TW202313972A - New nrg1 fusions, fusion junctions and methods for detecting them - Google Patents
New nrg1 fusions, fusion junctions and methods for detecting them Download PDFInfo
- Publication number
- TW202313972A TW202313972A TW111120470A TW111120470A TW202313972A TW 202313972 A TW202313972 A TW 202313972A TW 111120470 A TW111120470 A TW 111120470A TW 111120470 A TW111120470 A TW 111120470A TW 202313972 A TW202313972 A TW 202313972A
- Authority
- TW
- Taiwan
- Prior art keywords
- exon
- nrg1
- seq
- sequence
- fusion
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Abstract
Description
本揭露係有關神經調節蛋白1 (NRG1)融合物的領域、檢測這類融合物的方法、鑑定或診斷具有這類融合物之患者的方法、以及治療包含有NRG1融合物之癌症、腫瘤或異常細胞的方法。此外,本揭露係有關用於治療患有包含有NRG1融合物之ErbB-2/ErbB-3陽性癌症之個體的治療性(人類)抗體領域。The present disclosure pertains to the field of neuregulin 1 (NRG1) fusions, methods of detecting such fusions, methods of identifying or diagnosing patients with such fusions, and treating cancers, tumors or disorders comprising NRG1 fusions cell method. Furthermore, the disclosure is in the field of therapeutic (human) antibodies for use in the treatment of individuals with ErbB-2/ErbB-3 positive cancers comprising NRG1 fusions.
跨膜NRG1同功型之胞外結構域的蛋白分解處理會釋放可溶性因子。HRG1-β1係其中一種由NRG1基因編碼的蛋白質。NRG1含有Ig結構域及EGF樣結構域,該EGF樣域對於直接結合至受體酪胺酸激酶ErbB-3及ErbB-4而言是必要的。NRG1基因及同功型異形體已知有許多不同的別名,諸如:神經調節蛋白1 (Neuregulin 1);Pro-NRG1;HRGA;SMDF;HGL;GGF;NDF;NRG1內切轉錄2 (Intronic Transcript 2,不編碼蛋白質);Heregulin, α (45kD, ERBB2 P185-活化子);乙醯膽鹼受體誘導活性(Acetylcholine Receptor-Inducing Activity);原神經調節蛋白-1 (Pro-Neuregulin-1),為膜結合同功異形體;感覺及運動神經元衍生因子;新分化因子(Neu Differentiation Factor);神經膠質生長因子2;NRG1-IT2;MSTP131;MST131;ARIA;GGF2;HRG1;及HRG。NRG1基因的外部Id為HGNC: 7997;Entrez Gene: 3084;Ensembl: ENSG00000157168;OMIM: 142445及UniProtKB: Q02297。Proteolytic processing of the extracellular domain of the transmembrane NRG1 isoform releases soluble factors. HRG1-β1 is one of the proteins encoded by the NRG1 gene. NRG1 contains an Ig domain and an EGF-like domain that is necessary for direct binding to the receptor tyrosine kinases ErbB-3 and ErbB-4. The NRG1 gene and its isoforms are known by many different aliases, such as: Neuregulin 1; Pro-NRG1; HRGA; SMDF; HGL; GGF; NDF; , does not encode a protein); Heregulin, α (45kD, ERBB2 P185-activator); Acetylcholine Receptor-Inducing Activity (Acetylcholine Receptor-Inducing Activity); Pro-Neuregulin-1 (Pro-Neuregulin-1), for Membrane-bound isoforms; sensory and motor neuron-derived factors; Neu Differentiation Factor; glial growth factor 2; NRG1-IT2; MSTP131; MST131; ARIA; GGF2; HRG1; and HRG. The external Ids of the NRG1 gene are HGNC: 7997; Entrez Gene: 3084; Ensembl: ENSG00000157168; OMIM: 142445 and UniProtKB: Q02297.
NRG1之同功異形體係藉由選擇性剪接製造,且包括跨膜、外部膜結合、脫落(shed)、分泌、或胞內的形式(Falls之Exp Cell Res 284: 14–30, 2003; Hayes及Gullick之J. Mammary Gland Biol Neoplasia 13: 205-214, 2008)。彼等係結合至ErbB-3或ErbB-4,這可理解為促進與ErbB-2 (HER2)形成異二聚體。雖然通常將NRG1編碼蛋白視為有絲分裂原(mitogen),但其亦可強力促進細胞凋亡:尤其是在細胞中表現NRG1可造成該表現細胞的細胞凋亡(查閱Weinstein等人之oncogene 17: 2107–2113, 1998)。Isomorphic systems of NRG1 are produced by alternative splicing and include transmembrane, external membrane-bound, shed, secreted, or intracellular forms (Exp Cell Res 284: 14–30 in Falls, 2003; Hayes et al. Gullick, J. Mammary Gland Biol Neoplasia 13: 205-214, 2008). These bind to ErbB-3 or ErbB-4, which is understood to facilitate heterodimer formation with ErbB-2 (HER2). Although the NRG1-encoded protein is generally considered a mitogen, it is also a potent promoter of apoptosis: in particular, expression of NRG1 in cells can cause apoptosis in such expressing cells (see Weinstein et al. oncogene 17: 2107 –2113, 1998).
鑒於各種不同具有NRG1融合的腫瘤類型,基於新的融合伴隨體以及NRG1與融合伴隨體之間的新的斷點,需要有更快且更穩健的診斷方法,且需要有鑑定前所未知的NRG融合物的方法。Given the variety of tumor types with NRG1 fusions, there is a need for faster and more robust diagnostic methods based on novel fusion partners and new breakpoints between NRG1 and fusion partners, and the need to identify previously unknown NRGs Fusion method.
本揭露一般而言係提供涉及NRG1與新穎融合配偶體之融合物,以及由其所編碼之多肽融合物。本揭露提供一種涉及以NRG1及VAPB、NRG1及PVALB、NRG1及DAAM1、NRG1及ASPH、NRG1及ZFAT或是NRG1及DSCAML1作為新穎融合配偶體之融合物,在此一般來說分別指的是VAPB-NRG1、PVALB-NRG1、DAAM1- NRG1、ASPH-NRG1、ZFAT-NRG1及DSCAML1-NRG1。更特別的是,本揭露之此態樣係有關: 包含有VAPB核酸序列或VAPB核酸序列的一部分且其與NRG1核酸序列的一部分或NRG1核酸序列的一部分融合之多核苷酸,或 包含有PVALB核酸序列或PVALB核酸序列的一部分且其與NRG1核酸序列的一部分或NRG1核酸序列的一部分融合之多核苷酸,或 包含有DAAM1核酸序列或DAAM1核酸序列的一部分且其與NRG1核酸序列的一部分或NRG1核酸序列的一部分融合之多核苷酸,或 包含有ASPH核酸序列或ASPH核酸序列的一部分且其與NRG1核酸序列的一部分或NRG1核酸序列的一部分融合之多核苷酸,或 包含有ZFAT核酸序列或ZFAT核酸序列的一部分且其與NRG1核酸序列的一部分或NRG1核酸序列的一部分融合之多核苷酸,或 包含有DSCAML1核酸序列或DSCAML1核酸序列的一部分且其與NRG1核酸序列的一部分或NRG1核酸序列的一部分融合之多核苷酸。 The present disclosure generally provides fusions involving NRG1 and novel fusion partners, as well as polypeptide fusions encoded thereby. The present disclosure provides a fusion involving NRG1 and VAPB, NRG1 and PVALB, NRG1 and DAAM1, NRG1 and ASPH, NRG1 and ZFAT, or NRG1 and DSCAML1 as novel fusion partners, generally referred to herein as VAPB- NRG1, PVALB-NRG1, DAAM1-NRG1, ASPH-NRG1, ZFAT-NRG1 and DSCAML1-NRG1. More particularly, this aspect of this disclosure relates to: a polynucleotide comprising a VAPB nucleic acid sequence or a portion of a VAPB nucleic acid sequence fused to a portion of an NRG1 nucleic acid sequence or a portion of an NRG1 nucleic acid sequence, or A polynucleotide comprising a PVALB nucleic acid sequence or a portion of a PVALB nucleic acid sequence fused to a portion of an NRG1 nucleic acid sequence or a portion of an NRG1 nucleic acid sequence, or A polynucleotide comprising a DAAM1 nucleic acid sequence or a portion of a DAAM1 nucleic acid sequence fused to a portion of an NRG1 nucleic acid sequence or a portion of an NRG1 nucleic acid sequence, or a polynucleotide comprising an ASPH nucleic acid sequence or a portion of an ASPH nucleic acid sequence fused to a portion of an NRG1 nucleic acid sequence or a portion of an NRG1 nucleic acid sequence, or A polynucleotide comprising a ZFAT nucleic acid sequence or a part of a ZFAT nucleic acid sequence fused to a part of an NRG1 nucleic acid sequence or a part of an NRG1 nucleic acid sequence, or A polynucleotide comprising a DSCAML1 nucleic acid sequence or a part of a DSCAML1 nucleic acid sequence fused to a part of an NRG1 nucleic acid sequence or a part of an NRG1 nucleic acid sequence.
較佳地,該VAPB核酸序列包含序列SEQ ID NO: 17-23中之任一者(的一部分)或由其組成,或是包含SEQ ID NO: 17-23中之任一者之對偶基因變異體(的一部分)或由其組成。較佳地,該NRG1核酸序列包含序列SEQ ID NO: 125-138中之任一者(的一部分)或由其組成,或是包含序列SEQ ID NO: 125-138中之任一者之對偶基因變異體(的一部分)或由其組成。Preferably, the VAPB nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 17-23 or consists of it, or comprises an allele variation of any one of the SEQ ID NOs: 17-23 body (part of) or consists of. Preferably, the NRG1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 125-138 or consists of, or comprises an allele of any one of the sequences SEQ ID NO: 125-138 or consisting of (part of) a variant.
替代性地或除此之外,本揭露提供一種包含有VAPB之外顯子1或外顯子1之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,VAPB之外顯子1包含SEQ ID NO: 17或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 17的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。Alternatively or in addition, the present disclosure provides an allele variant comprising exon 1 or exon 1 of VAPB and its allele with exon 2 or exon 2 of NRG1 A polynucleotide in which a portion of a variant is fused. Preferably, VAPB exon 1 comprises or consists of SEQ ID NO: 17, and its allele variant is preferably a variant of SEQ ID NO: 17. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
較佳地,該PVALB核酸序列包含序列SEQ ID NO: 439-444中之任一者(的一部分)或由其組成,或是包含SEQ ID NO: 439-444中之任一者之對偶基因變異體(的一部分)或由其組成。較佳地,該NRG1核酸序列包含序列SEQ ID NO: 125-138中之任一者(的一部分)或由其組成,或是包含序列SEQ ID NO: 125-138中之任一者之對偶基因變異體(的一部分)或由其組成。Preferably, the PVALB nucleic acid sequence comprises (a part of) any of the sequences SEQ ID NO: 439-444 or consists of, or comprises an allele variation of any of SEQ ID NO: 439-444 body (part of) or consists of. Preferably, the NRG1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 125-138 or consists of, or comprises an allele of any one of the sequences SEQ ID NO: 125-138 or consisting of (part of) a variant.
替代性地或除此之外,本揭露提供一種包含有PVALB之外顯子4或外顯子4之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,PVALB之外顯子4包含SEQ ID NO: 422或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 422的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。Alternatively or in addition, the present disclosure provides an allele variant comprising exon 4 or exon 4 of PVALB and its allele with exon 6 or exon 6 of NRG1 A polynucleotide in which a portion of a variant is fused. Preferably, exon 4 of PVALB comprises or consists of SEQ ID NO: 422, and its allele variant is preferably a variant of SEQ ID NO: 422. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
較佳地,該DAAM1核酸序列包含序列SEQ ID NO: 606-631中之任一者(的一部分)或由其組成,或是包含SEQ ID NO: 606-631中之任一者之對偶基因變異體(的一部分)或由其組成。較佳地,該NRG1核酸序列包含序列SEQ ID NO: 125-138中之任一者(的一部分)或由其組成,或是包含序列SEQ ID NO: 125-138中之任一者之對偶基因變異體(的一部分)或由其組成。Preferably, the DAAM1 nucleic acid sequence comprises (a part of) any of the sequences SEQ ID NO: 606-631 or consists of, or comprises an allele variation of any of SEQ ID NO: 606-631 body (part of) or consists of. Preferably, the NRG1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 125-138 or consists of, or comprises an allele of any one of the sequences SEQ ID NO: 125-138 or consisting of (part of) a variant.
替代性地或除此之外,本揭露提供一種包含有DAAM1之外顯子1或外顯子1之對偶基因變異體的一部分且其與NRG1之外顯子1或外顯子1之對偶基因變異體的一部分融合之多核苷酸。較佳地,DAAM1之外顯子1包含SEQ ID NO: 606或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 606的變異體。NRG1之外顯子1包含SEQ ID NO: 125或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 125的變異體。Alternatively or in addition, the present disclosure provides an allele variant comprising a portion of DAAM1 exon 1 or exon 1 allele variant with NRG1 exon 1 or exon 1 allele A polynucleotide in which a portion of a variant is fused. Preferably, exon 1 of DAAM1 comprises or consists of SEQ ID NO: 606, and its allele variant is preferably a variant of SEQ ID NO: 606. Exon 1 of NRG1 comprises or consists of SEQ ID NO: 125, and its allele variant is preferably a variant of SEQ ID NO: 125.
較佳地,該ASPH核酸序列包含序列SEQ ID NO: 637-662中之任一者(的一部分)或由其組成,或是包含SEQ ID NO: 637-662中之任一者之對偶基因變異體(的一部分)或由其組成。較佳地,該NRG1核酸序列包含序列SEQ ID NO: 125-138中之任一者(的一部分)或由其組成,或是包含序列SEQ ID NO: 125-138中之任一者之對偶基因變異體(的一部分)或由其組成。Preferably, the ASPH nucleic acid sequence comprises (a part of) any of the sequences SEQ ID NO: 637-662 or consists of, or comprises an allele variation of any of SEQ ID NO: 637-662 body (part of) or consists of. Preferably, the NRG1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 125-138 or consists of, or comprises an allele of any one of the sequences SEQ ID NO: 125-138 or consisting of (part of) a variant.
替代性地或除此之外,本揭露提供一種包含有ASPH之外顯子22或外顯子22之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,ASPH之外顯子22包含SEQ ID NO: 658或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 658的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。Alternatively or in addition, the present disclosure provides an allele variant comprising exon 22 or exon 22 of ASPH and its allele with exon 2 or exon 2 of NRG1 A polynucleotide in which a portion of a variant is fused. Preferably, ASPH exon 22 comprises or consists of SEQ ID NO: 658, and the allele variant thereof is preferably a variant of SEQ ID NO: 658. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
較佳地,該ZFAT核酸序列包含序列SEQ ID NO: 830-846中之任一者(的一部分)或由其組成,或是包含SEQ ID NO: 830-846中之任一者之對偶基因變異體(的一部分)或由其組成。較佳地,該NRG1核酸序列包含序列SEQ ID NO: 125-138中之任一者(的一部分)或由其組成,或是包含序列SEQ ID NO: 125-138中之任一者之對偶基因變異體(的一部分)或由其組成。Preferably, the ZFAT nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 830-846 or consists of, or comprises an allele variation of any one of the SEQ ID NOs: 830-846 body (part of) or consists of. Preferably, the NRG1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 125-138 or consists of, or comprises an allele of any one of the sequences SEQ ID NO: 125-138 or consisting of (part of) a variant.
替代性地或除此之外,本揭露提供一種包含有ZFAT之外顯子12或外顯子12之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,ZFAT之外顯子12包含SEQ ID NO: 841或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 841的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。Alternatively or in addition, the present disclosure provides an allele variant comprising exon 12 or exon 12 of ZFAT and its allele with exon 6 or exon 6 of NRG1 A polynucleotide in which a portion of a variant is fused. Preferably, ZFAT exon 12 comprises or consists of SEQ ID NO: 841, and its allele variant is preferably a variant of SEQ ID NO: 841. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
較佳地,該DSCAML1核酸序列包含序列SEQ ID NO: 870-903中之任一者(的一部分)或由其組成,或是包含SEQ ID NO: 870-903中之任一者之對偶基因變異體(的一部分)或由其組成。較佳地,該NRG1核酸序列包含序列SEQ ID NO: 125-138中之任一者(的一部分)或由其組成,或是包含序列SEQ ID NO: 125-138中之任一者之對偶基因變異體(的一部分)或由其組成。Preferably, the DSCAML1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 870-903 or consists of, or comprises an allele variation of any one of SEQ ID NO: 870-903 body (part of) or consists of. Preferably, the NRG1 nucleic acid sequence comprises (a part of) any one of the sequences SEQ ID NO: 125-138 or consists of, or comprises an allele of any one of the sequences SEQ ID NO: 125-138 or consisting of (part of) a variant.
替代性地或除此之外,本揭露提供一種包含有DSCAML1之外顯子3或外顯子3之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,DSCAML1之外顯子3包含SEQ ID NO: 872或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 872的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。Alternatively or in addition, the present disclosure provides an allele variant comprising exon 3 or exon 3 of DSCAML1 and its allele with exon 2 or exon 2 of NRG1 A polynucleotide in which a portion of a variant is fused. Preferably, exon 3 of DSCAML1 comprises or consists of SEQ ID NO: 872, and the allele variant thereof is preferably a variant of SEQ ID NO: 872. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
另外地,本揭露提供涉及具先前未經揭露的融合接合處的NRG1之融合物,以及由其所編碼之多肽融合物。尤其,本揭露另外提供涉及NRG1與CADM1之融合物,在此一般來說指的是CADM1-NRG1。本揭露也提供涉及NRG1與CD44之融合物,在此一般來說指的是CD44-NRG1。本揭露也提供涉及NRG1與SLC3A2之融合物,在此一般來說指的是SLC3A2-NRG1。本揭露也提供涉及NRG1與VTCN1之融合物,在此一般來說指的是VTCN1-NRG1。本揭露也提供涉及NRG1與CDH1之融合物,在此一般來說指的是CDH1-NRG1。本揭露也提供涉及NRG1與CXADR之融合物,在此一般來說指的是CXADR-NRG1。本揭露也提供涉及NRG1與GTF2E2之融合物,在此一般來說指的是GTF2E2-NRG1。本揭露也提供涉及NRG1與CSMD1之融合物,在此一般來說指的是CSMD1-NRG1。本揭露也提供涉及NRG1與PTN之融合物,在此一般來說指的是PTN-NRG1。本揭露也提供涉及NRG1與ST14之融合物,在此一般來說指的是ST14-NRG1。本揭露也提供涉及NRG1與THBS1之融合物,在此一般來說指的是THBS1-NRG1。本揭露也提供涉及NRG1與AGRN之融合物,在此一般來說指的是AGRN-NRG1。本揭露也提供涉及NRG1與APP之融合物,在此一般來說指的是APP-NRG1。本揭露也提供涉及NRG1與WRN之融合物,在此一般來說指的是WRN-NRG1。本揭露也提供涉及NRG1與NRG1之融合物,在此一般來說指的是NRG1-NRG1。本揭露也提供涉及NRG1與NOTCH2之融合物,在此一般來說指的是NOTCH2-NRG1。本揭露也提供涉及NRG1與CD74之融合物,在此一般來說指的是CD74-NRG1。本揭露也提供涉及NRG1與SDC4之融合物,在此一般來說指的是SDC4-NRG1。本揭露也提供涉及NRG1與SLC4A4之融合物,在此一般來說指的是SLC4A4-NRG1。Additionally, the present disclosure provides fusions involving NRG1 having previously undisclosed fusion junctions, as well as polypeptide fusions encoded thereby. In particular, the present disclosure additionally provides fusions involving NRG1 and CADM1, generally referred to herein as CADM1-NRG1. The present disclosure also provides fusions involving NRG1 and CD44, generally referred to herein as CD44-NRG1. The present disclosure also provides fusions involving NRG1 and SLC3A2, generally referred to herein as SLC3A2-NRG1. The present disclosure also provides fusions involving NRG1 and VTCN1, generally referred to herein as VTCN1-NRG1. The present disclosure also provides fusions involving NRG1 and CDH1, generally referred to herein as CDH1-NRG1. The present disclosure also provides fusions involving NRG1 and CXADR, generally referred to herein as CXADR-NRG1. The present disclosure also provides fusions involving NRG1 and GTF2E2, generally referred to herein as GTF2E2-NRG1. The present disclosure also provides fusions involving NRG1 and CSMD1, generally referred to herein as CSMD1-NRG1. The present disclosure also provides fusions involving NRG1 and PTN, generally referred to herein as PTN-NRG1. The present disclosure also provides fusions involving NRG1 and ST14, generally referred to herein as ST14-NRG1. The present disclosure also provides fusions involving NRG1 and THBS1, generally referred to herein as THBS1-NRG1. The present disclosure also provides fusions involving NRG1 and AGRN, generally referred to herein as AGRN-NRG1. The present disclosure also provides fusions involving NRG1 and APP, generally referred to herein as APP-NRG1. The present disclosure also provides fusions involving NRG1 and WRN, generally referred to herein as WRN-NRG1. The present disclosure also provides fusions involving NRG1 and NRG1, generally referred to herein as NRG1-NRG1. The present disclosure also provides fusions involving NRG1 and NOTCH2, generally referred to herein as NOTCH2-NRG1. The present disclosure also provides fusions involving NRG1 and CD74, generally referred to herein as CD74-NRG1. The present disclosure also provides fusions involving NRG1 and SDC4, generally referred to herein as SDC4-NRG1. The present disclosure also provides fusions involving NRG1 and SLC4A4, generally referred to herein as SLC4A4-NRG1.
本揭露提供以下新穎融合接合處。尤其,本揭露提供一種包含有CADM1之外顯子7或外顯子7之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,CADM1之外顯子7包含SEQ ID NO: 39或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 39的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。The present disclosure provides the following novel fusion junctions. In particular, the present disclosure provides a multikaryon comprising exon 7 or a portion of an allogenic variant of exon 7 of CADM1 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 7 of CADM1 comprises or consists of SEQ ID NO: 39, and its allele variant is preferably a variant of SEQ ID NO: 39. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有CD44之外顯子5或外顯子5之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,CD44之外顯子5包含SEQ ID NO: 65或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 65的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the disclosure provides a multinuclear CD44 comprising exon 5 or a portion of an allogenic variant of exon 5 fused to exon 2 or a portion of an allogenic variant of NRG1 glycosides. Preferably, exon 5 of CD44 comprises or consists of SEQ ID NO: 65, and the allele variant thereof is preferably a variant of SEQ ID NO: 65. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有CD44之外顯子5或外顯子5之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,CD44之外顯子5包含SEQ ID NO: 65或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 65的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the disclosure provides a multinuclear CD44 comprising exon 5 or a portion of an allogenic variant of exon 5 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 5 of CD44 comprises or consists of SEQ ID NO: 65, and the allele variant thereof is preferably a variant of SEQ ID NO: 65. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有SLC3A2之轉錄本6之外顯子1或轉錄本6之外顯子1之對偶基因變異體的一部分且其與NRG1之外顯子5或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,所述SLC3A2之外顯子包含SEQ ID NO: 103或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 103的變異體。NRG1之外顯子5包含SEQ ID NO: 129或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 129的變異體。In particular, the present disclosure provides an allele variant comprising exon 1 of transcript 6 of SLC3A2 or a part of an allele variant of exon 1 of transcript 6 and paired with exon 5 or exon 2 of NRG1 A polynucleotide in which a portion of a gene variant is fused. Preferably, the exon of SLC3A2 comprises or consists of SEQ ID NO: 103, and the allele variant thereof is preferably a variant of SEQ ID NO: 103. Exon 5 of NRG1 comprises or consists of SEQ ID NO: 129, and the allele variant thereof is preferably a variant of SEQ ID NO: 129.
尤其,本揭露提供一種包含有VTCN1之外顯子2或外顯子2之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,VTCN1之外顯子2包含SEQ ID NO: 169或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 169的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multikaryon comprising exon 2 or part of an allogenic variant of exon 2 of VTCN1 fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, VTCN1 exon 2 comprises or consists of SEQ ID NO: 169, and its allele variant is preferably a variant of SEQ ID NO: 169. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有CDH1之外顯子11或外顯子11之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,CDH1之外顯子11包含SEQ ID NO: 198或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 198的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multikaryon comprising exon 11 or part of an allogenic variant of exon 11 of CDH1 fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, exon 11 of CDH1 comprises or consists of SEQ ID NO: 198, and the allele variant thereof is preferably a variant of SEQ ID NO: 198. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有CXADR之外顯子1或外顯子1之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,CXADR之外顯子1包含SEQ ID NO: 219或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 219的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multinuclear CXADR comprising exon 1 or part of an allogenic variant of exon 1 fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, CXADR exon 1 comprises or consists of SEQ ID NO: 219, and its allele variant is preferably a variant of SEQ ID NO: 219. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有GTF2E2之外顯子2或外顯子2之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,GTF2E2之外顯子2包含SEQ ID NO: 236或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 236的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multikaryon comprising exon 2 or part of an allogenic variant of exon 2 of GTF2E2 fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, exon 2 of GTF2E2 comprises or consists of SEQ ID NO: 236, and the allele variant thereof is preferably a variant of SEQ ID NO: 236. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有CSMD1之外顯子23或外顯子23之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,CSMD1之外顯子23包含SEQ ID NO: 279或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 279的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multikaryon comprising exon 23 or a portion of an allogenic variant of exon 23 of CSMD1 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 23 of CSMD1 comprises or consists of SEQ ID NO: 279, and its allele variant is preferably a variant of SEQ ID NO: 279. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有PTN之外顯子4或外顯子4之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,PTN之外顯子4包含SEQ ID NO: 318或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 318的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multikaryon comprising exon 4 or part of an allogenic variant of exon 4 of PTN fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, PTN exon 4 comprises or consists of SEQ ID NO: 318, and its allele variant is preferably a variant of SEQ ID NO: 318. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有ST14之外顯子11或外顯子11之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,ST14之外顯子11包含SEQ ID NO: 342或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 342的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multikaryon comprising exon 11 or a portion of an allogenic variant of exon 11 of ST14 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 11 of ST14 comprises or consists of SEQ ID NO: 342, and the allele variant thereof is preferably a variant of SEQ ID NO: 342. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有THBS1之外顯子9或外顯子9之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,THBS1之外顯子9包含SEQ ID NO: 386或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 386的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multikaryon comprising exon 9 or part of an allogenic variant of exon 9 of THBS1 fused to exon 6 or part of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 9 of THBS1 comprises or consists of SEQ ID NO: 386, and the allele variant thereof is preferably a variant of SEQ ID NO: 386. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有AGRN之外顯子12或外顯子12之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,AGRN之外顯子12包含SEQ ID NO: 416或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 416的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multikaryon comprising exon 12 or a portion of an allogenic variant of exon 12 of AGRN fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 12 of AGRN comprises or consists of SEQ ID NO: 416, and the allele variant thereof is preferably a variant of SEQ ID NO: 416. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有PVALB之外顯子4或外顯子4之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,PVALB之外顯子4包含SEQ ID NO: 442或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 442的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multinuclear polynucleotide comprising exon 4 or a portion of an allogenic variant of exon 4 of PVALB fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 4 of PVALB comprises or consists of SEQ ID NO: 442, and the allele variant thereof is preferably a variant of SEQ ID NO: 442. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有SLC3A2之轉錄本3之外顯子2或轉錄本3之外顯子2之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,所述SLC3A2之外顯子包含SEQ ID NO: 457或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 457的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a part of an allele variant comprising exon 2 of transcript 3 or exon 2 of transcript 3 of SLC3A2 and its pairing with exon 6 or exon 6 of NRG1 A polynucleotide in which a portion of a gene variant is fused. Preferably, the exon of SLC3A2 comprises or consists of SEQ ID NO: 457, and its allele variant is preferably a variant of SEQ ID NO: 457. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有APP之外顯子14或外顯子14之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,APP之外顯子14包含SEQ ID NO: 501或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 501的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multikaryon comprising exon 14 or a portion of an allogenic variant of exon 14 of APP fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, APP exon 14 comprises or consists of SEQ ID NO: 501, and its allele variant is preferably a variant of SEQ ID NO: 501. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有WRN之外顯子33或外顯子33之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,WRN之外顯子33包含SEQ ID NO: 562或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 562的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multinuclear WRN comprising exon 33 or a portion of an allogenic variant of exon 33 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, WRN exon 33 comprises or consists of SEQ ID NO: 562, and its allele variant is preferably a variant of SEQ ID NO: 562. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有NOTCH2之外顯子6或外顯子6之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,NOTCH2之外顯子6包含SEQ ID NO: 700或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 700的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multikaryon comprising exon 6 or a portion of an allogenic variant of exon 6 of NOTCH2 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 6 of NOTCH2 comprises or consists of SEQ ID NO: 700, and the allele variant thereof is preferably a variant of SEQ ID NO: 700. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
尤其,本揭露提供一種包含有CD74之外顯子2或外顯子2之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,CD74之外顯子2包含SEQ ID NO: 720或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 720的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multinuclear CD74 comprising exon 2 or part of an allogenic variant of exon 2 fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, exon 2 of CD74 comprises or consists of SEQ ID NO: 720, and the allele variant thereof is preferably a variant of SEQ ID NO: 720. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有SDC4之外顯子2或外顯子2之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,SDC4之外顯子2包含SEQ ID NO: 746或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 746的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multikaryon comprising exon 2 or part of an allogenic variant of exon 2 of SDC4 fused to exon 2 or part of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, exon 2 of SDC4 comprises or consists of SEQ ID NO: 746, and the allele variant thereof is preferably a variant of SEQ ID NO: 746. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有SDC4之外顯子4或外顯子4之對偶基因變異體的一部分且其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合之多核苷酸。較佳地,SDC4之外顯子4包含SEQ ID NO: 748或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 748的變異體。NRG1之外顯子2包含SEQ ID NO: 126或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 126的變異體。In particular, the present disclosure provides a multikaryon comprising exon 4 or a portion of an allogenic variant of exon 4 of SDC4 fused to exon 2 or a portion of an allogenic variant of exon 2 of NRG1 glycosides. Preferably, exon 4 of SDC4 comprises or consists of SEQ ID NO: 748, and the allele variant thereof is preferably a variant of SEQ ID NO: 748. NRG1 exon 2 comprises or consists of SEQ ID NO: 126, and its allele variant is preferably a variant of SEQ ID NO: 126.
尤其,本揭露提供一種包含有SLC4A4之外顯子14或外顯子14之對偶基因變異體的一部分且其與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分融合之多核苷酸。較佳地,SLC4A4之外顯子14包含SEQ ID NO: 780或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 780的變異體。NRG1之外顯子6包含SEQ ID NO: 130或由其組成,且其對偶基因變異體較佳為SEQ ID NO: 130的變異體。In particular, the present disclosure provides a multinuclear polynucleotide comprising exon 14 or a portion of an allogenic variant of exon 14 of SLC4A4 fused to exon 6 or a portion of an allogenic variant of exon 6 of NRG1 glycosides. Preferably, exon 14 of SLC4A4 comprises or consists of SEQ ID NO: 780, and the allele variant thereof is preferably a variant of SEQ ID NO: 780. Exon 6 of NRG1 comprises or consists of SEQ ID NO: 130, and its allele variant is preferably a variant of SEQ ID NO: 130.
本文所提供之NRG1融合物皆已在源自患者的樣本中觀察到,尤其是經診斷患有癌症之患者。The NRG1 fusions provided herein have all been observed in samples derived from patients, particularly patients diagnosed with cancer.
所揭露之鑑定此等NRG1融合物及基因重排可提供用於判定NRG1多核苷酸融合物或多肽是否存在於或來自於生物樣本中的新方法,用於測定NRG1多肽融合物之活性的方法,用於診斷會表現NRG1多肽融合物之癌症、腫瘤或異常細胞的方法,用於測定NRG1多肽融合物之活性的方法,用於治療會表現NRG1多肽融合物之癌症、腫瘤或異常細胞的方法,及/或抑制特徵為表現NRG1多核苷酸融合物或多肽之腫瘤形成之病程的方法。本揭露因此亦提供此等或更多種態樣並詳細描述於下文中。The disclosed identification of these NRG1 fusions and gene rearrangements may provide new methods for determining whether NRG1 polynucleotide fusions or polypeptides are present in or derived from biological samples, methods for determining the activity of NRG1 polypeptide fusions , a method for diagnosing a cancer, tumor or abnormal cell expressing an NRG1 polypeptide fusion, a method for determining the activity of an NRG1 polypeptide fusion, a method for treating a cancer, tumor or abnormal cell expressing an NRG1 polypeptide fusion , and/or a method of inhibiting the course of a neoplasia characterized by NRG1 polynucleotide fusions or polypeptides. The present disclosure therefore also provides these or more aspects and are described in detail below.
本揭露亦提供由選自以下之任一種多核苷酸融合物所編碼的多肽融合物:VAPB-NRG1、CADM1-NRG1 CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1及DSCAML1-NRG1。當本揭露之任一種多肽融合物被本文所提及之異常細胞表現時,其較佳地亦包含NRG1之EGF樣結構域。當本揭露之任一種多核苷酸融合物被含在本文所提及之異常細胞時,其較佳地包含編碼EGF樣結構域的NRG1外顯子,諸如NRG1之外顯子6、7或8,更佳地包含NRG1多核苷酸序列之外顯子6及7。The disclosure also provides a polypeptide fusion encoded by any polynucleotide fusion selected from the group consisting of: VAPB-NRG1, CADM1-NRG1 CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4- NRG1, SLC4A4-NRG1, ZFAT-NRG1 and DSCAML1-NRG1. When any polypeptide fusion of the present disclosure is expressed by the abnormal cells mentioned herein, it preferably also comprises the EGF-like domain of NRG1. When any one of the polynucleotide fusions of the present disclosure is contained in the abnormal cells mentioned herein, it preferably comprises an NRG1 exon encoding an EGF-like domain, such as NRG1 exon 6, 7 or 8 , preferably comprising exons 6 and 7 of the NRG1 polynucleotide sequence.
亦提供一種包含有本揭露之多核苷酸融合物的載體、一種包含有所述多核苷酸及/或所述載體之宿主細胞。亦提供一種製造本揭露之多肽的方法,包含將所述重組宿主細胞維持在適合該多核苷酸表現的條件下,藉以表現該多核苷酸融合物且產生一多肽融合物,隨後分離或純化該多肽。亦提供一種製造重組宿主細胞的方法,包含將所述載體導入宿主細胞內。A vector comprising the polynucleotide fusion of the present disclosure, a host cell comprising the polynucleotide and/or the vector are also provided. Also provided is a method of producing a polypeptide of the present disclosure, comprising maintaining the recombinant host cell under conditions suitable for expression of the polynucleotide, thereby expressing the polynucleotide fusion and producing a polypeptide fusion, followed by isolation or purification The peptide. Also provided is a method of producing a recombinant host cell, comprising introducing the vector into the host cell.
亦提供一種用於檢測本揭露之多核苷酸融合物的核酸探針、引子或引子對以及一種包含有用於檢測本揭露之多核苷酸融合物的存在之核酸探針、引子或引子對的檢測測定法。本揭露之這類核酸探針、引子或引子對較佳的長度為10-40個核苷酸。Also provided are a nucleic acid probe, primer or primer pair for detecting a polynucleotide fusion of the present disclosure and an assay comprising a nucleic acid probe, primer or primer pair for detecting the presence of a polynucleotide fusion of the present disclosure Assay. Such nucleic acid probes, primers or primer pairs of the present disclosure preferably have a length of 10-40 nucleotides.
亦提供一種用於檢測本揭露之多核苷酸融合物所編碼的多肽之第一抗體或第一與第二抗體對組,以及一種包含有該第一抗體或該第一與第二抗體對組以用於檢測本揭露之多核苷酸融合物所編碼之多肽的存在的檢測測定法。較佳地,該第一抗體與選自以下之多肽融合物結合:VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1及DSCAML1-NRG1,而該第一與第二抗體對組分別與VAPB及NRG1、或CADM1及NRG1、或CD44及NRG1、或SLC3A2及NRG1、或VTCN1及NRG1、或CDH1及NRG1、或CXADR及NRG1、或GTF2E2及NRG1、或CSMD1及NRG1、或PTN及NRG1、或ST14及NRG1、或THBS1及NRG1、或AGRN及NRG1、或PVALB及NRG1、APP及NRG1、或WRN及NRG1、或ASPH及NRG1、或NOTCH2及NRG1、或CD74及NRG1、或SDC4及NRG1、或SLC4A4及NRG1、或ZFAT及NRG1、或DSCAML1及NRG1結合。Also provided are a first antibody or a pair of first and second antibodies for detecting a polypeptide encoded by a polynucleotide fusion of the present disclosure, and a pair of antibodies comprising the first antibody or the pair of first and second antibodies With detection assays for detecting the presence of polypeptides encoded by polynucleotide fusions of the present disclosure. Preferably, the first antibody binds to a polypeptide fusion selected from: VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4- NRG1, ZFAT-NRG1 and DSCAML1-NRG1, and the first and second antibody pair groups are respectively associated with VAPB and NRG1, or CADM1 and NRG1, or CD44 and NRG1, or SLC3A2 and NRG1, or VTCN1 and NRG1, or CDH1 and NRG1 , or CXADR and NRG1, or GTF2E2 and NRG1, or CSMD1 and NRG1, or PTN and NRG1, or ST14 and NRG1, or THBS1 and NRG1, or AGRN and NRG1, or PVALB and NRG1, APP and NRG1, or WRN and NRG1, Or ASPH and NRG1, or NOTCH2 and NRG1, or CD74 and NRG1, or SDC4 and NRG1, or SLC4A4 and NRG1, or ZFAT and NRG1, or DSCAML1 and NRG1 combined.
本揭露包括了鑑定或檢測人類個體中之NRG1融合物之方法,且進一步涵蓋了診斷、治療選擇、及藥物治療之方法及其組合,以改善NRG1融合物相關的疾病,包括實體瘤。除此之外,本揭露提供直接了斷的手段或測定法來快速評估個體是否罹患或易於罹患癌症、腫瘤、或異常細胞。這類NRG1融合物資訊可有利作為診斷工具中之生物標誌物使用。The present disclosure includes methods of identifying or detecting NRG1 fusions in a human subject, and further encompasses methods of diagnosis, treatment selection, and drug therapy, and combinations thereof, to ameliorate NRG1 fusion-associated diseases, including solid tumors. Among other things, the present disclosure provides a straightforward means or assay to quickly assess whether an individual has or is predisposed to cancer, tumor, or abnormal cells. Such NRG1 fusion information can be advantageously used as a biomarker in a diagnostic tool.
因此,亦提供一種於樣本中鑑定如本文所提及之任一多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。Accordingly, there is also provided a method of identifying in a sample any polynucleotide fusion as referred to herein, or a polypeptide encoded thereby, which method comprises testing a sample obtained from an individual to detect in the sample Whether the fusion exists.
亦提供一種於樣本中檢測如本文所提及之任一多核苷酸融合物、或由其所編碼之多肽是否存在的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。Also provided is a method of detecting the presence in a sample of any polynucleotide fusion as referred to herein, or a polypeptide encoded thereby, said method comprising testing a sample obtained from an individual to detect in the sample Whether the fusion exists.
亦提供一種確定來自個體之異常細胞(諸如癌細胞或腫瘤細胞)是否包含本文所提及之任一多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自該個體之細胞或所述細胞之多核苷酸或多肽內容物是否於樣本中存在該融合物。Also provided is a method of determining whether an abnormal cell from an individual, such as a cancer cell or a tumor cell, comprises any of the polynucleotide fusions mentioned herein, or a polypeptide encoded thereby, comprising testing the Whether the cells of the individual or the polynucleotide or polypeptide content of the cells are present in the sample with the fusion.
亦提供一種鑑定一個體帶有本文所提及之多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。較佳地該檢驗之後為將該樣本中檢測到的融合物與帶有該融合物之個體聯繫在一起的步驟。Also provided is a method of identifying an individual carrying a polynucleotide fusion referred to herein, or a polypeptide encoded thereby, said method comprising testing a sample obtained from an individual to detect the presence of the fusion in the sample . Preferably the test is followed by the step of associating the fusion detected in the sample with the individual carrying the fusion.
本文所提及之樣本的檢驗較佳地係為活體外分析或這類方法、或離體分析或這類方法的一部分。The testing of samples referred to herein is preferably part of an in vitro analysis or such method, or an ex vivo analysis or such method.
較佳地,該個體為哺乳動物或人類個體,且該多核苷酸或多肽為哺乳動物或人類多核苷酸或多肽。Preferably, the individual is a mammalian or human individual and the polynucleotide or polypeptide is a mammalian or human polynucleotide or polypeptide.
較佳地,本文所提及之任一多核苷酸融合物及/或本文所提及之由其所編碼之多肽係經分離及/或純化或是實質上分離或實質上純化。Preferably, any of the polynucleotide fusions mentioned herein and/or the polypeptides encoded thereby mentioned herein are isolated and/or purified or substantially isolated or substantially purified.
本揭露係有關於罹患癌症或懷疑罹患癌症(尤其是實體癌或實體瘤)之個體。尤其,這類癌症為腺癌,更尤其是黏液腺癌、胰臟癌,更尤其是胰臟腺癌、或腎細胞癌、膽管癌、腦癌、神經膠質母細胞瘤、膽管癌、神經膠質瘤、胰管腺癌、肉瘤、膀胱癌、大腸癌、直腸癌、大腸直腸癌、膽囊癌、頭頸癌、前列腺癌、子宮癌、乳癌、卵巢癌、肝癌、子宮內膜癌、肺癌,尤其是非小細胞肺癌、或浸潤性黏液腺癌。該癌症可為原發性癌或轉移性癌。This disclosure relates to individuals suffering from or suspected of having cancer, especially solid cancer or tumors. In particular, such cancer is adenocarcinoma, more especially mucinous adenocarcinoma, pancreatic cancer, more especially pancreatic adenocarcinoma, or renal cell carcinoma, cholangiocarcinoma, brain cancer, glioblastoma, cholangiocarcinoma, glial Tumor, pancreatic duct adenocarcinoma, sarcoma, bladder cancer, colorectal cancer, rectal cancer, colorectal cancer, gallbladder cancer, head and neck cancer, prostate cancer, uterine cancer, breast cancer, ovarian cancer, liver cancer, endometrial cancer, lung cancer, especially non- Small cell lung cancer, or invasive mucinous adenocarcinoma. The cancer can be a primary cancer or a metastatic cancer.
本揭露亦提供一種治療患有ErbB-2及/或ErbB-3陽性異常細胞(諸如癌症或腫瘤)之個體的方法,所述異常細胞包含了本文所提及之多核苷酸融合物,且/或這類細胞表現了由其所編碼之多肽融合物,所述方法包含向該個體投與治療量之ErbB-2及/或ErbB-3標靶劑。較佳地,投與所述標靶劑之前係檢測本文所提及之多核苷酸或多肽融合物。The present disclosure also provides a method of treating an individual suffering from ErbB-2 and/or ErbB-3 positive abnormal cells (such as cancer or tumor), said abnormal cells comprising the polynucleotide fusions mentioned herein, and/ Or such cells express a polypeptide fusion encoded thereby, the method comprising administering to the individual a therapeutic amount of an ErbB-2 and/or ErbB-3 targeting agent. Preferably, the polynucleotide or polypeptide fusions mentioned herein are detected prior to administration of said targeting agent.
本揭露亦提供一種抑制患有ErbB-2及ErbB-3陽性異常細胞之個體病程的方法,所述異常細胞包含了本文所提及之多核苷酸融合物,且/或表現了由其所編碼之多肽融合物,所述方法包含向該個體投與治療量之ErbB-2及/或ErbB-3標靶劑。較佳地,投與所述標靶劑之前係檢測本文所提及之多核苷酸或多肽融合物。The present disclosure also provides a method of inhibiting the disease course of an individual suffering from ErbB-2 and ErbB-3 positive abnormal cells comprising and/or expressing the polynucleotide fusions mentioned herein. A polypeptide fusion, the method comprising administering to the individual a therapeutic amount of an ErbB-2 and/or ErbB-3 targeting agent. Preferably, the polynucleotide or polypeptide fusions mentioned herein are detected prior to administration of said targeting agent.
本揭露亦提供一種供使用於患有ErbB-2及ErbB-3陽性癌症或腫瘤之個體的治療中的ErbB-2及/或ErbB-3標靶劑,所述癌症或腫瘤包含了本文所提及之多核苷酸融合物,且/或表現了由其所編碼之多肽融合物,所述治療包含向該個體投與治療量之ErbB-2及/或ErbB-3標靶劑。較佳地,投與所述標靶劑之前係檢測本文所提及之多核苷酸或多肽融合物。The disclosure also provides an ErbB-2 and/or ErbB-3 targeting agent for use in the treatment of an individual with an ErbB-2 and ErbB-3 positive cancer or tumor, including the cancers or tumors mentioned herein and a polynucleotide fusion thereof, and/or a polypeptide fusion encoded thereby, wherein the treatment comprises administering to the individual a therapeutic amount of an ErbB-2 and/or ErbB-3 targeting agent. Preferably, the polynucleotide or polypeptide fusions mentioned herein are detected prior to administration of said targeting agent.
本揭露亦提供一種診斷帶有包含了本文所提及之多核苷酸融合物由其所編碼之多肽之異常細胞的個體的方法,該方法包含檢驗一獲自個體的樣本以檢測該樣本中是否存在該融合物。The disclosure also provides a method of diagnosing an individual with abnormal cells comprising a polypeptide encoded by a polynucleotide fusion mentioned herein, the method comprising testing a sample obtained from the individual to detect whether This fusion exists.
本揭露亦提供一種ErbB-2及/或ErbB-3標靶劑之治療方法或用途,其中係篩選個體是否存有本文所提及之NGR1融合物,隨後投與ErbB-2及/或ErbB-3標靶劑。The present disclosure also provides a treatment method or use of an ErbB-2 and/or ErbB-3 targeting agent, wherein an individual is screened for the presence of the NGR1 fusion mentioned herein, and then administered ErbB-2 and/or ErbB- 3 targeting agents.
本揭露亦包括活體內模型,諸如表現在其基因體內或是包含有本文所提及之多核苷酸融合物的移植異常細胞之異種移植或基因轉殖動物模型,且/或表現由其所編碼之多肽融合物,以及使用ERB2及/或Erb3標靶劑或其他標靶劑之這類模型的治療。較佳地,該動物模型為非人類動物模型。The present disclosure also includes in vivo models, such as xenograft or transgenic animal models of transplanted abnormal cells expressing in their genes or comprising the polynucleotide fusions mentioned herein, and/or expressing the genes encoded by them. and treatment of such models using ERB2 and/or Erb3 targeting agents or other targeting agents. Preferably, the animal model is a non-human animal model.
本揭露提供涉及NRG1與新穎融合配偶體之融合物,以及由其所編碼之多肽融合物。此外,本揭露提供涉及具先前未經揭露的融合接合處的NRG1之融合物,以及由其所編碼之多肽融合物。The present disclosure provides fusions involving NRG1 and novel fusion partners, as well as polypeptide fusions encoded thereby. In addition, the present disclosure provides fusions involving NRG1 with previously undisclosed fusion junctions, as well as polypeptide fusions encoded thereby.
一般而言,本揭露提供涉及NRG1與融合配偶體之融合物,該融合配偶體係選自VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1。所述融合物在此指的是VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1或DSCAML1-NRG1。存在本揭露之NRG1融合物中之任一者時,無論是多核苷酸或由其轉譯的多肽,都表示了異常細胞(諸如癌症或腫瘤)的存在。 NRG1 In general, the present disclosure provides fusions involving NRG1 and a fusion partner selected from the group consisting of VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB , APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1. Said fusions are referred to herein as VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14 -NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4-NRG1, ZFAT-NRG1 or DSCAML1-NRG1. The presence of any of the NRG1 fusions of the present disclosure, whether polynucleotides or polypeptides translated therefrom, indicates the presence of abnormal cells such as cancer or tumors. NRG1
NRG1基因編碼各種NRG1的同功型異形體。各種同功型異形體及其預期功能在Adelaide等人之Genes Chromosomes Cancer, Aug;37(4):333-45 (2003)中進行了描述。GGF及GGF2同功型異形體含有kringle樣序列加上Ig及EGF樣結構域;並且SMDF同功型異形體與其他同功型異形體僅共用EGF樣結構域。NRG1同功型異形體的受體是酪胺酸激酶跨膜受體的ErbB家族。該家族也被稱作人類表皮生長因子(EGF)受體家族(HER)。該家族有四個成員:ErbB(紅血球母細胞瘤)-1、ErbB-2、ErbB-3及ErbB-4。受體(綜述於Yarden及Pines,Nat Rev Cancer. 2012 Jul 12;12(8):553-63)係廣泛表現於上皮細胞上。HER受體或其配位體,諸如神經調理因子(heregulin, HRG)或表皮生長因子(EGF)之表現量上調,是人類癌症中常見的事件(Wilson, Fridlyand等人,Nature. 2012 Jul 26; 487(7408): 505–509)。ErbB-1及ErbB-2的過度表現特別發生在上皮腫瘤中,並且與腫瘤侵襲、轉移、化療耐藥性及預後不良有關(Zhang, H., Berezov, A., Wang, Q., Zhang, G., Drebin, J., Murali, R.等人(2007) Erbb receptors: from oncogenes to targeted cancer therapies. J. Clin. Invest., 117, 2051- 2058)。在正常乳房中,ErbB-3已被證明在管腔上皮的生長及分化中很重要。例如,ErbB-3的丟失/抑制會導致在管腔上皮上的基底選擇性擴增(Balko, Miller等人,2012 PNAS January 3, 2012 109 (1) 221-226)。配體與酪胺酸激酶受體的細胞外結構域的結合誘導了受體二聚化,兩者介於相同(同二聚化(homodimerization))及不同(異二聚化(heterodimerization)受體亞型之間。二聚化可以活化細胞內的酪胺酸激酶結構域,使這些結構域進行自磷酸化且繼而活化許多下游的促增生信號傳導路徑,包括由促分裂原活化蛋白激酶(MAPK)及促存活路徑Akt所介導的路徑(綜述於Yarden及Pines,2012)。ErbB-3可以藉由其配體的接合而被活化。這些配體包括但不限於神經調節蛋白(NRG)及神經調理因子(HRG),以及NRG1融合物。The NRG1 gene encodes various isoforms of NRG1. Various isoforms and their predicted functions are described in Adelaide et al. Genes Chromosomes Cancer, Aug; 37(4):333-45 (2003). The GGF and GGF2 isoforms contain kringle-like sequences plus Ig and EGF-like domains; and the SMDF isoforms share only the EGF-like domain with the other isoforms. The receptors for the NRG1 isoforms are the ErbB family of tyrosine kinase transmembrane receptors. This family is also known as the human epidermal growth factor (EGF) receptor family (HER). There are four members of this family: ErbB (erythroblastoma)-1, ErbB-2, ErbB-3, and ErbB-4. The receptor (reviewed in Yarden and Pines, Nat Rev Cancer. 2012 Jul 12;12(8):553-63) is widely expressed on epithelial cells. Upregulation of HER receptors or their ligands, such as heregulin (HRG) or epidermal growth factor (EGF), is a common event in human cancers (Wilson, Fridlyand et al., Nature. 2012 Jul 26; 487(7408): 505–509). Overexpression of ErbB-1 and ErbB-2 occurs particularly in epithelial tumors and is associated with tumor invasion, metastasis, chemotherapy resistance, and poor prognosis (Zhang, H., Berezov, A., Wang, Q., Zhang, G., Drebin, J., Murali, R. et al. (2007) Erbb receptors: from oncogenes to targeted cancer therapies. J. Clin. Invest., 117, 2051- 2058). In the normal breast, ErbB-3 has been shown to be important in the growth and differentiation of the luminal epithelium. For example, loss/inhibition of ErbB-3 leads to basal-selective expansion on luminal epithelium (Balko, Miller et al., 2012 PNAS January 3, 2012 109 (1) 221-226). Ligand binding to the extracellular domain of a tyrosine kinase receptor induces receptor dimerization between the same (homodimerization) and different (heterodimerization) receptors Between isoforms. Dimerization can activate intracellular tyrosine kinase domains, allowing these domains to autophosphorylate and subsequently activate many downstream pro-proliferative signaling pathways, including mitogen-activated protein kinase (MAPK ) and the pathway mediated by the pro-survival pathway Akt (reviewed in Yarden and Pines, 2012). ErbB-3 can be activated by engagement of its ligands. These ligands include but are not limited to neuregulin (NRG) and Neuromodulatory factor (HRG), and NRG1 fusions.
各種NRG1融合基因係描述於Dhanasekaran等人Nature Communications 5:5893, 2014,以及Jonna等人,Clin Cancer Res August 15 2019, 25 (16) 4966-4972。Various NRG1 fusion gene lines are described in Dhanasekaran et al. Nature Communications 5:5893, 2014, and Jonna et al., Clin Cancer Res August 15 2019, 25 (16) 4966-4972.
本文所使用之術語「NRG1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。NRG1基因及同功型異形體已知有許多不同的別名,諸如:神經調節蛋白1 (Neuregulin 1);Pro-NRG1;HRGA;SMDF;HGL;GGF;NDF;NRG1內切轉錄2 (Intronic Transcript 2,不編碼蛋白質);Heregulin, α (45kD, ERBB2 P185-活化子);乙醯膽鹼受體誘導活性(Acetylcholine Receptor-Inducing Activity);原神經調節蛋白-1 (Pro-Neuregulin-1),為膜結合同功異形體;感覺及運動神經元衍生因子;新分化因子(Neu Differentiation Factor);神經膠質生長因子2;NRG1-IT2;MSTP131;MST131;ARIA;GGF2;HRG1;及HRG。NRG1基因的外部Id為HGNC: 7997;Entrez Gene: 3084;Ensembl: ENSG00000157168;OMIM: 142445及UniProtKB: Q02297。為了避免疑慮,帶有NRG1序列係指帶有來自諸如NCBI參考序列NM_001159999.3或NM_001159999.3之對偶基因變異體的任何多核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。較佳地,本揭露之NRG1的全多核苷酸序列係根據SEQ ID NO: 138轉錄,而經轉錄的NRG1多肽序列係根據SEQ ID NO: 152。The term "NRG1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. The NRG1 gene and its isoforms are known by many different aliases, such as: Neuregulin 1; Pro-NRG1; HRGA; SMDF; HGL; GGF; NDF; , does not encode a protein); Heregulin, α (45kD, ERBB2 P185-activator); Acetylcholine Receptor-Inducing Activity (Acetylcholine Receptor-Inducing Activity); Pro-Neuregulin-1 (Pro-Neuregulin-1), for Membrane-bound isoforms; sensory and motor neuron-derived factors; Neu Differentiation Factor; glial growth factor 2; NRG1-IT2; MSTP131; MST131; ARIA; GGF2; HRG1; and HRG. The external Ids of the NRG1 gene are HGNC: 7997; Entrez Gene: 3084; Ensembl: ENSG00000157168; OMIM: 142445 and UniProtKB: Q02297. For the avoidance of doubt, a sequence with NRG1 refers to any polynucleotide sequence, including coding sequence or any exon, but also any A polypeptide sequence, or any portion thereof. Preferably, the entire polynucleotide sequence of NRG1 disclosed herein is transcribed according to SEQ ID NO: 138, and the transcribed polypeptide sequence of NRG1 is transcribed according to SEQ ID NO: 152.
較佳地,該編碼NRG1蛋白序列或NRG1外顯子的多核苷酸融合物部分進一步包含或編碼NRG1的EGF樣結構域。於該融合物中之NRG1基因(例如NRG1基因的3’端)較佳地包含外顯子6、7或8的編碼序列。此結構域係由位於NRG1基因(例如外顯子6-8)3’處的部分編碼,且對於結合至ErbB-3而言是必要的。NRG1融合基因較佳地包含編碼NRG1的EGF樣結構域之核酸序列。本揭露之該等NRG1融合物較佳地在該融合物分子的3’端保有此EGF樣結構域的符合讀框編碼區。EGF樣結構域一般為約三十至四十個胺基酸殘基長之序列,其原型係發現於表皮生長因子(EGF)之序列[PMID: 2288911, PMID: 6334307, PMID: 1522591, PMID: 6607417, PMID: 3282918, PMID: 11498013]中。已知其係以大致保守的形式存在於其他眾多、主要為動物蛋白質之中。EGF樣結構域的共同特徵是,其等係發現於膜結合蛋白之細胞外結構域或已知會分泌的蛋白質中(例外:前列腺素G/H合成酶)。EGF樣結構域一般包括六個半胱胺酸殘基,已證明其等(在EGF中)涉及二硫化物鍵。主要結構為一個雙鏈β摺板,然後是一個環接到C端的短雙鏈摺板。保守半胱胺酸之間的次結構域長度不一。本揭露之例示性EGF樣結構域較佳地係由NCBI參考序列NM_001159999.3之外顯子6-8組成,更佳地由外顯子6及7組成,但本揭露是有關於NRG1的任何功能性EGF樣結構域。NRG1之EGF樣結構域因此較佳地包含例示性序列HLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASF (SEQ ID: 163),或與其有至少85%一致性、或與其有至少90%、92%、94%、95%、96%或甚至至少98%一致性的對偶基因變異體。Preferably, the polynucleotide fusion part encoding NRG1 protein sequence or NRG1 exon further comprises or encodes the EGF-like domain of NRG1. The NRG1 gene in the fusion (eg the 3' end of the NRG1 gene) preferably comprises the coding sequence of exon 6, 7 or 8. This domain is encoded by the portion located 3' of the NRG1 gene (eg, exons 6-8) and is necessary for binding to ErbB-3. The NRG1 fusion gene preferably comprises a nucleic acid sequence encoding the EGF-like domain of NRG1. The NRG1 fusions of the present disclosure preferably retain the in-frame coding region of the EGF-like domain at the 3' end of the fusion molecule. The EGF-like domain is generally a sequence of about thirty to forty amino acid residues long, and its prototype is found in the sequence of epidermal growth factor (EGF) [PMID: 2288911, PMID: 6334307, PMID: 1522591, PMID: 6607417, PMID: 3282918, PMID: 11498013]. It is known to exist in a roughly conserved form among many other, mainly animal, proteins. A common feature of EGF-like domains is that they are found in the extracellular domain of membrane-bound proteins or proteins known to be secreted (exception: prostaglandin G/H synthase). EGF-like domains generally include six cysteine residues, which (in EGF) have been shown to involve disulfide bonds. The main structure is a double-stranded beta flap followed by a short double-stranded flap looped to the C-terminus. The subdomains between conserved cysteines vary in length. The exemplary EGF-like domain of the present disclosure is preferably composed of exons 6-8 of NCBI reference sequence NM_001159999.3, more preferably exons 6 and 7, but the present disclosure is not related to any aspect of NRG1 Functional EGF-like domain. The EGF-like domain of NRG1 therefore preferably comprises, or is at least 85% identical to, or at least 90%, 92%, 94%, 95%, 96% or Even allele variants with at least 98% identity.
本揭露之任一NRG1融合多核苷酸較佳地係至少包含VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1的5’部分來作為融合配偶體,以與NRG1核酸序列的3’部分融合。經轉譯的多肽融合物包含該NRG1融合多肽的游離C端以及該融合配偶體的游離N端。為了在分子融合物層級反應出此配向,係先提及NRG1的任一融合配偶體,接著是作為其融合配偶體的NRG1。 VAPB Any NRG1 fusion polynucleotide disclosed herein preferably at least comprises VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, The 5' part of ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1 was used as a fusion partner to fuse with the 3' part of the NRG1 nucleic acid sequence. The translated polypeptide fusion comprises the free C-terminus of the NRG1 fusion polypeptide and the free N-terminus of the fusion partner. To reflect this alignment at the molecular fusion level, either fusion partner of NRG1 is mentioned first, followed by NRG1 as its fusion partner. VAPB
VAPB或囊泡相關膜蛋白之相關蛋白B/C已知有許多不同名稱,諸如VAMP相關蛋白B及C;VAMP(囊泡相關膜蛋白)相關蛋白B及C;VAP-B;VAPB;ALS8;VAMP相關33 KDa蛋白;VAMP相關蛋白B/C;VAMP-B/VAMP-C;VAP-B/VAP-C;VAMP-B及VAP-C。VAPB基因的外部Id為HGNC: 12649;Entrez Gene: 9217;Ensembl: ENSG00000124164;OMIM: 605704;及UniProtKB: O95292。本文所使用之術語「VAPB」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有VAPB序列係指帶有來自諸如NCBI參考序列NM_004738.4或其任一對偶基因變異體的任何多核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 CADM1 VAPB or vesicle-associated membrane protein-associated protein B/C are known by many different names, such as VAMP-associated protein B and C; VAMP (vesicle-associated membrane protein)-associated protein B and C; VAP-B; VAPB; ALS8; VAMP-related 33 KDa protein; VAMP-related protein B/C; VAMP-B/VAMP-C; VAP-B/VAP-C; VAMP-B and VAP-C. The external Ids of the VAPB gene are HGNC: 12649; Entrez Gene: 9217; Ensembl: ENSG00000124164; OMIM: 605704; and UniProtKB: 095292. The term "VAPB" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, a VAPB sequence refers to any polynucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_004738.4 or any allogenic variant thereof sequence, or any part thereof. CADM1
CADM1或細胞黏附分子1已知有許多不同的名稱,諸如肺癌腫瘤抑制因子1;TSLC1;TSLC-1;生精免疫球蛋白超家族;SgIgSF;SgIGSF;免疫球蛋白超家族成員4;IGSF4;IgSF4;IGSF4A;突觸細胞黏附分子;SynCAM1;SYNCAM1;SynCAM;SYNCAM;Nectin樣蛋白2;NECL2;NECL-2;Nectin樣2;Necl-2;RA175;ST17;BL2;免疫球蛋白超家族之成員4D變異體1;免疫球蛋白超家族之成員4D變異體2;免疫球蛋白超家族之成員4;TSLC1/Nectin樣2/IGSF4;截斷的CADM1蛋白1及STSLC-1。CADM1的外部Id為HGNC: 5951;Entrez Gene: 23705;Ensembl: ENSG00000182985;OMIM: 605686;及UniProtKB: Q9BY67。本文所使用之術語「CADM1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有CADM1序列係指帶有來自諸如NCBI參考序列NM_001301045.1或其任一對偶基因變異體的任何多核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 CD44 CADM1 or cell adhesion molecule 1 is known by many different names such as lung cancer tumor suppressor 1; TSLC1; TSLC-1; spermatogenic immunoglobulin superfamily; SgIgSF; SgIGSF; immunoglobulin superfamily member 4; IGSF4; IgSF4 ; IGSF4A; Synaptic cell adhesion molecule; SynCAM1; SYNCAM1; SynCAM; SYNCAM; Nectin-like protein 2; NECL2; NECL-2; Nectin-like 2; Necl-2; RA175; ST17; BL2; Variant 1; member of the immunoglobulin superfamily 4D variant 2; member of the immunoglobulin superfamily 4; TSLC1/Nectin-like 2/IGSF4; truncated CADM1 protein 1 and STSLC-1. The external Ids of CADM1 are HGNC: 5951; Entrez Gene: 23705; Ensembl: ENSG00000182985; OMIM: 605686; and UniProtKB: Q9BY67. The term "CADM1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a CADM1 sequence means carrying any polynucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as NCBI Reference Sequence NM_001301045.1 or any allogenic variant thereof sequence, or any part thereof. CD44
CD44已知有許多不同的名稱,諸如CD44分子(印度血型);造血細胞E-及L-選滯蛋白配體;GP90淋巴細胞歸巢/黏附受體;硫酸軟骨素蛋白聚醣;細胞外基質受體III;硫酸軟骨素蛋白聚醣;吞噬細胞醣蛋白1;玻尿酸受體;赫爾墨斯抗原(Hermes Antigen);CD44抗原;ECMR-III;HUTCH-I;Epican;MDU2;MDU3;MIC4;LHR;CD44抗原(歸巢功能及印度血型系統);歸巢功能及印度血型系統;細胞表面醣蛋白CD44;印度血型抗原;吞噬細胞醣蛋白I;可溶性CD44;CDW44;CSPG8;HCELL;CDw44;PGP-1;MC56;Pgp1;及In。CD44基因的外部Id為HGNC: 1681;Entrez Gene: 960;Ensembl: ENSG00000026508;OMIM: 107269;及UniProtKB: P16070。本文所使用之術語「CD44」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有NRG1序列係指帶有來自諸如CD44參考序列NM_000610.4或其任一對偶基因變異體的任何多核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 SLC3A2 CD44 is known by many different names such as CD44 molecule (Indian blood group); hematopoietic E- and L-selectin ligand; GP90 lymphocyte homing/adhesion receptor; chondroitin sulfate proteoglycan; extracellular matrix Receptor III; Chondroitin Sulfate Proteoglycan; Phagocyte Glycoprotein 1; Hyaluronic Acid Receptor; Hermes Antigen; CD44 Antigen; ECMR-III; HUTCH-I; Epican; MDU2; MDU3; MIC4; LHR; CD44 antigen (homing function and Indian blood group system); homing function and Indian blood group system; cell surface glycoprotein CD44; Indian blood group antigen; phagocyte glycoprotein I; soluble CD44; CDW44; CSPG8; HCELL; CDw44; PGP -1; MC56; Pgpl; and In. The external Ids of the CD44 gene are HGNC: 1681; Entrez Gene: 960; Ensembl: ENSG00000026508; OMIM: 107269; and UniProtKB: P16070. The term "CD44" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying NRG1 sequence refers to carrying any polynucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as CD44 reference sequence NM_000610. sequence, or any part thereof. SLC3A2
SLC3A2或溶質載體家族3成員2已知有許多不同的名稱,諸如淋巴細胞活化抗原4F2大次單元;溶質載體家族3(二元(Dibasic)及中性胺基酸轉運的活化子),成員2;由單株抗體4F2、TRA1.10、TROP4及T43所識別的抗原;溶質載體家族3(胺基酸轉運蛋白重鏈),成員2;4F2細胞表面抗原重鏈;CD98重鏈;4F2HC;MDU1;由單株抗體4F2定義的抗原,重鏈;由單株抗體4F2定義的抗原;4F2重鏈抗原;4F2重鏈;CD98抗原;CD98HC;4T2HC;NACAE;CD98及4F2。SLC3A2的外部Id為HGNC: 11026;Entrez Gene: 6520;Ensembl: ENSG00000168003;OMIM: 158070;及UniProtKB: P08195。本文所使用之術語「SLC3A2」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有SLC3A2轉錄本6序列係指帶有來自諸如NCBI參考序列NM_001013251.3或其任一對偶基因變異體的任何多核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。而帶有SLC3A2轉錄本3序列係指帶有來自諸如NCBI參考序列NM_001013251.3或其任一對偶基因變異體的任何多核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 VTCN1 SLC3A2 or solute carrier family 3 member 2 is known by many different names, such as lymphocyte activation antigen 4F2 large subunit; solute carrier family 3 (activators of dibasic and neutral amino acid transport), member 2 ; antigen recognized by monoclonal antibodies 4F2, TRA1.10, TROP4, and T43; solute carrier family 3 (amino acid transporter heavy chain), member 2; 4F2 cell surface antigen heavy chain; CD98 heavy chain; 4F2HC; MDU1 Antigen defined by monoclonal antibody 4F2, heavy chain; Antigen defined by monoclonal antibody 4F2; 4F2 heavy chain antigen; 4F2 heavy chain; CD98 antigen; CD98HC; 4T2HC; NACAE; CD98 and 4F2. The external Ids of SLC3A2 are HGNC: 11026; Entrez Gene: 6520; Ensembl: ENSG00000168003; OMIM: 158070; and UniProtKB: P08195. The term "SLC3A2" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, a sequence with SLC3A2 transcript 6 refers to any polynucleotide sequence, including coding sequence or any exon, from such as NCBI reference sequence NM_001013251.3 or any of its allele variants, but also Include any polypeptide sequence, or any portion thereof. And with SLC3A2 transcript 3 sequence refers to any polynucleotide sequence from such as NCBI reference sequence NM_001013251.3 or any of its allele variants, including coding sequence or any exon, but also includes any polypeptide sequence, or any part thereof. VTCN1
VTCN1已知有許多不同的名稱,諸如含有T細胞活化抑制劑1的V-Set結構域;B7-H4;B7H4;免疫共刺激蛋白B7-H4;B7超級家族成員1;B7家族成員,H4;B7同源物4;B7h.5;B7S1;T細胞共刺激分子B7x;蛋白質B7S1;FLJ22418;PRO1291;VCTN1;VTCN1及B7X。VTCN1的外部Id為HGNC: 28873;NCBI Entrez Gene: 79679;Ensembl: ENSG00000134258;OMIM®: 608162及UniProtKB/Swiss-Prot: Q7Z7D3。本文所使用之術語「VTCN1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有VTCN1序列係指帶有來自諸如NCBI參考序列NM_024626.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 CDH1 VTCN1 is known by many different names, such as V-Set domain containing inhibitor of T cell activation 1; B7-H4; B7H4; immune co-stimulatory protein B7-H4; B7 superfamily member 1; B7 family member, H4; B7 homologue 4; B7h.5; B7S1; T cell co-stimulatory molecule B7x; protein B7S1; The external Ids of VTCN1 are HGNC: 28873; NCBI Entrez Gene: 79679; Ensembl: ENSG00000134258; OMIM®: 608162 and UniProtKB/Swiss-Prot: Q7Z7D3. The term "VTCN1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a VTCN1 sequence means carrying any nucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as NCBI reference sequence NM_024626. sequence, or any part thereof. CDH1
CDH1或鈣黏蛋白1已知有許多不同的名稱,諸如桑椹胚黏著蛋白(Uvomorulin);鈣黏蛋白1,第1型,E-鈣黏蛋白(上皮);上皮鈣黏蛋白;E-鈣黏蛋白;鈣黏蛋白-1;CAM 120/80;CD324;CDHE;UVO;鈣依賴性黏附蛋白,上皮;副睪分泌精子結合蛋白;鈣黏蛋白1,E-鈣黏蛋白(上皮);細胞-CAM 120/80;CD324抗原;E-鈣黏蛋白;Arc-1;BCDS1;ECAD;及LCAM。CDH1的外部Id為HGNC: 1748;NCBI Entrez Gene: 999;Ensembl: ENSG00000039068;OMIM®: 192090及UniProtKB/Swiss-Prot: P12830。本文所使用之術語「CDH1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有CDH1序列係指帶有來自諸如NCBI參考序列NM_001317185.2或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 CXADR CDH1 or Cadherin 1 is known by many different names such as Uvomorulin; Cadherin 1, type 1, E-cadherin (epithelial); Epithelial-cadherin; E-cadherin Protein; Cadherin-1; CAM 120/80; CD324; CDHE; UVO; CAM 120/80; CD324 antigen; E-cadherin; Arc-1; BCDS1; ECAD; and LCAM. The external Ids of CDH1 are HGNC: 1748; NCBI Entrez Gene: 999; Ensembl: ENSG00000039068; OMIM®: 192090 and UniProtKB/Swiss-Prot: P12830. The term "CDH1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a CDH1 sequence means carrying any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI reference sequence NM_001317185.2 or any allogenic variant sequence, or any part thereof. CXADR
CXADR或柯薩奇病毒及腺病毒受體是B組柯薩奇病毒及C亞組腺病毒的第I型膜受體。已發現到此基因的若干編碼不同同功型異形體之轉錄變異體。與CXADR相關的疾病包括心肌炎及角膜結膜炎。其相關途徑包括黏附及同種異體移植排斥。它是上皮細胞頂部接合複合物的一個組成部分,可作用為嗜同性細胞黏附分子,對細胞緊密連接的完整性至關重要。其也涉及通過與白血球膜質膜的跨膜蛋白JAML黏附交互作用所進行的白血球跨上皮細胞遷移。這兩種受體之間的交互作用也介導了駐留在上皮細胞中的一種T細胞次族群γ-δ T細胞的活化,並涉及組織內環境穩定與修復。在上皮細胞CXADR一結合時,JAML會通過PI3激酶及MAP激酶來介導γ-δ T細胞中的下游細胞信號傳導事件。這藉由T細胞相應地刺激上皮細胞組織修復而造成細胞介素及生長因子的增生與生成。CXADR的外部Id包括HGNC: 2559 NCBI;Entrez Gene: 1525;Ensembl: ENSG00000154639;OMIM®: 602621;UniProtKB/Swiss-Prot: P78310。本文所使用之術語「CXADR」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有CXADR序列係指帶有來自諸如NCBI參考序列NM_001207063.2或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 GTF2E2 CXADR, or coxsackievirus and adenovirus receptor, is a type I membrane receptor for group B coxsackieviruses and subgroup C adenoviruses. Several transcript variants encoding different isoforms of this gene have been found. Diseases associated with CXADR include myocarditis and keratoconjunctivitis. Associated pathways include adhesion and allograft rejection. It is a component of the apical junctional complex of epithelial cells, functions as a homophilic cell adhesion molecule, and is critical for the integrity of cellular tight junctions. It is also involved in the migration of leukocytes across epithelial cells through adhesive interactions with the transmembrane protein JAML of the leukocyte membrane plasma membrane. Interactions between these two receptors also mediate the activation of γ-δ T cells, an epithelial-resident T cell subpopulation, and are involved in tissue homeostasis and repair. Upon CXADR binding in epithelial cells, JAML mediates downstream cell signaling events in γ-δ T cells through PI3 kinase and MAP kinase. This stimulates the proliferation and production of cytokines and growth factors by T cells which in turn stimulate epithelial cell tissue repair. External Ids of CXADR include HGNC: 2559 NCBI; Entrez Gene: 1525; Ensembl: ENSG00000154639; OMIM®: 602621; UniProtKB/Swiss-Prot: P78310. The term "CXADR" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, a sequence with CXADR means any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_001207063.2 or any of its allele variants sequence, or any part thereof. GTF2E2
GTF2E2或共用轉錄因子IIE (TFIIE)是RNA聚合酶II轉錄起始複合體的一部分,其吸收TFIIH且對RNA聚合酶II清除啟動子是絕對必要的。TFIIE是一種含α及β次單元的雜二聚體(且有時為雜四聚體)。此基因所編碼的蛋白質表現了TFIIE的β次單元。與GTF2E2相關的疾病包括髮硫發育障礙症6、非光敏性髮硫發育障礙症。其相關途徑有滑液膜纖維母細胞中之細胞凋亡路徑以及巨噬細胞中的CCR5路徑。GTF2E2的外部Id包括HGNC: 4651;NCBI Entrez Gene: 2961;Ensembl: ENSG00000197265;OMIM®: 189964;UniProtKB/Swiss-Prot: P29084。本文所使用之術語「GTF2E2」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有GTF2E2序列係指帶有來自諸如NCBI參考序列NM_002095.6或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 CSMD1 GTF2E2 or shared transcription factor IIE (TFIIE) is part of the RNA polymerase II transcription initiation complex that takes up TFIIH and is absolutely essential for RNA polymerase II to clear the promoter. TFIIE is a heterodimer (and sometimes a heterotetramer) containing α and β subunits. The protein encoded by this gene expresses the beta subunit of TFIIE. Diseases associated with GTF2E2 include sulfodysplasia6, non-photosensitive sulfodysplasia. The related pathways are the apoptotic pathway in synovial fibroblasts and the CCR5 pathway in macrophages. External Ids of GTF2E2 include HGNC: 4651; NCBI Entrez Gene: 2961; Ensembl: ENSG00000197265; OMIM®: 189964; UniProtKB/Swiss-Prot: P29084. The term "GTF2E2" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a GTF2E2 sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_002095.6 or any of its allele variants sequence, or any part thereof. CSMD1
CSMD1(或CUB與Sushi多重功能域1)是一種與包括了自閉症及思覺失調症之疾病相關的蛋白質。CSMD1的外部Id包括HGNC: 14026;NCBI Entrez Gene: 64478;Ensembl: ENSG00000183117;OMIM®: 608397;UniProtKB/Swiss-Prot: Q96PZ7。本文所使用之術語「CSMD1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有CSMD1序列係指帶有來自諸如NCBI參考序列NM_033225.6或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 PTN CSMD1 (or CUB and Sushi Multiple Domain 1) is a protein associated with disorders including autism and schizophrenia. External Ids of CSMD1 include HGNC: 14026; NCBI Entrez Gene: 64478; Ensembl: ENSG00000183117; OMIM®: 608397; UniProtKB/Swiss-Prot: Q96PZ7. The term "CSMD1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, a sequence with CSMD1 means any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI reference sequence NM_033225.6 or any of its allele variants sequence, or any part thereof. PTN
多效生長因子(Pleiotrophin或PTN)蛋白質是一種分泌型肝素結合生長因子。該蛋白質在細胞生長與存活、細胞遷移、血管生成以及腫瘤生成中具有重要的作用。PTN (Pleiotrophin)是一種蛋白編碼基因。與PTN相關的疾病包括佩洛尼氏病(Peyronie's Disease)及鼻腔嗅神經母細胞瘤。其相關路徑有滑液膜纖維母細胞中的GPCR路徑及細胞凋亡路徑。PTN的外部Id包括HGNC: 9630;NCBI Entrez Gene: 5764;Ensembl: ENSG00000105894;OMIM®: 162095;UniProtKB/Swiss-Prot: P21246。本文所使用之術語「PTN」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有PTN序列係指帶有來自諸如NCBI參考序列NM_001321386.2或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 ST14 Pleiotrophin (PTN) protein is a secreted heparin-binding growth factor. This protein plays an important role in cell growth and survival, cell migration, angiogenesis, and tumorigenesis. PTN (Pleiotrophin) is a protein-coding gene. Diseases associated with PTN include Peyronie's Disease and nasal olfactory neuroblastoma. The related pathways include GPCR pathway and apoptosis pathway in synovial membrane fibroblasts. External Ids of PTN include HGNC: 9630; NCBI Entrez Gene: 5764; Ensembl: ENSG00000105894; OMIM®: 162095; UniProtKB/Swiss-Prot: P21246. The term "PTN" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a PTN sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_001321386.2 or any allogenic variant thereof sequence, or any part thereof. ST14
ST14或ST14跨膜絲胺酸蛋白酶Matriptase是一種上皮細胞衍生的膜主體蛋白絲胺酸蛋白酶。此蛋白酶與Kunitz型絲胺酸蛋白酶抑制劑HAI-1形成複合物,且被發現到會被神經鞘胺醇1-磷酸鹽活化。此蛋白酶已被證明可裂解及活化肝細胞生長因子/分射因子,以及尿激酶纖溶酶原激活酶,這表明此蛋白酶作為其他蛋白酶與潛在生長因子之上皮細胞膜活化子的功能。與ST14相關的疾病包括不同類型的魚鱗病。其相關路徑包括發育生物學及黏附。ST14的外部Id包括HGNC: 11344;NCBI Entrez Gene: 6768;Ensembl: ENSG00000149418;OMIM®: 606797;UniProtKB/Swiss-Prot: Q9Y5Y6。本文所使用之術語「ST14」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有ST14序列係指帶有來自諸如NCBI參考序列NM_021978.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 THBS1 ST14 or ST14 transmembrane serine protease Matriptase is an epithelial cell-derived membrane host protein serine protease. This protease forms a complex with the Kunitz-type serine protease inhibitor HAI-1 and was found to be activated by sphingosine 1-phosphate. This protease has been shown to cleave and activate hepatocyte growth factor/separator, as well as urokinase plasminogen activator, suggesting a function of this protease as an epithelial membrane activator of other proteases and potential growth factors. Diseases associated with ST14 include different types of ichthyosis. Related pathways include developmental biology and adhesion. External Ids of ST14 include HGNC: 11344; NCBI Entrez Gene: 6768; Ensembl: ENSG00000149418; OMIM®: 606797; UniProtKB/Swiss-Prot: Q9Y5Y6. The term "ST14" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, a sequence with ST14 means any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI reference sequence NM_021978.4 or any of its allele variants sequence, or any part thereof. THBS1
THBS1或血小板反應素1是一種雙硫鍵連接的同源三聚體蛋白次單元。此蛋白質是一種黏附性醣蛋白並介導細胞與細胞的交互作用以及細胞與基質的交互作用。此蛋白可結合至纖維蛋白原、纖維連接蛋白、層黏蛋白、V型膠原蛋白、及整聯蛋白α-V/β-1。此蛋白已被證明在血小板凝集、血管生成以及腫瘤生成中扮演重要角色。與THBS1相關的疾病包括血栓形成性血小板減少性紫瘢病及Peters-Plus症候群。其相關路徑有癌症中的蛋白多醣及胞外基質的降解。與此基因相關的基因本體(GO)標記包括鈣離子結合及肝素結合。THBS1的外部Id包括HGNC: 11785;NCBI Entrez Gene: 7057;Ensembl: ENSG00000137801;OMIM®: 188060;UniProtKB/Swiss-Prot: P07996。本文所使用之術語「THBS1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有THBS1序列係指帶有來自諸如NCBI參考序列NM_003246.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 AGRN THBS1 or Thrombospondin 1 is a disulfide-linked homotrimeric protein subunit. This protein is an adhesion glycoprotein and mediates cell-cell and cell-matrix interactions. This protein can bind to fibrinogen, fibronectin, laminin, collagen type V, and integrin α-V/β-1. This protein has been shown to play an important role in platelet aggregation, angiogenesis, and tumorigenesis. Diseases associated with THBS1 include thrombotic thrombocytopenic purpura and Peters-Plus syndrome. The related pathways are proteoglycan and extracellular matrix degradation in cancer. Gene Ontology (GO) markers associated with this gene include calcium binding and heparin binding. External Ids of THBS1 include HGNC: 11785; NCBI Entrez Gene: 7057; Ensembl: ENSG00000137801; OMIM®: 188060; UniProtKB/Swiss-Prot: P07996. The term "THBS1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a THBS1 sequence means carrying any nucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as NCBI reference sequence NM_003246.4 or any allele variant sequence, or any part thereof. AGRN
AGRN或聚集蛋白(Agrin)是神經肌肉接合發育中重要的幾種蛋白質之一。所編碼的蛋白質包含數個層黏蛋白G、Kazal型絲胺酸蛋白抑制劑、及表皮生長因子結構域。其他的轉譯後修飾出現在添加葡萄糖胺聚醣及雙硫鍵。在與影響肢帶肌的先天性肌無力症候群有關的一個基因家族中發現到此基因的突變。與AGRN相關的疾病包括肌無力症候群、先天性、8及突觸前先天性肌無力症候群。其相關路徑有在神經肌肉接合、血腦障壁、及免疫細胞移位處的Agrin交互作用。AGRN的外部Id包括HGNC: 329;NCBI Entrez Gene: 375790;Ensembl: ENSG00000188157;OMIM®: 103320;UniProtKB/Swiss-Prot: O00468。本文所使用之術語「AGRN」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有AGRN序列係指帶有來自諸如NCBI參考序列NM_001305275.2或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 PVALB AGRN, or Agrin, is one of several proteins important in the development of the neuromuscular junction. The encoded protein contains several laminin G, Kazal-type serine protein inhibitors, and epidermal growth factor domains. Other post-translational modifications occur in the addition of glycosaminoglycans and disulfide bonds. Mutations in this gene have been found in a family of genes associated with congenital myasthenic syndromes affecting the limb-girdle muscles. Disorders associated with AGRN include myasthenic syndrome, congenital, 8, and presynaptic congenital myasthenic syndrome. Related pathways are Agrin interactions at the neuromuscular junction, the blood-brain barrier, and translocation of immune cells. External Ids of AGRN include HGNC: 329; NCBI Entrez Gene: 375790; Ensembl: ENSG00000188157; OMIM®: 103320; UniProtKB/Swiss-Prot: O00468. The term "AGRN" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having an AGRN sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_001305275.2 or any of its allele variants sequence, or any part thereof. PVALB
PVALB或微小白蛋白(Parvalbumin)是一種高親合性鈣離子結合蛋白,其在結構上與作用上與鈣調素及旋光素C相似。所編碼的蛋白被認為涉及到肌肉放鬆。與PVALB相關的疾病包括魚類過敏及胎兒酒精症候群。與此基因相關的基因本體標記包括鈣離子結合及蛋白雜二聚化活性。PVALB的外部Id包括HGNC: 9704;NCBI Entrez Gene: 5816;Ensembl: ENSG00000100362;OMIM®: 168890;UniProtKB/Swiss-Prot: P20472。本文所使用之術語「PVALB」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有PVALB序列係指帶有來自諸如NCBI參考序列NM_002854.3或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 APP PVALB or Parvalbumin is a high-affinity calcium ion-binding protein, which is similar to calmodulin and optic C in structure and function. The encoded protein is thought to be involved in muscle relaxation. Disorders associated with PVALB include fish allergy and fetal alcohol syndrome. Gene ontology markers associated with this gene include calcium ion binding and protein heterodimerization activity. External Ids for PVALB include HGNC: 9704; NCBI Entrez Gene: 5816; Ensembl: ENSG00000100362; OMIM®: 168890; UniProtKB/Swiss-Prot: P20472. The term "PVALB" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a PVALB sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_002854.3 or any of its allele variants sequence, or any part thereof. app
APP已知有許多不同的名稱,諸如類澱粉蛋白β前驅蛋白、α-SAPP、AD1、阿茲海默病類澱粉蛋白A4蛋白同源物、類澱粉蛋白β (A4)前驅蛋白、阿茲海默病類澱粉蛋白、腦血管類澱粉蛋白肽、類澱粉蛋白β前驅蛋白、類澱粉蛋白前驅蛋白、肽酶連接蛋白(Nexin)-II、蛋白酶連接蛋白-II、PN-II、PreA4、ABPP、APPI、CVAP、β-類澱粉蛋白前驅蛋白、睪丸組織蛋白Li 2、β-類澱粉蛋白肽(1-40)、β-類澱粉蛋白肽(1-42)、類澱粉蛋白β-A4蛋白、β-類澱粉蛋白肽、阿茲海默病、CTFγ、ABETA、AAA、PN2及A4。APP基因的外部ID是HGNC:620;Entrez Gene:351;Ensembl:ENSG00000142192;OMIM®:104760及UniProtKB/Swiss-Prot:P05067。APP基因的外部Id為HGNC: 620;Entrez Gene: 351;Ensembl: ENSG00000142192;OMIM®: 104760及UniProtKB/Swiss-Prot: P05067。本文所使用之術語「APP」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有APP序列係指帶有來自諸如NCBI參考序列NM_001136130.3或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 WRN APP is known by many different names such as amyloid beta precursor protein, alpha-SAPP, AD1, Alzheimer's disease amyloid A4 protein homologue, amyloid beta (A4) precursor protein, Alzheimer's disease Silent disease amyloid, cerebrovascular amyloid peptide, amyloid β precursor protein, amyloid precursor protein, peptidase-linked protein (Nexin)-II, protease-linked protein-II, PN-II, PreA4, ABPP, APPI, CVAP, β-amyloid precursor protein, testicular tissue protein Li 2, β-amyloid peptide (1-40), β-amyloid peptide (1-42), amyloid β-A4 protein, β-amyloid peptide, Alzheimer's disease, CTFγ, ABETA, AAA, PN2 and A4. The external IDs of the APP gene are HGNC: 620; Entrez Gene: 351; Ensembl: ENSG00000142192; OMIM®: 104760 and UniProtKB/Swiss-Prot: P05067. The external Ids of the APP gene are HGNC: 620; Entrez Gene: 351; Ensembl: ENSG00000142192; OMIM®: 104760 and UniProtKB/Swiss-Prot: P05067. The term "APP" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having an APP sequence means having any nucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as NCBI Reference Sequence NM_001136130.3 or any allogenic variant thereof sequence, or any part thereof. WRN
WRN或維爾納氏症候群(Werner Syndrome) ATP依賴性解旋酶已知有許多不同的名稱,諸如WRN RecQ樣解旋酶;RECQL2;RECQ3;維爾納氏症候群RecQ樣解旋酶;DNA解旋酶、RecQ樣第3型;RecQ蛋白樣2;核酸外切酶WRN;維爾納氏症候群,RecQ解旋酶樣;維爾納氏症候群;EC 3.6.4.12;RECQL3;及RecQ3。WRN基因的外部ID為HGNC: 12791;NCBI Entrez Gene: 7486;Ensembl: ENSG00000165392;OMIM®: 604611;及UniProtKB/Swiss-Prot: Q14191。本文所使用之術語「WRN」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有WRN序列係指帶有來自諸如NCBI參考序列NM_000553.6或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 DAAM1 WRN or Werner Syndrome ATP-dependent helicase is known by many different names such as WRN RecQ-like helicase; RECQL2; RECQ3; Werner syndrome RecQ-like helicase; DNA helicase , RecQ-like type 3; RecQ protein-like 2; exonuclease WRN; Werner syndrome, RecQ helicase-like; Werner syndrome; EC 3.6.4.12; RECQL3; and RecQ3. The external IDs of the WRN gene are HGNC: 12791; NCBI Entrez Gene: 7486; Ensembl: ENSG00000165392; OMIM®: 604611; and UniProtKB/Swiss-Prot: Q14191. The term "WRN" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a WRN sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_000553.6 or any allele variant sequence, or any part thereof. DAAM1
DAAM1或形態發生紊亂相關活化子1已知有許多不同的名稱,諸如KIAA0666;及形態發生紊亂相關活化子1。DAAM1基因的外部Id為HGNC: 18142;NCBI Entrez Gene: 23002;Ensembl: ENSG00000100592;OMIM®: 606626;及UniProtKB/Swiss-Prot: Q9Y4D1。本文所使用之術語「DAAM1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有DAAM1序列係指帶有來自諸如NCBI參考序列NM_001270520.2或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 ASPH DAAM1 or Morphogenesis Disorder-Associated Activator 1 is known by many different names such as KIAA0666; and Morphogenesis Disorder-Associated Activator 1 . The external Ids of the DAAM1 gene are HGNC: 18142; NCBI Entrez Gene: 23002; Ensembl: ENSG00000100592; OMIM®: 606626; and UniProtKB/Swiss-Prot: Q9Y4D1. The term "DAAM1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a DAAM1 sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI reference sequence NM_001270520.2 or any allogenic variant sequence, or any part thereof. ASPH
ASPH或天門冬胺酸β-羥化酶已知有許多不同的名稱,諸如BAH、CASQ2BP1、JCTN、HAAH、天門冬胺酸/天冬醯胺基β-羥化酶、肽-天門冬胺酸β-二氧酶、ASP β-羥化酶、Junctate、Junctin、Humbug、Cardiac Junctin、EC 1.14.11.16、A β H-J-J、FDLAB及AAH。ASPH基因的外部Id為HGNC: 757;Entrez Gene: 444;Ensembl: ENSG00000198363;OMIM®: 600582及UniProtKB/Swiss-Prot: Q12797。本文所使用之術語「ASPH」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有ASPH序列係指帶有來自諸如NCBI參考序列NM_001164750.2或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 NOTCH2 ASPH or Aspartate β-Hydroxylase is known by many different names such as BAH, CASQ2BP1, JCTN, HAAH, Aspartate/Asparagine-based β-Hydroxylase, Peptide-Aspartate β-Dioxygenase, ASP β-Hydroxylase, Junctate, Junctin, Humbug, Cardiac Junctin, EC 1.14.11.16, AβHJJ, FDLAB, and AAH. The external Ids of the ASPH gene are HGNC: 757; Entrez Gene: 444; Ensembl: ENSG00000198363; OMIM®: 600582 and UniProtKB/Swiss-Prot: Q12797. The term "ASPH" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, bearing an ASPH sequence means bearing any nucleotide sequence, including coding sequence or any exon, but also including any polypeptide sequence, or any part thereof. NOTCH2
NOTCH2或Notch 受體2已知有許多不同的名稱,諸如Notch 2;神經源性位點缺口同源蛋白2;HN2;Notch(果蠅)同源物2;Notch同源物2(果蠅);Notch同源物2;HJCYS;及AGS2。NOTCH2基因的外部Id為HGNC: 7882;NCBI Entrez Gene: 4853;Ensembl: ENSG00000134250;OMIM®: 600275;及UniProtKB/Swiss-Prot: Q04721。本文所使用之術語「NOTCH2」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有NOTCH2序列係指帶有來自諸如NCBI參考序列NM_024408.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 CD74 NOTCH2 or Notch receptor 2 is known by many different names such as Notch 2; Neurogenic site Notch homolog 2; HN2; Notch (Drosophila) homolog 2; Notch homolog 2 (Drosophila) ; Notch homolog 2; HJCYS; and AGS2. The external Ids of the NOTCH2 gene are HGNC: 7882; NCBI Entrez Gene: 4853; Ensembl: ENSG00000134250; OMIM®: 600275; and UniProtKB/Swiss-Prot: Q04721. The term "NOTCH2" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, a NOTCH2 sequence refers to any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as the NCBI reference sequence NM_024408.4 or any of its allele variants sequence, or any part thereof. CD74
CD74已知有許多不同的名稱,諸如CD74分子;DHLAG;CD74分子,主要組織相容性複合物,II類不變鏈;HLA II類組織相容性抗原γ鏈;II類MHC相關不變鏈肽;HLA-DR抗原相關不變鏈;II類抗原的γ鏈;Ia相關不變鏈;MHC HLA-DR γ鏈;HLA-DR-γ;CLIP;Ia抗原相關不變鏈;CD74抗原;Ia-γ;HLADG;P33;II;及Ii;CD74基因的外部Id為HGNC: 1697;NCBI Entrez Gene: 972;Ensembl: ENSG00000019582;OMIM®: 142790;及UniProtKB/Swiss-Prot: P04233。本文所使用之術語「CD74」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有CD74序列係指帶有來自諸如NCBI參考序列NM_001025159.3或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 SDC4 CD74 is known by many different names such as CD74 molecule; DHLAG; CD74 molecule, major histocompatibility complex, class II invariant chain; HLA class II histocompatibility antigen gamma chain; MHC class II associated invariant chain Peptide; HLA-DR antigen-associated invariant chain; gamma chain of class II antigen; Ia-associated invariant chain; MHC HLA-DR gamma chain; HLA-DR-γ; CLIP; Ia antigen-associated invariant chain; CD74 antigen; Ia -γ; HLADG; P33; II; The term "CD74" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a CD74 sequence means carrying any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI reference sequence NM_001025159.3 or any allogenic variant sequence, or any part thereof. SDC4
SDC4或多配體聚醣4 (Syndecan 4)已知有許多不同的名稱,諸如雙棲蛋白聚醣(Amphiglycan);SYND4;多配體聚醣4 (Amphiglycan, Ryudocan);多配體聚醣蛋白聚醣4;Ryudocan核心蛋白;多配體聚醣-4;Ryudocan;及Ryudocan雙棲蛋白聚醣。SDC4基因的外部Id為HGNC: 10661;NCBI Entrez Gene: 6385;Ensembl: ENSG00000124145;OMIM®: 600017;及UniProtKB/Swiss-Prot: P31431。本文所使用之術語「SDC4」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有SDC4序列係指帶有來自諸如NCBI參考序列NM_002999.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 SLC4A4 SDC4 or Syndecan 4 is known by many different names such as Amphiglycan; SYND4; Amphiglycan, Ryudocan; Syndecan protein Glycan 4; Ryudocan core protein; Syndecan-4; Ryudocan; and Ryudocan amphiphilic proteoglycan. The external Ids of the SDC4 gene are HGNC: 10661; NCBI Entrez Gene: 6385; Ensembl: ENSG00000124145; OMIM®: 600017; and UniProtKB/Swiss-Prot: P31431. The term "SDC4" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a SDC4 sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI reference sequence NM_002999.4 or any of its allele variants sequence, or any part thereof. SLC4A4
SLC4A4或溶質載體家族4成員4已知有許多不同的名稱,諸如NBC1;HNBC1;HhNMC;NBC2;PNBC;溶質載體家族4(碳酸氫鈉共轉運蛋白),成員4;產電位碳酸氫鈉共轉運蛋白1;Na(+)/HCO3(-)共轉運蛋白;SLC4A5;KNBC1;碳酸氫鈉共轉運蛋白1(碳酸氫鈉共轉運蛋白,腎臟;碳酸氫鈉共轉運蛋白,胰臟);溶質載體家族4,碳酸氫鈉共轉運蛋白,成員4,腦型;溶質載體家族4,碳酸氫鈉共轉運蛋白,成員4;溶質載體家族4,碳酸氫鈉共轉運蛋白,成員5;碳酸氫鈉共轉運蛋白;NBCe1-A;NBCE1;KNBC;及NBC。SLC4A4基因的外部Id為HGNC: 11030;NCBI Entrez Gene: 8671;Ensembl: ENSG00000080493;OMIM®: 603345;及UniProtKB/Swiss-Prot: Q9Y6R1。本文所使用之術語「SLC4A4」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有SLC4A4序列係指帶有來自諸如NCBI參考序列NM_001098484.3或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 ZFAT SLC4A4 or solute carrier family 4 member 4 is known by many different names such as NBC1; HNBC1; HhNMC; NBC2; PNBC; solute carrier family 4 (sodium bicarbonate cotransporter), member 4; protein 1; Na(+)/HCO3(-) cotransporter; SLC4A5; KNBC1; sodium bicarbonate cotransporter 1 (sodium bicarbonate cotransporter, kidney; sodium bicarbonate cotransporter, pancreas); solute carrier Family 4, sodium bicarbonate cotransporter, member 4, brain type; solute carrier family 4, sodium bicarbonate cotransporter, member 4; solute carrier family 4, sodium bicarbonate cotransporter, member 5; sodium bicarbonate cotransporter Transporter; NBCel-A; NBCE1; KNBC; and NBC. The external Ids of the SLC4A4 gene are HGNC: 11030; NCBI Entrez Gene: 8671; Ensembl: ENSG00000080493; OMIM®: 603345; and UniProtKB/Swiss-Prot: Q9Y6R1. The term "SLC4A4" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, having a SLC4A4 sequence means having any nucleotide sequence, including coding sequence or any exon, but also any polypeptide, derived from such as NCBI Reference Sequence NM_001098484.3 or any of its allele variants sequence, or any part thereof. ZFAT
ZFAT或含AT-鈎結構域的鋅指(Zing Finger And AT-Hook Domain Containing)已知有許多不同的名稱,諸如鋅指蛋白406;KIAA1485;ZNF406;ZFAT1;鋅指蛋白ZFAT;在自體免疫性甲狀腺疾病的鋅指基因;在AITD易感區的鋅指基因;及AITD3。ZFAT基因的外部Id為HGNC: 19899;NCBI Entrez Gene: 57623;Ensembl: ENSG00000066827;OMIM®: 610931;及UniProtKB/Swiss-Prot: Q9P243。本文所使用之術語「ZFAT」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有ZFAT序列係指帶有來自諸如NCBI參考序列NM_020863.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 DSCAML1 ZFAT or Zing Finger And AT-Hook Domain Containing is known by many different names, such as zinc finger protein 406; KIAA1485; ZNF406; ZFAT1; zinc finger protein ZFAT; Zinc finger genes in prevalent thyroid disease; zinc finger genes in the AITD susceptibility region; and AITD3. The external Ids of ZFAT genes are HGNC: 19899; NCBI Entrez Gene: 57623; Ensembl: ENSG00000066827; OMIM®: 610931; and UniProtKB/Swiss-Prot: Q9P243. The term "ZFAT" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a ZFAT sequence means carrying any nucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as NCBI reference sequence NM_020863. sequence, or any part thereof. DSCAML1
DSCAML1或類DS細胞黏附分子1已知有許多不同的名稱,諸如KIAA1132;類唐氏症細胞黏附分子蛋白1;唐氏症細胞黏附分子2;DSCAM2;類唐氏症細胞黏附分子1;類唐氏症細胞黏附分子1;及類DSCAM 1。DSCAML1基因的外部Id為HGNC: 14656;NCBI Entrez Gene: 57453;Ensembl: ENSG00000177103;OMIM®: 611782;及UniProtKB/Swiss-Prot: Q8TD84。本文所使用之術語「DSCAML1」並不意味著侷限於任何諸如蛋白質、RNA、mRNA、cDNA或DNA序列之形式,而是包括所有這類的形式,除非上下文清楚指出是哪種形式。為了避免疑慮,帶有DSCAML1序列係指帶有來自諸如NCBI參考序列NM_020693.4或其任一對偶基因變異體的任何核苷酸序列,包括編碼序列或是任何外顯子,但也包括任何多肽序列,或其任何部分。 NRG1 多核苷酸融合物 DSCAML1 or DS-like cell adhesion molecule 1 is known by many different names such as KIAA1132; Down's syndrome-like cell adhesion molecule protein 1; Down's syndrome-like cell adhesion molecule 2; DSCAM2; Down's syndrome-like cell adhesion molecule 1; Syndrome Cell Adhesion Molecule 1; and DSCAM-like 1. The external Ids of the DSCAML1 gene are HGNC: 14656; NCBI Entrez Gene: 57453; Ensembl: ENSG00000177103; OMIM®: 611782; and UniProtKB/Swiss-Prot: Q8TD84. The term "DSCAML1" as used herein is not meant to be limited to any form such as protein, RNA, mRNA, cDNA or DNA sequence, but includes all such forms unless the context clearly indicates which form. For the avoidance of doubt, carrying a DSCAML1 sequence means carrying any nucleotide sequence, including coding sequence or any exon, but also including any polypeptide, derived from such as NCBI reference sequence NM_020693.4 or any allogenic variant sequence, or any part thereof. NRG1 polynucleotide fusion
根據本揭露,現提供了會導致NRG1與融合配偶體融合之NRG1融合產物表現的前所未知的基因重排。尤其,提供了包含有NRG1之多核苷酸融合物,包括VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1或DSCAML1-NRG1。尤其,在被診斷患有癌症的人類患者中係存在或已鑑定出這類融合物,並在下節中更詳細地提及。In light of the present disclosure, previously unknown gene rearrangements that result in the expression of NRG1 fusion products that result in the fusion of NRG1 to a fusion partner are provided. In particular, polynucleotide fusions comprising NRG1 are provided, including VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1 , PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4 - NRG1, ZFAT-NRG1 or DSCAML1-NRG1. In particular, such fusions are present or have been identified in human patients diagnosed with cancer and are mentioned in more detail in the next section.
在某些實施例中,該NRG1融合物在編碼該EGF樣結構域之序列的3’處可包含另外的下游融合配偶體。 VAPB-NRG1 多核苷酸融合物 In certain embodiments, the NRG1 fusion may comprise an additional downstream fusion partner 3' to the sequence encoding the EGF-like domain. VAPB-NRG1 polynucleotide fusion
根據本揭露,亦提供一種包含有VAPB核酸序列(或VAPB核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸。所述融合物中也包括了VAPB及NRG1核酸序列的對偶基因變異體。According to the present disclosure, there is also provided a polynucleotide comprising a fusion of a VAPB nucleic acid sequence (or a part of the VAPB nucleic acid sequence) and an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). Allele variants of VAPB and NRG1 nucleic acid sequences are also included in the fusion.
較佳地,該VAPB核酸序列(或其部分)包含SEQ ID NO: 17-23中之任一者或SEQ ID NO: 17-23中之任一者的對偶基因變異體,或是由其組成,而該NRG1核酸序列(或其部分)包含SEQ ID NO: 125-138中之任一者或SEQ ID NO: 125-138中之任一者的對偶基因變異體,或是由其組成。更佳地,該VAPB核酸序列包含SEQ ID NO: 23或SEQ ID NO: 23的對偶基因變異體之一部分,或是由其組成,而該NRG1核酸序列包含SEQ ID NO: 138或SEQ ID NO: 138的對偶基因變異體之一部分,或是由其組成。SEQ ID NO: 17-22分別對應根據NM_004738.4之VAPB的個別外顯子1-6。SEQ ID NO: 23對應根據NM_004738.4之VAPB的外顯子1-5。SEQ ID NO: 125-137分別對應根據NM_001159999之NRG1的個別外顯子1-13。SEQ ID NO: 138對應根據NM_001159999之NRG1的外顯子1-13。Preferably, the VAPB nucleic acid sequence (or part thereof) comprises, or consists of, any one of SEQ ID NOs: 17-23 or an allele variant of any one of SEQ ID NOs: 17-23 , and the NRG1 nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 125-138 or an allele variant of any one of SEQ ID NOs: 125-138, or consists of it. More preferably, the VAPB nucleic acid sequence comprises a part of SEQ ID NO: 23 or an allele variant of SEQ ID NO: 23, or consists of it, and the NRG1 nucleic acid sequence comprises SEQ ID NO: 138 or SEQ ID NO: 138 is part of, or consists of, the allele variant. SEQ ID NO: 17-22 correspond to individual exons 1-6, respectively, of VAPB according to NM_004738.4. SEQ ID NO: 23 corresponds to exons 1-5 of VAPB according to NM_004738.4. SEQ ID NO: 125-137 correspond to individual exons 1-13 of NRG1 according to NM_001159999, respectively. SEQ ID NO: 138 corresponds to exons 1-13 of NRG1 according to NM_001159999.
在一較佳實施例中,該VAPB核酸序列部分包含來自SEQ ID NO: 17-23中之任一者(或SEQ ID NO: 17-23中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,而該NRG1核酸序列部分包含來自SEQ ID NO: 125-138中之任一者(或SEQ ID NO: 125-138中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸。In a preferred embodiment, the VAPB nucleic acid sequence part comprises 2 to About 10, about 20, about 30, or up to about 40 or even all of the continuous nucleic acid, and the NRG1 nucleic acid sequence part comprises from any one of SEQ ID NO: 125-138 (or SEQ ID NO: 125-138 Allele variants of any) from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids.
較佳地,該VAPB核酸序列或其部分係位於NRG1核酸序列或其部分的5’處。Preferably, the VAPB nucleic acid sequence or part thereof is located 5' to the NRG1 nucleic acid sequence or part thereof.
較佳地,該包含有VAPB之VAPB核酸序列(或所述序列的一部分)與NRG1核酸序列(或所述序列的一部分)融合之多核苷酸係包含或編碼NRG1的EGF樣結構域。包含有該VAPB-NRG1多核苷酸融合物或表現該多肽融合物之異常細胞係包含或編碼NRG1的EGF樣結構域。為了檢測、診斷或鑑定之目的,只需證明介於VAPB與NRG1之間的融合接合處是符合讀框的並且發生在可使得所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising VAPB nucleic acid sequence (or a part of said sequence) fused with NRG1 nucleic acid sequence (or a part of said sequence) of VAPB comprises or encodes the EGF-like domain of NRG1. Abnormal cell lines containing the VAPB-NRG1 polynucleotide fusion or expressing the polypeptide fusion contain or encode the EGF-like domain of NRG1. For detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between VAPB and NRG1 is in-frame and occurs at a position such that the resulting fusion product, nucleic acid or protein contains the EGF-like domain of NRG1 That's it. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該VAPB核酸序列之對偶基因變異體與SEQ ID NO: 17-23中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of the VAPB nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 17-23, preferably at least 90% identity, and 92%, 94% identity with it , 96% or better at least 98% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138, preferably at least 90% identity, 92%, 94%, 96% or better at least 98% sequence identity thereto.
在一較佳實施例中,該包含有VAPB核酸與NRG1核酸融合之多核苷酸係包含來自SEQ ID NO: 3之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且較佳地包括位置43及44的核酸。連續核酸的數目可為SEQ ID NO: 3之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置53及54的核酸。In a preferred embodiment, the polynucleotide comprising the fusion of VAPB nucleic acid and NRG1 nucleic acid comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 3 Contiguous nucleic acids, and preferably include the nucleic acids at positions 43 and 44. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least Nucleic acids at positions 53 and 54 are included.
在一較佳實施例中,提供了一種根據SEQ ID NO: 3之多核苷酸,或一種包含了來自SEQ ID NO: 3之約20、約30、約40個或全部的連續核酸且較佳地包括了位置43及44的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 3之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置43及44的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 3, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 3 and preferably Polynucleotides that include nucleic acids at positions 43 and 44. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 43 and 44.
較佳地,該NRG1核酸序列之多核苷酸部分(或其對偶基因變異體)係編碼NRG1之EGF樣結構域,較佳為根據SEQ ID NO: 163之EGF樣結構域。Preferably, the polynucleotide portion of the NRG1 nucleic acid sequence (or an allele variant thereof) encodes an EGF-like domain of NRG1, preferably an EGF-like domain according to SEQ ID NO: 163.
在一替代態樣中,提供了一種包含有VAPB之外顯子1或其對偶基因變異體的一部分與NRG1之外顯子2或其對偶基因變異體的一部分融合之多核苷酸。較佳地,VAPB之外顯子1為SEQ ID NO: 17之外顯子。較佳地,NRG1之外顯子2為SEQ ID NO: 126之外顯子。所述VAPB外顯子1的部分較佳地包含SEQ ID NO: 1或其對偶基因變異體或是由其組成。所述NRG1外顯子2的部分較佳地包含SEQ ID NO: 2或其對偶基因變異體或是由其組成。當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 17及126可能就足夠了。來自NRG1外顯子2之3’處的任一序列係包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。In an alternative aspect, there is provided a polynucleotide comprising a portion of VAPB exon 1 or an allogenic variant thereof fused to a portion of NRG1 exon 2 or an allogenic variant thereof. Preferably, VAPB exon 1 is the exon of SEQ ID NO: 17. Preferably, exon 2 of NRG1 is the exon of SEQ ID NO: 126. Said part of VAPB exon 1 preferably comprises or consists of SEQ ID NO: 1 or its allele variant. The portion of exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 2 or an allele variant thereof. When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further includes any sequence 3' to exon 2 of NRG1, but in order to be able to use polynucleotide-based detection Assays to detect fusion junctions, the presence of at least SEQ ID NO: 17 and 126 may be sufficient. Any sequence from 3' of NRG1 exon 2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or is derived from it composition.
較佳地,該VAPB之外顯子1之對偶基因變異體與SEQ ID NO: 17有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1之外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of exon 1 of VAPB has at least 85% identity to SEQ ID NO: 17, preferably at least 90% identity, 92%, 94%, 96% or More preferably at least 98% sequence identity, and the allele variant of exon 2 of NRG1 is at least 85% identical to SEQ ID NO: 126, preferably at least 90% identical to its 92 %, 94%, 96% or better at least 98% sequence identity.
在一較佳實施例中,VAPB之外顯子1的部分係包含SEQ ID NO: 1或由其組成,且其對偶基因變異體與SEQ ID NO: 1有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此融合物中之NRG1之外顯子2的部分係包含SEQ ID NO: 2或由其組成,且其對偶基因變異體與SEQ ID NO: 2有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。這類短的多核苷酸序列特別有利於檢測VAPB與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。更佳地,VAPB之外顯子1的部分包含來自SEQ ID NO: 1之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置43的核酸。連續核酸的數目可為SEQ ID NO: 1之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置43的核酸。更佳地,VAPB之外顯子1的部分係包含或根據SEQ ID NO: 1或其對偶基因變異體。In a preferred embodiment, the part of exon 1 of VAPB comprises or consists of SEQ ID NO: 1, and its allele variant has at least 85% identity with SEQ ID NO: 1, preferably At least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity. The portion of exon 2 of NRG1 in the fusion comprises or consists of SEQ ID NO: 2, and its allele variant is at least 85% identical to SEQ ID NO: 2, preferably at least 90% identical % identity, 92%, 94%, 96% or better at least 98% sequence identity with it. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between VAPB and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion. More preferably, the part of VAPB exon 1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 1, and At least the nucleic acid at position 43 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 43 nucleic acid. More preferably, the part of exon 1 of VAPB comprises or is according to SEQ ID NO: 1 or an allele variant thereof.
替代性地,VAPB外顯子1的部分包含SEQ ID NO: 17(或SEQ ID NO: 17之對偶基因變異體)或由其組成,至少包括位置399的核酸。較佳地,VAPB外顯子1的部分包含來自SEQ ID NO: 17或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置399的核酸。連續核酸的數目可為SEQ ID NO: 17之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置399的核酸。在此替代例中,VAPB外顯子1的部分更佳地包含或根據SEQ ID NO: 17或其對偶基因變異體。較佳地,在與VAPB的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of VAPB exon 1 comprises or consists of SEQ ID NO: 17 (or an allele variant of SEQ ID NO: 17), including at least the nucleic acid at position 399. Preferably, the portion of VAPB exon 1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 17 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 399. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 399 nucleic acid. In this alternative, the part of VAPB exon 1 preferably comprises or is according to SEQ ID NO: 17 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with VAPB comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有VAPB外顯子1的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 3之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置43及44的核酸。連續核酸的數目可為SEQ ID NO: 3之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置43及44的核酸。SEQ ID NO: 3包括介於VAPB與NRG1之間的接合處,尤其該接合處係介於源於VAPB之位置43的核酸與源於NRG1之位置44的核酸之間。較佳地,該包含有VAPB外顯子1的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 3之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of VAPB exon 1 fused to a portion of NRG1 exon 2 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 43 and 44. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 43 and 44 nucleic acids. SEQ ID NO: 3 includes a junction between VAPB and NRG1, in particular the junction is between a nucleic acid derived from position 43 of VAPB and a nucleic acid derived from position 44 of NRG1. Preferably, the polynucleotide comprising a part of exon 1 of VAPB fused with a part of exon 2 of NRG1 has the polynucleotide sequence of SEQ ID NO: 3 or an allele variant thereof.
較佳地,本文所提供之任一VAPB-NRG1多核苷酸融合物為VAPB與NRG1之符合讀框的融合。更佳地所述融合物為包含有VAPB之外顯子1或外顯子1的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 3或其對偶基因變異體的融合物。Preferably, any of the VAPB-NRG1 polynucleotide fusions provided herein is an in-frame fusion of VAPB and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 1 or a part of exon 1 of VAPB and exon 2 or a part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 3 or an allele variant thereof.
較佳地,VAPB外顯子1(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有VAPB的N端和NRG1的C端。此外,本文所提供之VAPB-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自VAPB的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該VAPB-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括肺癌或腺癌,尤其是肺腺癌或非小細胞肺癌。 CADM1-NRG1 多核苷酸融合物 Preferably, the part of VAPB exon 1 (or its allegenic variant) is located 5' to NRG1 exon 2 (or its allegene variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of VAPB and the C-terminus of NRG1. In addition, the VAPB-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from VAPB and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The VAPB-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including lung or adenocarcinoma, especially lung adenocarcinoma or non-small cell lung cancer. CADM1-NRG1 polynucleotide fusion
亦提供了一種包含有CADM1之外顯子7的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。CADM1之外顯子7較佳為SEQ ID NO: 39或SEQ ID NO: 39之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 7 of CADM1 fused to a portion of exon 6 of NRG1. CADM1 exon 7 is preferably the exon of SEQ ID NO: 39 or an allogenic variant of SEQ ID NO: 39, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
較佳地,CADM1外顯子7之對偶基因變異體與SEQ ID NO: 39有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of CADM1 exon 7 is at least 85% identical to SEQ ID NO: 39, preferably at least 90% identical, 92%, 94%, 96% or better identical thereto At least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity with SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94% , 96% or better at least 98% sequence identity.
較佳地,CADM1之外顯子7的部分係包含或根據SEQ ID NO: 5,且其對偶基因變異體與SEQ ID NO: 5有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CADM1的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 6之序列,且其對偶基因變異體與SEQ ID NO: 6有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。這類短的多核苷酸序列特別有利於檢測CADM1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。更佳地,CADM1之外顯子7的部分包含來自SEQ ID NO: 5之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置43的核酸。連續核酸的數目可為SEQ ID NO: 5之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置53的核酸。更佳地,CADM1之外顯子7的部分係包含或根據SEQ ID NO: 5或其對偶基因變異體。Preferably, the part of exon 7 of CADM1 comprises or is according to SEQ ID NO: 5, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 5 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with CADM1 is preferably or comprises a sequence such as SEQ ID NO: 6, and its allele variant has at least 85% identity with SEQ ID NO: 6, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CADM1 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion. More preferably, the part of exon 7 of CADM1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 5, and At least the nucleic acid at position 43 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 53 nucleic acid. More preferably, the part of exon 7 of CADM1 comprises or is according to SEQ ID NO: 5 or an allele variant thereof.
替代性地,CADM1外顯子7的部分包含SEQ ID NO: 39(或SEQ ID NO: 39之對偶基因變異體)或由其組成,至少包括位置173的核酸。較佳地,CADM1外顯子7的部分包含來自SEQ ID NO: 39或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置173的核酸。連續核酸的數目可為SEQ ID NO: 39之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置173的核酸。在此替代例中,CADM1外顯子7的部分更佳地包含或根據SEQ ID NO: 39或其對偶基因變異體。較佳地,在與CADM1的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of CADM1 exon 7 comprises or consists of SEQ ID NO: 39 (or an allele variant of SEQ ID NO: 39), including at least the nucleic acid at position 173. Preferably, the portion of CADM1 exon 7 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 39 or its allele variant or is composed of It consists of, and at least includes, the nucleic acid at position 173. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 173 nucleic acid. In this alternative, the portion of CADM1 exon 7 preferably comprises or is according to SEQ ID NO: 39 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with CADM1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有CADM1外顯子7的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 7之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置53及54的核酸。連續核酸的數目可為SEQ ID NO: 7之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置53及54的核酸。SEQ ID NO: 7包括介於CADM1與NRG1之間的接合處,尤其該接合處係介於源於CADM1之位置53的核酸與源於NRG1之位置54的核酸之間。較佳地,該包含有CADM1外顯子7的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 7之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 7 of CADM1 fused to a part of exon 6 of NRG1 is comprised of sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 53 and 54. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 53 and 54 nucleic acids. SEQ ID NO: 7 includes the junction between CADM1 and NRG1, in particular the junction is between the nucleic acid derived from position 53 of CADM1 and the nucleic acid derived from position 54 of NRG1. Preferably, the polynucleotide comprising a part of exon 7 of CADM1 fused with a part of exon 6 of NRG1 has the polynucleotide sequence of SEQ ID NO: 7 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 7之多核苷酸,或一種包含了來自SEQ ID NO: 7之約20、約30、約40個或全部的連續核酸且較佳地包括了位置53及54的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 7之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置53及54的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 7, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 7 and preferably A polynucleotide that includes the nucleic acids at positions 53 and 54. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 53 and 54.
較佳地,本文所提供之任一CADM1-NRG1多核苷酸融合物為CADM1與NRG1之符合讀框的融合。更佳地所述融合物為包含有CADM1之外顯子7或外顯子7的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 7或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 7有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the CADM1-NRG1 polynucleotide fusions provided herein is an in-frame fusion of CADM1 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 7 or a part of exon 7 of CADM1 and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 7 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 7, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
較佳地,該包含有CADM1外顯子7(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及CADM1或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CADM1與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of CADM1 exon 7 (or its allele variant) fused to a part of NRG1 exon 6 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving CADM1 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between CADM1 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CADM1外顯子7(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CADM1的N端和NRG1的C端。此外,本文所提供之CADM1-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CADM1的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CADM1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括肺癌或腺癌,尤其是肺腺癌。Preferably, the part of exon 7 of CADM1 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CADM1 and the C-terminus of NRG1. In addition, the CADM1-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from CADM1 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CADM1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including lung or adenocarcinoma, especially lung adenocarcinoma.
當存在於患者或個體的異常細胞中時,所述CADM1-NRG1多核苷酸融合物較佳地進一步包括CADM1外顯子7之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 39及130可能就足夠了。來自CADM1外顯子7之5’處的任一序列包含SEQ ID NO: 33-38(或SEQ ID NO: 33-38之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之任一者或全部,或是由其組成。 CD44-NRG1 多核苷酸融合物 When present in abnormal cells of a patient or individual, the CADM1-NRG1 polynucleotide fusion preferably further comprises any sequence 5' to exon 7 of CADM1 and 3' to exon 6 of NRG1 However, in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 39 and 130 may be sufficient. Any sequence from 5' of CADM1 exon 7 comprising, or consisting of, one or all of SEQ ID NO: 33-38 (or any allele variant of SEQ ID NO: 33-38) , and any sequence from 3' of NRG1 exon 6 comprises any or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it. CD44-NRG1 polynucleotide fusion
亦提供了一種包含有CD44之外顯子5的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。CD44之外顯子5較佳為SEQ ID NO: 65或SEQ ID NO: 65之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 5 of CD44 fused to a portion of exon 2 of NRG1. CD44 exon 5 is preferably the exon of SEQ ID NO: 65 or an allogenic variant of SEQ ID NO: 65, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
較佳地,該CD44外顯子5之對偶基因變異體與SEQ ID NO: 65有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of CD44 exon 5 is at least 85% identical to SEQ ID NO: 65, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,CD44之外顯子5的部分係包含或根據SEQ ID NO: 9,且其對偶基因變異體與SEQ ID NO: 9有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CD44的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 10之序列,且其對偶基因變異體與SEQ ID NO: 10有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,CD44之外顯子5的部分包含來自SEQ ID NO: 9之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置52的核酸。連續核酸的數目可為SEQ ID NO: 9之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置52的核酸。更佳地,CD44之外顯子5的部分係包含或根據SEQ ID NO: 9或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測CD44與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 5 of CD44 comprises or is according to SEQ ID NO: 9, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 9 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with CD44 is preferably or comprises a sequence such as SEQ ID NO: 10, and its allele variant has at least 85% identity with SEQ ID NO: 10, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 5 of CD44 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 9, and At least the nucleic acid at position 52 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 52 nucleic acid. More preferably, the portion of exon 5 of CD44 comprises or is according to SEQ ID NO: 9 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CD44 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,CD44外顯子5的部分包含SEQ ID NO: 65(或SEQ ID NO: 65之對偶基因變異體)或由其組成,至少包括位置231的核酸。較佳地,CD44外顯子5的部分包含來自SEQ ID NO: 65或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置231的核酸。連續核酸的數目可為SEQ ID NO: 65之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置231的核酸。在此替代例中,CD44外顯子5的部分更佳地包含或根據SEQ ID NO: 231或其對偶基因變異體。較佳地,在與CD44的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 5 of CD44 comprises or consists of SEQ ID NO: 65 (or an allele variant of SEQ ID NO: 65), including at least the nucleic acid at position 231. Preferably, the portion of CD44 exon 5 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 65 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 231. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 231 nucleic acid. In this alternative, the portion of CD44 exon 5 preferably comprises or is according to SEQ ID NO: 231 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with CD44 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有CD44外顯子5的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 11之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置52及53的核酸。連續核酸的數目可為SEQ ID NO: 11之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置52及53的核酸。SEQ ID NO: 11包括介於CD44與NRG1之間的接合處,尤其該接合處係介於源於CD44之位置52的核酸與源於NRG1之位置53的核酸之間。較佳地,該包含有CD44外顯子5的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 15之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 5 of CD44 fused to a part of exon 2 of NRG1 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 52 and 53. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 52 and 53 nucleic acids. SEQ ID NO: 11 includes a junction between CD44 and NRG1, in particular the junction is between a nucleic acid derived from position 52 of CD44 and a nucleic acid derived from position 53 of NRG1. Preferably, the polynucleotide comprising a part of exon 5 of CD44 fused with a part of exon 2 of NRG1 has the polynucleotide sequence of SEQ ID NO: 15 or its allele variant.
在一較佳實施例中,提供了一種根據SEQ ID NO: 11之多核苷酸,或一種包含了來自SEQ ID NO: 11之約20、約30、約40個或全部的連續核酸且較佳地包括了位置52及53的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 11之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置52及53的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 11, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 11 and preferably A polynucleotide that includes the nucleic acids at positions 52 and 53. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 52 and 53.
較佳地,本文所提供之任一CD44-NRG1多核苷酸融合物為CD44與NRG1之符合讀框的融合。更佳地所述融合物為包含有CD44之外顯子5 (或外顯子5的一部分)以及NRG1之外顯子2 (或NRG1之外顯子2的一部分)之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 11或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 11有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the CD44-NRG1 polynucleotide fusions provided herein are in-frame fusions of CD44 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 5 (or a part of exon 5) of CD44 and exon 2 of NRG1 (or a part of exon 2 of NRG1). The in-frame fusion is preferably a fusion of SEQ ID NO: 11 or its allogeneic variant, and the allogeneic variant has at least 85% identity to SEQ ID NO: 11, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
較佳地,該包含有CD44外顯子5(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及CD44或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CD44與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 5 of CD44 (or its allele variant) fused to a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving CD44 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between CD44 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CD44外顯子5(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CD44的N端和NRG1的C端。此外,本文所提供之CD44-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CD44的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CD44-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括胰臟癌或胰臟腺癌,尤其是胰臟導管腺癌(或PDAC)。Preferably, the portion of exon 5 (or its allele variant) of CD44 is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CD44 and the C-terminus of NRG1. In addition, the CD44-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from CD44 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CD44-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including pancreatic cancer or pancreatic adenocarcinoma, especially pancreatic ductal adenocarcinoma (or PDAC).
當存在於患者或個體的異常細胞中時,所述CD44-NRG1多核苷酸融合物較佳地進一步包括CD44外顯子5之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 65及126可能就足夠了。來自CD44外顯子5之5’處的任一序列包含SEQ ID NO: 61-64(或SEQ ID NO: 61-64之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the CD44-NRG1 polynucleotide fusion preferably further comprises any sequence 5' to exon 5 of CD44 and 3' to exon 2 of NRG1 However, in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 65 and 126 may be sufficient. Any sequence from 5' of exon 5 of CD44 comprising or consisting of one or all of SEQ ID NO: 61-64 (or any allele variant of SEQ ID NO: 61-64) , and any sequence from 3' of NRG1 exon 6 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or is derived from its composition.
再者,提供了一種包含有CD44之外顯子5的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。所述CD44之外顯子5較佳為SEQ ID NO: 65或SEQ ID NO: 65之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Furthermore, a polynucleotide fusion comprising a portion of exon 5 of CD44 fused to a portion of exon 6 of NRG1 is provided. The CD44 exon 5 is preferably the exon of SEQ ID NO: 65 or the allele variant of SEQ ID NO: 65, and the NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: Exon of the allele variant of 130.
較佳地,所述CD44外顯子5之對偶基因變異體與SEQ ID NO: 65有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of CD44 exon 5 has at least 85% identity to SEQ ID NO: 65, preferably at least 90% identity, 92%, 94%, 96% or More preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 is at least 85% identical to SEQ ID NO: 130, preferably at least 90% identical to 92% identical to SEQ ID NO: 130 , 94%, 96% or better at least 98% sequence identity.
較佳地,所述CD44之外顯子5的部分係包含或根據SEQ ID NO: 759,且其對偶基因變異體與SEQ ID NO: 9有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CD44的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 760之序列,且其對偶基因變異體與SEQ ID NO: 760有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,CD44之外顯子5的部分包含來自SEQ ID NO: 759之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 759之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,所述CD44之外顯子5的部分係包含或根據SEQ ID NO: 759或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測CD44與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 5 of CD44 comprises or is according to SEQ ID NO: 759, and its allele variant has at least 85% identity with SEQ ID NO: 9, preferably at least 90% 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. The part of exon 6 of NRG1 in the fusion with CD44 is preferably or comprises a sequence such as SEQ ID NO: 760, and its allele variant has at least 85% identity with SEQ ID NO: 760, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 5 of CD44 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 759, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the part of exon 5 of CD44 comprises or is according to SEQ ID NO: 759 or its allele variant. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CD44 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,所述CD44外顯子5的部分包含SEQ ID NO: 65(或SEQ ID NO: 65之對偶基因變異體)或由其組成,至少包括位置231的核酸。較佳地,所述CD44外顯子5的部分包含來自SEQ ID NO: 65或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置231的核酸。連續核酸的數目可為SEQ ID NO: 65之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置231的核酸。在此替代例中,CD44外顯子5的部分更佳地包含或根據SEQ ID NO: 65或其對偶基因變異體。較佳地,在與CD44的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 5 of CD44 comprises or consists of SEQ ID NO: 65 (or an allele variant of SEQ ID NO: 65), at least including the nucleic acid at position 231. Preferably, the portion of exon 5 of CD44 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 65 or its allele variant or consists of and includes at least the nucleic acid at position 231. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 231 nucleic acid. In this alternative, the portion of CD44 exon 5 preferably comprises or is according to SEQ ID NO: 65 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with CD44 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一CD44-NRG1多核苷酸融合物為CD44與NRG1之符合讀框的融合。更佳地所述融合物為包含有CD44之外顯子5或外顯子5的一部分以及NRG1之外顯子6或NRG1之外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 761或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 761有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the CD44-NRG1 polynucleotide fusions provided herein are in-frame fusions of CD44 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 5 or a part of exon 5 of CD44 and exon 6 of NRG1 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 761 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 761, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有所述CD44外顯子5的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 761之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 761之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 761包括介於CD44與NRG1之間的接合處,尤其該接合處係介於源於CD44之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有CD44外顯子5的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 761之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 5 of CD44 fused with a part of exon 6 of NRG1 comprises from 2 to about 10 of SEQ ID NO: 761, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 761 includes a junction between CD44 and NRG1, in particular the junction is between the nucleic acid derived from position 75 of CD44 and the nucleic acid derived from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 5 of CD44 fused with a part of exon 6 of NRG1 has the polynucleotide sequence of SEQ ID NO: 761 or its allele variant.
在一較佳實施例中,提供了一種根據SEQ ID NO: 761之多核苷酸,或一種包含了來自SEQ ID NO: 761之約20、約30、約40個或全部的連續核酸且較佳地包括了位置75及76的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 761之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置75及76的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 761, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 761 and preferably A polynucleotide that includes the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 75 and 76.
較佳地,該包含有CD44外顯子5(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及所述CD44或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CD44與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of CD44 exon 5 (or its allogeneic variant) fused to a part of NRG1 exon 6 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving said CD44 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between CD44 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CD44外顯子5(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CD44的N端和NRG1的C端。此外,本文所提供之CD44-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CD44的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CD44-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是胰臟癌。Preferably, the portion of exon 5 (or its allele variant) of CD44 is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CD44 and the C-terminus of NRG1. In addition, the CD44-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from CD44 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CD44-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially pancreatic cancer.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括CD44外顯子5之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 65及130可能就足夠了。來自CD44外顯子5之5’處的任一序列包含SEQ ID NO: 61-64(或SEQ ID NO: 61-64之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。 SLC3A2-NRG1 多核苷酸融合物 When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of exon 5 of CD44 and any sequence at the 3' of exon 6 of NRG1. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 65 and 130 may be sufficient. Any sequence from 5' of exon 5 of CD44 comprising, or consisting of, one or all of SEQ ID NO: 61-64 (or any allele variant of SEQ ID NO: 61-64) , and any sequence from 3' of NRG1 exon 6 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is derived from its composition. SLC3A2-NRG1 polynucleotide fusion
亦提供了一種包含有SLC3A2轉錄本6之外顯子1的一部分與NRG1之外顯子5的一部分融合之多核苷酸融合物。所述SLC3A2之外顯子1較佳為SEQ ID NO: 103或SEQ ID NO: 103之對偶基因變異體的外顯子,而NRG1之外顯子5較佳為SEQ ID NO: 129或SEQ ID NO: 129之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 1 of SLC3A2 transcript fused to a portion of exon 5 of NRG1. The SLC3A2 exon 1 is preferably the exon of SEQ ID NO: 103 or the allele variant of SEQ ID NO: 103, and the NRG1 exon 5 is preferably SEQ ID NO: 129 or SEQ ID NO: Exon of the allele variant of 129.
較佳地,所述SLC3A2外顯子1之對偶基因變異體與SEQ ID NO: 103有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子5之對偶基因變異體與SEQ ID NO: 129有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of exon 1 of SLC3A2 has at least 85% identity to SEQ ID NO: 103, preferably at least 90% identity, 92%, 94%, 96% or More preferably at least 98% sequence identity, and the allele variant of NRG1 exon 5 has at least 85% identity with SEQ ID NO: 129, preferably at least 90% identity, and 92% with SEQ ID NO: 129 , 94%, 96% or better at least 98% sequence identity.
較佳地,所述SLC3A2之外顯子1的部分係包含或根據SEQ ID NO: 13,且其對偶基因變異體與SEQ ID NO: 13有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與SLC3A2的融合物中之NRG1之外顯子5的部分較佳為或包含如SEQ ID NO: 14之序列,且其對偶基因變異體與SEQ ID NO: 14有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,所述SLC3A2之外顯子1的部分包含來自SEQ ID NO: 13之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置53的核酸。連續核酸的數目可為SEQ ID NO: 13之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置53的核酸。更佳地,所述SLC3A2之外顯子1的部分係包含或根據SEQ ID NO: 13或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測SLC3A2與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 1 of SLC3A2 comprises or is according to SEQ ID NO: 13, and its allele variant has at least 85% identity with SEQ ID NO: 13, preferably at least 90% 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. The part of exon 5 of NRG1 in the fusion with SLC3A2 is preferably or comprises a sequence such as SEQ ID NO: 14, and its allele variant has at least 85% identity with SEQ ID NO: 14, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 1 of the SLC3A2 comprises or consists of continuous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 13 , and at least including the nucleic acid at position 53. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 53 nucleic acid. More preferably, the part of exon 1 of SLC3A2 comprises or is according to SEQ ID NO: 13 or its allele variant. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between SLC3A2 and NRG1 and determining whether such fusions are oncogenic in frame containing the EGF-like domain of NRG1 sexual fusion.
替代性地,所述SLC3A2外顯子1的部分包含SEQ ID NO: 103(或SEQ ID NO: 103之對偶基因變異體)或由其組成,至少包括位置552的核酸。較佳地,所述SLC3A2外顯子1的部分包含來自SEQ ID NO: 103或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置552的核酸。連續核酸的數目可為SEQ ID NO: 103之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置552的核酸。在此替代例中,SLC3A2外顯子1的部分更佳地包含或根據SEQ ID NO: 103或其對偶基因變異體。較佳地,在與所述SLC3A2的融合物中之NRG1外顯子5的部分包含來自SEQ ID NO: 157或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 157之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 1 of SLC3A2 comprises or consists of SEQ ID NO: 103 (or an allele variant of SEQ ID NO: 103), at least including the nucleic acid at position 552. Preferably, the portion of exon 1 of SLC3A2 comprises 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 103 or its allele variants or consists of and includes at least the nucleic acid at position 552. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 552 nucleic acid. In this alternative, the portion of exon 1 of SLC3A2 preferably comprises or is according to SEQ ID NO: 103 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 5 in the fusion with said SLC3A2 comprises from 2 to about 10, about 20, about 30, or up to about 40 of SEQ ID NO: 157 or an allele variant thereof or even all of the contiguous nucleic acids or consist thereof, and at least including the nucleic acid at position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一SLC3A2-NRG1多核苷酸融合物為SLC3A2與NRG1之符合讀框的融合。更佳地所述融合物為包含有SLC3A2之轉錄本6之外顯子1或外顯子1的一部分以及NRG1之外顯子5或外顯子5的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 15或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 15有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any SLC3A2-NRG1 polynucleotide fusion provided herein is an in-frame fusion of SLC3A2 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 1 or a part of exon 1 of transcript 6 of SLC3A2 and exon 5 or a part of exon 5 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 15 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 15, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有所述SLC3A2外顯子1的一部分與NRG1外顯子5的一部分融合之多核苷酸係包含來自SEQ ID NO: 15之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置53及54的核酸。連續核酸的數目可為SEQ ID NO: 15之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置53及54的核酸。SEQ ID NO: 15包括介於SLC3A2與NRG1之間的接合處,尤其該接合處係介於源於SLC3A2之位置53的核酸與源於NRG1之位置54的核酸之間。較佳地,該包含有SLC3A2外顯子1的一部分與NRG1外顯子5的一部分融合之多核苷酸係具有SEQ ID NO: 15之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 1 of SLC3A2 fused with a part of exon 5 of NRG1 is a polynucleotide comprising from 2 to about 10 of SEQ ID NO: 15, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 53 and 54. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 53 and 54 nucleic acids. SEQ ID NO: 15 includes the junction between SLC3A2 and NRG1, in particular the junction is between the nucleic acid derived from position 53 of SLC3A2 and the nucleic acid derived from position 54 of NRG1. Preferably, the polynucleotide comprising a part of exon 1 of SLC3A2 fused with a part of exon 5 of NRG1 has a polynucleotide sequence of SEQ ID NO: 15 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 15之多核苷酸,或一種包含了來自SEQ ID NO: 15之約20、約30、約40個或全部的連續核酸且較佳地包括了位置53及54的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 15之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置53及54的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 15, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 15 and preferably A polynucleotide that includes the nucleic acids at positions 53 and 54. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 53 and 54.
較佳地,該包含有SLC3A2轉錄本6之外顯子1(或其對偶基因變異體)的一部分與NRG1外顯子5(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及所述SLC3A2或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於所述SLC3A2與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 1 of SLC3A2 transcript 6 (or its allele variant) fused with a part of NRG1 exon 5 (or its allele variant) is a A portion of a longer polynucleotide further comprising or encoding the EGF-like domain of NRG1. Abnormal cell lines containing polynucleotide fusions involving said SLC3A2 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between said SLC3A2 and NRG1 is in-frame and occurs when the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 Such a location will suffice. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該SLC3A2轉錄本6之外顯子1(或其對偶基因變異體)的部分係位於NRG1外顯子5(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有SLC3A2的N端和NRG1的C端。此外,本文所提供之SLC3A2-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自SLC3A2的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該SLC3A2-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括肺癌或腺癌,尤其是肺腺癌。Preferably, the part of exon 1 of the SLC3A2 transcript 6 (or its allele variant) is located at the 5' position of NRG1 exon 5 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of SLC3A2 and the C-terminus of NRG1. In addition, the SLC3A2-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from SLC3A2 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The SLC3A2-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including lung or adenocarcinoma, especially lung adenocarcinoma.
當存在於患者或個體的異常細胞中時,所述SCL3A2-NRG1多核苷酸融合物較佳地進一步包括NRG1外顯子5之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 103及129可能就足夠了。來自NRG1外顯子5之3’處的任一序列係包含SEQ ID NO: 130-137(或SEQ ID NO: 130-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the SCL3A2-NRG1 polynucleotide fusion preferably further includes any sequence at the 3' of exon 5 of NRG1, but in order to be able to use the polynucleotide as For basic detection assays to detect fusion junctions, the presence of at least SEQ ID NO: 103 and 129 may be sufficient. Any sequence from 3' of NRG1 exon 5 comprises one or all of SEQ ID NO: 130-137 (or any allele variant of SEQ ID NO: 130-137), or is derived from it composition.
再者,提供了一種包含有SLC3A2轉錄本3之外顯子2的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。所述SLC3A2之外顯子2較佳為SEQ ID NO: 457或SEQ ID NO: 457之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Furthermore, a polynucleotide fusion comprising a part of exon 2 of SLC3A2 transcript 3 fused to a part of exon 6 of NRG1 is provided. The SLC3A2 exon 2 is preferably the exon of SEQ ID NO: 457 or the allele variant of SEQ ID NO: 457, and the NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: Exon of the allele variant of 130.
較佳地,所述SLC3A2外顯子2之對偶基因變異體與SEQ ID NO: 457有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of exon 2 of SLC3A2 has at least 85% identity to SEQ ID NO: 457, preferably at least 90% identity, 92%, 94%, 96% or More preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 is at least 85% identical to SEQ ID NO: 130, preferably at least 90% identical to 92% identical to SEQ ID NO: 130 , 94%, 96% or better at least 98% sequence identity.
較佳地,所述SLC3A2之外顯子2的部分係包含或根據SEQ ID NO: 452,且其對偶基因變異體與SEQ ID NO: 452有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與SLC3A2的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 453之序列,且其對偶基因變異體與SEQ ID NO: 453有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,SLC3A2之外顯子2的部分包含來自SEQ ID NO: 452之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置93的核酸。連續核酸的數目可為SEQ ID NO: 452之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置93的核酸。更佳地,所述SLC3A2之外顯子2的部分係包含或根據SEQ ID NO: 452或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測SLC3A2與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 2 of SLC3A2 comprises or is according to SEQ ID NO: 452, and its allele variant has at least 85% identity with SEQ ID NO: 452, preferably at least 90% 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. The part of exon 6 of NRG1 in the fusion with SLC3A2 is preferably or comprises a sequence such as SEQ ID NO: 453, and its allele variant has at least 85% identity with SEQ ID NO: 453, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 2 of SLC3A2 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 452, and At least the nucleic acid at position 93 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 93 nucleic acid. More preferably, the part of exon 2 of SLC3A2 comprises or is according to SEQ ID NO: 452 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between SLC3A2 and NRG1 and determining whether such fusions are oncogenic in frame containing the EGF-like domain of NRG1 sexual fusion.
替代性地,所述SLC3A2外顯子2的部分包含SEQ ID NO: 457(或SEQ ID NO: 457之對偶基因變異體)或由其組成,至少包括位置93的核酸。較佳地,所述SLC3A2外顯子2的部分包含來自SEQ ID NO: 457或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置93的核酸。連續核酸的數目可為SEQ ID NO: 39之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置457的核酸。在此替代例中,SLC3A2外顯子2的部分更佳地包含或根據SEQ ID NO: 457或其對偶基因變異體。較佳地,在與SLC3A2轉錄本3的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 2 of SLC3A2 comprises or consists of SEQ ID NO: 457 (or an allele variant of SEQ ID NO: 457), including at least the nucleic acid at position 93. Preferably, the portion of exon 2 of SLC3A2 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 457 or its allele variant or consists of and includes at least the nucleic acid at position 93. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 457 nucleic acid. In this alternative, the portion of exon 2 of SLC3A2 preferably comprises or is according to SEQ ID NO: 457 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with SLC3A2 transcript 3 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids are or consist of them, at least including the nucleic acid at position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一SLC3A2-NRG1多核苷酸融合物為SLC3A2與NRG1之符合讀框的融合。更佳地所述融合物為包含有SLC3A2之轉錄本3之外顯子2或外顯子2的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 454或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 454有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any SLC3A2-NRG1 polynucleotide fusion provided herein is an in-frame fusion of SLC3A2 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 2 or a part of exon 2 of transcript 3 of SLC3A2 and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 454 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 454, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有所述SLC3A2外顯子2的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 454之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置93及94的核酸。連續核酸的數目可為SEQ ID NO: 454之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置93及94的核酸。SEQ ID NO: 454包括介於所述SLC3A2與NRG1之間的接合處,尤其該接合處係介於源於SLC3A2之位置93的核酸與源於NRG1之位置94的核酸之間。較佳地,該包含有SLC3A2外顯子2的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 454之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising the fusion of a part of exon 2 of SLC3A2 and a part of exon 6 of NRG1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 93 and 94. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 93 And 94 nucleic acids. SEQ ID NO: 454 includes the junction between said SLC3A2 and NRG1, in particular the junction is between the nucleic acid derived from position 93 of SLC3A2 and the nucleic acid derived from position 94 of NRG1. Preferably, the polynucleotide comprising a part of exon 2 of SLC3A2 fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 454 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 454之多核苷酸,或一種包含了來自SEQ ID NO: 454之約20、約30、約40個或全部的連續核酸且較佳地包括了位置93及94的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 454之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置93及94的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 454, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 454 and preferably A polynucleotide that includes the nucleic acids at positions 93 and 94. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 93 and 94.
較佳地,該包含有SLC3A2轉錄本3之外顯子2(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及所述SLC3A2或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於所述SLC3A2與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 2 of SLC3A2 transcript 3 (or its allele variant) fused with a part of NRG1 exon 6 (or its allele variant) is a A portion of a longer polynucleotide further comprising or encoding the EGF-like domain of NRG1. Abnormal cell lines containing polynucleotide fusions involving said SLC3A2 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between said SLC3A2 and NRG1 is in-frame and occurs when the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 Such a location will suffice. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該SLC3A2轉錄本3之外顯子2(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有SLC3A2的N端和NRG1的C端。此外,本文所提供之SLC3A2-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自SLC3A2的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CADM1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括肺癌或腺癌,尤其是肺腺癌。Preferably, the part of exon 2 (or its allele variant) of the SLC3A2 transcript 3 is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of SLC3A2 and the C-terminus of NRG1. In addition, the SLC3A2-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from SLC3A2 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CADM1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including lung or adenocarcinoma, especially lung adenocarcinoma.
當存在於患者或個體的異常細胞中時,所述SCL3A2-NRG1多核苷酸融合物較佳地進一步包括NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 457及130可能就足夠了。來自NRG1外顯子6之3’處的任一序列係包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。 VTCN1-NRG1 多核苷酸融合物 When present in abnormal cells of a patient or individual, the SCL3A2-NRG1 polynucleotide fusion preferably further includes any sequence at the 3' of exon 6 of NRG1, but in order to be able to use the polynucleotide as For basic detection assays to detect fusion junctions, the presence of at least SEQ ID NO: 457 and 130 may be sufficient. Any sequence from 3' of NRG1 exon 6 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is derived from it composition. VTCN1-NRG1 polynucleotide fusion
亦提供了一種包含有VTCN1之外顯子2的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。VTCN1之外顯子2較佳為SEQ ID NO: 169或SEQ ID NO: 169之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 2 of VTCN1 fused to a portion of exon 2 of NRG1. VTCN1 exon 2 is preferably the exon of SEQ ID NO: 169 or an allogenic variant of SEQ ID NO: 169, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
較佳地,該VTCN1外顯子2之對偶基因變異體與SEQ ID NO: 169有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of VTCN1 exon 2 is at least 85% identical to SEQ ID NO: 169, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,VTCN1之外顯子2的部分係包含或根據SEQ ID NO: 164,且其對偶基因變異體與SEQ ID NO: 164有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與VTCN1的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 165之序列,且其對偶基因變異體與SEQ ID NO: 165有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,VTCN1之外顯子2的部分包含來自SEQ ID NO: 164之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置65的核酸。連續核酸的數目可為SEQ ID NO: 164之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置65的核酸。更佳地,VTCN1之外顯子2的部分係包含或根據SEQ ID NO: 164或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測VTCN1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of VTCN1 exon 2 comprises or is according to SEQ ID NO: 164, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 164 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with VTCN1 is preferably or comprises a sequence such as SEQ ID NO: 165, and its allele variant has at least 85% identity with SEQ ID NO: 165, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 2 of VTCN1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 164, and At least the nucleic acid at position 65 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 65 nucleic acid. More preferably, the part of exon 2 of VTCN1 comprises or is according to SEQ ID NO: 164 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between VTCN1 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
較佳地,本文所提供之任一VTCN1-NRG1多核苷酸融合物為VTCN1與NRG1之符合讀框的融合。更佳地所述融合物為包含有VTCN1之外顯子2或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 166或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 166有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any VTCN1-NRG1 polynucleotide fusion provided herein is an in-frame fusion of VTCN1 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 2 or a part of exon 2 of VTCN1 and exon 2 or a part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 166 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 166, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有VTCN1外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 166之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置65及66的核酸。連續核酸的數目可為SEQ ID NO: 166之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置65及66的核酸。SEQ ID NO: 166包括介於VTCN1與NRG1之間的接合處,尤其該接合處係介於源於VTCN1之位置65的核酸與源於NRG1之位置66的核酸之間。較佳地,該包含有VTCN1外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 166之多核苷酸序列或其對偶基因變異體。較佳地,在與VTCN1的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。In an alternative preferred embodiment, the polynucleotide comprising a part of VTCN1 exon 2 fused to a part of NRG1 exon 2 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 65 and 66. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 65 and 66 nucleic acids. SEQ ID NO: 166 includes the junction between VTCN1 and NRG1, in particular the junction is between the nucleic acid at position 65 from VTCN1 and the nucleic acid at position 66 from NRG1. Preferably, the polynucleotide comprising a part of exon 2 of VTCN1 fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 166 or an allele variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with VTCN1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一較佳實施例中,提供了一種根據SEQ ID NO: 166之多核苷酸,或一種包含了來自SEQ ID NO: 166之約20、約30、約40個或全部的連續核酸且較佳地包括了位置65及66的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 166之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置65及66的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 166, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 166 and preferably A polynucleotide that includes the nucleic acids at positions 65 and 66. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 65 and 66.
較佳地,該包含有VTCN1外顯子2(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及VTCN1或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於VTCN1與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of VTCN1 exon 2 (or its allogeneic variant) fused to a part of NRG1 exon 2 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving VTCN1 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between VTCN1 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該VTCN1外顯子2(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有VTCN1的N端和NRG1的C端。此外,本文所提供之VTCN1-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自VTCN1的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該VTCN1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌。Preferably, the part of VTCN1 exon 2 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of VTCN1 and the C-terminus of NRG1. In addition, the VTCN1-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from VTCN1 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The VTCN1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinomas.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括VTCN1外顯子2之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 169及126可能就足夠了。來自VTCN1外顯子2之5’處的任一序列包含SEQ ID NO: 168(或SEQ ID NO: 168之任一對偶基因變異體)或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。 CDH1-NRG1 多核苷酸融合物 When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of VTCN1 exon 2 and any sequence at the 3' of NRG1 exon 2 sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 169 and 126 may be sufficient. Any sequence from 5' of VTCN1 exon 2 comprising or consisting of SEQ ID NO: 168 (or any allele variant of SEQ ID NO: 168), and from 3 of NRG1 exon 2 Any sequence at ' comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or consists of it. CDH1-NRG1 polynucleotide fusion
亦提供了一種包含有CDH1之外顯子11的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。CDH1之外顯子11較佳為SEQ ID NO: 198或SEQ ID NO: 198之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 11 of CDH1 fused to a portion of exon 2 of NRG1. CDH1 exon 11 is preferably the exon of SEQ ID NO: 198 or an allogenic variant of SEQ ID NO: 198, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括CDH1外顯子11之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 198及126可能就足夠了。來自CDH1外顯子11之5’處的任一序列包含SEQ ID NO: 188-197(或SEQ ID NO: 188-197之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of CDH1 exon 11 and any sequence at the 3' of NRG1 exon 2. sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 198 and 126 may be sufficient. Any sequence from the 5' of exon 11 of CDH1 comprises or consists of one or all of SEQ ID NO: 188-197 (or any allele variant of SEQ ID NO: 188-197) , and any sequence from 3' of NRG1 exon 2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or is formed by its composition.
較佳地,該CDH1外顯子11之對偶基因變異體與SEQ ID NO: 198有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of CDH1 exon 11 is at least 85% identical to SEQ ID NO: 198, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,CDH1之外顯子11的部分係包含或根據SEQ ID NO: 184,且其對偶基因變異體與SEQ ID NO: 184有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CDH1的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 185之序列,且其對偶基因變異體與SEQ ID NO: 185有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,CDH1之外顯子11的部分包含來自SEQ ID NO: 184之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置119的核酸。連續核酸的數目可為SEQ ID NO: 184之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置119的核酸。更佳地,CDH1之外顯子11的部分係包含或根據SEQ ID NO: 184或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測CDH1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 11 of CDH1 comprises or is according to SEQ ID NO: 184, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 184 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with CDH1 is preferably or comprises a sequence such as SEQ ID NO: 185, and its allele variant has at least 85% identity with SEQ ID NO: 185, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 11 of CDH1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 184, and At least the nucleic acid at position 119 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 119 nucleic acid. More preferably, the portion of exon 11 of CDH1 comprises or is according to SEQ ID NO: 184 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CDH1 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,CDH1外顯子11的部分包含SEQ ID NO: 198(或SEQ ID NO: 198之對偶基因變異體)或由其組成,至少包括位置146的核酸。較佳地,CDH1外顯子11的部分包含來自SEQ ID NO: 198或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置146的核酸。連續核酸的數目可為SEQ ID NO: 198之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置146的核酸。在此替代例中,CDH1外顯子11的部分更佳地包含或根據SEQ ID NO: 198或其對偶基因變異體。Alternatively, the portion of CDH1 exon 11 comprises or consists of SEQ ID NO: 198 (or an allele variant of SEQ ID NO: 198), including at least the nucleic acid at position 146. Preferably, the portion of CDH1 exon 11 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 198 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 146. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 146 nucleic acid. In this alternative, the portion of exon 11 of CDH1 preferably comprises or is according to SEQ ID NO: 198 or an allegenic variant thereof.
較佳地,本文所提供之任一CDH1-NRG1多核苷酸融合物為CDH1與NRG1之符合讀框的融合。更佳地所述融合物為包含有CDH1之外顯子11或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 186或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 186有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any CDH1-NRG1 polynucleotide fusion provided herein is an in-frame fusion of CDH1 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 11 or a part of exon 2 of CDH1 and exon 2 or a part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 186 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 186, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有CDH1外顯子11的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 186之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置119及120的核酸。連續核酸的數目可為SEQ ID NO: 186之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置119及120的核酸。SEQ ID NO: 186包括介於CDH1與NRG1之間的接合處,尤其該接合處係介於源於CDH1之位置119的核酸與源於NRG1之位置120的核酸之間。較佳地,該包含有CDH1外顯子11的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 186之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 11 of CDH1 fused to a part of exon 2 of NRG1 is a polynucleotide comprising from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 119 and 120. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 119 and 120 nucleic acids. SEQ ID NO: 186 includes the junction between CDH1 and NRG1, in particular the junction is between the nucleic acid from position 119 of CDH1 and the nucleic acid from position 120 of NRG1. Preferably, the polynucleotide comprising a part of exon 11 of CDH1 fused with a part of exon 2 of NRG1 has the polynucleotide sequence of SEQ ID NO: 186 or its allele variant.
在一較佳實施例中,提供了一種根據SEQ ID NO: 186之多核苷酸,或一種包含了來自SEQ ID NO: 186之約20、約30、約40個或全部的連續核酸且較佳地包括了位置119及120的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 186之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置119及120的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 186, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 186 and preferably A polynucleotide that includes the nucleic acids at positions 119 and 120. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 119 and 120.
較佳地,該包含有CDH1外顯子11(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及CDH1或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CDH1與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 11 of CDH1 (or its allele variant) fused to a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving CDH1 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between CDH1 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CDH1外顯子11(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CDH1的N端和NRG1的C端。此外,本文所提供之CDH1-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CDH1的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CDH1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括肺癌或腺癌,尤其是肺腺癌。 CXADR-NRG1 多核苷酸融合物 Preferably, the part of exon 11 of CDH1 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CDH1 and the C-terminus of NRG1. In addition, the CDH1-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from CDH1 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CDH1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including lung or adenocarcinoma, especially lung adenocarcinoma. CXADR-NRG1 polynucleotide fusion
亦提供了一種包含有CXADR之外顯子1的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。CXADR之外顯子1較佳為SEQ ID NO: 2199或SEQ ID NO: 219之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 1 of CXADR fused to a portion of exon 2 of NRG1. CXADR exon 1 is preferably the exon of the allele variant of SEQ ID NO: 2199 or SEQ ID NO: 219, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括CXADR外顯子1之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 219及126可能就足夠了。NRG1外顯子2之3’處的任一序列係包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of CXADR exon 1 and any sequence at the 3' of NRG1 exon 2. sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 219 and 126 may be sufficient. Any sequence at the 3' of NRG1 exon 2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or consists of it .
較佳地,該CXADR外顯子1之對偶基因變異體與SEQ ID NO: 219有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the CXADR exon 1 allele variant is at least 85% identical to SEQ ID NO: 219, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,CXADR之外顯子1的部分係包含或根據SEQ ID NO: 215,且其對偶基因變異體與SEQ ID NO: 215有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CXADR的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 216之序列,且其對偶基因變異體與SEQ ID NO: 216有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,CXADR之外顯子1的部分包含來自SEQ ID NO: 215之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置43的核酸。連續核酸的數目可為SEQ ID NO: 215之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置43的核酸。更佳地,CXADR之外顯子1的部分係包含或根據SEQ ID NO: 215或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測CXADR與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 1 of CXADR comprises or is according to SEQ ID NO: 215, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 215 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with CXADR is preferably or comprises a sequence such as SEQ ID NO: 216, and its allele variant has at least 85% identity with SEQ ID NO: 216, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 1 of CXADR comprises or consists of contiguous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 215, and At least the nucleic acid at position 43 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 43 nucleic acid. More preferably, the part of exon 1 of CXADR comprises or is according to SEQ ID NO: 215 or an allegenic variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CXADR and NRG1 and determining whether such fusions are oncogenic in frame containing the EGF-like domain of NRG1 sexual fusion.
替代性地,CXADR外顯子1的部分包含SEQ ID NO: 219(或SEQ ID NO: 219之對偶基因變異體)或由其組成,至少包括位置130的核酸。較佳地,CXADR外顯子1的部分包含來自SEQ ID NO: 219或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置130的核酸。連續核酸的數目可為SEQ ID NO: 219之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置130的核酸。在此替代例中,CXADR外顯子1的部分更佳地包含或根據SEQ ID NO: 219或其對偶基因變異體。Alternatively, the portion of CXADR exon 1 comprises or consists of SEQ ID NO: 219 (or an allele variant of SEQ ID NO: 219), including at least the nucleic acid at position 130. Preferably, the portion of CXADR exon 1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 219 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 130. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 130 nucleic acid. In this alternative, the part of CXADR exon 1 preferably comprises or is according to SEQ ID NO: 219 or an allegenic variant thereof.
較佳地,本文所提供之任一CXADR-NRG1多核苷酸融合物為CXADR與NRG1之符合讀框的融合。更佳地所述融合物為包含有CXADR之外顯子1或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 217或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 217有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the CXADR-NRG1 polynucleotide fusions provided herein are in-frame fusions of CXADR and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 1 or part of exon 2 of CXADR and exon 2 or part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 217 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 217, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有CXADR外顯子1的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 217之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置43及44的核酸。連續核酸的數目可為SEQ ID NO: 217之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置43及44的核酸。SEQ ID NO: 217包括介於CXADR與NRG1之間的接合處,尤其該接合處係介於源於CXADR之位置43的核酸與源於NRG1之位置44的核酸之間。較佳地,該包含有CXADR外顯子1的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 217之多核苷酸序列或其對偶基因變異體。較佳地,在與CXADR的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。In an alternative preferred embodiment, the polynucleotide comprising a portion of CXADR exon 1 fused to a portion of NRG1 exon 2 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 43 and 44. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 43 and 44 nucleic acids. SEQ ID NO: 217 includes a junction between CXADR and NRG1, in particular the junction is between a nucleic acid derived from position 43 of CXADR and a nucleic acid derived from position 44 of NRG1. Preferably, the polynucleotide comprising a part of exon 1 of CXADR fused with a part of exon 2 of NRG1 has the polynucleotide sequence of SEQ ID NO: 217 or an allele variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with CXADR comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一較佳實施例中,提供了一種根據SEQ ID NO: 217之多核苷酸,或一種包含了來自SEQ ID NO: 217之約20、約30、約40個或全部的連續核酸且較佳地包括了位置43及44的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 217之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置43及44的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 217, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 217 and preferably Polynucleotides that include nucleic acids at positions 43 and 44. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 43 and 44.
較佳地,該包含有CXADR外顯子1(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及CXADR或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CXADR與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of CXADR exon 1 (or its allele variant) fused to a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving CXADR or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between CXADR and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CXADR外顯子1(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CXADR的N端和NRG1的C端。此外,本文所提供之CXADR-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CXADR的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CXADR-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是大腸癌。 GTF2E2-NRG1 多核苷酸融合物 Preferably, the part of CXADR exon 1 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CXADR and the C-terminus of NRG1. In addition, the CXADR-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from CXADR and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CXADR-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially colorectal cancer. GTF2E2-NRG1 polynucleotide fusion
亦提供了一種包含有GTF2E2之外顯子2的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。GTF2E2之外顯子2較佳為SEQ ID NO: 236或SEQ ID NO: 236之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 2 of GTF2E2 fused to a portion of exon 2 of NRG1. GTF2E2 exon 2 is preferably the exon of SEQ ID NO: 236 or an allogenic variant of SEQ ID NO: 236, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括GTF2E2外顯子2之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 236及126可能就足夠了。來自GTF2E2外顯子2之5’處的任一序列包含SEQ ID NO: 235(或SEQ ID NO: 235之任一對偶基因變異體)或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of GTF2E2 exon 2 and any sequence at the 3' of NRG1 exon 2 sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 236 and 126 may be sufficient. Any sequence from 5' of exon 2 of GTF2E2 comprising or consisting of SEQ ID NO: 235 (or any allele variant of SEQ ID NO: 235), and from 3 of exon 2 of NRG1 Any sequence at ' comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or consists of it.
較佳地,該GTF2E2外顯子2之對偶基因變異體與SEQ ID NO: 236有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of GTF2E2 exon 2 is at least 85% identical to SEQ ID NO: 236, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,GTF2E2之外顯子2的部分係包含或根據SEQ ID NO: 231,且其對偶基因變異體與SEQ ID NO: 231有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與GTF2E2的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 232之序列,且其對偶基因變異體與SEQ ID NO: 232有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,GTF2E2之外顯子2的部分包含來自SEQ ID NO: 231之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置141的核酸。連續核酸的數目可為SEQ ID NO: 231之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置141的核酸。更佳地,GTF2E2之外顯子2的部分係包含或根據SEQ ID NO: 231或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測GTF2E2與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the portion of exon 2 of GTF2E2 comprises or is according to SEQ ID NO: 231, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 231 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with GTF2E2 is preferably or comprises a sequence such as SEQ ID NO: 232, and its allele variant has at least 85% identity with SEQ ID NO: 232, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 2 of GTF2E2 comprises or consists of contiguous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 231, and At least the nucleic acid at position 141 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 141 nucleic acid. More preferably, the portion of exon 2 of GTF2E2 comprises or is according to SEQ ID NO: 231 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between GTF2E2 and NRG1 and determining whether such fusions are oncogenic in frame containing the EGF-like domain of NRG1 sexual fusion.
替代性地,GTF2E2外顯子2的部分包含SEQ ID NO: 236(或SEQ ID NO: 236之對偶基因變異體)或由其組成,至少包括位置170的核酸。較佳地,GTF2E2外顯子2的部分包含來自SEQ ID NO: 236或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置170的核酸。連續核酸的數目可為SEQ ID NO: 236之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置170的核酸。在此替代例中,GTF2E2外顯子2的部分更佳地包含或根據SEQ ID NO: 236或其對偶基因變異體。較佳地,在與GTF2E2的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of GTF2E2 exon 2 comprises or consists of SEQ ID NO: 236 (or an allele variant of SEQ ID NO: 236), including at least the nucleic acid at position 170. Preferably, the portion of GTF2E2 exon 2 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 236 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 170. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 170 nucleic acid. In this alternative, the portion of exon 2 of GTF2E2 preferably comprises or is according to SEQ ID NO: 236 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with GTF2E2 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一GTF2E2-NRG1多核苷酸融合物為GTF2E2與NRG1之符合讀框的融合。更佳地所述融合物為包含有GTF2E2之外顯子2或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 233或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 233有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the GTF2E2-NRG1 polynucleotide fusions provided herein is an in-frame fusion of GTF2E2 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 2 or part of exon 2 of GTF2E2 and exon 2 or part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 233 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 233, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有GTF2E2外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 233之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置141及142的核酸。連續核酸的數目可為SEQ ID NO: 233之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置141及142的核酸。SEQ ID NO: 233包括介於GTF2E2與NRG1之間的接合處,尤其該接合處係介於源於GTF2E2之位置141的核酸與源於NRG1之位置142的核酸之間。較佳地,該包含有GTF2E2外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 233之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of exon 2 of GTF2E2 fused to a portion of exon 2 of NRG1 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 141 and 142. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 141 and 142 nucleic acids. SEQ ID NO: 233 includes a junction between GTF2E2 and NRG1, in particular the junction is between the nucleic acid from position 141 of GTF2E2 and the nucleic acid from position 142 of NRG1. Preferably, the polynucleotide comprising a part of exon 2 of GTF2E2 fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 233 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 233之多核苷酸,或一種包含了來自SEQ ID NO: 233之約20、約30、約40個或全部的連續核酸且較佳地包括了位置141及142的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 233之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置141及142的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 233, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 233 and preferably A polynucleotide that includes the nucleic acids at positions 141 and 142. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 141 and 142.
較佳地,該包含有GTF2E2外顯子2(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及GTF2E2或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於GTF2E2與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 2 of GTF2E2 (or its allele variant) fused with a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving GTF2E2 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between GTF2E2 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該GTF2E2外顯子2(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有GTF2E2的N端和NRG1的C端。此外,本文所提供之GTF2E2-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自GTF2E2的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該GTF2E2-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,更尤其是(轉移性)乳腺癌NOS。 CSMD1-NRG1 多核苷酸融合物 Preferably, the part of GTF2E2 exon 2 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of GTF2E2 and the C-terminus of NRG1. In addition, the GTF2E2-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from GTF2E2 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The GTF2E2-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, more particularly (metastatic) breast cancer NOS. CSMD1-NRG1 polynucleotide fusion
亦提供了一種包含有CSMD1之外顯子23的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。CSMD1之外顯子23較佳為SEQ ID NO: 279或SEQ ID NO: 279之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 23 of CSMD1 fused to a portion of exon 6 of NRG1. CSMD1 exon 23 is preferably the exon of SEQ ID NO: 279 or an allogenic variant of SEQ ID NO: 279, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括CSMD1外顯子23之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 279及130可能就足夠了。來自CSMD1外顯子23之5’處的任一序列包含SEQ ID NO: 257-278(或SEQ ID NO: 257-278之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of CSMD1 exon 23 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 279 and 130 may be sufficient. Any sequence from 5' of CSMD1 exon 23 comprising any or all of SEQ ID NO: 257-278 (or any allele variant of SEQ ID NO: 257-278), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該CSMD1外顯子23之對偶基因變異體與SEQ ID NO: 279有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of CSMD1 exon 23 is at least 85% identical to SEQ ID NO: 279, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,CSMD1之外顯子23的部分係包含或根據SEQ ID NO: 253,且其對偶基因變異體與SEQ ID NO: 253有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CSMD1的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 254之序列,且其對偶基因變異體與SEQ ID NO: 254有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,CSMD1之外顯子23的部分包含來自SEQ ID NO: 253之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置88的核酸。連續核酸的數目可為SEQ ID NO: 253之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置88的核酸。更佳地,CSMD1之外顯子23的部分係包含或根據SEQ ID NO: 253或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測CSMD1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 23 of CSMD1 comprises or is according to SEQ ID NO: 253, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 253 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with CSMD1 is preferably or comprises a sequence such as SEQ ID NO: 254, and its allele variant has at least 85% identity with SEQ ID NO: 254, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 23 of CSMD1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 253, and At least the nucleic acid at position 88 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 88 nucleic acid. More preferably, the portion of exon 23 of CSMD1 comprises or is according to SEQ ID NO: 253 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CSMD1 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,CSMD1外顯子23的部分包含SEQ ID NO: 279(或SEQ ID NO: 279之對偶基因變異體)或由其組成,至少包括位置157的核酸。較佳地,CSMD1外顯子23的部分包含來自SEQ ID NO: 279或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置157的核酸。連續核酸的數目可為SEQ ID NO: 279之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置157的核酸。在此替代例中,CSMD1外顯子23的部分更佳地包含或根據SEQ ID NO: 279或其對偶基因變異體。較佳地,在與CSMD1的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of CSMD1 exon 23 comprises or consists of SEQ ID NO: 279 (or an allele variant of SEQ ID NO: 279), including at least the nucleic acid at position 157. Preferably, the portion of CSMD1 exon 23 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 279 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 157. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 157 nucleic acid. In this alternative, the portion of exon 23 of CSMD1 preferably comprises or is according to SEQ ID NO: 279 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with CSMD1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一CSMD1-NRG1多核苷酸融合物為CSMD1與NRG1之符合讀框的融合。更佳地所述融合物為包含有CSMD1之外顯子23或外顯子6的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 255或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 255有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any CSMD1-NRG1 polynucleotide fusion provided herein is an in-frame fusion of CSMD1 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 23 or a part of exon 6 of CSMD1 and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 255 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 255, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有CSMD1外顯子23的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 255之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置88及89的核酸。連續核酸的數目可為SEQ ID NO: 255之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置88及89的核酸。SEQ ID NO: 255包括介於CSMD1與NRG1之間的接合處,尤其該接合處係介於源於CSMD1之位置88的核酸與源於NRG1之位置89的核酸之間。較佳地,該包含有CSMD1外顯子23的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 255之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of CSMD1 exon 23 fused to a portion of NRG1 exon 6 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 88 and 89. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 88 and 89 nucleic acids. SEQ ID NO: 255 includes the junction between CSMD1 and NRG1, in particular the junction is between the nucleic acid from position 88 of CSMD1 and the nucleic acid from position 89 of NRG1. Preferably, the polynucleotide comprising a part of exon 23 of CSMD1 fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 255 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 255之多核苷酸,或一種包含了來自SEQ ID NO: 255之約20、約30、約40個或全部的連續核酸且較佳地包括了位置88及89的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 255之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置88及89的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 255, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 255 and preferably Polynucleotides that include nucleic acids at positions 88 and 89. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 88 and 89.
較佳地,該包含有CSMD1外顯子23(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及CSMD1或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CSMD1與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of CSMD1 exon 23 (or its allogeneic variant) fused to a part of NRG1 exon 6 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving CSMD1 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between CSMD1 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CSMD1外顯子23(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CSMD1的N端和NRG1的C端。此外,本文所提供之CSMD1-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CSMD1的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CSMD1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 PTN-NRG1 多核苷酸融合物 Preferably, the part of exon 23 of CSMD1 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CSMD1 and the C-terminus of NRG1. In addition, the CSMD1-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from CSMD1 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CSMD1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. PTN-NRG1 polynucleotide fusion
亦提供了一種包含有PTN之外顯子4的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。PTN之外顯子4較佳為SEQ ID NO: 318或SEQ ID NO: 318之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 4 of PTN fused to a portion of exon 2 of NRG1. PTN exon 4 is preferably the exon of SEQ ID NO: 318 or an allogenic variant of SEQ ID NO: 318, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括PTN外顯子4之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 318及126可能就足夠了。來自PTN外顯子4之5’處的任一序列包含SEQ ID NO: 315-317(或SEQ ID NO: 315-317之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of PTN exon 4 and any sequence at the 3' of NRG1 exon 2. sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 318 and 126 may be sufficient. Any sequence from 5' of PTN exon 4 comprising any or all of SEQ ID NO: 315-317 (or any allele variant of SEQ ID NO: 315-317), or derived from its composition, and any sequence from 3' of NRG1 exon 2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or is composed of it.
較佳地,該PTN外顯子4之對偶基因變異體與SEQ ID NO: 318有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of PTN exon 4 is at least 85% identical to SEQ ID NO: 318, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,PTN之外顯子4的部分係包含或根據SEQ ID NO: 311,且其對偶基因變異體與SEQ ID NO: 311有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與PTN的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 312之序列,且其對偶基因變異體與SEQ ID NO: 312有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,PTN之外顯子4的部分包含來自SEQ ID NO: 311之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置102的核酸。連續核酸的數目可為SEQ ID NO: 311之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置102的核酸。更佳地,PTN之外顯子4的部分係包含或根據SEQ ID NO: 311或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測PTN與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 4 of PTN comprises or is according to SEQ ID NO: 311, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 311 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with PTN is preferably or comprises a sequence such as SEQ ID NO: 312, and its allele variant has at least 85% identity with SEQ ID NO: 312, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 4 of PTN comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 311, and At least the nucleic acid at position 102 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 102 nucleic acid. More preferably, the portion of exon 4 of PTN comprises or is according to SEQ ID NO: 311 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between PTN and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,PTN外顯子4的部分包含SEQ ID NO: 318(或SEQ ID NO: 318之對偶基因變異體)或由其組成,至少包括位置162的核酸。較佳地,PTN外顯子4的部分包含來自SEQ ID NO: 318或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置162的核酸。連續核酸的數目可為SEQ ID NO: 318之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置162的核酸。在此替代例中,PTN外顯子4的部分更佳地包含或根據SEQ ID NO: 318或其對偶基因變異體。較佳地,在與PTN的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of PTN exon 4 comprises or consists of SEQ ID NO: 318 (or an allele variant of SEQ ID NO: 318), including at least the nucleic acid at position 162. Preferably, the portion of PTN exon 4 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 318 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 162. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 162 nucleic acid. In this alternative, the portion of PTN exon 4 preferably comprises or is according to SEQ ID NO: 318 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with PTN comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一PTN-NRG1多核苷酸融合物為PTN與NRG1之符合讀框的融合。更佳地所述融合物為包含有PTN之外顯子4或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 313或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 313有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the PTN-NRG1 polynucleotide fusions provided herein is an in-frame fusion of PTN and NRG1. More preferably, the fusion is an in-frame fusion comprising a part of exon 4 or exon 2 of PTN and a part of exon 2 or exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 313 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 313, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有PTN外顯子4的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 313之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置102及103的核酸。連續核酸的數目可為SEQ ID NO: 313之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置102及103的核酸。SEQ ID NO: 313包括介於PTN與NRG1之間的接合處,尤其該接合處係介於源於PTN之位置102的核酸與源於NRG1之位置103的核酸之間。較佳地,該包含有PTN外顯子4的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 313之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of PTN exon 4 fused to a portion of NRG1 exon 2 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 102 and 103. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 102 and 103 nucleic acids. SEQ ID NO: 313 includes a junction between PTN and NRG1, in particular the junction is between the nucleic acid from position 102 of PTN and the nucleic acid from position 103 of NRG1. Preferably, the polynucleotide comprising a part of PTN exon 4 fused with a part of NRG1 exon 2 is a polynucleotide sequence of SEQ ID NO: 313 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 313之多核苷酸,或一種包含了來自SEQ ID NO: 313之約20、約30、約40個或全部的連續核酸且較佳地包括了位置102及103的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 313之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置102及103的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 313, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 313 and preferably A polynucleotide that includes the nucleic acids at positions 102 and 103. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 102 and 103.
較佳地,該包含有PTN外顯子4(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及PTN或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於PTN與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of PTN exon 4 (or its allele variant) fused to a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving PTN or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between PTN and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該PTN外顯子4(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有PTN的N端和NRG1的C端。此外,本文所提供之PTN-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自PTN的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該PTN-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 ST14-NRG1 多核苷酸融合物 Preferably, the part of PTN exon 4 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of PTN and the C-terminus of NRG1. In addition, the PTN-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from PTN and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The PTN-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. ST14-NRG1 polynucleotide fusion
亦提供了一種包含有ST14之外顯子11的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。ST14之外顯子11較佳為SEQ ID NO: 342或SEQ ID NO: 342之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 11 of ST14 fused to a portion of exon 6 of NRG1. ST14 exon 11 is preferably the exon of SEQ ID NO: 342 or an allogenic variant of SEQ ID NO: 342, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括ST14外顯子11之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 324及130可能就足夠了。來自ST14外顯子11之5’處的任一序列包含SEQ ID NO: 332-341(或SEQ ID NO: 332-341之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of ST14 exon 11 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 324 and 130 may be sufficient. Any sequence from the 5' of ST14 exon 11 comprising any or all of SEQ ID NO: 332-341 (or any allele variant of SEQ ID NO: 332-341), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該ST14外顯子11之對偶基因變異體與SEQ ID NO: 342有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the ST14 exon 11 allele variant is at least 85% identical to SEQ ID NO: 342, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,ST14之外顯子11的部分係包含或根據SEQ ID NO: 328,且其對偶基因變異體與SEQ ID NO: 328有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與ST14的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 329之序列,且其對偶基因變異體與SEQ ID NO: 329有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,ST14之外顯子11的部分包含來自SEQ ID NO: 328之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置95的核酸。連續核酸的數目可為SEQ ID NO: 328之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置95的核酸。更佳地,ST14之外顯子11的部分係包含或根據SEQ ID NO: 328或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測ST14與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 11 of ST14 comprises or is according to SEQ ID NO: 328, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 328 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with ST14 is preferably or comprises a sequence such as SEQ ID NO: 329, and its allele variant has at least 85% identity with SEQ ID NO: 329, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 11 of ST14 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 328, and At least the nucleic acid at position 95 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 95 nucleic acid. More preferably, the portion of exon 11 of ST14 comprises or is according to SEQ ID NO: 328 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between ST14 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,ST14外顯子11的部分包含SEQ ID NO: 342(或SEQ ID NO: 342之對偶基因變異體)或由其組成,至少包括位置131的核酸。較佳地,ST14外顯子11的部分包含來自SEQ ID NO: 342或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置131的核酸。連續核酸的數目可為SEQ ID NO: 342之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置131的核酸。在此替代例中,ST14外顯子11的部分更佳地包含或根據SEQ ID NO: 342或其對偶基因變異體。較佳地,在與ST14 的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of ST14 exon 11 comprises or consists of SEQ ID NO: 342 (or an allele variant of SEQ ID NO: 342), including at least the nucleic acid at position 131. Preferably, the portion of ST14 exon 11 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 342 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 131. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 131 nucleic acid. In this alternative, the portion of ST14 exon 11 preferably comprises or is according to SEQ ID NO: 342 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with ST14 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一ST14-NRG1多核苷酸融合物為ST14與NRG1之符合讀框的融合。更佳地所述融合物為包含有ST14之外顯子11或外顯子6的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 330或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 330有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the ST14-NRG1 polynucleotide fusions provided herein is an in-frame fusion of ST14 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 11 or a part of exon 6 of ST14 and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 330 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 330, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有ST14外顯子11的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 330之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置95及96的核酸。連續核酸的數目可為SEQ ID NO: 330之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置95及96的核酸。SEQ ID NO: 330包括介於ST14 與NRG1之間的接合處,尤其該接合處係介於源於ST14之位置95的核酸與源於NRG1之位置96的核酸之間。較佳地,該包含有ST14外顯子11的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 330之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of ST14 exon 11 fused to a portion of NRG1 exon 6 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 95 and 96. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 95 And 96 nucleic acids. SEQ ID NO: 330 includes the junction between ST14 and NRG1, in particular the junction is between the nucleic acid derived from position 95 of ST14 and the nucleic acid derived from position 96 of NRG1. Preferably, the polynucleotide comprising a part of ST14 exon 11 fused with a part of NRG1 exon 6 has the polynucleotide sequence of SEQ ID NO: 330 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 330之多核苷酸,或一種包含了來自SEQ ID NO: 330之約20、約30、約40個或全部的連續核酸且較佳地包括了位置95及96的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 330之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置95及96的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 330, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 330 and preferably Polynucleotides that include nucleic acids at positions 95 and 96. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 95 and 96.
較佳地,該包含有ST14外顯子11(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及ST14或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於ST14與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of ST14 exon 11 (or its allogeneic variant) fused to a part of NRG1 exon 6 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving ST14 or expressing the polypeptide fusion contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between ST14 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該ST14外顯子11(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有ST14的N端和NRG1的C端。此外,本文所提供之ST14-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自ST14的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該ST14-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 THBS1-NRG1 多核苷酸融合物 Preferably, the portion of ST14 exon 11 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of ST14 and the C-terminus of NRG1. In addition, the ST14-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from ST14 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The ST14-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinomas, more particularly pancreatic ductal adenocarcinomas. THBS1-NRG1 polynucleotide fusion
亦提供了一種包含有THBS1之外顯子9的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。THBS1之外顯子9較佳為SEQ ID NO: 386或SEQ ID NO: 386之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 9 of THBS1 fused to a portion of exon 6 of NRG1. THBS1 exon 9 is preferably the exon of SEQ ID NO: 386 or an allogenic variant of SEQ ID NO: 386, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括THBS1外顯子9之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 386及130可能就足夠了。來自THBS1外顯子9之5’處的任一序列包含SEQ ID NO: 378-385(或SEQ ID NO: 378-385之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of THBS1 exon 9 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 386 and 130 may be sufficient. Any sequence from 5' of THBS1 exon 9 comprising any or all of SEQ ID NO: 378-385 (or any allele variant of SEQ ID NO: 378-385), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該THBS1外顯子9之對偶基因變異體與SEQ ID NO: 386有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of THBS1 exon 9 is at least 85% identical to SEQ ID NO: 386, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,THBS1之外顯子9的部分係包含或根據SEQ ID NO: 374,且其對偶基因變異體與SEQ ID NO: 374有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與THBS1的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 375之序列,且其對偶基因變異體與SEQ ID NO: 375有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,THBS1之外顯子9的部分包含來自SEQ ID NO: 374之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置56的核酸。連續核酸的數目可為SEQ ID NO: 374之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置56的核酸。更佳地,THBS1之外顯子9的部分係包含或根據SEQ ID NO: 374或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測THBS1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 9 of THBS1 comprises or is according to SEQ ID NO: 374, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 374 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with THBS1 is preferably or comprises a sequence such as SEQ ID NO: 375, and its allele variant has at least 85% identity with SEQ ID NO: 375, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 9 of THBS1 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 374, and At least the nucleic acid at position 56 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 56 nucleic acid. More preferably, the portion of exon 9 of THBS1 comprises or is according to SEQ ID NO: 374 or an allegenic variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between THBS1 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,THBS1外顯子9的部分包含SEQ ID NO: 386(或SEQ ID NO: 386之對偶基因變異體)或由其組成,至少包括位置177的核酸。較佳地,THBS1外顯子9的部分包含來自SEQ ID NO: 386或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置177的核酸。連續核酸的數目可為SEQ ID NO: 386之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置177的核酸。在此替代例中,THBS1外顯子9的部分更佳地包含或根據SEQ ID NO: 386或其對偶基因變異體。較佳地,在與THBS1的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of THBS1 exon 9 comprises or consists of SEQ ID NO: 386 (or an allele variant of SEQ ID NO: 386), including at least the nucleic acid at position 177. Preferably, the portion of THBS1 exon 9 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 386 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 177. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 177 nucleic acid. In this alternative, the portion of THBS1 exon 9 preferably comprises or is according to SEQ ID NO: 386 or an allegenic variant thereof. Preferably, the portion of exon 6 of NRG1 in the fusion with THBS1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一THBS1-NRG1多核苷酸融合物為THBS1與NRG1之符合讀框的融合。更佳地所述融合物為包含有THBS1之外顯子9或外顯子6的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 376或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 376有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any THBS1-NRG1 polynucleotide fusion provided herein is an in-frame fusion of THBS1 and NRG1. More preferably, the fusion is an in-frame fusion comprising a part of exon 9 or exon 6 of THBS1 and a part of exon 6 or exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 376 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 376, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有THBS1外顯子9的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 376之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置56及57的核酸。連續核酸的數目可為SEQ ID NO: 376之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置56及57的核酸。SEQ ID NO: 376包括介於THBS1與NRG1之間的接合處,尤其該接合處係介於源於THBS1之位置56的核酸與源於NRG1之位置57的核酸之間。較佳地,該包含有THBS1外顯子9的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 376之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 9 of THBS1 fused to a part of exon 6 of NRG1 is comprised of 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 56 and 57. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 56 and 57 nucleic acids. SEQ ID NO: 376 includes the junction between THBS1 and NRG1, in particular the junction is between the nucleic acid derived from position 56 of THBS1 and the nucleic acid derived from position 57 of NRG1. Preferably, the polynucleotide comprising a part of exon 9 of THBS1 fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 376 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 376之多核苷酸,或一種包含了來自SEQ ID NO: 376之約20、約30、約40個或全部的連續核酸且較佳地包括了位置56及57的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 376之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置56及57的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 376, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 376 and preferably A polynucleotide that includes the nucleic acids at positions 56 and 57. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 56 and 57.
較佳地,該包含有THBS1外顯子9(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及THBS1或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於THBS1與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of THBS1 exon 9 (or its allele variant) fused to a part of NRG1 exon 6 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving THBS1 or expressing the polypeptide fusion contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between THBS1 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該THBS1外顯子9(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有THBS1的N端和NRG1的C端。此外,本文所提供之THBS1-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自THBS1的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該THBS1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 AGRN-NRG1 多核苷酸融合物 Preferably, the portion of exon 9 of THBS1 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of THBS1 and the C-terminus of NRG1. In addition, the THBS1-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from THBS1 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The THBS1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. AGRN-NRG1 polynucleotide fusion
亦提供了一種包含有AGRN之外顯子12的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。AGRN之外顯子12較佳為SEQ ID NO: 416或SEQ ID NO: 416之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 12 of AGRN fused to a portion of exon 6 of NRG1. AGRN exon 12 is preferably the exon of SEQ ID NO: 416 or an allogenic variant of SEQ ID NO: 416, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括AGRN外顯子12之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 416及130可能就足夠了。來自AGRN外顯子12之5’處的任一序列包含SEQ ID NO: 405-415(或SEQ ID NO: 405-415之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of AGRN exon 12 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 416 and 130 may be sufficient. Any sequence from 5' of AGRN exon 12 comprising any or all of SEQ ID NO: 405-415 (or any allele variant of SEQ ID NO: 405-415), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該AGRN外顯子12之對偶基因變異體與SEQ ID NO: 416有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of AGRN exon 12 is at least 85% identical to SEQ ID NO: 416, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,AGRN之外顯子12的部分係包含或根據SEQ ID NO: 401,且其對偶基因變異體與SEQ ID NO: 401有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與AGRN的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 402之序列,且其對偶基因變異體與SEQ ID NO: 402有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,AGRN之外顯子12的部分包含來自SEQ ID NO: 401之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置106的核酸。連續核酸的數目可為SEQ ID NO: 401之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置106的核酸。更佳地,AGRN之外顯子12的部分係包含或根據SEQ ID NO: 401或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測AGRN與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 12 of AGRN comprises or is according to SEQ ID NO: 401, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 401 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with AGRN is preferably or comprises a sequence such as SEQ ID NO: 402, and its allele variant has at least 85% identity with SEQ ID NO: 402, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 12 of AGRN comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 401, and At least the nucleic acid at position 106 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 106 nucleic acid. More preferably, the portion of exon 12 of AGRN comprises or is according to SEQ ID NO: 401 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between AGRN and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,AGRN外顯子12的部分包含SEQ ID NO: 416(或SEQ ID NO: 416之對偶基因變異體)或由其組成,至少包括位置106的核酸。較佳地,AGRN外顯子12的部分包含來自SEQ ID NO: 416或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置106的核酸。連續核酸的數目可為SEQ ID NO: 416之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置106的核酸。在此替代例中,AGRN外顯子12的部分更佳地包含或根據SEQ ID NO: 416或其對偶基因變異體。較佳地,在與AGRN的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of AGRN exon 12 comprises or consists of SEQ ID NO: 416 (or an allele variant of SEQ ID NO: 416), including at least the nucleic acid at position 106. Preferably, the portion of AGRN exon 12 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 416 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 106. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 106 nucleic acid. In this alternative, the portion of AGRN exon 12 preferably comprises or is according to SEQ ID NO: 416 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with AGRN comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一AGRN-NRG1多核苷酸融合物為AGRN與NRG1之符合讀框的融合。更佳地所述融合物為包含有AGRN之外顯子12或外顯子12的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 403或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 403有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any AGRN-NRG1 polynucleotide fusion provided herein is an in-frame fusion of AGRN and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 12 or a part of exon 12 of AGRN and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 403 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 403, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有AGRN外顯子12的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 403之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置106及107的核酸。連續核酸的數目可為SEQ ID NO: 403之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置106及107的核酸。SEQ ID NO: 403包括介於AGRN與NRG1之間的接合處,尤其該接合處係介於源於AGRN之位置106的核酸與源於NRG1之位置107的核酸之間。較佳地,該包含有AGRN外顯子12的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 403之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of AGRN exon 12 fused to a portion of NRG1 exon 6 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 106 and 107. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 106 and 107 nucleic acids. SEQ ID NO: 403 includes a junction between AGRN and NRG1, in particular the junction is between the nucleic acid from position 106 of AGRN and the nucleic acid from position 107 of NRG1. Preferably, the polynucleotide comprising a part of exon 12 of AGRN fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 403 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 403之多核苷酸,或一種包含了來自SEQ ID NO: 403之約20、約30、約40個或全部的連續核酸且較佳地包括了位置106及107的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 403之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置106及107的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 403, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 403 and preferably A polynucleotide that includes the nucleic acids at positions 106 and 107. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 106 and 107.
較佳地,該包含有AGRN外顯子12(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及AGRN或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於AGRN與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of AGRN exon 12 (or its allogeneic variant) fused to a part of NRG1 exon 6 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving AGRN or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between AGRN and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該AGRN外顯子12(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有AGRN的N端和NRG1的C端。此外,本文所提供之AGRN-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自AGRN的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該AGRN-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 PVALB-NRG1 多核苷酸融合物 Preferably, the part of AGRN exon 12 (or its allogeneic variant) is located at the 5' position of NRG1 exon 6 (or its allogeneic variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of AGRN and the C-terminus of NRG1. In addition, the AGRN-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from AGRN and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The AGRN-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. PVALB-NRG1 polynucleotide fusion
根據本揭露,亦提供一種包含有PVALB核酸序列(或PVALB核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸。所述融合物中也包括了PVALB及NRG1核酸序列的對偶基因變異體。According to the present disclosure, there is also provided a polynucleotide comprising the fusion of the PVALB nucleic acid sequence (or a part of the PVALB nucleic acid sequence) and the NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). Allele variants of PVALB and NRG1 nucleic acid sequences are also included in the fusion.
較佳地,該PVALB核酸序列(或其部分)包含SEQ ID NO: 439-444中之任一者或SEQ ID NO: 439-444中之任一者的對偶基因變異體,或是由其組成,而該NRG1核酸序列(或其部分)包含SEQ ID NO: 125-138中之任一者或SEQ ID NO: 125-138中之任一者的對偶基因變異體,或是由其組成。更佳地,該PVALB核酸序列包含SEQ ID NO: 444或SEQ ID NO: 444的對偶基因變異體之一部分,或是由其組成,而該NRG1核酸序列包含SEQ ID NO: 138或SEQ ID NO: 138的對偶基因變異體之一部分,或是由其組成。SEQ ID NO: 439-443分別對應根據NM_002854.3之PVALB的個別外顯子1-5。SEQ ID NO: 444對應根據NM_002854.3之PVALB的外顯子1-5。SEQ ID NO: 125-137分別對應根據NM_001159999之NRG1的個別外顯子1-13。SEQ ID NO: 138對應根據NM_001159999之NRG1的外顯子1-13。Preferably, the PVALB nucleic acid sequence (or part thereof) comprises any one of SEQ ID NO: 439-444 or an allele variant of any one of SEQ ID NO: 439-444, or consists of it , and the NRG1 nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 125-138 or an allele variant of any one of SEQ ID NOs: 125-138, or consists of it. More preferably, the PVALB nucleic acid sequence comprises a part of SEQ ID NO: 444 or an allele variant of SEQ ID NO: 444, or consists of it, and the NRG1 nucleic acid sequence comprises SEQ ID NO: 138 or SEQ ID NO: 138 is part of, or consists of, the allele variant. SEQ ID NO: 439-443 correspond to individual exons 1-5 of PVALB according to NM_002854.3, respectively. SEQ ID NO: 444 corresponds to exons 1-5 of PVALB according to NM_002854.3. SEQ ID NO: 125-137 correspond to individual exons 1-13 of NRG1 according to NM_001159999, respectively. SEQ ID NO: 138 corresponds to exons 1-13 of NRG1 according to NM_001159999.
在一較佳實施例中,該PVALB核酸序列部分包含來自SEQ ID NO: 439-444中之任一者(或SEQ ID NO: 439-444中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,而該NRG1核酸序列部分包含來自SEQ ID NO: 125-138中之任一者(或SEQ ID NO: 125-138中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸。In a preferred embodiment, the PVALB nucleic acid sequence part comprises 2 to About 10, about 20, about 30, or up to about 40 or even all of the continuous nucleic acid, and the NRG1 nucleic acid sequence part comprises from any one of SEQ ID NO: 125-138 (or SEQ ID NO: 125-138 Allele variants of any) from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids.
亦提供了一種包含有PVALB之外顯子4的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。PVALB之外顯子4較佳為SEQ ID NO: 442或SEQ ID NO: 442之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 4 of PVALB fused to a portion of exon 6 of NRG1. PVALB exon 4 is preferably the exon of SEQ ID NO: 442 or an allogenic variant of SEQ ID NO: 442, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括PVALB外顯子4之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 422及130可能就足夠了。來自PVALB外顯子4之5’處的任一序列包含SEQ ID NO: 439-441(或SEQ ID NO: 439-441之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of PVALB exon 4 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 422 and 130 may be sufficient. Any sequence from 5' of PVALB exon 4 comprising any or all of SEQ ID NO: 439-441 (or any allele variant of SEQ ID NO: 439-441), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該PVALB外顯子4之對偶基因變異體與SEQ ID NO: 442有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of PVALB exon 4 is at least 85% identical to SEQ ID NO: 442, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,PVALB之外顯子4的部分係包含或根據SEQ ID NO: 435,且其對偶基因變異體與SEQ ID NO: 435有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與PVALB的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 436之序列,且其對偶基因變異體與SEQ ID NO: 436有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,PVALB之外顯子4的部分包含來自SEQ ID NO: 435之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置102的核酸。連續核酸的數目可為SEQ ID NO: 435之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置102的核酸。更佳地,PVALB之外顯子4的部分係包含或根據SEQ ID NO: 435或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測PVALB與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the portion of exon 4 of PVALB comprises or is according to SEQ ID NO: 435, and its allele variant is at least 85% identical to SEQ ID NO: 435, preferably at least 90% identical 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with PVALB is preferably or comprises a sequence such as SEQ ID NO: 436, and its allele variant has at least 85% identity with SEQ ID NO: 436, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 4 of PVALB comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 435, and At least the nucleic acid at position 102 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 102 nucleic acid. More preferably, the portion of exon 4 of PVALB comprises or is according to SEQ ID NO: 435 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between PVALB and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,PVALB外顯子4的部分包含SEQ ID NO: 442(或SEQ ID NO: 442之對偶基因變異體)或由其組成,至少包括位置110的核酸。較佳地,PVALB外顯子4的部分包含來自SEQ ID NO: 442或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置110的核酸。連續核酸的數目可為SEQ ID NO: 442之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置110的核酸。在此替代例中,PVALB外顯子4的部分更佳地包含或根據SEQ ID NO: 442或其對偶基因變異體。較佳地,在與PVALB的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of PVALB exon 4 comprises or consists of SEQ ID NO: 442 (or an allele variant of SEQ ID NO: 442), including at least the nucleic acid at position 110. Preferably, the portion of PVALB exon 4 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 442 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 110. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 110 nucleic acid. In this alternative, the portion of PVALB exon 4 preferably comprises or is according to SEQ ID NO: 442 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with PVALB comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一PVALB-NRG1多核苷酸融合物為PVALB與NRG1之符合讀框的融合。更佳地所述融合物為包含有PVALB之外顯子4或外顯子4的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 437或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 437有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the PVALB-NRG1 polynucleotide fusions provided herein is an in-frame fusion of PVALB and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 4 or a part of exon 4 of PVALB and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 437 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 437, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有PVALB外顯子4的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 437之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置102及103的核酸。連續核酸的數目可為SEQ ID NO: 437之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置102及103的核酸。SEQ ID NO: 437包括介於PVALB與NRG1之間的接合處,尤其該接合處係介於源於PVALB之位置102的核酸與源於NRG1之位置103的核酸之間。較佳地,該包含有PVALB外顯子4的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 437之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of PVALB exon 4 fused to a portion of NRG1 exon 6 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 102 and 103. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 102 and 103 nucleic acids. SEQ ID NO: 437 includes a junction between PVALB and NRG1, in particular the junction is between the nucleic acid at position 102 from PVALB and the nucleic acid at position 103 from NRG1. Preferably, the polynucleotide comprising a part of exon 4 of PVALB fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 437 or an allele variant thereof.
較佳地,該包含有PVALB外顯子4(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及PVALB或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於PVALB與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 4 of PVALB (or its allele variant) fused to a part of NRG1 exon 6 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving PVALB or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between PVALB and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該PVALB外顯子4(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有PVALB的N端和NRG1的C端。此外,本文所提供之PVALB-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自PVALB的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該PVALB-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 APP-NRG1 多核苷酸融合物 Preferably, the part of PVALB exon 4 (or its allegenic variant) is located 5' to NRG1 exon 6 (or its allegene variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of PVALB and the C-terminus of NRG1. In addition, the PVALB-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from PVALB and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The PVALB-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. APP-NRG1 polynucleotide fusion
亦提供了一種包含有APP之外顯子14的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。APP之外顯子14較佳為SEQ ID NO: 501或SEQ ID NO: 501之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of APP exon 14 fused to a portion of NRG1 exon 6. APP exon 14 is preferably the exon of SEQ ID NO: 501 or an allogenic variant of SEQ ID NO: 501, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括APP外顯子14之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 501及130可能就足夠了。來自APP外顯子14之5’處的任一序列包含SEQ ID NO: 488-500(或SEQ ID NO: 488-500之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of APP exon 14 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 501 and 130 may be sufficient. Any sequence from the 5' of APP exon 14 comprising any or all of SEQ ID NO: 488-500 (or any allele variant of SEQ ID NO: 488-500), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該APP外顯子14之對偶基因變異體與SEQ ID NO: 501有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the APP exon 14 allele variant is at least 85% identical to SEQ ID NO: 501, preferably at least 90% identical, 92%, 94%, 96% or more identical to SEQ ID NO: 501 Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,APP之外顯子14的部分係包含或根據SEQ ID NO: 484,且其對偶基因變異體與SEQ ID NO: 484有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與APP的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 485之序列,且其對偶基因變異體與SEQ ID NO: 485有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,APP之外顯子14的部分包含來自SEQ ID NO: 484之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置54的核酸。連續核酸的數目可為SEQ ID NO: 484之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置54的核酸。更佳地,APP之外顯子14的部分係包含或根據SEQ ID NO: 484或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測APP與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 14 of APP comprises or is according to SEQ ID NO: 484, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 484 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with APP is preferably or comprises a sequence such as SEQ ID NO: 485, and its allele variant has at least 85% identity with SEQ ID NO: 485, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 14 of APP comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 484, and At least the nucleic acid at position 54 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 54 nucleic acid. More preferably, the portion of APP exon 14 comprises or is according to SEQ ID NO: 484 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between APP and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,APP外顯子14的部分包含SEQ ID NO: 501(或SEQ ID NO: 501之對偶基因變異體)或由其組成,至少包括位置54的核酸。較佳地,APP外顯子14的部分包含來自SEQ ID NO: 501或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置54的核酸。連續核酸的數目可為SEQ ID NO: 501之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置54的核酸。在此替代例中,APP外顯子14的部分更佳地包含或根據SEQ ID NO: 501或其對偶基因變異體。較佳地,在與APP的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of APP exon 14 comprises or consists of SEQ ID NO: 501 (or an allele variant of SEQ ID NO: 501), including at least the nucleic acid at position 54. Preferably, the portion of APP exon 14 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 501 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 54. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 54 nucleic acid. In this alternative, the portion of APP exon 14 preferably comprises or is according to SEQ ID NO: 501 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with APP comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一APP-NRG1多核苷酸融合物為APP與NRG1之符合讀框的融合。更佳地所述融合物為包含有APP之外顯子14或外顯子14的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 486或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 486有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any APP-NRG1 polynucleotide fusion provided herein is an in-frame fusion of APP and NRG1. More preferably, the fusion is an in-frame fusion comprising APP exon 14 or a part of exon 14 and NRG1 exon 6 or a part of exon 6. The in-frame fusion is preferably a fusion of SEQ ID NO: 486 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 486, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有APP外顯子14的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 486之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置54及55的核酸。連續核酸的數目可為SEQ ID NO: 486之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置54及55的核酸。SEQ ID NO: 486包括介於APP與NRG1之間的接合處,尤其該接合處係介於源於APP之位置54的核酸與源於NRG1之位置54的核酸之間。較佳地,該包含有APP外顯子14的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 486之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of APP exon 14 fused to a portion of NRG1 exon 6 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 54 and 55. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 54 And 55 nucleic acids. SEQ ID NO: 486 includes a junction between APP and NRG1, in particular the junction is between the nucleic acid at position 54 from APP and the nucleic acid at position 54 from NRG1. Preferably, the polynucleotide comprising a part of APP exon 14 fused with a part of NRG1 exon 6 has the polynucleotide sequence of SEQ ID NO: 486 or its allele variant.
在一較佳實施例中,提供了一種根據SEQ ID NO: 486之多核苷酸,或一種包含了來自SEQ ID NO: 486之約20、約30、約40個或全部的連續核酸且較佳地包括了位置54及55的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 486之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置54及55的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 486, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 486 and preferably A polynucleotide that includes the nucleic acids at positions 54 and 55. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 54 and 55.
較佳地,該包含有APP外顯子14(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及APP或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於APP與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of APP exon 14 (or its allele variant) fused to a part of NRG1 exon 6 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving APP or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between APP and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該APP外顯子14(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有APP的N端和NRG1的C端。此外,本文所提供之APP-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自APP的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該APP-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 WRN-NRG1 多核苷酸融合物 Preferably, the APP exon 14 (or its allogeneic variant) part is located at the 5' of NRG1 exon 6 (or its allogeneic variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of APP and the C-terminus of NRG1. In addition, the APP-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from APP and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The APP-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. WRN-NRG1 polynucleotide fusion
亦提供了一種包含有WRN之外顯子33的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。WRN之外顯子33較佳為SEQ ID NO: 562或SEQ ID NO: 562之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of WRN exon 33 fused to a portion of NRG1 exon 6. WRN exon 33 is preferably the exon of SEQ ID NO: 562 or an allogenic variant of SEQ ID NO: 562, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括WRN外顯子33之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 562及130可能就足夠了。來自WRN外顯子33之5’處的任一序列包含SEQ ID NO: 530-561(或SEQ ID NO: 530-561之任一對偶基因變異體)中之任一者或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of WRN exon 33 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 562 and 130 may be sufficient. Any sequence from 5' of WRN exon 33 comprising any or all of SEQ ID NO: 530-561 (or any allele variant of SEQ ID NO: 530-561), or derived from its composition, and any sequence from 3' of exon 6 of NRG1 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is composed of it.
較佳地,該WRN外顯子33之對偶基因變異體與SEQ ID NO: 562有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of WRN exon 33 is at least 85% identical to SEQ ID NO: 562, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,WRN之外顯子33的部分係包含或根據SEQ ID NO: 526,且其對偶基因變異體與SEQ ID NO: 526有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與WRN的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 527之序列,且其對偶基因變異體與SEQ ID NO: 527有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,WRN之外顯子33的部分包含來自SEQ ID NO: 526之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置96的核酸。連續核酸的數目可為SEQ ID NO: 526之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置96的核酸。更佳地,WRN之外顯子33的部分係包含或根據SEQ ID NO: 526或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測WRN與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of WRN exon 33 comprises or is according to SEQ ID NO: 526, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 526 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with WRN is preferably or comprises a sequence such as SEQ ID NO: 527, and its allele variant has at least 85% identity with SEQ ID NO: 527, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 33 of WRN comprises or consists of contiguous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 526, and At least the nucleic acid at position 96 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 96 nucleic acid. More preferably, the portion of WRN exon 33 comprises or is according to SEQ ID NO: 526 or an allegenic variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between WRN and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,WRN外顯子33的部分包含SEQ ID NO: 562(或SEQ ID NO: 562之對偶基因變異體)或由其組成,至少包括位置163的核酸。較佳地,WRN外顯子33的部分包含來自SEQ ID NO: 562或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置163的核酸。連續核酸的數目可為SEQ ID NO: 562之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置163的核酸。在此替代例中,WRN外顯子33的部分更佳地包含或根據SEQ ID NO: 562或其對偶基因變異體。較佳地,在與WRN的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of WRN exon 33 comprises or consists of SEQ ID NO: 562 (or an allele variant of SEQ ID NO: 562), including at least the nucleic acid at position 163. Preferably, the portion of WRN exon 33 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 562 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 163. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 163 nucleic acid. In this alternative, the part of WRN exon 33 preferably comprises or is according to SEQ ID NO: 562 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with WRN comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一WRN-NRG1多核苷酸融合物為WRN與NRG1之符合讀框的融合。更佳地所述融合物為包含有WRN之外顯子33或外顯子33的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 528或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 528有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any WRN-NRG1 polynucleotide fusion provided herein is an in-frame fusion of WRN and NRG1. More preferably, the fusion is an in-frame fusion comprising WRN exon 33 or a part of exon 33 and NRG1 exon 6 or a part of exon 6. The in-frame fusion is preferably a fusion of SEQ ID NO: 528 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 528, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有WRN外顯子33的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 528之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置96及97的核酸。連續核酸的數目可為SEQ ID NO: 528之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置96及97的核酸。SEQ ID NO: 528包括介於WRN與NRG1之間的接合處,尤其該接合處係介於源於WRN之位置96的核酸與源於NRG1之位置97的核酸之間。較佳地,該包含有WRN外顯子33的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 528之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of WRN exon 33 fused to a portion of NRG1 exon 6 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 96 and 97. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 96 And 97 nucleic acids. SEQ ID NO: 528 includes a junction between WRN and NRG1, in particular the junction is between a nucleic acid derived from position 96 of WRN and a nucleic acid derived from position 97 of NRG1. Preferably, the polynucleotide comprising a part of WRN exon 33 fused with a part of NRG1 exon 6 has the polynucleotide sequence of SEQ ID NO: 528 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 528之多核苷酸,或一種包含了來自SEQ ID NO: 528之約20、約30、約40個或全部的連續核酸且較佳地包括了位置96及97的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 528之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置96及97的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 528, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 528 and preferably A polynucleotide that includes the nucleic acids at positions 96 and 97. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 96 and 97.
較佳地,該包含有WRN外顯子33(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及WRN或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於WRN與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of WRN exon 33 (or its allele variant) fused to a part of NRG1 exon 6 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving WRN or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between WRN and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該WRN外顯子33(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有WRN的N端和NRG1的C端。此外,本文所提供之WRN-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自WRN的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該WRN-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是乳癌。 DAAM1-NRG1 多核苷酸融合物 Preferably, the part of WRN exon 33 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of WRN and the C-terminus of NRG1. In addition, the WRN-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from WRN and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The WRN-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially breast cancer. DAAM1-NRG1 polynucleotide fusion
根據本揭露,亦提供一種包含有DAAM1核酸序列(或DAAM1核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸。所述融合物中也包括了DAAM1及NRG1核酸序列的對偶基因變異體。According to the present disclosure, there is also provided a polynucleotide comprising a fusion of DAAM1 nucleic acid sequence (or a part of DAAM1 nucleic acid sequence) and NRG1 nucleic acid sequence (or a part of NRG1 nucleic acid sequence). Allele variants of DAAM1 and NRG1 nucleic acid sequences are also included in the fusion.
較佳地,該DAAM1核酸序列(或其部分)包含SEQ ID NO: 606-631中之任一者或SEQ ID NO: 606-631中之任一者的對偶基因變異體,或是由其組成,而該NRG1核酸序列(或其部分)包含SEQ ID NO: 125-138中之任一者或SEQ ID NO: 125-138中之任一者的對偶基因變異體,或是由其組成。更佳地,該DAAM1核酸序列包含SEQ ID NO: 631或SEQ ID NO: 631的對偶基因變異體之一部分,或是由其組成,而該NRG1核酸序列包含SEQ ID NO: 138或SEQ ID NO: 138的對偶基因變異體之一部分,或是由其組成。SEQ ID NO: 606-630分別對應根據NM_001270520.2之DAAM1的個別外顯子1-25。SEQ ID NO: 631對應根據NM_001270520.2之DAAM1的外顯子1-25。SEQ ID NO: 125-137分別對應根據NM_001159999.3之NRG1的個別外顯子1-13。SEQ ID NO: 138對應根據NM_001159999.3之NRG1的外顯子1-13。Preferably, the DAAM1 nucleic acid sequence (or part thereof) comprises, or consists of, any one of SEQ ID NOs: 606-631 or an allele variant of any one of SEQ ID NOs: 606-631 , and the NRG1 nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 125-138 or an allele variant of any one of SEQ ID NOs: 125-138, or consists of it. More preferably, the DAAM1 nucleic acid sequence comprises SEQ ID NO: 631 or a part of the allele variant of SEQ ID NO: 631, or consists of it, and the NRG1 nucleic acid sequence comprises SEQ ID NO: 138 or SEQ ID NO: 138 is part of, or consists of, the allele variant. SEQ ID NO: 606-630 correspond to individual exons 1-25 of DAAM1 according to NM_001270520.2, respectively. SEQ ID NO: 631 corresponds to exons 1-25 of DAAM1 according to NM_001270520.2. SEQ ID NO: 125-137 correspond to individual exons 1-13 of NRG1 according to NM_001159999.3, respectively. SEQ ID NO: 138 corresponds to exons 1-13 of NRG1 according to NM_001159999.3.
在一較佳實施例中,該DAAM1核酸序列部分包含來自SEQ ID NO: 606-631中之任一者(或SEQ ID NO: 606-631中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,而該NRG1核酸序列部分包含來自SEQ ID NO: 125-138中之任一者(或SEQ ID NO: 125-138中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸。In a preferred embodiment, the DAAM1 nucleic acid sequence part comprises 2 to About 10, about 20, about 30, or up to about 40 or even all of the continuous nucleic acid, and the NRG1 nucleic acid sequence part comprises from any one of SEQ ID NO: 125-138 (or SEQ ID NO: 125-138 Allele variants of any) from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids.
較佳地,該DAAM1核酸序列之對偶基因變異體與SEQ ID NO: 606-631中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of the DAAM1 nucleic acid sequence is at least 85% identical to any one of SEQ ID NO: 606-631, preferably at least 90% identical, and 92%, 94% identical to any one of SEQ ID NO: 606-631 , 96% or better at least 98% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138, preferably at least 90% identity, 92%, 94%, 96% or better at least 98% sequence identity thereto.
較佳地,該DAAM1核酸序列或其部分係位於NRG1核酸序列或其部分的5’處。Preferably, the DAAM1 nucleic acid sequence or part thereof is located 5' to the NRG1 nucleic acid sequence or part thereof.
較佳地,該包含有DAAM1之DAAM1核酸序列(或所述序列的一部分)與NRG1核酸序列(或所述序列的一部分)融合之多核苷酸係包含或編碼NRG1的EGF樣結構域。包含有該DAAM1-NRG1多核苷酸融合物之異常細胞係包含或編碼NRG1的EGF樣結構域。為了檢測、診斷或鑑定之目的,只需證明介於DAAM1與NRG1之間的融合接合處是符合讀框的並且發生在可使得所得融合核酸編碼NRG1的EGF樣結構域的位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising DAAM1 nucleic acid sequence (or a part of said sequence) fused with NRG1 nucleic acid sequence (or a part of said sequence) of DAAM1 comprises or encodes the EGF-like domain of NRG1. Abnormal cell lines containing the DAAM1-NRG1 polynucleotide fusion contain or encode the EGF-like domain of NRG1. For detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between DAAM1 and NRG1 is in-frame and occurs at a position such that the resulting fusion nucleic acid encodes the EGF-like domain of NRG1. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
亦提供了一種包含有DAAM1之外顯子1的一部分與NRG1之外顯子1的一部分融合之多核苷酸融合物。DAAM1之外顯子1較佳為SEQ ID NO: 606或SEQ ID NO: 606之對偶基因變異體的外顯子,而NRG1之外顯子1較佳為SEQ ID NO: 125或SEQ ID NO: 125之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 1 of DAAM1 fused to a portion of exon 1 of NRG1. DAAM1 exon 1 is preferably the exon of SEQ ID NO: 606 or an allogenic variant of SEQ ID NO: 606, and NRG1 exon 1 is preferably SEQ ID NO: 125 or SEQ ID NO: Exons of 125 allele variants.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括DAAM1外顯子1之5’處的任一序列以及NRG1外顯子1之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 606及125可能就足夠了。來自DAAM1外顯子1之5’處的任一序列包含SEQ ID NO: 606(或SEQ ID NO: 606之任一對偶基因變異體)或是由其組成,而來自NRG1外顯子1之3’處的任一序列包含SEQ ID NO: 126-137(或SEQ ID NO: 126-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of DAAM1 exon 1 and any sequence at the 3' of NRG1 exon 1 sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 606 and 125 may be sufficient. Any sequence from 5' of DAAM1 exon 1 comprising or consisting of SEQ ID NO: 606 (or any allele variant of SEQ ID NO: 606), and from 3 of NRG1 exon 1 Any sequence at ' comprises one or all of SEQ ID NO: 126-137 (or any allele variant of SEQ ID NO: 126-137), or consists of it.
較佳地,該DAAM1外顯子1之對偶基因變異體與SEQ ID NO: 606有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子1之對偶基因變異體與SEQ ID NO: 125有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of DAAM1 exon 1 is at least 85% identical to SEQ ID NO: 606, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 1 has at least 85% identity to SEQ ID NO: 125, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,DAAM1之外顯子1的部分係包含或根據SEQ ID NO: 603,且其對偶基因變異體與SEQ ID NO: 603有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與DAAM1的融合物中之NRG1之外顯子1的部分較佳為或包含如SEQ ID NO: 604之序列,且其對偶基因變異體與SEQ ID NO: 604有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,DAAM1之外顯子1的部分包含來自SEQ ID NO: 603之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 603之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,DAAM1之外顯子1的部分係包含或根據SEQ ID NO: 603或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測DAAM1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 1 of DAAM1 comprises or is according to SEQ ID NO: 603, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 603 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 1 of NRG1 in the fusion with DAAM1 is preferably or comprises a sequence such as SEQ ID NO: 604, and its allele variant has at least 85% identity with SEQ ID NO: 604, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 1 of DAAM1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 603, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the part of exon 1 of DAAM1 comprises or is according to SEQ ID NO: 603 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between DAAM1 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,DAAM1外顯子1的部分包含SEQ ID NO: 606(或SEQ ID NO: 606之對偶基因變異體)或由其組成,至少包括位置102的核酸。較佳地,DAAM1外顯子1的部分包含來自SEQ ID NO: 606或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置102的核酸。連續核酸的數目可為SEQ ID NO: 606之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置102的核酸。在此替代例中,DAAM1外顯子1的部分更佳地包含或根據SEQ ID NO: 606或其對偶基因變異體。較佳地,在與DAAM1的融合物中之NRG1外顯子1的部分包含來自SEQ ID NO: 125或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 125之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of DAAM1 exon 1 comprises or consists of SEQ ID NO: 606 (or an allele variant of SEQ ID NO: 606), including at least the nucleic acid at position 102. Preferably, the portion of DAAM1 exon 1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 606 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 102. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 102 nucleic acid. In this alternative, the portion of DAAM1 exon 1 preferably comprises or is according to SEQ ID NO: 606 or an allegenic variant thereof. Preferably, the portion of exon 1 of NRG1 in the fusion with DAAM1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有DAAM1外顯子1的一部分與NRG1外顯子1的一部分融合之多核苷酸係包含來自SEQ ID NO: 605之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 605之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 605包括介於DAAM1與NRG1之間的接合處,尤其該接合處係介於源於DAAM1之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有DAAM1外顯子1的一部分與NRG1外顯子1的一部分融合之多核苷酸係具有SEQ ID NO: 605之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of DAAM1 exon 1 fused to a portion of NRG1 exon 1 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 605 includes a junction between DAAM1 and NRG1, in particular the junction is between the nucleic acid derived from position 75 of DAAM1 and the nucleic acid derived from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 1 of DAAM1 fused with a part of exon 1 of NRG1 has a polynucleotide sequence of SEQ ID NO: 605 or an allele variant thereof.
較佳地,本文提供之任一DAAM1-NRG1多核苷酸融合物為DAAM1之未轉譯區域的一部分與NRG1之未轉譯區域的一部分之融合物。更佳地,所述融合物為一種包含有DAAM1之外顯子1(或外顯子1的一部分)與NRG1之外顯子1(或NRG1外顯子1的一部分)的融合物。所述融合物較佳為SEQ ID NO: 605或其對偶基因變異體之融合物,且與SEQ ID NO: 605有至少85%的一致性,較佳地與其至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the DAAM1-NRG1 polynucleotide fusions provided herein is a fusion of a portion of the untranslated region of DAAM1 with a portion of the untranslated region of NRG1. More preferably, the fusion is a fusion comprising DAAM1 exon 1 (or a part of exon 1) and NRG1 exon 1 (or a part of NRG1 exon 1). The fusion is preferably a fusion of SEQ ID NO: 605 or an allele variant thereof, and is at least 85% identical to SEQ ID NO: 605, preferably at least 90%, 92%, 94% identical thereto , 96% or even 98% agreement.
較佳地,DAAM1外顯子1(或其對偶基因變異體)的部分係位於NRG1外顯子1(或其對偶基因變異體)的5’處。該NRG1蛋白因而變成了位在DAAM1啟動子的下游並且期望能在轉錄上受所述啟動子的控制。所得融合物因而導致了非蛋白質融合的NRG1蛋白質表現,且因此含有EGF樣結構域。隨之而來的是驅動一小群人類癌症增生及存活的能力,尤其是乳癌。 ASPH-NRG1 多核苷酸融合物 Preferably, the part of DAAM1 exon 1 (or its allegenic variant) is located 5' to NRG1 exon 1 (or its allegene variant). The NRG1 protein thus becomes downstream of the DAAM1 promoter and is expected to be transcriptionally controlled by said promoter. The resulting fusion thus results in expression of the NRG1 protein that is not a protein fusion, and thus contains an EGF-like domain. With it came the ability to drive the proliferation and survival of a small group of human cancers, especially breast cancer. ASPH-NRG1 polynucleotide fusion
根據本揭露,亦提供一種包含有ASPH核酸序列(或ASPH核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸。所述融合物中也包括了ASPH及NRG1核酸序列的對偶基因變異體。According to the disclosure, there is also provided a polynucleotide comprising an ASPH nucleic acid sequence (or a part of the ASPH nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). Allele variants of ASPH and NRG1 nucleic acid sequences are also included in the fusion.
較佳地,該ASPH核酸序列(或其部分)包含SEQ ID NO: 637-662中之任一者或SEQ ID NO: 637-662中之任一者的對偶基因變異體,或是由其組成,而該NRG1核酸序列(或其部分)包含SEQ ID NO: 125-138中之任一者或SEQ ID NO: 125-138中之任一者的對偶基因變異體,或是由其組成。更佳地,該ASPH核酸序列包含SEQ ID NO: 662或SEQ ID NO: 662的對偶基因變異體之一部分,或是由其組成,而該NRG1核酸序列包含SEQ ID NO: 138或SEQ ID NO: 138的對偶基因變異體之一部分,或是由其組成。SEQ ID NO: 637-661分別對應根據NM_001164750.2之ASPH的個別外顯子1-25。SEQ ID NO: 662對應根據NM_001164750.2之ASPH的外顯子1-25。SEQ ID NO: 125-137分別對應根據NM_001159999.3之NRG1的個別外顯子1-13。SEQ ID NO: 138對應根據NM_001159999.3之NRG1的外顯子1-13。Preferably, the ASPH nucleic acid sequence (or part thereof) comprises, or consists of, any one of SEQ ID NOs: 637-662 or an allele variant of any one of SEQ ID NOs: 637-662 , and the NRG1 nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 125-138 or an allele variant of any one of SEQ ID NOs: 125-138, or consists of it. More preferably, the ASPH nucleic acid sequence comprises SEQ ID NO: 662 or a part of the allele variant of SEQ ID NO: 662, or consists of it, and the NRG1 nucleic acid sequence comprises SEQ ID NO: 138 or SEQ ID NO: 138 is part of, or consists of, the allele variant. SEQ ID NO: 637-661 correspond to individual exons 1-25 of ASPH according to NM_001164750.2, respectively. SEQ ID NO: 662 corresponds to exons 1-25 of ASPH according to NM_001164750.2. SEQ ID NO: 125-137 correspond to individual exons 1-13 of NRG1 according to NM_001159999.3, respectively. SEQ ID NO: 138 corresponds to exons 1-13 of NRG1 according to NM_001159999.3.
在一較佳實施例中,該ASPH核酸序列部分包含來自SEQ ID NO: 637-662中之任一者(或SEQ ID NO: 637-662中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,而該NRG1核酸序列部分包含來自SEQ ID NO: 125-138中之任一者(或SEQ ID NO: 125-138中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸。In a preferred embodiment, the ASPH nucleic acid sequence part comprises 2 to About 10, about 20, about 30, or up to about 40 or even all of the continuous nucleic acid, and the NRG1 nucleic acid sequence part comprises from any one of SEQ ID NO: 125-138 (or SEQ ID NO: 125-138 Allele variants of any) from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids.
較佳地,該ASPH核酸序列之對偶基因變異體與SEQ ID NO: 637-662中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of the ASPH nucleic acid sequence is at least 85% identical to any one of SEQ ID NO: 637-662, preferably at least 90% identical, and 92%, 94% identical to any one of SEQ ID NO: 637-662 , 96% or better at least 98% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138, preferably at least 90% identity, 92%, 94%, 96% or better at least 98% sequence identity thereto.
較佳地,該ASPH核酸序列或其部分係位於NRG1核酸序列或其部分的5’處。Preferably, the ASPH nucleic acid sequence or part thereof is located 5' to the NRG1 nucleic acid sequence or part thereof.
較佳地,該包含有ASPH之ASPH核酸序列(或所述序列的一部分)與NRG1核酸序列(或所述序列的一部分)融合之多核苷酸係包含或編碼NRG1的EGF樣結構域。包含有該ASPH-NRG1多核苷酸融合物或表現該多肽融合物之異常細胞係包含或編碼NRG1的EGF樣結構域。為了檢測、診斷或鑑定之目的,只需證明介於ASPH與NRG1之間的融合接合處是符合讀框的並且發生在可使得所得融合產物、核酸或蛋白質包含NRG1的EGF樣結構域的位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising ASPH nucleic acid sequence (or a part of said sequence) fused to NRG1 nucleic acid sequence (or a part of said sequence) comprises or encodes the EGF-like domain of NRG1. Abnormal cell lines containing the ASPH-NRG1 polynucleotide fusion or expressing the polypeptide fusion contain or encode the EGF-like domain of NRG1. For the purposes of detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between ASPH and NRG1 is in-frame and occurs at a position such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1. Can. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
亦提供了一種包含有ASPH之外顯子22的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。ASPH之外顯子22較佳為SEQ ID NO: 658或SEQ ID NO: 658之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of ASPH exon 22 fused to a portion of NRG1 exon 2. ASPH exon 22 is preferably the exon of SEQ ID NO: 658 or an allogenic variant of SEQ ID NO: 658, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括ASPH外顯子22之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 658及126可能就足夠了。來自ASPH外顯子22之5’處的任一序列包含SEQ ID NO: 637-657(或SEQ ID NO: 637-657之任一對偶基因變異體)中之一或全部,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence 5' to ASPH exon 22 and any sequence 3' to NRG1 exon 2. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 658 and 126 may be sufficient. Any sequence from the 5' of ASPH exon 22 comprises one or all of SEQ ID NO: 637-657 (or any allele variant of SEQ ID NO: 637-657), while the sequence from NRG1 exon Any sequence at the 3' of sub2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or consists of it.
較佳地,該ASPH外顯子22之對偶基因變異體與SEQ ID NO: 658有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of ASPH exon 22 is at least 85% identical to SEQ ID NO: 658, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,ASPH之外顯子22的部分係包含或根據SEQ ID NO: 633,且其對偶基因變異體與SEQ ID NO: 633有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與ASPH的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 634之序列,且其對偶基因變異體與SEQ ID NO: 634有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,ASPH之外顯子22的部分包含來自SEQ ID NO: 633之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 633之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,ASPH之外顯子22的部分係包含或根據SEQ ID NO: 633或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測ASPH與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the portion of exon 22 of ASPH comprises or is according to SEQ ID NO: 633, and the allele variant thereof is at least 85% identical, preferably at least 90% identical to SEQ ID NO: 633 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with ASPH is preferably or comprises a sequence such as SEQ ID NO: 634, and its allele variant has at least 85% identity with SEQ ID NO: 634, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of ASPH exon 22 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 633, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the portion of exon 22 of ASPH comprises or is according to SEQ ID NO: 633 or an allegenic variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between ASPH and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,ASPH外顯子22的部分包含SEQ ID NO: 658(或SEQ ID NO: 658之對偶基因變異體)或由其組成,至少包括位置136的核酸。較佳地,ASPH外顯子22的部分包含來自SEQ ID NO: 658或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置136的核酸。連續核酸的數目可為SEQ ID NO: 658之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置136的核酸。在此替代例中,ASPH外顯子22的部分更佳地包含或根據SEQ ID NO: 658或其對偶基因變異體。較佳地,在與ASPH的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of ASPH exon 22 comprises or consists of SEQ ID NO: 658 (or an allele variant of SEQ ID NO: 658), including at least the nucleic acid at position 136. Preferably, the portion of ASPH exon 22 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 658 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 136. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 136 nucleic acid. In this alternative, the portion of ASPH exon 22 preferably comprises or is according to SEQ ID NO: 658 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with ASPH comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有ASPH外顯子22的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 635之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 635之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 635包括介於ASPH與NRG1之間的接合處,尤其該接合處係介於源於ASPH之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有ASPH外顯子22的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 635之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of ASPH exon 22 fused to a portion of NRG1 exon 2 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 635 includes a junction between ASPH and NRG1, in particular the junction is between a nucleic acid derived from position 75 of ASPH and a nucleic acid derived from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 22 of ASPH fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 635 or an allele variant thereof.
較佳地,本文所提供之任一ASPH-NRG1多核苷酸融合物為ASPH與NRG1之符合讀框的融合。更佳地所述融合物為包含有ASPH之外顯子22或外顯子22的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 635或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 635有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any ASPH-NRG1 polynucleotide fusion provided herein is an in-frame fusion of ASPH and NRG1. More preferably, the fusion is an in-frame fusion comprising ASPH exon 22 or a part of exon 22 and NRG1 exon 2 or a part of exon 2. The in-frame fusion is preferably a fusion of SEQ ID NO: 635 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 635, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
較佳地,ASPH外顯子2(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有ASPH的N端和NRG1的C端。此外,本文所提供之ASPH-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自ASPH的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該ASPH-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是大腸直腸腺癌。 NOTCH2-NRG1 多核苷酸融合物 Preferably, the portion of ASPH exon 2 (or its allegenic variant) is located 5' to NRG1 exon 2 (or its allelic variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of ASPH and the C-terminus of NRG1. In addition, the ASPH-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from ASPH and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The ASPH-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinomas, more particularly colorectal adenocarcinomas. NOTCH2-NRG1 polynucleotide fusion
亦提供了一種包含有NOTCH2之外顯子6的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。NOTCH2之外顯子6較佳為SEQ ID NO: 700或SEQ ID NO: 700之對偶基因變異體的外顯子,而NRG1之外顯子1較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 6 of NOTCH2 fused to a portion of exon 6 of NRG1. Exon 6 of NOTCH2 is preferably the exon of SEQ ID NO: 700 or an allogenic variant of SEQ ID NO: 700, and exon 1 of NRG1 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括NOTCH2外顯子6之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 700及130可能就足夠了。來自NOTCH2外顯子6之5’處的任一序列包含SEQ ID NO: 695-699(或SEQ ID NO: 695-699之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of NOTCH2 exon 6 and any sequence at the 3' of NRG1 exon 6 sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 700 and 130 may be sufficient. Any sequence from the 5' of NOTCH2 exon 6 comprising or consisting of one or all of SEQ ID NO: 695-699 (or any allele variant of SEQ ID NO: 695-699) , and any sequence from 3' of NRG1 exon 6 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is derived from its composition.
較佳地,該NOTCH2外顯子6之對偶基因變異體與SEQ ID NO: 700有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of NOTCH2 exon 6 is at least 85% identical to SEQ ID NO: 700, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,NOTCH2之外顯子6的部分係包含或根據SEQ ID NO: 691,且其對偶基因變異體與SEQ ID NO: 691有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與NOTCH2的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 692之序列,且其對偶基因變異體與SEQ ID NO: 692有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,NOTCH2之外顯子6的部分包含來自SEQ ID NO: 691之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 691之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,NOTCH2之外顯子6的部分係包含或根據SEQ ID NO: 691或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測NOTCH2與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the portion of exon 6 of NOTCH2 comprises or is according to SEQ ID NO: 691, and the allele variant thereof is at least 85% identical, preferably at least 90% identical to SEQ ID NO: 691 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with NOTCH2 is preferably or comprises a sequence such as SEQ ID NO: 692, and its allele variant has at least 85% identity with SEQ ID NO: 692, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 6 of NOTCH2 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 691, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the portion of exon 6 of NOTCH2 comprises or is according to SEQ ID NO: 691 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between NOTCH2 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,NOTCH2外顯子6的部分包含SEQ ID NO: 700(或SEQ ID NO: 700之對偶基因變異體)或由其組成,至少包括位置234的核酸。較佳地,NOTCH2外顯子6的部分包含來自SEQ ID NO: 700或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置234的核酸。連續核酸的數目可為SEQ ID NO: 700之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置234的核酸。在此替代例中,NOTCH2外顯子6的部分更佳地包含或根據SEQ ID NO: 700或其對偶基因變異體。較佳地,在與NOTCH2的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of NOTCH2 exon 6 comprises or consists of SEQ ID NO: 700 (or an allele variant of SEQ ID NO: 700), including at least the nucleic acid at position 234. Preferably, the portion of NOTCH2 exon 6 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 700 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 234. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 234 nucleic acid. In this alternative, the portion of NOTCH2 exon 6 preferably comprises or is according to SEQ ID NO: 700 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with NOTCH2 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一NOTCH2-NRG1多核苷酸融合物為NOTCH2與NRG1之符合讀框的融合。更佳地所述融合物為包含有NOTCH2之外顯子6或外顯子6的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 693或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 693有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any NOTCH2-NRG1 polynucleotide fusion provided herein is an in-frame fusion of NOTCH2 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 6 or a part of exon 6 of NOTCH2 and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 693 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 693, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有NOTCH2外顯子6的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 693之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 693之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 693包括介於NOTCH2與NRG1之間的接合處,尤其該接合處係介於源於NOTCH2之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有NOTCH2外顯子6的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 693之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of NOTCH2 exon 6 fused to a portion of NRG1 exon 6 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 693 includes the junction between NOTCH2 and NRG1, in particular the junction is between the nucleic acid from position 75 of NOTCH2 and the nucleic acid from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 6 of NOTCH2 fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 693 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 693之多核苷酸,或一種包含了來自SEQ ID NO: 693之約20、約30、約40個或全部的連續核酸且較佳地包括了位置75及76的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 693之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置75及76的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 693, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 693 and preferably A polynucleotide that includes the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 75 and 76.
較佳地,該包含有NOTCH2外顯子6(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及NOTCH2或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於NOTCH2與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of NOTCH2 exon 6 (or its allogeneic variant) fused to a part of NRG1 exon 6 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving NOTCH2 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For the purposes of rapid detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between NOTCH2 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該NOTCH2外顯子6(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有NOTCH2的N端和NRG1的C端。此外,本文所提供之NOTCH2-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自NOTCH2的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該NOTCH2-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。 CD74-NRG1 多核苷酸融合物 Preferably, the portion of NOTCH2 exon 6 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of NOTCH2 and the C-terminus of NRG1. In addition, the NOTCH2-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from NOTCH2 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The NOTCH2-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma. CD74-NRG1 polynucleotide fusion
亦提供了一種包含有CD74之外顯子2的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。CD74之外顯子2較佳為SEQ ID NO: 720或SEQ ID NO: 720之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 2 of CD74 fused to a portion of exon 2 of NRG1. CD74 exon 2 is preferably the exon of SEQ ID NO: 720 or an allogenic variant of SEQ ID NO: 720, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括CD74外顯子2之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 720及126可能就足夠了。來自CD74外顯子2之5’處的任一序列包含SEQ ID NO: 719(或SEQ ID NO: 719之任一對偶基因變異體)或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of exon 2 of CD74 and any sequence at the 3' of exon 2 of NRG1 sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 720 and 126 may be sufficient. Any sequence from 5' of exon 2 of CD74 comprising or consisting of SEQ ID NO: 719 (or any allele variant of SEQ ID NO: 719), and from 3 of NRG1 exon 2 Any sequence at ' comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or consists of it.
較佳地,該CD74外顯子2之對偶基因變異體與SEQ ID NO: 720有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of CD74 exon 2 is at least 85% identical to SEQ ID NO: 720, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,CD74之外顯子2的部分係包含或根據SEQ ID NO: 715,且其對偶基因變異體與SEQ ID NO: 715有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與CD74的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 716之序列,且其對偶基因變異體與SEQ ID NO: 716有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,CD74之外顯子2的部分包含來自SEQ ID NO: 715之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 715之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,CD74之外顯子2的部分係包含或根據SEQ ID NO: 715或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測CD74與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 2 of CD74 comprises or is according to SEQ ID NO: 715, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 715 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with CD74 is preferably or comprises a sequence such as SEQ ID NO: 716, and its allele variant has at least 85% identity with SEQ ID NO: 716, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 2 of CD74 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 715, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the portion of exon 2 of CD74 comprises or is according to SEQ ID NO: 715 or an allegenic variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between CD74 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,CD74外顯子2的部分包含SEQ ID NO: 720(或SEQ ID NO: 720之對偶基因變異體)或由其組成,至少包括位置173的核酸。較佳地,CD74外顯子2的部分包含來自SEQ ID NO: 720或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置173的核酸。連續核酸的數目可為SEQ ID NO: 720之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置173的核酸。在此替代例中,CD74外顯子2的部分更佳地包含或根據SEQ ID NO: 720或其對偶基因變異體。較佳地,在與CD74的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of CD74 exon 2 comprises or consists of SEQ ID NO: 720 (or an allele variant of SEQ ID NO: 720), including at least the nucleic acid at position 173. Preferably, the portion of CD74 exon 2 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 720 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 173. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 173 nucleic acid. In this alternative, the portion of CD74 exon 2 preferably comprises or is according to SEQ ID NO: 720 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with CD74 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一CD74-NRG1多核苷酸融合物為CD74與NRG1之符合讀框的融合。更佳地所述融合物為包含有CD74之外顯子2或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 717或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 717有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any CD74-NRG1 polynucleotide fusion provided herein is an in-frame fusion of CD74 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 2 or part of exon 2 of CD74 and exon 2 or part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 717 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 717, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有CD74外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 717之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置53及54的核酸。連續核酸的數目可為SEQ ID NO: 7之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 717包括介於CD74與NRG1之間的接合處,尤其該接合處係介於源於CD74之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有CD74外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 717之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of exon 2 of CD74 fused to a portion of exon 2 of NRG1 comprises sequences from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 53 and 54. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 717 includes a junction between CD74 and NRG1, in particular the junction is between the nucleic acid derived from position 75 of CD74 and the nucleic acid derived from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 2 of CD74 fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 717 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 717之多核苷酸,或一種包含了來自SEQ ID NO: 717之約20、約30、約40個或全部的連續核酸且較佳地包括了位置75及76的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 717之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置75及76的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 717, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 717 and preferably A polynucleotide that includes the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 75 and 76.
較佳地,該包含有CD74外顯子2(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及CD74或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於CD74與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of exon 2 of CD74 (or its allele variant) fused to a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving CD74 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between CD74 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該CD74外顯子2(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有CD74的N端和NRG1的C端。此外,本文所提供之CD74-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自CD74的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該CD74-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是肺癌。 SDC4-NRG1 多核苷酸融合物 Preferably, the portion of exon 2 of CD74 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of CD74 and the C-terminus of NRG1. In addition, the CD74-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from CD74 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The CD74-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially lung cancer. SDC4-NRG1 polynucleotide fusion
亦提供了一種包含有SDC4之外顯子2的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。所述SDC4之外顯子2較佳為SEQ ID NO: 746或SEQ ID NO: 746之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 2 of SDC4 fused to a portion of exon 2 of NRG1. The SDC4 exon 2 is preferably the exon of SEQ ID NO: 746 or the allele variant of SEQ ID NO: 746, and the NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: Exon of the allele variant of 126.
較佳地,所述SDC4外顯子2之對偶基因變異體與SEQ ID NO: 746有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of SDC4 exon 2 has at least 85% identity to SEQ ID NO: 746, preferably at least 90% identity, 92%, 94%, 96% or More preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,所述SDC4之外顯子2的部分係包含或根據SEQ ID NO: 741,且其對偶基因變異體與SEQ ID NO: 741有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與SDC4的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 742之序列,且其對偶基因變異體與SEQ ID NO: 742有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,SDC4之外顯子2的部分包含來自SEQ ID NO: 741之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 741之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,所述SDC4之外顯子2的部分係包含或根據SEQ ID NO: 741或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測SDC4與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 2 of SDC4 comprises or is according to SEQ ID NO: 741, and its allele variant has at least 85% identity with SEQ ID NO: 741, preferably at least 90% 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. The part of exon 2 of NRG1 in the fusion with SDC4 is preferably or comprises a sequence such as SEQ ID NO: 742, and its allele variant has at least 85% identity with SEQ ID NO: 742, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 2 of SDC4 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 741, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the part of exon 2 of SDC4 comprises or is according to SEQ ID NO: 741 or its allele variant. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between SDC4 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,所述SDC4外顯子2的部分包含SEQ ID NO: 746(或SEQ ID NO: 746之對偶基因變異體)或由其組成,至少包括位置139的核酸。較佳地,所述SDC4外顯子2的部分包含來自SEQ ID NO: 746或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置139的核酸。連續核酸的數目可為SEQ ID NO: 746之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置139的核酸。在此替代例中,SDC4外顯子2的部分更佳地包含或根據SEQ ID NO: 746或其對偶基因變異體。較佳地,在與SDC4的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, said portion of SDC4 exon 2 comprises or consists of SEQ ID NO: 746 (or an allele variant of SEQ ID NO: 746), at least including the nucleic acid at position 139. Preferably, the portion of the SDC4 exon 2 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 746 or its allele variant or consists of and includes at least the nucleic acid at position 139. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 139 nucleic acid. In this alternative, the part of SDC4 exon 2 preferably comprises or is according to SEQ ID NO: 746 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with SDC4 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一SDC4-NRG1多核苷酸融合物為SDC4與NRG1之符合讀框的融合。更佳地所述融合物為包含有SDC4之外顯子2或外顯子2的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 743或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 743有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the SDC4-NRG1 polynucleotide fusions provided herein is an in-frame fusion of SDC4 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 2 or part of exon 2 of SDC4 and exon 2 or part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 743 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 743, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有所述SDC4外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 743之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 743之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 743包括介於SDC4與NRG1之間的接合處,尤其該接合處係介於源於SDC4之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有SDC4外顯子2的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 743之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 2 of SDC4 fused with a part of exon 2 of NRG1 comprises from 2 to about 10 of SEQ ID NO: 743, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 743 includes the junction between SDC4 and NRG1, in particular the junction is between the nucleic acid from position 75 of SDC4 and the nucleic acid from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 2 of SDC4 fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 743 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 743之多核苷酸,或一種包含了來自SEQ ID NO: 743之約20、約30、約40個或全部的連續核酸且較佳地包括了位置75及76的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 743之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置75及76的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 743, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 743 and preferably A polynucleotide that includes the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 75 and 76.
較佳地,該包含有SDC4外顯子2(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及所述SDC4或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於SDC4與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of SDC4 exon 2 (or its allele variant) fused to a part of NRG1 exon 2 (or its allele variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Abnormal cell lines containing polynucleotide fusions involving said SDC4 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between SDC4 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該SDC4外顯子2(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有SDC4的N端和NRG1的C端。此外,本文所提供之SDC4-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自SDC4的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該SDC4-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是肺癌。Preferably, the part of exon 2 of SDC4 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of SDC4 and the C-terminus of NRG1. In addition, the SDC4-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from SDC4 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The SDC4-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially lung cancer.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括SDC4外顯子2之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 746及126可能就足夠了。來自SDC4外顯子2之5’處的任一序列包含SEQ ID NO: 745(或SEQ ID NO: 745之任一對偶基因變異體)或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of exon 2 of SDC4 and any sequence at the 3' of exon 2 of NRG1 sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 746 and 126 may be sufficient. Any sequence from 5' of exon 2 of SDC4 comprising or consisting of SEQ ID NO: 745 (or any allele variant of SEQ ID NO: 745), and from 3 of exon 2 of NRG1 Any sequence at ' comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or consists of it.
再者,提供了一種包含有SDC4之外顯子4的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。所述SDC4之外顯子4較佳為SEQ ID NO: 748或SEQ ID NO: 748之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Furthermore, a polynucleotide fusion comprising a portion of exon 4 of SDC4 fused to a portion of exon 2 of NRG1 is provided. The SDC4 exon 4 is preferably the exon of SEQ ID NO: 748 or the allele variant of SEQ ID NO: 748, and the NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: Exon of the allele variant of 126.
較佳地,所述SDC4外顯子4之對偶基因變異體與SEQ ID NO: 748有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of exon 4 of SDC4 has at least 85% identity to SEQ ID NO: 748, preferably at least 90% identity, 92%, 94%, 96% or More preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 is at least 85% identical to SEQ ID NO: 126, preferably at least 90% identical to 92% identical to SEQ ID NO: 126 , 94%, 96% or better at least 98% sequence identity.
較佳地,所述SDC4之外顯子4的部分係包含或根據SEQ ID NO: 822,且其對偶基因變異體與SEQ ID NO: 822有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與SDC4的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 823之序列,且其對偶基因變異體與SEQ ID NO: 823有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,SDC4之外顯子4的部分包含來自SEQ ID NO: 822之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 822之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,所述SDC4之外顯子4的部分係包含或根據SEQ ID NO: 822或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測SDC4與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 4 of SDC4 comprises or is according to SEQ ID NO: 822, and its allele variant has at least 85% identity with SEQ ID NO: 822, preferably at least 90% 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. The part of exon 2 of NRG1 in the fusion with SDC4 is preferably or comprises a sequence such as SEQ ID NO: 823, and its allele variant has at least 85% identity with SEQ ID NO: 823, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the portion of exon 4 of SDC4 comprises or consists of contiguous nucleic acids from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 822, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the part of exon 4 of SDC4 comprises or is according to SEQ ID NO: 822 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between SDC4 and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1. sexual fusion.
替代性地,所述SDC4外顯子4的部分包含SEQ ID NO: 748(或SEQ ID NO: 748之對偶基因變異體)或由其組成,至少包括位置199的核酸。較佳地,所述SDC4外顯子4的部分包含來自SEQ ID NO: 748或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置199的核酸。連續核酸的數目可為SEQ ID NO: 748之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置199的核酸。在此替代例中,SDC4外顯子4的部分更佳地包含或根據SEQ ID NO: 748或其對偶基因變異體。較佳地,在與SDC4的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 4 of SDC4 comprises or consists of SEQ ID NO: 748 (or an allele variant of SEQ ID NO: 748), at least including the nucleic acid at position 199. Preferably, the portion of the SDC4 exon 4 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 748 or its allele variant or consists of and includes at least the nucleic acid at position 199. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 199 nucleic acid. In this alternative, the portion of exon 4 of SDC4 preferably comprises or is according to SEQ ID NO: 748 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with SDC4 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一SDC4-NRG1多核苷酸融合物為SDC4與NRG1之符合讀框的融合。更佳地所述融合物為包含有SDC4之外顯子4或外顯子4的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 824或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 824有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the SDC4-NRG1 polynucleotide fusions provided herein is an in-frame fusion of SDC4 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 4 or a part of exon 4 of SDC4 and exon 2 or a part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 824 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 824, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有所述SDC4外顯子4的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 824之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 824之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 824包括介於SDC4與NRG1之間的接合處,尤其該接合處係介於源於SDC4之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有SDC4外顯子4的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 824之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a part of exon 4 of SDC4 fused with a part of exon 2 of NRG1 comprises from 2 to about 10 of SEQ ID NO: 824, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 824 includes the junction between SDC4 and NRG1, in particular the junction is between the nucleic acid from position 75 of SDC4 and the nucleic acid from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 4 of SDC4 fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 824 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 824之多核苷酸,或一種包含了來自SEQ ID NO: 824之約20、約30、約40個或全部的連續核酸且較佳地包括了位置53及54的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 824之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置75及76的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 824, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 824 and preferably A polynucleotide that includes the nucleic acids at positions 53 and 54. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 75 and 76.
較佳地,該包含有SDC4外顯子4(或其對偶基因變異體)的一部分與NRG1外顯子2(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及所述SDC4或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於SDC4與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of SDC4 exon 4 (or its allogeneic variant) fused to a part of NRG1 exon 2 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Abnormal cell lines containing polynucleotide fusions involving said SDC4 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between SDC4 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該SDC4外顯子4(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有SDC4的N端和NRG1的C端。此外,本文所提供之SDC4-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自SDC4的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該SDC4-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,包括肺癌,尤其是非小細胞肺癌。Preferably, the part of exon 4 of SDC4 (or its allele variant) is located at the 5' position of NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of SDC4 and the C-terminus of NRG1. In addition, the SDC4-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from SDC4 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The SDC4-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, including lung cancer, especially non-small cell lung cancer.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括SDC4外顯子4之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 748及126可能就足夠了。來自SDC4外顯子4之5’處的任一序列包含SEQ ID NO: 745-747(或SEQ ID NO: 745-747之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。 SLC4A4-NRG1 多核苷酸融合物 When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of exon 4 of SDC4 and any sequence at the 3' of exon 2 of NRG1. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 748 and 126 may be sufficient. Any sequence from the 5' of exon 4 of SDC4 comprising or consisting of one or all of SEQ ID NO: 745-747 (or any allele variant of SEQ ID NO: 745-747) , and any sequence from 3' of NRG1 exon 2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or is derived from its composition. SLC4A4-NRG1 polynucleotide fusion
亦提供了一種包含有SLC4A4之外顯子14的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。SLC4A4之外顯子14較佳為SEQ ID NO: 780或SEQ ID NO: 780之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 14 of SLC4A4 fused to a portion of exon 6 of NRG1. SLC4A4 exon 14 is preferably the exon of SEQ ID NO: 780 or an allele variant of SEQ ID NO: 780, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括SLC4A4外顯子14之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 780及130可能就足夠了。來自SLC4A4外顯子14之5’處的任一序列包含SEQ ID NO: 767-779(或SEQ ID NO: 767-779之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of exon 14 of SLC4A4 and any sequence at the 3' of exon 6 of NRG1. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 780 and 130 may be sufficient. Any sequence from the 5' of exon 14 of SLC4A4 comprising or consisting of one or all of SEQ ID NO: 767-779 (or any allele variant of SEQ ID NO: 767-779) , and any sequence from 3' of NRG1 exon 6 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is derived from its composition.
較佳地,該SLC4A4外顯子14之對偶基因變異體與SEQ ID NO: 780有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of exon 14 of SLC4A4 is at least 85% identical to SEQ ID NO: 780, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,SLC4A4之外顯子14的部分係包含或根據SEQ ID NO: 763,且其對偶基因變異體與SEQ ID NO: 763有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與SLC4A4的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 764之序列,且其對偶基因變異體與SEQ ID NO: 764有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,SLC4A4之外顯子14的部分包含來自SEQ ID NO: 763之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 763之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,SLC4A4之外顯子14的部分係包含或根據SEQ ID NO: 763或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測SLC4A4與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 14 of SLC4A4 comprises or is according to SEQ ID NO: 763, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 763 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with SLC4A4 is preferably or comprises a sequence such as SEQ ID NO: 764, and its allele variant has at least 85% identity with SEQ ID NO: 764, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 14 of SLC4A4 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 763, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the portion of exon 14 of SLC4A4 comprises or is according to SEQ ID NO: 763 or an allele variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between SLC4A4 and NRG1 and determining whether such fusions are oncogenic in frame containing the EGF-like domain of NRG1 sexual fusion.
替代性地,SLC4A4外顯子14的部分包含SEQ ID NO: 780(或SEQ ID NO: 780之對偶基因變異體)或由其組成,至少包括位置272的核酸。較佳地,SLC4A4外顯子14的部分包含來自SEQ ID NO: 780或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置272的核酸。連續核酸的數目可為SEQ ID NO: 780之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置272的核酸。在此替代例中,SLC4A4外顯子14的部分更佳地包含或根據SEQ ID NO: 780或其對偶基因變異體。較佳地,在與SLC4A4的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 14 of SLC4A4 comprises or consists of SEQ ID NO: 780 (or an allele variant of SEQ ID NO: 780), including at least the nucleic acid at position 272. Preferably, the portion of SLC4A4 exon 14 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acid from SEQ ID NO: 780 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 272. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 272 nucleic acid. In this alternative, the portion of exon 14 of SLC4A4 preferably comprises or is according to SEQ ID NO: 780 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with SLC4A4 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
較佳地,本文所提供之任一SLC4A4-NRG1多核苷酸融合物為SLC4A4與NRG1之符合讀框的融合。更佳地所述融合物為包含有SLC4A4之外顯子14或外顯子14的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 765或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 765有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any of the SLC4A4-NRG1 polynucleotide fusions provided herein is an in-frame fusion of SLC4A4 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 14 or a part of exon 14 of SLC4A4 and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 765 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 765, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
在一替代性較佳實施例中,該包含有SLC4A4外顯子14的一部分與NRG1外顯子6的一部分融合之多核苷酸係包含來自SEQ ID NO: 765之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 765之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 765包括介於SLC4A4與NRG1之間的接合處,尤其該接合處係介於源於SLC4A4之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有SLC4A4外顯子14的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 765之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of exon 14 of SLC4A4 fused to a portion of exon 6 of NRG1 comprises a sequence from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 765 includes the junction between SLC4A4 and NRG1, in particular the junction is between the nucleic acid from position 75 of SLC4A4 and the nucleic acid from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 14 of SLC4A4 fused with a part of exon 6 of NRG1 has a polynucleotide sequence of SEQ ID NO: 765 or an allele variant thereof.
在一較佳實施例中,提供了一種根據SEQ ID NO: 765之多核苷酸,或一種包含了來自SEQ ID NO: 765之約20、約30、約40個或全部的連續核酸且較佳地包括了位置75及76的核酸之多核苷酸。連續核酸的數目可為SEQ ID NO: 765之18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且較佳地至少包括位置75及76的核酸。In a preferred embodiment, there is provided a polynucleotide according to SEQ ID NO: 765, or a contiguous nucleic acid comprising about 20, about 30, about 40 or all of SEQ ID NO: 765 and preferably A polynucleotide that includes the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and preferably at least including the nucleic acids at positions 75 and 76.
較佳地,該包含有SLC4A4外顯子14(或其對偶基因變異體)的一部分與NRG1外顯子6(或其對偶基因變異體)的一部分融合之多核苷酸係為一種進一步包含了或編碼了NRG1的EGF樣結構域之較長多核苷酸的一部分。包含有涉及SLC4A4或表現該多肽融合物之多核苷酸融合物的異常細胞係包含或編碼NRG1的EGF樣結構域。為了快速檢測、診斷或鑑定之目的,只需證明介於SLC4A4與NRG1之間的融合接合處是符合讀框的並且發生在所得融合產物、核酸或蛋白質包含了NRG1之EGF樣結構域的這樣一個位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polynucleotide comprising a part of SLC4A4 exon 14 (or its allogeneic variant) fused to a part of NRG1 exon 6 (or its allogeneic variant) is a polynucleotide further comprising or Part of a longer polynucleotide encoding the EGF-like domain of NRG1. Aberrant cell lines containing polynucleotide fusions involving SLC4A4 or expressing such polypeptide fusions contain or encode the EGF-like domain of NRG1. For rapid detection, diagnostic or identification purposes, it is only necessary to demonstrate that the fusion junction between SLC4A4 and NRG1 is in-frame and occurs at one such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1 The location is fine. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
較佳地,該SLC4A4外顯子14(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有SLC4A4的N端和NRG1的C端。此外,本文所提供之SLC4A4-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自SLC4A4的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該SLC4A4-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是胰臟癌。 ZFAT-NRG1 多核苷酸融合物 Preferably, the part of exon 14 of SLC4A4 (or its allogeneic variant) is located at the 5' of NRG1 exon 6 (or its allogeneic variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of SLC4A4 and the C-terminus of NRG1. In addition, the SLC4A4-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is the polypeptide sequence from SLC4A4 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The SLC4A4-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially pancreatic cancer. ZFAT-NRG1 polynucleotide fusion
根據本揭露,亦提供一種包含有ZFAT核酸序列(或ZFAT核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸。所述融合物中也包括了ZFAT及NRG1核酸序列的對偶基因變異體。According to the present disclosure, there is also provided a polynucleotide comprising a fusion of a ZFAT nucleic acid sequence (or a part of the ZFAT nucleic acid sequence) and an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). Allele variants of ZFAT and NRG1 nucleic acid sequences are also included in the fusion.
較佳地,該ZFAT核酸序列(或其部分)包含SEQ ID NO: 830-846中之任一者或SEQ ID NO: 830-846中之任一者的對偶基因變異體,或是由其組成,而該NRG1核酸序列(或其部分)包含SEQ ID NO: 125-138中之任一者或SEQ ID NO: 125-138中之任一者的對偶基因變異體,或是由其組成。更佳地,該ZFAT核酸序列包含SEQ ID NO: 846或SEQ ID NO: 846的對偶基因變異體之一部分,或是由其組成,而該NRG1核酸序列包含SEQ ID NO: 138或SEQ ID NO: 138的對偶基因變異體之一部分,或是由其組成。SEQ ID NO: 830-845分別對應根據NM_020863.4之ZFAT的個別外顯子1-16。SEQ ID NO: 846對應根據NM_020863.4之ZFAT的外顯子1-16。SEQ ID NO: 125-137分別對應根據NM_001159999.3之NRG1的個別外顯子1-13。SEQ ID NO: 138對應根據NM_001159999.3之NRG1的外顯子1-13。Preferably, the ZFAT nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 830-846 or an allele variant of any one of SEQ ID NOs: 830-846, or consists of it , and the NRG1 nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 125-138 or an allele variant of any one of SEQ ID NOs: 125-138, or consists of it. More preferably, the ZFAT nucleic acid sequence comprises SEQ ID NO: 846 or a part of the allele variant of SEQ ID NO: 846, or consists of it, and the NRG1 nucleic acid sequence comprises SEQ ID NO: 138 or SEQ ID NO: 138 is part of, or consists of, the allele variant. SEQ ID NO: 830-845 correspond to individual exons 1-16 of the ZFAT according to NM_020863.4, respectively. SEQ ID NO: 846 corresponds to exons 1-16 of the ZFAT according to NM_020863.4. SEQ ID NO: 125-137 correspond to individual exons 1-13 of NRG1 according to NM_001159999.3, respectively. SEQ ID NO: 138 corresponds to exons 1-13 of NRG1 according to NM_001159999.3.
在一較佳實施例中,該ZFAT核酸序列部分包含來自SEQ ID NO: 830-846中之任一者(或SEQ ID NO: 830-846中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,而該NRG1核酸序列部分包含來自SEQ ID NO: 125-138中之任一者(或SEQ ID NO: 125-138中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸。In a preferred embodiment, the ZFAT nucleic acid sequence part comprises 2 to About 10, about 20, about 30, or up to about 40 or even all of the continuous nucleic acid, and the NRG1 nucleic acid sequence part comprises from any one of SEQ ID NO: 125-138 (or SEQ ID NO: 125-138 Allele variants of any) from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids.
較佳地,該ZFAT核酸序列之對偶基因變異體與SEQ ID NO: 830-846中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of the ZFAT nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 830-846, preferably at least 90% identity, and its 92%, 94% identity , 96% or better at least 98% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138, preferably at least 90% identity, 92%, 94%, 96% or better at least 98% sequence identity thereto.
較佳地,該ZFAT核酸序列或其部分係位於NRG1核酸序列或其部分的5’處。Preferably, the ZFAT nucleic acid sequence or part thereof is located 5' to the NRG1 nucleic acid sequence or part thereof.
較佳地,該包含有ZFAT之ZFAT核酸序列(或所述序列的一部分)與NRG1核酸序列(或所述序列的一部分)融合之多核苷酸係包含或編碼NRG1的EGF樣結構域。Preferably, the polynucleotide comprising a ZFAT nucleic acid sequence (or a part of said sequence) fused to an NRG1 nucleic acid sequence (or a part of said sequence) comprises or encodes the EGF-like domain of NRG1.
包含有該ZFAT-NRG1多核苷酸融合物或表現該多肽融合物之異常細胞係包含或編碼NRG1的EGF樣結構域。為了檢測、診斷或鑑定之目的,只需證明介於ZFAT與NRG1之間的融合接合處是符合讀框的並且發生在可使得所得融合產物、核酸或蛋白質包含NRG1的EGF樣結構域的位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Abnormal cell lines containing the ZFAT-NRG1 polynucleotide fusion or expressing the polypeptide fusion contain or encode the EGF-like domain of NRG1. For the purposes of detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between ZFAT and NRG1 is in-frame and occurs at a position such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1. Can. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
亦提供了一種包含有ZFAT之外顯子12的一部分與NRG1之外顯子6的一部分融合之多核苷酸融合物。ZFAT之外顯子12較佳為SEQ ID NO: 841或SEQ ID NO: 841之對偶基因變異體的外顯子,而NRG1之外顯子6較佳為SEQ ID NO: 130或SEQ ID NO: 130之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 12 of ZFAT fused to a portion of exon 6 of NRG1. ZFAT exon 12 is preferably the exon of SEQ ID NO: 841 or an allogenic variant of SEQ ID NO: 841, and NRG1 exon 6 is preferably SEQ ID NO: 130 or SEQ ID NO: 130 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括ZFAT外顯子12之5’處的任一序列以及NRG1外顯子6之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 841及130可能就足夠了。來自ZFAT外顯子12之5’處的任一序列包含SEQ ID NO: 830-840(或SEQ ID NO: 830-840之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子6之3’處的任一序列包含SEQ ID NO: 131-137(或SEQ ID NO: 131-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of ZFAT exon 12 and any sequence at the 3' of NRG1 exon 6. sequence, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 841 and 130 may be sufficient. Any sequence from the 5' of ZFAT exon 12 comprises one or all of SEQ ID NO: 830-840 (or any allele variant of SEQ ID NO: 830-840), or consists of it , and any sequence from 3' of NRG1 exon 6 comprises one or all of SEQ ID NO: 131-137 (or any allele variant of SEQ ID NO: 131-137), or is derived from its composition.
較佳地,該ZFAT外顯子12之對偶基因變異體與SEQ ID NO: 841有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of ZFAT exon 12 is at least 85% identical to SEQ ID NO: 841, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 6 has at least 85% identity to SEQ ID NO: 130, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,ZFAT之外顯子12的部分係包含或根據SEQ ID NO: 826,且其對偶基因變異體與SEQ ID NO: 826有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與ZFAT的融合物中之NRG1之外顯子6的部分較佳為或包含如SEQ ID NO: 827之序列,且其對偶基因變異體與SEQ ID NO: 827有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,ZFAT之外顯子12的部分包含來自SEQ ID NO: 826之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 826之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,ZFAT之外顯子12的部分係包含或根據SEQ ID NO: 826或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測ZFAT與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of ZFAT exon 12 comprises or is according to SEQ ID NO: 826, and its allele variant has at least 85% identity with SEQ ID NO: 826, preferably at least 90% identity 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 6 of NRG1 in the fusion with ZFAT is preferably or comprises a sequence such as SEQ ID NO: 827, and its allele variant has at least 85% identity with SEQ ID NO: 827, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of ZFAT exon 12 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 826, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the part of ZFAT exon 12 comprises or is according to SEQ ID NO: 826 or its allegenic variant. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between ZFAT and NRG1 and determining whether such fusions are oncogenic containing in-frame EGF-like domains of NRG1 sexual fusion.
替代性地,ZFAT外顯子12的部分包含SEQ ID NO: 841(或SEQ ID NO: 841之對偶基因變異體)或由其組成,至少包括位置139的核酸。較佳地,ZFAT外顯子12的部分包含來自SEQ ID NO: 841或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置139的核酸。連續核酸的數目可為SEQ ID NO: 841之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置139的核酸。在此替代例中,ZFAT外顯子12的部分更佳地包含或根據SEQ ID NO: 841或其對偶基因變異體。較佳地,在與ZFAT的融合物中之NRG1外顯子6的部分包含來自SEQ ID NO: 130或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 130之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of ZFAT exon 12 comprises or consists of SEQ ID NO: 841 (or an allele variant of SEQ ID NO: 841), including at least the nucleic acid at position 139. Preferably, the portion of ZFAT exon 12 comprises 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 841 or its allele variants or is composed of It consists of, and at least includes, the nucleic acid at position 139. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 139 nucleic acid. In this alternative, the portion of ZFAT exon 12 preferably comprises or is according to SEQ ID NO: 841 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 6 in the fusion with ZFAT comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有ZFAT外顯子12的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 828之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 828之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 828包括介於ZFAT與NRG1之間的接合處,尤其該接合處係介於源於ZFAT之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有ZFAT外顯子12的一部分與NRG1外顯子6的一部分融合之多核苷酸係具有SEQ ID NO: 828之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of ZFAT exon 12 fused to a portion of NRG1 exon 2 comprises from 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 828 includes a junction between ZFAT and NRG1, in particular the junction is between a nucleic acid derived from position 75 of ZFAT and a nucleic acid derived from position 76 of NRG1. Preferably, the polynucleotide comprising a part of ZFAT exon 12 fused with a part of NRG1 exon 6 has a polynucleotide sequence of SEQ ID NO: 828 or an allele variant thereof.
較佳地,本文所提供之任一ZFAT-NRG1多核苷酸融合物為ZFAT與NRG1之符合讀框的融合。更佳地所述融合物為包含有ZFAT之外顯子12或外顯子12的一部分以及NRG1之外顯子6或外顯子6的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 828或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 828有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any ZFAT-NRG1 polynucleotide fusion provided herein is an in-frame fusion of ZFAT and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 12 or a part of exon 12 of ZFAT and exon 6 or a part of exon 6 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 828 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 828, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
較佳地,該ZFAT外顯子12(或其對偶基因變異體)的部分係位於NRG1外顯子6(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有ZFAT的N端和NRG1的C端。此外,本文所提供之ZFAT-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自ZFAT的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該ZFAT-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是非小細胞肺癌。 DSCAML1-NRG1 多核苷酸融合物 Preferably, the part of ZFAT exon 12 (or its allele variant) is located at the 5' position of NRG1 exon 6 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of ZFAT and the C-terminus of NRG1. In addition, the ZFAT-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from ZFAT and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The ZFAT-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially non-small cell lung cancer. DSCAML1-NRG1 polynucleotide fusion
根據本揭露,亦提供一種包含有DSCAML1核酸序列(或DSCAML1核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸。所述融合物中也包括了DSCAML1及NRG1核酸序列的對偶基因變異體。According to the present disclosure, there is also provided a polynucleotide comprising a fusion of DSCAML1 nucleic acid sequence (or a part of DSCAML1 nucleic acid sequence) and NRG1 nucleic acid sequence (or a part of NRG1 nucleic acid sequence). Allele variants of DSCAML1 and NRG1 nucleic acid sequences are also included in the fusion.
較佳地,該DSCAML1核酸序列(或其部分)包含SEQ ID NO: 870-903中之任一者或SEQ ID NO: 870-903中之任一者的對偶基因變異體,或是由其組成,而該NRG1核酸序列(或其部分)包含SEQ ID NO: 125-138中之任一者或SEQ ID NO: 125-138中之任一者的對偶基因變異體,或是由其組成。更佳地,該DSCAML1核酸序列包含SEQ ID NO: 903或SEQ ID NO: 903的對偶基因變異體之一部分,或是由其組成,而該NRG1核酸序列包含SEQ ID NO: 138或SEQ ID NO: 138的對偶基因變異體之一部分,或是由其組成。SEQ ID NO: 870-902分別對應根據NM_020693.4之DSCAML1的個別外顯子1-33。SEQ ID NO: 903對應根據NM_020693.4之DSCAML1的外顯子1-33。SEQ ID NO: 125-137分別對應根據NM_001159999.3之NRG1的個別外顯子1-13。SEQ ID NO: 138對應根據NM_001159999.3之NRG1的外顯子1-13。Preferably, the DSCAML1 nucleic acid sequence (or part thereof) comprises, or consists of, any one of SEQ ID NOs: 870-903 or an allele variant of any one of SEQ ID NOs: 870-903 , and the NRG1 nucleic acid sequence (or part thereof) comprises any one of SEQ ID NOs: 125-138 or an allele variant of any one of SEQ ID NOs: 125-138, or consists of it. More preferably, the DSCAML1 nucleic acid sequence comprises a part of SEQ ID NO: 903 or an allele variant of SEQ ID NO: 903, or consists of it, and the NRG1 nucleic acid sequence comprises SEQ ID NO: 138 or SEQ ID NO: 138 is part of, or consists of, the allele variant. SEQ ID NO: 870-902 correspond to individual exons 1-33 of DSCAML1 according to NM_020693.4, respectively. SEQ ID NO: 903 corresponds to exons 1-33 of DSCAML1 according to NM_020693.4. SEQ ID NO: 125-137 correspond to individual exons 1-13 of NRG1 according to NM_001159999.3, respectively. SEQ ID NO: 138 corresponds to exons 1-13 of NRG1 according to NM_001159999.3.
在一較佳實施例中,該DSCAML1核酸序列部分包含來自SEQ ID NO: 870-903中之任一者(或SEQ ID NO: 870-903中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,而該NRG1核酸序列部分包含來自SEQ ID NO: 125-138中之任一者(或SEQ ID NO: 125-138中之任一者之對偶基因變異體)之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸。In a preferred embodiment, the DSCAML1 nucleic acid sequence part comprises 2 to About 10, about 20, about 30, or up to about 40 or even all of the continuous nucleic acid, and the NRG1 nucleic acid sequence part comprises from any one of SEQ ID NO: 125-138 (or SEQ ID NO: 125-138 Allele variants of any) from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids.
較佳地,該DSCAML1核酸序列之對偶基因變異體與SEQ ID NO: 870-903中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of the DSCAML1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 870-903, preferably at least 90% identity, and 92%, 94% identity with it , 96% or better at least 98% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138, preferably at least 90% identity, 92%, 94%, 96% or better at least 98% sequence identity thereto.
較佳地,該DSCAML1核酸序列或其部分係位於NRG1核酸序列或其部分的5’處。Preferably, the DSCAML1 nucleic acid sequence or part thereof is located 5' to the NRG1 nucleic acid sequence or part thereof.
較佳地,該包含有DSCAML1之DSCAML1核酸序列(或所述序列的一部分)與NRG1核酸序列(或所述序列的一部分)融合之多核苷酸係包含或編碼NRG1的EGF樣結構域。Preferably, the polynucleotide comprising DSCAML1 nucleic acid sequence (or a part of said sequence) fused with NRG1 nucleic acid sequence (or a part of said sequence) of DSCAML1 comprises or encodes the EGF-like domain of NRG1.
包含有該DSCAML1-NRG1多核苷酸融合物或表現該多肽融合物之異常細胞係包含或編碼NRG1的EGF樣結構域。為了檢測、診斷或鑑定之目的,只需證明介於DSCAML1與NRG1之間的融合接合處是符合讀框的並且發生在可使得所得融合產物、核酸或蛋白質包含NRG1的EGF樣結構域的位置即可。所述EGF樣結構域較佳為根據SEQ ID NO: 163或其對偶基因變異體的EGF樣結構域並與SEQ ID NO: 163有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Abnormal cell lines containing the DSCAML1-NRG1 polynucleotide fusion or expressing the polypeptide fusion contain or encode the EGF-like domain of NRG1. For the purposes of detection, diagnosis or identification, it is only necessary to demonstrate that the fusion junction between DSCAML1 and NRG1 is in-frame and occurs at a position such that the resulting fusion product, nucleic acid or protein comprises the EGF-like domain of NRG1. Can. The EGF-like domain is preferably an EGF-like domain according to SEQ ID NO: 163 or an allele variant thereof and has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 163 , with 92%, 94%, 96% or better at least 98% sequence identity therewith.
亦提供了一種包含有DSCAML1之外顯子3的一部分與NRG1之外顯子2的一部分融合之多核苷酸融合物。DSCAML1之外顯子3較佳為SEQ ID NO: 872或SEQ ID NO: 872之對偶基因變異體的外顯子,而NRG1之外顯子2較佳為SEQ ID NO: 126或SEQ ID NO: 126之對偶基因變異體的外顯子。Also provided is a polynucleotide fusion comprising a portion of exon 3 of DSCAML1 fused to a portion of exon 2 of NRG1. DSCAML1 exon 3 is preferably the exon of SEQ ID NO: 872 or an allogenic variant of SEQ ID NO: 872, and NRG1 exon 2 is preferably SEQ ID NO: 126 or SEQ ID NO: 126 exons of the allele variant.
當存在於患者或個體的異常細胞中時,所述多核苷酸融合物較佳地進一步包括DSCAML1外顯子3之5’處的任一序列以及NRG1外顯子2之3’處的任一序列,但為了能使用以多核苷酸為基礎的檢測測定法來檢測融合接合處,至少存在SEQ ID NO: 872及126可能就足夠了。來自DSCAML1外顯子3之5’處的任一序列包含SEQ ID NO: 870-871(或SEQ ID NO: 870-871之任一對偶基因變異體)中之一或全部,或是由其組成,而來自NRG1外顯子2之3’處的任一序列包含SEQ ID NO: 127-137(或SEQ ID NO: 127-137之任一對偶基因變異體)中之一或全部,或是由其組成。When present in abnormal cells of a patient or individual, the polynucleotide fusion preferably further comprises any sequence at the 5' of DSCAML1 exon 3 and any sequence at the 3' of NRG1 exon 2 sequences, but in order to be able to detect fusion junctions using polynucleotide-based detection assays, the presence of at least SEQ ID NO: 872 and 126 may be sufficient. Any sequence from 5' of exon 3 of DSCAML1 comprising or consisting of one or all of SEQ ID NO: 870-871 (or any allele variant of SEQ ID NO: 870-871) , and any sequence from 3' of NRG1 exon 2 comprises one or all of SEQ ID NO: 127-137 (or any allele variant of SEQ ID NO: 127-137), or is formed by its composition.
較佳地,該DSCAML1外顯子3之對偶基因變異體與SEQ ID NO: 872有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性,而該NRG1外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the allele variant of exon 3 of DSCAML1 is at least 85% identical to SEQ ID NO: 872, preferably at least 90% identical, 92%, 94%, 96% or more identical thereto Preferably at least 98% sequence identity, and the allele variant of NRG1 exon 2 has at least 85% identity to SEQ ID NO: 126, preferably at least 90% identity, and its 92%, 94%, 96% or better still at least 98% sequence identity.
較佳地,DSCAML1之外顯子3的部分係包含或根據SEQ ID NO: 866,且其對偶基因變異體與SEQ ID NO: 866有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在與DSCAML1的融合物中之NRG1之外顯子2的部分較佳為或包含如SEQ ID NO: 867之序列,且其對偶基因變異體與SEQ ID NO: 867有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。更佳地,DSCAML1之外顯子3的部分包含來自SEQ ID NO: 866之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置75的核酸。連續核酸的數目可為SEQ ID NO: 826之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75的核酸。更佳地,DSCAML1之外顯子3的部分係包含或根據SEQ ID NO: 866或其對偶基因變異體。這類短的多核苷酸序列特別有利於檢測DSCAML1與NRG1之間是否存在有較大的多核苷酸融合物並確定這類融合物是否為包含有NRG1之EGF樣結構域之符合讀框的致癌性融合物。Preferably, the part of exon 3 of DSCAML1 comprises or is according to SEQ ID NO: 866, and its allele variant has at least 85% identity, preferably at least 90% identity with SEQ ID NO: 866 92%, 94%, 96% or better at least 98% sequence identity therewith. The part of exon 2 of NRG1 in the fusion with DSCAML1 is preferably or comprises a sequence such as SEQ ID NO: 867, and its allele variant has at least 85% identity with SEQ ID NO: 867, Preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. More preferably, the part of exon 3 of DSCAML1 comprises or consists of continuous nucleic acid from 2 to about 10, about 20, about 30, or up to about 40 or even all of SEQ ID NO: 866, and At least the nucleic acid at position 75 is included. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 nucleic acid. More preferably, the part of exon 3 of DSCAML1 comprises or is according to SEQ ID NO: 866 or an allegenic variant thereof. Such short polynucleotide sequences are particularly useful for detecting the presence of larger polynucleotide fusions between DSCAML1 and NRG1 and determining whether such fusions are in-frame oncogenic agents containing the EGF-like domain of NRG1. sexual fusion.
替代性地,DSCAML1外顯子3的部分包含SEQ ID NO: 872(或SEQ ID NO: 872之對偶基因變異體)或由其組成,至少包括位置147的核酸。較佳地,DSCAML1外顯子3的部分包含來自SEQ ID NO: 872或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置147的核酸。連續核酸的數目可為SEQ ID NO: 872之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置147的核酸。在此替代例中,DSCAML1外顯子3的部分更佳地包含或根據SEQ ID NO: 872或其對偶基因變異體。較佳地,在與DSCAML1的融合物中之NRG1外顯子2的部分包含來自SEQ ID NO: 126或其對偶基因變異體之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸或是由其所組成,且至少包括位置1的核酸。連續核酸的數目可為SEQ ID NO: 126之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置1的核酸。Alternatively, the portion of exon 3 of DSCAML1 comprises or consists of SEQ ID NO: 872 (or an allele variant of SEQ ID NO: 872), including at least the nucleic acid at position 147. Preferably, the portion of exon 3 of DSCAML1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or even all of the contiguous nucleic acids from SEQ ID NO: 872 or an allele variant thereof or is composed of It consists of, and at least includes, the nucleic acid at position 147. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 147 nucleic acid. In this alternative, the portion of exon 3 of DSCAML1 preferably comprises or is according to SEQ ID NO: 872 or an allegenic variant thereof. Preferably, the portion of NRG1 exon 2 in the fusion with DSCAML1 comprises from 2 to about 10, about 20, about 30, or up to about 40 or Even the entire continuous nucleic acid or consists of it, and at least includes the nucleic acid in position 1. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, including at least position 1 nucleic acid.
在一替代性較佳實施例中,該包含有DSCAML1外顯子3的一部分與NRG1外顯子2的一部分融合之多核苷酸係包含來自SEQ ID NO: 868之2至約10、約20、約30、或高達約40個或甚至全部的連續核酸,且包括位置75及76的核酸。連續核酸的數目可為SEQ ID NO: 868之2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40個或全部的核酸,且至少包括位置75及76的核酸。SEQ ID NO: 868包括介於DSCAML1與NRG1之間的接合處,尤其該接合處係介於源於DSCAML1之位置75的核酸與源於NRG1之位置76的核酸之間。較佳地,該包含有DSCAML1外顯子3的一部分與NRG1外顯子2的一部分融合之多核苷酸係具有SEQ ID NO: 868之多核苷酸序列或其對偶基因變異體。In an alternative preferred embodiment, the polynucleotide comprising a portion of exon 3 of DSCAML1 fused to a portion of exon 2 of NRG1 is comprised of 2 to about 10, about 20, About 30, or up to about 40 or even all of the contiguous nucleic acids, and including the nucleic acids at positions 75 and 76. The number of consecutive nucleic acids can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 or all of the nucleic acids, and at least including position 75 and 76 nucleic acids. SEQ ID NO: 868 includes the junction between DSCAML1 and NRG1, in particular the junction is between the nucleic acid from position 75 of DSCAML1 and the nucleic acid from position 76 of NRG1. Preferably, the polynucleotide comprising a part of exon 3 of DSCAML1 fused with a part of exon 2 of NRG1 has a polynucleotide sequence of SEQ ID NO: 868 or an allele variant thereof.
較佳地,本文所提供之任一DSCAML1-NRG1多核苷酸融合物為DSCAML1與NRG1之符合讀框的融合。更佳地所述融合物為包含有DSCAML1之外顯子3或外顯子3的一部分以及NRG1之外顯子2或外顯子2的一部分之符合讀框的融合。所述符合讀框的融合較佳為SEQ ID NO: 868或其對偶基因變異體的融合物,且該對偶基因變異體與SEQ ID NO: 868有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。Preferably, any DSCAML1-NRG1 polynucleotide fusion provided herein is an in-frame fusion of DSCAML1 and NRG1. More preferably, the fusion is an in-frame fusion comprising exon 3 or a part of exon 3 of DSCAML1 and exon 2 or a part of exon 2 of NRG1. The in-frame fusion is preferably a fusion of SEQ ID NO: 868 or an allele variant thereof, and the allele variant has at least 85% identity to SEQ ID NO: 868, preferably at least 90%, 92%, 94%, 96% or even 98% agreement.
較佳地,該DSCAML1外顯子3(或其對偶基因變異體)的部分係位於NRG1外顯子2(或其對偶基因變異體)的5’處。此核酸層級的配向造成融合多肽產物含有DSCAML1的N端和NRG1的C端。此外,本文所提供之DSCAML1-NRG1多核苷酸融合物產生了一種蛋白質融合物,其中從N端到該融合接合處的部分係為來自DSCAML1的多肽序列,而從該接合處到C端的部分為NRG1多肽序列,其中該NRG1部分也提供了其EGF樣結構域。該DSCAML1-NRG1融合蛋白因此保留了NRG1的EGF樣結構域以及驅動一小群人類癌症增生及存活的能力,尤其是腺癌,更尤其是胰臟導管腺癌。Preferably, the part of exon 3 of DSCAML1 (or its allele variant) is located 5' to NRG1 exon 2 (or its allele variant). This nucleic acid-level alignment results in a fusion polypeptide product containing the N-terminus of DSCAML1 and the C-terminus of NRG1. In addition, the DSCAML1-NRG1 polynucleotide fusions provided herein result in a protein fusion in which the portion from the N-terminus to the fusion junction is a polypeptide sequence from DSCAML1 and the portion from the junction to the C-terminus is NRG1 polypeptide sequence, wherein the NRG1 portion also provides its EGF-like domain. The DSCAML1-NRG1 fusion protein thus retains the EGF-like domain of NRG1 and the ability to drive proliferation and survival of a small group of human cancers, especially adenocarcinoma, and more particularly pancreatic ductal adenocarcinoma.
本文所提及之每個包含有NRG1之多核苷酸融合物,包括VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1或DSCAML1-NRG1,較佳地係被分離出來。本發明之任一方法較佳地包含從樣本分離出一或多種含有多核苷酸的組分。該一或多種含有多核苷酸的組分通常從樣本中的任一細胞或細胞材料中分離出來。 NRG1 多肽融合物 Each polynucleotide fusion containing NRG1 mentioned herein includes VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4- NRG1, SLC4A4-NRG1, ZFAT-NRG1 or DSCAML1-NRG1, are preferably isolated. Any of the methods of the invention preferably comprises isolating one or more polynucleotide-containing fractions from the sample. The one or more polynucleotide-containing components are typically isolated from any cells or cellular material in the sample. NRG1 polypeptide fusion
根據本揭露,現提供了包含有NRG1之多肽融合物,包括VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1及DSCAML1-NRG1。尤其,在被診斷患有癌症的人類患者中係存在或已鑑定出這類融合物,並在下節中更詳細地提及。 VAPB-NRG1 多肽融合物 According to the present disclosure, there are now provided polypeptide fusions comprising NRG1, including VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1- NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4-NRG1, ZFAT-NRG1 and DSCAML1-NRG1. In particular, such fusions are present or have been identified in human patients diagnosed with cancer and are mentioned in more detail in the next section. VAPB-NRG1 polypeptide fusion
根據本揭露,亦提供一種由包含有VAPB核酸序列(或VAPB核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸所編碼之多肽融合物。該VAPB核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 24-30中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。該NRG1核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 139-152中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。According to the disclosure, there is also provided a polypeptide fusion encoded by a polynucleotide comprising a VAPB nucleic acid sequence (or a part of the VAPB nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). The VAPB nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 24-30 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences. The NRG1 nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 139-152 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences.
SEQ ID NO: 24-30中之任一者之VAPB對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。SEQ ID NO: 139-152中之任一者之NRG1對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。The VAPB allele variant of any one of SEQ ID NO: 24-30 preferably has at least 85% sequence identity with these sequences, more preferably 90%, 92%, 94%, 96% therewith Or even better at least 98% sequence identity. The NRG1 allele variant of any one of SEQ ID NOs: 139-152 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity.
較佳地,所述融合物之VAPB核酸序列部分係編碼VAPB之多肽部分,此多肽部分包含來自SEQ ID NO: 24-30中之任一者(或SEQ ID NO: 24-30中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。較佳地,所述融合物之NRG1核酸序列部分係編碼NRG1之多肽部分,此多肽部分包含來自SEQ ID NO: 139-152中之任一者(或SEQ ID NO: 139-152中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。Preferably, the VAPB nucleic acid sequence part of the fusion is a polypeptide part of encoding VAPB, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 24-30) from SEQ ID NO: 24-30. or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of Preferably, the NRG1 nucleic acid sequence part of the fusion is a polypeptide part of encoding NRG1, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 139-152) from SEQ ID NO: 139-152 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of
較佳地,本揭露之任一VAPB-NRG1多肽融合物包括SEQ ID NO: 24-30中之任一者的多肽序列,此多肽序列具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變添加、刪除或取代SEQ ID NO: 24-30之多肽的任一胺基酸。較佳地,所述多肽融合物包括SEQ ID NO: 24-30中之任一者的多肽序列,此多肽序列具有1、2、3、4或5個點突變添加、刪除或取代SEQ ID NO: 24-30之多肽的任一胺基酸。更佳地,所述多核苷酸融合物包括SEQ ID NO: 24-30中之任一者的多肽序列,此多肽序列具有1、2或3個點突變添加、刪除或取代SEQ ID NO: 24-30之多肽的任一胺基酸。Preferably, any VAPB-NRG1 polypeptide fusion of the present disclosure includes the polypeptide sequence of any one of SEQ ID NO: 24-30, and this polypeptide sequence has one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutation to add, delete or replace any amino acid of the polypeptide of SEQ ID NO: 24-30. Preferably, the polypeptide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 24-30, and this polypeptide sequence has 1, 2, 3, 4 or 5 point mutations added, deleted or substituted for SEQ ID NO : Any amino acid of the polypeptide of 24-30. More preferably, the polynucleotide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 24-30, and this polypeptide sequence has 1, 2 or 3 point mutations to add, delete or replace SEQ ID NO: 24 Any amino acid of the polypeptide of -30.
在一較佳實施例中,亦提供一種由包含有VAPB之外顯子1 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該VAPB之外顯子1所編碼的多肽較佳地包含SEQ ID NO: 24或SEQ ID NO: 24之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 141-151中之任一者或全部,或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可根據SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體,其序列係對應外顯子2-13全部所編碼的多肽序列。In a preferred embodiment, there is also provided a polynucleoside comprising a part of VAPB exon 1 (or its allele variant) and a part of NRG1 exon 2 (or its allele variant) Acid-encoded polypeptide fusion. The polypeptide encoded by exon 1 of VAPB preferably comprises or consists of SEQ ID NO: 24 or the allele variant of SEQ ID NO: 24. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any or all of SEQ ID NO: 141-151, or any allele variant of SEQ ID NO: 141-151. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be according to SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該VAPB之外顯子1部分所編碼的多肽包含來自SEQ ID NO: 24或SEQ ID NO: 24之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 24有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 1 part of VAPB comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 24 or the allele variant of SEQ ID NO: 24 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 24, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一VAPB-NRG1多肽融合物包括如SEQ ID NO: 4或其對偶基因變異體之多肽序列。SEQ ID NO: 4含有一個在VAPB與NRG1之間的融合接合處,該融合係位於位置14之胺基酸(其係與VAPB的柄位置1-13處的胺基酸一起)與位置16之胺基酸(其係與NRG1的柄位置17-31處的胺基酸一起)之間。在位置15處,因為NRG1與VAPB出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該VAPB-NRG1多肽融合物包括來自SEQ ID NO: 4或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置14、15及16的胺基酸。Preferably, any VAPB-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 4 or its allele variant. SEQ ID NO: 4 contains a fusion junction between VAPB and NRG1 between the amino acid at position 14 (with amino acids at positions 1-13 of the stalk of VAPB) and between position 16 between the amino acids (which tie together with the amino acids at positions 17-31 of the handle of NRG1). At position 15, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with VAPB. Preferably, the VAPB-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 4 or its allele variant, at least including positions 14, 15 and 16 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 4之多肽序列,或是一種包含來自SEQ ID NO: 4或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置14、15及16處的胺基酸。所述多肽序列與SEQ ID NO: 4有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 4, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 4 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 14, 15 and 16. The polypeptide sequence has at least 85% identity to SEQ ID NO: 4, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一VAPB-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 4的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 4之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 4的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 4之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 4的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 4之多肽的任一胺基酸。Preferably, any VAPB-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 4, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 4. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 4 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 4. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 4 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 4 of any amino acid.
較佳地,本文所提供之介於NRG1與VAPB之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自VAPB的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有VAPB-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 CADM1-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NRG1 and VAPB are aligned so that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from VAPB, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the VAPB-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. CADM1-NRG1 polypeptide fusion
亦提供一種由包含有CADM1之外顯子7 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CADM1之外顯子7所編碼的多肽較佳地包含SEQ ID NO: 51或SEQ ID NO: 51之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 45-50中之任一者或全部,或是SEQ ID NO: 45-50中之任一者之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151中之任一者,或是SEQ ID NO: 145-151中之任一者之任一對偶基因變異體。SEQ ID NO: 45-50分別對應CADM1之外顯子1-7所編碼的個別多肽序列。SEQ ID NO: 145-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CADM1 exon 7 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 7 of CADM1 preferably comprises or consists of SEQ ID NO: 51 or the allele variant of SEQ ID NO: 51. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any or all of SEQ ID NO: 45-50, or any allele variant of any of SEQ ID NO: 45-50, and is more Preferably further comprising any one of SEQ ID NO: 145-151, or any allogeneic variant of any one of SEQ ID NO: 145-151. SEQ ID NOs: 45-50 correspond to individual polypeptide sequences encoded by exons 1-7 of CADM1, respectively. SEQ ID NOs: 145-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該CADM1之外顯子7部分所編碼的多肽包含來自SEQ ID NO: 51或SEQ ID NO: 51之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 51有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 7 of CADM1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 51 or the allele variant of SEQ ID NO: 51 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 51, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CADM1-NRG1多肽融合物包括如SEQ ID NO: 8或其對偶基因變異體之多肽序列。SEQ ID NO: 8含有一個在CADM1與NRG1之間的融合接合處,該融合係位於位置17之胺基酸(其係與CADM1的柄位置1-16處的胺基酸一起)與位置19之胺基酸(其係與NRG1的柄位置20-28處的胺基酸一起)之間。在位置18處,因為NRG1與CADM1出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該CADM1-NRG1多肽融合物包括來自SEQ ID NO: 8或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置17、18及19的胺基酸。Preferably, any CADM1-NRG1 polypeptide fusion of the present disclosure comprises a polypeptide sequence such as SEQ ID NO: 8 or an allele variant thereof. SEQ ID NO: 8 contains a fusion junction between CADM1 and NRG1 between the amino acid at position 17 (with the amino acids at positions 1-16 of the stalk of CADM1) and between position 19 between the amino acids (which tie together with the amino acids at positions 20-28 of the handle of NRG1). At position 18, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 and CADM1. Preferably, the CADM1-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 8 or its allele variant, including at least positions 17, 18 and 19 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 8之多肽序列,或是一種包含來自SEQ ID NO: 8或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置17、18及19處的胺基酸。所述多肽序列與SEQ ID NO: 8有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 8, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 8 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 17, 18 and 19. The polypeptide sequence has at least 85% identity to SEQ ID NO: 8, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CADM1-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 8的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 8之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 8的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 8之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 8的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 8之多肽的任一胺基酸。Preferably, any CADM1-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 8, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 8. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 8 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 8. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 8 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 8 of any amino acid.
較佳地,本文所提供之介於NRG1與CADM1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CADM1的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CADM1-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 CD44-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NRG1 and CADM1 are aligned such that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from CADM1, and the portion spanning the fusion junction to the C-terminus Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the CADM1-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. CD44-NRG1 polypeptide fusion
亦提供一種由包含有CD44之外顯子5 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CD44之外顯子5所編碼的多肽較佳地包含SEQ ID NO: 84或SEQ ID NO: 84之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 80-83中之任一者或全部,或是SEQ ID NO: 80-83中之任一者之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 141-151中之任一者之任一對偶基因變異體。SEQ ID NO: 80-83分別對應CD44之外顯子1-4所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CD44 exon 5 (or its allogeneic variant) and a portion of NRG1 exon 2 (or its allogeneic variant). The polypeptide encoded by exon 5 of CD44 preferably comprises or consists of SEQ ID NO: 84 or an allele variant of SEQ ID NO: 84. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any or all of SEQ ID NO: 80-83, or any allele variant of any of SEQ ID NO: 80-83, and more Preferably further comprising any one of SEQ ID NO: 141-151, or any allogeneic variant of any one of SEQ ID NO: 141-151. SEQ ID NOs: 80-83 correspond to individual polypeptide sequences encoded by exons 1-4 of CD44, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該CD44之外顯子5部分所編碼的多肽包含來自SEQ ID NO: 84或SEQ ID NO: 84之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 84有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 5 part of CD44 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 84 or SEQ ID NO: 84 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 84, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CD44-NRG1多肽融合物包括如SEQ ID NO: 12或其對偶基因變異體之多肽序列。SEQ ID NO: 12含有一個在CD44與NRG1之間的融合接合處,該融合係位於位置17之胺基酸(其係與CD44的柄位置1-16處的胺基酸一起)與位置19之胺基酸(其係與NRG1的柄位置20-36處的胺基酸一起)之間。在位置18處,因為NRG1與CD44出乎意料產生符合讀框的融合而有蘇胺酸(T, Thr)殘基的存在。較佳地,該CD44-NRG1多肽融合物包括來自SEQ ID NO: 12或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置17、18及19的胺基酸。Preferably, any CD44-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 12 or its allele variant. SEQ ID NO: 12 contains a fusion junction between CD44 and NRG1 between the amino acid at position 17 (with the amino acids at positions 1-16 of the stalk of CD44) and between position 19 Amino acids (which tie together with the amino acids at positions 20-36 of the handle of NRG1). At position 18, a threonine (T, Thr) residue was present due to an unexpected in-frame fusion of NRG1 with CD44. Preferably, the CD44-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 12 or its allele variant, including at least positions 17, 18 and 19 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 12之多肽序列,或是一種包含來自SEQ ID NO: 12或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置17、18及19處的胺基酸。所述多肽序列與SEQ ID NO: 12有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 12, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 12 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 17, 18 and 19. The polypeptide sequence has at least 85% identity to SEQ ID NO: 12, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CD44-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 12的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 12之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 12的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 12之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 12的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 12之多肽的任一胺基酸。Preferably, any CD44-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 12, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 12. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 12 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 12. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 12 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 12 of any amino acid.
較佳地,本文所提供之介於NRG1與CD44之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CD44的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CD44-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。Preferably, the polypeptide fusions provided herein between NRG1 and CD44 are aligned such that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from CD44, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the CD44-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein.
亦提供一種由包含有CD44之外顯子5 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CD44之外顯子5所編碼的多肽較佳地包含SEQ ID NO: 84或SEQ ID NO: 84之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 80-83中之任一者,或是SEQ ID NO: 80-83之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 80-83分別對應CD44之外顯子1-4所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CD44 exon 5 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 5 of CD44 preferably comprises or consists of SEQ ID NO: 84 or an allele variant of SEQ ID NO: 84. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 80-83, or any allele variant of SEQ ID NO: 80-83, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NOs: 80-83 correspond to individual polypeptide sequences encoded by exons 1-4 of CD44, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該CD44之外顯子5部分所編碼的多肽包含來自SEQ ID NO: 84或SEQ ID NO: 84之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 84有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 5 part of CD44 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 84 or SEQ ID NO: 84 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 84, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CD44--NRG1多肽融合物包括如SEQ ID NO: 762或其對偶基因變異體之多肽序列。SEQ ID NO: 762含有一個在CD44-與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與CD44-的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與CD44-出乎意料產生符合讀框的融合而有蘇胺酸(T, Thr)殘基的存在。較佳地,該CD44--NRG1多肽融合物包括來自SEQ ID NO: 762或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置24、25及26的胺基酸。Preferably, any CD44-NRG1 polypeptide fusion of the present disclosure comprises a polypeptide sequence such as SEQ ID NO: 762 or an allele variant thereof. SEQ ID NO: 762 contains a fusion junction between CD44- and NRG1 with the amino acid at position 24 (together with the amino acids at positions 1-23 of the stalk of CD44-) and position Between the amino acids of 26 (which are together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, a threonine (T, Thr) residue is present due to an unexpected in-frame fusion of NRG1 to CD44-. Preferably, the CD44--NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 762 or its allele variant, including at least position 24 , 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 762之多肽序列,或是一種包含來自SEQ ID NO: 762或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 762有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 762, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 762 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 762, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CD44-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 762的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 762之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 762的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 762之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 762的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 762之多肽的任一胺基酸。Preferably, any CD44-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 762, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 762. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 762 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 762 : Any amino acid of the polypeptide of 762. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 762 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 762 of any amino acid.
較佳地,本文所提供之介於CD44與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CD44的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CD44-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 SLC3A2-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between CD44 and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from CD44, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the CD44-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. SLC3A2-NRG1 polypeptide fusion
亦提供一種由包含有SLC3A2之轉錄本6之外顯子1 (或其對偶基因變異體)的一部分以及NRG1之外顯子5 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。所述SLC3A2外顯子1所編碼的多肽較佳地包含SEQ ID NO: 113或SEQ ID NO: 113之對偶基因變異體或是由其組成。該NRG1之外顯子5所編碼的多肽較佳地包含SEQ ID NO: 143或SEQ ID NO: 143之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 144-151中之任一者,或是SEQ ID NO: 144-151中之任一者之任一對偶基因變異體。SEQ ID NO: 144-151分別對應NRG1之外顯子6-13所編碼的個別多肽序列。所述NRG1之外顯子5部分亦可由SEQ ID NO: 158或SEQ ID NO: 158之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列且包括一個由外顯子5之最3’符合讀框的三聯體所編碼之額外絲胺酸殘基。Also provided is a polynucleotide encoded by a portion of exon 1 (or its allele variant) of transcript 6 of SLC3A2 and a portion of NRG1 exon 5 (or its allele variant) Polypeptide Fusions. The polypeptide encoded by exon 1 of SLC3A2 preferably comprises or consists of SEQ ID NO: 113 or the allele variant of SEQ ID NO: 113. The polypeptide encoded by exon 5 of NRG1 preferably comprises or consists of SEQ ID NO: 143 or an allele variant of SEQ ID NO: 143. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 144-151, or any allele variant of any one of SEQ ID NO: 144-151. SEQ ID NOs: 144-151 correspond to individual polypeptide sequences encoded by exons 6-13 of NRG1, respectively. The exon 5 part of NRG1 may also be composed of SEQ ID NO: 158 or an allele variant of SEQ ID NO: 158, the sequence of which corresponds to the polypeptide sequence encoded by all exons 6-13 and includes a sequence consisting of The extra serine residue encoded by the most 3' in-frame triplet of exon 5.
較佳地,該SLC3A2之轉錄本6之外顯子1部分所編碼的多肽包含來自SEQ ID NO: 113或SEQ ID NO: 113之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 113有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在一較佳實施例中,該SLC3A2-NRG1融合物中之NRG1之外顯子5部分至少包含SEQ ID NO: 143位置16的胺基酸,或SEQ ID NO: 143之對偶基因變異體的對應胺基酸。該對偶基因變異體與SEQ ID NO: 143有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 1 part of exon 6 of the transcript of SLC3A2 comprises 8, 9, 10, 11, 12, 13 from SEQ ID NO: 113 or the allele variant of SEQ ID NO: 113 or 14 consecutive amino acids, or consists of them. The allele variant has at least 85% identity to SEQ ID NO: 113, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. In a preferred embodiment, the NRG1 exon 5 part in the SLC3A2-NRG1 fusion comprises at least the amino acid at position 16 of SEQ ID NO: 143, or the corresponding allele variant of SEQ ID NO: 143 amino acids. The allele variant has at least 85% identity to SEQ ID NO: 143, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之SLC3A2-NRG1多肽融合物之任一轉錄本6包括如SEQ ID NO: 16或其對偶基因變異體之多肽序列。SEQ ID NO: 16含有一個在SLC3A2與NRG1之間的融合接合處,該融合係位於位置17之胺基酸(其係與SLC3A2的柄位置1-16處的胺基酸一起)與位置19之胺基酸(其係與NRG1的柄位置20-29處的胺基酸一起)之間。在位置18處,因為NRG1與該SLC3A2轉錄本出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該SLC3A2-NRG1多肽融合物包括來自SEQ ID NO: 16或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置17、18及19的胺基酸。Preferably, any transcript 6 of the SLC3A2-NRG1 polypeptide fusion disclosed herein includes a polypeptide sequence such as SEQ ID NO: 16 or its allele variant. SEQ ID NO: 16 contains a fusion junction between SLC3A2 and NRG1 between the amino acid at position 17 (with the amino acids at positions 1-16 of the stalk of SLC3A2) and between position 19 Amino acids (which tie together with the amino acids at positions 20-29 of the handle of NRG1). At position 18, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 to the SLC3A2 transcript. Preferably, the SLC3A2-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 16 or its allele variant, including at least positions 17 and 18 and 19 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 16之多肽序列,或是一種包含來自SEQ ID NO: 16或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置17、18及19處的胺基酸。所述多肽序列與SEQ ID NO: 16有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 16, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 16 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 17, 18 and 19. The polypeptide sequence has at least 85% identity to SEQ ID NO: 16, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之SLC3A2-NRG1多肽融合物之任一轉錄本6包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 16的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 16之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 16的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 16之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 16的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 16之多肽的任一胺基酸。Preferably, any transcript 6 of the SLC3A2-NRG1 polypeptide fusion disclosed herein includes one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) points A mutated polypeptide sequence of SEQ ID NO: 16, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 16. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 16 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 16. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 16 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 16 of any amino acid.
較佳地,本文所提供之介於NRG1與SLC3A2之轉錄本6之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自SLC3A2的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有SLC3A2-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。Preferably, the polypeptide fusions provided herein between NRG1 and transcript 6 of SLC3A2 are aligned such that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from SLC3A2, and the portion spanning the fusion junction The part to the C-terminus is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the SLC3A2-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein.
亦提供一種由包含有SLC3A2之轉錄本3之外顯子2 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。所述SLC3A2外顯子2所編碼的多肽較佳地包含SEQ ID NO: 470或SEQ ID NO: 470之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 469或145-151中之任一者,或是SEQ ID NO: 469或145-151之任一對偶基因變異體。SEQ ID NO: 145-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polynucleotide encoded by a portion of exon 2 (or its allele variant) of transcript 3 of SLC3A2 and a portion of NRG1 exon 6 (or its allele variant) Polypeptide Fusions. The polypeptide encoded by exon 2 of SLC3A2 preferably comprises or consists of SEQ ID NO: 470 or the allele variant of SEQ ID NO: 470. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 469 or 145-151, or any allele variant of SEQ ID NO: 469 or 145-151. SEQ ID NOs: 145-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該SLC3A2之轉錄本3之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 470或SEQ ID NO: 470之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 470有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。在一較佳實施例中,該SLC3A2-NRG1融合物中之NRG1之外顯子6部分包含或者是根據SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的序列。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 2 part of transcript 3 of the SLC3A2 comprises 8, 9, 10, 11, 12, 13 from SEQ ID NO: 470 or the allele variant of SEQ ID NO: 470 or 14 consecutive amino acids, or consists of them. The allele variant has at least 85% identity to SEQ ID NO: 470, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. In a preferred embodiment, the portion of exon 6 of NRG1 in the SLC3A2-NRG1 fusion comprises or is the sequence according to SEQ ID NO: 144 or the allele variant of SEQ ID NO: 144. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之SLC3A2-NRG1多肽融合物之任一轉錄本3包括如SEQ ID NO: 455或其對偶基因變異體之多肽序列。SEQ ID NO: 455含有一個在SLC3A2與NRG1之間的融合接合處,該融合係位於位置30之胺基酸(其係與SLC3A2的柄位置1-29處的胺基酸一起)與位置32之胺基酸(其係與NRG1的柄位置33-39處的胺基酸一起)之間。在位置31處,因為NRG1與該SLC3A2轉錄本出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該SLC3A2-NRG1多肽融合物包括來自SEQ ID NO: 455或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置30、31及32的胺基酸。Preferably, any transcript 3 of the SLC3A2-NRG1 polypeptide fusion disclosed herein includes a polypeptide sequence such as SEQ ID NO: 455 or its allele variant. SEQ ID NO: 455 contains a fusion junction between SLC3A2 and NRG1 between the amino acid at position 30 (together with the amino acids at positions 1-29 of the stalk of SLC3A2) and between position 32 between the amino acids (which tie together with the amino acids at positions 33-39 of the handle of NRG1). At position 31, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 to the SLC3A2 transcript. Preferably, the SLC3A2-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 455 or its allele variants, including at least positions 30 and 31 and 32 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 455之多肽序列,或是一種包含來自SEQ ID NO: 455或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置30、31及32處的胺基酸。所述多肽序列與SEQ ID NO: 455有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 455, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 455 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 30, 31 and 32. The polypeptide sequence has at least 85% identity to SEQ ID NO: 455, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之SLC3A2-NRG1多肽融合物之任一轉錄本3包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 455的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 455之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 455的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 455之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 455的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 455之多肽的任一胺基酸。Preferably, any transcript 3 of the SLC3A2-NRG1 polypeptide fusion disclosed herein includes one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) points A mutated polypeptide sequence of SEQ ID NO: 455, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 455. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 455 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 455. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 455 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 455 of any amino acid.
較佳地,本文所提供之介於NRG1與SLC3A2之轉錄本6之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自SLC3A2的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有SLC3A2-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 VTCN1-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NRG1 and transcript 6 of SLC3A2 are aligned such that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from SLC3A2 and spanning the fusion junction The part to the C-terminus is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the SLC3A2-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. VTCN1-NRG1 polypeptide fusion
亦提供一種由包含有VTCN1之外顯子2 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該VTCN1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 176或SEQ ID NO: 176之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 175或SEQ ID NO: 175之對偶基因變異體,且較佳地進一步包括SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 141-151中之任一者之任一對偶基因變異體。SEQ ID NO: 175對應VTCN1之外顯子1所編碼的多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of VTCN1 exon 2 (or its allogeneic variant) and a portion of NRG1 exon 2 (or its allogeneic variant). The polypeptide encoded by exon 2 of VTCN1 preferably comprises or consists of SEQ ID NO: 176 or the allele variant of SEQ ID NO: 176. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises SEQ ID NO: 175 or an allele variant of SEQ ID NO: 175, and preferably further comprises any one of SEQ ID NO: 141-151, or SEQ ID NO: 141-151, or SEQ ID NO: 175 Any allele variant of any of ID NOs: 141-151. SEQ ID NO: 175 corresponds to the polypeptide sequence encoded by exon 1 of VTCN1. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該VTCN1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 176或SEQ ID NO: 176之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 176有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 2 part of VTCN1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 176 or the allele variant of SEQ ID NO: 176 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 176, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一VTCN1-NRG1多肽融合物包括如SEQ ID NO: 167或其對偶基因變異體之多肽序列。SEQ ID NO: 167含有一個在VTCN1與NRG1之間的融合接合處,該融合係位於位置21之胺基酸(其係與VTCN1的柄位置1-20處的胺基酸一起)與位置23之胺基酸(其係與NRG1的柄位置24-30處的胺基酸一起)之間。在位置22處,因為NRG1與VTCN1出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該VTCN1-NRG1多肽融合物包括來自SEQ ID NO: 167或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置21、22及23的胺基酸。Preferably, any VTCN1-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 167 or an allele variant thereof. SEQ ID NO: 167 contains a fusion junction between VTCN1 and NRG1 between the amino acid at position 21 (with the amino acids at positions 1-20 of the stalk of VTCN1 ) and between position 23 between the amino acids (which tie together with the amino acids at positions 24-30 of the handle of NRG1). At position 22, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 and VTCN1. Preferably, the VTCN1-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 167 or its allele variant, including at least positions 21, 22 and 23 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 167之多肽序列,或是一種包含來自SEQ ID NO: 167或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置21、22及23處的胺基酸。所述多肽序列與SEQ ID NO: 167有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 167, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 167 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 21, 22 and 23. The polypeptide sequence has at least 85% identity to SEQ ID NO: 167, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一VTCN1-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 167的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 167之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 167的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 167之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 167的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 167之多肽的任一胺基酸。Preferably, any VTCN1-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 167, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 167. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 167 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 167. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 167 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 167 of any amino acid.
較佳地,本文所提供之介於VTCN1與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自VTCN1的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有VTCN1-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 CDH1-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between VTCN1 and NRG1 are aligned so that the part spanning the N-terminus to the fusion junction is the polypeptide sequence from VTCN1, and the part spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the VTCN1-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. CDH1-NRG1 polypeptide fusion
亦提供一種由包含有CDH1之外顯子11 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CDH1之外顯子11所編碼的多肽較佳地包含SEQ ID NO: 206或SEQ ID NO: 206之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 205或SEQ ID NO: 205之對偶基因變異體,且較佳地進一步包括SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 141-151中之任一者之任一對偶基因變異體。SEQ ID NO: 205對應CDH1之外顯子10所編碼的多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CDH1 exon 11 (or its allogeneic variant) and a portion of NRG1 exon 2 (or its allogeneic variant). The polypeptide encoded by exon 11 of CDH1 preferably comprises or consists of SEQ ID NO: 206 or the allele variant of SEQ ID NO: 206. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises SEQ ID NO: 205 or an allele variant of SEQ ID NO: 205, and preferably further comprises any one of SEQ ID NO: 141-151, or SEQ ID NO: 141-151, or SEQ ID NO: Any allele variant of any of ID NOs: 141-151. SEQ ID NO: 205 corresponds to the polypeptide sequence encoded by exon 10 of CDH1. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該CDH1之外顯子11部分所編碼的多肽包含來自SEQ ID NO: 206或SEQ ID NO: 206之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 206有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 11 part of CDH1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 206 or SEQ ID NO: 206 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 206, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CDH1-NRG1多肽融合物包括如SEQ ID NO: 187或其對偶基因變異體之多肽序列。SEQ ID NO: 187含有一個在CDH1與NRG1之間的融合接合處,該融合係位於位置39之胺基酸(其係與CDH1的柄位置1-38處的胺基酸一起)與位置41之胺基酸(其係與NRG1的柄位置42-49處的胺基酸一起)之間。在位置40處,因為NRG1與CDH1出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該CDH1-NRG1多肽融合物包括來自SEQ ID NO: 187或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置39、40及41的胺基酸。Preferably, any CDH1-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 187 or its allele variant. SEQ ID NO: 187 contains a fusion junction between CDH1 and NRG1 between the amino acid at position 39 (together with the amino acids at positions 1-38 of the handle of CDH1) and between position 41 between the amino acids (which tie together with the amino acids at positions 42-49 of the handle of NRG1). At position 40, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 and CDH1. Preferably, the CDH1-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 187 or its allele variant, including at least positions 39, 40 and 41 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 187之多肽序列,或是一種包含來自SEQ ID NO: 187或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置39、40及41處的胺基酸。所述多肽序列與SEQ ID NO: 187有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 187, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 187 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 39, 40 and 41. The polypeptide sequence has at least 85% identity to SEQ ID NO: 187, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CDH1-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 187的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 187之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 187的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 187之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 187的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 187之多肽的任一胺基酸。Preferably, any CDH1-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 187, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 187. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 187 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 187. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 187 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 187 of any amino acid.
較佳地,本文所提供之介於CDH1與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CDH1的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CDH1-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 CXADR-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between CDH1 and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from CDH1, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the CDH1-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. CXADR-NRG1 polypeptide fusion
亦提供一種由包含有CXADR之外顯子1 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CXADR之外顯子1所編碼的多肽較佳地包含SEQ ID NO: 225或SEQ ID NO: 225之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 141-151,或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 225對應CXADR之外顯子1所編碼的多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CXADR exon 1 (or its allelic variant) and a portion of NRG1 exon 2 (or its allelic variant). The polypeptide encoded by exon 1 of CXADR preferably comprises or consists of SEQ ID NO: 225 or an allele variant of SEQ ID NO: 225. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises SEQ ID NO: 141-151, or any allele variant of SEQ ID NO: 141-151. SEQ ID NO: 225 corresponds to the polypeptide sequence encoded by exon 1 of CXADR. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該CXADR之外顯子1部分所編碼的多肽包含來自SEQ ID NO: 225或SEQ ID NO: 225之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 225有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the CXADR exon 1 part comprises 8, 9, 10, 11, 12, 13 or 14 contiguous Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 225, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CXADR-NRG1多肽融合物包括如SEQ ID NO: 218或其對偶基因變異體之多肽序列。SEQ ID NO: 218含有一個在CXADR與NRG1之間的融合接合處,該融合係位於位置14之胺基酸(其係與CXADR的柄位置1-13處的胺基酸一起)與位置16之胺基酸(其係與NRG1的柄位置17-33處的胺基酸一起)之間。在位置15處,因為NRG1與CXADR出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該CXADR-NRG1多肽融合物包括來自SEQ ID NO: 218或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置14、15及16的胺基酸。Preferably, any CXADR-NRG1 polypeptide fusion of the present disclosure comprises a polypeptide sequence such as SEQ ID NO: 218 or an allele variant thereof. SEQ ID NO: 218 contains a fusion junction between CXADR and NRG1 between the amino acid at position 14 (together with the amino acids at positions 1-13 of the stalk of CXADR) and position 16 Amino acids (which tie together with the amino acids at positions 17-33 of the handle of NRG1). At position 15, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 to CXADR. Preferably, the CXADR-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 218 or its allele variant, including at least positions 14, 14, 15 and 16 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 218之多肽序列,或是一種包含來自SEQ ID NO: 218或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置14、15及16處的胺基酸。所述多肽序列與SEQ ID NO: 218有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 218, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 218 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 14, 15 and 16. The polypeptide sequence has at least 85% identity to SEQ ID NO: 218, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CXADR-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 218的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 218之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 218的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 218之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 218的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 218之多肽的任一胺基酸。Preferably, any CXADR-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 218, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 218. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 218 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 218. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 218 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 218 of any amino acid.
較佳地,本文所提供之介於CXADR與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CXADR的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CXADR-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 GTF2E2-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between CXADR and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from CXADR, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the CXADR-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. GTF2E2-NRG1 polypeptide fusion
亦提供一種由包含有GTF2E2之外顯子2 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該GTF2E2之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 244或SEQ ID NO: 244之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 141-151,或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of GTF2E2 exon 2 (or its allelic variant) and a portion of NRG1 exon 2 (or its allelic variant). The polypeptide encoded by exon 2 of GTF2E2 preferably comprises or consists of SEQ ID NO: 244 or an allele variant of SEQ ID NO: 244. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises SEQ ID NO: 141-151, or any allele variant of SEQ ID NO: 141-151. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該GTF2E2之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 244或SEQ ID NO: 244之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 244有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 2 part of GTF2E2 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 244 or an allele variant of SEQ ID NO: 244 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 244, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一GTF2E2-NRG1多肽融合物包括如SEQ ID NO: 234或其對偶基因變異體之多肽序列。SEQ ID NO: 234含有一個在GTF2E2與NRG1之間的融合接合處,該融合係位於位置46之胺基酸(其係與GTF2E2的柄位置1-45處的胺基酸一起)與位置48之胺基酸(其係與NRG1的柄位置49-88處的胺基酸一起)之間。在位置47處,因為NRG1與GTF2E2出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該GTF2E2-NRG1多肽融合物包括來自SEQ ID NO: 234或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置46、47及48的胺基酸。Preferably, any GTF2E2-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 234 or an allele variant thereof. SEQ ID NO: 234 contains a fusion junction between GTF2E2 and NRG1 between the amino acid at position 46 (together with the amino acids at positions 1-45 of the handle of GTF2E2) and between position 48 Amino acids (which tie together with the amino acids at positions 49-88 of the handle of NRG1). At position 47, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion between NRG1 and GTF2E2. Preferably, the GTF2E2-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 234 or its allele variant, including at least positions 46, 47 and 48 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 234之多肽序列,或是一種包含來自SEQ ID NO: 234或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置46、47及48處的胺基酸。所述多肽序列與SEQ ID NO: 234有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 234, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 234 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 46, 47 and 48. The polypeptide sequence has at least 85% identity to SEQ ID NO: 234, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一GTF2E2-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 234的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 234之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 234的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 234之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 234的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 234之多肽的任一胺基酸。Preferably, any GTF2E2-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 234, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 234. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 234 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 234 : Any amino acid of the polypeptide of 234. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 234 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 234 of any amino acid.
較佳地,本文所提供之介於GTF2E2與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自GTF2E2的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有GTF2E2-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 CSMD1-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between GTF2E2 and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from GTF2E2, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the GTF2E2-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. CSMD1-NRG1 polypeptide fusion
亦提供一種由包含有CSMD1之外顯子23 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CSMD1之外顯子23所編碼的多肽較佳地包含SEQ ID NO: 305或SEQ ID NO: 305之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 283-304中之任一者,或SEQ ID NO: 283-304之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151,或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 283-304分別對應CSMD1之外顯子1-22所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CSMD1 exon 23 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 23 of CSMD1 preferably comprises or consists of SEQ ID NO: 305 or an allele variant of SEQ ID NO: 305. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 283-304, or any allele variant of SEQ ID NO: 283-304, and preferably further comprises SEQ ID NO: 145-151, or any allele variant of SEQ ID NO: 145-151. SEQ ID NO: 283-304 correspond to individual polypeptide sequences encoded by exons 1-22 of CSMD1, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該CSMD1之外顯子23部分所編碼的多肽包含來自SEQ ID NO: 305或SEQ ID NO: 305之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 305有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 23 of CSMD1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 305 or the allele variant of SEQ ID NO: 305 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 305, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CSMD1-NRG1多肽融合物包括如SEQ ID NO: 256或其對偶基因變異體之多肽序列。SEQ ID NO: 256含有一個在CSMD1與NRG1之間的融合接合處,該融合係位於位置29之胺基酸(其係與CSMD1的柄位置1-28處的胺基酸一起)與位置31之胺基酸(其係與NRG1的柄位置32-50處的胺基酸一起)之間。在位置30處,因為NRG1與CSMD1出乎意料產生符合讀框的融合而有蘇胺酸(T, Thr)殘基的存在。較佳地,該CSMD1-NRG1多肽融合物包括來自SEQ ID NO: 256或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置29、30及31的胺基酸。Preferably, any CSMD1-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 256 or an allele variant thereof. SEQ ID NO: 256 contains a fusion junction between CSMD1 and NRG1 between the amino acid at position 29 (with the amino acids at positions 1-28 of the stalk of CSMD1) and between position 31 between the amino acids (which tie together with the amino acids at positions 32-50 of the handle of NRG1). At position 30, a threonine (T, Thr) residue was present due to an unexpected in-frame fusion of NRG1 and CSMD1. Preferably, the CSMD1-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 256 or its allele variant, including at least positions 29, 30 and 31 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 256之多肽序列,或是一種包含來自SEQ ID NO: 256或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置29、30及31處的胺基酸。所述多肽序列與SEQ ID NO: 256有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 256, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 256 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 29, 30 and 31. The polypeptide sequence has at least 85% identity to SEQ ID NO: 256, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CSMD1-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 256的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 256之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 218的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 256之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 256的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 256之多肽的任一胺基酸。Preferably, any CSMD1-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 256, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 256. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 218 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 256. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 256 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 256 of any amino acid.
較佳地,本文所提供之介於CSMD1與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CSMD1的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CSMD1-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 PTN-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between CSMD1 and NRG1 are aligned so that the portion spanning the N-terminal to the fusion junction is the polypeptide sequence from CSMD1, and the portion spanning the fusion junction to the C-terminus Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing CSMD1-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. PTN-NRG1 polypeptide fusion
亦提供一種由包含有PTN之外顯子4 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該PTN之外顯子4所編碼的多肽較佳地包含SEQ ID NO: 323或SEQ ID NO: 323之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 321及/或322或是SEQ ID NO: 321及/或322之對偶基因變異體,且較佳地進一步包括SEQ ID NO: 141-151,或是SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 321、322及323分別對應PTN之外顯子2、3、4所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of PTN exon 4 (or its allelic variant) and a portion of NRG1 exon 2 (or its allelic variant). The polypeptide encoded by exon 4 of PTN preferably comprises or consists of SEQ ID NO: 323 or an allele variant of SEQ ID NO: 323. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises SEQ ID NO: 321 and/or 322 or an allele variant of SEQ ID NO: 321 and/or 322, and preferably further comprises SEQ ID NO: 141-151 , or any allele variant of SEQ ID NO: 141-151. SEQ ID NO: 321, 322 and 323 correspond to the individual polypeptide sequences encoded by exons 2, 3 and 4 of PTN, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該PTN之外顯子4部分所編碼的多肽包含來自SEQ ID NO: 323或SEQ ID NO: 323之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 323有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 4 of the PTN comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 323 or an allele variant of SEQ ID NO: 323 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 323, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一PTN-NRG1多肽融合物包括如SEQ ID NO: 314或其對偶基因變異體之多肽序列。SEQ ID NO: 314含有一個在PTN與NRG1之間的融合接合處,該融合係位於位置33之胺基酸(其係與PTN的柄位置1-32處的胺基酸一起)與位置35之胺基酸(其係與NRG1的柄位置36-67處的胺基酸一起)之間。在位置34處,因為NRG1與PTN出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該PTN-NRG1多肽融合物包括來自SEQ ID NO: 314或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置33、34及35的胺基酸。Preferably, any PTN-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 314 or an allele variant thereof. SEQ ID NO: 314 contains a fusion junction between PTN and NRG1 between the amino acid at position 33 (with the amino acids at positions 1-32 of the stalk of PTN) and between position 35 Amino acids (which tie together with the amino acids at positions 36-67 of the handle of NRG1). At position 34, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with PTN. Preferably, the PTN-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 314 or its allele variant, at least including positions 33, 34 and 35 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 314之多肽序列,或是一種包含來自SEQ ID NO: 314或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置33、34及35處的胺基酸。所述多肽序列與SEQ ID NO: 314有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 314, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 314 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 33, 34 and 35. The polypeptide sequence has at least 85% identity to SEQ ID NO: 314, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一PTN-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 314的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 314之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 314的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 314之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 314的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 314之多肽的任一胺基酸。Preferably, any PTN-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 314, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 314. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 314 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 314 : Any amino acid of the polypeptide of 314. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 314 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 314 of any amino acid.
較佳地,本文所提供之介於PTN與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自PTN的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有PTN-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 ST14-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between PTN and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from PTN, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the PTN-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. ST14-NRG1 polypeptide fusion
亦提供一種由包含有ST14之外顯子11 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該ST14之外顯子11所編碼的多肽較佳地包含SEQ ID NO: 362或SEQ ID NO: 362之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 352-361中之任一者,或是SEQ ID NO: 352-361之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 352-361分別對應ST14之外顯子1-10所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of ST14 exon 11 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 11 of ST14 preferably comprises or consists of SEQ ID NO: 362 or an allele variant of SEQ ID NO: 362. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 352-361, or any allele variant of SEQ ID NO: 352-361, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NOs: 352-361 correspond to individual polypeptide sequences encoded by exons 1-10 of ST14, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該ST14之外顯子11部分所編碼的多肽包含來自SEQ ID NO: 362或SEQ ID NO: 362之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 362有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 11 part of ST14 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 362 or SEQ ID NO: 362 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 362, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一ST14-NRG1多肽融合物包括如SEQ ID NO: 331或其對偶基因變異體之多肽序列。SEQ ID NO: 331含有一個在ST14與NRG1之間的融合接合處,該融合係位於位置31之胺基酸(其係與ST14的柄位置1-30處的胺基酸一起)與位置33之胺基酸(其係與NRG1的柄位置34-60處的胺基酸一起)之間。在位置32處,因為NRG1與ST14出乎意料產生符合讀框的融合而有脯胺酸(P, pro)殘基的存在。較佳地,該ST14-NRG1多肽融合物包括來自SEQ ID NO: 331或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置31、32及33的胺基酸。Preferably, any ST14-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 331 or its allele variant. SEQ ID NO: 331 contains a fusion junction between ST14 and NRG1 between the amino acid at position 31 (together with the amino acids at positions 1-30 of the stalk of ST14) and position 33 Amino acids (which tie together with the amino acids at positions 34-60 of the handle of NRG1). At position 32, a proline (P, pro) residue was present due to an unexpected in-frame fusion of NRG1 with ST14. Preferably, the ST14-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 331 or its allele variant, including at least positions 31, 32 and 33 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 331之多肽序列,或是一種包含來自SEQ ID NO: 331或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置31、32及33處的胺基酸。所述多肽序列與SEQ ID NO: 455有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 331, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 331 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 31, 32 and 33. The polypeptide sequence has at least 85% identity to SEQ ID NO: 455, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一ST14-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 331的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 331之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 331的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 331之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 331的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 331之多肽的任一胺基酸。Preferably, any ST14-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 331, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 331. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 331 having 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 331 : Any amino acid of the polypeptide of 331. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 331 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 331 of any amino acid.
較佳地,本文所提供之介於ST14與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自ST14的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有ST14-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 THBS1-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between ST14 and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from ST14, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the ST14-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. THBS1-NRG1 polypeptide fusion
亦提供一種由包含有THBS1之外顯子9 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該THBS1之外顯子9所編碼的多肽較佳地包含SEQ ID NO: 396或SEQ ID NO: 396之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 389-395中之任一者,或是SEQ ID NO: 389-395之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 389-395分別對應THBS1之外顯子2-8所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of THBS1 exon 9 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 9 of THBS1 preferably comprises or consists of SEQ ID NO: 396 or an allele variant of SEQ ID NO: 396. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 389-395, or any allele variant of SEQ ID NO: 389-395, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NOs: 389-395 correspond to individual polypeptide sequences encoded by exons 2-8 of THBS1, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該THBS1之外顯子9部分所編碼的多肽包含來自SEQ ID NO: 396或SEQ ID NO: 396之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 396有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 9 of THBS1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 396 or the allele variant of SEQ ID NO: 396 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 396, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一THBS1-NRG1多肽融合物包括如SEQ ID NO: 377或其對偶基因變異體之多肽序列。SEQ ID NO: 377含有一個在THBS1與NRG1之間的融合接合處,該融合係位於位置18之胺基酸(其係與THBS1的柄位置1-16處的胺基酸一起)與位置20之胺基酸(其係與NRG1的柄位置20-48處的胺基酸一起)之間。在位置19處,因為NRG1與THBS1出乎意料產生符合讀框的融合而有蘇胺酸(T, Thr)殘基的存在。較佳地,該THBS1-NRG1多肽融合物包括來自SEQ ID NO: 377或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置18、19及20的胺基酸。Preferably, any THBS1-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 377 or its allele variant. SEQ ID NO: 377 contains a fusion junction between THBS1 and NRG1 between the amino acid at position 18 (together with the amino acids at positions 1-16 of the handle of THBS1) and between position 20 Amino acids (which tie together with amino acids at positions 20-48 of the handle of NRG1). At position 19, a threonine (T, Thr) residue was present due to an unexpected in-frame fusion of NRG1 with THBS1. Preferably, the THBS1-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 377 or its allele variant, including at least positions 18, Amino acids 19 and 20.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 377之多肽序列,或是一種包含來自SEQ ID NO: 377或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置18、19及20處的胺基酸。所述多肽序列與SEQ ID NO: 377有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 377, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 377 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 18, 19 and 20. The polypeptide sequence has at least 85% identity to SEQ ID NO: 377, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一THBS1-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 377的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 377之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 377的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 377之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 377的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 377之多肽的任一胺基酸。Preferably, any THBS1-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 377, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 377. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 377 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 377. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 377 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 377 of any amino acid.
較佳地,本文所提供之介於THBS1與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自THBS1的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有THBS1-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 AGRN-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between THBS1 and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from THBS1, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing THBS1-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. AGRN-NRG1 polypeptide fusion
亦提供一種由包含有AGRN之外顯子12 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該AGRN之外顯子12所編碼的多肽較佳地包含SEQ ID NO: 430或SEQ ID NO: 430之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 419-429中之任一者,或是SEQ ID NO: 419-429之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 419-429分別對應AGRN之外顯子1-11所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of AGRN exon 12 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 12 of AGRN preferably comprises or consists of SEQ ID NO: 430 or an allele variant of SEQ ID NO: 430. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 419-429, or any allele variant of SEQ ID NO: 419-429, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NO: 419-429 correspond to individual polypeptide sequences encoded by AGRN exons 1-11, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該AGRN之外顯子12部分所編碼的多肽包含來自SEQ ID NO: 430或SEQ ID NO: 430之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 430有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 12 part of AGRN comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 430 or SEQ ID NO: 430 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 430, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一AGRN-NRG1多肽融合物包括如SEQ ID NO: 404或其對偶基因變異體之多肽序列。SEQ ID NO: 404含有一個在AGRN與NRG1之間的融合接合處,該融合係位於位置35之胺基酸(其係與AGRN的柄位置1-34處的胺基酸一起)與位置37之胺基酸(其係與NRG1的柄位置38-69處的胺基酸一起)之間。在位置36處,因為NRG1與AGRN出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該AGRN-NRG1多肽融合物包括來自SEQ ID NO: 404或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置35、36及37的胺基酸。Preferably, any AGRN-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 404 or an allele variant thereof. SEQ ID NO: 404 contains a fusion junction between AGRN and NRG1 between the amino acid at position 35 (together with the amino acids at positions 1-34 of the stalk of AGRN) and position 37 Amino acids (which tie together with the amino acids at positions 38-69 of the handle of NRG1). At position 36, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with AGRN. Preferably, the AGRN-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 404 or its allele variant, including at least positions 35, 36 and 37 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 404之多肽序列,或是一種包含來自SEQ ID NO: 404或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置35、36及37處的胺基酸。所述多肽序列與SEQ ID NO: 404有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 404, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 404 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 35, 36 and 37. The polypeptide sequence has at least 85% identity to SEQ ID NO: 404, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一AGRN-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 404 的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 404 之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 404 的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 404 之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 404 的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 404 之多肽的任一胺基酸。Preferably, any AGRN-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 404, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 404. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 404 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 404. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 404 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 404 of any amino acid.
較佳地,本文所提供之介於AGRN與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自AGRN的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有AGRN-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 PVALB-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between AGRN and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from AGRN, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the AGRN-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. PVALB-NRG1 polypeptide fusion
根據本揭露,亦提供一種由包含有PVALB核酸序列(或PVALB核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸所編碼之多肽融合物。該PVALB核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 445-449中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。該NRG1核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 139-152中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。According to the present disclosure, there is also provided a polypeptide fusion encoded by a polynucleotide comprising a PVALB nucleic acid sequence (or a part of the PVALB nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). The PVALB nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 445-449 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences. The NRG1 nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 139-152 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences.
SEQ ID NO: 445-449中之任一者之PVALB對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。SEQ ID NO: 139-152中之任一者之NRG1對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。The PVALB allele variant of any one of SEQ ID NOs: 445-449 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity. The NRG1 allele variant of any one of SEQ ID NOs: 139-152 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity.
較佳地,所述融合物之PVALB核酸序列部分係編碼PVALB之多肽部分,此多肽部分包含來自SEQ ID NO: 445-449中之任一者(或SEQ ID NO: 445-449中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。較佳地,所述融合物之NRG1核酸序列部分係編碼NRG1之多肽部分,此多肽部分包含來自SEQ ID NO: 139-152中之任一者(或SEQ ID NO: 139-152中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。Preferably, the PVALB nucleic acid sequence part of the fusion is a polypeptide part of coding PVALB, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 445-449) from SEQ ID NO: 445-449 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of Preferably, the NRG1 nucleic acid sequence part of the fusion is a polypeptide part of encoding NRG1, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 139-152) from SEQ ID NO: 139-152 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of
較佳地,本揭露之任一PVALB-NRG1多肽融合物包括SEQ ID NO: 445-449中之任一者的多肽序列,此多肽序列具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變添加、刪除或取代SEQ ID NO: 445-449之多肽的任一胺基酸。較佳地,所述多肽融合物包括SEQ ID NO: 445-449中之任一者的多肽序列,此多肽序列具有1、2、3、4或5個點突變添加、刪除或取代SEQ ID NO: 445-449之多肽的任一胺基酸。更佳地,所述多核苷酸融合物包括SEQ ID NO: 445-449中之任一者的多肽序列,此多肽序列具有1、2或3個點突變添加、刪除或取代SEQ ID NO: 445-449之多肽的任一胺基酸。Preferably, any PVALB-NRG1 polypeptide fusion of the present disclosure comprises the polypeptide sequence of any one of SEQ ID NO: 445-449, and this polypeptide sequence has one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutation to add, delete or replace any amino acid of the polypeptide of SEQ ID NO: 445-449. Preferably, the polypeptide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 445-449, and this polypeptide sequence has 1, 2, 3, 4 or 5 point mutations to add, delete or replace SEQ ID NO : Any amino acid of the polypeptide of 445-449. More preferably, the polynucleotide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 445-449, and this polypeptide sequence has 1, 2 or 3 point mutations added, deleted or substituted for SEQ ID NO: 445 Any amino acid of the polypeptide of -449.
在一較佳實施例中,亦提供一種由包含有PVALB之外顯子4 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該PVALB之外顯子4所編碼的多肽較佳地包含SEQ ID NO: 447或SEQ ID NO: 447之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 445及446中之任一者,以及SEQ ID NO: 141-151中之任一者,或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 145-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可根據SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體,其序列係對應外顯子6-13全部所編碼的多肽序列。In a preferred embodiment, there is also provided a polynucleoside comprising a part of exon 4 of PVALB (or its allele variant) and a part of NRG1 exon 6 (or its allele variant) Acid-encoded polypeptide fusion. The polypeptide encoded by exon 4 of PVALB preferably comprises or consists of SEQ ID NO: 447 or an allele variant of SEQ ID NO: 447. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NOs: 445 and 446, and any one of SEQ ID NOs: 141-151, or any pair of SEQ ID NOs: 141-151 genetic variants. SEQ ID NOs: 145-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be according to SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該PVALB之外顯子4部分所編碼的多肽包含來自SEQ ID NO: 447或SEQ ID NO: 447之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 447有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 4 of PVALB comprises 8, 9, 10, 11, 12, 13 or 14 contiguous sequences from SEQ ID NO: 447 or an allele variant of SEQ ID NO: 447 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 447, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一PVALB-NRG1多肽融合物包括如SEQ ID NO: 438或其對偶基因變異體之多肽序列。SEQ ID NO: 438含有一個在PVALB與NRG1之間的融合接合處,該融合係位於位置33之胺基酸(其係與PVALB的柄位置1-32處的胺基酸一起)與位置35之胺基酸(其係與NRG1的柄位置36-75處的胺基酸一起)之間。在位置34處,因為NRG1與PVALB出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該PVALB-NRG1多肽融合物包括來自SEQ ID NO: 438或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置33、34及35的胺基酸。Preferably, any PVALB-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 438 or its allele variant. SEQ ID NO: 438 contains a fusion junction between PVALB and NRG1 between the amino acid at position 33 (together with the amino acids at positions 1-32 of the stalk of PVALB) and position 35 Amino acids (which tie together with the amino acids at positions 36-75 of the handle of NRG1). At position 34, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with PVALB. Preferably, the PVALB-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 438 or its allele variant, at least including positions 33, 34 and 35 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 438之多肽序列,或是一種包含來自SEQ ID NO: 438或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置33、34及35處的胺基酸。所述多肽序列與SEQ ID NO: 438有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 438, or a polypeptide comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 438 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 33, 34 and 35. The polypeptide sequence has at least 85% identity to SEQ ID NO: 438, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一PVALB-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 438的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 438之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 438的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 438之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 438的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 438之多肽的任一胺基酸。Preferably, any PVALB-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 438, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 438. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 438 having 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 438. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 438 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 438 of any amino acid.
較佳地,本文所提供之介於NRG1與PVALB之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自PVALB的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有PVALB-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 APP-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NRG1 and PVALB are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from PVALB, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the PVALB-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. APP-NRG1 polypeptide fusion
亦提供一種由包含有APP之外顯子14 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該APP之外顯子14所編碼的多肽較佳地包含SEQ ID NO: 519或SEQ ID NO: 519之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 506-518中之任一者,或是SEQ ID NO: 506-518之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 506-518分別對應APP之外顯子1-3所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of APP exon 14 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 14 of APP preferably comprises or consists of SEQ ID NO: 519 or the allele variant of SEQ ID NO: 519. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 506-518, or any allele variant of SEQ ID NO: 506-518, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NO: 506-518 correspond to the individual polypeptide sequences encoded by APP exons 1-3, respectively. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該APP之外顯子14部分所編碼的多肽包含來自SEQ ID NO: 519或SEQ ID NO: 519之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 519有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 14 of the APP comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 519 or SEQ ID NO: 519 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 519, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一APP-NRG1多肽融合物包括如SEQ ID NO: 487或其對偶基因變異體之多肽序列。SEQ ID NO: 487含有一個在APP與NRG1之間的融合接合處,該融合係位於位置17之胺基酸(其係與APP的柄位置1-16處的胺基酸一起)與位置19之胺基酸(其係與NRG1的柄位置20-46處的胺基酸一起)之間。在位置18處,因為NRG1與APP出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該APP-NRG1多肽融合物包括來自SEQ ID NO: 487或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置17、18及19的胺基酸。Preferably, any APP-NRG1 polypeptide fusion of the present disclosure comprises a polypeptide sequence such as SEQ ID NO: 487 or its allele variant. SEQ ID NO: 487 contains a fusion junction between APP and NRG1 between the amino acid at position 17 (together with the amino acids at positions 1-16 of the handle of APP) and position 19 Amino acids (which tie together with the amino acids at positions 20-46 of the handle of NRG1). At position 18, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with APP. Preferably, the APP-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 487 or its allele variant, including at least positions 17, 18 and 19 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 487之多肽序列,或是一種包含來自SEQ ID NO: 487或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置17、18及19處的胺基酸。所述多肽序列與SEQ ID NO: 487有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 487, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 487 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 17, 18 and 19. The polypeptide sequence has at least 85% identity to SEQ ID NO: 487, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一APP-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 487的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 487之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 487的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 487之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 487的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 487之多肽的任一胺基酸。Preferably, any APP-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 487, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 487. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 487 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 487. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 487 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 487 of any amino acid.
較佳地,本文所提供之介於APP與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自APP的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有APP-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 WRN-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between APP and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from APP, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the APP-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. WRN-NRG1 polypeptide fusion
亦提供一種由包含有WRN之外顯子33 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該WRN之外顯子33所編碼的多肽較佳地包含SEQ ID NO: 597或SEQ ID NO: 597之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 566-596中之任一者,或是SEQ ID NO: 566-596之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 566-596對應WRN之外顯子2-32所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of WRN exon 33 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 33 of WRN preferably comprises or consists of SEQ ID NO: 597 or an allele variant of SEQ ID NO: 597. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 566-596, or any allele variant of SEQ ID NO: 566-596, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NOs: 566-596 correspond to individual polypeptide sequences encoded by WRN exons 2-32. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該WRN之外顯子33部分所編碼的多肽包含來自SEQ ID NO: 597或SEQ ID NO: 597之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 597有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 33 portion of WRN comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 597 or an allele variant of SEQ ID NO: 597 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 597, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一WRN-NRG1多肽融合物包括如SEQ ID NO: 528或其對偶基因變異體之多肽序列。SEQ ID NO: 528含有一個在WRN與NRG1之間的融合接合處,該融合係位於位置31之胺基酸(其係與WRN的柄位置1-30處的胺基酸一起)與位置33之胺基酸(其係與NRG1的柄位置34-60處的胺基酸一起)之間。在位置32處,因為NRG1與WRN出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該WRN-NRG1多肽融合物包括來自SEQ ID NO: 528或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置31、32及33的胺基酸。Preferably, any WRN-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 528 or an allele variant thereof. SEQ ID NO: 528 contains a fusion junction between WRN and NRG1 between the amino acid at position 31 (together with the amino acids at positions 1-30 of the stalk of WRN) and position 33 Amino acids (which tie together with the amino acids at positions 34-60 of the handle of NRG1). At position 32, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 to WRN. Preferably, the WRN-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 528 or its allele variant, including at least positions 31, 32 and 33 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 528之多肽序列,或是一種包含來自SEQ ID NO: 528或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置31、32及33處的胺基酸。所述多肽序列與SEQ ID NO: 528有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 528, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 528 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 31, 32 and 33. The polypeptide sequence has at least 85% identity to SEQ ID NO: 528, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一WRN-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 528的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 528之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 528的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 528之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 528的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 528之多肽的任一胺基酸。Preferably, any WRN-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 528, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 528. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 528 having 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 528. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 528 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 528 of any amino acid.
較佳地,本文所提供之介於WRN與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自WRN的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有WRN-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 ASPH-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between WRN and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from WRN, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the WRN-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. ASPH-NRG1 polypeptide fusion
根據本揭露,亦提供一種由包含有ASPH核酸序列(或ASPH核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸所編碼之多肽融合物。該ASPH核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 663-688中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。該NRG1核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 139-152中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。According to the present disclosure, there is also provided a polypeptide fusion encoded by a polynucleotide comprising an ASPH nucleic acid sequence (or a part of the ASPH nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). The ASPH nucleic acid sequence (or part thereof) preferably encodes a DNA comprising any one of SEQ ID NOs: 663-688 (or an allele variant of any one of these SEQ ID NOs) or is derived from its constituent sequences. The NRG1 nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 139-152 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences.
SEQ ID NO: 663-688中之任一者之ASPH對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。SEQ ID NO: 139-152中之任一者之NRG1對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。ASPH allele variants of any one of SEQ ID NO: 663-688 preferably have at least 85% sequence identity with these sequences, more preferably 90%, 92%, 94%, 96% therewith Or even better at least 98% sequence identity. The NRG1 allele variant of any one of SEQ ID NOs: 139-152 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity.
較佳地,所述融合物之ASPH核酸序列部分係編碼ASPH之多肽部分,此多肽部分包含來自SEQ ID NO: 663-688中之任一者(或SEQ ID NO: 663-688中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。較佳地,所述融合物之NRG1核酸序列部分係編碼NRG1之多肽部分,此多肽部分包含來自SEQ ID NO: 139-152中之任一者(或SEQ ID NO: 139-152中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。Preferably, the ASPH nucleic acid sequence portion of the fusion is a polypeptide portion encoding ASPH, and this polypeptide portion comprises any one of SEQ ID NOs: 663-688 (or any one of SEQ ID NOs: 663-688) or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of Preferably, the NRG1 nucleic acid sequence part of the fusion is a polypeptide part of encoding NRG1, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 139-152) from SEQ ID NO: 139-152 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of
較佳地,本揭露之任一ASPH-NRG1多肽融合物包括SEQ ID NO: 663-688中之任一者的多肽序列,此多肽序列具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變添加、刪除或取代SEQ ID NO: 663-688之多肽的任一胺基酸。較佳地,所述多肽融合物包括SEQ ID NO: 663-688中之任一者的多肽序列,此多肽序列具有1、2、3、4或5個點突變添加、刪除或取代SEQ ID NO: 663-688之多肽的任一胺基酸。更佳地,所述多核苷酸融合物包括SEQ ID NO: 663-688中之任一者的多肽序列,此多肽序列具有1、2或3個點突變添加、刪除或取代SEQ ID NO: 663-688之多肽的任一胺基酸。Preferably, any ASPH-NRG1 polypeptide fusion of the present disclosure includes the polypeptide sequence of any one of SEQ ID NO: 663-688, and this polypeptide sequence has one or more (ie 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutation to add, delete or replace any amino acid of the polypeptide of SEQ ID NO: 663-688. Preferably, the polypeptide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 663-688, and this polypeptide sequence has 1, 2, 3, 4 or 5 point mutations to add, delete or replace SEQ ID NO : Any amino acid of the polypeptide of 663-688. More preferably, the polynucleotide fusion includes the polypeptide sequence of any one of SEQ ID NO: 663-688, and this polypeptide sequence has 1, 2 or 3 point mutations to add, delete or replace SEQ ID NO: 663 Any amino acid of the polypeptide of -688.
在一較佳實施例中,亦提供一種由包含有ASPH之外顯子22 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該ASPH之外顯子22所編碼的多肽較佳地包含SEQ ID NO: 684或SEQ ID NO: 684之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 663-683中之任一者以及SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 663-683或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 663-683分別對應ASPH之外顯子1-21所編碼的個別多肽序列,而SEQ ID NO: 141-151 分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可根據SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體,其序列係對應外顯子3-13全部所編碼的多肽序列。In a preferred embodiment, there is also provided a polynucleoside comprising a part of ASPH exon 22 (or its allele variant) and a part of NRG1 exon 2 (or its allele variant) Acid-encoded polypeptide fusion. The polypeptide encoded by exon 22 of ASPH preferably comprises or consists of SEQ ID NO: 684 or an allele variant of SEQ ID NO: 684. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 663-683 and any one of SEQ ID NO: 141-151, or SEQ ID NO: 663-683 or SEQ ID NO : Any allele variant of 141-151. SEQ ID NOs: 663-683 correspond to individual polypeptide sequences encoded by ASPH exons 1-21, respectively, and SEQ ID NOs: 141-151 respectively correspond to individual polypeptide sequences encoded by NRG1 exons 3-13. The part of exon 2 of NRG1 can also be according to SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 3-13.
較佳地,該ASPH之外顯子22部分所編碼的多肽包含來自SEQ ID NO: 684或SEQ ID NO: 684之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 684有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the ASPH exon 22 encoded polypeptide comprises 8, 9, 10, 11, 12, 13 or 14 contiguous sequences from SEQ ID NO: 684 or an allele variant of SEQ ID NO: 684 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 684, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一ASPH-NRG1多肽融合物包括如SEQ ID NO: 636或其對偶基因變異體之多肽序列。SEQ ID NO: 636含有一個在ASPH與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與ASPH的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與ASPH出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該ASPH-NRG1多肽融合物包括來自SEQ ID NO: 636或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置24、25及26的胺基酸。Preferably, any ASPH-NRG1 polypeptide fusion of the present disclosure comprises a polypeptide sequence such as SEQ ID NO: 636 or an allele variant thereof. SEQ ID NO: 636 contains a fusion junction between ASPH and NRG1 between the amino acid at position 24 (with the amino acids at positions 1-23 of the stalk of ASPH) and between position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with ASPH. Preferably, the ASPH-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 636 or an allele variant thereof, at least including positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 636之多肽序列,或是一種包含來自SEQ ID NO: 636或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 636有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 636, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 636 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 636, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一ASPH-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 636的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 636之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 636的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 636之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 636的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 636之多肽的任一胺基酸。Preferably, any ASPH-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 636, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 636. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 636 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 636 : Any amino acid of the polypeptide of 636. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 636 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 636 of any amino acid.
較佳地,本文所提供之介於NRG1與ASPH之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自ASPH的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有ASPH-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 NOTCH2-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NRG1 and ASPH are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from ASPH, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the ASPH-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. NOTCH2-NRG1 polypeptide fusion
亦提供一種由包含有NOTCH2之外顯子6 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該NOTCH2之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 709或SEQ ID NO: 709之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 704-708中之任一者,或是SEQ ID NO: 704-708之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 566-596對應NOTCH2之外顯子1-5所編碼的個別多肽序列。SEQ ID NO: 141-151 分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of NOTCH2 exon 6 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 6 of NOTCH2 preferably comprises or consists of SEQ ID NO: 709 or an allele variant of SEQ ID NO: 709. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 704-708, or any allele variant of SEQ ID NO: 704-708, and preferably further comprises SEQ ID NO : 145-151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NOs: 566-596 correspond to individual polypeptide sequences encoded by exons 1-5 of NOTCH2. SEQ ID NO: 141-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該NOTCH2之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 709或SEQ ID NO: 709之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 709有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 6 of NOTCH2 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 709 or an allele variant of SEQ ID NO: 709 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 709, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一NOTCH2-NRG1多肽融合物包括如SEQ ID NO: 694或其對偶基因變異體之多肽序列。SEQ ID NO: 694含有一個在NOTCH2與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與NOTCH2的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與NOTCH2出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該NOTCH2-NRG1多肽融合物包括來自SEQ ID NO: 694或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置24、25及26的胺基酸。Preferably, any NOTCH2-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 694 or an allele variant thereof. SEQ ID NO: 694 contains a fusion junction between NOTCH2 and NRG1 between the amino acid at position 24 (together with the amino acids at positions 1-23 of the stalk of NOTCH2) and between position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with NOTCH2. Preferably, the NOTCH2-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 694 or its allele variant, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 694之多肽序列,或是一種包含來自SEQ ID NO: 694或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 694有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 694, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 694 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 694, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一NOTCH2-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 694的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 694之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 694的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 694之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 694的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 694之多肽的任一胺基酸。Preferably, any NOTCH2-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 694, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 694. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 694 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 694. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 694 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 694 of any amino acid.
較佳地,本文所提供之介於NOTCH2與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自NOTCH2的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有NOTCH2-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 CD74-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NOTCH2 and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is a polypeptide sequence from NOTCH2, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the NOTCH2-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. CD74-NRG1 polypeptide fusion
亦提供一種由包含有CD74之外顯子2 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該CD74之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 730或SEQ ID NO: 730之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 729或SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 729或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 729對應CD74之外顯子1所編碼的多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of CD74 exon 2 (or its allogeneic variant) and a portion of NRG1 exon 2 (or its allogeneic variant). The polypeptide encoded by exon 2 of CD74 preferably comprises or consists of SEQ ID NO: 730 or an allele variant of SEQ ID NO: 730. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 729 or SEQ ID NO: 141-151, or any allele of SEQ ID NO: 729 or SEQ ID NO: 141-151 variant. SEQ ID NO: 729 corresponds to the polypeptide sequence encoded by exon 1 of CD74. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該CD74之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 730或SEQ ID NO: 730之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 730有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 2 part of CD74 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 730 or the allele variant of SEQ ID NO: 730 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 730, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CD74-NRG1多肽融合物包括如SEQ ID NO: 718或其對偶基因變異體之多肽序列。SEQ ID NO: 718含有一個在CD74與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與CD74的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與CD74出乎意料產生符合讀框的融合而有脯胺酸(P, pro)殘基的存在。較佳地,該CD74-NRG1多肽融合物包括來自SEQ ID NO: 718或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置24、25及26的胺基酸。Preferably, any CD74-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 718 or its allele variant. SEQ ID NO: 718 contains a fusion junction between CD74 and NRG1 between the amino acid at position 24 (together with the amino acids at positions 1-23 of the stalk of CD74) and position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, a proline (P, pro) residue was present due to an unexpected in-frame fusion of NRG1 with CD74. Preferably, the CD74-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 718 or its allele variant, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 718之多肽序列,或是一種包含來自SEQ ID NO: 718或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 718有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 718, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 718 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 718, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一CD74-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 718的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 718之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 718的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 718之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 718的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 718之多肽的任一胺基酸。Preferably, any CD74-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 718, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 718. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 718 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 718 : Any amino acid of the polypeptide of 718. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 718 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 718 of any amino acid.
較佳地,本文所提供之介於CD74與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自CD74的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有CD74-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 SDC4-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between CD74 and NRG1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from CD74, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the CD74-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. SDC4-NRG1 polypeptide fusion
亦提供一種由包含有SDC4之外顯子2 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該SDC4之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 752或SEQ ID NO: 752之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 751或SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 751或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 751對應SDC4之外顯子1所編碼的多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of SDC4 exon 2 (or its allogeneic variant) and a portion of NRG1 exon 2 (or its allogeneic variant). The polypeptide encoded by exon 2 of SDC4 preferably comprises or consists of SEQ ID NO: 752 or an allele variant of SEQ ID NO: 752. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 751 or SEQ ID NO: 141-151, or any allele of SEQ ID NO: 751 or SEQ ID NO: 141-151 variant. SEQ ID NO: 751 corresponds to the polypeptide sequence encoded by exon 1 of SDC4. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該SDC4之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 752或SEQ ID NO: 752之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 752有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 2 part of SDC4 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 752 or an allele variant of SEQ ID NO: 752 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 752, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一SDC4-NRG1多肽融合物包括如SEQ ID NO: 744或其對偶基因變異體之多肽序列。SEQ ID NO: 744含有一個在SDC4與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與SDC4的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與SDC4出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該SDC4-NRG1多肽融合物包括來自SEQ ID NO: 744或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置24、25及26的胺基酸。Preferably, any SDC4-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 744 or an allele variant thereof. SEQ ID NO: 744 contains a fusion junction between SDC4 and NRG1 between the amino acid at position 24 (together with the amino acids at positions 1-23 of the stalk of SDC4) and position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with SDC4. Preferably, the SDC4-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 744 or its allele variant, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 744之多肽序列,或是一種包含來自SEQ ID NO: 744或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 744有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 744, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 744 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 744, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一SDC4-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 744的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 744之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 744的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 744之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 744的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 744之多肽的任一胺基酸。Preferably, any SDC4-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 744, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 744. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 744 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 744. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 744 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 744 of any amino acid.
較佳地,本文所提供之介於SDC4與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自SDC4的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有SDC4-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。Preferably, the polypeptide fusions provided herein between SDC4 and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from SDC4, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the SDC4-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein.
亦提供一種由包含有SDC4之外顯子4 (或其對偶基因變異體)的一部分以及NRG1之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該SDC4之外顯子4所編碼的多肽較佳地包含SEQ ID NO: 754或SEQ ID NO: 754之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 751-753或SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 751-753或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 751-753對應SDC4之外顯子1-3所編碼的個別多肽序列。SEQ ID NO: 141-151分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可由SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體所組成,其序列係對應外顯子2-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of SDC4 exon 4 (or its allogeneic variant) and a portion of NRG1 exon 2 (or its allogeneic variant). The polypeptide encoded by exon 4 of SDC4 preferably comprises or consists of SEQ ID NO: 754 or an allele variant of SEQ ID NO: 754. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 751-753 or SEQ ID NO: 141-151, or one of SEQ ID NO: 751-753 or SEQ ID NO: 141-151 Either allele variant. SEQ ID NO: 751-753 correspond to the individual polypeptide sequences encoded by exons 1-3 of SDC4. SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be composed of SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該SDC4之外顯子4部分所編碼的多肽包含來自SEQ ID NO: 754或SEQ ID NO: 754之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 754有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 4 part of SDC4 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 754 or the allele variant of SEQ ID NO: 754 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 754, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一SDC4-NRG1多肽融合物包括如SEQ ID NO: 825或其對偶基因變異體之多肽序列。SEQ ID NO: 825含有一個在SDC4與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與SDC4的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與SDC4出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該SDC4-NRG1多肽融合物包括來自SEQ ID NO: 825或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置24、25及26的胺基酸。Preferably, any SDC4-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 825 or an allele variant thereof. SEQ ID NO: 825 contains a fusion junction between SDC4 and NRG1 between the amino acid at position 24 (together with the amino acids at positions 1-23 of the stalk of SDC4) and position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with SDC4. Preferably, the SDC4-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 825 or its allele variant, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 825之多肽序列,或是一種包含來自SEQ ID NO: 825或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 825有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 825, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 825 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 825, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一SDC4-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 825的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 825之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 825的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 825之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 825的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 825之多肽的任一胺基酸。Preferably, any SDC4-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 825, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 825. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 825 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 825 : Any amino acid of the polypeptide of 825. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 825 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 825 of any amino acid.
較佳地,本文所提供之介於SDC4與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自SDC4的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有SDC4-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 SLC4A4-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between SDC4 and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from SDC4, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the SDC4-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. SLC4A4-NRG1 polypeptide fusion
亦提供一種由包含有SLC4A4之外顯子14 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該SLC4A4之外顯子14所編碼的多肽較佳地包含SEQ ID NO: 806或SEQ ID NO: 806之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 794-805中之任一者或SEQ ID NO: 794-805之任一對偶基因變異體,且較佳地進一步包括SEQ ID NO: 145-151或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 794-805對應SLC4A4之外顯子2-13所編碼的個別多肽序列。SEQ ID NO: 145-151分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可由SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體所組成,其序列係對應外顯子6-13全部所編碼的多肽序列。Also provided is a polypeptide fusion encoded by a polynucleotide comprising a portion of SLC4A4 exon 14 (or its allogeneic variant) and a portion of NRG1 exon 6 (or its allogeneic variant). The polypeptide encoded by exon 14 of SLC4A4 preferably comprises or consists of SEQ ID NO: 806 or an allele variant of SEQ ID NO: 806. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 794-805 or any allele variant of SEQ ID NO: 794-805, and preferably further comprises SEQ ID NO: 145 -151 or any allele variant of SEQ ID NO: 145-151. SEQ ID NOs: 794-805 correspond to individual polypeptide sequences encoded by exons 2-13 of SLC4A4. SEQ ID NOs: 145-151 correspond to individual polypeptide sequences encoded by exons 7-13 of NRG1, respectively. The part of exon 6 of NRG1 can also be composed of SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該SLC4A4之外顯子14部分所編碼的多肽包含來自SEQ ID NO: 806或SEQ ID NO: 806之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 806有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 14 part of SLC4A4 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 806 or the allele variant of SEQ ID NO: 806 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 806, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一SLC4A4-NRG1多肽融合物包括如SEQ ID NO: 766或其對偶基因變異體之多肽序列。SEQ ID NO: 766含有一個在SLC4A4與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與SLC4A4的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與SLC4A4出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該SLC4A4-NRG1多肽融合物包括來自SEQ ID NO: 766或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括其位置24、25及26的胺基酸。Preferably, any SLC4A4-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 766 or an allele variant thereof. SEQ ID NO: 766 contains a fusion junction between SLC4A4 and NRG1 between the amino acid at position 24 (together with the amino acids at positions 1-23 of the stalk of SLC4A4) and between position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with SLC4A4. Preferably, the SLC4A4-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 766 or its allele variant, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 766之多肽序列,或是一種包含來自SEQ ID NO: 766或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 766有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 766, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 766 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 766, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一SLC4A4-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 766的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 766之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 766的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 766之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 766的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 766之多肽的任一胺基酸。Preferably, any SLC4A4-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 766, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 766. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 766 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: : Any amino acid of the polypeptide of 766. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 766 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 766 of any amino acid.
較佳地,本文所提供之介於SLC4A4與NRG1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自SLC4A4的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有SLC4A4-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 ZFAT-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between SLC4A4 and NRG1 are aligned so that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from SLC4A4, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the SLC4A4-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. ZFAT-NRG1 polypeptide fusion
根據本揭露,亦提供一種由包含有ZFAT核酸序列(或ZFAT核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸所編碼之多肽融合物。該ZFAT核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 847-863中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。該NRG1核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 139-152中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。According to the disclosure, there is also provided a polypeptide fusion encoded by a polynucleotide comprising a ZFAT nucleic acid sequence (or a part of the ZFAT nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of the NRG1 nucleic acid sequence). The ZFAT nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 847-863 (or any allele variant of any one of these SEQ ID NOs) or is composed of its constituent sequences. The NRG1 nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 139-152 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences.
SEQ ID NO: 847-863中之任一者之ZFAT對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。SEQ ID NO: 139-152中之任一者之NRG1對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。The ZFAT allele variant of any one of SEQ ID NOs: 847-863 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity. The NRG1 allele variant of any one of SEQ ID NOs: 139-152 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity.
較佳地,所述融合物之ZFAT核酸序列部分係編碼ZFAT之多肽部分,此多肽部分包含來自SEQ ID NO: 847-863中之任一者(或SEQ ID NO: 847-863中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。較佳地,所述融合物之NRG1核酸序列部分係編碼NRG1之多肽部分,此多肽部分包含來自SEQ ID NO: 139-152中之任一者(或SEQ ID NO: 139-152中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。Preferably, the ZFAT nucleic acid sequence part of the fusion is a polypeptide part of coding ZFAT, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 847-863) from SEQ ID NO: 847-863 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of Preferably, the NRG1 nucleic acid sequence part of the fusion is a polypeptide part of encoding NRG1, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 139-152) from SEQ ID NO: 139-152 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of
較佳地,本揭露之任一ZFAT-NRG1多肽融合物包括SEQ ID NO: 847-863中之任一者的多肽序列,此多肽序列具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變添加、刪除或取代SEQ ID NO: 847-863之多肽的任一胺基酸。較佳地,所述多肽融合物包括SEQ ID NO: 847-863中之任一者的多肽序列,此多肽序列具有1、2、3、4或5個點突變添加、刪除或取代SEQ ID NO: 847-863之多肽的任一胺基酸。更佳地,所述多核苷酸融合物包括SEQ ID NO: 847-863中之任一者的多肽序列,此多肽序列具有1、2或3個點突變添加、刪除或取代SEQ ID NO: 847-863之多肽的任一胺基酸。Preferably, any ZFAT-NRG1 polypeptide fusion of the present disclosure includes the polypeptide sequence of any one of SEQ ID NO: 847-863, and this polypeptide sequence has one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutation to add, delete or replace any amino acid of the polypeptide of SEQ ID NO: 847-863. Preferably, the polypeptide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 847-863, and this polypeptide sequence has 1, 2, 3, 4 or 5 point mutations to add, delete or replace SEQ ID NO : Any amino acid of the polypeptide of 847-863. More preferably, the polynucleotide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 847-863, and this polypeptide sequence has 1, 2 or 3 point mutations added, deleted or substituted for SEQ ID NO: 847 Any amino acid of the polypeptide of -863.
在一較佳實施例中,亦提供一種由包含有ZFAT之外顯子12 (或其對偶基因變異體)的一部分以及NRG1之外顯子6 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該ZFAT之外顯子12所編碼的多肽較佳地包含SEQ ID NO: 858或SEQ ID NO: 858之對偶基因變異體或是由其組成。該NRG1之外顯子6所編碼的多肽較佳地包含SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 847-857中之任一者以及SEQ ID NO: 145-151中之任一者,或是SEQ ID NO: 847-857或SEQ ID NO: 145-151之任一對偶基因變異體。SEQ ID NO: 847-857分別對應ZFAT之外顯子1-11所編碼的個別多肽序列,而SEQ ID NO: 145-151 分別對應NRG1之外顯子7-13所編碼的個別多肽序列。所述NRG1之外顯子6部分亦可根據SEQ ID NO: 156或SEQ ID NO: 156之對偶基因變異體,其序列係對應外顯子6-13全部所編碼的多肽序列。In a preferred embodiment, a polynucleoside comprising a part of ZFAT exon 12 (or its allogeneic variant) and a part of NRG1 exon 6 (or its allogeneic variant) is also provided Acid-encoded polypeptide fusion. The polypeptide encoded by exon 12 of ZFAT preferably comprises or consists of SEQ ID NO: 858 or an allele variant of SEQ ID NO: 858. The polypeptide encoded by exon 6 of NRG1 preferably comprises or consists of SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 847-857 and any one of SEQ ID NO: 145-151, or SEQ ID NO: 847-857 or SEQ ID NO : Any allele variant of 145-151. SEQ ID NOs: 847-857 correspond to individual polypeptide sequences encoded by ZFAT exons 1-11, respectively, and SEQ ID NOs: 145-151 respectively correspond to individual polypeptide sequences encoded by NRG1 exons 7-13. The part of exon 6 of NRG1 can also be according to SEQ ID NO: 156 or an allele variant of SEQ ID NO: 156, and its sequence corresponds to the polypeptide sequence encoded by all exons 6-13.
較佳地,該ZFAT之外顯子12部分所編碼的多肽包含來自SEQ ID NO: 858或SEQ ID NO: 858之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 858有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子6部分所編碼的多肽包含來自SEQ ID NO: 144或SEQ ID NO: 144之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 144有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by exon 12 of the ZFAT comprises 8, 9, 10, 11, 12, 13 or 14 consecutive allele variants from SEQ ID NO: 858 or SEQ ID NO: 858 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 858, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 6 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 144 or an allele variant of SEQ ID NO: 144 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 144, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一ZFAT-NRG1多肽融合物包括如SEQ ID NO: 829或其對偶基因變異體之多肽序列。SEQ ID NO: 829含有一個在ZFAT與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與ZFAT的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與ZFAT出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該ZFAT-NRG1多肽融合物包括來自SEQ ID NO: 829或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置24、25及26的胺基酸。Preferably, any ZFAT-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 829 or an allele variant thereof. SEQ ID NO: 829 contains a fusion junction between ZFAT and NRG1 between the amino acid at position 24 (with the amino acids at positions 1-23 of the handle of ZFAT) and between position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion of NRG1 with ZFAT. Preferably, the ZFAT-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 829 or its allele variant, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 829之多肽序列,或是一種包含來自SEQ ID NO: 829或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 829有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 829, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 829 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 829, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一ZFAT-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 829的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 829之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 829的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 829之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 829的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 829之多肽的任一胺基酸。Preferably, any ZFAT-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 829, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 829. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 829 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 829 : Any amino acid of the polypeptide of 829. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 829 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 829 of any amino acid.
較佳地,本文所提供之介於NRG1與ZFAT之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自ZFAT的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有ZFAT-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。 DSCAML1-NRG1 多肽融合物 Preferably, the polypeptide fusions provided herein between NRG1 and ZFAT are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from ZFAT, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the ZFAT-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein. DSCAML1-NRG1 polypeptide fusion
根據本揭露,亦提供一種由包含有DSCAML1核酸序列(或DSCAML1核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部分)融合的多核苷酸所編碼之多肽融合物。該DSCAML1核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 904-937中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。該NRG1核酸序列(或其部分)較佳地係編碼一包含有SEQ ID NO: 139-152中之任一者(或此等SEQ ID NO中之任一者之對偶基因變異體)或是由其組成的序列。According to the present disclosure, there is also provided a polypeptide fusion encoded by a polynucleotide comprising a fusion of DSCAML1 nucleic acid sequence (or a part of DSCAML1 nucleic acid sequence) and NRG1 nucleic acid sequence (or a part of NRG1 nucleic acid sequence). The DSCAML1 nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 904-937 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences. The NRG1 nucleic acid sequence (or part thereof) preferably encodes a sequence comprising any one of SEQ ID NOs: 139-152 (or an allele variant of any one of these SEQ ID NOs) or derived from its constituent sequences.
SEQ ID NO: 904-937中之任一者之DSCAML1對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。SEQ ID NO: 139-152中之任一者之NRG1對偶基因變異體較佳地與該等序列有至少85%的序列一致性,更佳地與其有90%、92%、94%、96%或甚至更佳地至少98%的序列一致性。The DSCAML1 allele variant of any one of SEQ ID NOs: 904-937 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity. The NRG1 allele variant of any one of SEQ ID NOs: 139-152 preferably has at least 85% sequence identity to these sequences, more preferably 90%, 92%, 94%, 96% thereto Or even better at least 98% sequence identity.
較佳地,所述融合物之DSCAML1核酸序列部分係編碼DSCAML1之多肽部分,此多肽部分包含來自SEQ ID NO: 904-937中之任一者(或SEQ ID NO: 904-937中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。較佳地,所述融合物之NRG1核酸序列部分係編碼NRG1之多肽部分,此多肽部分包含來自SEQ ID NO: 139-152中之任一者(或SEQ ID NO: 139-152中之任一者之對偶基因變異體)之8、9、10、11、12、13或14個連續胺基酸,或是由其組成。Preferably, the DSCAML1 nucleic acid sequence portion of the fusion is a polypeptide portion encoding DSCAML1, and this polypeptide portion comprises any one of SEQ ID NOs: 904-937 (or any one of SEQ ID NOs: 904-937) or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of Preferably, the NRG1 nucleic acid sequence part of the fusion is a polypeptide part of encoding NRG1, and this polypeptide part comprises any one (or any one of SEQ ID NOs: 139-152) from SEQ ID NO: 139-152 or consist of 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids of an allele variant of
較佳地,本揭露之任一DSCAML1-NRG1多肽融合物包括SEQ ID NO: 904-937中之任一者的多肽序列,此多肽序列具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變添加、刪除或取代SEQ ID NO: 904-937之多肽的任一胺基酸。較佳地,所述多肽融合物包括SEQ ID NO: 904-937中之任一者的多肽序列,此多肽序列具有1、2、3、4或5個點突變添加、刪除或取代SEQ ID NO: 904-937之多肽的任一胺基酸。更佳地,所述多核苷酸融合物包括SEQ ID NO: 904-937中之任一者的多肽序列,此多肽序列具有1、2或3個點突變添加、刪除或取代SEQ ID NO: 904-937之多肽的任一胺基酸。Preferably, any DSCAML1-NRG1 polypeptide fusion of the present disclosure includes the polypeptide sequence of any one of SEQ ID NO: 904-937, and this polypeptide sequence has one or more (ie 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutation to add, delete or replace any amino acid of the polypeptide of SEQ ID NO: 904-937. Preferably, the polypeptide fusion comprises the polypeptide sequence of any one of SEQ ID NO: 904-937, and this polypeptide sequence has 1, 2, 3, 4 or 5 point mutations to add, delete or replace SEQ ID NO : Any amino acid of the polypeptide of 904-937. More preferably, the polynucleotide fusion includes the polypeptide sequence of any one of SEQ ID NO: 904-937, and this polypeptide sequence has 1, 2 or 3 point mutations to add, delete or replace SEQ ID NO: 904 Any amino acid of the polypeptide of -937.
在一較佳實施例中,亦提供一種由包含有DSCAML1之外顯子3 (或其對偶基因變異體)的一部分以及NRG1或其之外顯子2 (或其對偶基因變異體)的一部分之多核苷酸所編碼的多肽融合物。該DSCAML1之外顯子3所編碼的多肽較佳地包含SEQ ID NO: 906或SEQ ID NO: 906之對偶基因變異體或是由其組成。該NRG1之外顯子2所編碼的多肽較佳地包含SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體或是由其組成。較佳地,所述多肽融合物進一步包括SEQ ID NO: 904-905中之任一者以及SEQ ID NO: 141-151中之任一者,或是SEQ ID NO: 904-905或SEQ ID NO: 141-151之任一對偶基因變異體。SEQ ID NO: 904-905分別對應DSCAML1之外顯子1-2所編碼的個別多肽序列,而SEQ ID NO: 141-151 分別對應NRG1之外顯子3-13所編碼的個別多肽序列。所述NRG1之外顯子2部分亦可根據SEQ ID NO: 154或SEQ ID NO: 154之對偶基因變異體,其序列係對應外顯子2-13全部所編碼的多肽序列。In a preferred embodiment, there is also provided a part comprising DSCAML1 exon 3 (or its allogeneic variant) and NRG1 or its exon 2 (or its allogeneic variant) Polypeptide fusions encoded by polynucleotides. The polypeptide encoded by exon 3 of DSCAML1 preferably comprises or consists of SEQ ID NO: 906 or an allele variant of SEQ ID NO: 906. The polypeptide encoded by exon 2 of NRG1 preferably comprises or consists of SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140. Preferably, the polypeptide fusion further comprises any one of SEQ ID NO: 904-905 and any one of SEQ ID NO: 141-151, or SEQ ID NO: 904-905 or SEQ ID NO : Any allele variant of 141-151. SEQ ID NOs: 904-905 correspond to individual polypeptide sequences encoded by exons 1-2 of DSCAML1, respectively, while SEQ ID NOs: 141-151 correspond to individual polypeptide sequences encoded by exons 3-13 of NRG1, respectively. The part of exon 2 of NRG1 can also be according to SEQ ID NO: 154 or an allele variant of SEQ ID NO: 154, and its sequence corresponds to the polypeptide sequence encoded by all exons 2-13.
較佳地,該DSCAML1之外顯子3部分所編碼的多肽包含來自SEQ ID NO: 906或SEQ ID NO: 906之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 906有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。此外,該NRG1之外顯子2部分所編碼的多肽包含來自SEQ ID NO: 140或SEQ ID NO: 140之對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,或是由其組成。該對偶基因變異體與SEQ ID NO: 140有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。Preferably, the polypeptide encoded by the exon 3 part of DSCAML1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive sequences from SEQ ID NO: 906 or the allele variant of SEQ ID NO: 906 Amino acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 906, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto. Additionally, the polypeptide encoded by the exon 2 portion of NRG1 comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amine groups from SEQ ID NO: 140 or an allele variant of SEQ ID NO: 140 Acids, or consisting of them. The allele variant has at least 85% identity to SEQ ID NO: 140, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一DSCAML1-NRG1多肽融合物包括如SEQ ID NO: 869或其對偶基因變異體之多肽序列。SEQ ID NO: 869含有一個在DSCAML1與NRG1之間的融合接合處,該融合係位於位置24之胺基酸(其係與DSCAML1的柄位置1-23處的胺基酸一起)與位置26之胺基酸(其係與NRG1的柄位置27-49處的胺基酸一起)之間。在位置25處,因為NRG1與DSCAML1出乎意料產生符合讀框的融合而有丙胺酸(A, Ala)殘基的存在。較佳地,該DSCAML1-NRG1多肽融合物包括來自SEQ ID NO: 869或其對偶基因變異體的8、9、10、11、12、13或14個連續胺基酸,至少包括位置24、25及26的胺基酸。Preferably, any DSCAML1-NRG1 polypeptide fusion of the present disclosure includes a polypeptide sequence such as SEQ ID NO: 869 or its allele variant. SEQ ID NO: 869 contains a fusion junction between DSCAML1 and NRG1 between the amino acid at position 24 (together with the amino acids at positions 1-23 of the handle of DSCAML1) and between position 26 between the amino acids (which tie together with the amino acids at positions 27-49 of the handle of NRG1). At position 25, an alanine (A, Ala) residue was present due to an unexpected in-frame fusion between NRG1 and DSCAML1. Preferably, the DSCAML1-NRG1 polypeptide fusion comprises 8, 9, 10, 11, 12, 13 or 14 consecutive amino acids from SEQ ID NO: 869 or its allele variants, including at least positions 24, 25 and 26 amino acids.
在一較佳實施例中,其提供了一種根據SEQ ID NO: 869之多肽序列,或是一種包含來自SEQ ID NO: 869或其對偶基因變異體的8、9、10、11、12、13或14個或全部的連續胺基酸之多肽,至少包括位置24、25及26處的胺基酸。所述多肽序列與SEQ ID NO: 869有至少85%的一致性,較佳地至少90%的一致性,與其92%、94%、96%或更佳地至少98%的序列一致性。In a preferred embodiment, it provides a polypeptide sequence according to SEQ ID NO: 869, or a polypeptide sequence comprising 8, 9, 10, 11, 12, 13 from SEQ ID NO: 869 or its allele variants Or a polypeptide of 14 or all consecutive amino acids, at least including amino acids at positions 24, 25 and 26. The polypeptide sequence has at least 85% identity to SEQ ID NO: 869, preferably at least 90% identity, 92%, 94%, 96% or more preferably at least 98% sequence identity thereto.
較佳地,本揭露之任一DSCAML1-NRG1多肽融合物包括了具有一或多個(即1、2、3、4、5、6、7、8、9或10個)點突變之SEQ ID NO: 869的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 869之多肽的任一胺基酸。較佳地,所述多肽融合物包括了具有1、2、3、4或5個點突變之SEQ ID NO: 869的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 869之多肽的任一胺基酸。更佳地,所述多肽融合物包括了具有1、2或3個點突變之SEQ ID NO: 869的多肽序列,且所述點突變添加、刪除或取代該包含有SEQ ID NO: 869之多肽的任一胺基酸。Preferably, any DSCAML1-NRG1 polypeptide fusion of the present disclosure includes SEQ ID with one or more (i.e. 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10) point mutations The polypeptide sequence of NO: 869, and the point mutation adds, deletes or replaces any amino acid of the polypeptide comprising SEQ ID NO: 869. Preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 869 with 1, 2, 3, 4 or 5 point mutations, and the point mutations add, delete or replace the polypeptide sequence comprising SEQ ID NO: 869 : Any amino acid of the polypeptide of 869. More preferably, the polypeptide fusion comprises the polypeptide sequence of SEQ ID NO: 869 with 1, 2 or 3 point mutations, and the point mutations add, delete or replace the polypeptide comprising SEQ ID NO: 869 of any amino acid.
較佳地,本文所提供之介於NRG1與DSCAML1之間的多肽融合物係經配向,使跨越N端至該融合接合處的部分為來自DSCAML1的多肽序列,而跨越該融合接合處至C端的部分為NRG1多肽序列。較佳地,該NRG1多肽序列包含或編碼EGF樣結構域。所述含有DSCAML1-NRG1多肽融合物之EGF樣結構域較佳地係由本文所提及之異常細胞所組成。Preferably, the polypeptide fusions provided herein between NRG1 and DSCAML1 are aligned such that the portion spanning the N-terminus to the fusion junction is the polypeptide sequence from DSCAML1, and the portion spanning the fusion junction to the C-terminus is Part of it is the NRG1 polypeptide sequence. Preferably, the NRG1 polypeptide sequence comprises or encodes an EGF-like domain. The EGF-like domain containing the DSCAML1-NRG1 polypeptide fusion is preferably composed of the abnormal cells mentioned herein.
本文所提及之每個包含有NRG1之多核苷酸融合物,包括VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4-NRG1, ZFAT-NRG1 and DSCAML1-NRG1,較佳地係被分離出來。本發明之任一方法較佳地包含從樣本分離出一或多種含有多肽的組分。該一或多種含有多肽的組分通常從樣本中的任一細胞或細胞材料中分離出來。 用於檢測或鑑定 NRG1 融合物之測定法 Each polynucleotide fusion containing NRG1 mentioned herein includes VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4- NRG1, SLC4A4-NRG1, ZFAT-NRG1 and DSCAML1-NRG1, are preferably isolated. Any method of the invention preferably comprises isolating one or more polypeptide-containing fractions from the sample. The one or more polypeptide-containing components are typically isolated from any cells or cellular material in the sample. Assays for detecting or identifying NRG1 fusions
一般來說,待檢測之任一多核苷酸或多肽融合物係獲自或衍生自表現了包含有NRG1的EGF樣結構域之多核苷酸融合物的異常細胞。檢測測定法可以不是針對鑑定EGF樣結構域是否存在,因為其只需僅僅檢測實際的融合接合處以指示出進一步含有NRG1的EGF樣結構域的多核苷酸或多肽的存在即可。EGF樣結構域的存在可以從特異性地檢測或鑑定NRG1與任一融合配偶體之間為符合讀框的多核苷酸融合而推斷出,這樣的融合接合點會在NRG1的EGF樣結構域的5’處發現到。Generally, any polynucleotide or polypeptide fusion to be tested is obtained or derived from an abnormal cell expressing a polynucleotide fusion comprising the EGF-like domain of NRG1. The detection assay may not be directed to identifying the presence or absence of an EGF-like domain, as it need only detect the actual fusion junction to indicate the presence of a polynucleotide or polypeptide further containing the EGF-like domain of NRG1. The presence of an EGF-like domain can be inferred from the specific detection or identification of in-frame polynucleotide fusions between NRG1 and either fusion partner, such fusion junctions would be in the EGF-like domain of NRG1 5' found.
熟習該項技術者可採用多種不同技術來檢測獲自人類個體樣本中的已知多核苷酸或多肽融合物,其每一者通常包括了會與所感興趣的目標特異性地結合的結合劑。這類結合劑係指一種可選擇性地與目標序列結合以達到特異性地檢測該目標之試劑,諸如引子、引子對、探針、或抗體。所述結合包括了與多核苷酸序列雜交或黏合,通常是為了擴增及/或檢測該目標,或藉由具高親合性及特異性之抗體結合表位以致使目標的檢測。Those skilled in the art can employ a variety of different techniques to detect known polynucleotide or polypeptide fusions in samples obtained from human subjects, each of which typically includes a binding agent that specifically binds a target of interest. Such a binding agent refers to a reagent, such as a primer, a pair of primers, a probe, or an antibody, that selectively binds to a target sequence to achieve specific detection of the target. Said binding involves hybridization or adhesion to polynucleotide sequences, usually for the purpose of amplifying and/or detecting the target, or by binding an epitope with high affinity and specificity antibodies to enable detection of the target.
以特異性地與多核苷酸融合物結合之引子、引子對或探針的使用為基礎之技術在本領域中是眾所周知的。同樣地,以結合劑為基礎來檢測多肽的技術在本領域中是眾所周知的,特別是可以使用特異性地結合多肽的多肽。Techniques based on the use of primers, primer pairs or probes that specifically bind to polynucleotide fusions are well known in the art. Likewise, techniques for the detection of polypeptides based on binding agents are well known in the art, and in particular polypeptides that specifically bind polypeptides may be used.
其他使用本揭露之引子、引子對或探針的方式包括次世代定序(NGS),或是使用包含有多重免疫檢測技術之套組(panel)(例如利用Anchored Multiplex PCR (AMP™)技術靶向已知外顯子的Archer FusionPlex™ Custom Solid Panel)來檢測融合物,或例如使用QX200™ AutoDG™微滴式數位PCR系統(由BioRad提供)的微滴式數位PCR (ddPCR™)。其他手段包括分子信標的使用或是使用TaqMan™探針或嵌合型螢光染料(諸如SYBR™ Green)的定量PCR (Q-PCR)、螢光原位雜交(FISH)、次世代定序(NGS)、ddPCR™、Anchored Multiplex PCR、半定量PCR或定量PCR。Other ways of using the disclosed primers, primer pairs, or probes include next-generation sequencing (NGS), or using panels that include multiplex immunoassay techniques (e.g., using Anchored Multiplex PCR (AMP™) technology to target Fusions are detected to the Archer FusionPlex™ Custom Solid Panel) with known exons, or droplet digital PCR (ddPCR™) such as using the QX200™ AutoDG™ Droplet Digital PCR System (supplied by BioRad). Other approaches include the use of molecular beacons or quantitative PCR (Q-PCR) using TaqMan™ probes or chimeric fluorescent dyes such as SYBR™ Green, fluorescence in situ hybridization (FISH), next-generation sequencing ( NGS), ddPCR™, Anchored Multiplex PCR, semi-quantitative PCR, or quantitative PCR.
還有其他使用本揭露之探針來檢測本文所提及之NRG1與融合配偶體的多核苷酸融合物的方法,如IHC或FISH(諸如斷裂分離FISH),其中所感興趣的基因的兩端係以不同顏色標記。為檢測包含有本文所提及融合物之NRG1,使用正股hg38 chr8:31,639,222-32,764,405來設計適宜探針以標記NRG1基因的兩端以致使任何融合物的檢測。There are other methods using the probes of the present disclosure to detect polynucleotide fusions of NRG1 and fusion partners mentioned herein, such as IHC or FISH (such as fracture separation FISH), wherein the two ends of the gene of interest are linked Marked in different colors. To detect NRG1 containing fusions mentioned herein, the main strand hg38 chr8:31,639,222-32,764,405 was used to design appropriate probes to label both ends of the NRG1 gene to enable detection of any fusions.
本揭露提供一種用於檢測本文所提及之多核苷酸融合物的核酸探針、引子或引子對。亦提供一種包含有所述核酸探針、引子或引子對以用於檢測本文所提及之多核苷酸融合物的存在的檢測測定法。這類核酸探針、引子或引子對的長度使其可檢測所感興趣的多核苷酸,但較佳的長度為約10至約40個核苷酸。The present disclosure provides a nucleic acid probe, primer or pair of primers for detecting the polynucleotide fusions mentioned herein. Also provided is a detection assay comprising said nucleic acid probe, primer or pair of primers for detecting the presence of a polynucleotide fusion referred to herein. Such nucleic acid probes, primers or primer pairs are of a length such that they detect the polynucleotide of interest, but are preferably about 10 to about 40 nucleotides in length.
較佳地,使用於檢測多核苷酸融合物之任一核酸探針、引子或引子對係包括本文所述之可檢測標籤物。Preferably, any nucleic acid probe, primer or pair of primers used to detect polynucleotide fusions includes a detectable label as described herein.
較佳地,於檢測選自VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1或DSCAML1-NRG1之多核苷酸融合物是否存在的測定法中使用任一核酸探針、引子或引子對,尤其是檢測本文所揭露之該等多核苷酸融合物中的融合接合處是否存在的測定法。Preferably, the detection is selected from VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14- or Any nucleic acid probe, primer or primer pair is used in the assay for the presence or absence of polynucleotide fusions of DSCAML1-NRG1, particularly assays for detecting the presence or absence of fusion junctions in the polynucleotide fusions disclosed herein Law.
較佳地,藉由使用所述跨越NRG1與其融合配偶體之間的核酸接合處之探針、引子或引子對致使該融合接合物的擴增及檢測,以檢測本文所提及之任一多核苷酸融合物是否存在。 供使用於檢測 VAPB-NRG1 融合物之測定法中的引子或探針 Preferably, any of the multiples mentioned herein are detected by using said probes, primers or primer pairs spanning the nucleic acid junction between NRG1 and its fusion partner resulting in amplification and detection of the fusion junction. The presence or absence of nucleotide fusions. Primers or probes for use in assays for detecting VAPB-NRG1 fusions
本揭露提供一種用於檢測包含有VAPB核酸序列(或VAPB核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部份)融合的多核苷酸融合物之核酸引子、引子對或探針。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 23之多核苷酸(或SEQ ID NO: 23之對偶基因變異體)及根據SEQ ID NO: 138之多核苷酸(或SEQ ID NO: 138之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a polynucleotide fusion comprising a VAPB nucleic acid sequence (or a part of a VAPB nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of an NRG1 nucleic acid sequence) . Preferably, the nucleic acid probe, primer or primer pair is specifically compatible with the polynucleotide according to SEQ ID NO: 23 (or the allele variant of SEQ ID NO: 23) and the multinuclear polynucleotide according to SEQ ID NO: 138 nucleotide (or the allele variant of SEQ ID NO: 138), or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with these sequences and over about 12 to 40 nucleotides in length.
此外,本揭露提供一種用於檢測包含有SEQ ID NO: 3或由其組成的VAPB-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置43及44的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 3之多核苷酸(或SEQ ID NO: 3之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸,且該序列較佳地包括位置43及44的核酸。In addition, the present disclosure provides a nucleic acid primer, primer pair or probe for detecting a VAPB-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 3, and the sequence includes nucleic acids at positions 43 and 44. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 3 (or the allele variant of SEQ ID NO: 3), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to about 40 nucleotides in length, and preferably includes positions 43 and 44 nucleic acid.
較佳地,該用於檢測VAPB與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由VAPB之外顯子1組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子1的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測VAPB與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 17組成的序列(或SEQ ID NO: 17之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of VAPB and NRG1 is specifically associated with a sequence consisting of exon 1 of VAPB (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 1 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of VAPB and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 17 (or the allele variant of SEQ ID NO: 17), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自VAPB之外顯子1係包含SEQ ID NO: 17或其對偶基因變異體,或是由其組成。 供使用於檢測 CADM1-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 1 line from VAPB comprises or consists of SEQ ID NO: 17 or its allele variant. Primers or probes for use in assays for detecting CADM1-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 7或由其組成的CADM1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置53及54的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 7之多核苷酸(或SEQ ID NO: 7之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置53及54的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a CADM1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 7, and the sequence includes the nucleic acids at positions 53 and 54. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 7 (or the allele variant of SEQ ID NO: 7), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 53 and 54 .
較佳地,該用於檢測CADM1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CADM1之外顯子7組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子7的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CADM1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 57組成的序列(或SEQ ID NO: 57之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting a fusion of CADM1 and NRG1 is specifically associated with a sequence consisting of exon 7 of CADM1 (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 7 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of CADM1 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 57 (or the allele variant of SEQ ID NO: 57), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CADM1之外顯子7係包含SEQ ID NO: 39或其對偶基因變異體,或是由其組成。 供使用於檢測 CD44-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 7 line from CADM1 comprises or consists of SEQ ID NO: 39 or its allele variant. Primers or probes for use in assays for detecting CD44-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 11或由其組成的CD44-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置52及53的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 11之多核苷酸(或SEQ ID NO: 11之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置52及53的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a CD44-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 11, and the sequence includes nucleic acids at positions 52 and 53. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 11 (or the allele variant of SEQ ID NO: 11), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 52 and 53 .
較佳地,該用於檢測CD44與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CD44之外顯子5組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子5的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CD44與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 99組成的序列(或SEQ ID NO: 99之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of CD44 and NRG1 is specific to a sequence consisting of exon 5 of CD44 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 5 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair for detecting the fusion of CD44 and NRG1 is specifically compatible with the sequence composed of SEQ ID NO: 99 (or the allele variant of SEQ ID NO: 99), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CD44之外顯子5係包含SEQ ID NO: 65或其對偶基因變異體,或是由其組成。Preferably, the exon 5 line from CD44 comprises or consists of SEQ ID NO: 65 or its allele variant.
本揭露提供一種用於檢測包含有SEQ ID NO: 761或由其組成的CD44-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 761之多核苷酸(或SEQ ID NO: 761之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a CD44-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 761, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 761 (or the allele variant of SEQ ID NO: 761), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測CD44與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CD44之外顯子5組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子5的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CD44與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 99組成的序列(或SEQ ID NO: 99之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of CD44 and NRG1 is specific to a sequence consisting of exon 5 of CD44 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 5 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair for detecting the fusion of CD44 and NRG1 is specifically compatible with the sequence composed of SEQ ID NO: 99 (or the allele variant of SEQ ID NO: 99), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CD44之外顯子5係包含SEQ ID NO: 65或其對偶基因變異體,或是由其組成。 供使用於檢測 SLC3A2 -NRG1 融合物之測定法中的引子或探針 Preferably, the exon 5 line from CD44 comprises or consists of SEQ ID NO: 65 or its allele variant. Primers or probes for use in assays for detecting SLC3A2 -NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 15或由其組成的SLC3A2轉錄本6-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置53及54的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 15之多核苷酸(或SEQ ID NO: 15之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置53及54的核酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a SLC3A2 transcript 6-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 15, and the sequence includes nucleic acids at positions 53 and 54 . Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 15 (or the allele variant of SEQ ID NO: 15), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 53 and 54 .
較佳地,該用於檢測SLC3A2轉錄本6與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由所述SLC3A2之外顯子1組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子1的5’處之序列及/或一由NRG1之外顯子5組成的序列、或一位於外顯子5的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測所述SLC3A2與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 103組成的序列(或SEQ ID NO: 103之對偶基因變異體)、及/或SEQ ID NO: 157組成的序列(或SEQ ID NO: 157之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or primer pair used to detect the fusion of SLC3A2 transcript 6 and NRG1 is specifically compatible with a sequence consisting of exon 1 of SLC3A2 (such as used for functional gain sequence in the allele assay), a sequence located 5' of exon 1 and/or a sequence consisting of exon 5 of NRG1, or a sequence located 3' of exon 5 ( Gene sequences such as NRG1) are heterozygous (or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and have a length of more than about 12 to about 40 nucleotides). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of SLC3A2 and NRG1 is specifically related to the sequence composed of SEQ ID NO: 103 (or the allele variant of SEQ ID NO: 103 ), and/or a sequence consisting of SEQ ID NO: 157 (or an allele variant of SEQ ID NO: 157), or 95%, 96%, 97%, 98%, 99% of these sequences Or preferably have 100% sequence identity and are over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自SLC3A2轉錄本6之外顯子1係包含SEQ ID NO: 103或其對偶基因變異體,或是由其組成。Preferably, exon 1 from SLC3A2 transcript 6 comprises or consists of SEQ ID NO: 103 or its allele variant.
本揭露提供一種用於檢測包含有SEQ ID NO: 454或由其組成的SLC3A2轉錄本6-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置93及94的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 454之多核苷酸(或SEQ ID NO: 454之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置93及94的核酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a SLC3A2 transcript 6-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 454, and the sequence includes nucleic acids at positions 93 and 94 . Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 454 (or the allele variant of SEQ ID NO: 454), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 93 and 94 .
較佳地,該用於檢測SLC3A2轉錄本3與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由所述SLC3A2之外顯子2組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子2的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測所述SLC3A2與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 482組成的序列(或SEQ ID NO: 482之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or primer pair for detecting the fusion of SLC3A2 transcript 3 and NRG1 is specifically compatible with a sequence consisting of said SLC3A2 exon 2 (such as used for functional gain sequence in an allele assay), a sequence located 5' to exon 2 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' to exon 6 ( Gene sequences such as NRG1) are heterozygous (or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and have a length of more than about 12 to about 40 nucleotides). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of SLC3A2 and NRG1 is specific to the sequence composed of SEQ ID NO: 482 (or the allele variant of SEQ ID NO: 482 ), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155), or 95%, 96%, 97%, 98%, 99% of these sequences Or preferably have 100% sequence identity and are over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自SLC3A2轉錄本3之外顯子2係包含SEQ ID NO: 457或其對偶基因變異體,或是由其組成。 供使用於檢測 VTCN1-NRG1 融合物之測定法中的引子或探針 Preferably, exon 2 from SLC3A2 transcript 3 comprises or consists of SEQ ID NO: 457 or its allele variant. Primers or probes for use in assays for detecting VTCN1-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 166或由其組成的VTCN1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置65及66的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 166之多核苷酸(或SEQ ID NO: 166之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置65及66的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a VTCN1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 166, and the sequence includes nucleic acids at positions 65 and 66. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 166 (or the allele variant of SEQ ID NO: 166), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 65 and 66 .
較佳地,該用於檢測VTCN1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由VTCN1之外顯子2組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子2的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測VTCN1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 181組成的序列(或SEQ ID NO: 181之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting a fusion of VTCN1 and NRG1 is specifically associated with a sequence consisting of exon 2 of VTCN1 (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 2 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of VTCN1 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 181 (or the allele variant of SEQ ID NO: 181), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自VTCN1之外顯子2係包含SEQ ID NO: 169或其對偶基因變異體,或是由其組成。 供使用於檢測 CDH1-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 2 line from VTCN1 comprises or consists of SEQ ID NO: 169 or its allele variant. Primers or probes for use in assays for detecting CDH1-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 186或由其組成的CDH1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置119及120的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 186之多核苷酸(或SEQ ID NO: 186之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置119及120的核酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a CDH1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 186, and the sequence includes nucleic acids at positions 119 and 120. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 186 (or the allele variant of SEQ ID NO: 186), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 119 and 120 .
較佳地,該用於檢測CDH1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CDH1之外顯子11組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子11的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CDH1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 213組成的序列(或SEQ ID NO: 213之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of CDH1 and NRG1 are specifically associated with a sequence consisting of exon 11 of CDH1 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 11 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of CDH1 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 213 (or the allele variant of SEQ ID NO: 213), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CDH1之外顯子11係包含SEQ ID NO: 198或其對偶基因變異體,或是由其組成。 供使用於檢測 CXADR-NRG1 融合物之測定法中的引子或探針 Preferably, the line from exon 11 of CDH1 comprises or consists of SEQ ID NO: 198 or its allele variant. Primers or probes for use in assays for detecting CXADR-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 217或由其組成的CXADR-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置43及44的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 217之多核苷酸(或SEQ ID NO: 217之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置43及44的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a CXADR-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 217, and the sequence includes nucleic acids at positions 43 and 44. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 217 (or the allele variant of SEQ ID NO: 217), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 43 and 44 .
較佳地,該用於檢測CXADR與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CXADR之外顯子1組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子1的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CXADR與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 219組成的序列(或SEQ ID NO: 219之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of CXADR and NRG1 is specifically associated with a sequence consisting of exon 1 of CXADR (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 1 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair for detecting the fusion of CXADR and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 219 (or the allele variant of SEQ ID NO: 219), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CXADR之外顯子1係包含SEQ ID NO: 219或其對偶基因變異體,或是由其組成。 供使用於檢測 GTF2E2-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 1 line from CXADR comprises or consists of SEQ ID NO: 219 or its allele variant. Primers or probes for use in assays for detecting GTF2E2-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 233或由其組成的GTF2E2-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置141及142的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 233之多核苷酸(或SEQ ID NO: 233之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置141及142的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a GTF2E2-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 233, and the sequence includes nucleic acids at positions 141 and 142. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 233 (or the allele variant of SEQ ID NO: 233), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 141 and 142 .
較佳地,該用於檢測GTF2E2與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由GTF2E2之外顯子2組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子2的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測GTF2E2與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 252組成的序列(或SEQ ID NO: 252之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of GTF2E2 and NRG1 is specifically associated with a sequence consisting of exon 2 of GTF2E2 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 2 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of GTF2E2 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 252 (or the allele variant of SEQ ID NO: 252), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自GTF2E2之外顯子2係包含SEQ ID NO: 236或其對偶基因變異體,或是由其組成。 供使用於檢測 CSMD1-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 2 line from GTF2E2 comprises or consists of SEQ ID NO: 236 or its allele variant. Primers or probes for use in assays for detecting CSMD1-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 255或由其組成的CSMD1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置88及89的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 255之多核苷酸(或SEQ ID NO: 255之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置88及89的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a CSMD1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 255, and the sequence includes nucleic acids at positions 88 and 89. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 255 (or the allele variant of SEQ ID NO: 255), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 88 and 89 .
較佳地,該用於檢測CSMD1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CSMD1之外顯子23組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子23的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CSMD1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 309組成的序列(或SEQ ID NO: 309之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting a fusion of CSMD1 and NRG1 is specifically associated with a sequence consisting of exon 23 of CSMD1 (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 23 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of CSMD1 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 309 (or the allele variant of SEQ ID NO: 309), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CSMD1之外顯子23係包含SEQ ID NO: 279或其對偶基因變異體,或是由其組成。 供使用於檢測 PTN-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 23 line from CSMD1 comprises or consists of SEQ ID NO: 279 or its allele variant. Primers or probes for use in assays for detecting PTN-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 313或由其組成的PTN-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置102及103的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 313之多核苷酸(或SEQ ID NO: 313之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置102及103的核酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a PTN-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 313, and the sequence includes nucleic acids at positions 102 and 103. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 313 (or the allele variant of SEQ ID NO: 313), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 102 and 103 .
較佳地,該用於檢測PTN與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由PTN之外顯子4組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子4的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測PTN與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 326組成的序列(或SEQ ID NO: 326之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of PTN and NRG1 is specifically associated with a sequence consisting of exon 4 of PTN (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 4 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of PTN and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 326 (or the allele variant of SEQ ID NO: 326), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自PTN之外顯子4係包含SEQ ID NO: 318或其對偶基因變異體,或是由其組成。 供使用於檢測 ST14-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 4 line from PTN comprises or consists of SEQ ID NO: 318 or its allele variant. Primers or probes for use in assays for detecting ST14-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 330或由其組成的ST14-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置95及96的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 330之多核苷酸(或SEQ ID NO: 330之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置95及96的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a ST14-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 330, and the sequence includes nucleic acids at positions 95 and 96. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 330 (or the allele variant of SEQ ID NO: 330), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 95 and 96 .
較佳地,該用於檢測ST14與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由ST14之外顯子11組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子11的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測ST14與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 372組成的序列(或SEQ ID NO: 372之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of ST14 and NRG1 is specifically associated with a sequence consisting of exon 11 of ST14 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 11 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of ST14 and NRG1 is specifically related to the sequence composed of SEQ ID NO: 372 (or the allele variant of SEQ ID NO: 372), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自ST14之外顯子11係包含SEQ ID NO: 362或其對偶基因變異體,或是由其組成。 供使用於檢測 THBS1-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 11 line from ST14 comprises or consists of SEQ ID NO: 362 or its allele variant. Primers or probes for use in assays for detecting THBS1-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 376或由其組成的THBS1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置56及57的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 376之多核苷酸(或SEQ ID NO: 376之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置56及57的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a THBS1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 376, and the sequence includes nucleic acids at positions 56 and 57. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 376 (or the allele variant of SEQ ID NO: 376), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 56 and 57 .
較佳地,該用於檢測THBS1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由THBS1之外顯子9組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子9的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測THBS1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 399組成的序列(或SEQ ID NO: 399之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting a fusion of THBS1 and NRG1 is specifically associated with a sequence consisting of exon 9 of THBS1 (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 9 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of THBS1 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 399 (or the allele variant of SEQ ID NO: 399), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自THBS1之外顯子9係包含SEQ ID NO: 386或其對偶基因變異體,或是由其組成。 供使用於檢測 AGRN-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 9 line from THBS1 comprises or consists of SEQ ID NO: 386 or its allele variant. Primers or probes for use in assays for detecting AGRN-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 403或由其組成的AGRN-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置106及107的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 403之多核苷酸(或SEQ ID NO: 403之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置106及107的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting an AGRN-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 403, and the sequence includes the nucleic acids at positions 106 and 107. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 403 (or the allele variant of SEQ ID NO: 403), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 106 and 107 .
較佳地,該用於檢測AGRN與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由AGRN之外顯子12 組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子12 的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測AGRN與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 433組成的序列(或SEQ ID NO: 433之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of AGRN and NRG1 are specifically associated with a sequence consisting of exon 12 of AGRN (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 12 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of AGRN and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 433 (or the allele variant of SEQ ID NO: 433), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自AGRN之外顯子12 係包含SEQ ID NO: 416或其對偶基因變異體,或是由其組成。 供使用於檢測 PVALB-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 12 line from AGRN comprises or consists of SEQ ID NO: 416 or its allele variant. Primers or probes for use in assays for detecting PVALB-NRG1 fusions
本揭露提供一種用於檢測包含有PVALB核酸序列(或PVALB核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部份)融合的多核苷酸融合物之核酸引子、引子對或探針。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 444之多核苷酸(或SEQ ID NO: 444之對偶基因變異體)及根據SEQ ID NO: 138之多核苷酸(或SEQ ID NO: 138之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a polynucleotide fusion comprising a PVALB nucleic acid sequence (or a part of a PVALB nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of an NRG1 nucleic acid sequence) . Preferably, the nucleic acid probe, primer or primer pair is specifically compatible with the polynucleotide according to SEQ ID NO: 444 (or the allele variant of SEQ ID NO: 444) and the multinuclear polynucleotide according to SEQ ID NO: 138 nucleotide (or the allele variant of SEQ ID NO: 138), or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with these sequences and are greater than about 12 to about 40 nucleotides in length.
此外,本揭露提供一種用於檢測包含有SEQ ID NO: 437或由其組成的PVALB-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置102及103的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 437之多核苷酸(或SEQ ID NO: 437之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置102及103的核酸。In addition, the present disclosure provides a nucleic acid primer, primer pair or probe for detecting a PVALB-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 437, and the sequence includes nucleic acids at positions 102 and 103. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 437 (or the allele variant of SEQ ID NO: 437), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 102 and 103 .
較佳地,該用於檢測PVALB與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由PVALB之外顯子4組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子4的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測PVALB與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 450組成的序列(或SEQ ID NO: 450之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of PVALB and NRG1 is specifically associated with a sequence consisting of exon 4 of PVALB (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 4 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of PVALB and NRG1 is specifically related to the sequence composed of SEQ ID NO: 450 (or the allele variant of SEQ ID NO: 450), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自PVALB之外顯子4係包含SEQ ID NO: 442或其對偶基因變異體,或是由其組成。 供使用於檢測 APP-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 4 line from PVALB comprises or consists of SEQ ID NO: 442 or its allele variant. Primers or probes for use in assays for detecting APP-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 486或由其組成的APP-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置54及55的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 486之多核苷酸(或SEQ ID NO: 486之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置54及55的核酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting an APP-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 486, and the sequence includes the nucleic acids at positions 54 and 55. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 486 (or the allele variant of SEQ ID NO: 486), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 54 and 55 .
較佳地,該用於檢測APP與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由APP之外顯子14組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子14的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測APP與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 524組成的序列(或SEQ ID NO: 524之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of APP and NRG1 is specific for a sequence consisting of exon 14 of APP (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 14 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of APP and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 524 (or the allele variant of SEQ ID NO: 524), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自APP之外顯子14係包含SEQ ID NO: 501或其對偶基因變異體,或是由其組成。 供使用於檢測 WRN-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 14 line from APP comprises or consists of SEQ ID NO: 501 or its allele variant. Primers or probes for use in assays for detecting WRN-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 528或由其組成的WRN-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置96及97的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 528之多核苷酸(或SEQ ID NO: 528之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置96及97的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a WRN-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 528, and the sequence includes nucleic acids at positions 96 and 97. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 528 (or the allele variant of SEQ ID NO: 528), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 96 and 97 .
較佳地,該用於檢測WRN與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由WRN之外顯子33組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子33的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測WRN與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 601組成的序列(或SEQ ID NO: 601之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of WRN and NRG1 is specifically associated with a sequence consisting of exon 33 of WRN (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 33 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of WRN and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 601 (or the allele variant of SEQ ID NO: 601), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自WRN之外顯子33係包含SEQ ID NO: 562或其對偶基因變異體,或是由其組成。 供使用於檢測 DAAM1-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 33 line from WRN comprises or consists of SEQ ID NO: 562 or its allele variant. Primers or probes for use in assays for detecting DAAM1-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 605或由其組成的DAAM1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 605之多核苷酸(或SEQ ID NO: 605之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a DAAM1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 605, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 605 (or the allele variant of SEQ ID NO: 605), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測DAAM1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由DAAM1之外顯子1組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子1的5’處之序列及/或一由NRG1之外顯子1組成的序列、或一位於外顯子1的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測DAAM1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 606組成的序列(或SEQ ID NO: 606之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of DAAM1 and NRG1 is specifically associated with a sequence consisting of exon 1 of DAAM1 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 1 and/or a sequence consisting of exon 1 of NRG1, or a sequence located 3' of exon 1 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of DAAM1 and NRG1 is specifically related to the sequence composed of SEQ ID NO: 606 (or the allele variant of SEQ ID NO: 606), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自DAAM1之外顯子1係包含SEQ ID NO: 606或其對偶基因變異體,或是由其組成。 供使用於檢測 ASPH-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 1 line from DAAM1 comprises or consists of SEQ ID NO: 606 or its allele variant. Primers or probes for use in assays for detecting ASPH-NRG1 fusions
本揭露提供一種用於檢測包含有ASPH核酸序列(或ASPH核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部份)融合的多核苷酸融合物之核酸引子、引子對或探針。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 662之多核苷酸(或SEQ ID NO: 662之對偶基因變異體)及根據SEQ ID NO: 138之多核苷酸(或SEQ ID NO: 138之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a polynucleotide fusion comprising an ASPH nucleic acid sequence (or a part of an ASPH nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of an NRG1 nucleic acid sequence) . Preferably, the nucleic acid probe, primer or primer pair is specifically compatible with the polynucleotide according to SEQ ID NO: 662 (or the allele variant of SEQ ID NO: 662) and the multinuclear polynucleotide according to SEQ ID NO: 138 nucleotide (or the allele variant of SEQ ID NO: 138), or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with these sequences and are greater than about 12 to about 40 nucleotides in length.
此外,本揭露提供一種用於檢測包含有SEQ ID NO: 635或由其組成的ASPH-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 635之多核苷酸(或SEQ ID NO: 635之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。In addition, the present disclosure provides a nucleic acid primer, primer pair or probe for detecting an ASPH-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 635, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 635 (or the allele variant of SEQ ID NO: 635), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測ASPH與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由ASPH之外顯子22組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子22的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測ASPH與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 689組成的序列(或SEQ ID NO: 689之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of ASPH and NRG1 is specific for a sequence consisting of exon 22 of ASPH (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 22 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of ASPH and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 689 (or the allele variant of SEQ ID NO: 689), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自ASPH之外顯子22係包含SEQ ID NO: 658或其對偶基因變異體,或是由其組成。 供使用於檢測 NOTCH2-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 22 line from ASPH comprises or consists of SEQ ID NO: 658 or its allele variant. Primers or probes for use in assays for detecting NOTCH2-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 693或由其組成的NOTCH2-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 693之多核苷酸(或SEQ ID NO: 693之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a NOTCH2-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 693, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 693 (or the allele variant of SEQ ID NO: 693), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測NOTCH2與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由NOTCH2之外顯子6組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子6的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測NOTCH2與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 713組成的序列(或SEQ ID NO: 713之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting a fusion of NOTCH2 and NRG1 is specifically associated with a sequence consisting of exon 6 of NOTCH2 (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 6 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of NOTCH2 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 713 (or the allele variant of SEQ ID NO: 713), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自NOTCH2之外顯子6係包含SEQ ID NO: 700或其對偶基因變異體,或是由其組成。 供使用於檢測 CD74-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 6 line from NOTCH2 comprises or consists of SEQ ID NO: 700 or its allele variant. Primers or probes for use in assays for detecting CD74-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 717或由其組成的CD74-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 717之多核苷酸(或SEQ ID NO: 717之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a CD74-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 717, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 717 (or the allele variant of SEQ ID NO: 717), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測CD74與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由CD74之外顯子2組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子2的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測CD74與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 739組成的序列(或SEQ ID NO: 739之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers used to detect fusions of CD74 and NRG1 is specific to a sequence consisting of exon 2 of CD74 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 2 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of CD74 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 739 (or the allele variant of SEQ ID NO: 739), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自CD74之外顯子2係包含SEQ ID NO: 720或其對偶基因變異體,或是由其組成。 供使用於檢測 SDC4-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 2 line from CD74 comprises or consists of SEQ ID NO: 720 or its allele variant. Primers or probes for use in assays for detecting SDC4-NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 743或由其組成的SDC4-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 743之多核苷酸(或SEQ ID NO: 743之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a SDC4-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 743, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 743 (or the allele variant of SEQ ID NO: 743), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測SDC4與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由SDC4之外顯子2組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子2的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測SDC4與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 757組成的序列(或SEQ ID NO: 757之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of SDC4 and NRG1 is specifically associated with a sequence consisting of exon 2 of SDC4 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 2 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of SDC4 and NRG1 is specifically related to the sequence composed of SEQ ID NO: 757 (or the allele variant of SEQ ID NO: 757), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自SDC4之外顯子2係包含SEQ ID NO: 746或其對偶基因變異體,或是由其組成。Preferably, the exon 2 line from SDC4 comprises or consists of SEQ ID NO: 746 or its allele variant.
本揭露提供一種用於檢測包含有SEQ ID NO: 824或由其組成的SDC4-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 824之多核苷酸(或SEQ ID NO: 824之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a SDC4-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 824, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 824 (or the allele variant of SEQ ID NO: 824), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測SDC4與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由SDC4之外顯子4組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子4的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測SDC4與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 940組成的序列(或SEQ ID NO: 940之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of SDC4 and NRG1 is specifically associated with a sequence consisting of exon 4 of SDC4 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 4 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of SDC4 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 940 (or the allele variant of SEQ ID NO: 940), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自SDC4之外顯子2係包含SEQ ID NO: 748或其對偶基因變異體,或是由其組成。 供使用於檢測 SLC4A4 -NRG1 融合物之測定法中的引子或探針 Preferably, the exon 2 line from SDC4 comprises or consists of SEQ ID NO: 748 or its allele variant. Primers or probes for use in assays for detecting SLC4A4 -NRG1 fusions
本揭露提供一種用於檢測包含有SEQ ID NO: 765或由其組成的SLC4A4-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 765之多核苷酸(或SEQ ID NO: 765之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a SLC4A4-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 765, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 765 (or the allele variant of SEQ ID NO: 765), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測SLC4A4與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由SLC4A4之外顯子14組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子14的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測SLC4A4與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 820組成的序列(或SEQ ID NO: 820之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of SLC4A4 and NRG1 is specifically associated with a sequence consisting of exon 14 of SLC4A4 (such as used in gain-of-function allele assays sequence), a sequence located 5' of exon 14 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or primer pair used for detecting the fusion of SLC4A4 and NRG1 is specifically related to the sequence composed of SEQ ID NO: 820 (or the allele variant of SEQ ID NO: 820), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自SLC4A4之外顯子14係包含SEQ ID NO: 780或其對偶基因變異體,或是由其組成。 供使用於檢測 ZFAT-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 14 line from SLC4A4 comprises or consists of SEQ ID NO: 780 or its allele variant. Primers or probes for use in assays for detecting ZFAT-NRG1 fusions
本揭露提供一種用於檢測包含有ZFAT核酸序列(或ZFAT核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部份)融合的多核苷酸融合物之核酸引子、引子對或探針。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 846之多核苷酸(或SEQ ID NO: 846之對偶基因變異體)及根據SEQ ID NO: 138之多核苷酸(或SEQ ID NO: 138之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。The present disclosure provides a nucleic acid primer, primer pair or probe for detecting a polynucleotide fusion comprising a ZFAT nucleic acid sequence (or a part of a ZFAT nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of a NRG1 nucleic acid sequence) . Preferably, the nucleic acid probe, primer or primer pair is specifically compatible with the polynucleotide according to SEQ ID NO: 846 (or the allele variant of SEQ ID NO: 846) and the multinuclear polynucleotide according to SEQ ID NO: 138 nucleotide (or the allele variant of SEQ ID NO: 138), or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with these sequences and are greater than about 12 to about 40 nucleotides in length.
此外,本揭露提供一種用於檢測包含有SEQ ID NO: 828或由其組成的ZFAT-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 828之多核苷酸(或SEQ ID NO: 828之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。In addition, the present disclosure provides a nucleic acid primer, primer pair or probe for detecting a ZFAT-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 828, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 828 (or the allele variant of SEQ ID NO: 828), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測ZFAT與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由ZFAT之外顯子12組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子12的5’處之序列及/或一由NRG1之外顯子6組成的序列、或一位於外顯子6的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測ZFAT與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 864組成的序列(或SEQ ID NO: 864之對偶基因變異體)、及/或SEQ ID NO: 155組成的序列(或SEQ ID NO: 155之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting fusions of ZFAT and NRG1 is specifically associated with a sequence consisting of exon 12 of ZFAT (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 12 and/or a sequence consisting of exon 6 of NRG1, or a sequence located 3' of exon 6 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of ZFAT and NRG1 is specifically related to the sequence composed of SEQ ID NO: 864 (or the allele variant of SEQ ID NO: 864), and/or a sequence consisting of SEQ ID NO: 155 (or an allele variant of SEQ ID NO: 155) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自ZFAT之外顯子12係包含SEQ ID NO: 841或其對偶基因變異體,或是由其組成。 供使用於檢測 DSCAML1-NRG1 融合物之測定法中的引子或探針 Preferably, the exon 12 line from ZFAT comprises or consists of SEQ ID NO: 841 or its allele variant. Primers or probes for use in assays for detecting DSCAML1-NRG1 fusions
本揭露提供一種用於檢測包含有DSCAML1核酸序列(或DSCAML1核酸序列的一部分)與NRG1核酸序列(或NRG1核酸序列的一部份)融合的多核苷酸融合物之核酸引子、引子對或探針。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 903之多核苷酸(或SEQ ID NO: 903之對偶基因變異體)及根據SEQ ID NO: 138之多核苷酸(或SEQ ID NO: 138之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。The disclosure provides a nucleic acid primer, primer pair or probe for detecting a polynucleotide fusion comprising a DSCAML1 nucleic acid sequence (or a part of a DSCAML1 nucleic acid sequence) fused to an NRG1 nucleic acid sequence (or a part of an NRG1 nucleic acid sequence) . Preferably, the nucleic acid probe, primer or primer pair is specifically compatible with the polynucleotide according to SEQ ID NO: 903 (or the allele variant of SEQ ID NO: 903) and the polynucleotide according to SEQ ID NO: 138 nucleotide (or the allele variant of SEQ ID NO: 138), or have 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with these sequences and are greater than about 12 to about 40 nucleotides in length.
此外,本揭露提供一種用於檢測包含有SEQ ID NO: 868或由其組成的DSCAML1-NRG1多核苷酸融合物之核酸引子、引子對或探針,且該序列包括位置75及76的核酸。較佳地,該核酸探針、引子或引子對係特異性地與根據SEQ ID NO: 868之多核苷酸(或SEQ ID NO: 868之對偶基因變異體)雜合,或是與該序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至40個核苷酸,且該序列較佳地包括位置75及76的核酸。In addition, the present disclosure provides a nucleic acid primer, primer pair or probe for detecting a DSCAML1-NRG1 polynucleotide fusion comprising or consisting of SEQ ID NO: 868, and the sequence includes nucleic acids at positions 75 and 76. Preferably, the nucleic acid probe, primer or primer pair is specifically hybridized to the polynucleotide according to SEQ ID NO: 868 (or the allele variant of SEQ ID NO: 868), or is related to the sequence 95%, 96%, 97%, 98%, 99%, or preferably 100% sequence identity and over about 12 to 40 nucleotides in length, and the sequence preferably includes the nucleic acids at positions 75 and 76 .
較佳地,該用於檢測DSCAML1與NRG1之融合物的核酸探針、引子或引子對係特異性地與一由DSCAML1之外顯子3組成的序列(諸如使用於功能獲得對偶基因測定法中之序列)、一位於外顯子3的5’處之序列及/或一由NRG1之外顯子2組成的序列、或一位於外顯子2的3’處之序列(諸如NRG1之基因序列)雜合(或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸)。較佳地,該用於檢測DSCAML1與NRG1之融合物的核酸探針、引子或引子對係特異性地與SEQ ID NO: 938組成的序列(或SEQ ID NO: 938之對偶基因變異體)、及/或SEQ ID NO: 153組成的序列(或SEQ ID NO: 153之對偶基因變異體)雜合,或是與該等序列有95%、96%、97%、98%、99%或較佳地有100%的序列一致性且長度超過約12至約40個核苷酸。亦提供了用於檢測涉及所述外顯子任一者之對偶基因變異體的融合物之核酸探針、引子或引子對。Preferably, the nucleic acid probe, primer or pair of primers for detecting a fusion of DSCAML1 and NRG1 is specifically associated with a sequence consisting of exon 3 of DSCAML1 (such as used in a gain-of-function allele assay sequence), a sequence located 5' of exon 3 and/or a sequence consisting of exon 2 of NRG1, or a sequence located 3' of exon 2 (such as the gene sequence of NRG1 ) is heterozygous (or has 95%, 96%, 97%, 98%, 99% or preferably 100% sequence identity with such sequences and is longer than about 12 to about 40 nucleotides in length). Preferably, the nucleic acid probe, primer or pair of primers for detecting the fusion of DSCAML1 and NRG1 is specifically compatible with the sequence consisting of SEQ ID NO: 938 (or the allele variant of SEQ ID NO: 938), and/or a sequence consisting of SEQ ID NO: 153 (or an allele variant of SEQ ID NO: 153) is heterozygous, or is 95%, 96%, 97%, 98%, 99% or more Preferably there is 100% sequence identity and over about 12 to about 40 nucleotides in length. Also provided are nucleic acid probes, primers or primer pairs for detecting fusions involving allogenic variants of any of said exons.
較佳地,來自DSCAML1之外顯子3係包含SEQ ID NO: 872或其對偶基因變異體,或是由其組成。 使用於原位雜交之探針 Preferably, the exon 3 line from DSCAML1 comprises or consists of SEQ ID NO: 872 or its allele variant. Probes for in situ hybridization
亦提供使用於原位雜交(ISH)測定法之探針以檢測涉及VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之任何基因重排。尤其,該等探針係供使用於螢光原位雜交(FISH)或斷裂分離FISH。該基因重排較佳為VAPB、CADM1、CD44、SLC3A2、VTCN1 CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1與NRG1之基因融合物,但因為這些ISH測定法係靶向本文所揭露之融合接合處的5’及3’側,可使用該ISH測定法來檢測涉及該等基因的任何基因重排。Probes for use in in situ hybridization (ISH) assays to detect genes involved in VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, Any gene rearrangement of DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1. In particular, the probes are intended for use in fluorescence in situ hybridization (FISH) or fragmentation FISH. The gene rearrangement is preferably VAPB, CADM1, CD44, SLC3A2, VTCN1 CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, Gene fusions of ZFAT or DSCAML1 and NRG1, but because these ISH assays target the 5' and 3' sides of the fusion junctions disclosed herein, this ISH assay can be used to detect any gene duplication involving these genes. Row.
因此,特別提供一第一核酸探針及一第二核酸探針供使用原位雜交測定法中,以檢測VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之基因重排,其中: - 用於檢測VAPB之基因重排的第一探針特異性地與位於SEQ ID NO: 1之位置42或43之核酸5’處的VAPB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 1之位置42或43之核酸3’處的VAPB序列雜交; - 用於檢測CADM1之基因重排的第一探針特異性地與位於SEQ ID NO: 5之位置53之核酸5’處的CADM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 5之位置53之核酸3’處的CADM1序列雜交; - 用於檢測CD44之基因重排的第一探針特異性地與位於SEQ ID NO: 9之位置52之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 9之位置52之核酸3’處的CD44序列雜交; - 用於檢測CD44之基因重排的第一探針特異性地與位於SEQ ID NO: 759之位置75之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 759之位置75之核酸3’處的CD44序列雜交; - 用於檢測SLC3A2之基因重排的第一探針特異性地與位於SEQ ID NO: 13之位置53之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 13之位置53之核酸3’處的SLC3A2序列雜交; - 用於檢測VTCN1之基因重排的第一探針特異性地與位於SEQ ID NO: 164之位置65之核酸5’處的VTCN1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 164之位置65之核酸3’處的VTCN1序列雜交; - 用於檢測CDH1之基因重排的第一探針特異性地與位於SEQ ID NO: 184之位置119之核酸5’處的CDH1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 184之位置119之核酸3’處的CDH1序列雜交; - 用於檢測CXADR之基因重排的第一探針特異性地與位於SEQ ID NO: 215之位置43之核酸5’處的CXADR序列雜交,且該第二探針特異性地與位於SEQ ID NO: 215之位置43之核酸3’處的CXADR序列雜交; - 用於檢測GTF2E2之基因重排的第一探針特異性地與位於SEQ ID NO: 231之位置141之核酸5’處的GTF2E2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 231之位置141之核酸3’處的GTF2E2序列雜交; - 用於檢測CSMD1之基因重排的第一探針特異性地與位於SEQ ID NO: 253之位置88之核酸5’處的CSMD1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 253之位置88之核酸3’處的CSMD1序列雜交; - 用於檢測PTN之基因重排的第一探針特異性地與位於SEQ ID NO: 311之位置102之核酸5’處的PTN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 311之位置102之核酸3’處的PTN序列雜交; - 用於檢測ST14之基因重排的第一探針特異性地與位於SEQ ID NO: 328之位置95之核酸5’處的ST14序列雜交,且該第二探針特異性地與位於SEQ ID NO: 328之位置95之核酸3’處的ST14序列雜交; - 用於檢測THBS1之基因重排的第一探針特異性地與位於SEQ ID NO: 374之位置56之核酸5’處的THBS1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 374之位置56之核酸3’處的THBS1序列雜交; - 用於檢測AGRN之基因重排的第一探針特異性地與位於SEQ ID NO: 401之位置106之核酸5’處的AGRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 401之位置106之核酸3’處的AGRN序列雜交; - 用於檢測PVALB之基因重排的第一探針特異性地與位於SEQ ID NO: 435之位置102之核酸5’處的PVALB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 435之位置102之核酸3’處的PVALB序列雜交; - 用於檢測SLC3A2之基因重排的第一探針特異性地與位於SEQ ID NO: 452之位置93之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 452之位置93之核酸3’處的SLC3A2序列雜交; - 用於檢測APP之基因重排的第一探針特異性地與位於SEQ ID NO: 484之位置54之核酸5’處的APP序列雜交,且該第二探針特異性地與位於SEQ ID NO: 484之位置54之核酸3’處的APP序列雜交; - 用於檢測WRN之基因重排的第一探針特異性地與位於SEQ ID NO: 526之位置96之核酸5’處的WRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 526之位置96之核酸3’處的WRN序列雜交; - 用於檢測DAAM1之基因重排的第一探針特異性地與位於SEQ ID NO: 603之位置75之核酸5’處的DAAM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 603之位置75之核酸3’處的DAAM1序列雜交; - 用於檢測ASPH之基因重排的第一探針特異性地與位於SEQ ID NO: 633之位置75之核酸5’處的ASPH序列雜交,且該第二探針特異性地與位於SEQ ID NO: 633之位置75之核酸3’處的ASPH序列雜交; - 用於檢測NOTCH2之基因重排的第一探針特異性地與位於SEQ ID NO: 691之位置75之核酸5’處的NOTCH2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 691之位置75之核酸3’處的NOTCH2序列雜交; - 用於檢測CD74之基因重排的第一探針特異性地與位於SEQ ID NO: 715之位置75之核酸5’處的CD74序列雜交,且該第二探針特異性地與位於SEQ ID NO: 715之位置75之核酸3’處的CD74序列雜交; - 用於檢測SDC4之基因重排的第一探針特異性地與位於SEQ ID NO: 741之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 741之位置75之核酸3’處的SDC4序列雜交; - 用於檢測SDC4之基因重排的第一探針特異性地與位於SEQ ID NO: 822之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 822之位置75之核酸3’處的SDC4序列雜交; - 用於檢測SLC4A4之基因重排的第一探針特異性地與位於SEQ ID NO: 763之位置75之核酸5’處的SLC4A4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 763之位置75之核酸3’處的SLC4A4序列雜交; - 用於檢測ZFAT之基因重排的第一探針特異性地與位於SEQ ID NO: 826之位置75之核酸5’處的ZFAT序列雜交,且該第二探針特異性地與位於SEQ ID NO: 826之位置75之核酸3’處的ZFAT序列雜交;或 - 用於檢測DSCAML1之基因重排的第一探針特異性地與位於SEQ ID NO: 866之位置75之核酸5’處的DSCAML1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 866之位置75之核酸3’處的DSCAML1序列雜交。 Accordingly, a first nucleic acid probe and a second nucleic acid probe are specifically provided for use in an in situ hybridization assay to detect VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, Gene rearrangements of THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1, where: - the first probe for detecting the gene rearrangement of VAPB is specifically hybridized with the VAPB sequence at the nucleic acid 5' at position 42 or 43 of SEQ ID NO: 1, and the second probe is specifically hybridized with the sequence at The VAPB sequence hybridization at the nucleic acid 3' of position 42 or 43 of SEQ ID NO: 1; - the first probe for detecting the gene rearrangement of CADM1 is specifically hybridized to the CADM1 sequence located at the nucleic acid 5' of position 53 of SEQ ID NO: 5, and the second probe is specifically hybridized to the sequence located at SEQ ID NO: CADM1 sequence hybridization at the 3' of the nucleic acid at position 53 of 5; - the first probe for detecting gene rearrangement of CD44 specifically hybridizes to the CD44 sequence at the nucleic acid 5' at position 52 of SEQ ID NO: 9, and the second probe specifically hybridizes to the sequence at SEQ ID NO: CD44 sequence hybridization at the 3' position of the nucleic acid at position 52 of 9; - the first probe for detecting gene rearrangement of CD44 specifically hybridizes to the CD44 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 759, and the second probe specifically hybridizes to the sequence at SEQ ID CD44 sequence hybridization at the 3' position of the nucleic acid at position 75 of NO: 759; - the first probe for detecting the gene rearrangement of SLC3A2 specifically hybridizes to the SLC3A2 sequence at the nucleic acid 5' at position 53 of SEQ ID NO: 13, and the second probe specifically hybridizes to the sequence at SEQ ID NO: SLC3A2 sequence hybridization at the 3' of the nucleic acid at position 53 of 13; - the first probe for detecting the gene rearrangement of VTCN1 is specifically hybridized with the VTCN1 sequence at the nucleic acid 5' at position 65 of SEQ ID NO: 164, and the second probe is specifically hybridized with the sequence at SEQ ID NO: VTCN1 sequence hybridization at the 3' position of the nucleic acid at position 65 of 164; - the first probe for detecting gene rearrangement of CDH1 specifically hybridizes to the CDH1 sequence at the nucleic acid 5' at position 119 of SEQ ID NO: 184, and the second probe specifically hybridizes to the sequence at SEQ ID NO: CDH1 sequence hybridization at the 3' position of the nucleic acid at position 119 of 184; - the first probe for detecting the gene rearrangement of CXADR is specifically hybridized with the CXADR sequence at the nucleic acid 5' at position 43 of SEQ ID NO: 215, and the second probe is specifically hybridized with the sequence at SEQ ID NO: CXADR sequence hybridization at the 3' position of the nucleic acid at position 43 of 215; - the first probe for detecting the gene rearrangement of GTF2E2 specifically hybridizes to the GTF2E2 sequence located at the nucleic acid 5' of position 141 of SEQ ID NO: 231, and the second probe specifically hybridizes to the sequence located at SEQ ID NO: GTF2E2 sequence hybridization at the 3' position of the nucleic acid at position 141 of 231; - the first probe for detecting the gene rearrangement of CSMD1 is specifically hybridized to the CSMD1 sequence at the nucleic acid 5' at position 88 of SEQ ID NO: 253, and the second probe is specifically hybridized to the sequence at SEQ ID NO: CSMD1 sequence hybridization at the 3' position of the nucleic acid at position 88 of 253; - the first probe for detecting the gene rearrangement of PTN is specifically hybridized with the PTN sequence at the nucleic acid 5' at position 102 of SEQ ID NO: 311, and the second probe is specifically hybridized with the sequence at the position of SEQ ID NO: PTN sequence hybridization at the 3' position of the nucleic acid at position 102 of 311; - the first probe for detecting the gene rearrangement of ST14 specifically hybridizes to the ST14 sequence at the nucleic acid 5' at position 95 of SEQ ID NO: 328, and the second probe specifically hybridizes to the sequence at SEQ ID NO: ST14 sequence hybridization at the 3' position of the nucleic acid at position 95 of 328; - the first probe for detecting gene rearrangement of THBS1 specifically hybridizes to the THBS1 sequence at the 5' of the nucleic acid at position 56 of SEQ ID NO: 374, and the second probe specifically hybridizes to the sequence at SEQ ID THBS1 sequence hybridization at the 3' position of the nucleic acid at position 56 of NO: 374; - the first probe for detecting the gene rearrangement of AGRN is specifically hybridized with the AGRN sequence at the nucleic acid 5' at position 106 of SEQ ID NO: 401, and the second probe is specifically hybridized with the sequence at the position of SEQ ID NO: AGRN sequence hybridization at the 3' of the nucleic acid at position 106 of 401; - the first probe for detecting the gene rearrangement of PVALB is specifically hybridized with the PVALB sequence located at the nucleic acid 5' of position 102 of SEQ ID NO: 435, and the second probe is specifically hybridized with the sequence located at SEQ ID NO: PVALB sequence hybridization at the 3' position of the nucleic acid at position 102 of 435; - the first probe for detecting gene rearrangement of SLC3A2 specifically hybridizes to the SLC3A2 sequence at the nucleic acid 5' at position 93 of SEQ ID NO: 452, and the second probe specifically hybridizes to the sequence at SEQ ID NO: SLC3A2 sequence hybridization at the 3' position of the nucleic acid at position 93 of 452; - the first probe for detecting gene rearrangement of APP specifically hybridizes to the APP sequence at the nucleic acid 5' at position 54 of SEQ ID NO: 484, and the second probe specifically hybridizes to the sequence at SEQ ID NO: APP sequence hybridization at the 3' position of the nucleic acid at position 54 of 484; - the first probe for detecting the gene rearrangement of WRN specifically hybridizes to the WRN sequence at the nucleic acid 5' at position 96 of SEQ ID NO: 526, and the second probe specifically hybridizes to the sequence at SEQ ID NO: WRN sequence hybridization at the 3' position of the nucleic acid at position 96 of 526; - the first probe for detecting gene rearrangement of DAAM1 specifically hybridizes to the DAAM1 sequence at the 5' of the nucleic acid at position 75 of SEQ ID NO: 603, and the second probe specifically hybridizes to the sequence at SEQ ID NO: DAAM1 sequence hybridization at the 3' position of the nucleic acid at position 75 of 603; - the first probe for detecting the gene rearrangement of ASPH specifically hybridizes to the ASPH sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 633, and the second probe specifically hybridizes to the sequence at SEQ ID ASPH sequence hybridization at the 3' of the nucleic acid at position 75 of NO: 633; - the first probe for detecting the gene rearrangement of NOTCH2 specifically hybridizes to the NOTCH2 sequence at the 5' of the nucleic acid at position 75 of SEQ ID NO: 691, and the second probe specifically hybridizes to the sequence at SEQ ID NO: Hybridization of the NOTCH2 sequence at the 3' of the nucleic acid at position 75 of 691; - the first probe for detecting gene rearrangement of CD74 specifically hybridizes to the CD74 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 715, and the second probe specifically hybridizes to the sequence at SEQ ID CD74 sequence hybridization at the 3' of the nucleic acid at position 75 of NO: 715; - the first probe for detecting the gene rearrangement of SDC4 specifically hybridizes to the SDC4 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 741, and the second probe specifically hybridizes to the sequence at SEQ ID NO: SDC4 sequence hybridization at the 3' position of the nucleic acid at position 75 of 741; - the first probe for detecting the gene rearrangement of SDC4 specifically hybridizes to the SDC4 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 822, and the second probe specifically hybridizes to the sequence at SEQ ID NO: SDC4 sequence hybridization at the 3' position of the nucleic acid at position 75 of 822; - the first probe for detecting the gene rearrangement of SLC4A4 specifically hybridizes to the SLC4A4 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 763, and the second probe specifically hybridizes to the sequence at SEQ ID SLC4A4 sequence hybridization at the 3' position of the nucleic acid at position 75 of NO: 763; - the first probe for detecting the gene rearrangement of ZFAT is specifically hybridized with the ZFAT sequence at the 5' nucleic acid at position 75 of SEQ ID NO: 826, and the second probe is specifically hybridized with the sequence at SEQ ID ZFAT sequence hybridization at the 3' position of the nucleic acid at position 75 of NO: 826; or - the first probe for detecting the gene rearrangement of DSCAML1 specifically hybridizes to the DSCAML1 sequence at the 5' of the nucleic acid at position 75 of SEQ ID NO: 866, and the second probe specifically hybridizes to the sequence at SEQ ID The DSCAML1 sequence at the 3' of the nucleic acid at position 75 of NO: 866 hybridized.
替代性地,提供了一種較佳的ISH測定法,其中係檢測選自VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1或DSCAML1-NRG1之任一多核苷酸融合物。所述測定法制定了一第一探針及一第二探針的用途,其中若該第一探針與該融合接合處5’側的序列雜交,該第二探針則與另一側的序列雜交,或者是若該第一探針與該融合接合處3’側的序列雜交,該第二探針則與另一側的序列雜交,且其中該第一探針係與選自VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1之序列雜交,而該第二探針係與NRG1序列雜交且較佳為諸如本文所提及之EGF樣結構域序列。包括了與EGF樣結構域雜交之探針時,會將所述結構域置於該NRG1 融合配偶體序列的附近。Alternatively, a preferred ISH assay is provided wherein the assay is selected from the group consisting of VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1 , CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4 - any polynucleotide fusion of NRG1, SLC4A4-NRG1, ZFAT-NRG1 or DSCAML1-NRG1. The assay specifies the use of a first probe and a second probe, wherein if the first probe hybridizes to a sequence on the 5' side of the fusion junction, the second probe hybridizes to the sequence on the other side sequence hybridization, or if the first probe hybridizes to the sequence on the 3' side of the fusion junction, the second probe hybridizes to the sequence on the other side, and wherein the first probe is selected from VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1 sequence hybridization, and The second probe hybridizes to the NRG1 sequence and is preferably an EGF-like domain sequence such as mentioned herein. Inclusion of a probe that hybridizes to an EGF-like domain places the domain in the vicinity of the NRG1 fusion partner sequence.
特別提供了一種供使用於原位雜交測定法以檢測VAPB-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 3之位置42或43之核酸5’處的VAPB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 3之位置43或44之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Specifically provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect VAPB-NRG1 fusions, wherein the first probe is specifically at the position of SEQ ID NO: 3 The VAPB sequence at the nucleic acid 5' place of 42 or 43 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' place at position 43 or 44 of SEQ ID NO:3, such as the EGF of the present disclosure The sequence comprised by the structural domain, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CADM1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 7之位置53之核酸5’處的CADM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 7之位置54之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CADM1-NRG1 fusions, wherein the first probe is specifically linked to the sequence located between SEQ ID NO: 7 The CADM1 sequence at the 5' position of the nucleic acid at position 53 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 54 of SEQ ID NO: 7, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CD44-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 11之位置52之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 11之位置53之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CD44-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 11 The CD44 sequence at the 5' position of the nucleic acid at position 52 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 53 of SEQ ID NO: 11, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CD44-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 761之位置75之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 761之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CD44-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 761 The CD44 sequence at the 5' position of the nucleic acid at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 76 of SEQ ID NO: 761, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測SLC3A2-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 15之位置53之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 15之位置54之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect SLC3A2-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 15 The SLC3A2 sequence at the 5' position of the nucleic acid at position 53 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 54 of SEQ ID NO: 15, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測VTCN1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 166之位置65之核酸5’處的VTCN1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 166之位置66之核酸3’處的VTCN1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect VTCN1-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 166 The VTCN1 sequence at the 5' position of the nucleic acid at position 65 hybridizes, and the second probe specifically hybridizes with the VTCN1 sequence at the 3' position of the nucleic acid at position 66 of SEQ ID NO: 166, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CDH1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 186之位置119之核酸5’處的CDH1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 186之位置120之核酸3’處的CDH1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CDH1-NRG1 fusions, wherein the first probe is specifically related to the nucleic acid located between SEQ ID NO: 186 The CDH1 sequence at the 5' position of the nucleic acid at position 119 hybridizes, and the second probe specifically hybridizes with the CDH1 sequence at the 3' position of the nucleic acid at position 120 of SEQ ID NO: 186, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CXADR-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 217之位置43之核酸5’處的CXADR序列雜交,且該第二探針特異性地與位於SEQ ID NO: 217之位置44之核酸3’處的CXADR序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CXADR-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 217 The CXADR sequence at the nucleic acid 5' at position 43 hybridizes, and the second probe specifically hybridizes with the CXADR sequence at the nucleic acid 3' at position 44 of SEQ ID NO: 217, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測GTF2E2-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 233之位置141之核酸5’處的GTF2E2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 233之位置142之核酸3’處的GTF2E2序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect GTF2E2-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 233 The GTF2E2 sequence at the 5' position of the nucleic acid at position 141 hybridizes, and the second probe specifically hybridizes with the GTF2E2 sequence at the 3' position of the nucleic acid at position 142 of SEQ ID NO: 233, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CSMD1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 255之位置88之核酸5’處的CSMD1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 255之位置89之核酸3’處的CSMD1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CSMD1-NRG1 fusions, wherein the first probe is specifically linked to the sequence located between SEQ ID NO: 255 The CSMD1 sequence at the nucleic acid 5' at position 88 hybridizes, and the second probe specifically hybridizes with the CSMD1 sequence at the nucleic acid 3' at position 89 of SEQ ID NO: 255, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測PTN-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 313之位置102之核酸5’處的PTN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 313之位置103之核酸3’處的PTN序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect PTN-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 313 The PTN sequence at the nucleic acid 5' at position 102 hybridizes, and the second probe specifically hybridizes with the PTN sequence at the nucleic acid 3' at position 103 of SEQ ID NO: 313, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測ST14-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 330之位置95之核酸5’處的ST14序列雜交,且該第二探針特異性地與位於SEQ ID NO: 330之位置96之核酸3’處的ST14序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect ST14-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 330 The ST14 sequence at the 5' position of the nucleic acid at position 95 hybridizes, and the second probe specifically hybridizes with the ST14 sequence at the 3' position of the nucleic acid at position 96 of SEQ ID NO: 330, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測THBS1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 376之位置56之核酸5’處的THBS1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 376之位置57之核酸3’處的THBS1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect THBS1-NRG1 fusions, wherein the first probe is specifically associated with the nucleic acid located between SEQ ID NO: 376 The THBS1 sequence at the 5' position of the nucleic acid at position 56 hybridizes, and the second probe specifically hybridizes with the THBS1 sequence at the 3' position of the nucleic acid at position 57 of SEQ ID NO: 376, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測AGRN-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 403之位置106之核酸5’處的AGRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 403之位置107之核酸3’處的AGRN序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect AGRN-NRG1 fusions, wherein the first probe is specifically associated with the sequence located between SEQ ID NO: 403 The AGRN sequence at the nucleic acid 5' at position 106 hybridizes, and the second probe specifically hybridizes with the AGRN sequence at the nucleic acid 3' at position 107 of SEQ ID NO: 403, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測PVALB-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 437之位置102之核酸5’處的PVALB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 437之位置103之核酸3’處的PVALB序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect PVALB-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 437 The PVALB sequence at the nucleic acid 5' at position 102 hybridizes, and the second probe specifically hybridizes with the PVALB sequence at the nucleic acid 3' at position 103 of SEQ ID NO: 437, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測SLC3A2-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 454之位置93之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 454之位置94之核酸3’處的SLC3A2序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect SLC3A2-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 454 The SLC3A2 sequence at the nucleic acid 5' at position 93 hybridizes, and the second probe specifically hybridizes with the SLC3A2 sequence at the nucleic acid 3' at position 94 of SEQ ID NO: 454, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測APP-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 486之位置54之核酸5’處的APP序列雜交,且該第二探針特異性地與位於SEQ ID NO: 486之位置55之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect APP-NRG1 fusions, wherein the first probe is specifically related to the nucleic acid located between SEQ ID NO: 486 The APP sequence at the nucleic acid 5' at position 54 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 55 of SEQ ID NO: 486, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測WRN-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 528之位置96之核酸5’處的WRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 528之位置97之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect WRN-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 528 The WRN sequence at the nucleic acid 5' at position 96 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 97 of SEQ ID NO: 528, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測DAAM1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 605之位置75之核酸5’處的DAAM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 605之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect DAAM1-NRG1 fusions, wherein the first probe is specifically linked to the sequence located between SEQ ID NO: 605 The DAAM1 sequence at the 5' position of the nucleic acid at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 76 of SEQ ID NO: 605, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測ASPH-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 635之位置75之核酸5’處的ASPH序列雜交,且該第二探針特異性地與位於SEQ ID NO: 635之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect ASPH-NRG1 fusions, wherein the first probe is specifically related to the nucleic acid located between SEQ ID NO: 635 The ASPH sequence at the nucleic acid 5' at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 635, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測NOTCH2-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 693之位置75之核酸5’處的NOTCH2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 693之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in an in situ hybridization assay to detect a NOTCH2-NRG1 fusion, wherein the first probe is specifically related to the nucleic acid located between SEQ ID NO: 693 The NOTCH2 sequence at the 5' position of the nucleic acid at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 76 of SEQ ID NO: 693, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測CD74-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 717之位置75之核酸5’處的CD74序列雜交,且該第二探針特異性地與位於SEQ ID NO: 717之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect CD74-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 717 The CD74 sequence at the 5' position of the nucleic acid at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 76 of SEQ ID NO: 717, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測SDC4-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 743之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 743之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect SDC4-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 743 The SDC4 sequence at the nucleic acid 5' at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 743, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測SDC4-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 824之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 824之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect SDC4-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 824 The SDC4 sequence at the nucleic acid 5' at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 824, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測SLC4A4-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 765之位置75之核酸5’處的SLC4A4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 765之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect SLC4A4-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 765 The SLC4A4 sequence at the nucleic acid 5' at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 765, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測ZFAT-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 828之位置75之核酸5’處的ZFAT序列雜交,且該第二探針特異性地與位於SEQ ID NO: 828之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect ZFAT-NRG1 fusions, wherein the first probe is specifically linked to the sequence located between SEQ ID NO: 828 The ZFAT sequence at the nucleic acid 5' at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 828, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
也特別提供了一種供使用於原位雜交測定法以檢測DSCAML1-NRG1融合物的第一核酸探針及第二核酸探針,其中該第一探針特異性地與位於SEQ ID NO: 868之位置75之核酸5’處的DSCAML1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 868之位置76之核酸3’處的NRG1序列雜交,諸如本揭露之EGF樣結構域所包含之序列,特別是SEQ ID NO: 163之序列。Also particularly provided is a first nucleic acid probe and a second nucleic acid probe for use in in situ hybridization assays to detect DSCAML1-NRG1 fusions, wherein the first probe is specifically related to the sequence located between SEQ ID NO: 868 The DSCAML1 sequence at the 5' position of the nucleic acid at position 75 hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the 3' position of the nucleic acid at position 76 of SEQ ID NO: 868, such as the EGF-like domain of the present disclosure. Sequences included, in particular the sequence of SEQ ID NO: 163.
本文所提及之核酸探針、引子或引子對中之任一者係較佳地使用於以下鑑定或檢測方法中,以檢測選自VAPB-NRG1、CADM1-NRG1 CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1及DSCAML1-NRG1之多核苷酸、多肽融合物、其部分或對偶基因變異體中之任一者。 用於檢測 NRG 多核苷酸融合物之測定法 Any one of the nucleic acid probes, primers or primer pairs mentioned herein is preferably used in the following identification or detection methods to detect the selected from VAPB-NRG1, CADM1-NRG1 CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1- Any of polynucleotides, polypeptide fusions, partial or allogenic variants thereof of NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4-NRG1, ZFAT-NRG1 and DSCAML1-NRG1 . Assays for the Detection of NRG Polynucleotide Fusions
根據本揭露,其提供一種於樣本中鑑定如本文所提及之多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。According to the present disclosure, there is provided a method of identifying in a sample a polynucleotide fusion as referred to herein, or a polypeptide encoded thereby, which method comprises testing a sample obtained from an individual to detect whether This fusion exists.
亦提供一種於樣本中檢測多核苷酸融合物、或由其所編碼之多肽是否存在的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。Also provided is a method of detecting the presence in a sample of a polynucleotide fusion, or a polypeptide encoded thereby, comprising examining a sample obtained from an individual to detect the presence of the fusion in the sample.
亦提供一種確定來自個體之異常細胞是否包含多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自該個體之異常細胞之多核苷酸或多肽內容物是否於樣本中存在該融合物。Also provided is a method of determining whether an abnormal cell from an individual comprises a polynucleotide fusion, or a polypeptide encoded thereby, said method comprising testing whether the polynucleotide or polypeptide content of the abnormal cell obtained from the individual is present in the sample This fusion exists.
亦提供一種鑑定一個體帶有多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。較佳地該檢驗之後為將該樣本中檢測到的融合物與帶有該融合物之個體聯繫在一起的步驟。Also provided is a method of identifying an individual carrying a polynucleotide fusion, or a polypeptide encoded thereby, comprising testing a sample obtained from an individual to detect the presence of the fusion in the sample. Preferably the test is followed by the step of associating the fusion detected in the sample with the individual carrying the fusion.
較佳地,所述檢驗包含利用一結合劑特異性地結合所述多核苷酸,諸如本文所提及之任一核酸探針、引子或引子對,或特異性地與本文所提及之任一多核苷酸融合物所編碼之多肽結合,或是替代性地利用一結合劑與包含有該多核苷酸融合物之多核苷酸結合,來檢測本文所提及之任一多核苷酸或多肽融合物。該結合劑係利用來作為檢測步驟的一部分或是使檢測成為後續步驟。因此,於本揭露之任一方法中的檢驗較佳地包含利用一結合劑特異性地結合任一多核苷酸融合物或由其所編碼之任一多肽,以檢測這類融合物。所述結合劑較佳地包含引子、引子對、探針或抗體或是由其組成。此外,該結合劑較佳地包含一可檢測標籤物。Preferably, the assay comprises the use of a binding agent that specifically binds to the polynucleotide, such as any nucleic acid probe, primer or primer pair mentioned herein, or specifically with any of the nucleic acid probes mentioned herein. Binding of a polypeptide encoded by a polynucleotide fusion, or alternatively utilizing a binding agent to bind to a polynucleotide comprising the polynucleotide fusion, to detect any of the polynucleotides mentioned herein or polypeptide fusions. The binding agent is utilized as part of the detection step or to enable detection as a subsequent step. Accordingly, assaying in any of the methods of the present disclosure preferably comprises the use of a binding agent that specifically binds to any polynucleotide fusion or any polypeptide encoded thereby to detect such fusions. The binding agent preferably comprises or consists of a primer, primer pair, probe or antibody. In addition, the binding agent preferably comprises a detectable label.
替代性地,所述檢驗包含利用一結合劑與包含有該多核苷酸融合物之多核苷酸結合以檢測該融合物。在此替代例中,該結合劑係與位於該多核苷酸接合處5’及/或3’處之多核苷酸結合且該多核苷酸接合處含有NRG1與其融合配偶體之間的實際接合處。該結合劑黏合或雜交的多核苷酸係作為轉接子來連接或以其他方式附接到該多核苷酸融合物,之後檢測該融合物。這類轉接子較佳地含有一個標籤或分子標記碼(barcode),諸如在定序方法學(包括次世代定序)中所用者。較佳地包含有標記碼或標籤之這類轉接子係附接至該多核苷酸融合物,以在擴增之後可檢測所述融合物。尤其,該結合劑可使用在Anchored Multiplex PCR (或AMP)中,其中在從mRNA產生cDNA之後獲得用於NGS之富含目標樣本庫。Alternatively, the assay comprises detecting the fusion using a binding agent that binds to the polynucleotide comprising the polynucleotide fusion. In this alternative, the binding agent binds to a polynucleotide located 5' and/or 3' to the polynucleotide junction and the polynucleotide junction contains the actual junction between NRG1 and its fusion partner . The polynucleotide to which the binding agent binds or hybridizes is ligated or otherwise attached to the polynucleotide fusion as an adapter, after which the fusion is detected. Such adapters preferably contain a tag or molecular barcode, such as used in sequencing methodologies, including next generation sequencing. Such adapters, preferably comprising a marker code or tag, are attached to the polynucleotide fusion so that the fusion is detectable after amplification. In particular, the binder can be used in Anchored Multiplex PCR (or AMP), where a target-enriched library for NGS is obtained after cDNA generation from mRNA.
較佳地,所述結合劑係特異性地與根據SEQ ID NO: 1、SEQ ID NO: 2、SEQ ID NO: 3、SEQ ID NO: 5、SEQ ID NO: 6、SEQ ID NO: 7、SEQ ID NO: 9、SEQ ID NO: 10、SEQ ID NO: 11、SEQ ID NO: 13、SEQ ID NO: 14、SEQ ID NO: 15、SEQ ID NO: 164、SEQ ID NO: 165、SEQ ID NO: 166、SEQ ID NO: 184、SEQ ID NO: 185、SEQ ID NO: 186、SEQ ID NO: 215、SEQ ID NO: 216、SEQ ID NO:217、SEQ ID NO:231、SEQ ID NO:232、SEQ ID NO:233、SEQ ID NO:253、SEQ ID NO:254、SEQ ID NO:255、SEQ ID NO:311、SEQ ID NO:312、SEQ ID NO:313、SEQ ID NO:328、SEQ ID NO:329、SEQ ID NO:330、SEQ ID NO:374、SEQ ID NO:375、SEQ ID NO:376、SEQ ID NO:401、SEQ ID NO:402、SEQ ID NO:403、SEQ ID NO:435、SEQ ID NO:436、SEQ ID NO:437、SEQ ID NO:452、SEQ ID NO:453、SEQ ID NO:454、SEQ ID NO: 484、SEQ ID NO: 485、SEQ ID NO: 486、SEQ ID NO: 526、SEQ ID NO: 527、SEQ ID NO: 528、SEQ ID NO: 603、SEQ ID NO: 604、SEQ ID NO: 605、SEQ ID NO: 633、SEQ ID NO: 634、SEQ ID NO: 635、SEQ ID NO: 691、SEQ ID NO: 692、SEQ ID NO: 693、SEQ ID NO: 715、SEQ ID NO: 716、SEQ ID NO: 717、SEQ ID NO: 741、SEQ ID NO: 742、SEQ ID NO: 743、SEQ ID NO: 759、SEQ ID NO: 760、SEQ ID NO: 761、SEQ ID NO: 763、SEQ ID NO: 764、SEQ ID NO: 765、SEQ ID NO: 822、SEQ ID NO: 823、SEQ ID NO: 824、SEQ ID NO: 826、SEQ ID NO: 827、SEQ ID NO: 828、SEQ ID NO: 866、SEQ ID NO: 867或SEQ ID NO: 868中之任一者的多核苷酸序列結合。這些序列致使在所感興趣的融合接合處緊靠進行特異性雜交,或甚至含有該融合接合處,使其成為被所述結合劑結合的良好候選序列。Preferably, the binding agent is specifically related to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID NO: 166, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 215, SEQ ID NO: 216, SEQ ID NO: 217, SEQ ID NO: 231, SEQ ID NO: 232. SEQ ID NO:233, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:311, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:374, SEQ ID NO:375, SEQ ID NO:376, SEQ ID NO:401, SEQ ID NO:402, SEQ ID NO:403, SEQ ID NO:435, SEQ ID NO:436, SEQ ID NO:437, SEQ ID NO:452, SEQ ID NO:453, SEQ ID NO:454, SEQ ID NO:484, SEQ ID NO:485, SEQ ID NO: 486, SEQ ID NO: 526, SEQ ID NO: 527, SEQ ID NO: 528, SEQ ID NO: 603, SEQ ID NO: 604, SEQ ID NO: 605, SEQ ID NO: 633, SEQ ID NO: 634, SEQ ID NO: 635, SEQ ID NO: 691, SEQ ID NO: 692, SEQ ID NO: 693, SEQ ID NO: 715, SEQ ID NO: 716, SEQ ID NO: 717, SEQ ID NO: 741, SEQ ID NO: 742, SEQ ID NO: 743, SEQ ID NO: 759, SEQ ID NO: 760, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 764, SEQ ID NO: 765, SEQ ID NO: 822, SEQ ID NO: 823, SEQ ID NO: 824, SEQ ID NO: 826, SEQ ID NO: 827, SEQ ID NO: 828, SEQ ID NO: 866, SEQ ID NO: 867 or SEQ ID NO: 868 combination of polynucleotide sequences of any of them. These sequences allow specific hybridization next to, or even contain, the fusion junction of interest, making them good candidates for binding by the binding agent.
替代性地,該結合劑特異性地與包含有根據SEQ ID NO: 1、SEQ ID NO: 2、SEQ ID NO: 3、SEQ ID NO: 5、SEQ ID NO: 6、SEQ ID NO: 7、SEQ ID NO: 9、SEQ ID NO: 10、SEQ ID NO: 11、SEQ ID NO: 13、SEQ ID NO: 14、SEQ ID NO: 15、SEQ ID NO: 164、SEQ ID NO: 165、SEQ ID NO: 166、SEQ ID NO: 184、SEQ ID NO: 185、SEQ ID NO: 186、SEQ ID NO: 215、SEQ ID NO: 216、SEQ ID NO:217、SEQ ID NO:231、SEQ ID NO:232、SEQ ID NO:233、SEQ ID NO:253、SEQ ID NO:254、SEQ ID NO:255、SEQ ID NO:311、SEQ ID NO:312、SEQ ID NO:313、SEQ ID NO:328、SEQ ID NO:329、SEQ ID NO:330、SEQ ID NO:374、SEQ ID NO:375、SEQ ID NO:376、SEQ ID NO:401、SEQ ID NO:402、SEQ ID NO:403、SEQ ID NO:435、SEQ ID NO:436、SEQ ID NO:437、SEQ ID NO:452、SEQ ID NO:453、SEQ ID NO:454、SEQ ID NO: 484、SEQ ID NO: 485、SEQ ID NO: 486、SEQ ID NO: 526、SEQ ID NO: 527、SEQ ID NO: 528、SEQ ID NO: 603、SEQ ID NO: 604、SEQ ID NO: 605、SEQ ID NO: 633、SEQ ID NO: 634、SEQ ID NO: 635、SEQ ID NO: 691、SEQ ID NO: 692、SEQ ID NO: 693、SEQ ID NO: 715、SEQ ID NO: 716、SEQ ID NO: 717、SEQ ID NO: 741、SEQ ID NO: 742、SEQ ID NO: 743、SEQ ID NO: 759、SEQ ID NO: 760、SEQ ID NO: 761、SEQ ID NO: 763、SEQ ID NO: 764、SEQ ID NO: 765、SEQ ID NO: 822、SEQ ID NO: 823、SEQ ID NO: 824、SEQ ID NO: 826、SEQ ID NO: 827、SEQ ID NO: 828、SEQ ID NO: 866、SEQ ID NO: 867或SEQ ID NO: 868中之任一者之序列的多核苷酸序列。在此替代例的一較佳態樣中,該結合劑較佳地黏合或雜交至一轉接子序列來附接(較佳地以連接方式)至該多核苷酸融合序列。較佳地包含有標記碼或標籤之這類轉接子係附接至該多核苷酸融合物,以在擴增之後可檢測所述融合物。另外地或替代性地,該結合劑包含一基因特異性引子、引子對、或探針,其任一者係結合至一位於融合斷點5’或3’處之核苷酸序列,而可擴增NRG1與VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之接合處。Alternatively, the binding agent is specifically associated with a composition comprising according to SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID NO: 166, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 215, SEQ ID NO: 216, SEQ ID NO: 217, SEQ ID NO: 231, SEQ ID NO: 232. SEQ ID NO:233, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:311, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:374, SEQ ID NO:375, SEQ ID NO:376, SEQ ID NO:401, SEQ ID NO:402, SEQ ID NO:403, SEQ ID NO:435, SEQ ID NO:436, SEQ ID NO:437, SEQ ID NO:452, SEQ ID NO:453, SEQ ID NO:454, SEQ ID NO:484, SEQ ID NO:485, SEQ ID NO: 486, SEQ ID NO: 526, SEQ ID NO: 527, SEQ ID NO: 528, SEQ ID NO: 603, SEQ ID NO: 604, SEQ ID NO: 605, SEQ ID NO: 633, SEQ ID NO: 634, SEQ ID NO: 635, SEQ ID NO: 691, SEQ ID NO: 692, SEQ ID NO: 693, SEQ ID NO: 715, SEQ ID NO: 716, SEQ ID NO: 717, SEQ ID NO: 741, SEQ ID NO: 742, SEQ ID NO: 743, SEQ ID NO: 759, SEQ ID NO: 760, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 764, SEQ ID NO: 765, SEQ ID NO: 822, SEQ ID NO: 823, SEQ ID NO: 824, SEQ ID NO: 826, SEQ ID NO: 827, SEQ ID NO: 828, SEQ ID NO: 866, SEQ ID NO: 867 or SEQ ID NO: 868 A polynucleotide sequence of any sequence. In a preferred aspect of this alternative, the binding agent preferably binds or hybridizes to an adapter sequence to attach (preferably ligate) to the polynucleotide fusion sequence. Such adapters, preferably comprising a marker code or tag, are attached to the polynucleotide fusion so that the fusion is detectable after amplification. Additionally or alternatively, the binding agent comprises a gene-specific primer, pair of primers, or probe, either of which binds to a nucleotide sequence located 5' or 3' to the fusion breakpoint, allowing Amplifies NRG1 with VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or Junction of DSCAML1.
此外,檢驗任何本揭露之多核苷酸融合物是否存在係較佳地包含擴增帶有NRG1融合物或其一部分之多核苷酸,且較佳地使用DNA聚合酶,更佳地使用熱穩定DNA聚合酶或Taq聚合酶。Furthermore, testing for the presence of any polynucleotide fusion of the present disclosure preferably comprises amplifying a polynucleotide bearing an NRG1 fusion or a portion thereof, preferably using a DNA polymerase, more preferably using a thermostable DNA polymerase or Taq polymerase.
較佳地,該結合劑包含一第一引子及一第二引子以擴增包含有NRG1序列與VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1序列之核苷酸序列,進而檢測本文所提及之任一多核苷酸融合物。更佳地,提供了會與對於該融合物之NRG1序列部分具特異性的核苷酸序列雜交或黏合的第一引子,以及會與對於選自VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之融合配偶體具特異性的核苷酸序列雜交或黏合的第二引子。較佳地,該第一及第二引子係緊靠融合接合處結合,以擴增包含有NRG1序列的一部分與VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1序列的一部分之序列。該第一或第二引子亦可結合至跨越了該融合接合處的位點。一般而言,經擴增的序列包含多達1000、900、800、700、600、500、400或300個鹼基對。一般而言,經擴增的序列會跨越該NRG1融合接合處。Preferably, the binding agent comprises a first primer and a second primer to amplify the sequence containing NRG1 and VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN , PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or the nucleotide sequence of DSCAML1 sequence, and then detect any polynucleotide fusion mentioned herein. More preferably, there is provided a first primer which will hybridize or bind to a nucleotide sequence specific for the NRG1 sequence portion of the fusion, and which will bind to a nucleotide sequence selected from the group consisting of VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, Fusion partners of CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1 have specific nucleotide sequence hybridization or bonding The second primer. Preferably, the first and second primers are combined close to the fusion junction to amplify a part comprising NRG1 sequence and VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14 , THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or a portion of a DSCAML1 sequence. The first or second primer can also bind to a site spanning the fusion junction. Generally, the amplified sequence comprises up to 1000, 900, 800, 700, 600, 500, 400 or 300 base pairs. Generally, the amplified sequence will span the NRG1 fusion junction.
因此亦提供一種結合劑,其為或包含長度為10-40個核苷酸之核酸探針或引子以用於檢測本揭露之VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1與NRG1之多核苷酸融合物,其中該融合物分別包含SEQ ID NO: 3位置43及44的核酸、SEQ ID NO: 7位置53及54的核酸、SEQ ID NO: 11位置52及53的核酸、SEQ ID NO: 15位置53及54的核酸、SEQ ID NO: 166位置65及66的核酸、SEQ ID NO: 186位置119及120的核酸、SEQ ID NO: 217位置43及44的核酸、SEQ ID NO: 233位置141及142的核酸、SEQ ID NO: 255位置88及89的核酸、SEQ ID NO: 313位置102及103的核酸、SEQ ID NO: 330位置95及96的核酸、SEQ ID NO: 376位置56及57的核酸、SEQ ID NO: 403位置106及107的核酸、SEQ ID NO: 437位置102及103的核酸、或SEQ ID NO: 454位置93及94的核酸、SEQ ID NO: 486位置54及55的核酸、SEQ ID NO: 528位置96及97的核酸、SEQ ID NO: 605位置75及76的核酸、SEQ ID NO: 635位置75及76的核酸、SEQ ID NO: 693位置75及76的核酸、SEQ ID NO: 717位置75及76的核酸、SEQ ID NO: 743位置75及76的核酸、SEQ ID NO: 761位置75及76的核酸、SEQ ID NO: 765位置75及76的核酸、SEQ ID NO: 824位置75及76的核酸、SEQ ID NO: 828位置75及76的核酸、SEQ ID NO: 868位置75及76的核酸。這些位置的核酸界定了NRG1與其融合配偶體之間的斷點或融合接合處。Therefore also provide a kind of binding agent, it is or comprises the nucleic acid probe or primer that is 10-40 nucleotides in length to be used for detecting VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1 of the present disclosure , PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or polynucleotide fusions of DSCAML1 and NRG1, wherein the fusions respectively comprise SEQ ID NO: 3 The nucleic acid of position 43 and 44, the nucleic acid of SEQ ID NO: 7 position 53 and 54, the nucleic acid of SEQ ID NO: 11 position 52 and 53, the nucleic acid of SEQ ID NO: 15 position 53 and 54, the nucleic acid of SEQ ID NO: 166 position 65 The nucleic acid of and 66, the nucleic acid of SEQ ID NO: 186 position 119 and 120, the nucleic acid of SEQ ID NO: 217 position 43 and 44, the nucleic acid of SEQ ID NO: 233 position 141 and 142, the nucleic acid of SEQ ID NO: 255 position 88 and 89 The nucleic acid of, the nucleic acid of SEQ ID NO: 313 positions 102 and 103, the nucleic acid of SEQ ID NO: 330 positions 95 and 96, the nucleic acid of SEQ ID NO: 376 positions 56 and 57, the nucleic acid of SEQ ID NO: 403 positions 106 and 107 , the nucleic acid of SEQ ID NO: 437 positions 102 and 103, or the nucleic acid of SEQ ID NO: 454 positions 93 and 94, the nucleic acid of SEQ ID NO: 486 positions 54 and 55, the nucleic acid of SEQ ID NO: 528 positions 96 and 97, The nucleic acid of SEQ ID NO: 605 positions 75 and 76, the nucleic acid of SEQ ID NO: 635 positions 75 and 76, the nucleic acid of SEQ ID NO: 693 positions 75 and 76, the nucleic acid of SEQ ID NO: 717 positions 75 and 76, SEQ ID NO: NO: the nucleic acid of 743 positions 75 and 76, the nucleic acid of SEQ ID NO: 761 positions 75 and 76, the nucleic acid of SEQ ID NO: 765 positions 75 and 76, the nucleic acid of SEQ ID NO: 824 positions 75 and 76, SEQ ID NO: Nucleic acid at positions 75 and 76 of 828, nucleic acid at positions 75 and 76 of SEQ ID NO: 868. Nucleic acids at these positions define breakpoints or fusion junctions between NRG1 and its fusion partner.
較佳地,本文所提及之所述檢驗係包含擴增或檢測一用於區別該多核苷酸融合物或由其所編碼之多肽是否存在的序列。Preferably, the assay mentioned herein comprises amplifying or detecting a sequence for distinguishing the presence or absence of the polynucleotide fusion or the polypeptide encoded thereby.
在一較佳實施例中,該結合劑為多肽且較佳地藉由流式細胞儀(FC)、螢光原位雜交法(FISH)、免疫細胞化學染色法(ICC)、免疫螢光染色法(IF)來檢測其是否存在。熟習該項技術者可採用多種不同技術,使用FC、IHC、FISH、ICC或IF在樣本中檢測本揭露之NRG1融合物。例如,可使用靶向NRG1的第一抗體以及靶向選自VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之NRG1融合配偶體的第二抗體來進行ICC。該第一與第二抗體的訊號共定位,或是使用蛋白大小為基礎的區別測定法來檢測具有期望大小之多肽產物是否存在,可顯示出各別NRG1融合產物是否存在。較佳地,該抗NRG1抗體係靶向NRG1之EGF樣結構域,而該第二抗體係靶向諸如VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之融合配偶體的細胞外結構域。Frithiof等人於OncoTargets and therapy第9卷, 7095-7103, 2016年11月16日中提供了FISH實驗方案的一個例子。另外,可使用兩種不同的螢光檢測系統並結合在螢光顯微鏡中使用兩種不同通道來檢測抗體。基於兩個螢光訊號之重疊或未重疊外觀來說明結果。針對已知結構及序列的多肽來開發抗體,這完全屬於熟習該項技術者的工作範圍內。In a preferred embodiment, the binding agent is a polypeptide and is preferably detected by flow cytometry (FC), fluorescence in situ hybridization (FISH), immunocytochemical staining (ICC), immunofluorescent staining method (IF) to detect its presence. Those skilled in the art can employ a variety of different techniques to detect NRG1 fusions of the present disclosure in samples using FC, IHC, FISH, ICC or IF. For example, a primary antibody targeting NRG1 may be used as well as a target antibody selected from the group consisting of VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, Secondary antibodies to ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT, or the NRG1 fusion partner of DSCAML1 were used for ICC. Colocalization of the signals of the primary and secondary antibodies, or the presence or absence of a polypeptide product of the desired size using a protein size-based discrimination assay, can indicate the presence or absence of a respective NRG1 fusion product. Preferably, the anti-NRG1 antibody targets the EGF-like domain of NRG1, while the second antibody targets such as VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1 , AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or the extracellular domain of the fusion partner of DSCAML1. An example of a FISH protocol is provided by Frithiof et al., OncoTargets and therapy Volume 9, 7095-7103, November 16, 2016. Alternatively, antibodies can be detected using two different fluorescent detection systems combined with two different channels in a fluorescent microscope. Results are illustrated based on the overlapping or non-overlapping appearance of the two fluorescent signals. The development of antibodies against polypeptides of known structure and sequence is well within the purview of those skilled in the art.
較佳地,所要檢測之多核苷酸或多肽融合物係獲自於會表現包含有多核苷酸融合物之NRG1 EGF樣結構域的異常細胞。較佳地,本揭露之任何檢測方法係包含從個體獲得樣本的步驟,以及隨後從該樣本分離出該多核苷酸或其所編碼之多肽的步驟。此外,所述方法較佳地包含從該樣本純化或分離出該核苷酸或多肽的步驟。Preferably, the polynucleotide or polypeptide fusion to be detected is obtained from abnormal cells expressing the NRG1 EGF-like domain comprising the polynucleotide fusion. Preferably, any detection method of the present disclosure comprises the steps of obtaining a sample from an individual, and subsequently isolating the polynucleotide or the polypeptide encoded by it from the sample. Furthermore, the method preferably comprises the step of purifying or isolating the nucleotide or polypeptide from the sample.
較佳地,該樣本係獲自罹患癌症或腫瘤、或是懷疑罹患癌症或腫瘤並且帶有異常細胞的個體。該樣本可為液態檢體樣本,或是採自實體癌或實體瘤的樣本。該實體樣本較佳為經福馬林包埋及石蠟固定之含有異常細胞、一部分癌症或一部分腫瘤中之任一者的樣本。從實體癌或實體瘤採取樣本會嚴重破壞皮膚的物理完整性,可能還有內部器官,以觸及被調查的腫瘤。使用諸如選自血液、血清、血漿、胸腔積液、唾液、尿液、精液、痰、陰道液、羊水、腹膜液、腦脊液、骨髓、無細胞灌洗液或另一生物流體之液態檢體,可以有比這些方法更實惠、更快速、更可靠且侵入性更小的採樣。因此,較佳地,該樣本為液態檢體樣本。 檢測或鑑定 NRG1 融合物之基於抗體的測定法 Preferably, the sample is obtained from an individual suffering from, or suspected of having, cancer or tumor with abnormal cells. The sample can be a liquid sample, or a sample from a solid cancer or a solid tumor. The solid sample is preferably a formalin-embedded and paraffin-fixed sample containing any of abnormal cells, a portion of cancer, or a portion of tumor. Taking a sample from a solid cancer or tumor would severely compromise the physical integrity of the skin, and possibly internal organs, to access the tumor under investigation. Using liquid samples such as blood, serum, plasma, pleural effusion, saliva, urine, semen, sputum, vaginal fluid, amniotic fluid, peritoneal fluid, cerebrospinal fluid, bone marrow, acellular lavage fluid or another biological fluid, There can be more affordable, faster, more reliable and less invasive sampling than these methods. Therefore, preferably, the sample is a liquid sample. Antibody-Based Assays for Detecting or Identifying NRG1 Fusions
亦提供一種以多肽為基礎的檢測測定法以及使用於其中的抗體。該檢測測定法的目標在於檢測本揭露之多肽NRG1融合物,尤其是本揭露之VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1及DSCAML1-NRG1融合物。A polypeptide-based detection assay and antibodies for use therein are also provided. The detection assay is aimed at detecting the polypeptide NRG1 fusions of the present disclosure, in particular VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2 of the present disclosure -NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1 , SLC4A4-NRG1 , ZFAT-NRG1 and DSCAML1-NRG1 fusions.
特別提供了一種適於檢測本文所提到之任何多肽融合物(較佳為本文所提到之任何多核苷酸融合物所編碼之任何多肽融合物)的第一抗體或第一與第二抗體組。熟習該項技術者完全有能力設計、研發及生產能夠且適於與本文所揭露之多肽融合物特異性地結合的抗體。In particular provided is a first antibody or a first and a second antibody suitable for detecting any of the polypeptide fusions mentioned herein, preferably any polypeptide fusion encoded by any of the polynucleotide fusions mentioned herein Group. Those skilled in the art are well within the ability to design, develop and produce antibodies capable and suitable for specifically binding to the polypeptide fusions disclosed herein.
在使用第一抗體而不使用第二抗體的情況下,該第一抗體會特異性地檢測NRG1多肽融合物的獨特表位。這類表位包括跨越了介於NRG1與其融合配偶體之間之多肽融合物的表位。替代性地,在使用第一抗體而不使用第二抗體、並且於一檢測測定法中使用該第一抗體(諸如下文進一步討論)的情況下,所述測定法包括了一個辨別出非融合多肽的步驟(例如基於大小或電荷之分離步驟,諸如CIEX或HPLC)。在使用第一及第二抗體的情況下,該第一抗體會檢測該多肽融合物的一部分(即VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1),而該第二抗體會檢測該多肽融合物中包含有NRG1的部分。Where a primary antibody is used without the use of a secondary antibody, the primary antibody will specifically detect a unique epitope of the NRG1 polypeptide fusion. Such epitopes include epitopes that span the polypeptide fusion between NRG1 and its fusion partner. Alternatively, where the primary antibody is used without the secondary antibody, and the primary antibody is used in a detection assay (such as discussed further below), the assay includes a recognition of the non-fusion polypeptide steps (eg size or charge based separation steps such as CIEX or HPLC). Where primary and secondary antibodies are used, the primary antibody will detect a portion of the polypeptide fusion (i.e., VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT, or DSCAML1), and the secondary antibody detects the portion of the polypeptide fusion that includes NRG1.
更尤其是本揭露提供一種第一抗體或第一及第二抗體組供檢測由一多核苷酸融合物所編碼之多肽,且該多核苷酸融合物包含了一編碼VAPB(或VAPB之對偶基因變異體)蛋白序列之核酸與一編碼NRG1(或NRG1之對偶基因變異體)蛋白序列之核酸融合。More particularly, the present disclosure provides a first antibody or set of first and second antibodies for detecting a polypeptide encoded by a polynucleotide fusion comprising a VAPB (or VAPB's counterpart) Gene variant) protein sequence nucleic acid is fused to a nucleic acid encoding NRG1 (or allele variant of NRG1) protein sequence.
較佳地,該第一抗體、或第一與第二抗體組,係用於檢測包含有VAPB外顯子1或外顯子1之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含有用於檢測涉及VAPB及NRG1之多核苷酸融合物所編碼之多肽的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合VAPB-NRG1多肽融合物,而該第一與第二抗體組較佳地係結合VAPB及NRG1。Preferably, the first antibody, or the first and second antibody sets, is used to detect a part of the polypeptide encoded by VAPB exon 1 or an allele variant of exon 1 and NRG1 exon Polypeptide fusions in which a portion of a polypeptide encoded by an allele variant of exon 2 or exon 2 is fused. Also provided is a detection assay comprising a first antibody or a set of first and second antibodies for detecting the presence of a polypeptide encoded by a polynucleotide fusion involving VAPB and NRG1, wherein the first antibody preferably binds VAPB-NRG1 polypeptide fusion, and the first and second antibody sets preferably bind VAPB and NRG1.
亦提供一種第一抗體、或第一與第二抗體組,用於檢測包含有CADM1外顯子7或外顯子7之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CADM1外顯子7或外顯子7之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CADM1-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CADM1及NRG1。Also provided is a first antibody, or a set of first and second antibodies, for detecting a portion of a polypeptide comprising CADM1 exon 7 or an allele variant of exon 7 and NRG1 exon 6 or exon 7 A polypeptide fusion to which a portion of the polypeptide encoded by the allele variant of exon 6 is fused. Also provided is a method for detecting a part of the polypeptide encoded by CADM1 exon 7 or an allele variant of exon 7 and NRG1 exon 6 or an allele variant of exon 6 An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the CADM1-NRG1 polypeptide fusion, and the first and second antibodies The second set of antibodies preferably bind CADM1 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有CD44外顯子5或外顯子5之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CD44外顯子5或外顯子5之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CD44-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CD44及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising CD44 exon 5 or an allele variant of exon 5 and NRG1 exon 2 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of the polypeptide encoded by CD44 exon 5 or an allele variant of exon 5 and an allele variant of NRG1 exon 2 or exon 2. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the CD44-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds CD44 and NRG1 respectively.
亦提供一種第一抗體、或第一與第二抗體組,用於檢測包含有CD44外顯子5或外顯子5之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CD44外顯子5或外顯子5之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CD44-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CD44及NRG1。Also provided is a first antibody, or a set of first and second antibodies, for detecting a portion of a polypeptide comprising CD44 exon 5 or an allele variant of exon 5 and NRG1 exon 6 or exon 5 A polypeptide fusion to which a portion of the polypeptide encoded by the allele variant of exon 6 is fused. Also provided is a method comprising a part of the polypeptide encoded by CD44 exon 5 or the allele variant of exon 5 and NRG1 exon 6 or the allele variant of exon 6. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the CD44-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds CD44 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有SLC3A2之轉錄本6之外顯子1或外顯子1之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子5或外顯子5之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有所述SLC3A2外顯子1或外顯子1之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子5或外顯子5之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合SLC3A2轉錄本6之SLC3A2-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合所述SLC3A2及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising exon 1 of SLC3A2 transcript 6 or an allele variant of exon 1 and NRG1 exon Polypeptide fusions in which a portion of a polypeptide encoded by exon 5 or an allogenic variant of exon 5 is fused. Also provided is a method comprising a part of the polypeptide encoded by the SLC3A2 exon 1 or the allele variant of exon 1 and the allele variant of NRG1 exon 5 or exon 5. Assay for the detection of the presence of a first antibody or a set of first and second antibody fusions to which a portion of the encoded polypeptide is fused, wherein the first antibody is preferably fused to a SLC3A2-NRG1 polypeptide that binds SLC3A2 transcript 6 The first and second antibody groups preferably bind to said SLC3A2 and NRG1, respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有VTCN1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有VTCN1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合VTCN1-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合VTCN1及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide encoded by an allele variant comprising VTCN1 exon 2 or exon 2 and NRG1 exon 2 or exon 2 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of the polypeptide encoded by VTCN1 exon 2 or an allele variant of exon 2 and an allele variant of NRG1 exon 2 or exon 2. A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the VTCN1-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds to VTCN1 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有CDH1外顯子11或外顯子11之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CDH1外顯子11或外顯子11之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CDH1-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CDH1及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising exon 11 of CDH1 or an allele variant of exon 11 and exon 2 or exon 2 of NRG1 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of the polypeptide encoded by CDH1 exon 11 or an allele variant of exon 11 and an allele variant of NRG1 exon 2 or exon 2. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of a polypeptide is fused, wherein the first antibody preferably binds to the CDH1-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds CDH1 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有CXADR外顯子1或外顯子1之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CXADR外顯子1或外顯子1之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CXADR-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CXADR及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising CXADR exon 1 or an allele variant of exon 1 and NRG1 exon 2 or exon 1 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of a polypeptide encoded by CXADR exon 1 or an allele variant of exon 1 and an allele variant of NRG1 exon 2 or exon 2. A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the CXADR-NRG1 polypeptide fusion, and the first and second antibodies The second set of antibodies preferably binds to CXADR and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有GFT2E2外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有GFT2E2外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合GFT2E2-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合GFT2E2及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising GFT2E2 exon 2 or an allele variant of exon 2 and NRG1 exon 2 or exon 2 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of a polypeptide encoded by GFT2E2 exon 2 or an allele variant of exon 2 and an allele variant of NRG1 exon 2 or exon 2. A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the GFT2E2-NRG1 polypeptide fusion, and the first and second antibodies The second set of antibodies preferably binds GFT2E2 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有CSMD1外顯子23或外顯子23之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CSMD1外顯子23或外顯子23之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CSMD1-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CSMD1及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising CSMD1 exon 23 or an allogenic variant of exon 23 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by CSMD1 exon 23 or an allele variant of exon 23 and an allele variant of NRG1 exon 6 or exon 6 A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the CSMD1-NRG1 polypeptide fusion, and the first and second antibodies The second set of antibodies preferably bind CSMD1 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有PTN外顯子4或外顯子4之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有PTN外顯子4或外顯子4之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合PTN-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合PTN及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising PTN exon 4 or an allele variant of exon 4 and NRG1 exon 2 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of a polypeptide encoded by PTN exon 4 or an allele variant of exon 4 and an allele variant of NRG1 exon 2 or exon 2. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the PTN-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds PTN and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有ST14外顯子11或外顯子11之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有ST14外顯子11或外顯子11之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合ST14-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合ST14及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising ST14 exon 11 or an allogenic variant of exon 11 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by ST14 exon 11 or an allele variant of exon 11 and an allele variant of NRG1 exon 6 or exon 6 An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of a polypeptide is fused, wherein the first antibody preferably binds to the ST14-NRG1 polypeptide fusion, and the first and second antibodies The second set of antibodies preferably bind ST14 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有THBS1外顯子9或外顯子9之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有THBS1外顯子9或外顯子9之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合THBS1-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合THBS1及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising THBS1 exon 9 or an allele variant of exon 9 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by THBS1 exon 9 or an allele variant encoded by exon 9 and an allele variant encoded by NRG1 exon 6 or exon 6 An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of a polypeptide is fused, wherein the first antibody preferably binds to the THBS1-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds THBS1 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有AGRN外顯子12或外顯子12之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有AGRN外顯子12或外顯子12之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合AGRN-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合AGRN及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising AGRN exon 12 or an allogenic variant of exon 12 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by AGRN exon 12 or an allele variant of exon 12 and an allele variant of NRG1 exon 6 or exon 6 An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the AGRN-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds to AGRN and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有PVALB外顯子4或外顯子4之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有PVALB外顯子4或外顯子4之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合PVALB-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合PVALB及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising PVALB exon 4 or an allele variant of exon 4 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by PVALB exon 4 or an allele variant encoded by exon 4 and an allele variant encoded by NRG1 exon 6 or exon 6 A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the PVALB-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds PVALB and NRG1 respectively.
亦提供一種第一抗體、或第一與第二抗體組,用於檢測包含有SLC3A2之轉錄本3之外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有所述SLC3A2外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合轉錄本3之SLC3A2-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合所述SLC3A2及NRG1。Also provided is a first antibody, or a set of first and second antibodies, for detecting a portion of a polypeptide comprising exon 2 of transcript 3 of SLC3A2 or an allele variant of exon 2 and NRG1 exon Polypeptide fusions to which a portion of the polypeptide encoded by exon 6 or an allogenic variant of exon 6 is fused. Also provided is a method for detecting a part of the polypeptide encoded by the SLC3A2 exon 2 or the allele variant of exon 2 and NRG1 exon 6 or the allele variant of exon 6. Assay for the detection of the presence of a primary antibody or a set of primary and secondary antibodies to which a portion of the encoded polypeptide is fused, wherein the primary antibody is preferably a SLC3A2-NRG1 polypeptide fusion that binds transcript 3 , and the first and second antibody sets preferably bind to said SLC3A2 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有APP外顯子14或外顯子14之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有APP外顯子14或外顯子14之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合APP-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合APP及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising APP exon 14 or an allele variant of exon 14 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by exon 14 of APP or an allele variant of exon 14 and an allele variant of NRG1 exon 6 or exon 6. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the APP-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds APP and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有WRN外顯子33或外顯子33之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有WRN外顯子33或外顯子33之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合WRN-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合WRN及NRG1。Also provided is a primary antibody or a set of primary and secondary antibodies for detecting a portion of a polypeptide comprising WRN exon 33 or an allogenic variant of exon 33 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by WRN exon 33 or an allele variant of exon 33 and an allele variant of NRG1 exon 6 or exon 6 A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the WRN-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds to WRN and NRG1 respectively.
本揭露亦提供一種第一抗體或第一及第二抗體組供檢測由一多核苷酸融合物所編碼之多肽,且該多核苷酸融合物包含了一編碼ASPH(或ASPH之對偶基因變異體)蛋白序列之核酸與一編碼NRG1(或NRG1之對偶基因變異體)蛋白序列之核酸融合。The present disclosure also provides a first antibody or a set of first and second antibodies for detecting a polypeptide encoded by a polynucleotide fusion comprising an allele variant encoding ASPH (or ASPH) The nucleic acid of the protein sequence of NRG1 is fused with a nucleic acid encoding the protein sequence of NRG1 (or an allele variant of NRG1).
更特別的是提供了一種用於檢測多肽融合物的第一抗體或第一與第二抗體組,且該多肽融合物包含一部分由ASPH之外顯子22或外顯子22之對偶基因變異體所編碼的多肽與一部分由NRG1之外顯子2或外顯子2之對偶基因變異體所編碼的多肽融合。也提供了一種包含有第一抗體或第一與第二抗體組之檢測測定法供檢測是否存在有包含了一部分由ASPH之外顯子22或外顯子22之對偶基因變異體所編碼的多肽與一部分由NRG1之外顯子2或外顯子2之對偶基因變異體所編碼的多肽融合之多肽融合物,其中該第一抗體較佳地結合該ASPH-NRG1多肽融合物,而該第一與第二抗體組分別較佳地結合ASPH與NRG1。More particularly, there is provided a first antibody or a set of first and second antibodies for detecting a polypeptide fusion comprising a portion of ASPH exon 22 or an allele variant of exon 22 The encoded polypeptide is fused to a portion of the polypeptide encoded by exon 2 of NRG1 or an allele variant of exon 2. Also provided is a detection assay comprising a first antibody or a set of first and second antibodies for detecting the presence of a polypeptide comprising a portion encoded by ASPH exon 22 or an allele variant of exon 22 A polypeptide fusion fused to a portion of a polypeptide encoded by exon 2 of NRG1 or an allogenic variant of exon 2, wherein the first antibody preferably binds to the ASPH-NRG1 polypeptide fusion, and the first antibody Preferably bind ASPH and NRG1 respectively with the second antibody set.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有NOTCH2外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有NOTCH2外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合NOTCH2-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合NOTCH2及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising exon 6 of NOTCH2 or an allele variant of exon 6 and exon 6 or exon 6 of NRG1 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by exon 6 of NOTCH2 or an allele variant of exon 6 and an allele variant of NRG1 exon 6 or exon 6 An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of a polypeptide is fused, wherein the first antibody preferably binds to the NOTCH2-NRG1 polypeptide fusion, and the first and second antibodies The second set of antibodies preferably bind NOTCH2 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有CD74外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有CD74外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合CD74-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合CD74及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising CD74 exon 2 or an allele variant of exon 2 and NRG1 exon 2 or exon 2 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of a polypeptide encoded by CD74 exon 2 or an allele variant of exon 2 and an allele variant of NRG1 exon 2 or exon 2. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the CD74-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds CD74 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有SDC4外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有SDC4外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合SDC4-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合SDC4及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising SDC4 exon 2 or an allele variant of exon 2 and NRG1 exon 2 or exon 2 A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of a polypeptide encoded by SDC4 exon 2 or an allele variant of exon 2 and an allele variant of NRG1 exon 2 or exon 2. A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the SDC4-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds to SDC4 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有SDC4外顯子4或外顯子4之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有SDC4外顯子4或外顯子4之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子2或外顯子2之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合SDC4-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合SDC4及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising SDC4 exon 4 or an allele variant of exon 4 and NRG1 exon 2 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of Son 2 is fused. Also provided is a method comprising a part of the polypeptide encoded by SDC4 exon 4 or an allele variant of exon 4 and an allele variant of NRG1 exon 2 or exon 2. A detection assay for the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the SDC4-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds to SDC4 and NRG1 respectively.
亦提供一種第一抗體或第一與第二抗體組,用於檢測包含有SLC4A4外顯子14或外顯子14之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物。亦提供一種包含了用於檢測包含有SLC4A4外顯子14或外顯子14之對偶基因變異體所編碼的多肽的一部分與NRG1外顯子6或外顯子6之對偶基因變異體所編碼的多肽的一部分所融合之多肽融合物的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體較佳地係結合SLC4A4-NRG1多肽融合物,而該第一與第二抗體組較佳地係分別結合SLC4A4及NRG1。Also provided is a first antibody or a set of first and second antibodies for detecting a portion of a polypeptide comprising SLC4A4 exon 14 or an allogenic variant of exon 14 and NRG1 exon 6 or exon A polypeptide fusion in which a portion of the polypeptide encoded by the allele variant of sub6 is fused. Also provided is a method comprising a part of a polypeptide encoded by exon 14 of SLC4A4 or an allele variant of exon 14 and an allele variant of NRG1 exon 6 or exon 6. An assay for the detection of the presence of a first antibody or a set of first and second antibodies to which a portion of the polypeptide is fused, wherein the first antibody preferably binds to the SLC4A4-NRG1 polypeptide fusion, and the first and second antibodies The second antibody set preferably binds SLC4A4 and NRG1 respectively.
本揭露亦提供一種第一抗體或第一及第二抗體組供檢測由一多核苷酸融合物所編碼之多肽,且該多核苷酸融合物包含了一編碼ZFAT(或ZFAT之對偶基因變異體)蛋白序列之核酸與一編碼NRG1(或NRG1之對偶基因變異體)蛋白序列之核酸融合。The disclosure also provides a first antibody or a set of first and second antibodies for detecting a polypeptide encoded by a polynucleotide fusion comprising an allele variant encoding ZFAT (or ZFAT) The nucleic acid of the protein sequence of NRG1 is fused with a nucleic acid encoding the protein sequence of NRG1 (or an allele variant of NRG1).
更特別的是提供了一種用於檢測多肽融合物的第一抗體或第一與第二抗體組,且該多肽融合物包含一部分由ZFAT之外顯子12或外顯子12之對偶基因變異體所編碼的多肽與一部分由NRG1之外顯子6或外顯子6之對偶基因變異體所編碼的多肽融合。也提供了一種包含有第一抗體或第一與第二抗體組之檢測測定法供檢測是否存在有包含了一部分由ZFAT之外顯子12或外顯子12之對偶基因變異體所編碼的多肽與一部分由NRG1之外顯子6或外顯子6之對偶基因變異體所編碼的多肽融合之多肽融合物,其中該第一抗體較佳地結合該ZFAT-NRG1多肽融合物,而該第一與第二抗體組分別較佳地結合ZFAT與NRG1。More particularly, there is provided a first antibody or a set of first and second antibodies for detecting a polypeptide fusion comprising a portion of ZFAT exon 12 or an allele variant of exon 12 The encoded polypeptide is fused to a portion of the polypeptide encoded by exon 6 of NRG1 or an allele variant of exon 6. Also provided is a detection assay comprising a first antibody or a set of first and second antibodies for detecting the presence of a polypeptide comprising a portion encoded by ZFAT exon 12 or an allele variant of exon 12 A polypeptide fusion fused to a portion of a polypeptide encoded by NRG1 exon 6 or an allogenic variant of exon 6, wherein the first antibody preferably binds to the ZFAT-NRG1 polypeptide fusion, and the first antibody Preferably bind ZFAT and NRG1 respectively with the second antibody set.
本揭露亦提供一種第一抗體或第一及第二抗體組供檢測由一多核苷酸融合物所編碼之多肽,且該多核苷酸融合物包含了一編碼DSCAML1(或DSCAML1之對偶基因變異體)蛋白序列之核酸與一編碼NRG1(或NRG1之對偶基因變異體)蛋白序列之核酸融合。The present disclosure also provides a first antibody or a set of first and second antibodies for detecting a polypeptide encoded by a polynucleotide fusion comprising an allele variant encoding DSCAML1 (or DSCAML1 The nucleic acid of the protein sequence of NRG1 is fused with a nucleic acid encoding the protein sequence of NRG1 (or an allele variant of NRG1).
更特別的是提供了一種用於檢測多肽融合物的第一抗體或第一與第二抗體組,且該多肽融合物包含一部分由DSCAML1之外顯子3或外顯子3之對偶基因變異體所編碼的多肽與一部分由NRG1之外顯子或外顯子2之對偶基因變異體所編碼的多肽融合。也提供了一種包含有第一抗體或第一與第二抗體組之檢測測定法供檢測是否存在有包含了一部分由DSCAML1之外顯子3或外顯子3之對偶基因變異體所編碼的多肽與一部分由NRG1之外顯子2或外顯子2之對偶基因變異體所編碼的多肽融合之多肽融合物,其中該第一抗體較佳地結合該DSCAML1-NRG1多肽融合物,而該第一與第二抗體組分別較佳地結合DSCAML1與NRG1。More particularly, there is provided a first antibody or a set of first and second antibodies for detecting a polypeptide fusion comprising a portion of DSCAML1 exon 3 or an allele variant of exon 3 The encoded polypeptide is fused to a portion of the polypeptide encoded by an allele variant of exon or exon 2 of NRG1. Also provided is a detection assay comprising a first antibody or a set of first and second antibodies for detecting the presence of a polypeptide comprising a portion encoded by exon 3 or an allele variant of exon 3 of DSCAML1 A polypeptide fusion fused to a portion of a polypeptide encoded by NRG1 exon 2 or an allogenic variant of exon 2, wherein the first antibody preferably binds to the DSCAML1-NRG1 polypeptide fusion, and the first antibody Preferably bind DSCAML1 and NRG1 respectively with the second antibody panel.
較佳地,該抗NRG1抗體較佳地與NRG1之EGF樣結構域結合。在本揭露之多肽融合物中檢測EGF樣結構域的存在是有好處的,因為其顯示出EGF樣結構域被轉譯及因而與作為融合配偶體之VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1符合讀框的融合部分。Preferably, the anti-NRG1 antibody preferably binds to the EGF-like domain of NRG1. Detection of the presence of the EGF-like domain in the polypeptide fusions of the present disclosure is advantageous because it shows that the EGF-like domain is translated and thus associated with VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, In-frame fusion portions of CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT, or DSCAML1.
本文所提及之抗NRG1、VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1的抗體係可使用在流式細胞儀(FC)、螢光原位雜交法(FISH)、免疫細胞化學染色法(ICC)、免疫組織化學染色法(IHC)或免疫螢光染色法(IF)之中。 治療方法 Anti-NRG1, VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, ASPH, NOTCH2, CD74, SDC4, SLC4A4, Antibodies against ZFAT or DSCAML1 can be used in flow cytometry (FC), fluorescence in situ hybridization (FISH), immunocytochemical staining (ICC), immunohistochemical staining (IHC) or immunofluorescence staining (IF). treatment method
本揭露之融合物存在於具有異常細胞之人類患者中或已於其中鑑定出,尤其是以診斷患有癌症之患者。本揭露之癌症尤其為胰臟癌,更尤其是胰臟導管腺癌、肉瘤、膀胱癌、大腸癌、直腸癌、大腸直腸癌、膽囊癌、頭頸癌、前列腺癌、子宮癌、乳癌、卵巢癌、肝癌、子宮內膜癌、肺癌,尤其是非小細胞肺癌、或浸潤性黏液腺癌。Fusions of the present disclosure are present or have been identified in human patients with abnormal cells, especially patients diagnosed with cancer. The cancer of the present disclosure is especially pancreatic cancer, more particularly pancreatic ductal adenocarcinoma, sarcoma, bladder cancer, colorectal cancer, rectal cancer, colorectal cancer, gallbladder cancer, head and neck cancer, prostate cancer, uterine cancer, breast cancer, ovarian cancer , liver cancer, endometrial cancer, lung cancer, especially non-small cell lung cancer, or invasive mucinous adenocarcinoma.
一般而言,本揭露亦提供治療帶有像癌症、腫瘤之異常細胞之個體的方法,其中所述癌症、腫瘤或異常細胞,其細胞、癌症或腫瘤係包含本揭露之NRG1多核苷酸融合物或表現如本揭露之NRG1多肽融合物。In general, the present disclosure also provides methods of treating individuals with abnormal cells like cancers, tumors, wherein the cancers, tumors or abnormal cells, the cells, cancers or tumors comprise the NRG1 polynucleotide fusions of the present disclosure Or express an NRG1 polypeptide fusion as disclosed herein.
目前的NRG1融合物已確定為一個稱為eNRGy之臨床試驗及早期准入計畫(Early Access Program, 分別為NCT02912949及NCT04100694)(稱為)的一部分。截至2022年4月12日之截止日期,在整個融合配偶體及多種不同癌症類型中,根據RECIST 1.1標準觀察到總共約有70%之已登記且未被排除之經鑑定有NRG1融合物的患者表現出調查員評估反應。在表1的每個類別中都觀察到了調查員評估反應。觀察到反應的適應症包括胰腺導管腺癌、非小細胞肺癌、乳癌及膽管癌。
NRG1 融合配偶體的盛行率
較佳地,該異常細胞、癌(或癌細胞)或腫瘤(或腫瘤細胞)係包含本揭露之多核苷酸融合物,且該多核苷酸融合物進一步包含符合讀框的融合一編碼NRG1之EGF樣結構域的編碼序列。Preferably, the abnormal cell, cancer (or cancer cell) or tumor (or tumor cell) comprises the polynucleotide fusion of the present disclosure, and the polynucleotide fusion further comprises an in-frame fusion-coding NRG1 Coding sequence for the EGF-like domain.
本揭露亦提供一種治療患有包含了本文所提及之NRG1多核苷酸融合物或表現了本文所提及之NRG1多肽融合物的ErbB-2及/或ErbB-3陽性異常細胞、癌細胞或腫瘤細胞之個體的方法,所述方法包含檢測本文所提及之任一多核苷酸或多肽融合物是否存在,隨後向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑,藉以治療該個體之癌症。存在本揭露之NRG1融合物中之任一者時,無論是多核苷酸或由其轉譯的多肽,都表示了癌症。The present disclosure also provides a treatment for ErbB-2 and/or ErbB-3 positive abnormal cells, cancer cells or A method for an individual with tumor cells, said method comprising detecting the presence or absence of any of the polynucleotide or polypeptide fusions mentioned herein, and then administering to the individual an effective amount of ErbB-2 and/or ErbB-3 targets agent to treat the individual's cancer. The presence of any of the NRG1 fusions of the present disclosure, either as polynucleotides or polypeptides translated therefrom, is indicative of cancer.
亦提供一種抑制患有包含了本文所提及之NRG1多核苷酸融合物或表現了本文所提及之NRG1多肽融合物的ErbB-2及ErbB-3陽性癌症之個體病程的方法,所述方法包含檢測本文所提及之任一多核苷酸或多肽融合物是否存在,隨後向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑。Also provided is a method of inhibiting the course of disease in a subject having an ErbB-2 and ErbB-3 positive cancer comprising or expressing an NRG1 polynucleotide fusion mentioned herein, said method It comprises detecting the presence of any one of the polynucleotide or polypeptide fusions mentioned herein, and then administering to the individual an effective amount of an ErbB-2 and/or ErbB-3 targeting agent.
亦提供一種供使用於患有包含了本文所提及之NRG1多核苷酸融合物或表現了本文所提及之NRG1多肽融合物的ErbB-2及ErbB-3陽性癌症之個體的治療中的ErbB-2及/或ErbB-3標靶劑,所述治療包含檢測本文所提及之任一多核苷酸或多肽融合物是否存在,隨後向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑。Also provided is an ErbB for use in the treatment of an individual with an ErbB-2 and ErbB-3 positive cancer comprising an NRG1 polynucleotide fusion mentioned herein or expressing an NRG1 polypeptide fusion mentioned herein -2 and/or an ErbB-3 targeting agent, the treatment comprising detecting the presence of any polynucleotide or polypeptide fusion mentioned herein, then administering to the individual an effective amount of ErbB-2 and/or ErbB-3 targeting agent.
亦提供一種供使用於帶有包含了本文所提及之NRG1多核苷酸融合物或表現了本文所提及之NRG1多肽融合物的ErbB-2及ErbB-3陽性癌症之個體的治療藥物的製造中之ErbB-2及/或ErbB-3標靶劑,所述治療包含檢測本文所提及之任一多核苷酸或多肽融合物是否存在,隨後向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑。Also provided is the manufacture of a therapeutic medicament for use in an individual with an ErbB-2 and ErbB-3 positive cancer comprising an NRG1 polynucleotide fusion mentioned herein or expressing an NRG1 polypeptide fusion mentioned herein An ErbB-2 and/or ErbB-3 targeting agent in which the treatment comprises detecting the presence of any polynucleotide or polypeptide fusion mentioned herein, and then administering an effective amount of ErbB-2 to the individual and/or an ErbB-3 targeting agent.
再進一步提供了一種評估個體是否罹患癌症或是易於罹患癌症的方法,該方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在本文所提及之NRG1多核苷酸融合物或NRG1多肽融合物,且藉由鑑定所述融合物是否存在來評估所述個體罹患所述癌症或是易於罹患所述癌症。該檢驗較佳地包含利用一結合劑特異性地結合所述多核苷酸或是利用一結合劑與包含有所述多核苷酸融合物之多核苷酸結合來檢測該融合物。有關於與包含有所述多核苷酸融合物之多核苷酸結合的結合劑的更多細節詳見下述。Still further provided is a method of assessing whether an individual suffers from cancer or is prone to developing cancer, the method comprising testing a sample obtained from an individual to detect the presence of the NRG1 polynucleotide fusion or NRG1 polypeptide fusion mentioned herein in the sample and assessing that the individual suffers from the cancer or is predisposed to developing the cancer by identifying whether the fusion is present. The assay preferably comprises detecting the fusion using a binding agent that specifically binds to the polynucleotide or using a binding agent that binds to a polynucleotide comprising the polynucleotide fusion. Further details regarding binding agents that bind polynucleotides comprising such polynucleotide fusions are set forth below.
較佳地,本文所提及之診斷方法係包含將樣本中之NRG1融合物的檢測與表現本文所提及之NRG1多肽的癌症聯繫在一起的步驟,以診斷該個體患有表現了所述NRG1多肽融合物的癌症。NRG1融合分子的檢測使得可鑑定一個體患有表現了該NRG1融合多肽的癌症,且可診斷所述個體有這類癌症。對癌症的潛在致癌性驅動因素有所了解使得可選擇及指定改進或更好的特制治療。因此,檢測個體樣本中的融合分子,諸如融合多肽或多核苷酸,使得可改善患有NRG1相關癌症之個體的存活機會。Preferably, the diagnostic methods mentioned herein comprise the step of correlating the detection of NRG1 fusions in a sample with cancers expressing the NRG1 polypeptides mentioned herein, to diagnose that the individual suffers from the expression of said NRG1 Cancer of polypeptide fusions. Detection of NRG1 fusion molecules allows identification of an individual with a cancer expressing the NRG1 fusion polypeptide, and diagnosis of such cancer in the individual. Knowledge of the underlying oncogenic drivers of cancer allows for the selection and prescribing of improved or better tailored treatments. Thus, detection of fusion molecules, such as fusion polypeptides or polynucleotides, in a sample of an individual makes it possible to improve the chances of survival of an individual suffering from NRG1-associated cancer.
本揭露因此提供一種治療癌症的方法。該癌症較佳地係為復發性癌症或轉移性癌症。復發一般係指局部復發,且意指該癌症與原來的癌症在同一個地方或非常接近。當腫瘤已遷移到原來癌症附近的淋巴結或組織,或是擴散到遠離原來癌症更遠的器官或組織時,一般會說該腫瘤是轉移性腫瘤。在這類情況下,兩種表示皆可使用。The present disclosure thus provides a method of treating cancer. The cancer is preferably a recurrent cancer or a metastatic cancer. Recurrence generally refers to a local recurrence and means that the cancer is in the same place or very close to the original cancer. A tumor is generally said to be metastatic when it has migrated to lymph nodes or tissues near the original cancer, or spread to organs or tissues further away from the original cancer. In such cases, both representations can be used.
該癌症尤其為胰臟癌、胰臟腺癌、胰臟導管腺癌、肉瘤、膀胱癌、大腸直腸癌、膽囊癌、頭頸癌、前列腺癌、子宮癌、乳癌、卵巢癌、肝癌、子宮內膜癌、諸如非小細胞肺癌之肺癌,尤其是非小細胞肺癌,更尤其是浸潤性黏液腺癌。較佳地,該腫瘤基因體顯示出在選自由EGFR、KRAS、cKIT-BRCA1-2、MET、ROS、RET、ALK、較佳地KRAS所組成之群組的一或多個基因中是否存在突變。The cancer is especially pancreatic cancer, pancreatic adenocarcinoma, pancreatic ductal adenocarcinoma, sarcoma, bladder cancer, colorectal cancer, gallbladder cancer, head and neck cancer, prostate cancer, uterine cancer, breast cancer, ovarian cancer, liver cancer, endometrial cancer Carcinoma, lung cancer such as non-small cell lung cancer, especially non-small cell lung cancer, more especially invasive mucinous adenocarcinoma. Preferably, the tumor gene body shows the presence or absence of mutations in one or more genes selected from the group consisting of EGFR, KRAS, cKIT-BRCA1-2, MET, ROS, RET, ALK, preferably KRAS .
本揭露制定了ErbB-2及/或ErbB-3標靶劑用以治療會表現NRG1融合多肽的癌症之用途。所述,標靶劑係選自由以下所組成之群組:包含一結合ErbB-2之胞外部分的第一抗原結合位點及一結合ErbB-3之胞外部分的第二抗原結合位點之多特異性抗體、ErbB-2之酪胺酸激酶抑制劑、包含有一結合ErbB-2之胞外部分的抗原結合位點之單特異性雙價抗體、包含有一結合ErbB-3之胞外部分的抗原結合位點之單特異性雙價抗體、或其任何組合。The present disclosure contemplates the use of ErbB-2 and/or ErbB-3 targeting agents for the treatment of cancers expressing NRG1 fusion polypeptides. The targeting agent is selected from the group consisting of a first antigen binding site that binds the extracellular portion of ErbB-2 and a second antigen binding site that binds the extracellular portion of ErbB-3 Multispecific antibody, tyrosine kinase inhibitor of ErbB-2, monospecific bivalent antibody comprising an antigen-binding site binding to the extracellular part of ErbB-2, comprising an extracellular part binding to ErbB-3 A monospecific diabody with an antigen-binding site, or any combination thereof.
還包含投與一或多種選自以下組成之群的化合物以治療癌症:抑制PI3激酶路徑中之組分的抑制劑、抑制MAPK路徑中之組分的抑制劑、微管破壞藥物、及組蛋白去乙醯酶(HDAC)的抑制劑。所述抑制劑較佳地包含酪胺酸激酶抑制劑、PI3Ka抑制劑、Akt抑制劑、mTOR抑制劑或Src抑制劑。所述酪胺酸激酶抑制劑較佳為阿法替尼(afatinib)、拉帕替尼(lapatinib)及/或來那替尼(neratinib)。所述PI3Ka抑制劑較佳為BYL719。在一實施例中,所述Akt抑制劑為MK-2206。在一較佳實施例中,所述mTOR抑制劑為維莫司(everolimus)。在一較佳實施例中,所述Src抑制劑為塞卡替尼(saracatinib)。在揭示之一實施例中,所述微管破壞藥物為紫杉醇。在一較佳實施例中,所述HDAC抑制劑為伏立諾他(vorinostat)。在一實施例中,所述對ErbB 2及ErbB 3具有專一性的結合化合物為MM 111。Also comprising administering one or more compounds selected from the group consisting of inhibitors that inhibit components in the PI3 kinase pathway, inhibitors that inhibit components in the MAPK pathway, microtubule disrupting drugs, and histones for the treatment of cancer Inhibitors of deacetylase (HDAC). The inhibitors preferably comprise tyrosine kinase inhibitors, PI3Ka inhibitors, Akt inhibitors, mTOR inhibitors or Src inhibitors. The tyrosine kinase inhibitor is preferably afatinib, lapatinib and/or neratinib. The PI3Ka inhibitor is preferably BYL719. In one embodiment, the Akt inhibitor is MK-2206. In a preferred embodiment, the mTOR inhibitor is everolimus. In a preferred embodiment, the Src inhibitor is saracatinib. In one disclosed embodiment, the microtubule disrupting drug is paclitaxel. In a preferred embodiment, the HDAC inhibitor is vorinostat. In one embodiment, the binding compound specific to ErbB 2 and ErbB 3 is MM 111 .
該ErbB-2標靶劑較佳為帶有單特異性雙價抗體且其包含有結合ErbB-2的細胞外部分的抗原結合位點。這類抗體較佳為曲妥珠單抗、帕妥珠單抗(pertuzumab)或曲妥珠單抗艾坦辛(trastuzumab-emtansine)。ErbB-2的靶向治療較佳為ErbB-2 TKI。該ErbB-2 TKI較佳為拉帕替尼、卡奈替尼(canertinib)、來那替尼、圖卡替尼(tucatinib)(或厄濱替尼(irbinitinib))、CP-724714、達洛西替尼(tarloxitinib)、穆布里替尼(mubritinib)、阿法替尼、伐利替尼(varlitinib)及達科米替尼(dacomitinib)中之一或多者,較佳為阿法替尼。ErbB-2 TKI也可能影響ErbB-1信號傳導,但與ErbB-1 TKI的不同之處在於它對ErbB-2具有顯著的活性。ErbB-3靶向治療較佳為帶有單特異性雙價抗體且其包含有結合ErbB-3的細胞外部分的抗原結合位點。這類抗體較佳為帕利珠單抗(patritumab)、絲立班單抗(seribantumab)、魯木魯珠單抗(lumretuzumab)、埃爾格木單抗(elgemtumab)、GSK2849330、KTN3379或AV-203。The ErbB-2 targeting agent is preferably a monospecific diabody comprising an antigen binding site that binds the extracellular portion of ErbB-2. Such antibodies are preferably trastuzumab, pertuzumab or trastuzumab-emtansine. ErbB-2 targeted therapy is preferably ErbB-2 TKI. The ErbB-2 TKI is preferably lapatinib, canertinib, neratinib, tucatinib (or irbinitinib), CP-724714, Dalo One or more of tarloxitinib, mubritinib, afatinib, varlitinib and dacomitinib, preferably afatinib Ni. ErbB-2 TKIs may also affect ErbB-1 signaling, but differ from ErbB-1 TKIs in that they have significant activity against ErbB-2. ErbB-3 targeted therapy is preferably with a monospecific diabody comprising an antigen binding site that binds the extracellular portion of ErbB-3. Such antibodies are preferably patritumab, seribantumab, lumretuzumab, elgemtumab, GSK2849330, KTN3379 or AV-203 .
較佳地,該包含有結合ErbB-3的細胞外部分的抗原結合位點之單特異性雙價抗體係包含帕利珠單抗、絲立班單抗、魯木魯珠單抗、埃爾格木單抗、GSK2849330、KTN3379或AV-203。Preferably, the monospecific bivalent antibody system comprising an antigen-binding site that binds to the extracellular part of ErbB-3 comprises palivizumab, seribumab, rumucumab, erg Timumab, GSK2849330, KTN3379, or AV-203.
該酪胺酸激酶抑制劑較佳為阿法替尼、拉帕替尼及/或來那替尼。The tyrosine kinase inhibitor is preferably afatinib, lapatinib and/or neratinib.
更佳地,該ErbB-2及/或ErbB-3標靶劑為多特異性抗體,或更佳地為雙特異性抗體,且其包含了一與ErbB-2之細胞外部分結合的第一抗原結合位點以及一與ErbB-3之細胞外部分結合的第二抗原結合位點。尤其,所述雙特異性抗體為澤妥珠單抗(zenocutuzumab)。More preferably, the ErbB-2 and/or ErbB-3 targeting agent is a multispecific antibody, or more preferably a bispecific antibody, and it comprises a first antibody that binds to the extracellular part of ErbB-2. An antigen binding site and a second antigen binding site that binds to the extracellular portion of ErbB-3. In particular, the bispecific antibody is zenocutuzumab.
該抗體較佳地可在ErbB-2及ErbB-3陽性細胞中降低配體誘導的ErbB-3受體功能。此外,該抗體較佳地可降低ErbB-2及ErbB-3陽性細胞中之配體誘導的生長。該抗體較佳地為包含有一與ErbB-2之細胞外部分結合的第一抗原結合位點以及一與ErbB-3之細胞外部分結合的第二抗原結合位點多特異性或雙特異性抗體。The antibody preferably reduces ligand-induced ErbB-3 receptor function in ErbB-2 and ErbB-3 positive cells. Furthermore, the antibody preferably reduces ligand-induced growth in ErbB-2 and ErbB-3 positive cells. The antibody is preferably a multispecific or bispecific antibody comprising a first antigen binding site that binds to the extracellular portion of ErbB-2 and a second antigen binding site that binds to the extracellular portion of ErbB-3 .
該個體較佳為人類個體。該個體較佳為符合使用ErbB-2特異性抗體(諸如曲妥珠單抗)進行單株抗體療法的受試者。在一較佳實施例中,該個體包含有腫瘤/癌症,其較佳為ErbB-2/ErbB-3陽性癌症,較佳為具ErbB-2治療抗性基因型及/或heregulin抗性基因型之腫瘤/癌症,較佳為具單株抗體抗性基因型之腫瘤/癌症。涉及這類基因型之腫瘤可躲過當前抗HER2治療方案(諸如(但不限於)抗ErbB-2之單株抗體療法)的治療。The individual is preferably a human individual. The individual is preferably a subject eligible for monoclonal antibody therapy with an ErbB-2-specific antibody, such as trastuzumab. In a preferred embodiment, the individual comprises a tumor/cancer, preferably an ErbB-2/ErbB-3 positive cancer, preferably an ErbB-2 treatment-resistant genotype and/or a heregulin-resistant genotype The tumor/cancer is preferably a tumor/cancer with a monoclonal antibody resistant genotype. Tumors involving such genotypes may escape treatment with current anti-HER2 treatment regimens such as, but not limited to, anti-ErbB-2 monoclonal antibody therapy.
尤其,該癌症包含了屬於轉移性腫瘤的腫瘤,較佳地其中該腫瘤已遷移到原來癌症附近的淋巴結或組織,或是擴散到遠離原來癌症更遠的器官或組織。In particular, the cancer comprises tumors that are metastatic tumors, preferably wherein the tumor has migrated to lymph nodes or tissues adjacent to the original cancer, or spread to organs or tissues more distant from the original cancer.
本揭露亦包括活體內模型,諸如表現在其基因體內或是包含有本文所提及之多核苷酸融合物的移植異常細胞之異種移植或基因轉殖動物模型,且/或表現由其所編碼之多肽融合物,以及使用ERB2及/或Erb3標靶劑或其他標靶劑之這類模型的治療以評估這類試劑的治療活性。較佳地,該動物模型為非人類動物模型。The present disclosure also includes in vivo models, such as xenograft or transgenic animal models of transplanted abnormal cells expressing in their genes or comprising the polynucleotide fusions mentioned herein, and/or expressing the genes encoded by them. and treatment of such models with ERB2 and/or Erb3 targeting agents or other targeting agents to assess the therapeutic activity of such agents. Preferably, the animal model is a non-human animal model.
在所揭露的實施例中,所述活體內動物模型係包含如本揭露之多核苷酸融合物及/或表現由其所編碼之多肽融合物,其中由該動物模型所組成之多核苷酸融合物或多肽融合物較佳地係由存在於該動物模型中之移植異常細胞所組成,或是由該動物模型的基因體所組成。In a disclosed embodiment, the in vivo animal model comprises a polynucleotide fusion as disclosed herein and/or expresses a polypeptide fusion encoded thereby, wherein the animal model consists of a polynucleotide fusion The drug or polypeptide fusion is preferably composed of transplanted abnormal cells present in the animal model, or is composed of the gene body of the animal model.
在所揭露之實施例中,本揭露提供一種使用ErbB-2及/或ErbB-3標靶劑之所述活體內動物模型的治療方法,該標靶劑係選自由以下所組成之群組:包含一結合ErbB-2之胞外部分的第一抗原結合位點及一結合ErbB-3之胞外部分的第二抗原結合位點之多特異性抗體、ErbB-2之酪胺酸激酶抑制劑、包含有一結合ErbB-2之胞外部分的抗原結合位點之單特異性雙價抗體、包含有一結合ErbB-3之胞外部分的抗原結合位點之單特異性雙價抗體、或其任何組合,所述方法包含向該動物投與所述Erb2及/或Erb3標靶劑。In disclosed embodiments, the present disclosure provides a method of treatment of said in vivo animal model using an ErbB-2 and/or ErbB-3 targeting agent selected from the group consisting of: Multispecific antibody comprising a first antigen binding site that binds the extracellular portion of ErbB-2 and a second antigen binding site that binds the extracellular portion of ErbB-3, a tyrosine kinase inhibitor of ErbB-2 , a monospecific diabody comprising an antigen-binding site that binds an extracellular portion of ErbB-2, a monospecific diabody comprising an antigen-binding site that binds an extracellular portion of ErbB-3, or any of these In combination, said method comprises administering said Erb2 and/or Erb3 targeting agent to said animal.
在所揭露之實施例中,本揭露提供一種ErbB-2及/或ErbB-3標靶劑,其係選自由以下所組成之群組:包含一結合ErbB-2之胞外部分的第一抗原結合位點及一結合ErbB-3之胞外部分的第二抗原結合位點之多特異性抗體、ErbB-2之酪胺酸激酶抑制劑、包含有一結合ErbB-2之胞外部分的抗原結合位點之單特異性雙價抗體、包含有一結合ErbB-3之胞外部分的抗原結合位點之單特異性雙價抗體、或其任何組合,供使用於所述活體內動物模型的治療中。所述方法較佳地包含向該動物投與所述Erb2及/或Erb3標靶劑。 給藥及投與 In disclosed embodiments, the disclosure provides an ErbB-2 and/or ErbB-3 targeting agent selected from the group consisting of a first antigen that binds an extracellular portion of ErbB-2 Multispecific antibody binding site and a second antigen binding site binding to the extracellular portion of ErbB-3, tyrosine kinase inhibitor of ErbB-2, antigen binding comprising an extracellular portion binding to ErbB-2 Monospecific diabodies of a site, monospecific diabodies comprising an antigen binding site that binds an extracellular portion of ErbB-3, or any combination thereof, for use in therapy of said in vivo animal model . The method preferably comprises administering to the animal said Erb2 and/or Erb3 targeting agent. Administration and administration
本揭露之醫藥組合物中的活性成分或是任何治療方法中所投與之實際劑量含量可變化以獲得有效達成對特定患者、組合物及投藥模式之所需治療反應,而對患者不具有毒性的活性成分之量。所選劑量含量將視多種藥物動力學因素而定,包括所用之本揭露之特定組合物的活性、投藥途徑、投藥時間、所用特定化合物之排出速率、治療持續時間;與所用特定組合物組合使用的其他藥物、化合物及/或物質;所治療患者之年齡、性別、體重、病狀、一般健康狀況及先前醫療病史;及醫學領域中所熟知的類似因素。所述活性成分較佳為本揭露之任一Erb2及/或Erb3標靶劑。任何經批准的活性成分或正在進行臨床試驗的藥物,特別是一旦進入臨床試驗的第二期,可由醫生考慮按照批准的劑量方案或臨床試驗劑量治療方案投藥。Actual dosage levels of the active ingredients in the disclosed pharmaceutical compositions or administered in any method of treatment may be varied to achieve a desired therapeutic response effective for a particular patient, composition, and mode of administration without toxicity to the patient. amount of active ingredient. The selected dosage level will depend on a variety of pharmacokinetic factors, including the activity of the particular composition of the present disclosure used, the route of administration, the time of administration, the rate of excretion of the particular compound used, the duration of treatment; the combination with the particular composition used other drugs, compounds, and/or substances; the age, sex, weight, condition, general health, and prior medical history of the patient being treated; and similar factors well known in the medical field. The active ingredient is preferably any Erb2 and/or Erb3 targeting agent disclosed herein. Any approved active ingredient or drug that is undergoing clinical trials, especially once it enters the second phase of clinical trials, may be considered by a physician for administration according to the approved dosage regimen or clinical trial dosage regimen.
待向如本文中所提及之任一ErbB-2及/或ErbB-3標靶劑的量通常在治療範圍內,其意指使用足夠量來獲得治療效果,同時該量不超過會導致副作用達不可接受程度的臨限值。用來獲得所要治療效果之治療物質所需量越低,治療範圍一般會越大。所選劑量水平將取決於各種因素,包括投予途徑、投予時間、所用特定化合物之排出速率、治療持續期間、組合使用的其他藥物、化合物及/或物質、所治療個體之年齡、性別、體重、病狀、一般健康狀況及先前醫療病史;及醫學領域中所熟知的類似因素。The amount to be administered to any ErbB-2 and/or ErbB-3 targeting agent as mentioned herein is generally within the therapeutic range, which means using a sufficient amount to obtain a therapeutic effect while not exceeding the amount that would cause side effects threshold of unacceptable levels. The lower the amount of therapeutic substance required to achieve the desired therapeutic effect, the larger the therapeutic area will generally be. The selected dosage level will depend on various factors including the route of administration, time of administration, rate of excretion of the particular compound used, duration of treatment, other drugs, compounds and/or substances used in combination, age, sex, Weight, medical conditions, general health, and previous medical history; and similar factors well known in the medical field.
關於澤妥珠單抗,其在相對高劑量下具有良好的安全性概況,因此比其他標靶或具細胞毒性的化學治療劑相比下提供了較大的治療範圍。本發明之雙特異性抗體的給藥係遵循每週、每兩週、或每三週一次750 mg的投與方案,較佳為每兩週一次750 mg的劑量。給藥較佳地係用於患有胰臟癌、NSCLC、或實體瘤的個體,且其包括患有具NRG1融合物之實體瘤的任何個體,其中此類個體已在投予化療照護標準、或ErbB-2或ErbB-3標靶劑、或TKI時有所進展。替代地,隨後的給藥方案包括每週一次400 mg的固定劑量投予,較佳為在單次投予800 mg之後開始。在此替代給藥方案之後,本揭露之雙特異性抗體較佳地係以每週一次400 mg的劑量投予3週,接著1週沒有給藥。接著,遵循一或多個由下列組成之四週期間循環:三次的每週一次400 mg之固定劑量,隨後一週不投予。較佳為遵循此方式直到觀察到治療效果。With regard to zetuzumab, it has a good safety profile at relatively high doses and thus offers a larger therapeutic latitude compared to other targeted or cytotoxic chemotherapeutic agents. The administration of the bispecific antibody of the present invention follows a weekly, biweekly, or triweekly administration regimen of 750 mg, preferably a biweekly dose of 750 mg. Administration is preferably for individuals with pancreatic cancer, NSCLC, or solid tumors, and includes any individual with a solid tumor with an NRG1 fusion, wherein such individuals have been administered chemotherapy standard of care, Or ErbB-2 or ErbB-3 targeting agents, or TKI progressed. Alternatively, the subsequent dosing regimen comprises a weekly fixed dose administration of 400 mg, preferably initiated after a single administration of 800 mg. Following this alternate dosing regimen, the bispecific antibody of the present disclosure is preferably administered at a dose of 400 mg once weekly for 3 weeks, followed by 1 week of no administration. Then, one or more four-week period cycles consisting of three once-weekly fixed doses of 400 mg followed by one week of no administration were followed. It is preferred to follow this pattern until a therapeutic effect is observed.
給藥較佳地涉及以靜脈內注射進行本揭露之雙特異性抗體的兩次輸注以達到完整劑量,其較佳為在給藥> 360 mg抗體時。替代地,針對較低劑量可給予單次輸注的完整劑量,例如在給藥≤ 360 mg抗體時。給藥方案中可包括預備藥物以減緩輸注相關反應。 一般定義 Dosing preferably involves two infusions of the bispecific antibody of the present disclosure by intravenous injection to achieve the full dose, preferably when >360 mg of antibody is administered. Alternatively, a single infusion of the complete dose may be given for lower doses, eg, when administering ≤ 360 mg of antibody. Dosing regimens may include back-up medications to slow down infusion-related reactions. general definition
冠詞「a」及「an」在本文中是指冠詞語法對象中的一個或多個(即,一個或至少一個)。The articles "a" and "an" herein refer to one or more (ie, one or at least one) of the grammatical object of the article.
在整個本說明書及所附申請專利範圍中,詞語「包含」、「包括」及「具有」以及諸如「包含了」、「包含有」、「包括了」及「包括有」的變體將被解釋為具有包容性。也就是說,在上下文允許的情況下,這些詞語意欲表達可能包括未具體列舉的其他元素或整數。Throughout this specification and the appended claims, the words "comprises," "includes," and "has" and variations thereof such as "comprises," "comprising," "includes," and "includes" are to be referred to as Interpreted as inclusive. That is, where the context allows, these words are intended to mean that other elements or integers not specifically listed may be included.
當提及核酸或胺基酸序列時「一致性百分比(%)」的定義為在出於最佳比較目的而比對序列之後與經選擇序列中之殘基具有一致性之候選序列中之殘基百分比。為了使比對最佳化,可在二個序列之間在經比較之二個序列中之任一個中引入空隙。這類比對可在所比較之全長序列上進行。替代性地,比對可在較短長度上進行,例如在約20個、約50個、約100個或更多個核酸/鹼基或胺基酸上進行。序列一致性為經報導之經比對區上方之二個序列之間的一致匹配百分比。"Percent identity (%)" when referring to nucleic acid or amino acid sequences is defined as the residues in a candidate sequence that are identical to residues in the selected sequence after alignment of the sequences for optimal comparison purposes base percentage. To optimize the alignment, a gap can be introduced between the two sequences in either of the two sequences being compared. Such alignments can be performed over the full length of the sequences being compared. Alternatively, the alignment can be performed over shorter lengths, eg, over about 20, about 50, about 100 or more nucleic acids/bases or amino acids. Sequence identity is the reported percentage of identical matches between two sequences over an aligned region.
序列比較及二個序列之間的序列一致性百分比測定可使用數學演算法來實現。熟習該項技術者將瞭解以下事實:數個不同電腦程式可用於比對二個序列且測定二個序列之間的一致性(Kruskal, J. B. (1983) An overview of sequence comparison In D. Sankoff and J. B. Kruskal, (編), Time warps, string edits and macromolecules: the theory and practice of sequence comparison, 第1 -44頁Addison Wesley)。兩個胺基酸序列或核酸序列之間的序列一致性百分比可使用用於二個序列比對之尼-翁氏(Needleman and Wunsch)演算法來測定(Needleman, S. B.及Wunsch, C. D. (1970) J. Mol. Biol. 48, 443-453)。 尼-翁氏演算法已實施於電腦程式NEEDLE中。出於本揭露之目的,使用來自EMBOSS軟體包之NEEDLE程式以測定胺基酸及核酸序列之一致性百分比(2.8.0版或更高版本,EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice, P. LongdenJ.及Bleasby, A. Trends in Genetics 16, (6) 第276- 277頁, http://emboss.bioinformatics.nl/)。對於蛋白質序列,使用EBLOSUM62以用於取代矩陣。對於DNA序列,使用DNAFULL。所用參數為10之空隙開放罰分及0.5之空隙擴展罰分。The comparison of sequences and the determination of percent sequence identity between two sequences can be accomplished using a mathematical algorithm. Those skilled in the art will be aware of the fact that several different computer programs are available for aligning two sequences and determining the identity between the two sequences (Kruskal, J. B. (1983) An overview of sequence comparison In D. Sankoff and J. B. Kruskal, (ed.), Time warps, string edits and macromolecules: the theory and practice of sequence comparison, pp. 1-44 Addison Wesley). The percent sequence identity between two amino acid sequences or nucleic acid sequences can be determined using the Needleman and Wunsch algorithm for the alignment of two sequences (Needleman, S. B. and Wunsch, C. D. (1970) J. Mol. Biol. 48, 443-453). The Nyon-Wong algorithm has been implemented in the computer program NEEDLE. For the purposes of this disclosure, the NEEDLE program from the EMBOSS software package was used to determine percent identity for amino acid and nucleic acid sequences (version 2.8.0 or later, EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice , P. LongdenJ. and Bleasby, A. Trends in Genetics 16, (6) pp. 276-277, http://emboss.bioinformatics.nl/). For protein sequences, use EBLOSUM62 for the substitution matrix. For DNA sequences, use DNAFULL. The parameters used were a gap opening penalty of 10 and a gap extension penalty of 0.5.
在比對之後,藉由如上文所描述之程式NEEDLE如下計算查詢序列與本揭露序列之間的序列一致性百分比:在二個序列中顯示相同胺基酸或相同核苷酸之比對中之對應位置數目除以減除比對中之空隙總數目之後的比對總長度。After the alignment, the percent sequence identity between the query sequence and the sequences of the disclosure is calculated by the program NEEDLE as described above as follows: the number of alignments showing identical amino acids or identical nucleotides in the two sequences The number of corresponding positions is divided by the total length of the alignment after subtracting the total number of gaps in the alignment.
「對偶基因變異體」係指在源自患者的樣本中所鑑定出之特定序列之天然發生的對偶基因,其所鑑定出之序列已被指派SEQ ID NO,且其變異體落入一個確定的變異範圍內。該變異性的定義為與表示其為對偶基因變異體之序列有至少85%序列一致性,較佳為90%一致性,或與其有92%、94%、95%、96%或更佳為至少98%序列一致性。這類變異體被認為是與之比較的序列的一個對偶基因,因此仍有資格作為它的變異體基因。如果在本文中沒有明確提及與一特定對偶基因變異體的關係,這些序列一致性百分比仍適用於所指的對偶基因變異體。"Allogenic variant" means the naturally occurring allele of a specific sequence identified in a patient-derived sample, the identified sequence has been assigned a SEQ ID NO, and the variant falls into a defined within the range of variation. The variability is defined as having at least 85% sequence identity, preferably 90% identity, or 92%, 94%, 95%, 96% or better with the sequence indicating that it is an allogenic variant At least 98% sequence identity. Such a variant is considered an allele of the sequence it is being compared to, and thus still qualifies as its variant gene. If no relationship to a particular allele variant is explicitly mentioned herein, these percent sequence identities still apply to the allele variant referred to.
「雙特異性抗體」係指一抗體有一個抗體可變域結合至一第一抗原,而一個第二抗體可變域係結合至一第二抗原,其中所述第一及第二抗原是不相同的。術語「雙特異性抗體」亦涵蓋二元抗體,其中一個抗體可變域結合至一抗原的第一表位,而一個第二抗體可變域係結合至相同抗原的第二表位。該術語進一步包括其中至少一個VH能夠特異性地辨識出一第一抗原與一VL的抗體,與免疫球蛋白可變域中的至少一個VH配對,以能夠特異性地辨識出一第二抗原。所得VH/VL對將與抗原1或抗原2結合,且稱為「二合一抗體」,例如於WO 2008/027236、WO 2010/108127及Schaefer等人(Cancer Cell 20, 472-486, October 2011)中所描述。如本揭露之雙特異性抗體並不限於任何特定雙特異性型式或是產生其之方法。"Bispecific antibody" refers to an antibody having one antibody variable domain that binds to a first antigen and a second antibody variable domain that binds to a second antigen, wherein the first and second antigens are not identical. The term "bispecific antibody" also encompasses binary antibodies in which one antibody variable domain binds to a first epitope of an antigen and a second antibody variable domain binds to a second epitope of the same antigen. The term further includes antibodies in which at least one VH specifically recognizes a first antigen and a VL is paired with at least one VH in an immunoglobulin variable domain to specifically recognize a second antigen. The resulting VH/VL pair will bind to Antigen 1 or Antigen 2 and is called a "two-in-one antibody", for example in WO 2008/027236, WO 2010/108127 and Schaefer et al. (Cancer Cell 20, 472-486, October 2011 ) described in . Bispecific antibodies as disclosed herein are not limited to any particular bispecific format or method of producing them.
「可檢測標籤」是指化學、生物或其他修飾,包括但不限於螢光、質量、殘基、染料、放射性同位素、標籤或標記修飾等,其致使檢測所感興趣的分子(在本文中較佳為多核苷酸或多肽)是否存在。較佳地,可檢測標籤為可見標籤、螢光染料、淬滅體、紫外光可檢測標籤、顯色標籤、放射性標籤、電化學標籤、標記、當與對酶或分子標記碼(barcode)具特異性之基質接觸時產生可檢測標籤的酶。例示性螢光染料包括水溶性羅丹明染料、螢光素、4,7-二氯螢光素、苯并呫噸染料及能量轉移染料,如以下參考文獻中所揭露者:Handbook of Molecular Probes and Research Reagents, 第8版(2002), Molecular Probes, Eugene, Oreg.; WO 2001/32783; 美國專利公開號US 2002-0081616, US 2002-0086985; 及Lee等人, 1997, Nucleic Acids Research 25:2816-2822。"Detectable label" means a chemical, biological or other modification, including but not limited to fluorescent, mass, residue, dye, radioisotope, tag or label modification, etc., which results in the detection of a molecule of interest (preferred herein is polynucleotide or polypeptide) is present. Preferably, the detectable label is a visible label, a fluorescent dye, a quencher, an ultraviolet light detectable label, a chromogenic label, a radioactive label, an electrochemical label, a label, and when combined with an enzyme or molecular barcode. An enzyme that produces a detectable label upon contact with a specific substrate. Exemplary fluorescent dyes include water-soluble rhodamine dyes, luciferin, 4,7-dichlorofluorescein, benzoxanthene dyes, and energy transfer dyes, as disclosed in the following references: Handbook of Molecular Probes and Research Reagents, 8th Edition (2002), Molecular Probes, Eugene, Oreg.; WO 2001/32783; US Patent Publication Nos. US 2002-0081616, US 2002-0086985; and Lee et al., 1997, Nucleic Acids Research 25:2816 -2822.
「分離的」或「純化的」係指核酸或胺基酸序列被從其天然環境中移出而經分離或經純化。「分離的」核酸分子包括一種從存在於該核酸分子的天然來源中之其他核酸分子分離出者。此外,「分離的」核酸分子(諸如cDNA分子)在藉由重組技術產生時係包括不含其他細胞物質或培養基的狀態,或在化學合成時實質上不含化學前驅物或其他化學品。"Isolated" or "purified" means that a nucleic acid or amino acid sequence has been isolated or purified from its natural environment. An "isolated" nucleic acid molecule includes one that is separated from other nucleic acid molecules present in the natural source of the nucleic acid molecule. Furthermore, an "isolated" nucleic acid molecule, such as a cDNA molecule, includes a state free of other cellular material or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized.
本文中之「樣本」係指獲自個體或患者的樣本。這類樣本為生物樣本或患者樣本,這些術語在本文中可互換使用,因為本文所提及的樣本為獲自患者的生物樣本。術語「樣本」包括液態檢體樣本,採自實體癌或實體瘤的樣本,且在向其應用任何處理之前將含有多核苷酸融合物及/或多肽融合物。尤其,該樣本包含異常細胞,諸如腫瘤細胞或癌細胞。A "sample" herein refers to a sample obtained from an individual or patient. Such samples are biological samples or patient samples, and these terms are used interchangeably herein, as reference herein to a sample is a biological sample obtained from a patient. The term "sample" includes liquid specimen samples, samples taken from solid cancers or solid tumors, and which will contain polynucleotide fusions and/or polypeptide fusions before any processing is applied thereto. In particular, the sample contains abnormal cells, such as tumor cells or cancer cells.
措辭「不含其他細胞物質或培養基」係包括核酸分子製劑,其中將該核酸分子自分離或重組產生核酸分子之細胞的細胞成分分離出。The expression "free of other cellular material or culture medium" includes preparations of nucleic acid molecules in which the nucleic acid molecule is isolated from the cellular components of the cells from which the nucleic acid molecule was isolated or recombinantly produced.
「引子」係指具有足夠長度以特異性地與所感興趣的核苷酸序列雜交的單股核苷酸分子,以致使與一界定了在聚合酶連鎖反應(PCR)中被擴增之所感興趣區域的第二引子一起特異性結合(在本文中也稱為雜交或黏合)至目標或選定核苷酸序列。在本文中,第一及第二引子,或引子對,係專門設計用於擴增跨越NRG1融合接合處的核苷酸區域。尤其,使用了一第一引子與一第二引子的組合,該第一引子係與對於在該融合接合處一側之NRG1序列部分具特異性的核苷酸序列雜交,而該第二引子係與對於在該融合接合處另一側之選自VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之融合配偶體具特異性的核苷酸序列雜交。一般而言,引子本身並不包含可檢測標籤物。在NRG1融合多核苷酸之目標序列的全長上,引子通常與目標序列具有至少約80%、至少約85%、至少約90%、至少約95%、至少約97%、至少約98%或至少約99%的序列一致性。多核苷酸引子最佳為與其目標序列互補(在全長上具有100%序列一致性)。在本文中的所有實例中,序列一致性通常在非變異/目標序列的全長上測得。"Primer" means a single-stranded nucleotide molecule of sufficient length to specifically hybridize to a nucleotide sequence of interest such that it binds to a defined nucleotide sequence of interest to be amplified in a polymerase chain reaction (PCR). The second primers of the region together specifically bind (also referred to herein as hybridizing or adhering) to a target or selected nucleotide sequence. Herein, the first and second primers, or pair of primers, are specifically designed to amplify a region of nucleotides spanning the NRG1 fusion junction. In particular, a combination of a first primer that hybridizes to a nucleotide sequence specific for the portion of the NRG1 sequence on one side of the fusion junction and a second primer that hybridizes is used and for the other side of the fusion junction selected from VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2 , CD74, SDC4, SLC4A4, ZFAT or DSCAML1 fusion partners have specific nucleotide sequence hybridization. Generally, the primers themselves do not contain a detectable label. The primer typically shares at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 97%, at least about 98%, or at least Approximately 99% sequence identity. A polynucleotide primer is optimally complementary (100% sequence identity over its entire length) to its target sequence. In all instances herein, sequence identity is generally measured over the full length of the non-variant/target sequence.
「探針」或「核酸探針」係指具有足夠長度以特異性地與所感興趣的核苷酸序列雜交的單股核苷酸分子。尤其,在本揭露的上下文中,該探針可檢測發生在VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之中的融合物,尤其是選自VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、VTCN1-NRG1、 CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1- NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1及DSCAML1-NRG1的融合物。探針通常包含一或兩種可檢測標籤物。在NRG1融合多核苷酸之目標序列的全長上,多核苷酸探針通常與目標序列具有至少約80%,至少約85%,至少約90%,至少約95%,至少約97%,至少約98%或至少約99%的序列一致性。該多核苷酸引子最佳為與其目標序列互補(在全長上具有100%序列一致性)。在本文中的所有實例中,序列一致性通常在非變異/目標序列的全長上測得。A "probe" or "nucleic acid probe" refers to a single-stranded nucleotide molecule of sufficient length to specifically hybridize to a nucleotide sequence of interest. In particular, in the context of the present disclosure, the probes can detect genes that occur in VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, Fusions among ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1, especially selected from VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR-NRG1 , GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74 - Fusions of NRG1, SDC4-NRG1, SLC4A4-NRG1, ZFAT-NRG1 and DSCAML1-NRG1. Probes typically contain one or two detectable labels. On the full length of the target sequence of NRG1 fusion polynucleotide, the polynucleotide probe usually shares at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 97%, at least about 98% or at least about 99% sequence identity. The polynucleotide primer is optimally complementary (100% sequence identity over its entire length) to its target sequence. In all instances herein, sequence identity is generally measured over the full length of the non-variant/target sequence.
該多核苷酸探針較佳為DNA探針、TaqMan探針、分子信標、蝎型探針(scorpion probe)或於FISH中使用的探針。DNA探針通常與互補的目標序列雜交,且接著可使用(例如)可檢測標籤物來檢測。TaqMan探針在本領域中是已知的,且為具有附接至5' 端的螢光染料及附接至3' 端的淬滅體之多核苷酸。PCR中使用的聚合酶可裂解經雜交的探針,使螢光染料不受淬滅作用而使得可被檢測到。分子信標探針在本領域是已知的,除了與目標序列雜交而將染料與淬滅體分開(而不是使用裂解將染料與淬滅體分離),其係與TaqMan探針相似。Scorpion探針在本領域中是已知的且與分子信標相似,除了其3'端還包含一個與引子的延伸產物之5'端互補的序列以打開雜交中的探針而使染料可被檢測到。該多核苷酸探針較佳為TaqMan探針。該多核苷酸探針較佳為TaqMan探針並使用於上述任何PCR方法之中。The polynucleotide probe is preferably a DNA probe, a TaqMan probe, a molecular beacon, a scorpion probe or a probe used in FISH. DNA probes typically hybridize to a complementary target sequence and can then be detected using, for example, a detectable label. TaqMan probes are known in the art and are polynucleotides with a fluorescent dye attached to the 5' end and a quencher attached to the 3' end. The polymerase used in PCR cleaves the hybridized probe, rendering the fluorescent dye unquenchable and detectable. Molecular Beacon probes are known in the art and are similar to TaqMan probes except that they hybridize to the target sequence to separate the dye from the quencher (instead of using cleavage to separate the dye from the quencher). Scorpion probes are known in the art and are similar to molecular beacons, except that their 3' end also contains a sequence complementary to the 5' end of the extension product of the primer to open the hybridizing probe to the dye. detected. The polynucleotide probe is preferably a TaqMan probe. The polynucleotide probe is preferably a TaqMan probe and is used in any of the PCR methods described above.
使用於像FISH或斷裂分離FISH之技術中的探針係具有適宜長度且包含單一可檢測標籤物。FISH涉及已與間期染色體或中期染色體雜交之高特異性DNA探針的檢測且使之可見,以便使用螢光顯微鏡進行檢測。斷裂分離FISH係特別適合檢測融合斷點。在這種情況下,所感興趣的基因的兩端係以不同的顏色標記,當發生易位而產生融合物時,兩種單獨的顏色不會共定位,而會觀察到兩種原色。在具有正常複本數之經標記所感興趣基因的正常細胞中,這兩種顏色會共定位而產生融合信號。Probes used in techniques like FISH or Fragmentation FISH are of suitable length and comprise a single detectable label. FISH involves the detection and visualization of highly specific DNA probes that have hybridized to interphase or metaphase chromosomes for detection using a fluorescent microscope. The break-separation FISH line is particularly suitable for detecting fusion breakpoints. In this case, the two ends of the gene of interest are marked with different colors, and when a translocation occurs to create a fusion, the two separate colors do not colocalize, but two primary colors are observed. In normal cells with a normal number of copies of the tagged gene of interest, the two colors will co-localize to produce a fusion signal.
用於鑑定由細胞構成的融合多核苷酸之探針或引子係可使用標準分子生物學技術來分離或純化,或基於本文所述資料庫紀錄中的序列資訊來合成。如本文所述之這類核酸分子係可使用標準雜交及選殖技術(例如,於Sambrook等人編,Molecular Cloning: A Laboratory Manual, 第二版, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989中所述者)來分離。此外,可使用任何適宜標準分子生物技術來分離、純化或合成融合多核苷酸。在一些實施例中,該等引子或探針對於cDNA具特異性。例如,在一些實施例中,該等引子或探針對於cDNA中之外顯子-外顯子接合具特異性。Probes or primers used to identify fusion polynucleotides composed of cells can be isolated or purified using standard molecular biology techniques, or synthesized based on sequence information in database records described herein. Such nucleic acid molecules as described herein can be cloned using standard hybridization and cloning techniques (e.g., in Sambrook et al., eds., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) to separate. In addition, fusion polynucleotides can be isolated, purified or synthesized using any suitable standard molecular biology techniques. In some embodiments, the primers or probes are specific for cDNA. For example, in some embodiments, the primers or probes are specific for exon-exon junctions in cDNA.
「多核苷酸融合物」係指兩個多核苷酸之間的共價鍵結連接,其中在融合接合處的一側係存有來自一個基因的核苷酸序列,而在另一側係存有來自另一個基因的核苷酸序列。在多核苷酸融合物之上下文中,該連接係以允許轉錄成蛋白質之方式而可操作地連接。在融合接合處的其中一側上,所述核苷酸序列較佳為來自VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1,且另一側上較佳為來自NRG1。"Polynucleotide fusion" means a covalently bonded linkage between two polynucleotides in which a nucleotide sequence from one gene is present on one side of the fusion junction and a nucleotide sequence is present on the other. There is a nucleotide sequence from another gene. In the context of polynucleotide fusions, the linkage is operably linked in a manner that permits transcription into a protein. On one side of the fusion junction, the nucleotide sequence is preferably from VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1, and preferably from NRG1 on the other side.
「多肽融合物」係指兩個胺基酸之間的共價鍵結連接,其中在融合接合處的一側係存有來自一個多肽的胺基酸序列,而在另一側係存有來自另一個多肽的胺基酸序列。在融合接合處的其中一側上,所述胺基酸序列較佳為來自VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1,且另一側上較佳為來自NRG1。"Polypeptide fusion" means a covalently bonded linkage between two amino acids in which on one side of the fusion junction there is an amino acid sequence from one polypeptide and on the other side an amino acid sequence from The amino acid sequence of another polypeptide. On one side of the fusion junction, the amino acid sequence is preferably from VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, ASPH, NOTCH2, CD74, SDC4, SLC4A4, ZFAT or DSCAML1, and preferably from NRG1 on the other side.
「融合配偶體」係指由於基因重排而已與NRG1基因或多肽發生融合之基因或多肽。在本揭露中,該融合配偶體為VAPB、CADM1、CD44、SLC3A2、VTCN1 、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1中之一者。一般而言,該融合配偶體係位於該NRG1基因的上游處(在有義股的5’端),且符合讀框的融合而導致轉譯為一種包含有該融合配偶體之一部分與一NRG1部分的嵌合蛋白。由於接合兩個不同基因部分與融合多肽所導致的融合基因為來自這類融合基因的轉譯產物。"Fusion partner" refers to a gene or polypeptide that has been fused to an NRG1 gene or polypeptide due to gene rearrangement. In this disclosure, the fusion partners are VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, CD74, One of SDC4, SLC4A4, ZFAT or DSCAML1. Generally, the fusion partner system is located upstream of the NRG1 gene (at the 5' end of the sense strand), and in-frame fusion results in translation into a gene comprising a portion of the fusion partner and an NRG1 portion. chimeric protein. A fusion gene resulting from the joining of two different gene portions with a fusion polypeptide is the translation product from such a fusion gene.
「融合點」係指在胺基酸或核苷酸序列中有一側出現NRG1序列且另一側出現NRG1融合配偶體序列的位置。此為發生基因重排而形成嵌合基因、mRNA及/或蛋白序列的接合處或位置。本文中之術語「融合點」係可與「接合處」或「融合接合處」互換。"Fusion point" refers to the position in an amino acid or nucleotide sequence where the NRG1 sequence occurs on one side and the NRG1 fusion partner sequence occurs on the other side. This is the junction or location where gene rearrangement occurs to form chimeric gene, mRNA and/or protein sequences. The term "fusion point" herein is interchangeable with "junction" or "fusion junction".
「位於3’處的基因序列」係指位於所指基因的下游或3’處且被認為是所述基因的一部分之任何元件(無論是否具調節性質)。本文中之調節元件包括增強子、緘默子、反應元件、插入子、外顯子、內嵌啟動子,且該元件係對所涉基因具特異性,使得可因為靠近該融合接合處而以獨特方式擴增及鑑定該融合物。此上下文中之位於3’處的基因序列亦包括位於可用來獨特地鑑定所涉融合物之任何調節元件之間的任何基因序列。"Gene sequence located 3'" refers to any element (regulatory or not) located downstream or 3' of the gene in question and considered to be part of said gene. Regulatory elements herein include enhancers, silencers, response elements, insertions, exons, embedded promoters, and are specific to the gene in question such that they can be uniquely expressed due to their proximity to the fusion junction. way to amplify and identify the fusion. A gene sequence located 3' in this context also includes any gene sequence located between any regulatory elements that can be used to uniquely identify the fusion in question.
「位於5’處的基因序列」係指位於所指基因的下游或5’處且被認為是所述基因的一部分之任何元件(無論是否具調節性質)。本文中之調節元件包括啟動子、增強子、緘默子、反應元件、插入子、外顯子、內嵌啟動子,且該元件係對所涉基因具特異性,使得可因為靠近該融合接合處而以獨特方式擴增及鑑定該融合物。此上下文中之位於5’處的基因序列亦包括位於可用來獨特地鑑定所涉融合物之任何調節元件之間的任何基因序列。"Gene sequence located 5'" refers to any element (regulatory or not) located downstream or 5' of the gene in question and considered to be part of said gene. Regulatory elements herein include promoters, enhancers, silencers, response elements, inserts, exons, embedded promoters, and the elements are specific to the gene in question so that they can be Instead, the fusion is amplified and identified in a unique manner. A gene sequence located 5' in this context also includes any gene sequence located between any regulatory elements that can be used to uniquely identify the fusion in question.
如本文中所使用,術語「ErbB-2」係指在人類中由ERBB-2基因編碼的蛋白質。該基因或蛋白質之替代名稱包括CD340;HER-2;HER-2/neu;MLN 19;NEU;NGL;TKR1。ERBB-2基因經常被稱為HER2 (出自人類表皮生長因子受體2)。當本文中指稱ErbB-2時,該指稱係指人類ErbB-2。包含有結合ErbB-2的抗原結合部位之抗體係與人類ErbB-2結合。ErbB-2抗原結合部位因在人類與其他哺乳動物異種同源物之間的序列及三級結構相似性而亦可結合此類異種同源物,但不一定如此。人類ErbB-2蛋白及其編碼基因之資料庫存取號為(NP_001005862.1、NP_004439.2、NC_000017.10、NT_010783.15、NC_018928.2)。給予存取號主要是為了提供將ErbB-2識別為目標之進一步方法,結合抗體的ErbB-2蛋白之實際序列可有所變化,例如因為編碼基因中之突變,諸如發生於一些癌症或類似者中的突變。ErbB-2抗原結合部位係與ErbB-2及其各種變異體結合,諸如由一些ErbB-2陽性腫瘤細胞所表現者。結合ErbB-2的抗原結合部位較佳地與ErbB-2之結構域I結合。As used herein, the term "ErbB-2" refers to the protein encoded by the ERBB-2 gene in humans. Alternative names for the gene or protein include CD340; HER-2; HER-2/neu; MLN 19; NEU; NGL; TKR1. The ERBB-2 gene is often referred to as HER2 (from human epidermal growth factor receptor 2). When ErbB-2 is referred to herein, this reference refers to human ErbB-2. Antibodies containing an ErbB-2-binding antigen-binding site bind to human ErbB-2. ErbB-2 antigen binding sites can also bind human and other mammalian xenologs due to sequence and tertiary structural similarities between such xenologs, but not necessarily so. The database access numbers of human ErbB-2 protein and its coding gene are (NP_001005862.1, NP_004439.2, NC_000017.10, NT_010783.15, NC_018928.2). The accession number is given primarily to provide a further means of identifying ErbB-2 as a target, the actual sequence of the ErbB-2 protein that binds the antibody may vary, for example due to mutations in the coding gene, such as occurs in some cancers or the like mutations in . ErbB-2 antigen binding sites bind ErbB-2 and its various variants, such as those expressed by some ErbB-2 positive tumor cells. The antigen binding site that binds ErbB-2 preferably binds to domain I of ErbB-2.
如本文中所使用,術語「ErbB-3」係指在人類中由ERBB3基因編碼的蛋白質。該基因或蛋白質之替代名稱為HER3;LCCS2;MDA-BF-1;c-ErbB-3;c-ErbB3;ErbB3-S;p180-ErbB3;p45-sErbB3;及p85-sErbB3。當本文中指稱ErbB-3時,該指稱係指人類ErbB-3。包含有結合ErbB-3的抗原結合部位之抗體係與人類ErbB-3結合。ErbB-3抗原結合部位因在人類與其他哺乳動物異種同源物之間的序列及三級結構相似性而亦可結合此類異種同源物,但不一定如此。人類ErbB-3蛋白及其編碼基因之資料庫存取號為(NP_001005915.1、NP_001973.2、NC_000012.11、NC_018923.2、NT_029419.12)。給予存取號主要是為了提供將ErbB-3識別為目標之進一步方法,抗體所結合的ErbB-3蛋白之實際序列可有所變化,例如因為編碼基因中之突變,諸如發生於一些癌症或類似者中的突變。ErbB-3抗原結合部位係與ErbB-3及其各種變異體結合,諸如由一些ErbB-3陽性腫瘤細胞所表現者。結合ErbB-3的抗原結合部位較佳地與ErbB-3之結構域III結合。As used herein, the term "ErbB-3" refers to the protein encoded by the ERBB3 gene in humans. Alternative names for the gene or protein are HER3; LCCS2; MDA-BF-1; c-ErbB-3; c-ErbB3; ErbB3-S; When reference is made herein to ErbB-3, this reference refers to human ErbB-3. Antibodies containing an ErbB-3-binding antigen-binding site bind to human ErbB-3. ErbB-3 antigen binding sites can also bind human and other mammalian xenologs due to sequence and tertiary structural similarities between such xenologs, but not necessarily so. The database access numbers of human ErbB-3 protein and its coding gene are (NP_001005915.1, NP_001973.2, NC_000012.11, NC_018923.2, NT_029419.12). The accession number is given primarily to provide a further means of identifying ErbB-3 as a target, the actual sequence of the ErbB-3 protein to which the antibody binds may vary, for example due to mutations in the coding gene, such as occurs in some cancers or similar mutations in those. ErbB-3 antigen binding sites bind ErbB-3 and its various variants, such as those expressed by some ErbB-3 positive tumor cells. The antigen binding site that binds ErbB-3 preferably binds to domain III of ErbB-3.
當指稱ErbB-2或ErbB-3或其替代名稱時,該指稱係指人類ErbB-2或ErbB-3。本文中所指稱之抗體係結合至ErbB-2或ErbB-3及許多突變的ErbB-2或ErbB-3蛋白(如癌症中可發現的)。When referring to ErbB-2 or ErbB-3 or alternative names thereof, the reference refers to human ErbB-2 or ErbB-3. The antibodies referred to herein bind to ErbB-2 or ErbB-3 and to many mutated ErbB-2 or ErbB-3 proteins as found in cancer.
本文所提及之SEQ ID NO之任意範圍的數值係明確包括落入相關範圍內之全部的個別SEQ ID NO並包括其端值。因此,為避免疑慮,若(例如)指稱SEQ ID NO: 17-23,其係意欲在所提及之SEQ ID NO範圍的上下文中提及並揭露SEQ ID NO: 17、SEQ ID NO: 18、SEQ ID NO: 19、SEQ ID NO: 20、SEQ ID NO: 21、SEQ ID NO: 22及SEQ ID NO: 23。 條款 Reference herein to any range of values for a SEQ ID NO expressly includes all individual SEQ ID NOs falling within the relevant range up to and including end values. Therefore, for the avoidance of doubt, if (for example) reference is made to SEQ ID NO: 17-23, it is intended to refer to and disclose SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22 and SEQ ID NO: 23. terms
以下條項係說明例示性實施例。 1. 一種多核苷酸,包含:一VALB核酸序列或該VALB序列的對偶基因變異體,其係與一NRG1核酸序列或該NRG1序列的對偶基因變異體融合。 2. 如第1項之多核苷酸,其中該VAPB核酸序列包含SEQ ID NO: 17-23中之任一者或由其組成,或包含SEQ ID NO: 17-23中之任一者之對偶基因變異體或由其組成,且該NRG1核酸序列包含SEQ ID NO: 125-138中之任一者或由其組成。 3. 如第1或2項之多核苷酸,其中該VALB核酸序列(或其對偶基因變異體)係位於NRG1核酸序列(或其對偶基因變異體)的5’端。 4. 如前述各項中之任一者之多核苷酸,其中該VAPB核酸序列之對偶基因變異體與SEQ ID NO: 17-23中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性。 5. 如前述各項中之任一者之多核苷酸,其中該VALB核酸與該NRG1核酸之融合物包含來自SEQ ID NO: 3的2至約40個連續核酸,較佳地包括位置102及103處的核酸 6. 如前述各項中之任一者之多核苷酸,其中該編碼NRG1蛋白序列(或其對偶基因變異體)之核酸係包含或編碼NRG1之EGF樣結構域,較佳為如SEQ ID NO: 163之EGF樣結構域。 7. 一種多核苷酸,包含: - 一PVALB核酸序列或該PVALB序列的對偶基因變異體,其係與一NRG1核酸序列或該NRG1序列的對偶基因變異體融合,或 - 一ASPH核酸序列或該ASPH序列的對偶基因變異體,其係與一NRG1核酸序列或該NRG1序列的對偶基因變異體融合,或 - 一DAAM1核酸序列或該DAAM1序列的對偶基因變異體,其係與一NRG1核酸序列或該NRG1序列的對偶基因變異體融合,或 - 一ZFAT核酸序列或該ZFAT序列的對偶基因變異體,其係與一NRG1核酸序列或該NRG1序列的對偶基因變異體融合,或 - 一DACAML1核酸序列或該DSCAML1序列的對偶基因變異體,其係與一NRG1核酸序列或該NRG1序列的對偶基因變異體融合。 8. 如第7項之多核苷酸,其中 - 該PVALB核酸序列包含SEQ ID NO: 439-444中之任一者或由其組成,或包含SEQ ID NO: 439-444中之任一者之對偶基因變異體或由其組成,且該NRG1核酸序列包含SEQ ID NO: 125-138中之任一者或由其組成; - 該DAAM1核酸序列包含SEQ ID NO: 606-631中之任一者或由其組成,或包含SEQ ID NO: 606-631中之任一者之對偶基因變異體或由其組成,且該NRG1核酸序列包含SEQ ID NO: 125-138中之任一者或由其組成; - 該ZFAT核酸序列包含SEQ ID NO: 830-846中之任一者或由其組成,或包含SEQ ID NO: 830-846中之任一者之對偶基因變異體或由其組成,且該NRG1核酸序列包含SEQ ID NO: 125-138中之任一者或由其組成;或 - 該DSCAML1核酸序列包含SEQ ID NO: 870-903中之任一者或由其組成,或包含SEQ ID NO: 870-903中之任一者之對偶基因變異體或由其組成,且該NRG1核酸序列包含SEQ ID NO: 125-138中之任一者或由其組成。 9. 如第7或8項之多核苷酸,其中該PVALB、DAAM1、ZFAT或DSCAML1核酸序列(或其對偶基因變異體)係位於NRG1核酸序列(或其對偶基因變異體)的5’端。 10. 如第7-9項中任一項之多核苷酸,其中 - 該PVALB核酸序列之對偶基因變異體與SEQ ID NO: 439-444中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性; - 該DAAM1核酸序列之對偶基因變異體與SEQ ID NO: 606-631中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性; - 該ZFAT核酸序列之對偶基因變異體與SEQ ID NO: 830-846中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性;或 - 該DSCAML1核酸序列之對偶基因變異體與SEQ ID NO: 870-903中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性;且該NRG1核酸序列之對偶基因變異體與SEQ ID NO: 125-138中之任一者有至少85%的一致性,較佳地至少90%的一致性,更佳地至少95%的序列一致性。 11. 如第7至10項中之任一項之多核苷酸,其中: - 該PVALB核酸與該NRG1核酸之融合物包含來自SEQ ID NO: 437的2至約40個連續核酸,較佳地包括位置102及103處的核酸; - 該DAAM1核酸與該NRG1核酸之融合物包含來自SEQ ID NO: 605的2至約40個連續核酸,較佳地包括位置75及76處的核酸; - 該ZFAT核酸與該NRG1核酸之融合物包含來自SEQ ID NO: 828的2至約40個連續核酸,較佳地包括位置75及76處的核酸;或 - 該DSCAML1核酸與該NRG1核酸之融合物包含來自SEQ ID NO: 868的2至約40個連續核酸,較佳地包括位置75及76處的核酸。 12. 如第7至11項中之任一項之多核苷酸,其中該編碼NRG1蛋白序列(或其對偶基因變異體)之核酸係包含或編碼NRG1之EGF樣結構域,較佳為如SEQ ID NO: 163之EGF樣結構域。 13. 一種多核苷酸,包含: - VAPB之外顯子1或外顯子1之對偶基因變異體的一部分,其與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分融合; - CADM1之外顯子7或外顯子7之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - CD44之外顯子5或外顯子5之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - SLC3A2之轉錄本6之外顯子1或外顯子1之對偶基因變異體的一部分,與NRG1之外顯子5或外顯子5之對偶基因變異體的一部分; - VTCN1之外顯子2或外顯子2之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - CDH1之外顯子11或外顯子11之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - CXADR之外顯子1或外顯子1之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - GTF2E2之外顯子2或外顯子2之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - CSMD1之外顯子23或外顯子23之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - PTN之外顯子4或外顯子4之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - ST14之外顯子11或外顯子11之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - THBS1之外顯子9或外顯子9之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - AGRN之外顯子12或外顯子12之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - PVALB之外顯子4或外顯子4之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - SLC3A2之轉錄本3之外顯子2或外顯子2之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - APP之外顯子14或外顯子14之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - WRN之外顯子33或外顯子33之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - DAAM1之外顯子1或外顯子1之對偶基因變異體的一部分,與NRG1之外顯子1或外顯子1之對偶基因變異體的一部分; - ASPH之外顯子22或外顯子22之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - NOTCH2之外顯子6或外顯子6之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - CD74之外顯子2或外顯子2之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - SDC4之外顯子2或外顯子2之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - CD44之外顯子5或外顯子5之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - SLC4A4之外顯子14或外顯子14之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分; - SDC4之外顯子4或外顯子4之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分; - ZFAT之外顯子12或外顯子12之對偶基因變異體的一部分,與NRG1之外顯子6或外顯子6之對偶基因變異體的一部分;或 - DSCAML1之外顯子3或外顯子3之對偶基因變異體的一部分,與NRG1之外顯子2或外顯子2之對偶基因變異體的一部分。 14. 如第13項之多核苷酸,其中VAPB之外顯子1為SEQ ID NO: 17之外顯子;CADM1之外顯子7為SEQ ID NO: 39之外顯子;CD44之外顯子5為SEQ ID NO: 65之外顯子;SLC3A2之外顯子1為SEQ ID NO: 103之外顯子;VTCN1之外顯子2為SEQ ID NO: 169之外顯子;CDH1之外顯子11為SEQ ID NO: 198之外顯子;CXADR之外顯子1為SEQ ID NO: 219之外顯子;GTF2E2之外顯子2為SEQ ID NO: 236之外顯子;CSMD1之外顯子23為SEQ ID NO: 279之外顯子;PTN之外顯子4為SEQ ID NO: 318之外顯子;ST14之外顯子11為SEQ ID NO: 342之外顯子;THBS1之外顯子9為SEQ ID NO: 386之外顯子;AGRN之外顯子12為SEQ ID NO: 416之外顯子;PVALB之外顯子4為SEQ ID NO: 442之外顯子;SLC3A2之外顯子2為SEQ ID NO: 457之外顯子;APP之外顯子14為SEQ ID NO: 501之外顯子;WRN之外顯子33為SEQ ID NO: 562之外顯子;DAAM1之外顯子1為SEQ ID NO: 606之外顯子;ASPH之外顯子22為SEQ ID NO: 658之外顯子;NOTCH2之外顯子6為SEQ ID NO: 700之外顯子;CD74之外顯子2為SEQ ID NO: 720之外顯子;SDC4之外顯子2為SEQ ID NO: 746之外顯子;CD44之外顯子5為SEQ ID NO: 65之外顯子;SLC4A4之外顯子14為SEQ ID NO: 780之外顯子;SDC4之外顯子4為SEQ ID NO: 748之外顯子;ZFAT之外顯子12為SEQ ID NO: 841之外顯子;DSCAML1之外顯子3為SEQ ID NO: 872之外顯子,且NRG1之外顯子1、2、5及6分別為SEQ ID NO: 125、126、129及130之外顯子。 15. 如第13或14項中之任一項之多核苷酸,其中: - 該VAPB之外顯子1或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該CADM1之外顯子7或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該CD44之外顯子5或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該SLC3A2之外顯子1或其對偶基因變異體部分係位於該NRG1之外顯子5或其對偶基因變異體部分的5’端; - 該VTCN1之外顯子2或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該CDH1之外顯子11或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該CXADR之外顯子1或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該GTF2E2之外顯子2或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該CSMD1之外顯子23或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該PTN之外顯子4或其對偶基因變異體部分係位於該NRG1之外顯子2或其對偶基因變異體部分的5’端; - 該ST14之外顯子11或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該THBS1之外顯子9或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該AGRN之外顯子12或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該PVALB之外顯子4或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該SCL3A2之外顯子2或其對偶基因變異體部分係位於該NRG1之外顯子6或其對偶基因變異體部分的5’端; - 該APP之外顯子14或其對偶基因變異體部分係位於該NRG1之外顯子6或外顯子6之對偶基因變異體部分的5’端; - 該WRN之外顯子33或其對偶基因變異體部分係位於該NRG1之外顯子6或外顯子6之對偶基因變異體部分的5’端; - 該DAAM1之外顯子1或其對偶基因變異體部分係位於該NRG1之外顯子1或外顯子1之對偶基因變異體部分的5’端; - 該ASPH之外顯子22或其對偶基因變異體部分係位於該NRG1之外顯子2或外顯子2之對偶基因變異體部分的5’端; - 該NOTCH2之外顯子6或其對偶基因變異體部分係位於該NRG1之外顯子6或外顯子6之對偶基因變異體部分的5’端; - 該CD74之外顯子2或其對偶基因變異體部分係位於該NRG1之外顯子2或外顯子2之對偶基因變異體部分的5’端; - 該SDC4之外顯子2或其對偶基因變異體部分係位於該NRG1之外顯子2或外顯子2之對偶基因變異體部分的5’端; - 該CD44之外顯子5或其對偶基因變異體部分係位於該NRG1之外顯子6或外顯子6之對偶基因變異體部分的5’端; - 該SLC4A4之外顯子14或其對偶基因變異體部分係位於該NRG1之外顯子6或外顯子6之對偶基因變異體部分的5’端; - 該SDC4之外顯子4或其對偶基因變異體部分係位於該NRG1之外顯子2或外顯子2之對偶基因變異體部分的5’端; - 該ZFAT之外顯子12或其對偶基因變異體部分係位於該NRG1之外顯子6或外顯子6之對偶基因變異體部分的5’端;及 - 該DSCAML1之外顯子3或其對偶基因變異體部分係位於該NRG1之外顯子2或外顯子2之對偶基因變異體部分的5’端。 16. 如第13至15項中之任一項之多核苷酸,其中: - 該VAPB之外顯子1之對偶基因變異體與SEQ ID NO: 17有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - CADM1該之外顯子7之對偶基因變異體與SEQ ID NO: 39有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該CD44之外顯子5之對偶基因變異體與SEQ ID NO: 65有至少85%的一致性,較佳地有至少90%的一致性,更佳地有至少95%的一致性,或較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該SLC3A2之外顯子1之對偶基因變異體與SEQ ID NO: 103有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - VTCN1該之外顯子2之對偶基因變異體與SEQ ID NO: 169有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該CDH1之外顯子11之對偶基因變異體與SEQ ID NO: 198有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該NRG1之外顯子2之對偶基因變異體與SEQ ID NO: 126有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該NRG1之外顯子5之對偶基因變異體與SEQ ID NO: 129有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該NRG1之外顯子6之對偶基因變異體與SEQ ID NO: 130有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該CXADR之外顯子1之對偶基因變異體與SEQ ID NO: 219有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該GTF2E2之外顯子2之對偶基因變異體與SEQ ID NO: 236有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該CSMD1之外顯子23之對偶基因變異體與SEQ ID NO: 279有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該PTN之外顯子4之對偶基因變異體與SEQ ID NO: 318有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該ST14之外顯子11之對偶基因變異體與SEQ ID NO: 342有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該THBS1之外顯子9之對偶基因變異體與SEQ ID NO: 386有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該AGRN之外顯子12之對偶基因變異體與SEQ ID NO: 416有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該PVALB之外顯子4之對偶基因變異體與SEQ ID NO: 442有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該SCL3A2之外顯子2之對偶基因變異體與SEQ ID NO: 457有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該APP之外顯子14之對偶基因變異體與SEQ ID NO: 501有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該WRN之外顯子33之對偶基因變異體與SEQ ID NO: 562有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該DAAM1之外顯子1之對偶基因變異體與SEQ ID NO: 606有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該NRG1之外顯子1之對偶基因變異體與SEQ ID NO: 125有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該ASPH之外顯子22之對偶基因變異體與SEQ ID NO: 658有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該NOTCH2之外顯子6之對偶基因變異體與SEQ ID NO: 700有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該CD74之外顯子2之對偶基因變異體與SEQ ID NO: 720有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該SDC4之外顯子2之對偶基因變異體與SEQ ID NO: 746有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該CD44之外顯子5之對偶基因變異體與SEQ ID NO: 65有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該SLC4A4之外顯子14之對偶基因變異體與SEQ ID NO: 780有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該SDC4之外顯子4之對偶基因變異體與SEQ ID NO: 748有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性; - 該ZFAT之外顯子12之對偶基因變異體與SEQ ID NO: 841有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性;及 - 該DSCAML1之外顯子3之對偶基因變異體與SEQ ID NO: 872有至少85%的一致性,較佳地有至少90%、92%、94%、96%或甚至98%的一致性。 17. 如第13至16項中之任一項之多核苷酸,其中: - 該VAPB與NRG1之融合物包含來自SEQ ID NO: 3的2至約40個連續核酸,包括位置43及44處的核酸; - 該CADM1與NRG1之融合物包含來自SEQ ID NO: 7的2至約40個連續核酸,包括位置53及54處的核酸; - 該CD44與NRG1之融合物包含來自SEQ ID NO: 11的2至約40個連續核酸,包括位置52及53處的核酸; - 該SLC3A2與NRG1之融合物包含來自SEQ ID NO: 15的2至約40個連續核酸,包括位置53及54處的核酸; - 該VTCN1與NRG1之融合物包含來自SEQ ID NO: 166的2至約40個連續核酸,包括位置65及66處的核酸; - 該CDH1與NRG1之融合物包含來自SEQ ID NO: 186的2至約40個連續核酸,包括位置119及120處的核酸; - 該CXADR與NRG1之融合物包含來自SEQ ID NO: 217的2至約40個連續核酸,包括位置43及44處的核酸; - 該GTF2E2與NRG1之融合物包含來自SEQ ID NO: 233的2至約40個連續核酸,包括位置141及142處的核酸; - 該CSMD1與NRG1之融合物包含來自SEQ ID NO: 255的2至約40個連續核酸,包括位置88及89處的核酸; - 該PTN與NRG1之融合物包含來自SEQ ID NO: 313的2至約40個連續核酸,包括位置102及103處的核酸; - 該ST14與NRG1之融合物包含來自SEQ ID NO: 330的2至約40個連續核酸,包括位置95及96處的核酸; - 該THBS1與NRG1之融合物包含來自SEQ ID NO: 376的2至約40個連續核酸,包括位置56及57處的核酸; - 該AGRN與NRG1之融合物包含來自SEQ ID NO: 403的2至約40個連續核酸,包括位置106及107處的核酸; - 該PVALB與NRG1之融合物包含來自SEQ ID NO: 437的2至約40個連續核酸,包括位置102及103處的核酸; - 該SLC3A2與NRG1之融合物包含來自SEQ ID NO: 454的2至約40個連續核酸,包括位置93及94處的核酸; - 該APP與NRG1之融合物包含來自SEQ ID NO: 486的2至約40個連續核酸,包括位置54及55處的核酸; - 該WRN與NRG1之融合物包含來自SEQ ID NO: 528的2至約40個連續核酸,包括位置96及97處的核酸; - 該DAAM1與NRG1之融合物包含來自SEQ ID NO: 605的2至約40個連續核酸,包括位置75及76處的核酸; - 該ASPH與NRG1之融合物包含來自SEQ ID NO: 635的2至約40個連續核酸,包括位置75及76處的核酸; - 該NOTCH2與NRG1之融合物包含來自SEQ ID NO: 693的2至約40個連續核酸,包括位置75及76處的核酸; - 該CD74與NRG1之融合物包含來自SEQ ID NO: 717的2至約40個連續核酸,包括位置75及76處的核酸; - 該SDC4與NRG1之融合物包含來自SEQ ID NO: 743的2至約40個連續核酸,包括位置75及76處的核酸; - 該CD44與NRG1之融合物包含來自SEQ ID NO: 761的2至約40個連續核酸,包括位置75及76處的核酸; - 該SLC4A4與NRG1之融合物包含來自SEQ ID NO: 765的2至約40個連續核酸,包括位置75及76處的核酸; - 該SDC4與NRG1之融合物包含來自SEQ ID NO: 824的2至約40個連續核酸,包括位置75及76處的核酸; - 該ZFAT與NRG1之融合物包含來自SEQ ID NO: 828的2至約40個連續核酸,包括位置75及76處的核酸;以及 - 該DSCAML1與NRG1之融合物包含來自SEQ ID NO: 868的2至約40個連續核酸,包括位置75及76處的核酸。 18. 如第13至17項中之任一項之多核苷酸,其中: - 該VAPB與NRG1之融合物包含SEQ ID NO: 3或其對偶基因變異體; - 該CAD1與NRG1之融合物包含SEQ ID NO: 7或其對偶基因變異體; - 該CD44與NRG1之融合物包含SEQ ID NO: 11或其對偶基因變異體; - 該SLC3A2與NRG1之融合物包含SEQ ID NO: 15或其對偶基因變異體; - 該VTCN1與NRG1之融合物包含SEQ ID NO: 166或其對偶基因變異體; - 該CDH1與NRG1之融合物包含SEQ ID NO: 186或其對偶基因變異體; - 該CXADR與NRG1之融合物包含SEQ ID NO: 217或其對偶基因變異體; - 該GTF2E2與NRG1之融合物包含SEQ ID NO: 233或其對偶基因變異體; - 該CSMD1與NRG1之融合物包含SEQ ID NO: 255或其對偶基因變異體; - 該PTN與NRG1之融合物包含SEQ ID NO: 313或其對偶基因變異體; - 該ST14與NRG1之融合物包含SEQ ID NO: 330或其對偶基因變異體; - 該THBS1與NRG1之融合物包含SEQ ID NO: 376或其對偶基因變異體; - 該AGRN與NRG1之融合物包含SEQ ID NO: 403或其對偶基因變異體; - 該PVALB與NRG1之融合物包含SEQ ID NO: 437或其對偶基因變異體; - 該SLC3A2與NRG1之融合物包含SEQ ID NO: 454或其對偶基因變異體; - 該APP與NRG1之融合物包含SEQ ID NO: 486或其對偶基因變異體; - 該WRN與NRG1之融合物包含SEQ ID NO: 528或其對偶基因變異體; - 該DAAM1與NRG1之融合物包含SEQ ID NO: 605或其對偶基因變異體; - 該ASPH與NRG1之融合物包含SEQ ID NO: 635或其對偶基因變異體; - 該NOTCH2與NRG1之融合物包含SEQ ID NO: 693或其對偶基因變異體; - 該CD74與NRG1之融合物包含SEQ ID NO: 717或其對偶基因變異體; - 該SDC4與NRG1之融合物包含SEQ ID NO: 743或其對偶基因變異體; - 該CD44與NRG1之融合物包含SEQ ID NO: 761或其對偶基因變異體; - 該SLC4A4與NRG1之融合物包含SEQ ID NO: 765或其對偶基因變異體; - 該SDC4與NRG1之融合物包含SEQ ID NO: 824或其對偶基因變異體; - 該ZFAT與NRG1之融合物包含SEQ ID NO: 828或其對偶基因變異體;及 - 該DSCAML1與NRG1之融合物包含SEQ ID NO: 868或其對偶基因變異體。 19. 如第13至18項中之任一項之多核苷酸,其中: - 該VAPB之外顯子1部分為或包含SEQ ID NO: 1或SEQ ID NO: 1之對偶基因變異體; - 該CADM1之外顯子7部分為或包含SEQ ID NO: 5或SEQ ID NO: 5之對偶基因變異體; - 該CD44之外顯子5部分為或包含SEQ ID NO: 9或SEQ ID NO: 9之對偶基因變異體; - 該SLC3A2之外顯子1部分為或包含SEQ ID NO: 13或SEQ ID NO: 13之對偶基因變異體; - 該VTCN1之外顯子2部分為或包含SEQ ID NO: 164或SEQ ID NO: 164之對偶基因變異體; - 該CDH1之外顯子11部分為或包含SEQ ID NO: 184或SEQ ID NO: 184之對偶基因變異體; - 該CXADR之外顯子1部分為或包含SEQ ID NO: 219或SEQ ID NO: 219之對偶基因變異體; - 該GTF2E2之外顯子2部分為或包含SEQ ID NO: 236或SEQ ID NO: 236之對偶基因變異體; - 該CXADR之外顯子23部分為或包含SEQ ID NO: 279或SEQ ID NO: 279之對偶基因變異體; - 該PTN之外顯子4部分為或包含SEQ ID NO: 318或SEQ ID NO: 318之對偶基因變異體; - 該ST14之外顯子11部分為或包含SEQ ID NO: 342或SEQ ID NO: 342之對偶基因變異體; - 該THBS1之外顯子9部分為或包含SEQ ID NO: 385或SEQ ID NO: 386之對偶基因變異體; - 該AGRN之外顯子12部分為或包含SEQ ID NO: 416或SEQ ID NO: 416之對偶基因變異體; - 該PVALB之外顯子4部分為或包含SEQ ID NO: 442或SEQ ID NO: 442之對偶基因變異體; - 該SLC3A2之外顯子 2部分為或包含SEQ ID NO: 457或SEQ ID NO: 457之對偶基因變異體; - 該NRG1之外顯子2部分為或包含SEQ ID NO: 165或SEQ ID NO: 165之對偶基因變異體; - 該NRG1之外顯子5部分為或包含SEQ ID NO: 14或SEQ ID NO: 14之對偶基因變異體; - 該NRG1之外顯子6部分為或包含SEQ ID NO: 6或其對偶基因變異體; - 該APP之外顯子14部分為或包含SEQ ID NO: 484或其對偶基因變異體; - 該WRN之外顯子33部分為或包含SEQ ID NO: 526或其對偶基因變異體; - 該DAAM1之外顯子1部分為或包含SEQ ID NO: 603或其對偶基因變異體; - 該ASPH之外顯子22部分為或包含SEQ ID NO: 633或其對偶基因變異體; - 該NOTCH2之外顯子6部分為或包含SEQ ID NO: 691或其對偶基因變異體; - 該CD74之外顯子2部分為或包含SEQ ID NO: 715或其對偶基因變異體; - 該SDC4之外顯子2部分為或包含SEQ ID NO: 741或其對偶基因變異體; - 該CD44之外顯子5部分為或包含SEQ ID NO: 759或其對偶基因變異體; - 該SLC4A4之外顯子14部分為或包含SEQ ID NO: 763或其對偶基因變異體; - 該SDC4之外顯子4部分為或包含SEQ ID NO: 822或其對偶基因變異體; - 該ZFAT之外顯子12部分為或包含SEQ ID NO: 826或其對偶基因變異體; - 該DSCAML1之外顯子3部分為或包含SEQ ID NO: 866或其對偶基因變異體;及 - 該NRG1之外顯子1部分為或包含SEQ ID NO: 604或其對偶基因變異體。 20. 如第13至18項中之任一項之多核苷酸,其中: - VAPB與NRG1之融合包含一個在VAPB之外顯子1與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 3的VAPB位置43之核酸與NRG1位置44之核酸之間的接合處; - CADM1與NRG1之融合包含一個在CADM1之外顯子7與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 7的CADM1位置53之核酸與NRG1位置54之核酸之間的接合處; - CD44與NRG1之融合包含一個在CD44之外顯子5與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 11的CD44位置52之核酸與NRG1位置53之核酸之間的接合處; - SLC3A2與NRG1之融合包含一個在SLC3A2之外顯子1與NRG1之外顯子5之間的融合接合處,較佳為SEQ ID NO: 15的SLC3A2位置53之核酸與NRG1位置54之核酸之間的接合處; - VTCN1與NRG1之融合包含一個在VTCN1之外顯子2與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 166的VTCN1位置65之核酸與NRG1位置66之核酸之間的接合處; - CDH1與NRG1之融合包含一個在CDH1之外顯子11與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 186的CDH1位置119之核酸與NRG1位置120之核酸之間的接合處; - CXADR與NRG1之融合包含一個在CXADR之外顯子1與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 217的CXADR位置43之核酸與NRG1位置44之核酸之間的接合處; - GTF2E2與NRG1之融合包含一個在GTF2E2之外顯子2與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 233的GTF2E2位置141之核酸與NRG1位置142之核酸之間的接合處; - CSMD1與NRG1之融合包含一個在CSMD1之外顯子23與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 255的CSMD1位置88之核酸與NRG1位置89之核酸之間的接合處; - PTN與NRG1之融合包含一個在PTN之外顯子4與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 313的PTN位置102之核酸與NRG1位置103之核酸之間的接合處; - ST14與NRG1之融合包含一個在ST14之外顯子11與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 330的ST14位置95之核酸與NRG1位置96之核酸之間的接合處; - THBS1與NRG1之融合包含一個在THBS1之外顯子9與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 376的THBS1位置56之核酸與NRG1位置57之核酸之間的接合處; - AGRN與NRG1之融合包含一個在AGRN之外顯子12與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 403的AGRN位置106之核酸與NRG1位置107之核酸之間的接合處; - PVALB與NRG1之融合包含一個在PVALB之外顯子4與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 437的PVALB位置102之核酸與NRG1位置103之核酸之間的接合處; - SLC3A2與NRG1之融合包含一個在SLC3A2之外顯子2與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 454的SLC3A2位置93之核酸與NRG1位置94之核酸之間的接合處; - APP與NRG1之融合包含一個在APP之外顯子14與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 486的APP位置54之核酸與NRG1位置55之核酸之間的接合處; - WRN與NRG1之融合包含一個在WRN之外顯子33與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 528的WRN位置96之核酸與NRG1位置97之核酸之間的接合處; - DAAM1與NRG1之融合包含一個在DAAM1之外顯子1與NRG1之外顯子1之間的融合接合處,較佳為SEQ ID NO: 605的DAAM1位置之核酸75與NRG1位置之核酸76之間的接合處; - ASPH與NRG1之融合包含一個在ASPH之外顯子22與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 635的ASPH位置75之核酸與NRG1位置76之核酸之間的接合處; - NOTCH2與NRG1之融合包含一個在NOTCH2之外顯子6與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 693的NOTCH2位置75之核酸與NRG1位置76之核酸之間的接合處; - CD74與NRG1之融合包含一個在CD74之外顯子2與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 717的CD74位置75之核酸與NRG1位置76之核酸之間的接合處; - SDC4與NRG1之融合包含一個在SDC4之外顯子2與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 743的SDC4位置75之核酸與NRG1位置76之核酸之間的接合處; - CD44與NRG1之融合包含一個在CD44之外顯子5與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 761的VAPB位置75之核酸與NRG1位置76之核酸之間的接合處; - SLC4A4與NRG1之融合包含一個在SLC4A4之外顯子14與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 765的SLC4A4位置75之核酸與NRG1位置76之核酸之間的接合處; - SDC4與NRG1之融合包含一個在SDC4之外顯子4與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 824的SDC4位置75之核酸與NRG1位置76之核酸之間的接合處; - ZFAT與NRG1之融合包含一個在ZFAT之外顯子12與NRG1之外顯子6之間的融合接合處,較佳為SEQ ID NO: 828的ZFAT位置75之核酸與NRG1位置76之核酸之間的接合處;及 - DSCAML1與NRG1之融合包含一個在DSCAML1之外顯子3與NRG1之外顯子2之間的融合接合處,較佳為SEQ ID NO: 868的DSCAML1位置75之核酸與NRG1位置76之核酸之間的接合處。 21. 如前述各項中之任一者之多核苷酸,其中該多核苷酸係經分離的或純化的。 22. 如前述各項中之任一者之多核苷酸,其中該等融合中之任一者為符合讀框的融合。 23. 如前述各項中之任一者之多核苷酸,其中該多核苷酸為哺乳動物多核苷酸,較佳為人類多核苷酸。 24. 一種由如前述各項中之任一者之多核苷酸所編碼的多肽融合物。 25. 一種包含有如第1至23項中任一項之多核苷酸的載體。 26. 一種包含有如第1至23項中任一項之多核苷酸或如第25項之載體的重組宿主細胞。 27. 一種製造如第24項之多肽融合物的方法,包含將如第26項之宿主細胞維持在適合該宿主細胞所包含之多核苷酸表現的條件下,藉以表現該多核苷酸且產生一多肽融合物,隨後分離或純化該多肽融合物。 28. 一種製造重組宿主細胞的方法,包含將如第25項之載體導入一宿主細胞內。 29. 一種檢測測定法,其包含用於檢測如第1至23項中之任一項之多核苷酸融合物的存在之核酸探針、引子或引子對。 30. 一種用於檢測如第1至23項中任一項之多核苷酸融合物的核酸探針、引子或引子對。 31. 如第30項之核酸探針、引子或引子對,其長度為10至40個核苷酸。 32. 如第30或31項之核酸探針、引子或引子對,其中所檢測到的融合物包含: - VAPB與NRG1的融合物,包含有SEQ ID NO: 3或由其組成,且較佳地包括位置43或44的核酸; - CADM1與NRG1的融合物,包含有SEQ ID NO: 7或由其組成,且較佳地包括位置53或54的核酸; - CD44與NRG1的融合物,包含有SEQ ID NO: 11或由其組成,且較佳地包括位置52或53的核酸; - SLC3A2與NRG1的融合物,包含有SEQ ID NO: 15或由其組成,且較佳地包括位置53或54的核酸; - VTCN1與NRG1的融合物,包含有SEQ ID NO: 166或由其組成,且較佳地包括位置65或66的核酸; - CDH1與NRG1的融合物,包含有SEQ ID NO: 186或由其組成,且較佳地包括位置119或120的核酸; - CXADR與NRG1的融合物,包含有SEQ ID NO: 217或由其組成,且較佳地包括位置43或44的核酸; - GTF2E2與NRG1的融合物,包含有SEQ ID NO: 233或由其組成,且較佳地包括位置141或142的核酸; - CSMD1與NRG1的融合物,包含有SEQ ID NO: 255或由其組成,且較佳地包括位置88或89的核酸; - PTN與NRG1的融合物,包含有SEQ ID NO: 313或由其組成,且較佳地包括位置102或103的核酸; - ST14與NRG1的融合物,包含有SEQ ID NO: 330或由其組成,且較佳地包括位置95或96的核酸; - THBS1與NRG1的融合物,包含有SEQ ID NO: 376或由其組成,且較佳地包括位置56或57的核酸; - AGRN與NRG1的融合物,包含有SEQ ID NO: 403或由其組成,且較佳地包括位置106或107的核酸; - PVALB與NRG1的融合物,包含有SEQ ID NO: 437或由其組成,且較佳地包括位置102或103的核酸; - SLC3A2與NRG1的融合物,包含有SEQ ID NO: 454或由其組成,且較佳地包括位置93或94的核酸; - APP與NRG1的融合物,包含有SEQ ID NO: 486或由其組成,且較佳地包括位置54或55的核酸; - WRN與NRG1的融合物,包含有SEQ ID NO: 528或由其組成,且較佳地包括位置96或97的核酸; - DAAM1與NRG1的融合物,包含有SEQ ID NO: 605或由其組成,且較佳地包括位置75或76的核酸; - ASPH與NRG1的融合物,包含有SEQ ID NO: 635或由其組成,且較佳地包括位置75或76的核酸; - NOTCH2與NRG1的融合物,包含有SEQ ID NO: 693或由其組成,且較佳地包括位置75或76的核酸; - CD74與NRG1的融合物,包含有SEQ ID NO: 717或由其組成,且較佳地包括位置75或76的核酸; - SDC4與NRG1的融合物,包含有SEQ ID NO: 743或由其組成,且較佳地包括位置75或76的核酸; - CD44與NRG1的融合物,包含有SEQ ID NO: 761或由其組成,且較佳地包括位置75或76的核酸; - SLC4A4與NRG1的融合物,包含有SEQ ID NO: 765或由其組成,且較佳地包括位置75或76的核酸; - SDC4與NRG1的融合物,包含有SEQ ID NO: 824或由其組成,且較佳地包括位置75或76的核酸; - ZFAT與NRG1的融合物,包含有SEQ ID NO: 828或由其組成,且較佳地包括位置75或76的核酸;以及 - DSCAML1與NRG1的融合物,包含有SEQ ID NO: 868或由其組成,且較佳地包括位置75或76的核酸。 33. 如第30至32中任一項之核酸探針、引子或引子對,其中: - 用以檢測VAPB與NRG1之融合的探針、引子或引子對係特異性地與一由VAPB之外顯子1所組成之序列或位於外顯子1的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CADM1與NRG1之融合的探針、引子或引子對係特異性地與一由CADM1之外顯子7所組成之序列或位於外顯子7的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CD44 與NRG1之融合的探針、引子或引子對係特異性地與一由CD44 之外顯子5所組成之序列或位於外顯子5的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SLC3A2之轉錄本6與NRG1之融合的探針、引子或引子對係特異性地與一由SLC3A2之外顯子1所組成之序列或位於外顯子1的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子5所組成之序列或位於外顯子5的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測VTCN1與NRG1之融合的探針、引子或引子對係特異性地與一由VTCN1之外顯子2所組成之序列或位於外顯子2的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CDH1與NRG1之融合的探針、引子或引子對係特異性地與一由CDH1之外顯子11所組成之序列或位於外顯子11的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CXADR與NRG1之融合的探針、引子或引子對係特異性地與一由CXADR之外顯子1所組成之序列或位於外顯子1的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測GTF2E2與NRG1之融合的探針、引子或引子對係特異性地與一由GTF2E2之外顯子2所組成之序列或位於外顯子2的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CSMD1與NRG1之融合的探針、引子或引子對係特異性地與一由CSMD1之外顯子23所組成之序列或位於外顯子23的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測PTN與NRG1之融合的探針、引子或引子對係特異性地與一由PTN之外顯子4所組成之序列或位於外顯子4的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測ST14與NRG1之融合的探針、引子或引子對係特異性地與一由ST14之外顯子11所組成之序列或位於外顯子11的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測THBS1與NRG1之融合的探針、引子或引子對係特異性地與一由THBS1之外顯子9所組成之序列或位於外顯子9的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測AGRN與NRG1之融合的探針、引子或引子對係特異性地與一由AGRN之外顯子12所組成之序列或位於外顯子12的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測PVALB與NRG1之融合的探針、引子或引子對係特異性地與一由PVALB之外顯子4所組成之序列或位於外顯子4的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SLC3A2之轉錄本3與NRG1之融合的探針、引子或引子對係特異性地與一由SLC3A2之外顯子2所組成之序列或位於外顯子2的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測APP與NRG1之融合的探針、引子或引子對係特異性地與一由APP之外顯子14所組成之序列或位於外顯子14的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測WRN與NRG1之融合的探針、引子或引子對係特異性地與一由WRN之外顯子33所組成之序列或位於外顯子33的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測DAAM1與NRG1之融合的探針、引子或引子對係特異性地與一由DAAM1之外顯子1所組成之序列或位於外顯子1的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子1所組成之序列或位於外顯子1的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測ASPH與NRG1之融合的探針、引子或引子對係特異性地與一由ASPH之外顯子22所組成之序列或位於外顯子22的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測NOTCH2與NRG1之融合的探針、引子或引子對係特異性地與一由NOTCH2之外顯子6所組成之序列或位於外顯子6的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CD74與NRG1之融合的探針、引子或引子對係特異性地與一由CD74之外顯子2所組成之序列或位於外顯子2的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SDC4與NRG1之融合的探針、引子或引子對係特異性地與一由SDC4之外顯子2所組成之序列或位於外顯子2的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CD44與NRG1之融合的探針、引子或引子對係特異性地與一由CD44之外顯子5所組成之序列或位於外顯子5的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SLC4A4與NRG1之融合的探針、引子或引子對係特異性地與一由SLC4A4之外顯子14所組成之序列或位於外顯子14的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SDC4與NRG1之融合的探針、引子或引子對係特異性地與一由SDC4之外顯子4所組成之序列或位於外顯子4的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測ZFAT與NRG1之融合的探針、引子或引子對係特異性地與一由ZFAT之外顯子12所組成之序列或位於外顯子12的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子6所組成之序列或位於外顯子6的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性;或 - 用以檢測DSCAML1與NRG1之融合的探針、引子或引子對係特異性地與一由DSCAML1之外顯子3所組成之序列或位於外顯子3的5’處之序列雜交,及/或特異性地與一由NRG1之外顯子2所組成之序列或位於外顯子2的3’處之序列雜交,或是與該等序列有95%或更高的互補序列一致性。 34. 如第33項之核酸探針、引子或引子對,其中: - 來自VAPB之外顯子1包含SEQ ID NO: 17或其對偶基因變異體,或是由其組成; - 來自CADM1之外顯子7包含SEQ ID NO: 39或其對偶基因變異體,或是由其組成; - 來自CD44之外顯子5包含SEQ ID NO: 65或其對偶基因變異體,或是由其組成; - 來自SLC3A2之外顯子1包含SEQ ID NO: 103或其對偶基因變異體,或是由其組成; - 來自VTCN1之外顯子2包含SEQ ID NO: 169或其對偶基因變異體,或是由其組成; - 來自CDH1之外顯子11包含SEQ ID NO: 198或其對偶基因變異體,或是由其組成; - 來自CXADR之外顯子1包含SEQ ID NO: 219或其對偶基因變異體,或是由其組成; - 來自GTF2E2之外顯子2包含SEQ ID NO: 236或其對偶基因變異體,或是由其組成; - 來自CSMD1之外顯子23包含SEQ ID NO: 279或其對偶基因變異體,或是由其組成; - 來自PTN之外顯子4包含SEQ ID NO: 318或其對偶基因變異體,或是由其組成; - 來自ST14之外顯子11包含SEQ ID NO: 342或其對偶基因變異體,或是由其組成; - 來自THBS1之外顯子9包含SEQ ID NO: 386或其對偶基因變異體,或是由其組成; - 來自AGRN之外顯子12包含SEQ ID NO: 416或其對偶基因變異體,或是由其組成; - 來自PVALB之外顯子4包含SEQ ID NO: 442或其對偶基因變異體,或是由其組成; - 來自SLC3A2之外顯子2包含SEQ ID NO: 457或其對偶基因變異體,或是由其組成; - 來自APP之外顯子14包含SEQ ID NO: 501或其對偶基因變異體,或是由其組成; - 來自WRN之外顯子33包含SEQ ID NO: 562或其對偶基因變異體,或是由其組成; - 來自DAAM1之外顯子1包含SEQ ID NO: 606或其對偶基因變異體,或是由其組成; - 來自ASPH之外顯子22包含SEQ ID NO: 658或其對偶基因變異體,或是由其組成; - 來自NOTCH2之外顯子6包含SEQ ID NO: 700或其對偶基因變異體,或是由其組成; - 來自CD74之外顯子2包含SEQ ID NO: 720或其對偶基因變異體,或是由其組成; - 來自SDC4之外顯子2包含SEQ ID NO: 746或其對偶基因變異體,或是由其組成; - 來自CD44之外顯子5包含SEQ ID NO: 65或其對偶基因變異體,或是由其組成; - 來自SLC4A4之外顯子14包含SEQ ID NO: 780或其對偶基因變異體,或是由其組成; - 來自SDC4之外顯子4包含SEQ ID NO: 748或其對偶基因變異體,或是由其組成; - 來自ZFAT之外顯子12包含SEQ ID NO: 841或其對偶基因變異體,或是由其組成; - 來自DSCAML1之外顯子3包含SEQ ID NO: 872或其對偶基因變異體,或是由其組成;以及 - 來自NRG1之外顯子1、2、5及6分別包含SEQ ID NO: 125、126、129及130或其對偶基因變異體,或是由其組成。 35. 如第33項之核酸探針、引子或引子對,其中: - 用以檢測VAPB與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 17所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CADM1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 57所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CD44與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 99所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SLC3A2 與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 103所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 157所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; -用以檢測VTCN1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 181所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CDH1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 213所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CXADR與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 219所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測GTF2E2與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 252所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CSMD1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 309所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測PTN與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 326所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測ST14與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 372所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測THBS1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 399所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測AGRN與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 433所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測PVALB與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 450所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SLC3A2 與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 482所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測APP與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 524所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測WRN與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 601所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測DAAM1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 606所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 138所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測ASPH與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 689所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測NOTCH2與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 713所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CD74與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 739所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SDC4與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 757所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測CD44與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 99所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SLC4A4與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 820所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測SDC4與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 940所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性; - 用以檢測ZFAT與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 864所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 155所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性;及 - 用以檢測DSCAML1與NRG1之融合的探針、引子或引子對係特異性地與一由SEQ ID NO: 938所組成之序列雜交或其對偶基因變異體雜交,及/或特異性地與一由SEQ ID NO: 153所組成之序列雜交,或是與該等序列有95%或更高的互補序列一致性。 36. 一種供使用於原位雜交測定法以檢測如第1至23項中任一項之多核苷酸融合物的第一核酸探針及第二核酸探針, - 其中該第一探針特異性地與位於SEQ ID NO: 3之位置43之核酸5’處的VAPB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 3之位置44之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 7之位置53之核酸5’處的CADM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 7之位置54之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 11之位置52之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 11之位置53之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 15之位置53之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 15之位置54之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 166之位置65之核酸5’處的VTCN1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 166之位置66之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 186之位置119之核酸5’處的CDH1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 186之位置120之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 217之位置43之核酸5’處的CXADR序列雜交,且該第二探針特異性地與位於SEQ ID NO: 217之位置44之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 233之位置141之核酸5’處的GTF2E2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 233之位置142之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 255之位置88之核酸5’處的CSMD1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 255之位置89之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 313之位置102之核酸5’處的PTN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 313之位置103之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 330之位置95之核酸5’處的ST14序列雜交,且該第二探針特異性地與位於SEQ ID NO: 330之位置96之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 376之位置56之核酸5’處的THBS1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 376之位置57之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 403之位置106之核酸5’處的AGRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 403之位置107之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 437之位置102之核酸5’處的PVALB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 437之位置103之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 454之位置93之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 454之位置94之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 486之位置54之核酸5’處的APP序列雜交,且該第二探針特異性地與位於SEQ ID NO: 486之位置55之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 528之位置96之核酸5’處的WRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 528之位置97之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 605之位置75之核酸5’處的DAAM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 605之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 635之位置75之核酸5’處的ASPH序列雜交,且該第二探針特異性地與位於SEQ ID NO: 635之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 693之位置75之核酸5’處的NOTCH2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 693之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 717之位置75之核酸5’處的CD74序列雜交,且該第二探針特異性地與位於SEQ ID NO: 717之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 743之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 743之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 761之位置75之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 761之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 765之位置75之核酸5’處的SLC4A4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 765之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 824之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 824之位置76之核酸3’處的NRG1序列雜交; - 其中該第一探針特異性地與位於SEQ ID NO: 828之位置75之核酸5’處的ZFAT序列雜交,且該第二探針特異性地與位於SEQ ID NO: 828之位置76之核酸3’處的NRG1序列雜交;或 - 其中該第一探針特異性地與位於SEQ ID NO: 868之位置75之核酸5’處的DSCAML1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 868之位置76之核酸3’處的NRG1序列雜交。 37. 一種用於檢測如第1至23項中任一項之多核苷酸融合物所編碼的多肽之第一抗體或第一與第二抗體對組。 38. 一種包含有用於檢測如第1至23項中任一項之多核苷酸融合物所編碼之多肽的存在之第一抗體或第一與第二抗體組的檢測測定法,其中該第一抗體或第一與第二抗體組較佳為如第34項之第一抗體或第一與第二抗體組。 39. 如第38項之第一抗體或第一與第二抗體組,或如第32項之檢測測定法,其中該第一抗體係結合選自VAPB-NRG1、CADM1-NRG1、CD44-NRG1、SLC3A2-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SDC4-NRG1、SLC4A4-NRG1、ZFAT-NRG1或DSCAML1-NRG1之多肽融合物,而該第一與第二抗體組係分別結合VAPB與NRG1或CADM1與NRG1、或CD44與NRG1、SLC3A2與NRG1、CDH1與NRG1、CXADR與NRG1、GTF2E2與NRG1、CSMD1與NRG1、PTN與NRG1、ST14與NRG1、THBS1與NRG1、AGRN與NRG1、PVALB與NRG1、APP與NRG1、WRN與NRG1、ASPH與NRG1、NOTCH2與NRG1、CD74與NRG1、SDC4與NRG1、SLC4A4與NRG1、ZFAT與NRG1、或DSCAML1與NRG1。 40. 一種辨識一樣本中之如第1至23項中任一項之多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。 41. 一種於樣本中檢測如第1至23項中任一項之多核苷酸融合物、或由其所編碼之多肽是否存在的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。 42. 一種確定來自個體之異常細胞是否包含如第1至23項中任一項之融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自該個體之異常細胞之多核苷酸或多肽內容物是否於樣本中存在該融合物。 43. 一種辨識一個體帶有如第1至23項中任一項之多核苷酸融合物、或由其所編碼之多肽的方法,所述方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。 44. 如第37至40項中任一項之方法,其中該檢驗包含利用諸如第31至36項中任一項之核酸探針、引子或引子對之結合劑特異性地結合該多核苷酸或由其所編碼之多肽以檢測該融合物,或利用結合一包含有該多核苷酸融合物之多核苷酸的結合劑以檢測該融合物。 45. 如第40至44項中任一項之方法,其中該檢驗包含擴增或檢測一用於區別該多核苷酸融合物或由其所編碼之多肽是否存在的序列。 46. 如第41至45項中任一項之方法,其中該多核苷酸融合物係獲自一會表現包含有NRG1之EGF樣結構域之多核苷酸融合物的異常細胞。 47. 如第40至46項中任一項之方法,其中該方法包含從一個體獲得樣本之步驟,隨後為從該樣本中分離該多核苷酸或由其所編碼之多肽的步驟。 48. 如第40至47項中任一項之方法,其中該方法包含從該樣本純化或分離該多核苷酸的步驟。 49. 如第40至48項中任一項之方法,其中該結合劑係為或包含引子、引子對、探針或抗體。 50. 如第40至49項中任一項之方法,其中該檢驗為一種體外方法,較佳為一種活體外方法。 51. 如第40至50項中任一項之方法,其中該結合劑包含或與一可檢測標籤物相關聯。 52. 如第40至51項中任一項之方法,其中該樣本為液態檢體樣本或固態樣本,諸如經福馬林固定之石蠟包埋組織(FFPE)樣本。 53. 如第40至52項中任一項之方法,其中該樣本包含血液、血清、血漿、胸水、尿液、精液、羊水或腹水。 54. 如第40至53項中任一項之方法,其中該樣本包含變異細胞,諸如腫瘤細胞或癌細胞,或其多核苷酸或多肽內容物。 55. 一種治療患有包含了多核苷酸融合物及/或表現了由其所編碼之融合多肽之ErbB-2及/或ErbB-3陽性癌症或腫瘤之個體的方法,所述方法包含向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑,其中該融合物為如第1至23項中任一項之融合物。 56. 一種抑制患有包含了多核苷酸融合物及/或表現了由其所編碼之融合多肽之ErbB-2及ErbB-3陽性癌症或腫瘤之個體病程的方法,所述方法包含向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑,其中所述融合物為如第1至23項中任一項之融合物。 57. 一種供使用於患有包含了多核苷酸融合物及/或表現了由其所編碼之融合多肽之ErbB-2及ErbB-3陽性癌症或腫瘤之個體的治療中的ErbB-2及/或ErbB-3標靶劑,所述治療包含向該個體投與有效量之ErbB-2及/或ErbB-3標靶劑,其中所述融合物為如第1至23項中任一項之融合物。 58. 一種診斷個體是否有包含如第1至23項中任一項之多核苷酸融合物或由其所編碼之多肽的異常細胞之方法,該方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在該融合物。 59. 如第58項之方法,其中該檢驗包含利用諸如第30至35項中任一項之核酸探針、引子或引子對之結合劑特異性地結合該多核苷酸或由其所編碼之多肽以檢測該融合物,或利用結合一包含有該多核苷酸融合物之多核苷酸的結合劑以檢測該融合物。 60. 一種評估個體是否罹患癌症或腫瘤或是易於罹患癌症或腫瘤的方法,該方法包含檢驗獲自一個體之樣本以檢測該樣本中是否存在如第1至23項中任一項之多核苷酸融合物或由其所編碼之多肽,且藉由辨識所述多核苷酸或多肽融合物是否存在來評估所述個體罹患所述癌症或腫瘤或是易於罹患所述癌症或腫瘤。 61. 如第55至58項中任一項之方法或用途,其中該ErbB-2及/或ErbB-3標靶劑係選自由以下所組成之群組:包含一結合ErbB-2之胞外部分之第一抗原結合位點及一結合ErbB-3之胞外部分之第二抗原結合位點的多特異性抗體、ErbB-2之酪胺酸激酶抑制劑、包含有一結合ErbB-2之胞外部分之抗原結合位點的單特異性雙價抗體、包含有一結合ErbB-3之胞外部分之抗原結合位點的單特異性雙價抗體、或其任何組合。 62. 如第55至58、或61項中任一項之方法或用途,其中該ErbB-2及/或ErbB-3標靶劑為澤妥珠單抗(zenocutuzumab)。 63. 如第41至62項中任一項之方法或用途,其中該異常細胞、癌細胞或腫瘤細胞係包含如第1至23項中任一項之多核苷酸融合物或由其編碼之多肽,且其中該細胞所包含之多核苷酸融合物進一步包含符合讀框的融合一編碼NRG1之EGF樣結構域的編碼序列。 64. 如第41至63項中任一項之方法或用途,其中該異常細胞係來自癌症,尤其所述癌症為腺癌,更尤其是黏液腺癌、胰臟癌,更尤其是胰臟腺癌,更尤其是胰臟導管腺癌、腎細胞癌、肉瘤、膀胱癌、大腸癌、直腸癌、大腸直腸癌、膽囊癌、頭頸癌、前列腺癌、子宮癌、乳癌、卵巢癌、肝癌、子宮內膜癌、肺癌,較佳為非小細胞肺癌,較佳地、更佳地為浸潤性黏液腺癌、或是原發性癌或轉移性癌。 65. 一種包含如第1至23項中任一項之多核苷酸融合物及/或表現由其所編碼之多肽融合物的活體內動物模型,其中由該動物模型所組成之多核苷酸融合物或多肽融合物較佳地係由存在於該動物模型中之移植異常細胞所組成,或是由該動物模型的基因體所組成。 66. 一種以Erb2及/或Erb3標靶劑治療如第66項之活體內動物模型的方法,該標靶劑係選自由以下所組成之群組:包含一結合ErbB-2之胞外部分的第一抗原結合位點及一結合ErbB-3之胞外部分的第二抗原結合位點之多特異性抗體、ErbB-2之酪胺酸激酶抑制劑、包含有一結合ErbB-2之胞外部分的抗原結合位點之單特異性雙價抗體、包含有一結合ErbB-3之胞外部分的抗原結合位點之單特異性雙價抗體、或其任何組合,所述方法包含向該動物投與所述Erb2及/或Erb3標靶劑。 67. 一種供使用於原位雜交測定法以檢測VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1 CXADR、GTF2E2、CSMD1、PTN、ST14、THBS1、AGRN、PVALB、APP、WRN、DAAM1、ASPH、NOTCH2、CD74、SDC4、SLC4A4、ZFAT或DSCAML1之基因重排的第一核酸探針及第二核酸探針,其中: - 用於檢測VAPB之基因重排的第一探針特異性地與SEQ ID NO: 1之位置43之核酸5’處的VAPB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 1之位置43之核酸3’處的VAPB序列雜交; - 用於檢測CADM1之基因重排的第一探針特異性地與SEQ ID NO: 5之位置53之核酸5’處的CADM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 5之位置53之核酸3’處的CADM1序列雜交; - 用於檢測CD44之基因重排的第一探針特異性地與SEQ ID NO: 9之位置52之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 9之位置52之核酸3’處的CD44序列雜交; - 用於檢測SLC3A2之基因重排的第一探針特異性地與SEQ ID NO: 13之位置53之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 13之位置53之核酸3’處的SLC3A2序列雜交; - 用於檢測VTCN1之基因重排的第一探針特異性地與SEQ ID NO: 164之位置65之核酸5’處的VTCN1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 164之位置65之核酸3’處的VTCN1序列雜交; - 用於檢測CDH1之基因重排的第一探針特異性地與SEQ ID NO: 184之位置119之核酸5’處的CDH1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 184之位置119之核酸3’處的CDH1序列雜交; - 用於檢測CXADR之基因重排的第一探針特異性地與SEQ ID NO: 215之位置43之核酸5’處的CXADR序列雜交,且該第二探針特異性地與位於SEQ ID NO: 215之位置43之核酸3’處的CXADR序列雜交; - 用於檢測GTF2E2之基因重排的第一探針特異性地與SEQ ID NO: 231之位置141之核酸5’處的GTF2E2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 231之位置141之核酸3’處的GTF2E2序列雜交; - 用於檢測CSMD1之基因重排的第一探針特異性地與SEQ ID NO: 253之位置88之核酸5’處的CSMD1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 253之位置88之核酸3’處的CSMD1序列雜交; - 用於檢測PTN之基因重排的第一探針特異性地與SEQ ID NO: 311之位置102之核酸5’處的PTN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 311之位置102之核酸3’處的PTN序列雜交; - 用於檢測ST14之基因重排的第一探針特異性地與SEQ ID NO: 328之位置95之核酸5’處的ST14序列雜交,且該第二探針特異性地與位於SEQ ID NO: 328之位置95之核酸3’處的ST14序列雜交; - 用於檢測AGRN之基因重排的第一探針特異性地與SEQ ID NO: 401之位置106之核酸5’處的AGRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 401之位置106之核酸3’處的AGRN序列雜交; - 用於檢測THBS1之基因重排的第一探針特異性地與SEQ ID NO: 374之位置56之核酸5’處的THBS1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 374之位置56之核酸3’處的THBS1序列雜交; - 用於檢測PVALB之基因重排的第一探針特異性地與SEQ ID NO: 435之位置102之核酸5’處的PVALB序列雜交,且該第二探針特異性地與位於SEQ ID NO: 435之位置102之核酸3’處的PVALB序列雜交; - 用於檢測SLC3A2之基因重排的第一探針特異性地與SEQ ID NO: 452之位置93之核酸5’處的SLC3A2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 452之位置93之核酸3’處的SLC3A2序列雜交; - 用於檢測APP之基因重排的第一探針特異性地與SEQ ID NO: 484之位置54之核酸5’處的APP序列雜交,且該第二探針特異性地與位於SEQ ID NO: 484之位置54之核酸3’處的APP序列雜交; - 用於檢測WRN之基因重排的第一探針特異性地與SEQ ID NO: 526之位置96之核酸5’處的WRN序列雜交,且該第二探針特異性地與位於SEQ ID NO: 526之位置96之核酸3’處的WRN序列雜交; - 用於檢測DAAM1之基因重排的第一探針特異性地與SEQ ID NO: 603之位置75之核酸5’處的DAAM1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 603之位置75之核酸3’處的DAAM1序列雜交; - 用於檢測ASPH之基因重排的第一探針特異性地與SEQ ID NO: 633之位置75之核酸5’處的ASPH序列雜交,且該第二探針特異性地與位於SEQ ID NO: 633之位置75之核酸3’處的ASPH序列雜交; - 用於檢測NOTCH2之基因重排的第一探針特異性地與SEQ ID NO: 691之位置75之核酸5’處的NOTCH2序列雜交,且該第二探針特異性地與位於SEQ ID NO: 691之位置75之核酸3’處的NOTCH2序列雜交; - 用於檢測CD74之基因重排的第一探針特異性地與SEQ ID NO: 715之位置75之核酸5’處的CD74序列雜交,且該第二探針特異性地與位於SEQ ID NO: 715之位置75之核酸3’處的CD74序列雜交; - 用於檢測SDC4之基因重排的第一探針特異性地與SEQ ID NO: 741之位置75之核酸5’處的SDC4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 741之位置75之核酸3’處的SDC4序列雜交; - 用於檢測CD44之基因重排的第一探針特異性地與SEQ ID NO: 759之位置75之核酸5’處的CD44序列雜交,且該第二探針特異性地與位於SEQ ID NO: 759之位置75之核酸3’處的CD44序列雜交; - 用於檢測SLC4A4之基因重排的第一探針特異性地與SEQ ID NO: 763之位置75之核酸5’處的SLC4A4序列雜交,且該第二探針特異性地與位於SEQ ID NO: 763之位置75之核酸3’處的SLC4A4序列雜交; - 用於檢測SDC4 之基因重排的第一探針特異性地與SEQ ID NO: 822之位置75之核酸5’處的SDC4 序列雜交,且該第二探針特異性地與位於SEQ ID NO: 822之位置75之核酸3’處的SDC4 序列雜交; - 用於檢測ZFAT之基因重排的第一探針特異性地與SEQ ID NO: 826之位置75之核酸5’處的ZFAT序列雜交,且該第二探針特異性地與位於SEQ ID NO: 826之位置75之核酸3’處的ZFAT序列雜交;或 - 用於檢測DSCAML1之基因重排的第一探針特異性地與SEQ ID NO: 866之位置75之核酸5’處的DSCAML1序列雜交,且該第二探針特異性地與位於SEQ ID NO: 866之位置75之核酸3’處的DSCAML1序列雜交。 實例 實例 1 :檢驗樣本製備 The following items describe exemplary embodiments. 1. A polynucleotide comprising: a VALB nucleic acid sequence or an allele variant of the VALB sequence fused with a NRG1 nucleic acid sequence or an allele variant of the NRG1 sequence. 2. The polynucleotide according to item 1, wherein the VAPB nucleic acid sequence comprises or consists of any one of SEQ ID NOs: 17-23, or a pair comprising any one of SEQ ID NOs: 17-23 A gene variant or consists of it, and the NRG1 nucleic acid sequence comprises or consists of any one of SEQ ID NO: 125-138. 3. The polynucleotide according to item 1 or 2, wherein the VALB nucleic acid sequence (or its allele variant) is located at the 5' end of the NRG1 nucleic acid sequence (or its allele variant). 4. The polynucleotide according to any one of the preceding items, wherein the allele variant of the VAPB nucleic acid sequence is at least 85% identical to any one of SEQ ID NO: 17-23, preferably At least 90% identity, more preferably at least 95% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138, compared Preferably at least 90% identity, more preferably at least 95% sequence identity. 5. The polynucleotide of any one of the preceding items, wherein the fusion of the VALB nucleic acid and the NRG1 nucleic acid comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 3, preferably including positions 102 and Nucleic acid at 103 6. The polynucleotide according to any one of the preceding items, wherein the nucleic acid encoding the NRG1 protein sequence (or its allele variant) comprises or encodes an EGF-like domain of NRG1, preferably An EGF-like domain such as SEQ ID NO: 163. 7. A polynucleotide comprising: - a PVALB nucleic acid sequence or an allele variant of the PVALB sequence fused to a NRG1 nucleic acid sequence or an allele variant of the NRG1 sequence, or - an ASPH nucleic acid sequence or the allele variant An allele variant of an ASPH sequence fused to an NRG1 nucleic acid sequence or an allele variant of the NRG1 sequence, or - a DAAM1 nucleic acid sequence or an allele variant of the DAAM1 sequence fused to an NRG1 nucleic acid sequence or The allogenic variant fusion of the NRG1 sequence, or - a ZFAT nucleic acid sequence or the allogenic variant of the ZFAT sequence fused to a NRG1 nucleic acid sequence or the allogenic variant of the NRG1 sequence, or - a DACAML1 nucleic acid sequence Or the allogeneic variant of the DSCAML1 sequence fused to a NRG1 nucleic acid sequence or the allogeneic variant of the NRG1 sequence. 8. The polynucleotide according to item 7, wherein- the PVALB nucleic acid sequence comprises or consists of any one of SEQ ID NOs: 439-444, or comprises any one of SEQ ID NOs: 439-444 An allele variant or consists of it, and the NRG1 nucleic acid sequence comprises or consists of any one of SEQ ID NOs: 125-138; - the DAAM1 nucleic acid sequence comprises any one of SEQ ID NOs: 606-631 Or consist of, or comprise or consist of any allele variant of any one of SEQ ID NOs: 606-631, and the NRG1 nucleic acid sequence comprises any one of SEQ ID NOs: 125-138 or consist of it Composition; - the ZFAT nucleic acid sequence comprises or consists of any one of SEQ ID NOs: 830-846, or comprises or consists of any allele variant of any one of SEQ ID NOs: 830-846, and The NRG1 nucleic acid sequence comprises or consists of any one of SEQ ID NOs: 125-138; or - the DSCAML1 nucleic acid sequence comprises any one of SEQ ID NOs: 870-903 or consists of it, or comprises SEQ ID The allele variant of any one of NO: 870-903 or consists of it, and the NRG1 nucleic acid sequence comprises or consists of any one of SEQ ID NOs: 125-138. 9. The polynucleotide according to item 7 or 8, wherein the PVALB, DAAM1, ZFAT or DSCAML1 nucleic acid sequence (or its allele variant) is located at the 5' end of the NRG1 nucleic acid sequence (or its allele variant). 10. The polynucleotide according to any one of clauses 7-9, wherein- the allele variant of the PVALB nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 439-444, compared to Preferably at least 90% identity, more preferably at least 95% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 125-138 , preferably at least 90% identity, more preferably at least 95% sequence identity; - the allele variant of the DAAM1 nucleic acid sequence has at least 85% identity with any one of SEQ ID NO: 606-631 Identity, preferably at least 90% identity, more preferably at least 95% sequence identity; and the allele variant of the NRG1 nucleic acid sequence has at least 85% of any one of SEQ ID NO: 125-138 % identity, preferably at least 90% identity, more preferably at least 95% sequence identity; - the allele variant of the ZFAT nucleic acid sequence has any one of them in SEQ ID NO: 830-846 At least 85% identity, preferably at least 90% identity, more preferably at least 95% sequence identity; and the allele variant of the NRG1 nucleic acid sequence is identical to any one of SEQ ID NO: 125-138 There is at least 85% identity, preferably at least 90% identity, more preferably at least 95% sequence identity; or- the allele variant of the DSCAML1 nucleic acid sequence is identical to that of SEQ ID NO: 870-903 Either one has at least 85% identity, preferably at least 90% identity, more preferably at least 95% sequence identity; and the allele variant of the NRG1 nucleic acid sequence is identical to SEQ ID NO: 125- Any of 138 have at least 85% identity, preferably at least 90% identity, more preferably at least 95% sequence identity. 11. The polynucleotide according to any one of clauses 7 to 10, wherein: - the fusion of the PVALB nucleic acid and the NRG1 nucleic acid comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 437, preferably including the nucleic acids at positions 102 and 103; - the fusion of the DAAM1 nucleic acid and the NRG1 nucleic acid comprises from 2 to about 40 contiguous nucleic acids from SEQ ID NO: 605, preferably including the nucleic acids at positions 75 and 76; - the The fusion of the ZFAT nucleic acid and the NRG1 nucleic acid comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 828, preferably including the nucleic acids at positions 75 and 76; or - the fusion of the DSCAML1 nucleic acid and the NRG1 nucleic acid comprises From 2 to about 40 contiguous nucleic acids from SEQ ID NO: 868, preferably including the nucleic acids at positions 75 and 76. 12. The polynucleotide according to any one of items 7 to 11, wherein the nucleic acid encoding the NRG1 protein sequence (or its allele variant) comprises or encodes the EGF-like domain of NRG1, preferably as SEQ EGF-like domain of ID NO: 163. 13. A polynucleotide comprising: - a part of exon 1 or an allogenic variant of exon 1 of VAPB fused to a part of exon 2 or an allogenic variant of exon 2 of NRG1 ; - CADM1 exon 7 or part of an allogenic variant of exon 7, and NRG1 exon 6 or part of an allogenic variant of exon 6; - CD44 exon 5 or part of an allogenic variant Part of the allele variant of exon 5, part of the allele variant of exon 2 or exon 2 of NRG1; - allele of exon 1 or exon 1 of transcript 6 of SLC3A2 Part of a variant, part of an allogenic variant with exon 5 or exon 5 of NRG1; - Part of an allogenic variant of exon 2 or exon 2 of VTCN1, part of an allogenic variant with exon 5 of NRG1 Exon 2 or part of an allele variant of exon 2; - part of an allele variant of exon 11 or exon 11 of CDH1, allele of exon 2 or exon 2 of NRG1 Part of a variant; - part of an allogenic variant of exon 1 or exon 1 of CXADR, part of an allogenic variant of exon 2 or exon 2 of NRG1; - exon 2 of GTF2E2 Exon 2 or part of an allele variant of exon 2, and part of an allele variant of NRG1 exon 2 or exon 2; - CSMD1 exon 23 or part of an allele of exon 23 Part of a variant, with NRG1 exon 6 or part of an allogenic variant of exon 6; - Part of a PTN exon 4 or part of an allogenic variant of exon 4, with NRG1 exon Exon 2 or part of an allele variant of exon 2; - ST14 exon 11 or part of an allele variant of exon 11, and NRG1 exon 6 or part of an allele variant of exon 6 part of a variant; - THBS1 exon 9 or part of an allogenic variant of exon 9, and NRG1 exon 6 or part of an allogenic variant of exon 6; - AGRN exon Exon 12 or part of an allele variant of exon 12, and NRG1 exon 6 or part of an allele variant of exon 6; - PVALB exon 4 or part of an allele variant of exon 4 Part of the variant, with NRG1 exon 6 or part of the allogenic variant of exon 6; - Transcript 3 exon 2 of SLC3A2 or part of the allogenic variant of exon 2, with NRG1 exon 6 or part of an allogenic variant of exon 6; - APP exon 14 or part of an allogenic variant of exon 14, and NRG1 exon 6 or part of an allogenic variant Part of an allogenic variant of exon 6; - WRN exon 33 or part of an allogenic variant of exon 33, and NRG1 exon 6 or part of an allogenic variant of exon 6; - DAAM1 exon 1 or part of an allogenic variant of exon 1, and NRG1 exon 1 or part of an allogenic variant of exon 1; - ASPH exon 22 or part of an allogenic variant Part of an allogenic variant of 22, with NRG1 exon 2 or part of an allogenic variant of exon 2; - NOTCH2 exon 6 or part of an allogenic variant of exon 6, with NRG1 exon 6 or part of an allogenic variant of exon 6; - part of an allogenic variant of CD74 exon 2 or exon 2, and NRG1 exon 2 or part of an allogenic variant part of an allogenic variant of exon 2 of SDC4; part of an allogenic variant of exon 2 or exon 2 of SDC4, and part of an allogenic variant of exon 2 or exon 2 of NRG1;- CD44 exon 5 or part of an allogenic variant of exon 5, and NRG1 exon 6 or part of an allogenic variant of exon 6; - SLC4A4 exon 14 or part of an allogenic variant Part of an allogenic variant of 14, and part of an allogenic variant of exon 6 or exon 6 of NRG1; - part of an allogenic variant of exon 4 or exon 4 of SDC4, and NRG1 exon 2 or part of an allogenic variant of exon 2; - ZFAT exon 12 or part of an allogenic variant of exon 12, with NRG1 exon 6 or part of an allogenic variant part of an allogenic variant of exon 6; or - part of an allogenic variant of exon 3 or exon 3 of DSCAML1 and part of an allogenic variant of exon 2 or exon 2 of NRG1. 14. The polynucleotide as in item 13, wherein exon 1 of VAPB is exon of SEQ ID NO: 17; exon 7 of CADM1 is exon of SEQ ID NO: 39; exon of CD44 Son 5 is the exon of SEQ ID NO: 65; SLC3A2 exon 1 is the exon of SEQ ID NO: 103; VTCN1 exon 2 is the exon of SEQ ID NO: 169; CDH1 Exon 11 is the exon of SEQ ID NO: 198; exon 1 of CXADR is the exon of SEQ ID NO: 219; exon 2 of GTF2E2 is the exon of SEQ ID NO: 236; the exon of CSMD1 Exon 23 is the exon of SEQ ID NO: 279; PTN exon 4 is the exon of SEQ ID NO: 318; ST14 exon 11 is the exon of SEQ ID NO: 342; THBS1 Exon 9 is the exon of SEQ ID NO: 386; AGRN exon 12 is the exon of SEQ ID NO: 416; PVALB exon 4 is the exon of SEQ ID NO: 442; SLC3A2 exon 2 is the exon of SEQ ID NO: 457; APP exon 14 is the exon of SEQ ID NO: 501; WRN exon 33 is the exon of SEQ ID NO: 562 ; DAAM1 exon 1 is the exon of SEQ ID NO: 606; ASPH exon 22 is the exon of SEQ ID NO: 658; NOTCH2 exon 6 is the exon of SEQ ID NO: 700 CD74 exon 2 is SEQ ID NO: 720 exon; SDC4 exon 2 is SEQ ID NO: 746 exon; CD44 exon 5 is SEQ ID NO: 65 outside Exon; SLC4A4 exon 14 is the exon of SEQ ID NO: 780; SDC4 exon 4 is the exon of SEQ ID NO: 748; ZFAT exon 12 is the exon of SEQ ID NO: 841 exons; DSCAML1 exon 3 is the exon of SEQ ID NO: 872, and NRG1 exons 1, 2, 5 and 6 are the exons of SEQ ID NO: 125, 126, 129 and 130, respectively son. 15. The polynucleotide according to any one of clauses 13 or 14, wherein: - the VAPB exon 1 or allogenic variant thereof is partly located in the NRG1 exon 2 or allogenic variant thereof The 5' end of the portion; - the CADM1 exon 7 or its allele variant portion is located at the 5' end of the NRG1 exon 6 or its allele variant portion; - the CD44 exon 5 or its allelic variant part is located at the 5' end of the NRG1 exon 2 or its allelic variant part; - the SLC3A2 exon 1 or its allelic variant part is located at the NRG1 exon The 5' end of exon 5 or its allele variant part; - the VTCN1 exon 2 or its allele variant part is located at the 5' end of the NRG1 exon 2 or its allele variant part; - the CDH1 exon 11 or its allogenic variant part is located at the 5' end of the NRG1 exon 2 or its allogenic variant part; - the CXADR exon 1 or its allogenic variant part The part is located at the 5' end of the NRG1 exon 2 or its allele variant part; - The GTF2E2 exon 2 or its allele variant part is located at the NRG1 exon 2 or its allele The 5' end of the variant part; - the CSMD1 exon 23 or its allele variant part is located at the 5' end of the NRG1 exon 6 or its allegene variant part; - the PTN exon Exon 4 or its allogenic variant part is located at the 5' end of the NRG1 exon 2 or its allogenic variant part; - the ST14 exon 11 or its allogenic variant part is located at the NRG1 5' of exon 6 or its allogenic variant part; - the THBS1 exon 9 or its allogenic variant part is located 5' of the NRG1 exon 6 or its allogenic variant part - the AGRN exon 12 or its allele variant part is located at the 5' end of the NRG1 exon 6 or its allele variant part; - the PVALB exon 4 or its allele The variant part is located at the 5' end of the NRG1 exon 6 or its allele variant part; - the SCL3A2 exon 2 or its allele variant part is located at the NRG1 exon 6 or its the 5' end of the allogenic variant part; - the APP exon 14 or the allogenic variant part thereof is located at the 5' end of the NRG1 exon 6 or the allogenic variant part of exon 6; - the WRN exon 33 or its allogenic variant part is located at the 5' end of the NRG1 exon 6 or the allogenic variant part of exon 6; - the DAAM1 exon 1 or its The allogenic variant part is located at the 5' end of the NRG1 exon 1 or the allogenic variant part of exon 1; - the ASPH exon 22 or its allogenic variant part is located at the NRG1 Exon 2 or the 5' end of the allele variant part of exon 2; - the NOTCH2 exon 6 or the allele variant part is located at the NRG1 exon 6 or exon 6 the 5' end of the allogenic variant part; - the CD74 exon 2 or allogenic variant part thereof is located at the 5' end of the NRG1 exon 2 or the allogenic variant part of exon 2; - the SDC4 exon 2 or its allogenic variant part is located at the 5' end of the NRG1 exon 2 or exon 2 allogenic variant part; - the CD44 exon 5 or its allogenic variant part The allogenic variant part is located at the 5' end of exon 6 of the NRG1 or the allogenic variant part of exon 6; - the SLC4A4 exon 14 or its allogenic variant part is located at the NRG1 Exon 6 or the 5' end of the allele variant part of exon 6; - the SDC4 exon 4 or the allele variant part is located at the NRG1 exon 2 or exon 2 the 5' end of the allogenic variant part; - the ZFAT exon 12 or its allogenic variant part is located at the 5' end of the NRG1 exon 6 or the allogenic variant part of exon 6; and - the DSCAML1 exon 3 or allogenic variant portion thereof is located 5' to the NRG1 exon 2 or exon 2 allele variant portion. 16. The polynucleotide according to any one of clauses 13 to 15, wherein: - the allele variant of exon 1 of VAPB has at least 85% identity to SEQ ID NO: 17, preferably There is at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of CADM1 exon 7 has at least 85% identity to SEQ ID NO: 39, preferably There is at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 5 of CD44 has at least 85% identity with SEQ ID NO: 65, compared with Preferably at least 90% identity, more preferably at least 95% identity, or preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the SLC3A2 The allele variant of exon 1 has at least 85% identity to SEQ ID NO: 103, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - VTCN1 The allele variant of exon 2 has at least 85% identity to SEQ ID NO: 169, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the The allele variant of CDH1 exon 11 has at least 85% identity to SEQ ID NO: 198, preferably at least 90%, 92%, 94%, 96% or even 98% identity;- The allele variant of exon 2 of NRG1 is at least 85% identical to SEQ ID NO: 126, preferably at least 90%, 92%, 94%, 96% or even 98% identical; - the allele variant of exon 5 of NRG1 is at least 85% identical to SEQ ID NO: 129, preferably at least 90%, 92%, 94%, 96% or even 98% identical - the allele variant of exon 6 of NRG1 is at least 85% identical to SEQ ID NO: 130, preferably at least 90%, 92%, 94%, 96% or even 98% identical - the CXADR exon 1 allele variant has at least 85% identity to SEQ ID NO: 219, preferably at least 90%, 92%, 94%, 96% or even 98% identity Identity; - the allele variant of exon 2 of GTF2E2 has at least 85% identity to SEQ ID NO: 236, preferably at least 90%, 92%, 94%, 96% or even 98% - the allele variant of exon 23 of CSMD1 has at least 85% identity to SEQ ID NO: 279, preferably at least 90%, 92%, 94%, 96% or even 98% % identity; - the PTN exon 4 allele variant has at least 85% identity to SEQ ID NO: 318, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the ST14 exon 11 allele variant has at least 85% identity to SEQ ID NO: 342, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 9 of THBS1 has at least 85% identity to SEQ ID NO: 386, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 12 of AGRN has at least 85% identity to SEQ ID NO: 416, preferably at least 90%, 92%, 94%, 96% % or even 98% identity; - the allele variant of exon 4 of PVALB has at least 85% identity to SEQ ID NO: 442, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 2 of SCL3A2 has at least 85% identity, preferably at least 90%, 92%, 94% identity to SEQ ID NO: 457 , 96% or even 98% identity; - the APP exon 14 allele variant has at least 85% identity to SEQ ID NO: 501, preferably at least 90%, 92%, 94% %, 96% or even 98% identity; - the allele variant of exon 33 of the WRN has at least 85% identity to SEQ ID NO: 562, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 1 of DAAM1 has at least 85% identity, preferably at least 90%, 92% identity to SEQ ID NO: 606 , 94%, 96% or even 98% identity; - the allele variant of exon 1 of NRG1 has at least 85% identity, preferably at least 90%, 92% identity to SEQ ID NO: 125 %, 94%, 96% or even 98% identity; - the ASPH exon 22 allele variant has at least 85% identity to SEQ ID NO: 658, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 6 of NOTCH2 has at least 85% identity, preferably at least 90% identity to SEQ ID NO: 700 , 92%, 94%, 96% or even 98% identity; - the CD74 exon 2 allele variant has at least 85% identity to SEQ ID NO: 720, preferably at least 90% identity %, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 2 of SDC4 has at least 85% identity to SEQ ID NO: 746, preferably at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 5 of CD44 has at least 85% identity to SEQ ID NO: 65, preferably has At least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 14 of SLC4A4 has at least 85% identity to SEQ ID NO: 780, preferably There is at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 4 of SDC4 has at least 85% identity to SEQ ID NO: 748, preferably There is at least 90%, 92%, 94%, 96% or even 98% identity; - the allele variant of exon 12 of the ZFAT has at least 85% identity with SEQ ID NO: 841, compared with Preferably at least 90%, 92%, 94%, 96% or even 98% identity; and - the allele variant of exon 3 of DSCAML1 has at least 85% identity to SEQ ID NO: 872 , preferably at least 90%, 92%, 94%, 96% or even 98% concordance. 17. The polynucleotide according to any one of clauses 13 to 16, wherein: - the fusion of VAPB and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 3, including positions 43 and 44 -the fusion of CADM1 and NRG1 comprises from 2 to about 40 contiguous nucleic acids of SEQ ID NO: 7, including the nucleic acids at positions 53 and 54; -the fusion of CD44 and NRG1 comprises from SEQ ID NO: 2 to about 40 contiguous nucleic acids of 11, including those at positions 52 and 53; - the fusion of SLC3A2 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 15, including those at positions 53 and 54 Nucleic acid; - the fusion of VTCN1 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 166, including nucleic acids at positions 65 and 66; - the fusion of CDH1 and NRG1 comprises nucleic acids from SEQ ID NO: 186 2 to about 40 contiguous nucleic acids, including nucleic acids at positions 119 and 120; - the fusion of CXADR and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 217, including nucleic acids at positions 43 and 44 - the fusion of GTF2E2 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 233, including the nucleic acids at positions 141 and 142; - the fusion of CSMD1 and NRG1 comprises nucleic acids from SEQ ID NO: 255 2 to about 40 contiguous nucleic acids, including the nucleic acids at positions 88 and 89; - the fusion of PTN and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 313, including the nucleic acids at positions 102 and 103; - the fusion of ST14 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 330, including the nucleic acids at positions 95 and 96; - the fusion of THBS1 and NRG1 comprises 2 nucleic acids from SEQ ID NO: 376 to about 40 contiguous nucleic acids, including nucleic acids at positions 56 and 57; - the fusion of AGRN and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 403, including nucleic acids at positions 106 and 107; - The fusion of PVALB and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 437, including the nucleic acids at positions 102 and 103; - the fusion of SLC3A2 and NRG1 comprises 2 to about 40 consecutive nucleic acids from SEQ ID NO: 454 About 40 contiguous nucleic acids, including nucleic acids at positions 93 and 94; - the fusion of APP and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 486, including nucleic acids at positions 54 and 55; - the The fusion of WRN and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 528, including the nucleic acids at positions 96 and 97; - the fusion of DAAM1 and NRG1 comprises 2 to about 40 consecutive nucleic acids from SEQ ID NO: 605 40 contiguous nucleic acids, including those at positions 75 and 76; - the fusion of ASPH and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 635, including those at positions 75 and 76; - the NOTCH2 The fusion with NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 693, including the nucleic acids at positions 75 and 76; - the fusion of CD74 and NRG1 comprises 2 to about 40 consecutive nucleic acids from SEQ ID NO: 717 contiguous nucleic acids, including nucleic acids at positions 75 and 76; - the fusion of SDC4 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 743, including nucleic acids at positions 75 and 76; - the CD44 and The fusion of NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 761, including the nucleic acids at positions 75 and 76; - the fusion of SLC4A4 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 765 Contiguous nucleic acids, including nucleic acids at positions 75 and 76; - the fusion of SDC4 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 824, including nucleic acids at positions 75 and 76; - the ZFAT and NRG1 The fusion comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 828, including nucleic acids at positions 75 and 76; and - the fusion of DSCAML1 and NRG1 comprises 2 to about 40 contiguous nucleic acids from SEQ ID NO: 868 Contiguous nucleic acids, including nucleic acids at positions 75 and 76. 18. The polynucleotide according to any one of clauses 13 to 17, wherein: - the fusion of VAPB and NRG1 comprises SEQ ID NO: 3 or an allele variant thereof; - the fusion of CAD1 and NRG1 comprises SEQ ID NO: 7 or its allogeneic variant; - the fusion of CD44 and NRG1 comprising SEQ ID NO: 11 or its allogeneic variant; - the SLC3A2 and NRG1 fusion comprising SEQ ID NO: 15 or its allegiance a genetic variant; - the fusion of VTCN1 and NRG1 comprising SEQ ID NO: 166 or an allogenic variant thereof; - the fusion of CDH1 and NRG1 comprising SEQ ID NO: 186 or an allogenic variant thereof; - the CXADR and The fusion of NRG1 comprises SEQ ID NO: 217 or its allogeneic variant; - the fusion of GTF2E2 and NRG1 comprises SEQ ID NO: 233 or its allogeneic variant; - the fusion of CSMD1 and NRG1 comprises SEQ ID NO : 255 or its allogeneic variant; - the fusion of PTN and NRG1 comprising SEQ ID NO: 313 or its allogeneic variant; - the ST14 and NRG1 fusion comprising SEQ ID NO: 330 or its allogeneic variant - the fusion of THBS1 and NRG1 comprising SEQ ID NO: 376 or an allogeneic variant thereof; - the fusion of AGRN and NRG1 comprising SEQ ID NO: 403 or an allogeneic variant thereof; - the fusion of PVALB and NRG1 - the fusion of SLC3A2 and NRG1 comprises SEQ ID NO: 454 or an allogenic variant thereof; - the fusion of APP and NRG1 comprises SEQ ID NO: 486 or an allele variant thereof; - the fusion of WRN and NRG1 comprising SEQ ID NO: 528 or an allele variant thereof; - the fusion of DAAM1 and NRG1 comprising SEQ ID NO: 605 or an allele variant thereof; - the The fusion of ASPH and NRG1 comprises SEQ ID NO: 635 or its allogeneic variant; - the fusion of NOTCH2 and NRG1 comprises SEQ ID NO: 693 or its allogeneic variant; - the fusion of CD74 and NRG1 comprises SEQ ID NO: 693 or its allogeneic variant; ID NO: 717 or its allogeneic variant; - the fusion of SDC4 and NRG1 comprising SEQ ID NO: 743 or its allogeneic variant; - the CD44 and NRG1 fusion comprising SEQ ID NO: 761 or its allogene Variant; - the fusion of SLC4A4 and NRG1 comprises SEQ ID NO: 765 or an allogenic variant thereof; - the fusion of SDC4 and NRG1 comprises SEQ ID NO: 824 or an allogenic variant thereof; - the ZFAT and NRG1 the fusion of which comprises SEQ ID NO: 828 or an allogeneic variant thereof; and - the fusion of DSCAML1 and NRG1 comprises SEQ ID NO: 868 or an allogeneic variant thereof. 19. The polynucleotide according to any one of clauses 13 to 18, wherein: - the VAPB exon 1 portion is or comprises SEQ ID NO: 1 or an allele variant of SEQ ID NO: 1; - The CADM1 exon 7 portion is or comprises SEQ ID NO: 5 or an allele variant of SEQ ID NO: 5; - the CD44 exon 5 portion is or comprises SEQ ID NO: 9 or SEQ ID NO: The allele variant of 9; - the SLC3A2 exon 1 portion is or comprises SEQ ID NO: 13 or the allele variant of SEQ ID NO: 13; - the VTCN1 exon 2 portion is or comprises SEQ ID NO: 164 or the allele variant of SEQ ID NO: 164; - the CDH1 exon 11 portion is or comprises SEQ ID NO: 184 or the allele variant of SEQ ID NO: 184; - the CXADR exon the exon 1 portion is or comprises SEQ ID NO: 219 or an allele variant of SEQ ID NO: 219; - the GTF2E2 exon 2 portion is or comprises SEQ ID NO: 236 or an allele variant of SEQ ID NO: 236 - the CXADR exon 23 part is or comprises SEQ ID NO: 279 or an allele variant of SEQ ID NO: 279; - the PTN exon 4 part is or comprises SEQ ID NO: 318 or SEQ ID NO: 279 an allele variant of ID NO: 318; - the ST14 exon 11 portion is or comprises SEQ ID NO: 342 or an allele variant of SEQ ID NO: 342; - the THBS1 exon 9 portion is or an allele variant comprising SEQ ID NO: 385 or SEQ ID NO: 386; - the AGRN exon 12 portion is or comprises an allele variant of SEQ ID NO: 416 or SEQ ID NO: 416; - the PVALB the part of exon 4 is or comprises SEQ ID NO: 442 or an allele variant of SEQ ID NO: 442; - the part of exon 2 of SLC3A2 is or comprises SEQ ID NO: 457 or an allele variant of SEQ ID NO: 457 an allele variant; - the part of exon 2 of NRG1 is or comprises SEQ ID NO: 165 or an allele variant of SEQ ID NO: 165; - the part of exon 5 of NRG1 is or comprises SEQ ID NO: 14 or an allele variant of SEQ ID NO: 14; - the NRG1 exon 6 portion is or comprises SEQ ID NO: 6 or an allele variant thereof; - the APP exon 14 portion is or comprises SEQ ID NO: 484 or an allele variant thereof; - the WRN exon 33 portion is or comprises SEQ ID NO: 526 or an allele variant thereof; - the DAAM1 exon 1 portion is or comprises SEQ ID NO : 603 or an allogenic variant thereof; - the ASPH exon 22 portion is or comprises SEQ ID NO: 633 or an allele variant thereof; - the NOTCH2 exon 6 portion is or comprises SEQ ID NO: 691 or an allogenic variant thereof; - the CD74 exon 2 portion is or comprises SEQ ID NO: 715 or an allele variant thereof; - the SDC4 exon 2 portion is or comprises SEQ ID NO: 741 or an allogenic variant thereof; an allele variant; - the CD44 exon 5 portion is or comprises SEQ ID NO: 759 or an allele variant thereof; - the SLC4A4 exon 14 portion is or comprises SEQ ID NO: 763 or an allele variant thereof Variant; - the SDC4 exon 4 portion is or comprises SEQ ID NO: 822 or an allele variant thereof; - the ZFAT exon 12 portion is or comprises SEQ ID NO: 826 or an allele variant thereof ; - the portion of exon 3 of DSCAML1 is or comprises SEQ ID NO: 866 or an allegenic variant thereof; and - the portion of exon 1 of NRG1 is or comprises SEQ ID NO: 604 or an allegenic variant thereof. 20. The polynucleotide according to any one of clauses 13 to 18, wherein: - the fusion of VAPB and NRG1 comprises a fusion junction between exon 1 of VAPB and exon 2 of NRG1, compared to Preferably the junction between the nucleic acid at position 43 of VAPB of SEQ ID NO: 3 and the nucleic acid at position 44 of NRG1; - the fusion of CADM1 and NRG1 comprises a junction between exon 7 of CADM1 and exon 6 of NRG1 A fusion junction, preferably the junction between the nucleic acid at position 53 of CADM1 of SEQ ID NO: 7 and the nucleic acid at position 54 of NRG1; - the fusion of CD44 and NRG1 comprises a CD44 exon 5 and NRG1 exon The fusion junction between exon 2, preferably the junction between the nucleic acid at position 52 of CD44 of SEQ ID NO: 11 and the nucleic acid at position 53 of NRG1; The fusion junction with exon 5 of NRG1, preferably the junction between the nucleic acid at position 53 of SLC3A2 of SEQ ID NO: 15 and the nucleic acid at position 54 of NRG1; - the fusion of VTCN1 and NRG1 comprises a VTCN1 The fusion junction between exon 2 and NRG1 exon 2, preferably the junction between the nucleic acid at position 65 of VTCN1 of SEQ ID NO: 166 and the nucleic acid at position 66 of NRG1; - between CDH1 and NRG1 The fusion comprises a fusion junction between CDH1 exon 11 and NRG1 exon 2, preferably the junction between the nucleic acid at CDH1 position 119 of SEQ ID NO: 186 and the nucleic acid at NRG1 position 120; - the fusion of CXADR and NRG1 comprises a fusion junction between exon 1 of CXADR and exon 2 of NRG1, preferably between the nucleic acid at position 43 of CXADR of SEQ ID NO: 217 and the nucleic acid at position 44 of NRG1 - the fusion of GTF2E2 and NRG1 comprises a fusion junction between exon 2 of GTF2E2 and exon 2 of NRG1, preferably the nucleic acid at position 141 of GTF2E2 of SEQ ID NO: 233 and NRG1 The junction between the nucleic acid at position 142; - the fusion of CSMD1 and NRG1 comprises a fusion junction between CSMD1 exon 23 and NRG1 exon 6, preferably the CSMD1 position of SEQ ID NO: 255 The junction between the nucleic acid at 88 and the nucleic acid at position 89 of NRG1; - the fusion of PTN and NRG1 comprises a fusion junction between PTN exon 4 and NRG1 exon 2, preferably SEQ ID NO : the junction between the nucleic acid at PTN position 102 of 313 and the nucleic acid at position 103 of NRG1; - the fusion of ST14 and NRG1 comprises a fusion junction between ST14 exon 11 and NRG1 exon 6, compared Preferably the junction between the nucleic acid at position 95 of ST14 of SEQ ID NO: 330 and the nucleic acid at position 96 of NRG1; - the fusion of THBS1 and NRG1 comprises a junction between exon 9 of THBS1 and exon 6 of NRG1 The fusion junction, preferably the junction between the nucleic acid at position 56 of THBS1 of SEQ ID NO: 376 and the nucleic acid at position 57 of NRG1; - the fusion of AGRN and NRG1 comprises an exon 12 of AGRN and an exon of NRG1 The fusion junction between exon 6, preferably the junction between the nucleic acid at AGRN position 106 of SEQ ID NO: 403 and the nucleic acid at position 107 of NRG1; - the fusion of PVALB and NRG1 comprises a PVALB exon 4 The fusion junction with exon 6 of NRG1, preferably the junction between the nucleic acid at PVALB position 102 of SEQ ID NO: 437 and the nucleic acid at position 103 of NRG1; The fusion junction between exon 2 and exon 6 of NRG1, preferably the junction between the nucleic acid at position 93 of SLC3A2 of SEQ ID NO: 454 and the nucleic acid at position 94 of NRG1; - between APP and NRG1 The fusion comprises a fusion junction between APP exon 14 and NRG1 exon 6, preferably the junction between the nucleic acid at APP position 54 of SEQ ID NO: 486 and the nucleic acid at NRG1 position 55; - the fusion of WRN and NRG1 comprises a fusion junction between WRN exon 33 and NRG1 exon 6, preferably between the nucleic acid at WRN position 96 of SEQ ID NO: 528 and the nucleic acid at position 97 of NRG1 - the fusion of DAAM1 and NRG1 comprises a fusion junction between exon 1 of DAAM1 and exon 1 of NRG1, preferably nucleic acid 75 at the DAAM1 position of SEQ ID NO: 605 and NRG1 The junction between nucleic acid 76 at position; - the fusion of ASPH and NRG1 comprises a fusion junction between exon 22 of ASPH and exon 2 of NRG1, preferably the ASPH position of SEQ ID NO: 635 The junction between the nucleic acid at 75 and the nucleic acid at position 76 of NRG1; - the fusion of NOTCH2 and NRG1 comprises a fusion junction between NOTCH2 exon 6 and NRG1 exon 6, preferably SEQ ID NO : the junction between the nucleic acid at position 75 of NOTCH2 of 693 and the nucleic acid at position 76 of NRG1; - the fusion of CD74 and NRG1 comprises a fusion junction between exon 2 of CD74 and exon 2 of NRG1 Preferably the junction between the nucleic acid at position 75 of CD74 of SEQ ID NO: 717 and the nucleic acid at position 76 of NRG1; A fusion junction, preferably the junction between the nucleic acid at position 75 of SDC4 of SEQ ID NO: 743 and the nucleic acid at position 76 of NRG1; - the fusion of CD44 and NRG1 comprises a CD44 exon 5 and NRG1 exon The fusion junction between sub6, preferably the junction between the nucleic acid of VAPB position 75 of SEQ ID NO: 761 and the nucleic acid of NRG1 position 76; The fusion junction with exon 6 of NRG1, preferably the junction between the nucleic acid at position 75 of SLC4A4 of SEQ ID NO: 765 and the nucleic acid at position 76 of NRG1; - the fusion of SDC4 and NRG1 comprises a junction in SDC4 The fusion junction between exon 4 and NRG1 exon 2, preferably the junction between the nucleic acid at position 75 of SDC4 of SEQ ID NO: 824 and the nucleic acid at position 76 of NRG1; - between ZFAT and NRG1 The fusion comprises a fusion junction between ZFAT exon 12 and NRG1 exon 6, preferably the junction between the nucleic acid at ZFAT position 75 of SEQ ID NO: 828 and the nucleic acid at position 76 of NRG1; and - the fusion of DSCAML1 and NRG1 comprises a fusion junction between exon 3 of DSCAML1 and exon 2 of NRG1, preferably the nucleic acid at position 75 of DSCAML1 of SEQ ID NO: 868 and the nucleic acid at position 76 of NRG1 the junction between. 21. The polynucleotide according to any one of the preceding items, wherein the polynucleotide is isolated or purified. 22. The polynucleotide of any one of the preceding items, wherein any one of the fusions is an in-frame fusion. 23. The polynucleotide according to any one of the preceding items, wherein the polynucleotide is a mammalian polynucleotide, preferably a human polynucleotide. 24. A polypeptide fusion encoded by the polynucleotide of any one of the preceding items. 25. A vector comprising the polynucleotide according to any one of items 1 to 23. 26. A recombinant host cell comprising the polynucleotide according to any one of items 1 to 23 or the vector according to item 25. 27. A method for producing a polypeptide fusion as in item 24, comprising maintaining the host cell as in item 26 under conditions suitable for the expression of a polynucleotide contained in the host cell, thereby expressing the polynucleotide and producing a The polypeptide fusion, which is subsequently isolated or purified. 28. A method for producing a recombinant host cell, comprising introducing the vector according to item 25 into a host cell. 29. A detection assay comprising a nucleic acid probe, a primer or a pair of primers for detecting the presence of a polynucleotide fusion according to any one of items 1 to 23. 30. A nucleic acid probe, primer or primer pair for detecting the polynucleotide fusion of any one of items 1 to 23. 31. The nucleic acid probe, primer or primer pair according to item 30, which is 10 to 40 nucleotides in length. 32. The nucleic acid probe, primer or primer pair according to item 30 or 31, wherein the detected fusion comprises: - a fusion of VAPB and NRG1 comprising or consisting of SEQ ID NO: 3, and preferably Preferably comprising the nucleic acid at position 43 or 44; - a fusion of CADM1 and NRG1 comprising or consisting of SEQ ID NO: 7, and preferably comprising a nucleic acid at position 53 or 54; - a fusion of CD44 and NRG1 comprising A nucleic acid having or consisting of SEQ ID NO: 11, and preferably comprising position 52 or 53; - a fusion of SLC3A2 and NRG1, comprising or consisting of SEQ ID NO: 15, and preferably comprising position 53 or the nucleic acid of 54; - the fusion of VTCN1 and NRG1, comprising or consisting of SEQ ID NO: 166, and preferably comprising the nucleic acid of position 65 or 66; - the fusion of CDH1 and NRG1, comprising SEQ ID NO : 186 or consisting of it, and preferably comprising the nucleic acid of position 119 or 120; - a fusion of CXADR and NRG1 comprising or consisting of SEQ ID NO: 217, and preferably comprising the nucleic acid of position 43 or 44 - a fusion of GTF2E2 and NRG1 comprising or consisting of SEQ ID NO: 233, and preferably comprising the nucleic acid at position 141 or 142; - a fusion of CSMD1 and NRG1 comprising SEQ ID NO: 255 or consisting of It consists of, and preferably includes the nucleic acid at position 88 or 89; - a fusion of PTN and NRG1, comprising or consisting of SEQ ID NO: 313, and preferably including the nucleic acid at position 102 or 103; - ST14 and A fusion of NRG1 comprising or consisting of SEQ ID NO: 330, and preferably comprising the nucleic acid at position 95 or 96; - a fusion of THBS1 and NRG1 comprising or consisting of SEQ ID NO: 376, and Preferably comprising the nucleic acid at position 56 or 57; - a fusion of AGRN and NRG1 comprising or consisting of SEQ ID NO: 403, and preferably comprising the nucleic acid at position 106 or 107; - a fusion of PVALB and NRG1 , comprising or consisting of SEQ ID NO: 437, and preferably comprising the nucleic acid at position 102 or 103; - a fusion of SLC3A2 and NRG1, comprising or consisting of SEQ ID NO: 454, and preferably comprising The nucleic acid at position 93 or 94; - a fusion of APP and NRG1 comprising or consisting of SEQ ID NO: 486, and preferably comprising the nucleic acid at position 54 or 55; - a fusion of WRN and NRG1 comprising SEQ ID NO: 486 or consisting of ID NO: 528, and preferably comprising a nucleic acid at position 96 or 97; - a fusion of DAAM1 and NRG1 comprising or consisting of SEQ ID NO: 605, and preferably comprising position 75 or 76 -A fusion of ASPH and NRG1 comprising or consisting of SEQ ID NO: 635, and preferably comprising the nucleic acid at position 75 or 76; -A fusion of NOTCH2 and NRG1 comprising SEQ ID NO: 693 or consisting of, and preferably comprising the nucleic acid at position 75 or 76; - a fusion of CD74 and NRG1 comprising or consisting of SEQ ID NO: 717, and preferably comprising the nucleic acid at position 75 or 76; - A fusion of SDC4 and NRG1 comprising or consisting of SEQ ID NO: 743, and preferably comprising the nucleic acid at position 75 or 76; - a fusion of CD44 and NRG1 comprising or consisting of SEQ ID NO: 761 , and preferably comprising the nucleic acid at position 75 or 76; - a fusion of SLC4A4 and NRG1 comprising or consisting of SEQ ID NO: 765, and preferably comprising the nucleic acid at position 75 or 76; - SDC4 and NRG1 A fusion comprising or consisting of SEQ ID NO: 824, and preferably comprising the nucleic acid at position 75 or 76; - a fusion of ZFAT and NRG1 comprising or consisting of SEQ ID NO: 828, and preferably preferably comprising the nucleic acid at position 75 or 76; and - a fusion of DSCAML1 and NRG1 comprising or consisting of SEQ ID NO: 868, and preferably comprising the nucleic acid at position 75 or 76. 33. The nucleic acid probe, primer or primer pair according to any one of clauses 30 to 32, wherein: - the probe, primer or primer pair used to detect the fusion of VAPB and NRG1 is specifically compatible with a A sequence consisting of exon 1 or a sequence located 5' of exon 1 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 2 of NRG1 or located 3' of exon 2 The sequence hybridizes, or has 95% or more complementary sequence identity to these sequences; - The probe, primer or primer pair used to detect the fusion of CADM1 and NRG1 is specifically related to a CADM1 expression A sequence consisting of exon 7 or a sequence located 5' of exon 7 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 6 of NRG1 or a sequence located 3' of exon 6 sequence hybridization, or has 95% or higher complementary sequence identity to such sequences; - the probe, primer or primer pair used to detect the fusion of CD44 and NRG1 is specific for a CD44 exon A sequence consisting of exon 5 or a sequence located 5' to exon 5 hybridizes, and/or specifically to a sequence consisting of exon 2 of NRG1 or a sequence located 3' to exon 2 hybridize, or have 95% or greater complementary sequence identity to such sequences; - the probe, primer or primer pair for detecting the fusion of transcript 6 of SLC3A2 with NRG1 is specifically compatible with a sequence produced by SLC3A2 A sequence consisting of exon 1 or a sequence located 5' of exon 1 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 5 of NRG1 or located 3' of exon 5 The sequence hybridizes thereto, or has 95% or higher complementary sequence identity with such sequences; - The probe, primer or primer pair used to detect the fusion of VTCN1 and NRG1 is specifically compatible with a VTCN1 fusion A sequence consisting of exon 2 or a sequence located 5' of exon 2 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 2 of NRG1 or located 3' of exon 2 or have 95% or greater complementary sequence identity to such sequences; - the probe, primer or primer pair used to detect the fusion of CDH1 and NRG1 is specific for a CDH1-expressed A sequence consisting of exon 11 or a sequence located at 5' of exon 11 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 2 of NRG1 or located at 3' of exon 2 sequence hybridization, or has 95% or higher complementary sequence identity with such sequences; - the probe, primer or primer pair used to detect the fusion of CXADR and NRG1 is specific to a CXADR exon 1, or a sequence located 5' to exon 1, and/or specifically hybridizes to a sequence consisting of NRG1 exon 2, or a sequence located 3' to exon 2 hybridizes, or has 95% or greater complementary sequence identity to such sequences; - the probe, primer or primer pair used to detect the fusion of GTF2E2 and NRG1 is specific to a GTF2E2 exon 2 A sequence consisting of or located 5' to exon 2 hybridizes, and/or specifically hybridizes to a sequence consisting of NRG1 exon 2 or a sequence located 3' to exon 2 , or have 95% or greater complementary sequence identity to such sequences; - the probe, primer or primer pair for detecting the fusion of CSMD1 and NRG1 is specific to a CSMD1 exon 23 A sequence consisting of or a sequence located 5' to exon 23 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 6 of NRG1 or a sequence located 3' to exon 6, or a complementary sequence identity of 95% or greater to such sequences; - the probe, primer or primer pair for detecting the fusion of PTN and NRG1 is specifically bound to a PTN exon 4 or a sequence located 5' to exon 4, and/or specifically hybridizes to a sequence consisting of exon 2 of NRG1 or a sequence located 3' to exon 2, or have 95% or greater complementary sequence identity to the sequences; - the probe, primer or primer pair used to detect the fusion of ST14 and NRG1 is specific to a sequence consisting of exon 11 of ST14 sequence or a sequence located 5' to exon 11, and/or specifically hybridizes to a sequence consisting of NRG1 exon 6 or a sequence located 3' to exon 6, or Complementary sequence identity of 95% or higher to the sequences; - The probe, primer or primer pair for detecting the fusion of THBS1 and NRG1 is specific to a sequence consisting of exon 9 of THBS1 or a sequence at the 5' of exon 9 hybridizes, and/or specifically hybridizes with a sequence consisting of exon 6 of NRG1 or at a sequence at 3' of exon 6, or with The sequences have 95% or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of AGRN and NRG1 is specific to a sequence consisting of exon 12 of AGRN or A sequence located 5' of exon 12 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 6 of NRG1 or to a sequence located 3' of exon 6, or to the The same sequence has 95% or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of PVALB and NRG1 is specific to a sequence consisting of exon 4 of PVALB or located in The sequence at the 5' of exon 4 hybridizes, and/or specifically hybridizes with a sequence consisting of exon 6 of NRG1 or at the 3' of exon 6, or with such The sequence has 95% or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of the transcript 3 of SLC3A2 and NRG1 is specific to an exon 2 composed of SLC3A2 sequence or a sequence located 5' to exon 2, and/or specifically hybridizes to a sequence consisting of NRG1 exon 6 or a sequence located 3' to exon 6, or Complementary sequence identity of 95% or higher to the sequences; - the probe, primer or primer pair used to detect the fusion of APP and NRG1 is specific to a sequence consisting of exon 14 of APP or a sequence located 5' to exon 14, and/or specifically hybridizes to a sequence consisting of exon 6 of NRG1 or to a sequence located 3' to exon 6, or to The sequences have 95% or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of WRN and NRG1 is specific to a sequence consisting of exon 33 of WRN or A sequence located 5' to exon 33 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 6 of NRG1 or to a sequence located 3' to exon 6, or to the The sequence has 95% or higher complementary sequence identity; - The probe, primer or primer pair used to detect the fusion of DAAM1 and NRG1 is specific to a sequence consisting of exon 1 of DAAM1 or located in The sequence at the 5' of exon 1 hybridizes, and/or specifically hybridizes with a sequence consisting of exon 1 of NRG1 or at the 3' of exon 1, or with such The sequence has 95% or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of ASPH and NRG1 is specific to a sequence consisting of exon 22 of ASPH or located in exon The sequence at the 5' of exon 22 hybridizes, and/or specifically hybridizes to a sequence consisting of exon 2 of NRG1 or to a sequence located at the 3' of exon 2, or to these sequences Complementary sequence identity of 95% or higher; - The probe, primer or primer pair used to detect the fusion of NOTCH2 and NRG1 is specific to a sequence consisting of exon 6 of NOTCH2 or located in exon The sequence at the 5' place of exon 6 hybridizes, and/or specifically hybridizes with a sequence consisting of exon 6 of NRG1 or at the sequence at the 3' place of exon 6, or has a sequence with these sequences Complementary sequence identity of 95% or higher; - The probe, primer or primer pair used to detect the fusion of CD74 and NRG1 is specific to a sequence consisting of exon 2 of CD74 or located in exon 2, and/or specifically hybridize to a sequence consisting of exon 2 of NRG1 or to a sequence located at 3' of exon 2, or 95% to these sequences % or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of SDC4 and NRG1 is specific to a sequence consisting of exon 2 of SDC4 or located in exon 2 and/or specifically hybridize to a sequence consisting of exon 2 of NRG1 or a sequence located 3' to exon 2, or 95% of these sequences or higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of CD44 and NRG1 is specific to a sequence consisting of or located in exon 5 of CD44 The sequence at the 5' hybridizes, and/or specifically hybridizes to a sequence consisting of exon 6 of NRG1 or to a sequence at the 3' of exon 6, or 95% or Higher complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of SLC4A4 and NRG1 is specific to a sequence consisting of exon 14 of SLC4A4 or located at exon 5 of exon 14 ', and/or specifically hybridize to a sequence consisting of exon 6 of NRG1 or a sequence located 3' to exon 6, or to 95% or more of these sequences High complementary sequence identity; - The probe, primer or primer pair used to detect the fusion of SDC4 and NRG1 is specific to a sequence consisting of exon 4 of SDC4 or located 5' of exon 4 and/or specifically hybridize to a sequence consisting of exon 2 of NRG1 or a sequence located 3' to exon 2, or to 95% or more of these sequences Complementary sequence identity; - the probe, primer or primer pair used to detect the fusion of ZFAT and NRG1 is specific to a sequence consisting of exon 12 of ZFAT or located at the 5' position of exon 12 and/or specifically hybridize to a sequence consisting of exon 6 of NRG1 or a sequence located 3' to exon 6, or to have 95% or more of these sequences Complementary sequence identity; or - the probe, primer or primer pair used to detect the fusion of DSCAML1 and NRG1 is specific to a sequence consisting of exon 3 of DSCAML1 or located at the 5' position of exon 3 and/or specifically hybridize to a sequence consisting of exon 2 of NRG1 or a sequence located 3' to exon 2, or to a sequence that is 95% or higher Complementary sequence identity. 34. The nucleic acid probe, primer or primer pair according to item 33, wherein: - exon 1 from VAPB comprises or consists of SEQ ID NO: 17 or its allele variant; - is from outside CADM1 Exon 7 comprises or consists of SEQ ID NO: 39 or an allogenic variant thereof; - exon 5 from CD44 comprises or consists of SEQ ID NO: 65 or an allogenic variant thereof; - Exon 1 from SLC3A2 comprising or consisting of SEQ ID NO: 103 or an allogenic variant thereof; - exon 2 from VTCN1 comprising SEQ ID NO: 169 or an allogenic variant thereof, or consisting of It consists of; - from exon 11 of CDH1 comprising or consisting of SEQ ID NO: 198 or an allegenic variant thereof; - from CXADR exon 1 comprising SEQ ID NO: 219 or an allegenic variant thereof , or consists of; - from GTF2E2 exon 2 comprising SEQ ID NO: 236 or an allele variant thereof; - from CSMD1 exon 23 comprising SEQ ID NO: 279 or its An allele variant, or consisting of it; - from PTN exon 4 comprising SEQ ID NO: 318 or an allele variant thereof; - from ST14 exon 11 comprising SEQ ID NO : 342 or its allogenic variant, or consists of it; - from THBS1 exon 9 comprising or consisting of SEQ ID NO: 386 or its allele variant; - from AGRN exon 12 comprising, or consisting of, SEQ ID NO: 416 or an allogenic variant thereof; - from PVALB exon 4 comprising, or consisting of SEQ ID NO: 442 or an allogenic variant thereof; - from SLC3A2 Exon 2 comprises or consists of SEQ ID NO: 457 or an allogenic variant thereof; - from APP exon 14 comprises or consists of SEQ ID NO: 501 or an allogenic variant thereof; - from WRN exon 33 comprising or consisting of SEQ ID NO: 562 or an allogenic variant thereof; - from DAAM1 exon 1 comprising SEQ ID NO: 606 or an allogenic variant thereof, or Consists of; - from ASPH exon 22 comprising SEQ ID NO: 658 or an allegenic variant thereof; - from NOTCH2 exon 6 comprising SEQ ID NO: 700 or an allegenic variant thereof - from CD74 exon 2 comprising SEQ ID NO: 720 or an allele variant thereof, or consisting of it; - from SDC4 exon 2 comprising SEQ ID NO: 746 or An allele variant thereof, or consisting of it; - from exon 5 of CD44 comprising SEQ ID NO: 65 or an allele variant thereof; - from SLC4A4 exon 14 comprising SEQ ID NO: 780 or its allogenic variant, or consists of it; - from SDC4 exon 4 comprising, or consisting of SEQ ID NO: 748 or its allele variant; - from the ZFAT exon 12 comprises or consists of SEQ ID NO: 841 or its allogeneic variant; - from DSCAML1 exon 3 comprises or consists of SEQ ID NO: 872 or its allogeneic variant; and - from NRG1 exons 1, 2, 5 and 6 comprise or consist of SEQ ID NO: 125, 126, 129 and 130, or allelic variants thereof, respectively. 35. The nucleic acid probe, primer or primer pair according to item 33, wherein: - the probe, primer or primer pair used to detect the fusion of VAPB and NRG1 is specifically associated with a sequence consisting of SEQ ID NO: 17 Sequence hybridization or allele variant hybridization thereof, and/or hybridization specifically to a sequence consisting of SEQ ID NO: 153, or having 95% or greater complementary sequence identity to such sequences; - with The probe, primer or pair of primers to detect the fusion of CADM1 and NRG1 is specifically hybridized with a sequence consisting of SEQ ID NO: 57 or its allele variant hybridization, and/or specifically hybridized with a sequence composed of SEQ ID NO: 57 The sequence consisting of ID NO: 155 hybridizes, or has 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair used to detect the fusion of CD44 and NRG1 is specifically compatible with A sequence consisting of SEQ ID NO: 99 hybridizes or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 153, or 95% or higher complementary sequence identity; - the probe, primer or primer pair for detecting the fusion of SLC3A2 and NRG1 specifically hybridizes to a sequence consisting of SEQ ID NO: 103 or its allele variants, And/or specifically hybridize to a sequence consisting of SEQ ID NO: 157, or have 95% or higher complementary sequence identity with these sequences; - a probe for detecting the fusion of VTCN1 and NRG1 , the primer or primer pair specifically hybridizes to a sequence consisting of SEQ ID NO: 181 or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 153, Or there is 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair for detecting the fusion of CDH1 and NRG1 is specifically related to a sequence consisting of SEQ ID NO: 213 Sequence hybridization or allele variant hybridization thereof, and/or hybridization specifically to a sequence consisting of SEQ ID NO: 153, or having 95% or greater complementary sequence identity to such sequences; - with The probe, primer or pair of primers to detect the fusion of CXADR and NRG1 is specifically hybridized with a sequence consisting of SEQ ID NO: 219 or its allele variant hybridization, and/or specifically hybridized with a sequence composed of SEQ ID NO: 219 The sequence consisting of ID NO: 153 hybridizes, or has 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair used to detect the fusion of GTF2E2 and NRG1 is specifically compatible with A sequence consisting of SEQ ID NO: 252 hybridizes or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 153, or 95% or higher complementary sequence identity; - the probe, primer or primer pair for detecting the fusion of CSMD1 and NRG1 specifically hybridizes to a sequence consisting of SEQ ID NO: 309 or its allele variants, and/or specifically hybridize to a sequence consisting of SEQ ID NO: 155, or have 95% or higher complementary sequence identity with such sequences; - a probe for detecting the fusion of PTN and NRG1 , the primer or primer pair specifically hybridizes to a sequence consisting of SEQ ID NO: 326 or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 153, Or there is 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair for detecting the fusion of ST14 and NRG1 is specifically related to a sequence consisting of SEQ ID NO: 372 Sequence hybridization or allogenic variant hybridization thereof, and/or hybridization specifically to a sequence consisting of SEQ ID NO: 155, or having 95% or greater complementary sequence identity to such sequences; - with The probe, primer or pair of primers to detect the fusion of THBS1 and NRG1 is specifically hybridized to a sequence composed of SEQ ID NO: 399 or its allele variant hybridization, and/or specifically hybridized to a sequence composed of SEQ ID NO: 399 The sequence consisting of ID NO: 155 hybridizes, or has 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair used to detect the fusion of AGRN and NRG1 is specifically compatible with A sequence consisting of SEQ ID NO: 433 hybridizes or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 155, or 95% or higher complementary sequence identity; - the probe, primer or pair of primers for detecting the fusion of PVALB and NRG1 specifically hybridizes to a sequence consisting of SEQ ID NO: 450 or its allele variants, and/or specifically hybridize to a sequence consisting of SEQ ID NO: 155, or have 95% or higher complementary sequence identity with such sequences; - a probe for detecting the fusion of SLC3A2 and NRG1 , the primer or primer pair specifically hybridizes to a sequence consisting of SEQ ID NO: 482 or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 155, Or there is 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair for detecting the fusion of APP and NRG1 is specifically related to a sequence consisting of SEQ ID NO: 524 Sequence hybridization or allogenic variant hybridization thereof, and/or hybridization specifically to a sequence consisting of SEQ ID NO: 155, or having 95% or greater complementary sequence identity to such sequences; - with The probe, primer or pair of primers to detect the fusion of WRN and NRG1 is specifically hybridized to a sequence composed of SEQ ID NO: 601 or its allele variant hybridization, and/or specifically hybridized to a sequence composed of SEQ ID NO: 601 The sequence consisting of ID NO: 155 hybridizes, or has 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair used to detect the fusion of DAAM1 and NRG1 is specifically compatible with A sequence consisting of SEQ ID NO: 606 hybridizes or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 138, or 95% or higher complementary sequence identity; - probes, primers or primer pairs for detecting the fusion of ASPH and NRG1 specifically hybridize to a sequence consisting of SEQ ID NO: 689 or its allele variants, and/or specifically hybridize to a sequence consisting of SEQ ID NO: 153, or have 95% or higher complementary sequence identity with such sequences; - a probe for detecting the fusion of NOTCH2 and NRG1 , the primer or primer pair specifically hybridizes to a sequence consisting of SEQ ID NO: 713 or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 155, Or there is 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair for detecting the fusion of CD74 and NRG1 is specifically related to a sequence consisting of SEQ ID NO: 739 Sequence hybridization or allele variant hybridization thereof, and/or hybridization specifically to a sequence consisting of SEQ ID NO: 153, or having 95% or greater complementary sequence identity to such sequences; - with The probe, primer or pair of primers to detect the fusion of SDC4 and NRG1 is specifically hybridized to a sequence consisting of SEQ ID NO: 757 or its allele variant hybridization, and/or specifically hybridized to a sequence composed of SEQ ID NO: 757 The sequence consisting of ID NO: 153 hybridizes, or has 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair used to detect the fusion of CD44 and NRG1 is specifically compatible with A sequence consisting of SEQ ID NO: 99 hybridizes or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 155, or 95% or higher complementary sequence identity; - the probe, primer or primer pair for detecting the fusion of SLC4A4 and NRG1 specifically hybridizes to a sequence consisting of SEQ ID NO: 820 or its allele variants, and/or specifically hybridize to a sequence consisting of SEQ ID NO: 155, or have 95% or higher complementary sequence identity with such sequences; - a probe for detecting the fusion of SDC4 and NRG1 , the primer or primer pair specifically hybridizes to a sequence consisting of SEQ ID NO: 940 or its allele variant hybridizes, and/or specifically hybridizes to a sequence consisting of SEQ ID NO: 153, Or there is 95% or higher complementary sequence identity with these sequences; - the probe, primer or primer pair for detecting the fusion of ZFAT and NRG1 is specifically related to a sequence consisting of SEQ ID NO: 864 Sequences hybridize or allele variants thereof, and/or specifically hybridize to a sequence consisting of SEQ ID NO: 155, or have 95% or greater complementary sequence identity to such sequences; and- The probe, primer or primer pair used to detect the fusion of DSCAML1 and NRG1 is specifically hybridized with a sequence consisting of SEQ ID NO: 938 or its allele variant hybridization, and/or specifically hybridized with a sequence composed of A sequence consisting of SEQ ID NO: 153 hybridizes, or has 95% or greater complementary sequence identity to such sequences. 36. A first nucleic acid probe and a second nucleic acid probe for use in an in situ hybridization assay to detect the polynucleotide fusion of any one of items 1 to 23, - wherein the first probe is specific and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' of position 44 of SEQ ID NO: 3 -wherein the first probe is specifically hybridized to the CADM1 sequence at the nucleic acid 5' at position 53 of SEQ ID NO: 7, and the second probe is specifically at position 54 of SEQ ID NO: 7 -wherein the first probe specifically hybridizes to the CD44 sequence at the nucleic acid 5' at position 52 of SEQ ID NO: 11, and the second probe specifically hybridizes to The NRG1 sequence hybridization at the nucleic acid 3' place of position 53 of SEQ ID NO: 11; - wherein the first probe is specifically hybridized with the SLC3A2 sequence at the nucleic acid 5' position of position 53 of SEQ ID NO: 15, and The second probe is specifically hybridized to the NRG1 sequence at the 3' nucleic acid at position 54 of SEQ ID NO: 15; - wherein the first probe is specifically at the nucleic acid at position 65 of SEQ ID NO: 166 The VTCN1 sequence at the 5' hybridizes, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 66 of SEQ ID NO: 166; ID NO: The CDH1 sequence hybridization at the nucleic acid 5' place of the position 119 of 186, and the NRG1 sequence at the nucleic acid 3' place of the position 120 of the position of SEQ ID NO: 186 is specifically hybridized with the second probe; - wherein the first A probe specifically hybridizes to the CXADR sequence at the nucleic acid 5' at position 43 of SEQ ID NO: 217, and the second probe specifically hybridizes with the nucleic acid 3' at position 44 of SEQ ID NO: 217 NRG1 sequence hybridization; -wherein the first probe specifically hybridizes with the GTF2E2 sequence at the nucleic acid 5' of position 141 of SEQ ID NO: 233, and the second probe specifically hybridizes with the sequence located at SEQ ID NO: The NRG1 sequence hybridization at the nucleic acid 3' place of the position 142 of 233; - wherein the first probe is specifically hybridized with the CSMD1 sequence at the nucleic acid 5' position of the position 88 of SEQ ID NO: 255, and the second probe Specifically hybridize to the NRG1 sequence at the nucleic acid 3' at position 89 of SEQ ID NO: 255; -wherein the first probe is specifically at the PTN at the nucleic acid 5' at position 102 of SEQ ID NO: 313 Sequence hybridization, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 103 of SEQ ID NO: 313; - wherein the first probe is specifically at the position of SEQ ID NO: 330 The ST14 sequence hybridization at the nucleic acid 5' place at position 95, and the NRG1 sequence at the nucleic acid 3' place at position 96 of SEQ ID NO:330 is specifically hybridized with the second probe; -wherein the first probe specificity Hybridized with the THBS1 sequence at the nucleic acid 5' at position 56 of SEQ ID NO:376, and the second probe is specifically hybridized with the NRG1 sequence at the 3' nucleic acid at position 57 of SEQ ID NO:376; -wherein the first probe is specifically hybridized to the AGRN sequence at the nucleic acid 5' at position 106 of SEQ ID NO:403, and the second probe is specifically at position 107 of SEQ ID NO:403 The NRG1 sequence hybridization at the nucleic acid 3'place; -wherein the first probe specifically hybridizes with the PVALB sequence at the nucleic acid 5' position at position 102 of SEQ ID NO: 437, and the second probe specifically hybridizes with the nucleic acid at position 102 The NRG1 sequence hybridization at the nucleic acid 3' place of position 103 of SEQ ID NO: 437; - wherein the first probe is specifically hybridized with the SLC3A2 sequence at the nucleic acid 5' position of position 93 of SEQ ID NO: 454, and the The second probe is specifically hybridized with the NRG1 sequence at the nucleic acid 3' at position 94 of SEQ ID NO: 454; -wherein the first probe is specifically at the nucleic acid 5 at position 54 of SEQ ID NO: 486 ', and the second probe specifically hybridizes to the NRG1 sequence at the nucleic acid 3' at position 55 of SEQ ID NO: 486; -wherein the first probe specifically hybridizes to the sequence at SEQ ID NO: The WRN sequence hybridization at the nucleic acid 5' place of the position 96 of NO: 528, and the NRG1 sequence at the nucleic acid 3' place at the position 97 of SEQ ID NO: 528 is specifically hybridized with the second probe; - wherein the first The probe specifically hybridizes to the DAAM1 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 605, and the second probe specifically hybridizes with the nucleic acid 3' at position 76 of SEQ ID NO: 605. NRG1 sequence hybridization; - wherein the first probe specifically hybridizes to the ASPH sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 635, and the second probe specifically hybridizes to the sequence at SEQ ID NO: 635 The NRG1 sequence hybridization at the nucleic acid 3' place of position 76; -wherein the first probe is specifically hybridized with the NOTCH2 sequence at the nucleic acid 5' position at position 75 of SEQ ID NO: 693, and the second probe is specific Hybridization with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 693; -wherein the first probe is specifically at the CD74 sequence at the 5' nucleic acid at position 75 of SEQ ID NO: 717 Hybridization, and the second probe specifically hybridizes with the NRG1 sequence at the nucleic acid 3' at position 76 of SEQ ID NO: 717; - wherein the first probe is specifically at the position of SEQ ID NO: 743 The SDC4 sequence hybridization at the nucleic acid 5' place of 75, and the NRG1 sequence hybridization at the nucleic acid 3' place at the nucleic acid 3' place of position 76 of SEQ ID NO: 743 with this second probe specifically; - wherein the first probe specifically With the CD44 sequence hybridization at the nucleic acid 5' place of position 75 of SEQ ID NO:761, and the second probe is specifically hybridized with the NRG1 sequence at the nucleic acid 3' position of position 76 of SEQ ID NO:761;- Wherein the first probe specifically hybridizes with the SLC4A4 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 765, and the second probe specifically hybridizes with the nucleic acid at position 76 of SEQ ID NO: 765 The NRG1 sequence at the 3'hybridizes; - wherein the first probe specifically hybridizes to the SDC4 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 824, and the second probe specifically hybridizes to the sequence at SEQ ID NO: 824 ID NO: the NRG1 sequence hybridization at the nucleic acid 3' place of position 76 of 824; - wherein the first probe is specifically hybridized with the ZFAT sequence at the nucleic acid 5' position of position 75 of SEQ ID NO: 828, and the second The two probes are specifically hybridized to the NRG1 sequence at the 3' nucleic acid at position 76 of SEQ ID NO: 828; or- wherein the first probe is specifically at the nucleic acid 5 at position 75 of SEQ ID NO: 868 ', and the second probe specifically hybridizes to the NRG1 sequence at position 76 of SEQ ID NO: 868 at the nucleic acid 3'. 37. A first antibody or a pair of first and second antibodies for detecting a polypeptide encoded by the polynucleotide fusion of any one of items 1 to 23. 38. A detection assay comprising a first antibody or a set of first and second antibodies for detecting the presence of a polypeptide encoded by the polynucleotide fusion of any one of items 1 to 23, wherein the first The antibody or the first and second antibody group is preferably the first antibody or the first and second antibody group as described in item 34. 39. The first antibody or the set of first and second antibodies according to item 38, or the detection assay according to item 32, wherein the first antibody binds to a group selected from VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, CDH1-NRG1, CXADR-NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, ASPH- Polypeptide fusions of NRG1, NOTCH2-NRG1, CD74-NRG1, SDC4-NRG1, SLC4A4-NRG1, ZFAT-NRG1 or DSCAML1-NRG1, and the first and second antibody sets respectively bind VAPB and NRG1 or CADM1 and NRG1, or CD44 and NRG1, SLC3A2 and NRG1, CDH1 and NRG1, CXADR and NRG1, GTF2E2 and NRG1, CSMD1 and NRG1, PTN and NRG1, ST14 and NRG1, THBS1 and NRG1, AGRN and NRG1, PVALB and NRG1, APP and NRG1, WRN and NRG1, ASPH and NRG1, NOTCH2 and NRG1, CD74 and NRG1, SDC4 and NRG1, SLC4A4 and NRG1, ZFAT and NRG1, or DSCAML1 and NRG1. 40. A method of identifying the polynucleotide fusion of any one of items 1 to 23, or a polypeptide encoded thereby, in a sample, said method comprising testing a sample obtained from an individual to detect the sample Whether the fusion exists in . 41. A method for detecting the presence of a polynucleotide fusion as in any one of items 1 to 23, or a polypeptide encoded by it, in a sample, said method comprising testing a sample obtained from an individual to detect the Whether the fusion is present in the sample. 42. A method for determining whether an abnormal cell from an individual comprises a fusion as in any one of items 1 to 23, or a polypeptide encoded by it, said method comprising examining polynucleosides obtained from the abnormal cell of the individual Whether acid or polypeptide content is present in the sample for the fusion. 43. A method for identifying an individual carrying a polynucleotide fusion as in any one of items 1 to 23, or a polypeptide encoded by it, said method comprising testing a sample obtained from an individual to detect Whether the fusion exists. 44. The method according to any one of items 37 to 40, wherein the test comprises using a nucleic acid probe such as any one of items 31 to 36, a primer or a binding agent of a primer pair to specifically bind the polynucleotide Either the fusion is detected by the polypeptide encoded by it, or the fusion is detected by using a binding agent that binds a polynucleotide comprising the polynucleotide fusion. 45. The method according to any one of items 40 to 44, wherein the test comprises amplifying or detecting a sequence for distinguishing the presence or absence of the polynucleotide fusion or the polypeptide encoded by it. 46. The method according to any one of items 41 to 45, wherein the polynucleotide fusion is obtained from an abnormal cell expressing a polynucleotide fusion comprising the EGF-like domain of NRG1. 47. The method according to any one of items 40 to 46, wherein the method comprises the step of obtaining a sample from an individual, followed by the step of isolating the polynucleotide or the polypeptide encoded thereby from the sample. 48. The method according to any one of items 40 to 47, wherein the method comprises the step of purifying or isolating the polynucleotide from the sample. 49. The method according to any one of items 40 to 48, wherein the binding agent is or comprises a primer, primer pair, probe or antibody. 50. The method according to any one of items 40 to 49, wherein the test is an in vitro method, preferably an in vitro method. 51. The method according to any one of clauses 40 to 50, wherein the binding agent comprises or is associated with a detectable label. 52. The method according to any one of items 40 to 51, wherein the sample is a liquid specimen or a solid sample, such as a formalin-fixed paraffin-embedded tissue (FFPE) sample. 53. The method according to any one of items 40 to 52, wherein the sample comprises blood, serum, plasma, pleural fluid, urine, semen, amniotic fluid or ascites. 54. The method according to any one of items 40 to 53, wherein the sample comprises mutant cells, such as tumor cells or cancer cells, or polynucleotide or polypeptide content thereof. 55. A method of treating an individual suffering from an ErbB-2 and/or ErbB-3 positive cancer or tumor comprising a polynucleotide fusion and/or expressing a fusion polypeptide encoded by it, said method comprising introducing the The subject is administered an effective amount of an ErbB-2 and/or ErbB-3 targeting agent, wherein the fusion is a fusion according to any one of items 1-23. 56. A method of inhibiting the course of an individual suffering from an ErbB-2 and ErbB-3 positive cancer or tumor comprising a polynucleotide fusion and/or expressing a fusion polypeptide encoded thereby, said method comprising administering to the individual Administering an effective amount of an ErbB-2 and/or ErbB-3 targeting agent, wherein the fusion is a fusion according to any one of items 1-23. 57. An ErbB-2 and/or ErbB-2 for use in the treatment of an individual suffering from an ErbB-2 and ErbB-3 positive cancer or tumor comprising a polynucleotide fusion and/or expressing a fusion polypeptide encoded thereby or an ErbB-3 targeting agent, the treatment comprising administering to the individual an effective amount of ErbB-2 and/or an ErbB-3 targeting agent, wherein the fusion is as in any one of items 1 to 23 fusion. 58. A method for diagnosing whether an individual has abnormal cells comprising a polynucleotide fusion as in any one of items 1 to 23 or a polypeptide encoded by it, the method comprising testing a sample obtained from an individual to detect the Whether the fusion is present in the sample. 59. The method according to item 58, wherein the test comprises using a binding agent such as the nucleic acid probe, primer or primer pair of any one of items 30 to 35 to specifically bind to the polynucleotide or the polynucleotide encoded by it A polypeptide is used to detect the fusion, or a binding agent that binds a polynucleotide comprising the polynucleotide fusion is used to detect the fusion. 60. A method of assessing whether an individual suffers from or is susceptible to cancer or tumor, the method comprising testing a sample obtained from an individual to detect the presence of the polynucleoside according to any one of items 1 to 23 in the sample An acid fusion or a polypeptide encoded thereby, and by identifying the presence or absence of the polynucleotide or polypeptide fusion, assessing that the individual suffers from the cancer or tumor or is susceptible to the cancer or tumor. 61. The method or use according to any one of items 55 to 58, wherein the ErbB-2 and/or ErbB-3 targeting agent is selected from the group consisting of: an extracellular protein that binds ErbB-2 A multispecific antibody that binds to the first antigen-binding site of a portion of the extracellular portion of ErbB-3 and a second antigen-binding site that binds to the extracellular portion of ErbB-3, a tyrosine kinase inhibitor of ErbB-2, comprising a cell that binds to ErbB-2 A monospecific diabody having an antigen binding site on the outer portion, a monospecific diabody comprising an antigen binding site that binds the extracellular portion of ErbB-3, or any combination thereof. 62. The method or use according to any one of items 55 to 58, or 61, wherein the ErbB-2 and/or ErbB-3 targeting agent is zenocutuzumab. 63. The method or use according to any one of items 41 to 62, wherein the abnormal cell, cancer cell or tumor cell line comprises or is encoded by a polynucleotide fusion as in any one of items 1 to 23 The polypeptide, and wherein the polynucleotide fusion comprised by the cell further comprises a coding sequence in-frame fused to an EGF-like domain encoding NRG1. 64. The method or use according to any one of items 41 to 63, wherein the abnormal cell line is from cancer, especially said cancer is adenocarcinoma, more especially mucinous adenocarcinoma, pancreatic cancer, more especially pancreatic adenocarcinoma Cancer, more especially pancreatic ductal adenocarcinoma, renal cell carcinoma, sarcoma, bladder cancer, colorectal cancer, rectal cancer, colorectal cancer, gallbladder cancer, head and neck cancer, prostate cancer, uterine cancer, breast cancer, ovarian cancer, liver cancer, uterine Endometrial cancer, lung cancer, preferably non-small cell lung cancer, preferably, more preferably invasive mucinous adenocarcinoma, primary cancer or metastatic cancer. 65. An in vivo animal model comprising the polynucleotide fusion according to any one of items 1 to 23 and/or expressing the polypeptide fusion encoded thereby, wherein the polynucleotide fusion composed of the animal model The drug or polypeptide fusion is preferably composed of transplanted abnormal cells present in the animal model, or is composed of the gene body of the animal model. 66. A method of treating the in vivo animal model of item 66 with an Erb2 and/or Erb3 targeting agent selected from the group consisting of an extracellular moiety that binds ErbB-2 Multispecific antibody with a first antigen-binding site and a second antigen-binding site that binds to the extracellular portion of ErbB-3, a tyrosine kinase inhibitor of ErbB-2, comprising an extracellular portion that binds to ErbB-2 A monospecific diabody comprising an antigen-binding site that binds an extracellular portion of ErbB-3, or any combination thereof, the method comprising administering to the animal The Erb2 and/or Erb3 targeting agent. 67. An in situ hybridization assay for use in the detection of VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1 CXADR, GTF2E2, CSMD1, PTN, ST14, THBS1, AGRN, PVALB, APP, WRN, DAAM1, ASPH, NOTCH2, The first nucleic acid probe and the second nucleic acid probe of the gene rearrangement of CD74, SDC4, SLC4A4, ZFAT or DSCAML1, wherein: - the first probe for detecting the gene rearrangement of VAPB is specifically related to SEQ ID NO: The VAPB sequence at the nucleic acid 5' place of position 43 of 1 hybridizes, and the VAPB sequence at the nucleic acid 3' place of position 43 of SEQ ID NO: 1 is specifically hybridized with the second probe; - for detecting the gene of CADM1 The rearranged first probe specifically hybridizes to the CADM1 sequence at the nucleic acid 5' of position 53 of SEQ ID NO: 5, and the second probe specifically hybridizes to the nucleic acid at position 53 of SEQ ID NO: 5 The CADM1 sequence at the 3'hybridizes; - the first probe for detecting gene rearrangement of CD44 specifically hybridizes to the CD44 sequence at the nucleic acid 5' at position 52 of SEQ ID NO: 9, and the second probe Specifically hybridize to the CD44 sequence at the nucleic acid 3' of position 52 of SEQ ID NO:9; The SLC3A2 sequence at the nucleic acid 5' hybridizes, and the second probe specifically hybridizes with the SLC3A2 sequence at the nucleic acid 3' at position 53 of SEQ ID NO: 13; - the first gene rearrangement for detecting VTCN1 The probe specifically hybridizes with the VTCN1 sequence at the nucleic acid 5' at position 65 of SEQ ID NO: 164, and the second probe specifically hybridizes with the VTCN1 at the nucleic acid 3' at position 65 of SEQ ID NO: 164 Sequence hybridization; - the first probe for detecting gene rearrangement of CDH1 specifically hybridizes to the CDH1 sequence at the 5' position of the nucleic acid at position 119 of SEQ ID NO: 184, and the second probe specifically hybridizes to the sequence at The CDH1 sequence at the 3' position of the nucleic acid at position 119 of SEQ ID NO: 184 hybridizes; - the first probe for detecting the gene rearrangement of CXADR is specifically at the 5' position of the nucleic acid at position 43 of SEQ ID NO: 215 The CXADR sequence hybridizes, and the second probe specifically hybridizes to the CXADR sequence at the nucleic acid 3' at position 43 of SEQ ID NO: 215; - the first probe for detecting the gene rearrangement of GTF2E2 specifically With the GTF2E2 sequence hybridization at the nucleic acid 5' place of the position 141 of SEQ ID NO: 231, and the GTF2E2 sequence at the nucleic acid 3' position of the position 141 position of the second probe is specifically hybridized with SEQ ID NO: 231; - with The first probe for detecting gene rearrangement of CSMD1 specifically hybridizes with the CSMD1 sequence at the nucleic acid 5' of position 88 of SEQ ID NO: 253, and the second probe specifically hybridizes with the sequence located at SEQ ID NO: 253 The CSMD1 sequence hybridization at the 3' position of the nucleic acid at position 88; - the first probe for detecting the gene rearrangement of PTN specifically hybridizes with the PTN sequence at the 5' position of the nucleic acid at position 102 of SEQ ID NO: 311, and The second probe specifically hybridizes to the PTN sequence at the nucleic acid 3' of position 102 of SEQ ID NO: 311; The ST14 sequence hybridization at the nucleic acid 5' place of position 95 of 328, and the ST14 sequence at the nucleic acid 3' place of position 95 of SEQ ID NO: 328 is specifically hybridized with this second probe; - for detecting the gene of AGRN The rearranged first probe specifically hybridizes with the AGRN sequence at the nucleic acid 5' of position 106 of SEQ ID NO:401, and the second probe specifically hybridizes with the nucleic acid at position 106 of SEQ ID NO:401 The AGRN sequence at the 3'hybridizes; - the first probe for detecting the gene rearrangement of THBS1 specifically hybridizes to the THBS1 sequence at the 5' position of the nucleic acid at position 56 of SEQ ID NO: 374, and the second probe Specifically hybridizes to the THBS1 sequence at the 3' position of the nucleic acid at position 56 of SEQ ID NO: 374; - the first probe for detecting the gene rearrangement of PVALB is specifically at position 102 of SEQ ID NO: 435 The PVALB sequence at the 5' place of the nucleic acid hybridizes, and the second probe specifically hybridizes with the PVALB sequence at the 3' place of the nucleic acid at position 102 of SEQ ID NO: 435; - the first gene rearrangement for detecting SLC3A2 The probe specifically hybridizes with the SLC3A2 sequence at the nucleic acid 5' at position 93 of SEQ ID NO: 452, and the second probe specifically hybridizes with the SLC3A2 at the nucleic acid 3' at position 93 of SEQ ID NO: 452 Sequence hybridization; - the first probe for detecting gene rearrangement of APP specifically hybridizes to the APP sequence at the nucleic acid 5' of position 54 of SEQ ID NO: 484, and the second probe specifically hybridizes to the nucleic acid located at The APP sequence at the 3' position of the nucleic acid at position 54 of SEQ ID NO: 484 hybridizes; - the first probe for detecting the gene rearrangement of WRN is specifically at the 5' position of the nucleic acid at position 96 of SEQ ID NO: 526 The WRN sequence hybridizes, and the second probe specifically hybridizes to the WRN sequence at the 3' of the nucleic acid at position 96 of SEQ ID NO: 526; - the first probe for detecting the gene rearrangement of DAAM1 specifically Hybridization with the DAAM1 sequence at the nucleic acid 5' place of the position 75 of SEQ ID NO: 603, and the second probe is specifically hybridized with the DAAM1 sequence at the nucleic acid 3' position of the position 75 of SEQ ID NO: 603; - with The first probe for detecting gene rearrangement of ASPH specifically hybridizes to the ASPH sequence at the nucleic acid 5' of position 75 of SEQ ID NO: 633, and the second probe specifically hybridizes to the sequence located at SEQ ID NO: 633 The ASPH sequence at the 3' position of the nucleic acid at position 75 hybridizes; - the first probe for detecting the gene rearrangement of NOTCH2 specifically hybridizes with the NOTCH2 sequence at the 5' position of the nucleic acid at position 75 of SEQ ID NO: 691, and The second probe specifically hybridizes to the NOTCH2 sequence at the nucleic acid 3' of position 75 of SEQ ID NO:691; The CD74 sequence hybridization at the nucleic acid 5' place of the position 75 of 715, and the CD74 sequence at the nucleic acid 3' place of the position 75 position of SEQ ID NO: 715 is specifically hybridized with this second probe; - for detecting the gene of SDC4 The rearranged first probe specifically hybridizes to the SDC4 sequence at the nucleic acid 5' of position 75 of SEQ ID NO: 741, and the second probe specifically hybridizes to the nucleic acid at position 75 of SEQ ID NO: 741 The SDC4 sequence at the 3'hybridizes; - the first probe for detecting gene rearrangement of CD44 specifically hybridizes to the CD44 sequence at the nucleic acid 5' at position 75 of SEQ ID NO: 759, and the second probe Specifically hybridizes to the CD44 sequence at the 3' position of the nucleic acid at position 75 of SEQ ID NO: 759; - the first probe for detecting gene rearrangements of SLC4A4 is specifically at position 75 of SEQ ID NO: 763 The SLC4A4 sequence at the 5' place of the nucleic acid hybridizes, and the second probe specifically hybridizes with the SLC4A4 sequence at the 3' of the nucleic acid at position 75 of SEQ ID NO: 763; - the first gene rearrangement method for detecting SDC4 The probe specifically hybridizes with the SDC4 sequence at the nucleic acid 5' at position 75 of SEQ ID NO:822, and the second probe specifically hybridizes with the SDC4 at the nucleic acid 3' at position 75 of SEQ ID NO:822 Sequence hybridization; - the first probe for detecting the gene rearrangement of ZFAT is specifically hybridized with the ZFAT sequence at the nucleic acid 5' of position 75 of SEQ ID NO: 826, and the second probe is specifically hybridized with the position at The ZFAT sequence at the 3' position of the nucleic acid at position 75 of SEQ ID NO: 826 hybridizes; or - the first probe for detecting the gene rearrangement of DSCAML1 is specifically at the 5' position of the nucleic acid at position 75 of SEQ ID NO: 866 The DSCAML1 sequence hybridizes, and the second probe specifically hybridizes to the DSCAML1 sequence located 3' of the nucleic acid at position 75 of SEQ ID NO: 866. Example Example 1 : Test sample preparation
液態樣本或檢體包括血液、血清、血漿、胸腔積液、尿液、精液、陰道拭子、羊水、腹膜液、無細胞灌洗液或其他生物流體。Liquid samples or specimens include blood, serum, plasma, pleural effusion, urine, semen, vaginal swab, amniotic fluid, peritoneal fluid, acellular lavage fluid, or other biological fluids.
當使用血液作為起始材料時,可在以細胞穩定劑(諸如無細胞的DNA BCT管(Streck))處理過的管子中收集血液,該細胞穩定劑可防止血球細胞在收集後破裂,並將來自正常細胞的野生型DNA/RNA的汙染降至最低。 實例 2 :從血漿純化循環 DNA When using blood as the starting material, blood can be collected in tubes treated with a cell stabilizer, such as cell-free DNA BCT tubes (Streck), which prevents hemocytes from rupturing after collection and will Contamination with wild-type DNA/RNA from normal cells is minimized. Example 2 : Purification of Circulating DNA from Plasma
雖然此實例的實驗方案描述了從1 ml的血漿樣本中分離cfDNA,但當期望有更高產量及其他樣本類型(如尿液及血清)時,該實驗方案也適合從其他體積中分離。該實驗方案係基於QIAamp®循環核酸套組(CNA) (Qiagen, 產品編號#55114)及日期為10/2019 的手冊(HB-0202-006)。Although the protocol in this example describes the isolation of cfDNA from a 1 ml plasma sample, the protocol is also suitable for isolation from other volumes when higher yields are desired and other sample types such as urine and serum. The protocol is based on the QIAamp® Circulating Nucleic Acid Kit (CNA) (Qiagen, Product #55114) and brochure (HB-0202-006) dated 10/2019.
在開始之前,允許來自人類個體的血漿樣本平衡至室溫,無論是來自於適合冷藏溫度儲存的樣本或是從患者抽取後直接使用。所有的離心步驟都是在室溫下進行。必要時,用磷酸鹽緩衝鹽水(PBS)將體積低於1 ml的樣本調整到總共1 ml。 緩衝液及試劑的製備 Before starting, allow plasma samples from human subjects to equilibrate to room temperature, either from samples suitable for storage at refrigerated temperatures or drawn directly from the patient. All centrifugation steps were performed at room temperature. Sample volumes below 1 ml were adjusted to a total of 1 ml with phosphate-buffered saline (PBS), if necessary. Preparation of buffers and reagents
根據製造商的說明書來製備緩衝液ACB、ACW1及ACW2。簡言之,在使用前,將200 ml異丙醇(100 v/v%)添加到300 ml緩衝液ACB 濃縮物中以得到500 ml緩衝液ACB。在添加異丙醇之後,將所得緩衝液混合均勻。在使用前,將25 ml乙醇96–100 (v/v%)添加到19 ml緩衝液ACW1濃縮物以得到44 ml緩衝液ACW1。在添加乙醇之後將所得液體混合均勻。在使用前,將30 ml乙醇(96–100%)添加到13 ml緩衝液ACW2濃縮物中以得到43 ml緩衝液,隨後將其混合均勻。例如,要處理12個1 ml血漿樣本時,將10.6 ml緩衝液ACL與67.5 μl之含有載體的AVE緩衝液混合在一起。 從血漿分離cfDNA Buffers ACB, ACW1 and ACW2 were prepared according to the manufacturer's instructions. Briefly, 200 ml of isopropanol (100 v/v%) was added to 300 ml of Buffer ACB concentrate before use to obtain 500 ml of Buffer ACB. After the addition of isopropanol, the resulting buffer was mixed well. Before use, add 25 ml of ethanol 96–100 (v/v%) to 19 ml of Buffer ACW1 concentrate to obtain 44 ml of Buffer ACW1. The resulting liquid was mixed well after adding ethanol. Before use, 30 ml of ethanol (96–100%) was added to 13 ml of buffer ACW2 concentrate to obtain 43 ml of buffer, which was then mixed well. For example, to process 12 1 ml plasma samples, mix 10.6 ml buffer ACL with 67.5 μl AVE buffer containing vehicle. Isolation of cfDNA from plasma
流程:將100 µl QIAGEN蛋白酶K抽吸到50 ml離心管中,將1 ml的血漿添加到該50 ml離心管中。添加0.8 ml ACL緩衝液(含有1.0 µg載體RNA),之後關上蓋子並用脈衝渦動混合30秒且同時在管內形成渦流。將樣本及ACL緩衝液徹底混合以產生均質溶液。將樣本在60°C下培育30分鐘使其立即裂解。於離心管內向該溶胞產物添加1.8 ml ACB緩衝液,之後關上蓋子並用脈衝渦動徹底混合15至30秒。將該溶胞產物ACB緩衝混合液於離心管內在冰上培育5分鐘。將QIAamp小型管柱(Mini column)插入QIAvac 24 Plus上之真空接頭(VacConnector)中,並根據製造商的說明書設定。將20 ml延伸管插入敞開的QIAamp小型管柱中。將20 ml延伸管緊緊插入QIAamp小型管柱中。Procedure: 100 µl of QIAGEN Proteinase K was aspirated into a 50 ml centrifuge tube and 1 ml of plasma was added to the 50 ml centrifuge tube. Add 0.8 ml ACL buffer (containing 1.0 µg carrier RNA), then close the cap and mix by pulse vortexing for 30 seconds while vortexing the tube. Thoroughly mix the sample and ACL buffer to produce a homogeneous solution. Samples were immediately lysed by incubating at 60°C for 30 minutes. 1.8 ml of ACB buffer was added to the lysate in a centrifuge tube, then the cap was closed and mixed thoroughly with pulse vortexing for 15 to 30 seconds. The lysate ACB buffer mixture was incubated in a centrifuge tube on ice for 5 minutes. Insert the QIAamp mini column (Mini column) into the vacuum connector (VacConnector) on the QIAvac 24 Plus, and set according to the manufacturer's instructions. Insert the 20 ml extension tube into the open QIAamp Mini Column. Insert the 20 ml extension tubing tightly into the QIAamp Mini Column.
在冰上培育之後,將該溶胞產物 – ACB緩衝混合液施加至QIAamp小型管柱的延伸管中。開啟真空幫浦,且當所有溶解物已完全被抽吸通過管柱時,關閉真空幫浦且使壓力釋放至0毫巴。小心地移出且丟棄延伸管。如果為QIAvac連接系統(Connecting System)的一部分,則可使用真空調節器(Vacuum Regulator)。為了避免交叉污染,不要將延伸管在鄰近的QIAamp小型管柱上方移動。將600 µl ACW1緩衝液施加至QIAamp小型管柱中。將管柱的蓋子打開且開啟真空幫浦。在所有ACW1緩衝液已被抽吸通過QIAamp小型管柱之後,關閉真空幫浦且使壓力釋放至0毫巴。將750 µl ACW2緩衝液施加至QIAamp小型管柱中。將管柱的蓋子打開且開啟真空幫浦。在所有ACW2緩衝液已被抽吸通過QIAamp小型管柱之後,關閉真空幫浦且使壓力釋放至0毫巴。將750 µl乙醇(96 v/v%)施加至QIAamp小型管柱中。將管柱的蓋子打開且開啟真空幫浦。在所有乙醇已被抽吸通過QIAamp離心層析管柱(spin column)之後,關閉真空幫浦且使壓力釋放至0毫巴。將QIAamp小型管柱的蓋子蓋緊,將該管柱自真空歧管中移出且拋棄真空接頭。將QIAamp小型管柱置放於乾淨的2 ml收集管中,且全速(20,000 x g; 14,000 rpm)離心3分鐘。將QIAamp小型管柱置放於新的2 ml收集管中。打開蓋子,且將總成在56°C下培育10分鐘以使膜完全乾燥。將QIAamp小型管柱置放於乾淨的1.5 ml溶離管中並丟棄最後使用的2 ml收集管。將20 µl AVE緩衝液小心地施加至QIAamp小型管柱膜(Mini membrane)之中心。蓋緊蓋子且在室溫下培育3分鐘。將溶離緩衝液AVE平衡至室溫。回收的溶離液體積為約5 µl,小於施加至QIAamp小型管柱的溶離體積。在微型離心機中以全速(20,000 x g; 14,000 rpm)將管柱離心1分鐘以溶離核酸,致使獲得細胞游離DNA。After incubation on ice, the lysate-ACB buffer mixture was applied to the extension tube of the QIAamp Mini Column. The vacuum pump was turned on, and when all lysates had been completely drawn through the column, the vacuum pump was turned off and the pressure was released to 0 mbar. Carefully remove and discard the extension tube. If part of the QIAvac Connecting System, a Vacuum Regulator can be used. To avoid cross-contamination, do not move extension tubing over adjacent QIAamp mini-columns. Apply 600 µl of ACW1 buffer to the QIAamp Mini Column. Uncap the column and turn on the vacuum pump. After all of the ACW1 buffer had been pumped through the QIAamp mini-column, the vacuum pump was turned off and the pressure was released to 0 mbar. Apply 750 µl of ACW2 buffer to the QIAamp Mini Column. Uncap the column and turn on the vacuum pump. After all of the ACW2 buffer had been pumped through the QIAamp mini-column, the vacuum pump was turned off and the pressure was released to 0 mbar. Apply 750 µl of ethanol (96 v/v%) to the QIAamp Mini Column. Uncap the column and turn on the vacuum pump. After all the ethanol had been drawn through the QIAamp spin column, the vacuum pump was turned off and the pressure was released to 0 mbar. Cap the QIAamp mini column tightly, remove the column from the vacuum manifold and discard the vacuum fitting. Place the QIAamp Mini Column in a clean 2 ml collection tube and centrifuge at full speed (20,000 x g; 14,000 rpm) for 3 minutes. Place the QIAamp Mini Column into a new 2 ml collection tube. The lid was removed and the assembly was incubated at 56°C for 10 minutes to completely dry the membrane. Place the QIAamp Mini Column in a clean 1.5 ml elution tube and discard the last used 2 ml collection tube. Carefully apply 20 µl of AVE Buffer to the center of the QIAamp Mini membrane. Cover tightly and incubate at room temperature for 3 minutes. Equilibrate elution buffer AVE to room temperature. The recovered eluent volume was about 5 µl, which is less than the eluent volume applied to the QIAamp Mini Column. Centrifuge the column at full speed (20,000 x g; 14,000 rpm) in a microcentrifuge for 1 minute to elute nucleic acids, resulting in cell-free DNA.
用AVE緩衝液溶離游離的循環游離細胞DNA,即用於擴增反應或儲存在–30°C至–15°C下。經純化的核酸不含蛋白質、核酸酶及其他雜質。Free circulating cell-free DNA was lysed with AVE buffer, i.e. used in amplification reactions or stored at –30°C to –15°C. Purified nucleic acids are free of proteins, nucleases, and other impurities.
接著,核酸濃度係於擴增NRG1融合物及檢測分析之前,根據製造商的建議來測定,諸如定量擴增測定法。Next, nucleic acid concentration is determined according to the manufacturer's recommendations prior to amplification of the NRG1 fusion and detection assays, such as quantitative amplification assays.
用於QIAvac 24 Plus系統之真空處理1–24 QIAGEN離心層析管柱的材料清單。QIAvac 24 Plus:處理1–24 離心層析管柱的真空歧管,包括QIAvac 24 Plus真空歧管、Luer Plugs及快速聯結管(Quick Couplings) (產品編號19413)。真空接頭(500):500型拋棄式接頭,供與魯爾槽(luer slot)或真空閥(VacValve)上的QIAamp小型管柱一起使用(產品編號19407)。真空閥(24):24個閥,供與QIAvac 24 Plus一起使用(產品編號19408)。真空調節器(Vacuum Regulator):供與QIAvac歧管一起使用(產品編號19530)。真空幫浦(230 V, 50 Hz):通用真空幫浦(容量為34 L/分鐘,8 mbar真空絕對值) (產品編號84020)。QIAvac連接系統(Connecting System):用以連接真空歧管與真空幫浦的系統:包括托盤、廢液瓶、管路、聯結管、閥門、壓力計、24個真空閥(產品編號19419)。 實例 3 :循環腫瘤 RNA (cfRNA)從血漿純化循環RNA Bill of Materials for Vacuum 1–24 QIAGEN Spin Chromatography Columns for the QIAvac 24 Plus System. QIAvac 24 Plus: Vacuum manifold for processing 1–24 centrifugal chromatography columns, including QIAvac 24 Plus vacuum manifold, Luer Plugs and Quick Couplings (Product No. 19413). Vacuum Adapter (500): Type 500 Disposable Adapter for use with QIAamp Mini Columns on a luer slot or VacValve (Product No. 19407). Vacuum Valves (24): 24 valves for use with the QIAvac 24 Plus (Product No. 19408). Vacuum Regulator: For use with the QIAvac manifold (Product No. 19530). Vacuum pump (230 V, 50 Hz): Universal vacuum pump (34 L/min capacity, 8 mbar absolute vacuum) (Product No. 84020). QIAvac Connecting System (Connecting System): A system used to connect vacuum manifolds and vacuum pumps: including trays, waste bottles, tubing, connecting tubes, valves, pressure gauges, and 24 vacuum valves (product number 19419). Example 3 : Circulating Tumor RNA (cfRNA) Purification of Circulating RNA from Plasma
雖然此實例的實驗方案描述了從4 ml的血漿樣本中分離循環RNA,但當期望有更高產量及其他樣本類型(如尿液及血清)時,該實驗方案也適合從其他體積中分離。該實驗方案係基於QIAamp®循環核酸套組(CNA) (Qiagen, 產品編號#55114)及日期為10/2019 的手冊(HB-0202-006)。Although the protocol in this example describes the isolation of circulating RNA from a 4 ml plasma sample, the protocol is also suitable for isolation from other volumes when higher yields are desired and other sample types such as urine and serum. The protocol is based on the QIAamp® Circulating Nucleic Acid Kit (CNA) (Qiagen, Product #55114) and brochure (HB-0202-006) dated 10/2019.
在開始之前,允許來自人類個體的血漿樣本平衡至室溫,無論是來自於適合冷藏溫度儲存的樣本或是從患者抽取後直接使用。所有的離心步驟都是在室溫下進行。必要時,用磷酸鹽緩衝鹽水(PBS)將體積低於4 ml的樣本調整到總共4 ml。 緩衝液及試劑的製備。 Before starting, allow plasma samples from human subjects to equilibrate to room temperature, either from samples suitable for storage at refrigerated temperatures or drawn directly from the patient. All centrifugation steps were performed at room temperature. Sample volumes below 4 ml were adjusted to a total of 4 ml with phosphate-buffered saline (PBS), if necessary. Preparation of buffers and reagents.
根據製造商的說明書來製備緩衝液ACB、ACW1及ACW2。簡言之,在使用前,將200 ml異丙醇(100 v/v%)添加到300 ml緩衝液ACB 濃縮物中以得到500 ml緩衝液ACB。在添加異丙醇之後,將所得緩衝液混合均勻。在使用前,將25 ml乙醇96–100 (v/v%)添加到19 ml緩衝液ACW1濃縮物以得到44 ml緩衝液ACW1。在添加乙醇之後將所得液體混合均勻。在使用前,將30 ml乙醇(96–100%)添加到13 ml緩衝液ACW2濃縮物中以得到43 ml緩衝液,隨後將其混合均勻。例如,要處理12個4 ml血漿樣本時,將43 ml緩衝液ACL與67.5 μl之含有載體的AVE緩衝液混合在一起。 從血漿分離cfRNA Buffers ACB, ACW1 and ACW2 were prepared according to the manufacturer's instructions. Briefly, 200 ml of isopropanol (100 v/v%) was added to 300 ml of Buffer ACB concentrate before use to obtain 500 ml of Buffer ACB. After the addition of isopropanol, the resulting buffer was mixed well. Before use, add 25 ml of ethanol 96–100 (v/v%) to 19 ml of Buffer ACW1 concentrate to obtain 44 ml of Buffer ACW1. The resulting liquid was mixed well after adding ethanol. Before use, 30 ml of ethanol (96–100%) was added to 13 ml of buffer ACW2 concentrate to obtain 43 ml of buffer, which was then mixed well. For example, to process twelve 4 ml plasma samples, mix 43 ml of buffer ACL with 67.5 μl of AVE buffer containing vehicle. Isolation of cfRNA from plasma
流程:將400 µl QIAGEN蛋白酶K抽吸到50 ml離心管中,將4 ml的血漿添加到該50 ml離心管中。添加3.2 ml ACL緩衝液(含有1.0 µg載體RNA),之後關上蓋子並用脈衝渦動混合30秒且同時在管內形成渦流。將樣本及ACL緩衝液徹底混合以產生均質溶液。將樣本在60°C下培育30分鐘使其立即裂解。於離心管內向該溶胞產物添加7.2 ml ACB緩衝液,之後關上蓋子並用脈衝渦動徹底混合15至30秒。將該溶胞產物ACB緩衝混合液於離心管內在冰上培育5分鐘。將QIAamp小型管柱(Mini column)插入QIAvac 24 Plus上之真空接頭(VacConnector)中,並根據製造商的說明書設定。將20 ml延伸管插入敞開的QIAamp小型管柱中。將20 ml延伸管緊緊插入QIAamp小型管柱中。Procedure: 400 µl of QIAGEN Proteinase K was aspirated into a 50 ml centrifuge tube and 4 ml of plasma was added to the 50 ml centrifuge tube. Add 3.2 ml ACL buffer (containing 1.0 µg carrier RNA), then close the cap and mix by pulse vortexing for 30 seconds while vortexing the tube. Thoroughly mix the sample and ACL buffer to produce a homogeneous solution. Samples were immediately lysed by incubating at 60°C for 30 minutes. To this lysate was added 7.2 ml of ACB buffer in a centrifuge tube, after which the cap was closed and mixed thoroughly by pulse vortexing for 15 to 30 seconds. The lysate ACB buffer mixture was incubated in a centrifuge tube on ice for 5 minutes. Insert the QIAamp mini column (Mini column) into the vacuum connector (VacConnector) on the QIAvac 24 Plus, and set according to the manufacturer's instructions. Insert the 20 ml extension tube into the open QIAamp Mini Column. Insert the 20 ml extension tubing tightly into the QIAamp Mini Column.
在冰上培育之後,將該溶胞產物 – ACB緩衝混合液施加至QIAamp小型管柱的延伸管中。開啟真空幫浦,且當所有溶解物已完全被抽吸通過管柱時,關閉真空幫浦且使壓力釋放至0毫巴。小心地移出且丟棄延伸管。如果為QIAvac連接系統(Connecting System)的一部分,則可使用真空調節器(Vacuum Regulator)。為了避免交叉污染,不要將延伸管在鄰近的QIAamp小型管柱上方移動。將600 µl ACW1緩衝液施加至QIAamp小型管柱中。將管柱的蓋子打開且開啟真空幫浦。在所有ACW1緩衝液已被抽吸通過QIAamp小型管柱之後,關閉真空幫浦且使壓力釋放至0毫巴。將750 µl ACW2緩衝液施加至QIAamp小型管柱中。將管柱的蓋子打開且開啟真空幫浦。在所有ACW2緩衝液已被抽吸通過QIAamp小型管柱之後,關閉真空幫浦且使壓力釋放至0毫巴。將750 µl乙醇(96 v/v%)施加至QIAamp小型管柱中。將管柱的蓋子打開且開啟真空幫浦。在所有乙醇已被抽吸通過QIAamp離心層析管柱(spin column)之後,關閉真空幫浦且使壓力釋放至0毫巴。將QIAamp小型管柱的蓋子蓋緊,將該管柱自真空歧管中移出且拋棄真空接頭。將QIAamp小型管柱置放於乾淨的2 ml收集管中,且全速(20,000 x g; 14,000 rpm)離心3分鐘。將QIAamp小型管柱置放於新的2 ml收集管中。打開蓋子,且將總成在56°C下培育10分鐘以使膜完全乾燥。將QIAamp小型管柱置放於乾淨的1.5 ml溶離管中並丟棄最後使用的2 ml收集管。將20 µl AVE緩衝液小心地施加至QIAamp小型管柱膜(Mini membrane)之中心。蓋緊蓋子且在室溫下培育3分鐘。將溶離緩衝液AVE平衡至室溫。回收的溶離液體積為約5 µl,小於施加至QIAamp小型管柱的溶離體積。在微型離心機中以全速(20,000 x g; 14,000 rpm)將管柱離心1分鐘以溶離核酸,致使獲得細胞游離RNA。After incubation on ice, the lysate-ACB buffer mixture was applied to the extension tube of the QIAamp Mini Column. The vacuum pump was turned on, and when all lysates had been completely drawn through the column, the vacuum pump was turned off and the pressure was released to 0 mbar. Carefully remove and discard the extension tube. If part of the QIAvac Connecting System, a Vacuum Regulator can be used. To avoid cross-contamination, do not move extension tubing over adjacent QIAamp mini-columns. Apply 600 µl of ACW1 buffer to the QIAamp Mini Column. Uncap the column and turn on the vacuum pump. After all of the ACW1 buffer had been pumped through the QIAamp mini-column, the vacuum pump was turned off and the pressure was released to 0 mbar. Apply 750 µl of ACW2 buffer to the QIAamp Mini Column. Uncap the column and turn on the vacuum pump. After all of the ACW2 buffer had been pumped through the QIAamp mini-column, the vacuum pump was turned off and the pressure was released to 0 mbar. Apply 750 µl of ethanol (96 v/v%) to the QIAamp Mini Column. Uncap the column and turn on the vacuum pump. After all the ethanol had been drawn through the QIAamp spin column, the vacuum pump was turned off and the pressure was released to 0 mbar. Cap the QIAamp mini column tightly, remove the column from the vacuum manifold and discard the vacuum fitting. Place the QIAamp Mini Column in a clean 2 ml collection tube and centrifuge at full speed (20,000 x g; 14,000 rpm) for 3 minutes. Place the QIAamp Mini Column into a new 2 ml collection tube. The lid was removed and the assembly was incubated at 56°C for 10 minutes to completely dry the membrane. Place the QIAamp Mini Column in a clean 1.5 ml elution tube and discard the last used 2 ml collection tube. Carefully apply 20 µl of AVE Buffer to the center of the QIAamp Mini membrane. Cover tightly and incubate at room temperature for 3 minutes. Equilibrate elution buffer AVE to room temperature. The recovered eluent volume was about 5 µl, which is less than the eluent volume applied to the QIAamp Mini Column. Centrifuge the column at full speed (20,000 x g; 14,000 rpm) in a microcentrifuge for 1 minute to elute nucleic acids, resulting in cell-free RNA.
用AVE緩衝液溶離游離的循環游離細胞RNA,即用於擴增反應或儲存在–30°C至–15°C下。經純化的核酸不含蛋白質、核酸酶及其他雜質。Free circulating cell-free RNA was lysed with AVE buffer, either for amplification reactions or stored at –30°C to –15°C. Purified nucleic acids are free of proteins, nucleases, and other impurities.
接著,核酸濃度係於擴增NRG1融合物及檢測分析之前,根據製造商的建議來測定,諸如定量擴增測定法。Next, nucleic acid concentration is determined according to the manufacturer's recommendations prior to amplification of the NRG1 fusion and detection assays, such as quantitative amplification assays.
用於QIAvac 24 Plus系統之真空處理1–24 QIAGEN離心層析管柱的材料清單。QIAvac 24 Plus:處理1–24 離心層析管柱的真空歧管,包括QIAvac 24 Plus真空歧管、Luer Plugs及快速聯結管(Quick Couplings) (產品編號19413)。真空接頭(500):500型拋棄式接頭,供與魯爾槽(luer slot)或真空閥(VacValve)上的QIAamp小型管柱一起使用(產品編號19407)。真空閥(24):24個閥,供與QIAvac 24 Plus一起使用(產品編號19408)。真空調節器(Vacuum Regulator):供與QIAvac歧管一起使用(產品編號19530)。真空幫浦(230 V, 50 Hz):通用真空幫浦(容量為34 L/分鐘,8 mbar真空絕對值) (產品編號84020)。QIAvac連接系統(Connecting System):用以連接真空歧管與真空幫浦的系統:包括托盤、廢液瓶、管路、聯結管、閥門、壓力計、24個真空閥(產品編號19419)。Bill of Materials for Vacuum 1–24 QIAGEN Spin Chromatography Columns for the QIAvac 24 Plus System. QIAvac 24 Plus: Vacuum manifold for processing 1–24 centrifugal chromatography columns, including QIAvac 24 Plus vacuum manifold, Luer Plugs and Quick Couplings (Product No. 19413). Vacuum Adapter (500): Type 500 disposable adapter for use with QIAamp Mini Columns on a luer slot or VacValve (Product No. 19407). Vacuum Valves (24): 24 valves for use with the QIAvac 24 Plus (Product No. 19408). Vacuum Regulator: For use with the QIAvac manifold (Product No. 19530). Vacuum pump (230 V, 50 Hz): Universal vacuum pump (34 L/min capacity, 8 mbar absolute vacuum) (Product No. 84020). QIAvac Connecting System (Connecting System): A system used to connect vacuum manifolds and vacuum pumps: including trays, waste bottles, tubing, connecting tubes, valves, pressure gauges, and 24 vacuum valves (product number 19419).
為了避免在純化期間及之後損失掉來自生物材料的RNA,首先徹底清潔任何塑料器皿或玻璃器皿,以消除可能的核糖核酸酶(RNase)污染。藉由建立及維護無RNase的環境,可以避免無意中將RNase引入分離流程中。 實例 4 :循環腫瘤細胞的分離 To avoid loss of RNA from biological material during and after purification, first thoroughly clean any plastic or glassware to eliminate possible ribonuclease (RNase) contamination. By establishing and maintaining an RNase-free environment, you can avoid inadvertently introducing RNase into the separation process. Example 4 : Isolation of Circulating Tumor Cells
此實驗方案係描述從全血分離出循環腫瘤細胞,其使用AdnaTest BreastCancerSelect套組(ADNAGEN,產品編號T-1-508)使得可經由上皮及腫瘤相關抗原產生富集免疫磁性的腫瘤細胞。抗上皮及腫瘤相關抗原的抗體與磁珠(Dynabeads)共軛以標記周邊血中的腫瘤細胞。經標記的細胞係以磁粒收集器(AdnaMag-L及AdnaMag-S)萃取且隨後裂解。從所得溶胞產物分離出mRNA且使用於檢測NRG1融合物。This protocol describes the isolation of circulating tumor cells from whole blood using the AdnaTest Breast CancerSelect Kit (ADNAGEN, Cat. No. T-1-508) allowing the generation of tumor cells immunomagnetically enriched for epithelial and tumor-associated antigens. Antibodies against epithelial and tumor-associated antigens were conjugated to magnetic beads (Dynabeads) to label tumor cells in peripheral blood. Labeled cell lines were extracted with magnetic particle harvesters (AdnaMag-L and AdnaMag-S) and subsequently lysed. mRNA was isolated from the resulting lysates and used to detect NRG1 fusions.
將全血(至少5 ml,但也可能是10 ml)收集在含有EDTA(例如,‘S Monovette® Kalium EDTA’, Sarstedt;‘BD Vacutainer® K3EDTA’, Becton Dickinson)的管子中且立即置於冰上,之後在4小時內抽吸並進行處理。Whole blood (at least 5 ml, but possibly 10 ml) is collected in tubes containing EDTA (eg, 'S Monovette® Kalium EDTA', Sarstedt; 'BD Vacutainer® K3EDTA', Becton Dickinson) and immediately placed on ice on, then aspirate and dispose of within 4 hours.
選擇磁珠(Select Bead)的製備:以抽吸方式使BreastSelect Beads(套組中所提供)徹底重新懸浮。計算全部樣本所需的BreastSelect Beads體積且將計算出的體積轉移到1.5 ml反應管。將反應管置於AdnaMag-S磁力收集器(AdnaGen GmbH, 產品編號T-1-800)中。在1分鐘後移除上清液。將磁珠用1 ml PBS (磷酸鹽緩衝鹽水; pH 7.0 – 7.3)洗滌三次。將反應管從AdnaMag-S移出並使其重新懸浮於100 μl PBS中。Preparation of Select Beads: Thoroughly resuspend the BreastSelect Beads (provided in the kit) by aspiration. Calculate the volume of BreastSelect Beads required for the entire sample and transfer the calculated volume to a 1.5 ml reaction tube. The reaction tube was placed in an AdnaMag-S magnetic collector (AdnaGen GmbH, product number T-1-800). The supernatant was removed after 1 min. Wash the beads three times with 1 ml PBS (phosphate-buffered saline; pH 7.0 – 7.3). The reaction tube was removed from the AdnaMag-S and resuspended in 100 μl PBS.
腫瘤細胞的選擇:首先將5 ml的血液樣本抽吸到15 ml管子內。將100 μl之已重新懸浮的BreastSelect Beads添加至各血液樣本中。使管子在可傾斜和旋轉的設備上於室溫下慢速旋轉(大約5 rpm)30分鐘。接著將管子置於未置有磁性滑塊的AdnaMag-L (AdnaGen GmbH, 產品編號T-1-700)內,且向下擺動以釋放蓋子中獲得的血滴。接著將磁性滑塊插入且將管子在室溫下培育3分鐘。在不碰觸到磁珠的情況下用10 ml抽吸管將血液上清液完全移除。首先從AdnaMag-L將磁性滑塊移開,向管子添加5 ml的PBS,輕輕使磁珠/細胞複合物重新懸浮,以完成洗滌三次。將磁性滑塊放回到AdnaMag-L內且將管子在室溫下培育1分鐘,隨後用抽吸管移除上清液。洗滌之後,將磁性滑塊移開,將磁珠/細胞複合物重新懸浮1 ml PBS中並轉移到1.5 ml的反應管。將反應管置於已插入磁性滑塊的AdnaMag-S內。1分鐘之後,將上清液全部移除以進行後續的細胞裂解。Selection of Tumor Cells: First a 5 ml blood sample was drawn into a 15 ml tube. Add 100 μl of resuspended BreastSelect Beads to each blood sample. The tubes were spun slowly (approximately 5 rpm) on a tilt-and-rotate device for 30 minutes at room temperature. The tube was then placed in an AdnaMag-L (AdnaGen GmbH, Product No. T-1-700) without a magnetic slide and swung down to release the blood drop captured in the cap. Magnetic slides were then inserted and tubes were incubated at room temperature for 3 minutes. Remove the blood supernatant completely with a 10 ml suction tube without touching the magnetic beads. First remove the magnetic slide from the AdnaMag-L, add 5 ml of PBS to the tube, and gently resuspend the magnetic bead/cell complex to complete the three washes. The magnetic slide was placed back into the AdnaMag-L and the tube was incubated at room temperature for 1 minute before removing the supernatant with a suction tube. After washing, the magnetic slides were removed and the bead/cell complexes were resuspended in 1 ml PBS and transferred to 1.5 ml reaction tubes. Place the reaction tube inside the AdnaMag-S with the magnetic slide inserted. After 1 minute, all supernatants were removed for subsequent cell lysis.
從AdnaMag-S將磁性滑塊移開,向每個反應管添加200 μl之經室溫平衡的裂解/結合緩衝液(套組中所提供),隨後藉由抽吸五次使重新懸浮。將磁性滑塊插入AdnaMag-S且培育1分鐘。將含有細胞溶胞產物的上清液轉移到新的1.5 ml反應管,且將含有磁珠的管子丟棄。該細胞溶胞產物係適於mRNA分離或儲存在-20°C下。該溶胞產物的mRNA係於反轉錄(RT-PCR)反應中使用寡聚胸腺嘧啶(Oligo (dT))引子來分離並轉錄成cDNA模板。隨後藉由PCR介導擴增來檢測NRG1融合物。 實例 5 :細胞外囊泡及胞泌體 The magnetic slides were removed from the AdnaMag-S, 200 μl of room temperature equilibrated lysis/binding buffer (provided in the kit) was added to each reaction tube, and then resuspended by pipetting five times. Insert the magnetic slide into the AdnaMag-S and incubate for 1 minute. The supernatant containing the cell lysate was transferred to a new 1.5 ml reaction tube, and the tube containing the magnetic beads was discarded. The cell lysate is suitable for mRNA isolation or stored at -20°C. mRNA from this lysate was isolated and transcribed into cDNA template in a reverse transcription (RT-PCR) reaction using an oligo-thymidine (Oligo (dT)) primer. NRG1 fusions were subsequently detected by PCR-mediated amplification. Example 5 : Extracellular Vesicles and Exocytosomes
使用ExoRNeasy Maxi套組(產品編號77023, QIAGEN GmbH, Hilden, 德國)而從血液獲得總囊泡RNA,該套組係基於以膜為基礎的親合性結合步驟而從無細胞的生物液體分離出胞泌體及其他EV。Total vesicular RNA was obtained from blood using the ExoRNeasy Maxi Kit (Cat. No. 77023, QIAGEN GmbH, Hilden, Germany), which is isolated from cell-free biological fluids based on a membrane-based affinity binding procedure Cytosomes and other EVs.
步驟1包括血漿分離與儲存。將全血收集於含有EDTA的BD Vacutainer®靜脈採血管(Venous Blood Collection Tubes) (產品編號367525)(或是Streck公司之PAXgene®血液ccfDNA管[產品編號768115]及無細胞DNA BCT®管,但不是RNA BCT管)中。將管子儲存在室溫或4°C下且在1小時內處理。使用擺動式斗狀轉子將血液樣本在初次採血管中以1900 x g (3000 rpm)及4°C離心10分鐘。小心將上部(黃色)血漿相轉移到新的管子(具錐形底)中且不攪亂中間的血沉棕黃層(含有白細胞和血小板)。將血漿樣本在錐形管中以3000 x g及4°C離心15分鐘(或以16,000 x g離心10分鐘 – 見上文),或是通過0.8 µm過濾器(Sartorius® Minisart® NML [Sartorius產品編號16592])。小心將澄清上清液轉移到新的管子且不攪亂沉澱物。將血漿儲存在2–8°C達6小時並在同一天處理。Step 1 includes plasma separation and storage. Whole blood was collected in BD Vacutainer® Venous Blood Collection Tubes (Product No. 367525) containing EDTA (or Streck’s PAXgene® blood ccfDNA tubes [Product No. 768115] and cell-free DNA BCT® tubes, but not the RNA BCT tube). Store tubes at room temperature or 4°C and process within 1 hour. Blood samples were centrifuged in primary collection tubes at 1900 x g (3000 rpm) for 10 minutes at 4°C using a swinging bucket rotor. Carefully transfer the upper (yellow) plasma phase to a new tube (with conical bottom) without disturbing the middle buffy coat (containing leukocytes and platelets). Centrifuge plasma samples in conical tubes at 3000 x g for 15 min at 4°C (or at 16,000 x g for 10 min – see above) or pass through a 0.8 µm filter (Sartorius® Minisart® NML [Sartorius Prod. ]). Carefully transfer the clear supernatant to a new tube without disturbing the pellet. Plasma was stored at 2–8°C for 6 hours and processed the same day.
步驟2描述從4 ml的血漿獲得及裂解胞泌體及其他細胞外囊泡(EV)。將4 ml的結合緩衝液XBP添加至4 ml的血漿中並立即輕輕倒置管子5次以混合均勻。將血漿/XBP混合物添加至exoEasy離心層析管柱(spin column)(將EV結合至膜上的膜親合性管柱)且以500 x g離心1分鐘。將流出的液體丟棄,且將管柱放回同一個收集管中。將10 ml洗滌緩衝液XWP添加至exoEasy Maxi離心層析管柱,且以5000 x g離心5分鐘。將流出的液體與收集管一起丟棄。將離心層析管柱轉移至新的收集管。將700 µl QIAzol添加至膜以裂解囊泡,以5000 x g離心5分鐘以收集溶胞產物,將完全轉移至2 ml管子。Step 2 describes the extraction and lysis of exosomes and other extracellular vesicles (EVs) from 4 ml of plasma. Add 4 ml of Binding Buffer XBP to 4 ml of plasma and immediately mix well by gently inverting the tube 5 times. The plasma/XBP mixture was added to an exoEasy spin column (a membrane affinity column that binds EVs to the membrane) and centrifuged at 500 x g for 1 min. Discard the flow-through and return the column to the same collection tube. Add 10 ml Wash Buffer XWP to the exoEasy Maxi Spin Column and centrifuge at 5000 x g for 5 minutes. Discard the effluent with the collection tube. Transfer the spin chromatography column to a new collection tube. Add 700 µl of QIAzol to the membrane to lyse the vesicles, centrifuge at 5000 x g for 5 min to collect the lysate, and transfer completely to a 2 ml tube.
步驟3描述從溶胞產物分離出總RNA。根據製造商的說明書製備緩衝液RWT及RPE。將含有溶胞產物的管子短暫地渦動且在室溫下培育5分鐘。接著添加90 µl的氯仿至管子且劇烈搖動15秒。在室溫下培育2至3分鐘之後,在4°C下以12,000 x g將管子離心15分鐘。將上部水相轉移至新的收集管,添加2倍體積的100%乙醇且徹底混合。將700 µl樣本抽吸至在2 ml收集管中的RNeasy MinElute離心層析管柱中,且以≥8000 x g (≥10,000 rpm)離心15秒。丟棄流出的液體之後,使用剩餘的樣本重複此步驟。將700 µl緩衝液RWT添加至RNeasy MinElute離心層析管柱且以≥8000 x g (≥10,000 rpm)離心15秒,隨後添加500 µl緩衝液RPE且以≥8000 x g (≥10,000 rpm)離心15秒。將每個步驟中流出的液體丟棄。最後將500 µl緩衝液RPE添加至RNeasy MinElute離心層析管柱上且以≥8000 x g (≥10,000 rpm)離心2分鐘。將收集管與流出的液體一起丟棄。Step 3 describes the isolation of total RNA from the lysate. Prepare buffers RWT and RPE according to the manufacturer's instructions. Tubes containing lysates were vortexed briefly and incubated at room temperature for 5 minutes. Then add 90 µl of chloroform to the tube and shake vigorously for 15 seconds. After incubation at room temperature for 2 to 3 minutes, the tubes were centrifuged at 12,000 x g for 15 minutes at 4°C. Transfer the upper aqueous phase to a new collection tube, add 2 volumes of 100% ethanol and mix thoroughly. Aspirate 700 µl of sample onto an RNeasy MinElute spin chromatography column in a 2 ml collection tube and centrifuge at ≥8000 x g (≥10,000 rpm) for 15 seconds. After discarding the run-off, repeat this step with the remaining sample. Add 700 µl Buffer RWT to the RNeasy MinElute spin column and centrifuge at ≥8000 x g (≥10,000 rpm) for 15 seconds, followed by 500 µl Buffer RPE and centrifuge at ≥8000 x g (≥10,000 rpm) for 15 seconds. Discard the liquid that escapes from each step. Finally, 500 µl of Buffer RPE was added to the RNeasy MinElute spin column and centrifuged at ≥8000 x g (≥10,000 rpm) for 2 minutes. Discard the collection tube along with the effluent.
將RNeasy MinElute離心層析管柱置入新的2 ml收集管且在蓋子打開下以全速離心5分鐘以使膜乾燥。將收集管與流出的液體一起丟棄。14 µl不含RNase的水來溶離RNA,讓管子靜置1分鐘,且以全速離心1分鐘。 實例 6 :經腫瘤教化的血小板 (TEP) Place the RNeasy MinElute spin chromatography column into a new 2 ml collection tube and centrifuge at full speed for 5 minutes with the cap open to dry the membrane. Discard the collection tube along with the effluent. 14 µl RNase-free water to elute the RNA, let the tube sit for 1 min, and centrifuge at full speed for 1 min. Example 6 : Tumor-Educated Platelets (TEP)
已知腫瘤細胞會將(突變的) RNA轉移到血小板中,隨後用於檢測腫瘤相關的RNA標誌物。此實驗方案將血小板mRNA的降解程度降至最低,且將其他細胞類型(如紅血球及白血球)的污染程度降至最低(Amisten S, Methods Mol Biol. 2012)。接著將獲自血小板RNA用來擴增及檢測NRG1融合物。Tumor cells are known to transfer (mutated) RNA into platelets, which are subsequently used to detect tumor-associated RNA markers. This protocol minimizes platelet mRNA degradation and minimizes contamination with other cell types such as erythrocytes and leukocytes (Amisten S, Methods Mol Biol. 2012). RNA obtained from platelets was then used to amplify and detect NRG1 fusions.
抗體共軛磁珠的製備:將1 ml Dynabead洗滌緩衝液(具0.1% (w/v) BSA的PBS,pH 7.4)與250 μl Dynabead漿料(Dynabeads® Pan Mouse IgG, Invitrogen)一起混合並置於Dynamag磁鐵(DynaMag™-2或DynaMag™-15, Invitrogen)中1分鐘,隨後將液體移除。在另一次洗滌之後,從磁鐵移出磁珠且使其重新懸浮於250 μL洗滌緩衝液中。將15 μl的抗CD235a抗體(小鼠,抗人類CD235a,紅血球表面標誌物,BD Inc, NJ, USA)及15 μl抗CD45抗體(小鼠抗人類CD45,白血球表面標誌物,BD Inc, NJ, USA)添加至磁珠,在溫和傾斜和旋轉下混合且培育至少30分鐘。將管子置於磁鐵中1分鐘以移除上清液,且將磁珠重新懸浮於250 μl洗滌緩衝液,之後儲存在4°C。Preparation of antibody-conjugated magnetic beads: Mix 1 ml Dynabead wash buffer (PBS with 0.1% (w/v) BSA, pH 7.4) with 250 μl Dynabead slurry (Dynabeads® Pan Mouse IgG, Invitrogen) and place Dynamag magnet (DynaMag™-2 or DynaMag™-15, Invitrogen) for 1 min, after which the liquid was removed. After another wash, the beads were removed from the magnet and resuspended in 250 μL of wash buffer. 15 μl of anti-CD235a antibody (mouse, anti-human CD235a, erythrocyte surface marker, BD Inc, NJ, USA) and 15 μl of anti-CD45 antibody (mouse anti-human CD45, leukocyte surface marker, BD Inc, NJ, USA) was added to the magnetic beads, mixed and incubated for at least 30 minutes with gentle tilting and rotation. The supernatant was removed by placing the tube in the magnet for 1 min, and the beads were resuspended in 250 μl wash buffer before storage at 4°C.
收集血球及純化血小板:以18 ml ACD (無菌抗凝劑:檸檬酸葡萄糖溶液, Sigma Aldrich)、12 μl的1 mM PGE1 (Prostaglandin E1, 1 mM於乙醇中的儲備液, Sigma Aldrich)、120 μl的30 mM乙醯柳酸(30 mM於乙醇中的儲備液, Sigma Aldrich)、及480 μl的0.5 M EDTA (Sigma Aldrich)來製備血小板抑制混合物。在進一步含有1.5 ml血小板抑制混合物之15 ml管子的管內收集8.5 ml的血。在室溫下以200g離心20分鐘之後,將上層上面85%之富含血小板(PRP)的血漿轉移到新的15 ml管子。在另一次離心(200g,10分鐘,室溫)移除白血球與紅血球之後,將85%的PRP轉移到50 ml管子。接著以AutoStop™ BC高效過濾器(Pall公司,產品編號ATSBC1E)將PRP過濾以移除白血球,將濾液與抗體共軛磁珠混合培育,同時在室溫下傾斜和旋轉45分鐘。將該PRP/磁珠混合物置於DynaMagTM-2內2分鐘,且將上清液轉移到新的管子以得到耗乏PRP。在重複PRP耗乏步驟之後,將上清液在室溫下以800g離心10分鐘以收取上清液。將血小板沉澱物秤重並以每100 mg沉澱物用1 ml TRIzol (Invitrogen)來溶解,樣本秤重小於100 mg時用最低量1 mL TRIzol。Collect blood cells and purify platelets: 18 ml ACD (sterile anticoagulant: citrate dextrose solution, Sigma Aldrich), 12 μl 1 mM PGE1 (Prostaglandin E1, 1 mM stock solution in ethanol, Sigma Aldrich), 120 μl A platelet inhibitory cocktail was prepared with 30 mM acetylsalicylic acid (30 mM stock solution in ethanol, Sigma Aldrich), and 480 μl of 0.5 M EDTA (Sigma Aldrich). 8.5 ml of blood were collected in tubes of 15 ml tubes further containing 1.5 ml of platelet inhibitory cocktail. After centrifugation at 200g for 20 minutes at room temperature, the upper 85% platelet rich (PRP) plasma was transferred to a new 15 ml tube. After another centrifugation (200g, 10 min, room temperature) to remove white and red blood cells, 85% of the PRP was transferred to a 50 ml tube. Then, PRP was filtered with AutoStop™ BC high-efficiency filter (Pall Company, product number ATSBC1E) to remove leukocytes, and the filtrate was mixed with antibody-conjugated magnetic beads and incubated while tilting and rotating at room temperature for 45 minutes. The PRP/magnetic bead mixture was placed in the DynaMag™-2 for 2 minutes, and the supernatant was transferred to a new tube for depleted PRP. After repeating the PRP depletion step, the supernatant was harvested by centrifugation at 800 g for 10 min at room temperature. Platelet pellets were weighed and dissolved in 1 ml TRIzol (Invitrogen) per 100 mg of pellet, with a minimum of 1 mL TRIzol for samples weighing less than 100 mg.
分離血小板RNA:將溶解於TRIzol中的血小板樣本在室溫下培育5分鐘,且向每管添加200 μl氯仿(Sigma Aldrich)。在劇烈搖動15至30分鐘之後,將管子在室溫下培育2至3分鐘,並在4°C下以12,000g離心15分鐘。將10 μg極純肝醣(UltraPure™ Glycogen, Invitrogen, 產品編號10814–010)添加至新的無RNase管子。將TRIzol/氯仿混合物的水相小心添加至含有肝醣的管子,隨後添加500 μl的冷異丙醇(Sigma Aldrich)。將管子混合並在– 20°C下靜置過夜。在4°C下以12,000g離心15分鐘之後,移除上清液且將RNA沉澱物在1 ml的75 %乙醇中洗滌。將經乾燥的RNA沉澱物溶解於無RNase水中,且可進一步使用於反轉錄為cDNA。 實例 7 : ddPCR 核酸擴增及融合物檢測 Isolation of platelet RNA: Platelet samples dissolved in TRIzol were incubated for 5 minutes at room temperature and 200 μl of chloroform (Sigma Aldrich) was added to each tube. After 15 to 30 minutes of vigorous shaking, the tubes were incubated at room temperature for 2 to 3 minutes and centrifuged at 12,000 g for 15 minutes at 4°C. Add 10 μg Ultra Pure Glycogen (UltraPure™ Glycogen, Invitrogen, Product No. 10814–010) to a new RNase-free tube. The aqueous phase of the TRIzol/chloroform mixture was carefully added to the tube containing glycogen, followed by 500 μl of cold isopropanol (Sigma Aldrich). The tubes were mixed and left overnight at –20°C. After centrifugation at 12,000g for 15 minutes at 4°C, the supernatant was removed and the RNA pellet was washed in 1 ml of 75% ethanol. The dried RNA pellet is dissolved in RNase-free water and can be further used for reverse transcription to cDNA. Example 7 : ddPCR nucleic acid amplification and fusion detection
使用微滴式數位聚合酶連鎖反應(ddPCR)來檢測來自液態檢體cfDNA及cfRNA樣本中的目標核酸融合物。Droplet digital polymerase chain reaction (ddPCR) was used to detect target nucleic acid fusions in cfDNA and cfRNA samples from liquid samples.
使用Bio-Rad’s QX100 TM或QX200 TM微滴式數位PCR系統來進行微滴式數位聚合酶連鎖反應(ddPCR),以致使檢測來自液態檢體DNA及/或RNA (cDNA)樣本中的目標核酸融合物。 Use Bio-Rad's QX100 TM or QX200 TM Droplet Digital PCR System to perform droplet digital polymerase chain reaction (ddPCR) to enable detection of target nucleic acid fusions in DNA and/or RNA (cDNA) samples from liquid specimens things.
引子及探針:引子是設計在融合接合處或斷點附近,以擴增60-200 bp大小的NRG1融合產物。使用Primer3 程式(Whitehead Institute for Biomedical Research, 麻省理工學院)來設計引子使其GC含量為50-60%且T m介於50與65°C之間(在50 mM鹽濃度及300 nM寡核苷酸濃度下),以避免二級結構及引子二聚物。 Primers and probes: Primers are designed near fusion junctions or breakpoints to amplify NRG1 fusion products with a size of 60-200 bp. The Primer3 program (Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology) was used to design primers with a GC content of 50-60% and a Tm between 50 and 65°C (at 50 mM salt concentration and 300 nM oligonucleotide nucleotide concentration) to avoid secondary structures and primer-dimers.
使用具序列特異性且用FAM、HEX或VIC染料螢光標記的TaqMan TM水解探針。探針序列係於擴增子(amplicon)的兩個引子之間選擇。水解探針的T m係比GC含量為30-80%的引子高3-10°C。該等探針的長度小於30個核苷酸。 Sequence-specific TaqMan ™ hydrolysis probes fluorescently labeled with FAM, HEX, or VIC dyes were used. The probe sequence is selected between the two primers of the amplicon. The Tm of the hydrolysis probe is 3-10°C higher than that of the primer with 30-80% GC content. The probes are less than 30 nucleotides in length.
樣本製備:使用A260光譜儀來評估輸入DNA/RNA的濃度,以確認所要裝填之目標DNA/RNA濃度是在檢測動態範圍內。QX100或QX200系統建議的DNA動態範圍為每20 μl反應物120,000個複本數。也根據製造商的建議對基因體DNA使用4切酶或6切酶與每μg DNA 10單位酶來進行限制酶消化切割。經消化切割的DNA係儲存在-20°C下直至進一步使用。Sample preparation: Use the A260 spectrometer to evaluate the concentration of input DNA/RNA to confirm that the target DNA/RNA concentration to be loaded is within the detection dynamic range. The recommended DNA dynamic range for the QX100 or QX200 system is 120,000 replicates per 20 μl reaction. Genomic DNA was also cut by restriction enzyme digestion using 4 Dicer or 6 Dicer with 10 units of enzyme per μg DNA according to the manufacturer's recommendations. Digested and cleaved DNA was stored at -20°C until further use.
當使用cfDNA做為起始材料進行實驗時,該融合序列係包括融合位點加上每一側有足夠長的毗鄰序列以涵蓋PCR擴增子。所得核苷酸序列係介於50-250 ntd。將T7啟動子序列(5'-CAGAGATGCATAATACGACTCACTATAGGGAGA-3')加到目標序列的5'端。雙股去氧核醣核酸(DNA)片段形式之合成序列係從IDT (www.idtdna.com)訂購,且於Tris-EDTA (TE)緩衝液中恢復水分至最終濃度為10 ng/μl。When performing experiments using cfDNA as starting material, the fusion sequence includes the fusion site plus contiguous sequences of sufficient length on each side to encompass the PCR amplicon. The resulting nucleotide sequence is between 50-250 ntd. A T7 promoter sequence (5'-CAGAGATGCATAATACGACTCACTATAGGGAGA-3') was added to the 5' end of the target sequence. Synthetic sequences in the form of double-stranded deoxyribonucleic acid (DNA) fragments were ordered from IDT (www.idtdna.com) and rehydrated in Tris-EDTA (TE) buffer to a final concentration of 10 ng/μl.
當使用RNA做為起始材料時,首先使用RT-PCR來生成cDNA,例如使用針對寡聚胸腺嘧啶(oligo(dT))或基因特異性導引(priming)之Bio-Rad’s iScript TMSelect cDNA合成套組。cDNA濃度係減至相等於約0.2 ng/μl RNA,且每個微滴式數位PCR反應(總體積20 μl)使用5 μl。 When using RNA as starting material, first use RT-PCR to generate cDNA, for example using Bio-Rad's iScript TM Select cDNA synthesis for oligothymine (oligo(dT)) or gene-specific priming set. The cDNA concentration was reduced to an equivalent of approximately 0.2 ng/μl RNA, and 5 μl was used per droplet digital PCR reaction (total volume 20 μl).
為了生成cfRNA之陽性對照組,使用活體外轉錄將60 ng的合成DNA轉成RNA且使用苯酚/胍基試劑將所得RNA轉錄物純化。添加DNase I以移除殘餘模板DNA。以凝膠電泳來評估RNA的品質。基於輸出到檢驗樣本的所要複本數,將所得RNA稀釋至範圍為0.25至2.5 fg的濃度。將供使用於陽性對照組之10 μl的分析RNA等分儲存在– 80°C下。使用無核酸酶的水作為cfDNA及cfRNA的陰性對照組。To generate a positive control for cfRNA, 60 ng of synthetic DNA was converted to RNA using in vitro transcription and the resulting RNA transcript was purified using a phenol/guanidine based reagent. Add DNase I to remove residual template DNA. RNA quality was assessed by gel electrophoresis. The resulting RNA was diluted to concentrations ranging from 0.25 to 2.5 fg based on the desired number of replicates exported to the test sample. Aliquots of 10 μl of analyzed RNA for use in positive controls were stored at –80°C. Nuclease-free water was used as a negative control for cfDNA and cfRNA.
ddPCR反應:製備20 μl PCR混合物,其含有樣本核酸(1 μg的DNA或cDNA,最終濃度為50 ng/μl)、探針用之2x ddPCR supermix ((Bio-Rad, 產品編號1863023;不含2’-去氧尿核苷 5’-三磷酸鹽) - 10 μl、20x融合特異性引子/探針組(450 nmol/L引子,250 nmol/L FAM探針) - 1 μl、20x對照目標引子/探針組(450 nmol/L引子,250 nmol/L HEX探針)- 1 μl及無核酸酶的水。建立稍微超過20 μl (22-25 μl)的初始反應物池,以確保能將20 μl的混合物轉移至DG8卡匣(Bio-Rad, 產品編號1864007)。將該等反應混合物合併並於一個單獨的管子中(而非在微滴產生卡匣中)混合均勻。接著將該等反應混合物轉移至已預載於DG8卡匣支架(Bio-Rad, 產品編號1863051)中之DG8卡匣。ddPCR reaction: Prepare 20 μl PCR mixture containing sample nucleic acid (1 μg DNA or cDNA, final concentration 50 ng/μl), 2x ddPCR supermix ((Bio-Rad, product number 1863023; excluding 2 '-deoxyuridine 5'-triphosphate) - 10 μl, 20x fusion-specific primer/probe set (450 nmol/L primer, 250 nmol/L FAM probe) - 1 μl, 20x control target primer /probe set (450 nmol/L primer, 250 nmol/L HEX probe) - 1 μl and nuclease-free water. Set up an initial reaction pool of slightly more than 20 μl (22-25 μl) to ensure that the 20 μl of the mixture was transferred to a DG8 cassette (Bio-Rad, product number 1864007). The reaction mixtures were combined and mixed well in a separate tube (not in the droplet generation cassette). Then the The reaction mixture was transferred to a DG8 cassette preloaded in a DG8 cassette holder (Bio-Rad, prod. no. 1863051).
裝填20 μl PCR反應物之後,將70 μl的微滴產生油(Bio-Rad, 產品編號1863005)裝填入DG8卡匣各孔底部。在DG8卡匣頂部安裝一個襯墊,將其置入QX200微滴產生儀。該微滴產生儀在約2.5分鐘內對8個樣本產生每樣本約20,000個微油滴。接著輕輕將微油滴抽吸轉移至96孔盤。使用Bio-Rad’s PX1TM PCR孔盤封膜儀(Plate Sealer)及可穿刺熱封箔將PCR孔盤熱封。接著將PCR孔盤置入具有96深孔反應模組之C1000 Touch TM熱循環儀(Thermal Cycler)中進行PCR。熱循環條件如下:在95°C (10分鐘,1循環)活化酵素;變性(94°C, 30秒)及黏合/延長(55°C, 1分鐘) 40個循環;在98°C下酵素失活,10分鐘,1循環。使用每秒2°C的變溫速率。 After loading 20 μl of PCR reaction, 70 μl of droplet generation oil (Bio-Rad, prod. no. 1863005) was loaded into the bottom of each well of the DG8 cassette. Install a gasket on top of the DG8 cassette and place it into the QX200 Droplet Generator. The droplet generator generates about 20,000 oil droplets per sample for 8 samples in about 2.5 minutes. The oil droplets were then gently transferred by suction to a 96-well plate. PCR well plates were heat sealed using Bio-Rad's PX1™ PCR Plate Sealer and a pierceable heat seal foil. Then put the PCR well plate into a C1000 Touch TM thermal cycler (Thermal Cycler) with a 96-deep well reaction module for PCR. Thermal cycling conditions were as follows: Enzyme activation at 95°C (10 minutes, 1 cycle); 40 cycles of denaturation (94°C, 30 seconds) and adhesion/prolongation (55°C, 1 minute); enzyme at 98°C Inactivation, 10 minutes, 1 cycle. A ramp rate of 2°C per second was used.
在PCR擴增微油滴中的核酸目標之後,將含有微油滴的孔盤置入QX100或QX200微滴讀取儀,其中係使用軟體QuantaSoft,使用雙色檢測系統(設定成檢測FAM及HEX)個別分析每個微油滴。含有至少一個複本的目標DNA/RNA分子之陽性微油滴與陰性微油滴相比係展現出螢光增強。其濃度係以每μl最終1x ddPCR反應物中的複本數報告。使用閾值工具正確地指出微油滴群為雙陰性(FAM及HEX皆為陰性)、FAM陽性、HEC陽性、及雙陽性(FAM及HEX皆為陽性)。使用ABS分析,以每微升之複本數及每微滴之複本數得到目標物的絕對量化。 實例 8 :錨定多重 PCR After PCR amplification of the nucleic acid targets in the oil droplets, the plate containing the oil droplets is placed into a QX100 or QX200 droplet reader using QuantaSoft software using a two-color detection system (set to detect FAM and HEX) Each oil droplet is analyzed individually. Positive oil droplets containing at least one copy of the target DNA/RNA molecule exhibit increased fluorescence compared to negative oil droplets. Concentrations are reported as number of replicates per μl final 1x ddPCR reaction. Use the threshold tool to correctly indicate that the oil droplet populations are double negative (both FAM and HEX are negative), FAM positive, HEC positive, and double positive (both FAM and HEX are positive). Absolute quantification of the target was obtained in replicates per microliter and replicates per microdrop using ABS analysis. Example 8 : Anchored Multiplex PCR
使用錨定多重PCR (AMP)來檢測基因融合物及檢測與所感興趣之基因的多重融合。以下實驗方案係基於Archer FusionPlex實體瘤(Solid Tumor)套組(ArcherDX, 產品編號AB0005),且使用獲自例如實例6之液態檢體的RNA來進行。 樣本庫製備: Anchored multiplex PCR (AMP) was used to detect gene fusions and to detect multiple fusions to a gene of interest. The following experimental protocol is based on the Archer FusionPlex Solid Tumor Kit (ArcherDX, Product No. AB0005), and is carried out using RNA obtained from liquid specimens such as Example 6. Sample bank preparation:
包括一個含有至少數個已證實的基因融合之陽性對照組。包括一個非模板對照組作為每次運作時另外的樣本。Include a positive control group containing at least several confirmed gene fusions. A non-template control group was included as an additional sample for each run.
隨機導引、第一與第二股cDNA合成:該測定法所要輸入的是200 ng RNA,其係稀釋於pH8.0之10 mM Tris HCl中。將20 μl經稀釋的RNA添加至預冷的隨機導引(Random Priming)試劑連排反應管中(套組中所提供)。在混合及短暫快速旋轉之後,將混合物轉移到96孔PCR孔盤,以RT膜(USA Scientific, 產品編號2921-7800)密封並在熱循環儀鋁塊上以65°C培育5分鐘(蓋子加熱下)。Random-primed, first and second strand cDNA synthesis: The input to the assay is 200 ng of RNA diluted in 10 mM Tris HCl, pH 8.0. Add 20 μl of diluted RNA to pre-chilled Random Priming Reagent Strips (provided in the kit). After mixing and a brief quick spin, the mixture was transferred to a 96-well PCR well plate, sealed with RT membrane (USA Scientific, prod. no. 2921-7800) and incubated at 65°C for 5 minutes on a thermal cycler aluminum block (lid heated Down).
將隨機導引產物轉移到第一股試劑連排反應管(置於預冷鋁塊中)且混合、短暫快速旋轉,並轉移到96孔PCR孔盤。使用以下熱循環儀程式:25°C 10分鐘,42°C 30分鐘,80°C 20分鐘,保持4°C(蓋子加熱下)。The random priming products were transferred to the first reagent train tube (in a pre-cooled aluminum block) and mixed, spun briefly, and transferred to a 96-well PCR well plate. Use the following thermal cycler program: 25 °C for 10 min, 42 °C for 30 min, 80 °C for 20 min, hold at 4 °C (with lid heated).
於一組新的PCR連排反應管中以無核酸酶的水製作1:10稀釋的第一股產物,以使用於Pre-Seq品質控制(QC)測定法之中。QC測定法主要是用於驗證非模板對照組中沒有cDNA合成。根據製造商的實驗方案進行QC測定法。A 1:10 dilution of the first strand was made in nuclease-free water in a new set of PCR reaction strips for use in the Pre-Seq quality control (QC) assay. The QC assay is primarily used to verify the absence of cDNA synthesis in the non-template control group. QC assays were performed according to the manufacturer's protocol.
對於第二股cDNA合成,添加21 μl無核酸酶的水至其餘的第一股產物,且將40 μl的此產物添加至第二股試劑連排反應管(套組中所提供,置於預冷鋁塊中)。在混合及短暫快速旋轉之後,將混合物轉移到96孔PCR孔盤並密封。將孔盤插入熱循環儀鋁塊上,且使用以下熱循環儀程式:16°C 60分鐘,75°C 20分鐘,保持4°C(蓋子加熱下)。For second-strand cDNA synthesis, add 21 μl of nuclease-free water to the remaining first-strand product, and add 40 μl of this product to the second-strand reagent strip (supplied in the kit, set in the pre- cold aluminum block). After mixing and a brief quick spin, the mixture was transferred to a 96-well PCR well plate and sealed. Insert the well plate into a thermal cycler aluminum block and use the following thermal cycler program: 16 °C for 60 min, 75 °C for 20 min, hold at 4 °C (with lid heated).
末端修復(End Repair):將40 μl的第二股產物轉移到末端修復試劑連排反應管(套組中所提供,置於預冷鋁塊中)。在混合及短暫快速旋轉之後,將連排反應管在熱循環儀鋁塊中以25°C培育30分鐘,隨後保持4°C。在室溫下將該末端修復產物添加至純化珠粒(Agencourt AMPure XP Beads, Bechman Coulter, 產品編號A63881)。在混合之後,將該混合物在室溫下培育5分鐘,隨後在磁鐵(Alpaqua, 產品編號A32782)上培育5分鐘。丟棄上清液,將珠粒用200 μl 70%乙醇以培育30秒及風乾5分鐘進行洗滌兩次。使珠粒重新懸浮於22 μl之pH 8.0的10 mM Tris HCl中,離開磁鐵培育3分鐘,隨後在磁鐵上培育2分鐘。End Repair: Transfer 40 μl of the second-strand product to the End Repair Reagent Strip Reaction Tube (provided in the kit, placed in a pre-cooled aluminum block). After mixing and a brief quick spin, the reaction tube strips were incubated in a thermal cycler aluminum block at 25°C for 30 minutes and then held at 4°C. The end-repair product was added to purification beads (Agencourt AMPure XP Beads, Bechman Coulter, Product No. A63881 ) at room temperature. After mixing, the mixture was incubated at room temperature for 5 minutes, followed by incubation on a magnet (Alpaqua, Product No. A32782) for 5 minutes. The supernatant was discarded, and the beads were washed twice with 200 μl of 70% ethanol, incubated for 30 seconds and air-dried for 5 minutes. Beads were resuspended in 22 μl of 10 mM Tris HCl, pH 8.0, incubated for 3 minutes off the magnet, followed by 2 minutes on the magnet.
接合步驟1:從末端修復珠粒純化孔盤取出20 μl轉移入接合步驟1連排反應管(套組中所提供,置於預冷鋁塊中)。在混合及短暫快速旋轉之後,將混合物於96孔PCR孔盤中在熱循環儀鋁塊內以37°C培育15分鐘,隨後保持4°C(蓋子加熱下)。將全部體積的接合步驟1產物添加至50 μl之經室溫平衡的珠粒。在混合之後,將該混合物在室溫下培育5分鐘,隨後在磁鐵上培育5分鐘。丟棄上清液,將珠粒用200 μl 70%乙醇以培育30秒進行洗滌兩次。在最終洗滌之後,移除全部的70%乙醇且將珠粒風乾5分鐘。將珠粒從磁鐵移開且重新懸浮在42 μl之pH 8.0的10 mM Tris HCl中,離開磁鐵培育3分鐘,隨後在磁鐵上培育2分鐘。Ligation step 1: Transfer 20 μl from the end-repair bead purification plate into ligation step 1 run-through reaction tubes (provided in the kit, placed in a pre-cooled aluminum block). After mixing and a brief quick spin, the mixture was incubated in a 96-well PCR well plate in a thermal cycler aluminum block at 37°C for 15 minutes, then kept at 4°C (with lid heating). The full volume of ligation step 1 product was added to 50 μl of room temperature equilibrated beads. After mixing, the mixture was incubated at room temperature for 5 minutes, followed by incubation on the magnet for 5 minutes. The supernatant was discarded and the beads were washed twice with 200 μl of 70% ethanol with a 30 second incubation. After the final wash, all 70% ethanol was removed and the beads were air dried for 5 minutes. Beads were removed from the magnet and resuspended in 42 μl of 10 mM Tris HCl, pH 8.0, incubated for 3 minutes off the magnet, followed by 2 minutes on the magnet.
接合步驟2:從4°C儲藏庫中移出MBC適配連排反應管試劑(套組中所提供)並正確編號。為了定序目的,還記錄了樣本的特定索引。Ligation Step 2: Remove MBC Adapter Strip Reagents (provided in the kit) from 4°C storage and number them correctly. Specific indices of samples are also recorded for sequencing purposes.
從接合步驟1珠粒純化孔盤中取出40 μl轉移至MBC適配連排反應管(ArcherDX,產品編號AK0016-48,置於預冷鋁塊中)。在混合及短暫快速旋轉之後,將混合物轉移至接合步驟2試劑連排反應管(套組中所提供,也置於預冷鋁塊中)。在混合及短暫快速旋轉之後,將混合物在熱循環儀中以22°C培育5分鐘,隨後保持4°C。Transfer 40 μl from the bead purification well plate of ligation step 1 to MBC adapter series reaction tubes (ArcherDX, product number AK0016-48, placed in a pre-cooled aluminum block). After mixing and a brief quick spin, transfer the mixture to the ligation step 2 reagent strip (supplied in the kit, also placed in a pre-chilled aluminum block). After mixing and a brief quick spin, the mixture was incubated in a thermal cycler at 22°C for 5 minutes, then held at 4°C.
將接合清理(Ligation Cleanup)珠粒(經室溫平衡)渦動且將50 μl添加至新的PCR連排反應管組。在磁鐵上培育1分鐘之後,丟棄上清液。將連排反應管從磁鐵移開且重新懸浮於50 μl的接合清理緩衝液中。Ligation Cleanup beads (room temperature equilibrated) were swirled and 50 μl added to a new PCR strip. After 1 minute incubation on the magnet, the supernatant was discarded. The reaction tube strips were removed from the magnet and resuspended in 50 μl of Ligation Cleanup Buffer.
將接合步驟2產物添加至接合清理珠粒連排反應管,渦動混合且在室溫下培育5分鐘。重複進行混合及培育,之後將樣本在磁鐵上培育1分鐘並丟棄上清液。添加200 μl的接合清理緩衝液使其重新懸浮,且在快速旋轉之後將該混合物置於磁鐵上1分鐘。重複用接合清理緩衝液進行洗滌。使用超純水進行相同的洗滌動作,之後將珠粒重新懸浮於20 μl的5 mM NaOH中且於PCR孔盤中在熱循環儀鋁塊內用以下條件培育:75°C 10分鐘,隨後保持4°C(蓋子加熱下)。將PCR孔盤置於磁鐵上至少3分鐘。The ligation step 2 product was added to the ligated clean-up bead strip reaction tube, vortexed and incubated at room temperature for 5 minutes. Mixing and incubation were repeated, after which the samples were incubated on the magnet for 1 min and the supernatant was discarded. 200 μl of ligation cleanup buffer was added to resuspend and the mixture was placed on the magnet for 1 min after a quick spin. Repeat the wash with ligation cleanup buffer. The same wash was performed with ultrapure water, after which the beads were resuspended in 20 μl of 5 mM NaOH and incubated in a PCR well plate in a thermal cycler aluminum block with the following conditions: 75°C for 10 minutes followed by a hold 4°C (with lid heated). Place the PCR well plate on the magnet for at least 3 minutes.
第一次PCR:將2 μl的NRG1特異性引子(GSP1引子)添加至第一次PCR試劑連排反應管中的每個孔(套組中所提供)並與18 μl的接合步驟2清理產物混合。在短暫快速旋轉之後,將混合物在熱循環儀中用以下程式培育:95°C 3分鐘,15個循環;95°C 30秒 - 65°C 5分鐘(變溫速率100%);72°C 3分鐘,隨後保持4°C(蓋子加熱下)。1st PCR: Add 2 μl of NRG1-specific primers (GSP1 primers) to each well in the 1st PCR Reagent Strip (provided in the kit) and combine with 18 μl of the conjugation step 2 cleanup product mix. After a brief quick spin, the mixture was incubated in a thermocycler with the following program: 95°C for 3 min, 15 cycles; 95°C for 30 sec - 65°C for 5 min (100% ramp rate); 72°C for 3 min min, followed by maintaining at 4 °C (with the lid heated).
將20 μl的第一次PCR產物添加至24 μl之經室溫平衡的珠粒,混合並在室溫下培育5分鐘且在磁鐵上培育2分鐘。之後移除上清液,將珠粒用200 μl 70%乙醇以培育30秒及風乾進行洗滌兩次,並使其重新懸浮於24 μl之pH8.0的10 mM Tris HCl。 第二次PCR及樣本庫量化 20 μl of the first PCR product was added to 24 μl of room temperature equilibrated beads, mixed and incubated for 5 minutes at room temperature and 2 minutes on the magnet. The supernatant was then removed, the beads were washed twice with 200 μl of 70% ethanol incubated for 30 seconds and air-dried, and resuspended in 24 μl of 10 mM Tris HCl, pH 8.0. The second PCR and sample bank quantification
將2 μl之另外的NRG1特異性引子(GSP2引子)添加至在冷鋁塊上之正確標記的第二次PCR試劑連排反應管中的每個孔(套組中所提供),且與18 μl的第一次PCR清理產物混合。在短暫快速旋轉之後,將混合物在熱循環儀中用以下條件培育:95°C 3分鐘,18個循環;95°C 30秒 - 65°C 5分鐘(變溫速率100%);72°C 3分鐘,隨後保持4°C(蓋子加熱下)。Add 2 μl of additional NRG1-specific primers (GSP2 primers) to each well in a properly labeled second PCR reagent strip reaction tube (supplied in the kit) on a cold aluminum block and mix with 18 μl of the first PCR cleanup product mix. After a brief quick spin, the mixture was incubated in a thermal cycler with the following conditions: 95°C for 3 min, 18 cycles; 95°C for 30 sec - 65°C for 5 min (100% ramp rate); 72°C for 3 min min, followed by maintaining at 4 °C (with the lid heated).
將20 μl的第二次PCR產物添加至24 μl之經室溫平衡的珠粒,混合並在室溫下培育5分鐘且在磁鐵上培育2分鐘。之後移除上清液,將珠粒用200 μl 70%乙醇以培育30秒及風乾進行洗滌兩次。在磁鐵上培育2分鐘,將20 μl的第二次PCR產物轉移至新的PCR孔盤並進行定量。20 μl of the second PCR product was added to 24 μl of room temperature equilibrated beads, mixed and incubated for 5 minutes at room temperature and 2 minutes on the magnet. Afterwards, the supernatant was removed, and the beads were washed twice with 200 μl of 70% ethanol, incubated for 30 seconds and air-dried. After 2 minutes of incubation on the magnet, 20 μl of the second PCR product was transferred to a new PCR plate and quantified.
每一樣本庫的濃度係以qPCR使用用於Illumina平台編碼KK4973之Kapa Biosystems樣本庫定量套組(Library Quantification Kit)根據製造商的說明書來測定。用pH8.0之10 mM Tris HCl與0.05% Tween將第二次PCR產物連續稀釋。以每孔6μl之樣本庫定量主混合液(library quantitation master mix) (Kapa Biosystems, 產品編號KK4973)添加至96孔光學反應盤,隨後添加4 μl之合適稀釋品或標準品(Kapa Biosystems, 產品編號KK4906、KK4903)。使用以下qPCR程式:95°C 5分鐘1個循環;95°C 30秒(變溫速率4.4°C/秒) - 60°C 45秒(變溫速率2.2°C/秒) 35個循環。完成樣本庫定量之後,將所有的樣本庫均一化,且將10 μl之每個經均一化的樣本庫合併到一個1.5 ml微量離心管彙集。 樣本庫之定序: The concentration of each library was determined by qPCR using the Kapa Biosystems Library Quantification Kit (Library Quantification Kit) for the Illumina platform code KK4973 according to the manufacturer's instructions. The second PCR product was serially diluted with 10 mM Tris HCl pH 8.0 and 0.05% Tween. Add 6 μl per well of the library quantitation master mix (library quantitation master mix) (Kapa Biosystems, product number KK4973) to the 96-well optical reaction plate, and then add 4 μl of the appropriate dilution or standard (Kapa Biosystems, product number KK4906, KK4903). The following qPCR program was used: 1 cycle of 95°C for 5 minutes; 35 cycles of 95°C for 30 seconds (temperature ramp rate 4.4°C/sec) - 60°C for 45 seconds (temperature ramp rate 2.2°C/sec). After pool quantification was complete, all pools were normalized and 10 μl of each normalized pool pooled into a 1.5 ml microcentrifuge tube. Sequencing of sample banks:
從冷藏庫中移出定序試劑卡匣並置入去離子水直到填充線至少一小時。使定序試劑套組(MiSeq試劑套組v3 – 600循環; Illumina; 產品編號MS-102-3003)與室溫平衡。將10 μl的樣本庫池與10 μl的0.2N NaOH合併以製作變性擴增子樣本庫(DAL)池且在室溫下培育5分鐘。將10 μl之pH 7.0的200 mM Tris HCl添加至DAL,隨後添加970 μl的HT1緩衝液(套組中所提供)。Remove the sequencing reagent cartridge from the freezer and place in deionized water to the fill line for at least one hour. The sequencing reagent set (MiSeq Reagent Set v3 – 600 cycles; Illumina; Product No. MS-102-3003) was equilibrated to room temperature. 10 μl of the library pool was combined with 10 μl of 0.2N NaOH to make a denatured amplicon library (DAL) pool and incubated at room temperature for 5 minutes. 10 μl of 200 mM Tris HCl, pH 7.0 was added to the DAL, followed by 970 μl of HT1 buffer (provided in the kit).
合併300 μl的HT1、25 μl的20 pM PhiX及675 μl的DAL池以製作最終裝填管。在混合及短暫快速旋轉之後,將裝填管的全部體積添加至該定序試劑卡匣的樣本孔,且將該卡匣裝填到定序儀,以2×151 bp讀長與2×8索引測序數據(index reads)運行(MiSeqDx System – Illumina)。Combine 300 μl of HT1, 25 μl of 20 pM PhiX, and 675 μl of the DAL pool to make the final fill tube. After mixing and a brief quick spin, add the full volume of the fill tube to the sample well of the sequencing reagent cartridge, and load the cartridge into the sequencer for sequencing with 2×151 bp reads and 2×8 indexes Data (index reads) were run (MiSeqDx System - Illumina).
RNA融合數據係使用適合的生物資訊數據分析軟體來分析。 實例 9 :次世代定序 RNA fusion data were analyzed using appropriate bioinformatics data analysis software. Example 9 : Next Generation Sequencing
次世代定序係以DNA樣本使用MI-Exome檢驗(Caris Molecular Intelligence®, Caris Life Sciences)來進行,該檢驗係使用DNA且使用定製人類外顯體套組提供腫瘤突變剖析,以檢測包括有單一核苷酸同質多形性、插入/缺失及DNA重排及融合事件等之變異體。Next-generation sequencing was performed on DNA samples using the MI-Exome assay (Caris Molecular Intelligence®, Caris Life Sciences), which uses DNA and provides tumor mutational profiling using custom human exome panels to detect mutations including Variants of single nucleotide polymorphisms, insertions/deletions, and DNA rearrangements and fusion events.
以下的定製外顯體套組(panel)為使用五種套組的混合套組。該套組係使用Illumina NovaSeq 6000儀器來進行驗證。該混合套組包括兩種獲自Agilent的現成套組以及三種由Caris開發且優化的套組: Agilent Human All Exon V7 套組(48 MB) Agilent SNP Backbone套組(具1 MB解析度之3 MB套組) Caris 719-Gene Targeted Clinical套組(1 MB) Caris Intronic Fusion套組(0.1 MB) Caris Pathogenic套組(0.1 MB) The following customized exosome panel (panel) is a mixed panel using five panels. The set was validated using the Illumina NovaSeq 6000 instrument. This hybrid kit consists of two off-the-shelf kits from Agilent and three optimized kits developed by Caris: Agilent Human All Exon V7 Kit (48 MB) Agilent SNP Backbone Kit (3 MB kit with 1 MB resolution) Caris 719-Gene Targeted Clinical Kit (1 MB) Caris Intronic Fusion Package (0.1 MB) Caris Pathogenic Package (0.1 MB)
對於以RNA作為起始材料時,係使用全基因轉錄本測定法來進行次世代定序,例如MI TranscriptomeTM測定組(Caris Molecular Intelligence®, Caris Life Sciences)。當在Illumina NovaSeq儀器上定序時,該測定組係使用Agilent SureSelect Human All Exon套組來檢測基因融合物及剪接變異體。 實例 10 :螢光原位雜交 (FISH) For RNA starting material, next-generation sequencing is performed using whole-genome transcript assays, such as the MI TranscriptomeTM assay set (Caris Molecular Intelligence®, Caris Life Sciences). The assay was used to detect gene fusions and splice variants using the Agilent SureSelect Human All Exon set when sequencing on the Illumina NovaSeq instrument. Example 10 : Fluorescence in situ hybridization (FISH)
對獲自血漿或血液或其他液態檢體樣本上之循環腫瘤細胞(CTC)利用斷裂分離探針進行FISH。例如,循環腫瘤細胞係使用CellSearch® (Menarini silicon biosystems, 義大利)或ISET® (Rarecells Diagnostics, 巴黎,法國)系統而純化自血漿、血清等,並固定在載玻片上。FISH is performed on circulating tumor cells (CTCs) obtained from plasma or blood or other liquid specimens using a fragmentation probe. For example, circulating tumor cell lines are purified from plasma, serum, etc. using the CellSearch® (Menarini silicon biosystems, Italy) or ISET® (Rarecells Diagnostics, Paris, France) system, and fixed on glass slides.
簡言之,將從全血中濃縮的CTC封固在獨立的載玻片上(Frithiof, Henrik等人,OncoTargets and therapy 第9卷, 7095-7103, 2016年11月16日)。將含有CTC的溶液(~900 μL)轉移至1.5 mL Eppendorf管且置於磁性托盤中。培育10分鐘之後,移除未附著的溶劑。使細胞重新懸浮於10 μL的1倍磷酸鹽緩衝鹽水且封固在超凍(superfrost)載玻片(ThermoScientific, 德國)且在37°C下培育30分鐘。將載玻片浸入100%甲醇中5分鐘完成固定。將樣本儲存在−20°C直至進一步評估。Briefly, CTCs concentrated from whole blood were mounted on separate glass slides (Frithiof, Henrik et al., OncoTargets and therapy Vol. 9, 7095-7103, Nov. 16, 2016). The CTC-containing solution (-900 μL) was transferred to a 1.5 mL Eppendorf tube and placed in a magnetic tray. After 10 minutes of incubation, unattached solvent was removed. Cells were resuspended in 10 μL of 1X phosphate buffered saline and mounted on superfrost slides (ThermoScientific, Germany) and incubated at 37°C for 30 minutes. Fix the slides by immersing them in 100% methanol for 5 min. Samples were stored at −20°C until further evaluation.
使用ZytoLight® FISH-組織實施套組(Tissue Implementation Kit) (ZytoVision, 產品編號Z-2028-5)來完成細胞預處理。緩衝液PT1、ES1、WB1、WB2及MT7為套組中所提供。Cell pretreatment was accomplished using the ZytoLight® FISH-Tissue Implementation Kit (ZytoVision, Product No. Z-2028-5). Buffers PT1, ES1, WB1, WB2 and MT7 are provided in the kit.
將載玻片在70 °C下培育10分鐘,隨後在二甲苯中培育2 x 10分鐘。接著使用一連串乙醇的100%、100%、90%、70%乙醇使載玻片再水合各5分鐘,且最後在去離子水或蒸餾水中洗滌2 x 2分鐘。在98 °C下於預加溫之熱預處理檸檬酸液(Heat Pretreatment Solution Citric (PT1))中將載波片培育15分鐘,且立即轉移至去離子水或蒸餾水洗滌2 x 2分鐘。吸乾水分之後,向該樣本施用胃蛋白酶溶液(Pepsin Solution (ES1))且在37 °C之恆濕箱中培育15分鐘,並用洗滌緩衝液SSC (WB1)洗滌5分鐘。將載玻片於去離子水中洗滌1分鐘,在一連串乙醇的70%、90%、100%乙醇中脫水各1分鐘並風乾。Slides were incubated at 70 °C for 10 minutes, followed by 2 x 10 minutes in xylene. Slides were then rehydrated using a series of ethanol 100%, 100%, 90%, 70% ethanol for 5 minutes each, and finally washed 2 x 2 minutes in deionized or distilled water. Slides were incubated in pre-warmed Heat Pretreatment Solution Citric (PT1) for 15 minutes at 98°C and immediately transferred to deionized or distilled water for 2 x 2 minute washes. After the moisture was blotted, the sample was applied with pepsin solution (Pepsin Solution (ES1)) and incubated in a humidity chamber at 37°C for 15 minutes, and washed with washing buffer SSC (WB1) for 5 minutes. Slides were washed in deionized water for 1 min, dehydrated in a series of ethanol 70%, 90%, and 100% ethanol for 1 min each and air dried.
ZytoLight SPEC NRG1雙色斷裂分離探針(PL140)係由標記了綠色螢光染料(ZyGreen)及橙色螢光染料(ZyOrange)的多核苷酸所組成。這些探針係靶向所感興趣的基因(例如VAPB、CADM1、CD44、SLC3A2、VTCN1、CDH1、CXADR、GTF2E2、CSMD1、PTN、ST14、AGRN、THBS1或PVALB)中所發生之映射至斷點的遠端與近端之序列。將10 μl的探針抽吸到各個預處理的樣本上,蓋上22 mm x 22 mm蓋玻片並密封。將載玻片置入75 °C之雜交儀中10分鐘以完成變性,隨後將載玻片轉移至恆濕箱中在37 °C下雜交過夜。ZytoLight SPEC NRG1 dual-color break separation probe (PL140) is composed of polynucleotides labeled with green fluorescent dye (ZyGreen) and orange fluorescent dye (ZyOrange). These probes target the remote location that maps to a breakpoint occurring in a gene of interest such as VAPB, CADM1, CD44, SLC3A2, VTCN1, CDH1, CXADR, GTF2E2, CSMD1, PTN, ST14, AGRN, THBS1, or PVALB. end and proximal sequences. 10 μl of probes were pipetted onto each pretreated sample, covered with a 22 mm x 22 mm coverslip and sealed. Place the slides in a hybridization instrument at 75°C for 10 minutes to complete denaturation, then transfer the slides to a humidified chamber and hybridize overnight at 37°C.
次日,將載玻片浸入37 °C之1 x Wash緩衝液A (WB2) 1至3分鐘以移除蓋玻片,且在37 °C下用洗滌緩衝液A再洗滌2 x 5分鐘。將載玻片在70%、90%及100%乙醇中各培育1分鐘且風乾。將25 μl的DAPI/DuraTect-溶液(MT7)抽吸到載玻片上,蓋上蓋玻片且在黑暗中培育15分鐘。接著以螢光顯微鏡使用合適的ZyGreen (激發波503 nm/發射波528 nm)及ZyOrange (激發波580 nm/發射波599 nm)濾鏡來評估樣本螢光。基於綠色及橙色訊號之重疊或未重疊外觀來說明結果。 實例 11 : 免疫細胞化學染色法 (ICC) The next day, slides were immersed in 1 x Wash Buffer A (WB2) at 37°C for 1 to 3 minutes to remove coverslips, and washed an additional 2 x 5 minutes at 37°C with Wash Buffer A. Slides were incubated in 70%, 90% and 100% ethanol for 1 minute each and air dried. 25 μl of DAPI/DuraTect-solution (MT7) was pipetted onto the slide, covered with a coverslip and incubated for 15 minutes in the dark. Sample fluorescence was then evaluated with a fluorescence microscope using the appropriate ZyGreen (excitation 503 nm/emission 528 nm) and ZyOrange (excitation 580 nm/emission 599 nm) filters. Results are illustrated based on the overlapping or non-overlapping appearance of the green and orange signals. Example 11 : Immunocytochemical staining (ICC)
對獲自血漿或血液或其他液態檢體樣本上之循環腫瘤細胞進行免疫細胞化學染色法,以檢測NRG1融合蛋白的表現。例如,循環腫瘤細胞係使用CellSearch® (Menarini silicon biosystems, 義大利)或ISET® (Rarecells Diagnostics, 巴黎,法國)系統而純化自血漿、血清等,並固定在載玻片上。Perform immunocytochemical staining on circulating tumor cells obtained from plasma or blood or other liquid samples to detect the expression of NRG1 fusion protein. For example, circulating tumor cell lines are purified from plasma, serum, etc. using the CellSearch® (Menarini silicon biosystems, Italy) or ISET® (Rarecells Diagnostics, Paris, France) system, and fixed on glass slides.
簡言之,將從全血中濃縮的CTC封固在獨立的載玻片上(Frithiof, Henrik等人,OncoTargets and therapy 第9卷, 7095-7103, 2016年11月16日)。將含有CTC的溶液(~900 μL)轉移至1.5 mL Eppendorf管且置於磁性托盤中。培育10分鐘之後,移除未附著的溶劑。使細胞重新懸浮於10 μL的1倍磷酸鹽緩衝鹽水且封固在超凍載玻片(ThermoScientific, 德國)且在37°C下培育30分鐘。將載玻片浸入100%甲醇中5分鐘完成固定。將樣本儲存在−20°C直至進一步評估。Briefly, CTCs concentrated from whole blood were mounted on separate glass slides (Frithiof, Henrik et al., OncoTargets and therapy Vol. 9, 7095-7103, Nov. 16, 2016). The CTC-containing solution (-900 μL) was transferred to a 1.5 mL Eppendorf tube and placed in a magnetic tray. After 10 minutes of incubation, unattached solvent was removed. Cells were resuspended in 10 μL of 1X phosphate buffered saline and mounted on ultra-frozen slides (ThermoScientific, Germany) and incubated at 37°C for 30 minutes. Fix the slides by immersing them in 100% methanol for 5 min. Samples were stored at −20°C until further evaluation.
在固定之後,使用合適的免疫染色系統(例如Ventana Discovery自動免疫染色系統, Ventana Medical Systems, Tucson, AZ, USA)來進行載玻片上的蛋白質檢測。簡言之,將封固在超凍載玻片上的細胞在pH 8.0之EDTA基緩衝液中預處理。向本揭露之NRG1融合物施用兩種一級抗體。例如,抗NRG1抗體,其靶向位於融合接合處(諸如NRG1之EGFR結構域)C端側位置處的區域,以及另一種一級抗體,其靶向位於融合接合處N端位置處的CD44表位或結構域。用合適的稀釋劑進行抗體稀釋。使用合適的二級抗體及兩種不同螢光檢測系統來檢測該等抗體。在螢光顯微鏡中使用不同通道將免疫染色可視化。基於兩種螢光訊號之重疊或未重疊外觀來說明結果。 實例 12 :治療實驗方案 Following fixation, protein detection on slides is performed using an appropriate immunostaining system (eg, Ventana Discovery Automated Immunostaining System, Ventana Medical Systems, Tucson, AZ, USA). Briefly, cells mounted on ultra-frozen slides were pretreated in EDTA-based buffer at pH 8.0. Two primary antibodies were administered to the NRG1 fusions of the present disclosure. For example, an anti-NRG1 antibody targeting a region at a position C-terminal to the fusion junction such as the EGFR domain of NRG1, and another primary antibody targeting a CD44 epitope at a position N-terminal to the fusion junction or domain. Antibody dilution is performed with an appropriate diluent. These antibodies were detected using appropriate secondary antibodies and two different fluorescent detection systems. Immunostaining was visualized using different channels in a fluorescent microscope. Results are illustrated based on the overlapping or non-overlapping appearance of the two fluorescent signals. Example 12 : Treatment Experimental Protocol
在經診斷帶有以下融合物中之各者的患者中用澤妥珠單抗進行治療:VAPB-NRG1、CADM1-NRG1、CD44 -NRG1、SLC3A2-NRG1、VTCN1-NRG1、CDH1-NRG1、CXADR-NRG1、GTF2E2-NRG1、CSMD1-NRG1、PTN-NRG1、ST14-NRG1、THBS1-NRG1、AGRN-NRG1、PVALB-NRG1、APP-NRG1、WRN-NRG1、DAAM1-NRG1、ASPH-NRG1、NOTCH2-NRG1、CD74-NRG1、SLC4A4-NRG1、SDC4-NRG1、ZFAT-NRG1及DSCAML1-NRG1,其治療如下。Treatment with zetuzumab in patients diagnosed with each of the following fusions: VAPB-NRG1, CADM1-NRG1, CD44-NRG1, SLC3A2-NRG1, VTCN1-NRG1, CDH1-NRG1, CXADR- NRG1, GTF2E2-NRG1, CSMD1-NRG1, PTN-NRG1, ST14-NRG1, THBS1-NRG1, AGRN-NRG1, PVALB-NRG1, APP-NRG1, WRN-NRG1, DAAM1-NRG1, ASPH-NRG1, NOTCH2-NRG1, CD74-NRG1, SLC4A4-NRG1, SDC4-NRG1, ZFAT-NRG1, and DSCAML1-NRG1, were treated as follows.
按照雙週計劃,第一次輸注750毫克的澤妥珠單抗,用時4小時,隨後每隔一周輸注一次,用時2小時,以4周為一個週期。此外,還包括管理IRR(輸液相關反應)的預備藥物,其中包括所有輸液的解熱劑及抗組織胺。在第1天第1週期用藥之前還包括了皮質類固醇;此後,根據調查員的決定進行管理以管理IRR。 實例 13 :源自患者的 VAPB-NRG1 融合數據 According to the biweekly plan, the first infusion of 750 mg of zetuzumab lasted 4 hours, followed by infusions every other week for 2 hours, with a cycle of 4 weeks. Also included are prep medications for the management of IRR (Infusion-Related Reactions), which include antipyretics and antihistamines for all infusions. Corticosteroids were also included prior to Cycle 1 dosing on Day 1; thereafter, administered at the investigator's discretion to manage the IRR. Example 13 : Patient-derived VAPB-NRG1 fusion data
將獲自經診斷患有肺腺癌之68歲男性病患之右體壁胸膜所得到之具60%腫瘤性細胞之經石蠟固定的腫瘤性材料樣本進行以下分子生物科技分析。A paraffin-fixed tumor material sample with 60% neoplastic cells obtained from the right body wall pleura of a 68-year-old male patient diagnosed with lung adenocarcinoma was subjected to the following molecular biotechnology analysis.
使用可檢測一組14 個基因(ALK、BRAF、EGFR、FGFR1、FGFR2、FGFR3、KRAS、MET、NRG1、NTRK1、NTRK2、NTRK3、RET、ROS1)的Archer FusionPlex TM套組。根據製造商的說明書製備建庫,且在Illumina MiSeq定序儀上進行次世代定序(NGS),並使用Archer Analysis 6.0站點完成樣本分析。 An Archer FusionPlex ™ panel that detects a panel of 14 genes (ALK, BRAF, EGFR, FGFR1, FGFR2, FGFR3, KRAS, MET, NRG1, NTRK1, NTRK2, NTRK3, RET, ROS1) was used. The library was prepared according to the manufacturer's instructions, and next-generation sequencing (NGS) was performed on an Illumina MiSeq sequencer, and sample analysis was completed using the Archer Analysis 6.0 site.
在此套組中檢驗是否為VAPB-NRG1融合物陽性的樣本。對此樣本進行定序以致使如SEQ ID NO: 3之序列的鑑定。其顯示出在VAPB的外顯子1與NRG1的外顯子2之間出現一個融合接合處。帶下劃線的序列(核苷酸1-43)係與編碼VAPB (NM_004738.4)之基因的外顯子1的一部分對應。核苷酸44-94係與編碼NRG1的基因的一部分對應。 CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTT (SEQ ID NO: 3)。 實例 14 :源自患者的 CADM1-NRG1 融合數據 Samples positive for the VAPB-NRG1 fusion were tested in this panel. This sample was sequenced to allow the identification of the sequence as SEQ ID NO:3. It shows a fusion junction between exon 1 of VAPB and exon 2 of NRG1. The underlined sequence (nucleotides 1-43) corresponds to part of exon 1 of the gene encoding VAPB (NM_004738.4). Nucleotides 44-94 correspond to a portion of the gene encoding NRG1. CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTT (SEQ ID NO: 3). Example 14 : Patient-derived CADM1-NRG1 fusion data
將獲自經診斷罹患轉移性肺腺癌NOS之男性病患之具腫瘤性材料的經福馬林固定石蠟塊進行以下分子生物科技分析。Formalin-fixed paraffin blocks of neoplastic material obtained from a male patient diagnosed with metastatic lung adenocarcinoma NOS were subjected to the following molecular biotechnological analyses.
在以批准的顯微解剖技術收取轉移性腦組織之後進行分子檢驗。在顯微鏡下檢查候選載玻片,並圈選含有腫瘤細胞(及在需要檢驗時圈選出單獨的正常細胞)的區域。實驗室技師使用解剖顯微鏡從標記區域收取標的組織並取出。所標記及取出的區域係於顯微解剖後的載玻片上重新進行顯微鏡檢查,並複查顯微解剖的妥適性。Molecular testing was performed after harvesting of metastatic brain tissue by approved microdissection techniques. Candidate slides are examined under a microscope and the areas containing tumor cells (and, if required, isolated normal cells) are circled. A laboratory technician harvests and removes the target tissue from the marked area using a dissecting microscope. The marked and removed areas were remicroscopically examined on microdissected slides, and the adequacy of the microdissection was rechecked.
對獲自該樣本之cDNA進行RNA-Seq (RNA定序)以致使如SEQ ID NO: 7之序列的鑑定。其顯示出存在了符合讀框的融合且在CADM1基因之外顯子7與NRG1基因之外顯子6之間出現接合處。下劃線序列,即核苷酸1-53,係對應CADM1基因(NM_ 001301045.1)之外顯子7的一部分。核苷酸54-85係對應編碼NRG1之基因(NM_001159999.3)的一部分。 AGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGCTACATCTACATCCACCACTGGGACAAGCCAT (SEQ ID NO: 7)。 RNA-Seq (RNA sequencing) was performed on the cDNA obtained from this sample to allow the identification of the sequence as SEQ ID NO:7. It shows the presence of an in-frame fusion with a junction between exon 7 of the CADM1 gene and exon 6 of the NRG1 gene. The underlined sequence, namely nucleotides 1-53, corresponds to a part of exon 7 of CADM1 gene (NM_001301045.1). Nucleotides 54-85 correspond to part of the gene encoding NRG1 (NM_001159999.3). AGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGCTACATCTACATCCACCACTGGGACAAGCCAT (SEQ ID NO: 7).
使用Illumina NextSeq平台,基於次世代定序(NGS)分析將從經福馬林固定及石蠟包埋的腫瘤樣本分離出的基因體DNA進行腫瘤突變負荷分析。僅使用先前沒有關於生殖細胞改變的報告之誤義突變來計算腫瘤突變負荷。高突變負荷是免疫療法反應的一個潛在指標(Hellman等人,NEJM, 2018; Le等人,NEJM, 2015; Rizvi等人,Science, 2015; Rosenberg等人,Lancet 2016; Snyder等人,NEJM, 2014)。NSCLC中的截止點係基於一個大型第3期臨床試驗,其顯示出每兆鹼基中的TMB為10或有更多突變之患者在用免疫檢查點抑制劑組合療法治療時,其無進展生存期係比用化療治療時更長(Hellman等人,NEJM, 2018)。高:每兆鹼基大於或等於10個突變;低:每兆鹼基小於或等於9個突變。該患者樣本的TMB評分為9。Genome DNA isolated from formalin-fixed and paraffin-embedded tumor samples was analyzed for tumor mutational burden based on next-generation sequencing (NGS) analysis using the Illumina NextSeq platform. Tumor mutational burden was calculated using only missense mutations that had no previous reports of germline alterations. High mutational burden is a potential indicator of response to immunotherapy (Hellman et al., NEJM, 2018; Le et al., NEJM, 2015; Rizvi et al., Science, 2015; Rosenberg et al., Lancet 2016; Snyder et al., NEJM, 2014 ). The cutoff point in NSCLC was based on a large phase 3 trial that showed improved progression-free survival in patients with mutations in TMB of 10 or more per megabase when treated with immune checkpoint inhibitor combination therapy The period is longer than when treated with chemotherapy (Hellman et al., NEJM, 2018). High: greater than or equal to 10 mutations per megabase; low: less than or equal to 9 mutations per megabase. The patient sample had a TMB score of 9.
以NGS對樣本進行微衛星不穩定性(MSI)分析,其顯示出穩定的MSI特徵。NGS (MSI-NGS)之MSI狀態係藉由直接分析一組592個已知基因中所定序的已知微衛星區域來測得。為了確定臨床閾值,將MSI-NGS結果與來自於用傳統PCR為基礎的方法所分析之超過2,000個匹配臨床病例的結果進行比較。使用如用於突變檢測之相同深度與頻率標準來檢測微衛星基座中之基因體變異體。此測定法中只會考慮到會造成串聯重複序列數量改變的插入及缺失。計數每個樣本中的微衛星變異總數量並分成三個類別:高度、模稜兩可、及穩定。低MSI的結果係於穩定類別中報告。Samples were analyzed for microsatellite instability (MSI) by NGS, which showed stable MSI signatures. MSI status in NGS (MSI-NGS) is determined by direct analysis of known microsatellite regions sequenced in a panel of 592 known genes. To determine clinical thresholds, MSI-NGS results were compared to results from more than 2,000 matched clinical cases analyzed with conventional PCR-based methods. Genomic variants in the microsatellite base were detected using the same depth and frequency criteria as used for mutation detection. Only insertions and deletions that change the number of tandem repeats are considered in this assay. The total number of microsatellite variants in each sample was counted and divided into three categories: high, ambiguous, and stable. Results with low MSI are reported in the stable category.
在樣本上所進行的免疫組織化學染色顯示出ALK及PD-L1的結果為陰性,MLH1、MSH2、MSH6、PMS2及PTEN的結果為陽性。Immunohistochemical staining performed on the samples showed negative results for ALK and PD-L1, and positive results for MLH1, MSH2, MSH6, PMS2, and PTEN.
使用NGS或RNA-Seq分析,在所獲得的樣本中未檢測到以下與癌症類型相關之生物標誌物的融合物或突變:NTRK1、NTRK2、NTRK3、KRAS、ALK、DOR2、ErbB2、ErbB3、ErbB4、PIK3CA、TP53、MET、PD-L1、RET、ROS1、STKI1、TP53。 實例 15 :源自患者的 CD44-NRG1 融合數據 No fusions or mutations of the following cancer type-associated biomarkers were detected in the obtained samples using NGS or RNA-Seq analysis: NTRK1, NTRK2, NTRK3, KRAS, ALK, DOR2, ErbB2, ErbB3, ErbB4, PIK3CA, TP53, MET, PD-L1, RET, ROS1, STKI1, TP53. Example 15 : Patient-derived CD44-NRG1 fusion data
將獲自46歲男性病患肝臟的腫瘤性材料生物樣本進行以下分子生物科技分析。The biological sample of neoplastic material obtained from the liver of a 46-year-old male patient was subjected to the following molecular biotechnology analysis.
使用Archer FusionPlex TMCustom Solid套組來檢測腫瘤樣本中的基因融合物,該套組利用了Anchored Multiplex PCR (AMP TM)技術靶向先前報告過的染色體重排中的62個外顯子。對這62個標的外顯子設計出單向基因特異性引子(GSP)。將GPS與轉接子特異性引子組合來擴增已知的及新穎的融合轉錄物。在Illumina MiSeq定序儀上對富集的擴增子進行定序。 Gene fusions in tumor samples were detected using the Archer FusionPlex TM Custom Solid kit, which utilizes Anchored Multiplex PCR (AMP TM ) technology to target 62 exons in a previously reported chromosomal rearrangement. Unidirectional gene-specific primers (GSP) were designed for these 62 target exons. GPS was combined with adapter-specific primers to amplify known and novel fusion transcripts. The enriched amplicons were sequenced on an Illumina MiSeq sequencer.
於此研究組中之樣本係經CD44-NRG1融合物檢驗為陽性。對該樣本進行定序以致使如SEQ ID NO: 11之序列的鑑定。其顯示出存在了符合讀框的融合且在CD44之外顯子5與NRG1之外顯子2之間出現融合接合處。下劃線序列,即核苷酸1-52,係對應CD44基因(NM_000610)之外顯子5的一部分。核苷酸53-110係對應編碼NRG1之基因之外顯子2的一部分。 GACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCACCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT (SEQ ID NO: 11)。 Samples in this study group tested positive for the CD44-NRG1 fusion. The sample was sequenced to allow identification of the sequence as SEQ ID NO: 11. It shows that there is an in-frame fusion with a fusion junction between exon 5 of CD44 and exon 2 of NRG1. The underlined sequence, nucleotides 1-52, corresponds to a part of exon 5 of the CD44 gene (NM_000610). Nucleotides 53-110 correspond to part of exon 2 of the gene encoding NRG1. GACGAAGACAGTCCCTGGATCACCGACAGCAGCAGACAGAATCCCTGCTACCACCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT (SEQ ID NO: 11).
該樣本在此含有總數62個外顯子標的之臨床驗證套組中係檢驗為陰性,該等標的為:ALK之外顯子20(NM_004304)、ERG之外顯子2,4,8 (NM_004449)、EWSR1之外顯子6,7,8,10,11 (NM_005243)、FGFR3之外顯子17,18 (NM_00142)、FGFR2之外顯子17 (NM_00141.4)、FOXO1之外顯子2 (NM_002015)、NTRK3之外顯子11,12,14,15 (NM_002530)、RET之外顯子12 (NM_020975)、ROS之外顯子34,35 (NM_002944)、SS18之外顯子10 (NM_001007559)、STAT6之外顯子16,17,19 (NM_001178078)、TAF15之外顯子6 (NM_139215)、TFE3之外顯子6 (NM_006521)、BRAF之外顯子7,9,11 (NM_004333)、NTRK1之外顯子12 (NM_002529)、FUS之外顯子5,8 (NM_004960)、CIC之外顯子20 (NM_015125)。 實例 16 :源自患者的 SLC3A2 -NRG1 融合數據 The sample tested negative in this clinical validation panel containing a total of 62 exon targets: ALK exon 20 (NM_004304), ERG exons 2,4,8 (NM_004449 ), EWSR1 exon 6,7,8,10,11 (NM_005243), FGFR3 exon 17,18 (NM_00142), FGFR2 exon 17 (NM_00141.4), FOXO1 exon 2 (NM_002015), NTRK3 exon 11,12,14,15 (NM_002530), RET exon 12 (NM_020975), ROS exon 34,35 (NM_002944), SS18 exon 10 (NM_001007559 ), STAT6 exon 16,17,19 (NM_001178078), TAF15 exon 6 (NM_139215), TFE3 exon 6 (NM_006521), BRAF exon 7,9,11 (NM_004333), NTRK1 exon 12 (NM_002529), FUS exon 5,8 (NM_004960), CIC exon 20 (NM_015125). Example 16 : Patient-derived SLC3A2 -NRG1 fusion data
將獲自經診斷患有肺腺癌之女性病患之左下肺葉切除所得到之顯示出原發性腺癌組織的腫瘤性材料進行以下分子生物科技分析。Neoplastic material showing primary adenocarcinoma tissue obtained from left lower lobectomy of a female patient diagnosed with lung adenocarcinoma was subjected to the following molecular biotechnological analysis.
在藉由以下標的基因的定序收取腫瘤性細胞之後,使用Archer Lung Fusion Plex, Illumina MiniSEQ進行分子檢驗:ALK、BRAF、FGFR1、FGFR2、FGFR3、KRAS、MET、NRG1、NTRK1、NTRK2、NTRK3、RET、ROS1。參考基因體為GRCh37/hg19。Molecular assays were performed using Archer Lung Fusion Plex, Illumina MiniSEQ after harvesting neoplastic cells by sequencing of the following target genes: ALK, BRAF, FGFR1, FGFR2, FGFR3, KRAS, MET, NRG1, NTRK1, NTRK2, NTRK3, RET , ROS1. The reference gene body is GRCh37/hg19.
對獲自該樣本之cDNA進行RNA-Seq (RNA定序)以致使如SEQ ID NO: 15之序列的鑑定。其顯示出存在了出現在SLC3A2基因之外顯子1與NRG1基因之外顯子5之間的符合讀框融合接合處。下劃線序列,即核苷酸1-53,係對應編碼SLC3A2之基因((NM_001013251)之外顯子1的一部分。其餘35個核苷酸,即核苷酸54-88,係對應NRG1基因之外顯子5的一部分。 CCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGGCATCTACATCTACATCCACCACTGGGACAAGCCAT (SEQ ID NO: 15)。 實例 17 :源自患者的 VTCN1-NRG1 融合數據 RNA-Seq (RNA sequencing) was performed on the cDNA obtained from this sample to allow the identification of the sequence as SEQ ID NO: 15. It shows the presence of an in-frame fusion junction occurring between exon 1 of the SLC3A2 gene and exon 5 of the NRG1 gene. The underlined sequence, i.e. nucleotides 1-53, corresponds to a part of exon 1 of the gene encoding SLC3A2 ((NM_001013251). The remaining 35 nucleotides, i.e. nucleotides 54-88, correspond to the outside of the NRG1 gene Part of exon 5. CCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGG CATCTACATCTACATCCACCACTGGGACAAGCCAT (SEQ ID NO: 15). Example 17 : Patient-derived VTCN1-NRG1 fusion data
將獲自經診斷罹患肺腺癌之病患右側胸骨旁部位之包含有腫瘤性細胞的選定組織塊之轉移性腫瘤性檢體材料進行以下分子生物科技分析。Metastatic neoplastic specimen material obtained from the right parasternal site of a patient diagnosed with lung adenocarcinoma was subjected to the following molecular biotechnological analysis of selected tissue blocks containing neoplastic cells.
在藉由定序收取腫瘤性細胞及TNA萃取之後,對所獲得的材料使用Archer Fusion Plex RNA CTL_V6 (Archerdx)套組進行分子檢驗,所述定序係經由Nextseq雙端(pair end)使用流通槽N°HFFJFAFX2以及Sophia DDM版本5.7.4—b127-31a209c、流程(Pipeline) ID ILL1AM1R2 / v5.5.24.1 / GEN1GN1FSQ2來實現。After harvesting of neoplastic cells by sequencing and TNA extraction, molecular assays were performed on the obtained material using the Archer Fusion Plex RNA CTL_V6 (Archerdx) kit using flow cells via Nextseq pair end N°HFFJFAFX2 and Sophia DDM Version 5.7.4—b127-31a209c, Process (Pipeline) ID ILL1AM1R2 / v5.5.24.1 / GEN1GN1FSQ2 to achieve.
所指的套組係可檢測以下目標區域(及括號內的外顯子)中的突變:AKT1 (3)、ALK (22, 23, 24, 25)、AXL (5, 11, 15, 17)、BRAF (11, 15)、CTNNB1 (3)、CYSLTR2 (6)、DDR2 (17)、EGFR (18, 19, 20, 21)、ERBB2 (20)、FGFR1 (2, 8, 9, 10, 17)、FGFR2 (2, 5, 7, 8, 9, 10)、FGFR3 (3, 5, 8, 9, 10)、GNA11 (4,5)、GNAS (8, 9)、GNAQ (4,5)、HRAS (2, 3, 4)、IDH1 (4)、IDH2 (4)、KEAP1 (全部)、KIT (11, 13, 17)、KRAS (2, 3, 4)、MAP2K1 (2, 3)、MET (13 à 19)、NRAS (2, 3, 4)、PIK3CA (9, 20)、POLE (9 à 14)、RAF1 (4, 5, 6, 7, 9, 10, 11, 12 )、RET (11, 13, 14, 15, 16)、ROS1 (38)、STK11 (全部)、TP53 (全部)。檢測以下轉錄融合物:ALK、AXL、BRAF、CCND1、EGFR、FGFR1、FGFR2、FGFR3、MAP2K1、MET、NRG1、NTRK1、NTRK2、NTRK3、PPARG、RAF1、RET、ROS1。並檢測以下表現:ALK、CCND1、EGFR、ERBB2、FGFR1、FGFR2、FGFR3、MET、NTRK1、NTRK2、NTRK3、RET、ROS1。Refers to panels that detect mutations in the following regions of interest (and exons in brackets): AKT1 (3), ALK (22, 23, 24, 25), AXL (5, 11, 15, 17) , BRAF (11, 15), CTNNB1 (3), CYSLTR2 (6), DDR2 (17), EGFR (18, 19, 20, 21), ERBB2 (20), FGFR1 (2, 8, 9, 10, 17 ), FGFR2 (2, 5, 7, 8, 9, 10), FGFR3 (3, 5, 8, 9, 10), GNA11 (4,5), GNAS (8, 9), GNAQ (4,5) , HRAS (2, 3, 4), IDH1 (4), IDH2 (4), KEAP1 (all), KIT (11, 13, 17), KRAS (2, 3, 4), MAP2K1 (2, 3), MET (13 à 19), NRAS (2, 3, 4), PIK3CA (9, 20), POLE (9 à 14), RAF1 (4, 5, 6, 7, 9, 10, 11, 12 ), RET (11, 13, 14, 15, 16), ROS1 (38), STK11 (all), TP53 (all). The following transcriptional fusions were detected: ALK, AXL, BRAF, CCND1, EGFR, FGFR1, FGFR2, FGFR3, MAP2K1, MET, NRG1, NTRK1, NTRK2, NTRK3, PPARG, RAF1, RET, ROS1. And detect the following manifestations: ALK, CCND1, EGFR, ERBB2, FGFR1, FGFR2, FGFR3, MET, NTRK1, NTRK2, NTRK3, RET, ROS1.
在EGFR基因的外顯子18-21未觀察到突變;在KRAS基因的外顯子2及3未檢測到突變;在BRAF基因的外顯子11及15未檢測到突變;在ERBB2基因的外顯子20未檢測到突變;在PIK3CA基因的外顯子9及20未檢測到突變;在MET基因的外顯子14未檢測到突變;在IDH1基因的外顯子4檢測到突變395G>T,導致R132L胺基酸改變且在VTCN1的外顯子2與NRG1的外顯子2之間檢測到轉錄融合物。No mutations were observed in exons 18-21 of the EGFR gene; no mutations were detected in exons 2 and 3 of the KRAS gene; no mutations were detected in exons 11 and 15 of the BRAF gene; No mutation detected in exon 20; no mutation detected in exon 9 and 20 of PIK3CA gene; no mutation detected in exon 14 of MET gene; mutation 395G>T detected in exon 4 of IDH1 gene , resulting in a R132L amino acid change and a transcriptional fusion was detected between exon 2 of VTCN1 and exon 2 of NRG1.
未檢測到涉及ALK、ROS1、RET、NTRK1、NTRK2、NTRK3、FGFR1、FGFR2及FGFR3基因的轉錄融合物。Transcriptional fusions involving the ALK, ROS1, RET, NTRK1, NTRK2, NTRK3, FGFR1, FGFR2, and FGFR3 genes were not detected.
未檢測到ERBB2或MET過度表現。No ERBB2 or MET overexpression was detected.
對獲自該樣本之cDNA進行RNA-Seq (RNA定序)以致使如SEQ ID NO: 166之序列的鑑定。其顯示出存在了出現在VTCN1基因之外顯子2與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-65,係對應編碼VTCN1之基因(存取碼NM_024626.4)之外顯子2的一部分。其餘28個核苷酸,即核苷酸66-93,係對應NRG1之外顯子2的一部分。 CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAGCCTTGCCTCCCCGATTGAAAGAGATGAA (SEQ ID NO: 166)。 實例 18 :源自患者的 CDH1-NRG1 融合數據 RNA-Seq (RNA sequencing) was performed on the cDNA obtained from this sample to allow the identification of the sequence as SEQ ID NO: 166. It shows the presence of an in-frame fusion junction occurring between exon 2 of the VTCN1 gene and exon 2 of the NRG1 gene. The underlined sequence, nucleotides 1-65, corresponds to a part of exon 2 of the gene encoding VTCN1 (access code NM_024626.4). The remaining 28 nucleotides, nucleotides 66-93, correspond to part of exon 2 of NRG1. CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAGCCTTGCCTCCCCGATTGAAAGAGATGAA (SEQ ID NO: 166). Example 18 : Patient-derived CDH1-NRG1 fusion data
將獲自經診斷罹患胰臟腺癌之女性病患的腫瘤性材料進行以下分子生物科技分析。以Maxwell (Promega) FFPE LEV套組進行自動RNA萃取。Tumor material obtained from female patients diagnosed with pancreatic adenocarcinoma was subjected to the following molecular biotechnological analyses. Automated RNA extraction was performed with the Maxwell (Promega) FFPE LEV kit.
使用Archer Fusion Plex NGS進行分子檢驗。在含有161個基因之Oncomine Comprehensive Assay V3 (OCAV3)套組上進行NGS (S5XL – Life Technologies),其中81個被認為是具熱突變點基因(hotspot gene)且48個為全長基因,以及47個拷貝數基因。對獲自該樣本之cDNA進行RNA-Seq (RNA定序)以致使如SEQ ID NO: 186之序列的鑑定。Molecular assays were performed using the Archer Fusion Plex NGS. NGS (S5XL – Life Technologies) was performed on the Oncomine Comprehensive Assay V3 (OCAV3) panel containing 161 genes, of which 81 were considered hotspot genes and 48 were full-length genes, and 47 copy number genes. RNA-Seq (RNA sequencing) was performed on the cDNA obtained from this sample to allow the identification of the sequence as SEQ ID NO: 186.
其顯示出存在了出現在CDH1基因之外顯子11與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-119,係對應編碼CDH1之基因(NM_001317185.2)之外顯子11的一部分。其餘30個核苷酸,即核苷酸120-149,係對應NRG1基因之外顯子2的一部分。此外,NGS顯示出在FGFR3基因之外顯子10存在有突變1172C>T,引起了胺基酸改變Ala391Val。 CTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATGCCTTGCCTCCCCGATTGAAAGAGATGAAA (SEQ ID NO: 186)。 CCTTGCCTCCCCGATTGAAAGAGATGAAAA 實例 19 :源自患者的 CXADR-NRG1 融合數據 It shows the presence of an in-frame fusion junction occurring between exon 11 of the CDH1 gene and exon 2 of the NRG1 gene. The underlined sequence, nucleotides 1-119, corresponds to a part of exon 11 of the gene encoding CDH1 (NM_001317185.2). The remaining 30 nucleotides, namely nucleotides 120-149, correspond to a part of exon 2 of the NRG1 gene. In addition, NGS revealed a mutation 1172C>T in exon 10 of the FGFR3 gene, causing an amino acid change of Ala391Val. CTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATGCCTTGCCTCCCCGATTGAAAGAGATGAAA (SEQ ID NO: 186). CCTTGCCTCCCCGATTGAAAGAGATGAAAA Example 19 : Patient-derived CXADR-NRG1 fusion data
將獲自經診斷罹患大腸癌之男性病患的腫瘤性材料進行以下分子生物科技分析。所提交組織樣本的腫瘤區域或從周圍組織顯微解剖的載玻片,使用標準方法分離總RNA。對分離出的RNA進行逆轉錄,且建構雙股DNA。對該樣本進行通用標籤及標記碼處理,並使用基因特異性引子及通用標記特異性引子(Archer FusionPlex化學公司)進行半巢式多重PCR。使用Illumina NextSeq v2化學品來製備產物及定序。使用Archer分析軟體v4.1來分析序列資料。分析靈敏性為大約5 RNA分子編碼該融合物並取決於所出現的特異性融合事件。融合物係與所報告的參考轉錄本比較。Tumor material obtained from a male patient diagnosed with colorectal cancer was subjected to the following molecular biotechnological analysis. Total RNA was isolated from tumor regions of submitted tissue samples or microdissected slides from surrounding tissue using standard methods. The isolated RNA is reverse transcribed and double-stranded DNA is constructed. The sample was processed with universal tags and marker codes, and semi-nested multiplex PCR was performed using gene-specific primers and universal marker-specific primers (Archer FusionPlex Chemical Company). Products were prepared and sequenced using Illumina NextSeq v2 chemistry. Sequence data were analyzed using Archer analysis software v4.1. The assay sensitivity is approximately 5 RNA molecules encoding the fusion and depends on the specific fusion event that occurs. Fusions were compared to reported reference transcripts.
分子分析顯示出存在了出現在CXADR基因之外顯子1與NRG1之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-43,係對應編碼CXADR之基因(NM_001207063.2)之外顯子1的一部分。其餘58個核苷酸,即核苷酸44-101,係對應NRG1基因之外顯子2的一部分。 ATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT (SEQ ID NO: 217)。 實例 20 :源自患者的 GTF2E2-NRG1 融合數據 Molecular analysis revealed the presence of an in-frame fusion junction between exon 1 of the CXADR gene and exon 2 of NRG1. The underlined sequence, nucleotides 1-43, corresponds to a part of exon 1 of the gene encoding CXADR (NM_001207063.2). The remaining 58 nucleotides, nucleotides 44-101, correspond to a part of exon 2 of the NRG1 gene. ATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT (SEQ ID NO: 217). Example 20 : Patient-derived GTF2E2-NRG1 fusion data
將獲自經診斷罹患轉移性乳腺癌NOS之女性病患之具腫瘤性材料的經福馬林固定石蠟塊進行以下分子生物科技分析。Formalin-fixed paraffin blocks of neoplastic material obtained from female patients diagnosed with metastatic breast cancer NOS were subjected to the following molecular biotechnological analyses.
在以批准的顯微解剖技術收取組織之後進行分子檢驗。在顯微鏡下檢查候選載玻片,並圈選含有腫瘤細胞(及在需要檢驗時圈選出單獨的正常細胞)的區域。實驗室技師使用解剖顯微鏡從標記區域收取標的組織並取出。所標記及取出的區域係於顯微解剖後的載玻片上重新進行顯微鏡檢查,並複查顯微解剖的妥適性。Molecular testing was performed after harvesting the tissue with approved microdissection techniques. Candidate slides are examined under a microscope and the areas containing tumor cells (and, if required, isolated normal cells) are circled. A laboratory technician harvests and removes the target tissue from the marked area using a dissecting microscope. The marked and removed areas were remicroscopically examined on microdissected slides, and the adequacy of the microdissection was rechecked.
基因融合物及變異轉錄的偵測係於從經福馬林固定及石蠟包埋的腫瘤樣本分離出的mRNA上進行,其使用Agilent SureSelectXT的低起始量樣本庫製備化學品(Low Input Library prep chemistry),針對FFPE樣本優化,並結合SureSelect Human All Exon V7捕獲探針套組(bait panel) (48.2 Mb)與Illumina NovaSeq。此測定法係設計用來檢測出現在基因內已知的及新的斷點處之融合。此分析致使如SEQ ID NO: 233之序列的鑑定。其顯示出存在了符合讀框的融合且在GTF2E2基因之外顯子2與NRG1基因之外顯子2之間出現接合處。下劃線序列,即核苷酸1-141,係對應GTF2E2基因(NM_002095.6)之外顯子2的一部分。核苷酸142-268係對應編碼NRG1之基因(NM_001159999.3)的一部分。 GGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATCTTCTGAGTCATCATCATCATCGTCAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGA (SEQ ID NO: 233)。 Detection of gene fusions and variant transcripts was performed on mRNA isolated from formalin-fixed and paraffin-embedded tumor samples using Agilent SureSelectXT Low Input Library prep chemistry ), optimized for FFPE samples, combined with the SureSelect Human All Exon V7 capture probe set (bait panel) (48.2 Mb) and Illumina NovaSeq. This assay is designed to detect fusions that occur at known and novel breakpoints within the gene. This analysis led to the identification of a sequence such as SEQ ID NO: 233. It shows the presence of an in-frame fusion with a junction between exon 2 of the GTF2E2 gene and exon 2 of the NRG1 gene. The underlined sequence, nucleotides 1-141, corresponds to a part of exon 2 of the GTF2E2 gene (NM_002095.6). Nucleotides 142-268 correspond to part of the gene encoding NRG1 (NM_001159999.3). GGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATTCTTCTGAGTCATCATCATCATCGTCAAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCT GAATACTCCTCTCTCCAGATTCAAGTGGTTCAAGAATGGGAATGA (SEQ ID NO: 233).
使用Illumina NextSeq平台,基於次世代定序(NGS)分析將從經福馬林固定及石蠟包埋的腫瘤樣本分離出的基因體DNA進行腫瘤突變負荷分析。僅使用先前沒有關於生殖細胞變異的報告之誤義突變來計算腫瘤突變負荷。使用以下所定義之腫瘤突變負荷閾值水平並建立截止點: ● 高:每兆鹼基大於或等於17個突變(≥17個突變/Mb)。 ● 中:每兆鹼基大於或等於7個突變但小於17個突變(≥7個且<17個突變/Mb)。 ● 低:每兆鹼基小於或等於6個突變(≤6個突變/Mb)。患者樣本指出TMB評分為9。 Genome DNA isolated from formalin-fixed and paraffin-embedded tumor samples was analyzed for tumor mutational burden based on next-generation sequencing (NGS) analysis using the Illumina NextSeq platform. Tumor mutational burden was calculated using only missense mutations that had not been previously reported as germline variants. Threshold levels of tumor mutational burden defined below were used and cut-off points were established: ● High: greater than or equal to 17 mutations per megabase (≥17 mutations/Mb). • Medium: greater than or equal to 7 but less than 17 mutations per megabase (≥7 and <17 mutations/Mb). ● Low: Less than or equal to 6 mutations per megabase (≤6 mutations/Mb). The patient sample indicated a TMB score of 9.
藉由NGS使用微衛星不穩定性(MSI)分析,未檢測到微衛星不穩定性。No microsatellite instability was detected by NGS using microsatellite instability (MSI) analysis.
使用NGS分析,在GATA3的外顯子6檢測到框移突變(c.1305_1327del23,蛋白變異P436fs)及失能變異體NTRK2 (蛋白變異p.Q172 c.514C>T, (NM_006180.4)。 實例 21 :源自患者的 CSMD1-NRG1 融合數據 Using NGS analysis, a frameshift mutation was detected in exon 6 of GATA3 (c.1305_1327del23, protein variant P436fs) and a disabling variant NTRK2 (protein variant p.Q172 c.514C>T, (NM_006180.4). Example 21 : Patient-derived CSMD1-NRG1 fusion data
將獲自經診斷罹患大腸直腸癌之男性病患的腫瘤性材料進行以下分子生物科技分析。Tumor material obtained from male patients diagnosed with colorectal cancer was subjected to the following molecular biotechnological analyses.
使用由596個帶有單核苷酸變異、插入與刪除(indel)、易位之基因所組成的定製腫瘤學檢驗套組進行雜交捕獲次世代定序。使用IDT xGen Exome研究套組v1.0雜交探針使得可從重排基因中無偏地檢測所表現的融合轉錄本而進行全基因體轉錄本RNA定序。Hybrid capture next-generation sequencing was performed using a custom oncology panel consisting of 596 genes with single nucleotide variations, insertions and deletions (indels), and translocations. Whole-genome transcript RNA sequencing was performed using the IDT xGen Exome Research Kit v1.0 Hybridization Probes to allow unbiased detection of expressed fusion transcripts from rearranged genes.
其顯示出存在了出現在CSMD1基因之外顯子23與NRG1之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-88,係對應編碼CSMD1之基因(NM_033225.6)之外顯子23的一部分。其餘的核苷酸,即核苷酸89-150,係對應NRG1基因之外顯子6的一部分。相關的患者報告提及阿法替尼(Afatinib)及厄洛替尼(Erlotinib)為經FDA批准與此CSMD1-NRG1融合物相關的療法,同時提及GSK2849330作為調查性研究。此外,提到了與CSMD1-NRG1融合物相關的臨床試驗TAPUR (NCT02693535)。It shows the presence of an in-frame fusion junction occurring between exon 23 of the CSMD1 gene and exon 6 of the NRG1 gene. The underlined sequence, nucleotides 1-88, corresponds to a part of exon 23 of the gene encoding CSMD1 (NM_033225.6). The remaining nucleotides, nucleotides 89-150, correspond to a portion of exon 6 of the NRG1 gene. Related patient reports mention afatinib and erlotinib as FDA-approved therapies associated with this CSMD1-NRG1 fusion, while mentioning GSK2849330 as an investigational study. In addition, the clinical trial TAPUR (NCT02693535) related to the CSMD1-NRG1 fusion is mentioned.
在患者的材料中未鑑定出致病性生殖細胞突變,在KRAS或NRAS中也未鑑定出致病性單一核苷酸同質多形性。在患者的材料中鑑定出以下四種體細胞基因體變異體:在TP53中的Y220C誤義突變、在APC中的R564*終止突變( stop gain mutation)、在APC中的N1455fs框移突變、及在BRAF中的G469A誤義突變。確定MSI狀態為穩定的且腫瘤突變負荷為4.6m/MB。No pathogenic germline mutations were identified in patient material, nor were pathogenic single nucleotide polymorphisms identified in KRAS or NRAS. The following four somatic gene variants were identified in patient material: Y220C missense mutation in TP53, R564* stop gain mutation in APC, N1455fs frameshift mutation in APC, and G469A missense mutation in BRAF. The MSI status was determined to be stable and the tumor mutational burden was 4.6 m/MB.
鑑定出意義不明的變異體,包括:在APOB (NM_000384)中的c.3332+1G>A剪接區變異體、在GATA3 (NM_001002295)中的c.425C>T p.S142L誤義變異體、在LZTR1 (NM_006767)中的c.352C>T p.R118C誤義變異體、在CTC1 (NM_025099)中的c.227A>T p.H76L誤義變異體、在CYP1B1 (NM_000104)中的c.239G>A p.R80H誤義變異體、在RECQL4 (NM_004260)中的c.1099C>T p.R367W誤義變異體、在SOX2 (NM_003106)中的c.871A>G p.R291G誤義變異體、及在TCF7L2 (NM_001146274)中的c85_87del pE29del框內缺失。Variants of unknown significance were identified, including: c.3332+1G>A splice region variant in APOB (NM_000384), c.425C>T p.S142L missense variant in GATA3 (NM_001002295), c.352C>T p.R118C missense variant in LZTR1 (NM_006767), c.227A>T p.H76L missense variant in CTC1 (NM_025099), c.239G> in CYP1B1 (NM_000104) A p.R80H missense variant, c.1099C>T p.R367W missense variant in RECQL4 (NM_004260), c.871A>G p.R291G missense variant in SOX2 (NM_003106), and In-frame deletion of c85_87del pE29del in TCF7L2 (NM_001146274).
該患者先前的治療包括氟脲嘧啶、菊白葉酸、伊立替康(Irinotecan)、貝伐單抗(Bevacizumab)及奧沙利鉑(Oxaliplatin)。 ATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACT (SEQ ID NO: 255)。 實例 22 :源自患者的 PTN-NRG1 融合數據 The patient's previous treatment included fluorouracil, leucovorin, irinotecan, bevacizumab, and oxaliplatin. ATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACT (SEQ ID NO: 255). Example 22 : Patient-derived PTN-NRG1 fusion data
將獲自經診斷罹患轉移性乳房惡性腫瘤NOS之女性病患之具腫瘤性材料的經福馬林固定石蠟塊進行以下分子生物科技分析。Formalin-fixed paraffin blocks of neoplastic material obtained from female patients diagnosed with metastatic breast malignancy NOS were subjected to the following molecular biotechnological analyses.
在以批准的顯微解剖技術收取組織之後進行分子檢驗。在顯微鏡下檢查候選載玻片,並圈選含有腫瘤細胞(及在需要檢驗時圈選出單獨的正常細胞)的區域。實驗室技師使用解剖顯微鏡從標記區域收取標的組織並取出。所標記及取出的區域係於顯微解剖後的載玻片上重新進行顯微鏡檢查,並複查顯微解剖的妥適性。Molecular testing was performed after harvesting the tissue with approved microdissection techniques. Candidate slides are examined under a microscope and the areas containing tumor cells (and, if required, isolated normal cells) are circled. A laboratory technician harvests and removes the target tissue from the marked area using a dissecting microscope. The marked and removed areas were remicroscopically examined on microdissected slides, and the adequacy of the microdissection was rechecked.
基因融合物及變異轉錄的偵測係於從經福馬林固定及石蠟包埋的腫瘤樣本分離出的mRNA上進行,其使用Agilent SureSelectXT的低起始量樣本庫製備化學品(Low Input Library prep chemistry),針對FFPE樣本優化,並結合SureSelect Human All Exon V7捕獲探針套組(bait panel) (48.2 Mb)與Illumina NovaSeq。此測定法係設計用來檢測出現在基因內已知的及新的斷點處之融合。此分析致使如SEQ ID NO: 313之序列的鑑定。其顯示出存在了符合讀框的融合且在PTN基因之外顯子4與NRG1基因之外顯子2之間出現接合處。下劃線序列,即核苷酸1-102,係對應PTN基因(NM_001321386.2)之外顯子2的一部分。核苷酸103-205係對應編碼NRG1之基因(NM_001159999.3)的一部分。 CCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATT (SEQ ID NO: 313)。 Detection of gene fusions and variant transcripts was performed on mRNA isolated from formalin-fixed and paraffin-embedded tumor samples using Agilent SureSelectXT Low Input Library prep chemistry ), optimized for FFPE samples, combined with the SureSelect Human All Exon V7 capture probe set (bait panel) (48.2 Mb) and Illumina NovaSeq. This assay is designed to detect fusions that occur at known and novel breakpoints within the gene. This analysis led to the identification of the sequence as SEQ ID NO: 313. It shows the presence of an in-frame fusion with a junction between exon 4 of the PTN gene and exon 2 of the NRG1 gene. The underlined sequence, nucleotides 1-102, corresponds to a part of exon 2 of the PTN gene (NM_001321386.2). Nucleotides 103-205 correspond to part of the gene encoding NRG1 (NM_001159999.3). CCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAG CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATT (SEQ ID NO: 31 3).
使用Illumina NextSeq平台,基於次世代定序(NGS)分析將從經福馬林固定及石蠟包埋的腫瘤樣本分離出的基因體DNA進行腫瘤突變負荷分析。僅使用先前沒有關於生殖細胞變異的報告之誤義突變來計算腫瘤突變負荷。使用以下所定義之腫瘤突變負荷閾值水平並建立截止點: ● 高:每兆鹼基大於或等於17個突變(≥17個突變/Mb)。 ● 中:每兆鹼基大於或等於7個突變但小於17個突變(≥7個且<17個突變/Mb)。 ● 低:每兆鹼基小於或等於6個突變(≤6個突變/Mb)。患者樣本顯示出低TMB評分。 Genome DNA isolated from formalin-fixed and paraffin-embedded tumor samples was analyzed for tumor mutational burden based on next-generation sequencing (NGS) analysis using the Illumina NextSeq platform. Tumor mutational burden was calculated using only missense mutations that had not been previously reported as germline variants. Threshold levels of tumor mutational burden defined below were used and cut-off points were established: ● High: greater than or equal to 17 mutations per megabase (≥17 mutations/Mb). • Medium: greater than or equal to 7 but less than 17 mutations per megabase (≥7 and <17 mutations/Mb). ● Low: Less than or equal to 6 mutations per megabase (≤6 mutations/Mb). Patient samples showed low TMB scores.
藉由NGS使用微衛星不穩定性(MSI)分析,未檢測到微衛星不穩定性。No microsatellite instability was detected by NGS using microsatellite instability (MSI) analysis.
使用NGS分析,在TP53的外顯子10檢測到p.E336*, c.1006G>T突變(NM_000546.5)。在NTRK1/2/3、AKT1、BRCA1/2、ERBB2、ESR1、PIK3CA、PTEN中未檢測到變異,且對於PD-L1係得到陰性IHC結果(基於22c3及SP142)。 實例 23 :源自患者的 ST14-NRG1 融合數據 Using NGS analysis, the p.E336*, c.1006G>T mutation (NM_000546.5) was detected in exon 10 of TP53. No mutations were detected in NTRK1/2/3, AKT1, BRCA1/2, ERBB2, ESR1, PIK3CA, PTEN, and negative IHC results were obtained for the PD-L1 lineage (based on 22c3 and SP142). Example 23 : Patient-derived ST14-NRG1 fusion data
從經診斷出罹患非小細胞肺癌的患者所獲得之腫瘤性材料上進行的分子分析顯示出存在了出現在如SEQ ID NO: 330序列所包含之ST14基因之外顯子11與NRG1之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-95,係對應編碼ST14之基因(NM_021978.4)之外顯子11的一部分。其餘87個核苷酸,即核苷酸96-182,係對應NRG1基因之外顯子6的一部分。 CAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACCCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT (SEQ ID NO: 330)。 實例 24 :源自患者的 THBS1-NRG1 融合數據 Molecular analysis of tumorous material obtained from patients diagnosed with non-small cell lung cancer revealed the presence of exon 11 and NRG1 of the ST14 gene contained in the sequence SEQ ID NO: 330 In-frame fusion junction between sub6. The underlined sequence, nucleotides 1-95, corresponds to a part of exon 11 of the gene encoding ST14 (NM_021978.4). The remaining 87 nucleotides, namely nucleotides 96-182, correspond to a part of exon 6 of the NRG1 gene. CAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTCTCTACGACTCCAGTGACCCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT (SEQ ID NO: 330). Example 24 : Patient-derived THBS1-NRG1 fusion data
從經診斷出罹患胰臟腺癌的患者所獲得之具有腫瘤性材料之新鮮的、冷凍的、或經福馬林固定的石蠟塊中萃取腫瘤DNA,並進行以下分子生物技術分析。Tumor DNA was extracted from fresh, frozen, or formalin-fixed paraffin blocks of neoplastic material obtained from patients diagnosed with pancreatic adenocarcinoma and subjected to the following molecular biotechnological analyses.
調查萃取的DNA是否存有447個癌症基因的外顯子DNA序列且在60個基因中的191個區域進行重排檢測。從含有至少20%腫瘤細胞核的組織中分離出DNA,且使用溶液相Agilent SureSelect雜交捕獲套組及Illumina HiSeq 2500定序儀來進行大規模平行定序分析。Exon DNA sequences of 447 cancer genes were investigated in the extracted DNA, and rearrangement was detected in 191 regions of 60 genes. DNA was isolated from tissues containing at least 20% tumor nuclei and analyzed by massively parallel sequencing using a solution-phase Agilent SureSelect hybrid capture kit and an Illumina HiSeq 2500 sequencer.
分子分析顯示出存在了出現在THBS1基因之外顯子9與NRG1之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-56,係對應編碼THBS1之基因(NM_003246.4)之外顯子9的一部分。核苷酸57-145係對應NRG1基因之外顯子6的一部分。 ACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTC (SEQ ID NO: 376)。 Molecular analysis revealed the presence of an in-frame fusion junction between exon 9 of the THBS1 gene and exon 6 of the NRG1 gene. The underlined sequence, nucleotides 1-56, corresponds to a part of exon 9 of the gene encoding THBS1 (NM_003246.4). Nucleotides 57-145 correspond to a part of exon 6 of the NRG1 gene. ACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTC (SEQ ID NO: 376).
進一步的分子分析並未顯露出有ALK、NTRK1、NTRK2、NTRK3、ROS1的融合物,且未檢測到涉及BRCA1、BRCA2或PALB2的致病性突變、複本數改變或結構變異體。已對鑑定以下突變:TP53中的c.783-10_787delinsCCTG、TP53在17p13.1的單複本缺失、c.*178T>A,CDKN1B的外顯子3、在ERCC2外顯子12中的c.1205A>C (p.N402T)、在FANCD2外顯子21中的c.1858T>C (p.C620R)、在KIF1B外顯子5中的c.398A>C (p.N133T)、在NAB2中的c.1092-7T>C、在NOTCH1外顯子20中的c.3245G>A (p.R1082H)、在PTPN14外顯子13中的c.1798C>T (p.R600W)、及在RASA1外顯子9中的c.1273C>A (p.H425N)。 在KRAS或NRAS中也未鑑定出致病性單一核苷酸同質多形性。在患者的材料中鑑定出以下四種體細胞基因體變異體:在TP53中的Y220C誤義突變、在APC中的R564*終止突變( stop gain mutation)、在APC中的N1455fs框移突變、及在BRAF中的G469A誤義突變。確定MSI狀態為穩定的且腫瘤突變負荷為4.6m/MB。 實例 25 :源自患者的 AGRN-NRG1 融合數據 Further molecular analysis did not reveal fusions with ALK, NTRK1, NTRK2, NTRK3, ROS1, and no pathogenic mutations, copy number alterations, or structural variants involving BRCA1, BRCA2, or PALB2 were detected. The following mutations have been identified: c.783-10_787delinsCCTG in TP53, single copy deletion of TP53 at 17p13.1, c.*178T>A, exon 3 of CDKN1B, c.1205A in ERCC2 exon 12 >C (p.N402T), c.1858T>C (p.C620R) in FANCD2 exon 21, c.398A>C (p.N133T) in KIF1B exon 5, in NAB2 c.1092-7T>C, c.3245G>A in NOTCH1 exon 20 (p.R1082H), c.1798C>T in PTPN14 exon 13 (p.R600W), and outside RASA1 c.1273C>A in exon 9 (p.H425N). Pathogenic single nucleotide polymorphisms were also not identified in KRAS or NRAS. The following four somatic gene variants were identified in patient material: Y220C missense mutation in TP53, R564* stop gain mutation in APC, N1455fs frameshift mutation in APC, and G469A missense mutation in BRAF. The MSI status was determined to be stable and the tumor mutational burden was 4.6 m/MB. Example 25 : Patient-derived AGRN-NRG1 fusion data
將獲自經診斷罹患胰臟癌(PDAC)之男性病患的腫瘤性材料進行以下分子生物科技分析。Tumor material obtained from male patients diagnosed with pancreatic cancer (PDAC) was subjected to the following molecular biotechnological analyses.
使用由648個帶有單核苷酸變異、插入與刪除(indel)、易位之基因所組成的定製腫瘤學檢驗套組進行雜交捕獲次世代定序。可從重排基因中無偏地檢測所表現的融合轉錄本來進行全基因體轉錄本RNA定序。該測定法的限度為對於單核苷酸變異體的變異對偶基因分數(VAF)為5%且靈敏度為98.2%,對於插入與刪除的VAF為5%且靈敏度為91.8%,而對於易位的靈敏度為91.7%。Hybrid capture next-generation sequencing was performed using a custom oncology panel consisting of 648 genes with single nucleotide variations, insertions and deletions (indels), and translocations. Whole-genome transcript RNA sequencing can be performed with unbiased detection of expressed fusion transcripts from rearranged genes. The limits of the assay are a variant allele fraction (VAF) of 5% with a sensitivity of 98.2% for single nucleotide variants, a VAF of 5% for insertions and deletions with a sensitivity of 91.8%, and a sensitivity of 91.8% for translocations. The sensitivity is 91.7%.
此分析致使如SEQ ID NO: 403之序列的鑑定。其顯示出存在了符合讀框的融合且在AGRN基因之外顯子12與NRG1基因之外顯子6之間出現接合處。下劃線序列,即核苷酸1-106,係對應AGRN基因(NM_001305275.2)之外顯子12的一部分。核苷酸107-207係對應編碼NRG1之基因(NM_001159999.3)的一部分。 GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGAC (SEQ ID NO: 403)。 This analysis led to the identification of a sequence such as SEQ ID NO: 403. It shows the presence of an in-frame fusion with a junction between exon 12 of the AGRN gene and exon 6 of the NRG1 gene. The underlined sequence, nucleotides 1-106, corresponds to a part of exon 12 of the AGRN gene (NM_001305275.2). Nucleotides 107-207 correspond to part of the gene encoding NRG1 (NM_001159999.3). GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAG CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGAC (SE Q ID NO: 403).
此外,也檢測到FGFR1過度表現。 實例 26 :源自患者的 PVALB-NRG1 融合數據 In addition, FGFR1 overexpression was also detected. Example 26 : Patient-derived PVALB-NRG1 fusion data
從經診斷出罹患非小細胞肺癌的患者所獲得之腫瘤性材料上進行的分子分析顯示出存在了出現在如SEQ ID NO: 437序列所包含之PVALB基因之外顯子4與NRG1之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-102,係對應編碼PVALB之基因(NM_002854.3)之外顯子4的一部分。核苷酸103-227係對應NRG1基因之外顯子6的一部分。 TAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCAAACCCCTCGAGATACTTG (SEQ ID NO: 437)。 實例 27 :源自患者的 SLC3A2-NRG1 融合數據 Molecular analysis of tumorous material obtained from patients diagnosed with non-small cell lung cancer revealed the presence of exon 4 and NRG1 of the PVALB gene contained in the sequence SEQ ID NO: 437 In-frame fusion junction between sub6. The underlined sequence, nucleotides 1-102, corresponds to a part of exon 4 of the gene encoding PVALB (NM_002854.3). Nucleotides 103-227 correspond to a part of exon 6 of the NRG1 gene. TAAAAGGCTTCTCCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTT CAAACCCCTCGAGATACTTG (SEQ ID NO: 437). Example 27 : Patient-derived SLC3A2-NRG1 fusion data
從經診斷出罹患肺癌的患者所獲得之腫瘤性材料上進行以下分子生物技術分析。先後使用Maxwell RSC及ReliaPrep™ FFPE Total RNA Promega套組以及Archer Dx CTL Fusionplex從經福馬林固定的石蠟塊萃取出總RNA。使用Archer Analysis第6.2.7版進行生物資訊分析。The following molecular biotechnological analyzes were performed on tumorous material obtained from patients diagnosed with lung cancer. Total RNA was extracted from formalin-fixed paraffin blocks using Maxwell RSC followed by ReliaPrep™ FFPE Total RNA Promega Kit and Archer Dx CTL Fusionplex. Bioinformatic analysis was performed using Archer Analysis version 6.2.7.
此分析顯示出存在了出現在如SEQ ID NO: 454序列所包含之SLC3A2基因之外顯子2與NRG1之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-93,係對應編碼SLC3A2之基因(NM_002394.6)之外顯子2的一部分。核苷酸94-121係對應NRG1基因之外顯子6的一部分。 AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTCTTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAGCTACATCTACATCCACCACTGGGACAAG (SEQ ID NO: 454)。 實例 28 :源自患者的 APP-NRG1 融合數據 This analysis revealed the presence of an in-frame fusion junction occurring between exon 2 of the SLC3A2 gene and exon 6 of the NRG1 gene as encompassed by the sequence of SEQ ID NO: 454. The underlined sequence, nucleotides 1-93, corresponds to a part of exon 2 of the gene encoding SLC3A2 (NM_002394.6). Nucleotides 94-121 correspond to a part of exon 6 of the NRG1 gene. AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTCTTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAGCTACATCTACATCCACCACTGGGACAAG (SEQ ID NO: 454). Example 28 : Patient-derived APP-NRG1 fusion data
從經診斷出罹患胰臟癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 486序列所包含之APP基因之外顯子14與NRG1基因之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-54,係對應編碼APP之基因(NM_001136130.3)之外顯子14的一部分。核苷酸55-141係對應NRG1基因之外顯子6的一部分。 TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT (SEQ ID NO: 486)。 實例 29 :源自患者的 WRN-NRG1 融合數據 Molecular biotechnological analysis of tumorous material obtained from a patient diagnosed with pancreatic cancer revealed the presence of exon 14 of the APP gene and the NRG1 gene contained in the sequence of SEQ ID NO: 486 In-frame fusion junction between exon 6. The underlined sequence, nucleotides 1-54, corresponds to a part of exon 14 of the gene encoding APP (NM_001136130.3). Nucleotides 55-141 correspond to a part of exon 6 of the NRG1 gene. TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT (SEQ ID NO: 486). Example 29 : Patient-derived WRN-NRG1 fusion data
從經診斷出罹患乳癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 528序列所包含之WRN基因之外顯子33與NRG1基因之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-96,係對應編碼APP之基因(NM_000553.6)之外顯子33的一部分。核苷酸97-182係對應NRG1基因之外顯子6的一部分。 AAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGC (SEQ ID NO: 528)。 實例 30 :源自患者的 DAAM1-NRG1 融合數據 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with breast cancer revealed the presence of exon 33 of the WRN gene and outside the NRG1 gene as contained in the sequence of SEQ ID NO: 528 In-frame fusion junction between exon 6. The underlined sequence, nucleotides 1-96, corresponds to a part of exon 33 of the gene encoding APP (NM_000553.6). Nucleotides 97-182 correspond to a part of exon 6 of the NRG1 gene. AAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGC (SEQ ID NO: 528). Example 30 : Patient-derived DAAM1-NRG1 fusion data
從經診斷出罹患乳癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 605序列所包含之DAAM1基因之外顯子1與NRG1基因之外顯子1之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼DAAM1之基因(NM_001270520.2)之外顯子1的一部分。核苷酸76-150係對應NRG1基因之外顯子1的一部分。 GAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAGGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCA (SEQ ID NO: 605)。 實例 31 :源自患者的 ASPH-NRG1 融合數據 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with breast cancer revealed the presence of a gene present outside exon 1 of the DAAM1 gene and NRG1 gene as contained in the sequence of SEQ ID NO: 605 In-frame fusion junction between exon 1. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 1 of the gene encoding DAAM1 (NM_001270520.2). Nucleotides 76-150 correspond to a portion of exon 1 of the NRG1 gene. GAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAGGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCA (SEQ ID NO: 605). Example 31 : Patient-derived ASPH-NRG1 fusion data
從經診斷出罹患大腸直腸腺癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 635序列所包含之ASPH基因之外顯子22與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼ASPH之基因(NM_001164750.2)之外顯子1的一部分。核苷酸76-150係對應NRG1基因之外顯子2的一部分。 AAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 635)。 實例 32 :源自患者的 NOTCH2-NRG1 融合數據 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with colorectal adenocarcinoma showed the presence of exon 22 of the ASPH gene contained in the sequence of SEQ ID NO: 635 and NRG1 In-frame fusion junction between exon 2 of the gene. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 1 of the gene encoding ASPH (NM_001164750.2). Nucleotides 76-150 correspond to part of exon 2 of the NRG1 gene. AAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 635). Example 32 : Patient-derived NOTCH2-NRG1 fusion data
從經診斷出罹患胰臟癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 693序列所包含之NOTCH2基因之外顯子6與NRG1基因之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼NOTCH2之基因(NM_024408.4)之外顯子6的一部分。核苷酸76-150係對應NRG1基因之外顯子6的一部分。 CCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 693)。 實例 33 :源自患者的 CD74-NRG1 融合數據 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with pancreatic cancer revealed the presence of exon 6 of the NOTCH2 gene and the NRG1 gene as contained in the sequence of SEQ ID NO: 693 In-frame fusion junction between exon 6. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 6 of the gene encoding NOTCH2 (NM_024408.4). Nucleotides 76-150 correspond to a part of exon 6 of the NRG1 gene. CCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 693). Example 33 : Patient-derived CD74-NRG1 fusion data
從經診斷出罹患肺癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 717序列所包含之CD74基因之外顯子2與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼CD74之基因(NM_001025159.3)之外顯子2的一部分。核苷酸76-150係對應NRG1基因之外顯子2的一部分。 AGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGCCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 717)。 實例 34 :源自患者的 SDC4-NRG1 融合數據 Molecular biotechnological analysis of tumorous material obtained from a patient diagnosed with lung cancer revealed the presence of exon 2 of the CD74 gene and the NRG1 gene as contained in the sequence of SEQ ID NO: 717 In-frame fusion junction between exon 2. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 2 of the gene encoding CD74 (NM_001025159.3). Nucleotides 76-150 correspond to part of exon 2 of the NRG1 gene. AGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGCCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 717). Example 34 : Patient-derived SDC4-NRG1 fusion data
從經診斷出罹患肺癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 743序列所包含之SDC4基因之外顯子2與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼SDC4之基因(NM_002999.4)之外顯子2的一部分。核苷酸76-150係對應NRG1基因之外顯子2的一部分。 TACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 743)。 實例 35 :源自患者的 CD44-NRG1 融合數據 Molecular biotechnological analysis of tumorous material obtained from a patient diagnosed with lung cancer revealed the presence of exon 2 of the SDC4 gene and outside the NRG1 gene as contained in the sequence of SEQ ID NO: 743 In-frame fusion junction between exon 2. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 2 of the gene encoding SDC4 (NM_002999.4). Nucleotides 76-150 correspond to part of exon 2 of the NRG1 gene. TACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 743). Example 35 : Patient-derived CD44-NRG1 fusion data
從經診斷出罹患癌症的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 761序列所包含之CD44基因之外顯子5與NRG1基因之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼CD44之基因(NM_000610 .4)之外顯子2的一部分。核苷酸76-150係對應NRG1基因之外顯子6的一部分。 TTTCTACTGTACACCCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 761)。 實例 36 :源自患者的 SLC4A4-NRG1 融合數據 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with cancer revealed the presence of exon 5 of the CD44 gene and the NRG1 gene as contained in the sequence of SEQ ID NO: 761 In-frame fusion junction between exon 6. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 2 of the gene encoding CD44 (NM_000610.4). Nucleotides 76-150 correspond to a part of exon 6 of the NRG1 gene. TTTCTACTGTACACCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCCAGACAGAATCCCTGCTACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 761). Example 36 : Patient-derived SLC4A4-NRG1 fusion data
從經診斷出罹患胰臟癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 765序列所包含之SLC4A4基因之外顯子14與NRG1基因之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼SLC4A4之基因(NM_001098484.3)之外顯子14的一部分。核苷酸76-150係對應NRG1基因之外顯子6的一部分。 ACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 765)。 實例 37 :源自患者的 SDC4-NRG1 融合數據 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with pancreatic cancer revealed the presence of exon 14 of the SLC4A4 gene and the NRG1 gene as contained in the sequence of SEQ ID NO: 765 In-frame fusion junction between exon 6. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 14 of the gene encoding SLC4A4 (NM_001098484.3). Nucleotides 76-150 correspond to a part of exon 6 of the NRG1 gene. ACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 765). Example 37 : Patient-derived SDC4-NRG1 fusion data
從經診斷出罹患肺癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 824序列所包含之SDC4基因之外顯子4與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼SDC4之基因(NM_002999.4)之外顯子4的一部分。核苷酸76-150係對應NRG1基因之外顯子2的一部分。 ATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 824)。 實例 38 :源自患者的 ZFAT-NRG1 融合數據 Molecular biotechnological analysis of tumorous material obtained from a patient diagnosed with lung cancer revealed the presence of exon 4 of the SDC4 gene as contained in the sequence of SEQ ID NO: 824 and outside of the NRG1 gene In-frame fusion junction between exon 2. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 4 of the gene encoding SDC4 (NM_002999.4). Nucleotides 76-150 correspond to part of exon 2 of the NRG1 gene. ATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 824). Example 38 : Patient-derived ZFAT-NRG1 fusion data
從經診斷出罹患肺癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 828序列所包含之ZFAT基因之外顯子12與NRG1基因之外顯子6之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼ZFAT之基因(NM_020863.4)之外顯子12的一部分。核苷酸76-150係對應NRG1基因之外顯子6的一部分。 ACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 828)。 實例 39 :源自患者的 DSCAML1-NRG1 融合數據 Molecular biotechnological analysis of tumorous material obtained from a patient diagnosed with lung cancer revealed the presence of exon 12 of the ZFAT gene and outside of the NRG1 gene as contained in the sequence of SEQ ID NO: 828 In-frame fusion junction between exon 6. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 12 of the gene encoding ZFAT (NM_020863.4). Nucleotides 76-150 correspond to a part of exon 6 of the NRG1 gene. ACAGGAAGCACCCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG (SEQ ID NO: 828). Example 39 : Patient-derived DSCAML1-NRG1 fusion data
從經診斷出罹患胰臟癌的患者所獲得之腫瘤性材料上進行分子生物技術分析,其顯示出存在了出現在如SEQ ID NO: 868序列所包含之DSCAML1基因之外顯子3與NRG1基因之外顯子2之間的符合讀框融合接合處。下劃線序列,即核苷酸1-75,係對應編碼DSCAML1之基因(NM_020693.4)之外顯子3的一部分。核苷酸76-150係對應NRG1基因之外顯子2的一部分。 GCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 868)。 實例 40 : PDX 動物模型 Molecular biotechnological analysis of neoplastic material obtained from a patient diagnosed with pancreatic cancer revealed the presence of exon 3 of the DSCAML1 gene contained in the sequence SEQ ID NO: 868 and the NRG1 gene In-frame fusion junction between exon 2. The underlined sequence, nucleotides 1-75, corresponds to a part of exon 3 of the gene encoding DSCAML1 (NM_020693.4). Nucleotides 76-150 correspond to part of exon 2 of the NRG1 gene. GCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCACCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA (SEQ ID NO: 868). Example 40 : PDX animal model
製備源自患者的異種移植(PDX)動物模型使其含有包含了多核苷酸融合物且表現了由其所編碼之多肽融合物的異常細胞之植入異常細胞,以使用澤妥珠單抗作為ERB2及/或Erb3標靶劑進行治療來評估其治療活性。Preparation of a patient-derived xenograft (PDX) animal model containing implanted abnormal cells containing abnormal cells comprising a polynucleotide fusion and expressing a polypeptide fusion encoded thereby, using zetuzumab as ERB2 and/or Erb3 targeting agents were treated to assess their therapeutic activity.
可從含有癌症或樣本之樣本產生PDX模型,諸如在此實例中經診斷患有胰臟腺癌之女性患者,以及基本上如Puig等人, A Personalized Preclinical Model to Evaluate the Metastatic Potential of Patient-Derived Colon Cancer Initiating Cells, Clin Cancer Res; 19(24), 6787-6801 (2013)中所述之CDH1-NRG1多核苷酸融合物(見實例18),其全文係併入此申請案中。PDX models can be generated from samples containing cancer or specimens, such as in this example a female patient diagnosed with pancreatic adenocarcinoma, and essentially as described in Puig et al., A Personalized Preclinical Model to Evaluate the Metastatic Potential of Patient-Derived The CDH1-NRG1 polynucleotide fusion described in Colon Cancer Initiating Cells, Clin Cancer Res; 19(24), 6787-6801 (2013) (see Example 18), the entirety of which is incorporated into this application.
對於全部的小鼠研究,使用介於6-8週齡之雌性NOD.CB17/AlhnRj-Prkdcscid/Rj小鼠(Janvier Labs)。For all mouse studies, female NOD.CB17/AlhnRj-Prkdcscid/Rj mice (Janvier Labs) between 6-8 weeks of age were used.
該等模型可以皮下注射,或在免疫缺陷小鼠的胰腺中原位注射。原位模型係於淋巴結、肝臟、肺或癌病中產生局部及遠處轉移,再現PDAC患者的前期疾病。These models can be injected subcutaneously, or orthotopically in the pancreas of immunodeficient mice. Orthotopic models produce local and distant metastases in lymph nodes, liver, lung, or cancer, reproducing predisease in PDAC patients.
在第6週,將所有經載體或澤妥珠單抗治療的小鼠犧牲且分析治療功效。At week 6, all vehicle- or zeltuzumab-treated mice were sacrificed and analyzed for treatment efficacy.
可以對統計學上相關數量的NOD-SCID小鼠原位胰腺注射1*10^6個來自所述PDX模型的腫瘤細胞。從注射後15天開始,每週用CT成像來監測小鼠並檢測胰腺中的原發性腫瘤。在至少80%的動物的胰腺中有原發性腫瘤生長後開始進行治療。A statistically relevant number of NOD-SCID mice can be injected orthotopically with 1*10^6 tumor cells from the PDX model. Beginning 15 days after injection, mice were monitored weekly with CT imaging and primary tumors in the pancreas were detected. Treatment was initiated after at least 80% of the animals had primary tumor growth in the pancreas.
基於不符合以下品質標準的小鼠將其排除在研究之外,包括手術後死亡、沒有原發性腫瘤、腫瘤過大的小鼠、體重不足、及一般疾病跡象。根據實例12進行給藥及治療方案,然而,使用標準程序及計算,注意從人類情況中適當地轉換與小鼠有關的條件。Mice were excluded from the study based on failure to meet the following quality criteria, including death after surgery, absence of primary tumors, mice with excessively large tumors, underweight, and general signs of disease. Dosing and treatment regimens were carried out according to Example 12, however, using standard procedures and calculations, taking care to translate appropriately from human to mouse conditions.
本揭露之 序列資訊 VAPB 序列資訊SEQ ID NO:1 VAPB-NRG1融合物5’處之VAPB外顯子1序列 CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAG SEQ ID NO:2 VAPB-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTT SEQ ID NO:3 VAPB-NRG1多核苷酸序列 CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTT SEQ ID NO:4 VAPB-NRG1多肽序列 QVLSLEPQHELKFRALPPRLKEMKSQESAAG SEQ ID NO:17 VAPB之外顯子1 AGTCCCCGCCCCTGGAGCCGGCGGCGCAGGGCGCAGCTTCCCGCCGCCAGAGCGGGCCAGCCTGCTGCGTGCGTGCGTGTGTACGACTCTGCGTGCGTGCGTGCGTGCGTGCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCACCGCGTAGACCGACCCCCCCCCAGCGCGCCCACCCGGTAGAGGACCCCCGCCCGTGCCCCGACCGGTCCCCGCCTTTTTGTAAAACTTAAAGCGGGCGCAGCATTAACGCTTCCCGCCCCGGTGACCTCTCAGGGGTCTCCCCGCCAAAGGTGCTCCGCCGCTAAGGAACATGGCGAAGGTGGAGCAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAG SEQ ID NO:18 VAPB之外顯子2 GTCCCTTCACCGATGTTGTCACCACCAACCTAAAGCTTGGCAACCCGACAGACCGAAATGTGTGTTTTAAGGTGAAGACTACAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATTAATGTATCTG SEQ ID NO:19 VAPB之外顯子3 TGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTTTATGGTTCAGTCTATGTTTGCTCCAACTGACACTTCAGATATGGAAGCAGTA SEQ ID NO:20 VAPB之外顯子4 TGGAAGGAGGCAAAACCGGAAGACCTTATGGATTCAAAACTTAGATGTGTGTTTGAATTGCCAGCAGAGAATGATAAACCA SEQ ID NO:21 VAPB之外顯子5 CATGATGTAGAAATAAATAAAATTATATCCACAACTGCATCAAAGACAGAAACACCAATAGTGTCTAAGTCTCTGAGTTCTTCTTTGGATGACACCGAAGTTAAGAAGGTTATGGAAGAATGTAAGAGGCTGCAAGGTGAAGTTCAGAGGCTACGGGAGGAGAACAAGCAGTTCAAG SEQ ID NO:22 VAPB之外顯子6 GAAGAAGATGGACTGCGGATGAGGAAGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGCCCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTATCGTTGGTGTAATTATTGGGAAGATTGCCTTGTAGAGGTAGCATGCACAGGATGGTAAATTGGATTGGTGGATCCACCATATCATGGGATTTAAATTTATCATAACCATGTGTAAAAAGAAATTAATGTATGATGACATCTCACAGGTCTTGCCTTTAAATTACCCCTCCCTGCACACACATACACAGATACACACACACAAATATAATGTAACGATCTTTTAGAAAGTTAAAAATGTATAGTAACTGATTGAGGGGGAAAAGAATGATCTTTATTAATGACAAGGGAAACCATGAGTAATGCCACAATGGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCAGGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTAGTGACAGTCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATACCACCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGCACCAGCAGTTGTGGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATAAAAGACCAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCCCCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTTATCATTTTTATTATTTTGCCATTGGAAGGTTAACTTTAAAATGAGCCCTATCACTGAGAAATACGTGTTTCATGATTTAACTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTTTTTTTTGGTTGTCTTCAGCTGACAGTATGAAAAATGAAACTGCTGAAAAAGCTGAGCACCTGGTCACCCTTGGCCTTCCATTGCTTTGGCCTTCAGTAAAAAGCAGCCTCCCTTCTAGGTCAGGGAACCATGCCATTGAGACTAGTAACGGGCGTTCTGGGCACAGTCCCACTGTGCACAGGTTTGAGAGGACAAGTTCATCAGAAGGAAGGCAGTCCTTAGAAGTCACATACGTTGAGCCACGTTGCTCCTAAGCCTGGCTCTGTCAAGCTGGGTCAGGGGCCTTGAAACTGGAGAAGTGGAAGTCTATGGTTGGTCTGAGTAAGTAACTTCCTGTCTTCATGAAAAAAGTTGACTTTGAATCCCAGGTACTCACAGAAATGGTGAACAGACTTAGTTGTTACCCAGGCACCCATGGATTGTGTTGAGTGTGCAGACAGGGAGGCCACCCCAATAGGAATTCGTCTCCAGGATTTTTCCCATGTGTCCCCCAGTACTTATAAAAGGGAGTGAAAAGACCGAGCTGTAAGGCATGTGCCTTCTGCCACCTGACTTTCCGTGAGGGGACTAAAATTTACTAATTGTAGTTGCTGCAGCCAGTTAAGTCCTGTAGCTTCCAGGCCCTCATGTCTTTGATAGGAGAGTGCTTAGGTGGTCCCCAACAGTGCCTAGGGGTACAGTACAGTCCCATTACACTAGAGCAGGGCTCTATTTATTTTTAAAGGATATGGCCGTGTGTTTTGATAAAACTTTATTCACAAAAACAGCCGGGTCATGGGATTTGGCTTGTGAGTCTGTAACAGTTCTTAAAAAGAATATCTGAGAAACTACTCTGTTTTAGACCTTTGAAGGTGATTTAGAGTTTTGTGTACATCTAGGAGAAGGTGTTCAGCTTCTCAGAGGATGTGGACATTTTGGTTGCAGCTAAAAATCAGTCTCTGAAGTCTCTCTCCCTTCTAGAGGTTAGGACTTGGTGAACATGTTTGTGGGCCTTTTGACTGAGTGGCAGAAGGAAACTGCTCAGGAAGAGAAACAGGTGACTGATGGGAAGGTTGATTATTTTCTCAGTCATCCTGGCAGCCAAAAATGTGCCAGGAAAAGAAAGAATGTGGAGCACGCGTGGCTCCTGGAGGACTTGGAGATGCATGCACATTTAGGGTGTTTTCCCTAGAATTACATAATGAAAAAAAGAATAAGGCAAAGAGGAGGTGAATATGGGGCCTGTCACAACGGCCTGCCCTGCCCCAAGAGGGTTAAGAGTCAGATAATCGGGACGAAACTGGCATGGAAAGAGCGAGCCTAGGGAGGATGCCGCTGGGCAGTGTGCATGGGGGAGCTGCTGCCAGGCTGCCCTCCAGTCTGCTCCTGTGGTTACTGGCTCCACAGCACCTCAGAGAGGGCGGCCCTGGCTTCAGAAATGCCAGCCATAGTGCTCACAAATGCAGAAGAGATGGAAGCGGTGACAGAATCCTGAAAGTTTTTATTGATTGAAGTTTTAAATTGGTAACTTAAGCTTCCTTGGCACGATACAAAATACCTCTTAAAGACAGCAGGCTTTTTTATTTGTAGGTGTGAGGAACTGGCTTTAACTTTTTTCTCCTCCTAGTTTGCATGTTTTCCTTCTCTCGTCTTCTGAACTGCTGGCACCAGCAGTAATACATACTGATAAAATCAAAATTGATTTTTACCAGTGGCCAGTTTATGGCTAGAGAGACGACTTATACCTCCATAACACAGAAGGGGGAAAAATGAAGAACCTCCAGTGATCCGTGAAAACCTAAACGCTTTCAAACAAATCCCAGGAACAGAATTGCTATCGAAAGATATCATTGCCCAGTTTGCAGGCTATGTTGAGTCAGATAGAACTGAATGTAGTGAGAGCTCAGAGCTACAGAGCCTTTCAGATGAATTTGAAAACAGACTCTGTGTGTGTGTGCATGTGTGCATGTGTGCATGTGTGGCATATGTGCCGTATGTCAGTAGCTTGACAGTTTTCAAATCGTGCCTATATTTTTTTGCATACACAAATTTTTGTGTTTGCAAACTCAGAATCCATGCCAAAATACAATGTTATATGTCATTTTCAGCTCCTTCTCTAAAGGAATGGCCCATTTCTCATTGTAGTTTGAGAAATACATGTATGAAGAGATAGGGGTCTTGGGCTTCCCAGTGTCACTTTGAACACCTGAATAACATTTAACTCCTGAGACCTTCTCGGTGTAGAGGCCACTGCTTCCCCCTGCTGGAGATGGCATTTCATTGAAGGGCCTCTCGTGGCTTTCCCTGCCCCCGGCTGTCTGGCCTGAAGAAGGAGAAAGAACCAAACTGAACTATGAAAAGTTACCACTCTGAGGAGACCTCTCTTAATTAACACTTGGGGCCATGTTTGCTGTTGTTGAGAAGGAGTGTTCTCAAAGATGAGCTGGAATGGAATTGTATTTAGAAAGGCCCCTGCAAAGTATATAGATGGATGACTCTAGTTCATGACATACAAATCCCATAAGGCCAACGACCACTCTTCTGGAACACCAAGAGCAGCTCTGAGATCATGCTGGCCCTACGCGAATTGAGTTTCTGTGGCCTAATTGGATTTGGAGAACGCCTTCCCTGGCCCCTTTTCCTCAGACAGATCTGCTCTGATAGGAACCTTTTCAAGAAAGTTACTGTTGTTTCAATGCCACTCCTTACCTGTATAGAACATTTCCAATACATTCGCTCATTGAACTTAATCCTTGCAACTGTGACTGGGGGGTAGATGGCTCTGTTTGCATACGAAGAAATAAAGGCTCCAGGAGGTTAAATCGGGCAACTTTTTAGAACTAAATCAGTCTCTGTAAGGCCTACATTGCTAAGATACCATTTCAGCTCTGAAAATCTGCTTCAGGGAAGTGAGTGGATGAGGCCTTCCTGCCTCAGCTACTCTGCCCGTCTGTACATCTTTTGTGTCTGCCTCCGTACCTTATTCAGTTATTTTCACACTAAAGTAAGTAGAATTAAGACTGTAGTTCAGATGCTTTTTCTTTTTCTGTTGGAAACTGAACACACTACAGACAGTGAAAAAAGGTACATATTCCATTTTCTCATTGCCTGAAGATCTCTGCTGATGCTCCTGGAGAATGACTTTGGGGGCTTTAGAAAGAATATTGCCAGTCCGTCTCGGCAAGGAGATGATGGGAGCGCTTTATATGGAGGCTTTACATGACTTGTAAATTAAATGTGAATGAGGGCAGTTGATTAAAATTGGTATTACAGAAGGGCCCTGCTGAGGTTTGAAAACAGCTGAGCTGCTGATGTCTCAGGCCTTTCCCTGAATTAGCACTGCGGTTCTCCAGGATATCAGCAAAGAGGGCAAGTAATAGAAGCCCCTGATAAGGAGCGTCAGCCGACAGGCAAGCTTGGGAGGCTGTGGGAATGGGTCTGCCCCCAGCTTCACAGACCTCTTCCTCCAGCCTCTGAATCCCATTAGCCACAGCCTAGAACATTAGCTGAGCTGCACAAGCTCACCCACCCCTGTGCCAGGGGGCCCTGACCTCCCTCCATGCCATGTTTTTGGCTGTATCTACGGCACTTAACAATAGGGGCTTTTTATTTTCATTACAGAGATATTTTGAAAAATTTAAAAGACATGAACTCACATAAACAGTTATGGATGATAGTTAAAAGAGAAACGGGTGGAGGTGGATGAGAGGTTGTCTTCATGAATATAATTACTTGAGATTTTTTTTTCTTAATGGAATTAGTTTATTAGAAAATGTCTGTGTTAAATCCGTAGAAAAGGAAGAAAAGTGTAGCAACAAAAATGTAGCCATTATCTAACTTGCCATAAATATTTGCAGTTATGATACCTTGGAATGTTGCCACGATATGGATTGCTTTGATTAAAAGATGTCAGTTGAATAAAACAGTACTGTGGGAGAATCGCTTTCTGCTGCTAGATAAATGCTGATGTTTATTTTTAAACCAGGAAACATTGATCCTGTAACAATGCCCGATTACAATTGCTTTATTACACCCCAGGGCTGATGGAGATGTAATCACTTGGCTAATGGATGTGGGTGCAGGACAGATGCTCGCTTGCTGGCCTGCTTTCCTGCTTGCATTCTGATGAGCTGCAGGAGTGCGCCTGGCCTTCTGCAGGTGGAGCTGCTGTCAGAGCTTCGTTTCACTGATACCCAAAGCCATGTCTGACTGAAATAAAACAGGTTCCCTTTTTTTTTCCCTTTGGAAAATGCCAACTAAGGGAGACTAATCAGATATCTTAACACAATTTCATCCAGGCTTAGTGCTAACAAGATTGCGGGGCTTTTTAGGGTTTAAGAAGATGAGAAATGAGTGTGCACGTTTCACACGTTGACTTGCCGGTTTTTCCATGTCATACAAAAAAGTCCTGGCTGTTTCTCCGAACTGGCTGCCTGCATTCCCGTCTTTCTTTTGTTTTTAAGAAATAGACTGAATTCAGCTGTTAATCCTCTAGTACAGTATCCATGTTAAAATGTTTTTCCATTGCATCTTTTATGTGAATTCAAAGGTCAGAATTTATTGTCTGTGATATTGAGACCATGTGTACAAGAACTACTTTTTGCTTTTCATCATTCACTCCTTAGCAAACGTTTCGTAAGTACCCTCTGTCTGTTTGCTACTATATGAGGTGCTGCGAAATTAGTGGGCGTGGCTTTTTATATTTTTCATTCGTGTGTAGCCTAAGTAAGGTGACTCAAGATGATACACCGAGAGAAAAATGCAAAATATATTTGGTTCTCATTTCTGTTGCTGTCGTTTCCTTTTTAAAGACGATTTATCAACTGCTGCCATTTGGAACTTCCTATAAGAAACTAAAAATGATCTATTTCAGTGTTCCTTTCGCCTTTCCTCTGCTTTCTGAATAAATGGTTTCAGTAACCCATGCTGTTCTCTCCCTATTCTACGTCTTTCTCCCTATGTTGAAAAAAGATTCCCACAGTTTCTGATGTGTGTGTTTATAGTCTTCAATGTATGTTAACATGTTAGGAACTGAGTATCTTAAGAGATGTCTTAGAATGCTTTAGTTTTCATAATTTGTCCTTTATGTATTTTTCATTGTATTTGCTGTTTTGACATGGAAGTAATTTAAAAAGTTGGTGCAGGAAAGGACTCTTTACTGTTGCACATTTTGGTTTTCTGATATGTAATAAATTCATGGCTTGGCAGCTGACATGATGTTTCCCAGAGAGAAGGAGATGTATTTCTGCAGGGTCCAGACCAAAAGAGCCATTTACAGCATGTTCTCCCATGTTCCATTATCAGCCTGATGAAACCTGCCCTGCCAAGGCATAAACTTTTGTACTAGCTGTCTCCATATTATGTTCAATAAATTCTGTGCTCTGAATATATTTAAAAAAAAAAAAA SEQ ID NO:23 VAPB之全部6個外顯子1-6按順序排列的多核苷酸序列 SEQ ID NO:24 VAPB之外顯子1的轉譯多肽序列 MAKVEQVLSLEPQHELKFR SEQ ID NO:25 VAPB之外顯子2的轉譯多肽序列 PFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIIDAGASINVS SEQ ID NO:26 VAPB之外顯子3的轉譯多肽序列 MLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAV SEQ ID NO:27 VAPB之外顯子4的轉譯多肽序列 WKEAKPEDLMDSKLRCVFELPAENDKP SEQ ID NO:28 VAPB之外顯子5的轉譯多肽序列 HDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREENKQFK SEQ ID NO:29 VAPB之外顯子6的轉譯多肽序列 EEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGKIAL SEQ ID NO:30 VAPB多肽序列 MAKVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIIDAGASINVSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAVWKEAKPEDLMDSKLRCVFELPAENDKPHDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREENKQFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGKIAL CADM1 序列資訊SEQ ID NO:5 CADM1-NRG1融合物5’處之CADM1外顯子7序列 AGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACG SEQ ID NO:6 CADM1-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:7 CADM1-NRG1多核苷酸序列 AGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGCTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:8 CADM1-NRG1多肽序列 ASNIVGKAHSDYMLYVYATSTSTTGTSH SEQ ID NO:33 CADM1之外顯子1 GGTTGGGCTCGCGGCGCTGTGATTGGTCTGCCCGGACTCCGCCTCCAGCGCATGTCATTAGCATCTCATTAGCTGTCCGCTCGGGCTCCGGAGGCAGCCAACGCCGCCAGTCTGAGGCAGGTGCCCGACATGGCGAGTGTAGTGCTGCCGAGCGGATCCCAGTGTGCGGCGGCAGCGGCGGCGGCGGCGCCTCCCGGGCTCCGGCTCCGGCTTCTGCTGTTGCTCTTCTCCGCCGCGGCACTGATCCCCACAG SEQ ID NO:34 CADM1之外顯子2 GTGATGGGCAGAATCTGTTTACGAAAGACGTGACAGTGATCGAGGGAGAGGTTGCGACCATCAGTTGCCAAGTCAATAAGAGTGACGACTCTGTGATTCAGCTACTGAATCCCAACAGGCAGACCATTTATTTCAGGGACTTCAGGC SEQ ID NO:35 CADM1之外顯子3 CTTTGAAGGACAGCAGGTTTCAGTTGCTGAATTTTTCTAGCAGTGAACTCAAAGTATCATTGACAAACGTCTCAATTTCTGATGAAGGAAGATACTTTTGCCAGCTCTATACCGATCCCCCACAGGAAAGTTACACCACCATCACAGTCCTGG SEQ ID NO:36 CADM1之外顯子4 TCCCACCACGTAATCTGATGATCGATATCCAGAAAGACACTGCGGTGGAAGGTGAGGAGATTGAAGTCAACTGCACTGCTATGGCCAGCAAGCCAGCCACGACTATCAGGTGGTTCAAAGGGAACACAGAGCTAAAAG SEQ ID NO:37 CADM1之外顯子5 GCAAATCGGAGGTGGAAGAGTGGTCAGACATGTACACTGTGACCAGTCAGCTGATGCTGAAGGTGCACAAGGAGGACGATGGGGTCCCAGTGATCTGCCAGGTGGAGCACCCTGCGGTCACTGGAAACCTGCAGACCCAGCGGTATCTAGAAGTACAGT SEQ ID NO:38 CADM1之外顯子6 ATAAGCCTCAAGTGCACATTCAGATGACTTATCCTCTACAAGGCTTAACCCGGGAAGGGGACGCGCTTGAGTTAACATGTGAAGCCATCGGGAAGCCCCA SEQ ID NO:39 CADM1之外顯子7 GCCTGTGATGGTAACTTGGGTGAGAGTCGATGATGAAATGCCTCAACACGCCGTACTGTCTGGGCCCAACCTGTTCATCAATAACCTAAACAAAACAGATAATGGTACATACCGCTGTGAAGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACG SEQ ID NO:40 CADM1之外顯子8 ACACAACGGCGACGACAGAACCAGCAGTTCACG SEQ ID NO:41 CADM1之外顯子9 GCCTTACTCAGTTGCCCAATTCCGCAGAAGAACTGGACAGTGAGGACCTCTCAG SEQ ID NO:42 CADM1之外顯子10 ATTCCCGAGCAGGTGAAGAAGGCTCGATCAGGGCAGTGGATCATGCCGTGATCGGTGGCGTCGTGGCGGTGGTGGTGTTCGCCATGCTGTGCTTGCTCATCATTCTGGGGCGCTATTTTGCCAGACATAAAG SEQ ID NO:43 CADM1之外顯子11 GTACATACTTCACTCATGAAGCCAAAGGAGCCGATGACGCAGCAGACGCAGACACAGCTATAATCAATGCAGAAGGAGGACAGAACAACTCCGAAGAAAAGAAAGAGTACTTCATCTAGATCAGCCTTTTTGTTTCAATGAGGTGTCCAACTGGCCCTATTTAGATGATAAAGAGACAGTGATATTGGAACTTGCGAGAAATTCGTGTGTTTTTTTATGAATGGGTGGAAAGGTGTGAGACTGGGAAGGCTTGGGATTTGCTGTGTAAAAAAAAAAAAAATGTTCTTTGGAAAGTACACTCTGCTGTTTGACACCTCTTTTTTCGTTTGTTTGTTTGTTTAATTTTTATTTCTTCCTACCAAGTCAAACTTGGATACTTGGATTTAGTTTCAGTAGATTGCAGAAAATTCTGTGCCTTGTTTTTTGTTTGTTTGTTGCGTTCCTTTCTTTTCCCCCTTTGTGCACATTTATTTCCTCCCTCTACCCCAATTTCGGATTTTTTCCAAAATCTCCCATTTTGGAATTTGCCTGCTGGGATTCCTTAGACTCTTTTCCTTCCCTTTTCTGTTCTAGTTTTTTACTTTTGTTTATTTTTATGGTAACTGCTTTCTGTTCCAAATTCAGTTTCATAAAAGGAGAACCAGCACAGCTTAGATTTCATAGTTCAGAATTTAGTGTATCCATAATGCATTCTTCTCTGTTGTCGTAAAGATTTGGGTGAACAAACAATGAAAACTCTTTGCTGCTGCCCATGTTTCAAATACTTAGAGCAGTGAAGACTAGAAAATTAGACTGTGATTCAGAAAATGTTCTGTTTGCTGTGGAACTACATTACTGTACAGGGTTATCTGCAAGTGAGGTGTGTCACAATGAGATTGAATTTCACTGTCTTTAATTCTGTATCTGTAGACGGCTCAGTATAGATACCCTACGCTGTCCAGAAAGGTTTGGGGCAGAAAGGACTCCTCCTTTTTCCATGCCCTAAACAGACCTGACAGGTGAGGTCTGTTCCTTTTATATAAGTGGACAAATTTTGAGTTGCCACAGGAGGGGAAGTAGGGAGGGGGGAAATACAGTTCTGCTCTGGTTGTTTCTGTTCCAAATGATTCCATCCACCTTTCCCAATCGGCCTTACTTCTCACTAATTTGTAGGAAAAAGCAAGTTCGTCTGTTGTGCGAATGACTGAATGGGACAGAGTTGATTTTTTTTTTTTTTTCCTTTGTGCTTAGTTAGGAAGGCAGTAGGATGTGGCCTGCATGTACTGTATATTACAGATATTTGTCATGCTGGGATTTCCAACTCGAATCTGTGTGAAACTTTCATTCCTTCAGATTTGGCTTGACAAAGGCAGGAGGTACAAAAGAAGGGCTGGTATTGTTCTCACACTGGTCTGCTGTCGCTCTCAGTTCTCGATAGGTCAGAGCAGAGGTGGAAAAACAGCATGTACGGATTTTCAGTTACTTAATCAAAACTCAAATGTGAGTGTTTTTATCTTTTTACCTTTCATACACTAGCCTTGGCCTCTTTCCTCAGCCTTAAGAACCATCTGCCAAAAATTACTGATCCTCGCATGATGGCAGCCATAGTGCATAGCTACTAAAATCAGTGACCTTGAACATATCTTAGATGGGGAGCCTCGGGAAAAGGTAGAGGAGTCACGTTACCATTTACATGTTTTAAAGAAAGAAGTGTGGGGATTTTCACTGAAACGTCTAGGAAATCTAGAAGTAGTCCTGAAGGACAGAAACTAAACTCTTACCATATGTTTGGTAAGACTCCAGACTCCAGCTAACAGTCCCTATGGAAAGATGGCATCAAAAAAGATAGATCTATATATATATATAAATATATATTCTATTACATTTTCAGTGAGTAATTTTGGATTTTGCAAGGTGCATTTTTACTATTGTTACATTATGTGGAAAACTTATGCTGATTTATTTAAGGGGGAAAAAGTGTCAACTCTTTGTTATTTGAAAACATGTTTATTTTTCTTGTCTTTATTTTAACCTTTGATAGAACCATTGCAATATGGGGGCCTTTTGGGAACGGACTGGTATGTAAAAGAAAATCCATTATCGAGCAGCATTTTATTTACCCCTCCCCTATCCCTAGGCACTTAACCAAGACAAAAAGCCACAATGAACATCCCTTTTTCAATGAATTTTATAATCTGCAGCTCTATTCCGAGCCCTTAGCACCCATTCCGACCATAGTATAATCATATCAAAGGGTGAGAATCATTTAGCATGTTGTTGAAAGGTTTTTTTTCAGTTGTTCTTTTTAGAAAAAAAGAAAAACAAAAACAAAAACAAAAAAAAAAAATCACACCATTGCTCACAGAATTGGCATCTCATTTTTGGGACCTCCCATCTTTCTGTTTTGAAAAGTGTACAGTAGTGCAGTGTTCCTGATGTAACTTTATGGCTTACAATGTTGACATGTCTCAGGTTCATGTGTTGCGATTGGTGTTTTCCGTCTCAGGTAGATTGCAAAGTGTAGGCCCCACACATTGGAAAAAATAATAATAAAACAAAGCAAAAACAGGAAATTATGGATTTTAGTTGTATATTGGTTTATGTATTTTTTCTTAAGTATACAGTGCACTGTTTGAAATGTATTGTTGAGTATTACTTTGTACAGGTTGATCACTTTTTTTAGAGTGAAGAAAGAACAAACTTGTTTTTTGTGTTTTTTAAAGGAATATAAAATAATGAAGGATGTATAATTGATGCCAAATAAGCTTGTTCTTTAGTCACACCGACGTCTTATTTTTCCCTTTAGGCCAGTTCTGTTTTTAAGGTGTACATGGACAATGTTACAGTGTAAGAAACTCCATATCCATATGTTCCCATTCGCATTTTGTATTGGTTCATGTATACCATTTTTACAAAAAAAAAAAGAAAAAAAAGAAGTACTATAAAATATCTGTCTTCTTAATAAAAAAAAATTAATGTTACAAAGTGA SEQ ID NO:44 CADM1之全部11個外顯子1-11按順序排列的多核苷酸序列 SEQ ID NO:45 CADM1之外顯子1的轉譯多肽序列 MASVVLPSGSQCAAAAAAAAPPGLRLRLLLLLFSAAALIPT SEQ ID NO:46 CADM1之外顯子2的轉譯多肽序列 DGQNLFTKDVTVIEGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFR SEQ ID NO:47 CADM1之外顯子3的轉譯多肽序列 LKDSRFQLLNFSSSELKVSLTNVSISDEGRYFCQLYTDPPQESYTTITVL SEQ ID NO:48 CADM1之外顯子4的轉譯多肽序列 PPRNLMIDIQKDTAVEGEEIEVNCTAMASKPATTIRWFKGNTELK SEQ ID NO:49 CADM1之外顯子5的轉譯多肽序列 KSEVEEWSDMYTVTSQLMLKVHKEDDGVPVICQVEHPAVTGNLQTQRYLEVQ SEQ ID NO:50 CADM1之外顯子6的轉譯多肽序列 KPQVHIQMTYPLQGLTREGDALELTCEAIGKP SEQ ID NO:51 CADM1之外顯子7的轉譯多肽序列 PVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDYMLYVY SEQ ID NO:52 CADM1之外顯子8的轉譯多肽序列 TTATTEPAVH SEQ ID NO:53 CADM1之外顯子9的轉譯多肽序列 LTQLPNSAEELDSEDLS SEQ ID NO:54 CADM1之外顯子10的轉譯多肽序列 SRAGEEGSIRAVDHAVIGGVVAVVVFAMLCLLIILGRYFARHK SEQ ID NO:55 CADM1之外顯子11的轉譯多肽序列 TYFTHEAKGADDAADADTAIINAEGGQNNSEEKKEYFI SEQ ID NO:56 CADM1多肽序列 MASVVLPSGSQCAAAAAAAAPPGLRLRLLLLLFSAAALIPTGDGQNLFTKDVTVIEGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFRPLKDSRFQLLNFSSSELKVSLTNVSISDEGRYFCQLYTDPPQESYTTITVLVPPRNLMIDIQKDTAVEGEEIEVNCTAMASKPATTIRWFKGNTELKGKSEVEEWSDMYTVTSQLMLKVHKEDDGVPVICQVEHPAVTGNLQTQRYLEVQYKPQVHIQMTYPLQGLTREGDALELTCEAIGKPQPVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDYMLYVYDTTATTEPAVHGLTQLPNSAEELDSEDLSDSRAGEEGSIRAVDHAVIGGVVAVVVFAMLCLLIILGRYFARHKGTYFTHEAKGADDAADADTAIINAEGGQNNSEEKKEYFI SEQ ID NO:57 CADM1之全部7個外顯子1-7按順序排列的多核苷酸序列 SEQ ID NO:58 CADM1之外顯子1-7按順序排列的轉譯多肽序列 MASVVLPSGSQCAAAAAAAAPPGLRLRLLLLLFSAAALIPTGDGQNLFTKDVTVIEGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFRPLKDSRFQLLNFSSSELKVSLTNVSISDEGRYFCQLYTDPPQESYTTITVLVPPRNLMIDIQKDTAVEGEEIEVNCTAMASKPATTIRWFKGNTELKGKSEVEEWSDMYTVTSQLMLKVHKEDDGVPVICQVEHPAVTGNLQTQRYLEVQYKPQVHIQMTYPLQGLTREGDALELTCEAIGKPQPVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDYMLYVY CD44 序列資訊SEQ ID NO:9 來自CD44-NRG1融合物5’處之CD44外顯子5的序列 GACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCA SEQ ID NO:10 CD44-NRG1融合物3’處之NRG1外顯2子序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO:11 CD44-NRG1多核苷酸序列 GACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCACCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO:12 CD44-NRG1多肽序列 DEDSPWITDSTDRIPATTLPPRLKEMKSQESAAGSK SEQ ID NO:61 CD44之外顯子1 CTCATTGCCCAGCGGACCCCAGCCTCTGCCAGGTTCGGTCCGCCATCCTCGTCCCGTCCTCCGCCGGCCCCTGCCCCGCGCCCAGGGATCCTCCAGCTCCTTTCGCCCGCGCCCTCCGTTCGCTCCGGACACCATGGACAAGTTTTGGTGGCACGCAGCCTGGGGACTCTGCCTCGTGCCGCTGAGCCTGGCGCAGATCG SEQ ID NO:62 CD44之外顯子2 ATTTGAATATAACCTGCCGCTTTGCAGGTGTATTCCACGTGGAGAAAAATGGTCGCTACAGCATCTCTCGGACGGAGGCCGCTGACCTCTGCAAGGCTTTCAATAGCACCTTGCCCACAATGGCCCAGATGGAGAAAGCTCTGAGCATCGGATTTGAGACCTGCAG SEQ ID NO:63 CD44之外顯子3 GTATGGGTTCATAGAAGGGCACGTGGTGATTCCCCGGATCCACCCCAACTCCATCTGTGCAGCAAACAACACAGGGGTGTACATCCTCACATCCAACACCTCCCAGTATGACACATATTGCTTCAATGCTTCAG SEQ ID NO:64 CD44之外顯子4 CTCCACCTGAAGAAGATTGTACATCAGTCACAGACCTGCCCAATGCCTTTGATGGACCAATTACCATAA SEQ ID NO:65 CD44之外顯子5 CTATTGTTAACCGTGATGGCACCCGCTATGTCCAGAAAGGAGAATACAGAACGAATCCTGAAGACATCTACCCCAGCAACCCTACTGATGATGACGTGAGCAGCGGCTCCTCCAGTGAAAGGAGCAGCACTTCAGGAGGTTACATCTTTTACACCTTTTCTACTGTACACCCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCA SEQ ID NO:66 CD44之外顯子6 CTTTGATGAGCACTAGTGCTACAGCAACTGAGACAGCAACCAAGAGGCAAGAAACCTGGGATTGGTTTTCATGGTTGTTTCTACCATCAGAGTCAAAGAATCATCTTCACACAACAACACAAATGGCTG SEQ ID NO:67 CD44之外顯子7 GTACGTCTTCAAATACCATCTCAGCAGGCTGGGAGCCAAATGAAGAAAATGAAGATGAAAGAGACAGACACCTCAGTTTTTCTGGATCAGGCATTGATGATGATGAAGATTTTATCTCCAGCACCA SEQ ID NO:68 CD44之外顯子8 TTTCAACCACACCACGGGCTTTTGACCACACAAAACAGAACCAGGACTGGACCCAGTGGAACCCAAGCCATTCAAATCCGGAAGTGCTACTTCAGACAACCACAAGGATGACTG SEQ ID NO:69 CD44之外顯子9 ATGTAGACAGAAATGGCACCACTGCTTATGAAGGAAACTGGAACCCAGAAGCACACCCTCCCCTCATTCACCATGAGCATCATGAGGAAGAAGAGACCCCACATTCTACAAGCACAA SEQ ID NO:70 CD44之外顯子10 TCCAGGCAACTCCTAGTAGTACAACGGAAGAAACAGCTACCCAGAAGGAACAGTGGTTTGGCAACAGATGGCATGAGGGATATCGCCAAACACCCAAAGAAGACTCCCATTCGACAACAGGGACAGCTG SEQ ID NO:71 CD44之外顯子11 CAGCCTCAGCTCATACCAGCCATCCAATGCAAGGAAGGACAACACCAAGCCCAGAGGACAGTTCCTGGACTGATTTCTTCAACCCAATCTCACACCCCATGGGACGAGGTCATCAAGCAGGAAGAAGGATGG SEQ ID NO:72 CD44之外顯子12 ATATGGACTCCAGTCATAGTATAACGCTTCAGCCTACTGCAAATCCAAACACAGGTTTGGTGGAAGATTTGGACAGGACAGGACCTCTTTCAATGACAACGC SEQ ID NO:73 CD44之外顯子13 AGCAGAGTAATTCTCAGAGCTTCTCTACATCACATGAAGGCTTGGAAGAAGATAAAGACCATCCAACAACTTCTACTCTGACATCAAGCA SEQ ID NO:74 CD44之外顯子14 ATAGGAATGATGTCACAGGTGGAAGAAGAGACCCAAATCATTCTGAAGGCTCAACTACTTTACTGGAAGGTTATACCTCTCATTACCCACACACGAAGGAAAGCAGGACCTTCATCCCAGTGACCTCAGCTAAGACTGGGTCCTTTGGAGTTACTGCAGTTACTGTTGGAGATTCCAACTCTAATGTCAATCGTTCCTTATCAG SEQ ID NO:75 CD44之外顯子15 GAGACCAAGACACATTCCACCCCAGTGGGGGGTCCCATACCACTCATGGATCTGAATCAGATG SEQ ID NO:76 CD44之外顯子16 GACACTCACATGGGAGTCAAGAAGGTGGAGCAAACACAACCTCTGGTCCTATAAGGACACCCCAAATTCCAG SEQ ID NO:77 CD44之外顯子17 AATGGCTGATCATCTTGGCATCCCTCTTGGCCTTGGCTTTGATTCTTGCAGTTTGCATTGCAGTCAACAGTCGAAGAAG SEQ ID NO:78 CD44之外顯子18 GTGTGGGCAGAAGAAAAAGCTAGTGATCAACAGTGGCAATGGAGCTGTGGAGGACAGAAAGCCAAGTGGACTCAACGGAGAGGCCAGCAAGTCTCAGGAAATGGTGCATTTGGTGAACAAGGAGTCGTCAGAAACTCCAGACCAGTTTATGACAGCTGATGAGACAAGGAACCTGCAGAATGTGGACATGAAGATTGGGGTGTAACACCTACACCATTATCTTGGAAAGAAACAACCGTTGGAAACATAACCATTACAGGGAGCTGGGACACTTAACAGATGCAATGTGCTACTGATTGTTTCATTGCGAATCTTTTTTAGCATAAAATTTTCTACTCTTTTTGTTTTTTGTGTTTTGTTCTTTAAAGTCAGGTCCAATTTGTAAAAACAGCATTGCTTTCTGAAATTAGGGCCCAATTAATAATCAGCAAGAATTTGATCGTTCCAGTTCCCACTTGGAGGCCTTTCATCCCTCGGGTGTGCTATGGATGGCTTCTAACAAAAACTACACATATGTATTCCTGATCGCCAACCTTTCCCCCACCAGCTAAGGACATTTCCCAGGGTTAATAGGGCCTGGTCCCTGGGAGGAAATTTGAATGGGTCCATTTTGCCCTTCCATAGCCTAATCCCTGGGCATTGCTTTCCACTGAGGTTGGGGGTTGGGGTGTACTAGTTACACATCTTCAACAGACCCCCTCTAGAAATTTTTCAGATGCTTCTGGGAGACACCCAAAGGGTGAAGCTATTTATCTGTAGTAAACTATTTATCTGTGTTTTTGAAATATTAAACCCTGGATCAGTCCTTTGATCAGTATAATTTTTTAAAGTTACTTTGTCAGAGGCACAAAAGGGTTTAAACTGATTCATAATAAATATCTGTACTTCTTCGATCTTCACCTTTTGTGCTGTGATTCTTCAGTTTCTAAACCAGCACTGTCTGGGTCCCTACAATGTATCAGGAAGAGCTGAGAATGGTAAGGAGACTCTTCTAAGTCTTCATCTCAGAGACCCTGAGTTCCCACTCAGACCCACTCAGCCAAATCTCATGGAAGACCAAGGAGGGCAGCACTGTTTTTGTTTTTTGTTTTTTGTTTTTTTTTTTTGACACTGTCCAAAGGTTTTCCATCCTGTCCTGGAATCAGAGTTGGAAGCTGAGGAGCTTCAGCCTCTTTTATGGTTTAATGGCCACCTGTTCTCTCCTGTGAAAGGCTTTGCAAAGTCACATTAAGTTTGCATGACCTGTTATCCCTGGGGCCCTATTTCATAGAGGCTGGCCCTATTAGTGATTTCCAAAAACAATATGGAAGTGCCTTTTGATGTCTTACAATAAGAGAAGAAGCCAATGGAAATGAAAGAGATTGGCAAAGGGGAAGGATGATGCCATGTAGATCCTGTTTGACATTTTTATGGCTGTATTTGTAAACTTAAACACACCAGTGTCTGTTCTTGATGCAGTTGCTATTTAGGATGAGTTAAGTGCCTGGGGAGTCCCTCAAAAGGTTAAAGGGATTCCCATCATTGGAATCTTATCACCAGATAGGCAAGTTTATGACCAAACAAGAGAGTACTGGCTTTATCCTCTAACCTCATATTTTCTCCCACTTGGCAAGTCCTTTGTGGCATTTATTCATCAGTCAGGGTGTCCGATTGGTCCTAGAACTTCCAAAGGCTGCTTGTCATAGAAGCCATTGCATCTATAAAGCAACGGCTCCTGTTAAATGGTATCTCCTTTCTGAGGCTCCTACTAAAAGTCATTTGTTACCTAAACTTATGTGCTTAACAGGCAATGCTTCTCAGACCACAAAGCAGAAAGAAGAAGAAAAGCTCCTGACTAAATCAGGGCTGGGCTTAGACAGAGTTGATCTGTAGAATATCTTTAAAGGAGAGATGTCAACTTTCTGCACTATTCCCAGCCTCTGCTCCTCCCTGTCTACCCTCTCCCCTCCCTCTCTCCCTCCACTTCACCCCACAATCTTGAAAAACTTCCTTTCTCTTCTGTGAACATCATTGGCCAGATCCATTTTCAGTGGTCTGGATTTCTTTTTATTTTCTTTTCAACTTGAAAGAAACTGGACATTAGGCCACTATGTGTTGTTACTGCCACTAGTGTTCAAGTGCCTCTTGTTTTCCCAGAGATTTCCTGGGTCTGCCAGAGGCCCAGACAGGCTCACTCAAGCTCTTTAACTGAAAAGCAACAAGCCACTCCAGGACAAGGTTCAAAATGGTTACAACAGCCTCTACCTGTCGCCCCAGGGAGAAAGGGGTAGTGATACAAGTCTCATAGCCAGAGATGGTTTTCCACTCCTTCTAGATATTCCCAAAAAGAGGCTGAGACAGGAGGTTATTTTCAATTTTATTTTGGAATTAAATACTTTTTTCCCTTTATTACTGTTGTAGTCCCTCACTTGGATATACCTCTGTTTTCACGATAGAAATAAGGGAGGTCTAGAGCTTCTATTCCTTGGCCATTGTCAACGGAGAGCTGGCCAAGTCTTCACAAACCCTTGCAACATTGCCTGAAGTTTATGGAATAAGATGTATTCTCACTCCCTTGATCTCAAGGGCGTAACTCTGGAAGCACAGCTTGACTACACGTCATTTTTACCAATGATTTTCAGGTGACCTGGGCTAAGTCATTTAAACTGGGTCTTTATAAAAGTAAAAGGCCAACATTTAATTATTTTGCAAAGCAACCTAAGAGCTAAAGATGTAATTTTTCTTGCAATTGTAAATCTTTTGTGTCTCCTGAAGACTTCCCTTAAAATTAGCTCTGAGTGAAAAATCAAAAGAGACAAAAGACATCTTCGAATCCATATTTCAAGCCTGGTAGAATTGGCTTTTCTAGCAGAACCTTTCCAAAAGTTTTATATTGAGATTCATAACAACACCAAGAATTGATTTTGTAGCCAACATTCATTCAATACTGTTATATCAGAGGAGTAGGAGAGAGGAAACATTTGACTTATCTGGAAAAGCAAAATGTACTTAAGAATAAGAATAACATGGTCCATTCACCTTTATGTTATAGATATGTCTTTGTGTAAATCATTTGTTTTGAGTTTTCAAAGAATAGCCCATTGTTCATTCTTGTGCTGTACAATGACCACTGTTATTGTTACTTTGACTTTTCAGAGCACACCCTTCCTCTGGTTTTTGTATATTTATTGATGGATCAATAATAATGAGGAAAGCATGATATGTATATTGCTGAGTTGAAAGCACTTATTGGAAAATATTAAAAGGCTAACATTAAAAGACTAAAGGAAACAGA SEQ ID NO:79 CD44之全部18個外顯子1-18按順序排列的多核苷酸序列 SEQ ID NO:80 CD44之外顯子1的轉譯多肽序列 MDKFWWHAAWGLCLVPLSLAQI SEQ ID NO:81 CD44之外顯子2的轉譯多肽序列 LNITCRFAGVFHVEKNGRYSISRTEAADLCKAFNSTLPTMAQMEKALSIGFETC SEQ ID NO:82 CD44之外顯子3的轉譯多肽序列 YGFIEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNAS SEQ ID NO:83 CD44之外顯子4的轉譯多肽序列 PPEEDCTSVTDLPNAFDGPITI SEQ ID NO:84 CD44之外顯子5的轉譯多肽序列 IVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFYTFSTVHPIPDEDSPWITDSTDRIPAT SEQ ID NO:85 CD44之外顯子6的轉譯多肽序列 LMSTSATATETATKRQETWDWFSWLFLPSESKNHLHTTTQMA SEQ ID NO:86 CD44之外顯子7的轉譯多肽序列 TSSNTISAGWEPNEENEDERDRHLSFSGSGIDDDEDFISST SEQ ID NO:87 CD44之外顯子8的轉譯多肽序列 STTPRAFDHTKQNQDWTQWNPSHSNPEVLLQTTTRMT SEQ ID NO:88 CD44之外顯子9的轉譯多肽序列 VDRNGTTAYEGNWNPEAHPPLIHHEHHEEEETPHSTST SEQ ID NO:89 CD44之外顯子10的轉譯多肽序列 QATPSSTTEETATQKEQWFGNRWHEGYRQTPKEDSHSTTGTA SEQ ID NO:90 CD44之外顯子11的轉譯多肽序列 ASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRRM SEQ ID NO:91 CD44之外顯子12的轉譯多肽序列 MDSSHSITLQPTANPNTGLVEDLDRTGPLSMTT SEQ ID NO:92 CD44之外顯子13的轉譯多肽序列 QSNSQSFSTSHEGLEEDKDHPTTSTLTSS SEQ ID NO:93 CD44之外顯子14的轉譯多肽序列 RNDVTGGRRDPNHSEGSTTLLEGYTSHYPHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLS SEQ ID NO:94 CD44之外顯子15的轉譯多肽序列 DQDTFHPSGGSHTTHGSESD SEQ ID NO:95 CD44之外顯子16的轉譯多肽序列 HSHGSQEGGANTTSGPIRTPQIP SEQ ID NO:96 CD44之外顯子17的轉譯多肽序列 WLIILASLLALALILAVCIAVNSRR SEQ ID NO:97 CD44之外顯子18的轉譯多肽序列 CGQKKKLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTADETRNLQNVDMKIGV SEQ ID NO:98 CD44多肽序列 MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSISRTEAADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFYTFSTVHPIPDEDSPWITDSTDRIPATTLMSTSATATETATKRQETWDWFSWLFLPSESKNHLHTTTQMAGTSSNTISAGWEPNEENEDERDRHLSFSGSGIDDDEDFISSTISTTPRAFDHTKQNQDWTQWNPSHSNPEVLLQTTTRMTDVDRNGTTAYEGNWNPEAHPPLIHHEHHEEEETPHSTSTIQATPSSTTEETATQKEQWFGNRWHEGYRQTPKEDSHSTTGTAAASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRRMDMDSSHSITLQPTANPNTGLVEDLDRTGPLSMTTQQSNSQSFSTSHEGLEEDKDHPTTSTLTSSNRNDVTGGRRDPNHSEGSTTLLEGYTSHYPHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLSGDQDTFHPSGGSHTTHGSESDGHSHGSQEGGANTTSGPIRTPQIPEWLIILASLLALALILAVCIAVNSRRRCGQKKKLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTADETRNLQNVDMKIGV SEQ ID NO:99 CD44之全部5個外顯子1-5按順序排列的多核苷酸序列 SEQ ID NO:100 CD44之外顯子1-5的轉譯多肽序列 MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSISRTEAADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFYTFSTVHPIPDEDSPWITDSTDRIPAT SLC3A2 轉錄本 6 序列資訊SEQ ID NO:13 SLC3A2-NRG1融合物5’處之SLC3A2外顯子1序列 CCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGG SEQ ID NO:14 SLC3A2-NRG1融合物3’處之NRG1外顯子5序列 CATCTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:15 SLC3A2-NRG1多核苷酸序列 CCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGGCATCTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:16 SLC3A2-NRG1多肽序列 RIGDLQAFQGHGAGNLAASTSTSTTGTSH SEQ ID NO:103 SLC3A2之外顯子1 AGATGCAGTAGCCGAAACTGCGCGGAGGCACAGAGGCCGGGGAGAGCGTTCTGGGTCCGAGGGTCCAGGTAGGGGTTGAGCCACCATCTGACCGCAAGCTGCGTCGTGTCGCCGGTTCTGCAGGCACCATGAGCCAGGACACCGAGGTGGATATGAAGGAGGTGGAGCTGAATGAGTTAGAGCCCGAGAAGCAGCCGATGAACGCGGCGTCTGGGGCGGCCATGTCCCTGGCGGGAGCCGAGAAGAATGGTCTGGTGAAGATCAAGGTGGCGGAAGACGAGGCGGAGGCGGCAGCCGCGGCTAAGTTCACGGGCCTGTCCAAGGAGGAGCTGCTGAAGGTGGCAGGCAGCCCCGGCTGGGTACGCACCCGCTGGGCACTGCTGCTGCTCTTCTGGCTCGGCTGGCTCGGCATGCTTGCTGGTGCCGTGGTCATAATCGTGCGAGCGCCGCGTTGTCGCGAGCTACCGGCGCAGAAGTGGTGGCACACGGGCGCCCTCTACCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGG SEQ ID NO:104 SLC3A2之外顯子2 GTCTGAAGGGGCGTCTCGATTACCTGAGCTCTCTGAAGGTGAAGGGCCTTGTGCTGGGTCCAATTCACAAGAACCAGAAGGATGATGTCGCTCAGACTGACTTGCTGCAGATCGACCCCAATTTTGGCTCCAAGGAAGATTTTGACAGTCTCTTGCAATCGGCTAAAAAAAAGA SEQ ID NO:105 SLC3A2之外顯子3 GCATCCGTGTCATTCTGGACCTTACTCCCAACTACCGGGGTGAGAACTCGTGGTTCTCCACTCAGGTTGACACTGTGGCCACCAAGGTGAA SEQ ID NO:106 SLC3A2之外顯子4 GATGCTCTGGAGTTTTGGCTGCAAGCTGGCGTGGATGGGTTCCAGGTTCGGGACATAGAGAATCTGAAG SEQ ID NO:107 SLC3A2之外顯子5 GATGCATCCTCATTCTTGGCTGAGTGGCAAAATATCACCAAGGGCTTCAGTGAAGACAG SEQ ID NO:108 SLC3A2之外顯子6 GCTCTTGATTGCGGGGACTAACTCCTCCGACCTTCAGCAGATCCTGAGCCTACTCGAATCCAACAAAGACTTGCTGTTGACTAGCTCATACCTGTCTGATTCTGGTTCTACTGGGGAGCATACAAAATCCCTAGTCACACAGTATTTGAATGCCACTGGCAATCGCTGGTGCAGCTGGAGT SEQ ID NO:109 SLC3A2之外顯子7 TTGTCTCAGGCAAGGCTCCTGACTTCCTTCTTGCCGGCTCAACTTCTCCGACTCTACCAGCTGATGCTCTTCACCCTGCCAGGGACCCCTGTTTTCAGCTACGGGGATGAGATTGGCCTGGATGCAGCTGCCCTTCCTGGACAG SEQ ID NO:110 SLC3A2之外顯子8 CCTATGGAGGCTCCAGTCATGCTGTGGGATGAGTCCAGCTTCCCTGACATCCCAGGGGCTGTAAGTGCCAACATGACTGTGAAG SEQ ID NO:111 SLC3A2之外顯子9 GGCCAGAGTGAAGACCCTGGCTCCCTCCTTTCCTTGTTCCGGCGGCTGAGTGACCAGCGGAGTAAGGAGCGCTCCCTACTGCATGGGGACTTCCACGCGTTCTCCGCTGGGCCTGGACTCTTCTCCTATATCCGCCACTGGGACCAGAATGAGCGTTTTCTGGTAGTGCTTAACTTTGGGGATGTGGGCCTCTCGGCTGGACTGCAGGCCTCCGACCTGCCTGCCAGCGCCAGCCTGCCAGCCAAGGCTGACCTCCTGCTCAGCACCCAGCCAGGCCGTGAGGAGGGCTCCCCTCTTGAGCTGGAACGCCTGAAACTGGAGCCTCACGAAGGGCTGCTGCTCCGCTTCCCCTACGCGGCCTGACTTCAGCCTGACATGGACCCACTACCCTTCTCCTTTCCTTCCCAGGCCCTTTGGCTTCTGATTTTTCTCTTTTTTAAAAACAAACAAACAAACTGTTGCAGATTATGAGTGAACCCCCAAATAGGGTGTTTTCTGCCTTCAAATAAAAGTCACCCCTGCATGGTGAA SEQ ID NO:112 SLC3A2之全部9個外顯子1-9按順序排列的多核苷酸序列 SEQ ID NO:113 SLC3A2之外顯子1的轉譯多肽序列 MSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLA SEQ ID NO:114 SLC3A2之外顯子2的轉譯多肽序列 LKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKK SEQ ID NO:115 SLC3A2之外顯子3的轉譯多肽序列 IRVILDLTPNYRGENSWFSTQVDTVATKV SEQ ID NO:116 SLC3A2之外顯子4的轉譯多肽序列 DALEFWLQAGVDGFQVRDIENLK SEQ ID NO:117 SLC3A2之外顯子5的轉譯多肽序列 DASSFLAEWQNITKGFSED SEQ ID NO:118 SLC3A2之外顯子6的轉譯多肽序列 LLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWS SEQ ID NO:119 SLC3A2之外顯子7的轉譯多肽序列 LSQARLLTSFLPAQLLRLYQLMLFTLPGTPVFSYGDEIGLDAAALPGQ SEQ ID NO:120 SLC3A2之外顯子8的轉譯多肽序列 PMEAPVMLWDESSFPDIPGAVSANMTVK SEQ ID NO:121 SLC3A2之外顯子9的轉譯多肽序列 GQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGPGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA SEQ ID NO:122 SLC3A2多肽序列 MSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLAGLKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKKSIRVILDLTPNYRGENSWFSTQVDTVATKVKDALEFWLQAGVDGFQVRDIENLKDASSFLAEWQNITKGFSEDRLLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWSLSQARLLTSFLPAQLLRLYQLMLFTLPGTPVFSYGDEIGLDAAALPGQPMEAPVMLWDESSFPDIPGAVSANMTVKGQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGPGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA NRG1 序列資訊SEQ ID NO:125 NRG1之外顯子1 AGTAAGCCTCCGCAGCCCACTCGGACTGCAGCCTGTTTGCCGCCCGTCCTCCCATTGCAGCACTCGGGGCGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCAGCAGCATGGGGAAAGGACGCGCGGGCCGAGTTGGCACCACAG SEQ ID NO:126 NRG1之外顯子2 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCGAAAAAACAAACCACAAAATATCAAGATACAAAAAAAGCCAGG SEQ ID NO:127 NRG1之外顯子3 GAAGTCAGAACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGCAAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCATCGTGGAATCAAACG SEQ ID NO:128 NRG1之外顯子4 AGATCATCACTGGTATGCCAGCCTCAACTGAAGGAGCATATGTGTCTTCAG SEQ ID NO:129 NRG1之外顯子5 AGTCTCCCATTAGAATATCAGTATCCACAGAAGGAGCAAATACTTCTTCAT SEQ ID NO:130 NRG1之外顯子6 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCAAACCCCTCGAGATACTTGTGCAA SEQ ID NO:131 NRG1之外顯子7 GTGCCCAAATGAGTTTACTGGTGATCGCTGCCAAAACTACGTAATGGCCAGCTTCTACA SEQ ID NO:132 NRG1之外顯子8 AGCATCTTGGGATTGAATTTATGG SEQ ID NO:133 NRG1之外顯子9 AGGCGGAGGAGCTGTACCAGAAGAGAGTGCTGACCATAACCGGCATCTGCATCGCCCTCCTTGTGGTCGGCATCATGTGTGTGGTGGCCTACTGCAAAACCAA SEQ ID NO:134 NRG1之外顯子10 GAAACAGCGGAAAAAGCTGCATGACCGTCTTCGGCAGAGCCTTCGGTCTGAACGAAACAATATGATGAACATTGCCAATGGGCCTCACCATCCTAACCCACCCCCCGAGAATGTCCAGCTGGTGAAT SEQ ID NO:135 NRG1之外顯子11 CAATACGTATCTAAAAACGTCATCTCCAGTGAGCATATTGTTGAGAGAGAAGCAGAGACATCCTTTTCCACCAGTCACTATACTTCCACAGCCCATCACTCCACTACTGTCACCCAGACTCCTAGCCACAG SEQ ID NO:136 NRG1之外顯子12 CTGGAGCAACGGACACACTGAAAGCATCCTTTCCGAAAGCCACTCTGTAATCGTGATGTCATCCGTAGAAAACAGTAGGCACAGCAGCCCAACTGGGGGCCCAAGAGGACGTCTTAATGGCACAGGAGGCCCTCGTGAATGTAACAGCTTCCTCAGGCATGCCAGAGAAACCCCTGATTCCTACCGAGACTCTCCTCATAGTGAAAG SEQ ID NO:137 NRG1之外顯子13 GTATGTGTCAGCCATGACCACCCCGGCTCGTATGTCACCTGTAGATTTCCACACGCCAAGCTCCCCCAAATCGCCCCCTTCGGAAATGTCTCCACCCGTGTCCAGCATGACGGTGTCCATGCCTTCCATGGCGGTCAGCCCCTTCATGGAAGAAGAGAGACCTCTACTTCTCGTGACACCACCAAGGCTGCGGGAGAAGAAGTTTGACCATCACCCTCAGCAGTTCAGCTCCTTCCACCACAACCCCGCGCATGACAGTAACAGCCTCCCTGCTAGCCCCTTGAGGATAGTGGAGGATGAGGAGTATGAAACGACCCAAGAGTACGAGCCAGCCCAAGAGCCTGTTAAGAAACTCGCCAATAGCCGGCGGGCCAAAAGAACCAAGCCCAATGGCCACATTGCTAACAGATTGGAAGTGGACAGCAACACAAGCTCCCAGAGCAGTAACTCAGAGAGTGAAACAGAAGATGAAAGAGTAGGTGAAGATACGCCTTTCCTGGGCATACAGAACCCCCTGGCAGCCAGTCTTGAGGCAACACCTGCCTTCCGCCTGGCTGACAGCAGGACTAACCCAGCAGGCCGCTTCTCGACACAGGAAGAAATCCAGGCCAGGCTGTCTAGTGTAATTGCTAACCAAGACCCTATTGCTGTATAAAACCTAAATAAACACATAGATTCACCTGTAAAACTTTATTTTATATAATAAAGTATTCCACCTTAAATTAAACAATTTATTTTATTTTAGCAGTTCTGCAAATAGAAAACAGGAAAAAAACTTTTATAAATTAAATATATGTATGTAAAAATGTGTTATGTGCCATATGTAGCAATTTTTTACAGTATTTCAAAACGAGAAAGATATCAATGGTGCCTTTATGTTATGTTATGTCGAGAGCAAGTTTTGTACAGTTACAGTGATTGCTTTTCCACAGTATTTCTGCAAAACCTCTCATAGATTCAGTTTTTGCTGGCTTCTTGTGCATTGCATTATGATGTTGACTGGATGTATGATTTGCAAGACTTGCAACTGTCCCTCTGTTTGCTTGTAGTAGCACCCGATCAGTATGTCTTGTAATGGCACATCCATCCAGATATGCCTCTCTTGTGTATGAAGTTTTCTTTGCTTTCAGAATATGAAATGAGTTGTGTCTACTCTGCCAGCCAAAGGTTTGCCTCATTGGGCTCTGAGATAATAGTAGATCCAACAGCATGCTACTATTAAATACAGCAAGAAACTGCATTAAGTAATGTTAAATATTAGGAAGAAAGTAATACTGTGATTTAAAAAAAACTATATTATTAATCAGAAGACAGCTTGCTCTTACTAAAAGGAGCTCTCATTTACTTTATTTGATTTTATTTTTCTTGACAAAAAGCAACAGTTTTAGGGATAGCTTAGAAAATGGGTTCTGGCTTGCTATCAGGGTAAATCTAACACCTTACAAGAGGACTGAGTGTCACTTTCTCTCTGGGGGAATGATCCAGCAGCTTATCTAGTTGACAATCAAAACACGGCTGATAAAGGTGCAATCATTTCTGACATGTATTTTTCACTGATTTTGAAGCTAGTGATTGGTTGTGTCTTCTTGGCTCAAAAAGAAGCATATTACGGCACAAAAAGCCCAGCCCAGACAGCACATGCAGCATTTTGTCTGAAATACTTCTAGAGTCAAACGTGCCTGCTGTACATAGCGATGACTTGTCATCATAGGGAAGTATTTCCATCGTAGAGTGTTCAGAAGGAGTGACTGTATAGGTGGAGAGAAGCTTAGTGACTCCGTTGAAATTTTAAAATGTGGATGACCACCCCTTTCTCCCCCTTATTTTTCTTTTATCTTTCCATGTTGCCTTGATCAGGTCATAACTATGCATGAACATTTTTTATCAGGAATGGCCGATGTGTATGTGATTTGTAATCACAAGTAATGATTCATCAGGAAATGTCAATCCTGTTGGAAAGATTGCACCTTTACTTGCAGAAGTGACCCCCACCTGTGTCCTGACCTCTCCATTTACAGGCTCTCTCACCCATTTCCCCCACCTCCTTTAATTTTTGCTTTACTGTCATAAAGTAGGACTAAGATTGGTCTAAGCATTGCATGTTCTTTTGTGATGGTAAATCCAAAGGAAGGCCTATAAGTATTAACATTTGAAATAACTGCTAATTCAGGAAAATGGAAGAAAAAAAATTATTTGAAACACAGAACCCATTTCATGGCCTGCCTGATATCTGTGAAATCAGGGCTGGAGCTTTACTTAGGATTCACATGGCCTCCTAGGAACCATGGGACAAATGGGAAACAGGTTATCGGGGGATTCATGAAGTCAGTGAGAGTAATTGCTTCTTTTTTGCGGGTGAACTGAATGTATTTCTTCACCAAATCTTGATGTTAACAATTAAAAAGAAGAAATGACATGCAAGTAGGTCTTAGCAGAAAAATGCAGGCTGGGCATGAGTCATGTTGTTACCCTCCCACATGCTCCTACAATCCACAGAGATGCCTGTCTGCAGGTTCTTGAAGTTATTGTTAGTATTTGGTATCTCAAATTTTTCGTCACTGTTCACATGCCACTTTCTCTGTGCACAGTGGTATCCTCATTTGCTTTTTAACCTACACTGAGGAGTCTTTGTCAGGTTGCACTGATTTTCCAATTCTGCAGTAATGAGTAAGCTCACGGCATGGGGAAGAAGACAGTCAGTCCAATGAAGTTCTCTAAATTATTTTAACATTGCCTTTGAAGGCCTTGACTCATCCTTAGCTATTTCAATGAAGAAATTCCTACCATGAATTTAAAACCCTAAAAATTCTGTTTCAAATTCTTTGGGCATTGGGGTACTCAGATATCCCATTGTGGAAGAATTTTAAGAATAAATAGAAGTTTCTGTTGAGAACCATGAGCAACATGTTTCTTACAATGAGAATTGCTATGCATTTTAAAATTGCAAATATATATGAAAATTGAAGACAAGAGGAAATTGTATTTCTAACTTGATTCTGATCACTCACAGAGGTGGCATATTATTATAGTTGGGACATCCTTTGCACCCTTCATAAAAAAGGCCAGCTGACTGCTCAGCATCACCTGCCAAGGCCACTAGATTTGTGTTTACAGGGGTATCTCTGTGATGCTTGTCACATCACTCTTGACCACCTCTGTTAATAAATTCCGACAGTGCAGTGGCGATCGGAGTGTGAACTTATGTTCCCAGCATATGGAAAGCTATCTTAGGTTTTAAGGTAGTAGAAATTGCCCAGGAGTTTGACAGCAACTTTGTTTCCCGGGTCTAAAATCGTATCCCACTGAGGTGTATGCAGTGGAGCATAATACATGCAAATACATGCAAAACTCCTTTTGTTTCACCTAAGATTCACTTTCTATCTTACTTTCCCTTCCTGCCTAGTGTGACTTTTGCCCCCAAGAGTGCCTGGACAGCATTCTAGTTTCTACAAAATGGTCCTCTGTGTAGGTGAATGTGTCCCAAACCTGCTATCACTTTCTTGTTTCAGTGTGACTGTCTTGTTAGAGGTGAAGTTTATCCAGGGTAACTTGCTCACTAACTATTCCTTTTTATGGCCTGGGGTTAAAGGGCGCATGGCTCACACTGGTGAAAATAAGGAAGGCCTGGTCTTATCTTGTATTAATAATACTGGCTGCATTCCACCAGCCAGAGATTTCTATCTGCGAAGACCTATGAAACACTGAAGAGAAATGTAGGCAGAAGGAAATGGCCACATATCACAAGTTCTATTATATATTCTTTTGTAAATACATATTGTATATTACTTGGATGTTTTCTTATATCATTTACTGTCTTTTTGAGTTAATGTCAGTTTTTACTCTCTCAACTTACTATGTAACATTGTAAATAACATAATGTCCTTTATTATTTATATTTAAGCATCTAACATATAGAGTTGTTTTCATATAAGTTTAAGATAAATGTCAAAAATATATGTTCTTTTGTTTTTCTTTGCTTTAAAATTATGTATCTTTTCCTTTTCTTTTTTTTAAGAATAATTTATTGTTCAGGAGAAAGAATGTATATGTAACTGAAACTATCTGAAGAATGCACATTGAAGGCCGTGAGGTACTGATAAACTAAAGAATTTATTATTCAAAATACTAAGCAATAAGTAATTGTGATTTATTTAAAGTTTTGTCCATTTTCCATGAAAGACATACTGCAATAAAAATGCTACTCTGTGGAGACCTGGGAGTGTTGCTCAGCAGACTACAGCTTCAGTCTGTTAGACCAGCACCTTTCATCTCATTCCCATAGTTATGCTAATTTAGGATTGTGTTCCATGGACCCCATGATCACCTTGTCTATACGTTGCTTCTTGTCTGTCCATTGCTTTTGCCACACCACCTGTTCTCAAATCATCTCCTCCCTACCAATGCTGTTTATCACTTTCTTCCTTGTTGAAGAGGCCACACAACCAGACAGTACTATGCTTCCTTTTTCCTCCATACACAATAACAGAGAGAGAATATTCTAGGGCATGACTGCCTGGATCCTGGCTGTTGCTATCTTTTGTAGTGGCAGTAAGAAACTCCTTCAGACTAATGAAAATGTCAACGTGCCATTCAATCACGAAAGGTAACGAAAAATGCTCTCATGGTTCAAATAGTCCAATGGCCCATAGTGGCCTAAAAGGCAGCCAGTTGACACCTGGCCATGCTAAGCTTCCTTATACCATCCGCTAATGACTTTCCATTGGGCCCACAATTTACGGATTCATAATTTTAAAAGAGGAGAAGGCCAAGTTAGGTTCATTCCCCTTATTCTGTCAATAAAACAAATCAAACTCATGTCTATCTAACTGCTCAGGGAGGAGCCTTTGCATGAGAAAATTCTCATATTCTAAGACTGAGTCATAGAAATGAGGGTATTACTTTTCTTACTGCAATTAACCTAAACAAAAGCCATATTTTAACAAATAGATATTTGCATGGTACCCTTCATATATTCCAAGCATTTCACTCATTATTCCAGGTAGGTTAAGAGCTTCTGAAGTGTATGAAGTAAAGGTCAGCAATCCTTTGGGGTGAACAGTGGCCTCCTTTGGAGTTTGGGGGTAACCTGAGACTTCCCACCAATGTCCACCTCCATCTGTGTACCTAATTCCTATTACCTAGTTATGGCTCCTCTAGGATCATTTCCAAACACTCTGGATGTCCAGGAAATTTAAATTGTAGCTTTTGACTGAGCTAGTTTTTCCTATTTATATTAATAAATTTTCAAAAATGCTTGAAATCTTCACATTTGCAACAACTTTAGTTTTCATGCACATACAAACACAGAGAGACAAAAATTCCAAACAGACACTCTCCAAAAGCCACCACACTCTTTCACTTGCTCTATAGTCATTTAGCCAACCAGCCATGCAGAGAATATTTAAAACTTAAAGATTGAGACATTATTCTCAGTTTTTGCTGAGGCTTTGTAACGAAATTGAACACTATAAGCAGCTATTGTAGTAATTTTGGTTAAAATTGTTTGCCTGGGATATAGTATTTGAGGCAGAAGCACGTGTGTGAAGGAGGTGAGGTGGTTTGGAAAGAGTGAAGACTCGCAGCCAGATTGAATGTCTGGATAATTACTATAATTCTCCCTTCTTGGTTGAAACCATGTTCTCTCTTGATTTTTAAACCCAGGCTGCCTCTGGAAACAAGCAAACCTGAGTCTTTCTAACCTGAGTCTTCCCAATCATTAGATTTCTTTTCTGTCCTAACGATGAATGATAAAAGGACTTGATGTTCACAATTTGGGGTTATAAGGCAGGTCTGAAATCTGGAGACTCAAGATGCTGGAAGGAGTGGAAAGTTTCGATGACTTTATATGAATCACTTTGCACTCTATGTTTGGCTTGTCCTCTTTGAAACTGATTTACTAAAATAAATGTAAGGGAACTATTACTCCAAAAGATTAACTTGGCAGGAAATACCAATACTTTCAGTTTATGAAAGACAAAACTGTCTTGTTGCTACAGGAAGCTGCAATGTTCCTAACCTTTAAGGTTGGTGTTGAATAGGGTGGTCATGCCCTCCCCTGCAGGTATCTTTAGGCTCCTGTTGACCTCCTGGTACTATAACTGTTCGTCTTCTCTGGGTAGCTATTGATTTTGAACTTTAACATGCTTCAAAACTTTATTCATCAGGGAAATAGGAAAAGAGTTTTGTTACCTGGAGGAAATCTATTGTGATCTACCTGAGCTTTTTAAAAACAGACCAGGAGAAGGAAACCAGTAATTTTTAAAGAAGAGACAGAGAATGGGATAATAGTTTCACCCAGGATCTCTTTCTAACCCTTTCCCTTCAAATGAACTTATTGGAACAGAATTGGAAAGAAGAAAGGACATCTCTGCCCACCCCACAGGATGCCAAAAAGGCTAAAGAATTACCTCTGTAGATTTAAACATCTTTAATGGCTTATGTATAGATTTGCTAATACAGAGAGAAATGAACTATTAAATAAAAATCACATTTTATAATATTTTTATGGCTTAAAACATCCTTTATCTCCTTTTTGTTCTCTCTACATGATATGGTAAGTGATGAGGAAAATTTAGGCTCAGGAAGGTTAAAATCTTTCTTGGAGTTACACATCTAAGAGAGCTGCAGAGCTGACACTTGTACCCAGGTTTTCTGACTGCAAATCCAGTTTCTTTCTATTGCGTTCTTCCCCTTTCCCTGCCTCAAGCAGAAACAGGTTTTTTATTTTCAACCTTTATGTATACAGTATGTTATGTTACATCTACAGCTAAGTTTCTTTTTAGAAGAATGTGAGCCCTTCTAGCTTTGGTTTAGAGTGATTCTAGAAGCCAATTTCCTTGGCTTAGTGATTCTATGCACCTTTCCTAAACTTAGCTTTCTAAGGAAATGAAGTGTACGAGTGAGAATGAATTCACAATTTCGACATGTAGGTAGCATCCTAAAGTGAAAAGAGGAGGAAATTTGTGGTCAAAGCACTCTCCCCACCACTTAGAAACTTACTGACTGTGGGCAGCTTCCTCCTCCAAGTTTCCTTCCTGATTTACAAGACCGTGGTGTGGTCAGGATTAAACTTGAATACATGTAAGGAAGCCTGAAAGTGTCTAACACATAGCGAGTATTCAAATGCCACCTTCTATTTGATCCTTCCCCTCCAGTTCCTTAAGTTTTGGAATCTAGGTTTCTCAGTTCCAAATGGATTGACATTTGCATATCCCCATTGCACAATGGATCAAATAAACTTTATGTTATCATTTCTCCAACATAGTGCCAGTAAGCAAATCCTTTTTAATAACAACAGTATGTTGAGAAACATATCACCAAATAATATTTAACTTTGTAGCTTTGATAAGTTCTTTAGGTTTTGGTTTTGGTTTTGTTTTCTGAGACAGGGTCTTGATCTGTCACCCAGACTGGAGTGCAATGGTTCAATTTTAGCTCACTGCAACCTGTAACTCCTGAGCTCAAGTGATCCTCCCACCTCAGCCTCTCAAGTAGCTGGGACTACAGGTGTGCTCCACCATGCCCAGCGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTAGGGACAAGGTCTCGCTATGTTGCCCAGGCTGGTCTTCAATTCCTGGCCTTAAGTGATCCTCCTGCCTCGGCCTCCCAAAGTGCTGGAATTACAGGCATGAGCCACCACCCATAACTTTATGTTTGTTTTTTTGATGCAGTATAAGTTCAGCTTGCTTCTTATGCAGCCATACCATTTCATGTTAACTCTGATTTTTAGCAGCTTATTACATTAGTGTTTTATTATTAATATAATTTTACAGAAATTTACTAAACCATGACTCTGTAGAGTTTTAATAATACTACCTCCAAACATCATTGCAAACATCTAGAAGAATGAACAAAAATGATCTTAGATCGACAGTATATCTGTTTGTCTTAGTTTCTACACAGGATGTTCAGACATATTCCATTTCTTTAAAAAAAAAATATATATATATATATATATATATAGGCCTGGCACGGTGGCTCATGACTGTAATCCCAGCACTTTGGGAGGCTTAGGCAGGCAAATCACCTGAGGTCAGGAGTTTGAGACTGGCCTGACCAACATGGTGAAACCTCGTCTCTATTAAAATTACAAAAATTAGCCGGGCGTGGTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAGTTGAGATCACGCCATTGCACTCCAGCCTTGGTGACAAGAGTGAAACTCCGTCTCAAAAAAAAAAAAAATATGTATATATATATATATATATATATATATATATATATATATATATATATAAAATCCCACCAAAAGTCTGCAGAGTGACCAAATTAGACGGCTCTGGTTTCAGATTAAATTCTAAATGTGAGAAACCACATAGCTCCCATGATCCATCCAATAATTCCTCACGTCCTCTTCACTCTTTACTCCATGCATAAAACAGAATTTTTTTTTCTCATCCTGGGTATGAAGCAATTAATAATTTACGGATTTAGCCTATTTGGATTCAATCCCTTCAAACTCCATACTACATCCAAGGTGGAAGTGACTTAAACTCTGATATCAATCATCAGGCTTGTAATATAGGCTTTGTTAATGGCAGGAGAGTCTAATAAAACTTTCTGTTCCTTATCCTTCATTTAAATGAAAAACTTTTTATTGAAAACAATCATAACTCTAGCTCATCATAAATATAATTCATGAGGACATTTTATTATTTTTATATTAAAGAAATAATATTATAGATGTAAACTTTGCACCTTTCTAATTATTATCATGAGTTAAGCTAATACTTGTCTTCTGGTCCCTAGATGATGATTCTTTTTTGCCTTACTGGAGGAGCCCTTGTCTTGAAGTGAGTTGCTTCAACAGCAGAGGACTTCTAGTTTCTCCCAGTTGAGCCTAAAGTGAACTTTTCATCTTCTTCAGAGGAAGGGGCTTCCTTGATTTGTACTTTTGTGGCTCCTCAGATAACACAGACAATTTTATCTTGGATCCCAGGTTCTCTTCACCATTAAGAATAAGAAAGAGAGAAAATGCTGTGCATGACAGCCACCTACTCCAAACTACCCAACCCCCTGTAACCAGGTACCTTCCAACAACGAGATGATTCTGCCCTCACTCAAGAGTCTCCCCCACAAAGATTCCATTCTCCCTTTACTTTTTATTTTTTATTTTTTTGCAAAACAAAGGCCTCCTTTAGTACCTCCTTAGTTTATAACCCTTATCTTCCCAGCTCTTCCCTTCACTGATACCTCTGATTTCAAAAGTTCTGAAGTCGGAAGACCACACAATTTCAGACTGTGAACAGAAATTCAGTCAGAAATTATTGGAGTTAGAAAGAATTTAGAGAAATTGTATTGAACTAGAAAGTCCTCTTTGATTAGTGGTGGCCCTTTAGAAAGTTCTAGGGCAGAGTCCCATGGTGTTTCATCTTTTCCATGATTTGCTTGACCAAAAACTTTTCTTTCACAACATGAAAATATGCTAATCCACCACACATTTTGGATTCTGCTCTGTTTGCCTGAGGTGTTAGATCTCTAGCCAGGACTGTGAAGGGAAGGAACTTGAATCCTTCCTATTGAGCTATTAATGCAGAGTCAGTGAGATGAAGGGTTCCACTCGGGGTCAAAATCATGTCAGTTACCAAGCAAAGGAGCAAGTAAGGGGAAACATCTCCTCATCTGGTTAGTGGAGCCACATTTCACCCACTGATCAAGCCAGACCTGAGCAATAGTCTAGATTTCTCCCTCCACATCTAATTGGTGACAATGGTGATTGTACCACTGACAGGTCACACAAGTCCACACCCTCCTTCACAGCCTCACTGCTTCGGCTCTTGTTTATGCTCTTATCAGCATCTGTCACTAGGATCCATTTGTCTCCTGACTCATCTACTTCCTTTCACTCCCCTGGGCCATCCTTCACATCATGCTAGAAGACTGTGTCTAACTTGCAGAACTTATTGTGTCAATCTCCTGGTTACGGCCCCTCCATGGCTCCCCACCTGACATAGGAGGCCCTTCACAATCTGGCTTCTGTCACTCATAACTTGTCTCCAGCCTTCATTCTTCAGTCTGATTTTATGGTTTTCTAGTTCCCCAATACACCACACCAGTGTTTATGAAACCCTAGTCATTGGCCTACGAACTTTATGATTATTGACATATTCATGTACCACCTGTATTATTTTTTGCATAGTGTTTATTTTTAATTGACTACATTTTTAACTACAATAAAGTAATTTCAACTAAAA SEQ ID NO:138 NRG1之全部13個外顯子1-13按順序排列的多核苷酸序列 SEQ ID NO:139 NRG1之外顯子1的轉譯多肽序列 MGKGRAGRVGTT SEQ ID NO:140 NRG1之外顯子2的轉譯多肽序列 LPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKP SEQ ID NO:141 NRG1之外顯子3的轉譯多肽序列 KSELRINKASLADSGEYMCKVISKLGNDSASANITIVESN SEQ ID NO:142 NRG1之外顯子4的轉譯多肽序列 IITGMPASTEGAYVSS SEQ ID NO:143 NRG1之外顯子5的轉譯多肽序列 SPIRISVSTEGANTSS SEQ ID NO:144 NRG1之外顯子6的轉譯多肽序列 TSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLC SEQ ID NO:145 NRG1之外顯子7的轉譯多肽序列 CPNEFTGDRCQNYVMASFY SEQ ID NO:146 NRG1之外顯子8的轉譯多肽序列 HLGIEFM SEQ ID NO:147 NRG1之外顯子9的轉譯多肽序列 AEELYQKRVLTITGICIALLVVGIMCVVAYCKT SEQ ID NO:148 NRG1之外顯子10的轉譯多肽序列 KQRKKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVN SEQ ID NO:149 NRG1之外顯子11的轉譯多肽序列 QYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSH SEQ ID NO:150 NRG1之外顯子12的轉譯多肽序列 WSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSE SEQ ID NO:151 NRG1之外顯子13的轉譯多肽序列 YVSAMTTPARMSPVDFHTPSSPKSPPSEMSPPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO:152 NRG1多肽序列 MGKGRAGRVGTTALPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKPGKSELRINKASLADSGEYMCKVISKLGNDSASANITIVESNEIITGMPASTEGAYVSSESPIRISVSTEGANTSSSTSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKHLGIEFMEAEELYQKRVLTITGICIALLVVGIMCVVAYCKTKKQRKKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSERYVSAMTTPARMSPVDFHTPSSPKSPPSEMSPPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO:153 NRG1之全部12個外顯子2-13按順序排列的多核苷酸序列 SEQ ID NO:154 NRG1之外顯子2-13的轉譯多肽序列 LPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKPGKSELRINKASLADSGEYMCKVISKLGNDSASANITIVESNEIITGMPASTEGAYVSSESPIRISVSTEGANTSSSTSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKHLGIEFMEAEELYQKRVLTITGICIALLVVGIMCVVAYCKTKKQRKKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSERYVSAMTTPARMSPVDFHTPSSPKSPPSEMSPPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO:155 NRG1之全部8個外顯子6-13按順序排列的多核苷酸序列 SEQ ID NO:156 NRG1之外顯子6-13的轉譯多肽序列 TSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKHLGIEFMEAEELYQKRVLTITGICIALLVVGIMCVVAYCKTKKQRKKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSERYVSAMTTPARMSPVDFHTPSSPKSPPSEMSPPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO:157 NRG1之全部8個外顯子6-13按順序排列的多核苷酸序列(不包括外顯子5的核苷酸CAT) SEQ ID NO:158 NRG1之外顯子5+CAT及外顯子6-13的轉譯多肽序列 STSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKHLGIEFMEAEELYQKRVLTITGICIALLVVGIMCVVAYCKTKKQRKKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSERYVSAMTTPARMSPVDFHTPSSPKSPPSEMSPPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO:161 NRG1之全部5個外顯子1-5按順序排列的多核苷酸序列(不包括外顯子5的核苷酸CAT) AGTAAGCCTCCGCAGCCCACTCGGACTGCAGCCTGTTTGCCGCCCGTCCTCCCATTGCAGCACTCGGGGCGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCAGCAGCATGGGGAAAGGACGCGCGGGCCGAGTTGGCACCACAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCGAAAAAACAAACCACAAAATATCAAGATACAAAAAAAGCCAGGGAAGTCAGAACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGCAAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCATCGTGGAATCAAACGAGATCATCACTGGTATGCCAGCCTCAACTGAAGGAGCATATGTGTCTTCAGAGTCTCCCATTAGAATATCAGTATCCACAGAAGGAGCAAATACTTCTT SEQ ID NO:162 NRG1外顯子1-5的轉譯多肽序列,不包括經轉譯外顯子5的最C端胺基酸(S) MGKGRAGRVGTTALPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKPGKSELRINKASLADSGEYMCKVISKLGNDSASANITIVESNEIITGMPASTEGAYVSSESPIRISVSTEGANTS SEQ ID NO:163 根據SEQ ID NO: 152之NRG1序列的EGF樣結構域 HLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASF VTCN1 序列資訊SEQ ID NO:164 VTCN1-NRG1融合物5’處之VTCN1外顯子2序列 CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAG SEQ ID NO:165 VTCN1-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAA SEQ ID NO:166 VTCN1-NRG1多核苷酸序列 CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAGCCTTGCCTCCCCGATTGAAAGAGATGAA SEQ ID NO:167 VTCN1-NRG1多肽序列 IISIIIILAGAIALIIGFGISALPPRLKEM SEQ ID NO:168 VTCN1之外顯子1 GTGAGTCACCAAGGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGCTTCCCTGGGGCAGATCCTCTTCTGGAG SEQ ID NO:169 VTCN1之外顯子2 CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAG SEQ ID NO:170 VTCN1之外顯子3 GGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAG SEQ ID NO:171 VTCN1之外顯子4 CCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAG SEQ ID NO:172 VTCN1之外顯子5 AATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTCGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAG SEQ ID NO:173 VTCN1之外顯子6 GGATCTACAGAACTATTTCACCACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCACAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGTCGGCGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAACAAATGCGGGTTTATTTCTCAGATGATGTTCATCCGTGAATGGTCCAGGGAAGGACCTTTCACCTTGTCTATATGGCATTATGTCATCACAAGCTCTGAGGCTTCTCCTTTCCATCCTGCGTGGACAGCTAAGACCTCAGTTTTCAATAGCATCTAGAGCAGTGGGACTCAGCTGGGGTGATTTCGCCCCCCATCTCCGGGGGAATGTCTGAAGACAATTTTGGTTACCTCAATGAGGGAGTGGAGGAGGATACAGTGCTACTACCAACTAGTGGATAGAGGCCAGGGATGCTGCTCAACCTCCTACCATGTACAGGACGTCTCCCCATTACAACTACCCAATCCGAAGTGTCAACTGTGTCAGGGCTAAGAAACCCTGGTTTTGAGTAGAAAAGGGCCTGGAAAGAGGGGAGCCAACAAATCTGTCTGCTTCCTCACATTAGTCATTGGCAAATAAGCATTCTGTCTCTTTGGCTGCTGCCTCAGCACAGAGAGCCAGAACTCTATCGGGCACCAGGATAACATCTCTCAGTGAACAGAGTTGACAAGGCCTATGGGAAATGCCTGATGGGATTATCTTCAGCTTGTTGAGCTTCTAAGTTTCTTTCCCTTCATTCTACCCTGCAAGCCAAGTTCTGTAAGAGAAATGCCTGAGTTCTAGCTCAGGTTTTCTTACTCTGAATTTAGATCTCCAGACCCTGCCTGGCCACAATTCAAATTAAGGCAACAAACATATACCTTCCATGAAGCACACACAGACTTTTGAAAGCAAGGACAATGACTGCTTGAATTGAGGCCTTGAGGAATGAAGCTTTGAAGGAAAAGAATACTTTGTTTCCAGCCCCCTTCCCACACTCTTCATGTGTTAACCACTGCCTTCCTGGACCTTGGAGCCACGGTGACTGTATTACATGTTGTTATAGAAAACTGATTTTAGAGTTCTGATCGTTCAAGAGAATGATTAAATATACATTTCCTACACCA SEQ ID NO:174 VTCN1之全部6個外顯子1-6按順序排列的多核苷酸序列 SEQ ID NO:175 VTCN1之外顯子1的轉譯多肽序列 MASLGQILFW SEQ ID NO:176 VTCN1之外顯子2的轉譯多肽序列 IISIIIILAGAIALIIGFGIS SEQ ID NO:177 VTCN1之外顯子3的轉譯多肽序列 RHSITVTTVASAGNIGEDGILSCTFEPDIKLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKCYIITSKGKGNANLEYKTG SEQ ID NO:178 VTCN1之外顯子4的轉譯多肽序列 FSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVSNTSFELNSENVTMKVVSVLYNVTINNTYSCMIENDIAKATGDIKVT SEQ ID NO:179 VTCN1之外顯子5的轉譯多肽序列 SEIKRRSHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK SEQ ID NO:180 VTCN1多肽序列 MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVSNTSFELNSENVTMKVVSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK SEQ ID NO:181 VTCN1之外顯子1及2 GTGAGTCACCAAGGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAG SEQ ID NO:182 VTCN1之外顯子1及2的轉譯多肽序列 MASLGQILFWSIISIIIILAGAIALIIGFGIS CDH1 序列資訊SEQ ID NO: 184 CDH1-NRG1融合物5’處之CDH1外顯子11序列 CTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATG SEQ ID NO: 185 CDH1 -NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAA SEQ ID NO: 186 CDH1-NRG1多核苷酸序列 CTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATGCCTTGCCTCCCCGATTGAAAGAGATGAAA SEQ ID NO: 187 CDH1-NRG1多肽序列 WLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDNALPPRLKEMK SEQ ID NO: 188 CDH1之外顯子1 AGTGGCGTCGGAACTGCAAAGCACCTGTGAGCTTGCGGAAGTCAGTTCAGACTCCAGCCCGCTCCAGCCCGGCCCGACCCGACCGCACCCGGCGCCTGCCCTCGCTCGGCGTCCCCGGCCAGCCATGGGCCCTTGGAGCCGCAGCCTCTCGGCGCTGCTGCTGCTGCTGCAG SEQ ID NO: 189 CDH1之外顯子2 GTCTCCTCTTGGCTCTGCCAGGAGCCGGAGCCCTGCCACCCTGGCTTTGACGCCGAGAGCTACACGTTCACGGTGCCCCGGCGCCACCTGGAGAGAGGCCGCGTCCTGGGCAGAG SEQ ID NO: 190 CDH1之外顯子3 TGAATTTTGAAGATTGCACCGGTCGACAAAGGACAGCCTATTTTTCCCTCGACACCCGATTCAAAGTGGGCACAGATGGTGTGATTACAGTCAAAAGGCCTCTACGGTTTCATAACCCACAGATCCATTTCTTGGTCTACGCCTGGGACTCCACCTACAGAAAGTTTTCCACCAAAGTCACGCTGAATACAGTGGGGCACCACCACCGCCCCCCGCCCCATCAG SEQ ID NO: 191 CDH1之外顯子4 GCCTCCGTTTCTGGAATCCAAGCAGAATTGCTCACATTTCCCAACTCCTCTCCTGGCCTCAGAAGACAGAAGAGAGACTGGGTTATTCCTCCCATCAGCTGCCCAGAAAATGAAAAAGGCCCATTTCCTAAAAACCTGGTTCAG SEQ ID NO: 192 CDH1之外顯子5 ATCAAATCCAACAAAGACAAAGAAGGCAAGGTTTTCTACAGCATCACTGGCCAAGGAGCTGACACACCCCCTGTTGGTGTCTTTATTATTGAAAGAGAAACAGGATGGCTGAAGGTGACAGAGCCTCTGGATAGAGAACGCATTGCCACATACACT SEQ ID NO: 193 CDH1之外顯子6 CTCTTCTCTCACGCTGTGTCATCCAACGGGAATGCAGTTGAGGATCCAATGGAGATTTTGATCACGGTAACCGATCAGAATGACAACAAGCCCGAATTCACCCAGGAGGTCTTTAAGGGGTCTGTCATGGAAGGTGCTCTTCCAG SEQ ID NO: 194 CDH1之外顯子7 GAACCTCTGTGATGGAGGTCACAGCCACAGACGCGGACGATGATGTGAACACCTACAATGCCGCCATCGCTTACACCATCCTCAGCCAAGATCCTGAGCTCCCTGACAAAAATATGTTCACCATTAACAGGAACACAGGAGTCATCAGTGTGGTCACCACTGGGCTGGACCGAGAG SEQ ID NO: 195 CDH1之外顯子8 AGTTTCCCTACGTATACCCTGGTGGTTCAAGCTGCTGACCTTCAAGGTGAGGGGTTAAGCACAACAGCAACAGCTGTGATCACAGTCACTGACACCAACGATAATCCTCCGATCTTCAATCCCACCACG SEQ ID NO: 196 CDH1之外顯子9 TACAAGGGTCAGGTGCCTGAGAACGAGGCTAACGTCGTAATCACCACACTGAAAGTGACTGATGCTGATGCCCCCAATACCCCAGCGTGGGAGGCTGTATACACCATATTGAATGATGATGGTGGACAATTTGTCGTCACCACAAATCCAGTGAACAACGATGGCATTTTGAAAACAGCAAAGGTTTGTATGGTACCTGGCAAGATGCAGAAACTGGCATCCTCACAGCTGTTCATACCCTTGTCCCCTG SEQ ID NO: 197 CDH1之外顯子10 GGCTTGGATTTTGAGGCCAAGCAGCAGTACATTCTACACGTAGCAGTGACGAATGTGGTACCTTTTGAGGTCTCTCTCACCACCTCCACAGCCACCGTCACCGTGGATGTGCTGGATGTGAATGAAGCCCCCATCTTTGTGCCTCCTGAAAAGAGAGTGGAAGTGTCCGAGGACTTTGGCGTGGGCCAGGAAATCACATCCTACACTGCCCAGGAGCCAGACACATTTATGGAACAGAAAATAAC SEQ ID NO: 198 CDH1之外顯子11 ATATCGGATTTGGAGAGACACTGCCAACTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATG SEQ ID NO: 199 CDH1之外顯子12 GTTCTCCAGTTGCTACTGGAACAGGGACACTTCTGCTGATCCTGTCTGATGTGAATGACAACGCCCCCATACCAGAACCTCGAACTATATTCTTCTGTGAGAGGAATCCAAAGCCTCAGGTCATAAACATCATTGATGCAGACCTTCCTCCCAATACATCTCCCTTCACAGCAGAACTAACACACGGGGCGAGTGCCAACTGGACCATTCAGTACAACGACCCAA SEQ ID NO: 200 CDH1之外顯子13 CCCAAGAATCTATCATTTTGAAGCCAAAGATGGCCTTAGAGGTGGGTGACTACAAAATCAATCTCAAGCTCATGGATAACCAGAATAAAGACCAAGTGACCACCTTAGAGGTCAGCGTGTGTGACTGTGAAGGGGCCGCTGGCGTCTGTAGGAAGGCACAGCCTGTCGAAGCAGGATTGCAAATTCCTGCCATTCTGGGGATTCTTGGAGGAATTCTTGCTTTGCTAA SEQ ID NO: 201 CDH1之外顯子14 TTCTGATTCTGCTGCTCTTGCTGTTTCTTCGGAGGAGAGCGGTGGTCAAAGAGCCCTTACTGCCCCCAGAGGATGACACCCGGGACAACGTTTATTACTATGATGAAGAAGGAGGCGGAGAAGAGGACCAG SEQ ID NO: 202 CDH1之外顯子15 GACTTTGACTTGAGCCAGCTGCACAGGGGCCTGGACGCTCGGCCTGAAGTGACTCGTAACGACGTTGCACCAACCCTCATGAGTGTCCCCCGGTATCTTCCCCGCCCTGCCAATCCCGATGAAATTGGAAATTTTATTGATGAA SEQ ID NO: 203 CDH1之外顯子16 AATCTGAAAGCGGCTGATACTGACCCCACAGCCCCGCCTTATGATTCTCTGCTCGTGTTTGACTATGAAGGAAGCGGTTCCGAAGCTGCTAGTCTGAGCTCCCTGAACTCCTCAGAGTCAGACAAAGACCAGGACTATGACTACTTGAACGAATGGGGCAATCGCTTCAAGAAGCTGGCTGACATGTACGGAGGCGGCGAGGACGACTAGGGGACTCGAGAGAGGCGGGCCCCAGACCCATGTGCTGGGAAATGCAGAAATCACGTTGCTGGTGGTTTTTCAGCTCCCTTCCCTTGAGATGAGTTTCTGGGGAAAAAAAAGAGACTGGTTAGTGATGCAGTTAGTATAGCTTTATACTCTCTCCACTTTATAGCTCTAATAAGTTTGTGTTAGAAAAGTTTCGACTTATTTCTTAAAGCTTTTTTTTTTTTCCCATCACTCTTTACATGGTGGTGATGTCCAAAAGATACCCAAATTTTAATATTCCAGAAGAACAACTTTAGCATCAGAAGGTTCACCCAGCACCTTGCAGATTTTCTTAAGGAATTTTGTCTCACTTTTAAAAAGAAGGGGAGAAGTCAGCTACTCTAGTTCTGTTGTTTTGTGTATATAATTTTTTAAAAAAAATTTGTGTGCTTCTGCTCATTACTACACTGGTGTGTCCCTCTGCCTTTTTTTTTTTTTTAAGACAGGGTCTCATTCTATCGGCCAGGCTGGAGTGCAGTGGTGCAATCACAGCTCACTGCAGCCTTGTCCTCCCAGGCTCAAGCTATCCTTGCACCTCAGCCTCCCAAGTAGCTGGGACCACAGGCATGCACCACTACGCATGACTAATTTTTTAAATATTTGAGACGGGGTCTCCCTGTGTTACCCAGGCTGGTCTCAAACTCCTGGGCTCAAGTGATCCTCCCATCTTGGCCTCCCAGAGTATTGGGATTACAGACATGAGCCACTGCACCTGCCCAGCTCCCCAACTCCCTGCCATTTTTTAAGAGACAGTTTCGCTCCATCGCCCAGGCCTGGGATGCAGTGATGTGATCATAGCTCACTGTAACCTCAAACTCTGGGGCTCAAGCAGTTCTCCCACCAGCCTCCTTTTTATTTTTTTGTACAGATGGGGTCTTGCTATGTTGCCCAAGCTGGTCTTAAACTCCTGGCCTCAAGCAATCCTTCTGCCTTGGCCCCCCAAAGTGCTGGGATTGTGGGCATGAGCTGCTGTGCCCAGCCTCCATGTTTTAATATCAACTCTCACTCCTGAATTCAGTTGCTTTGCCCAAGATAGGAGTTCTCTGATGCAGAAATTATTGGGCTCTTTTAGGGTAAGAAGTTTGTGTCTTTGTCTGGCCACATCTTGACTAGGTATTGTCTACTCTGAAGACCTTTAATGGCTTCCCTCTTTCATCTCCTGAGTATGTAACTTGCAATGGGCAGCTATCCAGTGACTTGTTCTGAGTAAGTGTGTTCATTAATGTTTATTTAGCTCTGAAGCAAGAGTGATATACTCCAGGACTTAGAATAGTGCCTAAAGTGCTGCAGCCAAAGACAGAGCGGAACTATGAAAAGTGGGCTTGGAGATGGCAGGAGAGCTTGTCATTGAGCCTGGCAATTTAGCAAACTGATGCTGAGGATGATTGAGGTGGGTCTACCTCATCTCTGAAAATTCTGGAAGGAATGGAGGAGTCTCAACATGTGTTTCTGACACAAGATCCGTGGTTTGTACTCAAAGCCCAGAATCCCCAAGTGCCTGCTTTTGATGATGTCTACAGAAAATGCTGGCTGAGCTGAACACATTTGCCCAATTCCAGGTGTGCACAGAAAACCGAGAATATTCAAAATTCCAAATTTTTTTCTTAGGAGCAAGAAGAAAATGTGGCCCTAAAGGGGGTTAGTTGAGGGGTAGGGGGTAGTGAGGATCTTGATTTGGATCTCTTTTTATTTAAATGTGAATTTCAACTTTTGACAATCAAAGAAAAGACTTTTGTTGAAATAGCTTTACTGTTTCTCAAGTGTTTTGGAGAAAAAAATCAACCCTGCAATCACTTTTTGGAATTGTCTTGATTTTTCGGCAGTTCAAGCTATATCGAATATAGTTCTGTGTAGAGAATGTCACTGTAGTTTTGAGTGTATACATGTGTGGGTGCTGATAATTGTGTATTTTCTTTGGGGGTGGAAAAGGAAAACAATTCAAGCTGAGAAAAGTATTCTCAAAGATGCATTTTTATAAATTTTATTAAACAATTTTGTTAAA SEQ ID NO: 204 CDH1之全部16個外顯子1-16按順序排列的多核苷酸序列 SEQ ID NO: 205 CDH1之外顯子10的轉譯多肽序列 MEQKI SEQ ID NO: 206 CDH1之外顯子11的轉譯多肽序列 YRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDN SEQ ID NO: 207 CDH1之外顯子12的轉譯多肽序列 SPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADLPPNTSPFTAELTHGASANWTIQYNDP SEQ ID NO: 208 CDH1之外顯子13的轉譯多肽序列 QESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLEVSVCDCEGAAGVCRKAQPVEAGLQIPAILGILGGILALL SEQ ID NO: 209 CDH1之外顯子14的轉譯多肽序列 LILLLLLFLRRRAVVKEPLLPPEDDTRDNVYYYDEEGGGEEDQ SEQ ID NO: 210 CDH1之外顯子15的轉譯多肽序列 DFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYLPRPANPDEIGNFIDE SEQ ID NO: 211 CDH1之外顯子16的轉譯多肽序列 NLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKDQDYDYLNEWGNRFKKLADMYGGGEDD SEQ ID NO: 212 CDH1多肽序列 MEQKITYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDNGSPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADLPPNTSPFTAELTHGASANWTIQYNDPTQESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLEVSVCDCEGAAGVCRKAQPVEAGLQIPAILGILGGILALLILILLLLLFLRRRAVVKEPLLPPEDDTRDNVYYYDEEGGGEEDQDFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYLPRPANPDEIGNFIDENLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKDQDYDYLNEWGNRFKKLADMYGGGEDD SEQ ID NO: 213 CDH1之全部11個外顯子1-11按順序排列的多核苷酸序列 SEQ ID NO: 214 外顯子1-11的轉譯多肽序列 MEQKIYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDN CXADR 序列資訊SEQ ID NO: 215 CXADR-NRG1融合物5’處之CXADR外顯子1序列 ATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGG SEQ ID NO: 216 CXADR-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO: 217 CXADR-NRG1多核苷酸序列 ATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO: 218 CXADR-NRG1多肽序列 MALLLCFVLLCGVVALPPRLKEMKSQESAAGSK SEQ ID NO:219 CXADR之外顯子1 AGTCGGGAGCGCGCGAGGCGCGGGGAGCCTGGGACCAGGAGCGAGAGCCGCCTACCTGCAGCCGCCGCCCACGGCACGGCAGCCACCATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGG SEQ ID NO:220 CXADR之外顯子2 ATTTCGCCAGAAGTTTGAGTATCACTACTCCTGAAGAGATGATTGAAAAAGCCAAAGGGGAAACTGCCTATCTGCCATGCAAATTTACGCTTAGTCCCGAAGACCAGGGACCGCTGGACATCGAGTGGCTGATATCACCAGCTGATAATCAGAAGGTGGATCAAGTG SEQ ID NO:221 CXADR之外顯子3 ATTATTTTATATTCTGGAGACAAAATTTATGATGACTACTATCCAGATCTGAAAGGCCGAGTACATTTTACGAGTAATGATCTCAAATCTGGTGATGCATCAATAAATGTAACGAATTTACAACTGTCAGATATTGGCACATATCAGTGCAAAGTGAAAAAAGCTCCTGGTGTTGCAAATAAGAAGATTCATCTGGTAGTTCTTG SEQ ID NO:222 CXADR之外顯子4 TTAAGCCTTCAGGTGCGAGATGTTACGTTGATGGATCTGAAGAAATTGGAAGTGACTTTAAGATAAAATGTGAACCAAAAGAAGGTTCACTTCCATTACAGTATGAGTGGCAAAAATTGTCTGACTCACAGAAAATGCCCACTTCATGGTTAGCAG SEQ ID NO:223 CXADR之外顯子5 GGAAGATGTGCCACCTCCAAAGAGCCGTACGTCCACTGCCAGAAGCTACATCGGCAGTAATCATTCATCCCTGGGGTCCATGTCTCCTTCCAACATGGAAGGATATTCCAAGACTCAGTATAACCAAGTACCAAGTGAAGACTTTGAACGCACTCCTCAGAGTCCGACTCTCCCACCTGCTAAGGTAGCTGCCCCTAATCTAAGTCGAATGGGTGCGATTCCTGTGATGATTCCAGCACAGAGCAAGGATGGGTCTATAGTATAGAGCCTCCATATGTCTCATCTGTGCTCTCCGTGTTCCTTTCCTTTTTTTGATATATGAAAACCTATTCTGGTCTAAATTGTGTTACTAGCCTCAAAATACATCAAAAAATAAGTTAATCAGGAACTGTACGGAATATATTTTTAAAAATTTTTGTTTGGTTATATCGAAATAGTTACAGGCACTAAAGTTAGTAAAGAAAAGTTTACCATCTGAAAAAGCTGGATTTTCTTTAAGAGGTTGATTATAAAGTTTTCTAAATTTATCAGTACCTAAGTAAGATGTAGCGCTTTGAATATGAAATCATAGGTGAAGACATGGGTGAACTTACTTGCATACCAAGTTGATACTTGAATAACCATCTGAAAGTGGTACTTGATCATTTTTACCATTATTTTTAGGATGTGTATTTCATTTATTTATGGCCCACCAGTCTCCCCCAAATTAGTACAGAAATATCCATGACAAAATTACTTACGTATGTTTGTACTTGGTTTTACAGCTCCTTTGAAAACTCTGTGTTTGGAATATCTCTAAAAACATAGAAAACACTACAGTGGTTTAGAAATTACTAATTTTACTTCTAAGTCATTCATAAACCTTGTCTATGAAATGACTTCTTAAATATTTAGTTGATAGACTGCTACAGGTAATAGGGACTTAGCAAGCTCTTTTATATGCTAAAGGAGCATCTATCAGATTAAGTTAGAACATTTGCTGTCAGCCACATATTGAGATGACACTAGGTGCAATAGCAGGGATAGATTTTGTTGGTGAGTAGTCTCATGCCTTGAGATCTGTGGTGGTCTTCAAAATGGTGGCCAGCCAGATCAAGGATGTAGTATCTCATAGTTCCCAGGTGATATTTTTCTTATTAGAAAAATATTATAACTCATTTGTTGTTTGACACTTATAGATTGAAATTTCCTAATTTATTCTAAATTTTAAGTGGTTCTTTGGTTCCAGTGCTTTATGTTGTTGTTGTTTTTGGATGGTGTTACATATTATATGTTCTAGAAACATGTAATCCTAAATTTACCCTCTTGAATATAATCCCTGGATGATATTTTTTATCATAAATGCAGAATAATCAAATACATTTTAAGCAAGTTAAGTGTCCTCCATCAATTCTGTATTCCAGACTTGGGAGGATGTACAGTTGCTGTTGTGTGATCAAACATGTCTCTGTGTAGTTCCAGCAAATCAAGCTGAGCTTTGAAAAAGTTTGTCTTAGTTTTGTGAAGGTGATTTATTCTTAAAAAAAAAAAAGAAAGAAAAAGAAAAAAAGATAAGAAGGAGGAGTAAAGGGACTACTCCTCCTTGCCAAATGTGCTAAATATCATTTTAGGAGAAGAAAGTGGGTTTATTGTATTTCCCTTAAGATTGTGAGGGAGTGTGGATACAGTAGAATGAGCCAACAGTTTCTTTATAATAAATACGGTCTGCAATAAATTATTTCACTAGCTCTAAAACCTTTCCCTAGATTTTAGTAGGGAGTTGGTTTCTGTTAATATCTTTGGGTGCTGTGGTGGTAAATGCTATATTATGAACGGTGGCATGTATTTACAGTTAGAGTATTGTGTGTACACTTTTTAATGGTAAACTTAAGCTGAATGTGTAATGGATTTGTGTATAGTTTTACATATTTGGAAGCATTTTAAAAACAGGTTTTAACCTTATGTAAAATTACTTTTATACTCGTGTTAACATTTTCATCTGTGCCTTTTGGTAATTTAATTTCTATTATGAATTTCTGGTGCCTATGAGCTAGCTATCACCTACCTGAAAGGTGCTTAGAGGTGAAGGTACTGTTTCTAAAAACACATCACTGTGATACCTTTCTATCCTCACATTTTCAAGCTTGCCTCTTTTCTGTTCTTTGTGGATATAACTTAAGCAATTGTGTTATTCATAAAGGTTTAGAAATTTCAATATTCCCAACACTCTATGTTTCTGATTTTATAACAGTAGCCATTTTTGAAAGTCAGATGTTTGGCCTGTTTTATATGAATAAAGTTTATTTATAAAATATTATAAAAAATAAGTAAATAAACAGAACATTAATAATAAAGTTTTGGTTCTTCCTATTCCTGACTTTCATATAATGAAAATTATCCTATTGATCTAAGTAGAAGTTATCATAGAAAGTGGACACGTATAAGACTTTCCTTCCTTTTTTTTTTTAATAACATATGAGGAACAAGACTTCTCTTCCCATATACTTCATATTTTAGAGGACATTGTTTTAAAGGCTTATGTCTCACTGTAAAATTCTGTCAGCCAAATAGTACCAATACGTTTTCAAGTAGTTCTCACTGATAATTTAGTTGAACCAGAGATCAAATATTTGCTCCCGAATTACTACTGGTAATCAAGTAGTTGAACAAAAAATTACTAAAGCATTTCCGTTAGATCAGTCAAGGACAGTACTGCATCTTTTTTTTTTTTTTTTTTTTTAAGACGGAGTCTCGGTCTGTCACCCAGGCTGGAGTGCAGTGGCGGGATCTCGGCTCACTGCAAGCTCCGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGCAGCTGGGACTACAGGCTCCCATCACCACGCTCGGCTAAGTTTTTGTAATTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAACCACTGCACCCGGCCCAGTACTGCATCTTAACAGCAAAGCCATTTTATTCTACTTTATAACTGAGAGACTTGATACCATCCATCTCTTTAGGTTACAGAGGATAATTTGAAGAGAAATGTTACTGTAGAATATATAGTTCTGTACTTTTTTTTTTTTTTTTTAAGAGATAGGGTTTCACTATTGCTCAGGCTGGTCTCAAACTGCTGGGCTCAGGAGATCCTCCTGCCTTGGCCTCCCAAATTGCTGGGATTACAGGTGTGAGCCGCAGCATCCAGCCAGTTCTGTACTTTGAATATGGAGTAGTTTACAGCTATTTTTTTTTCTTACTGGTAATCTTAACTAATATGATTCCCTTGTTAGAGAGCCTCTCACTCCCCCACCCCCAAAAATGTCTACTATTCATGACAGTAACCAATTATTCTGGACAAATTGCTTCTTTTTAATTTGAGCTATCTGCCATGGACTTTCTAAAATGGAAACACAGCCTGAGTGTATCTTAGGGAGAGTTTGATTGAAAAAATCCAAATCACTATCCATATAGATCATGGATATAAAGAGATACCTGATTTTTATTAAAAAGATACTTTTTCAAATTTAAGAGTTAATCTTGGAAATTTGGAACAAGTAAAGGGGCAAGTAAACCTTTTGATGAAATATAAAAGGAACTCATTGCATGAAGTTGACTATCAAATTCTGTGATGTGTGGCTTCTTAAAAATATTCTCAGTGTCTTTTGTGTGCGTGCAGCATGTACATTTGATGTTATGTGAATGTTGAGTTTTTTCTTCTAATTTTCACTTCAGCAGTGTTTAGGGCTTTCAGATGCCTTATTCCAGTGTGAACAGAAAAAGTTCATATTTTATGTGGTTAATGCTTTGATGTGTCACATAAAGAGTAGTTTGTAGAAAATGTTGGCACAATTTTAACTTCTTAGTGGCTTGTGACATTATATATTATATATATATATGTACATATATCTTTATAACATTCCTGTGTTTAGTAGTGTAAATGTTCTGGGCAAGTTTTAATATTTTGAATGCCTTTGGATATTCCAGCAATAAAGGCATCATGTTCTGCAATAGGATTTCTTACTCATTTACCTATTTTAACACTAAAATAGACCACAACTGAGCACAAATTCCTTTTATAAATGTTATAGAAGCAGGGAAGAATAATAAACACATTTGTGAATTGTGGTTCAGTTTATTTATCTTTAGGGAAGGCTGATCATTTATCTTATAGCAGATAACCCCAGCCTCTTATTCATTATGGTTAACTTTTATAATTTATCTTATTTTATAATTTAAGAATATAGTACATATCAGTTGGGTTTGGTTTTGGTCATCGAGACTAAAAGCTCCATCAAAACAGAACTTTGTGTTTTCTGCTAACTTATTTAATGACACAAGTTTTAAGAGAACCACAATTCATTGATTCACTTATTCTTTTCCCTAATTGTGAATTTTAGTGATAAATACACCTGTACTACTGAGGAAAATATTCTGACACTTCACGTGTGCAAAGTATAGAACTGACAGTGTCAGTTTCAGATTTTGTATGTACGATTTCTGGCTTATATATCCAATGGTGCAGATTTTGAAATTTGTAAGAACAAAATTTGTTAAGAAAAACAACTTGCTCTAGTTTTGTGACCTTGTGTACTTTTGAAATAAAATCAAGAAAGCAGTTCTCTGCCTC SEQ ID NO:224 CXADR之全部5個外顯子1-5按順序排列的多核苷酸序列 SEQ ID NO:225 CXADR之外顯子1的轉譯多肽序列 MALLLCFVLLCGVV SEQ ID NO:226 CXADR之外顯子2的轉譯多肽序列 FARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLISPADNQKVDQV SEQ ID NO:227 CXADR之外顯子3的轉譯多肽序列 IILYSGDKIYDDYYPDLKGRVHFTSNDLKSGDASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVL SEQ ID NO:228 CXADR之外顯子4的轉譯多肽序列 KPSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQKLSDSQKMPTSWLA SEQ ID NO:229 CXADR之外顯子5的轉譯多肽序列 KMCHLQRAVRPLPEATSAVIIHPWGPCLLPTWKDIPRLSITKYQVKTLNALLRVRLSHLLR SEQ ID NO:230 CDAXR多肽序列 MALLLCFVLLCGVVDFARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLISPADNQKVDQVIILYSGDKIYDDYYPDLKGRVHFTSNDLKSGDASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVLVKPSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQKLSDSQKMPTSWLAGKMCHLQRAVRPLPEATSAVIIHPWGPCLLPTWKDIPRLSITKYQVKTLNALLRVRLSHLLR GTF2E2 序列資訊SEQ ID NO: 231 GTF2E2-NRG1融合物5’處之GTF2E2外顯子2序列 GGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATCTTCTGAGTCATCATCATCATCGTCAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTG SEQ ID NO: 232 GTF2E2-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGA SEQ ID NO: 233 GTF2E2-NRG1多核苷酸序列 GGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATCTTCTGAGTCATCATCATCATCGTCAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGA SEQ ID NO: 234 GTF2E2-NRG1多肽序列 ELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVEHGGSSGSKQNSALPPRLKEMKSQESAAGSKLVLRCETSSEYSSLRFKWFKNGN SEQ ID NO:235 GTF2E2之外顯子1 ACTGAGCTCCTAGCACCCGATCGGGAAGTGGCGGGCGGAGTCCCGGGTCCAGTCGCCGCCTCAGCTACCGCCGCTGCCGCCGCCGCCGCCGCCACCGCCAGTGGTGAGACCCCGACCTGGCGGGTCAGCGCTGGGCGTGCGTGCGGGCAGGCGGGGGCGCTGACGAGAAGCAGGAAGAGGGTGCAGTGCCGGCGTGGGCGGCCGGCCGAGGCGGAGGCGCAGGAAGGGGGCGGCGAGTCGTGCGAGGCTGCCCTTCTCACTCAG SEQ ID NO:236 GTF2E2之外顯子2 CATTATGGATCCAAGCCTGTTGAGAGAAAGGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATCTTCTGAGTCATCATCATCATCGTCAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTG SEQ ID NO:237 GTF2E2之外顯子3 ATCATAGCAATGGATCATTTAACTTGAAAGCTTTGTCAGGAAGCTCTGGATATAAGTTTGGTGTTCTTGCTAAGATTGTGAATTACATGAAG SEQ ID NO:238 GTF2E2之外顯子4 ACACGGCATCAGCGAGGAGATACGCATCCTCTAACCTTAGATGAAATTTTGGATGAAACACAACATTTAGATATTGGACTCAAGCAGAAACAATGGCTAATGACTGAG SEQ ID NO:239 GTF2E2之外顯子5 GCTTTAGTCAACAATCCCAAAATTGAAGTAATAGATGGGAAGTATGCTTTCAAGCCCAAGTACAACGTGAGAGATAAGAAGGCCCTACTTAGGCTCTTAGATCAGCATGACCAGCGAGGATTAGGAGGAATTCTTTTAGAAGACATAGAAGAAGCACTGCCCAATTCCCAGAAAGCTGTCAAG SEQ ID NO:240 GTF2E2之外顯子6 GCTTTGGGGGACCAGATACTATTTGTAAATCGTCCCGATAAGAAGAAAATACTTTTCTTCAATGATAAGAGCTGTCAGTTTTCTGTGGATGAAG SEQ ID NO:241 GTF2E2之外顯子7 AATTTCAGAAACTGTGGAGGAGTGTCACTGTAGATTCCATGGACGAGGAGAAAATTGAAGAATATCTGAAGCGACAGGGTATTTCTTCCATGCAGGAATCTGGACCAAAGAAAGTG SEQ ID NO:242 GTF2E2之外顯子8 GCCCCTATTCAGAGAAGGAAAAAGCCTGCTTCACAGAAAAAGCGACGCTTTAAGACTCATAACGAACACTTGGCTGGAGTGCTGAAGGATTACTCTGACATTACTTCCAGCAAATAGGGAACAGTTTTGCCCTGGAACAGAGTTACAGATACACAATCAAGAGTGTTCTTGCTGATGCTCGGGGTCTGAAGACTGTCTTCCTATCTGCTTCTTGCGGCTGAGGAGAGGAGCAGTTCAGTTTACAAAACAAGTGCAAATTACCAAACTCAAAGCTTATTTGAGTAGAATGGGCTCATGGGCAATGTGATGTTCCCTGTTAACCTTCTGTTACTCCCTGGGAGAAAGGCGCTGAGCGTGGCATGCAGGTGTCTTTGCTGTGTTTTTCTCCACTTCTAAATGGTTCCTGGTTCCTTTCTTCCTCGTTTGTTACTTTAGAGCAAGTTTGCCCATAGTCTTGAATGCAATATTTGTTTATTCCAAAAGAACATATTTATAATAAAATCACTGTAGAAGGATTTTTAAGATGTTAGTGAATTCTGTTTCTTTTCATTCTCGGAAATGGCAGGAAGCAGCTCCAGTCTCTGATTTCCATGGGTCACGTGCTGGGGATGTGATGAAGCCTGCAGTCTGCACTGTGTTGCTGAGCACATGGATTTCACCACTGGAACACAGGTGTGCTGCTTGTTAGCAAGCAGAGCAATAAAGATGTGCTGGATGTCA SEQ ID NO:243 GTF2E2之全部8個外顯子1-8按順序排列的多核苷酸序列 SEQ ID NO:244 GTF2E2之外顯子2的轉譯多肽序列 MDPSLLRERELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVEHGGSSGSKQNS SEQ ID NO:245 GTF2E2之外顯子3的轉譯多肽序列 HSNGSFNLKALSGSSGYKFGVLAKIVNYMK SEQ ID NO:246 GTF2E2之外顯子4的轉譯多肽序列 TRHQRGDTHPLTLDEILDETQHLDIGLKQKQWLMTE SEQ ID NO:247 GTF2E2之外顯子5的轉譯多肽序列 ALVNNPKIEVIDGKYAFKPKYNVRDKKALLRLLDQHDQRGLGGILLEDIEEALPNSQKAVK SEQ ID NO:248 GTF2E2之外顯子6的轉譯多肽序列 ALGDQILFVNRPDKKKILFFNDKSCQFSVDE SEQ ID NO:249 GTF2E2之外顯子7的轉譯多肽序列 FQKLWRSVTVDSMDEEKIEEYLKRQGISSMQESGPKKV SEQ ID NO:250 GTF2E2之外顯子8的轉譯多肽序列 APIQRRKKPASQKKRRFKTHNEHLAGVLKDYSDITSSK SEQ ID NO:251 全蛋白質序列 MDPSLLRERELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVEHGGSSGSKQNSDHSNGSFNLKALSGSSGYKFGVLAKIVNYMKTRHQRGDTHPLTLDEILDETQHLDIGLKQKQWLMTEALVNNPKIEVIDGKYAFKPKYNVRDKKALLRLLDQHDQRGLGGILLEDIEEALPNSQKAVKALGDQILFVNRPDKKKILFFNDKSCQFSVDEEFQKLWRSVTVDSMDEEKIEEYLKRQGISSMQESGPKKVAPIQRRKKPASQKKRRFKTHNEHLAGVLKDYSDITSSK SEQ ID NO:252 外顯子1-2 ACTGAGCTCCTAGCACCCGATCGGGAAGTGGCGGGCGGAGTCCCGGGTCCAGTCGCCGCCTCAGCTACCGCCGCTGCCGCCGCCGCCGCCGCCACCGCCAGTGGTGAGACCCCGACCTGGCGGGTCAGCGCTGGGCGTGCGTGCGGGCAGGCGGGGGCGCTGACGAGAAGCAGGAAGAGGGTGCAGTGCCGGCGTGGGCGGCCGGCCGAGGCGGAGGCGCAGGAAGGGGGCGGCGAGTCGTGCGAGGCTGCCCTTCTCACTCAGCATTATGGATCCAAGCCTGTTGAGAGAAAGGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATCTTCTGAGTCATCATCATCATCGTCAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTG CSMD1 序列資訊SEQ ID NO: 253 CSMD1-NRG1融合物5’處之CSMD1外顯子23序列 ATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCA SEQ ID NO: 254 CSMD1-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACT SEQ ID NO: 255 CSMD1-NRG1多核苷酸序列 ATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACT SEQ ID NO: 256 CSMD1-NRG1多肽序列 ILNSTSNHLWLEFNTNGSDTDQGFQLTYTTTSTSTTGTSHLVKCAEKEKT SEQ ID NO:257 CSMD1之外顯子1 CTCCACGGCAGCGGCTCCTTGTGCCACTAGCAGCCCTTCTTCTGCGCTCTCCGCCTTTTCTCTCTAGACTGGATCTCTCCTCCCCCCCGCGCCCCCCTCCCCGCATCTCCCACTCGCTGGCTCTCTCTCCAGCTGCCTCCTCTCCAGGTCTCTCCTGGCTGCGCGCGCTCCTCTCCCCGCTTCTCCCCCTCCCGCAGCCTCGCCGCCTTGGTGCCTTCCTGCCCGGCTCGGCCGGCGCTCGTCCCCGGCCCCGGCCCCGCCAGCCCGGGTCTCCGCGCTCGGAGCAGCTCAGCCCTGCAGTGGCTCGGGACCCGATGCTATGAGAGGGAAGCGAGCCGGGCGCCCAGACCTTCAGGAGGCGTCGGATGCGCGGCGGGTCTTGGGACCGGGCTCTCTCTCCGGCTCGCCTTGCCCTCGGGTGATTATTTGGCTCCGCTCATAGCCCTGCCTTCCTCGGAGGAGCCATCGGTGTCGCGTGCGTGTGGAGTATCTGCAGACATGACTGCGTGGAGGAGATTCCAGTCGCTGCTCCTGCTTCTCGGGCTGCTGGTGCTGTGCGCGAGGCTCCTCACTGCAGCGAAGG SEQ ID NO:258 CSMD1之外顯子2 GTCAGAACTGTGGAGGCTTAGTCCAGGGTCCCAATGGCACTATTGAGAGCCCAGGGTTTCCTCACGGGTATCCGAACTATGCCAACTGCACCTGGATCATCATCACGGGCGAGCGCAATAGGATACAGTTGTCCTTCCATACCTTTGCTCTTGAAGAAGATTTTGATATTTTATCAGTTTACGATGGACAGCCTCAACAAGGGAATTTAAAAGTGAG SEQ ID NO:259 CSMD1之外顯子3 ATTATCGGGATTTCAGCTGCCCTCCTCTATAGTGAGTACAGGATCTATCCTCACTCTGTGGTTCACGACAGACTTCGCTGTGAGTGCCCAAGGTTTCAAAGCATTATATGAAG SEQ ID NO:260 CSMD1之外顯子4 TTTTACCTAGCCACACTTGTGGAAATCCTGGAGAAATCCTGAAAGGAGTTCTGCATGGAACGAGATTCAACATAGGAGACAAAATCCGGTACAGCTGCCTCCCTGGCTACATCTTGGAAGGCCACGCCATCCTGACCTGCATCGTCAGCCCAGGAAATGGTGCATCGTGGGACTTCCCAGCTCCCTTTTGCAGAG SEQ ID NO:261 CSMD1之外顯子5 CTGAGGGAGCCTGCGGAGGAACCTTACGCGGGACCAGCAGCTCCATCTCCAGCCCGCACTTCCCTTCAGAGTACGAGAACAACGCGGACTGCACCTGGACCATTCTGGCTGAGCCCGGGGACACCATTGCGCTGGTCTTCACTGACTTTCAGCTAGAAGAAGGATATGATTTCTTAGAGATCAGTGGCACGGAAGCTCCATCCATATG SEQ ID NO:262 CSMD1之外顯子6 GCTAACTGGCATGAACCTCCCCTCTCCAGTTATCAGTAGCAAGAATTGGCTACGACTCCATTTCACCTCTGACAGCAACCACCGACGCAAAGGATTTAACGCTCAGTTCCAAG SEQ ID NO:263 CSMD1之外顯子7 TGAAAAAGGCGATTGAGTTGAAGTCAAGAGGAGTCAAGATGCTGCCCAGCAAGGATGGAAGCCATAAAAACTCTGTCT SEQ ID NO:264 CSMD1之外顯子8 TGAGCCAAGGAGGTGTTGCATTGGTCTCTGACATGTGTCCAGATCCTGGGATTCCAGAAAATGGTAGAAGAGCAGGTTCCGACTTCAG SEQ ID NO:265 CSMD1之外顯子9 GGTTGGTGCAAATGTACAGTTTTCATGTGAGGACAATTACGTGCTCCAGGGATCTAAAAGCATCACCTGTCAGAGAGTTACAGAGACGCTCGCTGCTTGGAGTGACCACAGGCCCATCTGCCGAG SEQ ID NO:266 CSMD1之外顯子10 CGAGAACATGTGGATCCAATCTGCGTGGGCCCAGCGGCGTCATTACCTCCCCTAATTATCCGGTTCAGTATGAAGATAATGCACACTGTGTGTGGGTCATCACCACCACCGACCCGGACAAG SEQ ID NO:267 CSMD1之外顯子11 GTCATCAAGCTTGCCTTTGAAGAGTTTGAGCTGGAGCGAGGCTATGACACCCTGACGGTTGGTGATGCTGGGAAGGTGGGAGACACCAGATCGGTCTTGTACGT SEQ ID NO:268 CSMD1之外顯子12 GCTCACGGGATCCAGTGTTCCTGACCTCATTGTGAGCATGAGCAACCAGATGTGGCTACATCTGCAGTCGGATGATAGCATTGGCTCACCTGGGTTTAAAGCTGTTTACCAAG SEQ ID NO:269 CSMD1之外顯子13 AAATTGAAAAGGGAGGGTGTGGGGATCCTGGAATCCCCGCCTATGGGAAGCGGACGGGCAGCAGTTTCCTCCATGGAGATACACTCACCTTTGAATGCCCGGCGGCCTTTGAGCTGGTGGGGGAGAGAGTTATCACCTGTCAGCAGAACAATCAGTGGTCTGGCAACAAGCCCAGCTGTGTAT SEQ ID NO:270 CSMD1之外顯子14 TTTCATGTTTCTTCAACTTTACGGCATCATCTGGGATTATTCTGTCACCAAATTATCCAGAGGAATATGGGAACAACATGAACTGTGTCTGGTTGATTATCTCGGAGCCAGGAAGTCGAATTCACCTAATCTTTAATGATTTTGATGTTGAGCCTCAGTTTGACTTTCTCGCGGTCAAGGATGATGGCATTTCTGACATAACTGTCCTGGGTACTTTTTCTGGCAATGAAGTGCCTTCCCAGCTGGCCAGCAGTGGGCATATAGTTCGCTTGGAATTTCAGTCTGACCATTCCACTACTGGCAGAGGGTTCAACATCACTTACACCA SEQ ID NO:271 CSMD1之外顯子15 CATTTGGTCAGAATGAGTGCCATGATCCTGGCATTCCTATAAACGGACGACGTTTTGGTGACAGGTTTCTACTCGGGAGCTCGGTTTCTTTCCACTGTGATGATGGCTTTGTCAAGACCCAGGGATCCGAGTCCATTACCTGCATACTGCAAGACGGGAACGTGGTCTGGAGCTCCACCGTGCCCCGCTGTGAAG SEQ ID NO:272 CSMD1之外顯子16 CTCCATGTGGTGGACATCTGACAGCGTCCAGCGGAGTCATTTTGCCTCCTGGATGGCCAGGATATTATAAGGATTCTTTACATTGTGAATGGATAATTGAAGCAAAACCAGGCCACTCTATCAAAATAACTTTTGACAG SEQ ID NO:273 CSMD1之外顯子17 ATTTCAGACAGAGGTCAATTATGACACCTTGGAGGTCAGAGATGGGCCAGCCAGTTCGTCCCCACTGATCGGCGAGTACCACGGCACCCAGGCACCCCAGTTCCTCATCAGCACCGGGAACTTCATGTACCTGCTGTTCACCACTGACAACAGCCGCTCCAGCATCGGCTTCCTCATCCACTATGAGA SEQ ID NO:274 CSMD1之外顯子18 GTGTGACGCTTGAGTCGGATTCCTGCCTGGACCCGGGCATCCCTGTGAACGGCCATCGCCACGGTGGAGACTTTGGCATCAGGTCCACAGTGACTTTCAGCTGTGACCCGGGGTACACACTAAGTGACGACGAGCCCCTCGTCTGTGAGAGGAACCACCAGTGGAACCACGCCTTGCCCAGCTGCGACG SEQ ID NO:275 CSMD1之外顯子19 CTCTATGTGGAGGCTACATCCAAGGGAAGAGTGGAACAGTCCTTTCTCCTGGGTTTCCAGATTTTTATCCAAACTCTCTAAACTGCACGTGGACCATTGAAGTGTCTCATGGGAAAG SEQ ID NO:276 CSMD1之外顯子20 GAGTTCAAATGATCTTTCACACCTTTCATCTTGAGAGTTCCCACGACTATTTACTGATCACAGAGGATGGAAGTTTTTCCGAGCCCGTTGCCAGGCTCACCGGGTCGGTGTTGCCTCATACGATCAAGGCAGGCCTGTTTGGAAACTTCACTGCCCAGCTTCGGTTTATATCAGACTTCTCAATTTCGTACGAGGGCTTCAATATCACATTTTCAG SEQ ID NO:277 CSMD1之外顯子21 AATATGACCTGGAGCCATGTGATGATCCTGGAGTCCCTGCCTTCAGCCGAAGAATTGGTTTTCACTTTGGTGTGGGAGACTCTCTGACGTTTTCCTGCTTCCTGGGATATCGTTTAGAAGGTGCCACCAAGCTTACCTGCCTGGGTGGGGGCCGCCGTGTGTGGAGTGCACCTCTGCCAAGGTGTGTGG SEQ ID NO:278 CSMD1之外顯子22 CCGAATGTGGAGCAAGTGTCAAAGGAAATGAAGGAACATTACTGTCTCCAAATTTTCCATCCAATTATGATAATAACCATGAGTGTATCTATAAAATAGAAACAGAAGCCGGCAAGGGCATCCACCTTAGAACACGAAGCTTCCAGCTGTTTGAAGGAGATACTCTAAAG SEQ ID NO:279 CSMD1之外顯子23 GTATATGATGGAAAAGACAGTTCCTCACGTCCACTGGGCACGTTCACTAAAAATGAACTTCTGGGGCTGATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCA SEQ ID NO:280 CSMD1之外顯子24 GTTTTGATCTGGTAAAATGTGAGGATCCGGGCATCCCTAACTACGGCTATAGGATCCGTGATGAAGGCCACTTTACCGACACTGTAGTTCTGTACAGTTGCAACCCGGGGTACGCCATGCATGGCAGCAACACCCTGACCTGTTTGAGTGGAGACAGGAGAGTGTGGGACAAACCACTACCTTCGTGCATAG SEQ ID NO:281 CSMD1之全部47個外顯子24-70按順序排列的多核苷酸序列 SEQ ID NO:282 CSMD1之全部70個外顯子1-70按順序排列的多核苷酸序列 CTCCACGGCAGCGGCTCCTTGTGCCACTAGCAGCCCTTCTTCTGCGCTCTCCGCCTTTTCTCTCTAGACTGGATCTCTCCTCCCCCCCGCGCCCCCCTCCCCGCATCTCCCACTCGCTGGCTCTCTCTCCAGCTGCCTCCTCTCCAGGTCTCTCCTGGCTGCGCGCGCTCCTCTCCCCGCTTCTCCCCCTCCCGCAGCCTCGCCGCCTTGGTGCCTTCCTGCCCGGCTCGGCCGGCGCTCGTCCCCGGCCCCGGCCCCGCCAGCCCGGGTCTCCGCGCTCGGAGCAGCTCAGCCCTGCAGTGGCTCGGGACCCGATGCTATGAGAGGGAAGCGAGCCGGGCGCCCAGACCTTCAGGAGGCGTCGGATGCGCGGCGGGTCTTGGGACCGGGCTCTCTCTCCGGCTCGCCTTGCCCTCGGGTGATTATTTGGCTCCGCTCATAGCCCTGCCTTCCTCGGAGGAGCCATCGGTGTCGCGTGCGTGTGGAGTATCTGCAGACATGACTGCGTGGAGGAGATTCCAGTCGCTGCTCCTGCTTCTCGGGCTGCTGGTGCTGTGCGCGAGGCTCCTCACTGCAGCGAAGGGTCAGAACTGTGGAGGCTTAGTCCAGGGTCCCAATGGCACTATTGAGAGCCCAGGGTTTCCTCACGGGTATCCGAACTATGCCAACTGCACCTGGATCATCATCACGGGCGAGCGCAATAGGATACAGTTGTCCTTCCATACCTTTGCTCTTGAAGAAGATTTTGATATTTTATCAGTTTACGATGGACAGCCTCAACAAGGGAATTTAAAAGTGAGATTATCGGGATTTCAGCTGCCCTCCTCTATAGTGAGTACAGGATCTATCCTCACTCTGTGGTTCACGACAGACTTCGCTGTGAGTGCCCAAGGTTTCAAAGCATTATATGAAGTTTTACCTAGCCACACTTGTGGAAATCCTGGAGAAATCCTGAAAGGAGTTCTGCATGGAACGAGATTCAACATAGGAGACAAAATCCGGTACAGCTGCCTCCCTGGCTACATCTTGGAAGGCCACGCCATCCTGACCTGCATCGTCAGCCCAGGAAATGGTGCATCGTGGGACTTCCCAGCTCCCTTTTGCAGAGCTGAGGGAGCCTGCGGAGGAACCTTACGCGGGACCAGCAGCTCCATCTCCAGCCCGCACTTCCCTTCAGAGTACGAGAACAACGCGGACTGCACCTGGACCATTCTGGCTGAGCCCGGGGACACCATTGCGCTGGTCTTCACTGACTTTCAGCTAGAAGAAGGATATGATTTCTTAGAGATCAGTGGCACGGAAGCTCCATCCATATGGCTAACTGGCATGAACCTCCCCTCTCCAGTTATCAGTAGCAAGAATTGGCTACGACTCCATTTCACCTCTGACAGCAACCACCGACGCAAAGGATTTAACGCTCAGTTCCAAGTGAAAAAGGCGATTGAGTTGAAGTCAAGAGGAGTCAAGATGCTGCCCAGCAAGGATGGAAGCCATAAAAACTCTGTCTTGAGCCAAGGAGGTGTTGCATTGGTCTCTGACATGTGTCCAGATCCTGGGATTCCAGAAAATGGTAGAAGAGCAGGTTCCGACTTCAGGGTTGGTGCAAATGTACAGTTTTCATGTGAGGACAATTACGTGCTCCAGGGATCTAAAAGCATCACCTGTCAGAGAGTTACAGAGACGCTCGCTGCTTGGAGTGACCACAGGCCCATCTGCCGAGCGAGAACATGTGGATCCAATCTGCGTGGGCCCAGCGGCGTCATTACCTCCCCTAATTATCCGGTTCAGTATGAAGATAATGCACACTGTGTGTGGGTCATCACCACCACCGACCCGGACAAGGTCATCAAGCTTGCCTTTGAAGAGTTTGAGCTGGAGCGAGGCTATGACACCCTGACGGTTGGTGATGCTGGGAAGGTGGGAGACACCAGATCGGTCTTGTACGTGCTCACGGGATCCAGTGTTCCTGACCTCATTGTGAGCATGAGCAACCAGATGTGGCTACATCTGCAGTCGGATGATAGCATTGGCTCACCTGGGTTTAAAGCTGTTTACCAAGAAATTGAAAAGGGAGGGTGTGGGGATCCTGGAATCCCCGCCTATGGGAAGCGGACGGGCAGCAGTTTCCTCCATGGAGATACACTCACCTTTGAATGCCCGGCGGCCTTTGAGCTGGTGGGGGAGAGAGTTATCACCTGTCAGCAGAACAATCAGTGGTCTGGCAACAAGCCCAGCTGTGTATTTTCATGTTTCTTCAACTTTACGGCATCATCTGGGATTATTCTGTCACCAAATTATCCAGAGGAATATGGGAACAACATGAACTGTGTCTGGTTGATTATCTCGGAGCCAGGAAGTCGAATTCACCTAATCTTTAATGATTTTGATGTTGAGCCTCAGTTTGACTTTCTCGCGGTCAAGGATGATGGCATTTCTGACATAACTGTCCTGGGTACTTTTTCTGGCAATGAAGTGCCTTCCCAGCTGGCCAGCAGTGGGCATATAGTTCGCTTGGAATTTCAGTCTGACCATTCCACTACTGGCAGAGGGTTCAACATCACTTACACCACATTTGGTCAGAATGAGTGCCATGATCCTGGCATTCCTATAAACGGACGACGTTTTGGTGACAGGTTTCTACTCGGGAGCTCGGTTTCTTTCCACTGTGATGATGGCTTTGTCAAGACCCAGGGATCCGAGTCCATTACCTGCATACTGCAAGACGGGAACGTGGTCTGGAGCTCCACCGTGCCCCGCTGTGAAGCTCCATGTGGTGGACATCTGACAGCGTCCAGCGGAGTCATTTTGCCTCCTGGATGGCCAGGATATTATAAGGATTCTTTACATTGTGAATGGATAATTGAAGCAAAACCAGGCCACTCTATCAAAATAACTTTTGACAGATTTCAGACAGAGGTCAATTATGACACCTTGGAGGTCAGAGATGGGCCAGCCAGTTCGTCCCCACTGATCGGCGAGTACCACGGCACCCAGGCACCCCAGTTCCTCATCAGCACCGGGAACTTCATGTACCTGCTGTTCACCACTGACAACAGCCGCTCCAGCATCGGCTTCCTCATCCACTATGAGAGTGTGACGCTTGAGTCGGATTCCTGCCTGGACCCGGGCATCCCTGTGAACGGCCATCGCCACGGTGGAGACTTTGGCATCAGGTCCACAGTGACTTTCAGCTGTGACCCGGGGTACACACTAAGTGACGACGAGCCCCTCGTCTGTGAGAGGAACCACCAGTGGAACCACGCCTTGCCCAGCTGCGACGCTCTATGTGGAGGCTACATCCAAGGGAAGAGTGGAACAGTCCTTTCTCCTGGGTTTCCAGATTTTTATCCAAACTCTCTAAACTGCACGTGGACCATTGAAGTGTCTCATGGGAAAGGAGTTCAAATGATCTTTCACACCTTTCATCTTGAGAGTTCCCACGACTATTTACTGATCACAGAGGATGGAAGTTTTTCCGAGCCCGTTGCCAGGCTCACCGGGTCGGTGTTGCCTCATACGATCAAGGCAGGCCTGTTTGGAAACTTCACTGCCCAGCTTCGGTTTATATCAGACTTCTCAATTTCGTACGAGGGCTTCAATATCACATTTTCAGAATATGACCTGGAGCCATGTGATGATCCTGGAGTCCCTGCCTTCAGCCGAAGAATTGGTTTTCACTTTGGTGTGGGAGACTCTCTGACGTTTTCCTGCTTCCTGGGATATCGTTTAGAAGGTGCCACCAAGCTTACCTGCCTGGGTGGGGGCCGCCGTGTGTGGAGTGCACCTCTGCCAAGGTGTGTGGCCGAATGTGGAGCAAGTGTCAAAGGAAATGAAGGAACATTACTGTCTCCAAATTTTCCATCCAATTATGATAATAACCATGAGTGTATCTATAAAATAGAAACAGAAGCCGGCAAGGGCATCCACCTTAGAACACGAAGCTTCCAGCTGTTTGAAGGAGATACTCTAAAGGTATATGATGGAAAAGACAGTTCCTCACGTCCACTGGGCACGTTCACTAAAAATGAACTTCTGGGGCTGATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCAGTTTTGATCTGGTAAAATGTGAGGATCCGGGCATCCCTAACTACGGCTATAGGATCCGTGATGAAGGCCACTTTACCGACACTGTAGTTCTGTACAGTTGCAACCCGGGGTACGCCATGCATGGCAGCAACACCCTGACCTGTTTGAGTGGAGACAGGAGAGTGTGGGACAAACCACTACCTTCGTGCATAGCGGAATGTGGTGGTCAGATCCATGCAGCCACATCAGGACGAATATTGTCCCCTGGCTATCCAGCTCCGTATGACAACAACCTCCACTGCACCTGGATTATAGAGGCAGACCCAGGAAAGACCATTAGCCTCCATTTCATTGTTTTCGACACGGAGATGGCTCACGACATCCTCAAGGTCTGGGACGGGCCGGTGGACAGTGACATCCTGCTGAAGGAGTGGAGTGGCTCCGCCCTTCCGGAGGACATCCACAGCACCTTCAACTCACTCACCCTGCAGTTCGACAGCGACTTCTTCATCAGCAAGTCTGGCTTCTCCATCCAGTTCTCCACCTCAATTGCAGCCACCTGTAACGATCCAGGTATGCCCCAAAATGGCACCCGCTATGGAGACAGCAGAGAGGCTGGAGACACCGTCACATTCCAGTGTGACCCTGGCTATCAGCTCCAAGGACAAGCCAAAATCACCTGTGTGCAGCTGAATAACCGGTTCTTTTGGCAACCAGACCCTCCTACATGCATAGCTGCTTGTGGAGGGAATCTGACGGGCCCAGCAGGTGTTATTTTGTCACCCAACTACCCACAGCCGTATCCTCCTGGGAAGGAATGTGACTGGAGAGTAAAAGTGAACCCGGACTTTGTCATCGCCTTGATATTCAAAAGTTTCAACATGGAGCCCAGCTATGACTTCCTACACATCTATGAAGGGGAAGATTCCAACAGCCCCCTCATTGGGAGTTACCAGGGCTCTCAGGCCCCAGAAAGAATAGAGAGTAGCGGAAACAGCCTGTTTCTGGCATTTCGGAGTGATGCCTCCGTGGGCCTTTCAGGGTTCGCCATTGAATTTAAAGAGAAACCACGGGAAGCTTGTTTTGACCCAGGAAATATAATGAATGGGACAAGAGTTGGAACAGACTTCAAGCTTGGCTCCACCATCACCTACCAGTGTGACTCTGGCTATAAGATTCTTGACCCCTCATCCATCACCTGTGTGATTGGGGCTGATGGGAAACCCTCCTGGGACCAAGTGCTGCCCTCCTGCAATGCTCCCTGTGGAGGCCAGTACACGGGATCAGAAGGGGTAGTTTTATCACCAAACTACCCCCATAATTACACAGCTGGTCAAATATGCCTCTATTCCATCACGGTACCAAAGGAATTCGTGGTCTTTGGACAGTTTGCCTATTTCCAGACAGCCCTGAATGATTTGGCAGAATTATTTGATGGAACCCATGCACAGGCCAGACTTCTCAGCTCACTCTCGGGGTCTCACTCAGGGGAAACATTGCCCTTGGCTACGTCAAATCAAATTCTGCTCCGATTCAGTGCAAAGAGCGGTGCCTCTGCCCGCGGCTTCCACTTCGTGTATCAAGCTGTTCCTCGTACCAGTGACACCCAATGCAGCTCTGTCCCCGAGCCCAGATACGGAAGGAGAATTGGTTCTGAGTTTTCTGCCGGCTCCATCGTCCGATTCGAGTGCAACCCGGGATACCTGCTTCAGGGTTCCACGGCGCTCCACTGCCAGTCCGTGCCCAACGCCTTGGCACAGTGGAACGACACGATCCCCAGCTGTGTGGTACCCTGCAGTGGCAATTTCACTCAACGAAGAGGTACAATCCTGTCCCCCGGCTACCCTGAGCCATACGGAAACAACTTGAACTGTATATGGAAGATCATAGTTACGGAGGGCTCGGGAATTCAGATCCAAGTGATCAGTTTTGCCACGGAGCAGAACTGGGACTCCCTTGAGATCCACGATGGTGGGGATGTGACCGCACCCAGACTGGGAAGCTTCTCAGGCACCACAGTACCGGCACTGCTGAACAGTACTTCCAACCAACTCTACCTGCATTTCCAGTCTGACATTAGTGTGGCAGCTGCTGGTTTCCACCTGGAATACAAAACTGTAGGTCTTGCTGCATGCCAAGAACCAGCCCTCCCCAGCAACAGCATCAAAATCGGAGATCGGTACATGGTGAACGACGTGCTCTCCTTCCAGTGCGAGCCCGGGTACACCCTGCAGGGCCGTTCCCACATTTCCTGTATGCCAGGGACCGTTCGCCGTTGGAACTATCCGTCTCCCCTGTGCATTGCAACCTGTGGAGGGACGCTGAGCACCTTGGGTGGTGTGATCCTGAGCCCCGGCTTCCCAGGTTCTTACCCCAACAACTTAGACTGCACCTGGAGGATCTCATTACCCATCGGCTATGGTGCACATATTCAGTTTCTGAATTTTTCTACCGAAGCTAATCATGACTTCCTTGAAATTCAAAATGGACCTTACCACACCAGCCCCATGATTGGACAATTTAGCGGCACGGATCTCCCCGCGGCCCTGCTGAGCACAACGCATGAAACCCTCATCCACTTTTATAGTGACCATTCGCAAAACCGGCAAGGATTTAAACTTGCTTACCAAGCCTATGAATTACAGAACTGTCCAGATCCACCCCCATTTCAGAATGGGTACATGATCAACTCGGATTACAGCGTGGGGCAATCAGTATCTTTCGAGTGTTATCCTGGGTACATTCTAATAGGCCATCCTGTCCTCACTTGTCAGCATGGGATCAACAGAAACTGGAACTACCCTTTTCCAAGATGTGATGCCCCTTGTGGGTACAACGTAACTTCTCAGAACGGCACCATCTACTCCCCTGGCTTTCCTGATGAGTATCCGATCCTGAAGGACTGCATTTGGCTCATCACGGTGCCTCCAGGGCACGGAGTTTACATCAACTTCACCCTGTTACAGACGGAAGCTGTCAACGATTACATTGCTGTTTGGGACGGTCCCGATCAGAACTCACCCCAGCTGGGAGTTTTCAGTGGCAACACAGCCCTCGAAACGGCGTATAGCTCCACCAACCAAGTCCTGCTCAAGTTCCACAGCGACTTTTCAAATGGAGGCTTCTTTGTCCTCAATTTCCACGCATTTCAGCTCAAGAAATGTCAACCTCCCCCAGCGGTTCCACAGGCAGAAATGCTTACTGAGGATGATGATTTCGAAATAGGAGATTTTGTGAAGTACCAGTGCCACCCCGGGTACACCTTGGTGGGGACCGACATTCTGACTTGCAAGCTCAGTTCCCAGTTGCAGTTTGAGGGTTCTCTCCCAACATGTGAAGCACAATGCCCAGCAAATGAAGTCCGGACTGGATCATCGGGAGTCATTCTCAGTCCAGGGTATCCGGGTAATTATTTTAACTCCCAGACTTGCTCTTGGAGTATTAAAGTGGAACCAAACTACAACATTACCATCTTTGTGGACACATTTCAAAGTGAAAAGCAGTTTGATGCACTGGAAGTGTTTGATGGTTCTTCTGGGCAAAGTCCTCTGCTAGTAGTCTTAAGTGGGAATCATACTGAACAATCAAATTTTACAAGCAGGAGTAATCAGTTATATCTCCGCTGGTCCACTGACCATGCCACCAGTAAGAAAGGATTCAAGATTCGCTATGCAGCACCTTACTGCAGTTTGACCCACCCCCTGAAGAATGGGGGTATTCTAAACAGGACTGCAGGAGCGGTTGGAAGCAAAGTGCATTATTTTTGCAAGCCTGGATACCGAATGGTCGGCCACAGCAATGCAACCTGTAGACGAAACCCACTTGGCATGTACCAGTGGGACTCCCTCACGCCACTCTGCCAGGCTGTGTCCTGTGGAATCCCAGAATCCCCAGGAAACGGTTCATTTACCGGGAACGAGTTCACTTTGGACAGTAAAGTGGTCTATGAATGTCATGAGGGCTTCAAGCTTGAATCCAGCCAGCAAGCAACAGCCGTGTGTCAAGAAGATGGGTTGTGGAGTAACAAGGGGAAGCCGCCCACGTGTAAGCCGGTCGCTTGCCCCAGCATTGAAGCTCAGCTCTCAGAACATGTCATCTGGAGGCTGGTTTCAGGATCCTTGAATGAGTACGGTGCTCAAGTATTGCTGAGCTGCAGTCCTGGTTACTACTTAGAAGGCTGGAGGCTCCTGCGGTGCCAGGCCAATGGGACGTGGAACATAGGAGATGAGAGGCCAAGCTGTCGAGTTATCTCGTGTGGAAGCCTTTCCTTTCCCCCAAATGGCAACAAGATTGGAACGTTGACAGTTTATGGGGCCACAGCTATATTTACGTGCAACACCGGCTACACGCTTGTGGGGTCTCATGTCAGAGAGTGCTTGGCAAATGGGCTCTGGAGCGGCAGCGAAACTCGATGTCTGGCTGGCCACTGCGGTTCCCCAGACCCGATTGTGAACGGTCACATTAGTGGAGATGGCTTCAGTTACAGAGACACGGTGGTTTACCAGTGCAATCCTGGTTTCCGGCTTGTGGGAACTTCCGTGAGGATATGCCTGCAAGACCACAAGTGGTCTGGACAAACGCCTGTCTGTGTCCCCATCACATGTGGTCACCCTGGAAACCCTGCCCACGGATTCACTAATGGCAGTGAGTTCAACCTGAATGATGTCGTGAATTTCACCTGCAACACGGGCTATTTGCTGCAGGGCGTGTCTCGAGCCCAGTGTCGGAGCAACGGCCAGTGGAGTAGCCCTCTGCCCACGTGTCGAGTGGTGAACTGTTCTGATCCAGGCTTTGTGGAAAATGCCATTCGTCACGGGCAACAGAACTTCCCTGAGAGTTTTGAGTATGGAATGAGTATCCTGTACCATTGCAAGAAGGGATTTTACTTGCTGGGATCTTCAGCCTTGACCTGTATGGCAAATGGCTTATGGGACCGATCCCTGCCCAAGTGTTTGGCTATATCGTGTGGACACCCAGGGGTCCCTGCCAACGCCGTCCTCACTGGAGAGCTGTTTACCTATGGCGCCGTCGTGCACTACTCCTGCAGAGGGAGCGAGAGCCTCATAGGCAACGACACGAGAGTGTGCCAGGAAGACAGTCACTGGAGCGGGGCACTGCCCCACTGCACAGGAAATAATCCTGGATTCTGTGGTGATCCGGGGACCCCAGCACATGGGTCTCGGCTTGGTGATGACTTTAAGACAAAGAGTCTTCTCCGCTTCTCCTGTGAAATGGGGCACCAGCTGAGGGGCTCCCCTGAACGCACGTGTTTGCTCAATGGGTCATGGTCAGGACTGCAGCCGGTGTGTGAGGCCGTGTCCTGTGGCAACCCTGGCACACCCACCAACGGAATGATTGTCAGTAGTGATGGCATTCTGTTCTCCAGCTCGGTCATCTATGCCTGCTGGGAAGGCTACAAGACCTCAGGGCTCATGACACGGCATTGCACAGCCAATGGGACCTGGACAGGCACTGCTCCCGACTGCACAATTATAAGTTGTGGGGATCCAGGCACACTAGCAAATGGCATCCAGTTTGGGACCGACTTCACCTTCAACAAGACTGTGAGCTATCAGTGTAACCCAGGCTATGTCATGGAAGCAGTCACATCCGCCACTATTCGCTGTACCAAAGACGGCAGGTGGAATCCGAGCAAACCTGTCTGCAAAGCCGTGCTGTGTCCTCAGCCGCCGCCGGTGCAGAATGGAACAGTGGAGGGAAGTGATTTCCGCTGGGGCTCCAGCATAAGTTACAGCTGCATGGACGGTTACCAGCTCTCTCACTCCGCCATCCTCTCCTGTGAAGGTCGCGGGGTGTGGAAAGGAGAGATCCCCCAGTGTCTCCCTGTGTTCTGCGGAGACCCTGGCATCCCCGCAGAAGGGCGACTTAGTGGGAAAAGTTTCACCTATAAGTCCGAAGTCTTCTTCCAGTGCAAATCTCCATTTATACTCGTGGGATCCTCCAGAAGAGTCTGCCAAGCTGACGGCACGTGGAGCGGCATACAACCCACCTGCATTGATCCTGCTCATAACACCTGCCCAGACCCTGGTACGCCACACTTTGGAATACAGAATAGCTCCAGAGGCTATGAGGTTGGAAGCACGGTTTTTTTCAGGTGCAGAAAAGGCTACCATATTCAAGGTTCCACGACTCGCACCTGCCTTGCCAATTTAACATGGAGTGGGATACAGACCGAATGTATACCTCATGCCTGCAGACAGCCAGAAACCCCGGCACACGCGGATGTGAGAGCCATCGATCTTCCTACTTTCGGCTACACCTTAGTGTACACCTGCCATCCAGGCTTTTTCCTCGCAGGGGGATCTGAGCACAGAACATGTAAAGCAGACATGAAATGGACAGGAAAGTCGCCTGTGTGTAAAAGTAAAGGAGTGAGAGAAGTTAATGAAACAGTTACTAAAACTCCAGTTCCTTCAGATGTCTTTTTCGTCAATTCACTGTGGAAGGGGTATTATGAATATTTAGGGAAAAGACAACCCGCCACTCTAACTGTTGACTGGTTCAATGCAACAAGCAGTAAGGTGAATGCCACCTTCAGCGAAGCCTCGCCAGTGGAGCTGAAGTTGACAGGCATTTACAAGAAGGAGGAGGCCCACTTACTCCTGAAAGCTTTTCAAATTAAAGGCCAGGCAGATATTTTTGTAAGCAAGTTCGAAAATGACAACTGGGGACTAGATGGTTATGTGTCATCTGGACTTGAAAGAGGAGGATTTACTTTTCAAGGTGACATTCATGGAAAAGACTTTGGAAAATTTAAGCTAGAAAGGCAAGATCCTTTAAACCCAGATCAAGACTCTTCCAGTCATTACCACGGCACCAGCAGTGGCTCTGTGGCGGCTGCCATTCTGGTTCCTTTCTTTGCTCTAATTTTATCAGGGTTTGCATTTTACCTCTACAAACACAGAACGAGACCAAAAGTTCAATACAATGGCTATGCTGGGCATGAAAACAGCAATGGACAAGCATCGTTTGAAAACCCCATGTATGATACAAACTTAAAACCCACAGAAGCCAAGGCTGTGAGGTTTGACACAACTCTGAACACAGTCTGTACAGTGGTATAGCCCTCAGTGCCCCAACAGGACTGATTCATAGCCATACCTCTGATGGACAAGCAGTGATTCCTTTGGTGCCATATACCACTCTCCCTTCCACTCTGGCTTTACTGCAGCGATCTTCAACCTTGTCTACTGGCATAAGTGCAGCGGGGATCTCTACTCAAATGTGTCAGGGTCTTCTACGGATCAAACTACACATGCGTTTTCATTCCAAAAGTGGGTTCTAAATGCCTGGCTGCATCTGTATGAAATCAAGGCACACTCCAGGAAGACTGCCACGTCGCGCCAACACGTCATACTCAATGCCTCAGACTTTCATATTTCTGTGTTGCTGAGATGCCTTTCAATGCAATCGTCTGGGCTCGTGGATATGTCCCTCAGGTGCGGTGACAGAATGGTGGCACCACGATATGTGTTCTCTTGTGTTGTTTTTCCTTTTTAAACCCCCATGAACACGAATACTCTGAAAAAAATAAAAAGCTTTCTGGAAGAAGACACCTTTCTGATAGAGGCTCACACCTACAAATGCTTCACTCTGTCCTTCCGAGACCTGACAAGCTTTGAGGACCTCACAGCTCCCCTGTGTGTTCATCTCTAGGGATGTTTGCAATTTCCCAGTCAGCTGTTCTGTCGCAGAATGTTTAATGCACAATTTTTTGCACTAGTGTGTTATGAATGACTAAGATTCTGATAAAAAAAATAAATTATTTACACAGGGTTTATACACACTATCCATTGTATATAAGCATTATTTCATATTATCAAGCTAAACATTCCCCCATCAGCTTAGTTGGAGTGTTAGGGAAAAGTATTCCTAGATATGGCACAGATTTTAAAAGGAAATACAGTATTGAAGAGATTTATTTTATTATTGCTTCAATTAGCTCCATTTACGTGTTGAATTCATTGAAGAGGTCCAATGAGAAAAAAACAGAAGCCTCCTTATTTCACACGTTTTCCTCCTTTAGTACCATCCTCATCCAATTACTGTCTCTCTGATACTACTTAATAGCAGGGGGTTTGCAGAAATTTCTGTTTGCCATGTAAAACTGTGAATAGTAATTTATTTTAGATAGTCGATGAACTTGTGGGTTTTAGCTCACAATGCAGCCTTCCCTTTTGCAGTGTTTTTTTTTTGTTTTTTTTTTTTTTTGTCTTTTACTGTGCCATCGATCTTTGATATTGCATTGAAAGACAATATACCACAGTAGCACCTTGAACTCAGTGAAAATTGTTCAGGATCAAAATACCAAGTGTTCTTTTAGAGGGAAGGAAAAAGTACACACACTCTCCTCTCACAATGATATATTTTATACATTCATTTGTTATTTGTTTCATGCTTTATGATTCCAGATGGAAAGGTAATTTCAGTGACTTTTCAAGTTTAAATTCCATTATAGGTAAATGATAAGTTATGATGCAAATAAAATCTATAAGATCCCCAGGGCAAATAAAAATCAAAACATGAAGTAGAAGATGTGGCCGTGAGGTAGTTTATGTAACAAATTCAAAGTGAAAATCATGTTTACTTTTACTTATACTTATTTGATAAAAATATTTTTGAAACGATAGTACTTATTTTATTATTTGATATTTCAGTTCCTATTCAATTGTGGCAGATTTTCTCTGTTTCACATTTTAGATTGGCGTTGGTAATAGAAATGTCAGAATGTTCAAATTGGCCTTCACGTTGTCGGAGTGAACACATTGACACCTAGCTTTAAGACTGATTTATCTGTTGGTGTACTGAAGGTTTCCATGTAGGACTTCAAATGTGGAAAAGGAAAAGCAGTCAGGAAAATGGGGCATTCTTTGGAGAGTCACGCGTTTTGATTCGGACATTTCCGTAGAGCTCGGCTCCCAGTGTTGTGTTCCTCGGTCGAAAGGGTCTCTGCTGTTTGGGGACTCACTGGCCTCTCCTAGGGACTCCTTTGTCTTGTGAACCCCACGCTGTTGGATTCTGTATCATTATGCTGAATTCTCTGCACAGTTTTCCCTGGCCAACCTGCCCACATCCTTGGAGATTTGCTTTGCCAGTGGGAATCCTTACATTGCTGTTTCACAGTAGACGGGACGAGGTCAGCGGGAGTCGTGCTCCTAACACACACATTGAACGAAACAGAAGATGATTGAAAGTGTGAGGAGGCTCGTGTGCAAGGGAGAACAGGGTTACTATACATATTAGTGTATATATATACATACATATATATATATATATATTGTACATATCTAAGTTTGAGTCATTCAAACTAGGTGCAAAATGCTGACTTCAGAGTCTGAATTAACATCTCTGTTCCCATATCCCTGACCTGCTCCCTGGTCAACGATGCTATGAAATCCTGAAATGACAGGACATACATACATACAAGAAACCACATATCAAATTAGATATGATTTTCCTTTGTGTGCAAAGTCAAACTGTCCTAGGGTTGCCAGTTTGAAGCATGTTATTTAAATGAAAAAAAAAATCAGTGAAATTCTCGTGTGAGAATTCTGCCTAGTTTCTTCCTAAGGTTGTGTGCAGTGTTGAACGGCGTCTCCGCAAGGTGTTGGAGGATCTCATTTTAGGGCAGTCAGGAGCTGTGCTTGCTGAGTTAGGTCTAGAAGACTCTTCCCTGAAGGCAACGGGAACACGCGTGAGGGACGCGACCACACACTAACAGAGGACACGTGCTTCAGAGCTGTTTAAAACTGCTGCTTGTTTTACACACACATCTTGCCTTTTTTCAGGCTAGCTGCAATAATTTTTTTCTTCTGTAAAATATTTTGTAAACAACAACAAAAAGCTATTATAAAAAGGGGGTAAAAAAAAGAACGCTGGCATTATGATCAGGAAAACCCATTGTCATCGCCGACCCTCCCTCCCGTCCCACCACACGCTGCTGTCACGACGTAGGTGCGAAAGACCTTTTTGTACAGAGATATATTTTTTATGAAGAATTTGTAAAATTATTAAATATGCTGTAATTTTTTGATTAATGTAGGTACATTGTTAAAAAATAAATGTTTTTACAATACAGAACTGTAATTTTCCCAATAATGTAAAATGTACCATCTCTAGCTGATTTTCAGTTCCAATCCTATTACACATGTATTAATATTAAAGTGGCCTGTTAAAATGAACAGTATCTTTTTTTTGTCAAAAAAATTATAAAGAGGGTGTAATATAGCCTGTGCAATGCCACCAATCTTTAAAGCAAATCAGAGTTCTAATTAAATATTTAATTTTA SEQ ID NO:283 CSMD1之外顯子1的轉譯多肽序列 MTAWRRFQSLLLLLGLLVLCARLLTAAK SEQ ID NO:284 CSMD1之外顯子2的轉譯多肽序列 QNCGGLVQGPNGTIESPGFPHGYPNYANCTWIIITGERNRIQLSFHTFALEEDFDILSVYDGQPQQGNLKV SEQ ID NO:285 CSMD1之外顯子3的轉譯多肽序列 LSGFQLPSSIVSTGSILTLWFTTDFAVSAQGFKALYE SEQ ID NO:286 CSMD1之外顯子4的轉譯多肽序列 LPSHTCGNPGEILKGVLHGTRFNIGDKIRYSCLPGYILEGHAILTCIVSPGNGASWDFPAPFCR SEQ ID NO:287 CSMD1之外顯子5的轉譯多肽序列 EGACGGTLRGTSSSISSPHFPSEYENNADCTWTILAEPGDTIALVFTDFQLEEGYDFLEISGTEAPSI SEQ ID NO:288 CSMD1之外顯子6的轉譯多肽序列 LTGMNLPSPVISSKNWLRLHFTSDSNHRRKGFNAQFQ SEQ ID NO:289 CSMD1之外顯子7的轉譯多肽序列 KKAIELKSRGVKMLPSKDGSHKNSV SEQ ID NO:290 CSMD1之外顯子8的轉譯多肽序列 SQGGVALVSDMCPDPGIPENGRRAGSDF SEQ ID NO:291 CSMD1之外顯子9的轉譯多肽序列 VGANVQFSCEDNYVLQGSKSITCQRVTETLAAWSDHRPICR SEQ ID NO:292 CSMD1之外顯子10的轉譯多肽序列 RTCGSNLRGPSGVITSPNYPVQYEDNAHCVWVITTTDPDK SEQ ID NO:293 CSMD1之外顯子11的轉譯多肽序列 VIKLAFEEFELERGYDTLTVGDAGKVGDTRSVLY SEQ ID NO:294 CSMD1之外顯子12的轉譯多肽序列 LTGSSVPDLIVSMSNQMWLHLQSDDSIGSPGFKAVYQ SEQ ID NO:295 CSMD1之外顯子13的轉譯多肽序列 IEKGGCGDPGIPAYGKRTGSSFLHGDTLTFECPAAFELVGERVITCQQNNQWSGNKPSCV SEQ ID NO:296 CSMD1之外顯子14的轉譯多肽序列 SCFFNFTASSGIILSPNYPEEYGNNMNCVWLIISEPGSRIHLIFNDFDVEPQFDFLAVKDDGISDITVLGTFSGNEVPSQLASSGHIVRLEFQSDHSTTGRGFNITYT SEQ ID NO:297 CSMD1之外顯子15的轉譯多肽序列 FGQNECHDPGIPINGRRFGDRFLLGSSVSFHCDDGFVKTQGSESITCILQDGNVVWSSTVPRCE SEQ ID NO:298 CSMD1之外顯子16的轉譯多肽序列 PCGGHLTASSGVILPPGWPGYYKDSLHCEWIIEAKPGHSIKITFD SEQ ID NO:299 CSMD1之外顯子17的轉譯多肽序列 FQTEVNYDTLEVRDGPASSSPLIGEYHGTQAPQFLISTGNFMYLLFTTDNSRSSIGFLIHYE SEQ ID NO:300 CSMD1之外顯子18的轉譯多肽序列 VTLESDSCLDPGIPVNGHRHGGDFGIRSTVTFSCDPGYTLSDDEPLVCERNHQWNHALPSCD SEQ ID NO:301 CSMD1之外顯子19的轉譯多肽序列 LCGGYIQGKSGTVLSPGFPDFYPNSLNCTWTIEVSHGK SEQ ID NO:302 CSMD1之外顯子20的轉譯多肽序列 VQMIFHTFHLESSHDYLLITEDGSFSEPVARLTGSVLPHTIKAGLFGNFTAQLRFISDFSISYEGFNITFS SEQ ID NO:303 CSMD1之外顯子21的轉譯多肽序列 YDLEPCDDPGVPAFSRRIGFHFGVGDSLTFSCFLGYRLEGATKLTCLGGGRRVWSAPLPRCV SEQ ID NO:304 CSMD1之外顯子22的轉譯多肽序列 ECGASVKGNEGTLLSPNFPSNYDNNHECIYKIETEAGKGIHLRTRSFQLFEGDTLK SEQ ID NO:305 CSMD1之外顯子23的轉譯多肽序列 VYDGKDSSSRPLGTFTKNELLGLILNSTSNHLWLEFNTNGSDTDQGFQLTYT SEQ ID NO:306 CSMD1之外顯子24的轉譯多肽序列 FDLVKCEDPGIPNYGYRIRDEGHFTDTVVLYSCNPGYAMHGSNTLTCLSGDRRVWDKPLPSCI SEQ ID NO:307 來自外顯子24-70之全部47個外顯子按順序排列的轉譯多肽序列 SEQ ID NO:308 全蛋白質序列 MTAWRRFQSLLLLLGLLVLCARLLTAAKGQNCGGLVQGPNGTIESPGFPHGYPNYANCTWIIITGERNRIQLSFHTFALEEDFDILSVYDGQPQQGNLKVRLSGFQLPSSIVSTGSILTLWFTTDFAVSAQGFKALYEVLPSHTCGNPGEILKGVLHGTRFNIGDKIRYSCLPGYILEGHAILTCIVSPGNGASWDFPAPFCRAEGACGGTLRGTSSSISSPHFPSEYENNADCTWTILAEPGDTIALVFTDFQLEEGYDFLEISGTEAPSIWLTGMNLPSPVISSKNWLRLHFTSDSNHRRKGFNAQFQVKKAIELKSRGVKMLPSKDGSHKNSVLSQGGVALVSDMCPDPGIPENGRRAGSDFRVGANVQFSCEDNYVLQGSKSITCQRVTETLAAWSDHRPICRARTCGSNLRGPSGVITSPNYPVQYEDNAHCVWVITTTDPDKVIKLAFEEFELERGYDTLTVGDAGKVGDTRSVLYVLTGSSVPDLIVSMSNQMWLHLQSDDSIGSPGFKAVYQEIEKGGCGDPGIPAYGKRTGSSFLHGDTLTFECPAAFELVGERVITCQQNNQWSGNKPSCVFSCFFNFTASSGIILSPNYPEEYGNNMNCVWLIISEPGSRIHLIFNDFDVEPQFDFLAVKDDGISDITVLGTFSGNEVPSQLASSGHIVRLEFQSDHSTTGRGFNITYTTFGQNECHDPGIPINGRRFGDRFLLGSSVSFHCDDGFVKTQGSESITCILQDGNVVWSSTVPRCEAPCGGHLTASSGVILPPGWPGYYKDSLHCEWIIEAKPGHSIKITFDRFQTEVNYDTLEVRDGPASSSPLIGEYHGTQAPQFLISTGNFMYLLFTTDNSRSSIGFLIHYESVTLESDSCLDPGIPVNGHRHGGDFGIRSTVTFSCDPGYTLSDDEPLVCERNHQWNHALPSCDALCGGYIQGKSGTVLSPGFPDFYPNSLNCTWTIEVSHGKGVQMIFHTFHLESSHDYLLITEDGSFSEPVARLTGSVLPHTIKAGLFGNFTAQLRFISDFSISYEGFNITFSEYDLEPCDDPGVPAFSRRIGFHFGVGDSLTFSCFLGYRLEGATKLTCLGGGRRVWSAPLPRCVAECGASVKGNEGTLLSPNFPSNYDNNHECIYKIETEAGKGIHLRTRSFQLFEGDTLKVYDGKDSSSRPLGTFTKNELLGLILNSTSNHLWLEFNTNGSDTDQGFQLTYTSFDLVKCEDPGIPNYGYRIRDEGHFTDTVVLYSCNPGYAMHGSNTLTCLSGDRRVWDKPLPSCIAECGGQIHAATSGRILSPGYPAPYDNNLHCTWIIEADPGKTISLHFIVFDTEMAHDILKVWDGPVDSDILLKEWSGSALPEDIHSTFNSLTLQFDSDFFISKSGFSIQFSTSIAATCNDPGMPQNGTRYGDSREAGDTVTFQCDPGYQLQGQAKITCVQLNNRFFWQPDPPTCIAACGGNLTGPAGVILSPNYPQPYPPGKECDWRVKVNPDFVIALIFKSFNMEPSYDFLHIYEGEDSNSPLIGSYQGSQAPERIESSGNSLFLAFRSDASVGLSGFAIEFKEKPREACFDPGNIMNGTRVGTDFKLGSTITYQCDSGYKILDPSSITCVIGADGKPSWDQVLPSCNAPCGGQYTGSEGVVLSPNYPHNYTAGQICLYSITVPKEFVVFGQFAYFQTALNDLAELFDGTHAQARLLSSLSGSHSGETLPLATSNQILLRFSAKSGASARGFHFVYQAVPRTSDTQCSSVPEPRYGRRIGSEFSAGSIVRFECNPGYLLQGSTALHCQSVPNALAQWNDTIPSCVVPCSGNFTQRRGTILSPGYPEPYGNNLNCIWKIIVTEGSGIQIQVISFATEQNWDSLEIHDGGDVTAPRLGSFSGTTVPALLNSTSNQLYLHFQSDISVAAAGFHLEYKTVGLAACQEPALPSNSIKIGDRYMVNDVLSFQCEPGYTLQGRSHISCMPGTVRRWNYPSPLCIATCGGTLSTLGGVILSPGFPGSYPNNLDCTWRISLPIGYGAHIQFLNFSTEANHDFLEIQNGPYHTSPMIGQFSGTDLPAALLSTTHETLIHFYSDHSQNRQGFKLAYQAYELQNCPDPPPFQNGYMINSDYSVGQSVSFECYPGYILIGHPVLTCQHGINRNWNYPFPRCDAPCGYNVTSQNGTIYSPGFPDEYPILKDCIWLITVPPGHGVYINFTLLQTEAVNDYIAVWDGPDQNSPQLGVFSGNTALETAYSSTNQVLLKFHSDFSNGGFFVLNFHAFQLKKCQPPPAVPQAEMLTEDDDFEIGDFVKYQCHPGYTLVGTDILTCKLSSQLQFEGSLPTCEAQCPANEVRTGSSGVILSPGYPGNYFNSQTCSWSIKVEPNYNITIFVDTFQSEKQFDALEVFDGSSGQSPLLVVLSGNHTEQSNFTSRSNQLYLRWSTDHATSKKGFKIRYAAPYCSLTHPLKNGGILNRTAGAVGSKVHYFCKPGYRMVGHSNATCRRNPLGMYQWDSLTPLCQAVSCGIPESPGNGSFTGNEFTLDSKVVYECHEGFKLESSQQATAVCQEDGLWSNKGKPPTCKPVACPSIEAQLSEHVIWRLVSGSLNEYGAQVLLSCSPGYYLEGWRLLRCQANGTWNIGDERPSCRVISCGSLSFPPNGNKIGTLTVYGATAIFTCNTGYTLVGSHVRECLANGLWSGSETRCLAGHCGSPDPIVNGHISGDGFSYRDTVVYQCNPGFRLVGTSVRICLQDHKWSGQTPVCVPITCGHPGNPAHGFTNGSEFNLNDVVNFTCNTGYLLQGVSRAQCRSNGQWSSPLPTCRVVNCSDPGFVENAIRHGQQNFPESFEYGMSILYHCKKGFYLLGSSALTCMANGLWDRSLPKCLAISCGHPGVPANAVLTGELFTYGAVVHYSCRGSESLIGNDTRVCQEDSHWSGALPHCTGNNPGFCGDPGTPAHGSRLGDDFKTKSLLRFSCEMGHQLRGSPERTCLLNGSWSGLQPVCEAVSCGNPGTPTNGMIVSSDGILFSSSVIYACWEGYKTSGLMTRHCTANGTWTGTAPDCTIISCGDPGTLANGIQFGTDFTFNKTVSYQCNPGYVMEAVTSATIRCTKDGRWNPSKPVCKAVLCPQPPPVQNGTVEGSDFRWGSSISYSCMDGYQLSHSAILSCEGRGVWKGEIPQCLPVFCGDPGIPAEGRLSGKSFTYKSEVFFQCKSPFILVGSSRRVCQADGTWSGIQPTCIDPAHNTCPDPGTPHFGIQNSSRGYEVGSTVFFRCRKGYHIQGSTTRTCLANLTWSGIQTECIPHACRQPETPAHADVRAIDLPTFGYTLVYTCHPGFFLAGGSEHRTCKADMKWTGKSPVCKSKGVREVNETVTKTPVPSDVFFVNSLWKGYYEYLGKRQPATLTVDWFNATSSKVNATFSEASPVELKLTGIYKKEEAHLLLKAFQIKGQADIFVSKFENDNWGLDGYVSSGLERGGFTFQGDIHGKDFGKFKLERQDPLNPDQDSSSHYHGTSSGSVAAAILVPFFALILSGFAFYLYKHRTRPKVQYNGYAGHENSNGQASFENPMYDTNLKPTEAKAVRFDTTLNTVCTVV- SEQ ID NO:309 CSMD1之全部23個外顯子1-23按順序排列的多核苷酸序列 SEQ ID NO:310 來自CSMD1之全部23個外顯子1-23按順序排列的轉譯多肽序列 PTN 序列資訊SEQ ID NO: 311 PTN-NRG1融合物5’處之PTN外顯子4序列 CCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAG SEQ ID NO: 312 PTN-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATT SEQ ID NO: 313 PTN-NRG1多核苷酸序列 CCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATT SEQ ID NO: 314 PTN-NRG1多肽序列 RTGSLKRALHNAECQKTVTISKPCGKLTKPKPQALPPRLKEMKSQESAAGSKLVLRCETSSEYSSLR SEQ ID NO:315 PTN之外顯子1 AAGGGGAAATAAGGGACAAGAGAGACCCTCTCATATTGTTTTATATTATTTCATACTCAGAAAAGGAAAGAGAAGCCAAACAAAAGGCAGGTAACCCAGCGCCTAGGAACCAGACCCGAAACCAAGGAACCAGATCTGAAACCAGGCCTGGGCCTGCCTGACCTAAGCCTGGTAGTAAAAATTCCACCCCTGACCTGACCTGGCAACTGTTGTTATCTACAGATTCCAGACATTGTATGGAAGGACACTGTGAAACCTCCCGTTCTGTTCTGTTTCACTCTGACCATCGGTGCTCACAGCCCCTATCACGTACCCCCTGGCTTGCTCAGTCGATCACGACCCTCTCACGTGGACCCCCTTAGAGTTGTTAGCCCTTAAAAGGGACAGAAGTTGAGCACCTGAGGAGCTCAGATTTTAAGACGCTAGGCTGCTGATGCTCCCAGCTGATTAAAGCCACTCCCTTCACTATCTCGGTGTCTCCTGTCCGCGGCTCGTCCTGCTACATTTCTTGGTTCCCTGACCGGCAAGCGAG SEQ ID NO:316 PTN之外顯子2 AATGCAGGCTCAACAGTACCAGCAGCAGCGTCGAAAATTTGCAGCTGCCTTCTTGGCATTCATTTTCATACTGGCAGCTGTGGATACTGCTGAAGCAGGGAAGAAAGAGAAACCAG SEQ ID NO:317 PTN之外顯子3 AAAAAAAAGTGAAGAAGTCTGACTGTGGAGAATGGCAGTGGAGTGTGTGTGTGCCCACCAGTGGAGACTGTGGGCTGGGCACACGGGAGGGCACTCGGACTGGAGCTGAGTGCAAGCAAACCATGAAGACCCAGAGATGTAAGATCCCCTGCAACTGGAAGAAGCAATTTGGCG SEQ ID NO:318 PTN之外顯子4 CGGAGTGCAAATACCAGTTCCAGGCCTGGGGAGAATGTGACCTGAACACAGCCCTGAAGACCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAG SEQ ID NO:319 PTN之外顯子5 CAGAATCTAAGAAGAAGAAAAAGGAAGGCAAGAAACAGGAGAAGATGCTGGATTAAAAGATGTCACCTGTGGAACATAAAAAGGACATCAGCAAACAGGATCAGTTAACTATTGCATTTATATGTACCGTAGGCTTTGTATTCAAAAATTATCTATAGCTAAGTACACAATAAGCAAAAACAAAAAGAAAAGAAAATTTTTGTAGTAGCGTTTTTTAAATGTATACTATAGTACCAGTAGGGGCTTATAATAAAGGACTGTAATCTTATTTAGGAAGTTGACTTATAGTACATGATAAATGATAGACAATTGAGGTAAGTTTTTTGAAATTATGTGACATTTTACATTAAATTTTTTTTACATTTTTTGGGCAGCAATTTAAATGTTATGACTATGTAAACTACTTCTCTTGTTAGGTAATTTTTTTCACCTAGATTTTTTTCCCAATTGAGAAAAATATATACTAAACAAAATAGCAATAAAACATAATCACTCTATTTGAAGAAAATATCTTGTTTTCTGCCAATAGATTTTTTAAAATGTAGTCAGCAAAATGGGGGTGGGGAAGCAGAGCATGTCCTAGTTCAATGTTGACTTTTTTTTTTTTTAAAGAAAAGCATTAAGACATAAAATTCTTTCACTTTGGCAGAAGCATTTGTTTTCTTGATGAAATTATTTTTCCATCTGAGGAAAAAAATACTAGGAAAATAAATCAAGGTGATGCTGAAAAAAAAA SEQ ID NO:320 PTN之全部5個外顯子1-5按順序排列的多核苷酸序列 SEQ ID NO:321 PTN之外顯子2的轉譯多肽序列 MQAQQYQQQRRKFAAAFLAFIFILAAVDTAEAGKKEKP SEQ ID NO:322 PTN之外顯子3的轉譯多肽序列 KKVKKSDCGEWQWSVCVPTSGDCGLGTREGTRTGAECKQTMKTQRCKIPCNWKKQFG SEQ ID NO:323 PTN之外顯子4的轉譯多肽序列 ECKYQFQAWGECDLNTALKTRTGSLKRALHNAECQKTVTISKPCGKLTKPKPQ SEQ ID NO:324 PTN之外顯子5 ESKKKKKEGKKQEKMLD SEQ ID NO:325 全蛋白質序列 MQAQQYQQQRRKFAAAFLAFIFILAAVDTAEAGKKEKPEKKVKKSDCGEWQWSVCVPTSGDCGLGTREGTRTGAECKQTMKTQRCKIPCNWKKQFGAECKYQFQAWGECDLNTALKTRTGSLKRALHNAECQKTVTISKPCGKLTKPKPQAESKKKKKEGKKQEKMLD SEQ ID NO:326 PTN之全部4個外顯子1-4按順序排列的多核苷酸序列 SEQ ID NO:327 PTN之全部3個外顯子2-4按順序排列的轉譯多肽序列 ST14 序列資訊SEQ ID NO: 328 ST14-NRG1融合物5’處之ST14外顯子11序列 CAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACC SEQ ID NO: 329 ST14-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO: 330 ST14-NRG1多核苷酸序列 CAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACCCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO: 331 ST14-NRG1多肽序列 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPTSTSTTGTSHLVKCAEKEKTFCVNGGEC SEQ ID NO:332 ST14之外顯子1 GTGAGAGCGGAGCTGCAGCCGGAGAAAGAGGAAGAGGGAGAGAGAGCGCGCCAGGGCGAGGGCACCGCCGCCGGTCGGGCGCGCTGGGCCTGCCCGGAATCCCGCCGCCTGCGCCCCGCGCCCCGCGCCCTGCGGGCCATGGGAGCCGGCCGCCGGCAGGGACGACGCCTGTGAGACCCGCGAGCGGCCTCGGGGACCATGGGGAGCGATCGGGCCCGCAAGGGCGGAGGGGGCCCGAAGGACTTCGGCGCGGGACTCAAGTACAACTCCCGGCACGAG SEQ ID NO:333 ST14之外顯子2 AAAGTGAATGGCTTGGAGGAAGGCGTGGAGTTCCTGCCAGTCAACAACGTCAAGAAGGTGGAAAAGCATGGCCCGGGGCGCTGGGTGGTGCTGGCAGCCGTGCTGATCGGCCTCCTCTTGGTCTTGCTGGGGATCGGCTTCCTGGTGTGGCATTTGCAGT SEQ ID NO:334 ST14之外顯子3 ACCGGGACGTGCGTGTCCAGAAGGTCTTCAATGGCTACATGAGGATCACAAATGAGAATTTTGTGGATGCCTACGAGAACTCCAACTCCACTGAGTTTGTAAGCCTGGCCAGCAAGGTGAAGGACGCG SEQ ID NO:335 ST14之外顯子4 CTGAAGCTGCTGTACAGCGGAGTCCCATTCCTGGGCCCCTACCACAAGGAGTCGGCTGTGACGGCCTTCAG SEQ ID NO:336 ST14之外顯子5 CGAGGGCAGCGTCATCGCCTACTACTGGTCTGAGTTCAGCATCCCGCAGCACCTGGTGGAGGAGGCCGAGCGCGTCATGGCCGAGGAGCGCGTAGTCATGCTGCCCCCGCGGGCGCGCTCCCTGAAGTCCTTTGTGGTCACCTCAGTGGTGGCTTTCC SEQ ID NO:337 ST14之外顯子6 CCACGGACTCCAAAACAGTACAGAGGACCCAGGACA SEQ ID NO:338 ST14之外顯子7 ACAGCTGCAGCTTTGGCCTGCACGCCCGCGGTGTGGAGCTGATGCGCTTCACCACGCCCGGCTTCCCTGACAGCCCCTACCCCGCTCATGCCCGCTGCCAGTGGGCCCTGCGGGGGGACGCCGACTCAGTGCTGAGCCTCACCTTCCGCAGCTTTGACCTTGCGTCCTGCGACGAGCGCGGCAGCGACCTGGTGACGGTGTACAACACCCTGAGCCCCATGGAGCCCCACGCCCTGGTGCA SEQ ID NO:339 ST14之外顯子8 GTTGTGTGGCACCTACCCTCCCTCCTACAACCTGACCTTCCACTCCTCCCAGAACGTCCTGCTCATCACACTGATAACCAACACTGAGCGGCGGCATCCCGGCTTTGAGGCCACCTTCTTCCAGCTGCCTAGGATGAGCA SEQ ID NO:340 ST14之外顯子9 GCTGTGGAGGCCGCTTACGTAAAGCCCAGGGGACATTCAACAGCCCCTACTACCCAGGCCACTACCCACCCAACATTGACTGCACATGGAACATTGAG SEQ ID NO:341 ST14之外顯子10 GTGCCCAACAACCAGCATGTGAAGGTGCGCTTCAAATTCTTCTACCTGCTGGAGCCCGGCGTGCCTGCGGGCACCTGCCCCAAGGACTACGTGGAGATCAACGGGGAGAA SEQ ID NO:342 ST14之外顯子11 ATACTGCGGAGAGAGGTCCCAGTTCGTCGTCACCAGCAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACC SEQ ID NO:343 ST14之外顯子12 CATGCCCGGGGCAGTTCACGTGCCGCACGGGGCGGTGTATCCGGAAGGAGCTGCGCTGTGATGGCTGGGCCGACTGCACCGACCACAGCGATGAGCTCAACTGCA SEQ ID NO:344 ST14之外顯子13 GTTGCGACGCCGGCCACCAGTTCACGTGCAAGAACAAGTTCTGCAAGCCCCTCTTCTGGGTCTGCGACAGTGTGAACGACTGCGGAGACAACAGCGACGAGCAGGGGTGCA SEQ ID NO:345 ST14之外顯子14 GTTGTCCGGCCCAGACCTTCAGGTGTTCCAATGGGAAGTGCCTCTCGAAAAGCCAGCAGTGCAATGGGAAGGACGACTGTGGGGACGGGTCCGACGAGGCCTCCTGCCCCAAGG SEQ ID NO:346 ST14之外顯子15 TGAACGTCGTCACTTGTACCAAACACACCTACCGCTGCCTCAATGGGCTCTGCTTGAGCAAGGGCAACCCTGAGTGTGACGGGAAGGAGGACTGTAGCGACGGCTCAGATGAGAAGGACTGCG SEQ ID NO:347 ST14之外顯子16 ACTGTGGGCTGCGGTCATTCACGAGACAGGCTCGTGTTGTTGGGGGCACGGATGCGGATGAGGGCGAGTGGCCCTGGCAGGTAAGCCTGCATGCTCTGGGCCAGGGCCACATCTGCGGTGCTTCCCTCATCTCTCCCAACTGGCTGGTCTCTGCCGCACACTGCTACATCGATGACAGAGGATTCAG SEQ ID NO:348 ST14之外顯子17 GTACTCAGACCCCACGCAGTGGACGGCCTTCCTGGGCTTGCACGACCAGAGCCAGCGCAGCGCCCCTGGGGTGCAGGAGCGCAGGCTCAAGCGCATCATCTCCCACCCCTTCTTCAATGACTTCACCTTCGACTATGACATCGCGCTGCTGGAGCTGGAGAAACCGGCAGAGTACAGCTCCATGGTGCGGCCCATCTGCCTGCCGGACGCCTCCCATGTCTTCCCTGCCGGCAAGGCCATCTGGGTCACGGGCTGGGGACACACCCAGTATGGAG SEQ ID NO:349 ST14之外顯子18 GCACTGGCGCGCTGATCCTGCAAAAGGGTGAGATCCGCGTCATCAACCAGACCACCTGCGAGAACCTCCTGCCGCAGCAGATCACGCCGCGCATGATGTGCGTGGGCTTCCTCAGCGGCGGCGTGGACTCCTGCCAG SEQ ID NO:350 ST14之外顯子19 GGTGATTCCGGGGGACCCCTGTCCAGCGTGGAGGCGGATGGGCGGATCTTCCAGGCCGGTGTGGTGAGCTGGGGAGACGGCTGCGCTCAGAGGAACAAGCCAGGCGTGTACACAAGGCTCCCTCTGTTTCGGGACTGGATCAAAGAGAACACTGGGGTATAGGGGCCGGGGCCACCCAAATGTGTACACCTGCGGGGCCACCCATCGTCCACCCCAGTGTGCACGCCTGCAGGCTGGAGACTGGACCGCTGACTGCACCAGCGCCCCCAGAACATACACTGTGAACTCAATCTCCAGGGCTCCAAATCTGCCTAGAAAACCTCTCGCTTCCTCAGCCTCCAAAGTGGAGCTGGGAGGTAGAAGGGGAGGACACTGGTGGTTCTACTGACCCAACTGGGGGCAAAGGTTTGAAGACACAGCCTCCCCCGCCAGCCCCAAGCTGGGCCGAGGCGCGTTTGTGCATATCTGCCTCCCCTGTCTCTAAGGAGCAGCGGGAACGGAGCTTCGGGGCCTCCTCAGTGAAGGTGGTGGGGCTGCCGGATCTGGGCTGTGGGGCCCTTGGGCCACGCTCTTGAGGAAGCCCAGGCTCGGAGGACCCTGGAAAACAGACGGGTCTGAGACTGAAATTGTTTTACCAGCTCCCAGGGTGGACTTCAGTGTGTGTATTTGTGTAAATGAGTAAAACATTTTATTTCTTTTTA SEQ ID NO:351 全mRNA多核苷酸序列 SEQ ID NO:352 ST14之外顯子1的轉譯多肽序列 MGSDRARKGGGGPKDFGAGLKYNSRHE SEQ ID NO:353 ST14之外顯子2的轉譯多肽序列 KVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAAVLIGLLLVLLGIGFLVWHLQ SEQ ID NO:354 ST14之外顯子3的轉譯多肽序列 RDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDA SEQ ID NO:355 ST14之外顯子4的轉譯多肽序列 LKLLYSGVPFLGPYHKESAVTAF SEQ ID NO:356 ST14之外顯子5的轉譯多肽序列 EGSVIAYYWSEFSIPQHLVEEAERVMAEERVVMLPPRARSLKSFVVTSVVAF SEQ ID NO:357 ST14之外顯子6的轉譯多肽序列 TDSKTVQRTQD SEQ ID NO:358 ST14之外顯子7的轉譯多肽序列 SCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALV SEQ ID NO:359 ST14之外顯子8的轉譯多肽序列 LCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMS SEQ ID NO:360 ST14之外顯子9的轉譯多肽序列 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIE SEQ ID NO:361 ST14之外顯子10的轉譯多肽序列 VPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGE SEQ ID NO:362 ST14之外顯子11的轉譯多肽序列 YCGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSD SEQ ID NO:363 ST14之外顯子12的轉譯多肽序列NM_021978.4 CPGQFTCRTGRCIRKELRCDGWADCTDHSDELNC SEQ ID NO:364 ST14之外顯子13的轉譯多肽序列 CDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC SEQ ID NO:365 ST14之外顯子14的轉譯多肽序列 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPK SEQ ID NO:366 ST14之外顯子15的轉譯多肽序列 NVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDC SEQ ID NO:367 ST14之外顯子16的轉譯多肽序列 CGLRSFTRQARVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGF SEQ ID NO:368 ST14之外顯子17的轉譯多肽序列 YSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYG SEQ ID NO:369 ST14之外顯子18的轉譯多肽序列 TGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQ SEQ ID NO:370 ST14之外顯子19的轉譯多肽序列 GDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV SEQ ID NO:371 全蛋白質序列 MGSDRARKGGGGPKDFGAGLKYNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAAVLIGLLLVLLGIGFLVWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERVVMLPPRARSLKSFVVTSVVAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV SEQ ID NO:372 ST14之全部11個外顯子1-11按順序排列的多核苷酸序列 SEQ ID NO:373 ST14之全部11個外顯子1-11按順序排列的轉譯多肽序列 THBS1 序列資訊SEQ ID NO: 374 THBS1-NRG1融合物5’處之THBS1外顯子9序列 ACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCA SEQ ID NO: 375 THBS1-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTC SEQ ID NO: 376 THBS1-NRG1多核苷酸序列 ACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTC SEQ ID NO: 377 THBS1-NRG1多肽序列 PCEGEARETKACKKDACPTTSTSTTGTSHLVKCAEKEKTFCVNGGECF SEQ ID NO:378 THBS1之外顯子1 AGCCGCTGCGCCCGAGCTGGCCTGCGAGTTCAGGGCTCCTGTCGCTCTCCAGGAGCAACCTCTACTCCGGACGCACAGGCATTCCCCGCGCCCCTCCAGCCCTCGCCGCCCTCGCCACCGCTCCCGGCCGCCGCGCTCCGGTACACACAG SEQ ID NO:379 THBS1之外顯子2 GATCCCTGCTGGGCACCAACAGCTCCACCATGGGGCTGGCCTGGGGACTAGGCGTCCTGTTCCTGATGCATGTGTGTGGCACCAACCGCATTCCAG SEQ ID NO:380 THBS1之外顯子3 AGTCTGGCGGAGACAACAGCGTGTTTGACATCTTTGAACTCACCGGGGCCGCCCGCAAGGGGTCTGGGCGCCGACTGGTGAAGGGCCCCGACCCTTCCAGCCCAGCTTTCCGCATCGAGGATGCCAACCTGATCCCCCCTGTGCCTGATGACAAGTTCCAAGACCTGGTGGATGCTGTGCGGGCAGAAAAGGGTTTCCTCCTTCTGGCATCCCTGAGGCAGATGAAGAAGACCCGGGGCACGCTGCTGGCCCTGGAGCGGAAAGACCACTCTGGCCAGGTCTTCAGCGTGGTGTCCAATGGCAAGGCGGGCACCCTGGACCTCAGCCTGACCGTCCAAGGAAAGCAGCACGTGGTGTCTGTGGAAGAAGCTCTCCTGGCAACCGGCCAGTGGAAGAGCATCACCCTGTTTGTGCAGGAAGACAGGGCCCAGCTGTACATCGACTGTGAAAAGATGGAGAATGCTGAGTTGGACGTCCCCATCCAAAGCGTCTTCACCAGAGACCTGGCCAGCATCGCCAGACTCCGCATCGCAAAGGGGGGCGTCAATGACAATTTCCAG SEQ ID NO:381 THBS1之外顯子4 GGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAAGGCTGCTCCAGCT SEQ ID NO:382 THBS1之外顯子5 CTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGCACTAACTACATTGGCCACAAGACAAAGGACTTGCAAGCCATCTGCGGCATCTCCTGTGATGAGCTGTCCAGCATGGTCCTGGAACTCAGGGGCCTGCGCACCATTGTGACCACGCTGCAGGACAGCATCCGCAAAGTG SEQ ID NO:383 THBS1之外顯子6 ACTGAAGAGAACAAAGAGTTGGCCAATGAGCTGAGGCGGCCTCCCCTATGCTATCACAACGGAGTTCAGTACAGAAATAACGAGGAATGGACTGTTGATAGCTGCACTGAGTGTCACTGTCAG SEQ ID NO:384 THBS1之外顯子7 AACTCAGTTACCATCTGCAAAAAGGTGTCCTGCCCCATCATGCCCTGCTCCAATGCCACAGTTCCTGATGGAGAATGCTGTCCTCGCTGTTGGC SEQ ID NO:385 THBS1之外顯子8 CCAGCGACTCTGCGGACGATGGCTGGTCTCCATGGTCCGAGTGGACCTCCTGTTCTACGAGCTGTGGCAATGGAATTCAGCAGCGCGGCCGCTCCTGCGATAGCCTCAACAACCGATGTGAGGGCTCCTCGGTCCAGACACGGACCTGCCACATTCAGGAGTGTGACAAGAGAT SEQ ID NO:386 THBS1之外顯子9 TTAAACAGGATGGTGGCTGGAGCCACTGGTCCCCGTGGTCATCTTGTTCTGTGACATGTGGTGATGGTGTGATCACAAGGATCCGGCTCTGCAACTCTCCCAGCCCCCAGATGAACGGGAAACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCA SEQ ID NO:387 THBS1之全部13個外顯子10-22按順序排列的多核苷酸序列 TTAAACAGGATGGTGGCTGGAGCCACTGGTCCCCGTGGTCATCTTGTTCTGTGACATGTGGTGATGGTGTGATCACAAGGATCCGGCTCTGCAACTCTCCCAGCCCCCAGATGAACGGGAAACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCATCAATGGAGGCTGGGGTCCTTGGTCACCATGGGACATCTGTTCTGTCACCTGTGGAGGAGGGGTACAGAAACGTAGTCGTCTCTGCAACAACCCCACACCCCAGTTTGGAGGCAAGGACTGCGTTGGTGATGTAACAGAAAACCAGATCTGCAACAAGCAGGACTGTCCAATTGATGGATGCCTGTCCAATCCCTGCTTTGCCGGCGTGAAGTGTACTAGCTACCCTGATGGCAGCTGGAAATGTGGTGCTTGTCCCCCTGGTTACAGTGGAAATGGCATCCAGTGCACAGATGTTGATGAGTGCAAAGAAGTGCCTGATGCCTGCTTCAACCACAATGGAGAGCACCGGTGTGAGAACACGGACCCCGGCTACAACTGCCTGCCCTGCCCCCCACGCTTCACCGGCTCACAGCCCTTCGGCCAGGGTGTCGAACATGCCACGGCCAACAAACAGGTGTGCAAGCCCCGTAACCCCTGCACGGATGGGACCCACGACTGCAACAAGAACGCCAAGTGCAACTACCTGGGCCACTATAGCGACCCCATGTACCGCTGCGAGTGCAAGCCTGGCTACGCTGGCAATGGCATCATCTGCGGGGAGGACACAGACCTGGATGGCTGGCCCAATGAGAACCTGGTGTGCGTGGCCAATGCGACTTACCACTGCAAAAAGGATAATTGCCCCAACCTTCCCAACTCAGGGCAGGAAGACTATGACAAGGATGGAATTGGTGATGCCTGTGATGATGACGATGACAATGATAAAATTCCAGATGACAGGGACAACTGTCCATTCCATTACAACCCAGCTCAGTATGACTATGACAGAGATGATGTGGGAGACCGCTGTGACAACTGTCCCTACAACCACAACCCAGATCAGGCAGACACAGACAACAATGGGGAAGGAGACGCCTGTGCTGCAGACATTGATGGAGACGGTATCCTCAATGAACGGGACAACTGCCAGTACGTCTACAATGTGGACCAGAGAGACACTGATATGGATGGGGTTGGAGATCAGTGTGACAATTGCCCCTTGGAACACAATCCGGATCAGCTGGACTCTGACTCAGACCGCATTGGAGATACCTGTGACAACAATCAGGATATTGATGAAGATGGCCACCAGAACAATCTGGACAACTGTCCCTATGTGCCCAATGCCAACCAGGCTGACCATGACAAAGATGGCAAGGGAGATGCCTGTGACCACGATGATGACAACGATGGCATTCCTGATGACAAGGACAACTGCAGACTCGTGCCCAATCCCGACCAGAAGGACTCTGACGGCGATGGTCGAGGTGATGCCTGCAAAGATGATTTTGACCATGACAGTGTGCCAGACATCGATGACATCTGTCCTGAGAATGTTGACATCAGTGAGACCGATTTCCGCCGATTCCAGATGATTCCTCTGGACCCCAAAGGGACATCCCAAAATGACCCTAACTGGGTTGTACGCCATCAGGGTAAAGAACTCGTCCAGACTGTCAACTGTGATCCTGGACTCGCTGTAGGTTATGATGAGTTTAATGCTGTGGACTTCAGTGGCACCTTCTTCATCAACACCGAAAGGGACGATGACTATGCTGGATTTGTCTTTGGCTACCAGTCCAGCAGCCGCTTTTATGTTGTGATGTGGAAGCAAGTCACCCAGTCCTACTGGGACACCAACCCCACGAGGGCTCAGGGATACTCGGGCCTTTCTGTGAAAGTTGTAAACTCCACCACAGGGCCTGGCGAGCACCTGCGGAACGCCCTGTGGCACACAGGAAACACCCCTGGCCAGGTGCGCACCCTGTGGCATGACCCTCGTCACATAGGCTGGAAAGATTTCACCGCCTACAGATGGCGTCTCAGCCACAGGCCAAAGACGGGTTTCATTAGAGTGGTGATGTATGAAGGGAAGAAAATCATGGCTGACTCAGGACCCATCTATGATAAAACCTATGCTGGTGGTAGACTAGGGTTGTTTGTCTTCTCTCAAGAAATGGTGTTCTTCTCTGACCTGAAATACGAATGTAGAGATCCCTAATCATCAAATTGTTGATTGAAAGACTGATCATAAACCAATGCTGGTATTGCACCTTCTGGAACTATGGGCTTGAGAAAACCCCCAGGATCACTTCTCCTTGGCTTCCTTCTTTTCTGTGCTTGCATCAGTGTGGACTCCTAGAACGTGCGACCTGCCTCAAGAAAATGCAGTTTTCAAAAACAGACTCAGCATTCAGCCTCCAATGAATAAGACATCTTCCAAGCATATAAACAATTGCTTTGGTTTCCTTTTGAAAAAGCATCTACTTGCTTCAGTTGGGAAGGTGCCCATTCCACTCTGCCTTTGTCACAGAGCAGGGTGCTATTGTGAGGCCATCTCTGAGCAGTGGACTCAAAAGCATTTTCAGGCATGTCAGAGAAGGGAGGACTCACTAGAATTAGCAAACAAAACCACCCTGACATCCTCCTTCAGGAACACGGGGAGCAGAGGCCAAAGCACTAAGGGGAGGGCGCATACCCGAGACGATTGTATGAAGAAAATATGGAGGAACTGTTACATGTTCGGTACTAAGTCATTTTCAGGGGATTGAAAGACTATTGCTGGATTTCATGATGCTGACTGGCGTTAGCTGATTAACCCATGTAAATAGGCACTTAAATAGAAGCAGGAAAGGGAGACAAAGACTGGCTTCTGGACTTCCTCCCTGATCCCCACCCTTACTCATCACCTGCAGTGGCCAGAATTAGGGAATCAGAATCAAACCAGTGTAAGGCAGTGCTGGCTGCCATTGCCTGGTCACATTGAAATTGGTGGCTTCATTCTAGATGTAGCTTGTGCAGATGTAGCAGGAAAATAGGAAAACCTACCATCTCAGTGAGCACCAGCTGCCTCCCAAAGGAGGGGCAGCCGTGCTTATATTTTTATGGTTACAATGGCACAAAATTATTATCAACCTAACTAAAACATTCCTTTTCTCTTTTTTCCTGAATTATCATGGAGTTTTCTAATTCTCTCTTTTGGAATGTAGATTTTTTTTAAATGCTTTACGATGTAAAATATTTATTTTTTACTTATTCTGGAAGATCTGGCTGAAGGATTATTCATGGAACAGGAAGAAGCGTAAAGACTATCCATGTCATCTTTGTTGAGAGTCTTCGTGACTGTAAGATTGTAAATACAGATTATTTATTAACTCTGTTCTGCCTGGAAATTTAGGCTTCATACGGAAAGTGTTTGAGAGCAAGTAGTTGACATTTATCAGCAAATCTCTTGCAAGAACAGCACAAGGAAAATCAGTCTAATAAGCTGCTCTGCCCCTTGTGCTCAGAGTGGATGTTATGGGATTCTTTTTTTCTCTGTTTTATCTTTTCAAGTGGAATTAGTTGGTTATCCATTTGCAAATGTTTTAAATTGCAAAGAAAGCCATGAGGTCTTCAATACTGTTTTACCCCATCCCTTGTGCATATTTCCAGGGAGAAGGAAAGCATATACACTTTTTTCTTTCATTTTTCCAAAAGAGAAAAAAATGACAAAAGGTGAAACTTACATACAAATATTACCTCATTTGTTGTGTGACTGAGTAAAGAATTTTTGGATCAAGCGGAAAGAGTTTAAGTGTCTAACAAACTTAAAGCTACTGTAGTACCTAAAAAGTCAGTGTTGTACATAGCATAAAAACTCTGCAGAGAAGTATTCCCAATAAGGAAATAGCATTGAAATGTTAAATACAATTTCTGAAAGTTATGTTTTTTTTCTATCATCTGGTATACCATTGCTTTATTTTTATAAATTATTTTCTCATTGCCATTGGAATAGATATCTCAGATTGTGTAGATATGCTATTTAAATAATTTATCAGGAAATACTGCCTGTAGAGTTAGTATTTCTATTTTTATATAATGTTTGCACACTGAATTGAAGAATTGTTGGTTTTTTCTTTTTTTTGTTTTGTTTTTTTTTTTTTTTTTTTTTGCTTTTGACCTCCCATTTTTACTATTTGCCAATACCTTTTTCTAGGAATGTGCTTTTTTTTGTACACATTTTTATCCATTTTACATTCTAAAGCAGTGTAAGTTGTATATTACTGTTTCTTATGTACAAGGAACAACAATAAATCATATGGAAATTTATATTTATACTTACTGTATCCATGCTTATTTGTTCTCTACTGGCTTTATGTCATGAAGTATATGCGTAAATACCATTCATAAATCAATATAGCATATACAAAAATAAATTACAGTAAGTCATAGCAACATTCACAGTTTGTATGTGATTGAGAAAGACTGAGTTGCTCAGGCCTAGGCTTAGAATTTGCTGCGTTTGTGGAATAAAAGAACAAAATGATACATTAGCCTGCCATATCAA SEQ ID NO:388 全mRNA多核苷酸序列 SEQ ID NO:389 THBS1之外顯子2的轉譯多肽序列 MGLAWGLGVLFLMHVCGTNRIP SEQ ID NO:390 THBS1之外顯子3的轉譯多肽序列 SGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELDVPIQSVFTRDLASIARLRIAKGGVNDNFQ SEQ ID NO:391 THBS1之外顯子4的轉譯多肽序列 GVLQNVRFVFGTTPEDILRNKGCSS SEQ ID NO:392 THBS1之外顯子5的轉譯多肽序列 TSVLLTLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRKV SEQ ID NO:393 THBS1之外顯子6的轉譯多肽序列 TEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECHCQ SEQ ID NO:394 THBS1之外顯子7的轉譯多肽序列 NSVTICKKVSCPIMPCSNATVPDGECCPRCW SEQ ID NO:395 THBS1之外顯子8的轉譯多肽序列 SDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKR SEQ ID NO:396 THBS1之外顯子9的轉譯多肽序列 KQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACP SEQ ID NO:397 THBS1之全部13個外顯子10-22按順序排列的轉譯多肽序列 KQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGYNCLPCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPNLPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYNHNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEHNPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRFQMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDDDYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYDKTYAGGRLGLFVFSQEMVFFSDLKYECRDP SEQ ID NO:398 全蛋白質序列 MGLAWGLGVLFLMHVCGTNRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELDVPIQSVFTRDLASIARLRIAKGGVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSSTSVLLTLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRKVTEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECHCQNSVTICKKVSCPIMPCSNATVPDGECCPRCWPSDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGYNCLPCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPNLPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYNHNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEHNPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRFQMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDDDYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYDKTYAGGRLGLFVFSQEMVFFSDLKYECRDP SEQ ID NO:399 THBS1之全部9個外顯子1-9按順序排列的多核苷酸序列 SEQ ID NO:400 THBS1之全部8個外顯子2-9按順序排列的轉譯多肽序列 AGRN 序列資訊SEQ ID NO: 401 AGRN-NRG1融合物5’處之AGRN外顯子12序列 GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAG SEQ ID NO: 402 AGRN-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGAC SEQ ID NO:403 AGRN-NRG1多核苷酸序列 GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGAC SEQ ID NO:404 AGRN-NRG1多肽序列 VCGSDGVTYSTECELKKARCESQRGLYVAAQGACRATSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKD SEQ ID NO:405 AGRN之外顯子1 AGTCCCGTCCCCGGCGCGGCCCGCGCGCTCCTCCGCCGCCTCTCGCCTGCGCCATGGCCGGCCGGTCCCACCCGGGCCCGCTGCGGCCGCTGCTGCCGCTCCTTGTGGTGGCCGCGTGCGTCCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAGGCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGCACACGTACTCCTGCAAG SEQ ID NO:406 AGRN之外顯子2 GTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGCCCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCCCTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTGCACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCATGCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAG SEQ ID NO:407 AGRN之外顯子3 ATAAACCCGGGACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATG SEQ ID NO:408 AGRN之外顯子4 CGTGCCGGGGAATGCTGTGCGGCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTGCAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACCTACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCCTGCTCAGCCGCGGGCCGTGCG SEQ ID NO:409 AGRN之外顯子5 GCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTTCGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCGACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCGGCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTTCGACGGCCCTTGTG SEQ ID NO:410 AGRN之外顯子6 ACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGTGTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCCGGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGGCCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAGGGTCGAG SEQ ID NO:411 AGRN之外顯子7 ACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTGGCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTGTGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGGCAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTG SEQ ID NO:412 AGRN之外顯子8 ACCAGGCCCCGTCCCCATGCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGCGTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGCGTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGATCCAGGTGGCGCGCAAAGGACCCTGTG SEQ ID NO:413 AGRN之外顯子9 ACCGCTGCGGGCAGTGCCGCTTTGGAGCCCTGTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAGCCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACGCCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGTG SEQ ID NO:414 AGRN之外顯子10 AGACCTGTGGAGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGGTGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTGCCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGCAGGGCCGTGCGAGCAGG SEQ ID NO:415 AGRN之外顯子11 CCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGTGACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGGACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCG SEQ ID NO:416 AGRN之外顯子12 GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAG SEQ ID NO:417 AGRN之全部27個外顯子13-39按順序排列的多核苷酸序列 GCCCCACCTTCGCCCCGCTGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAATATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAACCCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCCCAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGGCATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCCGTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTGGACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGCTGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGCGTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCAACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAAGACCATCGCCTGCCGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAGGAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAGGGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAGCAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCACGACCTCGGACCACTGCCAGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCCCTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAGCGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAGCCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCTACAACTCCGCGTTGGGCTGCTGCTCTGATGGGAAGACGCCCTCGCTGGACGCAGAGGGCTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAGGGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCCAAGTCAGAACTGTTCGGGGAGACAGCCAGGAGCATTGAGAGCACCCTGGACGACCTCTTCCGGAATTCAGACGTCAAGAAGGATTTTCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCCATTGTGGATGTGCACTTTGACCCCACCACAGCCTTCAGGGCACCCGACGTGGCCCGGGCCCTGCTCCGGCAGATCCAGGTGTCCAGGCGCCGGTCCTTGGGGGTGAGGCGGCCGCTGCAGGAGCACGTGCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCCACGTCAGGAGCCATTGCTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCGCCTGCCGTCCTCTGCTGTGACCCCTCGGGCCCCGCACCCCAGTCACACAAGCCAGCCCGTTGCCAAGACCACGGCAGCCCCCACCACACGTCGGCCCCCCACCACTGCCCCCAGCCGTGTGCCCGGACGTCGGCCCCCGGCCCCCCAGCAGCCTCCAAAGCCCTGTGACTCACAGCCCTGCTTCCACGGGGGGACCTGCCAGGACTGGGCATTGGGCGGGGGCTTCACCTGCAGCTGCCCGGCAGGCAGGGGAGGCGCCGTCTGTGAGAAGGTGCTTGGCGCCCCTGTGCCGGCCTTCGAGGGCCGCTCCTTCCTGGCCTTCCCCACTCTCCGCGCCTACCACACGCTGCGCCTGGCACTGGAATTCCGGGCGCTGGAGCCTCAGGGGCTGCTGCTGTACAATGGCAACGCCCGGGGCAAGGACTTCCTGGCATTGGCGCTGCTAGATGGCCGCGTGCAGCTCAGGTTTGACACAGGTTCGGGGCCGGCGGTGCTGACCAGTGCCGTGCCGGTAGAGCCGGGCCAGTGGCACCGCCTGGAGCTGTCCCGGCACTGGCGCCGGGGCACCCTCTCGGTGGATGGTGAGACCCCTGTTCTGGGCGAGAGTCCCAGTGGCACCGACGGCCTCAACCTGGACACAGACCTCTTTGTGGGCGGCGTACCCGAGGACCAGGCTGCCGTGGCGCTGGAGCGGACCTTCGTGGGCGCCGGCCTGAGGGGGTGCATCCGTTTGCTGGACGTCAACAACCAGCGCCTGGAGCTTGGCATTGGGCCGGGGGCTGCCACCCGAGGCTCTGGCGTGGGCGAGTGCGGGGACCACCCCTGCCTGCCCAACCCCTGCCATGGCGGGGCCCCATGCCAGAACCTGGAGGCTGGAAGGTTCCATTGCCAGTGCCCGCCCGGCCGCGTCGGACCAACCTGTGCCGATGAGAAGAGCCCCTGCCAGCCCAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCCCGAGGGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCCTCGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGAGCTGAGAGGCCTGCACACCTTTGCACGGGACCTGGGGGAGAAGATGGCGCTGGAGGTCGTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACGGCAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTCCGCTACGACCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTGGACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGGCCCCCGTGTGTTGGGGGAGTCCCCGAAATCCCGCAAGGTTCCGCACACCGTCCTCAACCTGAAGGAGCCGCTCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTCTGGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGCTGCTGACCCCGGAGCACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGGCCTCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGTGTGCCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCAGCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCAACGCTGTGACCGAGAGCGAACTGGCCAATGAGATCCCCGTCCCCGAAACTCTGGATTCCGGGGCCCTTCACAGCGAGAAGGCACTGCAGAGCAACCACTTTGAACTGAGCCTGCGCACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCCAGCCCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCTTGCGGGACGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCTGTAATTATTTTCTATTTTTGTAAACTTGTTGCTTTTTGATATGATTTTCTTGCCTGAGTGTTGGCCGGAGGGACTGCTGGCCCGGCCTCCCTTCCGTCCAGGCAGCCGTGCTGCAGACAGACCTAGTGCCGAGGGATGGACAGGCGAGGTGGCAGCGTGGAGGGCTCGGCGTGGATGGCAGCCTCAGGACACACACCCCTGCCTCAAGGTGCTGAGCCCCCGCCTTGCACTGCGCCTGCCCCACGGTGTCCCCGCCGGGAAGCAGCCCCGGCTCCTGAATCACCCTCGCTCCGTCAGGCGGGACTCGTGTCCCAGAGAGGAAGGGGCTGCTGAGGTCTGATGGGGCCCTTCCTCCGGGTGACCCCACAGGGCCTTTCCAAGCCCCCATTTGAGCTGCTCCTTCCTGTGTGTGCTCTGGGCCCTGCCTCGGCCTCCTGCGCCAATACTGTGACTTCCAAACAATGTTACTGCTGGGCACAGCTCTGCGTTGCTCCCGTGCTGCCTGCGCCAGCCCCAGGCTGCTGAGGAGCAGAGGCCAGACCAGGGCCGATCTGGGTGTCCTGACCCTCAGCTGGCCCTGCCCAGCCACCCTGGACGTGACCGTATCCCTCTGCCACACCCCAGGCCCTGCGAGGGGCTATCGAGAGGAGCTCACTGTGGGATGGGGTTGACCTCTGCCGCCTGCCTGGGTATCTGGGCCTGGCCATGGCTGTGTTCTTCATGTGTTGATTTTATTTGACCCCTGGAGTGGTGGGTCTCATCTTTCCCATCTCGCCTGAGAGCGGCTGAGGGCTGCCTCACTGCAAATCCTCCCCACAGCGTCAGTGAAAGTCGTCCTTGTCTCAGAATGACCAGGGGCCAGCCAGTGTCTGACCAAGGTCAAGGGGCAGGTGCAGAGGTGGCAGGGATGGCTCCGAAGCCAGAAATGCCTTAAACTGCAACGTCCCGTCCCTTCCCCACCCCCATCCCATCCCCACCCCCAGCCCCAGCCCAGTCCTCCTAGGAGCAGGACCCGATGAAGCGGGCGGCGGTGGGGCTGGGTGCCGTGTTACTAACTCTAGTATGTTTCTGTGTCAATCGCTGTGAAATAAAGTCTGAAAACTTTAAAA SEQ ID NO:418 全mRNA多核苷酸序列 SEQ ID NO:419 AGRN之外顯子1的轉譯多肽序列 MAGRSHPGPLRPLLPLLVVAACVLPGAGGTCPERALERREEEANVVLTGTVEEILNVDPVQHTYSCK SEQ ID NO:420 AGRN之外顯子2的轉譯多肽序列 VRVWRYLKGKDLVARESLLDGGNKVVISGFGDPLICDNQVSTGDTRIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVE SEQ ID NO:421 AGRN之外顯子3的轉譯多肽序列 KPGTHFTPVPPTPPD SEQ ID NO:422 AGRN之外顯子4的轉譯多肽序列 CRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSVVAPVCGSDASTYSNECELQRAQCSQQRRIRLLSRGPC SEQ ID NO:423 AGRN之外顯子5的轉譯多肽序列 SRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPC SEQ ID NO:424 AGRN之外顯子6的轉譯多肽序列 PCQGALPDPSRSCRVNPRTRRPEMLLRPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGR SEQ ID NO:425 AGRN之外顯子7的轉譯多肽序列 QCPEPCRFNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGPC SEQ ID NO:426 AGRN之外顯子8的轉譯多肽序列 QAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEATACTLGREIQVARKGPC SEQ ID NO:427 AGRN之外顯子9的轉譯多肽序列 RCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGPC SEQ ID NO:428 AGRN之外顯子10的轉譯多肽序列 TCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGSDGVTYGSACELREAACLQQTQIEEARAGPCEQ SEQ ID NO:429 AGRN之外顯子11的轉譯多肽序列 ECGSGGSGSGEDGDCEQELCRQRGGIWDEDSEDGPCVCDFSCQSVPGSP SEQ ID NO:430 AGRN之外顯子12的轉譯多肽序列 VCGSDGVTYSTECELKKARCESQRGLYVAAQGACR SEQ ID NO:431 AGRN之全部27個外顯子12-39按順序排列的轉譯多肽序列 VCGSDGVTYSTECELKKARCESQRGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPATGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCPMLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGPPVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMADPKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVHFDPTTAFRAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATARATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQPPKPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTLRAYHTLRLALEFRALEPQGLLLYNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTSAVPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAAVALERTFVGAGLRGCIRLLDVNNQRLELGIGPGAATRGSGVGECGDHPCLPNPCHGGAPCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFSHLELRGLHTFARDLGEKMALEVVFLARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPKSRKVPHTVLNLKEPLYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSFAGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESELANEIPVPETLDSGALHSEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVLRSTVPVNTNRWLRVVAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAYGTGFVGCLRDVVVGRHPLHLLEDAVTKPELRPCPTP SEQ ID NO:432 全蛋白質序列 MAGRSHPGPLRPLLPLLVVAACVLPGAGGTCPERALERREEEANVVLTGTVEEILNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKVVISGFGDPLICDNQVSTGDTRIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPDACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSVVAPVCGSDASTYSNECELQRAQCSQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLLRPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGPCDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEATACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGSDGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGGIWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPATGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCPMLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGPPVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMADPKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVHFDPTTAFRAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATARATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQPPKPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTLRAYHTLRLALEFRALEPQGLLLYNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTSAVPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAAVALERTFVGAGLRGCIRLLDVNNQRLELGIGPGAATRGSGVGECGDHPCLPNPCHGGAPCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFSHLELRGLHTFARDLGEKMALEVVFLARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPKSRKVPHTVLNLKEPLYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSFAGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESELANEIPVPETLDSGALHSEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVLRSTVPVNTNRWLRVVAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAYGTGFVGCLRDVVVGRHPLHLLEDAVTKPELRPCPTP SEQ ID NO:433 AGRN之全部12個外顯子1-12按順序排列的多核苷酸序列 SEQ ID NO:434 AGRN之全部12個外顯子1-12按順序排列的轉譯多肽序列 PVALB 序列資訊SEQ ID NO: 435 PVALB-NRG1融合物5’處之PVAPB外顯子4序列 TAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACG SEQ ID NO: 436 PVALB-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCAAACCCCTCGAGATACTTG SEQ ID NO:437 PVALB-NRG1多核苷酸序列 TAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTTCAAACCCCTCGAGATACTTG SEQ ID NO:438 PVALB-NRG1多肽序列 KGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDATSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYL SEQ ID NO:439 PVALB之外顯子1 ACTTCCCGACAGGACTTCCCACCAGCCCAGCCTTTCAGTGCAGGCTCCAGCCCTCCACCCCCACCCGAG SEQ ID NO:440 PVALB之外顯子2 TTGCAGGATGTCGATGACAGACTTGCTGAACGCTGAGGACATCAAGAAGGCGGTGGGAGCCTTTAGCG SEQ ID NO:441 PVALB之外顯子3 CTACCGACTCCTTCGACCACAAAAAGTTCTTCCAAATGGTCGGCCTGAAGAAAAAGAGTGCGGATGATGTGAAGAAGGTGTTTCACATGCTGGACAAGGACAAAAGTGGCTTCATCGAGGAGGATGAGCTGGG SEQ ID NO:442 PVALB之外顯子4 ATTCATCCTAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACG SEQ ID NO:443 PVALB之外顯子5 AATTCTCCACTCTGGTGGCTGAAAGCTAAGAAGCACTGACTGCCCCTGGTCTTCCACCTCTCTGCCCTGAACACCCAATCTCGGCCCCTCTCGCCACCCTCCTGCATTTCTGTTCAGTTCGTTTATGTTATTTTTTACTCCCCCATCCCCTGTGGCCCTCTAATGACACCATTCTTCTGGAAAATGCTGGAGAAGCAATAAAGGTTGTACCAGTCA SEQ ID NO:444 全mRNA多核苷酸序列 ACTTCCCGACAGGACTTCCCACCAGCCCAGCCTTTCAGTGCAGGCTCCAGCCCTCCACCCCCACCCGAGTTGCAGGATGTCGATGACAGACTTGCTGAACGCTGAGGACATCAAGAAGGCGGTGGGAGCCTTTAGCGCTACCGACTCCTTCGACCACAAAAAGTTCTTCCAAATGGTCGGCCTGAAGAAAAAGAGTGCGGATGATGTGAAGAAGGTGTTTCACATGCTGGACAAGGACAAAAGTGGCTTCATCGAGGAGGATGAGCTGGGATTCATCCTAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACGAATTCTCCACTCTGGTGGCTGAAAGCTAAGAAGCACTGACTGCCCCTGGTCTTCCACCTCTCTGCCCTGAACACCCAATCTCGGCCCCTCTCGCCACCCTCCTGCATTTCTGTTCAGTTCGTTTATGTTATTTTTTACTCCCCCATCCCCTGTGGCCCTCTAATGACACCATTCTTCTGGAAAATGCTGGAGAAGCAATAAAGGTTGTACCAGTCA SEQ ID NO:445 PVALB之外顯子2的轉譯多肽序列 MSMTDLLNAEDIKKAVGAFS SEQ ID NO:446 PVALB之外顯子3的轉譯多肽序列 TDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGFIEEDEL SEQ ID NO:447 PVALB之外顯子4的轉譯多肽序列 FILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVD SEQ ID NO:448 PVALB之外顯子5的轉譯多肽序列 FSTLVAES SEQ ID NO:449 全蛋白質序列 MSMTDLLNAEDIKKAVGAFSATDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGFIEEDELGFILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDEFSTLVAES SEQ ID NO:450 PVALB之全部4個外顯子1-4按順序排列的多核苷酸序列 SEQ ID NO:451 PVALB之全部3個外顯子2-4按順序排列的轉譯多肽序列 SLC3A2 轉錄本 3 序列資訊SEQ ID NO:452 SLC3A2-NRG1融合物5’處之SLC3A2外顯子2序列 AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTCTTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAG SEQ ID NO:453 SLC3A2-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAG SEQ ID NO:454 SLC3A2-NRG1多核苷酸序列 AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTCTTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAGCTACATCTACATCCACCACTGGGACAAG SEQ ID NO:455 SLC3A2-NRG1多肽序列 LGSHCVAQTGLELLASGDPLPSASQNAEMIATSTSTTGT SEQ ID NO:456 SLC3A2之外顯子1 GCATTGCGGCTTGGTTTTCTCACCCAGTGCATGTGGCAGGAGCGGTGAGATCACTGCCTCACGGCGATCCTGGACTGACGGTCACGACTGCCTACCCTCTAACCCTGTTCTGAGCTGCCCCTTGCCCACACACCCCAAACCTGTGTGCAGGATCCGCCTCCATGGAGCTACAGCCTCCTGAAGCCTCGATCGCCGTCGTGTCGATTCCGCGCCAGTTGCCTGGCTCACATTCGGAGGCTGGTGTCCAGGGTCTCAGCGCGGGGGACGACTCAG SEQ ID NO:457 SLC3A2之外顯子2 AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTCTTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAG SEQ ID NO:458 SLC3A2之外顯子3 AGACGGGGTCTGACTGTGTTACCCAGGCTGGTCTTCAACTCTTGGCCTCAAGTGATCCTCCTGCCTTAGCTTCCAAGAATGCTGAGGTTACAG SEQ ID NO:459 SLC3A2之外顯子4 GCACCATGAGCCAGGACACCGAGGTGGATATGAAGGAGGTGGAGCTGAATGAGTTAGAGCCCGAGAAGCAGCCGATGAACGCGGCGTCTGGGGCGGCCATGTCCCTGGCGGGAGCCGAGAAGAATGGTCTGGTGAAGATCAAGGTGGCGGAAGACGAGGCGGAGGCGGCAGCCGCGGCTAAGTTCACGGGCCTGTCCAAGGAGGAGCTGCTGAAGGTGGCAGGCAGCCCCGGCTGGGTACGCACCCGCTGGGCACTGCTGCTGCTCTTCTGGCTCGGCTGGCTCGGCATGCTTGCTGGTGCCGTGGTCATAATCGTGCGAGCGCCGCGTTGTCGCGAGCTACCGGCGCAGAAGTGGTGGCACACGGGCGCCCTCTACCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGG SEQ ID NO:460 SLC3A2之外顯子5 GTCTGAAGGGGCGTCTCGATTACCTGAGCTCTCTGAAGGTGAAGGGCCTTGTGCTGGGTCCAATTCACAAGAACCAGAAGGATGATGTCGCTCAGACTGACTTGCTGCAGATCGACCCCAATTTTGGCTCCAAGGAAGATTTTGACAGTCTCTTGCAATCGGCTAAAAAAAAGA SEQ ID NO:461 SLC3A2之外顯子6 GCATCCGTGTCATTCTGGACCTTACTCCCAACTACCGGGGTGAGAACTCGTGGTTCTCCACTCAGGTTGACACTGTGGCCACCAAGGTGAAG SEQ ID NO:462 SLC3A2之外顯子7 GATGCTCTGGAGTTTTGGCTGCAAGCTGGCGTGGATGGGTTCCAGGTTCGGGACATAGAGAATCTGAAG SEQ ID NO:463 SLC3A2之外顯子8 GATGCATCCTCATTCTTGGCTGAGTGGCAAAATATCACCAAGGGCTTCAGTGAAGACAG SEQ ID NO:464 SLC3A2之外顯子9 GCTCTTGATTGCGGGGACTAACTCCTCCGACCTTCAGCAGATCCTGAGCCTACTCGAATCCAACAAAGACTTGCTGTTGACTAGCTCATACCTGTCTGATTCTGGTTCTACTGGGGAGCATACAAAATCCCTAGTCACACAGTATTTGAATGCCACTGGCAATCGCTGGTGCAGCTGGAGT SEQ ID NO:465 SLC3A2之外顯子10 TTGTCTCAGGCAAGGCTCCTGACTTCCTTCTTGCCGGCTCAACTTCTCCGACTCTACCAGCTGATGCTCTTCACCCTGCCAGGGACCCCTGTTTTCAGCTACGGGGATGAGATTGGCCTGGATGCAGCTGCCCTTCCTGGACAG SEQ ID NO:466 SLC3A2之外顯子11 CCTATGGAGGCTCCAGTCATGCTGTGGGATGAGTCCAGCTTCCCTGACATCCCAGGGGCTGTAAGTGCCAACATGACTGTGAAG SEQ ID NO:467 SLC3A2之外顯子12 GGCCAGAGTGAAGACCCTGGCTCCCTCCTTTCCTTGTTCCGGCGGCTGAGTGACCAGCGGAGTAAGGAGCGCTCCCTACTGCATGGGGACTTCCACGCGTTCTCCGCTGGGCCTGGACTCTTCTCCTATATCCGCCACTGGGACCAGAATGAGCGTTTTCTGGTAGTGCTTAACTTTGGGGATGTGGGCCTCTCGGCTGGACTGCAGGCCTCCGACCTGCCTGCCAGCGCCAGCCTGCCAGCCAAGGCTGACCTCCTGCTCAGCACCCAGCCAGGCCGTGAGGAGGGCTCCCCTCTTGAGCTGGAACGCCTGAAACTGGAGCCTCACGAAGGGCTGCTGCTCCGCTTCCCCTACGCGGCCTGACTTCAGCCTGACATGGACCCACTACCCTTCTCCTTTCCTTCCCAGGCCCTTTGGCTTCTGATTTTTCTCTTTTTTAAAAACAAACAAACAAACTGTTGCAGATTATGAGTGAACCCCCAAATAGGGTGTTTTCTGCCTTCAAATAAAAGTCACCCCTGCATGGTGAA SEQ ID NO:468 SLC3A2之全mRNA多核苷酸序列 SEQ ID NO:469 SLC3A2之外顯子1的轉譯多肽序列 MELQPPEASIAVVSIPRQLPGSHSEAGVQGLSAGDDS SEQ ID NO:470 SLC3A2之外顯子2的轉譯多肽序列 LGSHCVAQTGLELLASGDPLPSASQNAEMI SEQ ID NO:471 SLC3A2之外顯子3的轉譯多肽序列 TGSDCVTQAGLQLLASSDPPALASKNAEVT SEQ ID NO:472 SLC3A2之外顯子4的轉譯多肽序列 TMSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLA SEQ ID NO:473 SLC3A2之外顯子5的轉譯多肽序列 LKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKK SEQ ID NO:474 SLC3A2之外顯子6的轉譯多肽序列 IRVILDLTPNYRGENSWFSTQVDTVATKVK SEQ ID NO:475 SLC3A2之外顯子7的轉譯多肽序列 DALEFWLQAGVDGFQVRDIENLK SEQ ID NO:476 SLC3A2之外顯子8的轉譯多肽序列 DASSFLAEWQNITKGFSED SEQ ID NO:477 SLC3A2之外顯子9的轉譯多肽序列 LLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWS SEQ ID NO:478 SLC3A2之外顯子10的轉譯多肽序列 LSQARLLTSFLPAQLLRLYQLMLFTLPGTPVFSYGDEIGLDAAALPGQ SEQ ID NO:479 SLC3A2之外顯子11的轉譯多肽序列 PMEAPVMLWDESSFPDIPGAVSANMTVK SEQ ID NO:480 SLC3A2之外顯子12的轉譯多肽序列 GQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGPGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA SEQ ID NO:481 SLC3A2之全蛋白質序列 MELQPPEASIAVVSIPRQLPGSHSEAGVQGLSAGDDSELGSHCVAQTGLELLASGDPLPSASQNAEMIETGSDCVTQAGLQLLASSDPPALASKNAEVTGTMSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLAGLKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKKSIRVILDLTPNYRGENSWFSTQVDTVATKVKDALEFWLQAGVDGFQVRDIENLKDASSFLAEWQNITKGFSEDRLLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWSLSQARLLTSFLPAQLLRLYQLMLFTLPGTPVFSYGDEIGLDAAALPGQPMEAPVMLWDESSFPDIPGAVSANMTVKGQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGPGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA SEQ ID NO:482 SLC3A2之外顯子1及2按順序排列的多核苷酸序列 SEQ ID NO:483 SLC3A2之外顯子1-2按順序排列的轉譯多肽序列 APP 序列資訊SEQ ID NO:484 來自APP-NRG1融合物5’處之APP外顯子14的序列 TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAG SEQ ID NO:485 APP-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO:486 APP-NRG1多核苷酸序列 TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO:487 APP-NRG1多肽序列 EPVDARPAADRGLTTRPATSTSTTGTSHLVKCAEKEKTFCVNGGEC SEQ ID NO:488 APP之外顯子1 GTCAGTTTCCTCGGCAGCGGTAGGCGAGAGCACGCGGAGGAGCGTGCGCGGGGGCCCCGGGAGACGGCGGCGGTGGCGGCGCGGGCAGAGCAAGGACGCGGCGGATCCCACTCGCACAGCAGCGCACTCGGTGCCCCGCGCAGGGTCGCGATGCTGCCCGGTTTGGCACTGCTCCTGCTGGCCGCCTGGACGGCTCGGGCGCTGGAG SEQ ID NO:489 APP之外顯子2 GTCTACCCTGAACTGCAGATCACCAATGTGGTAGAAGCCAACCAACCAGTGACCATCCAGAACTGGTGCAAGCGGGGCCGCAAGCAGTGCAAGACCCATCCCCACTTTGTGATTCCCTACCGCTGCTTAG SEQ ID NO:490 APP之外顯子3 TTGGTGAGTTTGTAAGTGATGCCCTTCTCGTTCCTGACAAGTGCAAATTCTTACACCAGGAGAGGATGGATGTTTGCGAAACTCATCTTCACTGGCACACCGTCGCCAAAGAG SEQ ID NO:491 APP之外顯子4 ACATGCAGTGAGAAGAGTACCAACTTGCATGACTACGGCATGTTGCTGCCCTGCGGAATTGACAAGTTCCGAGGGGTAGAGTTTGTGTGTTGCCCACTGGCTGAAGAAAGTGACAATGTGGATTCTGCTGATGCGGAGGAGGATGACTCGGATGTCTGGTGGGGCGGAGCAGACACAGACTATGCAGATGGGAG SEQ ID NO:492 APP之外顯子5 TGAAGACAAAGTAGTAGAAGTAGCAGAGGAGGAAGAAGTGGCTGAGGTGGAAGAAGAAGAAGCCGATGATGACGAGGACGATGAGGATGGTGATGAGGTAGAGGAAGAGGCTGAGGAACCCTACGAAGAAGCCACAGAGAGAACCACCAGCATTGCCACCACCACCACCACCACCACAGAGTCTGTGGAAGAGGTGGTTCGAG SEQ ID NO:493 APP之外顯子6 AGGTGTGCTCTGAACAAGCCGAGACGGGGCCGTGCCGAGCAATGATCTCCCGCTGGTACTTTGATGTGACTGAAGGGAAGTGTGCCCCATTCTTTTACGGCGGATGTGGCGGCAACCGGAACAACTTTGACACAGAAGAGTACTGCATGGCCGTGTGTGGCAGCGCCA SEQ ID NO:494 APP之外顯子7 TGTCCCAAAGTTTACTCAAGACTACCCAGGAACCTCTTGCCCGAGATCCTGTTAAAC SEQ ID NO:495 APP之外顯子8 TTCCTACAACAGCAGCCAGTACCCCTGATGCCGTTGACAAGTATCTCGAGACACCTGGGGATGAGAATGAACATGCCCATTTCCAGAAAGCCAAAGAGAGGCTTGAGGCCAAGCACCGAGAGAGAATGTCCCAG SEQ ID NO:496 APP之外顯子9 GTCATGAGAGAATGGGAAGAGGCAGAACGTCAAGCAAAGAACTTGCCTAAAGCTGATAAGAAGGCAGTTATCCAG SEQ ID NO:497 APP之外顯子10 CATTTCCAGGAGAAAGTGGAATCTTTGGAACAGGAAGCAGCCAACGAGAGACAGCAGCTGGTGGAGACACACATGGCCAGAGTGGAAGCCATGCTCAATGACCGCCGCCGCCTGGCCCTGGAGAACTACATCACCGCTCTGCAGGCTGTTCCTCCTCGG SEQ ID NO:498 APP之外顯子11 CCTCGTCACGTGTTCAATATGCTAAAGAAGTATGTCCGCGCAGAACAGAAGGACAGACAGCACACCCTAAAGCATTTCGAGCATGTGCGCATGGTGGATCCCAAGAAAGCCGCTCAGATCCGGTCCCAG SEQ ID NO:499 APP之外顯子12 GTTATGACACACCTCCGTGTGATTTATGAGCGCATGAATCAGTCTCTCTCCCTGCTCTACAACGTGCCTGCAGTGGCCGAGGAGATTCAGGATGAAGTTG SEQ ID NO:500 APP之外顯子13 ATGAGCTGCTTCAGAAAGAGCAAAACTATTCAGATGACGTCTTGGCCAACATGATTAGTGAACCAAGGATCAGTTACGGAAACGATGCTCTCATGCCATCTTTGACCGAAACGAAAACCACCGTGGAGCTCCTTCCCGTGAATGGAGAGTTCAGCCTGGACGATCTCCAGCCGTGGCATTCTTTTGGGGCTGACTCTGTGCCAGCCAACACAGAAAACGAAG SEQ ID NO:501 APP之外顯子14 TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAG SEQ ID NO:502 APP之外顯子15 GTTCTGGGTTGACAAATATCAAGACGGAGGAGATCTCTGAAGTGAAGATGGATGCAGAATTCCGACATGACTCAGGATATGAAGTTCATCATCAAAAATTG SEQ ID NO:503 APP之外顯子16 GTGTTCTTTGCAGAAGATGTGGGTTCAAACAAAGGTGCAATCATTGGACTCATGGTGGGCGGTGTTGTCATAGCGACAGTGATCGTCATCACCTTGGTGATGCTGAAGAAGAAACAGTACACATCCATTCATCATGGTGTGGTGGAG SEQ ID NO:504 APP之外顯子17 GTTGACGCCGCTGTCACCCCAGAGGAGCGCCACCTGTCCAAGATGCAGCAGAACGGCTACGAAAATCCAACCTACAAGTTCTTTGAGCAGATGCAGAACTAGACCCCCGCCACAGCAGCCTCTGAAGTTGGACAGCAAAACCATTGCTTCACTACCCATCGGTGTCCATTTATAGAATAATGTGGGAAGAAACAAACCCGTTTTATGATTTACTCATTATCGCCTTTTGACAGCTGTGCTGTAACACAAGTAGATGCCTGAACTTGAATTAATCCACACATCAGTAATGTATTCTATCTCTCTTTACATTTTGGTCTCTATACTACATTATTAATGGGTTTTGTGTACTGTAAAGAATTTAGCTGTATCAAACTAGTGCATGAATAGATTCTCTCCTGATTATTTATCACATAGCCCCTTAGCCAGTTGTATATTATTCTTGTGGTTTGTGACCCAATTAAGTCCTACTTTACATATGCTTTAAGAATCGATGGGGGATGCTTCATGTGAACGTGGGAGTTCAGCTGCTTCTCTTGCCTAAGTATTCCTTTCCTGATCACTATGCATTTTAAAGTTAAACATTTTTAAGTATTTCAGATGCTTTAGAGAGATTTTTTTTCCATGACTGCATTTTACTGTACAGATTGCTGCTTCTGCTATATTTGTGATATAGGAATTAAGAGGATACACACGTTTGTTTCTTCGTGCCTGTTTTATGTGCACACATTAGGCATTGAGACTTCAAGCTTTTCTTTTTTTGTCCACGTATCTTTGGGTCTTTGATAAAGAAAAGAATCCCTGTTCATTGTAAGCACTTTTACGGGGCGGGTGGGGAGGGGTGCTCTGCTGGTCTTCAATTACCAAGAATTCTCCAAAACAATTTTCTGCAGGATGATTGTACAGAATCATTGCTTATGACATGATCGCTTTCTACACTGTATTACATAAATAAATTAAATAAAATAACCCCGGGCAAGACTTTTCTTTGAAGGATGACTACAGACATTAAATAATCGAAGTAATTTTGGGTGGGGAGAAGAGGCAGATTCAATTTTCTTTAACCAGTCTGAAGTTTCATTTATGATACAAAAGAAGATGAAAATGGAAGTGGCAATATAAGGGGATGAGGAAGGCATGCCTGGACAAACCCTTCTTTTAAGATGTGTCTTCAATTTGTATAAAATGGTGTTTTCATGTAAATAAATACATTCTTGGAGGAGCA SEQ ID NO:505 APP之全mRNA多核苷酸序列 GTCAGTTTCCTCGGCAGCGGTAGGCGAGAGCACGCGGAGGAGCGTGCGCGGGGGCCCCGGGAGACGGCGGCGGTGGCGGCGCGGGCAGAGCAAGGACGCGGCGGATCCCACTCGCACAGCAGCGCACTCGGTGCCCCGCGCAGGGTCGCGATGCTGCCCGGTTTGGCACTGCTCCTGCTGGCCGCCTGGACGGCTCGGGCGCTGGAGGTCTACCCTGAACTGCAGATCACCAATGTGGTAGAAGCCAACCAACCAGTGACCATCCAGAACTGGTGCAAGCGGGGCCGCAAGCAGTGCAAGACCCATCCCCACTTTGTGATTCCCTACCGCTGCTTAGTTGGTGAGTTTGTAAGTGATGCCCTTCTCGTTCCTGACAAGTGCAAATTCTTACACCAGGAGAGGATGGATGTTTGCGAAACTCATCTTCACTGGCACACCGTCGCCAAAGAGACATGCAGTGAGAAGAGTACCAACTTGCATGACTACGGCATGTTGCTGCCCTGCGGAATTGACAAGTTCCGAGGGGTAGAGTTTGTGTGTTGCCCACTGGCTGAAGAAAGTGACAATGTGGATTCTGCTGATGCGGAGGAGGATGACTCGGATGTCTGGTGGGGCGGAGCAGACACAGACTATGCAGATGGGAGTGAAGACAAAGTAGTAGAAGTAGCAGAGGAGGAAGAAGTGGCTGAGGTGGAAGAAGAAGAAGCCGATGATGACGAGGACGATGAGGATGGTGATGAGGTAGAGGAAGAGGCTGAGGAACCCTACGAAGAAGCCACAGAGAGAACCACCAGCATTGCCACCACCACCACCACCACCACAGAGTCTGTGGAAGAGGTGGTTCGAGAGGTGTGCTCTGAACAAGCCGAGACGGGGCCGTGCCGAGCAATGATCTCCCGCTGGTACTTTGATGTGACTGAAGGGAAGTGTGCCCCATTCTTTTACGGCGGATGTGGCGGCAACCGGAACAACTTTGACACAGAAGAGTACTGCATGGCCGTGTGTGGCAGCGCCATGTCCCAAAGTTTACTCAAGACTACCCAGGAACCTCTTGCCCGAGATCCTGTTAAACTTCCTACAACAGCAGCCAGTACCCCTGATGCCGTTGACAAGTATCTCGAGACACCTGGGGATGAGAATGAACATGCCCATTTCCAGAAAGCCAAAGAGAGGCTTGAGGCCAAGCACCGAGAGAGAATGTCCCAGGTCATGAGAGAATGGGAAGAGGCAGAACGTCAAGCAAAGAACTTGCCTAAAGCTGATAAGAAGGCAGTTATCCAGCATTTCCAGGAGAAAGTGGAATCTTTGGAACAGGAAGCAGCCAACGAGAGACAGCAGCTGGTGGAGACACACATGGCCAGAGTGGAAGCCATGCTCAATGACCGCCGCCGCCTGGCCCTGGAGAACTACATCACCGCTCTGCAGGCTGTTCCTCCTCGGCCTCGTCACGTGTTCAATATGCTAAAGAAGTATGTCCGCGCAGAACAGAAGGACAGACAGCACACCCTAAAGCATTTCGAGCATGTGCGCATGGTGGATCCCAAGAAAGCCGCTCAGATCCGGTCCCAGGTTATGACACACCTCCGTGTGATTTATGAGCGCATGAATCAGTCTCTCTCCCTGCTCTACAACGTGCCTGCAGTGGCCGAGGAGATTCAGGATGAAGTTGATGAGCTGCTTCAGAAAGAGCAAAACTATTCAGATGACGTCTTGGCCAACATGATTAGTGAACCAAGGATCAGTTACGGAAACGATGCTCTCATGCCATCTTTGACCGAAACGAAAACCACCGTGGAGCTCCTTCCCGTGAATGGAGAGTTCAGCCTGGACGATCTCCAGCCGTGGCATTCTTTTGGGGCTGACTCTGTGCCAGCCAACACAGAAAACGAAGTTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAGGTTCTGGGTTGACAAATATCAAGACGGAGGAGATCTCTGAAGTGAAGATGGATGCAGAATTCCGACATGACTCAGGATATGAAGTTCATCATCAAAAATTGGTGTTCTTTGCAGAAGATGTGGGTTCAAACAAAGGTGCAATCATTGGACTCATGGTGGGCGGTGTTGTCATAGCGACAGTGATCGTCATCACCTTGGTGATGCTGAAGAAGAAACAGTACACATCCATTCATCATGGTGTGGTGGAGGTTGACGCCGCTGTCACCCCAGAGGAGCGCCACCTGTCCAAGATGCAGCAGAACGGCTACGAAAATCCAACCTACAAGTTCTTTGAGCAGATGCAGAACTAGACCCCCGCCACAGCAGCCTCTGAAGTTGGACAGCAAAACCATTGCTTCACTACCCATCGGTGTCCATTTATAGAATAATGTGGGAAGAAACAAACCCGTTTTATGATTTACTCATTATCGCCTTTTGACAGCTGTGCTGTAACACAAGTAGATGCCTGAACTTGAATTAATCCACACATCAGTAATGTATTCTATCTCTCTTTACATTTTGGTCTCTATACTACATTATTAATGGGTTTTGTGTACTGTAAAGAATTTAGCTGTATCAAACTAGTGCATGAATAGATTCTCTCCTGATTATTTATCACATAGCCCCTTAGCCAGTTGTATATTATTCTTGTGGTTTGTGACCCAATTAAGTCCTACTTTACATATGCTTTAAGAATCGATGGGGGATGCTTCATGTGAACGTGGGAGTTCAGCTGCTTCTCTTGCCTAAGTATTCCTTTCCTGATCACTATGCATTTTAAAGTTAAACATTTTTAAGTATTTCAGATGCTTTAGAGAGATTTTTTTTCCATGACTGCATTTTACTGTACAGATTGCTGCTTCTGCTATATTTGTGATATAGGAATTAAGAGGATACACACGTTTGTTTCTTCGTGCCTGTTTTATGTGCACACATTAGGCATTGAGACTTCAAGCTTTTCTTTTTTTGTCCACGTATCTTTGGGTCTTTGATAAAGAAAAGAATCCCTGTTCATTGTAAGCACTTTTACGGGGCGGGTGGGGAGGGGTGCTCTGCTGGTCTTCAATTACCAAGAATTCTCCAAAACAATTTTCTGCAGGATGATTGTACAGAATCATTGCTTATGACATGATCGCTTTCTACACTGTATTACATAAATAAATTAAATAAAATAACCCCGGGCAAGACTTTTCTTTGAAGGATGACTACAGACATTAAATAATCGAAGTAATTTTGGGTGGGGAGAAGAGGCAGATTCAATTTTCTTTAACCAGTCTGAAGTTTCATTTATGATACAAAAGAAGATGAAAATGGAAGTGGCAATATAAGGGGATGAGGAAGGCATGCCTGGACAAACCCTTCTTTTAAGATGTGTCTTCAATTTGTATAAAATGGTGTTTTCATGTAAATAAATACATTCTTGGAGGAGCA SEQ ID NO:506 APP之外顯子1的轉譯多肽序列 MLPGLALLLLAAWTARALE SEQ ID NO:507 APP之外顯子2的轉譯多肽序列 VYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCL SEQ ID NO:508 APP之外顯子3的轉譯多肽序列 GEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKE SEQ ID NO:509 APP之外顯子4的轉譯多肽序列 TCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADG SEQ ID NO:510 APP之外顯子5的轉譯多肽序列 EDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVR SEQ ID NO:511 APP之外顯子6的轉譯多肽序列 VCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSA SEQ ID NO:512 APP之外顯子7的轉譯多肽序列 SQSLLKTTQEPLARDPVK SEQ ID NO:513 APP之外顯子8的轉譯多肽序列 PTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQ SEQ ID NO:514 APP之外顯子9的轉譯多肽序列 VMREWEEAERQAKNLPKADKKAVIQ SEQ ID NO:515 APP之外顯子10的轉譯多肽序列 HFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPR SEQ ID NO:516 APP之外顯子11的轉譯多肽序列 PRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQ SEQ ID NO:517 APP之外顯子12的轉譯多肽序列 VMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEV SEQ ID NO:518 APP之外顯子13的轉譯多肽序列 ELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENE SEQ ID NO:519 APP之外顯子14的轉譯多肽序列 EPVDARPAADRGLTTRP SEQ ID NO:520 APP之外顯子15的轉譯多肽序列 SGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKL SEQ ID NO:521 APP之外顯子16的轉譯多肽序列 VFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVE SEQ ID NO:522 APP之外顯子17的轉譯多肽序列 VDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN SEQ ID NO:523 APP之全蛋白質序列 MLPGLALLLLAAWTARALEVYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN SEQ ID NO:524 APP之全部外顯子1-14按順序排列的多核苷酸序列 SEQ ID NO:525 APP之全部外顯子1-14按順序排列的轉譯多肽序列 WRN 序列資訊SEQ ID NO:526 來自WRN-NRG1融合物5’處之WRN外顯子33的序列 AAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAG SEQ ID NO:527 WRN-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO:528 WRN-NRG1多核苷酸序列 AAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGC SEQ ID NO:529 WRN-NRG1多肽序列 AGCPLDLERAGLTPEVQKIIADVIRNPPVNSATSTSTTGTSHLVKCAEKEKTFCVNGGEC SEQ ID NO:530 WRN之外顯子1 GTGTACTGTGTGCGCCGGGGAGGCGCCGGCTTGTACTCGGCAGCGCGGGAATAAAGTTTGCTGATTTGGTGTCTAGCCTGGATGCCTGGGTTGCAGGCCCTGCTTGTGGTGGCGCTCCACAGTCATCCGGCTGAAGAAGACCTGTTGGACTGGATCTTCTCGGG SEQ ID NO:531 WRN之外顯子2 TTTTCTTTCAGATATTGTTTTGTATTTACCCATGAAGACATTGTTTTTTGGACTCTGCAAATAGGACATTTCAAAGATGAGTGAAAAAAAATTGGAAACAACTGCACAGCAGCGGAAATGTCCTGAATGGATGAATGTGCAGAATAAAAGATGTGCTGTAGAAGAAAGAAAG SEQ ID NO:532 WRN之外顯子3 GCATGTGTTCGGAAGAGTGTTTTTGAAGATGACCTCCCCTTCTTAGAATTCACTGGATCCATTGTGTATAGTTACGATGCTAGTGATTGCTCTTTCCTGTCAGAAGATATTAG SEQ ID NO:533 WRN之外顯子4 CATGAGTCTATCAGATGGGGATGTGGTGGGATTTGACATGGAGTGGCCACCATTATACAATAGAGGGAAACTTGGCAAAGTTGCACTAATTCAGTTGTGTGTTTCTGAGAGCAAATGTTACTTGTTCCACGTTTCTTCCATGTCAG SEQ ID NO:534 WRN之外顯子5 TTTTTCCCCAGGGATTAAAAATGTTGCTTGAAAATAAAGCAGTTAAAAAGGCAGGTGTAGGAATTGAAGGAGATCAGTGGAAACTTCTACGTGACTTTGATATCAAATTGAAGAATTTTGTGGAGTTGACAGATGTTGCCAATAAAAAG SEQ ID NO:535 WRN之外顯子6 CTGAAATGCACAGAGACCTGGAGCCTTAACAGTCTGGTTAAACACCTCTTAGGTAAACAGCTCCTGAAAGACAAGTCTATCCGCTGTAGCAATTGGAGTAAATTTCCTCTCACTGAGGACCAGAAACTGTATGCAGCCACTGATGCTTAT SEQ ID NO:536 WRN之外顯子7 GCTGGTTTTATTATTTACCGAAATTTAGAGATTTTGGATGATACTGTGCAAAGGTTTGCTATAAATAAAG SEQ ID NO:537 WRN之外顯子8 AGGAAGAAATCCTACTTAGCGACATGAACAAACAGTTGACTTCAATCTCTGAGGAAGTGATGGATCTGGCTAAGCATCTTCCTCATGCTTTCAGTAAATTGGAAAACCCACGGAG SEQ ID NO:538 WRN之外顯子9 GGTTTCTATCTTACTAAAGGATATTTCAGAAAATCTATATTCACTGAGGAGGATGATAATTGGGTCTACTAACATTGAGACTGAACTGAGGCCCAGCAATAATTTAAACTTATTATCCTTTGAAGATTCAACTACTGGGGGAGTACAACAGAAACAAATTAGAGAACATGAAGTTTTAATTCACGTTGAAGATGAAACATGGGACCCAACACTTGATCATTTAGCTAAACATGATGGAGAAGATGTACTTGGAAATAAAGTGGAACGAAAAGAAGATGGATTTGAAGATGGAGTAGAAGACAACAAATTGAAAGAGAATATGGAAAGAGCTTGTTTGATGTCGTTAGATATTACAGAACATGAACTCCAAATTTTGGAACAGCAGTCTCAGGAAGAATATCTTAGTGATATTGCTTATAAATCTACTGAG SEQ ID NO:539 WRN之外顯子10 CATTTATCTCCCAATGATAATGAAAACGATACGTCCTATGTAATTGAGAGTGATGAAGATTTAGAAATGGAGATGCTTAAG SEQ ID NO:540 WRN之外顯子11 CATTTATCTCCCAATGATAATGAAAACGATACGTCCTATGTAATTGAGAGTGATGAAGATTTAGAAATGGAGATGCTTAAG SEQ ID NO:541 WRN之外顯子12 TCTTTAGAAAACCTCAATAGTGGCACGGTAGAACCAACTCATTCTAAATGCTTAAAAATGGAAAGAAATCTGGGTCTTCCTACTAAAGAAGAAGAAGAAGATGATGAAAATGAAGCTAATGAAGGGGAAGAAGATGATGATAAGG SEQ ID NO:542 WRN之外顯子13 ACTTTTTGTGGCCAGCACCCAATGAAGAGCAAGTTACTTGCCTCAAGATGTACTTTGGCCATTCCAGTTTTAAACC SEQ ID NO:543 WRN之外顯子14 AGTTCAGTGGAAAGTGATTCATTCAGTATTAGAAGAAAGAAGAGATAATGTTGCTGTCATGGCAACTG SEQ ID NO:544 WRN之外顯子15 GATATGGAAAGAGTTTGTGCTTCCAGTATCCACCTGTTTATGTAGGCAAGATTGGCCTTGTTATCTCTCCCCTTATTTCTCTGATGGAAGACCAAGTGCTACAGCTTAA SEQ ID NO:545 WRN之外顯子16 AATGTCCAACATCCCAGCTTGCTTCCTTGGATCAGCACAGTCAGAAAATGTTCTAACAGATATTAAATT SEQ ID NO:546 WRN之外顯子17 AGGTAAATACCGGATTGTATACGTAACTCCAGAATACTGTTCAGGTAACATGGGCCTGCTCCAGCAACTTGAGGCTGATATTG SEQ ID NO:547 WRN之外顯子18 GTATCACGCTCATTGCTGTGGATGAGGCTCACTGTATTTCTGAGTGGGGGCATGATTTTAGGGATTCATTCAGGAAGTTGGGCTCCCTAAAGACAGCACTGCCAATG SEQ ID NO:548 WRN之外顯子19 GTTCCAATCGTTGCACTTACTGCTACTGCAAGTTCTTCAATCCGGGAAGACATTGTACGTTGCTTAAATCTGAGAAATCCTCAGATCACCTGTACTGGTTTTGATCGACCAAACCTGTATTTAGAAGTTAGGCGAAAAACAGGGAATATCCTTCAGGATCTGCAGCCATTTCTTGTCAAAACAAG SEQ ID NO:549 WRN之外顯子20 TTCCCACTGGGAATTTGAAGGTCCAACAATCATCTACTGTCCTTCTAGAAAAATGACACAACAAGTTACAGGTGAACTTAGGAAACTGAATCTATCCTGTGGAACATACCATGCGGGCATGAGTTTTAGCACAAGGAAAGACATTCATCATAGGTTTGTAAGAGATGAAATTCAG SEQ ID NO:550 WRN之外顯子21 TGTGTCATAGCTACCATAGCTTTTGGAATGGGCATTAATAAAGCTGACATTCGCCAAGTCATTCATTACGGTGCTCCTAAGGACATGGAATCATATTATCAGGAGATTGGTAGAGCTGGTCGTGATGGACTTCAAAGTTCTTGTCACGTCCTCTGGGCTCCTGCAGACATTAACTTAAATAG SEQ ID NO:551 WRN之外顯子22 GCACCTTCTTACTGAGATACGTAATGAGAAGTTTCGATTATACAAATTAAAGATGATGGCAAAGATGGAAAAATATCTTCATTCTAGCAGATGTAGGAGACA SEQ ID NO:552 WRN之外顯子23 AATCATCTTGTCTCATTTTGAGGACAAACAAGTACAAAAAGCCTCCTTGGGAATTATGGGAACTGAAAAATGCTGTGATAATTGCAGGTCCAG SEQ ID NO:553 WRN之外顯子24 ATTGGATCATTGCTATTCCATGGATGACTCAGAGGATACATCCTGGGACTTTGGTCCACAAGCATTTAAGCTTTTGTCTGCTGTGGACATCTTAGGCGAAAAATTTGGAATTGGGCTTCCAATTTTATTTCTCCGAGGATCT SEQ ID NO:554 WRN之外顯子25 AATTCTCAGCGTCTTGCCGATCAATATCGCAGGCACAGTTTATTTGGCACTGGCAAGGATCAAACAGAGAGTTGGTGGAAGGCTTTTTCCCGTCAGCTGATCACTGAGGGATTCTTGGTAGAAGTTTCTCGGTATAACAAATTTATGAAGATTTGCGCCCTTACGAAAAAG SEQ ID NO:555 WRN之外顯子26 GGTAGAAATTGGCTTCATAAAGCTAATACAGAATCTCAGAGCCTCATCCTTCAAGCTAATGAAGAATTGTGTCCAAAGAAGTTGCTTCTGCCTAG SEQ ID NO:556 WRN之外顯子27 TTCGAAAACTGTATCTTCGGGCACCAAAGAGCATTGTTATAATCAAGTACCAGTTGAATTAAGTACAGAGAAGAAG SEQ ID NO:557 WRN之外顯子28 TCTAACTTGGAGAAGTTATATTCTTATAAACCATGTGATAAGATTTCTTCTGGGAGTAACATTTCTAAAAAAAG SEQ ID NO:558 WRN之外顯子29 TATCATGGTACAGTCACCAGAAAAAGCTTACAGTTCCTCACAGCCTGTTATTTCGGCACAAGAGCAGGAGACTCAG SEQ ID NO:559 WRN之外顯子30 ATTGTGTTATATGGCAAATTGGTAGAAGCTAGGCAGAAACATGCCAATAAAATGGATGTTCCCCCAGCTATTCTGGCAACAAACAAGATACTGGTGGATATGGCCAAAATGAG SEQ ID NO:560 WRN之外顯子31 ACCAACTACGGTTGAAAACGTAAAAAGGATTGATGGTGTTTCTGAAGGCAAAGCTGCCATGTTGGCCCCTCTGTTGGAAGTCATCAAACATTTCTGCCAAACAAATAGTGTTCAG SEQ ID NO:561 WRN之外顯子32 ACAGACCTCTTTTCAAGTACAAAACCTCAAGAAGAACAGAAGACGAGTCTGGTAGCAAAAAATAAAATATGCACACTTTCACAGTCTATGGCCATCACATACTCTTTATTCCAAGAAAAGAAGATGCCTTTG SEQ ID NO:562 WRN之外顯子33 AAGAGCATAGCTGAGAGCAGGATTCTGCCTCTCATGACAATTGGCATGCACTTATCCCAAGCGGTGAAAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAG SEQ ID NO:563 WRN之外顯子34 ATATGAGTAAAATTAGCCTAATCAGAATGTTAGTTCCTGAAAACATTGACACGTACCTTATCCACATGGCAATTGAGATCCTTAAACATGGTCCTGACAGCGGACTTCAACCTTCATGTGATGTCAACAAAAGGAGATGTTTTCCCGGTTCTGAAGAGATCTGTTCAAGTTCTAAGAGAAGCAAGGAAGAAGTAGGCATCAATACTGAG SEQ ID NO:564 WRN之外顯子35 ACTTCATCTGCAGAGAGAAAGAGACGATTACCTGTGTGGTTTGCCAAAGGAAGTGATACCAGCAAGAAATTAATGGACAAAACGAAAAGGGGAGGTCTTTTTAGTTAAGCTGGCAATTACCAGAACAATTATGTTTCTTGCTGTATTATAAGAGGATAGCTATATTTTATTTCTGAAGAGTAAGGAGTAGTATTTTGGCTTAAAAATCATTCTAATTACAAAGTTCACTGTTTATTGAAGAACTGGCATCTTAAATCAGCCTTCCGCAATTCATGTAGTTTCTGGGTCTTCTGGGAGCCTACGTGAGTACATCACCTAACAGAATATTAAATTAGACTTCCTGTAAGATTGCTTTAAGAAACTGTTACTGTCCTGTTTTCTAATCTCTTTATTAAAACAGTGTATTTGGAAAATGTTATGTGCTCTGATTTGATATAGATAACAGATTAGTAGTTACATGGTAATTATGTGATATAAAATATTCATATATTATCAAAATTCTGTTTTGTAAATGTAAGAAAGCATAGTTATTTTACAAATTGTTTTTACTGTCTTTTGAAGAAGTTCTTAAATACGTTGTTAAATGGTATTAGTTGACCAGGGCAGTGAAAATGAAACCGCATTTTGGGTGCCATTAAATAGGGAAAAAACATGTAAAAAATGTAAAATGGAGACCAATTGCACTAGGCAAGTGTATATTTTGTATTTTATATACAATTTCTATTATTTTTCAAGTAATAAAACAATGTTTTTCATACTGAATATTATATATATATTTTTTAGCTTTCATTTACTTAATTATTTTAAGTACCTTTATTTTTCCAGGATGTCAGAATTTGATTCTAATCTCTCTTATGTAGCACATGTGACTTAATTTAAAACCTATACTGTGACACAGAGTTGGGTAAACGATGATTATTTAACTTTAAGCAGTTCACCATCCATTTCAAAGCCTTTGATTGGCTTTTTTGTAAATAAAAATAACTTGTTAAGAAACAAATATATCTGTCATAGAAGAACTAGAAAATCCAGGGAAGTGAGAAAAATGAAAATAAAAATCATTCATAGTTTTACTAGTAGCTAATCACAGTCAACCTCTTTTGTGTATCCCACCAGACTTTTTTATATTCATTTGTTTTTAGTTAAAATATAAAAGTCTCGTATATTCCCATTTTTCTGCATTGCATTACCAGAAGGTAGTGGCGCCTATTAAATATGTGATATGTTGTTGTCCAGCCATGGCTTCTGCATTTGCATGCTTTTGTGTGTGCATCTGCAATACCCTGTGAATATCCTGTGTGATGGAGTGGCAAGTACGCACAGACACGTCTGCTGCATGCCTAGGTACGAGGCTGTCTCCAGGAGAAGCACTTGTTTGATTATTTGAGTTGCCAATTGAATTTGCTGCTTTTTTTCATGGCTTGCCATTTTCACTGAAAAGAATGACTAATGAAAAACGATGATTGGTTATTAGATTTGGATGTTTGGCAGACATTTTCTCAAAATTGAACTAAGTTGGCCTCTTCACGGAAAACAACTGGTATTTGTTGTGCCAATGATAAAATTGGAGATTTCTAGCAAAATGTATAATTTTGGAAAAGTTGTGTTCCTCCACTGGAAGCTTGACAGCTTTCCTTAACATAAAGACTTCTCTTTCTCTTCGCTTTCACTACTACTACTACTAATTCTTCTTCTGATTCTTCTTCTTCTCCTTCTTCCTTCTTCCTTCCTTCCTCCTCCTCCTCCTTCTTCTTCCTCTTCCTCTTCTTCTTTCTCTCTTTCCTTCCTTCCCTTCCCTTCCCCTTCCTTCCTTCCTTCCTTCCTTCCTCCCTCCCTCCCTCCCTCCCTCCCTCCCTCCTTTCTTTTTCTTTCTCTTTCTTTCTTTCTTTCTCTCTCTCTCTCTCTTTCTTTCTTTTTCTTTCTCTTTTTCTTTCTTTCAAGCAGTCCTCCCGCCTCAGTCCCCCAAAATAGTGGGATTACAGGTGTGAGCCACCATGCACAGCCTTACATAAAGCCTTTTCTAATGAGATGGATAGTAATTAACAAATGTGAGTTTTTGATATTATATAAAGATTTTTTCTGTGTTTCGAAGATCCGTATAACTCAGTGAATCAGTATGTTCTGGATGACTAATATGTGATGTTAAGAAATCATGACTGAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGATCAGGAGATCGAGACCACCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGTTGGTGCGTGCCTATAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACTCAGGAGGCGGAGATTGCAGTGAGCTGAGACTGCGCCACTGCACCCCAGCCTGGCGACAGAGCAAGACTCCGTCTCAAAAATAAAAAAAGAAATCATGACTGGGTAAAAGATCTGTTCAGAGTACAAGATGGACCAATGGATTTGATATATTTGAATATAACAGAGTATGAAAAAGTTTATTGATATAGTTTCAGATTACACACTGCAACTAATCTTTAAGAAACTATTACTTGTCCACTTTTTGGTAAAATTTCAGAGAACAATGTCCACCATTATCTGAACAGGCTATTAAAATACTCTTCTCTTTTCCAACTACGTGCCTGTGCAAAGTCAGATTTTTTTCATATACTTCAGCCAAAACAGCATATCAAAATGGATTGAATGCAGAAGTAGATCTGAGAATACAGCCACTTTTGTTAAGCCAGACAATGAGATTTGCAAAATGTAAACAATGCTGCTGTTCTCAGTTTTTAAAAATATGTTTTTTAAAAGTATTTATGTTAATGTGTACTTGGTTTACTACTGCTATTTTTAAATAAAACAAGAAACATTTTTAAATGTCTGTTTTAATTTCTAAAGTGGTAGTGATAGATATAACCCATATTAATAAAAGCTCTTTGGGGTCCTCAGTGATTTTTTTTTAAGAGTATGGAAGGGTTCTCAGACCTAAGAGATTGAGAAATGCTGATGTAATGTTTTATTATAAAGGTGTACCATGAATTATGTACCTTACTTCATATTGTTGGACATTAAAGTTGCTTTCAGTTTTTTTGTTTTAAA SEQ ID NO:565 WRN之全mRNA多核苷酸序列 GTGTACTGTGTGCGCCGGGGAGGCGCCGGCTTGTACTCGGCAGCGCGGGAATAAAGTTTGCTGATTTGGTGTCTAGCCTGGATGCCTGGGTTGCAGGCCCTGCTTGTGGTGGCGCTCCACAGTCATCCGGCTGAAGAAGACCTGTTGGACTGGATCTTCTCGGGTTTTCTTTCAGATATTGTTTTGTATTTACCCATGAAGACATTGTTTTTTGGACTCTGCAAATAGGACATTTCAAAGATGAGTGAAAAAAAATTGGAAACAACTGCACAGCAGCGGAAATGTCCTGAATGGATGAATGTGCAGAATAAAAGATGTGCTGTAGAAGAAAGAAAGGCATGTGTTCGGAAGAGTGTTTTTGAAGATGACCTCCCCTTCTTAGAATTCACTGGATCCATTGTGTATAGTTACGATGCTAGTGATTGCTCTTTCCTGTCAGAAGATATTAGCATGAGTCTATCAGATGGGGATGTGGTGGGATTTGACATGGAGTGGCCACCATTATACAATAGAGGGAAACTTGGCAAAGTTGCACTAATTCAGTTGTGTGTTTCTGAGAGCAAATGTTACTTGTTCCACGTTTCTTCCATGTCAGTTTTTCCCCAGGGATTAAAAATGTTGCTTGAAAATAAAGCAGTTAAAAAGGCAGGTGTAGGAATTGAAGGAGATCAGTGGAAACTTCTACGTGACTTTGATATCAAATTGAAGAATTTTGTGGAGTTGACAGATGTTGCCAATAAAAAGCTGAAATGCACAGAGACCTGGAGCCTTAACAGTCTGGTTAAACACCTCTTAGGTAAACAGCTCCTGAAAGACAAGTCTATCCGCTGTAGCAATTGGAGTAAATTTCCTCTCACTGAGGACCAGAAACTGTATGCAGCCACTGATGCTTATGCTGGTTTTATTATTTACCGAAATTTAGAGATTTTGGATGATACTGTGCAAAGGTTTGCTATAAATAAAGAGGAAGAAATCCTACTTAGCGACATGAACAAACAGTTGACTTCAATCTCTGAGGAAGTGATGGATCTGGCTAAGCATCTTCCTCATGCTTTCAGTAAATTGGAAAACCCACGGAGGGTTTCTATCTTACTAAAGGATATTTCAGAAAATCTATATTCACTGAGGAGGATGATAATTGGGTCTACTAACATTGAGACTGAACTGAGGCCCAGCAATAATTTAAACTTATTATCCTTTGAAGATTCAACTACTGGGGGAGTACAACAGAAACAAATTAGAGAACATGAAGTTTTAATTCACGTTGAAGATGAAACATGGGACCCAACACTTGATCATTTAGCTAAACATGATGGAGAAGATGTACTTGGAAATAAAGTGGAACGAAAAGAAGATGGATTTGAAGATGGAGTAGAAGACAACAAATTGAAAGAGAATATGGAAAGAGCTTGTTTGATGTCGTTAGATATTACAGAACATGAACTCCAAATTTTGGAACAGCAGTCTCAGGAAGAATATCTTAGTGATATTGCTTATAAATCTACTGAGCATTTATCTCCCAATGATAATGAAAACGATACGTCCTATGTAATTGAGAGTGATGAAGATTTAGAAATGGAGATGCTTAAGCATTTATCTCCCAATGATAATGAAAACGATACGTCCTATGTAATTGAGAGTGATGAAGATTTAGAAATGGAGATGCTTAAGTCTTTAGAAAACCTCAATAGTGGCACGGTAGAACCAACTCATTCTAAATGCTTAAAAATGGAAAGAAATCTGGGTCTTCCTACTAAAGAAGAAGAAGAAGATGATGAAAATGAAGCTAATGAAGGGGAAGAAGATGATGATAAGGACTTTTTGTGGCCAGCACCCAATGAAGAGCAAGTTACTTGCCTCAAGATGTACTTTGGCCATTCCAGTTTTAAACCAGTTCAGTGGAAAGTGATTCATTCAGTATTAGAAGAAAGAAGAGATAATGTTGCTGTCATGGCAACTGGATATGGAAAGAGTTTGTGCTTCCAGTATCCACCTGTTTATGTAGGCAAGATTGGCCTTGTTATCTCTCCCCTTATTTCTCTGATGGAAGACCAAGTGCTACAGCTTAAAATGTCCAACATCCCAGCTTGCTTCCTTGGATCAGCACAGTCAGAAAATGTTCTAACAGATATTAAATTAGGTAAATACCGGATTGTATACGTAACTCCAGAATACTGTTCAGGTAACATGGGCCTGCTCCAGCAACTTGAGGCTGATATTGGTATCACGCTCATTGCTGTGGATGAGGCTCACTGTATTTCTGAGTGGGGGCATGATTTTAGGGATTCATTCAGGAAGTTGGGCTCCCTAAAGACAGCACTGCCAATGGTTCCAATCGTTGCACTTACTGCTACTGCAAGTTCTTCAATCCGGGAAGACATTGTACGTTGCTTAAATCTGAGAAATCCTCAGATCACCTGTACTGGTTTTGATCGACCAAACCTGTATTTAGAAGTTAGGCGAAAAACAGGGAATATCCTTCAGGATCTGCAGCCATTTCTTGTCAAAACAAGTTCCCACTGGGAATTTGAAGGTCCAACAATCATCTACTGTCCTTCTAGAAAAATGACACAACAAGTTACAGGTGAACTTAGGAAACTGAATCTATCCTGTGGAACATACCATGCGGGCATGAGTTTTAGCACAAGGAAAGACATTCATCATAGGTTTGTAAGAGATGAAATTCAGTGTGTCATAGCTACCATAGCTTTTGGAATGGGCATTAATAAAGCTGACATTCGCCAAGTCATTCATTACGGTGCTCCTAAGGACATGGAATCATATTATCAGGAGATTGGTAGAGCTGGTCGTGATGGACTTCAAAGTTCTTGTCACGTCCTCTGGGCTCCTGCAGACATTAACTTAAATAGGCACCTTCTTACTGAGATACGTAATGAGAAGTTTCGATTATACAAATTAAAGATGATGGCAAAGATGGAAAAATATCTTCATTCTAGCAGATGTAGGAGACAAATCATCTTGTCTCATTTTGAGGACAAACAAGTACAAAAAGCCTCCTTGGGAATTATGGGAACTGAAAAATGCTGTGATAATTGCAGGTCCAGATTGGATCATTGCTATTCCATGGATGACTCAGAGGATACATCCTGGGACTTTGGTCCACAAGCATTTAAGCTTTTGTCTGCTGTGGACATCTTAGGCGAAAAATTTGGAATTGGGCTTCCAATTTTATTTCTCCGAGGATCTAATTCTCAGCGTCTTGCCGATCAATATCGCAGGCACAGTTTATTTGGCACTGGCAAGGATCAAACAGAGAGTTGGTGGAAGGCTTTTTCCCGTCAGCTGATCACTGAGGGATTCTTGGTAGAAGTTTCTCGGTATAACAAATTTATGAAGATTTGCGCCCTTACGAAAAAGGGTAGAAATTGGCTTCATAAAGCTAATACAGAATCTCAGAGCCTCATCCTTCAAGCTAATGAAGAATTGTGTCCAAAGAAGTTGCTTCTGCCTAGTTCGAAAACTGTATCTTCGGGCACCAAAGAGCATTGTTATAATCAAGTACCAGTTGAATTAAGTACAGAGAAGAAGTCTAACTTGGAGAAGTTATATTCTTATAAACCATGTGATAAGATTTCTTCTGGGAGTAACATTTCTAAAAAAAGTATCATGGTACAGTCACCAGAAAAAGCTTACAGTTCCTCACAGCCTGTTATTTCGGCACAAGAGCAGGAGACTCAGATTGTGTTATATGGCAAATTGGTAGAAGCTAGGCAGAAACATGCCAATAAAATGGATGTTCCCCCAGCTATTCTGGCAACAAACAAGATACTGGTGGATATGGCCAAAATGAGACCAACTACGGTTGAAAACGTAAAAAGGATTGATGGTGTTTCTGAAGGCAAAGCTGCCATGTTGGCCCCTCTGTTGGAAGTCATCAAACATTTCTGCCAAACAAATAGTGTTCAGACAGACCTCTTTTCAAGTACAAAACCTCAAGAAGAACAGAAGACGAGTCTGGTAGCAAAAAATAAAATATGCACACTTTCACAGTCTATGGCCATCACATACTCTTTATTCCAAGAAAAGAAGATGCCTTTGAAGAGCATAGCTGAGAGCAGGATTCTGCCTCTCATGACAATTGGCATGCACTTATCCCAAGCGGTGAAAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAGATATGAGTAAAATTAGCCTAATCAGAATGTTAGTTCCTGAAAACATTGACACGTACCTTATCCACATGGCAATTGAGATCCTTAAACATGGTCCTGACAGCGGACTTCAACCTTCATGTGATGTCAACAAAAGGAGATGTTTTCCCGGTTCTGAAGAGATCTGTTCAAGTTCTAAGAGAAGCAAGGAAGAAGTAGGCATCAATACTGAGACTTCATCTGCAGAGAGAAAGAGACGATTACCTGTGTGGTTTGCCAAAGGAAGTGATACCAGCAAGAAATTAATGGACAAAACGAAAAGGGGAGGTCTTTTTAGTTAAGCTGGCAATTACCAGAACAATTATGTTTCTTGCTGTATTATAAGAGGATAGCTATATTTTATTTCTGAAGAGTAAGGAGTAGTATTTTGGCTTAAAAATCATTCTAATTACAAAGTTCACTGTTTATTGAAGAACTGGCATCTTAAATCAGCCTTCCGCAATTCATGTAGTTTCTGGGTCTTCTGGGAGCCTACGTGAGTACATCACCTAACAGAATATTAAATTAGACTTCCTGTAAGATTGCTTTAAGAAACTGTTACTGTCCTGTTTTCTAATCTCTTTATTAAAACAGTGTATTTGGAAAATGTTATGTGCTCTGATTTGATATAGATAACAGATTAGTAGTTACATGGTAATTATGTGATATAAAATATTCATATATTATCAAAATTCTGTTTTGTAAATGTAAGAAAGCATAGTTATTTTACAAATTGTTTTTACTGTCTTTTGAAGAAGTTCTTAAATACGTTGTTAAATGGTATTAGTTGACCAGGGCAGTGAAAATGAAACCGCATTTTGGGTGCCATTAAATAGGGAAAAAACATGTAAAAAATGTAAAATGGAGACCAATTGCACTAGGCAAGTGTATATTTTGTATTTTATATACAATTTCTATTATTTTTCAAGTAATAAAACAATGTTTTTCATACTGAATATTATATATATATTTTTTAGCTTTCATTTACTTAATTATTTTAAGTACCTTTATTTTTCCAGGATGTCAGAATTTGATTCTAATCTCTCTTATGTAGCACATGTGACTTAATTTAAAACCTATACTGTGACACAGAGTTGGGTAAACGATGATTATTTAACTTTAAGCAGTTCACCATCCATTTCAAAGCCTTTGATTGGCTTTTTTGTAAATAAAAATAACTTGTTAAGAAACAAATATATCTGTCATAGAAGAACTAGAAAATCCAGGGAAGTGAGAAAAATGAAAATAAAAATCATTCATAGTTTTACTAGTAGCTAATCACAGTCAACCTCTTTTGTGTATCCCACCAGACTTTTTTATATTCATTTGTTTTTAGTTAAAATATAAAAGTCTCGTATATTCCCATTTTTCTGCATTGCATTACCAGAAGGTAGTGGCGCCTATTAAATATGTGATATGTTGTTGTCCAGCCATGGCTTCTGCATTTGCATGCTTTTGTGTGTGCATCTGCAATACCCTGTGAATATCCTGTGTGATGGAGTGGCAAGTACGCACAGACACGTCTGCTGCATGCCTAGGTACGAGGCTGTCTCCAGGAGAAGCACTTGTTTGATTATTTGAGTTGCCAATTGAATTTGCTGCTTTTTTTCATGGCTTGCCATTTTCACTGAAAAGAATGACTAATGAAAAACGATGATTGGTTATTAGATTTGGATGTTTGGCAGACATTTTCTCAAAATTGAACTAAGTTGGCCTCTTCACGGAAAACAACTGGTATTTGTTGTGCCAATGATAAAATTGGAGATTTCTAGCAAAATGTATAATTTTGGAAAAGTTGTGTTCCTCCACTGGAAGCTTGACAGCTTTCCTTAACATAAAGACTTCTCTTTCTCTTCGCTTTCACTACTACTACTACTAATTCTTCTTCTGATTCTTCTTCTTCTCCTTCTTCCTTCTTCCTTCCTTCCTCCTCCTCCTCCTTCTTCTTCCTCTTCCTCTTCTTCTTTCTCTCTTTCCTTCCTTCCCTTCCCTTCCCCTTCCTTCCTTCCTTCCTTCCTTCCTCCCTCCCTCCCTCCCTCCCTCCCTCCCTCCTTTCTTTTTCTTTCTCTTTCTTTCTTTCTTTCTCTCTCTCTCTCTCTTTCTTTCTTTTTCTTTCTCTTTTTCTTTCTTTCAAGCAGTCCTCCCGCCTCAGTCCCCCAAAATAGTGGGATTACAGGTGTGAGCCACCATGCACAGCCTTACATAAAGCCTTTTCTAATGAGATGGATAGTAATTAACAAATGTGAGTTTTTGATATTATATAAAGATTTTTTCTGTGTTTCGAAGATCCGTATAACTCAGTGAATCAGTATGTTCTGGATGACTAATATGTGATGTTAAGAAATCATGACTGAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGATCAGGAGATCGAGACCACCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGTTGGTGCGTGCCTATAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACTCAGGAGGCGGAGATTGCAGTGAGCTGAGACTGCGCCACTGCACCCCAGCCTGGCGACAGAGCAAGACTCCGTCTCAAAAATAAAAAAAGAAATCATGACTGGGTAAAAGATCTGTTCAGAGTACAAGATGGACCAATGGATTTGATATATTTGAATATAACAGAGTATGAAAAAGTTTATTGATATAGTTTCAGATTACACACTGCAACTAATCTTTAAGAAACTATTACTTGTCCACTTTTTGGTAAAATTTCAGAGAACAATGTCCACCATTATCTGAACAGGCTATTAAAATACTCTTCTCTTTTCCAACTACGTGCCTGTGCAAAGTCAGATTTTTTTCATATACTTCAGCCAAAACAGCATATCAAAATGGATTGAATGCAGAAGTAGATCTGAGAATACAGCCACTTTTGTTAAGCCAGACAATGAGATTTGCAAAATGTAAACAATGCTGCTGTTCTCAGTTTTTAAAAATATGTTTTTTAAAAGTATTTATGTTAATGTGTACTTGGTTTACTACTGCTATTTTTAAATAAAACAAGAAACATTTTTAAATGTCTGTTTTAATTTCTAAAGTGGTAGTGATAGATATAACCCATATTAATAAAAGCTCTTTGGGGTCCTCAGTGATTTTTTTTTAAGAGTATGGAAGGGTTCTCAGACCTAAGAGATTGAGAAATGCTGATGTAATGTTTTATTATAAAGGTGTACCATGAATTATGTACCTTACTTCATATTGTTGGACATTAAAGTTGCTTTCAGTTTTTTTGTTTTAAA SEQ ID NO:566 WRN之外顯子2的轉譯多肽序列 MSEKKLETTAQQRKCPEWMNVQNKRCAVEERK SEQ ID NO:567 WRN之外顯子3的轉譯多肽序列 ACVRKSVFEDDLPFLEFTGSIVYSYDASDCSFLSEDI SEQ ID NO:568 WRN之外顯子4的轉譯多肽序列 MSLSDGDVVGFDMEWPPLYNRGKLGKVALIQLCVSESKCYLFHVSSMS SEQ ID NO:569 WRN之外顯子5的轉譯多肽序列 FPQGLKMLLENKAVKKAGVGIEGDQWKLLRDFDIKLKNFVELTDVANKK SEQ ID NO:570 WRN之外顯子6的轉譯多肽序列 LKCTETWSLNSLVKHLLGKQLLKDKSIRCSNWSKFPLTEDQKLYAATDAY SEQ ID NO:571 WRN之外顯子7的轉譯多肽序列 AGFIIYRNLEILDDTVQRFAINK SEQ ID NO:572 WRN之外顯子8的轉譯多肽序列 EEILLSDMNKQLTSISEEVMDLAKHLPHAFSKLENPR SEQ ID NO:573 WRN之外顯子9的轉譯多肽序列 VSILLKDISENLYSLRRMIIGSTNIETELRPSNNLNLLSFEDSTTGGVQQKQIREHEVLIHVEDETWDPTLDHLAKHDGEDVLGNKVERKEDGFEDGVEDNKLKENMERACLMSLDITEHELQILEQQSQEEYLSDIAYKSTE SEQ ID NO:574 WRN之外顯子10的轉譯多肽序列 HLSPNDNENDTSYVIESDEDLEMEMLK SEQ ID NO:575 WRN之外顯子11的轉譯多肽序列 HLSPNDNENDTSYVIESDEDLEMEMLK SEQ ID NO:576 WRN之外顯子12的轉譯多肽序列 SLENLNSGTVEPTHSKCLKMERNLGLPTKEEEEDDENEANEGEEDDDK SEQ ID NO:577 WRN之外顯子13的轉譯多肽序列 FLWPAPNEEQVTCLKMYFGHSSFK SEQ ID NO:578 WRN之外顯子14的轉譯多肽序列 VQWKVIHSVLEERRDNVAVMAT SEQ ID NO:579 WRN之外顯子15的轉譯多肽序列 YGKSLCFQYPPVYVGKIGLVISPLISLMEDQVLQL SEQ ID NO:580 WRN之外顯子16的轉譯多肽序列 MSNIPACFLGSAQSENVLTDIK SEQ ID NO:581 WRN之外顯子17的轉譯多肽序列 GKYRIVYVTPEYCSGNMGLLQQLEADI SEQ ID NO:582 WRN之外顯子18的轉譯多肽序列 ITLIAVDEAHCISEWGHDFRDSFRKLGSLKTALPM SEQ ID NO:583 WRN之外顯子19的轉譯多肽序列 VPIVALTATASSSIREDIVRCLNLRNPQITCTGFDRPNLYLEVRRKTGNILQDLQPFLVKT SEQ ID NO:584 WRN之外顯子20的轉譯多肽序列 SHWEFEGPTIIYCPSRKMTQQVTGELRKLNLSCGTYHAGMSFSTRKDIHHRFVRDEIQ SEQ ID NO:585 WRN之外顯子21的轉譯多肽序列 CVIATIAFGMGINKADIRQVIHYGAPKDMESYYQEIGRAGRDGLQSSCHVLWAPADINLN SEQ ID NO:586 WRN之外顯子22的轉譯多肽序列 HLLTEIRNEKFRLYKLKMMAKMEKYLHSSRCRR SEQ ID NO:587 WRN之外顯子23的轉譯多肽序列 IILSHFEDKQVQKASLGIMGTEKCCDNCRS SEQ ID NO:588 WRN之外顯子24的轉譯多肽序列 LDHCYSMDDSEDTSWDFGPQAFKLLSAVDILGEKFGIGLPILFLRGS SEQ ID NO:589 WRN之外顯子25的轉譯多肽序列 NSQRLADQYRRHSLFGTGKDQTESWWKAFSRQLITEGFLVEVSRYNKFMKICALTKK SEQ ID NO:590 WRN之外顯子26的轉譯多肽序列 GRNWLHKANTESQSLILQANEELCPKKLLLP SEQ ID NO:591 WRN之外顯子27的轉譯多肽序列 SKTVSSGTKEHCYNQVPVELSTEKK SEQ ID NO:592 WRN之外顯子28的轉譯多肽序列 SNLEKLYSYKPCDKISSGSNISKK SEQ ID NO:593 WRN之外顯子29的轉譯多肽序列 IMVQSPEKAYSSSQPVISAQEQETQ SEQ ID NO:594 WRN之外顯子30的轉譯多肽序列 IVLYGKLVEARQKHANKMDVPPAILATNKILVDMAKM SEQ ID NO:595 WRN之外顯子31的轉譯多肽序列 PTTVENVKRIDGVSEGKAAMLAPLLEVIKHFCQTNSVQ SEQ ID NO:596 WRN之外顯子32的轉譯多肽序列 TDLFSSTKPQEEQKTSLVAKNKICTLSQSMAITYSLFQEKKMPL SEQ ID NO:597 WRN之外顯子33的轉譯多肽序列 KSIAESRILPLMTIGMHLSQAVKAGCPLDLERAGLTPEVQKIIADVIRNPPVNS SEQ ID NO:598 WRN之外顯子34的轉譯多肽序列 MSKISLIRMLVPENIDTYLIHMAIEILKHGPDSGLQPSCDVNKRRCFPGSEEICSSSKRSKEEVGINTE SEQ ID NO:599 WRN之外顯子35的轉譯多肽序列 TSSAERKRRLPVWFAKGSDTSKKLMDKTKRGGLFS SEQ ID NO:600 WRN之全蛋白質序列 MSEKKLETTAQQRKCPEWMNVQNKRCAVEERKACVRKSVFEDDLPFLEFTGSIVYSYDASDCSFLSEDISMSLSDGDVVGFDMEWPPLYNRGKLGKVALIQLCVSESKCYLFHVSSMSVFPQGLKMLLENKAVKKAGVGIEGDQWKLLRDFDIKLKNFVELTDVANKKLKCTETWSLNSLVKHLLGKQLLKDKSIRCSNWSKFPLTEDQKLYAATDAYAGFIIYRNLEILDDTVQRFAINKEEEILLSDMNKQLTSISEEVMDLAKHLPHAFSKLENPRRVSILLKDISENLYSLRRMIIGSTNIETELRPSNNLNLLSFEDSTTGGVQQKQIREHEVLIHVEDETWDPTLDHLAKHDGEDVLGNKVERKEDGFEDGVEDNKLKENMERACLMSLDITEHELQILEQQSQEEYLSDIAYKSTEHLSPNDNENDTSYVIESDEDLEMEMLKHLSPNDNENDTSYVIESDEDLEMEMLKSLENLNSGTVEPTHSKCLKMERNLGLPTKEEEEDDENEANEGEEDDDKDFLWPAPNEEQVTCLKMYFGHSSFKPVQWKVIHSVLEERRDNVAVMATGYGKSLCFQYPPVYVGKIGLVISPLISLMEDQVLQLKMSNIPACFLGSAQSENVLTDIKLGKYRIVYVTPEYCSGNMGLLQQLEADIGITLIAVDEAHCISEWGHDFRDSFRKLGSLKTALPMVPIVALTATASSSIREDIVRCLNLRNPQITCTGFDRPNLYLEVRRKTGNILQDLQPFLVKTSSHWEFEGPTIIYCPSRKMTQQVTGELRKLNLSCGTYHAGMSFSTRKDIHHRFVRDEIQCVIATIAFGMGINKADIRQVIHYGAPKDMESYYQEIGRAGRDGLQSSCHVLWAPADINLNRHLLTEIRNEKFRLYKLKMMAKMEKYLHSSRCRRQIILSHFEDKQVQKASLGIMGTEKCCDNCRSRLDHCYSMDDSEDTSWDFGPQAFKLLSAVDILGEKFGIGLPILFLRGSNSQRLADQYRRHSLFGTGKDQTESWWKAFSRQLITEGFLVEVSRYNKFMKICALTKKGRNWLHKANTESQSLILQANEELCPKKLLLPSSKTVSSGTKEHCYNQVPVELSTEKKSNLEKLYSYKPCDKISSGSNISKKSIMVQSPEKAYSSSQPVISAQEQETQIVLYGKLVEARQKHANKMDVPPAILATNKILVDMAKMRPTTVENVKRIDGVSEGKAAMLAPLLEVIKHFCQTNSVQTDLFSSTKPQEEQKTSLVAKNKICTLSQSMAITYSLFQEKKMPLKSIAESRILPLMTIGMHLSQAVKAGCPLDLERAGLTPEVQKIIADVIRNPPVNSDMSKISLIRMLVPENIDTYLIHMAIEILKHGPDSGLQPSCDVNKRRCFPGSEEICSSSKRSKEEVGINTETSSAERKRRLPVWFAKGSDTSKKLMDKTKRGGLFS SEQ ID NO:601 WRN之全部外顯子1-33按順序排列的多核苷酸序列 SEQ ID NO:602 WRN之全部外顯子1-33按順序排列的轉譯多肽序列 DAAM1 序列資訊SEQ ID NO:603 來自DAAM1-NRG1融合物5’處之DAAM1之5'UTR/外顯子1的序列GAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAG SEQ ID NO:604 DAAM1-NRG1融合物3’處之NRG1的5’UTR/外顯子1序列 GACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCA SEQ ID NO:605 DAAM1-NRG1多核苷酸序列 GAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAGGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCA SEQ ID NO:606 DAAM1之外顯子1 GCGAGTTAGTAACCTACGAGCGGCTGTGAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAG SEQ ID NO:607 DAAM1之外顯子2 CGTTTAGTCACATCAAGAAATAGAACAGAATTCAGCCATGGCCCCAAGAAAGAGAGGTGGACGAGGTATTTCATTCATCTTTTGCTGTTTCCGAAATAATGATCACCCAGAAATCACGTATCGGCTGCGAAATGATAGCAACTTTGCGCTTCAGACCATGGAACCAGCATTGCCCATGCCCCCTGTGGAGGAGCTGGATGTCATGTTCAGTGAACTGGTG SEQ ID NO:608 DAAM1之外顯子3 GATGAACTGGACCTCACAGACAAACACAGAGAAGCCATGTTTGCACTTCCAGCTGAGAAAAAATGGCAAATATACTGTAGCAAGAAAAAG SEQ ID NO:609 DAAM1之外顯子4 GACCAGGAAGAAAACAAGGGAGCTACAAGTTGGCCTGAATTCTACATTGATCAGCTCAATTCCATGGCTGCT SEQ ID NO:610 DAAM1之外顯子5 AGAAAATCTCTGCTGGCTTTAGAGAAGGAAGAAGAAGAAGAAAGAAGTAAAACTATAGAGAGTTTAAAGACAGCACTGAGGACAAAACCAATGAG SEQ ID NO:611 DAAM1之外顯子6 GTTTGTAACCAGATTCATCGACTTGGATGGCCTATCATGTATCCTCAACTTTCTAAAGACCATGGACTACGAGACCTCAGAGTCTCGAATACATACTTCTCTCATTGGCTGTATAAAGGCGTTAATGAACAACTCTCAAGGCCGGGCTCACGTCCTGGCTCATTCTGAGAGTATTAATGTAATTGCTCAGAGTCTGAGCACAGAGAACATTAAAACGAAGGTGGCCGTGCTGGAAATCTTGGGCGCCGTGTGCCTGGTTCCCGGGGGCCACAAGAAGGTTCTGCAGGCCATGCTGCACTACCAGAAGTATGCCAGCGAAAGGACCCGCTTTCAG SEQ ID NO:612 DAAM1之外顯子7 ACATTAATTAACGACTTGGATAAAAGCACTGGGCGGTATCGAGATGAAGTGAGTCTCAAGACTGCCATCATGTCCTTCATTAATGCAGTGCTCAGCCAAGGTGCAGGAGTG SEQ ID NO:613 DAAM1之外顯子8 GAGAGTTTGGACTTTAGACTTCATCTTCGCTATGAATTTCTGATGTTAGGAATTCAACCTGTAATAGATAAATTAAGGGAACACGAAAATTCAACATTAGATAG SEQ ID NO:614 DAAM1之外顯子9 GCATTTAGACTTTTTTGAAATGCTCCGAAATGAAGATGAACTAGAATTTGCCAAAAGATTTGAACTG SEQ ID NO:615 DAAM1之外顯子10 GTTCACATAGACACAAAAAGTGCAACTCAGATGTTTGAGCTGACCAGGAAGAGGCTGACACATAGTGAAGCTTACCCGCATTTCATGTCCATCCTGCACCACTGCCTCCAAATGCCTT SEQ ID NO:616 DAAM1之外顯子11 ACAAGAGGAGTGGCAACACTGTTCAGTACTGGCTACTACTAGATAGAATTATACAGCAGATAGTTATCCAGAATGACAAAGGACAGGACCCTGACTCCACACCTTTGGAAAACTTTAATATTAAGAATGTCGTACGAAT SEQ ID NO:617 DAAM1之外顯子12 GTTGGTTAATGAAAATGAAGTTAAGCAGTGGAAAGAACAAGCGGAAAAAATGAGAAAAG SEQ ID NO:618 DAAM1之外顯子13 AGCACAATGAGCTACAACAGAAACTGGAAAAGAAAGAACGAGAATGTGATGCTAAGACTCAAGAGAAGGAAGAGATGATGCAGACCTTAAATAAAATGAAAGAGAAACTTGAAAAGGAGACTACTGAGCATAAGCAAGTCAAGCAGCAGGTGGCGGACCTCACAGCACAGCTCCATGAGCTCAGCAGG SEQ ID NO:619 DAAM1之外顯子14 AGGGCCGTCTGTGCTTCAATCCCAGGTGGACCCTCGCCTGGAGCACCAGGAGGGCCCTTTCCTTCCTCTGTGCCTGGATCTCTCCTTCCTCCCCCACCACCCCCACCTCTACCAGGTGGGATGCTTCCCCCTCCACCGCCTCCCCTCCCTCCAGGTGGCCCTCCTCCTCCCCCAGGGCCTCCTCCCTTAGGGGCAATCATGCCACCTCCTGGTGCTCCAATGGGCCTAGCACTGAAGAAGAAAAGCATTCCTCAGCCCACAAATGCCCTGAAATCCTTCAACTGGTCTAAACTGCCCGAG SEQ ID NO:620 DAAM1之外顯子15 AACAAACTGGAAGGAACAGTATGGACCGAAATTGATGATACAAAAGTCTTCAAAATTCTAGATCTTGAAGACCTGGAAAGAACCTTCTCTGCCTATCAAAGACAGCAG SEQ ID NO:621 DAAM1之外顯子16 AAAGAAGCAGATGCCATTGATGACACTCTGAGTTCCAAACTTAAAGTTAAAGAGCTTTCGGTGATTGATGGTCGGAGAGCTCAGAATTGCAACATCCTTCTATCGAG SEQ ID NO:622 DAAM1之外顯子17 GTTGAAATTATCCAATGACGAAATCAAACGGGCAATTCTAACAATGGACGAACAGGAAGATCTGCCCAAGGACATGTTGGAACAG SEQ ID NO:623 DAAM1之外顯子18 CTCTTGAAATTTGTTCCTGAAAAAAGTGACATTGACCTATTGGAGGAACATAAACACGAACTGGATCGGATGGCCAAGGCTGATAGGTTCCTTTTTGAGATGAGCCG SEQ ID NO:624 DAAM1之外顯子19 AATTAATCACTATCAGCAAAGGTTGCAATCGCTGTACTTCAAAAAGAAGTTTGCAGAGCGTGTGGCAGAAGTGAAACCTAAAGTGGAAG SEQ ID NO:625 DAAM1之外顯子20 CAATTCGTTCTGGCTCAGAAGAGGTGTTTAGGAGTGGTGCCCTCAAGCAGTTGCTGGAGGTGGTTTTGGCATTTGGAAATTATATGAATAAAGGTCAAAGAGGGAATGCATATGGATTCAAGATATCTAGCCTAAACAAAATTGCTGACACAAAATCCAGCATCGACAA SEQ ID NO:626 DAAM1之外顯子21 AAACATTACCCTTTTGCACTATCTCATCACTATTGTGGAAAATAAGTACCCCAGTGTTCTCAATCTAAATGAAGAATTGCGAGATATTCCTCAAGCTGCGAAAGTAAA SEQ ID NO:627 DAAM1之外顯子22 CATGACTGAGCTGGACAAAGAAATAAGTACCTTGAGAAGTGGCTTGAAAGCAGTAGAGACA SEQ ID NO:628 DAAM1之外顯子23 GAGCTGGAATATCAGAAGTCTCAGCCCCCACAGCCCGGAGATAAGTTTGTGTCTGTTGTCAGCCAGTTCATCACAGTAGCCAGCTTCAGCTTCTCTGATGTTGAAGACCTTCTAGCAGAAGCTAAAGACCTG SEQ ID NO:629 DAAM1之外顯子24 TTTACTAAAGCAGTGAAGCACTTTGGGGAAGAGGCTGGCAAAATACAACCAGATGAGTTCTTTGGCATTTTTGATCAATTTCTTCAAGCTGTGTCAGAAGCCAAACAAGAAAACGAAAATATGAGAAAGAAAAAGGAGGAAGAAGAACGTCGAGCTCGCATGGAAGCTCAG SEQ ID NO:630 DAAM1之外顯子25 CTCAAAGAACAACGTGAAAGGGAACGTAAAATGAGAAAAGCTAAAGAGAATAGTGAAGAAAGCGGAGAGTTTGATGACCTTGTTTCAGCTTTACGCTCAGGAGAAGTGTTTGACAAAGACCTTTCTAAATTGAAACGGAATCGCAAACGTATTACCAACCAGATGACTGACAGCAGCAGAGAGAGACCAATCACAAAACTTAATTTCTAATTTTCCATGAATACTTTTTTTTAGAAAGCTCATTAGCAGCCCTCTAAAGTGACTAGAACGTTTCATTACACTGCCTTGCAATCCAAACAGTGGCAATTTTTTCCTTCATCTGTGAGTGAATGTGTGAACGTGTGTATGTAAATGTATGTGTGTATATATTAAAAAATGTATATAGATGTCTGAGTGTTGTCTGGAGACCTATACGTATGGTTAAAAAGATTTATGTTAATGTATGTGCTCCAAAACCTTTCGTGTATGCATTCACATTGAGTGTGGCTCATTTTCTTTCCCCGAACGCCATGACTGTTCAGAAGCACAATACTATCTCCTGAAAGAGATAAGAGACATTCCCTAGATTCAAAGGCAAAACAGAAGAAACAAACAAACAAACAAAAAAAGCTTGCAAAATATTTTATGGTTTCCAAGCTTGATATCCTTTAAAATTATTTTCATTGATGGAACTGGAGTTGTTGGAAAAACATAGATTTAAAATGATTTTTGATAGCTGACATTGTGATGTTGATGTATCACATCAGTAATAGGACCAGCTTTGAATTTCTGACATTGGTGTGGGGATACAGTCTGTAAATGTTTATTGAGAACATCTTGCACACAATTTGAATTATGTAGAATGTCAATCAAGTTTTTGTATATTTAAAAGTTGGACATCAATTTTTTCCCCTGATTTCATCAAGTTATCTCTGCCAAGTGCTCTTGATAATTTCTTCAGATTTTTGGAAAAAAACACTATATAAATGCAATCCATGCTTTTTTTAAAGAACAACATTGCCAGAGTATGCTTGTTCTAACAATATAGATATATAAACCTTAAAAATAATAAAATATCTCACCCAAGACTTAAAGGAAGAATTCTCTGAAGGGATAAAGATTACTAAAAAAAAAAAAAAAAAAAAAAAATTAATGGGGTGCCTTTTTGTTATAGTTTCTATTTTCTGTTTTGTAGGACAAGCTGCATTTTCTGTAAATATAGGTCTGGACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTATTTATCTCTAAGTATATTTGAAGTGATTGCTGTTTATATGTTGTCATTTTAAAATTGTGTGTCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAATAAAGCTCTCAGGGAGACATGAATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCTTAATATCCTTAAATTATGTGATATATAAAGAGGACTGTTACTTTTTTACTTTTTTTTTTTTTTTTTTTTTTTTTTGGCTTTGCTTTATTTATTTGTAGTTGGGGGCTAACGTTTTCTCTTTTCTTTCTATTGATCCTGTTGTGGTTGGGTTTCCTGTGGAGAGAGTAGTTTGTCCTGTTGCACTAGAACATTATTTACTCACTAAATTGAGTTTTTCAGTCAATTAACAAATATTTATTAAGTTCCTACTATGTACCAGGCATAGTAGGGCTACAATGGTAAGCAAGACAGAGTCCCTGCCCCCAAAGAGCTTATATTCTAATGGGGATATTAAGGGATGAATAGAATACCAATGTGTGCACTGTACAGGAATCATACTATTTAAAAATAATTTGTATAAACTATAATGCTTAGCACAGATGGCGAGTTATCTGTGCTATGTGAAAGCTGTGAAATAGTGCTCTAAGAGTTGTCAAGAGCTGTGGTTTTACATCTTTTCCTCATTGCAAATTTAGTGACTTTCTACACACTATATGGAAATAAATGACTAGCAAATAAAACAGTCATAAATACAAAGCAGAGGTTGCACTCCCCCAATCCCGAGTTAACCCAGGTCTGCAATAACACCATGTTAAAGGTGCAGATAGAGACTTGGCTCAAAAAGGCTTGGAGATGAGGAAGATTGGAATATAATGATGGTTTTGTCTGTTCCTTAACTAAAGTGCCTCTATGTATATTCTTTTCTATTTGTAGCAGGATAAATTTGGTCTGGCTCAGTTTTGGAACTGTATTTTGAAAATGGCTTTGTCTTACAGTTTAAGGAATAGACAGGTGGAGGGAAAGTCACATAAAGGAGCAAGTTTGTGTAGCTGTCCCTCTTGCCCCTTTTAATCATCCTCCTTTGATATGGCCATCCTGGTGGGCCTCCTTTGCCATTTCCATTTTTGGTTTCTTTCCCTGAAAACTGTGTGCAGGTAATTCCATGTGCCATTGTTGAAAAGAAAAAAAAAAAACAAAAAAAAAACCTACTTTTTAGATTGGTGCTGGTGTAAGTAGCCACTTTTCTCTCTTGGGTGTGTATTTTAAACTTTTTTTGTTTTTTTAAATTAATGCCAAAAAGAAAATGCATAATTTGTAAACTTAATTATATGTCTTATATCTTATTAGCTTAGTAGTTGGAACCACTTAGTCTTTAGGTGCAAGACTGTTGTTAGATAGTACTGAGAAAAAAAAAGTATGTGTTATGAGACTGTACATGTTTTTTTAAAAATAGCAATATGCAATAAAGAGATGAATTCATTGGGTGTA SEQ ID NO:631 DAAM1之全mRNA多核苷酸序列 GCGAGTTAGTAACCTACGAGCGGCTGTGAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAGCGTTTAGTCACATCAAGAAATAGAACAGAATTCAGCCATGGCCCCAAGAAAGAGAGGTGGACGAGGTATTTCATTCATCTTTTGCTGTTTCCGAAATAATGATCACCCAGAAATCACGTATCGGCTGCGAAATGATAGCAACTTTGCGCTTCAGACCATGGAACCAGCATTGCCCATGCCCCCTGTGGAGGAGCTGGATGTCATGTTCAGTGAACTGGTGGATGAACTGGACCTCACAGACAAACACAGAGAAGCCATGTTTGCACTTCCAGCTGAGAAAAAATGGCAAATATACTGTAGCAAGAAAAAGGACCAGGAAGAAAACAAGGGAGCTACAAGTTGGCCTGAATTCTACATTGATCAGCTCAATTCCATGGCTGCTAGAAAATCTCTGCTGGCTTTAGAGAAGGAAGAAGAAGAAGAAAGAAGTAAAACTATAGAGAGTTTAAAGACAGCACTGAGGACAAAACCAATGAGGTTTGTAACCAGATTCATCGACTTGGATGGCCTATCATGTATCCTCAACTTTCTAAAGACCATGGACTACGAGACCTCAGAGTCTCGAATACATACTTCTCTCATTGGCTGTATAAAGGCGTTAATGAACAACTCTCAAGGCCGGGCTCACGTCCTGGCTCATTCTGAGAGTATTAATGTAATTGCTCAGAGTCTGAGCACAGAGAACATTAAAACGAAGGTGGCCGTGCTGGAAATCTTGGGCGCCGTGTGCCTGGTTCCCGGGGGCCACAAGAAGGTTCTGCAGGCCATGCTGCACTACCAGAAGTATGCCAGCGAAAGGACCCGCTTTCAGACATTAATTAACGACTTGGATAAAAGCACTGGGCGGTATCGAGATGAAGTGAGTCTCAAGACTGCCATCATGTCCTTCATTAATGCAGTGCTCAGCCAAGGTGCAGGAGTGGAGAGTTTGGACTTTAGACTTCATCTTCGCTATGAATTTCTGATGTTAGGAATTCAACCTGTAATAGATAAATTAAGGGAACACGAAAATTCAACATTAGATAGGCATTTAGACTTTTTTGAAATGCTCCGAAATGAAGATGAACTAGAATTTGCCAAAAGATTTGAACTGGTTCACATAGACACAAAAAGTGCAACTCAGATGTTTGAGCTGACCAGGAAGAGGCTGACACATAGTGAAGCTTACCCGCATTTCATGTCCATCCTGCACCACTGCCTCCAAATGCCTTACAAGAGGAGTGGCAACACTGTTCAGTACTGGCTACTACTAGATAGAATTATACAGCAGATAGTTATCCAGAATGACAAAGGACAGGACCCTGACTCCACACCTTTGGAAAACTTTAATATTAAGAATGTCGTACGAATGTTGGTTAATGAAAATGAAGTTAAGCAGTGGAAAGAACAAGCGGAAAAAATGAGAAAAGAGCACAATGAGCTACAACAGAAACTGGAAAAGAAAGAACGAGAATGTGATGCTAAGACTCAAGAGAAGGAAGAGATGATGCAGACCTTAAATAAAATGAAAGAGAAACTTGAAAAGGAGACTACTGAGCATAAGCAAGTCAAGCAGCAGGTGGCGGACCTCACAGCACAGCTCCATGAGCTCAGCAGGAGGGCCGTCTGTGCTTCAATCCCAGGTGGACCCTCGCCTGGAGCACCAGGAGGGCCCTTTCCTTCCTCTGTGCCTGGATCTCTCCTTCCTCCCCCACCACCCCCACCTCTACCAGGTGGGATGCTTCCCCCTCCACCGCCTCCCCTCCCTCCAGGTGGCCCTCCTCCTCCCCCAGGGCCTCCTCCCTTAGGGGCAATCATGCCACCTCCTGGTGCTCCAATGGGCCTAGCACTGAAGAAGAAAAGCATTCCTCAGCCCACAAATGCCCTGAAATCCTTCAACTGGTCTAAACTGCCCGAGAACAAACTGGAAGGAACAGTATGGACCGAAATTGATGATACAAAAGTCTTCAAAATTCTAGATCTTGAAGACCTGGAAAGAACCTTCTCTGCCTATCAAAGACAGCAGAAAGAAGCAGATGCCATTGATGACACTCTGAGTTCCAAACTTAAAGTTAAAGAGCTTTCGGTGATTGATGGTCGGAGAGCTCAGAATTGCAACATCCTTCTATCGAGGTTGAAATTATCCAATGACGAAATCAAACGGGCAATTCTAACAATGGACGAACAGGAAGATCTGCCCAAGGACATGTTGGAACAGCTCTTGAAATTTGTTCCTGAAAAAAGTGACATTGACCTATTGGAGGAACATAAACACGAACTGGATCGGATGGCCAAGGCTGATAGGTTCCTTTTTGAGATGAGCCGAATTAATCACTATCAGCAAAGGTTGCAATCGCTGTACTTCAAAAAGAAGTTTGCAGAGCGTGTGGCAGAAGTGAAACCTAAAGTGGAAGCAATTCGTTCTGGCTCAGAAGAGGTGTTTAGGAGTGGTGCCCTCAAGCAGTTGCTGGAGGTGGTTTTGGCATTTGGAAATTATATGAATAAAGGTCAAAGAGGGAATGCATATGGATTCAAGATATCTAGCCTAAACAAAATTGCTGACACAAAATCCAGCATCGACAAAAACATTACCCTTTTGCACTATCTCATCACTATTGTGGAAAATAAGTACCCCAGTGTTCTCAATCTAAATGAAGAATTGCGAGATATTCCTCAAGCTGCGAAAGTAAACATGACTGAGCTGGACAAAGAAATAAGTACCTTGAGAAGTGGCTTGAAAGCAGTAGAGACAGAGCTGGAATATCAGAAGTCTCAGCCCCCACAGCCCGGAGATAAGTTTGTGTCTGTTGTCAGCCAGTTCATCACAGTAGCCAGCTTCAGCTTCTCTGATGTTGAAGACCTTCTAGCAGAAGCTAAAGACCTGTTTACTAAAGCAGTGAAGCACTTTGGGGAAGAGGCTGGCAAAATACAACCAGATGAGTTCTTTGGCATTTTTGATCAATTTCTTCAAGCTGTGTCAGAAGCCAAACAAGAAAACGAAAATATGAGAAAGAAAAAGGAGGAAGAAGAACGTCGAGCTCGCATGGAAGCTCAGCTCAAAGAACAACGTGAAAGGGAACGTAAAATGAGAAAAGCTAAAGAGAATAGTGAAGAAAGCGGAGAGTTTGATGACCTTGTTTCAGCTTTACGCTCAGGAGAAGTGTTTGACAAAGACCTTTCTAAATTGAAACGGAATCGCAAACGTATTACCAACCAGATGACTGACAGCAGCAGAGAGAGACCAATCACAAAACTTAATTTCTAATTTTCCATGAATACTTTTTTTTAGAAAGCTCATTAGCAGCCCTCTAAAGTGACTAGAACGTTTCATTACACTGCCTTGCAATCCAAACAGTGGCAATTTTTTCCTTCATCTGTGAGTGAATGTGTGAACGTGTGTATGTAAATGTATGTGTGTATATATTAAAAAATGTATATAGATGTCTGAGTGTTGTCTGGAGACCTATACGTATGGTTAAAAAGATTTATGTTAATGTATGTGCTCCAAAACCTTTCGTGTATGCATTCACATTGAGTGTGGCTCATTTTCTTTCCCCGAACGCCATGACTGTTCAGAAGCACAATACTATCTCCTGAAAGAGATAAGAGACATTCCCTAGATTCAAAGGCAAAACAGAAGAAACAAACAAACAAACAAAAAAAGCTTGCAAAATATTTTATGGTTTCCAAGCTTGATATCCTTTAAAATTATTTTCATTGATGGAACTGGAGTTGTTGGAAAAACATAGATTTAAAATGATTTTTGATAGCTGACATTGTGATGTTGATGTATCACATCAGTAATAGGACCAGCTTTGAATTTCTGACATTGGTGTGGGGATACAGTCTGTAAATGTTTATTGAGAACATCTTGCACACAATTTGAATTATGTAGAATGTCAATCAAGTTTTTGTATATTTAAAAGTTGGACATCAATTTTTTCCCCTGATTTCATCAAGTTATCTCTGCCAAGTGCTCTTGATAATTTCTTCAGATTTTTGGAAAAAAACACTATATAAATGCAATCCATGCTTTTTTTAAAGAACAACATTGCCAGAGTATGCTTGTTCTAACAATATAGATATATAAACCTTAAAAATAATAAAATATCTCACCCAAGACTTAAAGGAAGAATTCTCTGAAGGGATAAAGATTACTAAAAAAAAAAAAAAAAAAAAAAAATTAATGGGGTGCCTTTTTGTTATAGTTTCTATTTTCTGTTTTGTAGGACAAGCTGCATTTTCTGTAAATATAGGTCTGGACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTATTTATCTCTAAGTATATTTGAAGTGATTGCTGTTTATATGTTGTCATTTTAAAATTGTGTGTCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAATAAAGCTCTCAGGGAGACATGAATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCTTAATATCCTTAAATTATGTGATATATAAAGAGGACTGTTACTTTTTTACTTTTTTTTTTTTTTTTTTTTTTTTTTGGCTTTGCTTTATTTATTTGTAGTTGGGGGCTAACGTTTTCTCTTTTCTTTCTATTGATCCTGTTGTGGTTGGGTTTCCTGTGGAGAGAGTAGTTTGTCCTGTTGCACTAGAACATTATTTACTCACTAAATTGAGTTTTTCAGTCAATTAACAAATATTTATTAAGTTCCTACTATGTACCAGGCATAGTAGGGCTACAATGGTAAGCAAGACAGAGTCCCTGCCCCCAAAGAGCTTATATTCTAATGGGGATATTAAGGGATGAATAGAATACCAATGTGTGCACTGTACAGGAATCATACTATTTAAAAATAATTTGTATAAACTATAATGCTTAGCACAGATGGCGAGTTATCTGTGCTATGTGAAAGCTGTGAAATAGTGCTCTAAGAGTTGTCAAGAGCTGTGGTTTTACATCTTTTCCTCATTGCAAATTTAGTGACTTTCTACACACTATATGGAAATAAATGACTAGCAAATAAAACAGTCATAAATACAAAGCAGAGGTTGCACTCCCCCAATCCCGAGTTAACCCAGGTCTGCAATAACACCATGTTAAAGGTGCAGATAGAGACTTGGCTCAAAAAGGCTTGGAGATGAGGAAGATTGGAATATAATGATGGTTTTGTCTGTTCCTTAACTAAAGTGCCTCTATGTATATTCTTTTCTATTTGTAGCAGGATAAATTTGGTCTGGCTCAGTTTTGGAACTGTATTTTGAAAATGGCTTTGTCTTACAGTTTAAGGAATAGACAGGTGGAGGGAAAGTCACATAAAGGAGCAAGTTTGTGTAGCTGTCCCTCTTGCCCCTTTTAATCATCCTCCTTTGATATGGCCATCCTGGTGGGCCTCCTTTGCCATTTCCATTTTTGGTTTCTTTCCCTGAAAACTGTGTGCAGGTAATTCCATGTGCCATTGTTGAAAAGAAAAAAAAAAAACAAAAAAAAAACCTACTTTTTAGATTGGTGCTGGTGTAAGTAGCCACTTTTCTCTCTTGGGTGTGTATTTTAAACTTTTTTTGTTTTTTTAAATTAATGCCAAAAAGAAAATGCATAATTTGTAAACTTAATTATATGTCTTATATCTTATTAGCTTAGTAGTTGGAACCACTTAGTCTTTAGGTGCAAGACTGTTGTTAGATAGTACTGAGAAAAAAAAAGTATGTGTTATGAGACTGTACATGTTTTTTTAAAAATAGCAATATGCAATAAAGAGATGAATTCATTGGGTGTA SEQ ID NO:632 DAAM1之全蛋白質序列 MAPRKRGGRGISFIFCCFRNNDHPEITYRLRNDSNFALQTMEPALPMPPVEELDVMFSELVDELDLTDKHREAMFALPAEKKWQIYCSKKKDQEENKGATSWPEFYIDQLNSMAARKSLLALEKEEEEERSKTIESLKTALRTKPMRFVTRFIDLDGLSCILNFLKTMDYETSESRIHTSLIGCIKALMNNSQGRAHVLAHSESINVIAQSLSTENIKTKVAVLEILGAVCLVPGGHKKVLQAMLHYQKYASERTRFQTLINDLDKSTGRYRDEVSLKTAIMSFINAVLSQGAGVESLDFRLHLRYEFLMLGIQPVIDKLREHENSTLDRHLDFFEMLRNEDELEFAKRFELVHIDTKSATQMFELTRKRLTHSEAYPHFMSILHHCLQMPYKRSGNTVQYWLLLDRIIQQIVIQNDKGQDPDSTPLENFNIKNVVRMLVNENEVKQWKEQAEKMRKEHNELQQKLEKKERECDAKTQEKEEMMQTLNKMKEKLEKETTEHKQVKQQVADLTAQLHELSRRAVCASIPGGPSPGAPGGPFPSSVPGSLLPPPPPPPLPGGMLPPPPPPLPPGGPPPPPGPPPLGAIMPPPGAPMGLALKKKSIPQPTNALKSFNWSKLPENKLEGTVWTEIDDTKVFKILDLEDLERTFSAYQRQQKEADAIDDTLSSKLKVKELSVIDGRRAQNCNILLSRLKLSNDEIKRAILTMDEQEDLPKDMLEQLLKFVPEKSDIDLLEEHKHELDRMAKADRFLFEMSRINHYQQRLQSLYFKKKFAERVAEVKPKVEAIRSGSEEVFRSGALKQLLEVVLAFGNYMNKGQRGNAYGFKISSLNKIADTKSSIDKNITLLHYLITIVENKYPSVLNLNEELRDIPQAAKVNMTELDKEISTLRSGLKAVETELEYQKSQPPQPGDKFVSVVSQFITVASFSFSDVEDLLAEAKDLFTKAVKHFGEEAGKIQPDEFFGIFDQFLQAVSEAKQENENMRKKKEEEERRARMEAQLKEQRERERKMRKAKENSEESGEFDDLVSALRSGEVFDKDLSKLKRNRKRITNQMTDSSRERPITKLNF ASPH 序列資訊SEQ ID NO:633 來自ASPH-NRG1融合物5’處之ASPH外顯子22的序列 AAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAG SEQ ID NO:634 ASPH-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:635 ASPH-NRG1多核苷酸序列 AAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:636 ASPH-NRG1多肽序列 GLFLPEDENLREKGDWSQFTLWQQALPPRLKEMKSQESAAGSKLVLRCE SEQ ID NO:637 ASPH之外顯子1 AGTCTCAAGCTCTGGTAGGCAAGTGCATGCAGTGTGCCTAAAACCTGCCAGCAGTACTTTTGAGTTTTTTTTTTTGTTTTGTTTTACTTTAGCATTTATTATTCATGGATTGAAGAAATCAAAATGGCTGAAGATAAAG SEQ ID NO:638 ASPH之外顯子2 AGACAAAGCATGGAGGACACAAGAATGGGAGGAAAGGCGGACTCTCAGGAACTTCATTCTTCACGTGGTTTATGGTGATTGCATTGCTGGGCGTCTGGACATCTGTAGCTGTCGTTTGGTTTGATCTTGTTGACTATGAGGAAGTTCTAG SEQ ID NO:639 ASPH之外顯子3 GAAAACTAGGAATCTATGATGCTGATGGTGATGGAGATTTTGATGTGGATGATGCCAAAGTTTTATTAG SEQ ID NO:640 ASPH之外顯子4 GACTTAAAGAGAGATCTACTTCAGAGCCAGCAGTCCCGCCAGAAGAGGCTGAGCCACACACTGAGCCCGAGGAGCAGGTTCCTGTGGAGGCAG SEQ ID NO:641 ASPH之外顯子5 AACCCCAGAATATCGAAGATGAAGCAAAAGAACAAATTCAGTCCCTTCTCCATGAAATGGTACACGCAGAACATG SEQ ID NO:642 ASPH之外顯子6 TTGAGGGAGAAGACTTGCAACAAGAAGATGGACCCACAGGAGAACCACAACAAGAGGATGATGAGTTTCTTATGGCGACTGATGTAGATGATAGATTTGAGACCCTGGAACCTGAAGTATCTCATGAAG SEQ ID NO:643 ASPH之外顯子7 AAACCGAGCATAGTTACCACGTGGAAGAGACAG SEQ ID NO:644 ASPH之外顯子8 TTTCACAAGACTGTAATCAGGATATGGAAGAGATGATGTCTGAGCAGGAAAATCCAG SEQ ID NO:645 ASPH之外顯子9 ATTCCAGTGAACCAGTAGTAGAAGATGAAAGATTGCACCATGATACAG SEQ ID NO:646 ASPH之外顯子10 ATGATGTAACATACCAAGTCTATGAGGAACAAG SEQ ID NO:647 ASPH之外顯子11 CAGTATATGAACCTCTAGAAAATGAAGGGATAGAAATCACAG SEQ ID NO:648 ASPH之外顯子12 AAGTAACTGCTCCCCCTGAGGATAATCCTGTAGAAGATTCACAGGTAATTGTAGAAG SEQ ID NO:649 ASPH之外顯子13 AAGTAAGCATTTTTCCTGTGGAAGAACAGCAGGAAGTACCACCAG SEQ ID NO:650 ASPH之外顯子14 AAACAAATAGAAAAACAGATGATCCAGAACAAAAAGCAAAAG SEQ ID NO:651 ASPH之外顯子15 TTAAGAAAAAGAAGCCTAAACTTTTAAATAAATTTGATAAGACTATTAAAGCTGAACTTGATGCTGCAGAAAAACTCCGTAAAAGG SEQ ID NO:652 ASPH之外顯子16 GGAAAAATTGAGGAAGCAGTGAATGCATTTAAAGAACTAGTACGCAAATACCCTCAGAGTCCACGAGCAAGATATGGGAAGGCGCAG SEQ ID NO:653 ASPH之外顯子17 TGTGAGGATGATTTGGCTGAGAAGAGGAGAAGTAATGAGGTGCTACGTGGAGCCATCGAGACCTACCAAGAGGTGGCCAGCCTACCTGATGTCCCTGCAGACCTGCTGAAGCTGAGTTTGAAGCGTCGCTCAGACAGGCAACAATTTCTAG SEQ ID NO:654 ASPH之外顯子18 GTCATATGAGAGGTTCCCTGCTTACCCTGCAGAGATTAGTTCAACTATTTCCCAATGATACTTCCTTAAAAAATGACCTTGGCGTGGGATACCTCTTGATAGGAGATAATGACAATGCAAAGAAAGTTTATGAAGAG SEQ ID NO:655 ASPH之外顯子19 GTGCTGAGTGTGACACCTAATGATGGCTTTGCTAAAGTCCATTATGGCTTCATCCTGAAGGCACAGAACAAAATTGCTGAGAGCATCCCATATTTAAAG SEQ ID NO:656 ASPH之外顯子20 GAAGGAATAGAATCCGGAGATCCTGGCACTGATGATGGGAGATTTTATTTCCACCTGGGGGATGCCATGCAGAGGGTTGGGAACAAAGAG SEQ ID NO:657 ASPH之外顯子21 GCATATAAGTGGTATGAGCTTGGGCACAAGAGAGGACACTTTGCATCTGTCTGGCAACGCTCACTCTACAATGTGAATGGACTGAAAGCACAGCCTTGGTGGACCCCAAAAGAAACGGGCTACACAGAGTTAGTAAAG SEQ ID NO:658 ASPH之外顯子22 TCTTTAGAAAGAAACTGGAAGTTAATCCGAGATGAAGGCCTTGCAGTGATGGATAAAGCCAAAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAG SEQ ID NO:659 ASPH之外顯子23 GAAGAAGAAATGAAAATGCCTGCAAAGGAGCTCCTAAAACCTGTACCTTACTAGAAAAGTTCCCCGAGACAACAGGATGCAGAAGAGGACAG SEQ ID NO:660 ASPH之外顯子24 ATCAAATATTCCATCATGCACCCCGGGACTCACGTGTGGCCGCACACAGGGCCCACAAACTGCAGGCTCCGAATGCACCTGGGCTTGGTGATTCCCAAGGAAGGCTGCAAGATTCGATGTGCCAACGAGACCAA SEQ ID NO:661 ASPH之外顯子25 GACCTGGGAGGAAGGCAAGGTGCTCATCTTTGATGACTCCTTTGAGCACGAGGTATGGCAGGATGCCTCATCTTTCCGGCTGATATTCATCGTGGATGTGTGGCATCCGGAACTGACACCACAGCAGAGACGCAGCCTTCCAGCAATTTAGCATGAATTCATGCAAGCTTGGGAAACTCTGGAGAGAGGCTGCCTTTCTGGTTCCATCTCCTTGGGTGTGAGGATAGAATTTCGAACACCAAGAGTCAATTCCCTTGACTTGCAGCCCGAGTAATTCAAAGCCTCCTCCTAGGGTCAGAAGACACTAAAGGGAATATTTGCCTCGCTGCAATTCATTTAGGAAACACCCTGCTGTGTGTCATCTCATGACAGCACTGGTCTTCTGCCAGTATTTAAGGTGAACATTTGATAGCTTCTACCTTACCAGCCAAAGATATTTTTTCCACATAGAATAGGTCTAATTCAATGTATAATGAGAACATATGTAGAAACTGTGAATGGATTGCTTTAGTTTGTAATTTTTCTATGCAGTTATATTTTTCTAGTGTAGCTAGACTATTTTGTCATCATGTACCACTACATTTTTGTTTATTTTAATGACAAGCTGTATAAATGCTTTACTTCTAGCTATTTAATGGTAGCATTACTGGGGAACTCAGACTTCCCTCTTTTAATTCTTCTTAGTAAAAGATACTCATGAAAAAAGCAGTTTTATTTTCCTAACAAAAAAGAAAGAGCTCATTATGTCAGTGTCTATGAACTGTACCCATCCCAACTCTCAAATCGTTTGGTTTTTTTTATCTTGATTGAGATCCTCTTCTCACTATGCTAGTGGTGGAGATATTGACAAAATCCTATTTCTTTCAAAGAGGAACTTTTCACACCGAAAAAAGAGCATGGAATTATTTTATATTGTTATAAAAATCCCAGATGCAAATTTTTTTAATGCCAATTATTAGAGCTTCTGGGGAAAAAGTATAGTTCACGGAAATAAAACTATGTTCTTTCAGGGTTGGGTGGATAGGTGGCTGCTAGGGTGTCTGGCTCCTGGCGGCTTTGCCATCCATGAGGCAAGGGCTGGGAACACAGTGTCTTTGCCTATGGTAGATCCATGTGAATGTCAGGAAGCCAGCTCTTCAGTCTTGGAGATGATTTCTGCTACAATTCTGTAGAAAGATTAAGGATGGCAGAGTAAAAGGTTACCAAGAATGCCAGGATGTTTTTCTTGGGCGTAGGAGGTCCAGATTACTTTCCTTTTTGATGAAAGAGTTTGGAAGACTGTCCCATCTCTCTGGCTTGAGAAATCTCTGCCATTTTAAACATCACTGTGAAATAGCAATTATTATCATCTGTATTTAGTTTTAACATTACCCACAACATAGAAATAATAGGTAAAAATCGTCTTGCCTACTCATTCCAAAGATGATCAAGTCATTAATCTAGCAAAGTATTCATGTATCAGATTTTGTATATTTTGAATCAAAGCTAACTAGGAATGTTAGATATAAGAATGTAATGATATTCATGCACTGAATTCTAAGCCAATATGAACAAAAATGCTGCATGAATGGCACATATAGGTCACCAAAGTTCATTCACAGGTAGAAAAAACTTGTGCTTTCTTTTCCATCTAAAAACAAAAGGAGACTTTCTTTATCTCATTTAAAGAACAGCTCTTTGAAATTGAAATTGACCCTTTTTGCTTGACCTTAAGGAGATTAGCTTCCAGTAGATGAGTTTGCAAAATACTTTTCCTGTTCTTTTGTTTTGCTGGTATTGAAAACATCCCACTAAATCAGATGAAGAGGCATGGGAGGAAAAATATCCAAATTAATTACTAAAATCGAGAAGAGAAGGCAAACTCTTGAAAAGTAAAAAGGTGTTTGTGACCTTCAGTATTTATTGAACAGAGGAAATAACTGACAAGGGCAATACAATTCAATGTTCATGTAGTAACATTCATGTCACTTGTTGAATTTGGTTCTCATATGTATATTGCATACACATAAATTCAAACTATAAGTCGTCATTTTTGAGCCATCATCTTACATTCATGTAATGAAATTATGGAAGAGAGTAAAAACTAGCTCTTAACTTAGTAAATATAATATGGTATTTAAAATCAGGTCACTACAGTAAGGTTCTAAGTATTGCCAATTGAAAAGCTAGAAATGGTATTACTGTTGCAAAGTGTTGTCAATAATTGACTCCAATAGCATTGTAAATACTTGTATCCCACAACTATTTTAAACCCAAGCAATAAAATGGATTTTCTAATTCACTTCAATTCTTTATTTCTCTTACCTATCTATGTCTTGGTACATAAGAAAAGTGTTTCCATCACTGCATTGGTAGAATTATTTCAGTGTTATTATTTTTGTGATTTCGTATGTCTACACAAAAATGAATTAGTTTAATTTATATTGTGATACAGTTGTTTGAGAAATATTTTTAATTCTGTTTCAACTTTATTTCTCCAAGTGTATAATATAAAAATTACTTCTGTTATTGTTCTCTACCAATAGGGGAAAAAATTAAAACATTAGATCCATTGAGAAAGAGATGATGTAATAAATAATTAAGATTAGTAATAATATTATCAGGGGTGATTATGACCAGTTGAATAATCTCTTTCCCTTGAATTATTTAGCTAACAAATTAACTCTCCCAAATATTTAAAATAATGTAAAATCATATTTTACTGCCCATTATTAACTAAAATATTTTTGTTTGACTTTGAGCACCAACTGGTAATACTAATAAATACCCATGTCATGCAGATGGCTGGGCGAATAAGAGATGTCTAAAAATATGCACTGGTCTTGGAAAACATGGCACAAGTAAGGATATCATATATGATGTCTGTTTATTTTATGTCTGATTTCTTTTGAATGAGTAGTTGGGGACTCCATTTCTAAGGAGACTAGGTAAATAAAATGACCTTTGACATTTCA SEQ ID NO:662 ASPH之全mRNA多核苷酸序列 AGTCTCAAGCTCTGGTAGGCAAGTGCATGCAGTGTGCCTAAAACCTGCCAGCAGTACTTTTGAGTTTTTTTTTTTGTTTTGTTTTACTTTAGCATTTATTATTCATGGATTGAAGAAATCAAAATGGCTGAAGATAAAGAGACAAAGCATGGAGGACACAAGAATGGGAGGAAAGGCGGACTCTCAGGAACTTCATTCTTCACGTGGTTTATGGTGATTGCATTGCTGGGCGTCTGGACATCTGTAGCTGTCGTTTGGTTTGATCTTGTTGACTATGAGGAAGTTCTAGGAAAACTAGGAATCTATGATGCTGATGGTGATGGAGATTTTGATGTGGATGATGCCAAAGTTTTATTAGGACTTAAAGAGAGATCTACTTCAGAGCCAGCAGTCCCGCCAGAAGAGGCTGAGCCACACACTGAGCCCGAGGAGCAGGTTCCTGTGGAGGCAGAACCCCAGAATATCGAAGATGAAGCAAAAGAACAAATTCAGTCCCTTCTCCATGAAATGGTACACGCAGAACATGTTGAGGGAGAAGACTTGCAACAAGAAGATGGACCCACAGGAGAACCACAACAAGAGGATGATGAGTTTCTTATGGCGACTGATGTAGATGATAGATTTGAGACCCTGGAACCTGAAGTATCTCATGAAGAAACCGAGCATAGTTACCACGTGGAAGAGACAGTTTCACAAGACTGTAATCAGGATATGGAAGAGATGATGTCTGAGCAGGAAAATCCAGATTCCAGTGAACCAGTAGTAGAAGATGAAAGATTGCACCATGATACAGATGATGTAACATACCAAGTCTATGAGGAACAAGCAGTATATGAACCTCTAGAAAATGAAGGGATAGAAATCACAGAAGTAACTGCTCCCCCTGAGGATAATCCTGTAGAAGATTCACAGGTAATTGTAGAAGAAGTAAGCATTTTTCCTGTGGAAGAACAGCAGGAAGTACCACCAGAAACAAATAGAAAAACAGATGATCCAGAACAAAAAGCAAAAGTTAAGAAAAAGAAGCCTAAACTTTTAAATAAATTTGATAAGACTATTAAAGCTGAACTTGATGCTGCAGAAAAACTCCGTAAAAGGGGAAAAATTGAGGAAGCAGTGAATGCATTTAAAGAACTAGTACGCAAATACCCTCAGAGTCCACGAGCAAGATATGGGAAGGCGCAGTGTGAGGATGATTTGGCTGAGAAGAGGAGAAGTAATGAGGTGCTACGTGGAGCCATCGAGACCTACCAAGAGGTGGCCAGCCTACCTGATGTCCCTGCAGACCTGCTGAAGCTGAGTTTGAAGCGTCGCTCAGACAGGCAACAATTTCTAGGTCATATGAGAGGTTCCCTGCTTACCCTGCAGAGATTAGTTCAACTATTTCCCAATGATACTTCCTTAAAAAATGACCTTGGCGTGGGATACCTCTTGATAGGAGATAATGACAATGCAAAGAAAGTTTATGAAGAGGTGCTGAGTGTGACACCTAATGATGGCTTTGCTAAAGTCCATTATGGCTTCATCCTGAAGGCACAGAACAAAATTGCTGAGAGCATCCCATATTTAAAGGAAGGAATAGAATCCGGAGATCCTGGCACTGATGATGGGAGATTTTATTTCCACCTGGGGGATGCCATGCAGAGGGTTGGGAACAAAGAGGCATATAAGTGGTATGAGCTTGGGCACAAGAGAGGACACTTTGCATCTGTCTGGCAACGCTCACTCTACAATGTGAATGGACTGAAAGCACAGCCTTGGTGGACCCCAAAAGAAACGGGCTACACAGAGTTAGTAAAGTCTTTAGAAAGAAACTGGAAGTTAATCCGAGATGAAGGCCTTGCAGTGATGGATAAAGCCAAAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAGGAAGAAGAAATGAAAATGCCTGCAAAGGAGCTCCTAAAACCTGTACCTTACTAGAAAAGTTCCCCGAGACAACAGGATGCAGAAGAGGACAGATCAAATATTCCATCATGCACCCCGGGACTCACGTGTGGCCGCACACAGGGCCCACAAACTGCAGGCTCCGAATGCACCTGGGCTTGGTGATTCCCAAGGAAGGCTGCAAGATTCGATGTGCCAACGAGACCAAGACCTGGGAGGAAGGCAAGGTGCTCATCTTTGATGACTCCTTTGAGCACGAGGTATGGCAGGATGCCTCATCTTTCCGGCTGATATTCATCGTGGATGTGTGGCATCCGGAACTGACACCACAGCAGAGACGCAGCCTTCCAGCAATTTAGCATGAATTCATGCAAGCTTGGGAAACTCTGGAGAGAGGCTGCCTTTCTGGTTCCATCTCCTTGGGTGTGAGGATAGAATTTCGAACACCAAGAGTCAATTCCCTTGACTTGCAGCCCGAGTAATTCAAAGCCTCCTCCTAGGGTCAGAAGACACTAAAGGGAATATTTGCCTCGCTGCAATTCATTTAGGAAACACCCTGCTGTGTGTCATCTCATGACAGCACTGGTCTTCTGCCAGTATTTAAGGTGAACATTTGATAGCTTCTACCTTACCAGCCAAAGATATTTTTTCCACATAGAATAGGTCTAATTCAATGTATAATGAGAACATATGTAGAAACTGTGAATGGATTGCTTTAGTTTGTAATTTTTCTATGCAGTTATATTTTTCTAGTGTAGCTAGACTATTTTGTCATCATGTACCACTACATTTTTGTTTATTTTAATGACAAGCTGTATAAATGCTTTACTTCTAGCTATTTAATGGTAGCATTACTGGGGAACTCAGACTTCCCTCTTTTAATTCTTCTTAGTAAAAGATACTCATGAAAAAAGCAGTTTTATTTTCCTAACAAAAAAGAAAGAGCTCATTATGTCAGTGTCTATGAACTGTACCCATCCCAACTCTCAAATCGTTTGGTTTTTTTTATCTTGATTGAGATCCTCTTCTCACTATGCTAGTGGTGGAGATATTGACAAAATCCTATTTCTTTCAAAGAGGAACTTTTCACACCGAAAAAAGAGCATGGAATTATTTTATATTGTTATAAAAATCCCAGATGCAAATTTTTTTAATGCCAATTATTAGAGCTTCTGGGGAAAAAGTATAGTTCACGGAAATAAAACTATGTTCTTTCAGGGTTGGGTGGATAGGTGGCTGCTAGGGTGTCTGGCTCCTGGCGGCTTTGCCATCCATGAGGCAAGGGCTGGGAACACAGTGTCTTTGCCTATGGTAGATCCATGTGAATGTCAGGAAGCCAGCTCTTCAGTCTTGGAGATGATTTCTGCTACAATTCTGTAGAAAGATTAAGGATGGCAGAGTAAAAGGTTACCAAGAATGCCAGGATGTTTTTCTTGGGCGTAGGAGGTCCAGATTACTTTCCTTTTTGATGAAAGAGTTTGGAAGACTGTCCCATCTCTCTGGCTTGAGAAATCTCTGCCATTTTAAACATCACTGTGAAATAGCAATTATTATCATCTGTATTTAGTTTTAACATTACCCACAACATAGAAATAATAGGTAAAAATCGTCTTGCCTACTCATTCCAAAGATGATCAAGTCATTAATCTAGCAAAGTATTCATGTATCAGATTTTGTATATTTTGAATCAAAGCTAACTAGGAATGTTAGATATAAGAATGTAATGATATTCATGCACTGAATTCTAAGCCAATATGAACAAAAATGCTGCATGAATGGCACATATAGGTCACCAAAGTTCATTCACAGGTAGAAAAAACTTGTGCTTTCTTTTCCATCTAAAAACAAAAGGAGACTTTCTTTATCTCATTTAAAGAACAGCTCTTTGAAATTGAAATTGACCCTTTTTGCTTGACCTTAAGGAGATTAGCTTCCAGTAGATGAGTTTGCAAAATACTTTTCCTGTTCTTTTGTTTTGCTGGTATTGAAAACATCCCACTAAATCAGATGAAGAGGCATGGGAGGAAAAATATCCAAATTAATTACTAAAATCGAGAAGAGAAGGCAAACTCTTGAAAAGTAAAAAGGTGTTTGTGACCTTCAGTATTTATTGAACAGAGGAAATAACTGACAAGGGCAATACAATTCAATGTTCATGTAGTAACATTCATGTCACTTGTTGAATTTGGTTCTCATATGTATATTGCATACACATAAATTCAAACTATAAGTCGTCATTTTTGAGCCATCATCTTACATTCATGTAATGAAATTATGGAAGAGAGTAAAAACTAGCTCTTAACTTAGTAAATATAATATGGTATTTAAAATCAGGTCACTACAGTAAGGTTCTAAGTATTGCCAATTGAAAAGCTAGAAATGGTATTACTGTTGCAAAGTGTTGTCAATAATTGACTCCAATAGCATTGTAAATACTTGTATCCCACAACTATTTTAAACCCAAGCAATAAAATGGATTTTCTAATTCACTTCAATTCTTTATTTCTCTTACCTATCTATGTCTTGGTACATAAGAAAAGTGTTTCCATCACTGCATTGGTAGAATTATTTCAGTGTTATTATTTTTGTGATTTCGTATGTCTACACAAAAATGAATTAGTTTAATTTATATTGTGATACAGTTGTTTGAGAAATATTTTTAATTCTGTTTCAACTTTATTTCTCCAAGTGTATAATATAAAAATTACTTCTGTTATTGTTCTCTACCAATAGGGGAAAAAATTAAAACATTAGATCCATTGAGAAAGAGATGATGTAATAAATAATTAAGATTAGTAATAATATTATCAGGGGTGATTATGACCAGTTGAATAATCTCTTTCCCTTGAATTATTTAGCTAACAAATTAACTCTCCCAAATATTTAAAATAATGTAAAATCATATTTTACTGCCCATTATTAACTAAAATATTTTTGTTTGACTTTGAGCACCAACTGGTAATACTAATAAATACCCATGTCATGCAGATGGCTGGGCGAATAAGAGATGTCTAAAAATATGCACTGGTCTTGGAAAACATGGCACAAGTAAGGATATCATATATGATGTCTGTTTATTTTATGTCTGATTTCTTTTGAATGAGTAGTTGGGGACTCCATTTCTAAGGAGACTAGGTAAATAAAATGACCTTTGACATTTCA SEQ ID NO:663 ASPH之外顯子1的轉譯多肽序列 MAEDK SEQ ID NO:664 ASPH之外顯子2的轉譯多肽序列 TKHGGHKNGRKGGLSGTSFFTWFMVIALLGVWTSVAVVWFDLVDYEEVL SEQ ID NO:665 ASPH之外顯子3的轉譯多肽序列 KLGIYDADGDGDFDVDDAKVLL SEQ ID NO:666 ASPH之外顯子4的轉譯多肽序列 LKERSTSEPAVPPEEAEPHTEPEEQVPVEA SEQ ID NO:667 ASPH之外顯子5的轉譯多肽序列 PQNIEDEAKEQIQSLLHEMVHAEH SEQ ID NO:668 ASPH之外顯子6的轉譯多肽序列 EGEDLQQEDGPTGEPQQEDDEFLMATDVDDRFETLEPEVSHE SEQ ID NO:669 ASPH之外顯子7的轉譯多肽序列 TEHSYHVEET SEQ ID NO:670 ASPH之外顯子8的轉譯多肽序列 SQDCNQDMEEMMSEQENP SEQ ID NO:671 ASPH之外顯子9的轉譯多肽序列 SSEPVVEDERLHHDT SEQ ID NO:672 ASPH之外顯子10的轉譯多肽序列 DVTYQVYEEQ SEQ ID NO:673 ASPH之外顯子11的轉譯多肽序列 VYEPLENEGIEIT SEQ ID NO:674 ASPH之外顯子12的轉譯多肽序列 VTAPPEDNPVEDSQVIVE SEQ ID NO:675 ASPH之外顯子13的轉譯多肽序列 VSIFPVEEQQEVPP SEQ ID NO:676 ASPH之外顯子14的轉譯多肽序列 TNRKTDDPEQKAK SEQ ID NO:677 ASPH之外顯子15的轉譯多肽序列 KKKKPKLLNKFDKTIKAELDAAEKLRKR SEQ ID NO:678 ASPH之外顯子16的轉譯多肽序列 GKIEEAVNAFKELVRKYPQSPRARYGKAQ SEQ ID NO:679 ASPH之外顯子17的轉譯多肽序列 CEDDLAEKRRSNEVLRGAIETYQEVASLPDVPADLLKLSLKRRSDRQQFL SEQ ID NO:680 ASPH之外顯子18的轉譯多肽序列 HMRGSLLTLQRLVQLFPNDTSLKNDLGVGYLLIGDNDNAKKVYEE SEQ ID NO:681 ASPH之外顯子19的轉譯多肽序列 VLSVTPNDGFAKVHYGFILKAQNKIAESIPYLK SEQ ID NO:682 ASPH之外顯子20的轉譯多肽序列 EGIESGDPGTDDGRFYFHLGDAMQRVGNKE SEQ ID NO:683 ASPH之外顯子21的轉譯多肽序列 AYKWYELGHKRGHFASVWQRSLYNVNGLKAQPWWTPKETGYTELVK SEQ ID NO:684 ASPH之外顯子22的轉譯多肽序列 SLERNWKLIRDEGLAVMDKAKGLFLPEDENLREKGDWSQFTLWQQ SEQ ID NO:685 ASPH之外顯子23的轉譯多肽序列 RRNENACKGAPKTCTLLEKFPETTGCRRGQ SEQ ID NO:686 ASPH之外顯子24的轉譯多肽序列 IKYSIMHPGTHVWPHTGPTNCRLRMHLGLVIPKEGCKIRCANET SEQ ID NO:687 ASPH之外顯子25的轉譯多肽序列 TWEEGKVLIFDDSFEHEVWQDASSFRLIFIVDVWHPELTPQQRRSLPAI SEQ ID NO:688 ASPH之全蛋白質序列 MAEDKETKHGGHKNGRKGGLSGTSFFTWFMVIALLGVWTSVAVVWFDLVDYEEVLGKLGIYDADGDGDFDVDDAKVLLGLKERSTSEPAVPPEEAEPHTEPEEQVPVEAEPQNIEDEAKEQIQSLLHEMVHAEHVEGEDLQQEDGPTGEPQQEDDEFLMATDVDDRFETLEPEVSHEETEHSYHVEETVSQDCNQDMEEMMSEQENPDSSEPVVEDERLHHDTDDVTYQVYEEQAVYEPLENEGIEITEVTAPPEDNPVEDSQVIVEEVSIFPVEEQQEVPPETNRKTDDPEQKAKVKKKKPKLLNKFDKTIKAELDAAEKLRKRGKIEEAVNAFKELVRKYPQSPRARYGKAQCEDDLAEKRRSNEVLRGAIETYQEVASLPDVPADLLKLSLKRRSDRQQFLGHMRGSLLTLQRLVQLFPNDTSLKNDLGVGYLLIGDNDNAKKVYEEVLSVTPNDGFAKVHYGFILKAQNKIAESIPYLKEGIESGDPGTDDGRFYFHLGDAMQRVGNKEAYKWYELGHKRGHFASVWQRSLYNVNGLKAQPWWTPKETGYTELVKSLERNWKLIRDEGLAVMDKAKGLFLPEDENLREKGDWSQFTLWQQGRRNENACKGAPKTCTLLEKFPETTGCRRGQIKYSIMHPGTHVWPHTGPTNCRLRMHLGLVIPKEGCKIRCANETKTWEEGKVLIFDDSFEHEVWQDASSFRLIFIVDVWHPELTPQQRRSLPAI SEQ ID NO:689 ASPH之全部外顯子1-22按順序排列的多核苷酸序列 SEQ ID NO:690 ASPH之全部外顯子1-22按順序排列的轉譯多肽序列 NOTCH2 序列資訊SEQ ID NO:691 來自NOTCH2-NRG1融合物5’處之NOTCH2外顯子6的序列 CCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAG SEQ ID NO:692 NOTCH2-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:693 Notch2-NRG1多核苷酸序列 CCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:694 Notch2-NRG1多肽序列 SCTPGSTCIDRVASFSCMCPEGKAATSTSTTGTSHLVKCAEKEKTFCVN SEQ ID NO:695 NOTCH2之外顯子1 AGGCTGCTTCGTTGCACACCCGAGAAAGTTTCAGCCAAACTTCGGGCGGCGGCTGAGGCGGCGGCCGAGGAGCGGCGGACTCGGGGCGCGGGGAGTCGAGGCATTTGCGCCTGGGCTTCGGAGCGTAGCGCCAGGGCCTGAGCCTTTGAAGCAGGAGGAGGGGAGGAGAGAGTGGGGCTCCTCTATCGGGACCCCCTCCCCATGTGGATCTGCCCAGGCGGCGGCGGCGGCGGCGGAGGAGGAGGCGACCGAGAAGATGCCCGCCCTGCGCCCCGCTCTGCTGTGGGCGCTGCTGGCGCTCTGGCTGTGCTGCGCGGCCCCCGCGCATG SEQ ID NO:696 NOTCH2之外顯子2 CATTGCAGTGTCGAGATGGCTATGAACCCTGTGTAAATGAAGGAATGTGTGTTACCTACCACAATGGCACAGGATACTGCAA SEQ ID NO:697 NOTCH2之外顯子3 ATGTCCAGAAGGCTTCTTGGGGGAATATTGTCAACATCGAGACCCCTGTGAGAAGAACCGCTGCCAGAATGGTGGGACTTGTGTGGCCCAGGCCATGCTGGGGAAAGCCACGTGCCGATGTGCCTCAGGGTTTACAGGAGAGGACTGCCAGTACTCAACATCTCATCCATGCTTTGTGTCTCGACCCTGCCTGAATGGCGGCACATGCCATATGCTCAGCCGGGATACCTATGAGTGCACCTGTCAAGTCGGGTTTACAG SEQ ID NO:698 NOTCH2之外顯子4 GTAAGGAGTGCCAATGGACGGATGCCTGCCTGTCTCATCCCTGTGCAAATGGAAGTACCTGTACCACTGTGGCCAACCAGTTCTCCTGCAAATGCCTCACAGGCTTCACAGGGCAGAAATGTGAGACTGATGTCAATGAGTGTGACATTCCAGGACACTGCCAGCATGGTGGCACCTGCCTCAACCTGCCTGGTTCCTACCAGTGCCAGTGCCCTCAGGGCTTCACAGGCCAGTACTGTGACAGCCTGTATGTGCCCTGTGCACCCTCACCTTGTGTCAATGGAGGCACCTGTCGGCAGACTGGTGACTTCACTTTTGAGTGCAACTGCCTTCCAG SEQ ID NO:699 NOTCH2之外顯子5 GTTTTGAAGGGAGCACCTGTGAGAGGAATATTGATGACTGCCCTAACCACAGGTGTCAGAATGGAGGGGTTTGTGTGGATGGGGTCAACACTTACAACTGCCGCTGTCCCCCACAATGGACAG SEQ ID NO:700 NOTCH2之外顯子6 GACAGTTCTGCACAGAGGATGTGGATGAATGCCTGCTGCAGCCCAATGCCTGTCAAAATGGGGGCACCTGTGCCAACCGCAATGGAGGCTATGGCTGTGTATGTGTCAACGGCTGGAGTGGAGATGACTGCAGTGAGAACATTGATGATTGTGCCTTCGCCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAG SEQ ID NO:701 NOTCH2之外顯子7 GTCTCCTGTGTCATCTGGATGATGCATGCATCAGCAATCCTTGCCACAAGGGGGCACTGTGTGACACCAACCCCCTAAATGGGCAATATATTTGCACCTGCCCACAAGGCTACAAAGGGGCTGACTGCACAGAAGATGTGGATGAATGTGCCATGG SEQ ID NO:702 NOTCH2之外顯子8-34 CCAATAGCAATCCTTGTGAGCATGCAGGAAAATGTGTGAACACGGATGGCGCCTTCCACTGTGAGTGTCTGAAGGGTTATGCAGGACCTCGTTGTGAGATGGACATCAATGAGTGCCATTCAGACCCCTGCCAGAATGATGCTACCTGTCTGGATAAGATTGGAGGCTTCACATGTCTGTGCATGCCAGGTTTCAAAGGTGTGCATTGTGAATTAGAAATAAATGAATGTCAGAGCAACCCTTGTGTGAACAATGGGCAGTGTGTGGATAAAGTCAATCGTTTCCAGTGCCTGTGTCCTCCTGGTTTCACTGGGCCAGTTTGCCAGATTGATATTGATGACTGTTCCAGTACTCCGTGTCTGAATGGGGCAAAGTGTATCGATCACCCGAATGGCTATGAATGCCAGTGTGCCACAGGTTTCACTGGTGTGTTGTGTGAGGAGAACATTGACAACTGTGACCCCGATCCTTGCCACCATGGTCAGTGTCAGGATGGTATTGATTCCTACACCTGCATCTGCAATCCCGGGTACATGGGCGCCATCTGCAGTGACCAGATTGATGAATGTTACAGCAGCCCTTGCCTGAACGATGGTCGCTGCATTGACCTGGTCAATGGCTACCAGTGCAACTGCCAGCCAGGCACGTCAGGGGTTAATTGTGAAATTAATTTTGATGACTGTGCAAGTAACCCTTGTATCCATGGAATCTGTATGGATGGCATTAATCGCTACAGTTGTGTCTGCTCACCAGGATTCACAGGGCAGAGATGTAACATTGACATTGATGAGTGTGCCTCCAATCCCTGTCGCAAGGGTGCAACATGTATCAACGGTGTGAATGGTTTCCGCTGTATATGCCCCGAGGGACCCCATCACCCCAGCTGCTACTCACAGGTGAACGAATGCCTGAGCAATCCCTGCATCCATGGAAACTGTACTGGAGGTCTCAGTGGATATAAGTGTCTCTGTGATGCAGGCTGGGTTGGCATCAACTGTGAAGTGGACAAAAATGAATGCCTTTCGAATCCATGCCAGAATGGAGGAACTTGTGACAATCTGGTGAATGGATACAGGTGTACTTGCAAGAAGGGCTTTAAAGGCTATAACTGCCAGGTGAATATTGATGAATGTGCCTCAAATCCATGCCTGAACCAAGGAACCTGCTTTGATGACATAAGTGGCTACACTTGCCACTGTGTGCTGCCATACACAGGCAAGAATTGTCAGACAGTATTGGCTCCCTGTTCCCCAAACCCTTGTGAGAATGCTGCTGTTTGCAAAGAGTCACCAAATTTTGAGAGTTATACTTGCTTGTGTGCTCCTGGCTGGCAAGGTCAGCGGTGTACCATTGACATTGACGAGTGTATCTCCAAGCCCTGCATGAACCATGGTCTCTGCCATAACACCCAGGGCAGCTACATGTGTGAATGTCCACCAGGCTTCAGTGGTATGGACTGTGAGGAGGACATTGATGACTGCCTTGCCAATCCTTGCCAGAATGGAGGTTCCTGTATGGATGGAGTGAATACTTTCTCCTGCCTCTGCCTTCCGGGTTTCACTGGGGATAAGTGCCAGACAGACATGAATGAGTGTCTGAGTGAACCCTGTAAGAATGGAGGGACCTGCTCTGACTACGTCAACAGTTACACTTGCAAGTGCCAGGCAGGATTTGATGGAGTCCATTGTGAGAACAACATCAATGAGTGCACTGAGAGCTCCTGTTTCAATGGTGGCACATGTGTTGATGGGATTAACTCCTTCTCTTGCTTGTGCCCTGTGGGTTTCACTGGATCCTTCTGCCTCCATGAGATCAATGAATGCAGCTCTCATCCATGCCTGAATGAGGGAACGTGTGTTGATGGCCTGGGTACCTACCGCTGCAGCTGCCCCCTGGGCTACACTGGGAAAAACTGTCAGACCCTGGTGAATCTCTGCAGTCGGTCTCCATGTAAAAACAAAGGTACTTGCGTTCAGAAAAAAGCAGAGTCCCAGTGCCTATGTCCATCTGGATGGGCTGGTGCCTATTGTGACGTGCCCAATGTCTCTTGTGACATAGCAGCCTCCAGGAGAGGTGTGCTTGTTGAACACTTGTGCCAGCACTCAGGTGTCTGCATCAATGCTGGCAACACGCATTACTGTCAGTGCCCCCTGGGCTATACTGGGAGCTACTGTGAGGAGCAACTCGATGAGTGTGCGTCCAACCCCTGCCAGCACGGGGCAACATGCAGTGACTTCATTGGTGGATACAGATGCGAGTGTGTCCCAGGCTATCAGGGTGTCAACTGTGAGTATGAAGTGGATGAGTGCCAGAATCAGCCCTGCCAGAATGGAGGCACCTGTATTGACCTTGTGAACCATTTCAAGTGCTCTTGCCCACCAGGCACTCGGGGCCTACTCTGTGAAGAGAACATTGATGACTGTGCCCGGGGTCCCCATTGCCTTAATGGTGGTCAGTGCATGGATAGGATTGGAGGCTACAGTTGTCGCTGCTTGCCTGGCTTTGCTGGGGAGCGTTGTGAGGGAGACATCAACGAGTGCCTCTCCAACCCCTGCAGCTCTGAGGGCAGCCTGGACTGTATACAGCTCACCAATGACTACCTGTGTGTTTGCCGTAGTGCCTTTACTGGCCGGCACTGTGAAACCTTCGTCGATGTGTGTCCCCAGATGCCCTGCCTGAATGGAGGGACTTGTGCTGTGGCCAGTAACATGCCTGATGGTTTCATTTGCCGTTGTCCCCCGGGATTTTCCGGGGCAAGGTGCCAGAGCAGCTGTGGACAAGTGAAATGTAGGAAGGGGGAGCAGTGTGTGCACACCGCCTCTGGACCCCGCTGCTTCTGCCCCAGTCCCCGGGACTGCGAGTCAGGCTGTGCCAGTAGCCCCTGCCAGCACGGGGGCAGCTGCCACCCTCAGCGCCAGCCTCCTTATTACTCCTGCCAGTGTGCCCCACCATTCTCGGGTAGCCGCTGTGAACTCTACACGGCACCCCCCAGCACCCCTCCTGCCACCTGTCTGAGCCAGTATTGTGCCGACAAAGCTCGGGATGGCGTCTGTGATGAGGCCTGCAACAGCCATGCCTGCCAGTGGGATGGGGGTGACTGTTCTCTCACCATGGAGAACCCCTGGGCCAACTGCTCCTCCCCACTTCCCTGCTGGGATTATATCAACAACCAGTGTGATGAGCTGTGCAACACGGTCGAGTGCCTGTTTGACAACTTTGAATGCCAGGGGAACAGCAAGACATGCAAGTATGACAAATACTGTGCAGACCACTTCAAAGACAACCACTGTGACCAGGGGTGCAACAGTGAGGAGTGTGGTTGGGATGGGCTGGACTGTGCTGCTGACCAACCTGAGAACCTGGCAGAAGGTACCCTGGTTATTGTGGTATTGATGCCACCTGAACAACTGCTCCAGGATGCTCGCAGCTTCTTGCGGGCACTGGGTACCCTGCTCCACACCAACCTGCGCATTAAGCGGGACTCCCAGGGGGAACTCATGGTGTACCCCTATTATGGTGAGAAGTCAGCTGCTATGAAGAAACAGAGGATGACACGCAGATCCCTTCCTGGTGAACAAGAACAGGAGGTGGCTGGCTCTAAAGTCTTTCTGGAAATTGACAACCGCCAGTGTGTTCAAGACTCAGACCACTGCTTCAAGAACACGGATGCAGCAGCAGCTCTCCTGGCCTCTCACGCCATACAGGGGACCCTGTCATACCCTCTTGTGTCTGTCGTCAGTGAATCCCTGACTCCAGAACGCACTCAGCTCCTCTATCTCCTTGCTGTTGCTGTTGTCATCATTCTGTTTATTATTCTGCTGGGGGTAATCATGGCAAAACGAAAGCGTAAGCATGGCTCTCTCTGGCTGCCTGAAGGTTTCACTCTTCGCCGAGATGCAAGCAATCACAAGCGTCGTGAGCCAGTGGGACAGGATGCTGTGGGGCTGAAAAATCTCTCAGTGCAAGTCTCAGAAGCTAACCTAATTGGTACTGGAACAAGTGAACACTGGGTCGATGATGAAGGGCCCCAGCCAAAGAAAGTAAAGGCTGAAGATGAGGCCTTACTCTCAGAAGAAGATGACCCCATTGATCGACGGCCATGGACACAGCAGCACCTTGAAGCTGCAGACATCCGTAGGACACCATCGCTGGCTCTCACCCCTCCTCAGGCAGAGCAGGAGGTGGATGTGTTAGATGTGAATGTCCGTGGCCCAGATGGCTGCACCCCATTGATGTTGGCTTCTCTCCGAGGAGGCAGCTCAGATTTGAGTGATGAAGATGAAGATGCAGAGGACTCTTCTGCTAACATCATCACAGACTTGGTCTACCAGGGTGCCAGCCTCCAGGCCCAGACAGACCGGACTGGTGAGATGGCCCTGCACCTTGCAGCCCGCTACTCACGGGCTGATGCTGCCAAGCGTCTCCTGGATGCAGGTGCAGATGCCAATGCCCAGGACAACATGGGCCGCTGTCCACTCCATGCTGCAGTGGCAGCTGATGCCCAAGGTGTCTTCCAGATTCTGATTCGCAACCGAGTAACTGATCTAGATGCCAGGATGAATGATGGTACTACACCCCTGATCCTGGCTGCCCGCCTGGCTGTGGAGGGAATGGTGGCAGAACTGATCAACTGCCAAGCGGATGTGAATGCAGTGGATGACCATGGAAAATCTGCTCTTCACTGGGCAGCTGCTGTCAATAATGTGGAGGCAACTCTTTTGTTGTTGAAAAATGGGGCCAACCGAGACATGCAGGACAACAAGGAAGAGACACCTCTGTTTCTTGCTGCCCGGGAGGGGAGCTATGAAGCAGCCAAGATCCTGTTAGACCATTTTGCCAATCGAGACATCACAGACCATATGGATCGTCTTCCCCGGGATGTGGCTCGGGATCGCATGCACCATGACATTGTGCGCCTTCTGGATGAATACAATGTGACCCCAAGCCCTCCAGGCACCGTGTTGACTTCTGCTCTCTCACCTGTCATCTGTGGGCCCAACAGATCTTTCCTCAGCCTGAAGCACACCCCAATGGGCAAGAAGTCTAGACGGCCCAGTGCCAAGAGTACCATGCCTACTAGCCTCCCTAACCTTGCCAAGGAGGCAAAGGATGCCAAGGGTAGTAGGAGGAAGAAGTCTCTGAGTGAGAAGGTCCAACTGTCTGAGAGTTCAGTAACTTTATCCCCTGTTGATTCCCTAGAATCTCCTCACACGTATGTTTCCGACACCACATCCTCTCCAATGATTACATCCCCTGGGATCTTACAGGCCTCACCCAACCCTATGTTGGCCACTGCCGCCCCTCCTGCCCCAGTCCATGCCCAGCATGCACTATCTTTTTCTAACCTTCATGAAATGCAGCCTTTGGCACATGGGGCCAGCACTGTGCTTCCCTCAGTGAGCCAGTTGCTATCCCACCACCACATTGTGTCTCCAGGCAGTGGCAGTGCTGGAAGCTTGAGTAGGCTCCATCCAGTCCCAGTCCCAGCAGATTGGATGAACCGCATGGAGGTGAATGAGACCCAGTACAATGAGATGTTTGGTATGGTCCTGGCTCCAGCTGAGGGCACCCATCCTGGCATAGCTCCCCAGAGCAGGCCACCTGAAGGGAAGCACATAACCACCCCTCGGGAGCCCTTGCCCCCCATTGTGACTTTCCAGCTCATCCCTAAAGGCAGTATTGCCCAACCAGCGGGGGCTCCCCAGCCTCAGTCCACCTGCCCTCCAGCTGTTGCGGGCCCCCTGCCCACCATGTACCAGATTCCAGAAATGGCCCGTTTGCCCAGTGTGGCTTTCCCCACTGCCATGATGCCCCAGCAGGACGGGCAGGTAGCTCAGACCATTCTCCCAGCCTATCATCCTTTCCCAGCCTCTGTGGGCAAGTACCCCACACCCCCTTCACAGCACAGTTATGCTTCCTCAAATGCTGCTGAGCGAACACCCAGTCACAGTGGTCACCTCCAGGGTGAGCATCCCTACCTGACACCATCCCCAGAGTCTCCTGACCAGTGGTCAAGTTCATCACCCCACTCTGCTTCTGACTGGTCAGATGTGACCACCAGCCCTACCCCTGGGGGTGCTGGAGGAGGTCAGCGGGGACCTGGGACACACATGTCTGAGCCACCACACAACAACATGCAGGTTTATGCGTGAGAGAGTCCACCTCCAGTGTAGAGACATAACTGACTTTTGTAAATGCTGCTGAGGAACAAATGAAGGTCATCCGGGAGAGAAATGAAGAAATCTCTGGAGCCAGCTTCTAGAGGTAGGAAAGAGAAGATGTTCTTATTCAGATAATGCAAGAGAAGCAATTCGTCAGTTTCACTGGGTATCTGCAAGGCTTATTGATTATTCTAATCTAATAAGACAAGTTTGTGGAAATGCAAGATGAATACAAGCCTTGGGTCCATGTTTACTCTCTTCTATTTGGAGAATAAGATGGATGCTTATTGAAGCCCAGACATTCTTGCAGCTTGGACTGCATTTTAAGCCCTGCAGGCTTCTGCCATATCCATGAGAAGATTCTACACTAGCGTCCTGTTGGGAATTATGCCCTGGAATTCTGCCTGAATTGACCTACGCATCTCCTCCTCCTTGGACATTCTTTTGTCTTCATTTGGTGCTTTTGGTTTTGCACCTCTCCGTGATTGTAGCCCTACCAGCATGTTATAGGGCAAGACCTTTGTGCTTTTGATCATTCTGGCCCATGAAAGCAACTTTGGTCTCCTTTCCCCTCCTGTCTTCCCGGTATCCCTTGGAGTCTCACAAGGTTTACTTTGGTATGGTTCTCAGCACAAACCTTTCAAGTATGTTGTTTCTTTGGAAAATGGACATACTGTATTGTGTTCTCCTGCATATATCATTCCTGGAGAGAGAAGGGGAGAAGAATACTTTTCTTCAACAAATTTTGGGGGCAGGAGATCCCTTCAAGAGGCTGCACCTTAATTTTTCTTGTCTGTGTGCAGGTCTTCATATAAACTTTACCAGGAAGAAGGGTGTGAGTTTGTTGTTTTTCTGTGTATGGGCCTGGTCAGTGTAAAGTTTTATCCTTGATAGTCTAGTTACTATGACCCTCCCCACTTTTTTAAAACCAGAAAAAGGTTTGGAATGTTGGAATGACCAAGAGACAAGTTAACTCGTGCAAGAGCCAGTTACCCACCCACAGGTCCCCCTACTTCCTGCCAAGCATTCCATTGACTGCCTGTATGGAACACATTTGTCCCAGATCTGAGCATTCTAGGCCTGTTTCACTCACTCACCCAGCATATGAAACTAGTCTTAACTGTTGAGCCTTTCCTTTCATATCCACAGAAGACACTGTCTCAAATGTTGTACCCTTGCCATTTAGGACTGAACTTTCCTTAGCCCAAGGGACCCAGTGACAGTTGTCTTCCGTTTGTCAGATGATCAGTCTCTACTGATTATCTTGCTGCTTAAAGGCCTGCTCACCAATCTTTCTTTCACACCGTGTGGTCCGTGTTACTGGTATACCCAGTATGTTCTCACTGAAGACATGGACTTTATATGTTCAAGTGCAGGAATTGGAAAGTTGGACTTGTTTTCTATGATCCAAAACAGCCCTATAAGAAGGTTGGAAAAGGAGGAACTATATAGCAGCCTTTGCTATTTTCTGCTACCATTTCTTTTCCTCTGAAGCGGCCATGACATTCCCTTTGGCAACTAACGTAGAAACTCAACAGAACATTTTCCTTTCCTAGAGTCACCTTTTAGATGATAATGGACAACTATAGACTTGCTCATTGTTCAGACTGATTGCCCCTCACCTGAATCCACTCTCTGTATTCATGCTCTTGGCAATTTCTTTGACTTTCTTTTAAGGGCAGAAGCATTTTAGTTAATTGTAGATAAAGAATAGTTTTCTTCCTCTTCTCCTTGGGCCAGTTAATAATTGGTCCATGGCTACACTGCAACTTCCGTCCAGTGCTGTGATGCCCATGACACCTGCAAAATAAGTTCTGCCTGGGCATTTTGTAGATATTAACAGGTGAATTCCCGACTCTTTTGGTTTGAATGACAGTTCTCATTCCTTCTATGGCTGCAAGTATGCATCAGTGCTTCCCACTTACCTGATTTGTCTGTCGGTGGCCCCATATGGAAACCCTGCGTGTCTGTTGGCATAATAGTTTACAAATGGTTTTTTCAGTCCTATCCAAATTTATTGAACCAACAAAAATAATTACTTCTGCCCTGAGATAAGCAGATTAAGTTTGTTCATTCTCTGCTTTATTCTCTCCATGTGGCAACATTCTGTCAGCCTCTTTCATAGTGTGCAAACATTTTATCATTCTAAATGGTGACTCTCTGCCCTTGGACCCATTTATTATTCACAGATGGGGAGAACCTATCTGCATGGACCTCTGTGGACCACAGCGTACCTGCCCCTTTCTGCCCTCCTGCTCCAGCCCCACTTCTGAAAGTATCAGCTACTGATCCAGCCACTGGATATTTTATATCCTCCCTTTTCCTTAAGCACAATGTCAGACCAAATTGCTTGTTTCTTTTTCTTGGACTACTTTAATTTGGATCCTTTGGGTTTGGAGAAAGGGAATGTGAAAGCTGTCATTACAGACAACAGGTTTCAGTGATGAGGAGGACAACACTGCCTTTCAAACTTTTTACTGATCTCTTAGATTTTAAGAACTCTTGAATTGTGTGGTATCTAATAAAAGGGAAGGTAAGATGGATAATCACTTTCTCATTTGGGTTCTGAATTGGAGACTCAGTTTTTATGAGACACATCTTTTATGCCATGTATAGATCCTCCCCTGCTATTTTTGGTTTATTTTTATTGTTATAAATGCTTTCTTTCTTTGACTCCTCTTCTGCCTGCCTTTGGGGATAGGTTTTTTTGTTTGTTTATTTGCTTCCTCTGTTTTGTTTTAAGCATCATTTTCTTATGTGAGGTGGGGAAGGGAAAGGTATGAGGGAAAGAGAGTCTGAGAATTAAAATATTTTAGTATAAGCAATTGGCTGTGATGCTCAAATCCATTGCATCCTCTTATTGAATTTGCCAATTTGTAATTTTTGCATAATAAAGAACCAAAGGTGTAATGTTTTGTTGAGAGGTGGTTTAGGGATTTTGGCCCTAACCAATACATTGAATGTATGATGACTATTTGGGAGGACACATTTATGTACCCAGAGGCCCCCACTAATAAGTGGTACTATGGTTACTTCCTTGTGTACATTTCTCTTAAAAGTGATATTATATCTGTTTGTATGAGAAACCCAGTAACCAATAAAATGACCGCATATTCCTGACTAAACGTAGTAAGGAAAATGCACACTTTGTTTTTACTTTTCCGTTTCATTCTAAAGGTAGTTAAGATGAAATTTATATGAAAGCATTTTTATCACAAAATAAAAAAGGTTTGCCAAGCTCAGTGGTGTTGTATTTTTTATTTTCCAATACTGCATCCATGGCCTGGCAGTGTTACCTCATGATGTCATAATTTGCTGAGAGAGCAAATTTTCTTTTCTTTCTGAATCCCACAAAGCCTAGCACCAAACTTCTTTTTTTCTTCCTTTAATTAGATCATAAATAAATGATCCTGGGGAAAAAGCATCTGTCAAATAGGAAACATCACAAAACTGAGCACTCTTCTGTGCACTAGCCATAGCTGGTGACAAACAGATGGTTGCTCAGGGACAAGGTGCCTTCCAATGGAAATGCGAAGTAGTTGCTATAGCAAGAATTGGGAACTGGGATATAAGTCATAATATTAATTATGCTGTTATGTAAATGATTGGTTTGTAACATTCCTTAAGTGAAATTTGTGTAGAACTTAATATACAGGATTATAAAATAATATTTTGTGTATAAATTTGTTATAAGTTCACATTCATACATTTATTTATAAAGTCAGTGAGATATTTGAACATGAA SEQ ID NO:703 NOTCH2之全mRNA多核苷酸序列 AGGCTGCTTCGTTGCACACCCGAGAAAGTTTCAGCCAAACTTCGGGCGGCGGCTGAGGCGGCGGCCGAGGAGCGGCGGACTCGGGGCGCGGGGAGTCGAGGCATTTGCGCCTGGGCTTCGGAGCGTAGCGCCAGGGCCTGAGCCTTTGAAGCAGGAGGAGGGGAGGAGAGAGTGGGGCTCCTCTATCGGGACCCCCTCCCCATGTGGATCTGCCCAGGCGGCGGCGGCGGCGGCGGAGGAGGAGGCGACCGAGAAGATGCCCGCCCTGCGCCCCGCTCTGCTGTGGGCGCTGCTGGCGCTCTGGCTGTGCTGCGCGGCCCCCGCGCATGCATTGCAGTGTCGAGATGGCTATGAACCCTGTGTAAATGAAGGAATGTGTGTTACCTACCACAATGGCACAGGATACTGCAAATGTCCAGAAGGCTTCTTGGGGGAATATTGTCAACATCGAGACCCCTGTGAGAAGAACCGCTGCCAGAATGGTGGGACTTGTGTGGCCCAGGCCATGCTGGGGAAAGCCACGTGCCGATGTGCCTCAGGGTTTACAGGAGAGGACTGCCAGTACTCAACATCTCATCCATGCTTTGTGTCTCGACCCTGCCTGAATGGCGGCACATGCCATATGCTCAGCCGGGATACCTATGAGTGCACCTGTCAAGTCGGGTTTACAGGTAAGGAGTGCCAATGGACGGATGCCTGCCTGTCTCATCCCTGTGCAAATGGAAGTACCTGTACCACTGTGGCCAACCAGTTCTCCTGCAAATGCCTCACAGGCTTCACAGGGCAGAAATGTGAGACTGATGTCAATGAGTGTGACATTCCAGGACACTGCCAGCATGGTGGCACCTGCCTCAACCTGCCTGGTTCCTACCAGTGCCAGTGCCCTCAGGGCTTCACAGGCCAGTACTGTGACAGCCTGTATGTGCCCTGTGCACCCTCACCTTGTGTCAATGGAGGCACCTGTCGGCAGACTGGTGACTTCACTTTTGAGTGCAACTGCCTTCCAGGTTTTGAAGGGAGCACCTGTGAGAGGAATATTGATGACTGCCCTAACCACAGGTGTCAGAATGGAGGGGTTTGTGTGGATGGGGTCAACACTTACAACTGCCGCTGTCCCCCACAATGGACAGGACAGTTCTGCACAGAGGATGTGGATGAATGCCTGCTGCAGCCCAATGCCTGTCAAAATGGGGGCACCTGTGCCAACCGCAATGGAGGCTATGGCTGTGTATGTGTCAACGGCTGGAGTGGAGATGACTGCAGTGAGAACATTGATGATTGTGCCTTCGCCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAGGTCTCCTGTGTCATCTGGATGATGCATGCATCAGCAATCCTTGCCACAAGGGGGCACTGTGTGACACCAACCCCCTAAATGGGCAATATATTTGCACCTGCCCACAAGGCTACAAAGGGGCTGACTGCACAGAAGATGTGGATGAATGTGCCATGGCCAATAGCAATCCTTGTGAGCATGCAGGAAAATGTGTGAACACGGATGGCGCCTTCCACTGTGAGTGTCTGAAGGGTTATGCAGGACCTCGTTGTGAGATGGACATCAATGAGTGCCATTCAGACCCCTGCCAGAATGATGCTACCTGTCTGGATAAGATTGGAGGCTTCACATGTCTGTGCATGCCAGGTTTCAAAGGTGTGCATTGTGAATTAGAAATAAATGAATGTCAGAGCAACCCTTGTGTGAACAATGGGCAGTGTGTGGATAAAGTCAATCGTTTCCAGTGCCTGTGTCCTCCTGGTTTCACTGGGCCAGTTTGCCAGATTGATATTGATGACTGTTCCAGTACTCCGTGTCTGAATGGGGCAAAGTGTATCGATCACCCGAATGGCTATGAATGCCAGTGTGCCACAGGTTTCACTGGTGTGTTGTGTGAGGAGAACATTGACAACTGTGACCCCGATCCTTGCCACCATGGTCAGTGTCAGGATGGTATTGATTCCTACACCTGCATCTGCAATCCCGGGTACATGGGCGCCATCTGCAGTGACCAGATTGATGAATGTTACAGCAGCCCTTGCCTGAACGATGGTCGCTGCATTGACCTGGTCAATGGCTACCAGTGCAACTGCCAGCCAGGCACGTCAGGGGTTAATTGTGAAATTAATTTTGATGACTGTGCAAGTAACCCTTGTATCCATGGAATCTGTATGGATGGCATTAATCGCTACAGTTGTGTCTGCTCACCAGGATTCACAGGGCAGAGATGTAACATTGACATTGATGAGTGTGCCTCCAATCCCTGTCGCAAGGGTGCAACATGTATCAACGGTGTGAATGGTTTCCGCTGTATATGCCCCGAGGGACCCCATCACCCCAGCTGCTACTCACAGGTGAACGAATGCCTGAGCAATCCCTGCATCCATGGAAACTGTACTGGAGGTCTCAGTGGATATAAGTGTCTCTGTGATGCAGGCTGGGTTGGCATCAACTGTGAAGTGGACAAAAATGAATGCCTTTCGAATCCATGCCAGAATGGAGGAACTTGTGACAATCTGGTGAATGGATACAGGTGTACTTGCAAGAAGGGCTTTAAAGGCTATAACTGCCAGGTGAATATTGATGAATGTGCCTCAAATCCATGCCTGAACCAAGGAACCTGCTTTGATGACATAAGTGGCTACACTTGCCACTGTGTGCTGCCATACACAGGCAAGAATTGTCAGACAGTATTGGCTCCCTGTTCCCCAAACCCTTGTGAGAATGCTGCTGTTTGCAAAGAGTCACCAAATTTTGAGAGTTATACTTGCTTGTGTGCTCCTGGCTGGCAAGGTCAGCGGTGTACCATTGACATTGACGAGTGTATCTCCAAGCCCTGCATGAACCATGGTCTCTGCCATAACACCCAGGGCAGCTACATGTGTGAATGTCCACCAGGCTTCAGTGGTATGGACTGTGAGGAGGACATTGATGACTGCCTTGCCAATCCTTGCCAGAATGGAGGTTCCTGTATGGATGGAGTGAATACTTTCTCCTGCCTCTGCCTTCCGGGTTTCACTGGGGATAAGTGCCAGACAGACATGAATGAGTGTCTGAGTGAACCCTGTAAGAATGGAGGGACCTGCTCTGACTACGTCAACAGTTACACTTGCAAGTGCCAGGCAGGATTTGATGGAGTCCATTGTGAGAACAACATCAATGAGTGCACTGAGAGCTCCTGTTTCAATGGTGGCACATGTGTTGATGGGATTAACTCCTTCTCTTGCTTGTGCCCTGTGGGTTTCACTGGATCCTTCTGCCTCCATGAGATCAATGAATGCAGCTCTCATCCATGCCTGAATGAGGGAACGTGTGTTGATGGCCTGGGTACCTACCGCTGCAGCTGCCCCCTGGGCTACACTGGGAAAAACTGTCAGACCCTGGTGAATCTCTGCAGTCGGTCTCCATGTAAAAACAAAGGTACTTGCGTTCAGAAAAAAGCAGAGTCCCAGTGCCTATGTCCATCTGGATGGGCTGGTGCCTATTGTGACGTGCCCAATGTCTCTTGTGACATAGCAGCCTCCAGGAGAGGTGTGCTTGTTGAACACTTGTGCCAGCACTCAGGTGTCTGCATCAATGCTGGCAACACGCATTACTGTCAGTGCCCCCTGGGCTATACTGGGAGCTACTGTGAGGAGCAACTCGATGAGTGTGCGTCCAACCCCTGCCAGCACGGGGCAACATGCAGTGACTTCATTGGTGGATACAGATGCGAGTGTGTCCCAGGCTATCAGGGTGTCAACTGTGAGTATGAAGTGGATGAGTGCCAGAATCAGCCCTGCCAGAATGGAGGCACCTGTATTGACCTTGTGAACCATTTCAAGTGCTCTTGCCCACCAGGCACTCGGGGCCTACTCTGTGAAGAGAACATTGATGACTGTGCCCGGGGTCCCCATTGCCTTAATGGTGGTCAGTGCATGGATAGGATTGGAGGCTACAGTTGTCGCTGCTTGCCTGGCTTTGCTGGGGAGCGTTGTGAGGGAGACATCAACGAGTGCCTCTCCAACCCCTGCAGCTCTGAGGGCAGCCTGGACTGTATACAGCTCACCAATGACTACCTGTGTGTTTGCCGTAGTGCCTTTACTGGCCGGCACTGTGAAACCTTCGTCGATGTGTGTCCCCAGATGCCCTGCCTGAATGGAGGGACTTGTGCTGTGGCCAGTAACATGCCTGATGGTTTCATTTGCCGTTGTCCCCCGGGATTTTCCGGGGCAAGGTGCCAGAGCAGCTGTGGACAAGTGAAATGTAGGAAGGGGGAGCAGTGTGTGCACACCGCCTCTGGACCCCGCTGCTTCTGCCCCAGTCCCCGGGACTGCGAGTCAGGCTGTGCCAGTAGCCCCTGCCAGCACGGGGGCAGCTGCCACCCTCAGCGCCAGCCTCCTTATTACTCCTGCCAGTGTGCCCCACCATTCTCGGGTAGCCGCTGTGAACTCTACACGGCACCCCCCAGCACCCCTCCTGCCACCTGTCTGAGCCAGTATTGTGCCGACAAAGCTCGGGATGGCGTCTGTGATGAGGCCTGCAACAGCCATGCCTGCCAGTGGGATGGGGGTGACTGTTCTCTCACCATGGAGAACCCCTGGGCCAACTGCTCCTCCCCACTTCCCTGCTGGGATTATATCAACAACCAGTGTGATGAGCTGTGCAACACGGTCGAGTGCCTGTTTGACAACTTTGAATGCCAGGGGAACAGCAAGACATGCAAGTATGACAAATACTGTGCAGACCACTTCAAAGACAACCACTGTGACCAGGGGTGCAACAGTGAGGAGTGTGGTTGGGATGGGCTGGACTGTGCTGCTGACCAACCTGAGAACCTGGCAGAAGGTACCCTGGTTATTGTGGTATTGATGCCACCTGAACAACTGCTCCAGGATGCTCGCAGCTTCTTGCGGGCACTGGGTACCCTGCTCCACACCAACCTGCGCATTAAGCGGGACTCCCAGGGGGAACTCATGGTGTACCCCTATTATGGTGAGAAGTCAGCTGCTATGAAGAAACAGAGGATGACACGCAGATCCCTTCCTGGTGAACAAGAACAGGAGGTGGCTGGCTCTAAAGTCTTTCTGGAAATTGACAACCGCCAGTGTGTTCAAGACTCAGACCACTGCTTCAAGAACACGGATGCAGCAGCAGCTCTCCTGGCCTCTCACGCCATACAGGGGACCCTGTCATACCCTCTTGTGTCTGTCGTCAGTGAATCCCTGACTCCAGAACGCACTCAGCTCCTCTATCTCCTTGCTGTTGCTGTTGTCATCATTCTGTTTATTATTCTGCTGGGGGTAATCATGGCAAAACGAAAGCGTAAGCATGGCTCTCTCTGGCTGCCTGAAGGTTTCACTCTTCGCCGAGATGCAAGCAATCACAAGCGTCGTGAGCCAGTGGGACAGGATGCTGTGGGGCTGAAAAATCTCTCAGTGCAAGTCTCAGAAGCTAACCTAATTGGTACTGGAACAAGTGAACACTGGGTCGATGATGAAGGGCCCCAGCCAAAGAAAGTAAAGGCTGAAGATGAGGCCTTACTCTCAGAAGAAGATGACCCCATTGATCGACGGCCATGGACACAGCAGCACCTTGAAGCTGCAGACATCCGTAGGACACCATCGCTGGCTCTCACCCCTCCTCAGGCAGAGCAGGAGGTGGATGTGTTAGATGTGAATGTCCGTGGCCCAGATGGCTGCACCCCATTGATGTTGGCTTCTCTCCGAGGAGGCAGCTCAGATTTGAGTGATGAAGATGAAGATGCAGAGGACTCTTCTGCTAACATCATCACAGACTTGGTCTACCAGGGTGCCAGCCTCCAGGCCCAGACAGACCGGACTGGTGAGATGGCCCTGCACCTTGCAGCCCGCTACTCACGGGCTGATGCTGCCAAGCGTCTCCTGGATGCAGGTGCAGATGCCAATGCCCAGGACAACATGGGCCGCTGTCCACTCCATGCTGCAGTGGCAGCTGATGCCCAAGGTGTCTTCCAGATTCTGATTCGCAACCGAGTAACTGATCTAGATGCCAGGATGAATGATGGTACTACACCCCTGATCCTGGCTGCCCGCCTGGCTGTGGAGGGAATGGTGGCAGAACTGATCAACTGCCAAGCGGATGTGAATGCAGTGGATGACCATGGAAAATCTGCTCTTCACTGGGCAGCTGCTGTCAATAATGTGGAGGCAACTCTTTTGTTGTTGAAAAATGGGGCCAACCGAGACATGCAGGACAACAAGGAAGAGACACCTCTGTTTCTTGCTGCCCGGGAGGGGAGCTATGAAGCAGCCAAGATCCTGTTAGACCATTTTGCCAATCGAGACATCACAGACCATATGGATCGTCTTCCCCGGGATGTGGCTCGGGATCGCATGCACCATGACATTGTGCGCCTTCTGGATGAATACAATGTGACCCCAAGCCCTCCAGGCACCGTGTTGACTTCTGCTCTCTCACCTGTCATCTGTGGGCCCAACAGATCTTTCCTCAGCCTGAAGCACACCCCAATGGGCAAGAAGTCTAGACGGCCCAGTGCCAAGAGTACCATGCCTACTAGCCTCCCTAACCTTGCCAAGGAGGCAAAGGATGCCAAGGGTAGTAGGAGGAAGAAGTCTCTGAGTGAGAAGGTCCAACTGTCTGAGAGTTCAGTAACTTTATCCCCTGTTGATTCCCTAGAATCTCCTCACACGTATGTTTCCGACACCACATCCTCTCCAATGATTACATCCCCTGGGATCTTACAGGCCTCACCCAACCCTATGTTGGCCACTGCCGCCCCTCCTGCCCCAGTCCATGCCCAGCATGCACTATCTTTTTCTAACCTTCATGAAATGCAGCCTTTGGCACATGGGGCCAGCACTGTGCTTCCCTCAGTGAGCCAGTTGCTATCCCACCACCACATTGTGTCTCCAGGCAGTGGCAGTGCTGGAAGCTTGAGTAGGCTCCATCCAGTCCCAGTCCCAGCAGATTGGATGAACCGCATGGAGGTGAATGAGACCCAGTACAATGAGATGTTTGGTATGGTCCTGGCTCCAGCTGAGGGCACCCATCCTGGCATAGCTCCCCAGAGCAGGCCACCTGAAGGGAAGCACATAACCACCCCTCGGGAGCCCTTGCCCCCCATTGTGACTTTCCAGCTCATCCCTAAAGGCAGTATTGCCCAACCAGCGGGGGCTCCCCAGCCTCAGTCCACCTGCCCTCCAGCTGTTGCGGGCCCCCTGCCCACCATGTACCAGATTCCAGAAATGGCCCGTTTGCCCAGTGTGGCTTTCCCCACTGCCATGATGCCCCAGCAGGACGGGCAGGTAGCTCAGACCATTCTCCCAGCCTATCATCCTTTCCCAGCCTCTGTGGGCAAGTACCCCACACCCCCTTCACAGCACAGTTATGCTTCCTCAAATGCTGCTGAGCGAACACCCAGTCACAGTGGTCACCTCCAGGGTGAGCATCCCTACCTGACACCATCCCCAGAGTCTCCTGACCAGTGGTCAAGTTCATCACCCCACTCTGCTTCTGACTGGTCAGATGTGACCACCAGCCCTACCCCTGGGGGTGCTGGAGGAGGTCAGCGGGGACCTGGGACACACATGTCTGAGCCACCACACAACAACATGCAGGTTTATGCGTGAGAGAGTCCACCTCCAGTGTAGAGACATAACTGACTTTTGTAAATGCTGCTGAGGAACAAATGAAGGTCATCCGGGAGAGAAATGAAGAAATCTCTGGAGCCAGCTTCTAGAGGTAGGAAAGAGAAGATGTTCTTATTCAGATAATGCAAGAGAAGCAATTCGTCAGTTTCACTGGGTATCTGCAAGGCTTATTGATTATTCTAATCTAATAAGACAAGTTTGTGGAAATGCAAGATGAATACAAGCCTTGGGTCCATGTTTACTCTCTTCTATTTGGAGAATAAGATGGATGCTTATTGAAGCCCAGACATTCTTGCAGCTTGGACTGCATTTTAAGCCCTGCAGGCTTCTGCCATATCCATGAGAAGATTCTACACTAGCGTCCTGTTGGGAATTATGCCCTGGAATTCTGCCTGAATTGACCTACGCATCTCCTCCTCCTTGGACATTCTTTTGTCTTCATTTGGTGCTTTTGGTTTTGCACCTCTCCGTGATTGTAGCCCTACCAGCATGTTATAGGGCAAGACCTTTGTGCTTTTGATCATTCTGGCCCATGAAAGCAACTTTGGTCTCCTTTCCCCTCCTGTCTTCCCGGTATCCCTTGGAGTCTCACAAGGTTTACTTTGGTATGGTTCTCAGCACAAACCTTTCAAGTATGTTGTTTCTTTGGAAAATGGACATACTGTATTGTGTTCTCCTGCATATATCATTCCTGGAGAGAGAAGGGGAGAAGAATACTTTTCTTCAACAAATTTTGGGGGCAGGAGATCCCTTCAAGAGGCTGCACCTTAATTTTTCTTGTCTGTGTGCAGGTCTTCATATAAACTTTACCAGGAAGAAGGGTGTGAGTTTGTTGTTTTTCTGTGTATGGGCCTGGTCAGTGTAAAGTTTTATCCTTGATAGTCTAGTTACTATGACCCTCCCCACTTTTTTAAAACCAGAAAAAGGTTTGGAATGTTGGAATGACCAAGAGACAAGTTAACTCGTGCAAGAGCCAGTTACCCACCCACAGGTCCCCCTACTTCCTGCCAAGCATTCCATTGACTGCCTGTATGGAACACATTTGTCCCAGATCTGAGCATTCTAGGCCTGTTTCACTCACTCACCCAGCATATGAAACTAGTCTTAACTGTTGAGCCTTTCCTTTCATATCCACAGAAGACACTGTCTCAAATGTTGTACCCTTGCCATTTAGGACTGAACTTTCCTTAGCCCAAGGGACCCAGTGACAGTTGTCTTCCGTTTGTCAGATGATCAGTCTCTACTGATTATCTTGCTGCTTAAAGGCCTGCTCACCAATCTTTCTTTCACACCGTGTGGTCCGTGTTACTGGTATACCCAGTATGTTCTCACTGAAGACATGGACTTTATATGTTCAAGTGCAGGAATTGGAAAGTTGGACTTGTTTTCTATGATCCAAAACAGCCCTATAAGAAGGTTGGAAAAGGAGGAACTATATAGCAGCCTTTGCTATTTTCTGCTACCATTTCTTTTCCTCTGAAGCGGCCATGACATTCCCTTTGGCAACTAACGTAGAAACTCAACAGAACATTTTCCTTTCCTAGAGTCACCTTTTAGATGATAATGGACAACTATAGACTTGCTCATTGTTCAGACTGATTGCCCCTCACCTGAATCCACTCTCTGTATTCATGCTCTTGGCAATTTCTTTGACTTTCTTTTAAGGGCAGAAGCATTTTAGTTAATTGTAGATAAAGAATAGTTTTCTTCCTCTTCTCCTTGGGCCAGTTAATAATTGGTCCATGGCTACACTGCAACTTCCGTCCAGTGCTGTGATGCCCATGACACCTGCAAAATAAGTTCTGCCTGGGCATTTTGTAGATATTAACAGGTGAATTCCCGACTCTTTTGGTTTGAATGACAGTTCTCATTCCTTCTATGGCTGCAAGTATGCATCAGTGCTTCCCACTTACCTGATTTGTCTGTCGGTGGCCCCATATGGAAACCCTGCGTGTCTGTTGGCATAATAGTTTACAAATGGTTTTTTCAGTCCTATCCAAATTTATTGAACCAACAAAAATAATTACTTCTGCCCTGAGATAAGCAGATTAAGTTTGTTCATTCTCTGCTTTATTCTCTCCATGTGGCAACATTCTGTCAGCCTCTTTCATAGTGTGCAAACATTTTATCATTCTAAATGGTGACTCTCTGCCCTTGGACCCATTTATTATTCACAGATGGGGAGAACCTATCTGCATGGACCTCTGTGGACCACAGCGTACCTGCCCCTTTCTGCCCTCCTGCTCCAGCCCCACTTCTGAAAGTATCAGCTACTGATCCAGCCACTGGATATTTTATATCCTCCCTTTTCCTTAAGCACAATGTCAGACCAAATTGCTTGTTTCTTTTTCTTGGACTACTTTAATTTGGATCCTTTGGGTTTGGAGAAAGGGAATGTGAAAGCTGTCATTACAGACAACAGGTTTCAGTGATGAGGAGGACAACACTGCCTTTCAAACTTTTTACTGATCTCTTAGATTTTAAGAACTCTTGAATTGTGTGGTATCTAATAAAAGGGAAGGTAAGATGGATAATCACTTTCTCATTTGGGTTCTGAATTGGAGACTCAGTTTTTATGAGACACATCTTTTATGCCATGTATAGATCCTCCCCTGCTATTTTTGGTTTATTTTTATTGTTATAAATGCTTTCTTTCTTTGACTCCTCTTCTGCCTGCCTTTGGGGATAGGTTTTTTTGTTTGTTTATTTGCTTCCTCTGTTTTGTTTTAAGCATCATTTTCTTATGTGAGGTGGGGAAGGGAAAGGTATGAGGGAAAGAGAGTCTGAGAATTAAAATATTTTAGTATAAGCAATTGGCTGTGATGCTCAAATCCATTGCATCCTCTTATTGAATTTGCCAATTTGTAATTTTTGCATAATAAAGAACCAAAGGTGTAATGTTTTGTTGAGAGGTGGTTTAGGGATTTTGGCCCTAACCAATACATTGAATGTATGATGACTATTTGGGAGGACACATTTATGTACCCAGAGGCCCCCACTAATAAGTGGTACTATGGTTACTTCCTTGTGTACATTTCTCTTAAAAGTGATATTATATCTGTTTGTATGAGAAACCCAGTAACCAATAAAATGACCGCATATTCCTGACTAAACGTAGTAAGGAAAATGCACACTTTGTTTTTACTTTTCCGTTTCATTCTAAAGGTAGTTAAGATGAAATTTATATGAAAGCATTTTTATCACAAAATAAAAAAGGTTTGCCAAGCTCAGTGGTGTTGTATTTTTTATTTTCCAATACTGCATCCATGGCCTGGCAGTGTTACCTCATGATGTCATAATTTGCTGAGAGAGCAAATTTTCTTTTCTTTCTGAATCCCACAAAGCCTAGCACCAAACTTCTTTTTTTCTTCCTTTAATTAGATCATAAATAAATGATCCTGGGGAAAAAGCATCTGTCAAATAGGAAACATCACAAAACTGAGCACTCTTCTGTGCACTAGCCATAGCTGGTGACAAACAGATGGTTGCTCAGGGACAAGGTGCCTTCCAATGGAAATGCGAAGTAGTTGCTATAGCAAGAATTGGGAACTGGGATATAAGTCATAATATTAATTATGCTGTTATGTAAATGATTGGTTTGTAACATTCCTTAAGTGAAATTTGTGTAGAACTTAATATACAGGATTATAAAATAATATTTTGTGTATAAATTTGTTATAAGTTCACATTCATACATTTATTTATAAAGTCAGTGAGATATTTGAACATGAA SEQ ID NO:704 NOTCH2之外顯子1的轉譯多肽序列 MPALRPALLWALLALWLCCAAPAH SEQ ID NO:705 NOTCH2之外顯子2的轉譯多肽序列 LQCRDGYEPCVNEGMCVTYHNGTGYC SEQ ID NO:706 NOTCH2之外顯子3的轉譯多肽序列 CPEGFLGEYCQHRDPCEKNRCQNGGTCVAQAMLGKATCRCASGFTGEDCQYSTSHPCFVSRPCLNGGTCHMLSRDTYECTCQVGFT SEQ ID NO:707 NOTCH2之外顯子4的轉譯多肽序列 KECQWTDACLSHPCANGSTCTTVANQFSCKCLTGFTGQKCETDVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQYCDSLYVPCAPSPCVNGGTCRQTGDFTFECNCLP SEQ ID NO:708 NOTCH2之外顯子5的轉譯多肽序列 FEGSTCERNIDDCPNHRCQNGGVCVDGVNTYNCRCPPQWT SEQ ID NO:709 NOTCH2之外顯子6的轉譯多肽序列 QFCTEDVDECLLQPNACQNGGTCANRNGGYGCVCVNGWSGDDCSENIDDCAFASCTPGSTCIDRVASFSCMCPEGKA SEQ ID NO:710 NOTCH2之外顯子7的轉譯多肽序列 LLCHLDDACISNPCHKGALCDTNPLNGQYICTCPQGYKGADCTEDVDECAM SEQ ID NO:711 NOTCH2之外顯子8-34的轉譯多肽序列 NSNPCEHAGKCVNTDGAFHCECLKGYAGPRCEMDINECHSDPCQNDATCLDKIGGFTCLCMPGFKGVHCELEINECQSNPCVNNGQCVDKVNRFQCLCPPGFTGPVCQIDIDDCSSTPCLNGAKCIDHPNGYECQCATGFTGVLCEENIDNCDPDPCHHGQCQDGIDSYTCICNPGYMGAICSDQIDECYSSPCLNDGRCIDLVNGYQCNCQPGTSGVNCEINFDDCASNPCIHGICMDGINRYSCVCSPGFTGQRCNIDIDECASNPCRKGATCINGVNGFRCICPEGPHHPSCYSQVNECLSNPCIHGNCTGGLSGYKCLCDAGWVGINCEVDKNECLSNPCQNGGTCDNLVNGYRCTCKKGFKGYNCQVNIDECASNPCLNQGTCFDDISGYTCHCVLPYTGKNCQTVLAPCSPNPCENAAVCKESPNFESYTCLCAPGWQGQRCTIDIDECISKPCMNHGLCHNTQGSYMCECPPGFSGMDCEEDIDDCLANPCQNGGSCMDGVNTFSCLCLPGFTGDKCQTDMNECLSEPCKNGGTCSDYVNSYTCKCQAGFDGVHCENNINECTESSCFNGGTCVDGINSFSCLCPVGFTGSFCLHEINECSSHPCLNEGTCVDGLGTYRCSCPLGYTGKNCQTLVNLCSRSPCKNKGTCVQKKAESQCLCPSGWAGAYCDVPNVSCDIAASRRGVLVEHLCQHSGVCINAGNTHYCQCPLGYTGSYCEEQLDECASNPCQHGATCSDFIGGYRCECVPGYQGVNCEYEVDECQNQPCQNGGTCIDLVNHFKCSCPPGTRGLLCEENIDDCARGPHCLNGGQCMDRIGGYSCRCLPGFAGERCEGDINECLSNPCSSEGSLDCIQLTNDYLCVCRSAFTGRHCETFVDVCPQMPCLNGGTCAVASNMPDGFICRCPPGFSGARCQSSCGQVKCRKGEQCVHTASGPRCFCPSPRDCESGCASSPCQHGGSCHPQRQPPYYSCQCAPPFSGSRCELYTAPPSTPPATCLSQYCADKARDGVCDEACNSHACQWDGGDCSLTMENPWANCSSPLPCWDYINNQCDELCNTVECLFDNFECQGNSKTCKYDKYCADHFKDNHCDQGCNSEECGWDGLDCAADQPENLAEGTLVIVVLMPPEQLLQDARSFLRALGTLLHTNLRIKRDSQGELMVYPYYGEKSAAMKKQRMTRRSLPGEQEQEVAGSKVFLEIDNRQCVQDSDHCFKNTDAAAALLASHAIQGTLSYPLVSVVSESLTPERTQLLYLLAVAVVIILFIILLGVIMAKRKRKHGSLWLPEGFTLRRDASNHKRREPVGQDAVGLKNLSVQVSEANLIGTGTSEHWVDDEGPQPKKVKAEDEALLSEEDDPIDRRPWTQQHLEAADIRRTPSLALTPPQAEQEVDVLDVNVRGPDGCTPLMLASLRGGSSDLSDEDEDAEDSSANIITDLVYQGASLQAQTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVTDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLLDEYNVTPSPPGTVLTSALSPVICGPNRSFLSLKHTPMGKKSRRPSAKSTMPTSLPNLAKEAKDAKGSRRKKSLSEKVQLSESSVTLSPVDSLESPHTYVSDTTSSPMITSPGILQASPNPMLATAAPPAPVHAQHALSFSNLHEMQPLAHGASTVLPSVSQLLSHHHIVSPGSGSAGSLSRLHPVPVPADWMNRMEVNETQYNEMFGMVLAPAEGTHPGIAPQSRPPEGKHITTPREPLPPIVTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMARLPSVAFPTAMMPQQDGQVAQTILPAYHPFPASVGKYPTPPSQHSYASSNAAERTPSHSGHLQGEHPYLTPSPESPDQWSSSSPHSASDWSDVTTSPTPGGAGGGQRGPGTHMSEPPHNNMQVYA SEQ ID NO:712 NOTCH2之全蛋白質序列 MPALRPALLWALLALWLCCAAPAHALQCRDGYEPCVNEGMCVTYHNGTGYCKCPEGFLGEYCQHRDPCEKNRCQNGGTCVAQAMLGKATCRCASGFTGEDCQYSTSHPCFVSRPCLNGGTCHMLSRDTYECTCQVGFTGKECQWTDACLSHPCANGSTCTTVANQFSCKCLTGFTGQKCETDVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQYCDSLYVPCAPSPCVNGGTCRQTGDFTFECNCLPGFEGSTCERNIDDCPNHRCQNGGVCVDGVNTYNCRCPPQWTGQFCTEDVDECLLQPNACQNGGTCANRNGGYGCVCVNGWSGDDCSENIDDCAFASCTPGSTCIDRVASFSCMCPEGKAGLLCHLDDACISNPCHKGALCDTNPLNGQYICTCPQGYKGADCTEDVDECAMANSNPCEHAGKCVNTDGAFHCECLKGYAGPRCEMDINECHSDPCQNDATCLDKIGGFTCLCMPGFKGVHCELEINECQSNPCVNNGQCVDKVNRFQCLCPPGFTGPVCQIDIDDCSSTPCLNGAKCIDHPNGYECQCATGFTGVLCEENIDNCDPDPCHHGQCQDGIDSYTCICNPGYMGAICSDQIDECYSSPCLNDGRCIDLVNGYQCNCQPGTSGVNCEINFDDCASNPCIHGICMDGINRYSCVCSPGFTGQRCNIDIDECASNPCRKGATCINGVNGFRCICPEGPHHPSCYSQVNECLSNPCIHGNCTGGLSGYKCLCDAGWVGINCEVDKNECLSNPCQNGGTCDNLVNGYRCTCKKGFKGYNCQVNIDECASNPCLNQGTCFDDISGYTCHCVLPYTGKNCQTVLAPCSPNPCENAAVCKESPNFESYTCLCAPGWQGQRCTIDIDECISKPCMNHGLCHNTQGSYMCECPPGFSGMDCEEDIDDCLANPCQNGGSCMDGVNTFSCLCLPGFTGDKCQTDMNECLSEPCKNGGTCSDYVNSYTCKCQAGFDGVHCENNINECTESSCFNGGTCVDGINSFSCLCPVGFTGSFCLHEINECSSHPCLNEGTCVDGLGTYRCSCPLGYTGKNCQTLVNLCSRSPCKNKGTCVQKKAESQCLCPSGWAGAYCDVPNVSCDIAASRRGVLVEHLCQHSGVCINAGNTHYCQCPLGYTGSYCEEQLDECASNPCQHGATCSDFIGGYRCECVPGYQGVNCEYEVDECQNQPCQNGGTCIDLVNHFKCSCPPGTRGLLCEENIDDCARGPHCLNGGQCMDRIGGYSCRCLPGFAGERCEGDINECLSNPCSSEGSLDCIQLTNDYLCVCRSAFTGRHCETFVDVCPQMPCLNGGTCAVASNMPDGFICRCPPGFSGARCQSSCGQVKCRKGEQCVHTASGPRCFCPSPRDCESGCASSPCQHGGSCHPQRQPPYYSCQCAPPFSGSRCELYTAPPSTPPATCLSQYCADKARDGVCDEACNSHACQWDGGDCSLTMENPWANCSSPLPCWDYINNQCDELCNTVECLFDNFECQGNSKTCKYDKYCADHFKDNHCDQGCNSEECGWDGLDCAADQPENLAEGTLVIVVLMPPEQLLQDARSFLRALGTLLHTNLRIKRDSQGELMVYPYYGEKSAAMKKQRMTRRSLPGEQEQEVAGSKVFLEIDNRQCVQDSDHCFKNTDAAAALLASHAIQGTLSYPLVSVVSESLTPERTQLLYLLAVAVVIILFIILLGVIMAKRKRKHGSLWLPEGFTLRRDASNHKRREPVGQDAVGLKNLSVQVSEANLIGTGTSEHWVDDEGPQPKKVKAEDEALLSEEDDPIDRRPWTQQHLEAADIRRTPSLALTPPQAEQEVDVLDVNVRGPDGCTPLMLASLRGGSSDLSDEDEDAEDSSANIITDLVYQGASLQAQTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVTDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLLDEYNVTPSPPGTVLTSALSPVICGPNRSFLSLKHTPMGKKSRRPSAKSTMPTSLPNLAKEAKDAKGSRRKKSLSEKVQLSESSVTLSPVDSLESPHTYVSDTTSSPMITSPGILQASPNPMLATAAPPAPVHAQHALSFSNLHEMQPLAHGASTVLPSVSQLLSHHHIVSPGSGSAGSLSRLHPVPVPADWMNRMEVNETQYNEMFGMVLAPAEGTHPGIAPQSRPPEGKHITTPREPLPPIVTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMARLPSVAFPTAMMPQQDGQVAQTILPAYHPFPASVGKYPTPPSQHSYASSNAAERTPSHSGHLQGEHPYLTPSPESPDQWSSSSPHSASDWSDVTTSPTPGGAGGGQRGPGTHMSEPPHNNMQVYA SEQ ID NO:713 NOTCH2之全部外顯子1-6按順序排列的多核苷酸序列 SEQ ID NO:714 NOTCH2之全部外顯子1-6按順序排列的轉譯多肽序列 CD74 序列資訊SEQ ID NO:715 來自CD74-NRG1融合物5’處之CD74外顯子2的序列 AGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGC SEQ ID NO:716 CD74-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:717 CD74-NRG1多核苷酸序列 AGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGCCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:718 CD74-NRG1多肽序列 GRLDKLTVTSQNLQLENLRMKLPKPLPPRLKEMKSQESAAGSKLVLRCE SEQ ID NO:719 CD74之外顯子1 ATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCACATCTACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCAGGGTCCCAGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAG SEQ ID NO:720 CD74之外顯子2 CAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGC SEQ ID NO:721 CD74之外顯子3 CTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATGGGAGCCCTGCCCCAGGGG SEQ ID NO:722 CD74之外顯子4 CCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAG SEQ ID NO:723 CD74之外顯子5 AATGCTGACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAG SEQ ID NO:724 CD74之外顯子6 GTCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAG SEQ ID NO:725 CD74之外顯子7 TACTGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGCCACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATAACTGCAGTG SEQ ID NO:726 CD74之外顯子8 AGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAG SEQ ID NO:727 CD74之外顯子9 TCCCCATGTGAGAGCAGCAGAGGCGGTCTTCAACATCCTGCCAGCCCCACACAGCTACAGCTTTCTTGCTCCCTTCAGCCCCCAGCCCCTCCCCCATCTCCCACCCTGTACCTCATCCCATGAGACCCTGGTGCCTGGCTCTTTCGTCACCCTTGGACAAGACAAACCAAGTCGGAACAGCAGATAACAATGCAGCAAGGCCCTGCTGCCCAATCTCCATCTGTCAACAGGGGCGTGAGGTCCCAGGAAGTGGCCAAAAGCTAGACAGATCCCCGTTCCTGACATCACAGCAGCCTCCAACACAAGGCTCCAAGACCTAGGCTCATGGACGAGATGGGAAGGCACAGGGAGAAGGGATAACCCTACACCCAGACCCCAGGCTGGACATGCTGACTGTCCTCTCCCCTCCAGCCTTTGGCCTTGGCTTTTCTAGCCTATTTACCTGCAGGCTGAGCCACTCTCTTCCCTTTCCCCAGCATCACTCCCCAAGGAAGAGCCAATGTTTTCCACCCATAATCCTTTCTGCCGACCCCTAGTTCCCTCTGCTCAGCCAAGCTTGTTATCAGCTTTCAGGGCCATGGTTCACATTAGAATAAAAGGTAGTAATTAGAA SEQ ID NO:728 CD74之全mRNA多核苷酸序列 ATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCACATCTACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCAGGGTCCCAGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATGGATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATGGGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTGACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGGTCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAGTACTGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGCCACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATAACTGCAGTGAGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGTCCCCATGTGAGAGCAGCAGAGGCGGTCTTCAACATCCTGCCAGCCCCACACAGCTACAGCTTTCTTGCTCCCTTCAGCCCCCAGCCCCTCCCCCATCTCCCACCCTGTACCTCATCCCATGAGACCCTGGTGCCTGGCTCTTTCGTCACCCTTGGACAAGACAAACCAAGTCGGAACAGCAGATAACAATGCAGCAAGGCCCTGCTGCCCAATCTCCATCTGTCAACAGGGGCGTGAGGTCCCAGGAAGTGGCCAAAAGCTAGACAGATCCCCGTTCCTGACATCACAGCAGCCTCCAACACAAGGCTCCAAGACCTAGGCTCATGGACGAGATGGGAAGGCACAGGGAGAAGGGATAACCCTACACCCAGACCCCAGGCTGGACATGCTGACTGTCCTCTCCCCTCCAGCCTTTGGCCTTGGCTTTTCTAGCCTATTTACCTGCAGGCTGAGCCACTCTCTTCCCTTTCCCCAGCATCACTCCCCAAGGAAGAGCCAATGTTTTCCACCCATAATCCTTTCTGCCGACCCCTAGTTCCCTCTGCTCAGCCAAGCTTGTTATCAGCTTTCAGGGCCATGGTTCACATTAGAATAAAAGGTAGTAATTAGAA SEQ ID NO:729 CD74之外顯子1的轉譯多肽序列 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPE SEQ ID NO:730 CD74之外顯子2的轉譯多肽序列 KCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPK SEQ ID NO:731 CD74之外顯子3的轉譯多肽序列 PKPVSKMRMATPLLMQALPMGALPQG SEQ ID NO:732 CD74之外顯子4的轉譯多肽序列 PMQNATKYGNMTEDHVMHLLQ SEQ ID NO:733 CD74之外顯子5的轉譯多肽序列 NADPLKVYPPLKGSFPENLRHLKNTMETIDWK SEQ ID NO:734 CD74之外顯子6的轉譯多肽序列 VFESWMHHWLLFEMSRHSLEQKPTDAPPK SEQ ID NO:735 CD74之外顯子7的轉譯多肽序列 LTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHNCS SEQ ID NO:736 CD74之外顯子8的轉譯多肽序列 SLELEDPSSGLGVTKQDLGP SEQ ID NO:737 CD74之外顯子9的轉譯多肽序列 PM SEQ ID NO:738 CD74之全蛋白質序列 MHRRRSRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKVFESWMHHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHNCSESLELEDPSSGLGVTKQDLGPVPM SEQ ID NO:739 CD74之全部外顯子1-2按順序排列的多核苷酸序列 SEQ ID NO:740 CD74之全部外顯子1-2按順序排列的轉譯多肽序列 SDC4 序列資訊SEQ ID NO:741 來自SDC4-NRG1融合物5’處之SDC4外顯子2的序列 TACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGG SEQ ID NO:742 SDC4-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:743 SDC4-NRG1多核苷酸序列 TACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:744 SDC4-NRG1多肽序列 PDDEDVVGPGQESDDFELSGSGDLALPPRLKEMKSQESAAGSKLVLRCE SEQ ID NO:745 SDC4之外顯子1 ACTCGCCGCAGCCTGCGCGCCTTCTCCAGTCCGCGGTGCCATGGCCCCCGCCCGTCTGTTCGCGCTGCTGCTGTTCTTCGTAGGCGGAGTCGCCGAGTCG SEQ ID NO:746 SDC4之外顯子2 ATCCGAGAGACTGAGGTCATCGACCCCCAGGACCTCCTAGAAGGCCGATACTTCTCCGGAGCCCTACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGG SEQ ID NO:747 SDC4之外顯子3 ATGACTTGGAAGACTCCATGATCGGCCCTGAAGTTGTCCATCCCTTG SEQ ID NO:748 SDC4之外顯子4 GTGCCTCTAGATAACCATATCCCTGAGAGGGCAGGGTCTGGGAGCCAAGTCCCCACCGAACCCAAGAAACTAGAGGAGAATGAGGTTATCCCCAAGAGAATCTCACCCGTTGAAGAGAGTGAGGATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAG SEQ ID NO:749 SDC4之外顯子5 CTCTGATTGTGGGTGGCATCGTGGGCATCCTCTTTGCCGTCTTCCTGATCCTACTGCTCATGTACCGTATGAAGAAGAAGGATGAAGGCAGCTATGACCTGGGCAAGAAACCCATCTACAAGAAAGCCCCCACCAATGAGTTCTACGCGTGAAGCTTGCTTGTGGGCACTGGCTTGGACTTTAGCGGGGAGGGAAGCCAGGGGATTTTGAAGGGTGGACATTAGGGTAGGGTGAGGTCAACCTAATACTGACTTGTCAGTATCTCCAGCTCTGATTACCTTTGAAGTGTTCAGAAGAGACATTGTCTTCTACTGTTCTGCCAGGTTCTTCTTGAGCTTTGGGCCTCAGTTGCCCTGGCAGAAAAATGGATTCAACTTGGCCTTTCTGAAGGCAAGACTGGGATTGGATCACTTCTTAAACTTCCAGTTAAGAATCTAGGTCCGCCCTCAAGCCCATACTGACCATGCCTCATCCAGAGCTCCTCTGAAGCCAGGGGGCTAACGGATGTTGTGTGGAGTCCTGGCTGGAGGTCCTCCCCCAGTGGCCTTCCTCCCTTCCTTTCACAGCCGGTCTCTCTGCCAGGAAATGGGGGAAGGAACTAGAACCACCTGCACCTTGAGATGTTTCTGTAAATGGGTACTTGTGATCACACTACGGGAATCTCTGTGGTATATACCTGGGGCCATTCTAGGCTCTTTCAAGTGACTTTTGGAAATCAACCTTTTTTATTTGGGGGGGAGGATGGGGAAAAGAGCTGAGAGTTTATGCTGAAATGGATTTATAGAATATTTGTAAATCTATTTTTAGTGTTTGTTCGTTTTTTTAACTGTTCATTCCTTTGTGCAGAGTGTATATCTCTGCCTGGGCAAGAGTGTGGAGGTGCCGAGGTGTCTTCATTCTCTCGCACATTTCCACAGCACCTGCTAAGTTTGTATTTAATGGTTTTTGTTTTTGTTTTTGTTTGTTTCTTGAAAATGAGAGAAGAGCCGGAGAGATGATTTTTATTAATTTTTTTTTTTTTTTTTTTTTTTTACTATTTATAGCTTTAGATAGGGCCTCCCTTCCCCTCTTCTTTCTTTGTTCTCTTTCATTAAACCCCTTCCCCAGTTTTTTTTTTATACTTTAAACCCCGCTCCTCATGGCCTTGGCCCTTTCTGAAGCTGCTTCCTCTTATAAAATAGCTTTTGCCGAAACATAGTTTTTTTTTAGCAGATCCCAAAATATAATGAAGGGGATGGTGGGATATTTGTGTCTGTGTTCTTATAATATATTATTATTCTTCCTTGGTTCTAGAAAAATAGATAAATATATTTTTTTCAGGAAATAGTGTGGTGTTTCCAGTTTGATGTTGCTGGGTGGTTGAGTGAGTGAATTTTCATGTGGCTGGGTGGGTTTTTGCCTTTTTCTCTTGCCCTGTTCCTGGTGCCTTCTGATGGGGCTGGAATAGTTGAGGTGGATGGTTCTACCCTTTCTGCCTTCTGTTTGGGACCCAGCTGGTGTTCTTTGGTTTGCTTTCTTCAGGCTCTAGGGCTGTGCTATCCAATACAGTAACCACATGCGGCTGTTTAAAGTTAAGCCAATTAAAATCACATAAGATTAAAAATTCCTTCCTCAGTTGCACTAACCACGTTTCTAGAGGCGTCACTGTATGTAGTTCATGGCTACTGTACTGACAGCGAGAGCATGTCCATCTGTTGGACAGCACTATTCTAGAGAACTAAACTGGCTTAACGAGTCACAGCCTCAGCTGTGCTGGGACGACCCTTGTCTCCCTGGGTAGGGGGGGGGGAATGGGGGAGGGCTGATGAGGCCCCAGCTGGGGCCTGTTGTCTGGGACCCTCCCTCTCCTGAGAGGGGAGGCCTGGTGGCTTAGCCTGGGCAGGTCGTGTCTCCTCCTGACCCCAGTGGCTGCGGTGAGGGGAACCACCCTCCCTTGCTGCACCAGTGGCCATTAGCTCCCGTCACCACTGCAACCCAGGGTCCCAGCTGGCTGGGTCCTCTTCTGCCCCCAGTGCCCTTCCCCTTGGGCTGTGTTGGAGTGAGCACCTCCTCTGTAGGCACCTCTCACACTGTTGTCTGTTACTGATTTTTTTTGATAAAAAGATAATAAAACCTGGTACTTTCTAAA SEQ ID NO:750 SDC4之全mRNA多核苷酸序列 ACTCGCCGCAGCCTGCGCGCCTTCTCCAGTCCGCGGTGCCATGGCCCCCGCCCGTCTGTTCGCGCTGCTGCTGTTCTTCGTAGGCGGAGTCGCCGAGTCGATCCGAGAGACTGAGGTCATCGACCCCCAGGACCTCCTAGAAGGCCGATACTTCTCCGGAGCCCTACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGGATGACTTGGAAGACTCCATGATCGGCCCTGAAGTTGTCCATCCCTTGGTGCCTCTAGATAACCATATCCCTGAGAGGGCAGGGTCTGGGAGCCAAGTCCCCACCGAACCCAAGAAACTAGAGGAGAATGAGGTTATCCCCAAGAGAATCTCACCCGTTGAAGAGAGTGAGGATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAGCTCTGATTGTGGGTGGCATCGTGGGCATCCTCTTTGCCGTCTTCCTGATCCTACTGCTCATGTACCGTATGAAGAAGAAGGATGAAGGCAGCTATGACCTGGGCAAGAAACCCATCTACAAGAAAGCCCCCACCAATGAGTTCTACGCGTGAAGCTTGCTTGTGGGCACTGGCTTGGACTTTAGCGGGGAGGGAAGCCAGGGGATTTTGAAGGGTGGACATTAGGGTAGGGTGAGGTCAACCTAATACTGACTTGTCAGTATCTCCAGCTCTGATTACCTTTGAAGTGTTCAGAAGAGACATTGTCTTCTACTGTTCTGCCAGGTTCTTCTTGAGCTTTGGGCCTCAGTTGCCCTGGCAGAAAAATGGATTCAACTTGGCCTTTCTGAAGGCAAGACTGGGATTGGATCACTTCTTAAACTTCCAGTTAAGAATCTAGGTCCGCCCTCAAGCCCATACTGACCATGCCTCATCCAGAGCTCCTCTGAAGCCAGGGGGCTAACGGATGTTGTGTGGAGTCCTGGCTGGAGGTCCTCCCCCAGTGGCCTTCCTCCCTTCCTTTCACAGCCGGTCTCTCTGCCAGGAAATGGGGGAAGGAACTAGAACCACCTGCACCTTGAGATGTTTCTGTAAATGGGTACTTGTGATCACACTACGGGAATCTCTGTGGTATATACCTGGGGCCATTCTAGGCTCTTTCAAGTGACTTTTGGAAATCAACCTTTTTTATTTGGGGGGGAGGATGGGGAAAAGAGCTGAGAGTTTATGCTGAAATGGATTTATAGAATATTTGTAAATCTATTTTTAGTGTTTGTTCGTTTTTTTAACTGTTCATTCCTTTGTGCAGAGTGTATATCTCTGCCTGGGCAAGAGTGTGGAGGTGCCGAGGTGTCTTCATTCTCTCGCACATTTCCACAGCACCTGCTAAGTTTGTATTTAATGGTTTTTGTTTTTGTTTTTGTTTGTTTCTTGAAAATGAGAGAAGAGCCGGAGAGATGATTTTTATTAATTTTTTTTTTTTTTTTTTTTTTTTACTATTTATAGCTTTAGATAGGGCCTCCCTTCCCCTCTTCTTTCTTTGTTCTCTTTCATTAAACCCCTTCCCCAGTTTTTTTTTTATACTTTAAACCCCGCTCCTCATGGCCTTGGCCCTTTCTGAAGCTGCTTCCTCTTATAAAATAGCTTTTGCCGAAACATAGTTTTTTTTTAGCAGATCCCAAAATATAATGAAGGGGATGGTGGGATATTTGTGTCTGTGTTCTTATAATATATTATTATTCTTCCTTGGTTCTAGAAAAATAGATAAATATATTTTTTTCAGGAAATAGTGTGGTGTTTCCAGTTTGATGTTGCTGGGTGGTTGAGTGAGTGAATTTTCATGTGGCTGGGTGGGTTTTTGCCTTTTTCTCTTGCCCTGTTCCTGGTGCCTTCTGATGGGGCTGGAATAGTTGAGGTGGATGGTTCTACCCTTTCTGCCTTCTGTTTGGGACCCAGCTGGTGTTCTTTGGTTTGCTTTCTTCAGGCTCTAGGGCTGTGCTATCCAATACAGTAACCACATGCGGCTGTTTAAAGTTAAGCCAATTAAAATCACATAAGATTAAAAATTCCTTCCTCAGTTGCACTAACCACGTTTCTAGAGGCGTCACTGTATGTAGTTCATGGCTACTGTACTGACAGCGAGAGCATGTCCATCTGTTGGACAGCACTATTCTAGAGAACTAAACTGGCTTAACGAGTCACAGCCTCAGCTGTGCTGGGACGACCCTTGTCTCCCTGGGTAGGGGGGGGGGAATGGGGGAGGGCTGATGAGGCCCCAGCTGGGGCCTGTTGTCTGGGACCCTCCCTCTCCTGAGAGGGGAGGCCTGGTGGCTTAGCCTGGGCAGGTCGTGTCTCCTCCTGACCCCAGTGGCTGCGGTGAGGGGAACCACCCTCCCTTGCTGCACCAGTGGCCATTAGCTCCCGTCACCACTGCAACCCAGGGTCCCAGCTGGCTGGGTCCTCTTCTGCCCCCAGTGCCCTTCCCCTTGGGCTGTGTTGGAGTGAGCACCTCCTCTGTAGGCACCTCTCACACTGTTGTCTGTTACTGATTTTTTTTGATAAAAAGATAATAAAACCTGGTACTTTCTAAA SEQ ID NO:751 SDC4之外顯子1的轉譯多肽序列 MAPARLFALLLFFVGGVAES SEQ ID NO:752 SDC4之外顯子2的轉譯多肽序列 IRETEVIDPQDLLEGRYFSGALPDDEDVVGPGQESDDFELSGSGDL SEQ ID NO:753 SDC4之外顯子3的轉譯多肽序列 DLEDSMIGPEVVHPL SEQ ID NO:754 SDC4之外顯子4的轉譯多肽序列 VPLDNHIPERAGSGSQVPTEPKKLEENEVIPKRISPVEESEDVSNKVSMSSTVQGSNIFERTEVLA SEQ ID NO:755 SDC4之外顯子5的轉譯多肽序列 LIVGGIVGILFAVFLILLLMYRMKKKDEGSYDLGKKPIYKKAPTNEFYA SEQ ID NO:756 SDC4之全蛋白質序列 MAPARLFALLLFFVGGVAESIRETEVIDPQDLLEGRYFSGALPDDEDVVGPGQESDDFELSGSGDLDDLEDSMIGPEVVHPLVPLDNHIPERAGSGSQVPTEPKKLEENEVIPKRISPVEESEDVSNKVSMSSTVQGSNIFERTEVLAALIVGGIVGILFAVFLILLLMYRMKKKDEGSYDLGKKPIYKKAPTNEFYA SEQ ID NO:757 SDC4之全部外顯子1-2按順序排列的多核苷酸序列 SEQ ID NO:758 SDC4之全部外顯子1-2按順序排列的轉譯多肽序列 CD44 (2) 序列資訊SEQ ID NO:759 來自CD44-NRG1融合物5’處之CD44外顯子5的序列 TTTCTACTGTACACCCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCA SEQ ID NO:760 CD44-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:761 CD44-NRG1多核苷酸序列 TTTCTACTGTACACCCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:762 CD44-NRG1多肽序列 STVHPIPDEDSPWITDSTDRIPATTTSTSTTGTSHLVKCAEKEKTFCVN SLC4A4 序列資訊SEQ ID NO:763 來自SLC4A4-NRG1融合物5’處之SLC4A4外顯子14的序列 ACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAG SEQ ID NO:764 SLC4A4-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:765 SCL4A4-NRG1多核苷酸序列 ACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:766 SCL4A4-NRG1多肽序列 YPINSNFKVGYNTLFSCTCVPPDPATSTSTTGTSHLVKCAEKEKTFCVN SEQ ID NO:767 SLC4A4之外顯子1 GGCGGCGCGCCGGGCAGCGCTTCGGTGGCGGCGGCGGCCGCGGTGGCAGCGAAGGCGGCGGCGGCGGCGGCAGTGGCAGTGGCCGCTGCAGCCCCACACTCCGCCGCCAAACTGGAGGAGCGACGGAAGCCAGACCCCAGGAG SEQ ID NO:768 SLC4A4之外顯子2 GATGGAGGATGAAGCTGTCCTGGACAGAGGGGCTTCCTTCCTCAAGCATGTGTGTGATGAAGAAGAAGTAGAAG SEQ ID NO:769 SLC4A4之外顯子3 GCCACCATACCATTTACATCGGAGTCCATGTGCCGAAGAGTTACAGGAGAAGGAGACGTCACAAGAGAAAGACAGGGCACAAAGAAAAGAAGGAAAAGGAGAGAATCTCTGAGAACTACTCTGACAAATCAGATATTGAAAATGCTGATGAATCCAGCAGCAGCATCCTAAAACCTCTCA SEQ ID NO:770 SLC4A4之外顯子4 TCTCTCCTGCTGCAGAACGCATCCGATTCATCTTGGGAGAGGAGGATGACAGCCCAGCTCCCCCTCAGCTCTTCACGGAACTGGATGAGCTGCTGGCCGTGGATGGGCAGGAGATGGAGTGGAAGGAAACAGCCAG SEQ ID NO:771 SLC4A4之外顯子5 GTGGATCAAGTTTGAAGAAAAAGTGGAACAGGGTGGGGAAAGATGGAGCAAGCCCCATGTGGCCACATTGTCCCTTCATAGTTTATTTGAGCTGAGGACATGTATGGAGAAAGGATCCATCATGCTTGATCGGGAGGCTTCTTCTCTCCCACAGTTGGTGG SEQ ID NO:772 SLC4A4之外顯子6 AGATGATTGTTGACCATCAGATTGAGACAGGCCTATTGAAACCTGAACTTAAGGATAAGGTGACCTATACTTTGCTCCGGAAGCACCGGCATCAAACCAAGAAATCCAACCTTCGGTCCCTGGCTGACATTGGGAAGACAGTCTCCAGTGCAAGTAGGATGTTTACCAACCCTGATAATG SEQ ID NO:773 SLC4A4之外顯子7 GTAGCCCAGCCATGACCCATAGGAATCTGACTTCCTCCAGTCTGAATGACATTTCTGATAAACCGGAGAAGGACCAG SEQ ID NO:774 SLC4A4之外顯子8 CTGAAGAATAAGTTCATGAAAAAATTGCCACGTGATGCAGAAGCTTCCAACGTGCTTGTTGGGGAGGTTGACTTTTTGGATACTCCTTTCATTGCCTTTGTTAGGCTACAGCAGGCTGTCATGCTGGGTGCCCTGACTGAAGTTCCTGTGCCCACAAG SEQ ID NO:775 SLC4A4之外顯子9 GTTCTTGTTCATTCTCTTAGGTCCTAAGGGGAAAGCCAAGTCCTACCACGAGATTGGCAGAGCCATTGCCACCCTGATGTCTGATGAG SEQ ID NO:776 SLC4A4之外顯子10 GTGTTCCATGACATTGCTTATAAAGCAAAAGACAGGCACGACCTGATTGCTGGTATTGATGAGTTCCTAGATGAAGTCATCGTCCTTCCACCTGGGGAATGGGATCCAGCAATTAGGATAGAGCCTCCTAAGAGTCTTCCATCCTCTGACAAAAG SEQ ID NO:777 SLC4A4之外顯子11 AAAGAATATGTACTCAGGTGGAGAGAATGTTCAGATGAATGGGGATACGCCCCATGATGGAGGTCACGGAGGAGGAGGACATGGGGATTGTGAAGAATTGCAGCGAACTGGACG SEQ ID NO:778 SLC4A4之外顯子12 GTTCTGTGGTGGACTAATTAAAGACATAAAGAGGAAAGCGCCATTTTTTGCCAGTGATTTTTATGATGCTTTAAATATTCAAGCTCTTTCGGCAATTCTCTTCATTTATCTGGCAACTGTAACTAATGCTATCACTTTTGGAGGACTGCTTGGGGATGCCACTGACAACATGCAG SEQ ID NO:779 SLC4A4之外顯子13 GGCGTGTTGGAGAGTTTCCTGGGCACTGCTGTCTCTGGAGCCATCTTTTGCCTTTTTGCTGGTCAACCACTCACTATTCTGAGCAGCACCGGACCTGTCCTAGTTTTTGAGAGGCTTCTATTTAATTTCAGCAA SEQ ID NO:780 SLC4A4之外顯子14 GGACAATAATTTTGACTATTTGGAGTTTCGCCTTTGGATTGGCCTGTGGTCCGCCTTCCTATGTCTCATTTTGGTAGCCACTGATGCCAGCTTCTTGGTTCAATACTTCACACGTTTCACGGAGGAGGGCTTTTCCTCTCTGATTAGCTTCATCTTTATCTATGATGCTTTCAAGAAGATGATCAAGCTTGCAGATTACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAG SEQ ID NO:781 SLC4A4之外顯子15 CTAATATCTCAATATCTAATGACACCACACTGGCCCCAGAGTATTTGCCAACTATGTCTTCTACTGACATG SEQ ID NO:782 SLC4A4之外顯子16 TACCATAATACTACCTTTGACTGGGCATTTTTGTCGAAGAAGGAGTGTTCAAAATACGGAGGAAACCTCGTCGGGAACAACTGTAATTTTGTTCCTGATATCACACTCATGTCTTTTATCCTCTTCTTGGGAACCTACACCTCTTCCATGGCTCTGAAAAAATTCAAAACTAGTCCTTATTTTCCAACCACA SEQ ID NO:783 SLC4A4之外顯子17 GCAAGAAAACTGATCAGTGATTTTGCCATTATCTTGTCCATTCTCATCTTTTGTGTAATAGATGCCCTAGTAGGCGTGGACACCCCAAAACTAATTGTGCCAAGTGAGTTCAAG SEQ ID NO:784 SLC4A4之外顯子18 CCAACAAGTCCAAACCGAGGTTGGTTCGTTCCACCGTTTGGAGAAAACCCCTGGTGGGTGTGCCTTGCTGCTGCTATCCCGGCTTTGTTGGTCACTATACTGATTTTCATGGACCAACAAATTACAGCTGTGATTGTAAACAGGAAAGAACATAAACTCAAG SEQ ID NO:785 SLC4A4之外顯子19 AAAGGAGCAGGGTATCACTTGGATCTCTTTTGGGTGGCCATCCTCATGGTTATATGCTCCCTCATGGCTCTTCCGTGGTATGTAGCTGCTACGGTCATCTCCATTGCTCACATCGACAGTTTGAAGATGGAGACAGAGACTTCTGCACCTGGAGAACAACCAAAGTTTCTAGGAGTGAG SEQ ID NO:786 SLC4A4之外顯子20 GGAACAAAGAGTCACTGGAACCCTTGTGTTTATTCTGACTGGTCTGTCAGTCTTTATGGCTCCCATCTTGAAG SEQ ID NO:787 SLC4A4之外顯子21 GTTTATACCCATGCCTGTACTCTATGGTGTGTTCCTGTATATGGGAGTAGCATCCCTTAATGGTGTGCAG SEQ ID NO:788 SLC4A4之外顯子22 TTCATGGATCGTCTGAAGCTGCTTCTGATGCCTCTGAAGCATCAGCCTGACTTCATCTACCTGCGTCATGTTCCTCTGCGCAGAGTCCACCTGTTCACTTTCCTGCAGGTGTTGTGTCTGGCCCTGCTTTGGATCCTCAAGTCAACGGTGGCTGCTATCATTTTTCCAGTAATG SEQ ID NO:789 SLC4A4之外顯子23 ATCTTGGCACTTGTAGCTGTCAGAAAAGGCATGGACTACCTCTTCTCCCAGCACGACCTCAGCTTCCTGGATGATGTCATTCCAGAAAAGGACAAGAAAAAGAAGGAGGATGAGAAGAAAAAGAAAAAGAAGAAGGGAAGTCTGGACAGTGACAATGATGAT SEQ ID NO:790 SLC4A4之外顯子24 TCTGACTGCCCATACTCAGAAAAAGTTCCAAGTATTAAAATTCCAATGGACATCATGGAACAGCAACCTTTCCTAAGCGATAGCAAACCTTCTGACA SEQ ID NO:791 SLC4A4之外顯子25 GAGAAAGATCACCAACATTCCTTGAACGCCACACATCATGCTGATAAAATTCCTTTCCTTCAGTCACTCGGTATGCCAAG SEQ ID NO:792 SLC4A4之外顯子26 TCCTCCTAGAACTCCAGTAAAAGTTGTGCCTCAAATTAGAATAGAACTTGAACCTGAAGACAATGATTATTTCTGGAGGAGCAAGGGAACAGAAACTACATTGTAACCTGTTTGTCTTTCTTAAAACTGACATTTGTTGTTAATGTCATTTGTTTTTGTTTGGCTGTTTGTTTATTTTTTAACTTTTATTTCGTCTCAGTTTTTGGTCACAGGCCAAATAATACAGCGCTCTCTCTGCTTCTCTCTTGCATAGACACAATCAAGACAATAGTGCACCGTTCCTTAAAAACAGCATCTGAGGAATCCCCCTTTTGTTCTTAAACTTTCAGATGTGTCCTTTGATAACCAAATTCTGTCACTCAAGACACAGACACGCACAGACCCTGTCCTTTGCCTCTATTAAGCAGAGGATGGAAGTATTAAGGATTTTGTAACACCTTTTATGAAAATGTTGAAGGAACTTAAAACTTTAGCTTTGGAGCTGTGCTTACTGGCTTGTCTTTGTCTGGTAGAACAAACCTTGACCTCCAGACAGAGTCCCTTCTCACTTATAGAGCTCTCCAGGACTGGAAAAAGTGCTGCTATTTTAACTTGCTCTTGCTTGTAAATCCTAATCTTAGAGTTATCAAAAGAAGAAAAAACTGAAGGTACTTTACTCCCTATAGAGAAACCATTGCCATCATTGTAGCAAGTGCTGGAATGTCCCTTTTTTCCTATGCAACTTTTTTTAACCCTTTAATGAACTTATCTGTTGAGTACATTGAAGAATATTTTTCTTCCTAGATTTTGTTGTTTAAATTATGGGGCCTAACCTGCCACTTATTTTTTGTCAATTTTTAAAACTTTTTTTTAATTACTGTAAAGAAAATGAATTTTTTCCTGCAGCAGGAAACATAGTTTTGAGTAGTTCTACCTCTTATTTGTAGCTGCCAGGCTTTCTGTAAAAATTGTATTGTATATAATGTGATTTTTACACATACATACACACACAAATACACAATCTCTAGGGTAAGCCAGAAGGCAAGATCAGATTAAAAACACCATGTTTCTAAGCATCCATTTTTCCCTTTCTTTAAAAGAAACTTAACTGTTCTATGAAGGAGATTGAGGGAGAAGAGACAAACTCCTATGTCATGAGAATAACCGATGTTCTGATAATAGTAGCATCTAGGTACAGATGCTGGTTGTATTACCACGTCAATGTCCTATGCAGTATTGTTAGACATTTTCTCATTTTGAAATATTTGTGTGTTTGTGTATGTGCTCTGTGCCATGGCTGGTGTATATATGTGCAATGTTAGAAGGCAAAAGAGTGATGGTAGGCAGAGGGCAAAGTCATTGAATCTCTTATGCCAGTTTTCATAAAACCCAAACCACATATGAAAAAATCCATTAAGGGTCCAAGAAGTCTGTCCATATGAAAATGAGGGTAAATATAGTTTATTTCCCAGGTATCAGTCATTATAATTGATATAATAGCTCTAACATGCAATATAAAATTCATAGGAGTATTAATAGCCCATTTACACATCTATAAAATGTAATGGGATTGCAGAGCTGCAGAGTACAGTGTAACAGTACTCTCATGCAATTTTTTTCAGGATGCAAAGGCAATTATTCTTTGTAAGCGGGACATTTAGAATATATTTGTGTACATATTATATGTATGTATATTTCAAAGTACCACACTGAAAATTAGACATTTATTAACCAAATTTAACGTGGTATTTAAAGGTAATATTTTTAATATGATACATTACATATTGTGAATGTATACTAAAAAAACATTTTAAATGTTAAAATTATAATTTCAGATTCATATAACCACAACTGTGATATATCCTAACTATAACCAGTTGTTGAGGGGTATACTAGAAGCAGAATGAAACCACATTTTTTGGTTTGATAATATGCACTTATTGACTCCCACTCATTGTTATGTTAATTAAGTTATTATTCTGTCTCCTTGTAATTTTGATTACAAAAATTTTATTATCCTGAGTTAGCTGTTACTTTTACAGTACCTGATACTCCTAAAACTTTTAACTTATACAAATTAGTCAATAATGACCCCAATTTTTTCATTAAAATAATAGTGGTGAATTATATGTTATTGTGTTAAAACCTCACTTGCCAAATTCTGGCTTCACATTTGTATTTAGGGCTATCCTTAAAATGATGAGTCTATATTATCTAGCTTTCTATTACCCTAATATAAACTGGTATAAGAAGACTTTCCTTTTTTCTTTATGCATGGAAGCATCAATAAATTGTTTAAAAACCATGTATAGTAAATTCAGCTTAACCCGTGATCTTCTTAAGTTAAAGGTACTTTTGTTTTATAAAAGCTCTAGATAAAACTTTCTTTTCTGATCATGAATCAAGTATCTGTGGTTTCATGCCCCTCTCTATACCTTTCAAAGAACTCCTGAAGCAACTTAACTCATCATTTCAGCCTCTGAGTAGAGGTAAAACCTATGTGTACTTCTGTTTATGATCCATATTGATATTTATGACATGAACACAGAATAGTACCTTACATTTGCTAAACAGACAGTTAATATCAAATCCTTTCAATATTCTGGGAACCCAGGGAAGTTTTTAAAAATGTCATTACTTTCAAAGGAACAGAAGTAGTTAACCAAACTAACAAGCAAAACCTGAGGTTTACCTAGTGACACCAAATTATCGGTATTTTAACTGAATTTACCCATTGACTAAGAATGAACCAGATTTGGTGGTGGTTTTGTTTCTATGCAAACTGGACACAAATTACAACAGTAAATTTTTTTATAAGTGCTTCTCCCTTCTCCATGATGTGACTTCCGGAGATAAAGGATTCAAAAGATAAAGACAAAGTACGCTCAGAGTTGTTAACCAGAAAGTCCTGGCTGTGGTTGCAGAAACACTGTTGGAAGAAAAGAGATGACTAAGTCAAGTGTCTGCCTTATCAAAAGAGCAAAAATGCCTCTGGTTTTGTGTTTGGGAGAAAAGTATCTTGGACGCACTGTTTTCCTTGATAAAAGTCATCTTCTCTACTGTGTGAAATGAATACTTGGAATTCTAATTGTTTTGTGTGCCAGGGGCAGTAATGTCCCTGCCTCTTCTCCCAATCAAGGTTGAGGAGTGGGGCTGGGGAGAGGACTTAACTGACTTAAGAAGTAGGAAAACAAAAACCTCTCTCCTCAGCCTTCCACCTCCAAGAGAGGAGGAAAAACAGTTGTCTGCTGTCTGTAATTCAGTTTGCGTGTATTTTATGCTCATGCACCAACCCATACAGAGTAAATCTTTTATCAACTGTATACTGGTGTTTAATAGAGAATGATTGTCTTCCGAGTTTTTTGGTTCCTTTTTTAACTGTGTTAAAGTACTTGAAATGTATTGACTGCTGACTATATTTTAAAAACAAAATGAAATAATTTGAGTTGTATTACAGAGGTTGACATTGTTCAGGGATGGGACAAAGCCTTCTTCAATCCTTTTCATACTACTTAATGATTTTGGTGCAGGAACCTGAGATTTTCTGATTTATATTTCATGATATTTCACATTTGCTCTTCACAGCATGAGCATGAAGCCCAGTGGCACCAAATGGCTGGGTACAATCAAGTGATATTTTGTAGCACCTCACTATCTGAAAGGCCATGAGTTTTCAGATGATTTCATTGAGCTTCATTGCAGCCTGAAATTTTAAAAAAGTTGTGTAATACGCCAACCAGTCAAGTTGTGTTTTGGCCAGAGATTTAGATATGTCCAATTTCCTGGCTCATTTCATTGTGCTCTATGGGTACGTATAAAAAGCAAGAATTCTGTTTCCTAGGCAAACATTGCAACTCAGGGCTAAAGTCATCCAGTGAAACTTTTAGAGCCAGAAGTAACTTTGTCCCAGTCCTACAATGTGAAAAGAGTGAATAGTTGCCTCTTTTTAGCCATTTTCATGGCTGGTACATATTCGTACGCATTACTTTTCAGAATCAATACGCACTTTCAGATATTCTTATTTTTATTCTCTTAAGTCTTTATTAACTTTGGAGAGAGAAATGATGCATCTTTTTATTTTAAATGAAGTAGATCAACATGGTGGAACAAAATGATAAAGAACAGAAAACATTTCAATATATTACTAATAACTTTTTCCAATATAAATCCTAAAATTCCTATAACATAGTATTTTACAGTTTTATGAAGCTTTCTATTGTGACTTTTATGGAATTAAGAGATGAAGAAGATGAGATATTTTAGCATTTATATTTTTCAAAATTATATGTATACTTAAAAATAAAGTAACTTTATGCA SEQ ID NO:793 SLC4A4之全mRNA多核苷酸序列 GGCGGCGCGCCGGGCAGCGCTTCGGTGGCGGCGGCGGCCGCGGTGGCAGCGAAGGCGGCGGCGGCGGCGGCAGTGGCAGTGGCCGCTGCAGCCCCACACTCCGCCGCCAAACTGGAGGAGCGACGGAAGCCAGACCCCAGGAGGATGGAGGATGAAGCTGTCCTGGACAGAGGGGCTTCCTTCCTCAAGCATGTGTGTGATGAAGAAGAAGTAGAAGGCCACCATACCATTTACATCGGAGTCCATGTGCCGAAGAGTTACAGGAGAAGGAGACGTCACAAGAGAAAGACAGGGCACAAAGAAAAGAAGGAAAAGGAGAGAATCTCTGAGAACTACTCTGACAAATCAGATATTGAAAATGCTGATGAATCCAGCAGCAGCATCCTAAAACCTCTCATCTCTCCTGCTGCAGAACGCATCCGATTCATCTTGGGAGAGGAGGATGACAGCCCAGCTCCCCCTCAGCTCTTCACGGAACTGGATGAGCTGCTGGCCGTGGATGGGCAGGAGATGGAGTGGAAGGAAACAGCCAGGTGGATCAAGTTTGAAGAAAAAGTGGAACAGGGTGGGGAAAGATGGAGCAAGCCCCATGTGGCCACATTGTCCCTTCATAGTTTATTTGAGCTGAGGACATGTATGGAGAAAGGATCCATCATGCTTGATCGGGAGGCTTCTTCTCTCCCACAGTTGGTGGAGATGATTGTTGACCATCAGATTGAGACAGGCCTATTGAAACCTGAACTTAAGGATAAGGTGACCTATACTTTGCTCCGGAAGCACCGGCATCAAACCAAGAAATCCAACCTTCGGTCCCTGGCTGACATTGGGAAGACAGTCTCCAGTGCAAGTAGGATGTTTACCAACCCTGATAATGGTAGCCCAGCCATGACCCATAGGAATCTGACTTCCTCCAGTCTGAATGACATTTCTGATAAACCGGAGAAGGACCAGCTGAAGAATAAGTTCATGAAAAAATTGCCACGTGATGCAGAAGCTTCCAACGTGCTTGTTGGGGAGGTTGACTTTTTGGATACTCCTTTCATTGCCTTTGTTAGGCTACAGCAGGCTGTCATGCTGGGTGCCCTGACTGAAGTTCCTGTGCCCACAAGGTTCTTGTTCATTCTCTTAGGTCCTAAGGGGAAAGCCAAGTCCTACCACGAGATTGGCAGAGCCATTGCCACCCTGATGTCTGATGAGGTGTTCCATGACATTGCTTATAAAGCAAAAGACAGGCACGACCTGATTGCTGGTATTGATGAGTTCCTAGATGAAGTCATCGTCCTTCCACCTGGGGAATGGGATCCAGCAATTAGGATAGAGCCTCCTAAGAGTCTTCCATCCTCTGACAAAAGAAAGAATATGTACTCAGGTGGAGAGAATGTTCAGATGAATGGGGATACGCCCCATGATGGAGGTCACGGAGGAGGAGGACATGGGGATTGTGAAGAATTGCAGCGAACTGGACGGTTCTGTGGTGGACTAATTAAAGACATAAAGAGGAAAGCGCCATTTTTTGCCAGTGATTTTTATGATGCTTTAAATATTCAAGCTCTTTCGGCAATTCTCTTCATTTATCTGGCAACTGTAACTAATGCTATCACTTTTGGAGGACTGCTTGGGGATGCCACTGACAACATGCAGGGCGTGTTGGAGAGTTTCCTGGGCACTGCTGTCTCTGGAGCCATCTTTTGCCTTTTTGCTGGTCAACCACTCACTATTCTGAGCAGCACCGGACCTGTCCTAGTTTTTGAGAGGCTTCTATTTAATTTCAGCAAGGACAATAATTTTGACTATTTGGAGTTTCGCCTTTGGATTGGCCTGTGGTCCGCCTTCCTATGTCTCATTTTGGTAGCCACTGATGCCAGCTTCTTGGTTCAATACTTCACACGTTTCACGGAGGAGGGCTTTTCCTCTCTGATTAGCTTCATCTTTATCTATGATGCTTTCAAGAAGATGATCAAGCTTGCAGATTACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAGCTAATATCTCAATATCTAATGACACCACACTGGCCCCAGAGTATTTGCCAACTATGTCTTCTACTGACATGTACCATAATACTACCTTTGACTGGGCATTTTTGTCGAAGAAGGAGTGTTCAAAATACGGAGGAAACCTCGTCGGGAACAACTGTAATTTTGTTCCTGATATCACACTCATGTCTTTTATCCTCTTCTTGGGAACCTACACCTCTTCCATGGCTCTGAAAAAATTCAAAACTAGTCCTTATTTTCCAACCACAGCAAGAAAACTGATCAGTGATTTTGCCATTATCTTGTCCATTCTCATCTTTTGTGTAATAGATGCCCTAGTAGGCGTGGACACCCCAAAACTAATTGTGCCAAGTGAGTTCAAGCCAACAAGTCCAAACCGAGGTTGGTTCGTTCCACCGTTTGGAGAAAACCCCTGGTGGGTGTGCCTTGCTGCTGCTATCCCGGCTTTGTTGGTCACTATACTGATTTTCATGGACCAACAAATTACAGCTGTGATTGTAAACAGGAAAGAACATAAACTCAAGAAAGGAGCAGGGTATCACTTGGATCTCTTTTGGGTGGCCATCCTCATGGTTATATGCTCCCTCATGGCTCTTCCGTGGTATGTAGCTGCTACGGTCATCTCCATTGCTCACATCGACAGTTTGAAGATGGAGACAGAGACTTCTGCACCTGGAGAACAACCAAAGTTTCTAGGAGTGAGGGAACAAAGAGTCACTGGAACCCTTGTGTTTATTCTGACTGGTCTGTCAGTCTTTATGGCTCCCATCTTGAAGTTTATACCCATGCCTGTACTCTATGGTGTGTTCCTGTATATGGGAGTAGCATCCCTTAATGGTGTGCAGTTCATGGATCGTCTGAAGCTGCTTCTGATGCCTCTGAAGCATCAGCCTGACTTCATCTACCTGCGTCATGTTCCTCTGCGCAGAGTCCACCTGTTCACTTTCCTGCAGGTGTTGTGTCTGGCCCTGCTTTGGATCCTCAAGTCAACGGTGGCTGCTATCATTTTTCCAGTAATGATCTTGGCACTTGTAGCTGTCAGAAAAGGCATGGACTACCTCTTCTCCCAGCACGACCTCAGCTTCCTGGATGATGTCATTCCAGAAAAGGACAAGAAAAAGAAGGAGGATGAGAAGAAAAAGAAAAAGAAGAAGGGAAGTCTGGACAGTGACAATGATGATTCTGACTGCCCATACTCAGAAAAAGTTCCAAGTATTAAAATTCCAATGGACATCATGGAACAGCAACCTTTCCTAAGCGATAGCAAACCTTCTGACAGAGAAAGATCACCAACATTCCTTGAACGCCACACATCATGCTGATAAAATTCCTTTCCTTCAGTCACTCGGTATGCCAAGTCCTCCTAGAACTCCAGTAAAAGTTGTGCCTCAAATTAGAATAGAACTTGAACCTGAAGACAATGATTATTTCTGGAGGAGCAAGGGAACAGAAACTACATTGTAACCTGTTTGTCTTTCTTAAAACTGACATTTGTTGTTAATGTCATTTGTTTTTGTTTGGCTGTTTGTTTATTTTTTAACTTTTATTTCGTCTCAGTTTTTGGTCACAGGCCAAATAATACAGCGCTCTCTCTGCTTCTCTCTTGCATAGACACAATCAAGACAATAGTGCACCGTTCCTTAAAAACAGCATCTGAGGAATCCCCCTTTTGTTCTTAAACTTTCAGATGTGTCCTTTGATAACCAAATTCTGTCACTCAAGACACAGACACGCACAGACCCTGTCCTTTGCCTCTATTAAGCAGAGGATGGAAGTATTAAGGATTTTGTAACACCTTTTATGAAAATGTTGAAGGAACTTAAAACTTTAGCTTTGGAGCTGTGCTTACTGGCTTGTCTTTGTCTGGTAGAACAAACCTTGACCTCCAGACAGAGTCCCTTCTCACTTATAGAGCTCTCCAGGACTGGAAAAAGTGCTGCTATTTTAACTTGCTCTTGCTTGTAAATCCTAATCTTAGAGTTATCAAAAGAAGAAAAAACTGAAGGTACTTTACTCCCTATAGAGAAACCATTGCCATCATTGTAGCAAGTGCTGGAATGTCCCTTTTTTCCTATGCAACTTTTTTTAACCCTTTAATGAACTTATCTGTTGAGTACATTGAAGAATATTTTTCTTCCTAGATTTTGTTGTTTAAATTATGGGGCCTAACCTGCCACTTATTTTTTGTCAATTTTTAAAACTTTTTTTTAATTACTGTAAAGAAAATGAATTTTTTCCTGCAGCAGGAAACATAGTTTTGAGTAGTTCTACCTCTTATTTGTAGCTGCCAGGCTTTCTGTAAAAATTGTATTGTATATAATGTGATTTTTACACATACATACACACACAAATACACAATCTCTAGGGTAAGCCAGAAGGCAAGATCAGATTAAAAACACCATGTTTCTAAGCATCCATTTTTCCCTTTCTTTAAAAGAAACTTAACTGTTCTATGAAGGAGATTGAGGGAGAAGAGACAAACTCCTATGTCATGAGAATAACCGATGTTCTGATAATAGTAGCATCTAGGTACAGATGCTGGTTGTATTACCACGTCAATGTCCTATGCAGTATTGTTAGACATTTTCTCATTTTGAAATATTTGTGTGTTTGTGTATGTGCTCTGTGCCATGGCTGGTGTATATATGTGCAATGTTAGAAGGCAAAAGAGTGATGGTAGGCAGAGGGCAAAGTCATTGAATCTCTTATGCCAGTTTTCATAAAACCCAAACCACATATGAAAAAATCCATTAAGGGTCCAAGAAGTCTGTCCATATGAAAATGAGGGTAAATATAGTTTATTTCCCAGGTATCAGTCATTATAATTGATATAATAGCTCTAACATGCAATATAAAATTCATAGGAGTATTAATAGCCCATTTACACATCTATAAAATGTAATGGGATTGCAGAGCTGCAGAGTACAGTGTAACAGTACTCTCATGCAATTTTTTTCAGGATGCAAAGGCAATTATTCTTTGTAAGCGGGACATTTAGAATATATTTGTGTACATATTATATGTATGTATATTTCAAAGTACCACACTGAAAATTAGACATTTATTAACCAAATTTAACGTGGTATTTAAAGGTAATATTTTTAATATGATACATTACATATTGTGAATGTATACTAAAAAAACATTTTAAATGTTAAAATTATAATTTCAGATTCATATAACCACAACTGTGATATATCCTAACTATAACCAGTTGTTGAGGGGTATACTAGAAGCAGAATGAAACCACATTTTTTGGTTTGATAATATGCACTTATTGACTCCCACTCATTGTTATGTTAATTAAGTTATTATTCTGTCTCCTTGTAATTTTGATTACAAAAATTTTATTATCCTGAGTTAGCTGTTACTTTTACAGTACCTGATACTCCTAAAACTTTTAACTTATACAAATTAGTCAATAATGACCCCAATTTTTTCATTAAAATAATAGTGGTGAATTATATGTTATTGTGTTAAAACCTCACTTGCCAAATTCTGGCTTCACATTTGTATTTAGGGCTATCCTTAAAATGATGAGTCTATATTATCTAGCTTTCTATTACCCTAATATAAACTGGTATAAGAAGACTTTCCTTTTTTCTTTATGCATGGAAGCATCAATAAATTGTTTAAAAACCATGTATAGTAAATTCAGCTTAACCCGTGATCTTCTTAAGTTAAAGGTACTTTTGTTTTATAAAAGCTCTAGATAAAACTTTCTTTTCTGATCATGAATCAAGTATCTGTGGTTTCATGCCCCTCTCTATACCTTTCAAAGAACTCCTGAAGCAACTTAACTCATCATTTCAGCCTCTGAGTAGAGGTAAAACCTATGTGTACTTCTGTTTATGATCCATATTGATATTTATGACATGAACACAGAATAGTACCTTACATTTGCTAAACAGACAGTTAATATCAAATCCTTTCAATATTCTGGGAACCCAGGGAAGTTTTTAAAAATGTCATTACTTTCAAAGGAACAGAAGTAGTTAACCAAACTAACAAGCAAAACCTGAGGTTTACCTAGTGACACCAAATTATCGGTATTTTAACTGAATTTACCCATTGACTAAGAATGAACCAGATTTGGTGGTGGTTTTGTTTCTATGCAAACTGGACACAAATTACAACAGTAAATTTTTTTATAAGTGCTTCTCCCTTCTCCATGATGTGACTTCCGGAGATAAAGGATTCAAAAGATAAAGACAAAGTACGCTCAGAGTTGTTAACCAGAAAGTCCTGGCTGTGGTTGCAGAAACACTGTTGGAAGAAAAGAGATGACTAAGTCAAGTGTCTGCCTTATCAAAAGAGCAAAAATGCCTCTGGTTTTGTGTTTGGGAGAAAAGTATCTTGGACGCACTGTTTTCCTTGATAAAAGTCATCTTCTCTACTGTGTGAAATGAATACTTGGAATTCTAATTGTTTTGTGTGCCAGGGGCAGTAATGTCCCTGCCTCTTCTCCCAATCAAGGTTGAGGAGTGGGGCTGGGGAGAGGACTTAACTGACTTAAGAAGTAGGAAAACAAAAACCTCTCTCCTCAGCCTTCCACCTCCAAGAGAGGAGGAAAAACAGTTGTCTGCTGTCTGTAATTCAGTTTGCGTGTATTTTATGCTCATGCACCAACCCATACAGAGTAAATCTTTTATCAACTGTATACTGGTGTTTAATAGAGAATGATTGTCTTCCGAGTTTTTTGGTTCCTTTTTTAACTGTGTTAAAGTACTTGAAATGTATTGACTGCTGACTATATTTTAAAAACAAAATGAAATAATTTGAGTTGTATTACAGAGGTTGACATTGTTCAGGGATGGGACAAAGCCTTCTTCAATCCTTTTCATACTACTTAATGATTTTGGTGCAGGAACCTGAGATTTTCTGATTTATATTTCATGATATTTCACATTTGCTCTTCACAGCATGAGCATGAAGCCCAGTGGCACCAAATGGCTGGGTACAATCAAGTGATATTTTGTAGCACCTCACTATCTGAAAGGCCATGAGTTTTCAGATGATTTCATTGAGCTTCATTGCAGCCTGAAATTTTAAAAAAGTTGTGTAATACGCCAACCAGTCAAGTTGTGTTTTGGCCAGAGATTTAGATATGTCCAATTTCCTGGCTCATTTCATTGTGCTCTATGGGTACGTATAAAAAGCAAGAATTCTGTTTCCTAGGCAAACATTGCAACTCAGGGCTAAAGTCATCCAGTGAAACTTTTAGAGCCAGAAGTAACTTTGTCCCAGTCCTACAATGTGAAAAGAGTGAATAGTTGCCTCTTTTTAGCCATTTTCATGGCTGGTACATATTCGTACGCATTACTTTTCAGAATCAATACGCACTTTCAGATATTCTTATTTTTATTCTCTTAAGTCTTTATTAACTTTGGAGAGAGAAATGATGCATCTTTTTATTTTAAATGAAGTAGATCAACATGGTGGAACAAAATGATAAAGAACAGAAAACATTTCAATATATTACTAATAACTTTTTCCAATATAAATCCTAAAATTCCTATAACATAGTATTTTACAGTTTTATGAAGCTTTCTATTGTGACTTTTATGGAATTAAGAGATGAAGAAGATGAGATATTTTAGCATTTATATTTTTCAAAATTATATGTATACTTAAAAATAAAGTAACTTTATGCA SEQ ID NO:794 SLC4A4之外顯子2的轉譯多肽序列 MEDEAVLDRGASFLKHVCDEEEVE SEQ ID NO:795 SLC4A4之外顯子3的轉譯多肽序列 HHTIYIGVHVPKSYRRRRRHKRKTGHKEKKEKERISENYSDKSDIENADESSSSILKPL SEQ ID NO:796 SLC4A4之外顯子4的轉譯多肽序列 SPAAERIRFILGEEDDSPAPPQLFTELDELLAVDGQEMEWKETA SEQ ID NO:797 SLC4A4之外顯子5的轉譯多肽序列 WIKFEEKVEQGGERWSKPHVATLSLHSLFELRTCMEKGSIMLDREASSLPQLV SEQ ID NO:798 SLC4A4之外顯子6的轉譯多肽序列 MIVDHQIETGLLKPELKDKVTYTLLRKHRHQTKKSNLRSLADIGKTVSSASRMFTNPDN SEQ ID NO:799 SLC4A4之外顯子7的轉譯多肽序列 SPAMTHRNLTSSSLNDISDKPEKDQ SEQ ID NO:800 SLC4A4之外顯子8的轉譯多肽序列 LKNKFMKKLPRDAEASNVLVGEVDFLDTPFIAFVRLQQAVMLGALTEVPVPT SEQ ID NO:801 SLC4A4之外顯子9的轉譯多肽序列 FLFILLGPKGKAKSYHEIGRAIATLMSDE SEQ ID NO:802 SLC4A4之外顯子10的轉譯多肽序列 VFHDIAYKAKDRHDLIAGIDEFLDEVIVLPPGEWDPAIRIEPPKSLPSSDK SEQ ID NO:803 SLC4A4之外顯子11的轉譯多肽序列 KNMYSGGENVQMNGDTPHDGGHGGGGHGDCEELQRTG SEQ ID NO:804 SLC4A4之外顯子12的轉譯多肽序列 FCGGLIKDIKRKAPFFASDFYDALNIQALSAILFIYLATVTNAITFGGLLGDATDNMQ SEQ ID NO:805 SLC4A4之外顯子13的轉譯多肽序列 GVLESFLGTAVSGAIFCLFAGQPLTILSSTGPVLVFERLLFNFS SEQ ID NO:806 SLC4A4之外顯子14的轉譯多肽序列 DNNFDYLEFRLWIGLWSAFLCLILVATDASFLVQYFTRFTEEGFSSLISFIFIYDAFKKMIKLADYYPINSNFKVGYNTLFSCTCVPPDP SEQ ID NO:807 SLC4A4之外顯子15的轉譯多肽序列 NISISNDTTLAPEYLPTMSSTDM SEQ ID NO:808 SLC4A4之外顯子16的轉譯多肽序列 YHNTTFDWAFLSKKECSKYGGNLVGNNCNFVPDITLMSFILFLGTYTSSMALKKFKTSPYFPTT SEQ ID NO:809 SLC4A4之外顯子17的轉譯多肽序列 ARKLISDFAIILSILIFCVIDALVGVDTPKLIVPSEFK SEQ ID NO:810 SLC4A4之外顯子18的轉譯多肽序列 PTSPNRGWFVPPFGENPWWVCLAAAIPALLVTILIFMDQQITAVIVNRKEHKLK SEQ ID NO:811 SLC4A4之外顯子19的轉譯多肽序列 KGAGYHLDLFWVAILMVICSLMALPWYVAATVISIAHIDSLKMETETSAPGEQPKFLGV SEQ ID NO:812 SLC4A4之外顯子20的轉譯多肽序列 EQRVTGTLVFILTGLSVFMAPILK SEQ ID NO:813 SLC4A4之外顯子21的轉譯多肽序列 FIPMPVLYGVFLYMGVASLNGVQ SEQ ID NO:814 SLC4A4之外顯子22的轉譯多肽序列 FMDRLKLLLMPLKHQPDFIYLRHVPLRRVHLFTFLQVLCLALLWILKSTVAAIIFPVM SEQ ID NO:815 SLC4A4之外顯子23的轉譯多肽序列 ILALVAVRKGMDYLFSQHDLSFLDDVIPEKDKKKKEDEKKKKKKKGSLDSDNDD SEQ ID NO:816 SLC4A4之外顯子24的轉譯多肽序列 SDCPYSEKVPSIKIPMDIMEQQPFLSDSKPSD SEQ ID NO:817 SLC4A4之外顯子25的轉譯多肽序列 ERSPTFLERHTSC SEQ ID NO:819 SLC4A4之全蛋白質序列 MEDEAVLDRGASFLKHVCDEEEVEGHHTIYIGVHVPKSYRRRRRHKRKTGHKEKKEKERISENYSDKSDIENADESSSSILKPLISPAAERIRFILGEEDDSPAPPQLFTELDELLAVDGQEMEWKETARWIKFEEKVEQGGERWSKPHVATLSLHSLFELRTCMEKGSIMLDREASSLPQLVEMIVDHQIETGLLKPELKDKVTYTLLRKHRHQTKKSNLRSLADIGKTVSSASRMFTNPDNGSPAMTHRNLTSSSLNDISDKPEKDQLKNKFMKKLPRDAEASNVLVGEVDFLDTPFIAFVRLQQAVMLGALTEVPVPTRFLFILLGPKGKAKSYHEIGRAIATLMSDEVFHDIAYKAKDRHDLIAGIDEFLDEVIVLPPGEWDPAIRIEPPKSLPSSDKRKNMYSGGENVQMNGDTPHDGGHGGGGHGDCEELQRTGRFCGGLIKDIKRKAPFFASDFYDALNIQALSAILFIYLATVTNAITFGGLLGDATDNMQGVLESFLGTAVSGAIFCLFAGQPLTILSSTGPVLVFERLLFNFSKDNNFDYLEFRLWIGLWSAFLCLILVATDASFLVQYFTRFTEEGFSSLISFIFIYDAFKKMIKLADYYPINSNFKVGYNTLFSCTCVPPDPANISISNDTTLAPEYLPTMSSTDMYHNTTFDWAFLSKKECSKYGGNLVGNNCNFVPDITLMSFILFLGTYTSSMALKKFKTSPYFPTTARKLISDFAIILSILIFCVIDALVGVDTPKLIVPSEFKPTSPNRGWFVPPFGENPWWVCLAAAIPALLVTILIFMDQQITAVIVNRKEHKLKKGAGYHLDLFWVAILMVICSLMALPWYVAATVISIAHIDSLKMETETSAPGEQPKFLGVREQRVTGTLVFILTGLSVFMAPILKFIPMPVLYGVFLYMGVASLNGVQFMDRLKLLLMPLKHQPDFIYLRHVPLRRVHLFTFLQVLCLALLWILKSTVAAIIFPVMILALVAVRKGMDYLFSQHDLSFLDDVIPEKDKKKKEDEKKKKKKKGSLDSDNDDSDCPYSEKVPSIKIPMDIMEQQPFLSDSKPSDRERSPTFLERHTSC SEQ ID NO:820 SLC4A4之全部外顯子1-14按順序排列的多核苷酸序列 SEQ ID NO:821 SLC4A4之全部外顯子1-14按順序排列的轉譯多肽序列 SDC4 (2) 序列資訊SEQ ID NO:822 來自SDC4-NRG1融合物5’處之SDC4外顯子4的序列 ATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAG SEQ ID NO:823 SDC4-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:824 SDC4-NRG1多核苷酸序列 ATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:825 SDC4-NRG1多肽序列 VSNKVSMSSTVQGSNIFERTEVLAALPPRLKEMKSQESAAGSKLVLRCE SEQ ID NO:940 SDC4之全部外顯子1-4按順序排列的多核苷酸序列 SEQ ID NO:941 SDC4之全部外顯子1-4按順序排列的轉譯多肽序列 ZFAT 序列資訊SEQ ID NO:826 來自ZFAT-NRG1融合物5’處之ZFAT外顯子12的序列 ACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAG SEQ ID NO:827 ZFAT-NRG1融合物3’處之NRG1外顯子6序列 CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:828 ZFAT-NRG1多核苷酸序列 ACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:829 ZFAT-NRG1多肽序列 RKHPNEEYANVGTGELAAEVLIQQATSTSTTGTSHLVKCAEKEKTFCVN SEQ ID NO:830 ZFAT之外顯子1 ATCCGCCATGTTGGATGCCGCAGATTCGCCATAACCTCGCCGGCTCTTTTCTTAAAAAAATAAAAATAAAAAGCGAAGCGTCAGCAGGGCGCCCCGCCCCCTCGGTCGGCACGGGAGGGGGCCCGGAAGAGCCCGAGGCTTTTTTTTCCTCCGCGGTGGGGCGTTGCCATGGAGACGCGGGCGGCAG SEQ ID NO:831 ZFAT之外顯子2 AAAACACGGCCATCTTTATGTGTAAATGTTGTAACCTCTTCTCACCAAATCAGTCGGAACTCCTCTCCCACGTTTCAGAGAAGCACATGGAAGAAGGGGTTAATGTTGATGAGATTATTATTCCCCTTAGGCCTCTGAGTACACCTGAACCCCCCAACTCAAGCAAAACCGGAGATG SEQ ID NO:832 ZFAT之外顯子3 AGTTTTTGGTCATGAAGAGGAAGAGAGGCAGGCCTAAGGGGTCCACGAAGAAGTCCAGCACAGAAGAGGAGCTGGCAGAAAACATCGTGAGTCCGACTGAGGACAGCCCGCTGGCTCCGGAGGAAGGGAACAGCCTGCCTCCAAGCAGCTTGGAGTGTAGCAAGTGCTGTCGGAAGTTCTCCAACACGCGCCAGCTGCGGAAGCACATCTGCATTATCGTGCTGAATTTGGGTGAGGAGGAAGGAGAAGCAG SEQ ID NO:833 ZFAT之外顯子4 GTAACGAGTCTGACCTTGAACTAGAAAAGAAGTGTAAGGAAGATGATCGGGAAAAAGCCTCGAAAAGACCACGGTCACAGAAAACAGAGAAAGTCCAGAAGATCTCAGGAAAGGAGGCCAGACAGCTTTCTGGGGCGAAGAAACCCATCATAAGTGTGGTTTTAACTGCACACGAAGCAATTCCAG SEQ ID NO:834 ZFAT之外顯子5 GTGCTACCAAGATTGTGCCAGTGGAGGCTGGGCCCCCTGAAACAGGAGCTACAAATTCTGAGACCACTTCAGCAGACCTGGTGCCTCGGAGAGGCTACCAGGAATACGCCATTCAGCAGACACCTTATGAGCAACCAATGAAGTCAAGCAG SEQ ID NO:835 ZFAT之外顯子6 GCTAGGTCCCACTCAGCTCAAAATCTTCACTTGTGAATACTGCAACAAGGTCTTCAAGTTCAAGCACTCGCTGCAGGCCCACCTGAGGATCCACACCAATGAAAAGCCATACAAGTGCCCCCAGTGCAGCTATGCCAGTGCCATCAAGGCCAACCTCAATGTGCACCTGCGCAAGCACACTGGAGAGAAGTTCGCCTGCGACTATTGCTCGTTCACCTGCCTGAGCAAGGGCCACCTCAAGGTGCACATCGAGCGAGTGCACAAGAAGATCAAGCAGCACTGCCGCTTCTGCAAGAAGAAGTACTCTGACGTCAAGAACCTCATCAAGCACATCCGAGACGCGCATGACCCACAGGACAAGAAGGTCAAAGAGGCCTTGGACGAGCTCTGCCTGATGACGAGGGAGGGCAAGCGGCAGCTGCTCTATGACTGCCACATCTGTGAGCGCAAGTTCAAGAACGAGCTGGACCGTGACCGCCATATGCTGGTCCACGGAGACAAGTGGCCTTTTGCCTGTGAGCTCTGTGGCCATGGGGCCACCAAGTACCAGGCGCTGGAACTGCATGTCAGGAAGCACCCCTTCGTGTACGTCTGTGCCGTCTGCCGCAAGAAGTTCGTCAGCTCCATCAGGCTGCGCACCCACATCAAAGAGGTGCACGGGGCTGCCCAGGAGGCCTTGGTCTTCACCAGTTCCATCAACCAGAGCTTCTGCCTCCTGGAACCTGGTGGGGACATCCAGCAAGAAGCTCTGGGGGACCAGCTACAGCTGGTGGAAGAGGAGTTTGCCCTCCAGGGCGTGAATGCACTCAAGGAAGAGGCCTGTCCTGGGGACACTCAGCTGGAGGAGGGCCGGAAGGAGCCGGAGGCCCCTGGGGAAATGCCTGCCCCAGCTGTGCACCTGGCCTCCCCGCAGGCCGAAAGCACAGCCCTGCCACCCTGTGAGCTGGAAACCACCGTGGTCTCCTCCTCAGACCTGCATTCTCAAGAGGTGGTTTCAGATGATTTTTTGTTGAAAAATGATACCTCCTCCGCAGAGGCTCATGCTGCTCCTGAGAAGCCCCCAGACATGCAGCACAGAAGCTCAGTCCAGACGCAAGGTGAAGTGATCACACTACTGCTGTCCAAGGCCCAGAGTGCTGGGTCAGATCAGGAAAGCCATGGCGCCCAGAGCCCCCTAGGGGAAGGGCAGAACATGGCTGTGCTTTCAGCTGGTGACCCAGATCCCAGCAGGTGTCTCAGGTCAAACCCAGCTGAGGCCTCAGACCTCCTCCCTCCAGTAGCTGGTGGTGGGGACACCATCACACATCAGCCTGACTCTTGCAAAGCTGCCCCTGAGCACCGGTCAGGCATCACCGCTTTCATGAAGGTCCTGAACAGTTTACAGAAGAAGCAAATGAACACCAGCTTGTGTGAGCGGATCCGGAAGGTTTATGGAGACCTGGAGTGTGAATACTGTG SEQ ID NO:836 ZFAT之外顯子7 GCAAACTTTTTTGGTACCAAGTGCATTTTGATATGCATGTCCGCACCCACACCCGGGAACATCTGTATTATTGCTCTCAGTGTCATTATTCTTCCATCACCAAAAACTGCCTTAAACGCCACGTAATTCAGAAACACAGTAACATCTTGCTGAAGTGTCCCACCGATGGCTGTGACTACTCAACTCCAGATAAATATAAGCTACAGGCACATCTTAAAGTTCACACAGCACTG SEQ ID NO:837 ZFAT之外顯子8 GACAAAAGGAGTTATTCTTGTCCTGTTTGTGAAAAGTCTTTTTCAGAGGATCGATTGATAAAGTCACATATCAAGACCAACCATCCTG SEQ ID NO:838 ZFAT之外顯子9 AGGTCTCCATGAGCACCATTTCTGAGGTTCTCGGGAGGAGGGTTCAGCTGAAAGGGCTAATTGGAAAGAGAGCCATGAAATGCCCATATTGTGACTTTTATTTCATGAAGAATGGCTCAGACCTTCAGCGTCATATTTGGGCTCATGAAG SEQ ID NO:839 ZFAT之外顯子10 GTGTGAAGCCCTTCAAGTGTTCTTTGTGTGAGTATGCAACTCGTAGCAAGAGTAACCTCAAGGCTCATATGAATCGTCACAGCACTGAGAAAACCCACCTATGTGACATGTGTGGCAAGAAATTCAAATCAAAAGGGACACTGAAAAGTCACAAACTCCTTCACACTGCAGATG SEQ ID NO:840 ZFAT之外顯子11 GGAAGCAGTTTAAGTGCACGGTGTGTGACTACACAGCGGCCCAGAAGCCACAGCTGCTGCGGCACATGGAACAGCATGTCTCCTTCAAG SEQ ID NO:841 ZFAT之外顯子12 CCTTTCCGCTGTGCCCATTGCCATTACTCCTGCAACATATCTGGCTCTCTGAAGCGGCACTACAACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAG SEQ ID NO:842 ZFAT之外顯子13 GTGGTTTGAAGTGTCCTGTTTGCAGCTTTGTATATGGCACCAAATGGGAGTTCAATAGGCACTTGAAGAACAAACATGGCTTGAAGGTGGTGGAAATTGATGGAGACCCCAAGTGGGAG SEQ ID NO:843 ZFAT之外顯子14 ACAGCAACAGAAGCTCCTGAGGAGCCCTCCACCCAGTATCTCCACATCACAGAGGCCGAAGAAGACGTTCAAGGGACACAGGCAGCGGTGGCCGCGCTCCAGGACCTGAGATACACCTCTGAGAGTG SEQ ID NO:844 ZFAT之外顯子15 GCGACCGACTGGACCCCACGGCCGTGAACATCCTGCAGCAGATCATTGAGCTGGGCGCCGAGACCCATGACGCCACTGCCCTTGCCTCGGTGGTTGCCATGGCACCAGGGACGGTGACTGTGGTTAAGCAG SEQ ID NO:845 ZFAT之外顯子16 GTCACCGAGGAGGAGCCCAGCTCCAACCACACGGTCATGATCCAGGAGACGGTCCAGCAAGCGTCCGTGGAGCTTGCCGAGCAGCACCACCTGGTGGTGTCCTCCGACGACGTGGAGGGCATTGAGACGGTGACTGTCTACACGCAGGGCGGGGAGGCCTCGGAGTTCATCGTCTACGTGCAGGAGGCCATGCAGCCTGTGGAGGAGCAGGCTGTGGAGCAGCCGGCCCAGGAACTCTAGAGGACATGTGGCATCGGATGGCCACAGGGCGGGGCTGCCAGGCTCTGCAGGCACCCAGGGTGGGGAGGCCACCCTTCCTGCCCTACCCGCAGAATGGTGCTCTCCTTTGCCCTCCCTGCCCAGCAGCCTGATAGGACTCTCCTAGTCCAACTTGGGGTGGGCAAGGCAGTCAGCATCACCAGCAATACCACAGGACCCTCACCCCAGCATAGACACACACCCCCTGACCCTTACCATCTGCTTCCTGAAAGACTTCAGTGTCAGCTCCCCTACACACACCCCACACCTTCACCCCTTGCTTCAAGATTCAAACAGAGACTCCCAGTCCCCCTCAGCATCTTCCCTGAATCACAGCCCCAGCTCCTTGACCCCCATCTAGGTGCCAAATGTTCATCTGCAACCGCTATGCAGTCTGGTGAGAGGGAGACAGCCATCACATAGAAAGTGACCGTACGGGTTTTTAATCACTGCTGGGTGGGGTGGGGGTAGGGGGATTGTCCTGGCTTTGTCGACAAAGTCCCACTTCCCCGAGTATTAAGGGCCCTTGGTATCAAGTGAGGTAAATTCACCCATCACAGGGTCTCGCCCTACCATCCTGGAATTATTTCACTTTTAAGATAAATGCACTATTTCACTGTTCGCCTCCCATTCTAAGGAGGTGAGGTGGTTGGAATAAAAACAGTTCCTGTCTGAA SEQ ID NO:846 ZFAT之全mRNA多核苷酸序列 ATCCGCCATGTTGGATGCCGCAGATTCGCCATAACCTCGCCGGCTCTTTTCTTAAAAAAATAAAAATAAAAAGCGAAGCGTCAGCAGGGCGCCCCGCCCCCTCGGTCGGCACGGGAGGGGGCCCGGAAGAGCCCGAGGCTTTTTTTTCCTCCGCGGTGGGGCGTTGCCATGGAGACGCGGGCGGCAGAAAACACGGCCATCTTTATGTGTAAATGTTGTAACCTCTTCTCACCAAATCAGTCGGAACTCCTCTCCCACGTTTCAGAGAAGCACATGGAAGAAGGGGTTAATGTTGATGAGATTATTATTCCCCTTAGGCCTCTGAGTACACCTGAACCCCCCAACTCAAGCAAAACCGGAGATGAGTTTTTGGTCATGAAGAGGAAGAGAGGCAGGCCTAAGGGGTCCACGAAGAAGTCCAGCACAGAAGAGGAGCTGGCAGAAAACATCGTGAGTCCGACTGAGGACAGCCCGCTGGCTCCGGAGGAAGGGAACAGCCTGCCTCCAAGCAGCTTGGAGTGTAGCAAGTGCTGTCGGAAGTTCTCCAACACGCGCCAGCTGCGGAAGCACATCTGCATTATCGTGCTGAATTTGGGTGAGGAGGAAGGAGAAGCAGGTAACGAGTCTGACCTTGAACTAGAAAAGAAGTGTAAGGAAGATGATCGGGAAAAAGCCTCGAAAAGACCACGGTCACAGAAAACAGAGAAAGTCCAGAAGATCTCAGGAAAGGAGGCCAGACAGCTTTCTGGGGCGAAGAAACCCATCATAAGTGTGGTTTTAACTGCACACGAAGCAATTCCAGGTGCTACCAAGATTGTGCCAGTGGAGGCTGGGCCCCCTGAAACAGGAGCTACAAATTCTGAGACCACTTCAGCAGACCTGGTGCCTCGGAGAGGCTACCAGGAATACGCCATTCAGCAGACACCTTATGAGCAACCAATGAAGTCAAGCAGGCTAGGTCCCACTCAGCTCAAAATCTTCACTTGTGAATACTGCAACAAGGTCTTCAAGTTCAAGCACTCGCTGCAGGCCCACCTGAGGATCCACACCAATGAAAAGCCATACAAGTGCCCCCAGTGCAGCTATGCCAGTGCCATCAAGGCCAACCTCAATGTGCACCTGCGCAAGCACACTGGAGAGAAGTTCGCCTGCGACTATTGCTCGTTCACCTGCCTGAGCAAGGGCCACCTCAAGGTGCACATCGAGCGAGTGCACAAGAAGATCAAGCAGCACTGCCGCTTCTGCAAGAAGAAGTACTCTGACGTCAAGAACCTCATCAAGCACATCCGAGACGCGCATGACCCACAGGACAAGAAGGTCAAAGAGGCCTTGGACGAGCTCTGCCTGATGACGAGGGAGGGCAAGCGGCAGCTGCTCTATGACTGCCACATCTGTGAGCGCAAGTTCAAGAACGAGCTGGACCGTGACCGCCATATGCTGGTCCACGGAGACAAGTGGCCTTTTGCCTGTGAGCTCTGTGGCCATGGGGCCACCAAGTACCAGGCGCTGGAACTGCATGTCAGGAAGCACCCCTTCGTGTACGTCTGTGCCGTCTGCCGCAAGAAGTTCGTCAGCTCCATCAGGCTGCGCACCCACATCAAAGAGGTGCACGGGGCTGCCCAGGAGGCCTTGGTCTTCACCAGTTCCATCAACCAGAGCTTCTGCCTCCTGGAACCTGGTGGGGACATCCAGCAAGAAGCTCTGGGGGACCAGCTACAGCTGGTGGAAGAGGAGTTTGCCCTCCAGGGCGTGAATGCACTCAAGGAAGAGGCCTGTCCTGGGGACACTCAGCTGGAGGAGGGCCGGAAGGAGCCGGAGGCCCCTGGGGAAATGCCTGCCCCAGCTGTGCACCTGGCCTCCCCGCAGGCCGAAAGCACAGCCCTGCCACCCTGTGAGCTGGAAACCACCGTGGTCTCCTCCTCAGACCTGCATTCTCAAGAGGTGGTTTCAGATGATTTTTTGTTGAAAAATGATACCTCCTCCGCAGAGGCTCATGCTGCTCCTGAGAAGCCCCCAGACATGCAGCACAGAAGCTCAGTCCAGACGCAAGGTGAAGTGATCACACTACTGCTGTCCAAGGCCCAGAGTGCTGGGTCAGATCAGGAAAGCCATGGCGCCCAGAGCCCCCTAGGGGAAGGGCAGAACATGGCTGTGCTTTCAGCTGGTGACCCAGATCCCAGCAGGTGTCTCAGGTCAAACCCAGCTGAGGCCTCAGACCTCCTCCCTCCAGTAGCTGGTGGTGGGGACACCATCACACATCAGCCTGACTCTTGCAAAGCTGCCCCTGAGCACCGGTCAGGCATCACCGCTTTCATGAAGGTCCTGAACAGTTTACAGAAGAAGCAAATGAACACCAGCTTGTGTGAGCGGATCCGGAAGGTTTATGGAGACCTGGAGTGTGAATACTGTGGCAAACTTTTTTGGTACCAAGTGCATTTTGATATGCATGTCCGCACCCACACCCGGGAACATCTGTATTATTGCTCTCAGTGTCATTATTCTTCCATCACCAAAAACTGCCTTAAACGCCACGTAATTCAGAAACACAGTAACATCTTGCTGAAGTGTCCCACCGATGGCTGTGACTACTCAACTCCAGATAAATATAAGCTACAGGCACATCTTAAAGTTCACACAGCACTGGACAAAAGGAGTTATTCTTGTCCTGTTTGTGAAAAGTCTTTTTCAGAGGATCGATTGATAAAGTCACATATCAAGACCAACCATCCTGAGGTCTCCATGAGCACCATTTCTGAGGTTCTCGGGAGGAGGGTTCAGCTGAAAGGGCTAATTGGAAAGAGAGCCATGAAATGCCCATATTGTGACTTTTATTTCATGAAGAATGGCTCAGACCTTCAGCGTCATATTTGGGCTCATGAAGGTGTGAAGCCCTTCAAGTGTTCTTTGTGTGAGTATGCAACTCGTAGCAAGAGTAACCTCAAGGCTCATATGAATCGTCACAGCACTGAGAAAACCCACCTATGTGACATGTGTGGCAAGAAATTCAAATCAAAAGGGACACTGAAAAGTCACAAACTCCTTCACACTGCAGATGGGAAGCAGTTTAAGTGCACGGTGTGTGACTACACAGCGGCCCAGAAGCCACAGCTGCTGCGGCACATGGAACAGCATGTCTCCTTCAAGCCTTTCCGCTGTGCCCATTGCCATTACTCCTGCAACATATCTGGCTCTCTGAAGCGGCACTACAACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAGGTGGTTTGAAGTGTCCTGTTTGCAGCTTTGTATATGGCACCAAATGGGAGTTCAATAGGCACTTGAAGAACAAACATGGCTTGAAGGTGGTGGAAATTGATGGAGACCCCAAGTGGGAGACAGCAACAGAAGCTCCTGAGGAGCCCTCCACCCAGTATCTCCACATCACAGAGGCCGAAGAAGACGTTCAAGGGACACAGGCAGCGGTGGCCGCGCTCCAGGACCTGAGATACACCTCTGAGAGTGGCGACCGACTGGACCCCACGGCCGTGAACATCCTGCAGCAGATCATTGAGCTGGGCGCCGAGACCCATGACGCCACTGCCCTTGCCTCGGTGGTTGCCATGGCACCAGGGACGGTGACTGTGGTTAAGCAGGTCACCGAGGAGGAGCCCAGCTCCAACCACACGGTCATGATCCAGGAGACGGTCCAGCAAGCGTCCGTGGAGCTTGCCGAGCAGCACCACCTGGTGGTGTCCTCCGACGACGTGGAGGGCATTGAGACGGTGACTGTCTACACGCAGGGCGGGGAGGCCTCGGAGTTCATCGTCTACGTGCAGGAGGCCATGCAGCCTGTGGAGGAGCAGGCTGTGGAGCAGCCGGCCCAGGAACTCTAGAGGACATGTGGCATCGGATGGCCACAGGGCGGGGCTGCCAGGCTCTGCAGGCACCCAGGGTGGGGAGGCCACCCTTCCTGCCCTACCCGCAGAATGGTGCTCTCCTTTGCCCTCCCTGCCCAGCAGCCTGATAGGACTCTCCTAGTCCAACTTGGGGTGGGCAAGGCAGTCAGCATCACCAGCAATACCACAGGACCCTCACCCCAGCATAGACACACACCCCCTGACCCTTACCATCTGCTTCCTGAAAGACTTCAGTGTCAGCTCCCCTACACACACCCCACACCTTCACCCCTTGCTTCAAGATTCAAACAGAGACTCCCAGTCCCCCTCAGCATCTTCCCTGAATCACAGCCCCAGCTCCTTGACCCCCATCTAGGTGCCAAATGTTCATCTGCAACCGCTATGCAGTCTGGTGAGAGGGAGACAGCCATCACATAGAAAGTGACCGTACGGGTTTTTAATCACTGCTGGGTGGGGTGGGGGTAGGGGGATTGTCCTGGCTTTGTCGACAAAGTCCCACTTCCCCGAGTATTAAGGGCCCTTGGTATCAAGTGAGGTAAATTCACCCATCACAGGGTCTCGCCCTACCATCCTGGAATTATTTCACTTTTAAGATAAATGCACTATTTCACTGTTCGCCTCCCATTCTAAGGAGGTGAGGTGGTTGGAATAAAAACAGTTCCTGTCTGAA SEQ ID NO:847 ZFAT之外顯子1的轉譯多肽序列 METRA SEQ ID NO:848 ZFAT之外顯子2的轉譯多肽序列 NTAIFMCKCCNLFSPNQSELLSHVSEKHMEEGVNVDEIIIPLRPLSTPEPPNSSKTGD SEQ ID NO:849 ZFAT之外顯子3的轉譯多肽序列 FLVMKRKRGRPKGSTKKSSTEEELAENIVSPTEDSPLAPEEGNSLPPSSLECSKCCRKFSNTRQLRKHICIIVLNLGEEEGEA SEQ ID NO:850 ZFAT之外顯子4的轉譯多肽序列 NESDLELEKKCKEDDREKASKRPRSQKTEKVQKISGKEARQLSGAKKPIISVVLTAHEAIP SEQ ID NO:851 ZFAT之外顯子5的轉譯多肽序列 ATKIVPVEAGPPETGATNSETTSADLVPRRGYQEYAIQQTPYEQPMKSS SEQ ID NO:852 ZFAT之外顯子6的轉譯多肽序列 LGPTQLKIFTCEYCNKVFKFKHSLQAHLRIHTNEKPYKCPQCSYASAIKANLNVHLRKHTGEKFACDYCSFTCLSKGHLKVHIERVHKKIKQHCRFCKKKYSDVKNLIKHIRDAHDPQDKKVKEALDELCLMTREGKRQLLYDCHICERKFKNELDRDRHMLVHGDKWPFACELCGHGATKYQALELHVRKHPFVYVCAVCRKKFVSSIRLRTHIKEVHGAAQEALVFTSSINQSFCLLEPGGDIQQEALGDQLQLVEEEFALQGVNALKEEACPGDTQLEEGRKEPEAPGEMPAPAVHLASPQAESTALPPCELETTVVSSSDLHSQEVVSDDFLLKNDTSSAEAHAAPEKPPDMQHRSSVQTQGEVITLLLSKAQSAGSDQESHGAQSPLGEGQNMAVLSAGDPDPSRCLRSNPAEASDLLPPVAGGGDTITHQPDSCKAAPEHRSGITAFMKVLNSLQKKQMNTSLCERIRKVYGDLECEYC SEQ ID NO:853 ZFAT之外顯子7的轉譯多肽序列 KLFWYQVHFDMHVRTHTREHLYYCSQCHYSSITKNCLKRHVIQKHSNILLKCPTDGCDYSTPDKYKLQAHLKVHTAL SEQ ID NO:854 ZFAT之外顯子8的轉譯多肽序列 DKRSYSCPVCEKSFSEDRLIKSHIKTNHP SEQ ID NO:855 ZFAT之外顯子9的轉譯多肽序列 VSMSTISEVLGRRVQLKGLIGKRAMKCPYCDFYFMKNGSDLQRHIWAHE SEQ ID NO:856 ZFAT之外顯子10的轉譯多肽序列 VKPFKCSLCEYATRSKSNLKAHMNRHSTEKTHLCDMCGKKFKSKGTLKSHKLLHTAD SEQ ID NO:857 ZFAT之外顯子11的轉譯多肽序列 KQFKCTVCDYTAAQKPQLLRHMEQHVSFK SEQ ID NO:858 ZFAT之外顯子12的轉譯多肽序列 PFRCAHCHYSCNISGSLKRHYNRKHPNEEYANVGTGELAAEVLIQQ SEQ ID NO:859 ZFAT之外顯子13的轉譯多肽序列 GLKCPVCSFVYGTKWEFNRHLKNKHGLKVVEIDGDPKWE SEQ ID NO:860 ZFAT之外顯子14的轉譯多肽序列 TATEAPEEPSTQYLHITEAEEDVQGTQAAVAALQDLRYTSES SEQ ID NO:861 ZFAT之外顯子15的轉譯多肽序列 DRLDPTAVNILQQIIELGAETHDATALASVVAMAPGTVTVVKQ SEQ ID NO:862 ZFAT之外顯子16的轉譯多肽序列 VTEEEPSSNHTVMIQETVQQASVELAEQHHLVVSSDDVEGIETVTVYTQGGEASEFIVYVQEAMQPVEEQAVEQPAQEL SEQ ID NO:863 ZFAT之全蛋白質序列 METRAAENTAIFMCKCCNLFSPNQSELLSHVSEKHMEEGVNVDEIIIPLRPLSTPEPPNSSKTGDEFLVMKRKRGRPKGSTKKSSTEEELAENIVSPTEDSPLAPEEGNSLPPSSLECSKCCRKFSNTRQLRKHICIIVLNLGEEEGEAGNESDLELEKKCKEDDREKASKRPRSQKTEKVQKISGKEARQLSGAKKPIISVVLTAHEAIPGATKIVPVEAGPPETGATNSETTSADLVPRRGYQEYAIQQTPYEQPMKSSRLGPTQLKIFTCEYCNKVFKFKHSLQAHLRIHTNEKPYKCPQCSYASAIKANLNVHLRKHTGEKFACDYCSFTCLSKGHLKVHIERVHKKIKQHCRFCKKKYSDVKNLIKHIRDAHDPQDKKVKEALDELCLMTREGKRQLLYDCHICERKFKNELDRDRHMLVHGDKWPFACELCGHGATKYQALELHVRKHPFVYVCAVCRKKFVSSIRLRTHIKEVHGAAQEALVFTSSINQSFCLLEPGGDIQQEALGDQLQLVEEEFALQGVNALKEEACPGDTQLEEGRKEPEAPGEMPAPAVHLASPQAESTALPPCELETTVVSSSDLHSQEVVSDDFLLKNDTSSAEAHAAPEKPPDMQHRSSVQTQGEVITLLLSKAQSAGSDQESHGAQSPLGEGQNMAVLSAGDPDPSRCLRSNPAEASDLLPPVAGGGDTITHQPDSCKAAPEHRSGITAFMKVLNSLQKKQMNTSLCERIRKVYGDLECEYCGKLFWYQVHFDMHVRTHTREHLYYCSQCHYSSITKNCLKRHVIQKHSNILLKCPTDGCDYSTPDKYKLQAHLKVHTALDKRSYSCPVCEKSFSEDRLIKSHIKTNHPEVSMSTISEVLGRRVQLKGLIGKRAMKCPYCDFYFMKNGSDLQRHIWAHEGVKPFKCSLCEYATRSKSNLKAHMNRHSTEKTHLCDMCGKKFKSKGTLKSHKLLHTADGKQFKCTVCDYTAAQKPQLLRHMEQHVSFKPFRCAHCHYSCNISGSLKRHYNRKHPNEEYANVGTGELAAEVLIQQGGLKCPVCSFVYGTKWEFNRHLKNKHGLKVVEIDGDPKWETATEAPEEPSTQYLHITEAEEDVQGTQAAVAALQDLRYTSESGDRLDPTAVNILQQIIELGAETHDATALASVVAMAPGTVTVVKQVTEEEPSSNHTVMIQETVQQASVELAEQHHLVVSSDDVEGIETVTVYTQGGEASEFIVYVQEAMQPVEEQAVEQPAQEL SEQ ID NO:864 ZFAT之全部外顯子1-12按順序排列的多核苷酸序列 SEQ ID NO:865 ZFAT之全部外顯子1-12按順序排列的轉譯多肽序列 DSCAML1 序列資訊SEQ ID NO:866 來自DSCAML1-NRG1融合物5’處之DSCAML1外顯子3的序列 GCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCAG SEQ ID NO:867 DSCAML1-NRG1融合物3’處之NRG1外顯子2序列 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:868 DSCAML1-NRG1多核苷酸序列 GCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:869 DSCAML1-NRG1多肽序列 LIPSSVQEYVSVVSWEKDTVSIIPALPPRLKEMKSQESAAGSKLVLRCE SEQ ID NO:870 DSCAML1之外顯子1 AGCCGAGCGCTGGGCTGAGGAGCAGAGAGAGCGGGGCGCCGAGTGCGGGCGGCTGGGAGCGCGCTGAGCGGGGGAGAGGCGCTGCCGCACGGCCGGCCACAGGACCACCTCCCCGGAGAATAGGGCCTCTTTATGGCATGTGGCTGGTAACTTTCCTCCTGCTCCTGGACTCTTTACACAAAG SEQ ID NO:871 DSCAML1之外顯子2 CCCGCCCTGAAGATGTTGGCACCAGCCTCTACTTTGTAAATGACTCCTTGCAGCAGGTGACCTTTTCCAGCTCCGTGGGGGTGGTGGTGCCCTGCCCGGCCGCGGGCTCCCCCAGCGCGGCCCTTCGATGGTACCTGGCCACAGGGGACGACATCTACGACGTGCCGCACATCCGGCACGTCCACGCCAACGGGACGCTGCAGCTCTACCCCTTCTCCCCCTCCGCCTTCAATAGCTTTATCCACGACAATGACTACTTCTGCACCGCGGAGAACGCTGCCGGCAAGATCCGGAGCCCCAACATCCGCGTCAAAGCAG SEQ ID NO:872 DSCAML1之外顯子3 TTTTCAGGGAACCCTACACCGTCCGGGTGGAGGATCAAAGGTCAATGCGTGGCAACGTGGCCGTCTTCAAGTGCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCAG SEQ ID NO:873 DSCAML1之外顯子4 AACACAGGTTTTTTATTACCTACCACGGCGGGCTGTACATCTCTGACGTACAGAAGGAGGACGCCCTCTCCACCTATCGCTGCATCACCAAGCACAAGTATAGCGGGGAGACCCGGCAGAGCAATGGGGCACGCCTCTCTGTGACAG SEQ ID NO:874 DSCAML1之外顯子5 ACCCTGCTGAGTCGATCCCCACCATCCTGGATGGCTTCCACTCCCAGGAAGTGTGGGCCGGCCACACCGTGGAGCTGCCCTGCACCGCCTCGGGCTACCCTATCCCCGCCATCCGCTGGCTCAAGGATGGCCGGCCCCTCCCGGCTGACAGCCGCTGGACCAAGCGCATCACAGGGCTGACCATCAGCGACTTGCGGACCGAGGACAGCGGCACCTACATTTGTGAGGTCACCAACACCTTCGGTTCGGCAGAGGCCACAGGCATCCTCATGGTCATTG SEQ ID NO:875 DSCAML1之外顯子6 ATCCCCTTCATGTGACCCTGACACCAAAGAAGCTGAAGACCGGCATTGGCAGCACGGTCATCCTCTCCTGTGCCCTGACGGGCTCCCCAGAGTTCACCATCCGCTGGTATCGCAACACGGAGCTGGTGCTGCCTGACGAGGCCATCTCCATCCGCGGGCTCAGCAACGAGACGCTGCTCATCACCTCGGCCCAGAAGAGCCATTCCGGGGCCTACCAGTGCTTCGCTACCCGCAAGGCCCAGACCGCCCAGGACTTTGCCATCATTGCACTTGAGG SEQ ID NO:876 DSCAML1之外顯子7 ATGGCACGCCCCGCATCGTCTCGTCCTTCAGCGAGAAGGTGGTCAACCCCGGGGAGCAGTTCTCACTGATGTGTGCGGCCAAGGGCGCCCCGCCCCCCACGGTCACCTGGGCCCTCGACGATGAGCCCATCGTGCGGGATGGCAGCCACCGCACCAACCAGTACACCATGTCGGACGGCACCACCATCAGCCACATGAACGTCACAGGCCCCCAGATCCGCGACGGGGGCGTGTACCGGTGCACAGCGCGGAACTTGGTGGGCAGTGCTGAATATCAGGCGCGAATAAACGTAAGAG SEQ ID NO:877 DSCAML1之外顯子8 GCCCACCCAGCATCCGGGCTATGCGGAACATCACAGCAGTCGCCGGGCGGGACACCCTTATCAACTGCAGGGTCATCGGCTATCCCTACTACTCCATCAAGTGGTACAAGGATGCCCTGCTGCTGCCAGACAACCACCGCCAGGTGGTGTTTGAGAATGGGACCCTCAAGCTGACTGACGTGCAGAAGGGCATGGATGAGGGGGAGTACCTGTGCAGTGTCCTCATCCAGCCCCAGCTCTCCATCAGCCAGAGCGTTCACGTAGCCGTCAAAG SEQ ID NO:878 DSCAML1之外顯子9 TGCCCCCTCTGATCCAGCCCTTCGAATTCCCACCCGCCTCCATCGGCCAGCTGCTCTACATTCCCTGTGTGGTGTCCTCGGGGGACATGCCCATCCGTATCACCTGGAGGAAGGACGGACAGGTGATCATCTCAGGCTCGGGCGTGACCATCGAGAGCAAGGAATTCATGAGCTCCCTGCAGATCTCTAGCGTCTCCCTCAAGCACAACGGCAACTATACATGCATCGCCAGCAACGCAGCCGCCACCGTGAGCCGGGAGCGCCAGCTCATCGTGCGTG SEQ ID NO:879 DSCAML1之外顯子10 TGCCCCCTCGATTTGTGGTGCAACCCAACAACCAGGATGGCATCTACGGCAAAGCTGGTGTGCTCAACTGCTCGGTGGACGGCTACCCCCCACCCAAGGTCATGTGGAAGCATGCCAAGG SEQ ID NO:880 DSCAML1之外顯子11 GGAGCGGGAACCCCCAGCAGTACCACCCTGTGCCCCTCACTGGCCGCATCCAGATCCTGCCCAACAGCTCGCTGCTGATCCGCCACGTCCTAGAAGAGGACATCGGCTACTACCTCTGCCAGGCCAGCAACGGCGTAGGCACCGACATCAGCAAGTCCATGTTCCTCACAGTCAAGA SEQ ID NO:881 DSCAML1之外顯子12 TCCCGGCCATGATCACTTCCCACCCCAACACCACCATCGCCATCAAGGGCCATGCGAAGGAGCTAAACTGCACGGCACGGGGTGAGCGGCCCATCATCATCCGCTGGGAGAAGGGGGACACAGTCATCGACCCTGACCGCGTCATGCGGTATGCCATCGCCACCAAGGACAACGGCGACGAGGTCGTCTCCACACTGAAG SEQ ID NO:882 DSCAML1之外顯子13 CTCAAGCCCGCTGACCGTGGGGACTCTGTGTTCTTCAGCTGCCATGCCATCAACTCGTATGGGGAGGACCGGGGCTTGATCCAACTCACTGTGCAAG SEQ ID NO:883 DSCAML1之外顯子14 AGCCCCCCGACCCCCCAGAGCTGGAGATCCGGGAGGTGAAGGCCCGGAGCATGAACCTGCGCTGGACCCAGCGATTCGACGGGAACAGCATCATCACGGGCTTCGACATTGAATACAAGAACAAATCAG SEQ ID NO:884 DSCAML1之外顯子15 ATTCCTGGGACTTCAAGCAGTCCACACGCAACATCTCCCCCACCATCAACCAGGCCAACATTGTGGACTTGCACCCGGCATCTGTGTACAGCATCCGCATGTACTCTTTCAACAAGATTGGCCGCAGTGAACCAAGCAAGGAGCTCACCATCAGCACTGAGGAGGCCG SEQ ID NO:885 DSCAML1之外顯子16 CTCCCGATGGGCCCCCCATGGATGTTACCTTGCAGCCAGTGACCTCACAGAGCATCCAGGTGACCTGGAAG SEQ ID NO:886 DSCAML1之外顯子17 GCACCCAAGAAGGAGCTGCAGAACGGTGTCATCCGGGGCTACCAGATTGGCTACAGAGAGAACAGCCCCGGCAGCAACGGGCAGTACAGCATCGTGGAGATGAAGGCCACGGGGGACAGCGAGGTCTACACCCTGGACAACCTCAAGAAGTTCGCCCAGTATGGGGTGGTGGTCCAAGCCTTCAATCGGGCTGGCACGGGGCCCTCTTCCAGCGAGATCAATGCCACCACTCTGGAGGATG SEQ ID NO:887 DSCAML1之外顯子18 TGCCCAGCCAGCCCCCTGAGAACGTCCGGGCCCTGTCCATCACTTCTGACGTGGCCGTCATCTCCTGGTCAGAGCCCCCGCGCAGCACCCTCAATGGCGTCCTCAAAGGCTATCGGGTCATCTTCTGGTCCCTCTATGTTGATGGGG SEQ ID NO:888 DSCAML1之外顯子19 AGTGGGGCGAGATGCAGAACATCACCACCACGCGGGAGCGGGTGGAGCTGCGGGGCATGGAGAAGTTCACCAACTACAGCGTCCAGGTGCTGGCCTACACCCAGGCTGGGGACGGCGTACGCAGCAGTGTGCTCTACATCCAGACCAAGGAGGACG SEQ ID NO:889 DSCAML1之外顯子20 TTCCAGGTCCCCCTGCTGGCATCAAAGCTGTCCCTTCATCAGCTAGCAGTGTGGTTGTGTCTTGGCTCCCCCCTACCAAGCCCAACGGGGTGATCCGCAAGTACACCATCTTCTGTTCCAGCCCCGGGTCTGGCCAGCCG SEQ ID NO:890 DSCAML1之外顯子21 GCTCCCAGCGAGTACGAGACGAGTCCAGAGCAGCTCTTCTACCGGATCGCCCACCTAAACCGCGGTCAGCAGTATCTGCTGTGGGTGGCCGCCGTCACCTCTGCCGGCCGGGGCAACAGCAGCGAGAAGGTGACCATCGAGCCTGCTGGCAAGG SEQ ID NO:891 DSCAML1之外顯子22 CCCCAGCAAAGATCATCTCCTTTGGGGGCACCGTGACAACACCTTGGATGAAAGATGTTCGGCTGCCTTGCAATTCAGTGGGAGATCCAGCCCCTGCTGTGAAGTGGACCAAGGACAG SEQ ID NO:892 DSCAML1之外顯子23 TGAAGACTCGGCCATTCCAGTGTCCATGGATGGGCACCGGCTCATCCACACCAATGGCACACTGCTGCTGCGTGCAGTGAAGGCTGAGGACTCTGGCTACTACACGTGCACGGCCACCAACACTGGTGGCTTTGACACCATCATCGTCAACCTTCTGGTGCAAG SEQ ID NO:893 DSCAML1之外顯子24 TTCCCCCGGACCAGCCCCGCCTCACTGTCTCCAAAACCTCAGCTTCGTCCATCACCCTGACCTGGATTCCAGGTGACAATGGGGGCAGCTCCATCCGAG SEQ ID NO:894 DSCAML1之外顯子25 GCTTCGTGCTACAGTACTCGGTGGACAACAGCGAGGAGTGGAAGGATGTGTTCATCAGCTCCAGCGAGCGCTCCTTCAAGCTGGACAGCCTCAAGTGTGGCACGTGGTACAAGGTGAAGCTGGCAGCCAAGAACAGCGTGGGCTCTGGGCGCATCAGCGAGATCATCGAGGCCAAGACCCACGGGCGGG SEQ ID NO:895 DSCAML1之外顯子26 AGCCCTCCTTCAGCAAAGACCAACACCTCTTCACCCACATCAACTCCACGCATGCTCGGCTTAACCTGCAGGGCTGGAACAATGGGGGCTGCCCTATCACAGCCATCGTTCTGGAGTACCGGCCCAAGGGGACCTGGGCCTGGCAGGGCCTCCGGGCCAACAGCTCCGGGGAGGTGTTTCTGACGGAACTGCGAGAGGCCACGTGGTACGAGCTGCGCATGAGGGCTTGCAACAGTGCGGGCTGCGGCAATGAAACAGCCCAGTTCGCCACCCTGGACTACGATGGCA SEQ ID NO:896 DSCAML1之外顯子27 GCACCATTCCACCCATCAAGTCTGCTCAAGGTGAAGGGGATGATGTGAAGAAGCTGTTCACCATCGGCTGCCCTGTCATCCTGGCCACACTGGGGGTGGCACTGCTCTTCATCGTACGCAAGAAGAGGAAGGAGAAACGGCTGAAGCGACTCCGAG SEQ ID NO:897 DSCAML1之外顯子28 ATGCAAAGAGTTTGGCAGAAATGTTGATAAG SEQ ID NO:898 DSCAML1之外顯子29 CAAGAACAATAGAAGCTTTGACACCCCTGTGAAAGGGCCACCCCAGGGCCCACGGCTACACATTGACATCCCCAGGGTCCAGCTGCTCATCGAGGACAAAGAAGGCATCAAGCAACTGG SEQ ID NO:899 DSCAML1之外顯子30 GAGATGACAAGGCCACCATCCCTGTGACAGATGCTGAGTTCAGCCAAGCTGTCAACCCACAGAGCTTCTGTACTGGCGTCTCCTTGCACCACCCAACCCTCATCCAGAGCACAGGACCCCTCATCGACATGTCTGACATCCGGCCAGGAACCA SEQ ID NO:900 DSCAML1之外顯子31 ATCCAGTGTCCAGGAAGAATGTGAAGTCAGCCCACAGCACCCGGAACCGGTACTCAAGCCAGTGGACCCTGACCAAGTGCCAGGCCTCCACACCTGCCCGCACCCTCACCTCCGACTGGCGCACCGTGGGCTCCCAGCATGGTGTCACGGTCACTGAGAGTGACAGCTACAGTGCCAGCCTGTCCCAGGACACAG SEQ ID NO:901 DSCAML1之外顯子32 ACAAAGGAAGGAACAGCATGGTGTCCACTGAGAGTGCCTCTTCCACCTACGAGGAGCTGGCCCGGGCCTATGAGCATGCCAAGCTGGAGGAGCAGCTGCAGCACGCCAAGTTTGAGATCACCGAGTGCTTCATCTCTGACAGTTCCTCTGACCAGATGACCACAGGCACCAACGAGAACGCCGACAGCATGACATCCATGAGCACACCCTCAGAGCCTGGCATCTGCCGCTTTACCGCCTCACCACCCAAGCCCCAGGATGCGGACCGGGGCAAAAACGTGGCTGTGCCCATCCCTCACCGGGCCAACAAGA SEQ ID NO:902 DSCAML1之外顯子33 GTGACTACTGCAACCTGCCCCTGTATGCCAAGTCAGAGGCCTTCTTTCGAAAGGCAGATGGACGTGAGCCCTGCCCCGTGGTCCCACCCCGTGAGGCCTCCATCCGGAACCTGGCTCGAACCTACCACACCCAGGCTCGCCACCTGACCCTGGACCCTGCCAGCAAGTCCTTGGGCCTTCCCCACCCAGGGGCCCCCGCTGCCGCCTCCACAGCCACCTTACCTCAGAGGACTCTGGCCATGCCAGCCCCCCCAGCCGGCACAGCCCCCCCAGCCCCCGGCCCCACCCCTGCTGAGCCACCCACCGCCCCCAGCGCTGCCCCTCCGGCCCCCAGCACCGAGCCTCCACGAGCCGGGGGCCCACACACCAAAATGGGGGGCTCCAGGGACTCGCTTCTCGAGATGAGCACATCGGGGGTAGGGAGGTCTCAGAAGCAGGGGGCCGGGGCCTACTCCAAATCCTACACCCTGGTGTAGGGCCCGCAGGAAGAGCAGCCACGCCTGGACCGCGCCGCGCCGCAGCCCCACACGCCAGCTCGGCTGTTTTTCTGCATTATTTATATTCAACTGACAGACAAAAACCAACCAACGACAAAACAAAAACCCCCAATCATGAACGCCTGTACATAGAACTCTTTTGTACAAATGAAACTATTTTCTTCTTCTCCATGAAGCCAGGGCACAAAGAATTTGACAGTACAAGTCAAATCCCCCACCCCACAAAATATGTGTGGAGATATATATACATATATAGACAGACAGGAACGCGTCCACGAGCTATATATCTATATATTTCTCTCACCCTATTTTGAGACAGAGGCACAAAGACTCAGCAATTTTTTTCCCTCCTCCTCACCTTCCCCCCAGTCTAGGTGGTTTTGACAAAGACCAAAATCCCAACTCAGAGACACTGCATGCGATTTTACTGTTCCAAGAAAACCAGGAGTTGCTTCAATTTGCAGATGCTTATGTGTTAATACCTTTTTCTATGAAAAAAGACCCAGCGCCGTGTGCAATAAAGGTTATGTTTCTA SEQ ID NO:903 DSCAML1之全mRNA多核苷酸序列 AGCCGAGCGCTGGGCTGAGGAGCAGAGAGAGCGGGGCGCCGAGTGCGGGCGGCTGGGAGCGCGCTGAGCGGGGGAGAGGCGCTGCCGCACGGCCGGCCACAGGACCACCTCCCCGGAGAATAGGGCCTCTTTATGGCATGTGGCTGGTAACTTTCCTCCTGCTCCTGGACTCTTTACACAAAGCCCGCCCTGAAGATGTTGGCACCAGCCTCTACTTTGTAAATGACTCCTTGCAGCAGGTGACCTTTTCCAGCTCCGTGGGGGTGGTGGTGCCCTGCCCGGCCGCGGGCTCCCCCAGCGCGGCCCTTCGATGGTACCTGGCCACAGGGGACGACATCTACGACGTGCCGCACATCCGGCACGTCCACGCCAACGGGACGCTGCAGCTCTACCCCTTCTCCCCCTCCGCCTTCAATAGCTTTATCCACGACAATGACTACTTCTGCACCGCGGAGAACGCTGCCGGCAAGATCCGGAGCCCCAACATCCGCGTCAAAGCAGTTTTCAGGGAACCCTACACCGTCCGGGTGGAGGATCAAAGGTCAATGCGTGGCAACGTGGCCGTCTTCAAGTGCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCAGAACACAGGTTTTTTATTACCTACCACGGCGGGCTGTACATCTCTGACGTACAGAAGGAGGACGCCCTCTCCACCTATCGCTGCATCACCAAGCACAAGTATAGCGGGGAGACCCGGCAGAGCAATGGGGCACGCCTCTCTGTGACAGACCCTGCTGAGTCGATCCCCACCATCCTGGATGGCTTCCACTCCCAGGAAGTGTGGGCCGGCCACACCGTGGAGCTGCCCTGCACCGCCTCGGGCTACCCTATCCCCGCCATCCGCTGGCTCAAGGATGGCCGGCCCCTCCCGGCTGACAGCCGCTGGACCAAGCGCATCACAGGGCTGACCATCAGCGACTTGCGGACCGAGGACAGCGGCACCTACATTTGTGAGGTCACCAACACCTTCGGTTCGGCAGAGGCCACAGGCATCCTCATGGTCATTGATCCCCTTCATGTGACCCTGACACCAAAGAAGCTGAAGACCGGCATTGGCAGCACGGTCATCCTCTCCTGTGCCCTGACGGGCTCCCCAGAGTTCACCATCCGCTGGTATCGCAACACGGAGCTGGTGCTGCCTGACGAGGCCATCTCCATCCGCGGGCTCAGCAACGAGACGCTGCTCATCACCTCGGCCCAGAAGAGCCATTCCGGGGCCTACCAGTGCTTCGCTACCCGCAAGGCCCAGACCGCCCAGGACTTTGCCATCATTGCACTTGAGGATGGCACGCCCCGCATCGTCTCGTCCTTCAGCGAGAAGGTGGTCAACCCCGGGGAGCAGTTCTCACTGATGTGTGCGGCCAAGGGCGCCCCGCCCCCCACGGTCACCTGGGCCCTCGACGATGAGCCCATCGTGCGGGATGGCAGCCACCGCACCAACCAGTACACCATGTCGGACGGCACCACCATCAGCCACATGAACGTCACAGGCCCCCAGATCCGCGACGGGGGCGTGTACCGGTGCACAGCGCGGAACTTGGTGGGCAGTGCTGAATATCAGGCGCGAATAAACGTAAGAGGCCCACCCAGCATCCGGGCTATGCGGAACATCACAGCAGTCGCCGGGCGGGACACCCTTATCAACTGCAGGGTCATCGGCTATCCCTACTACTCCATCAAGTGGTACAAGGATGCCCTGCTGCTGCCAGACAACCACCGCCAGGTGGTGTTTGAGAATGGGACCCTCAAGCTGACTGACGTGCAGAAGGGCATGGATGAGGGGGAGTACCTGTGCAGTGTCCTCATCCAGCCCCAGCTCTCCATCAGCCAGAGCGTTCACGTAGCCGTCAAAGTGCCCCCTCTGATCCAGCCCTTCGAATTCCCACCCGCCTCCATCGGCCAGCTGCTCTACATTCCCTGTGTGGTGTCCTCGGGGGACATGCCCATCCGTATCACCTGGAGGAAGGACGGACAGGTGATCATCTCAGGCTCGGGCGTGACCATCGAGAGCAAGGAATTCATGAGCTCCCTGCAGATCTCTAGCGTCTCCCTCAAGCACAACGGCAACTATACATGCATCGCCAGCAACGCAGCCGCCACCGTGAGCCGGGAGCGCCAGCTCATCGTGCGTGTGCCCCCTCGATTTGTGGTGCAACCCAACAACCAGGATGGCATCTACGGCAAAGCTGGTGTGCTCAACTGCTCGGTGGACGGCTACCCCCCACCCAAGGTCATGTGGAAGCATGCCAAGGGGAGCGGGAACCCCCAGCAGTACCACCCTGTGCCCCTCACTGGCCGCATCCAGATCCTGCCCAACAGCTCGCTGCTGATCCGCCACGTCCTAGAAGAGGACATCGGCTACTACCTCTGCCAGGCCAGCAACGGCGTAGGCACCGACATCAGCAAGTCCATGTTCCTCACAGTCAAGATCCCGGCCATGATCACTTCCCACCCCAACACCACCATCGCCATCAAGGGCCATGCGAAGGAGCTAAACTGCACGGCACGGGGTGAGCGGCCCATCATCATCCGCTGGGAGAAGGGGGACACAGTCATCGACCCTGACCGCGTCATGCGGTATGCCATCGCCACCAAGGACAACGGCGACGAGGTCGTCTCCACACTGAAGCTCAAGCCCGCTGACCGTGGGGACTCTGTGTTCTTCAGCTGCCATGCCATCAACTCGTATGGGGAGGACCGGGGCTTGATCCAACTCACTGTGCAAGAGCCCCCCGACCCCCCAGAGCTGGAGATCCGGGAGGTGAAGGCCCGGAGCATGAACCTGCGCTGGACCCAGCGATTCGACGGGAACAGCATCATCACGGGCTTCGACATTGAATACAAGAACAAATCAGATTCCTGGGACTTCAAGCAGTCCACACGCAACATCTCCCCCACCATCAACCAGGCCAACATTGTGGACTTGCACCCGGCATCTGTGTACAGCATCCGCATGTACTCTTTCAACAAGATTGGCCGCAGTGAACCAAGCAAGGAGCTCACCATCAGCACTGAGGAGGCCGCTCCCGATGGGCCCCCCATGGATGTTACCTTGCAGCCAGTGACCTCACAGAGCATCCAGGTGACCTGGAAGGCACCCAAGAAGGAGCTGCAGAACGGTGTCATCCGGGGCTACCAGATTGGCTACAGAGAGAACAGCCCCGGCAGCAACGGGCAGTACAGCATCGTGGAGATGAAGGCCACGGGGGACAGCGAGGTCTACACCCTGGACAACCTCAAGAAGTTCGCCCAGTATGGGGTGGTGGTCCAAGCCTTCAATCGGGCTGGCACGGGGCCCTCTTCCAGCGAGATCAATGCCACCACTCTGGAGGATGTGCCCAGCCAGCCCCCTGAGAACGTCCGGGCCCTGTCCATCACTTCTGACGTGGCCGTCATCTCCTGGTCAGAGCCCCCGCGCAGCACCCTCAATGGCGTCCTCAAAGGCTATCGGGTCATCTTCTGGTCCCTCTATGTTGATGGGGAGTGGGGCGAGATGCAGAACATCACCACCACGCGGGAGCGGGTGGAGCTGCGGGGCATGGAGAAGTTCACCAACTACAGCGTCCAGGTGCTGGCCTACACCCAGGCTGGGGACGGCGTACGCAGCAGTGTGCTCTACATCCAGACCAAGGAGGACGTTCCAGGTCCCCCTGCTGGCATCAAAGCTGTCCCTTCATCAGCTAGCAGTGTGGTTGTGTCTTGGCTCCCCCCTACCAAGCCCAACGGGGTGATCCGCAAGTACACCATCTTCTGTTCCAGCCCCGGGTCTGGCCAGCCGGCTCCCAGCGAGTACGAGACGAGTCCAGAGCAGCTCTTCTACCGGATCGCCCACCTAAACCGCGGTCAGCAGTATCTGCTGTGGGTGGCCGCCGTCACCTCTGCCGGCCGGGGCAACAGCAGCGAGAAGGTGACCATCGAGCCTGCTGGCAAGGCCCCAGCAAAGATCATCTCCTTTGGGGGCACCGTGACAACACCTTGGATGAAAGATGTTCGGCTGCCTTGCAATTCAGTGGGAGATCCAGCCCCTGCTGTGAAGTGGACCAAGGACAGTGAAGACTCGGCCATTCCAGTGTCCATGGATGGGCACCGGCTCATCCACACCAATGGCACACTGCTGCTGCGTGCAGTGAAGGCTGAGGACTCTGGCTACTACACGTGCACGGCCACCAACACTGGTGGCTTTGACACCATCATCGTCAACCTTCTGGTGCAAGTTCCCCCGGACCAGCCCCGCCTCACTGTCTCCAAAACCTCAGCTTCGTCCATCACCCTGACCTGGATTCCAGGTGACAATGGGGGCAGCTCCATCCGAGGCTTCGTGCTACAGTACTCGGTGGACAACAGCGAGGAGTGGAAGGATGTGTTCATCAGCTCCAGCGAGCGCTCCTTCAAGCTGGACAGCCTCAAGTGTGGCACGTGGTACAAGGTGAAGCTGGCAGCCAAGAACAGCGTGGGCTCTGGGCGCATCAGCGAGATCATCGAGGCCAAGACCCACGGGCGGGAGCCCTCCTTCAGCAAAGACCAACACCTCTTCACCCACATCAACTCCACGCATGCTCGGCTTAACCTGCAGGGCTGGAACAATGGGGGCTGCCCTATCACAGCCATCGTTCTGGAGTACCGGCCCAAGGGGACCTGGGCCTGGCAGGGCCTCCGGGCCAACAGCTCCGGGGAGGTGTTTCTGACGGAACTGCGAGAGGCCACGTGGTACGAGCTGCGCATGAGGGCTTGCAACAGTGCGGGCTGCGGCAATGAAACAGCCCAGTTCGCCACCCTGGACTACGATGGCAGCACCATTCCACCCATCAAGTCTGCTCAAGGTGAAGGGGATGATGTGAAGAAGCTGTTCACCATCGGCTGCCCTGTCATCCTGGCCACACTGGGGGTGGCACTGCTCTTCATCGTACGCAAGAAGAGGAAGGAGAAACGGCTGAAGCGACTCCGAGATGCAAAGAGTTTGGCAGAAATGTTGATAAGCAAGAACAATAGAAGCTTTGACACCCCTGTGAAAGGGCCACCCCAGGGCCCACGGCTACACATTGACATCCCCAGGGTCCAGCTGCTCATCGAGGACAAAGAAGGCATCAAGCAACTGGGAGATGACAAGGCCACCATCCCTGTGACAGATGCTGAGTTCAGCCAAGCTGTCAACCCACAGAGCTTCTGTACTGGCGTCTCCTTGCACCACCCAACCCTCATCCAGAGCACAGGACCCCTCATCGACATGTCTGACATCCGGCCAGGAACCAATCCAGTGTCCAGGAAGAATGTGAAGTCAGCCCACAGCACCCGGAACCGGTACTCAAGCCAGTGGACCCTGACCAAGTGCCAGGCCTCCACACCTGCCCGCACCCTCACCTCCGACTGGCGCACCGTGGGCTCCCAGCATGGTGTCACGGTCACTGAGAGTGACAGCTACAGTGCCAGCCTGTCCCAGGACACAGACAAAGGAAGGAACAGCATGGTGTCCACTGAGAGTGCCTCTTCCACCTACGAGGAGCTGGCCCGGGCCTATGAGCATGCCAAGCTGGAGGAGCAGCTGCAGCACGCCAAGTTTGAGATCACCGAGTGCTTCATCTCTGACAGTTCCTCTGACCAGATGACCACAGGCACCAACGAGAACGCCGACAGCATGACATCCATGAGCACACCCTCAGAGCCTGGCATCTGCCGCTTTACCGCCTCACCACCCAAGCCCCAGGATGCGGACCGGGGCAAAAACGTGGCTGTGCCCATCCCTCACCGGGCCAACAAGAGTGACTACTGCAACCTGCCCCTGTATGCCAAGTCAGAGGCCTTCTTTCGAAAGGCAGATGGACGTGAGCCCTGCCCCGTGGTCCCACCCCGTGAGGCCTCCATCCGGAACCTGGCTCGAACCTACCACACCCAGGCTCGCCACCTGACCCTGGACCCTGCCAGCAAGTCCTTGGGCCTTCCCCACCCAGGGGCCCCCGCTGCCGCCTCCACAGCCACCTTACCTCAGAGGACTCTGGCCATGCCAGCCCCCCCAGCCGGCACAGCCCCCCCAGCCCCCGGCCCCACCCCTGCTGAGCCACCCACCGCCCCCAGCGCTGCCCCTCCGGCCCCCAGCACCGAGCCTCCACGAGCCGGGGGCCCACACACCAAAATGGGGGGCTCCAGGGACTCGCTTCTCGAGATGAGCACATCGGGGGTAGGGAGGTCTCAGAAGCAGGGGGCCGGGGCCTACTCCAAATCCTACACCCTGGTGTAGGGCCCGCAGGAAGAGCAGCCACGCCTGGACCGCGCCGCGCCGCAGCCCCACACGCCAGCTCGGCTGTTTTTCTGCATTATTTATATTCAACTGACAGACAAAAACCAACCAACGACAAAACAAAAACCCCCAATCATGAACGCCTGTACATAGAACTCTTTTGTACAAATGAAACTATTTTCTTCTTCTCCATGAAGCCAGGGCACAAAGAATTTGACAGTACAAGTCAAATCCCCCACCCCACAAAATATGTGTGGAGATATATATACATATATAGACAGACAGGAACGCGTCCACGAGCTATATATCTATATATTTCTCTCACCCTATTTTGAGACAGAGGCACAAAGACTCAGCAATTTTTTTCCCTCCTCCTCACCTTCCCCCCAGTCTAGGTGGTTTTGACAAAGACCAAAATCCCAACTCAGAGACACTGCATGCGATTTTACTGTTCCAAGAAAACCAGGAGTTGCTTCAATTTGCAGATGCTTATGTGTTAATACCTTTTTCTATGAAAAAAGACCCAGCGCCGTGTGCAATAAAGGTTATGTTTCTA SEQ ID NO:904 DSCAML1之外顯子1的轉譯多肽序列 MWLVTFLLLLDSLHK SEQ ID NO:905 DSCAML1之外顯子2的轉譯多肽序列 RPEDVGTSLYFVNDSLQQVTFSSSVGVVVPCPAAGSPSAALRWYLATGDDIYDVPHIRHVHANGTLQLYPFSPSAFNSFIHDNDYFCTAENAAGKIRSPNIRVKA SEQ ID NO:906 DSCAML1之外顯子3的轉譯多肽序列 FREPYTVRVEDQRSMRGNVAVFKCLIPSSVQEYVSVVSWEKDTVSIIP SEQ ID NO:907 DSCAML1之外顯子4的轉譯多肽序列 HRFFITYHGGLYISDVQKEDALSTYRCITKHKYSGETRQSNGARLSVT SEQ ID NO:908 DSCAML1之外顯子5的轉譯多肽序列 PAESIPTILDGFHSQEVWAGHTVELPCTASGYPIPAIRWLKDGRPLPADSRWTKRITGLTISDLRTEDSGTYICEVTNTFGSAEATGILMVI SEQ ID NO:909 DSCAML1之外顯子6的轉譯多肽序列 PLHVTLTPKKLKTGIGSTVILSCALTGSPEFTIRWYRNTELVLPDEAISIRGLSNETLLITSAQKSHSGAYQCFATRKAQTAQDFAIIALE SEQ ID NO:910 DSCAML1之外顯子7的轉譯多肽序列 GTPRIVSSFSEKVVNPGEQFSLMCAAKGAPPPTVTWALDDEPIVRDGSHRTNQYTMSDGTTISHMNVTGPQIRDGGVYRCTARNLVGSAEYQARINVR SEQ ID NO:911 DSCAML1之外顯子8的轉譯多肽序列 PPSIRAMRNITAVAGRDTLINCRVIGYPYYSIKWYKDALLLPDNHRQVVFENGTLKLTDVQKGMDEGEYLCSVLIQPQLSISQSVHVAVK SEQ ID NO:912 DSCAML1之外顯子9的轉譯多肽序列 PPLIQPFEFPPASIGQLLYIPCVVSSGDMPIRITWRKDGQVIISGSGVTIESKEFMSSLQISSVSLKHNGNYTCIASNAAATVSRERQLIVR SEQ ID NO:913 DSCAML1之外顯子10的轉譯多肽序列 PPRFVVQPNNQDGIYGKAGVLNCSVDGYPPPKVMWKHAK SEQ ID NO:914 DSCAML1之外顯子11的轉譯多肽序列 SGNPQQYHPVPLTGRIQILPNSSLLIRHVLEEDIGYYLCQASNGVGTDISKSMFLTVK SEQ ID NO:915 DSCAML1之外顯子12的轉譯多肽序列 PAMITSHPNTTIAIKGHAKELNCTARGERPIIIRWEKGDTVIDPDRVMRYAIATKDNGDEVVSTLK SEQ ID NO:916 DSCAML1之外顯子13的轉譯多肽序列 LKPADRGDSVFFSCHAINSYGEDRGLIQLTVQ SEQ ID NO:917 DSCAML1之外顯子14的轉譯多肽序列 PPDPPELEIREVKARSMNLRWTQRFDGNSIITGFDIEYKNKS SEQ ID NO:918 DSCAML1之外顯子15的轉譯多肽序列 SWDFKQSTRNISPTINQANIVDLHPASVYSIRMYSFNKIGRSEPSKELTISTEEA SEQ ID NO:919 DSCAML1之外顯子16的轉譯多肽序列 PDGPPMDVTLQPVTSQSIQVTWK SEQ ID NO:920 DSCAML1之外顯子17的轉譯多肽序列 APKKELQNGVIRGYQIGYRENSPGSNGQYSIVEMKATGDSEVYTLDNLKKFAQYGVVVQAFNRAGTGPSSSEINATTLED SEQ ID NO:921 DSCAML1之外顯子18的轉譯多肽序列 PSQPPENVRALSITSDVAVISWSEPPRSTLNGVLKGYRVIFWSLYVDG SEQ ID NO:922 DSCAML1之外顯子19的轉譯多肽序列 WGEMQNITTTRERVELRGMEKFTNYSVQVLAYTQAGDGVRSSVLYIQTKED SEQ ID NO:923 DSCAML1之外顯子20的轉譯多肽序列 PGPPAGIKAVPSSASSVVVSWLPPTKPNGVIRKYTIFCSSPGSGQP SEQ ID NO:924 DSCAML1之外顯子21的轉譯多肽序列 APSEYETSPEQLFYRIAHLNRGQQYLLWVAAVTSAGRGNSSEKVTIEPAGK SEQ ID NO:925 DSCAML1之外顯子22的轉譯多肽序列 PAKIISFGGTVTTPWMKDVRLPCNSVGDPAPAVKWTKD SEQ ID NO:926 DSCAML1之外顯子23的轉譯多肽序列 EDSAIPVSMDGHRLIHTNGTLLLRAVKAEDSGYYTCTATNTGGFDTIIVNLLVQ SEQ ID NO:927 DSCAML1之外顯子24的轉譯多肽序列 PPDQPRLTVSKTSASSITLTWIPGDNGGSSIR SEQ ID NO:928 DSCAML1之外顯子25的轉譯多肽序列 FVLQYSVDNSEEWKDVFISSSERSFKLDSLKCGTWYKVKLAAKNSVGSGRISEIIEAKTHGR SEQ ID NO:929 DSCAML1之外顯子26的轉譯多肽序列 PSFSKDQHLFTHINSTHARLNLQGWNNGGCPITAIVLEYRPKGTWAWQGLRANSSGEVFLTELREATWYELRMRACNSAGCGNETAQFATLDYDG SEQ ID NO:930 DSCAML1之外顯子27的轉譯多肽序列 TIPPIKSAQGEGDDVKKLFTIGCPVILATLGVALLFIVRKKRKEKRLKRLR SEQ ID NO:931 DSCAML1之外顯子28的轉譯多肽序列 AKSLAEMLI SEQ ID NO:932 DSCAML1之外顯子29的轉譯多肽序列 KNNRSFDTPVKGPPQGPRLHIDIPRVQLLIEDKEGIKQL SEQ ID NO:933 DSCAML1之外顯子30的轉譯多肽序列 DDKATIPVTDAEFSQAVNPQSFCTGVSLHHPTLIQSTGPLIDMSDIRPGT SEQ ID NO:934 DSCAML1之外顯子31的轉譯多肽序列 PVSRKNVKSAHSTRNRYSSQWTLTKCQASTPARTLTSDWRTVGSQHGVTVTESDSYSASLSQDT SEQ ID NO:935 DSCAML1之外顯子32的轉譯多肽序列 KGRNSMVSTESASSTYEELARAYEHAKLEEQLQHAKFEITECFISDSSSDQMTTGTNENADSMTSMSTPSEPGICRFTASPPKPQDADRGKNVAVPIPHRANK SEQ ID NO:936 DSCAML1之外顯子33的轉譯多肽序列 DYCNLPLYAKSEAFFRKADGREPCPVVPPREASIRNLARTYHTQARHLTLDPASKSLGLPHPGAPAAASTATLPQRTLAMPAPPAGTAPPAPGPTPAEPPTAPSAAPPAPSTEPPRAGGPHTKMGGSRDSLLEMSTSGVGRSQKQGAGAYSKSYTLV SEQ ID NO:937 DSCAML1之全蛋白質序列 MWLVTFLLLLDSLHKARPEDVGTSLYFVNDSLQQVTFSSSVGVVVPCPAAGSPSAALRWYLATGDDIYDVPHIRHVHANGTLQLYPFSPSAFNSFIHDNDYFCTAENAAGKIRSPNIRVKAVFREPYTVRVEDQRSMRGNVAVFKCLIPSSVQEYVSVVSWEKDTVSIIPEHRFFITYHGGLYISDVQKEDALSTYRCITKHKYSGETRQSNGARLSVTDPAESIPTILDGFHSQEVWAGHTVELPCTASGYPIPAIRWLKDGRPLPADSRWTKRITGLTISDLRTEDSGTYICEVTNTFGSAEATGILMVIDPLHVTLTPKKLKTGIGSTVILSCALTGSPEFTIRWYRNTELVLPDEAISIRGLSNETLLITSAQKSHSGAYQCFATRKAQTAQDFAIIALEDGTPRIVSSFSEKVVNPGEQFSLMCAAKGAPPPTVTWALDDEPIVRDGSHRTNQYTMSDGTTISHMNVTGPQIRDGGVYRCTARNLVGSAEYQARINVRGPPSIRAMRNITAVAGRDTLINCRVIGYPYYSIKWYKDALLLPDNHRQVVFENGTLKLTDVQKGMDEGEYLCSVLIQPQLSISQSVHVAVKVPPLIQPFEFPPASIGQLLYIPCVVSSGDMPIRITWRKDGQVIISGSGVTIESKEFMSSLQISSVSLKHNGNYTCIASNAAATVSRERQLIVRVPPRFVVQPNNQDGIYGKAGVLNCSVDGYPPPKVMWKHAKGSGNPQQYHPVPLTGRIQILPNSSLLIRHVLEEDIGYYLCQASNGVGTDISKSMFLTVKIPAMITSHPNTTIAIKGHAKELNCTARGERPIIIRWEKGDTVIDPDRVMRYAIATKDNGDEVVSTLKLKPADRGDSVFFSCHAINSYGEDRGLIQLTVQEPPDPPELEIREVKARSMNLRWTQRFDGNSIITGFDIEYKNKSDSWDFKQSTRNISPTINQANIVDLHPASVYSIRMYSFNKIGRSEPSKELTISTEEAAPDGPPMDVTLQPVTSQSIQVTWKAPKKELQNGVIRGYQIGYRENSPGSNGQYSIVEMKATGDSEVYTLDNLKKFAQYGVVVQAFNRAGTGPSSSEINATTLEDVPSQPPENVRALSITSDVAVISWSEPPRSTLNGVLKGYRVIFWSLYVDGEWGEMQNITTTRERVELRGMEKFTNYSVQVLAYTQAGDGVRSSVLYIQTKEDVPGPPAGIKAVPSSASSVVVSWLPPTKPNGVIRKYTIFCSSPGSGQPAPSEYETSPEQLFYRIAHLNRGQQYLLWVAAVTSAGRGNSSEKVTIEPAGKAPAKIISFGGTVTTPWMKDVRLPCNSVGDPAPAVKWTKDSEDSAIPVSMDGHRLIHTNGTLLLRAVKAEDSGYYTCTATNTGGFDTIIVNLLVQVPPDQPRLTVSKTSASSITLTWIPGDNGGSSIRGFVLQYSVDNSEEWKDVFISSSERSFKLDSLKCGTWYKVKLAAKNSVGSGRISEIIEAKTHGREPSFSKDQHLFTHINSTHARLNLQGWNNGGCPITAIVLEYRPKGTWAWQGLRANSSGEVFLTELREATWYELRMRACNSAGCGNETAQFATLDYDGSTIPPIKSAQGEGDDVKKLFTIGCPVILATLGVALLFIVRKKRKEKRLKRLRDAKSLAEMLISKNNRSFDTPVKGPPQGPRLHIDIPRVQLLIEDKEGIKQLGDDKATIPVTDAEFSQAVNPQSFCTGVSLHHPTLIQSTGPLIDMSDIRPGTNPVSRKNVKSAHSTRNRYSSQWTLTKCQASTPARTLTSDWRTVGSQHGVTVTESDSYSASLSQDTDKGRNSMVSTESASSTYEELARAYEHAKLEEQLQHAKFEITECFISDSSSDQMTTGTNENADSMTSMSTPSEPGICRFTASPPKPQDADRGKNVAVPIPHRANKSDYCNLPLYAKSEAFFRKADGREPCPVVPPREASIRNLARTYHTQARHLTLDPASKSLGLPHPGAPAAASTATLPQRTLAMPAPPAGTAPPAPGPTPAEPPTAPSAAPPAPSTEPPRAGGPHTKMGGSRDSLLEMSTSGVGRSQKQGAGAYSKSYTLV SEQ ID NO:938 DSCAML1之全部外顯子1-3按順序排列的多核苷酸序列 SEQ ID NO:939 DSCAML1之全部外顯子1-3按順序排列的轉譯多肽序列 of this disclosure sequence information VAPB sequence informationSEQ ID NO:1 VAPB exon 1 sequence at 5' of VAPB-NRG1 fusion CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAG SEQ ID NO:2 NRG1 exon 2 sequence at 3' of VAPB-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTT SEQ ID NO:3 VAPB-NRG1 polynucleotide sequence CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTT SEQ ID NO:4 VAPB-NRG1 polypeptide sequence QVLSLEPQHELKFRALPPRLKEMKSQESAAG SEQ ID NO:17 VAPB exon 1 AGTCCCCCGCCCCTGGAGCCGGCGGCGCAGGGCGCAGCTTCCCGCCGCCAGAGCGGGCCAGCCTGCTGCGTGCGTGCGTGTGTACGACTCTGCGTGCGTGCGTGCGTGCGTGCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCACCGCGTAGACCGACCCCCCCCCAGCGCGCCCACCCGGTAGAGGACCCCGCGC CCGTGCCCCGACCGGTCCCCGCCTTTTTGTAAAACTTAAAGCGGGCGCAGCATTAACGCTTCCCGCCCCGGTGACCTCTCAGGGGTCTCCCCCGCCAAAGGTGCTCCGCCGCTAAGGAACATGGCGAAGGTGGAGCAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAG SEQ ID NO:18 VAPB exon 2 GTCCTTCACCGATGTTGTCACCACCAAACCTAAAGCTTGGCAACCCGACAGACCGAAATGTGTGTTTTAAGGTGAAGACTACAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATTAATGTATCTG SEQ ID NO:19 VAPB exon 3 TGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTTTTATGGTTCAGTCTATGTTTGCTCCAACTGACACTTCAGATATGGAAGCAGTA SEQ ID NO:20 VAPB exon 4 TGGAAGGAGGCAAAACCGGAAGACCTTATGGATTCAAAACTTAGATGTGTGTTTGAATTGCCAGCAGAGAATGATAAACCA SEQ ID NO:21 VAPB exon 5 CATGATGTAGAAATAAATAAAATTATATCCACAACTGCATCAAAGACAGAAACACCAATAGTGTCTAAGTCTCTGAGTTTCTTCTTTGGATGACACCGAAGTTAAGAAGGTTATGGAAGAATGTAAGAGGCTGCAAGGTGAAGTTCAGAGGCTACGGGAGGAGAACAAGCAGTTCAAG SEQ ID NO:22 VAPB exon 6 GAAGAAGATGGACTGCGGATGAGGAAGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGCCCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTATCGTTGGTGTAATTATTGGGAAGATTGCCTTGTAGAGGTAGCATGCACAGGATGGTAAATTGGATTGGTGGATCCACCATATCATGG GATTTAAAATTTATCATAACCATGTGTAAAAAAGAAATTAATGTATGATGACATCTCACAGGTCTTGCCTTTAAATTACCCCTCCCCTGCACACACATACACAGATACACACACACAAATATAATGTAACGATCTTTTAGAAAGTTAAAAATGTATAGTAACTGATTGAGGGGGAAAAGAATGATCTTTTATTAATGACAAGGGAAACCATGAGTAATGCCACAATGGCATATT GTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCATCAGCTCCTT GGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAAT GTATTTCATCTCATGTTTTTCTTATTTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCTGTCCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCAGGAAGTGTTTTTTCTGGGTCAGTAAATAACAACT CATAGGGAGGGAAATTCTCAGTAGTGACAGTCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATACCACCCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCTTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGCACCAGCAGTTGTGT GGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATAAAAGACCAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCCCCAAAATTTAAGAATTCTTTTGTCATTTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTTATCATTTTTATTATTTTGC CATTGGAAGGTTAACTTTAAATGAGCCCTATCACTGAGAAATACGTGTTTCATGATTTAACTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTTTTTTTTGGTTGTCTTCAGCTGACAGTATGAAAAATGAAACTGCTGAAAAAAGCTGAGCACCTGGTCACCCTTGGCCTTCCATTGCTTTGGCCTTCAGTAAAAAGCAGCCTCCCTTTCTAGGTCAG GGAACCATGCCATTGAGACTAGTAACGGGCGTTCTGGGCACAGTCCCACTGTGCACAGGTTTGAGAGGACAAGTTCATCAGAAGGAAGGCAGTCCTTAGAAGTCACATACGTTGAGCCACGTTGCTCCTAAGCCTGGCTCTGTCCAAGCTGGGTCAGGGGCCTTGAAACTGGAGAAGTGGAAGTCTATGGTTGGTCTGAGTAAGTAACTTCCTGTCTTC ATGAAAAAAGTTGACTTTGAATCCCAGGTACTCACAGAAATGGTGAACAGACTTAGTTGTTACCCAGGCACCCATGGATTGTGTTGAGTGTGCAGACAGGGAGGCCACCCCAATAGGAATTCGTCTCCCAGGATTTTTCCCATGTGTCCCCAGTACTTATAAAAGGGAGTGAAAAGACCGAGCTGTAAGGCATGTGCCTTCTGCCACCTGACTTTCCGTGAGGGGACT AAAATTTACTAATTGTAGTTGCTGCCAGCCAGTTAAGTCCTGTAGCTTCCAGGCCCTCATGTCTTTGATAGGAGAGTGCTTAGGTGGTCCCAACAGTGCCTAGGGGTACAGTACAGTCCCATTACACTAGAGCAGGGCTCTATTTATTTTTAAAGGATATGGCCGTGTGTTTTGATAAAACTTTATTCACAAAACAGCCGGGTCATGGGATTTGGCTTGTGTGAGT CTGTAACAGTTCTTAAAAAGAATATCTGAGAAACTACTCTGTTTTGAGACCTTTGAAGGTGATTTAGAGTTTTGTGTACATCTAGGAGAAGGTGTTCAGCTTCTCAGAGGATGTGGACATTTTGGTTGCAGCTAAAAATCAGTCCTGAAGTCTCTCCCTTCTAGAGGTTAGGACTTGGTGAACATGTTTGTGGGCCTTTTGACTGAGTGGCAGAAGGAAACTGCTCAG GAAGAGAAACAGGTGACTGATGGGAAGGTTGATTATTTTCTCAGTCATCCTGGCAGCCAAAAATGTGCCAGGAAAAGAAAGAATGTGGAGCACGCGTGGCTCCTGGAGGACTTGGAGATGCATGCACATTTAGGGTGTTTTCCCTAGAATTACATAATGAAAAAAAGAATAAGGCAAAGAGGAGGTGAATATGGGGCCTGTCACAACGGCCTGCCCTGCCCCA AGAGGGTTAAGAGTCAGATAATCGGGACGAAACTGGCATGGAAAGAGCGAGCCTAGGGAGGATGCCGCTGGGCAGTGTGCATGGGGGAGCTGCTGCCAGGCTGCCCTCCAGTCTGCTCCTGTGGTTACTGGCTCCACAGCACCTCAGAGAGGGCGGCCCTGGCTTCAGAAATGCCAGCCATAGTGCTCACAAATGCAGAAGAGATGGAAGCG GTGACAGAATCCTGAAAGTTTTTATTGATTGAAGTTTTAAAATTGGTAACTTAAGCTTCCTGGCACGATACAAAATACCTCTTAAGACAGCAGGCTTTTTTATTTGTAGGTGTGAGGAACTGGCTTTAACTTTTTTCTCCTCCTAGTTTGCATGTTTTTCCTTCTCGTCTTCTGAACTGCTGGCACCAGCAGTAATACATACTGATAAAATCAAAATTGATTTTTAC CAGTGGCCAGTTTTATGGCTAGAGAGACGACTTATACCTCCATAACACAGAAGGGGGAAAAATGAAGACTCCAGTGATCCGTGAAAACCTAAACGCTTTCAAACAAATCCCAGGAACAGAATTGCTATCGAAAGATATCATTGCCCAGTTTGCAGGCTATGTTGAGTCAGATAGAACTGAATGTAGTGAGAGCTCAGAGCTACAGAGCCTTTCAGATGAATTTGAAAAC AGACTCTGTGTGTGTGTGCATGTGTGCATGTGTGCATGTGTGGCATATGTGCCGTATGTCAGTAGCTTGACAGTTTTTCAAATCGTGCCTATATTTTTTTTGCATACACAAAATTTTTGTGTTTGCAAACTCAGAATCCATGCCAAAATACAATGTTATATGTCATTTTCAGCTCCTTTCTAAAGGAATGGCCCATTTTCTCATTGTAGTTTGAGAAATACATGTATGAAGAGATAG GGGTCTTGGGCTTCCCAGTGTCACTTTGAACACCTGAATAACATTTAACTCCTGAGACCTTCTCGGTGTAGAGGCCACTGCTTCCCCCTGCTGGAGATGGCATTTCATTGAAGGGCCTCTCGTGGCTTTCCCTGCCCCCGGCTGTCTGGCCTGAAGAAGGAGAAAGAACCAAACTGAACTATGAAAAGTTACCACTCTGAGGAGACCCTCTCTTAATTAACACTT GGGGCCATGTTTGCTGTTGTTGAGAAGGAGTGTTCTCAAAGATGAGCTGGAATGGAATTGTATTTAGAAAGGCCCCTGCAAAGTATATAGATGGATGACTCTAGTTCATGACATACAAAATCCATAAGGCCAACGACCACTCTTCTGGAACACCAAGAGCAGCTCTGAGATCATGCTGGCCCTACGCGAATTGAGTTTCTGTGGCCTAATTGGATTTGGA GAACGCCTTCCCTGGCCCCTTTTCCTCAGACAGATCTGCTCTGATAGGAACCTTTTCAAGAAAGTTACTGTTGTTTCAATGCCACTCCTTACCTGTATAGAACATTTCCAAATACATTCGCTCATTGAACTTAATCCTTGCAACTGTGACTGGGGGGTAGATGGCTCTGTTTGCATACGAAGAAATAAAGGCTCCAGGAGGTTAAATCGGGCAACTTTTTAGAACTAA ATCAGTCTCTGTAAGGCCTACATTGCTAAGATACCATTTCAGCTCTGAAAATCTGCTTCAGGGAAGTGAGTGGATGAGGCCTTCCTGCCTCAGCTACTCTGCCCGTCTGTACATCTTTTGTGTCTGCCTCCGTACCTTATTTTTTCACACTAAAGTAAGTAGAATTAAGACTGTAGTTCAGATGCTTTTTCTTTTTCTGTTGGAAACTGAACACACTACAGACAGT GAAAAAAGGTACATATTCCATTTTCTCATTGCCTGAAGATCTCTGCTGATGCTCCTGGAGAATGACTTTGGGGGCTTTAGAAAGAATATTGCCAGTCCGTCTCGGCAAGGAGATGATGGGAGCGCTTTATATGGAGGCTTTACATGACTTGTAAATTAAATGTGAATGAGGGCAGTTGATTAAAATTGGTATTACAGAAGGGCCCTGCTGAGGTTTGA AAACAGCTGAGCTGCTGATGTCTCAGGCCTTTCCCTGAATTAGCACTGCGGTTTCCAGGATATCAGCAAAGAGGGCAAGTAATAGAAGCCCCTGATAAGGAGCGTCAGCCGACAGGCAAGCTTGGGAGGCTGTGGGAATGGGTCTGCCCCCAGCTTCACAGACCTCTTCCTCCAGCCCTCTGAATCCCATTAGCCACAGCCTAGAACATTAGCTGAGCTGCAC AAGCTCACCCACCCCTGTGCCAGGGGGCCCTGACCCTCCCTCCATGCCATGTTTTTGGCTGTATCTACGGCACTTAACAATAGGGGCTTTTTATTTTCATTACAGAGATATTTTGAAAAATTTAAAAGACATGAACTCACATAAACAGTTATGGATGATAGTTAAAAGAGAAACGGGTGGAGGTGGATGAGAGGTTGTCTTCATGAATAATTACTTGAGAT TTTTTTTTTCTTAATGGAATTAGTTTTATTAGAAAATGTCTGTGTTAAATCCGTAGAAAAGGAAGAAAAGTGTAGCAACAAAAATGTAGCCATTATCTAACTTGCCATAAATATTTGCAGTTATGATACCTTGGAATGTTGCCACGATATGGATTGCTTTGATTAAAAGATGTCAGTTGAATAAAACAGTACTGTGGGAGAATCGCTTTCTGCTGCTAGATAAATGCT GATGTTTATTTTTAAACCAGGAAACATTGATCCTGTAACAATGCCCGATTACAATTGCTTTATTACACCCCAGGGCTGATGGAGATGTAATCACTTGGCTAATGGATGTGGGTGCAGGACAGATGCTCGCTTGCTGGCCTGCTTTCCTGCTTGCATTCTGATGAGCTGCAGGAGTGCGCCTGGCCTTCTGCAGGTGGAGCTGCTGTCAGAGCTTCGTTTCA CTGATACCCAAAGCCATGTCTGACTGAAATAAAACAGGTTCCCTTTTTTTTTCCCTTTGGAAAATGCCAACTAAGGGAGACTAATCAGATATCTTAACACAATTTCATCCAGGCTTAGTGCTAACAAGATTGCGGGGCTTTTTAGGGTTTAAGAAGATGAGAAATGAGTGTGCACGTTTCACACGTTGACTTGCCGGTTTTTCCATGTCATACAAAAAAGTCCTGGCT GTTTCTCCGAACTGGCTGCCTGCATTCCCGTCTTTCTTTTGTTTTTAAGAAATAGACTGAATTCAGCTGTTAATCCTCTAGTACAGTATCCATGTTAAAATGTTTTTCCATTGCATCTTTTATGTGAATTCAAAGGTCAGAATTTATTGTCTGTGATATTGAGACCATGTGTACAAGAACTACTTTTTGTTTTCATCATTCACTCCTTAGCAAACGTTTCGTAAGTACCCTGT CTGTTTGCTACTATATGAGGTGCTGCGAAATTAGTGGGCGTGGCTTTTTATATTTTTCATTCGTGTGTAGCCTAAGTAAGGTGACTCAAGATGATACACCGAGAGAAAAATGCAAAATATTTGGTTCTCATTTCTGTTGCTGTCGTTTCCTTTTTTAAAGACGATTTATCAACTGCTGCCATTTGGAACTTCCTATAAGAAACTAAAAATGATCTATTTCAGTGTTC CTTTCGCCTTTCCTCTGCTTTCTGAATAAATGGTTTCAGTAACCCATGCTGTTTCTCCCTATTCTACGTCTTTCTCCTATGTTGAAAAAAGATTCCCACAGTTTCTGATGTGTGTGTTTATAGTCTTCAATGTATGTTAACATGTTAGGAACTGAGTATCTTAAGAGATGTCTTAGAATGCTTTAGTTTCATAATTTGTCCTTTATGTATTTTTCATTGTATTTGCTGT TTTGACATGGAAGTAATTTAAAAAGTTGGTGCAGGAAAGGACTCTTTACTGTTGCACATTTTGGTTTTCTGATATGTAATAAATTCATGGCTTGGCAGCTGACATGATGTTTCCCCAGAGAGAAGGAGATGTATTTCTGCAGGGTCCAGACCAAAAAGAGCCATTTACAGCATGTTCTCCCATGTTCCATTATCAGCTGATGAAACCTGCCCTGCCAAGGCATAA ACTTTTGTACTAGCTGTCTCCATATTATGTTCAATAAATTCTGTGCTCTGAATATATTTAAAAAAAAAAAAA SEQ ID NO:23 The polynucleotide sequence of all 6 exons 1-6 of VAPB arranged in order SEQ ID NO:24 Translated polypeptide sequence of VAPB exon 1 MAKVEQVLSLEPQHELKFR SEQ ID NO:25 Translated polypeptide sequence of VAPB exon 2 PFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIIDAGASINVS SEQ ID NO:26 Translated polypeptide sequence of exon 3 of VAPB MLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAV SEQ ID NO:27 Translated polypeptide sequence of exon 4 of VAPB WKEAKPEDLMDSKLRCVFELPAENDKP SEQ ID NO:28 Translated polypeptide sequence of exon 5 of VAPB HDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREENKQFK SEQ ID NO:29 Translated polypeptide sequence of exon 6 of VAPB EEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGKIAL SEQ ID NO:30 VAPB polypeptide sequence MAKVEQVLSLEPQHELKFRGPFTDVVTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIIDAGASINVSVMLQPFDYDPNEKSKHKFMVQSMFAPTDTSDMEAVWKEAKPEDLMDSKLRCVFELPAENDKPHDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREENK QFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALVVLFFIVGVIIGKIAL CADM1 sequence informationSEQ ID NO:5 CADM1 exon 7 sequence at 5' of CADM1-NRG1 fusion AGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACG SEQ ID NO:6 NRG1 exon 6 sequence at 3' of CADM1-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:7 CADM1-NRG1 polynucleotide sequence AGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACGCTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:8 CADM1-NRG1 polypeptide sequence ASNIVGKAHSDYMLYVYATSTSTTGTSH SEQ ID NO:33 CADM1 exon 1 GGTTGGGCTCGCGGCGCTGTGATTGGTCTGCCCGGACTCCGCCTCCAGCGCATGTCATTAGCATCTCATTAGCTGTCCGCTCGGGCTCCGGAGGCAGCCAACGCCGCCAGTCTGAGGCAGGTGCCCGACATGGCGAGTGTAGTGCTGCCGAGCGGATCCCAGTGTGCGGCGGCAGCGGCGGCGGCGGCGCCTCCCGGGCTCCGGCT CCGGCTTCTGCTGTTGCTCTTCTCCGCCGCGGCACTGATCCCCACAG SEQ ID NO:34 CADM1 exon 2 GTGATGGGCAGAATCTGTTTACGAAAGACGTGACAGTGATCGAGGGAGAGGTTGCGACCATCAGTTGCCAAGTCAATAAGAGTGACGACTCTGTGATTCAGCTACTGAATCCCAACAGGCAGACCATTTATTTCAGGGACTTCAGGC SEQ ID NO:35 CADM1 exon 3 CTTTGAAGGACAGCAGGTTTCAGTTGCTGAATTTTTTCTAGCAGTGAACTCAAAGTATCATTGACAAACGTCTCAATTTCTGATGAAGGAAGATACTTTTGCCAGTCCTATACCGATCCCCCACAGGAAAGTTACACCATCACAGTCCTGG SEQ ID NO:36 CADM1 exon 4 TCCCACCACGTAATCTGATGATCGATATCCAGAAAGACACTGCGGTGGAAGGTGAGGAGATTGAAGTCAACTGCACTGCTATGGCCAGCAAGCCAGCCACGACTATCAGGTGGTTCAAAGGGAACACAGAGCTAAAAG SEQ ID NO:37 CADM1 exon 5 GCAAATCGGAGGTGGAAGAGTGGTCAGACATGTACACTGTGACCAGTCAGCTGATGCTGAAGGTGCACAAGGAGGACGATGGGGTCCCAGTGATCTGCCAGGTGGAGCACCCTGCGGTCACTGGAAACCTGCAGACCCAGCGGTATCTAGAAGTACAGT SEQ ID NO:38 CADM1 exon 6 ATAAGCCTCAAGTGCACATTCAGATGACTTATCCTCTACAAGGCTTAACCCGGGAAGGGGACGCGCTTGAGTTAACATGTGAAGCCATCGGGAAGCCCCA SEQ ID NO:39 CADM1 exon 7 GCCTGTGATGGTAACTTGGGTGAGAGTCGATGATGAAATGCCTCAACACGCCGTACTGTCTGGGCCCAACCTGTTCATCAATAACCTAAACAAACAGATAATGGTACATACCGCTGTGAAGCTTCAAACATAGTGGGGAAAGCTCACTCGGATTATATGCTGTATGTATACG SEQ ID NO:40 Exon 8 of CADM1 ACACAACGGCGACGACAGAACCAGCAGTTCACG SEQ ID NO:41 CADM1 exon 9 GCCTTACTCAGTTGCCCAATTCCGCAGAAGAACTGGACAGTGAGGACCTCTCAG SEQ ID NO:42 Exon 10 of CADM1 ATTCCCGAGCAGGTGAAGAAGGCTCGATCAGGGCAGTGGATCATGCCGTGATCGGTGGCGTCGTGGCGGTGGTGGTGTTCGCCATGCTGTGCTTGCTCATCATTCTGGGGCGCTATTTTGCCAGACATAAAG SEQ ID NO:43 CADM1 exon 11 GTACATACTTCACTCATGAAGCCAAAGGAGCCGATGACGCAGCAGACGCAGACACAGCTATAATCAATGCAGAAGGAGGACAGAACAACTCCGAAGAAAAGAAAGAGTACTTCATCTAGATCAGCCTTTTTGTTTCAATGAGGTGTCCAACTGGCCCTATTTAGATGATAAAGAGACAGTGATATTGGAACTTGCGAGAAATTCGTGTGTTTTTTTATGAATGGGTG GAAAGGTGTGAGACTGGGAAGGCTTGGGATTTGCTGTGTAAAAAAAAAAAAAATGTTCTTTGGAAAGTACACTCTGCTGTTTGACACCTCTTTTTTCGTTTGTTTGTTTGTTTAATTTTTATTTCTTCCTACCAAGTCAAACTTGGATACTTGGATTTAGTTTCAGTAGATTGCAAAAATTCTGTGCCTTGTTTTTTGTTTGTTTGTTGCGTTCCTTTCTTC CCCCTTTGTGCACATTTATTTCCTCCCTCTACCCCAATTTCGGATTTTTTCCAAAATCTCCCATTTTGGAATTTGCCTGCTGGGATTCCTTAGACTCTTTTCCTTCCCTTTTCTGTTCTAGTTTTTTACTTTTGTTTTATTTTTATGGTAACTGCTTTCTGTTCCAAATTCAGTTTCATAAAAGGAGAACCAGCACAGCTTAGATTTCATAGTTCAGAATTTAGTGTATCCATA ATGCATTCTTCTCTGTTGTCGTAAAGATTTGGGTGAACAAACAATGAAAACTCTTTGCTGCTGCCCATGTTTCAAATACTTAGAGCAGTGAAGACTAGAAAATTAGACTGTGATTCAGAAAATGTTCTGTTTGCTGTGGAACTACATTACTGTACAGGGTTATCTGCAAGTGAGGTGTGTCACAATGAGATTGAATTTCACTGTCTTTAATTCTGTATCTGTAGACGGC TCAGTATAGATACCCTACGCTGTCCAGAAAGGTTTGGGGCAGAAAGGACTCCTCCTTTTTCCATGCCCTAAACAGACCTGACAGGTGAGGTCTGTTCCTTTTATAAGTGGACAAATTTTGAGTTGCCACAGGAGGGGAAGTAGGGAGGGGGGAAATACAGTTCTGCTCTGGTTGTTTCTGTTCCAAATGATTCCATCCACCTTTCCCAATCGGCCT TACTTCTCACTAATTTGTAGGAAAAAGCAAGTTCGTCTGTTGTGCGAATGACTGAATGGGACAGAGTTGATTTTTTTTTTTTTTCCTTTGTGCTTAGTTAGGAAGGCAGTAGGATGTGGCCTGCATGTACTGTATATTACAGATATTTGTCATGCTGGGATTTCCAACTCGAATCTGTGTGAAACTTTCATTCCTTCAGATTTGGCTTGACAAAGGCAGGAGGTACAA AAGAAGGGCTGGTATTGTTCTCACACTGGTCTGCTGTCGCTCTCAGTTCTCGATAGGTCAAGAGCAGAGGTGGAAAAACAGCATGTACGGATTTTCAGTTACTTAATCAAAACTCAAATGTGAGTGTTTTTTATCTTTTTACCTTTTCATACACTAGCCTTGCCTCTTTCCTCAGCCTTAAGAACCATCTGCCAAAAATTACTGATCCTCGCATGATGGCAGCCATAGT GCATAGCTACTAAAATCAGTGACCTTGAACATATCTTAGATGGGGAGCCTCGGGAAAAGGTAGAGGAGTCACGTTACCATTTACATGTTTTAAAAGAAAGAAGTGTGGGGATTTTCACTGAAACGTCTAGGAAATCTAGAAGTAGTCCTGAAGGACAGAAACTAAACTCTTACCATATGTTTGGTAAGACTCCAGACTCCAGCTAACAGTCCCTATGGAAAGATGGC ATCAAAAAAGATAGATCTATATATATATAAATATATTTCTATTACATTTTCAGTGAGTAATTTTGGATTTTGCAAGGTGCATTTTTACTATTGTTACATTATGTGGAAAACTTATGCTGATTTATTTAAGGGGGAAAAAGTGTCAACTCTTTGTTATTTGAAAACATGTTTATTTTTCTTGTCTTTTATTTTAACCTTTGATAGAACCATTGCAATATGGGGGCCTT TTGGGAACGGACTGGTATGTAAAAAGAAAATCCATTATCGAGCAGCATTTTATTTACCCCTCCCCTATCCCTAGGCACTTAACCAAAGACAAAAAGCCACAATGAACATCCCTTTTTCAATGAATTTTATAATCTGCAGCTCTATTCCGAGCCCTTAGCACCCATTCCGACCATAGTATAATCATATCAAAGGGTGAGAATCATTTAGCATGTTGTTGAAAGGTTTTTTTTTCAGTTGT TCTTTTTAGAAAAAAAGAAAAACAAAAAAAAAAACAAAAAAAAAAAATCACACCATTGCTCACAGAATTGGCATCTCATTTTTGGGACCTCCCATCTTTCTGTTTTGAAAAGTGTACAGTAGTGCAGTGTTCCTGATGTAACTTTATGGCTTACAATGTTGACATGTCTCAGGTTCATGTGTTGCGATTGGTGTTTTCCGTCTCAGGTAGATTGCAAAGTGTAGGC CCCACACATTGGAAAAAATAATAATAAAAAAAGCAAAAACAGGAAATTATGGATTTTAGTTGTATATTGGTTTATGTATTTTTTCTTAAGTATACAGTGCACTGTTTGAAATGTATTGTTGAGTATTACTTTGTACAGGTTGATCACTTTTTTAGAGTGAAGAAAGAACAAACTTGTTTTTTGTGTTTTTTTAAAGGAATATAAAATAATGAAGGATGTATAATTGATGC CAAATAAGCTTGTTCTTTTAGTCACACCGACGTCTTTATTTTTCCCTTTAGGCCAGTTCTGTTTTTAAGGTGTACATGGACAATGTTACAGTGTAAGAAACTCCATATCCATATGTTCCCATTCGCATTTTGTATTGGTTCATGTATACCATTTTTCAAAAAAAAAAAAAAAAAAAAGTAAGTACTATAAAATATCTGTCTTCTTAATAAAAAAAAAATTAATGTTACAAAGTGA SEQ ID NO:44 The polynucleotide sequence of all 11 exons 1-11 of CADM1 arranged in order SEQ ID NO:45 Translated polypeptide sequence of exon 1 of CADM1 MASVVLPSGSQCAAAAAAAPPGLRLRLLLLLFSAAALIPT SEQ ID NO:46 Translated polypeptide sequence of exon 2 of CADM1 DGQNLFTKDVTVIEGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFR SEQ ID NO:47 Translated polypeptide sequence of exon 3 of CADM1 LKDSRFQLLNFSSSELKVSLTNVSISDEGRYFCQLYTDPPQESYTTITVL SEQ ID NO:48 Translated polypeptide sequence of exon 4 of CADM1 PPRNLMIDIQKDTAVEGEEIEVNCTAMASKPATTIRWFKGNTELK SEQ ID NO:49 Translated polypeptide sequence of exon 5 of CADM1 KSEVEEWSDMYTVTSQLMLKVHKEDDGVPVICQVEHPAVTGNLQTQRYLEVQ SEQ ID NO:50 Translated polypeptide sequence of exon 6 of CADM1 KPQVHIQMTYPLQGLTREGDALELTCEAIGKP SEQ ID NO:51 Translated polypeptide sequence of exon 7 of CADM1 PVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDYMLYVY SEQ ID NO:52 Translated polypeptide sequence of exon 8 of CADM1 TTATTEPAVH SEQ ID NO:53 Translated polypeptide sequence of exon 9 of CADM1 LTQLPNSAEELDSEDLS SEQ ID NO:54 Translated polypeptide sequence of exon 10 of CADM1 SRAGEEGSIRAVDHAVIGGVVAVVVFAMLCLIILGRYFARHK SEQ ID NO:55 Translated polypeptide sequence of exon 11 of CADM1 TYFTHEAKGADDAADADTAIINAEGGQNNSEEKKEYFI SEQ ID NO:56 CADM1 polypeptide sequence MASVVLPSGSQCAAAAAAAPPGLRLRLLLLLFSAAALIPTGDGQNLFTKDVTVIEGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFRPLKDSRFQLLNFSSSELKVSLTNVSISDEGRYFCQLYTDPPQESYTTITVLVPPRNLMIDIQKDTAVEGEEIEVNCTAMASKPATTIRWFKGNTELKG KSEVEEWSDMYTVTSQLMLKVHKEDDGVPVICQVEHPAVTGNLQTQRYLEVQYKPQVHIQMTYPLQGLTREGDALELTCEAIGKPQPVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDYMLYVYDTTATTEPAVHGLTQLPNSAEELDSEDLSDSRAGEEGSIR AVDHAVIGGVVAVVVFAMLCLIILGRYFARHKGTYFTHEAKGADDAADADTAIINAEGGQNNSEEKKEYFI SEQ ID NO:57 The polynucleotide sequence of all seven exons 1-7 of CADM1 arranged in order SEQ ID NO:58 Translated polypeptide sequence of CADM1 exon 1-7 arranged in order MASVVLPSGSQCAAAAAAAPPGLRLRLLLLLFSAAALIPTGDGQNLFTKDVTVIEGEVATISCQVNKSDDSVIQLLNPNRQTIYFRDFRPLKDSRFQLLNFSSSELKVSLTNVSISDEGRYFCQLYTDPPQESYTTITVLVPPRNLMIDIQKDTAVEGEEIEVNCTAMASKPATTIRWFKGNTELKG KSEVEEWSDMYTVTSQLMLKVHKEDDGVPVICQVEHPAVTGNLQTQRYLEVQYKPQVHIQMTYPLQGLTREGDALELTCEAIGKPQPVMVTWVRVDDEMPQHAVLSGPNLFINNLNKTDNGTYRCEASNIVGKAHSDYMLYVY CD44 sequence informationSEQ ID NO:9 Sequence from CD44 exon 5 at 5' of CD44-NRG1 fusion GACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCA SEQ ID NO: 10 NRG1 exon 2 subsequence at CD44-NRG1 fusion 3' CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO:11 CD44-NRG1 polynucleotide sequence GACGAAGACAGTCCCTGGATCACCGACAGCAGCAGACAGAATCCCTGCTACCACCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO:12 CD44-NRG1 polypeptide sequence DEDSPWITDSTDRIPATTLPPRLKEMKSQESAAGSK SEQ ID NO:61 CD44 exon 1 CTCATTGCCCAGCGGACCCAGCCTCTGCCAGGTTCGGTCCGCCATCCTCGTCCCGTCCTCCGCCGGCCCCTGCCCCGCGCCCAGGGATCCTCCAGCTCCTTTCGCCCGCGCCCTCCGTTCGCTCCGGACACCATGGACAAGTTTTGGTGGCACGCAGCCTGGGGACTCTGCCTCGTGCCGCTGAGCCTGGCGCAGATCG SEQ ID NO:62 CD44 exon 2 ATTTGAATATAACCTGCCGCTTTGCAGGTGTATTCCACGTGGAGAAAAATGGTCGCTACAGCATCTCTCGGACGGAGGCCGCTGACCTCTGCAAGGCTTTCAATAGCACCTTGCCCACAATGGCCCAGATGGAGAAAGCTCTGAGCATCGGATTTGAGACCTGCAG SEQ ID NO:63 CD44 exon 3 GTATGGGTTCATAGAAGGGCACGTGGTGATTCCCCGGATCCACCCCAACTCCATCTGTGCAGCAAACAACACAGGGGTGTACATCCTCACATCCAACACCTCCAGTATGACACATATTGCTTCAATGCTTCAG SEQ ID NO:64 CD44 exon 4 CTCCACCTGAAGAAGATTGTACATCAGTCACAGACCTGCCCAATGCCTTTGATGGACCAATTACCATAA SEQ ID NO:65 CD44 exon 5 CTATTGTTAACCGTGATGGCACCCGCTATGTCCAGAAAGGAGAATACAGAACGAATCCTGAAGACATCTACCCCAGCAAACCCTACTGATGATGACGTGAGCAGCGGCTCTCCAGTGAAAGGAGCAGCACTTCAGGAGGTTACATTCTTTTACACCTTTTTACTGTACCACCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCCAGACAGAATCCCT GCTACCA SEQ ID NO:66 CD44 exon 6 CTTTGATGAGCACTAGTGCTACAGCAACTGAGACAGCAACCAAGAGGCAAGAAACCTGGGATTGGTTTTCATGGTTGTTTCTACCATCATCAGAGTCAAAGAATCATCTTCACACAACAACACAAATGGCTG SEQ ID NO:67 CD44 exon 7 GTACGTCTTCAAATACCATTCTCAGCAGGCTGGGAGCCAAATGAAGAAAATGAAGATGAAAGAGACAGACCTCCAGTTTTTCTGGATCAGGCATTGATGATGATGAAGATTTTATCTCCAGCACCA SEQ ID NO:68 CD44 exon 8 TTTCAACCACACCACGGGCTTTTGACCACACAAAACAGAACCAGGACTGGACCCAGTGGAACCCAAGCCATTCAAATCCGGAAGTGCTACTTCAGACAACCACAAGGATGACTG SEQ ID NO:69 CD44 exon 9 ATGTAGACAGAAATGGCACCACTGCTTATGAAGGAAACTGGAACCCAGAAGCACACCCTCCCTCATTCACCATGAGCATCATGAGGAAGAAGAGACCCCACATTCTACAAGCACAA SEQ ID NO:70 CD44 exon 10 TCCAGGCAACTCCTAGTAGTACAACGGAAGAAACAGCTACCCAGAAGGAACAGTGGTTTGGCAACAGATGGCATGAGGGATATCGCCAAACACCCAAAGAAGACTCCCATTCGACAACAGGGACAGCTG SEQ ID NO:71 CD44 exon 11 CAGCCTCAGCTCATACCAGCATCCAATGCAAGGAAGGACAACACCAAGCCCAGAGGACAGTTCCTGGACTGATTTCTTCAACCCAATCTCACACCCCATGGGACGAGGTCATCAAGCAGGAAGAAGGATGG SEQ ID NO:72 CD44 exon 12 ATATGGACTCCAGTCATAGTATAACGCTTCAGCCTACTGCAAATCCAAACACAGGTTTGGTGGAAGATTTGGACAGGACAGGACCTCTTTCAATGACAACGC SEQ ID NO:73 CD44 exon 13 AGCAGAGTAATTCTCAGAGCTTCTCTACATCACATGAAGGCTTGGAAGAAGATAAAGACCATCCAAACTTCTACTCTGACATCAAGCA SEQ ID NO:74 CD44 exon 14 ATAGGAATGATGTCACAGGTGGAAGAAGAGACCCAAAATCATTCTGAAGGCTCAACTACTTTACTGGAAGGTTATACCTCTCATTACCCACACACGAAGGAAAGCAGGACCTTCATCCCAGTGACCTCAGCTAAGACTGGGTCCTTTGGAGTTACTGCAGTTACTGTTGGAGATTCCAACTTCTAATGTCAATCGTTCCTTATCAG SEQ ID NO:75 CD44 exon 15 GAGACCAAGACACATTCCACCCCAGTGGGGGGTCCCATACCACTCATGGATCTGAATCAGATG SEQ ID NO:76 CD44 exon 16 GACACTCACATGGGAGTCAAGAAGGTGGAGCAAACACAACCTCTGGTCCTATAAGGACACCCCAAATTCCAG SEQ ID NO:77 CD44 exon 17 AATGGCTGATCATCTTGGCATCCCCTTGGCCTTGGCTTTGATTCTTGCAGTTTGCATTGCAGTCAACAGTCGAAGAAG SEQ ID NO:78 CD44 exon 18 GTGTGGGCAGAAGAAAAAGCTAGTGATCAACAGTGGCAATGGAGCTGTGGAGGACAGAAAGCCAAGTGGACTCAACGGAGAGGCCAGCAAGTCTCAGGAAATGGTGCATTTGGTGAACAAGGAGTCGTCAGAAACTCCAGACCAGTTTATGACAGCTTATGACAGCTGATGAGACAAGGAACCTGCAGAATGTGGACATGAAGATTGGGGTGTAACACCCTACACC ATTATCTTGGAAAGAAACAACCGTTGGAAACATAACCATTACAGGGAGCTGGGACACTTAACAGATGCAATGTGCTACTGATTGTTTCATTGCGAATCTTTTTTAGCATAAAATTTTCTACTCTTTTTGTTTTTTGTGTTTTGTTCTTTAAAAGTCAGGTCCAATTTGTAAAAACAGCATTGCTTTCTGAAATTAGGGCCCAATTAATAATCAGCAAGAATTTGATCGTTC CAGTTCCCCACTTGGAGGCCTTTCATCCCTCGGGTGTGCTATGGATGGCTTCTAACAAAAACTACACATATGTATTCCTGATCGCCAACCTTTCCCCACCAGCTAAGGACATTTCCCAGGGTTAATAGGGCCTGGTCCCTGGGAGGAAATTTGAATGGGTCCATTTTGCCCTTCCATAGCCTAATCCCTGGGCATTGCTTTCCACTGAGGTTGGGGGT TGGGGTGTACTAGTTACACATTCTTCAACAGACCCCCTCTAGAAATTTTTCAGATGCTTCTGGGAGACACCCAAAGGGTGAAGCTATTTATCTGTAGTAAACTATTTATCTGTGTTTTTGAAATATTAAAACCCTGGATCAGTCCTTTGATCAGTATAATTTTTTAAAGTTACTTTGTCAGAGGCACAAAAGGGTTTAAACTGATTCATAATAAATATCTGTACTTCTTCGATC TTCACCCTTTTGTGCTGTGATTCTTCAGTTTCTAAACCAGCACTGTCTGGGTCCCTACAATGTATCAGGAAGAGCTGAGAATGGTAAGGAGACTCTTCTAAGTCTTCATCTCAGAGACCCTGAGTTCCCACTCAGACCCACTCAGCCAAATCTCATGGAAGACCAAGGAGGGCAGCACTGTTTTTTGTTTTTTGTTTTTTGTTTTTTTTTTTTGACACTGTCCAAAGGTTTTC CATCCTGTCCTGGAATCAGAGTTGGAAGCTGAGGAGCTTCAGCCTCTTTTATGGTTTAATGGCCACCTGTTCTCTCCTGTGAAAGGCTTTGCAAAGTCACATTAAGTTTGCATGACCTGTTATCCCTGGGGCCCTATTTCATAGAGGCTGGCCCTATTAGTGATTTCCAAAAACAATATGGAAGTGCCTTTTGATGTCTTACAATAAGAAGAAGCCAAT GGAAATGAAAGAGATTGGCAAAGGGGAAGGATGATGCCATGTAGATCCTGTTTGACATTTTTATGGCTGTATTTGTAAACTTAAACACCCAGTGTCTGTTCTTGATGCAGTTGCTATTTAGGATGAGTTAAGTGCCTGGGGAGTCCCTCAAAAAGGTTAAAGGGATTCCCATCATTGGAATCTTATCACCAGATAGGCAAGTTTATGACCAACAAGAGAG TACTGGCTTTATCCTCTAACCTCATATTTCTCCCACTTGGCAAGTCCTTTGTGGCATTTATTCATCAGTCAGGGTGTCCGATTGGTCCTAGAACTTCCAAAGGCTGCTTGTCATAGAAGCCATTGCATCTATAAAAGCAACGGCTCCTGTTAAATGGTATCTCCTTTCTGAGGCTCCTACTAAAAGTCATTTGTTACCTAAACTTATGTGCTTAACAGGCAATGCTTCT CAGACCACAAAGCAGAAAGAAGAAGAAAAGCTCCTGACTAAATCAGGGCTGGGCTTAGACAGAGTTGATCTGTAGAATATCTTTAAAGGAGAGATGTCCAACTTTCTGCACTATTCCCCAGCCTCTGCTCCTCCCTGTCTACCTCTCCCCTCCCTCTCCCTCCACTTCACCCCCACAATCTTGAAAAACTTCCTTTTCTCTTCTGTGAACATCATTGGCCAGATCCATTTCAGT GGTCTGGATTTCTTTTTTATTTTCTTTTCAACTTGAAAGAAACTGGACATTAGGCCACTATGTGTTGTTACTGCCACTAGTGTTCAAGTGCCTCTTGTTTTTCCCAGAGAGATTTCCTGGGTCTGCCAGAGGCCCAGACAGGCTCACTCAAGCTTTAACTGAAAAGCAACAAGCCACTCCAGGACAAGGTTCAAAATGGTTACAACAGCCTCTACCTGTCGCCCCAGGGA GAAAGGGGTAGTGATACAAGTCTCATAGCCAGAGATGGTTTTCCACTCCTTCTAGATATTCCAAAAAGAGGCTGAGACAGGAGGTTATTTTCCAATTTTATTTTGGAATTAAATACTTTTTTCCTTTATTTACTGTTGTAGTCCCTCACTTGGATATACCCTGTTTTTCACGATAGAAATAAGGGAGGTCTAGAGCTTCTATTCCTTGGCCATTGTCAACGGAGAGCT GGCCAAGTCTTCACAAACCCTTGCAACATTGCCTGAAGTTTATGGAATAAGATGTATTTCTCACTCCTTGATCTCAAGGGCGTAACTCTGGAAGCACAGCTTGACTACACGTCATTTTTACCAATGATTTTCAGGTGACCTGGGCTAAGTCATTTTAAACTGGGTCTTTAAACTGGGTCTTTAAAAGTAAAAGGCCAACATTTAATTATTTTGCAAAGCAACCTAAGAGCTAAAGATGT AATTTTTTCTTGCAATTGTAAATCTTTTGTGTCTCCTGAAGACTTCCCCTTAAAATTAGCTCTGAGTGAAAAATCAAAAGAGACAAAAGACATCTTCGAATCCATATTTCAAGCCTGGTAGAATTGGCTTTTCTAGCAGAACCTTTCCAAAAGTTTTATATTGAGATTCATAACACACCAAGAATTGATTTTGTAGCCAACATTCATTCAATACTGTTATATCAGAGGAGTAGGAGAGA GGAAACATTTGACTTATCTGGAAAAGCAAAATGTACTTAAGAATAAGAATAACATGGTCCATTCACCTTTATGTTATAGATATGTCTTTGTGTAAATCATTTGTTTTGAGTTTTCAAAGAATAGCCCATTGTTCATTCTTGTGCTGTACAATGACCACTGTTATTGTTACTTTGACTTTCAGACACACCCTTCCTCTGGTTTTTTGTATATTTATTGATGGATCAAT AATAATGAGGAAAGCATGATATGTATATTGCTGAGTTGAAAGCACTTATTGGAAAATATTAAAAGGCTAACATTAAAAGACTAAAGGAAACAGA SEQ ID NO:79 The polynucleotide sequence of all 18 exons 1-18 of CD44 arranged in order SEQ ID NO:80 Translated polypeptide sequence of exon 1 of CD44 MDKFWWHAAWGLCLVPLSLAQI SEQ ID NO:81 Translated polypeptide sequence of exon 2 of CD44 LNITCRFAGVFHVEKNGRYSISRTEADLCKAFNSTLPTMAQMEKALSIGFETC SEQ ID NO:82 Translated polypeptide sequence of exon 3 of CD44 YGFIEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNAS SEQ ID NO:83 Translated polypeptide sequence of exon 4 of CD44 PPEEDCTSVTDLPNAFDGPITI SEQ ID NO:84 Translated polypeptide sequence of exon 5 of CD44 IVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSTSGGYIFYTFSTVHPIPDEDSPWITDSTDRIPAT SEQ ID NO:85 Translated polypeptide sequence of exon 6 of CD44 LMSTSATATETATKRQETWDWFSWLFLPSESKNHLHTTTQMA SEQ ID NO:86 Translated polypeptide sequence of exon 7 of CD44 TSSNTISAGWEPNEENEDERDRHLSFSGSGIDDDDEDFISST SEQ ID NO:87 Translated polypeptide sequence of exon 8 of CD44 STTPRAFDHTKQNQDWTQWNPSHSNPEVLLQTTTRMT SEQ ID NO:88 Translated polypeptide sequence of exon 9 of CD44 VDRNGTTAYEGNWNPEAHPPLIHHEHHEEEEETPHSTST SEQ ID NO:89 Translated polypeptide sequence of exon 10 of CD44 QATPSSTTEETATQKEQWFGNRWHEGYRQTPKEDSHSTTGTA SEQ ID NO:90 Translated polypeptide sequence of exon 11 of CD44 ASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRRM SEQ ID NO:91 Translated polypeptide sequence of exon 12 of CD44 MDSSHSITLQPTANPNTGLVEDLDRTGPLSMTT SEQ ID NO:92 Translated polypeptide sequence of exon 13 of CD44 QSNSQSFSTSHEGLEEDKDHPTTSTLTSS SEQ ID NO:93 Translated polypeptide sequence of exon 14 of CD44 RNDVTGGRRDPNHSEGSTTLLEGYTSHYPHTKESRTFIPVTSAKTGSFGVTAVTVGDSNSNVNRSLS SEQ ID NO:94 Translated polypeptide sequence of exon 15 of CD44 DQDTFHPSGGSHTTHGSESD SEQ ID NO:95 Translated polypeptide sequence of exon 16 of CD44 HSHGSQEGGANTTSGPIRTPQIP SEQ ID NO:96 Translated polypeptide sequence of exon 17 of CD44 WLIILASLLALALILAVCIAVNSRR SEQ ID NO:97 Translated polypeptide sequence of exon 18 of CD44 CGQKKKLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTADETRNLQNVDMKIGV SEQ ID NO:98 CD44 polypeptide sequence MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSISRTEADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFY TFSTVHPIPDEDSPWITDSTDRIPATTLMSTSATATETATKRQETWDWFSWLFLPSESKNHLHTTTQMAGTSSNTISAGWEPNEENEDERDRHLSFSGSGIDDDEDFISSSTISTTPRAFDHTKQNQDWTQWNPSHSNPEVLLQTTTRMTDVDRNGTTAYEGNWNPEAHPPLIHHEHEEEETPHSTSTIQATPSSTTEETATQK EQWFGNRWHEGYRQTPKEDSHSTTGTAAASAHTSHPMQGRTTPSPEDSSWTDFFNPISHPMGRGHQAGRRMDMDSSHSITLQPTANPNTGLVEDLDRTGPLSMTTQQSNSQSFSTSHEGLEEDKDHPTTSTLTSSNRNDVTGGRRDPNHSEGSTTLEGYTSHYPHTKESRTFIPVTSAKTGSFGVTAVTV GDSNSNVNRSLSGDQDTFHPSGGSHTTHGSESDGHSHGSQEGGANTTSGPIRTPQIPEWLIILASLLALALILAVCIAVNSRRRCGQKKKLVINSGNGAVEDRKPSGLNGEASKSQEMVHLVNKESSETPDQFMTADETRNLQNVDMKIGV SEQ ID NO:99 The polynucleotide sequence of all five exons 1-5 of CD44 arranged in order SEQ ID NO:100 Translated polypeptide sequence of exon 1-5 of CD44 MDKFWWHAAWGLCLVPLSLAQIDLNITCRFAGVFHVEKNGRYSISRTEADLCKAFNSTLPTMAQMEKALSIGFETCRYGFIEGHVVIPRIHPNSICAANNTGVYILTSNTSQYDTYCFNASAPPEEDCTSVTDLPNAFDGPITITIVNRDGTRYVQKGEYRTNPEDIYPSNPTDDDVSSGSSSERSSTSGGYIFY TFSTVHPIPDEDSPWITDSTDRIPAT SLC3A2 transcript 6 sequence informationSEQ ID NO:13 SLC3A2 exon 1 sequence at 5' of SLC3A2-NRG1 fusion CCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGG SEQ ID NO:14 NRG1 exon 5 sequence at 3' of SLC3A2-NRG1 fusion CATCTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:15 SLC3A2-NRG1 polynucleotide sequence CCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGGCATCTACATCTACATCCACCACTGGGACAAGCCAT SEQ ID NO:16 SLC3A2-NRG1 polypeptide sequence RIGDLQAFQGHGAGNLAASTSTSTTGTSH SEQ ID NO:103 Exon 1 of SLC3A2 AGATGCAGTAGCCGAAACTGCGCGGAGGCACAGAGGCCGGGGAGAGCGTTCTGGGTCCGAGGGTCCAGGTAGGGGTTGAGCCACCATCTGACCGCAAGCTGCGTCGTGTCGCCGGTTCTGCAGGCACCATGAGCCAGGACACCGAGGTGGATATGAAGGAGGTGGAGCTGAATGAGTTAGAGCCCGAGAAGCAGCCGATGAACGCGGCGT CTGGGGCGGCCATGTCCCTGGCGGGAGCCGAGAAGAATGGTCTGGTGAAGATCAAGGTGGCGGAAGACGAGGCGGAGGCGGCAGCCGCGGCTAAGTTCACGGGCCTGTCCAAGGAGGAGCTGCTGAAGGTGGCAGGCAGCCCCGGCTGGGTACGCACCCGCTGGGCACTGCTGCTGCTCTTCTGGCTCGGCTGGCTCGGCATGC TTGCTGGTGCCGTGGTCATAATCGTGCGAGCGCCGCGTTGTCGCGAGCTACCGGCGCAGAAGTGGTGGCACACGGGCGCCCTCTACCGCATCGGCGACCTTCAGGCCTTCCAGGGCCACGGCGCGGGCAACCTGGCGG SEQ ID NO:104 Exon 2 of SLC3A2 GTCTGAAGGGGCGTCTCGATTACCTGAGCTCTCTGAAGGTGAAGGGCCTTGTGCTGGGTCCAATTCACAAGAACCAGAAGGATGATGTCGCTCCAGACTGACTTGCTGCAGATCGACCCCAATTTTGGCTCCAAGGAAGATTTTGACAGTCTCTTGCAATCGGCTAAAAAAAAAGA SEQ ID NO:105 Exon 3 of SLC3A2 GCATCCGTGTCATTCTGGACCTTACTCCCAACTACCGGGGTGAGAACTCGTGGTTTCCCACTCAGGTTGACACTGTGGCCACCAAGGTGAA SEQ ID NO:106 Exon 4 of SLC3A2 GATGCTCTGGAGTTTTGGCTGCAAGCTGGCGTGGATGGGTTCCAGGTTCGGGACATAGAGAATCTGAAG SEQ ID NO:107 Exon 5 of SLC3A2 GATGCATCCTCATTCTTGGCTGAGTGGCAAAATATCACCAAGGGCTTCAGTGAAGACAG SEQ ID NO:108 Exon 6 of SLC3A2 GCTCTTGATTGCGGGGACTAACTCCTCCGACCTTCAGCAGATCCTGAGCCTACTCGAATCCAACAAAGACTTGCTGTTGACTAGCTCATACCTGTCTGATTCTGGTTCTACTGGGGAGCATACAAAATCCTAGTCACACAGTTTGAATGCCACTGGCAATCGCTGGTGCAGCTGGAGT SEQ ID NO:109 Exon 7 of SLC3A2 TTGTCTCAGGCAAGGCTCCTGACTTCCTTCTTGCCGGCTCAACTTCTCCGACTCTACCAGCTGATGCTCTTCACCCTGCCAGGGACCCCTGTTTTTCAGCTACGGGGATGAGATTGGCCTGGATGCAGCTGCCCTTCCTGGACAG SEQ ID NO:110 Exon 8 of SLC3A2 CCTATGGAGGCTCCAGTCATGCTGTGGGATGAGTCCAGCTTCCCTGACATCCCAGGGGCTGTAAGTGCCAACATGACTGTGAAG SEQ ID NO:111 Exon 9 of SLC3A2 GGCCAGAGTGAAGACCCTGGCTCCCTCCTTTCCTTGTTCCGGCGGCTGAGTGACCAGCGGAGTAAGGAGCGCTCCCTACTGCATGGGGACTTCCACGCGTTCTCCGCTGGGCCTGGACTCTTCTCCTATATCCGCCACTGGGACCAGAATGAGCGTTTTCTGGTAGTGCTTAACTTTGGGGATGTGGGCCTCTCGGCTGGACTGCAGGC CTCCGACCTGCCTGCCAGCGCCAGCCTGCCAGCCAAGGCTGACCTCCTGCTCAGCACCCAGCCAGGCCGTGAGGAGGGCTCCCCTCTTGAGCTGGAACGCCTGAAACTGGAGCCTCACGAAGGGCTGCTGCTCCGCTTCCCCTACGCGGCCTGACTTCAGCCTGACATGGACCCCACTACCCTTCTCCTTTCCTTCCCAGGCCCTTTGGCTTCTGATTTTTCT CTTTTTTAAAAACAAACAAACAAACTGTTGCAGATTATGAGTGAACCCCCAAATAGGGTGTTTTCTGCCTTCAAATAAAAGTCACCCCTGCATGGTGAA SEQ ID NO:112 The polynucleotide sequence of all nine exons 1-9 of SLC3A2 arranged in order SEQ ID NO:113 Translated polypeptide sequence of exon 1 of SLC3A2 MSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLA SEQ ID NO:114 Translated polypeptide sequence of exon 2 of SLC3A2 LKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKK SEQ ID NO:115 Translated polypeptide sequence of exon 3 of SLC3A2 IRVILDLTPNYRGENSWFSTQVDTVATKV SEQ ID NO:116 Translated polypeptide sequence of exon 4 of SLC3A2 DALEFWLQAGVDGFQVRDIENLK SEQ ID NO:117 Translated polypeptide sequence of exon 5 of SLC3A2 DASSFLAEWQNITKGFSED SEQ ID NO:118 Translated polypeptide sequence of exon 6 of SLC3A2 LLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWS SEQ ID NO:119 Translated polypeptide sequence of exon 7 of SLC3A2 LSQARLLTSFLPAQLLRLYQLMLFLTLPGTPVFSYGDEIGLDAAALPGQ SEQ ID NO:120 Translated polypeptide sequence of exon 8 of SLC3A2 PMEAPVMLWDESSFPDIPGAVSANMTVK SEQ ID NO:121 Translated polypeptide sequence of exon 9 of SLC3A2 GQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA SEQ ID NO:122 SLC3A2 polypeptide sequence MSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLAGLKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLL QSAKKKSIRVILLDLTPNYRGENSWFSTQVDTVATKVKDALEFWLQAGVDGFQVRDIENLKDASSFLAEWQNITKGFSEDRLLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWSLSQARLLTSFLPAQLLRLYQLMLFLTLPGTPVFSYGDEIGLDAAALPGQPM EAPVMLWDESSFPDIPGAVSANMTVKGQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA NRG1 sequence informationSEQ ID NO: 125 NRG1 exon 1 AGTAAGCCTCCGCAGCCCACTCGGACTGCAGCCTGTTTGCCGCCCGTCCTCCCATTGCAGCACTCGGGGCGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCAGCAGCATGGGGAAAGGACGCGCGGG CCGAGTTGGCACCACAG SEQ ID NO: 126 Exon 2 of NRG1 CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCGAAAAAACAAACCAAAATATCAAGATACAAAAAAGCCAG G SEQ ID NO: 127 NRG1 exon 3 GAAGTCAGAACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGCAAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCATCGTGGAATCAAACG SEQ ID NO: 128 NRG1 exon 4 AGATCATCACTGGTATGCCAGCTCCAACTGAAGGA GCATATGTGTCTTCAG SEQ ID NO: 129 NRG1 exon 5 AGTCTCCCATTAGAATATCAGTATCCACAGAAGGAGCAAATACTTCTTCAT CCTCGAGATACTTGTGCAA SEQ ID NO: 131 NRG1 exon 7 GTGCCCAAATGAGTTTACTGGTGATCGCTGCCAAAACTACGTAATGGCCAGCTTCTACA SEQ ID NO: 132 NRG1 exon 8 AGCATCTTGGGATTGAATTTATGG SEQ ID NO: 133 NRG1 exon 9 AGGCGGAGGAGCTGTACCAGAAAGAGAGTGCTGACCATAACCGGCATCTGCATCGCCCTCCTTGTGGTCGGCATCATGTGTGTGGTGGCCTACTGCAAAACCAA SEQ ID NO: 134 NRG1 exon 10 GAAACAGCGGAAAAAGCTGCATGACCGTCTTCGGCAGAGCCTTCGGTCTGAACGA AACAATATGATGAACATTGCCAATGGGCCTCACCATCCTAACCCACCCCCGAGAATGTCCAGCTGGTGAAT SEQ ID NO: 135 NRG1 exon 11 CAATACGTATCTAAAAAACGTCATTCCAGTGAGCATATTGTTGAGAGAAGCAGAGACATCCTTTTCCACCAGTCACTATACTTCCACAGCCATCACTCCACTACTGTCACCCAGACTCCTAG CCACAG SEQ ID NO: 136 NRG1 exon 12 CTGGAGCAACGGACACACTGAAAGCATCCTTTCCGAAAGCCACTCTGTAATCGTGATGTCATCCGTAAAACAGTAGGCACAGCAGCCCAACTGGGGGCCCAAGAGGACGTCTTAATGGCACAGGAGGCCCTCGTGAATGTAACAGCTTCTCAGGCATGCCAGAGAAACCCCCTGATTCCTACC GAGACTCTCCTCATAGTGAAAG SEQ ID NO: 137 NRG1 Exon 13 GTATGTGTCAGCCATGACCACCCCCGGCTCGTATGTCACCTGTAGATTTCCACACGCCAAGCTCCCCCAAATCGCCCCCTTCGGAAATGTCTCCACCACCCGTGTCCAGCATGACGGTGTCCCATGCCTTCCATGGCGGTCAGCCCTTCATGGAAGAAGAGAGACCCTACTTCTCGTGACACCACCAAGGCTGCGGGAGAAGAAGTTTGACCATCA CCCTCAGCAGTTCAGCTCCTTCCACCACAACCCCGCGCATGACAGTAACAGCTCCCTGCTAGCCCCTTGAGGATAGTGGAGGATGAGGAGTATGAAACGACCCAAGAGTACGAGCCAGCCCAAGAGCCTGTTAAAGAAACTCGCCAATAGCCGGCGGGCCAAAAGAACCAAGCCCAATGGCCACATTGCTAACAGATTGGAAGTGGACAGCAACACAAGCTCCCAGAGC AGTAACTCAGAGAGTGAAACAGAAGATGAAAGAGTAGGTGAAGATACGCCTTTCCTGGGCATACAGAAACCCCCTGGCAGCCAGTCTTGAGGCAACACCTGCCTTCCGCCTGGCTGACAGCAGGACTAACCCAGCAGGCCGCTTCTCGACACAGGAAGAAATCCAGGCCAGGCTGTCTAGTGTAATTGCTAACCAAGACCCTATTGCTGTATAAAACCTAAATAAACAC ATAGATTCACCTGTAAAACTTTATTTATATAATAAAGTATTCCACCTTAAATTAAACAATTTATTTTATTTTAGCAGTTCTGCAAATAGAAAACAGGAAAAAACTTTTTATAAATTAAATATATGTATGTAAAAAATGTGTTATGTGCCATATGTAGCAATTTTTACAGTATTTCAAACGAGAAAGATATCAATGGTGCCTTTATGTTATGTTATGTCGAGAGCAAGTTTTGTA CAGTTACAGTGATTGCTTTTCCACAGTTTCTGCAAAACCTCTCATAGATTCAGTTTTTGCTGGCTTCTTGTGCATTGCATTATGATGTTGACTGGATGTATGATTTGCAAGACTTGCAACTGTCCCCTCTGTTTGCTTGTAGTAGCACCCGATCAGTATGTCTTGTAATGGCACATCCATCCAGATATGCCTCTCTTGTGTATGAAGTTTTCTTTGCTTTCAGAAT ATGAAATGAGTTGTGTCTACTCTGCCAGCCAAAGGTTTGCCTCATTGGGCTCTGAGATAATAGTAGATCCAACAGCATGCTACTATTAAATACAGCAAGAAACTGCATTAAGTAATGTTAAATATTAGGAAGAAAGTAATACTGTGATTTAAAAAAAACTATATTATTAATCAGAAGACAGCTTGCTCTTACTAAAAAGGAGCTCTCATTTACTTTATTTGATTTTTATTTTTCTT GACAAAAAGCAACAGTTTTAGGGATAGCTTAGAAAATGGGTTCTGGCTTGCTATCAGGGTAAATCTAACACCTTACAAGAGGACTGAGTGTCACTTTCTCTCTGGGGGAATGATCCAGCAGCTTATCTAGTTGACAATCAAAACACGGCTGATAAAGGTGCAATCATTTCTGACATGTATTTTTCACTGATTTTGAAGCTAGTGATTGGTTGTGTCTTCTTGG CTCAAAAAGAAGCATATTACGGCACAAAAAGCCCAGCCCAGACAGCACATGCAGCATTTTGTCTGAAATACTTCTAGAGTCAAACGTGCCTGCTGTACATAGCGATGACTTGTCATCATAGGGAAGTATTTCCATCGTAGAGTGTTCAGAAGGAGTGACTGTATAGGTGGAGAAGCTTAGTGACTCCGTTGAAATTTTAAAATGTGGATGACCACCCCTTTCTCCCCCT TATTTTTCTTTTATCTTTCCATGTTGCCTTGATCAGGTCATAACTATGCATGAACATTTTTTATCAGGAATGGCCGATGTGTATGTGATTTGTAATCACAAGTAATGATTCATCAGGAAATGTCAAATCCTGTTGGAAAGATTGCACCTTTACTTGCAGAAGTGACCCCCACCTGTGTCCTGACCTCTCATTTTCACAGGCTCTCACCCATTCCCCACCTCCTTTAAT TTTTGCTTTACTGTCATAAAGTAGGACTAAGATTGGTCTAAGCATTGCATGTTCTTTTGTGATGGTAAATCCAAAGGAAGGCCTATAAGTTAACATTTGAAATAACTGCTAATTCAGGAAAATGGAAGAAAAAAAATTATTTGAAACACAGAACCCATTTCATGGCCTGCCTGATATCTGTGAAATCAGGGCTGGAGCTTTACTTAGGATTCACATGGCCTCCTAGGA ACCATGGGACAAATGGGAAACAGGTTATCGGGGGATTCATGAAGTCAGTGAGAGTAATTGCTTCTTTTTTGCGGGTGAACTGAATGTATTTTCTTCACCAAATCTTGATGTTAACAATTAAAAAGAAGAAATGACATGCAAGTAGGTCTTAGCAGAAAAATGCAGGCTGGGCATGAGTCATGTTGTTACCCTCCCACATGCTCCTACAATCCACAGAGAT GCCTGTCTGCAGGTTCTTGAAGTTATTGTTAGTATTTGGTATCTCAATTTTTCGTCACTGTTCACATGCCACTTTCTCTGTGCACAGTGGTATCCTCATTTGCTTTTTAACCTACACTGAGGAGTCTTTGTCAGGTTGCACTGATTTTCCAATTCTGCAGTAATGAGTAAGCTCACGGCATGGGGAAGAAGACAGTCAGTCCAATGAAGTTCCTAAATTATTTAACATT GCCTTTGAAGGCCTTGACTCATCCTTAGCTATTTCAATGAAGAAATTCCTACCATGAATTTAAAACCCTAAAAATTCTGTTTCAAATTCTTTGGGCATTGGGGTACTCAGATATCCCATTGTGGAAGAATTTTAAGAATAAATAGAAGTTTCTGTTGAGAACCATGAGCAACATGTTTCTACAATGAGAATTGCTATGCATTTTAAAATTGCAAATATATATGAAAAT TGAAAGACAAGAGGAAATTGTATTTCTAACTTGATTCTGATCACTCACAGAGGTGGCATATTATATTATAGTTGGGACATCCTTTGCACCCTTCATAAAAAAGGCCAGCTGACTGCTCAGCATCACCTGCCAAGGCCACTAGATTTGTGTTTACAAGGGGTATCTCTGTGATGCTTGTCACATCACTCTTGACCACCTCTGTTAATAAATTCCGACAGTGCAGTGGCGATCGGAGT GTGAACTTATGTTCCCAGCATATGGAAAGCTATCTTAGGTTTTAAAGGTAGTAGAAATTGCCCAGGAGTTTGACAGCAACTTTGTTTCCCGGGTCTAAAAATCGTATCCCACTGAGGTGTATGCAGTGGAGCATAATACATGCAAATACATGCAAAACTCCTTTTGTTTCTACCTAAGATTCACTTTCTTCTTACTTTCCCTTCCTGCCTAGTGTGACTTTTGCCCCAAGAGTGC CTGGACAGCATTCTAGTTTCTACAAAATGGTCCTCTGTGTAGGTGAATGTGTCCCAAACCTGCTATCACTTTCTTGTTTCAGTGTGACTGTCTTGTTAGAGGTGAAGTTTATCCAGGGTAACTTGCTCACTAACTATTCCTTTTTATGGCCTGGGGTTAAAGGGCGCATGGCTCACACTGGTGAAAATAAGGAAGGCCTGGTCTTATCTTGTATTAATAATACT GGCTGCATTCCACCAGCCAGAGATTTCTATCTGCGAAGACCTATGAAACACTGAAGAGAAATGTAGGCAGAAGGAAATGGCCACATATCACACAGTTTCTATTATATTTCTTTTGTAAATACATATTGTATATTACTTGGATGTTTTTCTTATATCATTTACTGTCTTTTGAGTTAATGTCAGTTTTACTTCTCCAACTTACTATGTAACATTGTAAATAACATAATGTCCTTTAT TATTTATATTTAAGCATCTAACATATAGAGTTGTTTTTCATAAGTTTAAGATAAATGTCAAAAATATGTCAAAAATATATGTTCTTTTGTTTTTCTTTGCTTTAAAATTATGTATCTTTTCCTTTTCTTTTTTTAAGAATAATTTATTGTTCAGGAGAAAGAATGTATATGTAACTGAAACTATCTGAAGAATGCACATTGAAGGCCGTGAGGTACTGATAAACTAAAGAATTTATTATTCAAAAT ACTAAGCAATAAGTAATTGTGATTTATTTAAAGTTTTGTCCATTTTCCATGAAAGACATACTGCAATAAAAATGCTACTCTGTGGAGACCTGGGAGTGTTGCTCAGCAGACTACAGCTTCAGTCTGTTAGACCAGCACCTTTCATCTCATTCCCATAGTTATGCTAATTTAGGATTGTTGTTCCATGGACCCCATGATCACCTTGTCTATACGTTGCTTCTTGTCTGTCCAT TGCTTTTGCCCACCACCTGTTCTCAAATCATCTCCTCCCTACCAATGCTGTTTATCACTTTCTTCCTTTGTTGAAGAGGCCACAACCAGACAGTACTATGCTTCCTTTTTCCTCATACACAATAACAGAGAGAGAATATTTCTAGGGCATGACTGCCTGGATCCTGGCTGTTGCTATCTTTTGTAGTGGCAGTAAGAAACTCCTTCAGACTAATGAAAATGTCAACGTG CCATTCAATCACGAAAGGTAACGAAAAATGCTCTCATGGTTCAAATAGTCCAATGGCCCATAGTGGCCTAAAAGGCAGCCAGTTGACACCTGGCCATGCTAAGCTTCCTTACCATCCGCTAATGACTTTCCATTGGGCCCACAATTTACGGATTCATAATTTTAAAAGAGGAGAAGGCCAAGTTAGGTTCATTCCCCTTATTCTGTCAATAAACAAATCAAACTCAT GTCTATCTAACTGCTCAGGGAGGAGCCTTTGCATGAGAAAATTCTCATATTTCTAAGACTGAGTCATAGAAATGAGGGTATTACTTTTCTTACTGCAATTAACCTAAACAAAAGCCATATTTAACAAATAGATTTGCATGGTACCCTTCATATTCCAAAGCATTTCACTCATTATTCCAGGTAGGTTAAGAGCTTCTGAAGTGTATGAAGTAAAGGTCAGCAATCCTTG GGGTGAACAGTGGCCTCCTTTGGAGTTTGGGGGTAACCTGAGACTTCCCACCAATGTCCACCATCCATCTGTGTACCTAATTCCTATTACCTAGTTATGGCTCCTCTAGGATCATTTTCCAAACACTCTGGATGTCCAGGAAATTAATTGTAGCTTTTGACTGAGCTAGTTTTTCCTATTTATATTAATAAATTTTCAAAAATGCTTGAAAATCTTCACATTTGCAACAC ACTTTAGTTTTTCATGCACATACAAACACAGAGAGACAAAAATTCCAAACAGACACTCTCCAAAAGCCACCACACTTCTTTCACTTGTCCTATAGTCATTTAGCCAACCAGCCATGCAGAGAATATTTAAAACTTAAAGATTGAGACATTTTCTCAGTTTTTGCTGAGGCTTTGTAACGAAATTGAACACTATAAGCAGCTATTGTAGTAATTTTGGTTAAAATTGTTTGCCTGG GATATAGTATTTGAGGCAGAAGCACGTGTGTGAAGGAGGTGAGGTGGTTTGGAAAGAGTGAAGACTCGCAGCCAGATTGAATGTCTGGATAATTACTATAATTCTCCCCTTCTTGGTTGAAACCATGTTCTCTCTTGATTTTTAAACCCAGGCTGCCTCTGGAAACAAGCAAACCTGAGTCTTTCTAACCTGAGTCTCCCAATCATTAGATTTCTTTTCTGTCCTAACGAT GAATGATAAAAGGACTTGATGTTCACAATTTGGGGTTATAAGGCAGGTCTGAAATCTGGAGACTCAAGATGCTGGAAGGAGTGGAAAGTTTCGATGACTTTATATGAATCACTTTGCACTCTATGTTTGGCTTGTCCTCTTTGAAACTGATTTACTAAAATAAATGTAAGGGAACTATTACTCAAAAGATTAACTTGGCAGGAAATACCAATACTTTCAGTTTAT GAAAGACAAAACTGTCTTGTTGCTACAGGAAGCTGCAATGTTCCTAACCTTAAGGTTGGTGTTGAATAGGGTGGTCATGCCCTCCCCTGCAGGTATCTTTAGGCTCCTGTTGACCTCCTGGTACTATAACTGTTCGTCTTCTCTGGGTAGCTATTGATTTTGAACTTTAACATGCTTCAAAACTTTTATTCATCAGGGAAATAGGAAAAGAGTTTTGTTACCTG GAGGAAATCTATTGTGATCTACCTGAGCTTTTTAAAAACAGACCAGGAGAAGGAAACCAGTAATTTTTAAAGAAGAGACAGAGAATGGGATAATAGTTTCACCCAGGATCTCTTTCTAACCCTTTCCCTTCAAATGAACTTATTGGAACAGAATTGGAAAGAAGAAAGGACATCTCTGCCCACCCCACAGGATGCCAAAAAGGCTAAAGAATTACCCTCTGTAGATTTAAACAT CTTTAATGGCTTATGTATAGATTTGCTAATACAGAGAGAAATGAACTATTAAATAAAAATCACATTTTAATATTTTTTATGGCTTAAAACATCCTTTATCTCCTTTTTGTTCTCTCTACATGATATGGTAAGTGATGAGGAAAATTTAGGCTCAGGAAGGTTAAAATCTTTCTTGGAGTTACACATCTAAGAGAGCTGCAGAGCTGACACTTGTACCCCAGGTTTTCTGACT GCAAATCCAGTTTCTTTCTATTGCGTTCTTCCCCTTTCCCTGCCTCAAGCAGAAACAGGTTTTTATTTTCAACCTTTATGTATACAGTATGTTATGTTACATTCTACAGCTAAGTTTCTTTTTAGAAGAATGTGAGCCCTTCTAGCTTTGGTTTAGAGTGATTCTAGAAGCCAATTTCCTTGGCTTAGTGATTCTATGCACCTTTCCTAAACTTAGCTTTCTAAGGAAATGAAGT GTACGAGTGAGAATGAATTCACAATTTCGACATGTAGGTAGCATCCTAAAGTGAAAAGAGGAGGAAATTTGTGGTCAAAGCACTCTCCCACCACTTAGAAACTTACTGACTGTGGGCAGCTTCCTCCTCCAAGTTTCCTTCCTGATTTACAAGACCGTGGTGTGGTCAGGATTAAACTTGAATACATGTAAGGAAGCCTGAAAGTGTCTAACACATAGCGAGTATTCAAAT GCCACCCTTCTATTTGATCCTTCCCCTCCAGTTCCTTAAGTTTTGGAATCTAGGTTTCTCAGTTCCAAATGGATTGACATTTGCATATCCCCATTGCACAATGGATCAAATAAACTTTATGTTATCATTCTCTCAACATAGTGCCAGTAAGCAAATCCTTTTAATAACAACAGTATGTTGAGAAACATATCACCAAATAATATTTAACTTTGTAGCTTTGATAAGTTCTTTAGGT TTTGGTTTTGGTTTTGTTTTCTGAGACAGGGTCTTGATCTGTCACCCAGACTGGAGTGCAATGGTTCAATTTTAGCTCACTGCAACCTGTAACTCCTGAGCTCAAGTGATCCTCCCACCTCAGCCTCTCAAGTAGCTGGGACTACAGGTGTGCTCCACCATGCCCAGCGATTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTAGGGACAAGGTCTCGCTATGT TGCCCAGGCTGGTCTTCAATTCCTGGCCTTAAGTGATCCTCCTGCCTCGGCCTCCCAAAGTGCTGGAATTACAGGCATGAGCCACCACCCATAACTTTATGTTTGTTTTTTTTGATGCAGTATAAGTTCAGCTTGCTTCTTATGCAGCCATACCATCTGATTTTTAGCAGCTTATTACATTAGTGTTTACAGAAATTTACTA AACCATGACTCTGTAGAGTTTTTAATAATACTACCTCCAAACATCATTGCAAACATCTAGAAGAATGAACAAAAATGATCTTAGATCGACAGTATATCTGTTTGTCTTAGTTTCTACACACAGGATGTTCAGACATATTCCCATTTCTTTAAAAAAAAAATATATATATATATATATATAGGCCTGGCACGGTGGCTCATGACTGTAATCCCAGCACTTTGGGAGGCTTAGGCAGG CAAATCACCTGAGGTCAGGAGTTTGAGACTGGCCTGACCCAACATGGTGAAACCTCGTCTCTATTAAAATTACAAAAATTAGCCGGGCGTGGTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAGTTGAGATCACGCCATTGCACTCCAGCCTTGGTGACAAGGTGAAACTCC GTCTCAAAAAAAAAAAAAAATATGTATATATATATATATATATATATATATATATATATATATATATAAAATCCCACCAAAAGTCTGCAGAGTGACCAAATTAGACGGCTCTGGTTTCAGATTAAATTCTAAATGTGAGAAACCACATAGCTCCCATGATCCATCCAATAATTCCTCACGTCCTCTTCACTCTTTACTCCATGCATAAACAGAATTTTTTTTCTCCTG GGTATGAAGCAATTAATAATTTACGGATTTAGCCTATTTGGATTCAATCCCTTCAAACTCCATACTACATCCAAGGTGGAAGTGACTTAAACTCTGATATCAATCATCAGGCTTGTAATAGGCTTTGTTAATGGCAGGAGAGTCTAATAAAACTTTCTGTTCCTTTATCCTTCATTTAAATGAAAAAACTTTTTTATTGAAAACAATCATAACTCTAGCTCATCATAAATAAATA ATTCATGAGGACATTTTATTATTTTTATATTAAGAAATAATATTATAGATGTAAACTTTGCACCTTTTCAATTATTATCATGAGTTAAGCTAATACTTGTCTTCTGGTCCCTAGATGATGATTCTTTTTTGCCTTACTGGAGGAGCCCTTGTCTTGAAGTGAGTTGCTTCAACAGCAGAGGACTTCTAGTTTCTCCCAGTTGAGCCTAAAGTGAACTTTTCATCTCT TCAGAGGAAGGGGCTTCCTTGATTTGTACTTTTGTGGCTCCTCAGATAACACAGACAATTTTATCTTGGATCCCAGGTTTCTTCACCATTAAGAATAAGAAAGAGAAAATGCTGTGCATGACAGCCCCCTACTCCAAACTACCCCAACCCCCTGTAACCTCAACGAGATGATTCTGCCCTCACTCAAGAGTCCCCCACAAAGATTCATTCTCCTTT ACTTTTTTATTTTTTATTTTTTTGCAAAACAAAGGCCTCCTTTAGTACCTCCTTAGTTTAACCCTTATCTTCCCAGCTCTTCCCTTCACTGATACCCTCTGATTTCAAAAAGTTCTGAAGTCGGAAGACCACACAATTTCAGACTGTGAACAGAAATTCAGTCAGAAATTATTGGAGTTAGAAAGAATTTAGAGAAATTGTATTGAACTAGAAAGTCCTCTTTGATTAGTGGTGGC CCTTTAGAAAGTTCTAGGGCAGAGTCCCATGGTGTTTCATTCTTTTCCATGATTTGCTTGACCAAAAACTTTTCTTTCACAACATGAAAATATGCTAATCCACCACACATTTTGGATTCTGCTCTGTTTGCCTGAGGTGTTAGATCTCTAGCCAGGACTGTGAAGGGAAGGAACTTGAATCCTTCCTATTGAGCTATTAATGCAGAGTCAGTGAGATGAAGGGTTCCACTCG GGGTCAAAATCATGTCAGTTACCCAAGCAAAGGAGCAAGTAAGGGGAAACATCTCCTCATCTGGTTAGTGGAGCCACATTTCACCCACTGATCAAGCCAGACCTGAGCAATAGTCTAGATTTCTCCCCTCCACATCTAATTGGTGACAATGGTGATTGTACCACTGACAGGTCACACCAAGTCCACACCCTCCTTCACAGCTCACTGCTTCGGCTCTTGTTTATGCTCTTAT CAGCATCTGTCACTAGGATCCATTTGTCTCCTGACTCATCTACTTCCTTTCACTCCCCTGGGCCATCCTTCACATCATGCTAGAAGACTGTGTCTAACTTGCAGAACTTATTGTGTCAATCTCCTGGTTACGGCCCCTCCATGGCTCCCCACCTGACATAGGAGGCCCTTCACAATCTGGCTTCTGTCACTCATAACTTGTCTCCAGTCTGATT TTATGGTTTTTCTAGTTCCCCAAATACCACACCAGTGTTTATGAAACCTAGTCATTGGCCTACGAACTTTATGATTATTGACATATTCATGTACCACCTGTATTTATTTTTTGCATAGTGTTTTATTTTAATTGACTACATTTTTAACTACAATAAAGTAATTTCAACTAAAA SEQ ID NO: 138 All 13 exons 1-13 of NRG1 in sequence Nucleotide sequence SEQ ID NO: 139 Translated polypeptide sequence of exon 1 of NRG1 MGKGRAGRVGTT SEQ ID NO: 140 Translated polypeptide sequence of exon 2 of NRG1 LPPRLKEMKSQESAAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKP SEQ ID NO: 141 Translated polypeptide sequence of exon 3 of NRG1 KSELRINKASLADSGEYMCKVISKLGNDSASANITIVESN SEQ ID NO: 142 Exon of NRG1 The translated polypeptide sequence of 4 IITGMPASTEGAYVSS SEQ ID NO: 143 NRG1 Translated polypeptide sequence of exon 5 SPIRISVSTEGANTSS SEQ ID NO: 144 Translated polypeptide sequence of exon 6 of NRG1 TSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLC SEQ ID NO: 145 Translated polypeptide sequence of exon 7 of NRG1 CPNEFTGDRCQNYVMASFY SEQ ID NO: 146 Outside of NRG1 Translated polypeptide sequence of exon 8 HLGIEFM SEQ ID NO: 147 Translated polypeptide sequence of exon 9 of NRG1 AEELYQKRVLTITGICALLVVGIMCVVAYCKT SEQ ID NO: 148 Translated polypeptide sequence of exon 10 of NRG1 KQRKKLHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVN SEQ ID NO: 149 N RG1 exon The translated polypeptide sequence of 11 QYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSH SEQ ID NO:150 The translated polypeptide sequence of NRG1 exon 12 WSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSE SEQ ID NO:151 The translation of NRG1 exon 13 Peptide sequence YVSAMTTPARMSPVDFHTPSSPKSPPSEMSPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQAR LSSVIANQDPIAV SEQ ID NO:152 NRG1 polypeptide sequence SEQ ID NO: 153 The sequenced polynucleotide sequence of all 12 exons 2-13 of NRG1 SEQ ID NO: 154 The translated polypeptide sequence of NRG1 exons 2-13 SEQ ID NO: 155 All 8 exons of NRG1 6-13 polynucleotide sequence in sequence SEQ ID NO: 156 Translated polypeptide sequence of NRG1 exon 6-13 TSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKHLGIFMEAEELYQKRVLTITGICIALVVGIMCVVAYCKTKKQRKKLHDRLRQSLRSERNNMMNIAN GPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHVIVMSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSERYVSAMTTPPARMSPVDFHTPSSPKSPPSEMSPVSSMTVSMPSMAVSPFMEEERPLLLVTPPRLREKKFDHHPQQF SSFHHNPAHDSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO: 157 All 8 exons 6-13 of NRG1 sequenced multinucleated Nucleotide sequence (excluding the nucleotide CAT of exon 5) SEQ ID NO:158 Translated polypeptide sequence of exon 5+CAT and exon 6-13 of NRG1 STSTSTTGTSHLVKCAEKEKFFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFYKHLGIFMEAEELYQKRVLTITGICIALVVGIMCVVAYCKTKKQRKK LHDRLRQSLRSERNNMMNIANGPHHPNPPPENVQLVNQYVSKNVISSEHIVEREAETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHTESILSESHSVIVMSSVENSRHSSPTGGPRGRLNGTGGPRECNSFLRHARETPDSYRDSPHSERYVSAMTTPARMSPVDFHTPSSPKSPPSEMSPVSSMTVSMPSMAVSPFMEEERP LLLVTPPRLREKKFDHHPQQFSSFHHNPAHDSSNSLPASPLRIVEDEEYETTQEYEPAQEPVKKLANSRRAKRTKPNGHIANRLEVDSNTSSQSSNSESETEDERVGEDTPFLGIQNPLAASLEATPAFRLADSRTNPAGRFSTQEEIQARLSSVIANQDPIAV SEQ ID NO: 161 All 5 of NRG1 Exons 1-5 sequenced polynucleotide sequence (excluding the nucleotide CAT of exon 5) AGTAAGCCTCCGCAGCCCACTCGGACTGCAGCCTGTTTGCCGCCCGTCCTCCCATTGCAGCACTCGGGGCGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCAGCAGCATGGGGAA AGGACGCGCGGGCCGAGTTGGCACCACAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCGAATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCGAAAAAACAAACCACAAAATATCAAGATACAAAAAAAAGCCAGGGAAGTCAGAACTTCGCATTA ACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGCAAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCATCGTGGAATCAAACGAGATCATCACTGGTATGCCAGCCTCAACTGAAGGAGCATATGTGTCTTCAGAGTCTCCCATTAGAATATCAGTATCCACAGAAGGAGCAAATACTTCTT SEQ ID NO: 162 NRG1 explicit Translated polypeptide sequences of subs 1-5, excluding translated The most C-terminal amino acid of exon 5 (S) MGKGRAGRVGTTALPPRLKEMKSQESAAAGSKLVLRCETSSEYSSLRFKWFKNGNELNRKNKPQNIKIQKKPGKSELRINKASLADSGEYMCKVISKLGNDSASANITIVESNEIITGMPASTEGAYVSSESPIRISVSTEGANTS SEQ ID NO: 163 According to SEQ ID NO: 163 EGF-like domain of NRG1 sequence of ID NO: 152 HLVKCAEKEKTFCVNGGECFMVKDLSNPSRYLCKCPNEFTGDRCQNYVMASFVTCN1 sequence informationSEQ ID NO:164 VTCN1 exon 2 sequence at 5' of VTCN1-NRG1 fusion CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAG SEQ ID NO:165 NRG1 exon 2 sequence at 3' of VTCN1-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAA SEQ ID NO:166 VTCN1-NRG1 polynucleotide sequence CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAGCCTTGCCTCCCCGATTGAAAGAGATGAA SEQ ID NO:167 VTCN1-NRG1 polypeptide sequence IISIIIILAGAIALIIGFGISALPPRLKEM SEQ ID NO:168 VTCN1 exon 1 GTGAGTCACCAAGGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGCTTCCCTGGGGCAGATCCTCTTCTGGAG SEQ ID NO:169 Exon 2 of VTCN1 CATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATCATTGGCTTTGGTATTTCAG SEQ ID NO:170 Exon 3 of VTCN1 GGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCAAAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGTGTTT GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAG SEQ ID NO:171 Exon 4 of VTCN1 CCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACAC ATACTCCTGTATGATTGAAAATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAG SEQ ID NO:172 Exon 5 of VTCN1 AATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTCGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAG SEQ ID NO:173 Exon 6 of VTCN1 GGATCTACAGAACTATTTTCACCACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAAACAAGAGCAAGAAACAAAAAAAGCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAGTAAAAT GCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCCTGCCTGTCACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGCCTCCAACATATCCACATCTTATATTCCACAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACT GCCACTTCGCAACTTCAGGGGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAAACAGAGCAGTCGGCGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAACAAATGCGGGTTTATTTCCAGAT GATGTTCATCCGTGAATGGTCCAGGGAAGGACCTTTCACCTTTGTCTATATGGCATTATGTCATCACAAGCTCTGAGGCTTCTCCTTTCCATCCTGCGTGGACAGCTAAGACCTCAGTTTTCAATAGCATCTAGAGCAGTGGGACTCAGCTGGGGTGATTTCGCCCCCCATCTCCGGGGGAATGTCTGAAGACAATTTTGGTTACCTCAATGAGGGAGTGGA GGAGGATACAGTGCTACTACCAACTAGTGGATAGAGGCCAGGGATGCTGCTCAACCTCCTACCATGTACAGGACGTCTCCCCCATTACAACTACCCAATCCGAAGTGTCAACTGTGTCAGGGCTAAGAAACCCTGGTTTTGAGTAGAAAAGGGCCTGGAAAGAGGGGAGCCAACAAATCTGTCTGCTTCCTCACATTAGTCATTGGCAAATAAGCATTCTGT CTCTTTGGCTGCTGCCTCAGCACAGAGAGCCAGAACTCTATCGGGCACCAGGATAACATCTCTCAGTGAACAGAGTTGACAAGGCCTATGGGAAATGCCTGATGGGATTATCTTCAGCTTGTTGAGCTTCTAAGTTTCTTCCCTTCATTCTACCCTGCAAGCCAAGTTCTGTAAGAGAAATGCCTGAGTTCTAGCTCAGGTTTCTTACTCTGAATTTAGATCTCCA GACCCTGCCTGGCCACAATTCAAATTAAGGCAACAAACATATACCTTCCATGAAGCACACACAGACTTTTGAAAGCAAGGACAATGACTGCTTGAATTGAGGCCTTGAGGAATGAAGCTTTGAAGGAAAAGAATACTTTGTTTCCAGCCCCCTTCCCACACTCTTCATGTGTTAACCACTGCCTTCCTGGACCTTGGAGCCACGGTGACTGTATTACATGTTGTTATAGAAA ACTGATTTTAGAGTTCTGATCGTTCAAGAGAATGATTAAATATACATTTCCTACACCA SEQ ID NO:174 The polynucleotide sequence of all 6 exons 1-6 of VTCN1 arranged in order SEQ ID NO:175 Translated polypeptide sequence of exon 1 of VTCN1 MASLGQILFW SEQ ID NO:176 Translated polypeptide sequence of exon 2 of VTCN1 IISIIIILAGAIALIIGFGIS SEQ ID NO:177 Translated polypeptide sequence of exon 3 of VTCN1 RHSITVTTVASAGNIGEDGILSCTFEPDIKLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKCYIITSKGKGNANLEYKTG SEQ ID NO:178 Translated polypeptide sequence of exon 4 of VTCN1 FSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQGANFSEVSNTSFELNSENVTMKVVSVLYNVTINNTYSCMIENDIAKATGDIKVT SEQ ID NO:179 Translated polypeptide sequence of exon 5 of VTCN1 SEIKRRSHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK SEQ ID NO:180 VTCN1 polypeptide sequence MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVDQ GANFSEVSNTSFELNSENVTMKVVSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK SEQ ID NO:181 VTCN1 exons 1 and 2 GTGAGTCACCAAGGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTGCACTCATTGGCTTTGGTATTTCAG SEQ ID NO:182 Translated polypeptide sequence of exons 1 and 2 of VTCN1 MASLGQILFWSIISIIIILAGAIALIIGFGIS CDH1 sequence informationSEQ ID NO: 184 CDH1 exon 11 sequence at 5' of CDH1-NRG1 fusion CTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATG SEQ ID NO: 185 NRG1 exon 2 sequence at 3' of CDH1-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAA SEQ ID NO: 186 CDH1-NRG1 polynucleotide sequence CTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATGCCTTGCCTCCCCGATTGAAAGAGATGAAA SEQ ID NO: 187 CDH1-NRG1 polypeptide sequence WLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDNALPPRLKEMK SEQ ID NO: 188 CDH1 exon 1 AGTGGCGTCGGAACTGCAAAGCACCTGTGAGCTTGCGGAAGTCAGTTCAGACTCCAGCCGCTCCAGCCCGGCCCGACCCGACCGCACCCGGCGCCTGCCCTCGCTCGGCGTCCCCGCCAGCCATGGGCCCTTGGAGCCGCAGCCTCTCGGCGCTGCTGCTGCTGCTGCAG SEQ ID NO: 189 CDH1 exon 2 GTCTCCTCTTGGCTCTGCCAGGAGCCGGAGCCCTGCCACCCTGGCTTTGACGCCGAGAGCTACACGTTCACGGTGCCCCGGCGCCACCTGGAGAGAGGCCGCGTCCTGGGCAGAG SEQ ID NO: 190 CDH1 exon 3 TGAATTTTGAAGATTGCACCGGTCGACAAAGGACAGCCTATTTTTCCCTCGACACCCGATTCAAAGTGGGCACAGATGGTGTGATTACAGTCAAAAGGCCTCTACGGTTTCATAACCCACAGATCCATTTCTTGGTCTACGCCTGGGACTCCACCCTACAGAAAGTTTTCACCAAAGTCACGCTGAATACAGTGGGGCACCACCACCGCCCCCCGCCC CATCAG SEQ ID NO: 191 Exon 4 of CDH1 GCCTCCGTTTCTGGAATCCAAGCAGAATTGCTCACATTTCCCAACTCCTCTCCTGGCCTCAGAAGACAGAAGAGAGACTGGGTTATTCCTCCCATCAGCTGCCCAGAAAATGAAAAAGGCCCATTTCCTAAAAACCTGGTTCAG SEQ ID NO: 192 CDH1 exon 5 ATCAAATCCAACAAAGACAAAGAAGGCAAGGTTTTTCTACAGCATCACTGGCCAAGGAGCTGACACACCCCCTGTTGGTGTCTTTTATTATTGAAAGAGAAACAGGATGGCTGAAGGTGACAGAGCCTCTGGATAGAGAACGCATTGCCACATACACT SEQ ID NO: 193 Exon 6 of CDH1 CTCTTCTCTCACGCTGTGTCATCCAACGGGAATGCAGTTGAGGATCCAATGGAGATTTTGATCACGGTAACCGATCAGAATGACAACAAGCCCGAATTCACCCAGGAGGTCTTTAAGGGGTCTTGTCATGGAAGGTGCTCTTCCAG SEQ ID NO: 194 CDH1 exon 7 GAACCTCTGTGATGGAGGTCACAGCCACAGACGCGGACGATGATGTGAACACCTACAATGCCGCCATCGCTTACACCATCCTCAGCCAAGATCCTGAGCTCCCTGACAAAAATATGTTCACCATTAACAGGAACACAGGAGTCATCAGTGTGGTCACCACTGGGCTGGACCGAGAG SEQ ID NO: 195 Exon 8 of CDH1 AGTTTCCCCTACGTATACCCTGGTGGTTCAAGCTGCTGACCTTCAAGGTGAGGGGTTAAGCACAACAGCAACAGCTGTGATCACAGTCACTGACACCAACGATAATCCTCCGATCTTCAATCCCACCACG SEQ ID NO: 196 CDH1 exon 9 TACAAGGGTCAGGTGCCTGAGAACGAGGCTAACGTCGTAATCACCACACTGAAAGTGACTGATGCTGATGCCCCCAATACCCAGCGTGGGAGGCTGTATACCATATTGAATGATGGTGGACAATTTGTCGTCACCACAAAATCCAGTGAACAACGATGGCATTTTGAAAACAGCAAAGGTTTGTATGGTACCTGGCAAGATGCAGAAACT GGCATCCTCACAGCTGTTCATACCCTTGTCCCCTG SEQ ID NO: 197 Exon 10 of CDH1 GGCTTGGATTTTGAGGCCAAGCAGCAGTACATTCTACACGTAGCAGTGACGAATGTGGTACCTTTTGAGGTCTCTCTCACCACCTCCACAGCCACCGTCACCGTGGATGTGCTGGATGTGAATGAAGCCCCATCTTTGTGCCTCCTGAAAAGGAGTGGAAGTGTCCGAGGACTTTGGCGTGGGCCAGGAAATCACTCCTACACTGCCCAGGAGCCAG ACACATTTATGGAACAGAAAATAAC SEQ ID NO: 198 Exon 11 of CDH1 ATATCGGATTTGGAGAGACACTGCCAACTGGCTGGAGATTAATCCGGACACTGGTGCCATTTCCACTCGGGCTGAGCTGGACAGGGAGGATTTTGAGCACGTGAAGAACAGCACGTACACAGCCCTAATCATAGCTACAGACAATG SEQ ID NO: 199 Exon 12 of CDH1 GTTCTCCAGTTGCTACTGGAACAGGGACACTTCTGCTGATCCTGTCTGATGTGAATGACAACGCCCCCATACCAGAACCTCGAACTATATTTCTTCTGTGAGAGGAATCCAAAGCCTCAGGTCATAAACATCATTGATGCAGACCTTCCTCCCAATACATCTCCCCTTCACAGCAGAACTAACACACGGGGCGAGTGCCAACTGGACCATTCAGTACAACGACCCAA SEQ ID NO: 200 Exon 13 of CDH1 CCCAAGAATCTATCATTTTGAAGCCAAAGATGGCCTTAGAGGTGGGTGACTACAAAATCAATCTCAAGCTCATGGATAACCAGAATAAAGACCAAAGTGACCACCTTAGAGGTCAGCGTGTGTGACTGTGAAGGGGCCGCTGGCGTCTGTAGGAAGGCCAGCCTGTCGAAGCAGGATTGCAAATTCCTGCCATTCTGGGGATTCTTGGAGGAATTCTTGCT TTGCTAA SEQ ID NO: 201 CDH1 exon 14 TTCTGATTCTGCTGCTCTTGCTGTTTCTTCGGAGGAGAGCGGTGGTCAAAGAGCCCTACTGCCCCCAGAGGATGACACCCGGGACAACGTTTATTACTATGATGAAGAAGGAGGCGGAGAAGAGGACCAG SEQ ID NO: 202 Exon 15 of CDH1 GACTTTGACTTGAGCCAGCTGCACAGGGGCCTGGACGCTCGGCCTGAAGTGACTCGTAACGACGTTGCACCAACCCTCATGAGTGTCCCCCGGTATCTTCCCGCCCTGCCAATCCCGATGAAATTGGAAATTTTATTGATGAA SEQ ID NO: 203 Exon 16 of CDH1 AatctgaAgCGGCGATGACCCCCCCCCCCCCCCCCCCCCCCCCCTTCTCTCTGTTGAAGGGAGGGTCCGCTGCTGAgCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCTCCCTCTTTCTCTCTCTCTCTCTCTCTCTCTCTCTTCCCCCC. CCAGGACTACTGACTACTGAACGAATGGGGGGGGCAATCGCGCTCAAGGGGGGGGCGCGCGCGCGACGACGGGGGGGGGCCCCCCCCCCCCCCCCTGGGAAAAAAATGCAGCAG AAATCACGTTGTGTGTGTGTGTGTTTTTTCAGCTCCCCCTTGAGAGAGAGAGGGGGGGGGGGGGGGAAAGAGAGGGTGTGCAGCAGTAGTACTCTTTTTTTTTTTTTTTTTTTTTTTA TGTTAGAAAAGTTTCGACGACGACGACGACTTTTTTTTTTTTTTTTTTTTTTCCCCCACTTTTTGTGTGTGTGTCCCCAAACCAACCAACAACAAGCAGAGTCACACCCCC CAGCACACACTTGCAGCAGCAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCAGCACTCTCTAGTCTAGTTTTTTTTTTTTTTTTGTGTGTCTCTCTCAC TaccTGGTGTGTGTGTGTGTCTCTGCTGCTTTTTTTTTTTTTTTTTTTTTCAGGTCTCTCGGCCCCGGGGCAGCATCATCTCTCCCTCTCTCTCTCTCTCTCTCTCTCTCTAGCTCTCTCCTCCTCTCCTCCTCAT TGCACCTCAGCCTCCCCAAGCTGGGGGACCACAGCACCCACCACTATGCATTTTTTTTTTTTTTTTGGGGGGGGTCCCCCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCAT CCTCCCATCTTGCTCCCCAGGGGGGGGGGGGACACACAGAGCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCTGCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC GATCATAGCTCTCTGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGCAGCCCCCCCCCCTCTCTCTTTTTTTTTTTTTTTTCCCCCCCCCCCTCTCTCTCTCTCTCTCAAG is CAATCCTCTCTGCCCCCCCCCCCCCCCAAAGCTGGGGGGGCAGCTGCCCCCCCTCCTCTTCTCTCTCTCTCTCTGAAGCCCCCCCCCCCCCAGGGGAgggggggggggggggggggggAgCCCCCCCCCCCCCCCCCCCCCCCCCTCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC AgaAAAATTTTATTGGGGGCTTTTTTTTTTTTTTTTTTTTAGTTTTTGTTGTTTTGTGCCCACACACTGTACTCTGTGACTTTTTTCTCTCTGTGTGTGTGTGTACTGCAATGGGGG GCAGCTAGTCAGTGAGAGTGTTGAgTGTGTGTGTGTCATTTTTTTTTGAGAGAGAGAGACCCCCCCCCCCCCTGCTGCCCAGCAgCgcggaacttatg AAAAGGGGGGCTGAGAGGCAGGGGAGAGAGCTGTGAgCTGCAATTGCAACTGAGAGAGAGAGAGAGAGAGGTGTCTCTCTGAAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGgggggggggggggggggggggggga GTTTTCTGACACAAGAAGAAGAAGAAGTGTGTTTGTCCCCCCCCCCCCCCCCCCCCAAGCTGCTTGTGTGTACTGGGGGGGGGGGGGACCCCCCCCCCCCACACACACACACACACACACCCCCCCCCCCCCCCCCCCCCCCCCCTGCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC GagaatttcaaaAAAAAAAAATTTTTTTTTTTTTTTTTTTTTTTAGAGAGAAGAAAAAAAATGCCTGCTAAGGGGGGGGGGGGGGGGGGGGTGGGAGGAGAGATTTTTTTTTGAAATTTCAAC TTTTGACAATCAAAAGAGACTTTTTTTGAAAATAGTTTTTTTCTCTCAAGGGGGGGGGGAAAAAACCCAATCAATCAATCAATTTTTTTTTCAGGCAGGCAGGAGAATC AGTTTTCTGTGTGTGTGTGTAGAATGTCTGTGTTTTTTTGGTGTACATGTGTGTGTGTGTGTGTTTTTTTGGGGGGGGGGGAAAACAAGAGAGAGAAGAAGAAGAAGAAGAGAAGAGAAGAAGAAGCAAGAAGAAGAAGAAGAAAGAAAGAAAGAAAAAAAAAAGAAAGAAGAAAAACAAAAC AAATTTTTTTTTTAAAAATTTTTTTAAAAAA SEQ ID NO: 204 The polynucleotide sequence of all 16 exons 1-16 of CDH1 arranged in order SEQ ID NO: 205 Translated polypeptide sequence of exon 10 of CDH1 MEQKI SEQ ID NO: 206 Translated polypeptide sequence of exon 11 of CDH1 YRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDN SEQ ID NO: 207 Translated polypeptide sequence of exon 12 of CDH1 SPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADDLPPNTSPFTAELTHGASANWTIQYNDP SEQ ID NO: 208 Translated polypeptide sequence of exon 13 of CDH1 QESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLEVSVCDCEGAAGVCRKAQPVEAGLQIPAILGILGGILALL SEQ ID NO: 209 Translated polypeptide sequence of exon 14 of CDH1 LILLLLLFLRRRAVVKEPLLPPEDDTRDNVYYYDEEGGGEEDQ SEQ ID NO: 210 Translated polypeptide sequence of exon 15 of CDH1 DFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYLPRPANPDEIGNFIDE SEQ ID NO: 211 Translated polypeptide sequence of exon 16 of CDH1 NLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKDQDYDYLNEWGNRFKKLADMYGGGEDD SEQ ID NO: 212 CDH1 polypeptide sequence MEQKITYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDNGSPVATGTGTLLLILSDVNDNAPIPEPRTIFFCERNPKPQVINIIDADLPPNTSPFTAELTHGASANWTIQYNDPTQESIILKPKMALEVGDYKINLKLMDNQNKDQVTTLEVSVCCDCEGAAGVCKAQPVEAGLQ IPAILGILGGILALLILILLLLLFLRRRAVVKEPLLPPEDDTRDNVYYYDEEGGGEEDQDFDLSQLHRGLDARPEVTRNDVAPTLMSVPRYLPRPANPDEIGNFIDENLKAADTDPTAPPYDSLLVFDYEGSGSEAASLSSLNSSESDKDQDYDYLNEWGNRFKKLADMYGGGEDD SEQ ID NO: 213 The polynucleotide sequence of all 11 exons 1-11 of CDH1 arranged in order SEQ ID NO: 214 Translated polypeptide sequence of exons 1-11 MEQKIYRIWRDTANWLEINPDTGAISTRAELDREDFEHVKNSTYTALIIATDN CXADR sequence informationSEQ ID NO: 215 CXADR exon 1 sequence at 5' of CXADR-NRG1 fusion ATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGG SEQ ID NO: 216 NRG1 exon 2 sequence at 3' of CXADR-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO: 217 CXADR-NRG1 polynucleotide sequence ATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACT SEQ ID NO: 218 CXADR-NRG1 polypeptide sequence MALLLCFVLLCGVVALPPRLKEMKSQESAAGSK SEQ ID NO:219 CXADR exon 1 AGTCGGGAGCGCGCGAGGCGCGGGGAGCCTGGGACCAGGAGCGAGAGCCGCCTACCTGCAGCCGCCGCCCACGGCACGGCAGCACCATGGCGCTCCTGCTGTGCTTCGTGCTCCTGTGCGGAGTAGTGG SEQ ID NO:220 CXADR exon 2 ATTTCGCCAGAAGTTTTGAGTATCACTACTCCTGAAGAGATGATTGAAAAAAGCCAAAGGGGAAACTGCCTATCTGCCATGCAAATTTACGCTTAGTCCCGAAGACCAGGGACCGCTGGACATCGAGTGGCTGATATCACCAGCTGATAATCAGAAGGTGGATCAAGTG SEQ ID NO:221 CXADR exon 3 ATTATTTTATATTCTGGAGACAAAATTTATGATGACTACTATCCAGATCTGAAAGGCCGAGTACATTTTACGAGTAATGATCTCAAATCTGGTGATGCATCAATAAATGTAACGAATTTACAACTGTCAGATATTGGCACATATCAGTGCAAAGTGAAAAAAGCTCCTGGTGTTGCAAATAAGAAGATTCATCTGGTAGTTCTTG SEQ ID NO:222 CXADR exon 4 TTAAGCCTTCAGGTGCGAGATGTTACGTTGATGGATCTGAAGAAATTGGAAGTGACTTTAAGATAAAATGTGAACCAAAGAAGGTTCACTTCCATTACAGTATGAGTGGCAAAAATTGTCTGACTCACAGAAAATGCCCACTTCATGGTTAGCAG SEQ ID NO:223 CXADR exon 5 GGAAGATGTGCCACCTCCAAAGAGCCGTACGTCCACTGCCAGAAGCTACATCGGCAGTAATCATTCATCCCTGGGGTCCATGTCTCCTTCCAACATGGAAGGATATTCCAAAGACTCAGTATAACCAAGTACCAAGTGAAGACTTTGAACGCACTCCTCAGGTCCCACCTGCTAAGGTAGCTGCCCCTAATCTAAGTCGAATGGGTGCGATTCCTGTG ATGATTCCCAGCACAGAGCAAGGATGGGTCTATAGTATAGAGCCTCCATATGTCTCATCTGTGCTCTCCGTGTTCCTTTCCTTTTTTTGATATATGAAAACCTATTCTGGTCTAAATTGTGTTACTAGCCTCAAAATACATCAAAAAATAAGTTAATCAGGAACTTTTTAAAAATTTTTGTTTGGTTATATCGAAATAGTTACAGGCACTAAAGTTA GTAAAGAAAAGTTTACCATCTGAAAAAGCTGGATTTTCTTTAAGAGGTTGATTATAAAGTTTTTCTAAATTTATCAGTACCTAAGTAAGATGTAGCGCTTTGAATATGAAATCATAGGTGAAGACATGGGTGAACTTACTTGCATACCAAGTTGATACTTGAATAACCATCTGAAAGTGGTACTTGATCATTTTTACCATTATTTTTAGGATGTGTATTTCATTTATTTATG GCCCACCAGTCTCCCCCAAATTAGTACAGAAATATCCATGACAAAATTACTTACGTATGTTTGTACTTGGTTTTCAGCTCCTTTGAAAACTCTGTGTTTGGAATATCCTAAAAACATAGAAAACACTACAGTGGTTTAGAAAATTACTAATTTTAACTTCTAAGTCATTCATAAACCTTGTCTATGAAATGACTTCTTAAATATTTAGTTGATAGACTGCTACAGGTAATAGG GACTTAGCAAGCTCTTTTATATGCTAAAGGAGCATCTATCAGATTAAGTTAGAACATTTGCTGTCAGCCACATATTGAGATGACACTAGGTGCAATAGCAGGGATAGATTTTGTTGGTGAGTAGTCTCATGCCTTGAGATCTGTGGTGGTCTTCAAAATGGTGGCCAGCCAGATCAAGGATGTAGTATCTCATAGTTCCCAAGGTGATATTTTTCTTATTAGAAAAATAT TATAACTCATTTGTTGTTGTTTGACACTTATAGATTGAAATTTCCTAATTTATTCTAAATTTTAAGTGGTTCTTTGGTTCCAGTGCTTTATGTTGTTGTTGTTTTTTGGATGGTGTTACATATTATATGTTCTAGAAACATGTAATCCTAAATTTACCCTCTTGAATATAATTCCCTTGGATGATATTTTTTTATCATAAATGCAGAATAATCAAATACATTTTAAGCAAGTTAA GTGTCCTCCAATTCTGTATTCCAGACTTGGGAGGATGTACAGTTGCTGTTGTGTGATCAAACATGTCTCTGTGTAGTTCAGCAAATCAAGCTGAGCTTTGAAAAAGTTTGTCTTAGTTTTGTGAAGGTGATTTATTCTTAAAAAAAAAAAAGAAAAAAAAAAAAAGATAAGAAGGAGGAGTAAAGGGACTACTCCTCCTTGCCAAATGTGCTAAATATCATTTTA GGAGAAAGAAAGTGGGTTTATTGTATTTCCCCTTAAGATTGTGAGGGAGTGTGGATACAGTAGAATGAGCCAACAGTTTCTTTAATAAATACGGTCTGCAATAAATTATTTCACTAGCTCTAAAACCTTTCCCTAGATTTAGTAGGGAGTTGGTTTCTGTTAATATCTTGGGTGCTGTGGTGGTAAATGCTATATTATGAACGGTGGCATGTATTTACAGTT AGAGTATTGTGTGTACACTTTTTTAATGGTAAACTTAAGCTGAATGTGTAATGGATTTGTGTATAGTTTTACATATTTGGAAGCATTTTAAAAACAGGTTTTAAACCTTTATGTAAAATTACTTTATACTCGTGTTAACATTTTCATCTGTGCCTTTTGGTAATTTAATTCTATTATGAATTTCTGGTGCCTATGAGCTAGCTATCACCTACCTGAAAGGTGCTTAGAGGTGAAG GTACTGTTTCTAAAAAACACATCACTGTGATACCTTTTCTATCCTCACATTTTCAAGCTTGCCTCTTTTCTGTTCTTTGTGGATATAACTTAAGCAATTGTGTTATTCATAAAGGTTTAGAAATTTCAATATTTCCCAACACTCTATGTTTCTGATTTTATAACAGTAGCCATTTTTGAAAGTCAGATGTTTGGCCTGTTTTATATGAATAAAGTTTTATAAAATATTATAAAAAAT AAGTAAATAAACAGAACATTAATAATAAAAGTTTTGGTTCTTCCTATTCCTGACTTTCATATAATGAAAATTATCCTATTGATCTAAGTAGAAGTTATCATAGAAAGTGGACACGTATAAGACTTTTCCTTCCTTTTTTTTTTAATAACATATGAGGAACAAGACTTCTCTTCCCATACTTCATATTTTGAGGACATTGTTTTAAAGGCTTATGTCTCACTGTAAAATCTGTCACT GCCAAATAGTACCAATACGTTTTCAAGTAGTTCTCACTGATAATTTAGTTGAACCAGAGATCAAATATTTGCTCCCGAATTACTACTGGTAATCAAGTAGTTGAACAAAAAATTACTAAAGCATTTCCGTTAGATCAGTCAAGGACAGTACTGCATCTTTTTTTTTTTTTTTTTTTTTTAAGACGGAGTCTCGGTCTGTCACCCAGGCTGGAGTGCAGTGGCGGGATCTCG GCTCACTGCAAGCTCCGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGCAGCTGGGACTACAGGCTCCCATCACCACGCTCGGCTAAGTTTTTTGTAATTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCTGCCTGCCTCGGCTCCCAAAGTGCTGGGATTACAGGCGTGAACCACTGCACCC GGCCCAGTACTGCATCTTAACAGCAAAGCCATTTTATTCTACTTTATAACTGAGAGACTTGATACCATCCATCTCTTTAGGTTACAGAGGATAATTTGAAGAGAAATGTTACTGTAGAATATAGTTCTGTACTTTTTTTTTTTTTTAAGAGATAGGGTTTCACTATTGCTCAGGCTGGTCTCAAACTGCTGGGCTCAGGAGATCCCTCCTGCCTTGGCCTCCCAATT GCTGGGATTACAGGTGTGAGCCGCAGCATCCAGCCAGTTCTGTACTTTGAATATGGAGTAGTTTACAGCTATTTTTTTTCTTACTGGTAATCTTAACTAATATGATTCCCTTGTTAGAGCCTCTCACTCCCCACCCCCAAAAATGTCTACTATTCATGACAGTAACCAATTATTCTGGACAAATTGCTTCTTTTTAATTTGAGCTATCTGCCATGGACTTTC TAAAATGGAAACACAGCCTGAGTGTATCTTAGGGAGAGTTTGATTGAAAAAATCCAAATCACTATCCATATAGATCATGGATATAAAGAGATACCTGATTTTTATTAAAAAGATACTTTTTTCAAATTTAAGAGTTAATCTTGGAAATTTGGAACAAGTAAAAGGGGCAAGTAAACCTTTTGATGAAATATAAAAGGAACTCATTGCATGAAGTTGACTATCAAATTCTGTG ATGTGTGGCTTCTTAAAAATATTCTCAGTGTCTTTTGTGTGCGTGCAGCATGTACATTTGATGTTATGTGAATGTTGAGTTTTTCTTTCTAATTTTCACTTCAGCAGTGTTTAGGCTTTCAGATGCCTTATTCCAGTGTGAACAGAAAAAGTTCATATTTTATGTGGTTAATGCTTTGATGTGTCACATAAAGAGTAGTTTTGTAGAAAATGTTGGCACAATTTTAACTT CTTAGTGGCTTGTGACATTATATATTATATATATATGTACATATCTTTAACATTCCTGTGTTTAGTAGTGTAAATGTTCTGGGCAAGTTTTTAATATTTTGAATGCCTTTGGATATTCCAGCAATAAAGGCATCATGTTCTGCAATAGGATTTCTTACTCATTTACCTATTTTAACACTAAAATAGACCAAACTGAGCACAAATTCCTTTATAAATGTTATAGAAG CAGGGAAGAATAATAAACACATTTGTGAATTGTGGTTCAGTTTTTATCTTTAGGGAAGGCTGATCATTTATCTTTATAGCAGATAACCCCAGCCTCTTATTCATTATGGTTAACTTTTATAAATTTATCTTTTTTATAATTTAAGAATATAGTACATATCAGTTGGGTTTGGTTTTGGTCATCGAGACTAAAAGCTCCATCAAAACAGAACTTTGTGTTTTCTGCTAAC TTATTTAATGACACAAGTTTTAAGAGAACCACAATTCATTGATTCACTTATTCTTTTCCCTAATTGTGAATTTTAGTGATAAATACACCTGTACTACTGAGGAAAATATTCTGACACTTCACGTGTGCAAAGTATAGAACTGACAGTGTCAGTTTCAGATTTTGTATGTACGATTTCTGGCTTATATCCAATGGTGCAGATTTTGAAATTTGTAAGAACAAAATTTGTTA AGAAAAACAACTTGCTCTAGTTTTGTGACCTTGTGTACTTTTGAAATAAAATCAAGAAAGCAGTTCTCTGCCTC SEQ ID NO:224 The polynucleotide sequence of all five exons 1-5 of CXADR arranged in order SEQ ID NO:225 Translated polypeptide sequence of exon 1 of CXADR MALLLCFVLLCGVV SEQ ID NO:226 Translated polypeptide sequence of exon 2 of CXADR FARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLISPADNQKVDQV SEQ ID NO:227 Translated polypeptide sequence of exon 3 of CXADR IILYSGDKIYDDYYPDLKGRVHFTSNDLKSGDASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVL SEQ ID NO:228 Translated polypeptide sequence of exon 4 of CXADR KPSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQKLSDSQKMPTSWLA SEQ ID NO:229 Translated polypeptide sequence of exon 5 of CXADR KMCHLQRAVRPLPEATSAVIIHPWGPCLLPTWKDIPRLSITKYQVKTLNALLRVRLSHLLR SEQ ID NO:230 CDAXR polypeptide sequence MALLLCFVLLCGVVDFARSLSITTPEEMIEKAKGETAYLPCKFTLSPEDQGPLDIEWLISPADNQKVDQVIILYSGDKIYDDYYPDLKGRVHFTSNDLKSGDASINVTNLQLSDIGTYQCKVKKAPGVANKKIHLVVLVKPSGARCYVDGSEEIGSDFKIKCEPKEGSLPLQYEWQ KLSDSQKMPTSWLAGKMCHLQRAVRPLPEATSAVIIHPWGPCLLPTWKDIPRLSITKYQVKTLNALLRVRLSHLLR GTF2E2 sequence informationSEQ ID NO: 231 GTF2E2 exon 2 sequence at 5' of GTF2E2-NRG1 fusion GGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATTCTTCTGAGTCATCATCATCATCGTCAAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTG SEQ ID NO: 232 NRG1 exon 2 sequence at 3' of GTF2E2-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCGAATTCAAGTGGTTCAAGAATGGGAATGA SEQ ID NO: 233 GTF2E2-NRG1 polynucleotide sequence GGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATTCTTCTGAGTCATCATCATCATCGTCAAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGA ATACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGA SEQ ID NO: 234 GTF2E2-NRG1 polypeptide sequence ELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVEHGGSSGSKQNSALPPRLKEMKSQESAAAGSKLVLRCETSSEYSSLRFKWFKNGN SEQ ID NO:235 Exon 1 of GTF2E2 ACTGAGCTCCTAGCACCCGATCGGGAAGTGGCGGGCGGAGTCCCGGGTCCAGTCGCCGCCTCAGCTACCGCCGCTGCCGCCGCCGCCGCCGCCACCGCCAGTGGTGAGACCCCGACCTGGCGGGTCAGCGCTGGGCGTGCGTGCGGGCAGGCGGGGGCGCTGACGAGAAGCAGGAAGAGGGTGCAGTGCCGGCGTGGGCG GCCGGCCGAGGCGGAGGCGCAGGAAGGGGGCGGCGAGTCGTGCGAGGCTGCCCTTCTCACTCAG SEQ ID NO:236 Exon 2 of GTF2E2 CATTATGGATCCAAGCCTGTTGAGAGAAAGGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCTGTAGTAGAAAAACGTTCAGCATCCTTCTGAGTCATCATCATCATCGTCAAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGCTCTAAACAAAATTCTG SEQ ID NO:237 Exon 3 of GTF2E2 ATCATAGCAATGGATCATTTAACTTGAAAGCTTTGTCAGGAAGCTCTGGATATAAGTTTGGTGTTCTTGCTAAGATTGTGAATTACATGAAG SEQ ID NO:238 Exon 4 of GTF2E2 ACACGGCATCAGCGAGGAGATACGCATCCTCTAACCTTAGATGAAATTTTGGATGAAACACAACATTTAGATATTGGACTCAAGCAGAAACAATGGCTAATGACTGAG SEQ ID NO:239 Exon 5 of GTF2E2 GCTTTAGTCAAACAATCCCAAAATTGAAGTAATAGATGGGAAGTATGCTTTCAAGCCCAAGTACAACGTGAGAGATAAGAAGGCCCTACTTAGGCTCTTAGATCAGCATGACCAGCGAGGATTAGGAGGAATTCTTTTAGAAGACATAGAAGAAGCACTGCCCAATTCCCAGAAAGCTGTCAAG SEQ ID NO:240 Exon 6 of GTF2E2 GCTTTGGGGGACCAGATACTATTTGTAAATCGTCCCGATAAGAAGAAAATACTTTTCTTCAATGATAAGAGCTGTCAGTTTTCTGTGGATGAAG SEQ ID NO:241 Exon 7 of GTF2E2 AATTTCAGAAACTGTGGAGGAGTGTCACTGTAGATTCCATGGACGAGGAGAAAATTGAAGAATATCTGAAGCGACAGGGTATTTCTTCCATGCAGGAATCTGGACCAAAAGAAAGTG SEQ ID NO:242 Exon 8 of GTF2E2 GCCCCTATTCAGAGAAGGAAAAAGCCTGCTTCACAGAAAAAGCGACGCTTTAAGACTCATAACGAACACTTGGCTGGAGTGCTGAAGGATTACTCTGACATTACTTCCAGCAAATAGGGAACAGTTTTGCCCTGGAACAGAGTTACAGATACACAATCAAGGTGTTCTTGCTGATGCTCGGGGTCTGAAGACTGTCTTCCTATCTGCTTCTTGCGGCTGAG GAGAGGAGCAGTTCAGTTTACAAAACAAGTGCAAATTACCAAACTCAAAGCTTATTTGAGTAGAATGGGCTCATGGGCAATGTGATGTTCCCTGTTAAACCTTCTGTTACTCCCTGGGAGAAAGGCGCTGAGCGTGGCATGCAGGTGTCTTTGCTGTGTTTTTCTCACTTCTAAATGGTTCCTGGTTCCTTTCTTCCTCGTTTGTTACTTTAGAGCAAGTTTGC CCATAGTCTTGAATGCAATATTTGTTTATTCCAAAAGAACATATTTATAATAAAATCACTGTAGAAGGATTTTTAAGATGTTAGTGAATTCTGTTTCTTTTCATTCTCGGAAATGGCAGGAAGCAGCTCCAGTCTCTGGGTCACGTGCTGGGGATGTGATGAAGCCTGCAGTCTGCACTGTGTTGCTGAGCACATGGATTTCACCACTGGAAC ACAGGTGTGCTGCTTGTTAGCAAGCAGAGCAATAAAGATGTGCTGGATGTCA SEQ ID NO:243 The polynucleotide sequence of all eight exons 1-8 of GTF2E2 arranged in order SEQ ID NO:244 Translated polypeptide sequence of exon 2 of GTF2E2 MDPSLLRERELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVEHGGSSGSKQNS SEQ ID NO:245 Translated polypeptide sequence of exon 3 of GTF2E2 HSNGSFNLKALSGSSGYKFGVLAKIVNYMK SEQ ID NO:246 Translated polypeptide sequence of exon 4 of GTF2E2 TRHQRGDTHPLTLDEILDETQHLDIGLKQKQWLMTE SEQ ID NO:247 Translated polypeptide sequence of exon 5 of GTF2E2 ALVNNPKIEVIDGKYAFKPKYNVRDKKALLRLLDQHDQRGLGGILLEDIEEALPNSQKAVK SEQ ID NO:248 Translated polypeptide sequence of exon 6 of GTF2E2 ALGDQILFVNRPDKKKILFFNDKSCQFSVDE SEQ ID NO:249 Translated polypeptide sequence of exon 7 of GTF2E2 FQKLWRSVTVDSMDEEKIEEYLKRQGISSMQESGPKKV SEQ ID NO:250 Translated polypeptide sequence of exon 8 of GTF2E2 APIQRRKKPASQKKRRFKTHNEHLAGVLKDYSDITSSK SEQ ID NO:251 Full protein sequence MDPSLLRERELFKKRALSTPVVEKRSASSESSSSSSKKKKTKVEHGGSSGSKQNSDHSNGSFNLKALSGSSGYKFGVLAKIVNYMKTRHQRGDTHPLTLDEILDETQHLDIGLKQKQWLMTEALVNNPKIEVIDGKYAFKPKYNVRDKKALLRLLDQHDQRGLGGILLEDIE EALPNSQKAVKALGDQILFVNRPDKKKILFFNDKSCQFSVDEEFQKLWRSVTVDSMDEEKIEEYLKRQGISSMQESGPKKVAPIQRRKKPASQKKRRFKTHNEHLAGVLKDYSDITSSK SEQ ID NO:252 exons 1-2 ACTGAGCTCCTAGCACCCGATCGGGAAGTGGCGGGCGGAGTCCCGGGTCCAGTCGCCGCCTCAGCTACCGCCGCTGCCGCCGCCGCCGCCGCCACCGCCAGTGGTGAGACCCCGACCTGGCGGGTCAGCGCTGGGCGTGCGTGCGGGCAGGCGGGGGCGCTGACGAGAAGCAGGAAGAGGGTGCAGTGCCGGCGTGGGCG GCCGGCCGAGGCGGAGGCGCAGGAAGGGGGCGGCGAGTCGTGCGAGGCTGCCCTTCTCACTCAGCATTATGGATCCAAGCCTGTTGAGAGAAAGGGAGCTGTTCAAAAAACGAGCTCTTTCTACTCCCTGTAGTAGAAAAACGTTCAGCATCTTCTGAGTCATCATCATCATCGTCAAAAGAAGAAGAAAACAAAGGTAGAACATGGAGGATCGTCAGGC TCTAAACAAAATTCTG CSMD1 sequence informationSEQ ID NO: 253 CSMD1 exon 23 sequence ATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCCAAGGTTTTCAACTCACCTATACCA at 5' of CSMD1-NRG1 fusion 5' SEQ ID NO: 254 NRG1 exon 6 sequence CTACATCTAC at 3' of CSMD1-NRG1 fusion ATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACT SEQ ID NO: 255 CSMD1-NRG1 Polynucleotide sequence ATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACT SEQ ID NO: 256 CSMD1-NRG1 polypeptide sequence ILNSTSNHLWLEFNTNGSDTDQGF QLTYTTTSTSTTGTSHLVKCAEKEKT SEQ ID NO:257 CSMD1 Exon 1 CTCCACGGCAGCGGCTCCTTGTGCCACTAGCAGCCCTTCTTCTGCGCTCTCCGCCTTTTCTCTCTAGACTGGATCTCTCCCTCCCCCCCGCGCCCCCCTCCCCGCATCTCCCACTCGCTGGCTCTCTCTCCAGCTGCCTCCTCTCCAGGTCTCTCCTGG CTGCGCGCGCTCCTCTCCCCGCTTCTCCCCCTCCCGCAGCCTCGCCGCCTTGGTGCCTTCCTGCCCGGCTCGGCCGGCGCTCGTCCCCGGCCCCGGCCCCGCCAGCCCGGGTCTCCGCGCTCGGAGCAGCTCAGCCCTGCAGTGGCTCGGGACCCGATGCTATGAGAGGGAAGCGAGCCGGGCGCCCAGACCTTCAGGAGGCGTCGGATGCGC GGCGGGTCTTGGGACCGGGCTCCTCTCTCCGGCTCGCCTTGCCCTCGGGTGATTATTTGGCTCCGCTCATAGCCCTGCCTTCCTCGGAGGAGCCATCGGTGTCGCGTGCGTGTGGAGTATCTGCAGACATGACTGCGTGGAGGAGATTCCAGTCGCTGCTCCTGCTTCTCGGGCTGCTGGTGCTGTGCGCGAGGCTCCTCACTGCAGCGAAGG SEQ ID NO: 258 CSMD1 exon 2 GTCAGAACTGTGGAGGCTTAGTCCAGGGTCCCAATGGCACTATTGAGAGCCCAGGGTTTCCTCACGGGTATCCGAACTATGCCAACTGCACCTGGATCATCATCACGGGCGAGCGCAATAGGATAGTTCAGTTGTCCTTCCATACCTTTGCTCTTGAAGAAGATTTTGATATTTTATCATTTACGATGGACAGC CTCAACAAGGGAATTTAAAAGTGAG SEQ ID NO: 259 CSMD1 exons 3 ATTATCGGGATTTCAGCTGCCCTCCTCTATAGTGAGTACCAGGATCTATCTCACTCTGTGGTTCACGACAGACTTCGCTGTGAGTGCCCAAGGTTTCAAAGCATTATATGAAG SEQ ID NO: 260 CSMD1 Exon 4 TTTTACCTAGCCACACTTGTGGAAATCCTGGAGAAATCCTGAAAGGAGTTCTGCATGGAACGAGATTCAACATAG GAGACAAAATCCGGTACAGCTGCCTCCCTGGCTACATCTTGGAAGGCCACGCCATCCTGACCTGCATCGTCAGCCCAGGAAATGGTGCATCGTGGGACTTCCCAGCTCCCTTTTTGCAGAG SEQ ID NO: 261 CSMD1 exon 5 CTGAGGGAGCCTGCGGAGGAACCTTACGCGGGACCAGCAGCTCCATCCTCCAGCCCGCACTTCCCTTCA GAGTACGAGAACAACGCGGACTGCACCTGGACCATTCTGGCTGAGCCCGGGGACACCATTGCGCTGGTCTTCACTGACTTTCAGCTAGAAGAAGGATATGATTTCTTAGAGATCAGTGGCACGGAAGCTCCATCCATATG SEQ ID NO: 262 CSMD1 Exon 6 GCTAACTGGCATGAACCTCCCCTCTCCAGTTATTCAGTAGCAAGAATTG GCTACGACTCCATTTCACCTCTGACAGCAACCACCGACGCAAAGGATTTAACGCTCAGTTCCAAG SEQ ID NO: 263 CSMD1 exon 7 TGAAAAAGGCGATTGAGTTGAAGTCAAGAGGAGTCAAGATGCTGCCCAGCAAGGATGGAAGCCATAAAAACTCTGTCT SEQ ID NO: 264 CSMD1 exon 8 TGAGCCAAGGAGGTGTTGCATTGGTCTCTGACATGTGTCCAGATCCTGGGATTCCAGAAAATGGTAGAAGAGCAGGTTCCGACTTCAG SEQ ID NO: 265 CSMD1 exon 9 GGTTGGTGCAAATGTACAGTTTTTCATGTGAGGACAATTACGTGCTCCAGGGATCTAAAAGCATC ACCTGTCAGAGAGTTACAGACGCTCGCTGCTTGGAGTGACCACAGGCCCATCTGCCGAG SEQ ID NO: 266 CSMD1 exon 10 CGAGAACATGTGGATCCAATCTGCGTGGGCCCAGCGGCGTCATTACCTCCCCCTAATTATCCGGTTCAGTATGAAGATAATGCACACTGTGTGTGGGTCATCACCACCACCGACCCGGACAAG SE Q ID NO: 267 CSMD1 Exon 11 GTCATCAAGCTTGCCTTTGAAGAGTTTGAGCTGGAGCGAGGCTATGACACCCTGACGGTTGGTGATGCTGGGAAGGTGGGAGACACCCAGATCGGTCTTGTACGT SEQ ID NO:268 CSMD1 exon 12 GCTCACGGGATCCAGTGTTCCTGACCTCATTGTGAGCATGAGCAACCAGATGTGGCTACATCTGCAGTCGGATGATAGCATTGGCTCACCTGGGTTTAAAGCTGTTTACCAAG SEQ ID NO:269 CSMD1 exon 13 AAATTGAAAAGGGAGGGTGTGGGGATCCTGGAATCCCCGCCT ATGGGAAGCGGACGGGCAGCAGTTTCCTCCATGGAGATACACTCACCTTTGAATGCCCGGCGGCCTTTGAGCTGGTGGGGGAGAGAGTTATCACCTGTCAGCAGAACAATCAGTGGTCTGGCAACAAGCCCAGCTGTGTAT SEQ ID NO: 270 CSMD1 exon 14 TTTCATGTTTCTTCAACTTTACGGCATCATCTGGGATTA TTCTGTCACCCAAATTATCCAGAGGAATATGGGAACAACATGAACTGTGTCTGGTTGATTATCTCGGAGCCAGGAAGTCGAATTCACCTAATCTTTAATGATTTTGATGTTGAGCCTCAGTTTGACTTTCTCGCGGTCAAGGATGATGGCATTTCTGACATAACTGTCCTGGGTACTTTTTCTGGCAATGAAGTGCCTTCCCAGCTGGCCAGCAGTGG GCATATAGTTCGCTTGGAATTTCAGTCTGACCATTCCACTACTGGCAGAGGGTTCAACATCACTTACACCA SEQ ID NO: 271 CSMD1 exon 15 CATTTGGTCAGAATGAGTGCCATGATCCTGGCATTCCTATAAACGGACGACGTTTTGGTGACAGGTTTCTACTCGGGAGCTCGGTTCTTTCCACTGTGATGATGGCTTTGTCA AGACCCAGGGATCCGAGTCCATTACCTGCATACTGCAAGACGGGAACGTGGTCTGGAGCTCCACCGTGCCCCGCTGTGAAG SEQ ID NO: 272 CSMD1 exon 16 CTCCATGTGGTGGACATCTGACAGCGTCCAGCGGAGTCATTTTGCCTCCTGGATGGCCAGGATATTATAAGGATTCTTTACATTGTGAATGGATAATTGAAGCAAAACCAGGCCACTCTATCAAAATAACTTTTGACAG SEQ ID NO: 273 CSMD1 exon 17 ATTTCAGACAGAGGTCAATT ATGACACCTTGGAGGTCAGAGATGGGCCAGCCAGTTCGTCCCCACTGATCGGCGAGTACCACGGCACCCAGGCACCCAGTTCCTCATCAGCACCGGGAACTTCATGTACCTGCTGTTCACCACTGACAACAGCCGCTCCAGCATCGGCTTCCTCATCCACTATGAGA SEQ ID NO: 274 CSMD1 exon 18 GTGTGACGCTTGAGTCGG ATTCCTGCCTGGACCGGGCATCCCTGTGAACGGCCATCGCCACGGTGGAGACTTTGGCATCAGGTCCACAGTGACTTTCAGCTGTGACCCGGGGTACACACTAAGTGACGACGAGCCCCTCGTCTGTGAGAGGAACCACCAGTGGAACCACGCCTTGCCCAGCTGCGACG SEQ ID NO: 275 CSMD1 exon 19 CTCTATGT GGAGGCTACATCCAAGGGAAGAGTGGAACAGTCCTTTCTCCTGGGTTTCCAGATTTTTTATCCAAACTTCTCTAAACTGCACGTGGACCATTGAAGTGTCTCATGGGAAAG SEQ ID NO: 276 CSMD1 Exon 20 GAGTTCAAATGATCTTTCACACCTTTCATCTTGAGAGTTCCCACGACTATTTACTGATCACAGAGGATGGAAGTTTTTCCGAGCCCGTTGCCAGGCTCACCGGGTCGGTGTTGCCTCATACGATCAAGGCAGGCCTGTTTGGAAACTTCACTGCCCAGCTTCGGTTTATATCAGACTTCTCAATTTCGTACGAGGGCTCAATATCACAT TTTCAG SEQ ID NO: 277 CSMD1 exon 21 AATATGACCTGGAGCCATGTGATGATCCTGGAGTCCCTGCCTTCAGCCGAAGAATTGGTTTTCACTTTGGTGTGGGAGACTCTCTGACGTTTTCCTGCTTCCTGGGATATCGTTTAGAAGGTGCCACCAAGCTTACCTGCCTGGGTGGGGCCGCCGTGTGTGGAGTGCACCTGC CAAGGTGTGTGG SEQ ID NO: 278 CSMD1 exon 22 CCGAATGTGGAGCAAGTGTCAAAGGAAATGAAGGAACATTACTGTCTCCAAATTTTCCATCCAAATTATGATAATAACCATGAGTGTATCTATAAAATAGAAACAGAAGCCGGCAAGGGCATCCACCTTAGAACACGAAGCTTCCAGCTGTTTGAAGGAGATACTCTAAAG SEQ ID NO: 279 CSMD1 exon 23 GTATATGATGGAAAAGACAGTTCCTCACGTCCACTGGGCACGTTCACTAAAAATGAACTTCTGGGGCTGATCCTAAACAGCACATCCAATCACCTGTGGCTAGAGTTCAACACCAATGGATCTGACACCGACCAAGGTTTTCAACTCACCTATACCA SEQ ID NO: 280 CSMD1 exon Exon 24 GTTTTGATCTGGTAAAATGTGAGGATCCGGGCATCCCTAACTACGGCTATAGGATCCGTGATGAAGGCCACTTTACCGACACTGTAGTTCTGTACAGTTGCAACCCGGGGTACGCCATGCATGGCAGCAACACCCCTGACCTGTTTGAGTGGAGACAGGAGAGTGTGGGACAAACCACTACCTTCGTGCATAG SEQ ID NO: 2 81 The polynucleotide sequence of all 47 exons 24-70 of CSMD1 in sequence SEQ ID NO: 282 The polynucleotide sequence of all 70 exons 1-70 of CSMD1 in sequence Acid sequence SEQ ID NO: 283 Translated polypeptide sequence of exon 1 of CSMD1 MTAWRRFQSLLLLLGLLVLCARLLTAAK SEQ ID NO: 284 Translated polypeptide sequence of exon 2 of CSMD1 QNCGGLVQGPNGTIESPGFPHGYPNYANCTWIIITGERNRIQLSFHTFALEEDFDILSVYDGQPQQGNLKV SEQ ID NO: 285 The translated polypeptide sequence of CSMD1 exon 3 LSGFQLPSSIVSTGSILTLWFTTDFAVSAQGFKALYE SEQ ID NO:286 Translated polypeptide sequence of exon 4 of CSMD1 LPSHTCGNPGEILKGVLHGTRFNIGDKIRYSCLPGYILEGHAILTCIVSPGNGASWDFPAPFCR SEQ ID NO:287 Translated polypeptide sequence of exon 5 of CSMD1 EGACGGTLRGTSSSISPHFPSEYENNADCTWTILAEPGDTIALVFTDFQLEEGYDF LEISGTEAPSI SEQ ID NO:288 Translated polypeptide sequence of exon 6 of CSMD1 LTGMNLPSPVISSKNWLRLHFTSDSNHRRKGFNAQFQ SEQ ID NO: 289 Translated polypeptide sequence KKAIELKSRGVKMLPSKDGSHKNSV of exon 7 of CSMD1 LAAWSDHRPICR SEQ ID NO: 292 Translated polypeptide sequence of exon 10 of CSMD1 RTCGSNLRGPSGVITSPNYPVQYEDNAHCVWVITTTPDDK SEQ ID NO: 293 Translated polypeptide sequence of exon 11 of CSMD1 VIKLAFEEFELERGYDTLTVGDAGKVGDTRSVLY SEQ ID NO: 294 Translated polypeptide sequence of exon 12 of CSMD1 LTGSSVPDLIVSMSN QMWLHLQSDDSIGSPGFKAVYQ SEQ ID NO:295 CSMD1 Translated polypeptide sequence of exon 13 IEKGGCGDPGIPAYGKRTGSSFLHGDTTLTFECPAAFELVGERVITCQQNNQWSGNKPSCV SEQ ID NO: 296 Translated polypeptide sequence of CSMD1 exon 14 SCFFNFTASSGIILSPNYPEEYGNNMNCVWLIISEPGSRIHLIFNDFDVEPQFDFLAVKDDGISDITVLGTFSGNEV PSQLASSGHIVRLEFQSDHSTTGRGFNITYT SEQ ID NO: 297 Translated polypeptide sequence of CSMD1 exon 15 FGQNECHDPGIPINGRRFGDRFLLGSSVSFHCDDGFVKTQGSESITCILQDGNVVWSSTVPRCE SEQ ID NO: 298 CSMD1 Translated polypeptide sequence of exon 16 PCGGHLTASSGVILPPGWPGYYKDSLHCEWIIEAKPGHSIKITFD SEQ ID NO:299 Translated polypeptide sequence of exon 17 of CSMD1 FQTEVNYDTLEVRDGPASSSPLIGEYHGTQAPQFLISTGNFMYLLFTTDNRSRSSIGFLIHYE SEQ ID NO:300 Exon 18 of CSMD1 Translated polypeptide sequence VTLESDSCLDPGIPVNGHRHGGDFGIRSTVTFSCDPGYTLSDDEPLVCERNHQWNHALPSCD SEQ ID NO: 301 CSMD1 exon Translated polypeptide sequence of 19 LCGGYIQGKSGTVLSPGPDFYPNSLNCTWTIEVSHGK SEQ ID NO: 302 Translated polypeptide sequence of CSMD1 exon 20 VQMIFHTFHLESSHDYLLITEDGSFSEPVARLTGSVLPHTIKAGLFGNFTAQLRFIDFSISYEGFNITFS SEQ ID NO: 303 CSMD1 exon 2 The translated polypeptide sequence of 1 YDLEPCDDPGVPAFSRRIGFHFGVGDSLTFSCFLGYRLEGATKLTCLGGGRRVWSAPLPRCV SEQ ID NO: 304 CSMD1 exon 22 Translated polypeptide sequence ECGASVKGNEGTLLLSPNFPSNYDNNHECIYKIETEAGKGIHLRTRSFQLFEGDTLK SEQ ID NO: 305 Translated polypeptide sequence of CSMD1 exon 23 VYDGKDSSSRPLGTFTKNELLGLILNSTSNHLWLEFNTNGSDTDQGFQLTYT SEQ ID NO: 306 Translated polypeptide sequence of CSMD1 exon 24 FDLVKCEDPGIPNYGYRIRDEGHFTDTVVLYSCNPGYAMHGSNTTLTCLSGDRRVWDKPLPSCI SEQ ID NO: 307 All from exons 24-70 Translated polypeptide sequence of 47 exons sequenced SEQ ID NO:308 Whole protein sequence-SEQ ID NO:309 Polynucleotide sequence sequenced sequence of all 23 exons 1-23 of CSMD1 SEQ ID NO:310 Sequential translated polypeptide sequence from all 23 exons 1-23 of CSMD1PTN sequence informationSEQ ID NO: 311 PTN exon 4 sequence at 5' of PTN-NRG1 fusion CCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAG SEQ ID NO: 312 NRG1 exon 2 sequence at 3' of PTN-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCCAGATT SEQ ID NO: 313 PTN-NRG1 polynucleotide sequence CCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAATACTCCTCTCTCGATT SEQ ID NO: 314 PTN-NRG1 polypeptide sequence RTGSLKRALHNAECQKTVTISKPCGKLTKPKPQALPPRLKEMKSQESAAAGSKLVLRCETSSEYSSLR SEQ ID NO:315 PTN exon 1 AAGGGGAAATAAGGGACAAGAGAGACCCTCTCATATTGTTTTATATTTTTCATACTCAGAAAAGGAAAGAGAAGCCAAACAAAAGGCAGGTAACCCGCGCCTAGGAACCAGACCCGAAACCAAGGAACCAGATCTGAAACCAGGCCTGGGCCTGCCTGACCTAAGCCTGGTAGTAAAAATTCCACCCCTGACCTGACCTGGCAACTGTTGTTATTCAGATTC CAGACATTGTATGGAAGGACACTGTGAAACCTCCCGTTCTGTTCTGTTTCACTCTGACCATCGGTGCTCACAGCCCCTATCACGTACCCCCTGGCTTGCTCAGTCGATCACGACCCTCTCACGTGGACCCCCTTAGAGTTGTTAGCCCTTAAAAGGGACAGAAGTTGAGCACCTGAGGAGCTCAGATTTTAAGACGCTAGGCTGCTGATGCTCCCAGCTGATTAAAGC CACTCCCTTCACTATCTCGGTGTCTCCTGTCCGCGGCTCGTCCTGCTACATTTCTTGGTTCCCTGACCGGCAAGCGAG SEQ ID NO:316 PTN exon 2 AATGCAGGCTCAACAGTACCAGCAGCGTCGAAAATTTGCAGCTGCCTTCTTGGCATTCATTTTCATACTGGCAGCTGTGGATACTGCTGAAGCAGGGAAGAAAGAGAAACCAG SEQ ID NO:317 PTN exon 3 AAAAAAAAGTGAAGAAGTCTGACTGTGGAGAATGGCAGTGGAGTGTGTGTGCCCACCAGTGGAGACTGTGGGCTGGGCACACGGGAGGGCACTCGGACTGGAGCTGAGTGCAAGCAAACCATGAAGACCCAGAGATGTAAGATCCCCCTGCAACTGGAAGAAGCAATTTGGCG SEQ ID NO:318 PTN exon 4 CGGAGTGCAAATACCAGTTCCAGGCCTGGGGAGAATGTGACCTGAACACAGCCCTGAAGACCAGAACTGGAAGTCTGAAGCGAGCCCTGCACAATGCCGAATGCCAGAAGACTGTCACCATCTCCAAGCCCTGTGGCAAACTGACCAAGCCCAAACCTCAAG SEQ ID NO:319 Exon 5 of PTN CAGAATCTAAGAAGAAGAAAAAGGAAGGCAAGAAACAGGAGAAGATGCTGGATTAAAAGATGTCACCTGTGGAACATAAAAAGGACATCAGCAAACAGGATCAGTTAACTATTGCATTTATATGTACCGTAGGCTTTGTATTCAAAAATTATCTATAGCTAAGTACACAATAAGCAAAAAACAAAAAGAAAAGAAAATTTTTGTAGTAGCGTTTTTTAAATGTATACTATAGT ACCAGTAGGGGCTTATAATAAAGGACTGTAATTCTTATTTAGGAAGTTGACTTATAGTACATGATAAATGATAGACAATTGAGGTAAGTTTTTTGAAATTATGTGACATTTTACATTAAATTTTTTTTACATTTTTTGGGCAGCAATTTAAATGTTATGACTATGTAAACTACTTCTCTTGTTAGGTAATTTTTTTCACCTAGATTTTTTTCCCAATTGAGAAAAAATATAC TAAACAAAATAGCAATAAAACATAATCACTCTATTTGAAGAAAATATCTTGTTTTCTGCCAATAGATTTTTTAAAATGTAGTCAGCAAAATGGGGGTGGGGAAGCAGAGCATGTCCTAGTTCAATGTTGACTTTTTTTTTTTAAAGAAAAGCATTAAGACATAAAATTCTTTCACTTTGGCAGAAGCATTTGTTTTCTTGATGAAATTATTTTTCCATCTGAGGAAAAAAA TACTAGGAAAATAAATCAAGGTGATGCTGAAAAAAAAA SEQ ID NO:320 The polynucleotide sequence of all five exons 1-5 of PTN arranged in order SEQ ID NO:321 Translated polypeptide sequence of PTN exon 2 MQAQQYQQQRRKFAAAFLAFIFILAAVDTAEAGKKEKP SEQ ID NO:322 Translated polypeptide sequence of PTN exon 3 KKVKKSDCGEWQWSVCVPTSGDCGLGTREGTRTGAECKQTMKTQRCKIPCNWKKQFG SEQ ID NO:323 Translated polypeptide sequence of PTN exon 4 ECKYQFQAWGECDLNTALKTRTGSLKRALHNAECQKTVTISKPCGKLTKPKPQ SEQ ID NO:324 Exon 5 of PTN ESKKKKKEGKKQEKMLD SEQ ID NO:325 full protein sequence MQAQQYQQQRRKFAAAFLAFIFILAAVDTAEAGKKEKPEKKVKKSDCGEWQWSVCVPTSGDCGLGTREGTRTGAECKQTMKTQRCKIPCNWKKQFGAECKYQFQAWGECDLNTALKTRTGSLKRALHNAECQKTVTISKPCGKLTKPKPQAESKKKKKEGKKQEKMLD SEQ ID NO:326 The polynucleotide sequence of all four exons 1-4 of PTN arranged in order SEQ ID NO:327 Translated polypeptide sequence of all three exons 2-4 of PTN arranged in order ST14 sequence informationSEQ ID NO: 328 ST14 exon 11 sequence at 5' of ST14-NRG1 fusion CAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACC SEQ ID NO: 329 NRG1 exon 6 sequence at 3' of ST14-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO: 330 ST14-NRG1 polynucleotide sequence CAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACCCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO: 331 ST14-NRG1 polypeptide sequence NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPTTSTTGTSHLVKCAEKEKTFCVNGGEC SEQ ID NO:332 Exon 1 of ST14 GTGAGAGCGGAGCTGCAGCCGGAGAAAGAGGAAGAGGGAGAGAGAGCGCGCCAGGGCGAGGGCACCGCCGCCGGTCGGGCGCGCTGGGCCTGCCCGGAATCCCGCCGCCTGCGCCCCGCGCCCCGCGCCCTGCGGGCCATGGGAGCCGCCGCCGGCAGGGACGACGCCTGTGAGACCCGCGAGCGGCCTCGGGGACC ATGGGGAGCGATCGGGCCCGCAAGGGCGGAGGGGGCCCGAAGGACTTCGGCGCGGGACTCAAGTACAACTCCCGGCACGAG SEQ ID NO:333 ST14 exon 2 AAAGTGAATGGCTTGGAGGAAGGCGTGGAGTTCCTGCCAGTCAACAACGTCAAGAAGGTGGAAAAGCATGGCCCGGGGCGCTGGGTGGTGCTGGCAGCCGTGCTGATCGGCCTCCTCTTGGTCTTGCTGGGGATCGGCTTCCTGGTGTGGCATTTGCAGT SEQ ID NO:334 ST14 exon 3 ACCGGGACGTGCGTGTCCAGAAGGTCTTCAATGGCTACATGAGGATCACAAATGAGAATTTTGTGGATGCCTACGAGAACTCCAACTCCACTGAGTTTGTAAGCCTGGCCAGCAAGGTGAAGGACGCG SEQ ID NO:335 ST14 exon 4 CTGAAGCTGCTGTACAGCGGAGTCCCATTCCTGGGCCCCTACCACAAGGAGTCGGCTGTGACGGCCTTCAG SEQ ID NO:336 ST14 exon 5 CGAGGGCAGCGTCATCGCCTACTACTGGTCTGAGTTCAGCATCCCGCAGCACCTGGTGGAGGAGGCCGAGCGCGTCATGGCCGAGGAGCGCGTAGTCATGCTGCCCCCGCGGGCGCGCTCCCTGAAGTCCTTTGTGGTCACCTCAGTGGTGGCTTTCC SEQ ID NO:337 ST14 exon 6 CCACGGACTCCAAAACAGTACAGAGGACCCAGGACA SEQ ID NO:338 ST14 exon 7 ACAGCTGCAGCTTTGGCCTGCACGCCCGCGGTGTGGAGCTGATGCGCTTCACCACGCCCGGCTTCCCTGACAGCCCCTACCCCGCTCATGCCCGCTGCCAGTGGGCCCTGCGGGGGGACGCCGACTCAGTGCTGAGCCTCACCTTCCGCAGCTTTGACCTTGCGTCCTGCGACGAGCGCGCAGCGACCCTGGTGACGGTGTACAACCCCTGAG CCCCATGGAGCCCCACGCCCTGGTGCA SEQ ID NO:339 Exon 8 of ST14 GTTGTGTGGCACCTACCCCTCCCTCCTACAACCTGACCTTCCACTCCTCCCCAGAACGTCCTGCTCATCACACTGATAACCAACACTGAGCGGCGGCATCCCGGCTTTGAGGCCACCTTCTTCCAGCTGCCTAGGATGAGCA SEQ ID NO:340 Exon 9 of ST14 GCTGTGGAGGCCGCTTACGTAAAGCCCAGGGGACATTCAACAGCCCTACTACCCAGGCCACTACCCACCCAACATTGACTGCACATGGAACATTGAG SEQ ID NO:341 Exon 10 of ST14 GTGCCCAACAACCAGCATGTGAAGGTGCGCTTCAAATTCTTCTACCTGCTGGAGCCCGGCGTGCCTGCGGGCACCTGCCCCAAGGACTACGTGGAGATCAACGGGGAGAA SEQ ID NO:342 Exon 11 of ST14 ATACTGCGGAGAGAGGTCCCAGTTCGTCGTCACCAGCAACAGCAACAAGATCACAGTTCGCTTCCACTCAGATCAGTCCTACACCGACACCGGCTTCTTAGCTGAATACCTCTCCTACGACTCCAGTGACC SEQ ID NO:343 Exon 12 of ST14 CATGCCCGGGGCAGTTCACGTGCCGCACGGGGCGGTGTATCCGGAAGGAGCTGCGCTGTGATGGCTGGGCCGACTGCACCGACCCACAGCGATGAGCTCAACTGCA SEQ ID NO:344 Exon 13 of ST14 GTTGCGACGCCGGCCACCAGTTCACGTGCAAGAACAAGTTCTGCAAGCCCCTCTTCTGGGTCTGCGACAGTGTGAACGACTGCGGAGACAACAGCGACGAGCAGGGGTGCA SEQ ID NO:345 Exon 14 of ST14 GTTGTCCGGCCCAGACCTTCAGGTGTTCCAATGGGAAGTGCCTCTCGAAAAGCCAGCAGTGCAATGGGAAGGACGACTGTGGGGACGGGTCCGACGAGGCCTCCTGCCCCAAGG SEQ ID NO:346 Exon 15 of ST14 TGAACGTCGTCACTTGTACCAAACACCCTACCGCTGCCTCAATGGGCTCTGCTTGAGCAAGGGCAACCCTGAGTGTGACGGGAAGGAGGACTGTAGCGACGGCTCAGATGAGAAGGACTGCG SEQ ID NO:347 Exon 16 of ST14 ACTGTGGGCTGCGGTCATTCACGAGACAGGCTCGTGTTGTTGGGGGCACGGATGCGGATGAGGGCGAGTGGCCCTGGCAGGTAAGCCTGCATGCTCTGGGCCAGGGCCACATCTGCGGTGCTTCCCTCATCTCTCCCAACTGGCTGGTCTCTGCCGCACACTGCTACATCGATGACAGAGGATTCAG SEQ ID NO:348 Exon 17 of ST14 GTACTCAGACCCCACGCAGTGGACGGCCTTCCTGGGCTTGCACGACCAGGCCAGCGCAGCGCCCCTGGGGTGCAGGAGCGCAGGCTCAAGCGCATCATCTCCCACCCCTTCTTCAATGACTTCACCTTCGACTATGACATCGCGCTGCTGGAGCTGGAGAAACCGGCAGAGTACAGCTCCATGGTGCGGCCCATCTGCCTGCCGGACGCCTCCCATGT CTTCCCTGCCGGCAAGGCCATCTGGGTCACGGGCTGGGGACACACCCAGTATGGAG SEQ ID NO:349 Exon 18 of ST14 GCACTGGCGCGCTGATCCTGCAAAAGGGTGAGATCCGCGTCATCAACCAGACCACCTGCGAGAACCTCCTGCCGCAGCAGATCACGCCGCGCATGATGTGCGTGGGCTTCCTCAGCGGCGGCGTGGACTCCTGCCAG SEQ ID NO:350 Exon 19 of ST14 GGTGATTCCGGGGGACCCCCTGTCCAGCGTGGAGGCGGATGGGCGGATCTTCCAGGCCGGTGTGGTGAGCTGGGGAGACGGCTGCGCTCAGAGGAACAAGCCAGGCGTGTACACAAGGCTCCCTCTGTTTCGGGACTGGATCAAAGAGAACACTGGGGTATAGGGGCCGGGGCCACCCAAATGTGTACACCTGCGGGGCCA CCCATCGTCCACCCCAGTGTGCACGCCTGCAGGCTGGAGACTGGACCGCTGACTGCACCAGCGCCCCAGAACATACACTGTGAACTCAATCTCCAGGGCTCAAATCTGCCTAGAAAACCTCTCGCTTCCTCAGCCTCCAAAGTGGAGCTGGGAGGTAGAAGGGGAGGACACTGGTGGTTCTACTGACCCAACTGGGGGCAAAGGTTTGAAGACACAGTC CCCCGCCAGCCCCAAGCTGGGCCGAGGCGCGTTTGTGCATATCTGCCTCCCCTGTCTCTAAGGAGCAGCGGGAACGGAGCTTCGGGGCTCTCTCAGTGAAGGTGGTGGGGCTGCCGGATCTGGGCTGTGGGGCCCTTGGGCCACGCTCTTGAGGAAGCCCAGGCTCGGAGGACCCTGGAAAACAGACGGGTCTGAGACTGAAATTGT TTTACCAGTCCCAGGGTGGACTTCAGTGTGTGTATTTGTGTAAATGAGTAAAACATTTTATTTCTTTTTA SEQ ID NO:351 full mRNA polynucleotide sequence SEQ ID NO:352 Translated polypeptide sequence of exon 1 of ST14 MGSDRARKGGGGPKDFGAGLKYNSRHE SEQ ID NO:353 Translated polypeptide sequence of ST14 exon 2 KVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAAVLIGLLLVLLGIGFLVWHLQ SEQ ID NO:354 Translated polypeptide sequence of exon 3 of ST14 RDVRVQKVFNGYMRITNENFVDAYENSNNSTEFVSLASKVKDA SEQ ID NO:355 Translated polypeptide sequence of exon 4 of ST14 LKLLYSGVPFLGPYHKESAVTAF SEQ ID NO:356 Translated polypeptide sequence of exon 5 of ST14 EGSVIAYYWSEFSIPQHLVEEAERVMAEERVVMLPPRARSLKSFVVTSVVAF SEQ ID NO:357 Translated polypeptide sequence of exon 6 of ST14 TDSKTVQRTQD SEQ ID NO:358 Translated polypeptide sequence of exon 7 of ST14 SCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALV SEQ ID NO:359 Translated polypeptide sequence of exon 8 of ST14 LCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMS SEQ ID NO:360 Translated polypeptide sequence of exon 9 of ST14 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIE SEQ ID NO:361 Translated polypeptide sequence of exon 10 of ST14 VPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGE SEQ ID NO:362 Translated polypeptide sequence of exon 11 of ST14 YCGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSD SEQ ID NO:363 Translated polypeptide sequence of exon 12 of ST14 NM_021978.4 CPGQFTCRTGRCIRKELRCDGWADCTDHSDELNC SEQ ID NO:364 Translated polypeptide sequence of exon 13 of ST14 CDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC SEQ ID NO:365 Translated polypeptide sequence of exon 14 of ST14 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPK SEQ ID NO:366 Translated polypeptide sequence of exon 15 of ST14 NVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDC SEQ ID NO:367 Translated polypeptide sequence of exon 16 of ST14 CGLRSFTRQARVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGF SEQ ID NO:368 Translated polypeptide sequence of exon 17 of ST14 YSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYG SEQ ID NO:369 Translated polypeptide sequence of exon 18 of ST14 TGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQ SEQ ID NO:370 Translated polypeptide sequence of exon 19 of ST14 GDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV SEQ ID NO:371 Full protein sequence MGSDRARKGGGGPKDFGAGLKYNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAAVLIGLLLVLLGIGFLVWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERVVMLPPR ARSLKSFVVTSVVAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQ HVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVN VVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWG HTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV SEQ ID NO:372 The polynucleotide sequence of all 11 exons 1-11 of ST14 arranged in order SEQ ID NO:373 Translated polypeptide sequence of all 11 exons 1-11 of ST14 arranged in order THBS1 sequence informationSEQ ID NO: 374 THBS1 exon 9 sequence at 5' of THBS1-NRG1 fusion ACCCTGTGAAGGCGAAGCGCGGGAGACCAAAAGCCTGCAAGAAAGACGCCTGCCCCA SEQ ID NO: 375 NRG1 exon 6 sequence at 3' of THBS1-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTC SEQ ID NO: 376 THBS1-NRG1 polynucleotide sequence ACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTC SEQ ID NO: 377 THBS1-NRG1 polypeptide sequence PCEGEARETKACKKDACPTTTSTTGTSHLVKCAEKEKTFCVNGGECF SEQ ID NO:378 Exon 1 of THBS1 AGCCGCTGCGCCCGAGCTGGCCTGCGAGTTCAGGGCTCCTGTCGCTCTCCAGGAGCAACCTCTACTCCGGACGCACAGGCATTCCCGCGCCCCCTCCAGCCCTCGCCGCCCTCGCCACCGCTCCCGGCCGCCGCGCTCCGGTACACACAG SEQ ID NO:379 Exon 2 of THBS1 GATCCCTGCTGGGCACCAACAGCTCCACCATGGGGCTGGCCTGGGGACTAGGCGTCCTGTTCCTGATGCATGTGTGTGGCACCAACCGCATTCCAG SEQ ID NO:380 Exon 3 of THBS1 AGTCTGGCGGAGACAACAGCGTGTTTGACATCTTTGAACTCACCGGGGCCGCCCGCAAGGGGTCTGGGCGCCGACTGGTGAAGGGCCCCGACCCTTCCAGCCCAGCTTTCCGCATCGAGGATGCCAACCTGATCCCCCCTGTGCCTGATGACAAGTTCCAAAGACCTGGTGGATGCTGTGCGGGCAGAAAAGGGTTTCCTCCTTCTGGCAT CCCTGAGGCAGATGAAGAAGACCCGGGGCACGCTGCTGGCCCTGGAAGACCACTCTGGCCAGGTCTTCAGCGTGGTGTCCAATGGCAAGGCGGGCACCCTGGACCCTCAGCCTGACCGTCCAAGGAAAAGCAGCACGTGGTGTCTGTGGAAGAAGCTCTCCTGGCAACCGGCCAGTGGAAGAGCATCCACCCTGTTTGTGCAGGAAGACAG GGCCCAGCTGTACATCGACTGTGAAAAGATGGAGAATGCTGAGTTGGACGTCCCCATCCAAAGCGTCTTCACCAGAGACCTGGCCAGCATCGCCAGACTCCGCATCGCAAAGGGGGGCGTCAATGACAATTTCCAG SEQ ID NO:381 Exon 4 of THBS1 GGGGTGCTGCAGAATGTGAGGTTTGTCTTTGGAACCACACCAGAAGACATCCTCAGGAACAAAGGCTGCTCCAGCT SEQ ID NO:382 Exon 5 of THBS1 CTACCAGTGTCCTCCTCACCCTTGACAACAACGTGGTGAATGGTTCCAGCCCTGCCATCCGCACTAACTACATTGGCCACAAGACAAAGGACTTGCAAGCCATCTGCGGCATCTCCTGTGATGAGCTGTCCAGCATGGTCCTGGAACTCAGGGGCCTGCGCACCATTGTGACCACGCTGCAGGACAGCATCCGCAAAGTG SEQ ID NO:383 Exon 6 of THBS1 ACTGAAGAGAACAAAGAGTTGGCCAATGAGCTGAGGCGGCCTCCCCTATGCTATCACAACGGAGTTCAGTACAGAAATAACGAGGAATGGACTGTTGATAGCTGCACTGAGTGTCACTGTCAG SEQ ID NO:384 Exon 7 of THBS1 AACTCAGTTACCATCTGCAAAAAGGTGTCCTGCCCCATCATGCCCTGCTCCAATGCCACAGTTCCTGATGGAGAATGCTGTCCTCGCTGTTGGC SEQ ID NO:385 Exon 8 of THBS1 CCAGCGACTCTGCGGACGATGGCTGGTCTCCATGGTCCGAGTGGACCTCCTGTTCTACGAGCTGTGGCAATGGAATTCAGCAGCGCGGCCGCTCCTGCGATAGCCTCAACAACCGATGTGAGGGCTCCTCGGTCCAGACACGGACCTGCCACATTCAGGAGTGTGACAAGAGAT SEQ ID NO:386 Exon 9 of THBS1 TTAAACAGGATGGTGGCTGGAGCCACTGGTCCCGTGGTCATCTTGTTCTGTGACATGTGGTGATGGTGTGATCACAAGGATCCGGCTCTGCAACTCTCCCAGCCCCCAGATGAACGGGAAACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCCTGCAAGAAAGACGCCTGCCCCA SEQ ID NO:387 The polynucleotide sequence of all 13 exons 10-22 of THBS1 arranged in order TTAAACAGGATGGTGGCTGGAGCCACTGGTCCCCGTGGTCATCTTGTTCTGTGACATGTGGTGATGGTGTGATCACAAGGATCCGGCTCTGCAACTCTCCCAGCCCCCAGATGAACGGGAAACCCTGTGAAGGCGAAGCGCGGGAGACCAAAGCTGCAAGAAAGACGCCTGCCCCATCAATGGAGGCTGGGGTCCTTGGTCACCATGG GACATCTGTTCTGTCACCTGTGGAGGAGGGGTACAGAAACGTAGTCGTCTCTGCAACAACCCCACACCCAGTTTGGAGGCAAGGACTGCGTTGGTGATGTAACAGAAAACCAGATCTGCAACAAGCAGGACTGTCCAATTGATGGATGCCTGTCCAATCCCTGCTTTGCCGGCGTGAAGTGTACTAGCTACCCTGATGGCAGCTGGAAATGTGGT GCTTGTCCCCCCTGGTTACAGTGGAAATGGCATCCAGTGCACAGATGTTGATGAGTGCAAAGAAGTGCCTGATGCCTGCTTCAACCACAATGGAGAGCACCGGTGTGAGAACACGGACCCCGGCTACAACTGCCTGCCCTGCCCCCCACGCTTCACCGGCTCACAGCCCTTCGGCCAGGGTGTCGAACATGCCACGGCCAACAAACAGGTGTGCAAGCCCCG TAACCCCTGCACGGATGGGACCCACGACTGCAACAAGAACGCCAAGTGCAACTACCTGGGCCACTATAGCGACCCCATGTACCGCTGCGAGTGCAAGCCTGGCTACGCTGGCAATGGCATCATCTGCGGGGAGGACACAGACCTGGATGGCTGCCCAATGAGAACCTGGTGTGCGTGGCCAATGCGACTTACCACTGCAAAAAGGATAATT GCCCCAACCTTCCCAACTCAGGGCAGGAAGACTATGACAAGGATGGAATTGGTGATGCCTGTGATGATGACGATGACAATGATAAAATTCCAGATGACAGGGACAACTGTCCATTCCATTACAACCCAGCTCAGTATGACTATGACAGAGATGATGTGGGAGACCGCTGTGACAACTGTCCCTACAACCACACCCAGATCAGGCAGACACAGACAACAATG GGGAAGGAGACGCCTGTGCTGCAGACATTGATGGAGACGGTATCCTCAATGAACGGGACAACTGCCAGTACGTCTACAATGTGGACCAGAGAGACACTGATATGGATGGGGTTGGAGATCAGTGTGACAATTGCCCCTTGGAACACAATCCGGATCAGCTGGACTCTCAGACCGCATTGGAGATACCTGTGACAACAATCAGGATATT GATGAAGATGGCCACCAGAACAATCTGGACAACTGTCCCTATGTGCCCAATGCCAACCAGGCTGACCATGACAAAGATGGCAAGGGAGATGCCTGTGACCACGATGATGACAACGATGGCATTCCTGATGACAAGGACAACTGCAGACTCGTGCCCAATCCCGACCAGAAGGACTCTGACGGCGATGGTCGAGGTGATGCCTGCAAAGATGATTTTGACC ATGACAGTGTGCCAGACATCGATGACATCTGTCCTGAGAATGTTGACATCAGTGAGACCGATTTCCGCCGATTCCAGATGATTCCTCTGGACCCCAAAGGGACATCCCCAAAATGACCTAACTGGGTTGTACGCCATCAGGGTAAAGAACTCGTCCAGACTGTCAACTGTGATCCTGGACTCGCTGTAGGTTATGATGAGTTTAATGCTGTGGACTTCA GTGGCACCTTCTTCATCAACACCGAAAGGGACGATGACTATGCTGGATTTGTCTTTGGCTACCAGTCCAGCAGCCGCTTTTATGTTGTGTGATGTGGAAGCAAGTCACCCAGTCCTACTGGGACACCAACCCCACGAGGGCTCAGGGATACTCGGGCCTTTCTGTGAAAGTTGTAAACTCCACACAGGGCCTGGCGAGCACCTGCGGAACGCCCTGTGGC ACACAGGAAACACCCCTGGCCAGGTGCGCACCCTGTGGCATGACCCTCGTCACATAGGCTGGAAAGATTTCACCGCCTACAGATGGCGTCTCAGCCACAGGCCAAAGACGGGTTTCATTAGAGTGGTGATGTATGAAGGGAAGAAAATCATGGCTGACTCAGGACCCATCTATGATAAAACCTATGCTGGTGGTAGACTAGGGTTGTTTGTCTCTC TCAAGAAATGGTGTTTCTTCTCTGACCTGAAATACGAATGTAGAGATCCCTAATCATCAAATTGTTGATTGAAAGACTGATCATAAACCAATGCTGGTATTGCACCTTCTGGAACTATGGGCTTGAGAAAACCCCCAGGATCACTTCTCTTGGCTTCCTTCTTTTCTGTGCTTGCATCAGTGTGGACTCCTAGAACGTGCGACCTGCCTCAAGAAAATGCAGTTTCA AAAACAGACTCAGCATTCAGCCTCCAATGAATAAGACATTCTCCAAGCATAAACAATTGCTTTGGTTCCTTTTGAAAAAGCATCTACTTGCTTCAGTTGGGAAGGTGCCCATTCCACTCTGCCTTTGTCACAGAGCAGGGTGCTATTGTGAGGCCATCTCTGAGCAGTGGACTCAAAAGCATTTTCAGGCATGTCAGAGAAGGGAGGACTCACTAGAATTAGCAAACAA ACCACCCTGACATCCTCCTTCAGGAACACGGGGAGCAGAGGCCAAAGCACTAAGGGGAGGGCGCATACCCGAGACGATTGTATGAAGAAAATATGGAGGAACTGTTACATGTTCGGTACTAAGTCATTTTCAGGGGATTGAAAGACTATTGCTGGATTTCATGATGCTGACTGGCGTTAGCTGATTAACCCATGTAAATAGGCACTTAAATAGAAGCAGGA AAGGGAGACAAAAGACTGGCTTCTGGACTTCCTCCCTGATCCCCACCTTACTCATCACCTGCAGTGGCCAGAATTAGGGAATCAGAATCAAACCAGTGTAAGGCAGTGCTGGCTGCCATTGCCTGGTCACATTGAAATTGGTGGCTTCATTCTAGATGTAGCTTGTGCAGATGTAGCAGGAAAATAGGAAAACCTACCATCTCAGTGAGCACCAGCTGCCTCCCA AAGGAGGGGCAGCCGTGCTTATATTTTTTATGGTTACAATGGCACAAAATTATTATCAACCTAACTAAAACATTCCTTTTCTCTTTTTTTCCTGAATTATCATGGAGTTTTTCTAATTCTCTCTTTTGGAATGTAGATTTTTTTTAAATGCTTTACGATGTAAAATATTTTTTTTACTTATTCTGGAAGATCTGGCTGAAGGATTATTCATGGAACAGGAAGAAGCGTA AAGACTATCCATGTCATCTTTGTTGAGAGTCTTCGTGACTGTAAGATTGTAAATACAGATTATTTATTAACTCTGTTCTGCCTGGAAATTTAGGCTTCATACGGAAAGTGTTTGAGAGCAAGTAGTTGACATTTATCAGCAAATCTCTTGCAAGAACAGCACAAGGAAAATCAGTCTAATAAGCTGCTCTGCCCCTTGTGCTCAGAGTGGATGTTATGGGATTCTTTTTTT CTCTGTTTTATTCTTTTCAAGTGGAATTAGTTGGTTATCCATTTGCAAATGTTTTAAAATTGCAAAGAAAGCCATGAGGTCTTCAATACTGTTTTACCCCATCCCTTGTGCATATTTCCAGGGAGAAGGAAAGCATATACACTTTTTTTTCTTCATTTTTCCAAAAGAAGAAAAAAAATGACAAAAGGTGAAACTTACATACAAATATTACTCATTTGTTGTGTGACTGAGTAAAGAAT TTTTGGATCAAGCGGAAAGAGTTTAAGTGTCTAACAAACTTAAAGCTACTGTAGTACCTAAAAAGTCAGTGTTGTACATAGCATAAAAACTCTGCAGAGAAGTATTCCCAATAAGGAAATAGCATTGAAATGTTAAATACAATTTCTGAAAGTTATGTTTTTTTTTCTATCATCTGGTATACCATTGCTTTATTTTTATAAATTTTTCTCATTGCCATTGGAATAGATATCTCAGA TTGTGTAGATATGCTATTTAAATAATTTATCAGGAAATACTGCCTGTAGAGTTAGTATTTCTATTTTATATAATGTTTGCACACTGAATTGAAGAATTGTTGGTTTTTCTTTTTTTTGTTTTGTTTTTTTTTTTTTTTTTTTTGCTTTTGACCTCCCATTTTTACTATTTGCCAATACCTTTTTCTAGGAATGTGCTTTTTTTTGTACACATTTTTATCCATTTACATTCTAAG CAGTGTAAGTTGTATATTACTGTTTCTTATGTACAAGGAACAACAATAAATCATATGGAAATTTATATTTTACTTACTGTATCCATGCTTATTTGTTCTCTACTGGCTTTATGTCATGAAGTATATGCGTAAATACCATTCATAAAATCAATATAGCATATACAAAAAATAAATTACAGTAAGTCATAGCAACATTCACAGTTTGTATGTGATTGAGAAAGACTGAGTTGCTCAG GCCTAGGCTTAGAATTTGCTGCGTTTGTGGAATAAAAGAACAAAATGATACATTAGCCTGCCATATCAA SEQ ID NO:388 full mRNA polynucleotide sequence SEQ ID NO:389 Translated polypeptide sequence of exon 2 of THBS1 MGLAWGLGVLFLMHVCGTNRIP SEQ ID NO:390 Translated polypeptide sequence of exon 3 of THBS1 SGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELDVPIQSVFTRDLASIARLRIAKGGVNDNFQ SEQ ID NO:391 Translated polypeptide sequence of exon 4 of THBS1 GVLQNVRFVFGTTPEDILRNKGCSS SEQ ID NO:392 Translated polypeptide sequence of exon 5 of THBS1 TSVLLTLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRKV SEQ ID NO:393 Translated polypeptide sequence of exon 6 of THBS1 TEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECHCQ SEQ ID NO:394 Translated polypeptide sequence of exon 7 of THBS1 NSVTICKKVSCPIMPCSNATVPDGECCPRCW SEQ ID NO:395 Translated polypeptide sequence of exon 8 of THBS1 SDSADDGWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKR SEQ ID NO:396 Translated polypeptide sequence of exon 9 of THBS1 KQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACP SEQ ID NO:397 The translated polypeptide sequence of all 13 exons 10-22 of THBS1 arranged in order KQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNNTPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGVKCTSYPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGY NCLCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPNLPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPFHYNPAQYDYDRDDVGDRCDNCPYNHNPDQADTDNNGEGDACAADIDGD GILNERDNCQYVYNVDQRDTMDDGVGDQCDNCPLEHNPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHDSVPDIDDICPENVDISETDFRRFQMIPLDPKGTSQNDPNWVVRHQGKELVQ TVNCDPGLAVGYDEFNAVDFSGTFFINTERDDDYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPKTGFIRVVMYEGKKIMADSGPIYDKTYAGGRGLLFVFSQEMVFFSDL KYECRDP SEQ ID NO:398 Full protein sequence MGLAWGLGVLFLMHVCGTNRIPESGGDNSVFDIFELTGAARKGSGRRLVKGPDPSSPAFRIEDANLIPPVPDDKFQDLVDAVRAEKGFLLLASLRQMKKTRGTLLALERKDHSGQVFSVVSNGKAGTLDLSLTVQGKQHVVSVEEALLATGQWKSITLFVQEDRAQLYIDCEKMENAELDVPIQ SVFTRDLASIARLRIAKGGVNDNFQGVLQNVRFVFGTTPEDILRNKGCSSTSVLLTLDNNVVNGSSPAIRTNYIGHKTKDLQAICGISCDELSSMVLELRGLRTIVTTLQDSIRKVTEENKELANELRRPPLCYHNGVQYRNNEEWTVDSCTECHCQNSVTICKKVSCPIMPCSNATVPDGECCPRCWPSDSADD GWSPWSEWTSCSTSCGNGIQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQDGGWSHWSPWSSCSVTCGDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGGVQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDCPIDGCLSNPCFAGVKCTS YPDGSWKCGACPPGYSGNGIQCTDVDECKEVPDACFNHNGEHRCENTDPGYNCLPCPPRFTGSQPFGQGVEHATANKQVCKPRNPCTDGTHDCNKNAKCNYLGHYSDPMYRCECKPGYAGNGIICGEDTDLDGWPNENLVCVANATYHCKKDNCPNLPNSGQEDYDKDGIGDACDDDDDNDKIPDDRDNCPF HYNPAQYDYDRDDDVGDRCDNCPYNHNPDQADTDNNGEGDACAADIDGDGILNERDNCQYVYNVDQRDTDMDGVGDQCDNCPLEHNPDQLDSDSDRIGDTCDNNQDIDEDGHQNNLDNCPYVPNANQADHDKDGKGDACDHDDDNDGIPDDKDNCRLVPNPDQKDSDGDGRGDACKDDFDHD SVPDIDICPENVDISETDFRRFQMIPLDPKGTSQNDPNWVVRHQGKELVQTVNCDPGLAVGYDEFNAVDFSGTFFINTERDDDYAGFVFGYQSSSRFYVVMWKQVTQSYWDTNPTRAQGYSGLSVKVVNSTTGPGEHLRNALWHTGNTPGQVRTLWHDPRHIGWKDFTAYRWRLSHRPK TGFIRVVMYEGKKIMADSGPIYDKTYAGGRLGLFVFSQEMVFFSDLKYECRDP SEQ ID NO:399 The polynucleotide sequence of all nine exons 1-9 of THBS1 arranged in order SEQ ID NO:400 Translated polypeptide sequence of all 8 exons 2-9 of THBS1 arranged in order AGRN sequence informationSEQ ID NO: 401 AGRN exon 12 sequence at 5' of the AGRN-NRG1 fusion GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAG SEQ ID NO: 402 NRG1 exon 6 sequence at 3' of AGRN-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGAC SEQ ID NO:403 AGRN-NRG1 polynucleotide sequence GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGAC SEQ ID NO:404 AGRN-NRG1 polypeptide sequence VCGSDGVTYSTECELKKARCESQRGLYVAAQGACRATSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKD SEQ ID NO:405 AGRN exon 1 AGTCCCGTCCCCGGCGCGGCCCGCGCGCTCCTCCGCCGCCTCTCGCCTGCGCCATGGCCGGCCGGTCCCACCCGGGCCCGCTGCGGCCGCTGCTGCCGCTCCTTGTGGTGGCCGCGTGCGTCCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAGGCGAACGTGGTGCTCACCGGG ACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGCACACGTACTCCTGCAAG SEQ ID NO:406 AGRN exon 2 GTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGCCCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCCCTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCCTGCACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTC ATGCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAG SEQ ID NO:407 AGRN exon 3 ATAAACCCGGGACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATG SEQ ID NO:408 AGRN exon 4 CGTGCCGGGGAATGCTGTGCGGCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTGCAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACCTACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCCTGCTCAGCCGCG GGCCGTGCG SEQ ID NO:409 AGRN exon 5 GCTCGCGGGACCCCCTGCTCCAACGTGACCTGCAGCTTCGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCGACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCGGCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTAAGAAGT TCGACGGCCCTTGTG SEQ ID NO:410 AGRN exon 6 ACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGTGTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCCGGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGGCCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCG GCCAGTGCCAGGGTCGAG SEQ ID NO:411 AGRN exon 7 ACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTGGCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTGTGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGGCAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTG SEQ ID NO:412 AGRN exon 8 ACCAGGCCCCGTCCCCATGCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGCGTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGCGTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGATCCAGGTGGCGCGCAAAG GACCCTGTG SEQ ID NO:413 AGRN exon 9 ACCGCTGCGGGCAGTGCCGCTTTGGAGCCCTGTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGCCCAGCCCGTGTGTGGCTCCGACGGGCACACGTACCCAGCGAGTGCATGCTGCACGTGCACGCCTGCACACACCAGATCAGCTGCACGTGGCCTCAGCTGGACCCTGTG SEQ ID NO:414 AGRN exon 10 AGACCTGTGGAGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGGTGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTGCCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGCAGGGCCGTGCGAGCAGG SEQ ID NO:415 AGRN exon 11 CCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGTGACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGGACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCG SEQ ID NO:416 AGRN exon 12 GTGTGCGGCTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCACAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAG SEQ ID NO:417 The polynucleotide sequence of all 27 exons 13-39 of AGRN arranged in order GCCCCACCTTCGCCCCGCTGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAATATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAACCCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCCCAGGTGTGGGGGGCCTCAG GTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGGCATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCCGTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTGGACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCT GTGAAGCTGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGCGTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCAACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAAGACCATCGCCTGCCGCCAGGGCCTGCAAATTCTATCCAGA GCCTGGGCCCGTGCCAGGAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAGGGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAGCAGTACCGCCACAGCCAGACCACCCCTCCGCCCTCATCACGACCTCGGACCACTGCCAGCGTCCCCAGGACCACGTGTGGCCCGT GCTGACGGTGCCCCCCACGGCACCCTCCCCTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAGCGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAGCCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCTA CAACTCCGCGTTGGGCTGCTGCTCTGATGGGAAGACGCCCTCGCTGGACGCAGAGGGCTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAGGGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCCAAGTCAGAACTGTTCGGGGAGACAGCCAGGAGCATTGAGAGCACCCTGG ACGACCTCTTCCGGAATTCAGACGTCAAGAAGGATTTTCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCCATTGTGGATGTGCACTTTGACCCCACCACAGCCTTCAGGGCACCCGACGTGGCCCGGGCCCTGCTCCGGCAGATCCAGGTGTCCAGGCGCCGGTCCTTGGGGGTGAGGCGGCCGCTGCAGGAGCACGT GCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCCACGTCAGGAGCCATTGCTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCGCCTGCCGTCCTCTGCTGTGACCCCTCGGGCCCCGCACCCCAGTCACACAAGCCAGCCGTTGCCAAGACCACGGCAGCCCCCACCACACGTCGGCCCCCCACCACTGCCCCCAGCC GTGTGCCCGGACGTCGGCCCCCGGCCCCCCAGCAGCCTCCAAAGCCCTGTGACTCACAGCCCTGCTTCCACGGGGGGACCTGCCAGGACTGGGCATTGGGCGGGGGCTTCACCTGCAGCTGCCCGGCAGGCAGGGGAGGCGCCGTCTGTGAGAAGGTGCTTGGCGCCCCTGTGCCGGCCTTCGAGGGCCGCTCCTTCCTGGCCTTCCCCA CTCTCCGCGCCTACCACACGCTGCGCCTGGCACTGGAATTCCGGGCGCTGGAGCCTCAGGGGCTGCTGCTGTACAATGGCAACGCCCGGGGCAAGGACTTCCTGGCATTGGCGCTGCTAGATGGCCGCGTGCAGCTCAGGTTTGACACAGGTTCGGGGCCGGCGGTGCTGACCAGTGCCGTGCCGGTAGAGCCGGGCCAGT GGCACCGCCTGGAGCTGTCCCGGCACTGGCGCCGGGGCACCCTCTCGGTGGATGGTGAGACCCCTGTTCTGGGCGAGAGTCCCAGTGGCACCGACGGCCTCAACCTGGACACAGACCTTTGTGGGCGGCGTACCCGAGGACCAGGCTGCCGTGGCGCTGGAGCGGACCTTCGTGGGCGCCGGCCTGAGGGGGTGCATCC GTTTGCTGGACGTCAACAACCAGCGCCTGGAGCTTGGCATTGGGCCGGGGGGCTGCCACCCGAGGCTCTGGCGTGGGCGAGTGCGGGGACCACCCCTGCCTGCCCAACCCCTGCCATGGCGGGGCCCCATGCCAGAACCTGGAGGCTGGAAGGTTCCATTGCCAGTGCCCGCCCGGCCGCGTCGGACCAACCTGTGCCGATGAGA AGAGCCCCTGCCAGCCCAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCCCGAGGGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCCTCGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGAGCTGAGAGGCCTGCACACCTTTGCACGGG ACCTGGGGGAGAAGATGGCGCTGGAGGTCGTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACGGCAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTCCGCTACGACCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCT GGACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGGCCCCCGTGTGTTGGGGGAGTCCCCCGAAATCCCGCAAGGTTCCGCACACCGTCCTCAACCTGAAGGAGCCGCTCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTCTGGCTTCGACGGTGCCATCCAG CTGGTCTCCCTCGGAGGCCGCCAGCTGCTGACCCCGGAGCACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGGCCTCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGTGTGCCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGA GAAGTCAGCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTGTCGAGTACCTCAACGCTGTGACCGAGAGCGAACTGGCCAATGAGATCCCCGTCCCCGAAACTCTGGATTCCGGGGCCCTTCACAGCGAGAAGGCACTGCAGAGCAACCACTTTGAACTGAGCCTGCGCACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTG GCAAGGCCACGGAGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCCAGCCCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGCTCCTCCCCGCT GGGCGCCACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCTTGCGGGACGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTG GCACCAGAGCCCCGCGCCCGCTGTAATTATTTTCTATTTTTGTAAACTTGTTGCTTTTTGATATGATTTTCTTGCCTGAGTGTTGGCCGGAGGGACTGCTGGCCCGGCCTCCCTTCCGTCCAGGCAGCCGTGCTGCAGACAGACCTAGTGCCGAGGGATGGACAGGCGAGGTGGCAGCGTGGAGGGCTCGGCGTGGATGGCAGCCTCAGG ACACACACCCCTGCCTCAAGGTGCTGAGCCCCCGCCTTGCACTGCGCCTGCCCCACGGTGTCCCCGCCGGGAAGCAGCCCCGGCTCCTGAATCACCCTCGCTCCGTCAGGCGGGACTCGTGTCCCAGAGAGGAAGGGGCTGCTGAGGTCTGATGGGGCCCTTCCTCCGGGTGACCCCACAGGGCCTTTCCAAGCCCCCATTTGAGCTGCTCTCTC CTGTGTGTGCTCTGGGCCCTGCCTCGGCCTCCTGCGCCAATACTGTGACTTCCAAACAATGTTACTGCTGGGCACAGCTCTGCGTTGCTCCCGTGCTGCCTGCGCCAGCCCCAGGCTGCTGAGGAGCAGAGGCCAGACCAGGCCGATCTGGGTGTCCTGACCCTCAGCTGGCCCTGCCCAGCCACCCTGGACGTGACCGTATCCCTCTGCCACACC CCAGGCCCTGCGAGGGGCTATCGAGAGGAGCTCACTGTGGGATGGGGTTGACCCTCTGCCGCCTGCCTGGGTATCTGGGCCTGGCCATGGCTGTGTTCTTCATGTGTTGATTTTATTTGACCCCTGGAGTGGTGGGTCTCATCTTTCCCATCTCGCCTGAGAGCGGCTGAGGGCTGCCTCACTGCAAATCCTCCCCACAGCGTCAGTGAA AGTCGTCCTTGTCTCCAGAATGACCAGGGGCCAGCCAGTGTCTGACCAAGGTCAAGGGGCAGGTGCAGAGGTGGCAGGGATGGCTCCGAAGCCAGAAATGCCTTAAACTGCAACGTCCCGTCCCTCCCCACCCCCATCCCATCCCCACCCCCAGCCCCAGCCCAGTCTCCTAGGAGCAGGACCCGATGAAGCGGGCGGCGGTGGGGCTGGGTGCCGTGT TACTAACTCTAGTATGTTTCTGTGTCAATCGCTGTGAAATAAAGTCTGAAAACTTTAAAA SEQ ID NO:418 full mRNA polynucleotide sequence SEQ ID NO:419 Translated polypeptide sequence of exon 1 of AGRN MAGRSHGPPLRPLLPLLVVAACVLPGAGGTCPERALERREEEANVVLTGTVEEILNVDPVQHTYSCK SEQ ID NO:420 Translated polypeptide sequence of exon 2 of AGRN VRVWRYLKGKDLVARESLLDGGNKVVISGFGDPLICDNQVSTGDTRIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVE SEQ ID NO:421 Translated polypeptide sequence of exon 3 of AGRN KPGTHFTPVPPTPPD SEQ ID NO:422 Translated polypeptide sequence of exon 4 of AGRN CRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSVVAPVCGSDASTYSNECELQRAQCSQQRRIRLLSRGPC SEQ ID NO:423 Translated polypeptide sequence of exon 5 of AGRN SRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPC SEQ ID NO:424 Translated polypeptide sequence of exon 6 of AGRN PCQGALPPDPSRSCRVNPRTRRPEMLLRPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGR SEQ ID NO:425 Translated polypeptide sequence of exon 7 of AGRN QCPEPCRFNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGPC SEQ ID NO:426 Translated polypeptide sequence of exon 8 of AGRN QAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEATACTLGREIQVARKGPC SEQ ID NO:427 Translated polypeptide sequence of exon 9 of AGRN RCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGPC SEQ ID NO:428 Translated polypeptide sequence of exon 10 of AGRN TCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGSDGVTYGSACELREAACLQQTQIEEARAGPCEQ SEQ ID NO:429 Translated polypeptide sequence of exon 11 of AGRN ECGSGGSGSGEDGDCEQELCRQRGGIWDEDSEDGPVCDFSCQSVPGSP SEQ ID NO:430 Translated polypeptide sequence of exon 12 of AGRN VCGSDGVTYSTECELKKARCESQRGLYVAAQGACR SEQ ID NO:431 The translated polypeptide sequence of all 27 exons 12-39 of AGRN arranged in order VCGSDGVTYSTECELKKARCESQRGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPATGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVE ESGSAHCVCPMLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGPPVERASCYNSALGCCSDGKTPSLDA EGSNCPATKVFQGVLELEGVEGQELFYTPEMADPKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRDLPGGKSVRAIVDVHFDPTTAFRAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATARATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTAPSRVPGRRPP APQQPPKPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTLRAYHTRLLALEFRALEPQGLLLYNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTSAVPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAAVALERTFV GAGLRGCIRLLDVNNQRLELEGPGGAATRGSGVGECGDHPCLPNPCHGGAPCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFSHLELRGLHTFARDLGEKMALEVVFLARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEFRYDLG KGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPKSRKVPHTVLNLKEPLYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSFAGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESELANEIPV PETLDSGALHSEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVLRSTVPVNTNRWLRVVAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAYGTGFVGCLRDVVVGRHPLHLLEDAVTKPELRPCPTP SEQ ID NO:432 Full protein sequence MAGRSHGPPLRPLLPLLVVAACVLPGAGGTCPERALERREEEANVVLTGTVEEILNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKVVISGFGDPLICDNQVSTGDTRIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPTPDACRGMLCGFGAVCEPNAEGPGRASCVCK KSPCPSVVAPVCGSDASTYSNECELQRAQCSQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCGSDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPRSCRVNPRTRRPEMLLRPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCR FNAVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGPCDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEATLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPSECMLHVHACTHQISLHVASAGP CETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGSDGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGGIWDEDSEDGPVCDFSCQSVPPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGACRGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCN PHGSYGGTCDPATGQCSCRPGVGGLRCDRPGVGGLRCDRPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLCSCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCPMLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSASVTVTTPGLLLSQALPAPP GALPLAPSSTAHSQTTPPPSSSRPRTTASVPRTTVWPVLTVPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGPPVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMADPKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRRLGPGKSVRAIVDV HFDPTTAFRAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATARATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPAPQQPPKPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTLRAYHTLRLALEFRALEPQGLLL YNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTSAVPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAAVALERTFVGAGLRGCIRLLDVNNQRLELEGPGGAATRGSGVGECGDHPCLPNPCHGGAPCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNP CHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFSHLELRGLHTFARDLGEKMALEVVFLARGPSGLLLYNGQKTDGKGDFVSLALRDRRLEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPKSRKVPHTVLNLKEPLYVGGAPDFSKLARAAAVSSGFD GAIQLVSLGGRQLLTPEHVLRQVDVTSFAGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESELANEIPVPETLDSGALHSEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPVVLRSTVPVNTNRWL RVVAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAYGTGFVGCLRDVVVGRHPLHLLEDAVTKPELRPCPTP SEQ ID NO:433 The polynucleotide sequence of all 12 exons 1-12 of AGRN arranged in order SEQ ID NO:434 Translated polypeptide sequence of all 12 exons 1-12 of AGRN arranged in order PVALB sequence informationSEQ ID NO: 435 PVAPB exon 4 sequence at 5' of PVALB-NRG1 fusion TAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACG SEQ ID NO: 436 NRG1 exon 6 sequence at 3' of PVALB-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTCAAACCCCTCGAGATACTTG SEQ ID NO:437 PVALB-NRG1 polynucleotide sequence TAAAAGGCTTCTCCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCTTCATGGTGAAAGACCTTCA AACCCCTCGAGATACTTG SEQ ID NO:438 PVALB-NRG1 polypeptide sequence KGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDATSTSTTGTSHLVKCAEKEKTFCVNGGECFMVKDLSNPSRYL SEQ ID NO:439 PVALB exon 1 ACTTCCCGACAGGACTTCCCACCAGCCCCAGCCTTTCAGTGCAGGCTCCAGCCCTCCACCCCCACCCGAG SEQ ID NO:440 PVALB exon 2 TTGCAGGATGTCGATGACAGACTTGCTGAACGCTGAGGACATCAAGAAGGCGGTGGGAGCCTTTAGCG SEQ ID NO:441 PVALB exon 3 CTACCGACTCCTTCGACCACAAAAAGTTCTTCCAAATGGTCGGCCTGAAGAAAAAGAGTGCGGATGATGTGAAGAAGGTGTTTCACATGCTGGACAAGGACAAAAGTGGCTTCATCGAGGAGGATGAGCTGGG SEQ ID NO:442 PVALB exon 4 ATTCATCCTAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACG SEQ ID NO:443 PVALB exon 5 AATTCTCCACTCTGGTGGCTGAAAGCTAAGAAGCACTGACTGCCCCTGGTCTTCCACCTCTCTGCCCTGAACACCCAATCTCGGCCCCTCTCGCCACCCTCCTGCATTTCTGTTCAGTTCGTTTTATGTTTTTTTTACTCCCCCATCCCCTGTGGCCCTCTCTAATGACACCATTCTTCTGGAAAATGCTGGAGAAGCAATAAAGGTTGTACCAGTCA SEQ ID NO:444 full mRNA polynucleotide sequence ACTTCCCGACAGGACTTCCCACCAGCCCAGCCTTTCAGTGCAGGCTCCAGCCCTCCACCCCCACCCGAGTTGCAGGATGTCGATGACAGACTTGCTGAACGCTGAGGACATCAAGAAGGCGGTGGGAGCCTTTAGCGCTACCGACTCCTTCGACCACAAAAAGTTCTTCCAAATGGTCGGCCTGAAGAAAAGAGTGCGGATGATGTGAAGAAGGTGT TTCACATGCTGGACAAGGACAAAAGTGGCTTCATCGAGGAGGATGAGCTGGGATTCATCCTAAAAGGCTTCTCCCCAGATGCCAGAGACCTGTCTGCTAAAGAAACCAAGATGCTGATGGCTGCTGGAGACAAAGATGGGGACGGCAAAATTGGGGTTGACGAATTCTCCACTCTGGTGGCTGAAAGCTAAAGAAGCACTGACTGCCCCTG GTCTTCCACCTCTCTGCCCTGAACACCCAATCTCGGCCCCTCTCGCCACCCTCCTGCATTTCTGTTCAGTTCGTTTATGTTTTTTACTCCCCCATCCCCTGTGGCCCTCTAATGACACCATTCTTCTGGAAAATGCTGGAGAAGCAATAAAGGTTGTACCAGTCA SEQ ID NO:445 Translated polypeptide sequence of exon 2 of PVALB MSMTDLLNAEDIKKAVGAFS SEQ ID NO:446 Translated polypeptide sequence of exon 3 of PVALB TDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGFIEEDEL SEQ ID NO:447 Translated polypeptide sequence of exon 4 of PVALB FILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVD SEQ ID NO:448 Translated polypeptide sequence of exon 5 of PVALB FSTLVAES SEQ ID NO:449 Full protein sequence MSMTDLLNAEDIKKAVGAFSATDSFDHKKFFQMVGLKKKSADDVKKVFHMLDKDKSGFIEEDELGFILKGFSPDARDLSAKETKMLMAAGDKDGDGKIGVDEFSTLVAES SEQ ID NO:450 The polynucleotide sequence of all four exons 1-4 of PVALB arranged in order SEQ ID NO:451 The translated polypeptide sequence of all three exons 2-4 of PVALB arranged in order SLC3A2 transcript 3 sequence informationSEQ ID NO:452 SLC3A2 exon 2 sequence at 5' of SLC3A2-NRG1 fusion AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTTCTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAG SEQ ID NO:453 NRG1 exon 6 sequence at 3' of SLC3A2-NRG1 fusion CTACATCTACATCCACCACTGGGACAAG SEQ ID NO:454 SLC3A2-NRG1 polynucleotide sequence AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTCTTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAGCTACATCTACATCCACCACTGGGACAAG SEQ ID NO:455 SLC3A2-NRG1 polypeptide sequence LGSHCVAQTGLELLASGDPLPSASQNAEMIATSTSTTGT SEQ ID NO:456 Exon 1 of SLC3A2 GCATTGCGGCTTGGTTTTTCTCACCCAGTGCATGTGGCAGGAGCGGTGAGATCACTGCCTCACGGCGATCCTGGACTGACGGTCACGACTGCCTACCCTCTAACCCTGTTCTGAGCTGCCCCTTGCCCACACACCCCAAACCTGTGTGCAGGATCCGCCTCCATGGAGCTACAGCCTCCTGAAGCCTCGATCGCCGTCGTGTCGATTCCGCGCCAGTTGC CTGGCTCACATTCGGAGGCTGGTGTCCAGGGTCTCAGCGCGGGGGACGACTCAG SEQ ID NO:457 Exon 2 of SLC3A2 AGTTGGGGTCTCACTGTGTTGCCCAGACTGGTCTCGAACTTCTGGCCTCAGGTGATCCTCTTCCCTCAGCTTCCCAGAATGCCGAGATGATAG SEQ ID NO:458 Exon 3 of SLC3A2 AGACGGGGTCTGACTGTGTTACCCAGGCTGGTCTTCAACTCTTGGCCTCAAGTGATCCTCCTGCCTTAGCTTCCAAGAATGCTGAGGTTACAG SEQ ID NO:459 Exon 4 of SLC3A2 GCACCATGAGCCAGGACACCGAGGTGGATATGAAGGAGGTGGAGCTGAATGAGTTAGAGCCCGAGAAGCAGCCGATGAACGCGGCGTCTGGGGCGGCCATGTCCCTGGCGGGAGCCGAGAAGAATGGTCTGGTGAAGATCAAGGTGGCGGAAGACGAGGCGGAGGCGGCAGCCGCGGCTAAGTTCACGGGCCTGTCCAAGGAGGAGCT GCTGAAGGTGGCAGGCAGCCCCGGCTGGGTACGCACCCGCTGGGCACTGCTGCTGCTCTTCTGGCTCGGCTGGCTCGGCATGCTTGCTGGTGCCGTGGTCATAATCGTGCGAGCGCCGCGTTGTCGCGAGCTACCGGCGCAGAAGTGGTGGCACACGGGCGCCCTCTACCGCATCGGCGACCTTCAGGCCTTCCAGGCCACGGCGCG GGCAACCTGGCGG SEQ ID NO:460 Exon 5 of SLC3A2 GTCTGAAGGGGCGTCTCGATTACCTGAGCTCTCTGAAGGTGAAGGGCCTTGTGCTGGGTCCAATTCACAAGAACCAGAAGGATGATGTCGCTCCAGACTGACTTGCTGCAGATCGACCCCAATTTTGGCTCCAAGGAAGATTTTGACAGTCTCTTGCAATCGGCTAAAAAAAAAGA SEQ ID NO:461 Exon 6 of SLC3A2 GCATCCGTGTCATTCTGGACCTTACTCCCAACTACCGGGGTGAGAACTCGTGGTTTCCCACTCAGGTTGACACTGTGGCCACCAAGGTGAAG SEQ ID NO:462 Exon 7 of SLC3A2 GATGCTCTGGAGTTTTGGCTGCAAGCTGGCGTGGATGGGTTCCAGGTTCGGGACATAGAGAATCTGAAG SEQ ID NO:463 Exon 8 of SLC3A2 GATGCATCCTCATTCTTGGCTGAGTGGCAAAATATCACCAAGGGCTTCAGTGAAGACAG SEQ ID NO:464 Exon 9 of SLC3A2 GCTCTTGATTGCGGGGACTAACTCCTCCGACCTTCAGCAGATCCTGAGCCTACTCGAATCCAACAAAGACTTGCTGTTGACTAGCTCATACCTGTCTGATTCTGGTTCTACTGGGGAGCATACAAAATCCTAGTCACACAGTTTGAATGCCACTGGCAATCGCTGGTGCAGCTGGAGT SEQ ID NO:465 Exon 10 of SLC3A2 TTGTCTCAGGCAAGGCTCCTGACTTCCTTCTTGCCGGCTCAACTTCTCCGACTCTACCAGCTGATGCTCTTCACCCTGCCAGGGACCCCTGTTTTTCAGCTACGGGGATGAGATTGGCCTGGATGCAGCTGCCCTTCCTGGACAG SEQ ID NO:466 Exon 11 of SLC3A2 CCTATGGAGGCTCCAGTCATGCTGTGGGATGAGTCCAGCTTCCCTGACATCCCAGGGGCTGTAAGTGCCAACATGACTGTGAAG SEQ ID NO:467 Exon 12 of SLC3A2 GGCCAGAGTGAAGACCCTGGCTCCCTCCTTTCCTTGTTCCGGCGGCTGAGTGACCAGCGGAGTAAGGAGCGCTCCCTACTGCATGGGGACTTCCACGCGTTCTCCGCTGGGCCTGGACTCTTCTCCTATATCCGCCACTGGGACCAGAATGAGCGTTTTCTGGTAGTGCTTAACTTTGGGGATGTGGGCCTCTCGGCTGGACTGCAGGC CTCCGACCTGCCTGCCAGCGCCAGCCTGCCAGCCAAGGCTGACCTCCTGCTCAGCACCCAGCCAGGCCGTGAGGAGGGCTCCCCTCTTGAGCTGGAACGCCTGAAACTGGAGCCTCACGAAGGGCTGCTGCTCCGCTTCCCCTACGCGGCCTGACTTCAGCCTGACATGGACCCCACTACCCTTCTCCTTTCCTTCCCAGGCCCTTTGGCTTCTGATTTTTCT CTTTTTTAAAAACAAACAAACAAACTGTTGCAGATTATGAGTGAACCCCCAAATAGGGTGTTTTCTGCCTTCAAATAAAAGTCACCCCTGCATGGTGAA SEQ ID NO:468 The complete mRNA polynucleotide sequence of SLC3A2 SEQ ID NO:469 Translated polypeptide sequence of exon 1 of SLC3A2 MELQPPEASIAVVSIPRQLPGSHSEAGVQGLSAGDDS SEQ ID NO:470 Translated polypeptide sequence of exon 2 of SLC3A2 LGSHCVAQTGLELLASGDPLPSASQNAEMI SEQ ID NO:471 Translated polypeptide sequence of exon 3 of SLC3A2 TGSDCVTQAGLQLLASSDPPALASKNAEVT SEQ ID NO:472 Translated polypeptide sequence of exon 4 of SLC3A2 TMSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIVRAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLA SEQ ID NO:473 Translated polypeptide sequence of exon 5 of SLC3A2 LKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKK SEQ ID NO:474 Translated polypeptide sequence of exon 6 of SLC3A2 IRVILDLTPNYRGENSWFSTQVDTVATKVK SEQ ID NO:475 Translated polypeptide sequence of exon 7 of SLC3A2 DALEFWLQAGVDGFQVRDIENLK SEQ ID NO:476 Translated polypeptide sequence of exon 8 of SLC3A2 DASSFLAEWQNITKGFSED SEQ ID NO:477 Translated polypeptide sequence of exon 9 of SLC3A2 LLIAGTNSSDLQQILSLLESNKDLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWS SEQ ID NO:478 Translated polypeptide sequence of exon 10 of SLC3A2 LSQARLLTSFLPAQLLRLYQLMLFLTLPGTPVFSYGDEIGLDAAALPGQ SEQ ID NO:479 Translated polypeptide sequence of exon 11 of SLC3A2 PMEAPVMLWDESSFPDIPGAVSANMTVK SEQ ID NO:480 Translated polypeptide sequence of exon 12 of SLC3A2 GQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDLPASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA The complete protein sequence of SEQ ID NO:481 SLC3A2 MELQPPEASIAVVSIPRQLPGSHSEAGVQGLSAGDDSELGSHCVAQTGLELLASGDPLPSASQNAEMIETGSDCVTQAGLQLLASSDPPALASKNAEVTGTMSQDTEVDMKEVELNELEPEKQPMNAASGAAMSLAGAEKNGLVKIKVAEDEAEAAAAAKFTGLSKEELLKVAGSPGWVRTRWALLLLFWLGWLGMLAGAVVIIV RAPRCRELPAQKWWHTGALYRIGDLQAFQGHGAGNLAGLKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKKSIRVILDLTPNYRGENSWFSTQVDTVATKVKDALEFWLQAGVDGFQVRDIENLKDASSFLAEWQNITKGFSEDRLLIAGTNSSDLQQIL SLLESNKDLLLLTSSYLSDSGSTGEHTKSLVTQYLNATGNRWCSWSLSQARLLTSFLPAQLLRLYQLMMLFTLPGTPVFSYGDEIGLDAAALPGQPMEAPVMLWDESSFPDIPGAVSANMTVKGQSEDPGSLLSLFRRLSDQRSKERSLLHGDFHAFSAGGLFSYIRHWDQNERFLVVLNFGDVGLSAGLQASDL PASASLPAKADLLLSTQPGREEGSPLELERLKLEPHEGLLLRFPYAA SEQ ID NO:482 The polynucleotide sequence of exons 1 and 2 of SLC3A2 arranged in sequence SEQ ID NO:483 SLC3A2 exon 1-2 translated polypeptide sequence arranged in order app sequence informationSEQ ID NO:484 Sequence from APP exon 14 at 5' of the APP-NRG1 fusion TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAG SEQ ID NO:485 NRG1 exon 6 sequence at 3' of APP-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO:486 APP-NRG1 polynucleotide sequence TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO:487 APP-NRG1 polypeptide sequence EPVDARPAADRGLTTRPATSTSTTTGTSHLVKCAEKEKTFCVNGGEC SEQ ID NO:488 APP exon 1 GTCAGTTTCCTCGGCAGCGGTAGGCGAGAGCACGCGGAGGAGCGTGCGCGGGGGCCCCGGGAGACGGCGGCGGTGGCGGCGCGGGCAGAGCAAGGACGCGGCGGATCCCACTCGCACAGCAGCGCACTCGGTGCCCCGCGCAGGGTCGCGATGCTGCCCGGTTTGGCACTGCTCCTGCTGGCCGCCTGGACGGCTCGGGCG CTGGAG SEQ ID NO:489 APP exon 2 GTCTACCCTGAACTGCAGATCACCAATGTGGTAGAAGCCAACCAACCAGTGACCATCCAGAACTGGTGCAAGCGGGGCCGCAAGCAGTGCAAGACCCATCCCACTTTGTGATTCCCTACCGCTGCTTAG SEQ ID NO:490 APP exon 3 TTGGTGAGTTTGTAAGTGATGCCCTTCTCGTTCCTGACAAGTGCAAATTCTTACACCAGGAGAGGATGGATGTTTGCGAAACTCATTCTCACTGGCACACCGTCGCCAAAGAG SEQ ID NO:491 APP Exon 4 ACATGCAGTGAGAAGAGTACCAACTTGCATGACTACGGCATGTTGCTGCCCTGCGGAATTGACAAGTTCCGAGGGGTAGAGTTTGTGTGTTGCCCACTGGCTGAAGAAAGTGACAATGTGGATTCTGCTGATGCGGAGGAGGATGACTCGGATGTCTGGTGGGGCGGAGCAGACACAGACTATGCAGATGGGAG SEQ ID NO:492 APP exon 5 TGAAGACAAAGTAGTAGAAGTAGCAGAGGAGGAAGAAGTGGCTGAGGTGGAAGAAGAAGAAGCCGATGATGACGAGGACGATGAGGATGGTGATGAGGTAGAGGAAGAGGCTGAGGAACCCTACGAAGAAGCCAGAGAGAGAACCACCAGCATTGCCACCACCACCACCACCAGAGTCTGTGGAAGAGGTGGTTCGAG SEQ ID NO:493 APP exon 6 AGGTGTGCTCTGAACAAGCCGAGACGGGGCCGTGCCGAGCAATGATCTCCCGCTGGTACTTTGATGTGACTGAAGGGAAGTGTGCCCCATTCTTTTACGGCGGATGTGGCGGCAACCGGAACAACTTTGACACAGAAGAGTACTGCATGGCCGTGTGTGGCAGCGCCA SEQ ID NO:494 APP exon 7 TGTCCCAAAGTTTTACTCAAGACTACCCAGGAACCTCTTGCCCGAGATCCTGTTAAAC SEQ ID NO:495 APP Exon 8 TTCCTACAACAGCAGCCAGTACCCCTGATGCCGTTGACAAGTATCTCGAGACACCTGGGGATGAGAATGAACATGCCCATTTCCAGAAAGCCAAAGAGAGGCTTGAGGCCAAGCACCGAGAGAGAATGTCCCAG SEQ ID NO:496 APP Exon 9 GTCATGAGAGAATGGGAAGAGGCAGAACGTCAAGCAAAGAACTTGCCTAAAGCTGATAAGAAGGCAGTTATCCAG SEQ ID NO:497 APP Exon 10 CATTTCCAGGAGAAAGTGGAATCTTTGGAACAGGAAGCAGCCAACGAGAGACAGCAGCTGGTGGAGACACACATGGCCAGAGTGGAAGCCATGCTCAATGACCGCCGCCGCCTGGCCCTGGAGAACTACATCACCGCTCTGCAGGCTGTTCCTCCTCGG SEQ ID NO:498 APP Exon 11 CCTCGTCACGTGTTCAATATGCTAAAGAAGTATGTCCGCGCAGAACAGAAGGACAGACAGCACACCCTAAAGCATTTCGAGCATGTGCGCATGGTGGATCCCAAGAAAGCCGCTCAGATCCGGTCCCAG SEQ ID NO:499 APP Exon 12 GTTATGACACACCTCCGTGTGATTTATGAGCGCATGAATCAGTCTCTCTCCCCTGCTCTACAACGTGCCTGCAGTGGCCGAGGAGATTCAGGATGAAGTTG SEQ ID NO:500 APP Exon 13 ATGAGCTGCTTCAGAAAGAGCAAAACTATTCAGATGACGTCTTGGCCAACATGATTAGTGAACCAAGGATCAGTTACGGAAACGATGCTCTCATGCCATCTTTGACCGAAACGAAAACCACCGTGGAGCTCCTTCCCGTGAATGGAGAGTTCAGCCTGGACGATCTCCAGCCGTGGCATTCTTTTGGGGCTGACTCTGTGCCAGCCAACAGAAAACGA AG SEQ ID NO:501 APP Exon 14 TTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAG SEQ ID NO:502 APP Exon 15 GTTCTGGGTTGACAAATATCAAGACGGAGGAGATCTCTGAAGTGAAGATGGATGCAGAATTCCGACATGACTCAGGATATGAAGTTCATCATCAAAAATTG SEQ ID NO:503 APP exon 16 GTGTTCTTTGCAGAAGATGTGGGTTCAAACAAAGGTGCAATCATTGGACTCATGGTGGGCGGTGTTGTCATAGCGACAGTGATCGTCATCACCTTGGTGATGCTGAAGAAGAAACAGTACACATCCATTCATCATGGTGTGGTGGAG SEQ ID NO:504 APP exon 17 GTTGACGCCGCTGTCACCCCCAGAGGAGCGCCACCTGTCCAAGATGCAGCAGAACGGCTACGAAAATCCAACCTACAAGTTCTTTGAGCAGATGCAGAACTAGACCCCCGCCACAGCAGCCTCTGAAGTTGGACAGCAAAACCATTGCTTCACTACCCCATCGGTGTCCATTTATAGAATAATGTGGGAAGAAACAAACCCGTTTTATGATTTACTCATTATCGCCTT TTGACAGCTGTGCTGTAACACAAGTAGATGCCTGAACTTGAATTAATCCACACACATCAGTAATGTATTTCTATCTCTCTTTACATTTTGGTCTCTATACTACATTATTAATGGGTTTTGTGTACTGTAAAGAATTTAGCTGTATCAAACTAGTGCATGAATAGATTCTCTCCTGATTATTTATCACATAGCCCCTTAGCCAGTTGTATATTATTCTTGTGGTTTGTGACCCAAT TAAGTCCTACTTTACATATGCTTTAAGAATCGATGGGGGATGCTTCATGTGAACGTGGGAGTTCAGCTGCTTCTCTTGCCTAAGTATTCCTTTCCTGATCACTATGCATTTTAAAGTTAACATTTTTAAGTATTTCAGATGCTTTAGAGAGATTTTTTTTCCATGACTGCATTTACTGTACAGATTGCTGCTTCTGCTATATTTGTGATATAGGAATTAAGAGGAT ACACACGTTTGTTTCTTCGTGCCTGTTTTATGTGCACACATTAGGCATTGAGACTTCAAGCTTTTCTTTTTTTGTCCACGTATCTTTGGGTCTTTGATAAAGAAAAGAATCCCTGTTCATTGTAAGCACTTTTACGGGGCGGGTGGGGAGGGGTGCTCTGCTGGTCTTCAATTACCAAGAATTCTCCAAAACAATTTTCTGCAGGATGATTGTACAGAATCA TTGCTTATGACATGATCGCTTTCTACACTGTATTACATAAATAAATTAAATAAAATAACCCCGGGCAAGACTTTTCTTGAAGGATGACTACAGACATTAAATAATCGAAGTAATTTTGGGTGGGGAGAAGAGGCAGATTCAATTTTCTTTAACCAGTCTGAAGTTTCATTTATGATACAAAAAGAAGATGAAAATGGAAGTGGCAATATAAGGGGATGAGAAG GCATGCCTGGACAAACCCTTCTTTTAAGATGTGTCTTCAATTTGTATAAAATGGTGTTTTCATGTAAATAAATACATTCTTGGAGGAGCA The complete mRNA polynucleotide sequence of SEQ ID NO:505 APP GTCAGTTTCCTCGGCAGCGGTAGGCGAGAGCACGCGGAGGAGCGTGCGCGGGGGCCCCGGGAGACGGCGGCGGTGGCGGCGCGGGCAGAGCAAGGACGCGGCGGATCCCACTCGCACAGCAGCGCACTCGGTGCCCCGCGCAGGGTCGCGATGCTGCCCGGTTTGGCACTGCTCCTGCTGGCCGCCTGGACGGCTCGGGCG CTGGAGGTCTACCCTGAACTGCAGATCACCAATGTGGTAGAAGCCAAACCAACCAGTGACCATCCAGAACTGGTGCAAGCGGGGCCGCAAGCAGTGCAAGACCCATCCCACTTTGTGATTCCCTACCGCTGCTTAGTTGGTGAGTTTGTAAGTGATGCCCTTCTCGTTCCTGACAAGTGCAAATTCTTTACACCAGGAGAGGATGGATGTTTGCGAAACTCATC TTCACTGGCACACCGTCGCCAAAGAGACATGCAGTGAGAAGAGTACCAACTTGCATGACTACGGCATGTTGCTGCCCTGCGGAATTGACAAGTTCCGAGGGGTAGAGTTTGTGTGTTGCCCACTGGCTGAAGAAAGTGACAATGTGGATTCTGCTGATGCGGAGGAGGATGACTCGGATGTCTGGTGGGGCGGAGCAGACAGACTAT GCAGATGGGAGTGAAGACAAAGTAGTAGAAGTAGCAGAGGAGGAAGAAGTGGCTGAGGTGGAAGAAGAAGAAGCCGATGATGACGAGGACGATGAGGATGGTGATGAGGTAGAGGAAGAGGCTGAGGAACCCTACGAAGAAGCCACAGAGAGAACCACCAGCATTGCCACCACCACCACCACCAGAGTCTGTGGAAGAGGTGGTTCGAGAGGTGT GCTCTGAACAAGCCGAGACGGGGCCGTGCCGAGCAATGATCTCCCGCTGGTACTTTGATGTGACTGAAGGGAAGTGTGCCCCATTCTTTTACGGCGGATGTGGCGGCAACCGGAACAACTTTGACACAGAAGAGTACTGCATGGCCGTGTGTGGCAGCGCCATGTCCCAAAGTTTACTCAAGACTACCCAGGAACCTCTTGCCCGAGATCCTGTTAA ACTTCCCTACAACAGCAGCCAGTACCCTGATGCCGTTGACAAGTATCTCGAGACACCTGGGGATGAATGAACATGCCCATTTCCAGAAAGCCAAAGAGAGGCTTGAGGCCAAGCACCGAGAGAGAATGTCCCAGGTCATGAGAGAATGGGAAGAGGCAGAACGTCCAAGCAAAGAACTTGCCTAAAAGCTGATAAGAAGGCAGTTATTCCAGCATTTCCAGGAGAAAGTG GAATCTTTGGAACAGGAAGCAGCCAACGAGAGACAGCAGCTGGTGGAGACACACATGGCCAGAGTGGAAGCCATGCTCAATGACCGCCGCCGCCTGGCCCTGGAGAACTACATCACCGCTCTGCAGGCTGTTCCTCCTCGGCCTCGTCACGTGTTCAATATGCTAAAGAAGTATGTCCGCAGAACAGAAGGACAGACAGCACACCCTAAAGCATTTCGAGCAT GTGCGCATGGTGGATCCCAAGAAAGCCGCTCAGATCCGGTCCCAGGTTATGACACACCTCCGTGTGATTTATGAGCGCATGAATCAGTCTCTCTCCTGCTCTACAACGTGCCTGCAGTGGCCGAGGAGATTCAGGATGAAGTTGATGAGCTGCTTCAGAAAGAGCAAAACTATTCAGATGACGTCTTGGCCAACATGATTAGTGAACCAAGGATCAGTTACGGA AACGATGCTCTCATGCCATCTTTGACCGAAACGAAAACCACCGTGGAGCTCCTTCCCGTGAATGGAGAGTTCAGCCTGGACGATCTCCAGCCGTGGCATTCTTTTGGGGCTGACTCTGTGCCAGCCAACACAGAAAACGAAGTTGAGCCTGTTGATGCCCGCCCTGCTGCCGACCGAGGACTGACCACTCGACCAGGTTCTGGGTTGACAAATATCAAG ACGGAGGAGATCTCTGAAGTGAAGATGGATGCAGAATTCCGACATGACTCAGGATATGAAGTTCATCATCAAAAATTGGTGTTCTTTGCAGAAGATGTGGGTTCAAACAAAGGTGCAATCATTGGACTCATGGTGGGCGGTGTTGTCATAGCGACAGTGATCGTCATCACCTTGGTGATGCTGAAGAAGAAACAGTACACATCCATTCATCATGGTGT GGTGGAGGTTGACGCCGCTGTCACCCCCAGAGGAGCGCCACCTGTCCAAGATGCAGCAGAACGGCTACGAAAATCCAACCTACAAGTTCTTTGAGCAGATGCAGAACTAGACCCCCGCCACAGCAGCCTCTGAAGTTGGACAGCAAAACCATTGCTTCACTACCCCATCGGTGTCCATTTATAGAATAATGTGGGAAGAAACAAACCCGTTTTATGATTTACTCATTA TCGCCTTTTGACAGCTGTGCTGTAACACAAGTAGATGCCTGAACTTGAATTAATCCACACATCAGTAATGTATTTCTATCTCTCTTTACATTTTGGTCTCTATACTACATTATTAATGGGTTTTGTGTACTGTAAAGAATTTAGCTGTATCAAACTAGTGCATGAATAGATTCTCTCCTGATTATTTATCACATAGCCCCTTAGCCAGTTGTATATTATTCTTGTGGTTTGT GACCCAATTAAGTCCTACTTTACATATGCTTTAAGAATCGATGGGGGATGCTTCATGTGAACGTGGGAGTTCAGCTGCTTCTCTTGCCTAAGTATTCCTTTCCTGATCACTATGCATTTTAAAGTTAAAACATTTTTAAGTATTTCAGATGCTTTAGAGAGATTTTTTTTCCATGACTGCATTTTTACTGTACAGATTGCTGCTTCTGCTATATTTGTGATATAGGAATTA AGAGGATACACACGTTTGTTTCTTCGTGCCTGTTTTATGTGCACACATTAGGCATTGAGACTTCAAGCTTTTCTTTTTTTGTCCACGTATCTTTGGGTCTTTGATAAAGAAAAGAATCCCTGTTCATTGTAAGCACTTTTACGGGGCGGGTGGGAGGGGTGCTCTGCTGGTCTTCAATTACCAAGAATTCTCCAAAACAATTTTCTGCAGGATGATTGTA CAGAATCATTGCTTATGACATGATCGCTTTCTACACTGTATTACATAAATAAATTAAATAAAATAACCCCGGGCAAGACTTTTCTTTGAAGGATGACTACAGACATTAAATAATCGAAGTAATTTTGGGTGGGGAGAAGAGGCAGATTCAATTTTCTTTAACCAGTCTGAAGTTTCATTTATGATACAAAAAGAAGATGAAAATGGAAGTGGCAATATAAGGGGAT GAGGAAGGCATGCCTGGACAAAACCCTTCTTTTAAGATGTGTCTTCAATTTGTATAAAATGGTGTTTTCATGTAAATAAATACATTCTTGGAGGAGCA SEQ ID NO:506 Translated polypeptide sequence of APP exon 1 MLPGLALLLLAAWTARALE SEQ ID NO:507 Translated polypeptide sequence of APP exon 2 VYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCL SEQ ID NO:508 Translated polypeptide sequence of APP exon 3 GEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKE SEQ ID NO:509 Translated polypeptide sequence of APP exon 4 TCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADG SEQ ID NO:510 Translated polypeptide sequence of APP exon 5 EDKVVEVAEEEEVAEVEEEEADDDEDDEDGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVR SEQ ID NO:511 Translated polypeptide sequence of APP exon 6 VCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSA SEQ ID NO:512 Translated polypeptide sequence of APP exon 7 SQSLLKTTQEPLARDPVK SEQ ID NO:513 Translated polypeptide sequence of APP exon 8 PTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQ SEQ ID NO:514 Translated polypeptide sequence of APP exon 9 VMREWEEAERQAKNLPKADKKAVIQ SEQ ID NO:515 Translated polypeptide sequence of APP exon 10 HFQEKVESLEQEAANERQQLVETHMARVEAMLNDRRRRLALENYITALQAVPPR SEQ ID NO:516 Translated polypeptide sequence of APP exon 11 PRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQ SEQ ID NO:517 Translated polypeptide sequence of APP exon 12 VMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEV SEQ ID NO:518 Translated polypeptide sequence of APP exon 13 ELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADSVPANTENE SEQ ID NO:519 Translated polypeptide sequence of APP exon 14 EPVDARPAADRGLTTRP SEQ ID NO:520 Translated polypeptide sequence of APP exon 15 SGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKL SEQ ID NO:521 Translated polypeptide sequence of APP exon 16 VFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVE SEQ ID NO:522 Translated polypeptide sequence of APP exon 17 VDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN The complete protein sequence of SEQ ID NO:523 APP MLPGLALLLLAAWTARALEVYPELQITNVVEANQPVTIQNWCKRGRKQCKTHPHFVIPYRCLVGEFVSDALLVPDKCKFLHQERMDVCETHLHWHTVAKETCSEKSTNLHDYGMLLPCGIDKFRGVEFVCCPLAEESDNVDSADAEEDDSDVWWGGADTDYADGSEDKVVEVAEEEEVAEVEEEEADDDEDDE DGDEVEEEAEEPYEEATERTTSIATTTTTTTESVEEVVREVCSEQAETGPCRAMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSAMSQSLLKTTQEPLARDPVKLPTTAASTPDAVDKYLETPGDENEHAHFQKAKERLEAKHRERMSQVMREWEEAERQAKNLPKADKKAVIQHFQEKV ESLEQEAANERQQLVETHMARVEAMLNDRRRLALENYITALQAVPPRPRHVFNMLKKYVRAEQKDRQHTLKHFEHVRMVDPKKAAQIRSQVMTHLRVIYERMNQSLSLLYNVPAVAEEIQDEVDELLQKEQNYSDDVLANMISEPRISYGNDALMPSLTETKTTVELLPVNGEFSLDDLQPWHSFGADS VPANTENEVEPVDARPAADRGLTTRPGSGLTNIKTEEISEVKMDAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIATVIVITLVMLKKKQYTSIHHGVVEVDAAVTPEERHLSKMQQNGYENPTYKFFEQMQN SEQ ID NO:524 The polynucleotide sequence of all exons 1-14 of APP arranged in order SEQ ID NO:525 The translated polypeptide sequence of all exons 1-14 of APP arranged in order WRN sequence informationSEQ ID NO:526 Sequence from WRN exon 33 at 5' of WRN-NRG1 fusion AAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAG SEQ ID NO:527 Sequence of NRG1 exon 6 at 3' of WRN-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGCT SEQ ID NO: 528 WRN - NRG1 polynucleotide sequence AAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATGGAGGGGAGTGC SEQ ID NO: 529 WRN - NRG1 polypeptide sequence AGCPLDLERAGLTPEVQKIIADVIRNPPVNSATSTSTTGTSHLVKCAEKEKTFCVNGGEC SEQ ID NO: 530 WRN exon 1 GTGTACTGTGTGCGCCGGGGAGGCGCCGGCTTGTACTCGGCAGCGCGGGAATAAAGTTTGCTGATTTGGTGTCTAGCCTGGATGCCTGGGTTGCAGGCCCTGC TTGTGGTGGCGCTCCACAGTCATCCGGCTGAAGAAGACCTGTTGGACTGGATCTTCTCGGG SEQ ID NO: 531 WRN exon 2 TTTTCTTTCAGATATTGTTTTGTTTTACCCATGAAGACATTGTTTTTTGGACTCTGCAAATAGGACATTTCAAAGATGAGTGAAAAAAATTGGAAACAACTGCACAGCAGCGGAAATGTCCTGA ATGGATGAATGTGCAGAATAAAAGATGTGCTGTAGAAGAAAGAAAG SEQ ID NO:532 other than WRN Exon 3 GCATGTGTTCGGAAGAGTGTTTTTGAAGATGACCTCCCCTTCTTAGAATTCACTGGATCCATTGTGTATAGTTACGATGCTAGTGATTGCTCTTTCCTGTCAGAAGATATTAG SEQ ID NO: 533 WRN Exon 4 CATGAGTCTATCAGATGGGGATGTGGTGGGATTTGACATGGAGTGGCCACCATTATACAATAGAGGGA AACTTGGCAAAGTTGCACTAATTCAGTTGTGTGTTTCTGAGAGCAAATGTTACTTGTTCCACGTTTCTTCCATGTCAG SEQ ID NO: 534 WRN Exon 5 TTTTTCCCCAGGGATTAAAAATGTTGCTTGAAAATAAAAGCAGTTAAAAAGGCAGGTGTAGGAATTGAAGGAGATCAGTGGAAACTTCTACGTGACTTTGATATCAAATTGAAGAATTTT GTGGAGTTGACAGATGTTGCCAATAAAAAG SEQ ID NO:535 WRN exon 6 CTGAAATGCACAGAGACCTGGAGCCTTAACAGTCTGGTTAAACACCCTCTTAGGTAAACAGCTCCTGAAAGACAAGTCTATCCGCTGTAGCAATTGGAGTAAGCAATTGGAGTAAGAAACTGTATGCAGCCACTGATGCTTAT SEQ ID NO :536 WRN exons 7 GCTGGTTTTATTATTTACCGAAATTTAGAGATTTTGGATGATACTGTGCAAAGGTTTGCTATAAATAAAG SEQ ID NO: 537 WRN exon 8 AGGAAGAAATCCTACTTAGCGACATGAACAAACAGTTGACTTCAATCTCTGAGGAAGTGATGGATCTGGCTAAGCATCTTCCTCATGCTTTCAGTAAATTGGAAAACCCACGGAG SEQ ID NO: 538 WRN exon 9 GGTTTCTATCTTACTAAAGGATATTTCAGAAATCTATATTCACTGAGGAGGATGATAATTGGGTCTACTAACATTGAGACTGAACTGAGGCCCAGCAATAATTTAAACTTATTTATCCTTTGAAGATTCAACTACTGGGGGAGTACAACAGAAACAAATTAGAGAACATGAAGTTTTAATTCACGTTGAAGATGA AACATGGGACCCAACACTTGATCATTTAGCTAAACATGATGGAGAAGATGTACTTGGAAATAAAGTGGAACGAAAAGAAGATGGATTTGAAGATGGAGTAGAAGACAACAAATTGAAAGAGAATATGGAAAGAGCTTGTTTGATGTCGTTAGATATCAGAACATGAACTCCAATTTTGGAACAGCAGTCTCAGGAAGAATATCTTAGTGATATTGCTTATAAA TCTACTGAG SEQ ID NO: 539 WRN exon 10 CATTTATCTCCCAATGATAATGAAAACGATACGTCCTATGTAATTGAGAGTGATGAAGATTTAGAAATGGAGATGCTTAAG SEQ ID NO: 540 WRN exon 11 CATTTATCTCCCAATGATAATGAAAACGATACGTCCTATGTAATTGAGAGTGATGAAGATTTAGAAATGGA GATGCTTAAG SEQ ID NO: 541 WRN exon 12 TCTTTAGAAAACCTCAATAGTGGCACGGTAGAACCAACTTCTAAATGCTTAAAAATGGAAAGAAATCTGGGTCTTCCTACTAAAGAAGAAGAAGAAGATGATGAAAATGAAGCTAATGAAGGGGAAGAAGATGATGATAAGG SEQ ID NO: 542 WRN exon 13 ACTTTTT GTGGCCAGCACCCCAATGAAGAGCAAGTTACTTGCCTCAAGATGTACTTTGGCCATTCCAGTTTAAACC SEQ ID NO: 543 WRN exon 14 AGTTCAGTGGAAAGTGATTCATTCAGTATTAGAAGAAAGAAGAGATAATGTTGCTGTCATGGCAACTG SEQ ID NO: 544 WRN exon 15 GATATGGAAAGAGTTTGTGCT TCCAGTATCCACCTGTTTATGTAGGCAAGATTGGCCTTGTTATTCTCCCCCTTATTTCTCTGATGGAAGACCAAGTGCTACAGCTTAA SEQ ID NO: 545 WRN exon 16 AATGTCCAACATCCCAGCTTGCTTCCTTGGATCAGCACAGTCAGAAAATGTTCTAACAGATATTAATT SEQ ID NO: 546 WRN exon 17 AGGTAAATACCGGATTGTATACGTAACTCCAGAATACTGTTCAGGTAACATGGGCCTGCTCCAGCAACTTGAGGCTGATATTG SEQ ID NO: 547 WRN exon 18 GTATCACGCTCATTGCTGTGGATGAGGCTCACTGTATTTCTGAGTGGGGGCATGATTTTAGGGATTCATTCAGGAAGTTGGGCTCCCTAAAGACAGCACTGCCAATG SEQ ID NO: 548 WRN exon 19 GTTCCAATCGTTGCACTTACTGCTACTGCAAGTTTCTTCAATCCGGGAAGACATTGTAC GTTGCTTAAATCTGAGAAATCCTCAGATCACCTGTACTGGTTTTGATCGACCAAACCTGTATTTAGAAGTTAGGCGAAAAACAGGGAATATCCTTCAGGATCTGCAGCCATTTCTTGTCAAAACAAG SEQ ID NO: 549 WRN exon 20 TTCCCACTGGGAATTTGAAGGTCCAACATCATCTACTGTCCTTCTAGAAAAATGACACAACAAGTTACAGGTGAACTTAGGAAACTGAATCTATCCTGTGGAACATACCATGCGGGCATGAGTTTTAGCACAAGGAAAGACATTCATCATAGGTTTGTAAGAGATGAAATTCAG SEQ ID NO: 550 WRN exon 21 TGTGTCATAGCTACCATAGCTTTTGGAATGGGCATTAATAAAGCTGACATTCGCCAAGTCATTCATTACGGTGCTCCTAAGGACATGGAATCATATTATTCAGGAGATTGGTAGAGCTGGTCGTGATGGACTTCAAAGTTCTTGTCACGTCCTCTGGGCTCCTGCAGACATTAACTTAAATAG SEQ ID NO: 551 WRN exon 22 G CACCTTCTTACTGAGATACGTAATGAGAAGTTTCGATTATACAAATTAAAGATGATGGCAAAGATGGAAAAATATCTTCATTCTAGCAGATGTAGGAGACA SEQ ID NO: 552 WRN exon 23 AATCATCTTGTCTCATTTTGAGGACAAACAAGTACAAAAAGCCTCCTTGGGAATTATGGGAACTGAAAAATGCTGTGATAATTGCAGGTCC AG SEQ ID NO:553 WRN Exon 24 ATTGGATCATTGCTATTCCATGGATGACTCAGAGGATACATCCTGGGACTTTGGTCCACAAGCATTTAAGCTTTTGTCTGCTGTGGACATCTTAGGCGAAAAATTTGGAATTGGGCTTCCAATTTTATTTCTCCGAGGATCT SEQ ID NO: 554 WRN Exon 25 AATTCTCAGCGTCTTGCCGATCAATATCGCAG GCACAGTTTTATTTGGCACTGGCAAGGATCAAACAGAGAGTTGGTGGAAGGCTTTTTCCCGTCAGCTGATCACTGAGGGATTCTTGGTAGAAGTTTCTCGGTATAACAAATTTATGAAGATTTGCGCCCTTACGAAAAAG SEQ ID NO: 555 WRN exon 26 GGTAGAAATTGGCTTCATAAAGCTAATACAGAATCTCAGAGCTCATCCT TCAAGCTAATGAAGAATTGTGTCCAAAGAAGTTGCTTCTGCCTAG SEQ ID NO: 556 Outside of WRN Exon 27 TTCGAAAACTGTATCTTCGGGCACCAAAGAGCATTGTTATAATCAAGTACCAGTTGAATTAAGTACAGAGAAGAAG SEQ ID NO: 557 Outside of WRN Exon 28 TCTAACTTGGAGAAGTTATATTTCTTATAAACCATGTGATAAGATTTCTTCTGGGAGTAACATTTCTAAAAAAAG SEQ ID NO: 558 WRN exon 29 TATCATGGTACAGTCACCAGAAAAAAGCTTACAGTTCCTCACAGCTGTTATTTCGGCACAAGAGCAGGAGACTCAG SEQ ID NO: 559 WRN exon 3 0 ATTGTGTTATATGGCAAATTGGTAGAAGCTAGGCAGAAACATGCCAATAAAATGGATGTTCCCCCAGCTATTCTGGCAACAAACAAGATACTGGTGGATATGGCCAAAATGAG SEQ ID NO: 560 WRN exon 31 ACCAACTACGGTTGAAAACGTAAAAAGGATTGATGGTGTTTCTGAAGGCAAAGCTGCCATGTTGGCCCCT CTGTTGGAAGTCATCAAACATTTCTGCCAAACAAATAGTGTTCAG SEQ ID NO: 561 WRN exon 32 ACAGACCCTTTTCAAGTACAAACCTCAAGAAGAACAGAAGACGAGTCTGGTAGCAAAAAATAAAATATGCACACTTTCACAGTCTATGGCCATCACATACTCTTTATTCCAAGAAAAGAAGATGCCTTTG SEQ ID NO: 562 WRN exon 33 AAGAGCATAGCTGAGAGCAGGATTCTGCCTCTCATGACAATTGGCATGCACTTATC CCAAGCGGTGAAAGCTGGCTGCCCCCTTGATTTGGAGCGAGCAGGCCTGACTCCAGAGGTTCAGAAGATTATTGCTGATGTTATCCGAAACCCTCCCGTCAACTCAG SEQ ID NO: 563 WRN exon 34 ATATGAGTAAAAATTAGCCTAATCAGAATGTTAGTTCCTGAAAACATTGACACGTACCTTATCCACATGGCAATTGAGATCCTTAA ACATGGTCCTGACAGCGGACTTCAACCTTCATGTGATGTCAACAAAAGGAGATGTTTTCCCGGTTCTGAAGAGATCTGTTCAAGTTTCTAAGAGAAGCAAGGAAGAAGTAGGCATCAATACTGAG SEQ ID NO: 564 WRN exon 35 ACTTCATCTGCAGAGAGAAAAGAGACGATTACCTGTGTGGTTTGCCAAAGGAAGTGATACCAGCAAGAAAT TAATGGACAAAACGAAAAGGGGAGGTCTTTTTTTAGTTAAGCTGGCAATTACCAGAACAATTATGTTTCTTGCTGTATTATAAGAGGATAGCTATATTTTATTTCTGAAGAGTAAGGAGTAGTATTTTGGCTTAAAAATCATTCTAATTACAAAAGTTCACTGTTTATTGAAGAACTGGCATCTTAAATCAGCCTTCCGCAATTCATGTAGTTTCTGGGTCTTCTGGGA GCCTACGTGAGTACATCACCCTAACAGAATATTAAATTAGACTTCCTGTAAGATTGCTTTAAGAAACTGTTACTGTCCTGTTTTTCTAATCTCTTTATTAAAACAGTGTATTTGGAAAATGTTATGTGCTCTGATTTGATATAGATAACAGATTAGTAGTTACATGGTAATTATGTGATATAAAATATTTCATATATTTCAAAATTCTGTTTTGTAAATGTAAGAAGCATAGTTATTT TACAAATTGTTTTTACTGTCTTTTGAAGAAGTTCTTAAATACGTTGTTAAATGGTATTAGTTGACCAGGGCAGTGAAAATGAAACCGCATTTTGGGTGCCATTAAATAGGGAAAAAACATGTAAAAAATGTAAAATGGAGACCAATTGCACTAGGCAAGTGTATATTTTGTATTTTATACAATTTCTATTTTTTTCAAGTAATAAACAATGTTTTTCACTGA ATATTATATATATATTTTTTAGCTTTTCATTTACTTAATTATTTTAAGTACCTTTTATTTTTCCAGGATGTCAGAATTTGATTCAATCTCTCTTATGTAGCACATGTGACTTAATTTAAACCTATACTGTGACACAGAGTTGGGTAAACGATGATTATTTAACTTTAAGCAGTTCACCATCCATTTCAAAGCCTTTGATTGGCTTTTTTGTAAATAAAAATAACTTGTTAAGAAAC AAATATATCTGTCATAGAAGAACTAGAAAATCCAGGGAAGTGAGAAAAATGAAAATAAAATCATTCATAGTTTTACTAGTAGCTAATCACAGTCAAACCTCTTTGTGTATCCACCAGACTTTTTTTATATTCATTTGTTTTTAGTTAAAAATAAAAGTCTCGTATATTCCCCATTTTTCTGCATTGCATTACCCAGAAGGTAGTGGCGCCTATTAAATATGTGATATGTTGTT GTCCAGCCATGGCTTCTGCATTTGCATGCTTTTGTGTGTGCATCTGCAATACCCTGTGAATACCCTGTGTGATGGAGTGGCAAGTACGCACAGACACGTCTGCTGCATGCCTAGGTACGAGGCTGTCTCCAGGAGAAGCACTTGTTTGATTATTTGAGTTGCCAATTGAATTTGCTGCTTTTTTTCATGGCTTGCCATTTTCACTGAAAAGAATGACTAATGAAAAACG ATGATTGGTTATTAGATTTGGATGTTTGGCAGACATTTTCTCAAAATTGAACTAAGTTGGCCTCTTCACGGAAAACAACTGGTATTTGTTGTGCCAATGATAAAATTGGAGATTTCTAGCAAAATGTATAATTTTGGAAAAGTTGTGTTCCTCCACTGGAAGCTTGACAGCTTTCCTTAACATAAAGACTTCTCTTTCTCTCGCTTTCACTACTACTACTACTAATTCT TCTTCTGATTCTTCTTCTTCTCCTTCTTCCTTCTTCCTTCCTTCCTCCTCCTCCTCCTTCTTCTTCCTCTTCCTCTTCTTTCTCTCTCTTTCCTTCCTTCCCTTCCCTTCCCCTTCCTTCCTTCCTTCCTCCCTCCCTCCCTCCCTCCCTCCTCCTCCTCCTTCCTTTTCCTCCTCCTTCTTTCTTTCTCTCTCCTTCTTCTTCTTTCTCTCTCCTCTCTTCTTCTTTTCTTTCTCTTCTTTTTTTTT CTTTCAAGCAGTCCTCCCGCCTCAGTCCCCCAAAAATAGTGGGATTACAGGTGTGAGCCACCATGCACAGCCTTACATAAAGCCTTTTCTAATGAGATGGATAGTAATTAACAAATGTGAGTTTTTGATATTATATAAAGATTTTTTCTGTGTTTCGAAGATCCGTATAACTCAGTGAATCAGTATGTTCTGGATGACTAATATGTGATGTTAAGAAATCATGACTGAGG CCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGATCAGGAGATCGAGACCACCCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGTTGGTGCGTGCCTATAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACTCAGGAGGCGGAGATTG CAGTGAGCTGAGACTGCGCCACTGCACCCCAGCCTGGCGACAGAGCAAGACTCCGTCTCAAAAAAAAAAAGAAATCATGACTGGGTAAAAGATCTGTTCAGAGTACAAGATGGACCAATGGATTTGATATATTTGAATAACAGAGTGAAAAAAGTTTATTGATATAGTTTCACACTGCAACTAATCTTTAAGAAACTATTACTTGTCCACTTTTTGGTAA AATTTCAGAGAACAATGTCCACCATTATCTGAACAGGCTATTAAAATCTTCTCTTTTCCAACTACGTGCCTGTGCAAAGTCAGATTTTTTCATACTTCAGCCAAAACAGCATATCAAAATGGATTGAATGCAGAAGTAGATCTGAGAATACAGCCACTTTTGTTAAAGCCAGACAATGAGATTTGCAAAATGTAAACAATGCTGCTGTTCTCAGTTTTTAAAAAATATGTTTTT TAAAAGTATTTATGTTAATGTGTACTTGGTTTACTACTGCTATTTTTAAATAAAACAAGAAACATTTTTAAATGTCTGTTTTATTTTCTAAAGTGGTAGTGATAGATATAACCCATATTAATAAAAGCTCTTTGGGGTCCTCAGTGATTTTTTTTTAAGAGTATGGAAGGGTTCTCAGACCTAAGAGATTGAGAAATGCTGATGTAATGTTTTTATTATAAAGGTGTA CCATGAATTATGTACCTTACTTCATATTGTTGGACATTAAAGTTGCTTTCAGTTTTTTTTGTTTTAAAA SEQ ID NO:565 WRN full mRNA polynucleotide Sequence SEQ ID NO:566 Translated polypeptide sequence of WRN exon 2 MSEKKLETTAQQRKCPEWMNVQNKRCAVEERK SEQ ID NO:567 Translated polypeptide sequence of WRN exon 3 ACVRKSVFEDDLPFLEFTGSIVYSYDASDCSFLSEDI SEQ ID NO:568 Translated polypeptide sequence of WRN exon 4 MSLSDGDVVGFDME WPPLYNRGKLGKVALIQLCVSESKCYLFHVSSMS SEQ ID NO:569 The translated polypeptide sequence of exon 5 of WRN FPQGLKMLLENKAVKKAGVGIEGDQWKLLRDFDIKLKNFVELTDVANKK SEQ ID NO:570 The translated polypeptide sequence of exon 6 of WRN LKCTETWSLNSLVKHLLGKQLLKDKSIRCSNWSKFPLTEDQKLYAATDAY SEQ ID NO:571 WRN Translated polypeptide sequence of exon 7 AGFIIYRNLEILDDTVQRFAINK SEQ ID NO :572 Translated polypeptide sequence of exon 8 of WRN EEILLSDMNKQLTSISEEEVMDLAKHLPHAFSKLENPR SEQ ID NO:573 Translated polypeptide sequence of exon 9 of WRN VSILLKDISENLYSLRRMIIGSTNIETELRPSNNLNLLSFEDSTTGGVQQKQIREHEVLIHVEDETWDPTLDHLAKHDGEDVLGNKVERKED GFEDGVEDNKLKENMERACLMSLDITEHELQILEQQSQEEYLSDIAYKSTE SEQ ID NO:574 Translated polypeptide sequence of WRN exon 10 HLSPNDNENDTSYVIESDEDLEMEMLK SEQ ID NO:575 The translated polypeptide sequence of WRN exon 11 HLSPNDNENDTSYVIESDEDLEMEMLK SEQ ID NO:576 The translated polypeptide sequence of WRN exon 12 SLENLNSGTVEPTHSKCLKMERNLGLPTKEEEEDDENEANEGEEDDDK SEQ ID NO:577 The translated polypeptide sequence of WRN exon 13 FLWPAPNEEQVTCLKMYFGHSSFK SEQ ID NO:578 of WRN Translated polypeptide sequence of exon 14 VQWKVIHSVLEERRDNVAVMAT SEQ ID NO: 579 Translated polypeptide sequence of exon 15 of WRN YGKSLCFQYPPVYVGKIGLVISPLISLMEDQVLQL SEQ ID NO: 580 Translated polypeptide sequence of exon 16 of WRN MSNIPACFLGSAQSENVLTDIK SEQ ID NO: 581 WR N explicit Translated polypeptide sequence of exon 17 GKYRIVYVTPEYCSGNMGLLQQLEADI SEQ ID NO: 582 Translated polypeptide sequence of exon 18 of WRN ITLIAVDEAHCISEWGHDFRDSFRKLGSLKTALPM SEQ ID NO: 583 Translated polypeptide sequence of exon 19 of WRN VPIVALTATASSSIREDIVRCLNLRNPQITCTGFDRPNLYLE VRRKTGNILQDLQPFLVKT SEQ ID NO: 584 WRN exon 20 Translated polypeptide sequence SHWEFEGPTIIYCPSRKMTQQVTGELRKLNLSCGTYHAGMFSSTRKDIHHRFVRDEIQ SEQ ID NO:585 WRN exon 21 translated polypeptide sequence CVIATIAFGMGINKADIRQVIHYGAPKDMESYYQEIGRAGRDGLQSSCHVLWAPADINLN SEQ ID NO:586 WRN exon Translated polypeptide sequence of exon 22 HLLTEIRNEKFRLYKLKMMAKMEKYLHSSRCRR SEQ ID NO:587 Translation of WRN exon 23 Peptide sequence IILSHFEDKQVQKASLGIMGTEKCCDNCRS SEQ ID NO: 588 Translated polypeptide sequence of WRN exon 24 LDHCYSMDDSEDTSWDFGPQAFKLLSAVDILGEKFGIGLPILFLRGS SEQ ID NO: 589 Translated polypeptide sequence of WRN exon 25 NSQRLADQYRRHSLFGTGKDQSWTEWKAFSR QLITEGFLVEVSRYNKFMKICALTKK SEQ ID NO:590 Translated polypeptide sequence of exon 26 of WRN GRNWLHKANTESQSLILQANEELCPKKLLLP SEQ ID NO:591 The translated polypeptide sequence of WRN exon 27 SKTVSSGTKEHCYNQVPVELSTEKK SEQ ID NO:592 The translated polypeptide sequence of WRN exon 28 SNLEKLYSYKPCDKISSGSNISKK SEQ ID NO:593 The translation of WRN exon 29 Translate polypeptide sequence IMVQSPEKAYSSSQPVISAQEQETQ SEQ ID NO: 594 Translated polypeptide sequence of exon 30 of WRN IVLYGKLVEARQKHANKMDVPPAILATNKILVDMAKM SEQ ID NO: 595 Translated polypeptide sequence of exon 31 of WRN PTTVENVKRIDGVSEGKAAMLAPLLEVIKHFCQTNSVQ SEQ ID NO: 596 Translated polypeptide sequence of exon 32 of WRN TDLFSSTK PQEEQKTSLVAKNKICTLSQSMAITYSLFQEKKMPL SEQ ID NO : 597 The translated polypeptide sequence of WRN exon 33 KSIAESRILPLMTIGMHLSQAVKAGCPLDLERAGLTPEVQKIIADVIRNPPVNS SEQ ID NO: 598 The translated polypeptide sequence of WRN exon 34 MSKISLIRMLVPENIDTYLIHMAIEILKHGPDSGLQPSCDVNKRRCFPGSEEICSSSKRSKEEVGINTE SEQ ID NO: Translated polypeptide sequence of exon 35 of 599 WRN TSSAERKRRLPVWFAKGSDTSKKLMDKTKRGGLFS SEQ ID NO:600 Whole protein sequence of WRN SEQ ID NO:601 Polynucleotide sequence of all exons 1-33 of WRN in sequence SEQ ID NO:602 Translated polypeptide sequence of all exons 1-33 of WRN in sequenceDAAM1 sequence informationSEQ ID NO:603 Sequence from 5' UTR/exon 1 of DAAM1 at 5' of DAAM1-NRG1 fusion GAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAG SEQ ID NO:604 5' UTR/exon 1 sequence of NRG1 at 3' of DAAM1-NRG1 fusion GACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCA SEQ ID NO:605 DAAM1-NRG1 polynucleotide sequence GAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAGGACAGAGAGGGAGGAGGCGCGCGGGGACGGGGACGCCCAGGAGGACCCACTCGCGGGTCCCGCTCCGCTCCGGCA SEQ ID NO:606 DAAM1 exon 1 GCGAGTTAGTAACCTACGAGCGGCTGTGAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCAG SEQ ID NO:607 DAAM1 exon 2 CGTTTTTAGTCACACACAAGAAAATACACACATCAGCCCCCCCCCCCCCCCCCAAGGGGGGGGGGGGGGGTCATTCATTTTTCCCCCCCCCCCCCCCGGCGGCGCGCGAAATGATA is GcaacttttgCTCAGACACACACACACAGCACCCCCCCCCCCCCCCTGGGGGGGGGTCATGTCAGTGAACTGGGTGTG SEQ ID NO:608 DAAM1 exon 3 GATGAACTGGACCTCCACAGACAAACACAGAGAAGCCATGTTTGCACTTCCAGCTGAGAAAAAATGGCAAATATACTGTAGCAAGAAAAAAG SEQ ID NO:609 DAAM1 exon 4 GACCAGGAAGAAAACAAGGGAGCTACAAGTTGGCCTGAATTCTACATTGATCAGCTCAATTCCATGGCTGCT SEQ ID NO:610 DAAM1 exon 5 AGAAAATCTCTGCTGGCTTTAGAGAAGGAAGAAGAAGAAGAAAGAAGTAAAACTATAGAGAGTTTAAAGACAGCACTGAGGACAAAACCAATGAG SEQ ID NO:611 DAAM1 exon 6 GTTTGTAACCAGATTCATCGACTTGGATGGCCTATCATGTATCCTCAACTTTTCTAAAGACCATGGACTACGAGACCTCAGAGTCTCGAATACATACTTCTCTCATTGGCTGTATAAAGGCGTTAATGAACAACTCTCAAGGCCGGGCTCACGTCCTGGCTCATTCTGAGAGTATTAATGTAATTGCTCAGAGTCTGAGCACAGAGAACATTAAAACGAAGGTG GCCGTGCTGGAAATCTTGGGCGCCGTGTGCCTGGTTCCCGGGGGCCACAAGAAGGTTCTGCAGGCCATGCTGCACTACCAGAAGTATGCCAGCGAAAGGACCCGCTTTCAG SEQ ID NO:612 DAAM1 exon 7 ACATTAATTAACGACTTGGATAAAAGCACTGGGCGGTATCGAGATGAAGTGAGTCTCAAGACTGCCATCATGTCCTTCATTAATGCAGTGCTCCAGCCAAGGTGCAGGAGTG SEQ ID NO:613 DAAM1 exon 8 GAGAGTTTGGACTTTAGACTTCATCTTCGCTATGAATTTCTGATGTTAGGAATTCAACCTGTAATAGATAAATTAAGGGAACACGAAAATTCAACATTAGATAG SEQ ID NO:614 DAAM1 exon 9 GCATTTAGACTTTTTTGAAATGCTCCGAAATGAAGATGAACTAGAATTTGCCAAAAGATTTGAACTG SEQ ID NO:615 DAAM1 exon 10 GTTCACATAGACACAAAAAAGTGCAACTCAGATGTTTGAGCTGACCAGGAAGAGGCTGACACATAGTGAAGCTTACCCGCATTTCATGTCCATCCTGCACCACTGCCTCCAAATGCCTT SEQ ID NO:616 DAAM1 exon 11 ACAAGAGGAGTGGCAACACTGTTCAGTACTGGCTACTACTAGATAGAATTATACAGCAGATAGTTATTCCAGAATGACAAAGGACAGGACCCTGACTCCACACCTTTGGAAAACTTTAATATTAAGAATGTCGTACGAAT SEQ ID NO:617 DAAM1 exon 12 GTTGGTTAATGAAAATGAAGTTAAGCAGTGGAAAGAACAAGCGGAAAAAATGAGAAAAG SEQ ID NO:618 DAAM1 exon 13 AGCACAATGAGCTACAACAGAAACTGGAAAAGAAAGAACGAGAATGTGATGCTAAGACTCAAGAGAAGGAAGAGATGATGCAGACCTTAAATAAAATGAAAGAGAAACTTGAAAAGGAGACTACTGAGCATAAGCAAGTCAAGCAGCAGGTGGCGGACCTCACAGCACAGCTCCATGAGCTCAGTCAGG SEQ ID NO:619 DAAM1 exon 14 AGGGCCGTCTGTGCTTCAATCCCAGGTGGACCCTCGCCTGGAGCACCAGGAGGGCCCTTTCCTTCCTCTGTGCCTGGATCTCTCCTTCCTCCCACCACCCCCACCTCTACCAGGTGGGATGCTTCCCCCTCCACCGCCTCCCCTCCCTCCAGGTGGCCCTCCTCCTCCCCCAGGGCCTCCTCCCTTAGGGGCAATCATGCCACCTCCTGGTGCTCCAAT GGGCCTAGCACTGAAGAAGAAAAGCATTCCTCCAGCCCACAAATGCCCTGAAATCCTTCAACTGGTCTAAACTGCCCGAG SEQ ID NO:620 DAAM1 exon 15 AACAAACTGGAAGGAACAGTATGGACCGAAATTGATGATACAAAAGTCTTCAAAATTCTAGATCTTGAAGACCTGGAAAGAACCTTCTCTGCCTATCAAAGACAGCAG SEQ ID NO:621 DAAM1 exon 16 AAAGAAGCAGATGCCATTGATGACACTCTGAGTTCCAAACTTAAAGTTAAAGAGCTTTCGGTGATTGATGGTCGGAGAGCTCAGAATTGCAACATCCTTCTATCGAG SEQ ID NO:622 DAAM1 exon 17 GTTGAAATTATCCAATGACGAAATCAAACGGGCAATTCTAACAATGGACGAACAGGAAGATCTGCCCAAGGACATGTTGGAACAG SEQ ID NO:623 DAAM1 exon 18 CTCTTGAAATTTGTTCCTGAAAAAAGTGACATTGACCTATTGGAGGAACATAAACACGAACTGGATCGGATGGCCAAGGCTGATAGGTTCCTTTTTGAGATGAGCCG SEQ ID NO:624 DAAM1 exon 19 AATTAATCACTATCAGCAAAGGTTGCAATCGCTGTACTTCAAAAAGAGTTTGCAGAGCGTGTGGCAGAAGTGAAACCTAAAGTGGAAG SEQ ID NO:625 DAAM1 exon 20 CAATTCGTTCTGGCTCAGAAGAGGTGTTTAGGAGTGGTGCCCTCAAGCAGTTGCTGGAGGTGGTTTTGGCATTTGGAAATTATATGAATAAAGGTCAAAGAGGGAATGCATATGGATTCAAGATATCTAGCCTAAACAAAATTGCTGACACAAAATCCAGCATCGACAA SEQ ID NO:626 DAAM1 exon 21 AAACATTACCCTTTTGCACTATCTCATCACTATTGTGGAAAATAAGTACCCCAGTGTTCCAAATCTAAATGAAGAATTGCGAGATATTCCTCAAAGCTGCGAAAGTAAA SEQ ID NO:627 DAAM1 exon 22 CATGACTGAGCTGGACAAAGAAATAAGTACCTTGAGAAGTGGCTTGAAAGCAGTAGAGACA SEQ ID NO:628 DAAM1 exon 23 GAGCTGGAATATCAGAAGTCTCAGCCCCCACAGCCCGGAGATAAGTTTGTGTCTGTTGTCAGCCAGTTCATCACAGTAGCCAGCTTCAGCTTCCTGATGTTGAAGACCTTCTAGCAGAAGCTAAAGACCTG SEQ ID NO:629 DAAM1 exon 24 TTTACTAAAGCAGTGAAGCACTTTGGGGAAGAGGCTGGCAAAATACAACCAGATGAGTTCTTTGGCATTTTTGATCAATTTCTTCAAGCTGTGTCAGAAGCCAAACAAGAAAACGAAAATATGAGAAAGAAAAAGGAGGAAGAAGAACGTCGAGCTCGCATGGAAGCTCAG SEQ ID NO:630 DAAM1 exon 25 CTCAAAGAACAACGTGAAAGGGAACGTAAAATGAGAAAAGCTAAAGAATAGTGAAGAAAGCGGAGAGTTTGATGACCTTGTTTCAGCTTTACGCTCAGGAGAAGTGTTTGACAAAGACCTTTCTAAATTGAAACGGAATCGCAAACGTATTACCAACCAGATGACTGACAGCAGCAGAGAGAGACCAATCACAAACTTAATTTCTAATTTTTTTTTTT AGAAAAGCTCATTAGCAGCCCTCTAAAGTGACTAGAACGTTTCATTACACTGCCTTGCAATCCAAACAGTGGCAATTTTTTCCTTCATCTGTGAGTGAATGTGTGAACGTGTGTATGTAAATGTATGTGTGTATATATTAAAAAATGTATAGATGTCTGAGTGTTGTCTGGAGACCTATACGTATGGTTAAAAAAGATTTATGTTAATGTATGTGCTCCAAAACCTTTCGTGTAT GCATTCACATTGAGTGTGGCCTTCATTTCTTTCCCCGAACGCCATGACTGTTCAGAAGCACAATACTATTCTCCTGAAAGAGATAAGAGACATTCCCTAGATTCAAAGGCAAAACAGAAGAAACAAACAAACAAAAAAAAAAGCTTGCAAAATATTTTATGGTTTCCAAGCTTGATATCCTTTTAAAATTTTTCATTGATGGAACTGGAGTTGTTGGAAAAACATAGATT TAAAATGATTTTTGATAGCTGACATTGTGATGTTGATGTATCACATCAGTAATAGGACCAGCTTTGAATTTCTGACATTGGTGTGGGGATACAGTCTGTAAATGTTTATTGAGAACATCTTGCACACAATTTGAATTATGTAGAATGTCAATCAAGTTTTTGTATATTTAAAAGTTGGACATCAATTTTTCCCCTGATTTCATCAAGTTATCTCTGCCAAGTGC TTGATAATTTCTTCAGATTTTTGGAAAAAAACACTATAAATGCAATCCATGCTTTTTTTAAAGAACAACATTGCCAGAGTATGCTTGTTTCTAACAATATAGATATATAAACCTTAAAAATAAAATATCTCCACCCAAGACTTAAAGGAAGAATTCTCTGAAGGGATAAAGATTACTAAAAAAAAAAAAAAAAAAAAAATTAATGGGGTGCCTTTTTTTGTTATAGTTTCTATTTT CTGTTTTGTAGGACAAGCTGCATTTTCTGTAAATATAGGTCTGGACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTATTTTCTAAGTATTTGAAGTGATTGCTGTTTATATGTTGTCATTTAAAATTGTGTGTCAGTAAAGCTACCTGTAAATTTCAGTCCAAAAAATAAAGCTCTCAGGGAGACATGAATAAAATCAATGA ACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCTTAATATCCTTAAATTATGTGATATAAAGAGGACTGTTACTTTTTACTTTTTTTTTTTTTTTTTTTTTTTTTTTGGCTTTGCTTTATTTATTTGTAGTTGGGGGCTAACGTTTTTCTTTTTCTTCTTATTGATCCTGTTGTGGTTGGGTTTCCTGTGGAGAGAGTAGTTTGTCCTGT TGCACTAGAACATTTATTTACTCACTAAATTGAGTTTTCAGTCAATTAACAAATATTTTATTAAGTTCCTACTATGTACCAGGCATAGTAGGGCTACAATGGTAAGCAAGACAGGTCCCTGCCCCCAAAAGAGCTTATATTCTAATGGGGATATTAGGGATGAATAGAATACCAATGTGTGCACTGTACAGGAATCATACTATTTAAAAATAATTTGTATAAACTATAAT GCTTAGCACAGATGGCGAGTTATCTGTGCTATGTGAAAGCTGTGAAATAGTGCTCTAAGAGTTGTCAAGAGCTGTGGTTTTACATCTTTTCCTCATTGCAAATTTAGTGACTTTTCACACACTATATGGAAATAAATGACTAGCAAATAAAACAGTCATAAATACAAAGCAGAGGTTGCACTCCCCAATCCCGAGTTAACCCAGGTCTGCAATAACCCATGTTAAAGGT GCAGATAGAGACTTGGCTCAAAAAGGCTTGGAGATGAGGAAGATTGGAATATAATGATGGTTTTGTCTGTTCCTTAACTAAAGTGCCTCTATGTATATTTCTTTTCTATTTGTAGCAGGATAAATTTGGTCTGGCTCAGTTTTGGAACTGTATTTTGAAAATGGCTTTGTCTTACAGTTTAAGGAATAGACAGGTGGAGGGAAAGTCACATAAAGGAGCAAGT TTGTGTAGCTGTCCCTCTTGCCCCTTTTAATCATCCTCCTTTGATATGGCCATCCTGGTGGGCCTCCTTTGCCATTTCCATTTTTGGTTTCTTCCCTGAAAACTGTGTGCAGGTAATTCCATGTGCCATTGTTGAAAAGAAAAAAAAAAAACAAAAAAAACCTACTTTTTGATTGGTGCTGGTGTAAGTAGCCACTTTTCTCTCTTGGGTGTGTATTTAAACT TTTTTTGTTTTTTTAAATTAATGCCAAAAAGAAAATGCATAATTTGTAAACTTAATTATATGTCTTATATCTTATTAGCTTAGTAGTTGGAACCACTTAGTCTTTAGGTGCAAGACTGTTGTTGTTAGATAGTACTGAGAAAAAAAAGTATGTGTTATGAGACTGTACATGTTTTTTAAAAATAGCAATATGCAATAAAGAGATGAATTCATTGGGTGTA SEQ ID NO:631 The complete mRNA polynucleotide sequence of DAAM1 GCGAGTTAGTAACCTACGAGCGGCTGTGAAGGAAACTGTTTAACCGGATCCCATTGTACCCAGAGTGCAGAGCCGCCTTTCCAGCATGCAGGGGCTGCTCCAGCGTTTAGTCACATCAAGAAATAGAACAGAATTCAGCCATGGCCCCAAGAAAGAGAGGTGGACGAGGTATTTCATTCATTTTTGCTGTTTCCAGAAATAATGATCACCCGAAATCACGTATC GGCTGCGAAATGATAGCAACTTTGCGCTTCAGACCATGGAACCAGCATTGCCCATGCCCCCTGTGGAGGAGCTGGATGTCATGTTCAGTGAACTGGTGGATGAACTGGACCTCCACAGACAAACACAGAGAAGCCATGTTTGCACTTCCAGCTGAGAAAAAATGGCAAATATACTGTAGCAAAAAAAGGACCAGGAAGAAAACAAGGGAGCTACAAGTTGGC CTGAATTCTACATTGATCAGCTCAATTCCATGGCTGCTAGAAAATCTCTGCTGGCTTTAGAGAAGGAAGAAGAAGAAGAAAGAAGTAAAACTATAGAGAGTTTAAAGACAGCACTGAGGACAAAACCAATGAGGTTTGTAACCAGATTCATCGACTTGGATGGCCTATCATGTATCCTCAACTTTTCTAAAGACCATGGACTACGAGACCTCAGAGTCGAATACATACTT CTCTCATTGGCTGTATAAAGGCGTTAATGAACAACTCTCAAGGCCGGGCTCACGTCCTGGCTCATTCTGAGAGTATTAATGTAATTGCTCAGAGTCTGAGCACAGAGAACATTAAAACGAAGGTGGCCGTGCTGGAAATCTTGGGCGCCGTGTGCCTGGTTCCCGGGGGCCACAAGAAGGTTCTGCAGGCCATGCTGCACTACCAGAGAGTATGCCAGCGA AAGGACCCGCTTTCAGACATTAATTAACGACTTGGATAAAAGCACTGGGCGGTATCGAGATGAAGTGAGTCTCAAGACTGCCATCATGTCCTTCATTAATGCAGTGCTCCAGCCAAGGTGCAGGAGTGGAGAGTTTGGACTTTAGACTTCATCTTCGCTATGAATTTCTGATGTTAGGAATTCAACCTGTAATAGATAAATTAAGGGAACACGAAAATTCAACAC TTAGATAGGCATTTAGACTTTTTTTGAAATGCTCCGAAATGAAGATGAACTAGAATTTGCCAAAAGATTTGAACTGGTTCACATAGACACAAAAAGTGCAACTCAGATGTTTGAGCTGACCAGGAAGAGGCTGACACATAGTGAAGCTTACCCCGCATTTCATGTCCATCCTGCACCACTGCCTCCAAATGCCTTACAAGAGGAGTGGCAACACTGTTCAGTACTGGCTACTAC TAGATAGAATTATACAGCAGATAGTTATTCCAGAATGACAAAGGACAGGACCCTGACTCCACACCTTTGGAAAACTTTAATATTAAGAATGTCGTACGAATGTTGGTTAATGAAAATGAAGTTAAAGCAGTGGAAAGAACAAGCGGAAAAAATGAGAAAAGAGCACAATGAGCTACAACAGAAACTGGAAAAGAAAGAACGAGAATGTGATGCTAAGACTCAAGAGAAGGAAG AGATGATGCAGACCTTAAATAAAATGAAAGAGAAACTTGAAAAGGAGACTACTGAGCATAAGCAAGTCAAGCAGCAGGTGGCGGACCTCACAGCACAGCTCCATGAGCTCAGCAGGAGGGCCGTCTGTGCTTCAATCCCAGGTGGACCCTCGCCTGGAGCACCAGGAGGGCCCTTTCCTTCCTCTGTGCCTGGATCTCTCCTTCCTCCCCACCCACCCACCTACTAC CAGGTGGGATGCTTCCCCCTCCACCGCCTCCCCTCCCTCCAGGTGGCCCTCCTCCTCCCCCAGGGCCTCCTCCCTTAGGGGCAATCATGCCACCTCCTGGTGCTCCAATGGGCCTAGCACTGAAGAAGAAAAGCATTCCTCAGCCCACAAATGCCCTGAAATCCTTCAACTGGTCTAAACTGCCCGAGAACAAACTGGAAGGAACAGTATGGACCGAAATTGAT GATACAAAAGTCTTCAAAATTCTAGATCTTGAAGACCTGGAAAGAACCTTCTCTGCCTATCAAAGACAGCAGAAAGAAGCAGATGCCATTGATGACACTCTGAGTTCCAAACTTAAAGTTAAAGAGCTTTCGGTGATTGATGGTCGGAGAGCTCAGAATTGCAACATCCTTCTATCGAGGTTGAAATTATCCAATGACGAAATCAAACGGGCAATTCTAACAATGGACGA ACAGGAAGATCTGCCCAAGGACATGTTGGAACAGCTCTTGAAATTTGTTCCTGAAAAAAGTGACATTGACCTATTGGAGGAACATAAACACGAACTGGATCGGATGGCCAAGGCTGATAGGTTCCTTTTTGAGATGAGCCGAATTAATCACTATCAGCAAAGGTTGCAATCGCTGTACTTCAAAAAGTAAGTTTGCAGAGCGTGTGGCAGAAGTGAAACCTAAA GTGGAAGCAATTCGTTCTGGCTCAGAAGAGGTGTTTAGGAGTGGTGCCCTCAAGCAGTTGCTGGAGGTGGTTTTGGCATTTGGAAATTATATGAATAAAGGTCAAAGGGGAATGCATATGGATTCAAGATATCTAGCCTAAACAAAATTGCTGACACAAAATCCAGCATCGACAAAAACATTACCCTTTTGCACTATCTCATCACTATTGTGGAAAATAAGTACCC CAGTGTTCTCAATCTAAATGAAGAATTGCGAGATATTCCTCCAAGCTGCGAAAGTAAACATGACTGAGCTGGACAAAGAAATAAGTACCTTGAGAAGTGGCTTGAAAGCAGTAGAGACAGAGCTGGAATATCAGAAGTCTCAGCCCCCACAGCCCGGAGATAAGTTTGTGTCTGTTGTCAGCCAGTTCATCACAGTAGCCAGCTTCAGCTTCTCTGATGTTGAAGACCTTCTA GCAGAAGCTAAAGACCTGTTTACTAAAGCAGTGAAGCACTTTGGGGAAGAGGCTGGCAAAATACAACCAGATGAGTTCTTTGGCATTTTTGATCAATTTCTTCAAGCTGTGTCAGAAGCCAAACAAGAAAACGAAAATATGAGAAAGAAAAAAGGAGGAAGAAGAACGTCGAGCTCGCATGGAAGCTCAGCTCAAAGAACAACGTGAAAGGGAACGTAAAATGAGAAAAG CTAAAGGAATAGTGAAGAAAGCGGAGAGTTTGATGACCTTGTTTCAGCTTTACGCTCAGGAGAAGTGTTTGACAAAGACCTTTCTAAATTGAAACGGAATCGCAAACGTATTACCAACCAGATGACTGACAGCAGCAGAGAGAGACCAATCACAAACTTAATTTCTAATTTTCCATGAATACTTTTTTTAAAGCTCATTAGCAGCCCTCTAAAGTGACTAGAACGTTTCAT TACACTGCCTTGCAATCCAAACAGTGGCAATTTTTTCCTTCATCTGTGAGTGAATGTGTGAACGTGTGTATGTAAATGTATGTGTATATTAAAAAATGTATATAGATGTCTGAGTGTTGTCTGGAGACCTATACGTATGGTTAAAAAAGATTTATGTTAATGTGCTCCAAAACCTTTCGTGTATGCATTCACATTGAGTGTGGCTCATTTTCTTCCCCGAACGCC ATGACTGTTCAGAAGCACAATACTATTCTCCTGAAAGAGATAAGAGACATTCCCTAGATTCAAAGGCAAAACAGAAGAAACAAACAAACAAACAAAAAAAGCTTGCAAAATATTTTATGGTTTCCAAGCTTGATATCCTTTAAAATTTTTCATTGATGGAACTGGAGTTGTTGGAAAACATAGATTTAAAATGATTTTTGATAGCTGACATTGTGATGTTGATGTAT CACATCAGTAATAGGACCAGCTTTGAATTTCTGACATTGGTGTGGGGATACAGTCTGTAAATGTTTATTGAGAACATCTTGCACACAATTTGAATTATGTAGAATGTCAATCAAGTTTTTTGTATATTTAAAAGTTGGACATCAATTTTTTCCCCTGATTTCATCAAGTTATCTCTGCCAAGTGCTCTTGATAATTTCTTCAGATTTTTGGAAAAAAACACTATATAAATG CAATCCATGCTTTTTTTAAAGAACAACATTGCCAGAGTATGCTTGTTCTAACAATATAGATATATAAACCTTAAAAATAATAAAATATCTCACCCCAAGACTTAAAGGAAGAATTCTCTGAAGGGATAAAGATTACTAAAAAAAAAAAAAAAAAAAAAAAAAATGGGTGCCTTTTTGTTTAGTTTCTATTTTCTGTTTTGTAGGACAAGCTGCATTTTCTGTAAATAGGTCTG GACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTATTTTCTCTAAGTATTTGAAGTGATTGCTGTTTATATGTTGTCATTTTAAAAATTGTGTGTCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAAATAAAGCTCTCAGGGAGACATGAATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAAC TCTTAATATCCTTAAATTATGTGATATATAAAGAGGACTGTTACTTTTTACTTTTTTTTTTTTTTTTTTTTTTTTTGGCTTTGCTTTATTTATTTGTAGTTGGGGGCTAACGTTTTTCTCTTTTCTTCTATTGATCCTGTTGTGGTTGGGTTTCCTGTGGAGAGAGTAGTTTGTCCTGTTGCACTAGAACATTTACTCACTAAATTGAGTTTTCAGTC ATTAACAAATATTTTATTAAGTTCCTACTATGTACCAGGCATAGTAGGGCTACAATGGTAAGCAAGACAGGTCCCTGCCCCCAAAGAGCTTATATTCTAATGGGGATATTAAGGGATGAATAGAATACCAGTGTGCACTGTACAGGAATCATACTATTTAAAAATAATTTGTATAAAAACTATAATGCTTAGCACAGATGGCGAGTTATCTGTGCTATGTGAAAGCT GTGAAATAGTGCTCTAAGAGTTGTCCAAGAGCTGTGGTTTTACATCTTTTCCTCATTGCAAATTTAGTGACTTTTCACACACTATATGGAAATAAATGACTAGCAAATAAAACAGTCATAAATACAAAGCAGAGGTTGCACTCCCCAATCCCGAGTTAACCCAGGTCTGCAATAACACCATGTTAAAGGTGCAGATAGAGACTTGGCTCAAAAAGGCTTGGAGATGAGGAAG ATTGGAATATAATGATGGTTTTGTCTGTTCCTTAACTAAAAGTGCCTCTATGTATATTTCTTTTCTATTTGTAGCAGGATAAATTTGGTCTGGCTCAGTTTTGGAACTGTATTTTGAAAATGGCTTTGTCTTACAGTTTAAGGAATAGACAGGTGGAGGGAAAGTCACATAAAGGAGCAAGTTTGTGTAGCTGTCCCTCTTGCCCCTTTTAATCATCCTCCTTTG ATATGGCCATCCTGGTGGGCCTCCTTTGCCATTTCCATTTTTGGTTTCTTTTCCCTGAAAACTGTGTGCAGGTAATTCCATGTGCCATTGTTGAAAAGAAAAAAAAAAAAAAAAAAAAACCTACTTTTTGATTGGTGCTGGTGTAAGTAGCCACTTTTCTCTCTTGGGTGTGTATTTAAACTTTTTTTGTTTTTTAAATTAATGCCAAAAAAAAATGCATAATTT GTAAACTTAATTATATGTCTTATATCTTATTAGCTTAGTAGTTGGAACCACTTAGTCTTTAGGTGCAAGACTGTTGTTAGATAGTACTGAGAAAAAAAAGTATGTGTTATGAGACTGTACATGTTTTTTAAAAATAGCAATATGCAATAAAGAGATGAATTCATTGGGTGTA The complete protein sequence of SEQ ID NO:632 DAAM1 MAPRKRGGRGISFIFCCFRNNDHPEITYRLRNDSNFALQTMEPALPPMPPVEELDVMFSELVDELDLTDKHREAMFALPAEKKWQIYCSKKKDQEENKGATSWPEFYIDQLNSMAARKSLLALEKEEEEERSKTIESLKTALRTKPMRFVTRFIDLDGLSCILNFLKTMDYETSESRIHTSLIGCIKAL MNNSQGRAHVLAHSESINVIAQSLSTENIKTKVAVLEILGAVCLVPGGHKKVLQAMLHYQKYASERTRFQTLINDLDKSTGRYRDEVSLKTAIMSFINAVLSQGAGVESLDFRLHLRYEFLMLGIQPVIDKLREHENSTLDRHLDFEMLRNEDELEFAKRFELVHIDTKSATQMFELTRKRLT HSEAYPHFMSILHHCLQMPYKRSGNTVQYWLLLDRIIQQIVIQNDKGQDPDSTPLENFNIKNVVRMLVNENEVKQWKEQAEKMRKEHNELQQKLEKKERECDAKTQEKEEMMQTLNKMKEKLEKETTEHKQVKQQVADLTAQLHELSRRAVCASIPGGPSPGAPGGPFPSSVPGSLLPPPPPP PLPGGMLPPPPPLPPGGPPPPPGPPPLGAIMPPPPGAPMGLALKKKSIPQPTNALKSFNWSKLPENKLEGTVWTEIDDTKVFKILDLEDLERTFSAYQRQQKEADAIDDTLSSKLKVKELSVIDGRRAQNCNILLSRLKLSNDEIKRAILTMDEQEDLPKDMLEQLLKFVPEKSDIDLLEEHKHELDRMAKADRFL FEMSRINHYQQRLQSLYFKKKFAERVAEVKPKVEAIRSGSEEVFRSGALKQLLEVVLAFGNYMNKGQRGNAYGFKISSLNKIADTKSSIDKNITLLHYLITIVENKYPSVLNLNEELRDIPQAAKVNMTELDKEISTLRSGLKAVETELEYQKSQPPQPGDKFVSVVSQFITVASFSF SDVEDLLAEAKDLFTKAVKHFGEEAGKIQPDEFFGIFDQFLQAVSEAKQENENMRKKKEEEERRARMEAQLKEQRERERKMRKAKENSEESGEFDDLVSALRSGEVFDKDLSKLKRNRKRITNQMTDSSRERPITKLNF ASPH sequence informationSEQ ID NO:633 Sequence from ASPH exon 22 at 5' of the ASPH-NRG1 fusion AAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAG SEQ ID NO:634 NRG1 exon 2 sequence at 3' of the ASPH-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:635 ASPH-NRG1 polynucleotide sequence AAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:636 ASPH-NRG1 polypeptide sequence GLFLPEDENLREKGDWSQFTLWQQALPPRLKEMKSQESAAAGSKLVLRCE SEQ ID NO:637 ASPH exon 1 AGTCTCAAAGCTCTGGTAGGCAAGTGCATGCAGTGTGCCTAAAACCTGCCAGCAGTACTTTTTGAGTTTTTTTTTTTTGTTTTGTTTTACTTTAGCATTTATTATTCATGGATTGAAGAAATCAAAATGGCTGAAGATAAAG SEQ ID NO:638 ASPH exon 2 AGACAAAGCATGGAGGACACAAGAATGGGAGGAAAGGCGGACTCTCAGGAACTTCATTCTTCACGTGGTTTATGGTGATTGCATTGCTGGGCGTCTGGACATCTGTAGCTGTCGTTTGGTTTGATCTTGTTGACTATGAGGAAGTTCTAG SEQ ID NO:639 ASPH exon 3 GAAAACTAGGAATCTATGATGCTGATGGTGATGGAGATTTTGATGTGGATGATGCCAAAGTTTTTATTAG SEQ ID NO:640 ASPH exon 4 GACTTAAAGAGAGATCTACTTCAGAGCCAGCAGTCCCGCCAGAAGAGGCTGAGCCACACACTGAGCCCGAGGAGCAGGTTCCTGTGGAGGCAG SEQ ID NO:641 ASPH exon 5 AACCCCAGAATATCGAAGATGAAGCAAAAAGAACAAATTCAGTCCCTTCTCCATGAAATGGTACACGCAGAACATG SEQ ID NO:642 ASPH exon 6 TTGAGGGAGAAGACTTGCAACAAGAAGATGGACCCACAGGAGAACCACAACAAGAGGATGATGAGTTTCTTATGGCGACTGATGTAGATGATAGATTTGAGACCCTGGAACCTGAAGTATCTCATGAAG SEQ ID NO:643 ASPH exon 7 AAACCGAGCATAGTTACCACGTGGAAGAGACAG SEQ ID NO:644 ASPH exon 8 TTTCACAAGACTGTAATCAGGATATGGAAGAGATGATGTCTGAGCAGGAAAATCCAG SEQ ID NO:645 ASPH exon 9 ATTCCAGTGAACCAGTAGTAGAAGATGAAAGATTGCACCATGATACAG SEQ ID NO:646 ASPH exon 10 ATGATGTAACATACCAAGTCTATGAGGAACAAG SEQ ID NO:647 ASPH exon 11 CAGTATATGAACCTCTAGAAAATGAAGGGATAGAAATCACAG SEQ ID NO:648 ASPH exon 12 AAGTAACTGCTCCCCCTGAGGATAATCCTGTAGAAGATTCACAGGTAATTGTAGAAG SEQ ID NO:649 ASPH exon 13 AAGTAAGCATTTTTCCTGTGGAAGAACAGCAGGAAGTACCACCAG SEQ ID NO:650 ASPH exon 14 AAACAAATAGAAAAACAGATGATCCAGAACAAAAAGCAAAAG SEQ ID NO:651 ASPH exon 15 TTAAGAAAAAAGAAGCCTAAACTTTTAAATAAATTTGATAAGACTATTAAAGCTGAACTTGATGCTGCAGAAAAACTCCGTAAAAGG SEQ ID NO:652 ASPH exon 16 GGAAAAATTGAGGAAGCAGTGAATGCATTTAAAGAACTAGTACGCAAATACCCTCAGAGTCCACGAGCAAGATATGGGAAGGCGCAG SEQ ID NO:653 ASPH exon 17 TGTGAGGATGATTTGGCTGAGAAGAGGAGAAGTAATGAGGTGCTACGTGGAGCCATCGAGACCTACCAAGAGGTGGCCAGCCTACCTGATGTCCCCTGCAGACCTGCTGAAGCTGAGTTTGAAGCGTCGCTCAGACAGGCAACAATTTCTAG SEQ ID NO:654 ASPH exon 18 GTCATATGAGAGGTTCCCTGCTTACCCTGCAGAGATTAGTTCAACTATTTCCCAATGATACTTCCTTAAAAAATGACCTTGGCGTGGGATACCTCTTGATAGGAGATAATGACAATGCAAAGAAAGTTTTATGAAGAG SEQ ID NO:655 ASPH exon 19 GTGCTGAGTGTGACACCTAATGATGGCTTTGCTAAAGTCCATTATGGCTTCATCCTGAAGGCACAGAACAAAATTGCTGAGAGCATCCATATTTAAAG SEQ ID NO:656 ASPH exon 20 GAAGGAATAGAATCCGGAGATCCTGGCACTGATGATGGGAGATTTTATTTCCACCTGGGGGATGCCATGCAGAGGGTTGGGAACAAAGAG SEQ ID NO:657 ASPH exon 21 GCATATAAGTGGTATGAGCTTGGGCACAAGAGAGGACACTTTGCATCTGTCTGGCAACGCTCACTCTACAATGTGAATGGACTGAAAGCACAGCCTTGGTGGACCCCAAAAAGAAACGGGCTACACAGAGTTAGTAAAG SEQ ID NO:658 ASPH exon 22 TCTTTAGAAAGAAACTGGAAGTTAATCCGAGATGAAGGCCTTGCAGTGATGGATAAAGCCAAAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAG SEQ ID NO:659 ASPH exon 23 GAAGAAGAAATGAAAATGCCTGCAAAGGAGCTCCTAAAACCTGTACCTTACTAGAAAAGTTCCCCGAGACAACAGGATGCAGAAGAGGACAG SEQ ID NO:660 ASPH exon 24 ATCAAATATTCCATCATGCACCCCGGGACTCACGTGTGGCCGCACACAGGGCCCACAAACTGCAGGCTCCGAATGCACCTGGGCTTGGTGATTCCCAAGGAAGGCTGCAAGATTCGATGTGCCAACGAGACCAA SEQ ID NO:661 ASPH exon 25 GACCTGGGAGGAAGGCAAGGTGCTCATCTTTGATGACTCCTTTGAGCACGAGGTATGGCAGGATGCCTCATCTTTCCGGCTGATATTCATCGTGGATGTGTGGCATCCGGAACTGACACCACAGCAGAGACGCAGCCTTCCAGCAATTTAGCATGAATTCATGCAAGCTTGGGAAACTCTGGAGAGAGGCTGCCTTTCTGGTTCCATTCTCTTGG GTGTGAGGATAGAATTTCGAACACCAAGAGTCAATTCCCTTGACTTGCAGCCCGAGTAATTCAAAGCCTCTCCTAGGGTCAGAAGACACTAAAGGGAATATTGCCTCGCTGCAATTCATTTAGGAAACACCCTGCTGTGTGTCATCTCATGACAGCACTGGTCTTCTGCCAGTATTTAAGGTGAACATTTGATAGCTTCTACCTTACCAGCCAAAGATATTTTTCCA CATAGAATAGGTCTAATTCAATGTATAATGAGAACATATGTAGAAACTGTGAATGGATTGCTTTAGTTTGTAATTTTTCTATGCAGTTATATTTTTCTAGTGTAGCTAGACTATTTTTGTCATCATGTACCACTACATTTTTGTTTATTTTAATGACAAGCTGTATAAATGCTTTACTTCTAGCTATTTAATGGTAGCATTACTGGGGAACTCAGACTTCCCCTTTTAATTCTTCT TAGTAAAAGATACTCATGAAAAAAGCAGTTTTTATTTTCCTAACAAAAAAGAAAGAGCTCATTATGTCAGTGTCTATGAACTGTACCCATCCCAACTTCTCAAATCGTTTGGTTTTTTTTTATCTTGATTGAGATCCTTCTCACTATGCTAGTGGTGGAGATATTGACAAAATCCTATTTCTTTCAAAGAGGAACTTTCACACCGAAAAAAGGAGCATGGAATTATTTTATATTGT TATAAAAATCCCAGATGCAAATTTTTTTAATGCCAATTATTAGAGCTTCTGGGGAAAAAGTATAGTTCACGGAAATAAAACTATGTTCTTTCAGGGTTGGGTGGATAGGTGGCTGCTAGGGTGTCTGGCTCCTGGCGGCTTTGCCATCCATGAGGCAAGGGCTGGGAACACAGTGTCTTTGCCTATGGTAGATCCATGTGAATGTCAGGAAGCC AGCTCTTCAGTCTTGGAGATGATTTCTGCTACAATTCTGTAGAAAGATTAAGGATGGCAGAGTAAAAGGTTACCAAGAATGCCAGGATGTTTTTCTTGGGCGTAGGAGGTCCAGATTACTTTTCCTTTTTGATGAAAGAGTTTGGAAGACTGTCCCATCTCTCTGGCTTGAAAATCTCTGCCATTTTAAACATCACTGTGAAATAGCAATTATTATCATCTGTATT TAGTTTTTAACATTACCCACAACATAGAAATAATAGGTAAAAATCGTCTTGCCTACTCATTCCAAAGATGATCAAGTCATTAATCTAGCAAAGTATTCATGTATCAGATTTTGTATATTTTGAATCAAAGCTAACTAGGAATGTTAGATATAAGAATGTAATGATATTCATGCACTGAATTCTAAGCCAATATGAACAAAAATGCTGCATGAATGGCACATAGGTCACCA AAGTTCATTCACAGGTAGAAAAAACTTGTGCTTTCTTTTCCATCTAAAAACAAAAGGAGACTTTCTTTATCTCATTTAAAGAACAGCTCTTTGAAATTGAAATTGACCCTTTTTGCTTGACCTTAAGGAGATTAGCTTCCAGTAGATGAGTTTGCAAAATACTTTTCCTGTTCTTTTGTTTTGCTGGTATTGAAAACTCCCACTAATCAGATGAAGAGGCATGGGAGGAAAA ATATCCAAATTAATTACTAAAATCGAGAAGAGAAGGCAAACTCTTGAAAAGTAAAAAGGTGTTTGTGACCTTCAGTATTTATTGAACAGAGGAAATAACTGACAAGGGCAATACAATTCAATGTTCATGTAGTAACATTCATGTCACTTGTTGAATTTGGTTCTCATATGTATATTGCATACACATAAATTCAAACTATAAGTCGTCATTTTTGAGCCATCATCTTACATTCAT GTAATGAAATTATGGAAGAGAGTAAAAACTAGCTCTTAACTTAGTAAATAATATGGTATTTAAAATCAGGTCACTACAGTAAGGTTCTAAGTATTGCCAATTGAAAAGCTAGAAATGGTATTACTGTTGCAAAGTGTTGTCAATAATTGACTCCAATAGCATTGTAAATACTTGTATCCCAACTATTTAAACCCAAGCAATAAAATGGATTTTCTAATTCACTTCAATTC TTTATTTTCTCTTACCTATCTATGTCTTGGTACATAAGAAAAGTGTTTCCATCACTGCATTGGTAGAATTATTTCAGTGTTATTTATTTTTGTGATTTCGTATGTCTACACAAAAATGAATTAGTTTAATTTATATTGTGATACAGTTGTTTGAGAAATATTTTTAATTCTGTTTCAACTTTTATTTCTCCAAGTGTATATATAAAAATTACTTCTGTTATTGTTCTCTACCAATAGG GGAAAAAATTAAAACATTAGATCCATTGAGAAAGAGATGATGTAATAAATAATTAAGATTAGTAATAATATTATCAGGGGTGATTATGACCAGTTGAATAATCTCTTTCCCTTGAATTATTTAGCTAACAAATTAACTTCCCAAATATTTAAAATAATGTAAAATCTATATTTTACTGCCCATTATTAACTAAAATATTTTGTTTGACTTTGAGCACCAACTGGTAATACTAATAA ATACCCATGTCATGCAGATGGCTGGGCGAATAAGAGATGTCTAAAAATATGCACTGGTCTTGGAAAACATGGCACAAGTAAGGATATCATATATGATGTCTGTTTATTTTATGTCTGATTTCTTTTGAATGAGTAGTTGGGGACTCCATTTCTAAGGAGACTAGGTAAATAAAATGACCTTTGACATTTCA SEQ ID NO:662 The complete mRNA polynucleotide sequence of ASPH AGTCTCAAAGCTCTGGTAGGCAAGTGCATGCAGTGTGCCTAAAACCTGCCAGTCAGTTTTTTTGAGTTTTTTTTTTTTGTTTTGTTTTACTTTAGCATTTATTATTCATGGATTGAAGAAATCAAAATGGCTGAAGATAAAGAGACAAAGCATGGAGGACACAAGAATGGGAGGAAAGGCGGACTCTCAGGAACTTCATTCTTCACGTGGTTTATGGTGATTGCAT TGCTGGGCGTCTGGACATCTGTAGCTGTCGTTTGGTTTGATCTTGTTGACTATGAGGAAGTTCTAGGAAAACTAGGAATCTATGATGCTGATGGTGATGGAGATTTTGATGTGGATGATGCCAAAGTTTTTATTAGGACTTAAAGAGAGATCTACTTCAGAGCCAGCAGTCCCGCCAGAAGAGGCTGAGCCACACACTGAGCCCGAGGAGCAGGTTCCTGTG GAGGCAGAACCCCAGAATATCGAAGATGAAGCAAAAAGAACAAATTCAGTCCCTTCTCCATGAAATGGTACACGCAGAACATGTTGAGGGAGAAGACTTGCAACAAGAAGATGGACCACAGGAGAACCACAACAAGAGGATGATGAGTTTCTTATGGCGACTGATGTAGATGATAGATTTGAGACCCTGGAACCTGAAGTATCTCATGAAGAAACCGAGCATAGT TACCACGTGGAAGAGACAGTTTCACAAGACTGTAATCAGGATATGGAAGAGATGATGTCTGAGCAGGAAAATCCAGATTCCAGTGAACCAGTAGTAGAAGATGAAAGATTGCACCATGATACAGATGATGTAACATACCAAGTCTATGAGGAACAAGCAGTATATGAACCTCTAGAAAATGAAGGGATAGAAATCACAGAAGTAACTGCTCCCCCTGAGGATAATCCTGTA GAAGATTCACAGGTAATTGTAGAAGAAGTAAGCATTTTTCCTGTGGAAGAACAGCAGGAAGTACCACCAGAAACAAATAGAAAAACAGATGATCCAGAACAAAAAGCAAAAGTTAAGAAAAAGAAGCCTAAACTTTTAAATAAATTTGATAAGACTATTAAAGCTGAACTTGATGCTGCAGAAAAACTCCGTAAAAGGGGAAAAATTGAGGAAGCAGTGAATGCATTTAAGAAC TAGTACGCAAATACCCTCAGAGTCCACGAGCAAGATATGGGAAGGCGCAGTGTGAGGATGATTTGGCTGAGAAGAGGAGAAGTAATGAGGTGCTACGTGGAGCCATCGAGACCTACCAAAGAGGTGGCCAGCCTACCTGATGTCCCCTGCAGACCTGCTGAAGCTGAGTTTGAAGCGTCGCTCAGACAGGCAACAATTTCTTAGGTCATATGAGAGGTTCCCTGCT TACCCTGCAGAGATTAGTTCAACTATTTCCCAATGATACTTCCTTAAAAAATGACCTTGGCGTGGGATACCTCTTGATAGGAGATAATGACAATGCAAAGAAAGTTTTATGAAGAGGTGCTGAGTGTGACACCTAATGATGGCTTTGCTAAAGTCCATTATGGCTTCATCCTGAAGGCACAGAACAAAATTGCTGAGAGCATCCCATATTTAAAGGAAGGAATAGAATCCG GAGATCCTGGCACTGATGATGGGAGATTTTATTTCCACCTGGGGGATGCCATGCAGAGGGTTGGGAACAAAGAGGCATATAAGTGGTATGAGCTTGGGCACAAGAGAGGACACTTTGCATCTGTCTGGCAACGCTCACTCTACAATGTGAATGGACTGAAAGCACAGCCTTGGTGGACCCCAAAAAGAAACGGGCTACACAGAGTTAGTAAAGTCTTTA GAAAGAAACTGGAAGTTAATCCGAGATGAAGGCCTTGCAGTGATGGATAAAGCCAAAGGTCTCTTCCTGCCTGAGGATGAAAACCTGAGGGAAAAAAGGGGACTGGAGCCAGTTCACGCTGTGGCAGCAAGGAAGAAGAAATGAAAATGCAAAGGAGCTCCTAAACCTGTACCTTACTAGAAAAGTTCCCCGAGACAACAGGATGCAGAAGAGGACAGA TCAAATATTCCATCATGCACCCCGGGACTCACGTGTGGCCGCACACAGGGCCCACAAACTGCAGGCTCCGAATGCACCTGGGCTTGGTGATTCCCAAGGAAGGCTGCAAGATTCGATGTGCCAACGAGACCAAGACCTGGGAGGAAGGCAAGGTGCTCATCTTTGATGACTCCTTTGAGCACGAGGTATGGCAGGATGCCTCATCTTTCCGGCTGATA TTCATCGTGGATGTGTGGCATCCGGAACTGACACCACAGCAGAGACGCAGCCTTCCAGCAATTTAGCATGAATTCATGCAAGCTTGGGAAACTCTGGAGAGAGGCTGCCTTTCTGGTTCCATCCTTGGGTGTGAGGATAGAATTTCGAACACCAAGAGTCAATTCCCTTGACTTGCAGCCCGAGTAATTCAAAGCCTCTCCTAGGGTCAGAAGACACTAA AGGGAATATTTGCCTCGCTGCAATTCATTTAGGAAACACCCTGCTGTGTCATCTCATGACAGCACTGGTCTTCTGCCAGTATTTAAGGTGAACATTTGATAGCTTCTACCTTACCAGCCAAAGATATTTTTTCCACATAGAATAGGTCTAATTCAATGTATAATGAGAACATATGTAGAAACTGTGAATGGATTGCTTTAGTTTGTAATTTTTCTATGCAGTTATATTTT TCTAGTGTAGCTAGACTATTTTTGTCATCATGTACCACTACATTTTTGTTTTATTTTAATGACAAGCTGTATAAATGCTTTACTTCTAGCTATTTAATGGTAGCATTACTGGGGAACTCAGACTTCCCCTTTTTTAATTCTTCTTAGTAAAAGATACTCATGAAAAAAGCAGTTTTTTTTCCTAACAAAAAAGAAAGAGCTCATTATGTCAGTGTCTATGAACTGTACCCATCCCAACTC TCAAATCGTTTGGTTTTTTTTTATCTTGATTGAGATCCTTCTCACTATGCTAGTGGTGGAGATATTGACAAAATCCTATTTCTTTCAAAGAGGAACTTTCACACCGAAAAAAGAGCATGGAATTATTTTATATTGTTATAAAAATCCCAGATGCAAATTTTTTTTAATGCCAATTATTAGAGCTTCTGGGGAAAAAAGTATAGTTCACGGAAATAAAACTATGTTCTTTC AGGGTTGGGTGGATAGGTGGCTGCTAGGGTGTCTGGCTCCTGGCGGCTTTGCCATCCATGAGGCAAGGGCTGGGAACACAGTGTCTTTGCCTATGGTAGATCCATGTGAATGTCAGGAAGCCAGCTCTTCAGTCTTGGAGATGATTTCTGCTACAATTCTGTAGAAAGATTAAGGATGGCAGAGTAAAAGGTTACCAAGAATGCCAGGATGTTT TTCTTGGGCGTAGGAGGTCCAGATTACTTTCCTTTTTGATGAAAGAGTTTGGAAGACTGTCCCATCTCTCTGGCTTGAGAAATCTCTGCCATTTTAAACATCACTGTGAAATAGCAATTATTATCATCTGTATTTAGTTTTTAACATTACCCACAACATAGAAATAATAGGTAAAAATCGTCTTGCCTACTCATTCCAAAGATGATCAAGTCATTAATCTAGCAAAGTAT TCATGTATCAGATTTTGTATATTTTGAATCAAAGCTAACTAGGAATGTTAGATATAAGAATGTAATGATATTCATGCACTGAATTCTAAGCCAATATGAACAAAAATGCTGCATGAATGGCACATATAGGTCACCAAAGTTCATTCACAGGTAGAAAAAACTTGTGCTTTCTTTTCCATCTAAAAACAAAAGGAGACTTTCTTTATCTCATTTAAAGAACAGCTCTTTGAAATT GAAATTGACCCTTTTTGCTTGACCTTAAGGAGATTAGCTTCCAGTAGATGAGTTTGCAAAATACTTTTCCTGTTCTTTTGTTTTGCTGGTATTGAAAACATCCACTAAATCAGATGAAGAGGCATGGGAGGAAAAAATATCCAATTAATTACTAAAATCGAGAAGAGAAGGCAAACTCTTGAAAAGTAAAAAAGGTGTTTGTGACCTTCAGTATTTATTGAACAGAGGAAAT AACTGACAAGGGCAATACAATTCAATGTTCATGTAGTAACATTCATGTCACTTGTTGAATTTGGTTCTCATATGTATATTGCATACACATAAATTCAAACTATAAGTCGTCATTTTTGAGCCATCATCTTACATTCATGTAATGAAATTATGGAAGAGTAAAAACTAGCTCTTAACTTAGTAAATAATATGGTATTAAAATCAGGTCACTACAGTAAGGTTCTAAGTATT GCCAATTGAAAAGCTAGAAATGGTATTACTGTTGCAAAGTGTTGTCAATAATTGACTCCAATAGCATTGTAAATACTTGTATCCCACAACTATTTAAACCCAAGCAATAAAATGGATTTTCTAATTCACTTCAATTCTTTATTTCTCTTACCTATCTATGTCTTGGTACAATAAGAAAGTGTTTCCATCACTGCATTGGTAGAATTTTCAGTGTTATTATTTTTGTGATT TCGTATGTCTAACACAAAAATGAATTAGTTTAATTTATATTGTGATACAGTTGTTTGAGAAATATTTTTAATTCTGTTTCAACTTTATTTTCTCCAAGTGTATAATATAAAAATTACTTCTGTTATTGTTCTCTACCAATAGGGGAAAAAATTAAAACATTAGATCCATTGAGAAAGAGATGATGTAATAAATAATTAAGATTAGTAATAATATTATTCAGGGGTGATTATGACCAGTT GAATAATCTCTTTCCCTTGAATTATTTAGCTAACAAATTAACTTCCCAAATATTTAAAATAATGTAAAATCATATTTTACTGCCCATTATTAACTAAAATATTTTGTTTGACTTTGAGCACCAACTGGTAATACTAATAAATACCCATGTCATGCAGATGGCTGGGCGAATAAGAGATGTCTAAAAATATGCACTGGTCTTGGAAAACATGGCACAAGTAAGGATATC ATATATGATGTCTGTTTATTTTATGTCTGATTTCTTTTGAATGAGTAGTTGGGGACTCCATTTCTAAGGAGACTAGGTAAATAAAATGACCTTTGACATTTCA SEQ ID NO:663 Translated polypeptide sequence of ASPH exon 1 MAEDK SEQ ID NO:664 Translated polypeptide sequence of ASPH exon 2 TKHGGHKNGRKGGLSGTSFFTWFMVIALLGVWTSVAVVWFDLVDYEEVL SEQ ID NO:665 Translated polypeptide sequence of ASPH exon 3 KLGIYDADGDGDFDVDDAKVLL SEQ ID NO:666 Translated polypeptide sequence of ASPH exon 4 LKERSTSEPAVPPEEAEPHTEPEEQVPVEA SEQ ID NO:667 Translated polypeptide sequence of exon 5 of ASPH PQNIEDEAKEQIQSLLHEMVHAEH SEQ ID NO:668 Translated polypeptide sequence of exon 6 of ASPH EGEDLQQEDGPTGEPQQEDDEFLMATDVDDRFETLEPEVSHE SEQ ID NO:669 Translated polypeptide sequence of exon 7 of ASPH TEHSYHVEET SEQ ID NO:670 Translated polypeptide sequence of exon 8 of ASPH SQDCNQDMEEMMSEQENP SEQ ID NO:671 Translated polypeptide sequence of ASPH exon 9 SSEPVVEDERLHHDT SEQ ID NO:672 Translated polypeptide sequence of exon 10 of ASPH DVTYQVYEEQ SEQ ID NO:673 Translated polypeptide sequence of ASPH exon 11 VYEPLENEGIEIT SEQ ID NO:674 Translated polypeptide sequence of ASPH exon 12 VTAPPEDNPVEDSQVIVE SEQ ID NO:675 Translated polypeptide sequence of ASPH exon 13 VSIFPVEEQQEVPP SEQ ID NO:676 Translated polypeptide sequence of ASPH exon 14 TNRKTDDPEQKAK SEQ ID NO:677 Translated polypeptide sequence of exon 15 of ASPH KKKKPKLLNKFDKTIKAELDAAEKLRKR SEQ ID NO:678 Translated polypeptide sequence of exon 16 of ASPH GKIEEAVNAFKELVRKYPQSPRARYGKAQ SEQ ID NO:679 Translated polypeptide sequence of ASPH exon 17 CEDDLAEKRRSNEVLRGAIETYQEVASLPDVPADLLKLSLKRRSDRQQFL SEQ ID NO:680 Translated polypeptide sequence of ASPH exon 18 HMRGSLLTLQRLVQLFPNDTSLKNDLGVGYLLIGDNDNAKKVYEE SEQ ID NO:681 Translated polypeptide sequence of ASPH exon 19 VLSVTPNDGFAKVHYGFILKAQNKIAESIPYLK SEQ ID NO:682 Translated polypeptide sequence of ASPH exon 20 EGIESGDPGTDDGRFYFHLGDAMQRVGNKE SEQ ID NO:683 Translated polypeptide sequence of ASPH exon 21 AYKWYELGHKRGHFASVWQRSLYNVNGLKAQPWWTPKETGYTELVK SEQ ID NO:684 Translated polypeptide sequence of ASPH exon 22 SLERNWKLIRDEGLAVMDKAKGLFLPPEDENLREKGDWSQFTLWQQ SEQ ID NO:685 Translated polypeptide sequence of ASPH exon 23 RRNENACKGAPKTCTLLEKFPETTGCRRGQ SEQ ID NO:686 Translated polypeptide sequence of ASPH exon 24 IKYSIMHPGTHVWPHTGPTNCRLRMHLGLVIPKEGCKIRCANET SEQ ID NO:687 Translated polypeptide sequence of ASPH exon 25 TWEEGKVLIFDDSFEHEVWQDASSFRLIFIVDVWHPELTPQQRRSLPAI The complete protein sequence of SEQ ID NO:688 ASPH MAEDKETKHGGHKNGRKGGLSGTSFFTWFMVIALLGVWTSVAVVWFDLVDYEEVLGKLGIYDADGDGDFDVDDAKVLLGLKERSEPAVPPEEAEPHTEPEEQVPVEAEPQNIEDEAKEQIQSLLHEMVHAEHVEGEDLQQEDGPTGEPQQEDDEFLMATDVDDRFETLEPEVSHEETEHSYHVEETVS QDCNQDMEEMMSEQENPDSSEPVVEDERLHHDTDDVTYQVYEEQAVYEPLENEGIEITEVTAPPEDNPVEDSQVIVEEVSIFPVEEQQEVPPETNRKTDDPEQKAKVKKKKPKLLNKFDKTIKAELDAAEKLRKRGKIEEAVNAFKELVRKYPQSPRARYGKAQCEDDLAEKRRSNEVL RGAIETYQEVASLPDVPADLLKLSLKRRSDRQQFLGHMRGSLLTLQRLVQLFPNDTSLKNDLGVGYLLIGDNDNAKKVYEEVLSVTPNDGFAKVHYGFILKAQNKIAESIPYLKEGIESGDPGTDDGRFYFHLGDAMQRVGNKEAYKWYELGHKRGHFASVWQRSLYNVNGLKAQP WWTPKETGYTELVKSLERNWKLIRDEGLAVMDKAKGLAFLPEDENLREKGDWSQFTLWQQGRRNENACKGAPKTCTLLEKFPETTGCRRGQIKYSIMHPGTHVWPHTGPTNCRLRMHLGLVIPKEGCKIRCANETKTWEEGKVLIFDDSFEHEVWQDASSFRLIFIVDVWHPELTPQQRRSLPAI SEQ ID NO:689 The polynucleotide sequence of all exons 1-22 of ASPH arranged in order SEQ ID NO:690 Translated polypeptide sequence of all exons 1-22 of ASPH arranged in order NOTCH2 sequence informationSEQ ID NO:691 Sequence CCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAG Sequence from NOTCH2 exon 6 at 5' of NOTCH2-NRG1 fusion GGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:693 Notch2 - NRG1 polynucleotide sequence CCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO: 694 Notch2-NRG1 polypeptide sequence SCTPGSTCIDRVASFSCMCP EGKAATSTSTTGTSHLVKCAEKEKTFCVN SEQ ID NO:695 NOTCH2 Exon 1 AGGCTGCTTCGTTGCACACCCGAGAAAGTTTCAGCCAAACTTCGGGCGGCGGCTGAGGCGGCGGCCGAGGAGCGGCGGACTCGGGGCGCGGGGAGTCGAGGCATTTGCGCCTGGGCTTCGGAGCGTAGCGCCAGGGCCTGA GCCTTTGAAGCAGGAGGAGGGGAGGAGAGAGTGGGGCTCCTCTATCGGGACCCCCTCCCCATGTGGATCTGCCCAGGCGGCGGCGGCGGCGGCGGAGGAGGAGGCGACCGAGAAGATGCCCGCCCTGCGCCCCGCTCTGCTGTGGGCGCTGCTGGCGCTCTGGCTGTGCTGCGCGGCCCCCGCGCATG SEQ ID NO: 696 NOTCH2 Exon 2 CATTGCAGTGTCGAGATGGCTATGAACCCTGTGTAAATGAAGGAATGTGTGTTACCTACCACAATGGCACAGGATACTGCAA SEQ ID NO:697 NOTCH2 Exon Exon 3 ATGTCCAGAAGGCTTCTTGGGGGAATATTGTCAACATCGAGACCCCTGTGAGAAGAACCGCTGCCAGAATGGTGGGACTTGTGTGGCCCAGGCCATGCTGGGGAAAGCCACGTGCCGATGTGCCTCAGGGTTTACAGGAGAGGACTGCCAGTACTCAACATCATCCATGCTTTGTGTCTCGACCCTGCCTGAATGGCGGCAC ATGCCATATGCTCAGCCGGGATACCTATGAGTGCACCTGTCAAGTCGGGTTTACAG SEQ ID NO:698 NOTCH2 exon 4 GTAAGGAGTGCCAATGGACGGATGCCTGCCTGTCTCATCCCTGTGCAAATGGAAGTACCTGTACCACTGTGGCCAACCAGTTCTCCTGCAAATGCCTCACAGGCTTCACAGGGCAGAAATGTGAGACTGATGTCA ATGAGTGTGACATTCCAGGACACTGCCAGCATGGTGGCACCTGCCTCAACCTGCCTGGTTCCTACCAGTGCCAGTGCCCTCAGGGCTTCACAGGCCAGTACTGTGACAGCCTGTATGTGCCCTGTGCACCCCTCACCTTGTGTCAATGGAGGCACCTGTCGGCAGACTGGTGACTTCACTTTTGAGTGCAACTGCCTTCCAG Other than 699 NOTCH2 Exon 5 GTTTTGAAGGGAGCACCTGTGAGAGGAATATTGATGACTGCCCTAACCACAGGTGTCAGAATGGAGGGGTTTGTGTGGATGGGGTCCAACACTTACAACTGCCGCTGTCCCCCCACAATGGACAG SEQ ID NO: 700 NOTCH2 exon 6 GACAGTTCTGCACAGAGGATGTGGATGAATGCCTGCTGCAGCCCAATGCCTGTCAAA ATGGGGGCACCTGTGCCAACCGCAATGGAGGCTATGGCTGTGTATGTGTCAACGGCTGGAGTGGAGATGACTGCAGTGAGAACATTGATGATTGTGCCTTCGCCTCCTGTACTCCAGGCTCCACCTGCATCGACCGTGTGGCCTCCTTCTCTTGCATGTGCCCAGAGGGGAAGGCAG 7 GTCTCCTGTGTCATCTGGATGATGCATGCATCAGCAATCCTTGCCACAAGGGGGCACTGTGTGACACCAACCCCTAAATGGGCAATATATTTGCACCTGCCCACAAGGCTACAAAGGGGCTGACTGCACAGAAGATGTGGATGAATGTGCCATGG SEQ ID NO: 702 NOTCH2 exon 8-34 CCAATAGCAATCCTTGT GAGCATGCAGGAAAATGTGTGAACACGGATGGCGCCTTCCACTGTGAGTGTCTGAAGGGTTATGCAGGACCTCGTTGTGAGATGGACATCAATGAGTGCCATTCAGACCCCTGCCAGAATGATGCTACCTGTCTGGATAAGATTGGAGGCTTCACATGTCTGTGCATGCCAGGTTTCAAAGGTGTGCATTGTGAATTAGAAATAAATGAATGTCAGAGCAACCC TTGTGTGAACAATGGGCAGTGTGTGGATAAAGTCAATCGTTTCCAGTGCCTGTGTCCTCCTGGTTTCACTGGGCCAGTTTGCCAGATTGATATTGATGACTGTTCCAGTACTCCGTGTCTGAATGGGGCAAAGTGTATCGATCACCCGAATGGCTATGAATGCCAGTGTGCCACAGGTTTCACTGGTGTGTTGTGTGAGGAGAACATTGACAACTGTGACC CCGATCCTTGCCACCATGGTCAGTGTCAGGATGGTATTGATTCCTACACCTGCATCTGCAATCCCGGGTACATGGGCGCCATCTGCAGTGACCAGATTGATGAATGTTACAGCAGCCCTTGCCTGAACGATGGTCGCTGCATTGACCTGGTCAATGGCTACCAGTGCAACTGCCAGCCAGGCACGTCAGGGGTTAATTGTGAAATTAATTTTGAT GACTGTGCAAGTAACCCTTGTATCCATGGAATCTGTATGGATGGCATTAATCGCTACAGTTGTGTCTGCTCACCAGGATTCACAGGGCAGAGATGTAACATTGACATTGATGAGTGTGCCTCCAATCCCTGTCGCAAGGGTGCAACATGTATCAACGGTGTGAATGGTTTCCGCTGTATATGCCCCGAGGGACCCCATCACCCCAGCTGCTACTCACAGGTGA ACGAATGCCTGAGCAATCCCTGCATCCATGGAAACTGTACTGGAGGTCTCAGTGGATATAAGTGTCTCTGTGATGCAGGCTGGGTTGGCATCAACTGTGAAGTGGACAAAAATGAATGCCTTTCGAATCCATGCCAGAATGGAGGAACTTGTGACAATCTGGTGAATGGATACAGGTGTACTTGCAAGAAGGGCTTTAAAGGCTATAACTGCCAGGTGA ATATTGATGAATGTGCCTCAAATCCATGCCTGAACCAAGGAACCTGCTTTGATGACATAAGTGGCTACACTTGCCACTGTGTGCTGCCATACAAGGCAAGAATTGTCAGACAGTATTGGCTCCCTGTTCCCCAAACCCTTGTGAGAATGCTGCTGTTTGCAAAGAGTCACCAAATTTTGAGAGTTATACTTGCTTGTGTGCTCCTGGCTGGCAAGGTCAGCG GTGTACCATTGACATTGACGAGTGTATCTCCAAGCCCTGCATGAACCATGGTCTCTGCCATAACACCCAGGGCAGCTACATGTGTGAATGTCCACCAGGCTTCAGTGGTATGGACTGTGAGGAGGACATTGATGACTGCCTTGCCAATCCTTGCCAGAATGGAGGTTCCTGTATGGATGGAGTGAATACTTTCCTGCCTCTGCCTTCCGGGTTTCACTGGG GATAAGTGCCAGACAGACATGAATGAGTGTCTGAGTGAACCCTGTAAGAATGGAGGGACCTGCTCTGACTACGTCAACAGTTACACTTGCAAGTGCCAGGCAGGATTTGATGGAGTCCATTGTGAGAACAACATCAATGAGTGCACTGAGAGCTCCTGTTTCAATGGTGGCACATGTGTTGATGGGATTAACTCCTTCTCTTGCTTGTGCCCTGTGGGTTT CACTGGATCCTTCTGCCTCCATGAGATCAATGAATGCAGCTCTCATCCATGCCTGAATGAGGGAACGTGTGTTGATGGCCTGGGTACCTACCGCTGCAGCTGCCCCCTGGGCTACACTGGGAAAAACTGTCAGACCCTGGTGAATCTCTGCAGTCGGTCTCCATGTAAAACAAAGGTACTTGCGTTCAGAAAAAAGCAGAGTCCCAGTGCCTATGTCCATCTG GATGGGCTGGTGCCTATTGTGACGTGCCCAATGTCTCTTGTGACATAGCAGCCTTCCAGGAGAGGTGTGCTTGTTGAACACTTGTGCCAGCACTCAGGTGTCTGCATCAATGCTGGCAACACGCATTACTGTCAGTGCCCCCTGGGCTATACTGGGAGCTACTGTGAGGAGCAACTCGATGAGTGTGCGTCCAACCCCTGCCAGCACGGGCAACATG CAGTGACTTCATTGGTGGATACAGATGCGAGTGTGTCCCAGGCTATCAGGGTGTCAACTGTGAGTATGAAGTGGATGAGTGCCAGAATCAGCCCTGCCAGAATGGAGGCACCTGTATTGACCTTGTGAACCATTTCAAGTGCTCTTGCCCACCAGGCACTCGGGGCCTACTCTGTGAAGAGAACATTGATGACTGTGCCCGGGGTCCCCATTGCCTTAATG GTGGTCAGTGCATGGATAGGATTGGAGGCTACAGTTGTCGCTGCTTGCCTGGCTTTGCTGGGGAGCGTTGTGAGGGAGACATCAACGAGTGCCTCTCCAACCCCTGCAGCTCTGAGGGCAGCCTGGACTGTATACAGCTCACCAATGACTACCTGTGTGTTTGCCGTAGTGCCTTTACTGGCCGGCACTGTGAAACCTTCGTCGATGTGTGT CCCCAGATGCCCTGCCTGAATGGAGGGACTTGTGCTGTGGCCAGTAACATGCCTGATGGTTTCATTTGCCGTTGTCCCCCGGGATTTTCCGGGGCAAGGTGCCAGAGCAGCTGTGGACAAGTGAAATGTAGGAAGGGGGAGCAGTGTGTGCACACCGCCTCTGGACCCCGCTGCTTCTGCCCCAGTCCCGGGACTGCGAGTCAGGCTGTG CCAGTAGCCCCTGCCAGCACGGGGGCAGCTGCCACCCTCAGCGCCAGCCTCCTTATTACTCCTGCCAGTGTCCCCACCATTCTCGGGTAGCCGCTGTGAACTCTACACGGCACCCCCCAGCACCCCTCCTGCCACCTGTCTGAGCCAGTATTGTGCCGACAAAGCTCGGGATGGCGTCTGTGATGAGGCCTGCAACAGCCATGCCTGCCAGTGGGATGG GGGTGACTGTTTCTCTCACCATGGAGAACCCCTGGGCCAACTGCTCCTCCCCACTTCCCTGCTGGGATTATATCAACAACCAGTGTGATGAGCTGTGCAACACGGTCGAGTGCCTGTTTGACAACTTTGAATGCCAGGGGAACAGCAAGACATGCAAGTATGACAAATACTGTGCAGACCACTTCAAAAGACAACCAGTGAGGA GTGTGGTTGGGATGGGCTGGACTGTGCTGCTGACCAACCTGAGAACCTGGCAGAAGGTACCCTGGTTATTGTGGTATTGATGCCACCTGAACAACTGCTCCAGGATGCTCGCAGCTTCTTGCGGGCACTGGGTACCCTGCTCCACACCAACCTGCGCATTAAGCGGGACTCCCAGGGGGAACTCATGGTGTACCCCTATTATGGTGA GAAGTCAGCTGCTATGAAGAAACAGAGGATGACACGCAGATCCCTTCCTGGTGAACAAGAACAGGAGGTGGCTGGCTCTAAAGTCTTTCTGGAAATTGACAACCGCCAGTGTGTTCAAGACTCAGACCACTGCTTCAAGAACACGGATGCAGCAGCAGCTCTCCTGGCCTCTCACGCCATACAGGGGACCCTGTCATACCCCTTGTGTCTGTCGTCAGTGAAT CCCTGACTCCAGAACGCACTCAGCTCCTCTATCTCCTTGCTGTTGCTGTTGTCATCATTCTGTTTATTATTCTGCTGGGGGTAATCATGGCAAAACGAAAGCGTAAGCATGGCTCTCTCTGGCTGCCTGAAGGTTTCACTCTTCGCCGAGATGCAAGCAATCACAAGCGTCGTGAGCCAGTGGGACAGGATGCTGTGGGGCTGAAAAATCTCTCA GTGCAAGTCTCAGAAGCTAACCTAATTGGTACTGGAACAAGTGAACACTGGGTCGATGATGAAGGGCCCCAGCCAAAGAAAGTAAAGGCTGAAGATGAGGCCTTACTCTCAGAAGAAGATGACCCCATTGATCGACGGCCATGGACACAGCAGCACCTTGAAGCTGCAGACATCCGTAGGACACCATCGCTGGCTCTCACCCCTCCTCAGGCAGAGCAGGAG GTGGATGTGTTAGATGTGAATGTCCGTGGCCCAGATGGCTGCACCCCATTGATGTTGGCTTCTCTCCGAGGAGGCAGCTCAGATTTGAGTGATGAAGATGAAGATGCAGAGGACTCTTCTGCTAACATCATCACAGACTTGGTCTACCAGGGTGCCAGCCTCCAGGCCCAGACAGACCGGACTGGTGAGATGGCCCTGCACCTTGCAGCCCGCTACTCAC GGGCTGATGCTGCCAAGCGTCTCCTGGATGCAGGTGCAGATGCCAATGCCCAGGACAACATGGGCCGCTGTCCACTCCATGCTGCAGTGGCAGCTGATGCCCAAGGTGTCTTCCAGATTCTGATTCGCAACCGAGTAACTGATCTAGATGCCAGGATGAATGATGGTACTACACCCCTGATCCTGGCTGCCCGCCTGGCTGTGGAGGGAATG GTGGCAGAACTGATCAACTGCCAAGCGGATGTGAATGCAGTGGATGACCATGGAAAATCTGCTCTTCACTGGGCAGCTGCTGTCAATAATGTGGAGGCAACTCTTTTGTTGTTGAAAAATGGGGCCAACCGAGACATGCAGGACAACAAGGAAGAGACCTCTGTTTCTTGCTGCCCGGGAGGGGAGCTATGAAGCAGCCAAGATCCTGTTA GACCATTTTTGCCCAATCGAGACATCCACAGACCATATGGATCGTCTTCCCCGGGATGTGGCTCGGGATCGCATGCACCATGACATTGTGCGCCTTCTGGATGAATACAATGTGACCCCAAAGCCCTCCAGGCACCGTGTTGACTTCTGCTCTCTCACCTGTCATCTGTGGGCCCAACAGATCTTTCCTCAGCCTGAAGCACCACCCCAATGGGCAAGAGTCT AGACGGCCCAGTGCCAAGAGTACCATGCCTACTAGCCTCCCTAACCTTGCCAAGGAGGCAAAGGATGCCAAGGGTAGTAGGAGGAAGAAGTCTCTGAGTGAGAAGGTCCAACTGTCTGAGAGTTCAGTAACTTTTCCCCTGTTGATTCCCTAGAATCTCCTCACACGTATGTTTCCGACACCACATCCTCTCCAATGATTACATCCCCCTGGGATCTTACAGGCCTCACCCAAC CCTATGTTGGCCACTGCCGCCCCTCCTGCCCCAGTCCATGCCCAGCATGCACTATTCTTTTTCTAACCTTCATGAAATGCAGCCTTTGGCACATGGGGCCAGCACTGTGCTTCCCTCAGTGAGCCAGTTGCTATCCCACCACCATTGTGTCTCCAGGCAGTGGCAGTGCTGGAAGCTTGAGTAGGCTCCATCCAGTCCCAGTCCCAGCAGATTGGATGAAC CGCATGGAGGTGAATGAGACCCAGTACAATGAGATGTTTGGTATGGTCCTGGCTCCAGCTGAGGGCACCCATCCTGGCATAGCTCCCAGAGCAGGCCACCTGAAGGGAAGCACATAACCACCCCTCGGGAGCCCTTGCCCCCCATTGTGACTTTCCAGCTCATCCCTAAAGGCAGTATTGCCCAACCAGCGGGGGCCTCCCCAGCCTCAGTCCACCTGCC CTCCAGCTGTTGCGGGCCCCCTGCCCACCATGTACCAGATTCCAGAAATGGCCCGTTTGCCCAGTGTGGCTTTCCCACTGCCATGATGCCCCAGCAGGACGGGCAGGTAGCTCAGCCATTCTCCCAGCCTATCATCCTTTCCCAGCCCTCTGTGGGCAAGTACCCCACCCCCTTCAGCACAGTTATGCTTCCTCAAATGCTGCTGAGCGAACACCCAGTCA CAGTGGTCACCTCCAGGGTGAGCATCCCTACCTGACACCATCCCCAGAGTCTCCTGACCAGTGGTCAAGTTCATCACCCACTCTGCTTCTGACTGGTCAGATGTGACCACCAGCCCTACCCCTGGGGGTGCTGGAGGAGGTCACGCGGGGACCTGGGACACACATGTCTGAGCCACCACACAACAACATGCAGGTTTATGCGTGAGAGAGTCCACCTCAGT GTAGAGACATAACTGACTTTTGTAAATGCTGCTGAGGAACAAATGAAGGTCATCCGGGAGAGAAATGAAGAAATCTCTGGAGCCAGCTTCTAGAGGTAGGAAAGAGAAGATGTTTCTTATTCAGATAATGCAAGAGAAGCAATTCGTCAGTTTCACTGGGTATCTGCAAGGCTTATTGATTATTCTAATCTAATAAGACAAGTTTGTGGAAATGCAAGATGAATACAAG CCTTGGGTCCATGTTTACTTCTCTTCTATTTGGAGAATAAGATGGATGCTTATTGAAGCCCAGACATTCTTGCAGCTTGGACTGCATTTTAAGCCCTGCAGGCTTCTGCCATATCCATGAGAAGATTCTACACTAGCGTCCTGTTGGGAATTATGCCCTGGAATTCTGCCTGAATTGACCTACGCATCTCCTCCTCCTTGGACATTCTTTTGTCTTCATTTG GTGCTTTTGGTTTTGCACCTCTCCGTGATTGTAGCCCTACCAGCATGTTATAGGGCAAGACCTTTGTGCTTTTGATCATTCTGGCCCATGAAAGCAACTTTGGTCTCCTTTCCCCTCCTGTCTTCCCGGTATCCCTTGGAGTCTCCAAGGTTTTACTTTGGTATGGTTCTCAGCACAAACCTTTCAAGTATGTTGTTTCTTTGGAAAATGGACATACTGTATTGTGT TCTCCTGCATATATCATTCCTGGAGAGAGAAGGGGAGAAGAATACTTTTCTTCAACAAATTTTGGGGGCAGGAGATCCCTTCAAGAGGCTGCACCTTAATTTTCTTGTCTGTGTGCAGGTCTTCATATAAACTTTACCAGGAAGAAGGGTGTGAGTTGTTGTTTTTCTGTGTATGGGCCTGGTCAGTGTAAAGTTTTATCCTTGATAGTCTAGTTACTATGACCCT CCCCACTTTTTTAAAACCAGAAAAAGGTTTGGAATGTTGGAATGACCAAGAGACAAGTTAACTCGTGCAAGAGCCAGTTACCCACCCACAGGTCCCCTACTTCCTGCCAAGCATTCCATTGACTGCCTGTATGGAACACATTTGTCCCAGATCTGAGCATTCTAGGCCTGTTTCACTCACTCACCCAGCATATGAAACTAGTCTTAACTGTTGAGCCTTTCCTTTCATATCCAC AGAAGACACTGTCTCAAATGTTGTACCCCTTGCCATTTAGGACTGAACTTTCCTTAGCCCAAGGGACCCAGTGACAGTTGTCTTCCGTTTGTCAGATGATCAGTCTCTACTGATTATCTTGCTGCTTAAAGGCCTGCTCACCAATCTTTTCTTCACACCGTGTGGTCCGTGTTACTGGTATACCCAGTATGTTCTCACTGAAGACATGGACTTTATATGTTCAAGTGCAGGA ATTGGAAAGTTGGACTTGTTTTCTATGATCCAAAACAGCCCTATAAGAAGGTTGGAAAAGGAGGAACTATATAGCAGCCTTTGCTATTTTCTGCTACCATTTCTTTTCCTCTGAAGCGGCCATGACATTCCCTTTGGCAACTAACGTAGAAACTCAACGAACATTTTCCTTTCCTAGAGTCACCTTTTAGATGATAATGGACAACTATAGACTTGCTCATTGTTCA GACTGATTGCCCCTCACCTGAATCCACTCTCTGTATTCATGCTCTTGGCAATTTCTTTGACTTTCTTTTAAGGGCAGAAGCATTTTAGTTAATTGTAGATAAAGAATAGTTTTTCTTCCTCTTCTCCTTGGGCCAGTTAATAATTGGTCCATGGCTACACTGCAACTTCCGTCCAGTGCTGTGATGCCCATGACACCTGCAAAATAAGTTCTGCCTGGGCATTTTGTAG ATATTAACAGGTGAATTCCCGACTCTTTTGGTTTGAATGACAGTTCTCATTCCTTCTATGGCTGCAAGTATGCATCAGTGCTTCCCACTTACCTGATTTGTCTGTCGGTGGCCCCATATGGAAACCCTGCGTGTCTGTTGGCATAATAGTTTTACAAATGGTTTTTCAGTCCTATCCAAATTTATTGAACCAACAAAAATAATTACTTCTGCCCTGAGATAAGCAGATT AAGTTTTGTTCATTCTCTGCTTTATTCTCTCCATGTGGCAACATTCTGTCAGCCTCTTTCATAGTGTGCAAACATTTTATCATTCTAAATGGTGACTCTCTGCCCTTGGACCCATTTATTATTCACAGATGGGGAGAACCTATCTGCATGGACCCTGTGGACCACAGCGTACCTGCCCCTTTCTGCCCTCCTGCTCCAGCCCCACTTCTGAAAGTATCAGCTACTGATCCAGC CACTGGATATTTTATATCCTCCCTTTTCCTTAAGCACAATGTCAGACCAAATTGCTTGTTTCTTTTTCTTGGACTACTTTTAATTTGGATCCTTTGGGTTTGGAGAAAGGGAATGTGAAAGCTGTCATTACAGACAACAGGTTTCAGTGATGAGGAGGACAACACTGCCTTTCAAACTTTTTACTGATCTCTTAGATTTTAAGAACTCTTGAATTGTGTGGTATCT AATAAAAGGGAAGGTAAGATGGATAATCACTTTCTCATTTGGGTTCTGAATTGGAGACTCAGTTTTTTATGAGACACATCTTTTATGCCATGTATAGATCCTCCCCTGCTATTTTTGGTTTATTTTTTATTGTTATAAATGCTTTCTTTCTTTGACTCCTCTTCTGCCTGCCTTTGGGGATAGGTTTTTTTTGTTTGTTTTATTTGCTTCCTCTGTTTTGTTTTAGCA TCATTTTCTTATGTGAGGTGGGGAAGGGAAAGGTATGAGGGAAAGAGAGTCTGAGAATTAAAATATTTAGTATAAGCAATTGGCTGTGATGCTCAAATCCATTGCATCCTCTTATTGAATTTGCCAATTTGTAATTTTTGCATAATAAAGAACCAAAGGTGTAATGTTTTGTTGAGAGGTGGTTTAGGGATTTTGGCCCTAACCAATACATTGAATGTATG ATGACTATTTGGGAGGACACATTTATGTACCCAGAGGCCCCCACTAATAAGTGGTACTATGGTTACTTCCTTGTGTACATTTCTCTAAAAGTGATATTATATCTGTTTGTATGAGAAACCCAGTAACCAATAAAATGACCGCATATTCCTGACTAAACGTAGTAAGGAAAATGCACACTTTGTTTTACTTTCCGTTTCATTCTAAAGGTAGTTAAGATGAAATTTATATGA AAGCATTTTTATCACAAATAAAAAAGGTTTGCCAAGCTCAGTGGTGTTGTATTTTTTTATTTCCAATACTGCATCCATGGCCTGGCAGTGTTACCTCATGATGTCATAATTTGCTGAGAGAGCAAATTTTCTTTTCTTTCTGAATCCCACAAAAGCCTAGCACCAAACTTCTTTTTTTCTTCCTTTAATTAGATCATAAATAAATGATCCTGGGGAAAAAGCATCTGTCAAAT AGGAAACATCACAAAACTGAGCACTTCTTCTGTGCACTAGCCATAGCTGGTGACAAACAGATGGTTGCTCAGGGACAAGGTGCCTTCCAATGGAAATGCGAAGTAGTTGCTATAGCAAGAATTGGGAACTGGGATATAAGTCATAATATTAATTATGCTGTTATGTAAATGATTGGTTTGTAACATTCCTTAAGTGAAATTTGTGTAGAACTTAATATACAGGATTA TAAAATAATATTTTGTGTATAAATTTGTTATAAGTTCACATTCATACATTTATTTATAAAGTCAGTGAGATATTTGAACATGAA SEQ ID NO:703 The complete mRNA polynucleotide sequence of NOTCH2 SEQ ID NO:704 The translated polypeptide sequence of NOTCH2 exon 1 MPALRPALLWALLALWLCCAAPAH SEQ ID NO:705 The translated polypeptide sequence of exon 2 of NOTCH2 LQCRDGYEPCVNEGMCVTYHNGTGYC SEQ ID NO:706 The translated polypeptide sequence of exon 3 of NOTCH2 CPEGFLGEYCQHRDPCEKNRCQNGGTCVAQAMLGKATCRCASGFTGEDCQYSTSHPCFVSRPCLNGGTCHMLSRDTYECTCQVGFT SEQ ID NO:707 NOTCH The translated polypeptide sequence of exon 4 KECQWTDACLSHPCANGSTCTTVANQFSCKCLTGFTGQKCETDVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQYCDSLYVPCAPSPCVNGGTCRQTGDFTFECNCLP SEQ ID NO: 708 of NOTCH2 Translated polypeptide sequence of exon 5 FEGSTCERNIDDCPNHRCQNGGVCVDGVNTYNCRCPPQWT SEQ ID NO: 709 Translated polypeptide sequence of exon 6 of NOTCH2 QFCTEDVDECLLQPNACQNGGTCANNRNGGYGCVCVNGWSGDDCSENIDDCAFASCTPGSTCIDRVASFSCMCPEGKA SEQ ID NO: 710 of exon 7 of NOTCH2 Translated polypeptide sequence LLCHLDDACISNPCHKGALCDTNPLNGQYICTCPQGYKGADCTEDVDECAM SEQ ID NO:711 NOTCH2 external expression The translated polypeptide sequence of sub-8-34 SEQ ID NO:712 The whole protein sequence of NOTCH2 SEQ ID NO:713 The whole exon 1-6 of NOTCH2 The polynucleotide sequence SEQ ID NO:714 The whole exon of NOTCH2 Translated polypeptide sequences of sub-1-6 in sequenceCD74 sequence informationSEQ ID NO:715 Sequence from CD74 exon 2 at 5' of CD74-NRG1 fusion AGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGC SEQ ID NO:716 NRG1 exon 2 sequence at 3' of CD74-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:717 CD74-NRG1 polynucleotide sequence AGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGCCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:718 CD74-NRG1 polypeptide sequence GRLDKLTVTSQNLQLENLRMKLPKPLPPRLKEMKSQESAAAGSKLVLRCE SEQ ID NO:719 CD74 exon 1 ATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCATCTACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCCAGGGTCCCAGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATG GATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAG SEQ ID NO:720 CD74 exon 2 CAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAGCTGGAGAACCTGCGCATGAAGCTTCCCAAGC SEQ ID NO:721 CD74 exon 3 CTCCCAAGCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATGGGAGCCCTGCCCCAGGGG SEQ ID NO:722 CD74 exon 4 CCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAG SEQ ID NO:723 CD74 exon 5 AATGCTGACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCGGAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAG SEQ ID NO:724 CD74 exon 6 GTCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCACTGACGCTCCACCGAAAG SEQ ID NO:725 CD74 exon 7 TACTGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTATCTGCCACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATAACTGCAGTG SEQ ID NO:726 CD74 exon 8 AGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAG SEQ ID NO:727 CD74 exon 9 TCCCCATGTGAGAGCAGCAGAGGCGGTCTTCAACATCCTGCCAGCCCCACAGCTACAGCTTTCTTGCTCCCTTCAGCCCCCAGCCCCTCCCCCATCTCCCACCCTGTACCTCATCCCATGAGACCCTGGTGCCTGGCTCTTTCGTCACCCTTGGACAAGACAAACCAAGTCGGAACAGCAGATAACAATGCAGCAAGGCCCTGCTGCCCAATCTCCATCTGTCAACAGG GGCGTGAGGTCCCAGGAAGTGGCCAAAAGCTAGACAGATCCCCGTTCCTGACATCACAGCAGCCTCCAACACAAGGCTCCAAGACCTAGGCTCATGGACGAGATGGGAAGGCACAGGGAGAAGGGATAACCCTACACCCAGACCCCAGGCTGGACATGCTGTCCTCTCCCCTCCAGCCTTTGGCCTTGGCTTTTCTAGCCTATTTACCTGCAGGCTGAGC CACTCTTCTTCCCTTTCCCCAGCATCACTCCCCCAAGGAAGAGCCAATGTTTTCCACCCATAATCCTTTCTGCCGACCCCTAGTTCCCTCTGCTCAGCCAAGCTTGTTATCAGCTTTCAGGGCCATGGTTCACATTAGAATAAAAGGTAGTAATTAGAA SEQ ID NO:728 The complete mRNA polynucleotide sequence of CD74 ATCCTGCCCCGCAAAAGGCAGCTTCACCAAAGTGGGGTATTTCCAGCCTTTGTAGCTTTCACTTCCATCTACCAAGTGGGCGGAGTGGCCTTCTGTGGACGAATCAGATTCCTCTCCAGCACCGACTTTAAGAGGCGAGCCGGGGGGTCCAGGGTCCCAGATGCACAGGAGGAGAAGCAGGAGCTGTCGGGAAGATCAGAAGCCAGTCATG GATGACCAGCGCGACCTTATCTCCAACAATGAGCAACTGCCCATGCTGGGCCGGCGCCCTGGGGCCCCGGAGAGCAAGTGCAGCCGCGGAGCCCTGTACACAGGCTTTTCCATCCTGGTGACTCTGCTCCTCGCTGGCCAGGCCACCACCGCCTACTTCCTGTACCAGCAGCAGGGCCGGCTGGACAAACTGACAGTCACCTCCCAGAACCTGCAG CTGGAGAACCTGCGCATGAAGCTTCCCAAGCCTCCCAAGCCCTGTGAGCAAGATGCGCATGGCCACCCCGCTGCTGATGCAGGCGCTGCCCATGGGAGCCCTGCCCCAGGGGCCCATGCAGAATGCCACCAAGTATGGCAACATGACAGAGGACCATGTGATGCACCTGCTCCAGAATGCTGACCCCCTGAAGGTGTACCCGCCACTGAAGGGGAGCTTCCCG GAGAACCTGAGACACCTTAAGAACACCATGGAGACCATAGACTGGAAGGTCTTTGAGAGCTGGATGCACCATTGGCTCCTGTTTGAAATGAGCAGGCACTCCTTGGAGCAAAAGCCCCACTGACGCTCCACCGAAAGTACTGACCAAGTGCCAGGAAGAGGTCAGCCACATCCCTGCTGTCCACCCGGGTTCATTCAGGCCCAAGTGCGACGAGAACGGCAACTAT CTGCCACTCCAGTGCTATGGGAGCATCGGCTACTGCTGGTGTGTCTTCCCCAACGGCACGGAGGTCCCCAACACCAGAAGCCGCGGGCACCATAACTGCAGTGAGTCACTGGAACTGGAGGACCCGTCTTCTGGGCTGGGTGTGACCAAGCAGGATCTGGGCCCAGTCCCATGTGAGAGCAGCAGAGGCGGTCTTCACATCCTGCCAGCCC CACACAGCTACAGCTTTCTTGCTCCCTTCAGCCCCCAGCCCCTCCCCCATCTCCCACCCTGTACCTCATCCCATGAGACCCTGGTGCCTGGCTCTTTCGTCACCCTTGGACAAGACAAACCAAGTCGGAACAGCAGATAACAATGCAGCAAGGCCCTGCTGCCCAATCTCCATCTGTCCAACAGGGGCGTGAGGTCCCAGGAAGTGGCCAAAAGCTAGACAGATCCCCGTTC CTGACATCACAGCAGCCTCCAACACAAGGCTCCAAGACCTAGGCTCATGGACGAGATGGGAAGGCACAGGGAGAAGGGATAACCCTACACCCAGACCCCAGGCTGGACATGCTGACTGTCCTCTCCCCTCCAGCCTTTGGCCTTGGCTTTTCTAGCCTATTTACCTGCAGGCTGAGCCACTCTCTTCCCTTTCCCCAGCATCACTCCCAAGGAAGAGCCAATGTTT TCCACCCATAATCCTTTTCTGCCGACCCCTAGTTCCCTCTGCTCAGCCAAGCTTGTTATCAGCTTTCAGGGCCATGGTTCACATTAGAATAAAAGGTAGTAATTAGAA SEQ ID NO:729 Translated polypeptide sequence of exon 1 of CD74 MHRRRRSSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPE SEQ ID NO:730 Translated polypeptide sequence of exon 2 of CD74 KCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPK SEQ ID NO:731 Translated polypeptide sequence of exon 3 of CD74 PKPVSKMRMATPLLMQALPMGALPQG SEQ ID NO:732 Translated polypeptide sequence of exon 4 of CD74 PMQNATKYGNMTEDHVMHLLQ SEQ ID NO:733 Translated polypeptide sequence of exon 5 of CD74 NADPLKVYPPLKGSFPENLRHLKNTMETIDWK SEQ ID NO:734 Translated polypeptide sequence of exon 6 of CD74 VFESWMHHWLLFEMSRHSLEQKPTDAPPK SEQ ID NO:735 Translated polypeptide sequence of exon 7 of CD74 LTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTRSRGHHNCS SEQ ID NO:736 Translated polypeptide sequence of exon 8 of CD74 SLELEDPSSGLGVTKQDLGP SEQ ID NO:737 Translated polypeptide sequence of exon 9 of CD74 PM SEQ ID NO:738 Full protein sequence of CD74 MHRRRRSCREDQKPVMDDQRDLISNNEQLPMLGRRPGAPESKCSRGALYTGFSILVTLLLAGQATTAYFLYQQQGRLDKLTVTSQNLQLENLRMKLPKPPKPVSKMRMATPLLMQALPMGALPQGPMQNATKYGNMTEDHVMHLLQNADPLKVYPPLKGSFPENLRHLKNTMETIDWKVFESWM HHWLLFEMSRHSLEQKPTDAPPKVLTKCQEEVSHIPAVHPGSFRPKCDENGNYLPLQCYGSIGYCWCVFPNGTEVPNTSRSRGHHNCSESLELEDPSSGLGVTKQDLGPVPM SEQ ID NO:739 The polynucleotide sequence of all exons 1-2 of CD74 arranged in order SEQ ID NO:740 Translated polypeptide sequence of all exons 1-2 of CD74 arranged in order SDC4 sequence informationSEQ ID NO:741 Sequence from SDC4 exon 2 at 5' of SDC4-NRG1 fusion TACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGG SEQ ID NO:742 NRG1 exon 2 sequence at 3' of SDC4-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:743 SDC4-NRG1 polynucleotide sequence TACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:744 SDC4-NRG1 polypeptide sequence PDDEDVVGPGQESDDFELSGSGDLALPPRLKEMKSQESAAAGSKLVLRCE SEQ ID NO:745 Exon 1 of SDC4 ACTCGCCGCAGCCTGCGCGCCTTCTCCAGTCCGCGGTGCCATGGCCCCCGCCCGTCTGTTCGCGCTGCTGCTGTTCTTCGTAGGCGGAGTCGCCGAGTCG SEQ ID NO:746 Exon 2 of SDC4 ATCCGAGAGACTGAGGTCATCGACCCCCAGGACCTCCTAGAAGGCCGATACTTCTCCGGAGCCCTACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACTTTGAGCTGTCTGGCTCTGGAGATCTGG SEQ ID NO:747 Exon 3 of SDC4 ATGACTTGGAAGACTCCATGATCGGCCCTGAAGTTGTCCATCCCTTG SEQ ID NO:748 Exon 4 of SDC4 GTGCCTCTAGATAACCATATCCCCTGAGAGGGCAGGGTCTGGGAGCCAAGTCCCCACCGAACCCAAGAAACTAGAGGAGAATGAGGTTATTCCCCAAGAGAATCTCACCCGTTGAAGAGAGTGAGGATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAG SEQ ID NO:749 Exon 5 of SDC4 CTCTGATTGTGGGTGGCATCGTGGGCATCCTCTTTGCCGTCCTTCCTGATCCTACTGCTCATGTACCGTATGAAGAAGAAGGATGAAGGCAGCTATGACCTGGGCAAGAAACCCATCTACAAGAAAGCCCCCACCAATGAGTTCTACGCGTGAAGCTTGCTTGTGGGCACTGGCTTGGACTTTAGCGGGGAGGGAAGCCAGGGGATTTTGAAG GGTGGACATTAGGGTAGGGTGAGGTCAACCTAATACTGACTTGTCAGTATCTCCAGCTCTGATTACCTTTGAAGTGTTCAGAAGAGACATTGTCTTCTACTGTTCTGCCAGGTTCTTCTTGAGCTTTGGGCCTCAGTTGCCCTGGCAGAAAAATGGATTCAACTTGGCCTTTCTGAAGGCAAGACTGGGATTGGATCACTTCTTAAACTTCCAGTTAAGAATCT AGGTCCGCCCTCCAAGCCCATACTGACCATGCCTCATCCAGAGCTCCTCTGAAGCCAGGGGGCTAACGGATGTTGTGTGGAGTCCTGGCTGGAGGTCCTCCCCAGTGGCCTTCCTCCCTTCCTTCACAGCCGGTCTCCTGCCAGGAAATGGGGGAAGGAACTAGAACCACCTGCACCTTGAGATGTTTCTGTAAATGGGTACTTGTGATCACACT ACGGGAATCTCTGTGGTATATACCTGGGGCCATTCTAGGCTCTTTCAAGTGACTTTTGGAAATCAACCTTTTTTATTTGGGGGGGAGGATGGGGAAAAGAGCTGAGAGTTTATGCTGAAATGGATTTATAGAATATTTGTAAATCTATTTTTAGTGTGTTCGTTTTTTTAACTGTTCATTCCTTTGTGCAGAGTGTATATCTGCCTGGGCAAGAGT GTGGAGGTGCCGAGGTGTCTTCATTCTCTCGCACATTTTCCACAGCACCTGCTAAGTTTGTATTTAATGGTTTTTTGTTTTGTTTTTTGTTGTTTCTTGAAAATGAGAGAAGAGCCGGAGAGATGATTTTTATTAATTTTTTTTTTTTTTTTTTTTTACTATTTATAGCTTTAGATAGGGCCTCCCTTCCCCTTCTTTTCTTTGTTCTCTTTCATTAAACCCCTTCCCC AGTTTTTTTTTTATACTTTAAAACCCCGCTCCTCATGGCCTTGGCCCTTTCTGAAGCTGCTTCCTCTTATAAAATAGCTTTTGCCGAAACATAGTTTTTTTTTAGCAGATCCCAAAAATATAATGAAGGGGATGGTGGGATATTTGTGTCTGTGTTCTTATAATATATTATTTATTCTTCCTTGGTTCTAGAAAAATAGATAAATAATATTTTTTTCAGGAAATAGTGTGGTGT TTCCAGTTTGATGTTGCTGGGTGGTTGAGTGAGTGAATTTTCATGTGGCTGGGTGGGTTTTTTGCCTTTTTCTCTTGCCCTGTTCCTGGTGCCTTCTGATGGGGCTGGAATAGTTGAGGTGGATGGTTCTACCCTTTCTGCCTTCTGTTTGGGACCCAGCTGGTGTTCTTTGGTTTGCTTTCTTCAGGCTCTAGGGCTGTGCTATCCA ATACAGTAACCACATGCGGCTGTTTAAAAGTTAAGCCAATTAAAATCACATAAGATTAAAAATTCCTTCCTCAGTTGCACTAACCACGTTTCTAGAGGCGTCACTGTATGTAGTTCATGGCTACTGTACTGACAGCGAGAGCATGTCCATCTGTTGGACAGCACTATTCTAGAGAACTAAACTGGCTTAACGAGTCACAGCCTCAGCTGTGCTGGGACGACCTTGTCT CCCTGGGTAGGGGGGGGGAATGGGGGAGGGCTGATGAGGCCCCAGCTGGGGCCTGTTGTCTGGGACCTCCCTCTCCTGAGAGGGGAGGCCTGGTGGCTTAGCCTGGGCAGGTCGTGTCTCCTCCTGACCCCAGTGGCTGCGGTGAGGGGAACCACCTCCCTTGCTGCACCAGTGGCCATTAGCTCCCGTCACCA CTGCAACCCAGGGTCCCAGCTGGCTGGGTCCCTCTTCTGCCCCCAGTGCCCTTCCCCTTGGGCTGTGTTGGAGTGAGCACCTCCTCTGTAGGCACCTCTCACACTGTTGTCTGTTACTGATTTTTTTTGATAAAAAGATAATAAAACCTGGTACTTTCTAAA The complete mRNA polynucleotide sequence of SEQ ID NO:750 SDC4 ACTCGCCGCAGCCTGCGCGCCTTCTCCAGTCCGCGGTGCCATGGCCCCCGCCCGTCTGTTCGCGCTGCTGCTGTTCTTCGTAGGCGGAGTCGCCGAGTCGATCCGAGAGACTGAGGTCATCGACCCCCAGGACCTCCTAGAAGGCCGATACTTCTCCGGAGCCCTACCAGACGATGAGGATGTAGTGGGGCCCGGGCAGGAATCTGATGACT TTGAGCTGTCTGGCTCTGGAGATCTGGATGACTTGGAAGACTCCATGATCGGCCCTGAAGTTGTCCATCCCTTGGTGCCTCTAGATAACCATATCCCTGAGAGGGCAGGGTCTGGGAGCCAAGTCCCCACCGAACCCAAGAAACTAGAGGAGAATGAGGTTATTCCCCAAGAGAATCACCCGTTGAAGAGAGTGAGGATGTGTCCAACAAGGTGTCAATGT CCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAGCTCTGATTGTGGGTGGCATCGTGGGCATCCTCTTTGCCGTCTTCCTGATCCTACTGCTCATGTACCGTATGAAGAAGAAGGATGAAGGCAGCTATGACCTGGGCAAGAAACCCATCTACAAGAAAGCCCCCACCAATGAGTTTCTACGCGTGAAGCTTGCTTGTGGGCACT GGCTTGGACTTTAGCGGGGAGGGAAGCCAGGGGATTTTGAAGGGTGGACATTAGGGTAGGGTGAGGTCAACCTAACTGACTTGTCAGTATCTCAGCTCTGATTACCTTTGAAGTGTTCAGAAAGAGACATTGTCTTACTGTTCTGCCAGGTTCTTCTTGAGCTTTGGGCCTCAGTTGCCCTGGCAGAAAAATGGATTCAACTTGGCCTTTCTGA AGGCAAGACTGGGATTGGATCACTTCTCTAAACTTCCAGTTAAGAATCTAGGTCCGCCCTCCAAGCCCATACTGACCATGCCTCATCCAGAGCTCCTCTGAAGCCAGGGGGCTAACGGATGTTGTGTGGAGTCCTGGCTGGAGGTCCTCCCCAGTGGCCTTCCTCCCTTCCTTCACAGCCGGTCTCTCTGCCAGGAAATGGGGGAAGGAACTAGAACC ACCTGCACCTTGAGATGTTTCTGTAAATGGGTACTTGTGATCACACTACGGGAATCTCTGTGGTATATACCTGGGGCCATTCTAGGCTCTTTCAAGTGACTTTTGGAAATCAACCTTTTTTATTTGGGGGGAGGATGGGGAAAAGAGCTGAGAGTTTTATGCTGAAATGGATTTATAGAATATTTGTAAATCTATTTTGTGTTTGTTCGTTTTTTT AACTGTTCATTCCTTTGTGCAGAGTGTATATCCTCTGCCTGGGCAAGAGTGTGGAGGTGCCGAGGTGTCTTCATTCTCTCGCACATTTTCCACAGCACCTGCTAAGTTTGTATTTAATGGTTTTTGTTTTTGTTTTGTTTTGTTTCTTGAAAATGAGAGAAGAGCCGGAGAGATGATTTTTTTTTTTTTTTTTACTATTTATAGCTTTTAGATAGG GCCTCCCTTCCCCTCTTCTTTCTTTGTTCTCTTTCATTAAACCCCTTCCCCAGTTTTTTTTTTATACTTTAAACCCGCTCCTCATGGCCTTGGCCCTTTCTGAAGCTGCTTCCTCTTATAAAATAGCTTTTGCCGAAACATAGTTTTTTTAGCAGATCCCAAAAATAATGAAGGGGATGGTGGGATATTTGTGTCTGTGTTCTTATAATATTATTTATTCTTCCTT GGTTCTAGAAAAATAGATAAATATATTTTTTTCAGGAAATAGTGTGGTGTTTCCAGTTTGATGTTGCTGGGTGGTTGAGTGAGTGAATTTTCATGTGGCTGGGTGGGTTTTTTGCCTTTTTCTCTTGCCCTGTTCCTGGTGCCTTCTGATGGGGCTGGAATAGTTGAGGTGGATGGTTTCTACCCTTTCTGCCTTCTGTTTGGGACCCAGCTG GTGTTCTTTGGTTTGCTTTCTTCAGGCTCTAGGGCTGTGCTATCCAATACAGTAACCACATGCGGCTGTTTAAAAGTTAAGCCAATTAAAATCACATAAGATTAAAAATTCCTTCCTCAGTTGCACTAACCACGTTTCTAAGAGGCGTCACTGTATGTAGTACTGTACTGACAGCGAGAGCATGTCCATCTAGAGAACTAAACTG GCTTAACGAGTCACAGCCTCAGCTGTGCTGGGACGACCCTTTGTCTCCCCTGGGTAGGGGGGGGGAATGGGGGAGGGCTGATGAGGCCCCAGCTGGGGCCTGTTGTCTCGGGACCTCCCTCTCCTGAGAGGGGAGGCCTGGTGGCTTAGCCTGGGCAGGTCGTGTCTCCTCCTGACCCCAGTGGCTGCGGTGAGGG GAACCACCCTCCCTTGCTGCACCAGTGGCCATTAGCTCCCGTCACCCACTGCAACCCAGGGTCCCAGCTGGCTGGGTCCCTCTTCTGCCCCCAGTGCCCTTCCCCTTGGGCTGTGTTGGAGTGAGCACCTCCTCTGTAGGCACCTCTCACACTGTTGTCTGTTACTGATTTTTTTTGATAAAAAGATAATAAAACCTGGTACTTTTCTAAA SEQ ID NO:751 Translated polypeptide sequence of exon 1 of SDC4 MAPARLFALLLLFFVGGVAES SEQ ID NO:752 Translated polypeptide sequence of exon 2 of SDC4 IRETEVIDPQDLLEGRYFSGALPDDEDVVGPGQESDDFELSGSGDL SEQ ID NO:753 Translated polypeptide sequence of exon 3 of SDC4 DLEDSMIGPEVVHPL SEQ ID NO:754 Translated polypeptide sequence of exon 4 of SDC4 VPLDNHIPERAGSGSQVPTEPKKLEENEVIPKRISPVEESEDVSNKVSMSSTVQGSNIFERTEVLA SEQ ID NO:755 Translated polypeptide sequence of exon 5 of SDC4 LIVGGIVGILFAVFLILLLMYRMKKKDEGSYDLGKKPIYKKAPTNEFYA The complete protein sequence of SEQ ID NO:756 SDC4 MAPARLFALLLFFVGGVAESIRETEVIDPQDLLEGRYFSGALPDDEDVVGPGQESDDFELSGSGDLDDLEDSMIGPEVVHPLVPLDNHIPERAGSGSQVPTEPKKLEENEVIPKRISPVEESEDVSNKVSMSSTVQGSNIFERTEVLAALIVGGIVGILFAVFLILLLMYRMKKKDEGSYDLGKKPIYKKAPTNEFYA SEQ ID NO:757 The polynucleotide sequence of all exons 1-2 of SDC4 arranged in order SEQ ID NO:758 Translated polypeptide sequence of all exons 1-2 of SDC4 arranged in order CD44 (2) sequence informationSEQ ID NO:759 Sequence from CD44 exon 5 at 5' of CD44-NRG1 fusion TTTCTACTGTACACCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCA SEQ ID NO:760 NRG1 exon 6 sequence at 3' of CD44-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:761 CD44-NRG1 polynucleotide sequence TTTCTACTGTACACCCATCCCAGACGAAGACAGTCCCTGGATCACCGACAGCACAGACAGAATCCCTGCTACCACTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:762 CD44-NRG1 polypeptide sequence STVHPIPDEDSPWITDSTDRIPATTTSTSTTGTSHLVKCAEKEKTFCVN SLC4A4 sequence informationSEQ ID NO:763 Sequence ACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAG Sequence ACTACCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAG Sequence from SLC4A4-NRG1 fusion 5' of SLC4A4-NRG1 fusion 5' of SLC4A4 exon 14 SEQ ID NO:764 SLC4A4-NRG1 fusion 3' of NRG1 exon 6 sequence CTACATCTACATCC ACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO: 765 SCL4A4 - NRG1 polynucleotide sequence ACTACCCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCTGTGTGCCACCTGACCCAGCCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO: 766 SCL4A4-NRG1 polypeptide sequence YPINSNFKVGYNTLFS CTCVPPDPATSTSTTGTSHLVKCAEKEKTFCVN SEQ ID NO:767 SLC4A4 exon 1 GGCGGCGCGCCGGGCAGCGCTTCGGTGGCGGCGGCGGCCGCGGTGGCAGCGAAGGCGGCGGCGGCGGCGGCAGTGGCAGTGGCCGCTGCAGCCCCACACTCCGCCGCCAAACTGGAGGAGCGACGGAAGCCAGAC CCCAGGAG SEQ ID NO: 768 SLC4A4 exon 2 GATGGAGGATGAAGCTGTCCTGGACAGAGGGGCTTCCTTCCTCAAGCATGTGTGTGATGAAGAAGAAGTAGAAG SEQ ID NO: 769 SLC4A4 exon Exon 3 GCCACCATACCATTTTACATCGGAGTCCATGTGCCGAAGAGTTACAGGAGAAGGAGACGTCACAAGAGAAAGACAGGGCACAAAGAAAAGAAGGAAAAGGAGAGAATCTCTGAGAACTACTCTGACAAAATCAGATATTGAAAATGCTGATGAATCCAGCAGCAGCATCCTAAAACCTCTCA SEQ ID NO: 770 SLC4A4 exon 4 TCTCTC CTGCTGCAGAACGCATCCGATTCATCTTGGGAGAGGAGGATGACAGCCCAGCTCCCCTCAGCTCTTCACGGAACTGGATGAGCTGCTGGCCGTGGATGGGCAGGAGATGGAGTGGAAGGAAACAGCCAG SEQ ID NO: 771 SLC4A4 exon 5 GTGGATCAAGTTTGAAGAAAAAGTGGAACAGGGTGGGGAAAGAT GGAGCAAGCCCCATGTGGCCACATTGTCCCTTCATAGTTTATTTGAGCTGAGGACATGTATGGAGAAAGGATCCATCATGCTTGATCGGGAGGCTTCTTCTCTCCCACAGTTGGTGG SEQ ID NO: 772 SLC4A4 exon 6 AGATGATTGTTGACCATCAGATTGAGACAGGCCTATTGAAACCTGAACTTAAGGATAAGGTGACCTATA CTTTGCTCCGGAAGCACCGGCATCAAACCAAGAAATCCAACCTTCGGTCCCTGGCTGACATTGGGAAGACAGTCTCCAGTGCAAGTAGGATGTTTACCAACCCTGATAATG SEQ ID NO: 773 Exon of SLC4A4 7 GTAGCCCAGCCATGACCCATAGGAATCTGACTTCCTCCAGTCTGAATGACATTTCTGATAAACCGGAGAAGGACCAG SEQ ID NO: 774 SLC4A4 exon 8 CTGAAGAATAAGTTCATGAAAAAATTGCCACGTGATGCAGAAGCTTCCAACGTGCTTGTTGGGGAGGTTGACTTTTTGGATACTCCTTTCATTGCCTTTGTTAG GCTACAGCAGGCTGTCATGCTGGGTGCCCTGACTGAAGTTCCTGTGCCCACAAG SEQ ID NO: 775 SLC4A4 exon 9 GTTCTTGTTCATTCTCTTAGGTCCTAAGGGGAAAGCCAAGTCCTACCACGAGATTGGCAGAGCCATTGCCACCCTGATGTCTGATGAG SEQ ID NO: 776 SLC4A4 exon 10 GTGTTC CATGACATTGCTTATAAAGCAAAAGACAGGCACGACCTGATTGCTGGTATTGATGAGTTCCTAGATGAAGTCATCGTCCTTCCACCTGGGGAATGGGATCCAGCAATTAGGATAGAGCCTCCTAAGAGTCTTCCATCCTCTGACAAAAG SEQ ID NO: 777 SLC4A4 exon 11 AAAGAATATGTACTCAGGTGGAGAGAATGTTCAGATGAATG GGGATACGCCCCATGATGGAGGTCACGGAGGAGGAGGACATGGGGATTGTGAAGAATTGCAGCGAACTGGACG SEQ ID NO: 778 SLC4A4 exon 12 GTTCTGTGGTGGACTAATTAAAGACATAAAGAGGAAAGCGCCATTTTTTGCCAGTGATTTTTATGATGCTTTAAATATTCAAGCTCTTTCGGCAATTCCTTCATTTATCTGGCAACTGTAACTAATGCTATCACTTTTGGAGGACTGCTTGGGGATGCCACTGACAACATGCAG SEQ ID NO: 779 SLC4A4 exon 13 GGCGTGTTGGAGAGTTTCCTGGGCACTGCTGTCTCTGGAGCCATCTTTTGCCTTTTTGCTGGTCAACCACTCACTATTCTGAGCAGCACCGGACCTGTCCTAGTTTTGAGAGGCTTCTATTTAATTTCAGCAA SEQ ID NO: 780 SLC4A4 exon 14 GGACAATAATTTTGACTATTTGGAGTT TCGCCTTTGGATTGGCCTGTGGTCCGCCTTCCTATGTCTCATTTTTGGTAGCCACTGATGCCAGCTTCTTGGTTCAATACTTCACACGTTTCACGGAGGAGGGCTTTTCCTCTCTGATTAGCTTCATCTTTATCTATGATGCTTTCAAGAAGATGATCAAGCTTGCAGATTACTACCCCCATCAACTCCAACTTCAAAGTGGGCTACAACACTCTCTTTTCCTGTACCT GTGTGCCACCTGACCCAG SEQ ID NO:781 SLC4A4 Exon 15 CTAATATCTCAATATCTAATGACACCACACTGGCCCCAGAGTATTTGCCAACTATGTCTTCTACTGACATG SEQ ID NO:782 SLC4A4 exon 16 TACCATAATACTACCTTTGACTGGGCATTTTTGTCGAAGAAGGAGTGTTCAAAATACGGAGGAAACCTCGTCGGGAACAACTGTAATTTTGTTCCTGATATCACACTCATGTCTTTTTATCCTCTTCTTGGGAACCTACACCTCTCCATGGCTCTGAAAAAATTCAAAACTAGTCCTTATTTCCAACCACA SEQ ID NO: 783 SLC4A4 exon 17 GCAAGAAAACTGATCAGTGATTTTGCCATTATCTTGTCCATTCTCATCTTTTGTGTAATAGATGCCCTAGTAGGCGTGGACACCCCAAAAACTAATTGTGCCAAGTGAGTTCAAG SEQ ID NO: 784 SLC4A4 exon 18 CCAACAAGTCCAAACCGAGGTTGGTTCGTTCCACCGTTTG GAGAAAACCCCTGGTGGGTGTGCCTTGCTGCTGCTATCCCGGCTTTGTTGGTCACTATACTGATTTTCATGGACCAACAAATTACAGCTGTGATTGTAAACAGGAAAGAACATAAACTCAAG SEQ ID NO: 785 SLC4A4 exon 19 AAAGGAGCAGGGTATCACTTGGATCTCTTTTGGGTGGCCATCCTCATGGTTATATGCTC CCTCATGGCTCTTCCGTGGTATGTAGCTGCTACGGTCATTCCATTGCTCACATCGACAGTTTGAAGATGGAGACAGAGACTTCTGCACCTGGAGAACAACCAAAGTTTCTAGGAGTGAG SEQ ID NO: 786 SLC4A4 exon 20 GGAACAAAAGAGTCACTGGAACCCTTGTGTTTATTCTGACTGGTCTGTCAGTCTTTATGGCTCCCATCTTGAAG SEQ ID NO: 787 SLC4A4 exon 21 GTTTATACCCATGCCTGTACTCTATGGTGTGTTCCTGTATATGGGAGTAGCATCCCTTAATGGTGTGCAG SEQ ID NO: 78 8 SLC4A4 exon 22 TTCATGGATCGTCTGAAGCTGCTTCTGATGCCTCTGAAGCATCAGCTGACTTCATCTACCTGCGTCATGTTCCTCTGCGCAGAGTCCACCTGTTCACTTTCCTGCAGGTGTTGTGTCTGGCCCTGCTTTGGATCCTCAAGTCAACGGTGGCTGCTATCATTTTTCCAGTAATG SEQ ID NO: 789 SLC4A4 exon Sub 23 ATCTTGGCACTTGTAGCTGTCAGAAAAGGCATGGACTACCTCTTCTCCCAGCACGACCTCAGCTTCCTGGATGATGTCATTCCAGAAAAGGACAAGAAAAAGAAGGAGGATGAGAAGAAAAAGAAAAAGAAGAAGGGAAGTCTGGACAGTGACAATGATGAT SEQ ID NO: 790 SLC4A4 In addition, Xianzi 24 TCTGACTGCCCCATACAGAAAAGTTTCCAAGTAAAAAAAAATTCCAACAACACACACACACTTTTTCTTCTTCGAACTCACTCACTGACTGACACTGACACTGACACTGACACAACAACTTCACAACAACAACAACAACAACAACAACAACAACAACACAACACACACACACATAT AAGAGACACAACACAACATTGACCACCACACACACACATCTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAOAAAAAAOOAAAAAAAAAAAAAOEation CaaAATTATAGAAAAACTGAACTGAACAATGAATGAATTTTTTTGGGGGGGGGGGAACAACAACTACTGTGTGTTTTTTTTTTTTGACACATGATTTCATTTCATTTTTTTT CTGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCGTCTTTTTTTTTTTCAGCCAAATACAGCGCTCTCTCTCTTCTTGCAATCAATCACACCAAAACAGC AtcttgagaatcccccccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCCTTTTTTTTTTTTTTTTTCAGTGTGTTGAACCAAAACTCAGACACACACACACACACACCCCCTTCTTCTTAGAGAGGAGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTCACACACACATCTCAACCAAC GATTTTTTGTGTGTGTTTTTTTTTTTTTTTTGAAGGAAGGAACTTTAACTTTTTTTTTTTTGGTGTGTGGGGGGTTTTGTGGGGGGAACCCCCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCTCCTCCCTCCCTCCCTCT AtagagCTCTCTCCATCTGGGAAAAAGCTGCTGCTTTTTTTTTTGCTTTGTTGTAAATCTTTCAAAGAAAAAACTGTGTGTCCTCTCTCTCTCCCAACCAACCAACCAACCAACCAACCCCCCATC AttgtagcaagTGGGGAATGTCCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTTGTGTGAAGAATTTTTTTTTTTTTTTTTTTTTTTTTTTT CCATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTGTGTGAAAAAATGAATTTTTTGCAGCAGAACAACAACATTTTTTCTTTTTGCAGGCTTTTGTAAAAAAAAA AATTGTGTGTGTATATAATTGATTTTTTTTTTTTTTTACACACACACACAAATACAATCTAGGTAGCCAGCAAGCAGCAGAGAGAGAAAAAAACCCAAGCAgCAgCAgCAgCAgCAgCAgCAGCAGCAGCAGCAGCAGCAGCCAGCCAGCCAGCAGCCAGCAGCAGCAGCAGCAGCAGCAGCAGCACCACCACCCCAAGCCACCCACCCCACCCACCCACCCACCCCACAC TTACTGTGTTCTTCTAGGAGAGAGAGGGGGAGAGAGAGAGACAACTCTCTCATGAATGATGATGTGTGATGCAGTAGTAGTGTGTGTCAATCAATCAATCATCAATCATCAAT AgACACATTTTTTCATTTGAAAAATTTTGTGTGTTGTGTGTGTGTGCCATGGGGGGTGTGTGTGTGCAAGCAAAGGCAGGCAAGGCAAGGAAGAAGTCTTCTTATTCTTCTAT TTTCAAAAAAAAAACCCAACCACACATGAAAAAAAAAAAAAATCAAGGGTCCAAGTCAAGTGTGTCAAAAAAAAAAATGTAAAAAAATATTTTTCCCCCAGTCAACTAACTGCAAAAAAATC AtaggagtatttaatgccccccattTACACAAAAAAATGGGGGGGGGCAGCAGCGCAGCAGTGTGTGTCTCTCTCAATTTTTTTTTTTTTTTTTTTTTTTTTTCAGCAAAAGCAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAT AATATTTGTGTGTGTGTGTGTGTACATTATGTTCACACACACACACAAAAAACACACCAACCAAACCAACGTTGTTTTTTTTTTTTTTTTTTAATACATGAAAACACAAACACAACACAAACATT TTAAAAAATGTTTAAAAAATAATTTTCAGATTCATACACACAACAACAACTGTGAACTACTACCAGTTGGGGGGGGTAGAGAAGAACCACACACACACACATTTTGCACTCCCCCAC TCATTGTTTTTTTTAATAAGTTTTTTTTTTCTCTCTTGTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATTTGAGTTTTTTTACTCTCTCTTTTTTTTTTAAAATCAATAATAAT GACCCCCAATTTTTTTTTTTTTTTTTTTTTTAAAATAATGTGTGAATTATTTGTTGTTGTCTCTCCAAAAAAAAAATTGGGGGGGGTTTTCTTCTTCTAGTAGTTCTTTCTTTCTTACCCCC CTAATATAACTGGTGTGTGTGTGTGAAGAGACTTTTTTTTTTTTTTTTTTTTTTTTTTGGGAAGCAATTTTTTTTTTTTTTTTTAACCAAAAAAAAACCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCTT AAGCTCTAGAAAAAAAAAAAAAAAAAAAACTTTTTTTTTGATGAATCAAGTCTGTGTCATGCCTCTCTTCTTCTCTCTCTGAACTCAGAGAGAGTGAGTAGTAGTAGTAGTAGTAGTAGAGTAGAGAGAGTCAACTCTCTCTCTCTCTCTCTCCCCCCCCTCTCTCAAGAAT AtGTGTGTGTCTCTGTTTTTTTTTCCCATTGACACACACACACATACATACTACTTGCTACAGACAGTAAAAAAATCCAACCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCAACAAC TTCAAAGAACAGAGTAGTTTACAACTAACAACAAAAAAAAAAAACTGACTGACTGACCAAAAAAAAAATCGGGTTGAATGACCCCCCAACCAGACCAGAGAGAGAGACAGAGAGACACCAACCCCCAACCACCACCACCTGACTGGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACTGACAT GTTTTCTATGCAACAACACACAAAAACAACAACAACAGTTTTTTTTTTTTTTTCTCTCTCCCATGTCCCCCGGGAAAGAAGAAGCAGTCTCTCTCGTCTCGTCGTGTGTTGTCTCTCTCTCTCCCCTCTCCTCCTCCATGATGATGAT GaaAgTCTCTGGTGTGTGTGTGCAGACACACTGTGTGGAAGAGAGAGAGAGAAGTCAAGTGTGTGCTAAAGCAAAAAAAAAAAAAAAATGCTGGGGGGGGGGGGCACA CtgtttttttttttgataaaAgtcartTCTCTCTCTGTGAATGAATGAATTGAATTTTGCCAGGGGGGGGGGGGGGTCCTCCCCAATCAAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG AgaggACTTACTACTTTTAAGAAGTAGAAAAAAAAAAAAAAAAAACCTCTCTCCCCCCCCCCCCCCAGAGGAAAAAAACAGTTGTGTGTGTTGTTTTTTTTTTTCTCTCTCTCTCTCTC CaccaaccatacAggtaAAAAAATTTTTTTTTTTTTGTGTGTGTTTTTTTAATGAATGATGTCCGAGAGAGAGAGGGGGGTTTTTTGTTTGAATGTGACTGACTGACTGACTGACTGATGACTGACTGATGATGACTGATGATGATGATGATGAATTGAATTTT TTAAAAAAAAAAAAAATGATTTTGAGAGTTGTTACAGGTGACACAGGGGGGGGGGGGGGGGGGGCAAGCCAATCATTTCATTTTTTTGCACTGACTGACTGATGATTGATTTGATTGATTGATTTTTTTT CatgatttttcACATTTTTTCACAGCAGCAgCCCCCCCCCCAAAAAAATGGGGGGGGGGGGGGGTGTGTTGTCTCTCTCTGCCCCCAGCCAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTTC is AttgCAGCTGAAAAAAAAAAAAAGTGTGTGTGTGTGTGCCAACCAGTCAGTCAGTGTTTTTTTGCCAGAGAGTCCAATCCTCTCTCTCTACTAAAGCAGAAGAAGAATTT CTGTTTTTTTTTCTCTAGCAACAACAACTCAACTCGGGGGGGTCAAGTCCCAACCAACCCCCCCCCCCCCAACCCCCCCCCCTAAAAGAAGAAGAAGAAGAATGCTTTTTTTTTTTTTGCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC AtgctggtacatatATCGTACGCATTTTTTTTTTTTTTTCAATCATACGCATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT AtggtggaacaaaaaaaaaAgaacagaAAAAAAACATTTCAATTAATAATAATTTTTTTTTTTTTCCTCTCTCTACTACTACATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT AtgaagAgAgATGAGAGAGAGCATTTTTTTTTTTTTTTTTTTAAAAAAAAACTTTTTTTTTTT NO: 793 SLC4A4 The whole mRNA polynucleotide sequence SEQ ID NO: 794 SLC4A4 of SLC4A4 Translated polypeptide sequence of exon 2 MEDEAVLDRGASFLKHVCDEEEVE SEQ ID NO:795 Translated polypeptide sequence of exon 3 of SLC4A4 HHTIYIGVHVPKSYRRRRHKRKRKTGHKEKKEKERISENYSDKSDIENADESSSSILKPL SEQ ID NO:796 Translated polypeptide sequence of exon 4 of SLC4A4 SPAAERIRFILG EEDDSPAPPQLFTELDELLAVDGQEMEWKETA SEQ ID NO:797 SLC4A4 External display Translated polypeptide sequence of exon 5 WIKFEEKVEQGGERWSKPHVATLSLHSLFELRTCMEKGSIMLDREASSLPQLV SEQ ID NO: 798 SLC4A4 Translated polypeptide sequence of exon 6 MIVDHQIETGLLKPELKDKVTYTLLRKHRHQTKKSNLRSLADIGKTVSSASRMFTNPDN SEQ ID NO: 799 SLC4A4 Translated polypeptide sequence of exon 7 SPAMTHRNLTSSSLNDISDKPEKDQ SEQ ID NO: 800 SLC4A4 exon 8 Translated polypeptide sequence LKNKFMKKLPRDAEASNVLVGEVDFLDTPFIAFVRLQQAVMLGALTEVPVPT SEQ ID NO: 801 Translated polypeptide sequence of exon 9 of SLC4A4 FLFILLGPKGKAKSYHEIGRAIATLMSDE SEQ ID NO: 802 Translated polypeptide sequence of exon 10 of SLC4A4 VFHDIAYKAKDRHDLIAGIDEFLDEVIVLPP GEWDPAIRIEPPKSLPSSDK SEQ ID NO:803 Translation of exon 11 of SLC4A4 Peptide sequence KNMYSGGENVQMNGDTPHDGGHGGGGHGDCEELQRTG SEQ ID NO: 804 Translated polypeptide sequence of exon 12 of SLC4A4 FCGGLIKDIKRKAPFFASDFYDALNIQALSAILFIYLATVTNAITFGGLLGDATDNMQ SEQ ID NO: 805 Translated polypeptide sequence of exon 13 of SLC4A4 G VLESFLGTAVSGAIFCLFAGQPLTILSSTGPVLVFERLLFNFS SEQ ID NO: 806 Translated polypeptide sequence of exon 14 of SLC4A4 DNNFDYLEFRLWIGLWSAFLCLILVATDASFLVQYFTRFTEEGFSSLISFIFIYDAFKKMIKLADYYPINSNFKVGYNTLFSCTCVPPDP SEQ ID NO: 807 Translated polypeptide sequence of exon 15 of SLC4A4 NISISNDTTLAPEYLPTMSSTDM SEQ ID NO: 808 Translated polypeptide sequence of exon 16 of SLC4A4 YHNT TFDWAFLSKKECSKYGGNLVGNNCNFVPDITLMSFILFLGTYTSSMALKKFKTSPYFPTT SEQ ID NO:809 Translated polypeptide sequence of exon 17 of SLC4A4 ARKLISDFAIILSILIFCVIDALVGVDTPKLIVPSEFK SEQ ID NO: 810 Translated polypeptide sequence of exon 18 of SLC4A4 PTSPNRGWFVPPFGENPWWVCLAAAIPALLVTILIFMDQQITAVIVNRKEHKLK SEQ ID NO: 811 Translated polypeptide sequence of exon 19 of SLC4A4 KGAGYHLLDLFWVAILMVICSLMALPWYVAATVISIAHIDSLKMETETSAPAGEQPKFLGV SEQ ID NO:812 Translated polypeptide sequence of exon 20 of SLC4A4 EQRVTGTLVFILTGLSVFMAPILK SEQ ID NO :813 The translated polypeptide sequence of exon 21 of SLC4A4 FIPMPVLYGVFLYMGVASLNGVQ SEQ ID NO:814 The translated polypeptide sequence of exon 22 of SLC4A4 FMDRLKLLLMPLKHQPDFIYLRHVPLRRVHLFTFLQVLCLALLWILKSTVAAIFPVM SEQ ID NO:815 The translated polypeptide sequence of exon 23 of SLC4A4 Translated polypeptide sequence ILALVAVRKGMDYLFSQHDLSFLDDVIPEKDKKKKEDEKKKKKKKGSLDSDNDD SEQ ID NO: 816 The translated polypeptide sequence of exon 24 of SLC4A4 SDCPYSEKVPSIKIPMDIMEQQPFLSDSKPSD SEQ ID NO:817 The translated polypeptide sequence of exon 25 of SLC4A4 ERSPTFLERHTSC SEQ ID NO:819 The complete protein sequence of SLC4A4 SEQ ID NO:820 The entire exon 1- of SLC4A4 14 Sequentially arranged polynucleotide sequence SEQ ID NO: 821 SLC4A4 all exons 1-14 Sequentially arranged translated polypeptide sequenceSDC4 (2) sequence informationSEQ ID NO:822 Sequence from SDC4 exon 4 at 5' of SDC4-NRG1 fusion ATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAG SEQ ID NO:823 NRG1 exon 2 sequence at 3' of SDC4-NRG1 fusion CCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:824 SDC4-NRG1 polynucleotide sequence ATGTGTCCAACAAGGTGTCAATGTCCAGCACTGTGCAGGGCAGCAACATCTTTGAGAGAACGGAGGTCCTGGCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO:825 SDC4-NRG1 polypeptide sequence VSNKVSMSSTVQGSNIFERTEVLAALPPRLKEMKSQESAAAGSKLVLRCE SEQ ID NO:940 The polynucleotide sequence of all exons 1-4 of SDC4 arranged in order SEQ ID NO:941 The translated polypeptide sequence of all exons 1-4 of SDC4 arranged in order ZFAT sequence informationSEQ ID NO:826 Sequence from ZFAT exon 12 at 5' of ZFAT-NRG1 fusion ACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAG SEQ ID NO:827 NRG1 exon 6 sequence at 3' of ZFAT-NRG1 fusion CTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:828 ZFAT-NRG1 polynucleotide sequence ACAGGAAGCACCCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAGCTACATCTACATCCACCACTGGGACAAGCCATCTTGTAAAATGTGCGGAGAAGGAGAAAACTTTCTGTGTGAATG SEQ ID NO:829 ZFAT-NRG1 polypeptide sequence RKHPNEEYANVGTGELAAEVLIQQATSTSTTGTSHLVKCAEKEKTFCVN SEQ ID NO:830 ZFAT exon 1 ATCCGCCATGTTGGATGCCGCAGATTCGCCATAACCTCGCCGGCTCTTTTCTTAAAAAAATAAAAATAAAAAGCGAAGCGTCAGCAGGGCGCCCCGCCCCCTCGGTCGGCACGGGAGGGGGCCCGGAAGAGCCCGAGGCTTTTTTTTCCTCCGCGGTGGGGCGTTGCCATGGAGACGCGGGCGGCAG SEQ ID NO:831 ZFAT exon 2 AAAACACGGCCATCTTTATGTGTAAATGTTGTAACCTCTTCTCACCAAATCAGTCGGAACTCCTCTCCCACGTTTCAGAGAAGCACATGGAAGAAGGGGTTAATGTTGATGAGATTATTATTCCCCTTAGGCCTCTGAGTACACCTGAACCCCCCAACTCAAGCAAAAACCGGAGATG SEQ ID NO:832 ZFAT exon 3 AGTTTTTGGTCATGAAGAGGAAGAGAGGCAGGCCTAAGGGGTCCACGAAGAAGTCCAGCACAGAAGAGGAGCTGGCAGAAAACATCGTGAGTCCGACTGAGGACAGCCCGCTGGCTCCGGAGGAAGGGAACAGCCTGCCTCCAAGCAGCTTGGAGTGTAGCAAGTGCTGTCGGAAGTTCTCCAACACGCGCCAGCTGCGGAAGCACATCTGCATTATC GTGCTGAATTTGGGTGAGGAGGAAGGAGAAGCAG SEQ ID NO:833 ZFAT exon 4 GTAACGAGTCTGACCTTGAACTAGAAAAGAAGTGTAAGGAAGATGATCGGGAAAAAGCCTCGAAAAGACCACGGTCACAGAAAACAGAGAAAGTCCAGAAGATCTCAGGAAAGGAGGCCAGACAGCTTTCTGGGGCGAAGAAACCCATCATAAGTGTGGTTTTTAACTGCACACGAAGCAATTCCAG SEQ ID NO:834 ZFAT exon 5 GTGCTACCAAGATTGTGCCAGTGGAGGCTGGGCCCCCTGAAACAGGAGCTACAAATTCTGAGACCACTTCAAGCAGACCTGGTGCCTCGGAGAGGCTACCAGGAATACGCCATTCAGCAGACACCTTATGAGCAACCAATGAAGTCAAGCAG SEQ ID NO:835 ZFAT exon 6 GCTAGGTCCCACTCAGCTCAAAAATCTTCACTTGTGAATACTGCAACAAGGTCTTCAAGTTCAAGCACTCGCTGCAGGCCCACCTGAGGATCCACACCAATGAAAAGCCATACAAGTGCCCCAGTGCAGCTATGCCAGTGCCATCAAGGCCAACCTCAATGTGCACCTGCGCAAGCACACTGGAGAGAAGTTCGCCTGCGACTATTGCTCGTTCACCTGCCTGAGCAAGG GCCACCTCAAGGTGCACATCGAGCGAGTGCACAAGAAGATCAAGCAGCACTGCCGCTTCTGCAAGAAGAAGTACTCTGACGTCAAGAACCTCATCAAGCACATCCGAGACGCGCATGACCCACAGGACAAGAAGGTCAAAGGGCCTTGGACGAGCTCTGCCTGATGACGAGGGAGGGCAAGCGGCAGCTGCTCTATGACTGCCACATCTGTGAGCGCAAGT TCAAGAACGAGCTGGACCGTGACCGCCATATGCTGGTCCACGGAGACAAGTGGCCTTTTGCCTGTGAGCTCTGTGGCCATGGGGCCACCAAGTACCAGGCGCTGGAACTGCATGTCAGGAAGCACCCCCTTCGTGTACGTCTGTGCCGTCTGCCGCAAGAAGTTCGTCAGTCCATCAGGCTGCGCACCCACATCAAAGAGGTGCACGGGGCTGCCC AGGAGGCCTTGGTCTTCACCAGTTCCATCAACCAGAGCTTCTGCCTCCTGGAACCTGGTGGGGACATCCAGCAAGAAGCTCTGGGGGACCAGCTACAGCTGGTGGAAGAGGAGTTTGCCCTCCAGGGCGTGAATGCACTCAAGGAAGAGGCCTGTCCTGGGGACACTCAGCTGGAGGAGGGCCGGAAGGAGCCGGAGGCCCCTGGGGAA ATGCCTGCCCCAGCTGTGCACCTGGCCTCCCCGCAGGCCGAAAGCACAGCCCTGCCACCCTGTGAGCTGGAAACCACCGTGGTCTCTCTCTCCAGACCTGCATTCTCCAAGAGGTGGTTTCAGATGATTTTTTGTTGAAAAATGATACCTCCTCCGCAGAGGCTCATGCTGCTCCTGAGAAGCCCCCAGACATGCAGCACAGAAGCTCAGTCCAGACGCAAGGTGAAGTGA TCACACTACTGCTGTCCAAGGCCCAGAGTGCTGGGTCAGATCAGGAAAGCCATGGCGCCCAGAGCCCCCTAGGGGAAGGGCAGAACATGGCTGTGCTTTCAGCTGGTGACCCAGATCCCAGCAGGTGTCTCAGGTCAAACCCAGCTGAGGCCTCAGACCTCCTCCCTCCAGTAGCTGGTGGTGGGGACCCATCACACATCAGCCTGACTCTTGCAAAG CTGCCCCTGAGCACCGGTCAGGCATCACCGCTTTCATGAAGGTCCTGAACAGTTTACAGAAGAAGCAAATGAACACCAGCTTGTGTGAGCGGATCCGGAAGGTTTATGGAGACCTGGAGTGTGAATACTGTG SEQ ID NO:836 ZFAT exon 7 GCAAACTTTTTGGTACCAAGTGCATTTTGATATGCATGTCCGCACCCACACCCGGGAACATCTGTATTGCTCTCAGTGTCATTATTCTTCCATCACCAAAACTGCCTTAAACGCCACGTAATTCAGAAACACAGTAACATCTTGCTGAAGTGTCCCACCGATGGCTGTGACTACTCAACTCCGATAAATATAAGCTACAGGCACATCTTAAAGTTCAC ACAGCACTG SEQ ID NO:837 ZFAT exon 8 GACAAAAGGAGTTATTCTTGTCCTGTTTGTGAAAAGTCTTTTTCAGAGGATCGATTGATAAAGTCACATATCAAGACCAACCATCCTG SEQ ID NO:838 ZFAT exon 9 AGGTCTCCATGAGCACCATTTCTGAGGTTCTCGGGAGGAGGGTTCAGCTGAAAGGGCTAATTGGAAAGAGAGCCATGAAATGCCCATATTGTGACTTTTATTTCATGAAGAATGGCTCAGACCTTCAGCGTCATATTTGGGCTCATGAAG SEQ ID NO:839 ZFAT exon 10 GTGTGAAGCCCTTCAAGTGTTCTTTGTGTGAGTATGCAACTCGTAGCAAGAGTAACCTCAAGGCTCATATGAATCGTCACAGCACTGAGAAAACCCACCTATGTGACATGTGTGGCAAGAAATTCAAATCAAAAGGGACACTGAAAAGTCACAAACTCCTTCACACTGCAGATG SEQ ID NO:840 ZFAT exon 11 GGAAGCAGTTTAAGTGCACGGTGTGTGACTACACACAGCGGCCCAGAAGCCACAGCTGCTGCGGCACATGGAACAGCATGTCTCCTTCAAG SEQ ID NO:841 ZFAT exon 12 CCTTTCCGCTGTGCCCATTGCCATTACTCCTGCAACATATCTGGCTCTCTGAAGCGGCACTACAACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGGCAGCGGAGGTGCTCATCCAGCAAG SEQ ID NO:842 ZFAT exon 13 GTGGTTTGAAGTGTCCTGTTTGCAGCTTTGTATATGGCACCAAATGGGAGTTCAATAGGCACTTGAAGAACAAACATGGCTTGAAGGTGGTGGAAATTGATGGAGACCCCAAGTGGGAG SEQ ID NO:843 ZFAT exon 14 ACAGCAACAGAAGCTCCTGAGGAGCCCTCCACCCAGTATCCTCCACATCACAGAGGCCGAAGAAGACGTTCAAGGGACACAGGCAGCGGTGGCCGCGCTCCAGGACCTGAGATACACCCTCTGAGAGTG SEQ ID NO:844 ZFAT exon 15 GCGACCGACTGGACCCCACGGCCGTGAACATCCTGCAGCAGATCATTGAGCTGGGCGCCGAGACCCATGACGCCACTGCCCTTGCCTCGGTGGTTGCCATGGCACCAGGGACGGTGACTGTGGTTAAGCAG SEQ ID NO:845 ZFAT exon 16 GTCACCGAGGAGGAGCCCAGCTCCAACCACACGGTCATGATCCAGGAGACGGTCCAGCAAGCGTCCGTGGAGCTTGCCGAGCAGCACCACCTGGTGGTGTCCTCCGACGACGTGGAGGGCATTGAGACGGTGACTGTCTACACGCAGGGCGGGGAGGCCTCGGAGTTCATCGTCTACGTGCAGGAGGCCATGCAGCCTGTGGAGGAGC AGGCTGTGGAGCAGCCGGCCCAGGAACTCTAGAGGACATGTGGCATCGGATGGCCACAGGGCGGGGCTGCCAGGCTCTGCAGGCACCCAGGGTGGGGAGGCCACCCTTCCTGCCCTACCCGCAGAATGGTGCTCTCCTTTGCCCTCCCTGCCCAGCAGCCTGATAGGACTCTCCTAGTCCAACTTGGGGTGGGCAAGGCAGTCAGCATCAC CAGCAATACCACAGGACCCTCACCCCAGCATAGACACACACCCCCTGACCCTTACCATCTGCTTCCTGAAAGACTTCAGTGTCAGTCCCCTACACACACACCCACCCTTCACCCCTTGCTTCAAGATTCAAACAGAGACTCCAGTCCCCCTCAGCATCTTCCCTGAATCACAGCCCCAGCTCCTTGACCCCCATCTAGGTGCCAAATGTTCATCTGCAACCGCTATGCAGTCTGGT GAGAGGGAGACAGCCATCACATAGAAAGTGACCGTACGGGTTTTTAATCACTGCTGGGTGGGGTGGGGGTAGGGGGATTGTCCTGGCTTTGTCGACAAAGTCCCACTTCCCCGAGTATTAAGGGCCCTTGGTATCAAGTGAGGTAAATTCACCATCAGGGTCTCGCCCTACCATCCTGGAATTATTTCACTTTTAAGATAAATGCACT ATTTCACTGTTCGCCTCCCATTCTAAGGAGGTGAGGTGGTTGGAATAAAAAACAGTTCCTGTCTGAA SEQ ID NO:846 The complete mRNA polynucleotide sequence of ZFAT ATCCGCCATGTTGGATGCCGCAGATTCGCCATAACCTCGCCGGCTCTTTTCTTAAAAAAATAAAAATAAAAAGCGAAGCGTCAGCAGGGCGCCCCGCCCCCTCGGTCGGCACGGGAGGGGGCCCGGAAGAGCCCGAGGCTTTTTTTTCCTCCGCGGTGGGGCGTTGCCATGGAGACGCGGGCGGCAGAAAACACGGCCATCTTT ATGTGTAAAATGTTGTAACCTCTTCTCACCAAATCAGTCGGAACTCCTCTCCCACGTTTCCAGAGAAGCACATGGAAGAAGGGGTTAATGTTGATGAGATTATTATTCCCCTTAGGCCTCTGAGTATCACCTGAACCCCCCAACTCAAGCAAAAACCGGAGATGAGTTTTTGGTCATGAAGAGGAAGAGAGGCAGGCCTAAGGGGTCCACGAAGAAGTCCAGCAGCAGAAGAG GAGCTGGCAGAAAACATCGTGAGTCCGACTGAGGACAGCCCGCTGGCTCCGGAGGAAGGGAACAGCCTGCCTCCAAGCAGCTTGGAGTGTAGCAAGTGCTGTCGGAAGTTCTCCAACACGCGCCAGCTGCGGAAGCACATCTGCATTATCGTGCTGAATTTGGGTGAGGAGGAAGGAAGCAGGTAACGAGTCTGACCTTGAACTAGAAAAGAGTG TAAGGAAGATGATCGGGAAAAAAGCCTCGAAAGACCACGGTCACAGAAAACAGAGAAAGTCCAGAAGATCTCAGGAAAGGAGGCCAGACAGCTTTCTGGGGCGAAGAAACCCATCATAAGTGTGGTTTTTAACTGCACACGAAGCAATTCCAGGTGCTACCAAGATTGTGCCAGTGGAGGCTGGGCCCCCTGAAACAGGAGCTACAAATTCTGAGACCACTTCA GCAGACCTGGTGCCTCGGAGAGGCTACCAGGAATACGCCATTCAGCAGACACCTTATGAGCAACCAATGAAGTCAAGCAGGCTAGGTCCCACTCAGCTCAAAATCTTCACTTGTGAATACTGCAACAAGGTCTTCAAGTTCAAGCACTCGCTGCAGGCCCACCTGAGGATCCACACCAATGAAAAGCCATACAAGTGCCCCCAGTGCAGCTATGCCAGTGCCATCAAGGCC ACCTCAATGTGCACCTGCGCCAAGCACACTGGAGAGAAGTTCGCCTGCGACTATTGCTCGTTCACCTGCCTGAGCAAGGGCCACCTCAAGGTGCACATCGAGCGAGTGCACAAGAAGATCAAGCAGCACTGCCGCTTCTGCAAGAAGAAGTACCTCATCAAGCACATCCGAGACGCGCATGACCCACAGGACAAGAAGGTCAAAGAGGCCTTG GACGAGCTCTGCCTGATGACGAGGGAGGGCAAGCGGCAGCTGCTCTATGACTGCCACATCTGTGAGCGCAAGTTCAAGAACGAGCTGGACCGTGACCGCCATATGCTGGTCCACGGAGACAAGTGGCCTTTTGCCTGTGAGCTCTGTGGCCATGGGGCCACCAAGTACCAGGCGCTGGAACTGCATGTCAGGAAGCCACCCCTTCGTGTACGTCT GTGCCGTCTGCCGCAAGAAGTTCGTCAGTCCATCAGGCTGCGCACCCACATCAAAGAGGTGCACGGGGCTGCCCAGGAGGCCTTGGTCTTCACCAGTTCCATCAACCAGAGCTTCTGCCTCCTGGAACCTGGTGGGGACATCCAGCAAGAAGCTCTGGGGGACCAGCTACAGCTGGTGGAAGAGGAGTTTGCCCTCCAGGGCGTGAATGCACTCAAG GAAGAGGCCTGTCCTGGGGACACTCAGCTGGAGGAGGGCCGGAAGGAGCCGGAGGCCCCTGGGGAAATGCCTGCCCCAGCTGTGCACCTGGCCTCCCCGCAGGCCGAAAGCACAGCCCTGCCACCCTGTGAGCTGGAAACCCACCGTGGTCTCTCCTCAGACCTGCATTCTCCAAGAGGTGGTTTCAGATGATTTTTTGTTGAAAAATGATACCTC CTCCGCAGAGGCTCATGCTGCTCCTGAGAAGCCCCCAGACATGCAGCACAGAAGCTCAGTCCAGACGCAAGGTGAAGTGATCACACTACTGCTGTCCAAGGCCCAGAGTGCTGGGTCCAGATCAGGAAAGCCATGGCGCCCAGAGCCCCCTAGGGGAAGGGCAGAACATGGCTGTGCTTTCAGCTGGTGACCCAGATCCCAGCAGGTGTCTCAGGTCAAACCCAG CTGAGGCCTCAGACCTCCTCCCTCCAGTAGCTGGTGGTGGGGAACACCATCACACATCAGCCTGACTCTTGCAAAGCTGCCCCTGAGCACCGGTCAGGCATCACCGCTTTCATGAAGGTCCTGAACAGTTTACAGAAGAAGCAAATGAACACCAGCTTGTGTGAGCGGATCCGGAAGGTTTATGGAGACCTGGAGTGTGAATACTGTGGCAAACTTTTTTGGTAC CAAGTGCATTTTGATATGCATGTCCGCACCCACACCCGGGAACATCTGTATTATTGCTCTCAGTGTCATTATTCTTCCATCACCAAAACTGCCTTAAACGCCACGTAATTCAGAAACACAGTAACATCTTGCTGAAGTGTCCCACCGATGGCTGTGACTACTCAACTCCGATAAATATAAGCTACAGGCACATCTTAAAGTTCACACAGCACTGGACAAAAGGAGT TATTCTTGTCCTGTTTGTGAAAAGTCTTTTTCAGAGGATCGATTGATAAAGTCACATATCAAGACCAACCATCCTGAGGTCTCCATGAGCACCATTTCTGAGGTTCTCGGGAGGAGGGTTCAGCTGAAAGGGCTAATTGGAAAGAGAGCCATGAAATGCCCATATTGTGACTTTTATTTCATGAAGAATGGCTCAGACCTTCAGCGTCATATTTGGGCTCAT GAAGGTGTGAAGCCCTTCAAGTGTTCTTTGTGTGAGTATGCAACTCGTAGCAAGAGTAACCTCAAGGCTCATATGAATCGTCACAGCACTGAGAAAACCCACCTATGTGACATGTGTGGCAAGAAATTCAAATCAAAAGGGACACTGAAAAGTCACAAACTCCTTCACACTGCAGATGGGAAGCAGTTTAAGTGCACGGTGTGTGACTACACACAGCGCCCAGAAGCCACAG CTGCTGCGGCACATGGAACAGCATGTCTCTCTTCAAGCCTTTTCCGCTGTGCCCATTGCCATTACTCCTGCAACATATCTGGCTCTCTGAAGCGGCACTACAACAGGAAGCACCCTAATGAGGAGTATGCCAACGTGGGCACCGGGGAGCTGCAGCGGAGGTGCTCATCCAGCAAGGTGGTTTGAAGTGTCCTGTTTGCAGCTTTGTATATGGCACCA AATGGGAGTTCAATAGGCACTTGAAGAACAAACATGGCTTGAAGGTGGTGGAAATTGATGGAGACCCCAAGTGGGAGACCCAAGCAACAGAAGCTCCTGAGGAGCCCTCACCCAGTATCCTCCACATCACAGAGGCCGAAGAAGACGTTCAAGGGACACAGGCAGCGGTGGCCGCGCTCCAGGACCTGAGATACCACCTCTGAGAGTGGCGACCGACTGGA CCCCACGGCCGTGAACATCCTGCAGCAGATCATTGAGCTGGGCGCCGAGACCCATGACGCCACTGCCCTTGCCTCGGTGGTTGCCATGGCACCAGGGACGGTGACTGTGGTTAAGCAGGTCACCGAGGAGGAGCCCAGCTCCAACCACACGGTCATGATCCAGGAGACGGTCCAGCAAGCGTCCGTGGAGCTTGCCGAGCAGCACCACCTGGT GGTGTCCTCCGACGACGTGGAGGGCATTGAGACGGTGACTGTCTACACGCAGGGCGGGAGGCCTCGGAGTTCATCGTCTACGTGCAGGAGGCCATGCAGCCTGTGGAGGAGCAGGCTGTGGAGCAGCCGGCCCAGGAACTCTAGAGGACATGTGGCATCGGATGGCCACAGGGCGGGGCTGCCAGGCTCTGCAGGCACCCAGG GTGGGGAGGCCACCCTTCCTGCCCTACCCGCAGAATGGTGCTCTCCTTTGCCCTCCCTGCCCAGCAGCCTGATAGGACTCTCCTAGTCCAACTTGGGGTGGCAAGGCAGTCAGCATCACCAGCAATACCACAGGACCCTCACCCCAGCATAGACACACACCCCCTGACCCTTACCATCTGCTTCCTGAAAGACTTCAGTGTCAGTCCCCTACACACACCCCACC TTCACCCCCTTGCTTCAAGATTCAAACAGAGACTCCCAAGTCCCCCTCAGCATCTTCCCTGAATCACAGCCCCAGCTCCTTGACCCCCATCTAGGTGCCAAATGTTCATCTGCAACCGCTATGCAGTCTGGTGAGAGGGAGACAGCCATCACATAGAAAGTGACCGTACGGGTTTTTAATCACTGCTGGGTGGGGTGGGGGTAGGGGGATTGTCCTGGCT TTGTCGACAAAGTCCCACTTCCCCGAGTATTAAGGGCCCTTGGTATCAAGTGAGGTAAATTCACCCATCACAGGGTCTCGCCCTACCATCCTGGAATTATTTCACTTTTAAGATAAATGCACTATTTCACTGTTCGCCTCCCATTCTAAGGAGGTGAGGTGGTTGGAATAAAAACAGTTCCTGTCTGAA SEQ ID NO:847 Translated polypeptide sequence of ZFAT exon 1 METRA SEQ ID NO:848 Translated polypeptide sequence of ZFAT exon 2 NTAIFMCKCCNLFSPNQSELLSHVSEKHMEEGVNVDEIIIPLRPLSTPEPPNSSKTGD SEQ ID NO:849 Translated polypeptide sequence of ZFAT exon 3 FLVMKRKRGRPKGSTKKSSTEEELAENIVSPTEDSPLAPEEGNSLPPSSLECSKCCRKFSNTRQLRKHICIIVLNLGEEEGEA SEQ ID NO:850 Translated polypeptide sequence of ZFAT exon 4 NESDLELEKKCKEDDREKASKRPRSQKTEKVQKISGKEARQLSGAKKPIISVVLTAHEAIP SEQ ID NO:851 Translated polypeptide sequence of ZFAT exon 5 ATKIVPVEAGPPETGATNSETTSADLVPRRGYQEYAIQQTPYEQPMKSS SEQ ID NO:852 Translated polypeptide sequence of ZFAT exon 6 LGPTQLKIFTCEYCNKVFKFKHSLQAHLRIHTNEKPYKCPQCSYASAIKANLNVHLRKHTGEKFACDYCSFTCLSKGHLKVHIERVHKKIKQHCRFCKKKYSDVKNLIKHIRDAHDPQDKKVKEALDELCLMTREGKRQLLYDCHICERKFKNELDRDRHMLVHGDK WPFACELCGHGATKYQALELHVRKHPFVYVCAVCRKKFVSSIRLRTHIKEVHGAAQEALVFTSSINQSFCLLEPGGDIQQEALGDQLQLVEEEFALQGVNALKEEACPGDTQLEEGRKEPEAPGEMPAPAVHLASPQAESTALPPCELETTVVSSSDLHSQEVVSDDFLLKNDTSSAEAHAAPEKPPDMQHRSSV QTQGEVITLLLSKAQSAGSDQESHGAQSPLGEGQNMAVLSAGDPDPSRCLRSNPEAEASDLLPPVAGGGDTITHQPDSCKAAPEHRSGITAFMKVLNSLQKKQMNTSLCERIRKVYGDLECEYC SEQ ID NO:853 Translated polypeptide sequence of ZFAT exon 7 KLFWYQVHFDMHVRTHTREHLYYCSQCHYSSITKNCLKRHVIQKHSNILLKCPTDGCDYSTPDKYKLQAHLKVHTAL SEQ ID NO:854 Translated polypeptide sequence of ZFAT exon 8 DKRSYSCPCEKSFSEDRLIKSHIKTNHP SEQ ID NO:855 Translated polypeptide sequence of ZFAT exon 9 VSMSTISEVLGRRVQLKGLIGKRAMKCPYCDFYFMKNGSDLQRHIWAHE SEQ ID NO:856 Translated polypeptide sequence of ZFAT exon 10 VKPFKCSLCEYATRSKSNLKAHMNRHSTEKTHLCDMCGKKFKSKGTLKSHKLLHTAD SEQ ID NO:857 Translated polypeptide sequence of ZFAT exon 11 KQFKCTVCDYTAAQKPQLLRHMEQHVSFK SEQ ID NO:858 Translated polypeptide sequence of ZFAT exon 12 PFRCAHCHYSCNISGSLKRHYNRKHPNEEYANVGTGELAAEVLIQQ SEQ ID NO:859 Translated polypeptide sequence of ZFAT exon 13 GLKCPVCSFVYGTKWEFNRHLKNKHGLKVVEIDGDPKWE SEQ ID NO:860 Translated polypeptide sequence of ZFAT exon 14 TATEAPEEPSTQYLHITEAEEDVQGTQAAVAALQDLRYTSES SEQ ID NO:861 Translated polypeptide sequence of ZFAT exon 15 DRLDPTAVNILQQIIELGAETHDATALASVVAMAPGTVTVVKQ SEQ ID NO:862 Translated polypeptide sequence of ZFAT exon 16 VTEEEPSSNHTVMIQETVQQASVELAEQHHLVVSSDDVEGIETVTVYTQGGEASEFIVYVQEAMQPVEEQAVEQPAQEL SEQ ID NO:863 Full protein sequence of ZFAT METRAAENTAIFMCKCCNLFSPNQSELLSHVSEKHMEEGVNVDEIIIPLRPPLSTPEPPNSSKTGDEFLVMKRKRGRPKGSTKKSSTEEELAENIVSPTEDSPLAPEEGNSLPPSSLECSKCCRKFSNTRQLRKHICIIVLNLGEEEGEAGNESDLELEKKCKEDDREKASKRPRSQKTEKVQKISGKEARQLSGA KKPIISVVLTAHEAIPGATKIVPVEAGPPETGATNSETTSADLVPRRGYQEYAIQQTPYEQPMKSSRLGPTQLKIFTCEYCNKVFKFKHSLQAHLRIHTNEKPYKCPQCSYASAIKANLNVHLRKHTGEKFACDYCSFTCLSKGHLKVHIERVHKKIKQHCRFCKKKYSDVKNLIK HIRDAHDPQDKKVKEALDELCLMTREGKRQLLYDCHICERKFKNELDRDRHMLVHGDKWPFACELCGHGATKYQALELHVRKHPFVYVCAVCRKKFVSSIRLRTHIKEVHGAAQEALVFTSSINQSFCLLEPGGDIQQEALGDQLQLVEEEFALQGVNALKEEACPGDTQLEEGRKEAP GEMPAPAVHLASPQAESTALPPCELETTVVSSSDLHSQEVVSDDFLLKNDTSSAEAHAAPEKPPDMQHRSSVQTQGEVITLLLSKAQSAGSDQESHGAQSPLGEGQNMAVLSAGDPDPSRCLRSNPEAEASDLLPPVAGGGDTITHQPDSCKAAPHRSGITAFMKVLNSLQKKQMNTSLCERIRKVYGDLECEYCGK LFWYQVHFDMHVRTHTREHLYYCSQCHYSSITKNCLKRHVIQKHSNILLKCPTDGCDYSTPDKYKLQAHLKVHTALDKRSYSCPCEKSFSEDRLIKSHIKTNHPEVSMSTISEVLGRRVQLKGLIGKRAMKCPYCDFYFMKNGSDLQRHIWAHEGVKPFKCSLCEYATRSKSNL KAHMNRHSTEKTHLCDMCGKKFKSKGTLKSHKLLHTADGKQFKCTVCDYTAAQKPQLLRHMEQHVSFKPFRCAHCHYSCNISGSLKRHYNRKHPNEEYANVGTGELAAEVLIQQGGLKCPVCSFVYGTKWEFNRHLKNKHGLKVVEIDGDPKWETATEAPEEEPSTQYLHITEAE EDVQGTQAAVAALQDLRYTSESGDRLDPTAVNILQQIIELGAETHDATALASVVAMAPGTVTVVKQVTEEEPSSNHTVMIQETVQQQASVELAEQHHLVVSSDDVEGIETVTVYTQGGEASEFIVYVQEAMQPVEEQAVEQPAQEL SEQ ID NO:864 The polynucleotide sequence of all exons 1-12 of ZFAT arranged in order SEQ ID NO:865 The translated polypeptide sequence of all exons 1-12 of ZFAT arranged in order DSCAML1 sequence informationSEQ ID NO:866 Sequence from DSCAML1 exon 3 at 5' of DSCAML1-NRG1 fusion GCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCCAG SEQ ID NO:867 Sequence of NRG1 exon 2 at 3' of DSCAML1-NRG1 fusion CCTTGCCTCCCCGATTGAAA GAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO: 868 DSCAML1 - NRG1 polynucleotide sequence GCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCAGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGGAATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAA SEQ ID NO: 869 DSCAML1-NRG1 polypeptide sequence LIPSSVQEYVSVVSWEKD TVSIIPALPPRLKEMKSQESAAAGSKLVLRCE SEQ ID NO:870 DSCAML1 exon 1 AGCCGAGCGCTGGGCTGAGGAGCAGAGAGAGCGGGGCGCCGAGTGCGGGCGGCTGGGAGCGCGCTGAGCGGGGGAGAGGCGCTGCCGCACGGCCGGCCACAGGACCACCTCCCCGGAGAATAGGGCCTCTTTATG GCATGTGGCTGGTAACTTTCCTCCTGCTCCTGGACTCTTTACACAAAG SEQ ID NO:871 DSCAML1 exon 2 CCCGCCCTGAAGATGTTGGCACCAGCCCTACTTTGTAAATGACTCCTTGCAGCAGGTGACCTTTTCCAGCTCCGTGGGGGTGGTGGTGCCCTGCCCGGCCGCGGGCTCCCCCAGCGCGGCCCTTCGATGGTAC DSCAML1 Exon 3 TTTTCAGGGAACCCTACACCGTCCGGGTGGAGGATCAAAGGTCAATGCGTGGCAACGTGGCCGTCTTTCAAGTGCCTCATCCCCTCTTCAGTGCAGGAATATGTTAGCGTTGTATCTTGGGAGAAAGACACAGTCTCCATCATCCAG SEQ ID NO: 873 DSCAML1 exon 4 AACACAGGTTTTTTATTACCTACCACGGCGG GCTGTACATCCTCTGACGTACAGAAGGAGGACGCCCTCTCCACCTATCGCTGCATCACCAAGCACAAGTATAGCGGGGAGACCCGGCAGAGCAATGGGCACGCCTCTCTGTGACAG SEQ ID NO: 874 DSCAML1 Exon 5 ACCCTGCTGAGTCGATCCCCACCATCCTGGATGGCTTCCACTCCCAGGAAGTGTGGGCCGGCCA CACCGTGGAGCTGCCCTGCACCGCCTCGGGCTACCCTATCCCCGCCATCCGCTGGCTCAAGGATGGCCGGCCCCTCCCGGCTGACAGCCGCTGGACCAAGCGCATCACAGGGCTGACCATCAGCGACTTGCGGACCGAGGACAGCGGCACCTACATTTGTGAGGTCACCAACACCTTCGGTTCGGCAGAGGCCACAGGCATCCTCATGGTCA TTG SEQ ID NO: 875 DSCAML1 exon 6 ATCCCCTTCATGTGACCCTGACACCAAAGAAGCTGAAGACCGGCATTGGCAGCACGGTCATCCTCTCCTGTGCCCTGACGGGCTCCCCAGAGTTCACCATCCGCTGGTATCGCAACACGGAGCTGGTGCTGCCTGACGAGGCCATCTCCATCCGCGGGCTCAGCAACGAGACGCT GCTCATCACCTCGGCCCAGAAGAGCCATTCCGGGGCCTACCAGTGCTTCGCTACCCGCAAGGCCCAGACCGCCCAGGACTTTGCCATCATTGCACTTGAGG SEQ ID NO: 876 Exon of DSCAML1 7 ATGGCACGCCCCGCATCGTCTCGTCCTTCAGCGAGAAGGTGGTCAACCCCGGGGAGCAGTTCTCACTGATGTGTGCGGCCAAGGGCGCCCCGCCCCCCACGGTCACCTGGGCCCTCGACGATGAGCCCATCGTGCGGGATGGCACCACCGCACCAACCAGTACACCATGTCGGACGGCACCACCATCAGCCACATGAACGTCACA GGCCCCCAGATCCGCGACGGGGGCGTGTACCGGTGCACAGCGCGGAACTTGGTGGGCAGTGCTGAATATCAGGCGCGAATAAACGTAAGAG SEQ ID NO: 877 DSCAML1 exon 8 GCCCACCCAGCATCCGGGCTATGCGGAACATCACAGCAGTCGCCGGGCGGGACACCCTTATCAACTGCAGGGTCATCGGCTATCCCTA CTACTCCATCAAGTGGTACAAGGATGCCCTGCTGCTGCCAGACAACCACCGCCAGGTGGTGTTTGAGAATGGGACCCTCAAGCTGACTGACGTGCAGAAGGGCATGGATGAGGGGGAGTACCTGTGCAGTGTCCTCATCCAGCCCCAGCTCTCCATCAGCCAGAGCGTTCACGTAGCCGTCAAAG SEQ ID NO: 878 DSCAML1 exon 9 T GCCCCCTCTGATCCAGCCCTTCGAATTCCCACCCGCCTCCATCGGCCAGCTGCTCTACATTCCCTGTGTGGTGTCCTCGGGGGACATGCCCATCCGTATCACCTGGAGGAAGGACGGACAGGTGATCATCTCAGGCTCGGGCGTGACCATCGAGAGCAAGGAATTCATGAGCTCCCTGCAGATCTCTAGCGTCTCCCTCAAGCACAACGGCAACTATACAT GCATCGCCAGCAACGCAGCCGCCACCGTGAGCCGGGAGCGCCAGCTCCATCGTGCGTG SEQ ID NO: 879 DSCAML1 exon 10 TGCCCCCTCGATTTGTGGTGCAACCCAACAACCAGGATGGCATCTACGGCAAAGCTGGTGTGTCCAACTGCTCGGTGGACGCTACCCCCCACCCAAGGTCATGTGGAAGCATGCCAAGG SEQ ID NO: 880 DSCAML1 exon 11 GGAGCGGGAACCCCCAGCAGTACCACCCCTGTGCCCCTCACTGGCCGCATCCAGATCCTGCCCAACAGCTCGCTGCTGATCCGCCACGTCCTAGAAGAGGACATCGGCTACTACCTCTGCCAGGCCAGCAACGGCGTAGGCACCGACATCAGCAAGTCCATGTTCCTCACAGTCAAGA SEQ ID NO: 881 DSCAML1 exon 12 TCCCGGCCATGATCACTTCCCACCCCCAACACCACCATCGCCATCAAGGGCCATGCGAAGGAGCTAAACTGCACGGCACGGGGTGAGCGGCCCATCATCATCCGCTGGGAGAAGGGGGACACAGTCATCGACCCTGACCGCGTCATGCGGTATGCCATCGCCACCAAGGACAACGGCGACGA GGTCGTCTCCACACTGAAG SEQ ID NO:882 DSCAML1 exon 13 CTCAAGCCCGCTGACCGTGGGGACTCTGTGTTTCTCAGCTGCCATGCCATCAACTCGTATGGGGAGGACCGGGGCTTGATCCAACTCACTGTGCAAG SEQ ID NO:883 DSCAML1 exon 14 AGCCCCCCGACCCCCCAGAGCTGGAG ATCCGGGAGGTGAAGGCCCGGAGCATGAACCTGCGCTGGACCCAGCGATTCGACGGGAACAGCATCATCACGGGCTTCGACATTGAATACAAGAACAAATCAG SEQ ID NO: 884 DSCAML1 Exon 15 ATTCCTGGGACTTCAAGCAGTCCACACGCAACATCTCCCCCACCATCAACCAGGCCAACATTGTGGACTTGCACCCGGC ATCTGTGTACAGCATCCGCATGTACTCTTTCAACAAGATTGGCCGCAGTGAACCAAGCAAGGAGCTCACCATCAGCACTGAGGAGGCCG SEQ ID NO: 885 DSCAML1 exon 16 CTCCCGATGGGCCCCCCATGGATGTTACCTTGCAGCCAGTGACCTCACAGAGCATCCAGGTGACCTGGAAG GGGCAGTACAGCATCGTGGAGATGAAGGCCACGGGGGACAGCGAGGTCTACACCCTGGACAACCTCAAGAAGTTCGCCCAGTATGGGGTGGTGGTCCAAGCCTTCAATCGGGCTGGCACGGGGCCCTCTTCCAGCGAGATCAATGCCACCACTCTGGAGGATG SEQ ID NO: 887 DSCAML1 exon 18 TGCCCAGCC AGCCCCCTGAGAACGTCCGGGCCCTGTCCATCACTTCTGACGTGGCCGTCATCTCCTGGTCAGAGCCCCCGCGCAGCACCCTCAATGGCGTCCTCAAAGGCTATCGGGTCATCTTCTGGTCCCCTCTATGTTGATGGGG SEQ ID NO: 888 DSCAML1 Exon 19 AGTGGGGCGAGATGCAGAACATCACCACCCACGCGGGAGCGG GTGGAGCTGCGGGGCATGGAGAAGTTCCACCAACTACAGCGTCCAGGTGCTGGCCTACACCCAGGCTGGGGACGGCGTACGCAGCAGTGTGCTCTACATCCAGACCAAGGAGGACG SEQ ID NO: 889 DSCAML1 exon 20 TTCCAGGTCCCCCCTGCTGGCATCAAAGCTGTCCCCTTCATCAGCTAGCAGTGTGGTTGTGTCTTGGCTCCCCCCTACCAAGCCCAACGGGGTGATCCGAAGTACACCATCCTTCTGTTCCAGCCCCGGGTCTGGCCAGCCG SEQ ID NO: 890 DSCAML1 exon 21 GCTCCCAGCGAGTAC GAGACGAGTCCAGAGCAGCTCTTCTACCGGATCGCCCACCTAAACCGCGGTCAGCAGTATCTGCTGTGGGTGGCCGCCGTCACCTCTGCCGGCCGGGGCAACAGCAGCGAGAAGGTGACCATCGAGCCTGCTGGCAAGG SEQ ID NO: 891 DSCAML1 Exon 22 CCCCAGCAAAGATCATCTCCTTTGGGGGCACCGTGA CAACACCTTGGATGAAAGATGTTCGGCTGCCTTGCAATTCAGTGGGAGATCCAGCCCCTGCTGTGAAGTGGACCAAGGACAG SEQ ID NO: 892 DSCAML1 exon 23 TGAAGACTCGGCCATTCCAGTGTCCATGGATGGGCACCGGCTCATCCACACCAATGGCACACTGCTGCTGCGTGCAGTGAAGGCTGAGGACTCT GGCTACTACACGTGCACGGCCACCAACACTGGTGGCTTTGACACCATCATCGTCAAACCTTCTGGTGCAAG SEQ ID NO: 893 DSCAML1 Exon 24 TTCCCCCGGACCAGCCCCGCCTCACTGTCTCCAAAAACCTCAGCTTCGTCCATCACCCTGACCTGGATTCCAGGTGACAATGGGGGCAGCTCCATCCGAG SEQ ID NO: 894 DSCAML1 exon 25 GCTTCGTGCTACAGTACTCGGTGGACAACAGCGAGGAGTGGAAGGATGTGTTCATCAGCTCCAGCGAGCGCTCCTT CAAGCTGGACAGCCTCAAGTGTGGCACGTGGTACAAGGTGAAGCTGGCAGCCAAGAACAGCGTGGGCTCTGGGCGCATCAGCGAGATCATCGAGGCCAAGACCCACGGGCGGG SEQ ID NO: 895 DSCAML1 Exon 26 AGCCCCTCCTTCAGCAAAGACCAACACCTCTTCACCCACATCAACTCCACGCATGCTCGGCTTAACCTGC AGGGCTGGAACAATGGGGGCTGCCCTATCACAGCCATCGTTCTGGAGTACCGGCCCAAGGGGACCTGGGCCTGGCAGGGCCTCCGGGCCAACAGCTCCGGGGAGGTGTTTCTGACGGAACTGCGAGAGGCCACGTGGTACGAGCTGCGCATGAGGGCTTGCAACAGTGCGGGCTGCGGCAATGAAACAGCCCAGTTCGCCACC CTGGACTACGATGGCA SEQ ID NO: 896 DSCAML1 exon 27 GCACCATTCCACCCATCAAGTCTGCTCAAGGTGAAGGGGATGATGTGAAGAAGCTGTTCACCATCGGCTGCCCTGTCATCCTGGCCACACTGGGGGTGGCACTGCTCTTCATCGTACGCAAGAAGAGGAAGGAGAAACGGCTGAAGCGACTCCGAG SEQ ID NO:897 other than DSCAML1 Exon 28 ATGCAAAGAGTTTGGCAGAAATGTTGATAAG SEQ ID NO: 898 DSCAML1 exon 29 CAAGAACAATAGAAGCTTTGACACCCCTGTGAAAGGGCCACCCCAGGGCCCACGGCTACACATTGACATCCCCAGGGTCCAGCTGCTCATCGAGGACAAAGAAGGCATCAAGCAACTGG SEQ ID NO: 899 DSCA ML1 exon 30 GAGATGACAAGGCCACCATCCCTGTGACAGATGCTGAGTTCAGCCAAGCTGTCCAACCAGAGCTTCTGTACTGGCGTCTCCTTGCACCACCCAACCCTCATCCAGAGCACAGGACCCCTCATCGACATGTCTGACATCCGGCCAGGAACCA SEQ ID NO: 900 DSCAML1 exon 31 ATCCAGTGTCCAGGAAGAATGTGA AGTCAGCCCACAGCACCCGGAACCGGTACTCAAGCCAGTGGACCCTGACCAAGTGCCAGGCCTCCACACCTGCCCGCACCCTCACCTCCGACTGGCGCACCGTGGGCTCCCAGCATGGTGTCACGGTCACTGAGAGTGACAGCTACAGTGCCAGCCTGTCCCAGGACACAG SEQ ID NO: 901 exon of DSCAML1 32 ACAAAGGAAGGAACAGCATGGTGTCCACTGAGAGTGCCTCTTCCACCTACGAGGAGCTGGCCCGGGCCTATGAGCATGCCAAGCTGGAGGAGCAGCTGCAGCACGCCAAGTTTGAGATCACCGAGTGCTTCATCTCTGACAGTTCCCTGACCAGATGACCACAGGCACCAACGAGAACGCCGACAGCATGACATCCATGAGCACACCCTCCAGAGCCT GGCATCTGCCGCTTTACCGCCTCACCACCCAAGCCCCAGGATGCGGACCGGGGCAAAAACGTGGCTGTGCCCATCCCTCACCGGGCCAACAAGA SEQ ID NO: 902 DSCAML1 Exon 33 GTGACTACTGCAACCTGCCCCTGTATGCCAAGTCAGAGGCCTTCTTTCGAAAGGCAGATGGACGTGAGCCCTGCCCCGTGGTCCCACCCCC GTGAGGCCTCCATCCGGAACCTGGCTCGAACCTACCACCCAGGCTCGCCACCTGACCCTGGACCCTGCCAGCAAGTCCTTGGGCCTTCCCCCACCCGGGGCCCCCGCTGCCGCCTCCACAGCCACCTTACCTCAGAGGACTCTGGCCATGCCAGCCCCCCCAGCCGGCACAGCCCCCCCAGCCCCCGGCCCCACCCCTGCTGAGCCACCCCACCGCCCCCAGCG CTGCCCCTCCGGCCCCCAGCACCGAGCCTCCACGAGCCGGGGGCCCACACACCAAAATGGGGGGCTCCAGGGACTCGCTTCTCGAGATGAGCACATCGGGGGTAGGGAGGTCTCCAGAAGCAGGGGGCCGGGGCCTACTCCAAATCCTACACCCTGGTGTAGGGCCCGCAGGAAGAGCAGCCACGCCTGGACCGCGCCGCGCCGCAGCC CCACACGCCAGCTCGGCTGTTTTTTCTGCATTATTATATTCAACTGACAGACAAAAAACCAACCAACGACAAAACAAAAACCCCCAATCATGAACGCCTGTACATAGAACTCTTTTGTACAAATGAAACTATTTTTCTTCTTCTCCATGAAGCCAGGCACAAAGAATTTGACAGTACAAGTCAATCCCCCACCCCAAAATATGTGTGGAGATATATATACATATAGACAGAC AGGAACGCGTCCACGAGCTATATATCTATATATTTCTCTCACCCTATTTTGAGACAGAGGCACAAAGACTCAGCAATTTTTTTTCCCTCCTCCTCACCTTCCCCCAGTCTAGGTGGTTTTGACAAAGACCAAAATCCCAACTCAGAGACACTGCATGCGATTTTACTGTTCCAAAAACCAGGAGTTGCTTCAATTTGCAGATGCTTATGTGTTAATACCTTTTTTCTATGAAAAAAGAC CCAGCGCCGTGTGCAATAAAGGTTATGTTTCTA SEQ ID NO:903 The complete mRNA polynucleotide sequence of DSCAML1 SEQ ID NO:904 The translated polypeptide sequence of exon 1 of DSCAML1 MWLVTFLLLLDSLHK SEQ ID NO:905 The translated polypeptide sequence of DSCAML1 The translated polypeptide sequence of exon 2 RPEDVGTSLYFVNDSLQQVTFSSSVGVVVPCPAAGSPSAALRWYLATGDDIYDVPHIRHVHANGTLQLYPFSPSAFNSFIHDNDYFCTAENAAGKIRSPNIRVKA SEQ ID NO: 906 The translated polypeptide sequence of exon 3 of DSCAML1 FREPYTVRVEDQRSMRGNVAVFKCLIPSSV QEYVSVVSWEKDTVSIIP SEQ ID NO:907 Translated polypeptide sequence of exon 4 of DSCAML1 HRFFITYHGGLYISDVQKEDALSTYRCITKHKYSGETRQSNGARLSVT SEQ ID NO:908 Exon of DSCAML1 Translated polypeptide sequence of exon 5 PAESIPTILDGFHSQEVWAGHHTVELPCTASGYPIPAIRWLKDGRPLPADSRWTKRITGLTISDLRTEDSGTYICEVTNTFGSAEATGILMVI SEQ ID NO: 909 Translated polypeptide sequence of exon 6 of DSCAML1 PLHVTLTPKKLKTGIGSTVILSCALTGSPEFTIRWYRNTELVLPDEAISIRG LSNETLLITSAQKSHSGAYQCFATRKAQTAQDFAIIALE SEQ ID NO:910 Translated polypeptide sequence of exon 7 of DSCAML1 GTPRIVSSFSEKVVNPGEQFSLMCAAKGAPPPPTVTWALDDEPIVRDGSHRTNQYTMSDGTTISHMNVTGPQIRDGGVYRCTARNLVGSAEYQARINVR SEQ ID NO:911 DSCAML1 exon 8 Translated polypeptide sequence PPSIRAMRNITAVAGRDTLINCRVIGYPYYSIKWYKDALLLPDNHRQVVFENGTLKLTDVQKGMDEGEYLCSVLIQPQLSISQSVHVAVK SEQ ID NO: 912 Translated polypeptide sequence PPLIQPFEFPPASIGQLLYIPCVVSSGDMPIRITWRKDGQVIISGSGVTI ESKEFMSSLQISSVSLKHNGNYTCIASNAAATVSRERQLIVR SEQ ID NO:913 Translated polypeptide sequence of exon 10 of DSCAML1 PPRFVVQPNNQDGIYGKAGVLNCSVDGYPPPKVMWKHAK SEQ ID NO:914 Translation of exon 11 of DSCAML1 Peptide sequence SGNPQQYHPVPLTGRIQILPNSSLLIRHVLEEDIGYYLCQASNGVGTDISKSMFLTVK SEQ ID NO:915 Translated polypeptide sequence of exon 12 of DSCAML1 PAMITSHPNTTIAIKGHAKELNCTARGERPIIIRWEKGDTVIDPDRVMRYAIATKDNGDEVVSTLK SEQ ID NO:916 DSCAML1 exon Translated polypeptide sequence of exon 13 LKPADRGDSVFFSCHAINSYGEDRGLIQLTVQ SEQ ID NO: 917 Translated polypeptide sequence of exon 14 of DSCAML1 PPDPPELEIREVKARSMNLRWTQRFDGNSIITGFDIEYKNKS SEQ ID NO:918 Translated polypeptide sequence of exon 15 of DSCAML1 SWDFKQSTRNISPTINQANIVDLHPASVYSIRMYSFNKIGRSEPSKELTISTEEA SEQ ID NO:919 Translated polypeptide sequence of exon 16 of DSCAML1 PDGPPMDVTLQ PVTSQSIQVTWK SEQ ID NO:920 Translated polypeptide sequence of exon 17 of DSCAML1 APKKELQNGVIRGYQIGYRENSPGSNGQYSIVEMKATGDSEVYTLDNLKKFAQYGVVVQAFNRAGTGPSSSEINATTLED SEQ ID NO:921 Translated polypeptide sequence of exon 18 of DSCAML1 PSQPPENVRALSITSDVAVISWSEPPRSTLNGVLKGYRVIFWSLYVDG SEQ ID NO:922 Translated polypeptide sequence of exon 19 of DSCAML1 WGEMQNITTTRERVELRGMEKFTNYSVQVLAYTQAGDGVRSSVLYIQTKED SEQ ID NO:9 23 Translated polypeptide sequence of exon 20 of DSCAML1 PGPPAGIKAVPSSASSVVVSWLPPTKPNGVIRKYTIFCSSPGSGQP SEQ ID NO :924 The translated polypeptide sequence of exon 21 of DSCAML1 APSEYETSPEQLFYRIAHLNRGQQYLLWVAAVTSAGRGNSSEKVTIEPAGK SEQ ID NO:925 The translated polypeptide sequence of exon 22 of DSCAML1 PAKIISFGGTVTTPWMKDVRLPCNSVGDPAPAVKWTKD SEQ ID NO:926 Outside of DSCAML1 Translated polypeptide sequence of exon 23 EDSAIPVSMDGHRLIHTNGTLLLRAVKAEDSGYYTCTATNTGGFDTIIVNLLVQ SEQ ID NO:927 Translated polypeptide sequence of exon 24 of DSCAML1 PPDQPRLTVSKTSASSITLTWIPGDNGGSSIR SEQ ID NO: 928 Translated polypeptide sequence of exon 25 of DSCAML1 FVLQYSVDNSEEWKDVFISSSERSFKLDSLKCGTWYKVKLAAKNSVGSGRISEIIEAKTHGR SEQ ID NO: 929 Translated polypeptide sequence PSFSKDQHLFTHINSTHARLNLQGWNNGGCPITAIVLEYRPKGTWAWQGLRANSSGEVFLTELREATWYELRMRACNSAGCGNETAQFATLDYDG SEQ ID NO: 930 of DSCAML1 Translated polypeptide sequence of exon 27 TIPPIKSAQGEGDDVKKLFTIGCPVILATLGVALLFIVRKKRKEKRLKRLR SEQ ID NO:931 Translated polypeptide sequence of exon 28 of DSCAML1 AKSLAEMLI SEQ ID NO:932 Translated polypeptide sequence of exon 29 of DSCAML1 KNNRSFDTPVKGPPQGPRLHIDIPRVQ LLIEDKEGIKQL SEQ ID NO:933 DSCAML1 epitope Translated polypeptide sequence of exon 30 DDKATIPVTDAEFSQAVNPQSFCTGVSLHHPTLIQSTGPLIDMSDIRPGT SEQ ID NO:934 Translated polypeptide sequence of exon 31 of DSCAML1 PVSRKNVKSAHSTRNRYSSQWTLTKCQASTPARTLTSDWRTVGSQHGVTVTESDSYSASLSQDT SEQ ID NO:935 DS Translated polypeptide sequence of CAML1 exon 32 KGRNSMVSTESASSTYEELARAYEHAKLEEQLQHAKFEITECFISDSSSDQMTTGTNENADSMTSMSTPSEPGICRFTASPPKPQDADRGKNVAVPIPHRANK SEQ ID NO: 936 DSCAML1 exon 33 The translated polypeptide sequence of DYCNLPLYAKSEAFFRKADGREPCPVVPPREASIRNLARTYHTQARHLTLDPASKSLGLPGAPAAASTATLPQRTLAMPAPPAGTAPPAPGPTPAEPPTAPSAAPPAPSTEPPRAGGPHTKMGGSRDSLLEMSTSGVGRSQKQGAGAYSKSYTLV SEQ ID NO:937 The complete protein sequence of DSCAML1 SEQ ID NO:9 38 Polynucleotide sequence of all exons 1-3 of DSCAML1 arranged in order SEQ ID NO: 939 All exons 1-3 of DSCAML1 3 Translated polypeptide sequences in sequence
<![CDATA[<110> 荷蘭商美勒斯公司 (MERUS N.V.)]]>
<![CDATA[<120> 新NRG1融合物、融合接合處及檢測彼等之方法]]>
<![CDATA[<140> TW 111120470]]>
<![CDATA[<141> 2022-06-01]]>
<![CDATA[<150> NL 2028384]]>
<![CDATA[<151> 2021-06-03]]>
<![CDATA[<150> NL 2030006]]>
<![CDATA[<151> 2021-12-03]]>
<![CDATA[<160> 942 ]]>
<![CDATA[<170> PatentIn version 3.5]]>
<![CDATA[<210> 1]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 1]]>
caggtcctga gcctcgagcc gcagcacgag ctcaaattcc gag 43
<![CDATA[<210> 2]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 2]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt t 51
<![CDATA[<210> 3]]>
<![CDATA[<211> 94]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 3]]>
caggtcctga gcctcgagcc gcagcacgag ctcaaattcc gagccttgcc tccccgattg 60
aaagagatga aaagccagga atcggctgca ggtt 94
<![CDATA[<210> 4]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 4]]>
Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu Lys Phe Arg Ala Leu
1 5 10 15
Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly
20 25 30
<![CDATA[<210> 5]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 5]]>
agcttcaaac atagtgggga aagctcactc ggattatatg ctgtatgtat acg 53
<![CDATA[<210> 6]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 6]]>
ctacatctac atccaccact gggacaagcc at 32
<![CDATA[<210> 7]]>
<![CDATA[<211> 85]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 7]]>
agcttcaaac atagtgggga aagctcactc ggattatatg ctgtatgtat acgctacatc 60
tacatccacc actgggacaa gccat 85
<![CDATA[<210> 8]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 8]]>
Ala Ser Asn Ile Val Gly Lys Ala His Ser Asp Tyr Met Leu Tyr Val
1 5 10 15
Tyr Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His
20 25
<![CDATA[<210> 9]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 9]]>
gacgaagaca gtccctggat caccgacagc acagacagaa tccctgctac ca 52
<![CDATA[<210> 10]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 10]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaact 58
<![CDATA[<210> 11]]>
<![CDATA[<211> 110]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 11]]>
gacgaagaca gtccctggat caccgacagc acagacagaa tccctgctac caccttgcct 60
ccccgattga aagagatgaa aagccaggaa tcggctgcag gttccaaact 110
<![CDATA[<210> 12]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 12]]>
Asp Glu Asp Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala
1 5 10 15
Thr Thr Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala
20 25 30
Ala Gly Ser Lys
35
<![CDATA[<210> 13]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 13]]>
ccgcatcggc gaccttcagg ccttccaggg ccacggcgcg ggcaacctgg cgg 53
<![CDATA[<210> 14]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 14]]>
catctacatc tacatccacc actgggacaa gccat 35
<![CDATA[<210> 15]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 15]]>
ccgcatcggc gaccttcagg ccttccaggg ccacggcgcg ggcaacctgg cggcatctac 60
atctacatcc accactggga caagccat 88
<![CDATA[<210> 16]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 16]]>
Arg Ile Gly Asp Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu
1 5 10 15
Ala Ala Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His
20 25
<![CDATA[<210> 17]]>
<![CDATA[<211> 399]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 17]]>
agtccccgcc cctggagccg gcggcgcagg gcgcagcttc ccgccgccag agcgggccag 60
cctgctgcgt gcgtgcgtgt gtacgactct gcgtgcgtgc gtgcgtgcgt gcgtgccgtc 120
agctcgccgg gcaccgcggc ctcgccctcg ccctccgccc ctgcgcctgc accgcgtaga 180
ccgacccccc cccagcgcgc ccacccggta gaggaccccc gcccgtgccc cgaccggtcc 240
ccgccttttt gtaaaactta aagcgggcgc agcattaacg cttcccgccc cggtgacctc 300
tcaggggtct ccccgccaaa ggtgctccgc cgctaaggaa catggcgaag gtggagcagg 360
tcctgagcct cgagccgcag cacgagctca aattccgag 399
<![CDATA[<210> 18]]>
<![CDATA[<211> 153]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 18]]>
gtcccttcac cgatgttgtc accaccaacc taaagcttgg caacccgaca gaccgaaatg 60
tgtgttttaa ggtgaagact acagcaccac gtaggtactg tgtgaggccc aacagcggaa 120
tcatcgatgc aggggcctca attaatgtat ctg 153
<![CDATA[<210> 19]]>
<![CDATA[<211> 104]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 19]]>
tgatgttaca gcctttcgat tatgatccca atgagaaaag taaacacaag tttatggttc 60
agtctatgtt tgctccaact gacacttcag atatggaagc agta 104
<![CDATA[<210> 20]]>
<![CDATA[<211> 81]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 20]]>
tggaaggagg caaaaccgga agaccttatg gattcaaaac ttagatgtgt gtttgaattg 60
ccagcagaga atgataaacc a 81
<![CDATA[<210> 21]]>
<![CDATA[<211> 177]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 21]]>
catgatgtag aaataaataa aattatatcc acaactgcat caaagacaga aacaccaata 60
gtgtctaagt ctctgagttc ttctttggat gacaccgaag ttaagaaggt tatggaagaa 120
tgtaagaggc tgcaaggtga agttcagagg ctacgggagg agaacaagca gttcaag 177
<![CDATA[<210> 22]]>
<![CDATA[<211> 7035]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 22]]>
gaagaagatg gactgcggat gaggaagaca gtgcagagca acagccccat ttcagcatta 60
gccccaactg ggaaggaaga aggccttagc acccggctct tggctctggt ggttttgttc 120
tttatcgttg gtgtaattat tgggaagatt gccttgtaga ggtagcatgc acaggatggt 180
aaattggatt ggtggatcca ccatatcatg ggatttaaat ttatcataac catgtgtaaa 240
aagaaattaa tgtatgatga catctcacag gtcttgcctt taaattaccc ctccctgcac 300
acacatacac agatacacac acacaaatat aatgtaacga tcttttagaa agttaaaaat 360
gtatagtaac tgattgaggg ggaaaagaat gatctttatt aatgacaagg gaaaccatga 420
gtaatgccac aatggcatat tgtaaatgtc attttaaaca ttggtaggcc ttggtacatg 480
atgctggatt acctctctta aaatgacacc cttcctcgcc tgttggtgct ggcccttggg 540
gagctggagc ccagcatgct ggggagtgcg gtcagctcca cacagtagtc cccacgtggc 600
ccactcccgg cccaggctgc tttccgtgtc ttcagttctg tccaagccat cagctccttg 660
ggactgatga acagagtcag aagcccaaag gaattgcact gtggcagcat cagacgtact 720
cgtcataagt gagaggcgtg tgttgactga ttgacccagc gctttggaaa taaatggcag 780
tgctttgttc acttaaaggg accaagctaa atttgtattg gttcatgtag tgaagtcaaa 840
ctgttattca gagatgttta atgcatattt aacttattta atgtatttca tctcatgttt 900
tcttattgtc acaagagtac agttaatgct gcgtgctgct gaactctgtt gggtgaactg 960
gtattgctgc tggagggctg tgggctcctc tgtctctgga gagtctggtc atgtggaggt 1020
ggggtttatt gggatgctgg agaagagctg ccaggaagtg ttttttctgg gtcagtaaat 1080
aacaactgtc atagggaggg aaattctcag tagtgacagt caactctagg ttaccttttt 1140
taatgaagag tagtcagtct tctagattgt tcttatacca cctctcaacc attactcaca 1200
cttccagcgc ccaggtccaa gtctgagcct gacctcccct tggggaccta gcctggagtc 1260
aggacaaatg gatcgggctg cagagggtta gaagcgaggg caccagcagt tgtgggtggg 1320
gagcaaggga agagagaaac tcttcagcga atccttctag tactagttga gagtttgact 1380
gtgaattaat tttatgccat aaaagaccaa cccagttctg tttgactatg tagcatcttg 1440
aaaagaaaaa ttataataaa gccccaaaat taagaattct tttgtcattt tgtcacattt 1500
gctctatggg gggaattatt attttatcat ttttattatt ttgccattgg aaggttaact 1560
ttaaaatgag ccctatcact gagaaatacg tgtttcatga tttaactctg tgtgtgtgtg 1620
tgtgtgtgtg tgtgtgtgtg tgtgtatttt ttttttggtt gtcttcagct gacagtatga 1680
aaaatgaaac tgctgaaaaa gctgagcacc tggtcaccct tggccttcca ttgctttggc 1740
cttcagtaaa aagcagcctc ccttctaggt cagggaacca tgccattgag actagtaacg 1800
ggcgttctgg gcacagtccc actgtgcaca ggtttgagag gacaagttca tcagaaggaa 1860
ggcagtcctt agaagtcaca tacgttgagc cacgttgctc ctaagcctgg ctctgtcaag 1920
ctgggtcagg ggccttgaaa ctggagaagt ggaagtctat ggttggtctg agtaagtaac 1980
ttcctgtctt catgaaaaaa gttgactttg aatcccaggt actcacagaa atggtgaaca 2040
gacttagttg ttacccaggc acccatggat tgtgttgagt gtgcagacag ggaggccacc 2100
ccaataggaa ttcgtctcca ggatttttcc catgtgtccc ccagtactta taaaagggag 2160
tgaaaagacc gagctgtaag gcatgtgcct tctgccacct gactttccgt gaggggacta 2220
aaatttacta attgtagttg ctgcagccag ttaagtcctg tagcttccag gccctcatgt 2280
ctttgatagg agagtgctta ggtggtcccc aacagtgcct aggggtacag tacagtccca 2340
ttacactaga gcagggctct atttattttt aaaggatatg gccgtgtgtt ttgataaaac 2400
tttattcaca aaaacagccg ggtcatggga tttggcttgt gagtctgtaa cagttcttaa 2460
aaagaatatc tgagaaacta ctctgtttta gacctttgaa ggtgatttag agttttgtgt 2520
acatctagga gaaggtgttc agcttctcag aggatgtgga cattttggtt gcagctaaaa 2580
atcagtctct gaagtctctc tcccttctag aggttaggac ttggtgaaca tgtttgtggg 2640
ccttttgact gagtggcaga aggaaactgc tcaggaagag aaacaggtga ctgatgggaa 2700
ggttgattat tttctcagtc atcctggcag ccaaaaatgt gccaggaaaa gaaagaatgt 2760
ggagcacgcg tggctcctgg aggacttgga gatgcatgca catttagggt gttttcccta 2820
gaattacata atgaaaaaaa gaataaggca aagaggaggt gaatatgggg cctgtcacaa 2880
cggcctgccc tgccccaaga gggttaagag tcagataatc gggacgaaac tggcatggaa 2940
agagcgagcc tagggaggat gccgctgggc agtgtgcatg ggggagctgc tgccaggctg 3000
ccctccagtc tgctcctgtg gttactggct ccacagcacc tcagagaggg cggccctggc 3060
ttcagaaatg ccagccatag tgctcacaaa tgcagaagag atggaagcgg tgacagaatc 3120
ctgaaagttt ttattgattg aagttttaaa ttggtaactt aagcttcctt ggcacgatac 3180
aaaatacctc ttaaagacag caggcttttt tatttgtagg tgtgaggaac tggctttaac 3240
ttttttctcc tcctagtttg catgttttcc ttctctcgtc ttctgaactg ctggcaccag 3300
cagtaataca tactgataaa atcaaaattg atttttacca gtggccagtt tatggctaga 3360
gagacgactt atacctccat aacacagaag ggggaaaaat gaagaacctc cagtgatccg 3420
tgaaaaccta aacgctttca aacaaatccc aggaacagaa ttgctatcga aagatatcat 3480
tgcccagttt gcaggctatg ttgagtcaga tagaactgaa tgtagtgaga gctcagagct 3540
acagagcctt tcagatgaat ttgaaaacag actctgtgtg tgtgtgcatg tgtgcatgtg 3600
tgcatgtgtg gcatatgtgc cgtatgtcag tagcttgaca gttttcaaat cgtgcctata 3660
tttttttgca tacacaaatt tttgtgtttg caaactcaga atccatgcca aaatacaatg 3720
ttatatgtca ttttcagctc cttctctaaa ggaatggccc atttctcatt gtagtttgag 3780
aaatacatgt atgaagagat aggggtcttg ggcttcccag tgtcactttg aacacctgaa 3840
taacatttaa ctcctgagac cttctcggtg tagaggccac tgcttccccc tgctggagat 3900
ggcatttcat tgaagggcct ctcgtggctt tccctgcccc cggctgtctg gcctgaagaa 3960
ggagaaagaa ccaaactgaa ctatgaaaag ttaccactct gaggagacct ctcttaatta 4020
acacttgggg ccatgtttgc tgttgttgag aaggagtgtt ctcaaagatg agctggaatg 4080
gaattgtatt tagaaaggcc cctgcaaagt atatagatgg atgactctag ttcatgacat 4140
acaaatccca taaggccaac gaccactctt ctggaacacc aagagcagct ctgagatcat 4200
gctggcccta cgcgaattga gtttctgtgg cctaattgga tttggagaac gccttccctg 4260
gccccttttc ctcagacaga tctgctctga taggaacctt ttcaagaaag ttactgttgt 4320
ttcaatgcca ctccttacct gtatagaaca tttccaatac attcgctcat tgaacttaat 4380
ccttgcaact gtgactgggg ggtagatggc tctgtttgca tacgaagaaa taaaggctcc 4440
aggaggttaa atcgggcaac tttttagaac taaatcagtc tctgtaaggc ctacattgct 4500
aagataccat ttcagctctg aaaatctgct tcagggaagt gagtggatga ggccttcctg 4560
cctcagctac tctgcccgtc tgtacatctt ttgtgtctgc ctccgtacct tattcagtta 4620
ttttcacact aaagtaagta gaattaagac tgtagttcag atgctttttc tttttctgtt 4680
ggaaactgaa cacactacag acagtgaaaa aaggtacata ttccattttc tcattgcctg 4740
aagatctctg ctgatgctcc tggagaatga ctttgggggc tttagaaaga atattgccag 4800
tccgtctcgg caaggagatg atgggagcgc tttatatgga ggctttacat gacttgtaaa 4860
ttaaatgtga atgagggcag ttgattaaaa ttggtattac agaagggccc tgctgaggtt 4920
tgaaaacagc tgagctgctg atgtctcagg cctttccctg aattagcact gcggttctcc 4980
aggatatcag caaagagggc aagtaataga agcccctgat aaggagcgtc agccgacagg 5040
caagcttggg aggctgtggg aatgggtctg cccccagctt cacagacctc ttcctccagc 5100
ctctgaatcc cattagccac agcctagaac attagctgag ctgcacaagc tcacccaccc 5160
ctgtgccagg gggccctgac ctccctccat gccatgtttt tggctgtatc tacggcactt 5220
aacaataggg gctttttatt ttcattacag agatattttg aaaaatttaa aagacatgaa 5280
ctcacataaa cagttatgga tgatagttaa aagagaaacg ggtggaggtg gatgagaggt 5340
tgtcttcatg aatataatta cttgagattt ttttttctta atggaattag tttattagaa 5400
aatgtctgtg ttaaatccgt agaaaaggaa gaaaagtgta gcaacaaaaa tgtagccatt 5460
atctaacttg ccataaatat ttgcagttat gataccttgg aatgttgcca cgatatggat 5520
tgctttgatt aaaagatgtc agttgaataa aacagtactg tgggagaatc gctttctgct 5580
gctagataaa tgctgatgtt tatttttaaa ccaggaaaca ttgatcctgt aacaatgccc 5640
gattacaatt gctttattac accccagggc tgatggagat gtaatcactt ggctaatgga 5700
tgtgggtgca ggacagatgc tcgcttgctg gcctgctttc ctgcttgcat tctgatgagc 5760
tgcaggagtg cgcctggcct tctgcaggtg gagctgctgt cagagcttcg tttcactgat 5820
acccaaagcc atgtctgact gaaataaaac aggttccctt tttttttccc tttggaaaat 5880
gccaactaag ggagactaat cagatatctt aacacaattt catccaggct tagtgctaac 5940
aagattgcgg ggctttttag ggtttaagaa gatgagaaat gagtgtgcac gtttcacacg 6000
ttgacttgcc ggtttttcca tgtcatacaa aaaagtcctg gctgtttctc cgaactggct 6060
gcctgcattc ccgtctttct tttgttttta agaaatagac tgaattcagc tgttaatcct 6120
ctagtacagt atccatgtta aaatgttttt ccattgcatc ttttatgtga attcaaaggt 6180
cagaatttat tgtctgtgat attgagacca tgtgtacaag aactactttt tgcttttcat 6240
cattcactcc ttagcaaacg tttcgtaagt accctctgtc tgtttgctac tatatgaggt 6300
gctgcgaaat tagtgggcgt ggctttttat atttttcatt cgtgtgtagc ctaagtaagg 6360
tgactcaaga tgatacaccg agagaaaaat gcaaaatata tttggttctc atttctgttg 6420
ctgtcgtttc ctttttaaag acgatttatc aactgctgcc atttggaact tcctataaga 6480
aactaaaaat gatctatttc agtgttcctt tcgcctttcc tctgctttct gaataaatgg 6540
tttcagtaac ccatgctgtt ctctccctat tctacgtctt tctccctatg ttgaaaaaag 6600
attcccacag tttctgatgt gtgtgtttat agtcttcaat gtatgttaac atgttaggaa 6660
ctgagtatct taagagatgt cttagaatgc tttagttttc ataatttgtc ctttatgtat 6720
ttttcattgt atttgctgtt ttgacatgga agtaatttaa aaagttggtg caggaaagga 6780
ctctttactg ttgcacattt tggttttctg atatgtaata aattcatggc ttggcagctg 6840
acatgatgtt tcccagagag aaggagatgt atttctgcag ggtccagacc aaaagagcca 6900
tttacagcat gttctcccat gttccattat cagcctgatg aaacctgccc tgccaaggca 6960
taaacttttg tactagctgt ctccatatta tgttcaataa attctgtgct ctgaatatat 7020
ttaaaaaaaa aaaaa 7035
<![CDATA[<210> 23]]>
<![CDATA[<211> 7949]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 23]]>
agtccccgcc cctggagccg gcggcgcagg gcgcagcttc ccgccgccag agcgggccag 60
cctgctgcgt gcgtgcgtgt gtacgactct gcgtgcgtgc gtgcgtgcgt gcgtgccgtc 120
agctcgccgg gcaccgcggc ctcgccctcg ccctccgccc ctgcgcctgc accgcgtaga 180
ccgacccccc cccagcgcgc ccacccggta gaggaccccc gcccgtgccc cgaccggtcc 240
ccgccttttt gtaaaactta aagcgggcgc agcattaacg cttcccgccc cggtgacctc 300
tcaggggtct ccccgccaaa ggtgctccgc cgctaaggaa catggcgaag gtggagcagg 360
tcctgagcct cgagccgcag cacgagctca aattccgagg tcccttcacc gatgttgtca 420
ccaccaacct aaagcttggc aacccgacag accgaaatgt gtgttttaag gtgaagacta 480
cagcaccacg taggtactgt gtgaggccca acagcggaat catcgatgca ggggcctcaa 540
ttaatgtatc tgtgatgtta cagcctttcg attatgatcc caatgagaaa agtaaacaca 600
agtttatggt tcagtctatg tttgctccaa ctgacacttc agatatggaa gcagtatgga 660
aggaggcaaa accggaagac cttatggatt caaaacttag atgtgtgttt gaattgccag 720
cagagaatga taaaccacat gatgtagaaa taaataaaat tatatccaca actgcatcaa 780
agacagaaac accaatagtg tctaagtctc tgagttcttc tttggatgac accgaagtta 840
agaaggttat ggaagaatgt aagaggctgc aaggtgaagt tcagaggcta cgggaggaga 900
acaagcagtt caaggaagaa gatggactgc ggatgaggaa gacagtgcag agcaacagcc 960
ccatttcagc attagcccca actgggaagg aagaaggcct tagcacccgg ctcttggctc 1020
tggtggtttt gttctttatc gttggtgtaa ttattgggaa gattgccttg tagaggtagc 1080
atgcacagga tggtaaattg gattggtgga tccaccatat catgggattt aaatttatca 1140
taaccatgtg taaaaagaaa ttaatgtatg atgacatctc acaggtcttg cctttaaatt 1200
acccctccct gcacacacat acacagatac acacacacaa atataatgta acgatctttt 1260
agaaagttaa aaatgtatag taactgattg agggggaaaa gaatgatctt tattaatgac 1320
aagggaaacc atgagtaatg ccacaatggc atattgtaaa tgtcatttta aacattggta 1380
ggccttggta catgatgctg gattacctct cttaaaatga cacccttcct cgcctgttgg 1440
tgctggccct tggggagctg gagcccagca tgctggggag tgcggtcagc tccacacagt 1500
agtccccacg tggcccactc ccggcccagg ctgctttccg tgtcttcagt tctgtccaag 1560
ccatcagctc cttgggactg atgaacagag tcagaagccc aaaggaattg cactgtggca 1620
gcatcagacg tactcgtcat aagtgagagg cgtgtgttga ctgattgacc cagcgctttg 1680
gaaataaatg gcagtgcttt gttcacttaa agggaccaag ctaaatttgt attggttcat 1740
gtagtgaagt caaactgtta ttcagagatg tttaatgcat atttaactta tttaatgtat 1800
ttcatctcat gttttcttat tgtcacaaga gtacagttaa tgctgcgtgc tgctgaactc 1860
tgttgggtga actggtattg ctgctggagg gctgtgggct cctctgtctc tggagagtct 1920
ggtcatgtgg aggtggggtt tattgggatg ctggagaaga gctgccagga agtgtttttt 1980
ctgggtcagt aaataacaac tgtcataggg agggaaattc tcagtagtga cagtcaactc 2040
taggttacct tttttaatga agagtagtca gtcttctaga ttgttcttat accacctctc 2100
aaccattact cacacttcca gcgcccaggt ccaagtctga gcctgacctc cccttgggga 2160
cctagcctgg agtcaggaca aatggatcgg gctgcagagg gttagaagcg agggcaccag 2220
cagttgtggg tggggagcaa gggaagagag aaactcttca gcgaatcctt ctagtactag 2280
ttgagagttt gactgtgaat taattttatg ccataaaaga ccaacccagt tctgtttgac 2340
tatgtagcat cttgaaaaga aaaattataa taaagcccca aaattaagaa ttcttttgtc 2400
attttgtcac atttgctcta tggggggaat tattatttta tcatttttat tattttgcca 2460
ttggaaggtt aactttaaaa tgagccctat cactgagaaa tacgtgtttc atgatttaac 2520
tctgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgta tttttttttt ggttgtcttc 2580
agctgacagt atgaaaaatg aaactgctga aaaagctgag cacctggtca cccttggcct 2640
tccattgctt tggccttcag taaaaagcag cctcccttct aggtcaggga accatgccat 2700
tgagactagt aacgggcgtt ctgggcacag tcccactgtg cacaggtttg agaggacaag 2760
ttcatcagaa ggaaggcagt ccttagaagt cacatacgtt gagccacgtt gctcctaagc 2820
ctggctctgt caagctgggt caggggcctt gaaactggag aagtggaagt ctatggttgg 2880
tctgagtaag taacttcctg tcttcatgaa aaaagttgac tttgaatccc aggtactcac 2940
agaaatggtg aacagactta gttgttaccc aggcacccat ggattgtgtt gagtgtgcag 3000
acagggaggc caccccaata ggaattcgtc tccaggattt ttcccatgtg tcccccagta 3060
cttataaaag ggagtgaaaa gaccgagctg taaggcatgt gccttctgcc acctgacttt 3120
ccgtgagggg actaaaattt actaattgta gttgctgcag ccagttaagt cctgtagctt 3180
ccaggccctc atgtctttga taggagagtg cttaggtggt ccccaacagt gcctaggggt 3240
acagtacagt cccattacac tagagcaggg ctctatttat ttttaaagga tatggccgtg 3300
tgttttgata aaactttatt cacaaaaaca gccgggtcat gggatttggc ttgtgagtct 3360
gtaacagttc ttaaaaagaa tatctgagaa actactctgt tttagacctt tgaaggtgat 3420
ttagagtttt gtgtacatct aggagaaggt gttcagcttc tcagaggatg tggacatttt 3480
ggttgcagct aaaaatcagt ctctgaagtc tctctccctt ctagaggtta ggacttggtg 3540
aacatgtttg tgggcctttt gactgagtgg cagaaggaaa ctgctcagga agagaaacag 3600
gtgactgatg ggaaggttga ttattttctc agtcatcctg gcagccaaaa atgtgccagg 3660
aaaagaaaga atgtggagca cgcgtggctc ctggaggact tggagatgca tgcacattta 3720
gggtgttttc cctagaatta cataatgaaa aaaagaataa ggcaaagagg aggtgaatat 3780
ggggcctgtc acaacggcct gccctgcccc aagagggtta agagtcagat aatcgggacg 3840
aaactggcat ggaaagagcg agcctaggga ggatgccgct gggcagtgtg catgggggag 3900
ctgctgccag gctgccctcc agtctgctcc tgtggttact ggctccacag cacctcagag 3960
agggcggccc tggcttcaga aatgccagcc atagtgctca caaatgcaga agagatggaa 4020
gcggtgacag aatcctgaaa gtttttattg attgaagttt taaattggta acttaagctt 4080
ccttggcacg atacaaaata cctcttaaag acagcaggct tttttatttg taggtgtgag 4140
gaactggctt taactttttt ctcctcctag tttgcatgtt ttccttctct cgtcttctga 4200
actgctggca ccagcagtaa tacatactga taaaatcaaa attgattttt accagtggcc 4260
agtttatggc tagagagacg acttatacct ccataacaca gaagggggaa aaatgaagaa 4320
cctccagtga tccgtgaaaa cctaaacgct ttcaaacaaa tcccaggaac agaattgcta 4380
tcgaaagata tcattgccca gtttgcaggc tatgttgagt cagatagaac tgaatgtagt 4440
gagagctcag agctacagag cctttcagat gaatttgaaa acagactctg tgtgtgtgtg 4500
catgtgtgca tgtgtgcatg tgtggcatat gtgccgtatg tcagtagctt gacagttttc 4560
aaatcgtgcc tatatttttt tgcatacaca aatttttgtg tttgcaaact cagaatccat 4620
gccaaaatac aatgttatat gtcattttca gctccttctc taaaggaatg gcccatttct 4680
cattgtagtt tgagaaatac atgtatgaag agataggggt cttgggcttc ccagtgtcac 4740
tttgaacacc tgaataacat ttaactcctg agaccttctc ggtgtagagg ccactgcttc 4800
cccctgctgg agatggcatt tcattgaagg gcctctcgtg gctttccctg cccccggctg 4860
tctggcctga agaaggagaa agaaccaaac tgaactatga aaagttacca ctctgaggag 4920
acctctctta attaacactt ggggccatgt ttgctgttgt tgagaaggag tgttctcaaa 4980
gatgagctgg aatggaattg tatttagaaa ggcccctgca aagtatatag atggatgact 5040
ctagttcatg acatacaaat cccataaggc caacgaccac tcttctggaa caccaagagc 5100
agctctgaga tcatgctggc cctacgcgaa ttgagtttct gtggcctaat tggatttgga 5160
gaacgccttc cctggcccct tttcctcaga cagatctgct ctgataggaa ccttttcaag 5220
aaagttactg ttgtttcaat gccactcctt acctgtatag aacatttcca atacattcgc 5280
tcattgaact taatccttgc aactgtgact ggggggtaga tggctctgtt tgcatacgaa 5340
gaaataaagg ctccaggagg ttaaatcggg caacttttta gaactaaatc agtctctgta 5400
aggcctacat tgctaagata ccatttcagc tctgaaaatc tgcttcaggg aagtgagtgg 5460
atgaggcctt cctgcctcag ctactctgcc cgtctgtaca tcttttgtgt ctgcctccgt 5520
accttattca gttattttca cactaaagta agtagaatta agactgtagt tcagatgctt 5580
tttctttttc tgttggaaac tgaacacact acagacagtg aaaaaaggta catattccat 5640
tttctcattg cctgaagatc tctgctgatg ctcctggaga atgactttgg gggctttaga 5700
aagaatattg ccagtccgtc tcggcaagga gatgatggga gcgctttata tggaggcttt 5760
acatgacttg taaattaaat gtgaatgagg gcagttgatt aaaattggta ttacagaagg 5820
gccctgctga ggtttgaaaa cagctgagct gctgatgtct caggcctttc cctgaattag 5880
cactgcggtt ctccaggata tcagcaaaga gggcaagtaa tagaagcccc tgataaggag 5940
cgtcagccga caggcaagct tgggaggctg tgggaatggg tctgccccca gcttcacaga 6000
cctcttcctc cagcctctga atcccattag ccacagccta gaacattagc tgagctgcac 6060
aagctcaccc acccctgtgc cagggggccc tgacctccct ccatgccatg tttttggctg 6120
tatctacggc acttaacaat aggggctttt tattttcatt acagagatat tttgaaaaat 6180
ttaaaagaca tgaactcaca taaacagtta tggatgatag ttaaaagaga aacgggtgga 6240
ggtggatgag aggttgtctt catgaatata attacttgag attttttttt cttaatggaa 6300
ttagtttatt agaaaatgtc tgtgttaaat ccgtagaaaa ggaagaaaag tgtagcaaca 6360
aaaatgtagc cattatctaa cttgccataa atatttgcag ttatgatacc ttggaatgtt 6420
gccacgatat ggattgcttt gattaaaaga tgtcagttga ataaaacagt actgtgggag 6480
aatcgctttc tgctgctaga taaatgctga tgtttatttt taaaccagga aacattgatc 6540
ctgtaacaat gcccgattac aattgcttta ttacacccca gggctgatgg agatgtaatc 6600
acttggctaa tggatgtggg tgcaggacag atgctcgctt gctggcctgc tttcctgctt 6660
gcattctgat gagctgcagg agtgcgcctg gccttctgca ggtggagctg ctgtcagagc 6720
ttcgtttcac tgatacccaa agccatgtct gactgaaata aaacaggttc cctttttttt 6780
tccctttgga aaatgccaac taagggagac taatcagata tcttaacaca atttcatcca 6840
ggcttagtgc taacaagatt gcggggcttt ttagggttta agaagatgag aaatgagtgt 6900
gcacgtttca cacgttgact tgccggtttt tccatgtcat acaaaaaagt cctggctgtt 6960
tctccgaact ggctgcctgc attcccgtct ttcttttgtt tttaagaaat agactgaatt 7020
cagctgttaa tcctctagta cagtatccat gttaaaatgt ttttccattg catcttttat 7080
gtgaattcaa aggtcagaat ttattgtctg tgatattgag accatgtgta caagaactac 7140
tttttgcttt tcatcattca ctccttagca aacgtttcgt aagtaccctc tgtctgtttg 7200
ctactatatg aggtgctgcg aaattagtgg gcgtggcttt ttatattttt cattcgtgtg 7260
tagcctaagt aaggtgactc aagatgatac accgagagaa aaatgcaaaa tatatttggt 7320
tctcatttct gttgctgtcg tttccttttt aaagacgatt tatcaactgc tgccatttgg 7380
aacttcctat aagaaactaa aaatgatcta tttcagtgtt cctttcgcct ttcctctgct 7440
ttctgaataa atggtttcag taacccatgc tgttctctcc ctattctacg tctttctccc 7500
tatgttgaaa aaagattccc acagtttctg atgtgtgtgt ttatagtctt caatgtatgt 7560
taacatgtta ggaactgagt atcttaagag atgtcttaga atgctttagt tttcataatt 7620
tgtcctttat gtatttttca ttgtatttgc tgttttgaca tggaagtaat ttaaaaagtt 7680
ggtgcaggaa aggactcttt actgttgcac attttggttt tctgatatgt aataaattca 7740
tggcttggca gctgacatga tgtttcccag agagaaggag atgtatttct gcagggtcca 7800
gaccaaaaga gccatttaca gcatgttctc ccatgttcca ttatcagcct gatgaaacct 7860
gccctgccaa ggcataaact tttgtactag ctgtctccat attatgttca ataaattctg 7920
tgctctgaat atatttaaaa aaaaaaaaa 7949
<![CDATA[<210> 24]]>
<![CDATA[<211> 19]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 24]]>
Met Ala Lys Val Glu Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu
1 5 10 15
Lys Phe Arg
<![CDATA[<210> 25]]>
<![CDATA[<211> 50]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 25]]>
Pro Phe Thr Asp Val Val Thr Thr Asn Leu Lys Leu Gly Asn Pro Thr
1 5 10 15
Asp Arg Asn Val Cys Phe Lys Val Lys Thr Thr Ala Pro Arg Arg Tyr
20 25 30
Cys Val Arg Pro Asn Ser Gly Ile Ile Asp Ala Gly Ala Ser Ile Asn
35 40 45
Val Ser
50
<![CDATA[<210> 26]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 26]]>
Met Leu Gln Pro Phe Asp Tyr Asp Pro Asn Glu Lys Ser Lys His Lys
1 5 10 15
Phe Met Val Gln Ser Met Phe Ala Pro Thr Asp Thr Ser Asp Met Glu
20 25 30
Ala Val
<![CDATA[<210> 27]]>
<![CDATA[<211> 27]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 27]]>
Trp Lys Glu Ala Lys Pro Glu Asp Leu Met Asp Ser Lys Leu Arg Cys
1 5 10 15
Val Phe Glu Leu Pro Ala Glu Asn Asp Lys Pro
20 25
<![CDATA[<210> 28]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 28]]>
His Asp Val Glu Ile Asn Lys Ile Ile Ser Thr Thr Ala Ser Lys Thr
1 5 10 15
Glu Thr Pro Ile Val Ser Lys Ser Leu Ser Ser Ser Leu Asp Asp Thr
20 25 30
Glu Val Lys Lys Val Met Glu Glu Cys Lys Arg Leu Gln Gly Glu Val
35 40 45
Gln Arg Leu Arg Glu Glu Asn Lys Gln Phe Lys
50 55
<![CDATA[<210> 29]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 29]]>
Glu Glu Asp Gly Leu Arg Met Arg Lys Thr Val Gln Ser Asn Ser Pro
1 5 10 15
Ile Ser Ala Leu Ala Pro Thr Gly Lys Glu Glu Gly Leu Ser Thr Arg
20 25 30
Leu Leu Ala Leu Val Val Leu Phe Phe Ile Val Gly Val Ile Ile Gly
35 40 45
Lys Ile Ala Leu
50
<![CDATA[<210> 30]]>
<![CDATA[<211> 243]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 30]]>
Met Ala Lys Val Glu Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu
1 5 10 15
Lys Phe Arg Gly Pro Phe Thr Asp Val Val Thr Thr Asn Leu Lys Leu
20 25 30
Gly Asn Pro Thr Asp Arg Asn Val Cys Phe Lys Val Lys Thr Thr Ala
35 40 45
Pro Arg Arg Tyr Cys Val Arg Pro Asn Ser Gly Ile Ile Asp Ala Gly
50 55 60
Ala Ser Ile Asn Val Ser Val Met Leu Gln Pro Phe Asp Tyr Asp Pro
65 70 75 80
Asn Glu Lys Ser Lys His Lys Phe Met Val Gln Ser Met Phe Ala Pro
85 90 95
Thr Asp Thr Ser Asp Met Glu Ala Val Trp Lys Glu Ala Lys Pro Glu
100 105 110
Asp Leu Met Asp Ser Lys Leu Arg Cys Val Phe Glu Leu Pro Ala Glu
115 120 125
Asn Asp Lys Pro His Asp Val Glu Ile Asn Lys Ile Ile Ser Thr Thr
130 135 140
Ala Ser Lys Thr Glu Thr Pro Ile Val Ser Lys Ser Leu Ser Ser Ser
145 150 155 160
Leu Asp Asp Thr Glu Val Lys Lys Val Met Glu Glu Cys Lys Arg Leu
165 170 175
Gln Gly Glu Val Gln Arg Leu Arg Glu Glu Asn Lys Gln Phe Lys Glu
180 185 190
Glu Asp Gly Leu Arg Met Arg Lys Thr Val Gln Ser Asn Ser Pro Ile
195 200 205
Ser Ala Leu Ala Pro Thr Gly Lys Glu Glu Gly Leu Ser Thr Arg Leu
210 215 220
Leu Ala Leu Val Val Leu Phe Phe Ile Val Gly Val Ile Ile Gly Lys
225 230 235 240
Ile Ala Leu
<![CDATA[<210> 31]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 31]]>
000
<![CDATA[<210> 32]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 32]]>
000
<![CDATA[<210> 33]]>
<![CDATA[<211> 253]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 33]]>
ggttgggctc gcggcgctgt gattggtctg cccggactcc gcctccagcg catgtcatta 60
gcatctcatt agctgtccgc tcgggctccg gaggcagcca acgccgccag tctgaggcag 120
gtgcccgaca tggcgagtgt agtgctgccg agcggatccc agtgtgcggc ggcagcggcg 180
gcggcggcgc ctcccgggct ccggctccgg cttctgctgt tgctcttctc cgccgcggca 240
ctgatcccca cag 253
<![CDATA[<210> 34]]>
<![CDATA[<211> 147]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 34]]>
gtgatgggca gaatctgttt acgaaagacg tgacagtgat cgagggagag gttgcgacca 60
tcagttgcca agtcaataag agtgacgact ctgtgattca gctactgaat cccaacaggc 120
agaccattta tttcagggac ttcaggc 147
<![CDATA[<210> 35]]>
<![CDATA[<211> 153]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 35]]>
ctttgaagga cagcaggttt cagttgctga atttttctag cagtgaactc aaagtatcat 60
tgacaaacgt ctcaatttct gatgaaggaa gatacttttg ccagctctat accgatcccc 120
cacaggaaag ttacaccacc atcacagtcc tgg 153
<![CDATA[<210> 36]]>
<![CDATA[<211> 138]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 36]]>
tcccaccacg taatctgatg atcgatatcc agaaagacac tgcggtggaa ggtgaggaga 60
ttgaagtcaa ctgcactgct atggccagca agccagccac gactatcagg tggttcaaag 120
ggaacacaga gctaaaag 138
<![CDATA[<210> 37]]>
<![CDATA[<211> 159]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 37]]>
gcaaatcgga ggtggaagag tggtcagaca tgtacactgt gaccagtcag ctgatgctga 60
aggtgcacaa ggaggacgat ggggtcccag tgatctgcca ggtggagcac cctgcggtca 120
ctggaaacct gcagacccag cggtatctag aagtacagt 159
<![CDATA[<210> 38]]>
<![CDATA[<211> 100]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 38]]>
ataagcctca agtgcacatt cagatgactt atcctctaca aggcttaacc cgggaagggg 60
acgcgcttga gttaacatgt gaagccatcg ggaagcccca 100
<![CDATA[<210> 39]]>
<![CDATA[<211> 173]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 39]]>
gcctgtgatg gtaacttggg tgagagtcga tgatgaaatg cctcaacacg ccgtactgtc 60
tgggcccaac ctgttcatca ataacctaaa caaaacagat aatggtacat accgctgtga 120
agcttcaaac atagtgggga aagctcactc ggattatatg ctgtatgtat acg 173
<![CDATA[<210> 40]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 40]]>
acacaacggc gacgacagaa ccagcagttc acg 33
<![CDATA[<210> 41]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 41]]>
gccttactca gttgcccaat tccgcagaag aactggacag tgaggacctc tcag 54
<![CDATA[<210> 42]]>
<![CDATA[<211> 132]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 42]]>
attcccgagc aggtgaagaa ggctcgatca gggcagtgga tcatgccgtg atcggtggcg 60
tcgtggcggt ggtggtgttc gccatgctgt gcttgctcat cattctgggg cgctattttg 120
ccagacataa ag 132
<![CDATA[<210> 43]]>
<![CDATA[<211> 2968]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 43]]>
gtacatactt cactcatgaa gccaaaggag ccgatgacgc agcagacgca gacacagcta 60
taatcaatgc agaaggagga cagaacaact ccgaagaaaa gaaagagtac ttcatctaga 120
tcagcctttt tgtttcaatg aggtgtccaa ctggccctat ttagatgata aagagacagt 180
gatattggaa cttgcgagaa attcgtgtgt ttttttatga atgggtggaa aggtgtgaga 240
ctgggaaggc ttgggatttg ctgtgtaaaa aaaaaaaaaa tgttctttgg aaagtacact 300
ctgctgtttg acacctcttt tttcgtttgt ttgtttgttt aatttttatt tcttcctacc 360
aagtcaaact tggatacttg gatttagttt cagtagattg cagaaaattc tgtgccttgt 420
tttttgtttg tttgttgcgt tcctttcttt tccccctttg tgcacattta tttcctccct 480
ctaccccaat ttcggatttt ttccaaaatc tcccattttg gaatttgcct gctgggattc 540
cttagactct tttccttccc ttttctgttc tagtttttta cttttgttta tttttatggt 600
aactgctttc tgttccaaat tcagtttcat aaaaggagaa ccagcacagc ttagatttca 660
tagttcagaa tttagtgtat ccataatgca ttcttctctg ttgtcgtaaa gatttgggtg 720
aacaaacaat gaaaactctt tgctgctgcc catgtttcaa atacttagag cagtgaagac 780
tagaaaatta gactgtgatt cagaaaatgt tctgtttgct gtggaactac attactgtac 840
agggttatct gcaagtgagg tgtgtcacaa tgagattgaa tttcactgtc tttaattctg 900
tatctgtaga cggctcagta tagataccct acgctgtcca gaaaggtttg gggcagaaag 960
gactcctcct ttttccatgc cctaaacaga cctgacaggt gaggtctgtt ccttttatat 1020
aagtggacaa attttgagtt gccacaggag gggaagtagg gaggggggaa atacagttct 1080
gctctggttg tttctgttcc aaatgattcc atccaccttt cccaatcggc cttacttctc 1140
actaatttgt aggaaaaagc aagttcgtct gttgtgcgaa tgactgaatg ggacagagtt 1200
gatttttttt tttttttcct ttgtgcttag ttaggaaggc agtaggatgt ggcctgcatg 1260
tactgtatat tacagatatt tgtcatgctg ggatttccaa ctcgaatctg tgtgaaactt 1320
tcattccttc agatttggct tgacaaaggc aggaggtaca aaagaagggc tggtattgtt 1380
ctcacactgg tctgctgtcg ctctcagttc tcgataggtc agagcagagg tggaaaaaca 1440
gcatgtacgg attttcagtt acttaatcaa aactcaaatg tgagtgtttt tatcttttta 1500
cctttcatac actagccttg gcctctttcc tcagccttaa gaaccatctg ccaaaaatta 1560
ctgatcctcg catgatggca gccatagtgc atagctacta aaatcagtga ccttgaacat 1620
atcttagatg gggagcctcg ggaaaaggta gaggagtcac gttaccattt acatgtttta 1680
aagaaagaag tgtggggatt ttcactgaaa cgtctaggaa atctagaagt agtcctgaag 1740
gacagaaact aaactcttac catatgtttg gtaagactcc agactccagc taacagtccc 1800
tatggaaaga tggcatcaaa aaagatagat ctatatatat atataaatat atattctatt 1860
acattttcag tgagtaattt tggattttgc aaggtgcatt tttactattg ttacattatg 1920
tggaaaactt atgctgattt atttaagggg gaaaaagtgt caactctttg ttatttgaaa 1980
acatgtttat ttttcttgtc tttattttaa cctttgatag aaccattgca atatgggggc 2040
cttttgggaa cggactggta tgtaaaagaa aatccattat cgagcagcat tttatttacc 2100
cctcccctat ccctaggcac ttaaccaaga caaaaagcca caatgaacat ccctttttca 2160
atgaatttta taatctgcag ctctattccg agcccttagc acccattccg accatagtat 2220
aatcatatca aagggtgaga atcatttagc atgttgttga aaggtttttt ttcagttgtt 2280
ctttttagaa aaaaagaaaa acaaaaacaa aaacaaaaaa aaaaaatcac accattgctc 2340
acagaattgg catctcattt ttgggacctc ccatctttct gttttgaaaa gtgtacagta 2400
gtgcagtgtt cctgatgtaa ctttatggct tacaatgttg acatgtctca ggttcatgtg 2460
ttgcgattgg tgttttccgt ctcaggtaga ttgcaaagtg taggccccac acattggaaa 2520
aaataataat aaaacaaagc aaaaacagga aattatggat tttagttgta tattggttta 2580
tgtatttttt cttaagtata cagtgcactg tttgaaatgt attgttgagt attactttgt 2640
acaggttgat cacttttttt agagtgaaga aagaacaaac ttgttttttg tgttttttaa 2700
aggaatataa aataatgaag gatgtataat tgatgccaaa taagcttgtt ctttagtcac 2760
accgacgtct tatttttccc tttaggccag ttctgttttt aaggtgtaca tggacaatgt 2820
tacagtgtaa gaaactccat atccatatgt tcccattcgc attttgtatt ggttcatgta 2880
taccattttt acaaaaaaaa aaagaaaaaa aagaagtact ataaaatatc tgtcttctta 2940
ataaaaaaaa attaatgtta caaagtga 2968
<![CDATA[<210> 44]]>
<![CDATA[<211> 4310]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 44]]>
ggttgggctc gcggcgctgt gattggtctg cccggactcc gcctccagcg catgtcatta 60
gcatctcatt agctgtccgc tcgggctccg gaggcagcca acgccgccag tctgaggcag 120
gtgcccgaca tggcgagtgt agtgctgccg agcggatccc agtgtgcggc ggcagcggcg 180
gcggcggcgc ctcccgggct ccggctccgg cttctgctgt tgctcttctc cgccgcggca 240
ctgatcccca caggtgatgg gcagaatctg tttacgaaag acgtgacagt gatcgaggga 300
gaggttgcga ccatcagttg ccaagtcaat aagagtgacg actctgtgat tcagctactg 360
aatcccaaca ggcagaccat ttatttcagg gacttcaggc ctttgaagga cagcaggttt 420
cagttgctga atttttctag cagtgaactc aaagtatcat tgacaaacgt ctcaatttct 480
gatgaaggaa gatacttttg ccagctctat accgatcccc cacaggaaag ttacaccacc 540
atcacagtcc tggtcccacc acgtaatctg atgatcgata tccagaaaga cactgcggtg 600
gaaggtgagg agattgaagt caactgcact gctatggcca gcaagccagc cacgactatc 660
aggtggttca aagggaacac agagctaaaa ggcaaatcgg aggtggaaga gtggtcagac 720
atgtacactg tgaccagtca gctgatgctg aaggtgcaca aggaggacga tggggtccca 780
gtgatctgcc aggtggagca ccctgcggtc actggaaacc tgcagaccca gcggtatcta 840
gaagtacagt ataagcctca agtgcacatt cagatgactt atcctctaca aggcttaacc 900
cgggaagggg acgcgcttga gttaacatgt gaagccatcg ggaagcccca gcctgtgatg 960
gtaacttggg tgagagtcga tgatgaaatg cctcaacacg ccgtactgtc tgggcccaac 1020
ctgttcatca ataacctaaa caaaacagat aatggtacat accgctgtga agcttcaaac 1080
atagtgggga aagctcactc ggattatatg ctgtatgtat acgacacaac ggcgacgaca 1140
gaaccagcag ttcacggcct tactcagttg cccaattccg cagaagaact ggacagtgag 1200
gacctctcag attcccgagc aggtgaagaa ggctcgatca gggcagtgga tcatgccgtg 1260
atcggtggcg tcgtggcggt ggtggtgttc gccatgctgt gcttgctcat cattctgggg 1320
cgctattttg ccagacataa aggtacatac ttcactcatg aagccaaagg agccgatgac 1380
gcagcagacg cagacacagc tataatcaat gcagaaggag gacagaacaa ctccgaagaa 1440
aagaaagagt acttcatcta gatcagcctt tttgtttcaa tgaggtgtcc aactggccct 1500
atttagatga taaagagaca gtgatattgg aacttgcgag aaattcgtgt gtttttttat 1560
gaatgggtgg aaaggtgtga gactgggaag gcttgggatt tgctgtgtaa aaaaaaaaaa 1620
aatgttcttt ggaaagtaca ctctgctgtt tgacacctct tttttcgttt gtttgtttgt 1680
ttaattttta tttcttccta ccaagtcaaa cttggatact tggatttagt ttcagtagat 1740
tgcagaaaat tctgtgcctt gttttttgtt tgtttgttgc gttcctttct tttccccctt 1800
tgtgcacatt tatttcctcc ctctacccca atttcggatt ttttccaaaa tctcccattt 1860
tggaatttgc ctgctgggat tccttagact cttttccttc ccttttctgt tctagttttt 1920
tacttttgtt tatttttatg gtaactgctt tctgttccaa attcagtttc ataaaaggag 1980
aaccagcaca gcttagattt catagttcag aatttagtgt atccataatg cattcttctc 2040
tgttgtcgta aagatttggg tgaacaaaca atgaaaactc tttgctgctg cccatgtttc 2100
aaatacttag agcagtgaag actagaaaat tagactgtga ttcagaaaat gttctgtttg 2160
ctgtggaact acattactgt acagggttat ctgcaagtga ggtgtgtcac aatgagattg 2220
aatttcactg tctttaattc tgtatctgta gacggctcag tatagatacc ctacgctgtc 2280
cagaaaggtt tggggcagaa aggactcctc ctttttccat gccctaaaca gacctgacag 2340
gtgaggtctg ttccttttat ataagtggac aaattttgag ttgccacagg aggggaagta 2400
gggagggggg aaatacagtt ctgctctggt tgtttctgtt ccaaatgatt ccatccacct 2460
ttcccaatcg gccttacttc tcactaattt gtaggaaaaa gcaagttcgt ctgttgtgcg 2520
aatgactgaa tgggacagag ttgatttttt tttttttttc ctttgtgctt agttaggaag 2580
gcagtaggat gtggcctgca tgtactgtat attacagata tttgtcatgc tgggatttcc 2640
aactcgaatc tgtgtgaaac tttcattcct tcagatttgg cttgacaaag gcaggaggta 2700
caaaagaagg gctggtattg ttctcacact ggtctgctgt cgctctcagt tctcgatagg 2760
tcagagcaga ggtggaaaaa cagcatgtac ggattttcag ttacttaatc aaaactcaaa 2820
tgtgagtgtt tttatctttt tacctttcat acactagcct tggcctcttt cctcagcctt 2880
aagaaccatc tgccaaaaat tactgatcct cgcatgatgg cagccatagt gcatagctac 2940
taaaatcagt gaccttgaac atatcttaga tggggagcct cgggaaaagg tagaggagtc 3000
acgttaccat ttacatgttt taaagaaaga agtgtgggga ttttcactga aacgtctagg 3060
aaatctagaa gtagtcctga aggacagaaa ctaaactctt accatatgtt tggtaagact 3120
ccagactcca gctaacagtc cctatggaaa gatggcatca aaaaagatag atctatatat 3180
atatataaat atatattcta ttacattttc agtgagtaat tttggatttt gcaaggtgca 3240
tttttactat tgttacatta tgtggaaaac ttatgctgat ttatttaagg gggaaaaagt 3300
gtcaactctt tgttatttga aaacatgttt atttttcttg tctttatttt aacctttgat 3360
agaaccattg caatatgggg gccttttggg aacggactgg tatgtaaaag aaaatccatt 3420
atcgagcagc attttattta cccctcccct atccctaggc acttaaccaa gacaaaaagc 3480
cacaatgaac atcccttttt caatgaattt tataatctgc agctctattc cgagccctta 3540
gcacccattc cgaccatagt ataatcatat caaagggtga gaatcattta gcatgttgtt 3600
gaaaggtttt ttttcagttg ttctttttag aaaaaaagaa aaacaaaaac aaaaacaaaa 3660
aaaaaaaatc acaccattgc tcacagaatt ggcatctcat ttttgggacc tcccatcttt 3720
ctgttttgaa aagtgtacag tagtgcagtg ttcctgatgt aactttatgg cttacaatgt 3780
tgacatgtct caggttcatg tgttgcgatt ggtgttttcc gtctcaggta gattgcaaag 3840
tgtaggcccc acacattgga aaaaataata ataaaacaaa gcaaaaacag gaaattatgg 3900
attttagttg tatattggtt tatgtatttt ttcttaagta tacagtgcac tgtttgaaat 3960
gtattgttga gtattacttt gtacaggttg atcacttttt ttagagtgaa gaaagaacaa 4020
acttgttttt tgtgtttttt aaaggaatat aaaataatga aggatgtata attgatgcca 4080
aataagcttg ttctttagtc acaccgacgt cttatttttc cctttaggcc agttctgttt 4140
ttaaggtgta catggacaat gttacagtgt aagaaactcc atatccatat gttcccattc 4200
gcattttgta ttggttcatg tataccattt ttacaaaaaa aaaaagaaaa aaaagaagta 4260
ctataaaata tctgtcttct taataaaaaa aaattaatgt tacaaagtga 4310
<![CDATA[<210> 45]]>
<![CDATA[<211> 41]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 45]]>
Met Ala Ser Val Val Leu Pro Ser Gly Ser Gln Cys Ala Ala Ala Ala
1 5 10 15
Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu
20 25 30
Phe Ser Ala Ala Ala Leu Ile Pro Thr
35 40
<![CDATA[<210> 46]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 46]]>
Asp Gly Gln Asn Leu Phe Thr Lys Asp Val Thr Val Ile Glu Gly Glu
1 5 10 15
Val Ala Thr Ile Ser Cys Gln Val Asn Lys Ser Asp Asp Ser Val Ile
20 25 30
Gln Leu Leu Asn Pro Asn Arg Gln Thr Ile Tyr Phe Arg Asp Phe Arg
35 40 45
<![CDATA[<210> 47]]>
<![CDATA[<211> 50]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 47]]>
Leu Lys Asp Ser Arg Phe Gln Leu Leu Asn Phe Ser Ser Ser Glu Leu
1 5 10 15
Lys Val Ser Leu Thr Asn Val Ser Ile Ser Asp Glu Gly Arg Tyr Phe
20 25 30
Cys Gln Leu Tyr Thr Asp Pro Pro Gln Glu Ser Tyr Thr Thr Ile Thr
35 40 45
Val Leu
50
<![CDATA[<210> 48]]>
<![CDATA[<211> 45]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 48]]>
Pro Pro Arg Asn Leu Met Ile Asp Ile Gln Lys Asp Thr Ala Val Glu
1 5 10 15
Gly Glu Glu Ile Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala
20 25 30
Thr Thr Ile Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys
35 40 45
<![CDATA[<210> 49]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 49]]>
Lys Ser Glu Val Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gln
1 5 10 15
Leu Met Leu Lys Val His Lys Glu Asp Asp Gly Val Pro Val Ile Cys
20 25 30
Gln Val Glu His Pro Ala Val Thr Gly Asn Leu Gln Thr Gln Arg Tyr
35 40 45
Leu Glu Val Gln
50
<![CDATA[<210> 50]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 50]]>
Lys Pro Gln Val His Ile Gln Met Thr Tyr Pro Leu Gln Gly Leu Thr
1 5 10 15
Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala Ile Gly Lys Pro
20 25 30
<![CDATA[<210> 51]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 51]]>
Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro Gln His
1 5 10 15
Ala Val Leu Ser Gly Pro Asn Leu Phe Ile Asn Asn Leu Asn Lys Thr
20 25 30
Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn Ile Val Gly Lys Ala
35 40 45
His Ser Asp Tyr Met Leu Tyr Val Tyr
50 55
<![CDATA[<210> 52]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 52]]>
Thr Thr Ala Thr Thr Glu Pro Ala Val His
1 5 10
<![CDATA[<210> 53]]>
<![CDATA[<211> 17]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 53]]>
Leu Thr Gln Leu Pro Asn Ser Ala Glu Glu Leu Asp Ser Glu Asp Leu
1 5 10 15
Ser
<![CDATA[<210> 54]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 54]]>
Ser Arg Ala Gly Glu Glu Gly Ser Ile Arg Ala Val Asp His Ala Val
1 5 10 15
Ile Gly Gly Val Val Ala Val Val Val Phe Ala Met Leu Cys Leu Leu
20 25 30
Ile Ile Leu Gly Arg Tyr Phe Ala Arg His Lys
35 40
<![CDATA[<210> 55]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 55]]>
Thr Tyr Phe Thr His Glu Ala Lys Gly Ala Asp Asp Ala Ala Asp Ala
1 5 10 15
Asp Thr Ala Ile Ile Asn Ala Glu Gly Gly Gln Asn Asn Ser Glu Glu
20 25 30
Lys Lys Glu Tyr Phe Ile
35
<![CDATA[<210> 56]]>
<![CDATA[<211> 443]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 56]]>
Met Ala Ser Val Val Leu Pro Ser Gly Ser Gln Cys Ala Ala Ala Ala
1 5 10 15
Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu
20 25 30
Phe Ser Ala Ala Ala Leu Ile Pro Thr Gly Asp Gly Gln Asn Leu Phe
35 40 45
Thr Lys Asp Val Thr Val Ile Glu Gly Glu Val Ala Thr Ile Ser Cys
50 55 60
Gln Val Asn Lys Ser Asp Asp Ser Val Ile Gln Leu Leu Asn Pro Asn
65 70 75 80
Arg Gln Thr Ile Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg
85 90 95
Phe Gln Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr
100 105 110
Asn Val Ser Ile Ser Asp Glu Gly Arg Tyr Phe Cys Gln Leu Tyr Thr
115 120 125
Asp Pro Pro Gln Glu Ser Tyr Thr Thr Ile Thr Val Leu Val Pro Pro
130 135 140
Arg Asn Leu Met Ile Asp Ile Gln Lys Asp Thr Ala Val Glu Gly Glu
145 150 155 160
Glu Ile Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr
165 170 175
Ile Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val
180 185 190
Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gln Leu Met Leu Lys
195 200 205
Val His Lys Glu Asp Asp Gly Val Pro Val Ile Cys Gln Val Glu His
210 215 220
Pro Ala Val Thr Gly Asn Leu Gln Thr Gln Arg Tyr Leu Glu Val Gln
225 230 235 240
Tyr Lys Pro Gln Val His Ile Gln Met Thr Tyr Pro Leu Gln Gly Leu
245 250 255
Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala Ile Gly Lys
260 265 270
Pro Gln Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro
275 280 285
Gln His Ala Val Leu Ser Gly Pro Asn Leu Phe Ile Asn Asn Leu Asn
290 295 300
Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn Ile Val Gly
305 310 315 320
Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr Asp Thr Thr Ala Thr
325 330 335
Thr Glu Pro Ala Val His Gly Leu Thr Gln Leu Pro Asn Ser Ala Glu
340 345 350
Glu Leu Asp Ser Glu Asp Leu Ser Asp Ser Arg Ala Gly Glu Glu Gly
355 360 365
Ser Ile Arg Ala Val Asp His Ala Val Ile Gly Gly Val Val Ala Val
370 375 380
Val Val Phe Ala Met Leu Cys Leu Leu Ile Ile Leu Gly Arg Tyr Phe
385 390 395 400
Ala Arg His Lys Gly Thr Tyr Phe Thr His Glu Ala Lys Gly Ala Asp
405 410 415
Asp Ala Ala Asp Ala Asp Thr Ala Ile Ile Asn Ala Glu Gly Gly Gln
420 425 430
Asn Asn Ser Glu Glu Lys Lys Glu Tyr Phe Ile
435 440
<![CDATA[<210> 57]]>
<![CDATA[<211> 1123]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 57]]>
ggttgggctc gcggcgctgt gattggtctg cccggactcc gcctccagcg catgtcatta 60
gcatctcatt agctgtccgc tcgggctccg gaggcagcca acgccgccag tctgaggcag 120
gtgcccgaca tggcgagtgt agtgctgccg agcggatccc agtgtgcggc ggcagcggcg 180
gcggcggcgc ctcccgggct ccggctccgg cttctgctgt tgctcttctc cgccgcggca 240
ctgatcccca caggtgatgg gcagaatctg tttacgaaag acgtgacagt gatcgaggga 300
gaggttgcga ccatcagttg ccaagtcaat aagagtgacg actctgtgat tcagctactg 360
aatcccaaca ggcagaccat ttatttcagg gacttcaggc ctttgaagga cagcaggttt 420
cagttgctga atttttctag cagtgaactc aaagtatcat tgacaaacgt ctcaatttct 480
gatgaaggaa gatacttttg ccagctctat accgatcccc cacaggaaag ttacaccacc 540
atcacagtcc tggtcccacc acgtaatctg atgatcgata tccagaaaga cactgcggtg 600
gaaggtgagg agattgaagt caactgcact gctatggcca gcaagccagc cacgactatc 660
aggtggttca aagggaacac agagctaaaa ggcaaatcgg aggtggaaga gtggtcagac 720
atgtacactg tgaccagtca gctgatgctg aaggtgcaca aggaggacga tggggtccca 780
gtgatctgcc aggtggagca ccctgcggtc actggaaacc tgcagaccca gcggtatcta 840
gaagtacagt ataagcctca agtgcacatt cagatgactt atcctctaca aggcttaacc 900
cgggaagggg acgcgcttga gttaacatgt gaagccatcg ggaagcccca gcctgtgatg 960
gtaacttggg tgagagtcga tgatgaaatg cctcaacacg ccgtactgtc tgggcccaac 1020
ctgttcatca ataacctaaa caaaacagat aatggtacat accgctgtga agcttcaaac 1080
atagtgggga aagctcactc ggattatatg ctgtatgtat acg 1123
<![CDATA[<210> 58]]>
<![CDATA[<211> 331]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 58]]>
Met Ala Ser Val Val Leu Pro Ser Gly Ser Gln Cys Ala Ala Ala Ala
1 5 10 15
Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu
20 25 30
Phe Ser Ala Ala Ala Leu Ile Pro Thr Gly Asp Gly Gln Asn Leu Phe
35 40 45
Thr Lys Asp Val Thr Val Ile Glu Gly Glu Val Ala Thr Ile Ser Cys
50 55 60
Gln Val Asn Lys Ser Asp Asp Ser Val Ile Gln Leu Leu Asn Pro Asn
65 70 75 80
Arg Gln Thr Ile Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg
85 90 95
Phe Gln Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr
100 105 110
Asn Val Ser Ile Ser Asp Glu Gly Arg Tyr Phe Cys Gln Leu Tyr Thr
115 120 125
Asp Pro Pro Gln Glu Ser Tyr Thr Thr Ile Thr Val Leu Val Pro Pro
130 135 140
Arg Asn Leu Met Ile Asp Ile Gln Lys Asp Thr Ala Val Glu Gly Glu
145 150 155 160
Glu Ile Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr
165 170 175
Ile Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val
180 185 190
Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gln Leu Met Leu Lys
195 200 205
Val His Lys Glu Asp Asp Gly Val Pro Val Ile Cys Gln Val Glu His
210 215 220
Pro Ala Val Thr Gly Asn Leu Gln Thr Gln Arg Tyr Leu Glu Val Gln
225 230 235 240
Tyr Lys Pro Gln Val His Ile Gln Met Thr Tyr Pro Leu Gln Gly Leu
245 250 255
Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala Ile Gly Lys
260 265 270
Pro Gln Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro
275 280 285
Gln His Ala Val Leu Ser Gly Pro Asn Leu Phe Ile Asn Asn Leu Asn
290 295 300
Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn Ile Val Gly
305 310 315 320
Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr
325 330
<![CDATA[<210> 59]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 59]]>
000
<![CDATA[<210> 60]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 60]]>
000
<![CDATA[<210> 61]]>
<![CDATA[<211> 200]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 61]]>
ctcattgccc agcggacccc agcctctgcc aggttcggtc cgccatcctc gtcccgtcct 60
ccgccggccc ctgccccgcg cccagggatc ctccagctcc tttcgcccgc gccctccgtt 120
cgctccggac accatggaca agttttggtg gcacgcagcc tggggactct gcctcgtgcc 180
gctgagcctg gcgcagatcg 200
<![CDATA[<210> 62]]>
<![CDATA[<211> 166]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 62]]>
atttgaatat aacctgccgc tttgcaggtg tattccacgt ggagaaaaat ggtcgctaca 60
gcatctctcg gacggaggcc gctgacctct gcaaggcttt caatagcacc ttgcccacaa 120
tggcccagat ggagaaagct ctgagcatcg gatttgagac ctgcag 166
<![CDATA[<210> 63]]>
<![CDATA[<211> 134]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 63]]>
gtatgggttc atagaagggc acgtggtgat tccccggatc caccccaact ccatctgtgc 60
agcaaacaac acaggggtgt acatcctcac atccaacacc tcccagtatg acacatattg 120
cttcaatgct tcag 134
<![CDATA[<210> 64]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 64]]>
ctccacctga agaagattgt acatcagtca cagacctgcc caatgccttt gatggaccaa 60
ttaccataa 69
<![CDATA[<210> 65]]>
<![CDATA[<211> 231]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 65]]>
ctattgttaa ccgtgatggc acccgctatg tccagaaagg agaatacaga acgaatcctg 60
aagacatcta ccccagcaac cctactgatg atgacgtgag cagcggctcc tccagtgaaa 120
ggagcagcac ttcaggaggt tacatctttt acaccttttc tactgtacac cccatcccag 180
acgaagacag tccctggatc accgacagca cagacagaat ccctgctacc a 231
<![CDATA[<210> 66]]>
<![CDATA[<211> 129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 66]]>
ctttgatgag cactagtgct acagcaactg agacagcaac caagaggcaa gaaacctggg 60
attggttttc atggttgttt ctaccatcag agtcaaagaa tcatcttcac acaacaacac 120
aaatggctg 129
<![CDATA[<210> 67]]>
<![CDATA[<211> 126]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 67]]>
gtacgtcttc aaataccatc tcagcaggct gggagccaaa tgaagaaaat gaagatgaaa 60
gagacagaca cctcagtttt tctggatcag gcattgatga tgatgaagat tttatctcca 120
gcacca 126
<![CDATA[<210> 68]]>
<![CDATA[<211> 114]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 68]]>
tttcaaccac accacgggct tttgaccaca caaaacagaa ccaggactgg acccagtgga 60
acccaagcca ttcaaatccg gaagtgctac ttcagacaac cacaaggatg actg 114
<![CDATA[<210> 69]]>
<![CDATA[<211> 117]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 69]]>
atgtagacag aaatggcacc actgcttatg aaggaaactg gaacccagaa gcacaccctc 60
ccctcattca ccatgagcat catgaggaag aagagacccc acattctaca agcacaa 117
<![CDATA[<210> 70]]>
<![CDATA[<211> 129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 70]]>
tccaggcaac tcctagtagt acaacggaag aaacagctac ccagaaggaa cagtggtttg 60
gcaacagatg gcatgaggga tatcgccaaa cacccaaaga agactcccat tcgacaacag 120
ggacagctg 129
<![CDATA[<210> 71]]>
<![CDATA[<211> 132]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 71]]>
cagcctcagc tcataccagc catccaatgc aaggaaggac aacaccaagc ccagaggaca 60
gttcctggac tgatttcttc aacccaatct cacaccccat gggacgaggt catcaagcag 120
gaagaaggat gg 132
<![CDATA[<210> 72]]>
<![CDATA[<211> 102]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 72]]>
atatggactc cagtcatagt ataacgcttc agcctactgc aaatccaaac acaggtttgg 60
tggaagattt ggacaggaca ggacctcttt caatgacaac gc 102
<![CDATA[<210> 73]]>
<![CDATA[<211> 90]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 73]]>
agcagagtaa ttctcagagc ttctctacat cacatgaagg cttggaagaa gataaagacc 60
atccaacaac ttctactctg acatcaagca 90
<![CDATA[<210> 74]]>
<![CDATA[<211> 204]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 74]]>
ataggaatga tgtcacaggt ggaagaagag acccaaatca ttctgaaggc tcaactactt 60
tactggaagg ttatacctct cattacccac acacgaagga aagcaggacc ttcatcccag 120
tgacctcagc taagactggg tcctttggag ttactgcagt tactgttgga gattccaact 180
ctaatgtcaa tcgttcctta tcag 204
<![CDATA[<210> 75]]>
<![CDATA[<211> 63]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 75]]>
gagaccaaga cacattccac cccagtgggg ggtcccatac cactcatgga tctgaatcag 60
atg 63
<![CDATA[<210> 76]]>
<![CDATA[<211> 72]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400>]]> 76
gacactcaca tgggagtcaa gaaggtggag caaacacaac ctctggtcct ataaggacac 60
cccaaattcc ag 72
<![CDATA[<210> 77]]>
<![CDATA[<211> 79]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 77]]>
aatggctgat catcttggca tccctcttgg ccttggcttt gattcttgca gtttgcattg 60
cagtcaacag tcgaagaag 79
<![CDATA[<210> 78]]>
<![CDATA[<211> 3274]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 78]]>
gtgtgggcag aagaaaaagc tagtgatcaa cagtggcaat ggagctgtgg aggacagaaa 60
gccaagtgga ctcaacggag aggccagcaa gtctcaggaa atggtgcatt tggtgaacaa 120
ggagtcgtca gaaactccag accagtttat gacagctgat gagacaagga acctgcagaa 180
tgtggacatg aagattgggg tgtaacacct acaccattat cttggaaaga aacaaccgtt 240
ggaaacataa ccattacagg gagctgggac acttaacaga tgcaatgtgc tactgattgt 300
ttcattgcga atctttttta gcataaaatt ttctactctt tttgtttttt gtgttttgtt 360
ctttaaagtc aggtccaatt tgtaaaaaca gcattgcttt ctgaaattag ggcccaatta 420
ataatcagca agaatttgat cgttccagtt cccacttgga ggcctttcat ccctcgggtg 480
tgctatggat ggcttctaac aaaaactaca catatgtatt cctgatcgcc aacctttccc 540
ccaccagcta aggacatttc ccagggttaa tagggcctgg tccctgggag gaaatttgaa 600
tgggtccatt ttgcccttcc atagcctaat ccctgggcat tgctttccac tgaggttggg 660
ggttggggtg tactagttac acatcttcaa cagaccccct ctagaaattt ttcagatgct 720
tctgggagac acccaaaggg tgaagctatt tatctgtagt aaactattta tctgtgtttt 780
tgaaatatta aaccctggat cagtcctttg atcagtataa ttttttaaag ttactttgtc 840
agaggcacaa aagggtttaa actgattcat aataaatatc tgtacttctt cgatcttcac 900
cttttgtgct gtgattcttc agtttctaaa ccagcactgt ctgggtccct acaatgtatc 960
aggaagagct gagaatggta aggagactct tctaagtctt catctcagag accctgagtt 1020
cccactcaga cccactcagc caaatctcat ggaagaccaa ggagggcagc actgtttttg 1080
ttttttgttt tttgtttttt ttttttgaca ctgtccaaag gttttccatc ctgtcctgga 1140
atcagagttg gaagctgagg agcttcagcc tcttttatgg tttaatggcc acctgttctc 1200
tcctgtgaaa ggctttgcaa agtcacatta agtttgcatg acctgttatc cctggggccc 1260
tatttcatag aggctggccc tattagtgat ttccaaaaac aatatggaag tgccttttga 1320
tgtcttacaa taagagaaga agccaatgga aatgaaagag attggcaaag gggaaggatg 1380
atgccatgta gatcctgttt gacattttta tggctgtatt tgtaaactta aacacaccag 1440
tgtctgttct tgatgcagtt gctatttagg atgagttaag tgcctgggga gtccctcaaa 1500
aggttaaagg gattcccatc attggaatct tatcaccaga taggcaagtt tatgaccaaa 1560
caagagagta ctggctttat cctctaacct catattttct cccacttggc aagtcctttg 1620
tggcatttat tcatcagtca gggtgtccga ttggtcctag aacttccaaa ggctgcttgt 1680
catagaagcc attgcatcta taaagcaacg gctcctgtta aatggtatct cctttctgag 1740
gctcctacta aaagtcattt gttacctaaa cttatgtgct taacaggcaa tgcttctcag 1800
accacaaagc agaaagaaga agaaaagctc ctgactaaat cagggctggg cttagacaga 1860
gttgatctgt agaatatctt taaaggagag atgtcaactt tctgcactat tcccagcctc 1920
tgctcctccc tgtctaccct ctcccctccc tctctccctc cacttcaccc cacaatcttg 1980
aaaaacttcc tttctcttct gtgaacatca ttggccagat ccattttcag tggtctggat 2040
ttctttttat tttcttttca acttgaaaga aactggacat taggccacta tgtgttgtta 2100
ctgccactag tgttcaagtg cctcttgttt tcccagagat ttcctgggtc tgccagaggc 2160
ccagacaggc tcactcaagc tctttaactg aaaagcaaca agccactcca ggacaaggtt 2220
caaaatggtt acaacagcct ctacctgtcg ccccagggag aaaggggtag tgatacaagt 2280
ctcatagcca gagatggttt tccactcctt ctagatattc ccaaaaagag gctgagacag 2340
gaggttattt tcaattttat tttggaatta aatacttttt tccctttatt actgttgtag 2400
tccctcactt ggatatacct ctgttttcac gatagaaata agggaggtct agagcttcta 2460
ttccttggcc attgtcaacg gagagctggc caagtcttca caaacccttg caacattgcc 2520
tgaagtttat ggaataagat gtattctcac tcccttgatc tcaagggcgt aactctggaa 2580
gcacagcttg actacacgtc atttttacca atgattttca ggtgacctgg gctaagtcat 2640
ttaaactggg tctttataaa agtaaaaggc caacatttaa ttattttgca aagcaaccta 2700
agagctaaag atgtaatttt tcttgcaatt gtaaatcttt tgtgtctcct gaagacttcc 2760
cttaaaatta gctctgagtg aaaaatcaaa agagacaaaa gacatcttcg aatccatatt 2820
tcaagcctgg tagaattggc ttttctagca gaacctttcc aaaagtttta tattgagatt 2880
cataacaaca ccaagaattg attttgtagc caacattcat tcaatactgt tatatcagag 2940
gagtaggaga gaggaaacat ttgacttatc tggaaaagca aaatgtactt aagaataaga 3000
ataacatggt ccattcacct ttatgttata gatatgtctt tgtgtaaatc atttgttttg 3060
agttttcaaa gaatagccca ttgttcattc ttgtgctgta caatgaccac tgttattgtt 3120
actttgactt ttcagagcac acccttcctc tggtttttgt atatttattg atggatcaat 3180
aataatgagg aaagcatgat atgtatattg ctgagttgaa agcacttatt ggaaaatatt 3240
aaaaggctaa cattaaaaga ctaaaggaaa caga 3274
<![CDATA[<210> 79]]>
<![CDATA[<211> 5431]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 79]]>
ctcattgccc agcggacccc agcctctgcc aggttcggtc cgccatcctc gtcccgtcct 60
ccgccggccc ctgccccgcg cccagggatc ctccagctcc tttcgcccgc gccctccgtt 120
cgctccggac accatggaca agttttggtg gcacgcagcc tggggactct gcctcgtgcc 180
gctgagcctg gcgcagatcg atttgaatat aacctgccgc tttgcaggtg tattccacgt 240
ggagaaaaat ggtcgctaca gcatctctcg gacggaggcc gctgacctct gcaaggcttt 300
caatagcacc ttgcccacaa tggcccagat ggagaaagct ctgagcatcg gatttgagac 360
ctgcaggtat gggttcatag aagggcacgt ggtgattccc cggatccacc ccaactccat 420
ctgtgcagca aacaacacag gggtgtacat cctcacatcc aacacctccc agtatgacac 480
atattgcttc aatgcttcag ctccacctga agaagattgt acatcagtca cagacctgcc 540
caatgccttt gatggaccaa ttaccataac tattgttaac cgtgatggca cccgctatgt 600
ccagaaagga gaatacagaa cgaatcctga agacatctac cccagcaacc ctactgatga 660
tgacgtgagc agcggctcct ccagtgaaag gagcagcact tcaggaggtt acatctttta 720
caccttttct actgtacacc ccatcccaga cgaagacagt ccctggatca ccgacagcac 780
agacagaatc cctgctacca ctttgatgag cactagtgct acagcaactg agacagcaac 840
caagaggcaa gaaacctggg attggttttc atggttgttt ctaccatcag agtcaaagaa 900
tcatcttcac acaacaacac aaatggctgg tacgtcttca aataccatct cagcaggctg 960
ggagccaaat gaagaaaatg aagatgaaag agacagacac ctcagttttt ctggatcagg 1020
cattgatgat gatgaagatt ttatctccag caccatttca accacaccac gggcttttga 1080
ccacacaaaa cagaaccagg actggaccca gtggaaccca agccattcaa atccggaagt 1140
gctacttcag acaaccacaa ggatgactga tgtagacaga aatggcacca ctgcttatga 1200
aggaaactgg aacccagaag cacaccctcc cctcattcac catgagcatc atgaggaaga 1260
agagacccca cattctacaa gcacaatcca ggcaactcct agtagtacaa cggaagaaac 1320
agctacccag aaggaacagt ggtttggcaa cagatggcat gagggatatc gccaaacacc 1380
caaagaagac tcccattcga caacagggac agctgcagcc tcagctcata ccagccatcc 1440
aatgcaagga aggacaacac caagcccaga ggacagttcc tggactgatt tcttcaaccc 1500
aatctcacac cccatgggac gaggtcatca agcaggaaga aggatggata tggactccag 1560
tcatagtata acgcttcagc ctactgcaaa tccaaacaca ggtttggtgg aagatttgga 1620
caggacagga cctctttcaa tgacaacgca gcagagtaat tctcagagct tctctacatc 1680
acatgaaggc ttggaagaag ataaagacca tccaacaact tctactctga catcaagcaa 1740
taggaatgat gtcacaggtg gaagaagaga cccaaatcat tctgaaggct caactacttt 1800
actggaaggt tatacctctc attacccaca cacgaaggaa agcaggacct tcatcccagt 1860
gacctcagct aagactgggt cctttggagt tactgcagtt actgttggag attccaactc 1920
taatgtcaat cgttccttat caggagacca agacacattc caccccagtg gggggtccca 1980
taccactcat ggatctgaat cagatggaca ctcacatggg agtcaagaag gtggagcaaa 2040
cacaacctct ggtcctataa ggacacccca aattccagaa tggctgatca tcttggcatc 2100
cctcttggcc ttggctttga ttcttgcagt ttgcattgca gtcaacagtc gaagaaggtg 2160
tgggcagaag aaaaagctag tgatcaacag tggcaatgga gctgtggagg acagaaagcc 2220
aagtggactc aacggagagg ccagcaagtc tcaggaaatg gtgcatttgg tgaacaagga 2280
gtcgtcagaa actccagacc agtttatgac agctgatgag acaaggaacc tgcagaatgt 2340
ggacatgaag attggggtgt aacacctaca ccattatctt ggaaagaaac aaccgttgga 2400
aacataacca ttacagggag ctgggacact taacagatgc aatgtgctac tgattgtttc 2460
attgcgaatc ttttttagca taaaattttc tactcttttt gttttttgtg ttttgttctt 2520
taaagtcagg tccaatttgt aaaaacagca ttgctttctg aaattagggc ccaattaata 2580
atcagcaaga atttgatcgt tccagttccc acttggaggc ctttcatccc tcgggtgtgc 2640
tatggatggc ttctaacaaa aactacacat atgtattcct gatcgccaac ctttccccca 2700
ccagctaagg acatttccca gggttaatag ggcctggtcc ctgggaggaa atttgaatgg 2760
gtccattttg cccttccata gcctaatccc tgggcattgc tttccactga ggttgggggt 2820
tggggtgtac tagttacaca tcttcaacag accccctcta gaaatttttc agatgcttct 2880
gggagacacc caaagggtga agctatttat ctgtagtaaa ctatttatct gtgtttttga 2940
aatattaaac cctggatcag tcctttgatc agtataattt tttaaagtta ctttgtcaga 3000
ggcacaaaag ggtttaaact gattcataat aaatatctgt acttcttcga tcttcacctt 3060
ttgtgctgtg attcttcagt ttctaaacca gcactgtctg ggtccctaca atgtatcagg 3120
aagagctgag aatggtaagg agactcttct aagtcttcat ctcagagacc ctgagttccc 3180
actcagaccc actcagccaa atctcatgga agaccaagga gggcagcact gtttttgttt 3240
tttgtttttt gttttttttt tttgacactg tccaaaggtt ttccatcctg tcctggaatc 3300
agagttggaa gctgaggagc ttcagcctct tttatggttt aatggccacc tgttctctcc 3360
tgtgaaaggc tttgcaaagt cacattaagt ttgcatgacc tgttatccct ggggccctat 3420
ttcatagagg ctggccctat tagtgatttc caaaaacaat atggaagtgc cttttgatgt 3480
cttacaataa gagaagaagc caatggaaat gaaagagatt ggcaaagggg aaggatgatg 3540
ccatgtagat cctgtttgac atttttatgg ctgtatttgt aaacttaaac acaccagtgt 3600
ctgttcttga tgcagttgct atttaggatg agttaagtgc ctggggagtc cctcaaaagg 3660
ttaaagggat tcccatcatt ggaatcttat caccagatag gcaagtttat gaccaaacaa 3720
gagagtactg gctttatcct ctaacctcat attttctccc acttggcaag tcctttgtgg 3780
catttattca tcagtcaggg tgtccgattg gtcctagaac ttccaaaggc tgcttgtcat 3840
agaagccatt gcatctataa agcaacggct cctgttaaat ggtatctcct ttctgaggct 3900
cctactaaaa gtcatttgtt acctaaactt atgtgcttaa caggcaatgc ttctcagacc 3960
acaaagcaga aagaagaaga aaagctcctg actaaatcag ggctgggctt agacagagtt 4020
gatctgtaga atatctttaa aggagagatg tcaactttct gcactattcc cagcctctgc 4080
tcctccctgt ctaccctctc ccctccctct ctccctccac ttcaccccac aatcttgaaa 4140
aacttccttt ctcttctgtg aacatcattg gccagatcca ttttcagtgg tctggatttc 4200
tttttatttt cttttcaact tgaaagaaac tggacattag gccactatgt gttgttactg 4260
ccactagtgt tcaagtgcct cttgttttcc cagagatttc ctgggtctgc cagaggccca 4320
gacaggctca ctcaagctct ttaactgaaa agcaacaagc cactccagga caaggttcaa 4380
aatggttaca acagcctcta cctgtcgccc cagggagaaa ggggtagtga tacaagtctc 4440
atagccagag atggttttcc actccttcta gatattccca aaaagaggct gagacaggag 4500
gttattttca attttatttt ggaattaaat acttttttcc ctttattact gttgtagtcc 4560
ctcacttgga tatacctctg ttttcacgat agaaataagg gaggtctaga gcttctattc 4620
cttggccatt gtcaacggag agctggccaa gtcttcacaa acccttgcaa cattgcctga 4680
agtttatgga ataagatgta ttctcactcc cttgatctca agggcgtaac tctggaagca 4740
cagcttgact acacgtcatt tttaccaatg attttcaggt gacctgggct aagtcattta 4800
aactgggtct ttataaaagt aaaaggccaa catttaatta ttttgcaaag caacctaaga 4860
gctaaagatg taatttttct tgcaattgta aatcttttgt gtctcctgaa gacttccctt 4920
aaaattagct ctgagtgaaa aatcaaaaga gacaaaagac atcttcgaat ccatatttca 4980
agcctggtag aattggcttt tctagcagaa cctttccaaa agttttatat tgagattcat 5040
aacaacacca agaattgatt ttgtagccaa cattcattca atactgttat atcagaggag 5100
taggagagag gaaacatttg acttatctgg aaaagcaaaa tgtacttaag aataagaata 5160
acatggtcca ttcaccttta tgttatagat atgtctttgt gtaaatcatt tgttttgagt 5220
tttcaaagaa tagcccattg ttcattcttg tgctgtacaa tgaccactgt tattgttact 5280
ttgacttttc agagcacacc cttcctctgg tttttgtata tttattgatg gatcaataat 5340
aatgaggaaa gcatgatatg tatattgctg agttgaaagc acttattgga aaatattaaa 5400
aggctaacat taaaagacta aaggaaacag a 5431
<![CDATA[<210> 80]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 80]]>
Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro
1 5 10 15
Leu Ser Leu Ala Gln Ile
20
<![CDATA[<210> 81]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 81]]>
Leu Asn Ile Thr Cys Arg Phe Ala Gly Val Phe His Val Glu Lys Asn
1 5 10 15
Gly Arg Tyr Ser Ile Ser Arg Thr Glu Ala Ala Asp Leu Cys Lys Ala
20 25 30
Phe Asn Ser Thr Leu Pro Thr Met Ala Gln Met Glu Lys Ala Leu Ser
35 40 45
Ile Gly Phe Glu Thr Cys
50
<![CDATA[<210> 82]]>
<![CDATA[<211> 44]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 82]]>
Tyr Gly Phe Ile Glu Gly His Val Val Ile Pro Arg Ile His Pro Asn
1 5 10 15
Ser Ile Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn
20 25 30
Thr Ser Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser
35 40
<![CDATA[<210> 83]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 83]]>
Pro Pro Glu Glu Asp Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe
1 5 10 15
Asp Gly Pro Ile Thr Ile
20
<![CDATA[<210> 84]]>
<![CDATA[<211> 76]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 84]]>
Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu Tyr Arg
1 5 10 15
Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp Asp Val
20 25 30
Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly Tyr Ile
35 40 45
Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp Ser Pro
50 55 60
Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr
65 70 75
<![CDATA[<210> 85]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 85]]>
Leu Met Ser Thr Ser Ala Thr Ala Thr Glu Thr Ala Thr Lys Arg Gln
1 5 10 15
Glu Thr Trp Asp Trp Phe Ser Trp Leu Phe Leu Pro Ser Glu Ser Lys
20 25 30
Asn His Leu His Thr Thr Thr Gln Met Ala
35 40
<![CDATA[<210> 86]]>
<![CDATA[<211> 41]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 86]]>
Thr Ser Ser Asn Thr Ile Ser Ala Gly Trp Glu Pro Asn Glu Glu Asn
1 5 10 15
Glu Asp Glu Arg Asp Arg His Leu Ser Phe Ser Gly Ser Gly Ile Asp
20 25 30
Asp Asp Glu Asp Phe Ile Ser Ser Thr
35 40
<![CDATA[<210> 87]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 87]]>
Ser Thr Thr Pro Arg Ala Phe Asp His Thr Lys Gln Asn Gln Asp Trp
1 5 10 15
Thr Gln Trp Asn Pro Ser His Ser Asn Pro Glu Val Leu Leu Gln Thr
20 25 30
Thr Thr Arg Met Thr
35
<![CDATA[<210> 88]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 88]]>
Val Asp Arg Asn Gly Thr Thr Ala Tyr Glu Gly Asn Trp Asn Pro Glu
1 5 10 15
Ala His Pro Pro Leu Ile His His Glu His His Glu Glu Glu Glu Thr
20 25 30
Pro His Ser Thr Ser Thr
35
<![CDATA[<210> 89]]>
<![CDATA[<211> 4]]>2
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 89]]>
Gln Ala Thr Pro Ser Ser Thr Thr Glu Glu Thr Ala Thr Gln Lys Glu
1 5 10 15
Gln Trp Phe Gly Asn Arg Trp His Glu Gly Tyr Arg Gln Thr Pro Lys
20 25 30
Glu Asp Ser His Ser Thr Thr Gly Thr Ala
35 40
<![CDATA[<210> 90]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 90]]>
Ala Ser Ala His Thr Ser His Pro Met Gln Gly Arg Thr Thr Pro Ser
1 5 10 15
Pro Glu Asp Ser Ser Trp Thr Asp Phe Phe Asn Pro Ile Ser His Pro
20 25 30
Met Gly Arg Gly His Gln Ala Gly Arg Arg Met
35 40
<![CDATA[<210> 91]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 91]]>
Met Asp Ser Ser His Ser Ile Thr Leu Gln Pro Thr Ala Asn Pro Asn
1 5 10 15
Thr Gly Leu Val Glu Asp Leu Asp Arg Thr Gly Pro Leu Ser Met Thr
20 25 30
Thr
<![CDATA[<210> 92]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 92]]>
Gln Ser Asn Ser Gln Ser Phe Ser Thr Ser His Glu Gly Leu Glu Glu
1 5 10 15
Asp Lys Asp His Pro Thr Thr Ser Thr Leu Thr Ser Ser
20 25
<![CDATA[<210> 93]]>
<![CDATA[<211> ]]> 67
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 93]]>
Arg Asn Asp Val Thr Gly Gly Arg Arg Asp Pro Asn His Ser Glu Gly
1 5 10 15
Ser Thr Thr Leu Leu Glu Gly Tyr Thr Ser His Tyr Pro His Thr Lys
20 25 30
Glu Ser Arg Thr Phe Ile Pro Val Thr Ser Ala Lys Thr Gly Ser Phe
35 40 45
Gly Val Thr Ala Val Thr Val Gly Asp Ser Asn Ser Asn Val Asn Arg
50 55 60
Ser Leu Ser
65
<![CDATA[<210> 94]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 94]]>
Asp Gln Asp Thr Phe His Pro Ser Gly Gly Ser His Thr Thr His Gly
1 5 10 15
Ser Glu Ser Asp
20
<![CDATA[<210> 95]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 95]]>
His Ser His Gly Ser Gln Glu Gly Gly Ala Asn Thr Thr Ser Gly Pro
1 5 10 15
Ile Arg Thr Pro Gln Ile Pro
20
<![CDATA[<210> 96]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 96]]>
Trp Leu Ile Ile Leu Ala Ser Leu Leu Ala Leu Ala Leu Ile Leu Ala
1 5 10 15
Val Cys Ile Ala Val Asn Ser Arg Arg
20 25
<![CDATA[<210> 97]]>
<![CDATA[<211> 67]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 97]]>
Cys Gly Gln Lys Lys Lys Leu Val Ile Asn Ser Gly Asn Gly Ala Val
1 5 10 15
Glu Asp Arg Lys Pro Ser Gly Leu Asn Gly Glu Ala Ser Lys Ser Gln
20 25 30
Glu Met Val His Leu Val Asn Lys Glu Ser Ser Glu Thr Pro Asp Gln
35 40 45
Phe Met Thr Ala Asp Glu Thr Arg Asn Leu Gln Asn Val Asp Met Lys
50 55 60
Ile Gly Val
65
<![CDATA[<210> 98]]>
<![CDATA[<211> 742]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 98]]>
Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro
1 5 10 15
Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30
Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser Ile Ser Arg Thr Glu
35 40 45
Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met Ala
50 55 60
Gln Met Glu Lys Ala Leu Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly
65 70 75 80
Phe Ile Glu Gly His Val Val Ile Pro Arg Ile His Pro Asn Ser Ile
85 90 95
Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110
Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp
115 120 125
Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro Ile Thr
130 135 140
Ile Thr Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu
145 150 155 160
Tyr Arg Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp
165 170 175
Asp Val Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly
180 185 190
Tyr Ile Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp
195 200 205
Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr Thr Leu
210 215 220
Met Ser Thr Ser Ala Thr Ala Thr Glu Thr Ala Thr Lys Arg Gln Glu
225 230 235 240
Thr Trp Asp Trp Phe Ser Trp Leu Phe Leu Pro Ser Glu Ser Lys Asn
245 250 255
His Leu His Thr Thr Thr Gln Met Ala Gly Thr Ser Ser Asn Thr Ile
260 265 270
Ser Ala Gly Trp Glu Pro Asn Glu Glu Asn Glu Asp Glu Arg Asp Arg
275 280 285
His Leu Ser Phe Ser Gly Ser Gly Ile Asp Asp Asp Glu Asp Phe Ile
290 295 300
Ser Ser Thr Ile Ser Thr Thr Pro Arg Ala Phe Asp His Thr Lys Gln
305 310 315 320
Asn Gln Asp Trp Thr Gln Trp Asn Pro Ser His Ser Asn Pro Glu Val
325 330 335
Leu Leu Gln Thr Thr Thr Arg Met Thr Asp Val Asp Arg Asn Gly Thr
340 345 350
Thr Ala Tyr Glu Gly Asn Trp Asn Pro Glu Ala His Pro Pro Leu Ile
355 360 365
His His Glu His His Glu Glu Glu Glu Thr Pro His Ser Thr Ser Thr
370 375 380
Ile Gln Ala Thr Pro Ser Ser Thr Thr Glu Glu Thr Ala Thr Gln Lys
385 390 395 400
Glu Gln Trp Phe Gly Asn Arg Trp His Glu Gly Tyr Arg Gln Thr Pro
405 410 415
Lys Glu Asp Ser His Ser Thr Thr Gly Thr Ala Ala Ala Ser Ala His
420 425 430
Thr Ser His Pro Met Gln Gly Arg Thr Thr Pro Ser Pro Glu Asp Ser
435 440 445
Ser Trp Thr Asp Phe Phe Asn Pro Ile Ser His Pro Met Gly Arg Gly
450 455 460
His Gln Ala Gly Arg Arg Met Asp Met Asp Ser Ser His Ser Ile Thr
465 470 475 480
Leu Gln Pro Thr Ala Asn Pro Asn Thr Gly Leu Val Glu Asp Leu Asp
485 490 495
Arg Thr Gly Pro Leu Ser Met Thr Thr Gln Gln Ser Asn Ser Gln Ser
500 505 510
Phe Ser Thr Ser His Glu Gly Leu Glu Glu Asp Lys Asp His Pro Thr
515 520 525
Thr Ser Thr Leu Thr Ser Ser Asn Arg Asn Asp Val Thr Gly Gly Arg
530 535 540
Arg Asp Pro Asn His Ser Glu Gly Ser Thr Thr Leu Leu Glu Gly Tyr
545 550 555 560
Thr Ser His Tyr Pro His Thr Lys Glu Ser Arg Thr Phe Ile Pro Val
565 570 575
Thr Ser Ala Lys Thr Gly Ser Phe Gly Val Thr Ala Val Thr Val Gly
580 585 590
Asp Ser Asn Ser Asn Val Asn Arg Ser Leu Ser Gly Asp Gln Asp Thr
595 600 605
Phe His Pro Ser Gly Gly Ser His Thr Thr His Gly Ser Glu Ser Asp
610 615 620
Gly His Ser His Gly Ser Gln Glu Gly Gly Ala Asn Thr Thr Ser Gly
625 630 635 640
Pro Ile Arg Thr Pro Gln Ile Pro Glu Trp Leu Ile Ile Leu Ala Ser
645 650 655
Leu Leu Ala Leu Ala Leu Ile Leu Ala Val Cys Ile Ala Val Asn Ser
660 665 670
Arg Arg Arg Cys Gly Gln Lys Lys Lys Leu Val Ile Asn Ser Gly Asn
675 680 685
Gly Ala Val Glu Asp Arg Lys Pro Ser Gly Leu Asn Gly Glu Ala Ser
690 695 700
Lys Ser Gln Glu Met Val His Leu Val Asn Lys Glu Ser Ser Glu Thr
705 710 715 720
Pro Asp Gln Phe Met Thr Ala Asp Glu Thr Arg Asn Leu Gln Asn Val
725 730 735
Asp Met Lys Ile Gly Val
740
<![CDATA[<210> 99]]>
<![CDATA[<211> 800]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 99]]>
ctcattgccc agcggacccc agcctctgcc aggttcggtc cgccatcctc gtcccgtcct 60
ccgccggccc ctgccccgcg cccagggatc ctccagctcc tttcgcccgc gccctccgtt 120
cgctccggac accatggaca agttttggtg gcacgcagcc tggggactct gcctcgtgcc 180
gctgagcctg gcgcagatcg atttgaatat aacctgccgc tttgcaggtg tattccacgt 240
ggagaaaaat ggtcgctaca gcatctctcg gacggaggcc gctgacctct gcaaggcttt 300
caatagcacc ttgcccacaa tggcccagat ggagaaagct ctgagcatcg gatttgagac 360
ctgcaggtat gggttcatag aagggcacgt ggtgattccc cggatccacc ccaactccat 420
ctgtgcagca aacaacacag gggtgtacat cctcacatcc aacacctccc agtatgacac 480
atattgcttc aatgcttcag ctccacctga agaagattgt acatcagtca cagacctgcc 540
caatgccttt gatggaccaa ttaccataac tattgttaac cgtgatggca cccgctatgt 600
ccagaaagga gaatacagaa cgaatcctga agacatctac cccagcaacc ctactgatga 660
tgacgtgagc agcggctcct ccagtgaaag gagcagcact tcaggaggtt acatctttta 720
caccttttct actgtacacc ccatcccaga cgaagacagt ccctggatca ccgacagcac 780
agacagaatc cctgctacca 800
<![CDATA[<210> 100]]>
<![CDATA[<211> 222]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 100]]>
Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro
1 5 10 15
Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30
Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser Ile Ser Arg Thr Glu
35 40 45
Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met Ala
50 55 60
Gln Met Glu Lys Ala Leu Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly
65 70 75 80
Phe Ile Glu Gly His Val Val Ile Pro Arg Ile His Pro Asn Ser Ile
85 90 95
Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110
Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp
115 120 125
Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro Ile Thr
130 135 140
Ile Thr Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu
145 150 155 160
Tyr Arg Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp
165 170 175
Asp Val Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly
180 185 190
Tyr Ile Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp
195 200 205
Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr
210 215 220
<![CDATA[<210> 101]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 101]]>
000
<![CDATA[<210> 102]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 102]]>
000
<![CDATA[<210> 103]]>
<![CDATA[<211> 552]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 103]]>
agatgcagta gccgaaactg cgcggaggca cagaggccgg ggagagcgtt ctgggtccga 60
gggtccaggt aggggttgag ccaccatctg accgcaagct gcgtcgtgtc gccggttctg 120
caggcaccat gagccaggac accgaggtgg atatgaagga ggtggagctg aatgagttag 180
agcccgagaa gcagccgatg aacgcggcgt ctggggcggc catgtccctg gcgggagccg 240
agaagaatgg tctggtgaag atcaaggtgg cggaagacga ggcggaggcg gcagccgcgg 300
ctaagttcac gggcctgtcc aaggaggagc tgctgaaggt ggcaggcagc cccggctggg 360
tacgcacccg ctgggcactg ctgctgctct tctggctcgg ctggctcggc atgcttgctg 420
gtgccgtggt cataatcgtg cgagcgccgc gttgtcgcga gctaccggcg cagaagtggt 480
ggcacacggg cgccctctac cgcatcggcg accttcaggc cttccagggc cacggcgcgg 540
gcaacctggc gg 552
<![CDATA[<210> 104]]>
<![CDATA[<211> 174]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 104]]>
gtctgaaggg gcgtctcgat tacctgagct ctctgaaggt gaagggcctt gtgctgggtc 60
caattcacaa gaaccagaag gatgatgtcg ctcagactga cttgctgcag atcgacccca 120
attttggctc caaggaagat tttgacagtc tcttgcaatc ggctaaaaaa aaga 174
<![CDATA[<210> 105]]>
<![CDATA[<211> 91]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 105]]>
gcatccgtgt cattctggac cttactccca actaccgggg tgagaactcg tggttctcca 60
ctcaggttga cactgtggcc accaaggtga a 91
<![CDATA[<210> 106]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 106]]>
gatgctctgg agttttggct gcaagctggc gtggatgggt tccaggttcg ggacatagag 60
aatctgaag 69
<![CDATA[<210> 107]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 107]]>
gatgcatcct cattcttggc tgagtggcaa aatatcacca agggcttcag tgaagacag 59
<![CDATA[<210> 108]]>
<![CDATA[<211> 181]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 108]]>
gctcttgatt gcggggacta actcctccga ccttcagcag atcctgagcc tactcgaatc 60
caacaaagac ttgctgttga ctagctcata cctgtctgat tctggttcta ctggggagca 120
tacaaaatcc ctagtcacac agtatttgaa tgccactggc aatcgctggt gcagctggag 180
t 181
<![CDATA[<210> 109]]>
<![CDATA[<211> 144]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 109]]>
ttgtctcagg caaggctcct gacttccttc ttgccggctc aacttctccg actctaccag 60
ctgatgctct tcaccctgcc agggacccct gttttcagct acggggatga gattggcctg 120
gatgcagctg cccttcctgg acag 144
<![CDATA[<210> 110]]>
<![CDATA[<211> 84]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 110]]>
cctatggagg ctccagtcat gctgtgggat gagtccagct tccctgacat cccaggggct 60
gtaagtgcca acatgactgt gaag 84
<![CDATA[<210> 111]]>
<![CDATA[<211> 530]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 111]]>
ggccagagtg aagaccctgg ctccctcctt tccttgttcc ggcggctgag tgaccagcgg 60
agtaaggagc gctccctact gcatggggac ttccacgcgt tctccgctgg gcctggactc 120
ttctcctata tccgccactg ggaccagaat gagcgttttc tggtagtgct taactttggg 180
gatgtgggcc tctcggctgg actgcaggcc tccgacctgc ctgccagcgc cagcctgcca 240
gccaaggctg acctcctgct cagcacccag ccaggccgtg aggagggctc ccctcttgag 300
ctggaacgcc tgaaactgga gcctcacgaa gggctgctgc tccgcttccc ctacgcggcc 360
tgacttcagc ctgacatgga cccactaccc ttctcctttc cttcccaggc cctttggctt 420
ctgatttttc tcttttttaa aaacaaacaa acaaactgtt gcagattatg agtgaacccc 480
caaatagggt gttttctgcc ttcaaataaa agtcacccct gcatggtgaa 530
<![CDATA[<210> 112]]>
<![CDATA[<211> 1884]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 112]]>
agatgcagta gccgaaactg cgcggaggca cagaggccgg ggagagcgtt ctgggtccga 60
gggtccaggt aggggttgag ccaccatctg accgcaagct gcgtcgtgtc gccggttctg 120
caggcaccat gagccaggac accgaggtgg atatgaagga ggtggagctg aatgagttag 180
agcccgagaa gcagccgatg aacgcggcgt ctggggcggc catgtccctg gcgggagccg 240
agaagaatgg tctggtgaag atcaaggtgg cggaagacga ggcggaggcg gcagccgcgg 300
ctaagttcac gggcctgtcc aaggaggagc tgctgaaggt ggcaggcagc cccggctggg 360
tacgcacccg ctgggcactg ctgctgctct tctggctcgg ctggctcggc atgcttgctg 420
gtgccgtggt cataatcgtg cgagcgccgc gttgtcgcga gctaccggcg cagaagtggt 480
ggcacacggg cgccctctac cgcatcggcg accttcaggc cttccagggc cacggcgcgg 540
gcaacctggc gggtctgaag gggcgtctcg attacctgag ctctctgaag gtgaagggcc 600
ttgtgctggg tccaattcac aagaaccaga aggatgatgt cgctcagact gacttgctgc 660
agatcgaccc caattttggc tccaaggaag attttgacag tctcttgcaa tcggctaaaa 720
aaaagagcat ccgtgtcatt ctggacctta ctcccaacta ccggggtgag aactcgtggt 780
tctccactca ggttgacact gtggccacca aggtgaagat gctctggagt tttggctgca 840
agctggcgtg gatgggttcc aggttcggga catagagaat ctgaaggatg catcctcatt 900
cttggctgag tggcaaaata tcaccaaggg cttcagtgaa gacaggctct tgattgcggg 960
gactaactcc tccgaccttc agcagatcct gagcctactc gaatccaaca aagacttgct 1020
gttgactagc tcatacctgt ctgattctgg ttctactggg gagcatacaa aatccctagt 1080
cacacagtat ttgaatgcca ctggcaatcg ctggtgcagc tggagtttgt ctcaggcaag 1140
gctcctgact tccttcttgc cggctcaact tctccgactc taccagctga tgctcttcac 1200
cctgccaggg acccctgttt tcagctacgg ggatgagatt ggcctggatg cagctgccct 1260
tcctggacag cctatggagg ctccagtcat gctgtgggat gagtccagct tccctgacat 1320
cccaggggct gtaagtgcca acatgactgt gaagggccag agtgaagacc ctggctccct 1380
cctttccttg ttccggcggc tgagtgacca gcggagtaag gagcgctccc tactgcatgg 1440
ggacttccac gcgttctccg ctgggcctgg actcttctcc tatatccgcc actgggacca 1500
gaatgagcgt tttctggtag tgcttaactt tggggatgtg ggcctctcgg ctggactgca 1560
ggcctccgac ctgcctgcca gcgccagcct gccagccaag gctgacctcc tgctcagcac 1620
ccagccaggc cgtgaggagg gctcccctct tgagctggaa cgcctgaaac tggagcctca 1680
cgaagggctg ctgctccgct tcccctacgc ggcctgactt cagcctgaca tggacccact 1740
acccttctcc tttccttccc aggccctttg gcttctgatt tttctctttt ttaaaaacaa 1800
acaaacaaac tgttgcagat tatgagtgaa cccccaaata gggtgttttc tgccttcaaa 1860
taaaagtcac ccctgcatgg tgaa 1884
<![CDATA[<210> 113]]>
<![CDATA[<211> 141]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 113]]>
Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu Val Glu Leu Asn Glu
1 5 10 15
Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala Ser Gly Ala Ala Met
20 25 30
Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val Lys Ile Lys Val Ala
35 40 45
Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Lys Phe Thr Gly Leu Ser
50 55 60
Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro Gly Trp Val Arg Thr
65 70 75 80
Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly Trp Leu Gly Met Leu
85 90 95
Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro Arg Cys Arg Glu Leu
100 105 110
Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu Tyr Arg Ile Gly Asp
115 120 125
Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu Ala
130 135 140
<![CDATA[<210> 114]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 114]]>
Leu Lys Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val Lys Gly Leu
1 5 10 15
Val Leu Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val Ala Gln Thr
20 25 30
Asp Leu Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu Asp Phe Asp
35 40 45
Ser Leu Leu Gln Ser Ala Lys Lys Lys
50 55
<![CDATA[<210> 115]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 115]]>
Ile Arg Val Ile Leu Asp Leu Thr Pro Asn Tyr Arg Gly Glu Asn Ser
1 5 10 15
Trp Phe Ser Thr Gln Val Asp Thr Val Ala Thr Lys Val
20 25
<![CDATA[<210> 116]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 116]]>
Asp Ala Leu Glu Phe Trp Leu Gln Ala Gly Val Asp Gly Phe Gln Val
1 5 10 15
Arg Asp Ile Glu Asn Leu Lys
20
<![CDATA[<210> 117]]>
<![CDATA[<211> 19]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 117]]>
Asp Ala Ser Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys Gly Phe
1 5 10 15
Ser Glu Asp
<![CDATA[<210> 118]]>
<![CDATA[<211> 60]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 118]]>
Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp Leu Gln Gln Ile Leu Ser
1 5 10 15
Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu Thr Ser Ser Tyr Leu Ser
20 25 30
Asp Ser Gly Ser Thr Gly Glu His Thr Lys Ser Leu Val Thr Gln Tyr
35 40 45
Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser Trp Ser
50 55 60
<![CDATA[<210> 119]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 119]]>
Leu Ser Gln Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln Leu Leu
1 5 10 15
Arg Leu Tyr Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro Val Phe
20 25 30
Ser Tyr Gly Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro Gly Gln
35 40 45
<![CDATA[<210> 120]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 120]]>
Pro Met Glu Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe Pro Asp
1 5 10 15
Ile Pro Gly Ala Val Ser Ala Asn Met Thr Val Lys
20 25
<![CDATA[<210> 121]]>
<![CDATA[<211> 120]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 121]]>
Gly Gln Ser Glu Asp Pro Gly Ser Leu Leu Ser Leu Phe Arg Arg Leu
1 5 10 15
Ser Asp Gln Arg Ser Lys Glu Arg Ser Leu Leu His Gly Asp Phe His
20 25 30
Ala Phe Ser Ala Gly Pro Gly Leu Phe Ser Tyr Ile Arg His Trp Asp
35 40 45
Gln Asn Glu Arg Phe Leu Val Val Leu Asn Phe Gly Asp Val Gly Leu
50 55 60
Ser Ala Gly Leu Gln Ala Ser Asp Leu Pro Ala Ser Ala Ser Leu Pro
65 70 75 80
Ala Lys Ala Asp Leu Leu Leu Ser Thr Gln Pro Gly Arg Glu Glu Gly
85 90 95
Ser Pro Leu Glu Leu Glu Arg Leu Lys Leu Glu Pro His Glu Gly Leu
100 105 110
Leu Leu Arg Phe Pro Tyr Ala Ala
115 120
<![CDATA[<210> 122]]>
<![CDATA[<211> 529]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 122]]>
Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu Val Glu Leu Asn Glu
1 5 10 15
Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala Ser Gly Ala Ala Met
20 25 30
Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val Lys Ile Lys Val Ala
35 40 45
Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Lys Phe Thr Gly Leu Ser
50 55 60
Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro Gly Trp Val Arg Thr
65 70 75 80
Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly Trp Leu Gly Met Leu
85 90 95
Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro Arg Cys Arg Glu Leu
100 105 110
Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu Tyr Arg Ile Gly Asp
115 120 125
Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu Ala Gly Leu Lys
130 135 140
Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val Lys Gly Leu Val Leu
145 150 155 160
Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val Ala Gln Thr Asp Leu
165 170 175
Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu Asp Phe Asp Ser Leu
180 185 190
Leu Gln Ser Ala Lys Lys Lys Ser Ile Arg Val Ile Leu Asp Leu Thr
195 200 205
Pro Asn Tyr Arg Gly Glu Asn Ser Trp Phe Ser Thr Gln Val Asp Thr
210 215 220
Val Ala Thr Lys Val Lys Asp Ala Leu Glu Phe Trp Leu Gln Ala Gly
225 230 235 240
Val Asp Gly Phe Gln Val Arg Asp Ile Glu Asn Leu Lys Asp Ala Ser
245 250 255
Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys Gly Phe Ser Glu Asp
260 265 270
Arg Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp Leu Gln Gln Ile Leu
275 280 285
Ser Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu Thr Ser Ser Tyr Leu
290 295 300
Ser Asp Ser Gly Ser Thr Gly Glu His Thr Lys Ser Leu Val Thr Gln
305 310 315 320
Tyr Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser Trp Ser Leu Ser Gln
325 330 335
Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln Leu Leu Arg Leu Tyr
340 345 350
Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro Val Phe Ser Tyr Gly
355 360 365
Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro Gly Gln Pro Met Glu
370 375 380
Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe Pro Asp Ile Pro Gly
385 390 395 400
Ala Val Ser Ala Asn Met Thr Val Lys Gly Gln Ser Glu Asp Pro Gly
405 410 415
Ser Leu Leu Ser Leu Phe Arg Arg Leu Ser Asp Gln Arg Ser Lys Glu
420 425 430
Arg Ser Leu Leu His Gly Asp Phe His Ala Phe Ser Ala Gly Pro Gly
435 440 445
Leu Phe Ser Tyr Ile Arg His Trp Asp Gln Asn Glu Arg Phe Leu Val
450 455 460
Val Leu Asn Phe Gly Asp Val Gly Leu Ser Ala Gly Leu Gln Ala Ser
465 470 475 480
Asp Leu Pro Ala Ser Ala Ser Leu Pro Ala Lys Ala Asp Leu Leu Leu
485 490 495
Ser Thr Gln Pro Gly Arg Glu Glu Gly Ser Pro Leu Glu Leu Glu Arg
500 505 510
Leu Lys Leu Glu Pro His Glu Gly Leu Leu Leu Arg Phe Pro Tyr Ala
515 520 525
Ala
<![CDATA[<210> 123]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 123]]>
000
<![CDATA[<210> 124]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 124]]>
000
<![CDATA[<210> 125]]>
<![CDATA[<211> 187]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 125]]>
agtaagcctc cgcagcccac tcggactgca gcctgtttgc cgcccgtcct cccattgcag 60
cactcggggc gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac 120
tcgcgggtcc cgctccgctc cggcagcagc atggggaaag gacgcgcggg ccgagttggc 180
accacag 187
<![CDATA[<210> 126]]>
<![CDATA[<211> 178]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 126]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag attcaagtgg ttcaagaatg 120
ggaatgaatt gaatcgaaaa aacaaaccac aaaatatcaa gatacaaaaa aagccagg 178
<![CDATA[<210> 127]]>
<![CDATA[<211> 122]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 127]]>
gaagtcagaa cttcgcatta acaaagcatc actggctgat tctggagagt atatgtgcaa 60
agtgatcagc aaattaggaa atgacagtgc ctctgccaat atcaccatcg tggaatcaaa 120
cg 122
<![CDATA[<210> 128]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 128]]>
agatcatcac tggtatgcca gcctcaactg aaggagcata tgtgtcttca g 51
<![CDATA[<210> 129]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 129]]>
agtctcccat tagaatatca gtatccacag aaggagcaaa tacttcttca t 51
<![CDATA[<210> 130]]>
<![CDATA[<211> 1]]>30
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 130]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac ccctcgagat 120
acttgtgcaa 130
<![CDATA[<210> 131]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 131]]>
gtgcccaaat gagtttactg gtgatcgctg ccaaaactac gtaatggcca gcttctaca 59
<![CDATA[<210> 132]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 132]]>
agcatcttgg gattgaattt atgg 24
<![CDATA[<210> 133]]>
<![CDATA[<211> 103]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 133]]>
aggcggagga gctgtaccag aagagagtgc tgaccataac cggcatctgc atcgccctcc 60
ttgtggtcgg catcatgtgt gtggtggcct actgcaaaac caa 103
<![CDATA[<210> 134]]>
<![CDATA[<211> 127]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 134]]>
gaaacagcgg aaaaagctgc atgaccgtct tcggcagagc cttcggtctg aacgaaacaa 60
tatgatgaac attgccaatg ggcctcacca tcctaaccca ccccccgaga atgtccagct 120
ggtgaat 127
<![CDATA[<210> 135]]>
<![CDATA[<211> 131]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 135]]>
caatacgtat ctaaaaacgt catctccagt gagcatattg ttgagagaga agcagagaca 60
tccttttcca ccagtcacta tacttccaca gcccatcact ccactactgt cacccagact 120
cctagccaca g 131
<![CDATA[<210> 136]]>
<![CDATA[<211> 207]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 136]]>
ctggagcaac ggacacactg aaagcatcct ttccgaaagc cactctgtaa tcgtgatgtc 60
atccgtagaa aacagtaggc acagcagccc aactgggggc ccaagaggac gtcttaatgg 120
cacaggaggc cctcgtgaat gtaacagctt cctcaggcat gccagagaaa cccctgattc 180
ctaccgagac tctcctcata gtgaaag 207
<![CDATA[<210> 137]]>
<![CDATA[<211> 10299]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 137]]>
gtatgtgtca gccatgacca ccccggctcg tatgtcacct gtagatttcc acacgccaag 60
ctcccccaaa tcgccccctt cggaaatgtc tccacccgtg tccagcatga cggtgtccat 120
gccttccatg gcggtcagcc ccttcatgga agaagagaga cctctacttc tcgtgacacc 180
accaaggctg cgggagaaga agtttgacca tcaccctcag cagttcagct ccttccacca 240
caaccccgcg catgacagta acagcctccc tgctagcccc ttgaggatag tggaggatga 300
ggagtatgaa acgacccaag agtacgagcc agcccaagag cctgttaaga aactcgccaa 360
tagccggcgg gccaaaagaa ccaagcccaa tggccacatt gctaacagat tggaagtgga 420
cagcaacaca agctcccaga gcagtaactc agagagtgaa acagaagatg aaagagtagg 480
tgaagatacg cctttcctgg gcatacagaa ccccctggca gccagtcttg aggcaacacc 540
tgccttccgc ctggctgaca gcaggactaa cccagcaggc cgcttctcga cacaggaaga 600
aatccaggcc aggctgtcta gtgtaattgc taaccaagac cctattgctg tataaaacct 660
aaataaacac atagattcac ctgtaaaact ttattttata taataaagta ttccacctta 720
aattaaacaa tttattttat tttagcagtt ctgcaaatag aaaacaggaa aaaaactttt 780
ataaattaaa tatatgtatg taaaaatgtg ttatgtgcca tatgtagcaa ttttttacag 840
tatttcaaaa cgagaaagat atcaatggtg cctttatgtt atgttatgtc gagagcaagt 900
tttgtacagt tacagtgatt gcttttccac agtatttctg caaaacctct catagattca 960
gtttttgctg gcttcttgtg cattgcatta tgatgttgac tggatgtatg atttgcaaga 1020
cttgcaactg tccctctgtt tgcttgtagt agcacccgat cagtatgtct tgtaatggca 1080
catccatcca gatatgcctc tcttgtgtat gaagttttct ttgctttcag aatatgaaat 1140
gagttgtgtc tactctgcca gccaaaggtt tgcctcattg ggctctgaga taatagtaga 1200
tccaacagca tgctactatt aaatacagca agaaactgca ttaagtaatg ttaaatatta 1260
ggaagaaagt aatactgtga tttaaaaaaa actatattat taatcagaag acagcttgct 1320
cttactaaaa ggagctctca tttactttat ttgattttat ttttcttgac aaaaagcaac 1380
agttttaggg atagcttaga aaatgggttc tggcttgcta tcagggtaaa tctaacacct 1440
tacaagagga ctgagtgtca ctttctctct gggggaatga tccagcagct tatctagttg 1500
acaatcaaaa cacggctgat aaaggtgcaa tcatttctga catgtatttt tcactgattt 1560
tgaagctagt gattggttgt gtcttcttgg ctcaaaaaga agcatattac ggcacaaaaa 1620
gcccagccca gacagcacat gcagcatttt gtctgaaata cttctagagt caaacgtgcc 1680
tgctgtacat agcgatgact tgtcatcata gggaagtatt tccatcgtag agtgttcaga 1740
aggagtgact gtataggtgg agagaagctt agtgactccg ttgaaatttt aaaatgtgga 1800
tgaccacccc tttctccccc ttatttttct tttatctttc catgttgcct tgatcaggtc 1860
ataactatgc atgaacattt tttatcagga atggccgatg tgtatgtgat ttgtaatcac 1920
aagtaatgat tcatcaggaa atgtcaatcc tgttggaaag attgcacctt tacttgcaga 1980
agtgaccccc acctgtgtcc tgacctctcc atttacaggc tctctcaccc atttccccca 2040
cctcctttaa tttttgcttt actgtcataa agtaggacta agattggtct aagcattgca 2100
tgttcttttg tgatggtaaa tccaaaggaa ggcctataag tattaacatt tgaaataact 2160
gctaattcag gaaaatggaa gaaaaaaaat tatttgaaac acagaaccca tttcatggcc 2220
tgcctgatat ctgtgaaatc agggctggag ctttacttag gattcacatg gcctcctagg 2280
aaccatggga caaatgggaa acaggttatc gggggattca tgaagtcagt gagagtaatt 2340
gcttcttttt tgcgggtgaa ctgaatgtat ttcttcacca aatcttgatg ttaacaatta 2400
aaaagaagaa atgacatgca agtaggtctt agcagaaaaa tgcaggctgg gcatgagtca 2460
tgttgttacc ctcccacatg ctcctacaat ccacagagat gcctgtctgc aggttcttga 2520
agttattgtt agtatttggt atctcaaatt tttcgtcact gttcacatgc cactttctct 2580
gtgcacagtg gtatcctcat ttgcttttta acctacactg aggagtcttt gtcaggttgc 2640
actgattttc caattctgca gtaatgagta agctcacggc atggggaaga agacagtcag 2700
tccaatgaag ttctctaaat tattttaaca ttgcctttga aggccttgac tcatccttag 2760
ctatttcaat gaagaaattc ctaccatgaa tttaaaaccc taaaaattct gtttcaaatt 2820
ctttgggcat tggggtactc agatatccca ttgtggaaga attttaagaa taaatagaag 2880
tttctgttga gaaccatgag caacatgttt cttacaatga gaattgctat gcattttaaa 2940
attgcaaata tatatgaaaa ttgaagacaa gaggaaattg tatttctaac ttgattctga 3000
tcactcacag aggtggcata ttattatagt tgggacatcc tttgcaccct tcataaaaaa 3060
ggccagctga ctgctcagca tcacctgcca aggccactag atttgtgttt acaggggtat 3120
ctctgtgatg cttgtcacat cactcttgac cacctctgtt aataaattcc gacagtgcag 3180
tggcgatcgg agtgtgaact tatgttccca gcatatggaa agctatctta ggttttaagg 3240
tagtagaaat tgcccaggag tttgacagca actttgtttc ccgggtctaa aatcgtatcc 3300
cactgaggtg tatgcagtgg agcataatac atgcaaatac atgcaaaact ccttttgttt 3360
cacctaagat tcactttcta tcttactttc ccttcctgcc tagtgtgact tttgccccca 3420
agagtgcctg gacagcattc tagtttctac aaaatggtcc tctgtgtagg tgaatgtgtc 3480
ccaaacctgc tatcactttc ttgtttcagt gtgactgtct tgttagaggt gaagtttatc 3540
cagggtaact tgctcactaa ctattccttt ttatggcctg gggttaaagg gcgcatggct 3600
cacactggtg aaaataagga aggcctggtc ttatcttgta ttaataatac tggctgcatt 3660
ccaccagcca gagatttcta tctgcgaaga cctatgaaac actgaagaga aatgtaggca 3720
gaaggaaatg gccacatatc acaagttcta ttatatattc ttttgtaaat acatattgta 3780
tattacttgg atgttttctt atatcattta ctgtcttttt gagttaatgt cagtttttac 3840
tctctcaact tactatgtaa cattgtaaat aacataatgt cctttattat ttatatttaa 3900
gcatctaaca tatagagttg ttttcatata agtttaagat aaatgtcaaa aatatatgtt 3960
cttttgtttt tctttgcttt aaaattatgt atcttttcct tttctttttt ttaagaataa 4020
tttattgttc aggagaaaga atgtatatgt aactgaaact atctgaagaa tgcacattga 4080
aggccgtgag gtactgataa actaaagaat ttattattca aaatactaag caataagtaa 4140
ttgtgattta tttaaagttt tgtccatttt ccatgaaaga catactgcaa taaaaatgct 4200
actctgtgga gacctgggag tgttgctcag cagactacag cttcagtctg ttagaccagc 4260
acctttcatc tcattcccat agttatgcta atttaggatt gtgttccatg gaccccatga 4320
tcaccttgtc tatacgttgc ttcttgtctg tccattgctt ttgccacacc acctgttctc 4380
aaatcatctc ctccctacca atgctgttta tcactttctt ccttgttgaa gaggccacac 4440
aaccagacag tactatgctt cctttttcct ccatacacaa taacagagag agaatattct 4500
agggcatgac tgcctggatc ctggctgttg ctatcttttg tagtggcagt aagaaactcc 4560
ttcagactaa tgaaaatgtc aacgtgccat tcaatcacga aaggtaacga aaaatgctct 4620
catggttcaa atagtccaat ggcccatagt ggcctaaaag gcagccagtt gacacctggc 4680
catgctaagc ttccttatac catccgctaa tgactttcca ttgggcccac aatttacgga 4740
ttcataattt taaaagagga gaaggccaag ttaggttcat tccccttatt ctgtcaataa 4800
aacaaatcaa actcatgtct atctaactgc tcagggagga gcctttgcat gagaaaattc 4860
tcatattcta agactgagtc atagaaatga gggtattact tttcttactg caattaacct 4920
aaacaaaagc catattttaa caaatagata tttgcatggt acccttcata tattccaagc 4980
atttcactca ttattccagg taggttaaga gcttctgaag tgtatgaagt aaaggtcagc 5040
aatcctttgg ggtgaacagt ggcctccttt ggagtttggg ggtaacctga gacttcccac 5100
caatgtccac ctccatctgt gtacctaatt cctattacct agttatggct cctctaggat 5160
catttccaaa cactctggat gtccaggaaa tttaaattgt agcttttgac tgagctagtt 5220
tttcctattt atattaataa attttcaaaa atgcttgaaa tcttcacatt tgcaacaact 5280
ttagttttca tgcacataca aacacagaga gacaaaaatt ccaaacagac actctccaaa 5340
agccaccaca ctctttcact tgctctatag tcatttagcc aaccagccat gcagagaata 5400
tttaaaactt aaagattgag acattattct cagtttttgc tgaggctttg taacgaaatt 5460
gaacactata agcagctatt gtagtaattt tggttaaaat tgtttgcctg ggatatagta 5520
tttgaggcag aagcacgtgt gtgaaggagg tgaggtggtt tggaaagagt gaagactcgc 5580
agccagattg aatgtctgga taattactat aattctccct tcttggttga aaccatgttc 5640
tctcttgatt tttaaaccca ggctgcctct ggaaacaagc aaacctgagt ctttctaacc 5700
tgagtcttcc caatcattag atttcttttc tgtcctaacg atgaatgata aaaggacttg 5760
atgttcacaa tttggggtta taaggcaggt ctgaaatctg gagactcaag atgctggaag 5820
gagtggaaag tttcgatgac tttatatgaa tcactttgca ctctatgttt ggcttgtcct 5880
ctttgaaact gatttactaa aataaatgta agggaactat tactccaaaa gattaacttg 5940
gcaggaaata ccaatacttt cagtttatga aagacaaaac tgtcttgttg ctacaggaag 6000
ctgcaatgtt cctaaccttt aaggttggtg ttgaataggg tggtcatgcc ctcccctgca 6060
ggtatcttta ggctcctgtt gacctcctgg tactataact gttcgtcttc tctgggtagc 6120
tattgatttt gaactttaac atgcttcaaa actttattca tcagggaaat aggaaaagag 6180
ttttgttacc tggaggaaat ctattgtgat ctacctgagc tttttaaaaa cagaccagga 6240
gaaggaaacc agtaattttt aaagaagaga cagagaatgg gataatagtt tcacccagga 6300
tctctttcta accctttccc ttcaaatgaa cttattggaa cagaattgga aagaagaaag 6360
gacatctctg cccaccccac aggatgccaa aaaggctaaa gaattacctc tgtagattta 6420
aacatcttta atggcttatg tatagatttg ctaatacaga gagaaatgaa ctattaaata 6480
aaaatcacat tttataatat ttttatggct taaaacatcc tttatctcct ttttgttctc 6540
tctacatgat atggtaagtg atgaggaaaa tttaggctca ggaaggttaa aatctttctt 6600
ggagttacac atctaagaga gctgcagagc tgacacttgt acccaggttt tctgactgca 6660
aatccagttt ctttctattg cgttcttccc ctttccctgc ctcaagcaga aacaggtttt 6720
ttattttcaa cctttatgta tacagtatgt tatgttacat ctacagctaa gtttcttttt 6780
agaagaatgt gagcccttct agctttggtt tagagtgatt ctagaagcca atttccttgg 6840
cttagtgatt ctatgcacct ttcctaaact tagctttcta aggaaatgaa gtgtacgagt 6900
gagaatgaat tcacaatttc gacatgtagg tagcatccta aagtgaaaag aggaggaaat 6960
ttgtggtcaa agcactctcc ccaccactta gaaacttact gactgtgggc agcttcctcc 7020
tccaagtttc cttcctgatt tacaagaccg tggtgtggtc aggattaaac ttgaatacat 7080
gtaaggaagc ctgaaagtgt ctaacacata gcgagtattc aaatgccacc ttctatttga 7140
tccttcccct ccagttcctt aagttttgga atctaggttt ctcagttcca aatggattga 7200
catttgcata tccccattgc acaatggatc aaataaactt tatgttatca tttctccaac 7260
atagtgccag taagcaaatc ctttttaata acaacagtat gttgagaaac atatcaccaa 7320
ataatattta actttgtagc tttgataagt tctttaggtt ttggttttgg ttttgttttc 7380
tgagacaggg tcttgatctg tcacccagac tggagtgcaa tggttcaatt ttagctcact 7440
gcaacctgta actcctgagc tcaagtgatc ctcccacctc agcctctcaa gtagctggga 7500
ctacaggtgt gctccaccat gcccagcgat tttttttttt tttttttttt ttttttttgg 7560
tagggacaag gtctcgctat gttgcccagg ctggtcttca attcctggcc ttaagtgatc 7620
ctcctgcctc ggcctcccaa agtgctggaa ttacaggcat gagccaccac ccataacttt 7680
atgtttgttt ttttgatgca gtataagttc agcttgcttc ttatgcagcc ataccatttc 7740
atgttaactc tgatttttag cagcttatta cattagtgtt ttattattaa tataatttta 7800
cagaaattta ctaaaccatg actctgtaga gttttaataa tactacctcc aaacatcatt 7860
gcaaacatct agaagaatga acaaaaatga tcttagatcg acagtatatc tgtttgtctt 7920
agtttctaca caggatgttc agacatattc catttcttta aaaaaaaaat atatatatat 7980
atatatatat ataggcctgg cacggtggct catgactgta atcccagcac tttgggaggc 8040
ttaggcaggc aaatcacctg aggtcaggag tttgagactg gcctgaccaa catggtgaaa 8100
cctcgtctct attaaaatta caaaaattag ccgggcgtgg tggcacatgc ctgtaatccc 8160
agctactcgg gaggctgagg caggagaatc acttgaacct gggaggcaga ggttgcagtg 8220
agttgagatc acgccattgc actccagcct tggtgacaag agtgaaactc cgtctcaaaa 8280
aaaaaaaaaa tatgtatata tatatatata tatatatata tatatatata tatatatata 8340
taaaatccca ccaaaagtct gcagagtgac caaattagac ggctctggtt tcagattaaa 8400
ttctaaatgt gagaaaccac atagctccca tgatccatcc aataattcct cacgtcctct 8460
tcactcttta ctccatgcat aaaacagaat ttttttttct catcctgggt atgaagcaat 8520
taataattta cggatttagc ctatttggat tcaatccctt caaactccat actacatcca 8580
aggtggaagt gacttaaact ctgatatcaa tcatcaggct tgtaatatag gctttgttaa 8640
tggcaggaga gtctaataaa actttctgtt ccttatcctt catttaaatg aaaaactttt 8700
tattgaaaac aatcataact ctagctcatc ataaatataa ttcatgagga cattttatta 8760
tttttatatt aaagaaataa tattatagat gtaaactttg cacctttcta attattatca 8820
tgagttaagc taatacttgt cttctggtcc ctagatgatg attctttttt gccttactgg 8880
aggagccctt gtcttgaagt gagttgcttc aacagcagag gacttctagt ttctcccagt 8940
tgagcctaaa gtgaactttt catcttcttc agaggaaggg gcttccttga tttgtacttt 9000
tgtggctcct cagataacac agacaatttt atcttggatc ccaggttctc ttcaccatta 9060
agaataagaa agagagaaaa tgctgtgcat gacagccacc tactccaaac tacccaaccc 9120
cctgtaacca ggtaccttcc aacaacgaga tgattctgcc ctcactcaag agtctccccc 9180
acaaagattc cattctccct ttacttttta ttttttattt ttttgcaaaa caaaggcctc 9240
ctttagtacc tccttagttt ataaccctta tcttcccagc tcttcccttc actgatacct 9300
ctgatttcaa aagttctgaa gtcggaagac cacacaattt cagactgtga acagaaattc 9360
agtcagaaat tattggagtt agaaagaatt tagagaaatt gtattgaact agaaagtcct 9420
ctttgattag tggtggccct ttagaaagtt ctagggcaga gtcccatggt gtttcatctt 9480
ttccatgatt tgcttgacca aaaacttttc tttcacaaca tgaaaatatg ctaatccacc 9540
acacattttg gattctgctc tgtttgcctg aggtgttaga tctctagcca ggactgtgaa 9600
gggaaggaac ttgaatcctt cctattgagc tattaatgca gagtcagtga gatgaagggt 9660
tccactcggg gtcaaaatca tgtcagttac caagcaaagg agcaagtaag gggaaacatc 9720
tcctcatctg gttagtggag ccacatttca cccactgatc aagccagacc tgagcaatag 9780
tctagatttc tccctccaca tctaattggt gacaatggtg attgtaccac tgacaggtca 9840
cacaagtcca caccctcctt cacagcctca ctgcttcggc tcttgtttat gctcttatca 9900
gcatctgtca ctaggatcca tttgtctcct gactcatcta cttcctttca ctcccctggg 9960
ccatccttca catcatgcta gaagactgtg tctaacttgc agaacttatt gtgtcaatct 10020
cctggttacg gcccctccat ggctccccac ctgacatagg aggcccttca caatctggct 10080
tctgtcactc ataacttgtc tccagccttc attcttcagt ctgattttat ggttttctag 10140
ttccccaata caccacacca gtgtttatga aaccctagtc attggcctac gaactttatg 10200
attattgaca tattcatgta ccacctgtat tattttttgc atagtgttta tttttaattg 10260
actacatttt taactacaat aaagtaattt caactaaaa 10299
<![CDATA[<210> 138]]>
<![CDATA[<211> 11669]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 138]]>
agtaagcctc cgcagcccac tcggactgca gcctgtttgc cgcccgtcct cccattgcag 60
cactcggggc gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac 120
tcgcgggtcc cgctccgctc cggcagcagc atggggaaag gacgcgcggg ccgagttggc 180
accacagcct tgcctccccg attgaaagag atgaaaagcc aggaatcggc tgcaggttcc 240
aaactagtcc ttcggtgtga aaccagttct gaatactcct ctctcagatt caagtggttc 300
aagaatggga atgaattgaa tcgaaaaaac aaaccacaaa atatcaagat acaaaaaaag 360
ccagggaagt cagaacttcg cattaacaaa gcatcactgg ctgattctgg agagtatatg 420
tgcaaagtga tcagcaaatt aggaaatgac agtgcctctg ccaatatcac catcgtggaa 480
tcaaacgaga tcatcactgg tatgccagcc tcaactgaag gagcatatgt gtcttcagag 540
tctcccatta gaatatcagt atccacagaa ggagcaaata cttcttcatc tacatctaca 600
tccaccactg ggacaagcca tcttgtaaaa tgtgcggaga aggagaaaac tttctgtgtg 660
aatggagggg agtgcttcat ggtgaaagac ctttcaaacc cctcgagata cttgtgcaag 720
tgcccaaatg agtttactgg tgatcgctgc caaaactacg taatggccag cttctacaag 780
catcttggga ttgaatttat ggaggcggag gagctgtacc agaagagagt gctgaccata 840
accggcatct gcatcgccct ccttgtggtc ggcatcatgt gtgtggtggc ctactgcaaa 900
accaagaaac agcggaaaaa gctgcatgac cgtcttcggc agagccttcg gtctgaacga 960
aacaatatga tgaacattgc caatgggcct caccatccta acccaccccc cgagaatgtc 1020
cagctggtga atcaatacgt atctaaaaac gtcatctcca gtgagcatat tgttgagaga 1080
gaagcagaga catccttttc caccagtcac tatacttcca cagcccatca ctccactact 1140
gtcacccaga ctcctagcca cagctggagc aacggacaca ctgaaagcat cctttccgaa 1200
agccactctg taatcgtgat gtcatccgta gaaaacagta ggcacagcag cccaactggg 1260
ggcccaagag gacgtcttaa tggcacagga ggccctcgtg aatgtaacag cttcctcagg 1320
catgccagag aaacccctga ttcctaccga gactctcctc atagtgaaag gtatgtgtca 1380
gccatgacca ccccggctcg tatgtcacct gtagatttcc acacgccaag ctcccccaaa 1440
tcgccccctt cggaaatgtc tccacccgtg tccagcatga cggtgtccat gccttccatg 1500
gcggtcagcc ccttcatgga agaagagaga cctctacttc tcgtgacacc accaaggctg 1560
cgggagaaga agtttgacca tcaccctcag cagttcagct ccttccacca caaccccgcg 1620
catgacagta acagcctccc tgctagcccc ttgaggatag tggaggatga ggagtatgaa 1680
acgacccaag agtacgagcc agcccaagag cctgttaaga aactcgccaa tagccggcgg 1740
gccaaaagaa ccaagcccaa tggccacatt gctaacagat tggaagtgga cagcaacaca 1800
agctcccaga gcagtaactc agagagtgaa acagaagatg aaagagtagg tgaagatacg 1860
cctttcctgg gcatacagaa ccccctggca gccagtcttg aggcaacacc tgccttccgc 1920
ctggctgaca gcaggactaa cccagcaggc cgcttctcga cacaggaaga aatccaggcc 1980
aggctgtcta gtgtaattgc taaccaagac cctattgctg tataaaacct aaataaacac 2040
atagattcac ctgtaaaact ttattttata taataaagta ttccacctta aattaaacaa 2100
tttattttat tttagcagtt ctgcaaatag aaaacaggaa aaaaactttt ataaattaaa 2160
tatatgtatg taaaaatgtg ttatgtgcca tatgtagcaa ttttttacag tatttcaaaa 2220
cgagaaagat atcaatggtg cctttatgtt atgttatgtc gagagcaagt tttgtacagt 2280
tacagtgatt gcttttccac agtatttctg caaaacctct catagattca gtttttgctg 2340
gcttcttgtg cattgcatta tgatgttgac tggatgtatg atttgcaaga cttgcaactg 2400
tccctctgtt tgcttgtagt agcacccgat cagtatgtct tgtaatggca catccatcca 2460
gatatgcctc tcttgtgtat gaagttttct ttgctttcag aatatgaaat gagttgtgtc 2520
tactctgcca gccaaaggtt tgcctcattg ggctctgaga taatagtaga tccaacagca 2580
tgctactatt aaatacagca agaaactgca ttaagtaatg ttaaatatta ggaagaaagt 2640
aatactgtga tttaaaaaaa actatattat taatcagaag acagcttgct cttactaaaa 2700
ggagctctca tttactttat ttgattttat ttttcttgac aaaaagcaac agttttaggg 2760
atagcttaga aaatgggttc tggcttgcta tcagggtaaa tctaacacct tacaagagga 2820
ctgagtgtca ctttctctct gggggaatga tccagcagct tatctagttg acaatcaaaa 2880
cacggctgat aaaggtgcaa tcatttctga catgtatttt tcactgattt tgaagctagt 2940
gattggttgt gtcttcttgg ctcaaaaaga agcatattac ggcacaaaaa gcccagccca 3000
gacagcacat gcagcatttt gtctgaaata cttctagagt caaacgtgcc tgctgtacat 3060
agcgatgact tgtcatcata gggaagtatt tccatcgtag agtgttcaga aggagtgact 3120
gtataggtgg agagaagctt agtgactccg ttgaaatttt aaaatgtgga tgaccacccc 3180
tttctccccc ttatttttct tttatctttc catgttgcct tgatcaggtc ataactatgc 3240
atgaacattt tttatcagga atggccgatg tgtatgtgat ttgtaatcac aagtaatgat 3300
tcatcaggaa atgtcaatcc tgttggaaag attgcacctt tacttgcaga agtgaccccc 3360
acctgtgtcc tgacctctcc atttacaggc tctctcaccc atttccccca cctcctttaa 3420
tttttgcttt actgtcataa agtaggacta agattggtct aagcattgca tgttcttttg 3480
tgatggtaaa tccaaaggaa ggcctataag tattaacatt tgaaataact gctaattcag 3540
gaaaatggaa gaaaaaaaat tatttgaaac acagaaccca tttcatggcc tgcctgatat 3600
ctgtgaaatc agggctggag ctttacttag gattcacatg gcctcctagg aaccatggga 3660
caaatgggaa acaggttatc gggggattca tgaagtcagt gagagtaatt gcttcttttt 3720
tgcgggtgaa ctgaatgtat ttcttcacca aatcttgatg ttaacaatta aaaagaagaa 3780
atgacatgca agtaggtctt agcagaaaaa tgcaggctgg gcatgagtca tgttgttacc 3840
ctcccacatg ctcctacaat ccacagagat gcctgtctgc aggttcttga agttattgtt 3900
agtatttggt atctcaaatt tttcgtcact gttcacatgc cactttctct gtgcacagtg 3960
gtatcctcat ttgcttttta acctacactg aggagtcttt gtcaggttgc actgattttc 4020
caattctgca gtaatgagta agctcacggc atggggaaga agacagtcag tccaatgaag 4080
ttctctaaat tattttaaca ttgcctttga aggccttgac tcatccttag ctatttcaat 4140
gaagaaattc ctaccatgaa tttaaaaccc taaaaattct gtttcaaatt ctttgggcat 4200
tggggtactc agatatccca ttgtggaaga attttaagaa taaatagaag tttctgttga 4260
gaaccatgag caacatgttt cttacaatga gaattgctat gcattttaaa attgcaaata 4320
tatatgaaaa ttgaagacaa gaggaaattg tatttctaac ttgattctga tcactcacag 4380
aggtggcata ttattatagt tgggacatcc tttgcaccct tcataaaaaa ggccagctga 4440
ctgctcagca tcacctgcca aggccactag atttgtgttt acaggggtat ctctgtgatg 4500
cttgtcacat cactcttgac cacctctgtt aataaattcc gacagtgcag tggcgatcgg 4560
agtgtgaact tatgttccca gcatatggaa agctatctta ggttttaagg tagtagaaat 4620
tgcccaggag tttgacagca actttgtttc ccgggtctaa aatcgtatcc cactgaggtg 4680
tatgcagtgg agcataatac atgcaaatac atgcaaaact ccttttgttt cacctaagat 4740
tcactttcta tcttactttc ccttcctgcc tagtgtgact tttgccccca agagtgcctg 4800
gacagcattc tagtttctac aaaatggtcc tctgtgtagg tgaatgtgtc ccaaacctgc 4860
tatcactttc ttgtttcagt gtgactgtct tgttagaggt gaagtttatc cagggtaact 4920
tgctcactaa ctattccttt ttatggcctg gggttaaagg gcgcatggct cacactggtg 4980
aaaataagga aggcctggtc ttatcttgta ttaataatac tggctgcatt ccaccagcca 5040
gagatttcta tctgcgaaga cctatgaaac actgaagaga aatgtaggca gaaggaaatg 5100
gccacatatc acaagttcta ttatatattc ttttgtaaat acatattgta tattacttgg 5160
atgttttctt atatcattta ctgtcttttt gagttaatgt cagtttttac tctctcaact 5220
tactatgtaa cattgtaaat aacataatgt cctttattat ttatatttaa gcatctaaca 5280
tatagagttg ttttcatata agtttaagat aaatgtcaaa aatatatgtt cttttgtttt 5340
tctttgcttt aaaattatgt atcttttcct tttctttttt ttaagaataa tttattgttc 5400
aggagaaaga atgtatatgt aactgaaact atctgaagaa tgcacattga aggccgtgag 5460
gtactgataa actaaagaat ttattattca aaatactaag caataagtaa ttgtgattta 5520
tttaaagttt tgtccatttt ccatgaaaga catactgcaa taaaaatgct actctgtgga 5580
gacctgggag tgttgctcag cagactacag cttcagtctg ttagaccagc acctttcatc 5640
tcattcccat agttatgcta atttaggatt gtgttccatg gaccccatga tcaccttgtc 5700
tatacgttgc ttcttgtctg tccattgctt ttgccacacc acctgttctc aaatcatctc 5760
ctccctacca atgctgttta tcactttctt ccttgttgaa gaggccacac aaccagacag 5820
tactatgctt cctttttcct ccatacacaa taacagagag agaatattct agggcatgac 5880
tgcctggatc ctggctgttg ctatcttttg tagtggcagt aagaaactcc ttcagactaa 5940
tgaaaatgtc aacgtgccat tcaatcacga aaggtaacga aaaatgctct catggttcaa 6000
atagtccaat ggcccatagt ggcctaaaag gcagccagtt gacacctggc catgctaagc 6060
ttccttatac catccgctaa tgactttcca ttgggcccac aatttacgga ttcataattt 6120
taaaagagga gaaggccaag ttaggttcat tccccttatt ctgtcaataa aacaaatcaa 6180
actcatgtct atctaactgc tcagggagga gcctttgcat gagaaaattc tcatattcta 6240
agactgagtc atagaaatga gggtattact tttcttactg caattaacct aaacaaaagc 6300
catattttaa caaatagata tttgcatggt acccttcata tattccaagc atttcactca 6360
ttattccagg taggttaaga gcttctgaag tgtatgaagt aaaggtcagc aatcctttgg 6420
ggtgaacagt ggcctccttt ggagtttggg ggtaacctga gacttcccac caatgtccac 6480
ctccatctgt gtacctaatt cctattacct agttatggct cctctaggat catttccaaa 6540
cactctggat gtccaggaaa tttaaattgt agcttttgac tgagctagtt tttcctattt 6600
atattaataa attttcaaaa atgcttgaaa tcttcacatt tgcaacaact ttagttttca 6660
tgcacataca aacacagaga gacaaaaatt ccaaacagac actctccaaa agccaccaca 6720
ctctttcact tgctctatag tcatttagcc aaccagccat gcagagaata tttaaaactt 6780
aaagattgag acattattct cagtttttgc tgaggctttg taacgaaatt gaacactata 6840
agcagctatt gtagtaattt tggttaaaat tgtttgcctg ggatatagta tttgaggcag 6900
aagcacgtgt gtgaaggagg tgaggtggtt tggaaagagt gaagactcgc agccagattg 6960
aatgtctgga taattactat aattctccct tcttggttga aaccatgttc tctcttgatt 7020
tttaaaccca ggctgcctct ggaaacaagc aaacctgagt ctttctaacc tgagtcttcc 7080
caatcattag atttcttttc tgtcctaacg atgaatgata aaaggacttg atgttcacaa 7140
tttggggtta taaggcaggt ctgaaatctg gagactcaag atgctggaag gagtggaaag 7200
tttcgatgac tttatatgaa tcactttgca ctctatgttt ggcttgtcct ctttgaaact 7260
gatttactaa aataaatgta agggaactat tactccaaaa gattaacttg gcaggaaata 7320
ccaatacttt cagtttatga aagacaaaac tgtcttgttg ctacaggaag ctgcaatgtt 7380
cctaaccttt aaggttggtg ttgaataggg tggtcatgcc ctcccctgca ggtatcttta 7440
ggctcctgtt gacctcctgg tactataact gttcgtcttc tctgggtagc tattgatttt 7500
gaactttaac atgcttcaaa actttattca tcagggaaat aggaaaagag ttttgttacc 7560
tggaggaaat ctattgtgat ctacctgagc tttttaaaaa cagaccagga gaaggaaacc 7620
agtaattttt aaagaagaga cagagaatgg gataatagtt tcacccagga tctctttcta 7680
accctttccc ttcaaatgaa cttattggaa cagaattgga aagaagaaag gacatctctg 7740
cccaccccac aggatgccaa aaaggctaaa gaattacctc tgtagattta aacatcttta 7800
atggcttatg tatagatttg ctaatacaga gagaaatgaa ctattaaata aaaatcacat 7860
tttataatat ttttatggct taaaacatcc tttatctcct ttttgttctc tctacatgat 7920
atggtaagtg atgaggaaaa tttaggctca ggaaggttaa aatctttctt ggagttacac 7980
atctaagaga gctgcagagc tgacacttgt acccaggttt tctgactgca aatccagttt 8040
ctttctattg cgttcttccc ctttccctgc ctcaagcaga aacaggtttt ttattttcaa 8100
cctttatgta tacagtatgt tatgttacat ctacagctaa gtttcttttt agaagaatgt 8160
gagcccttct agctttggtt tagagtgatt ctagaagcca atttccttgg cttagtgatt 8220
ctatgcacct ttcctaaact tagctttcta aggaaatgaa gtgtacgagt gagaatgaat 8280
tcacaatttc gacatgtagg tagcatccta aagtgaaaag aggaggaaat ttgtggtcaa 8340
agcactctcc ccaccactta gaaacttact gactgtgggc agcttcctcc tccaagtttc 8400
cttcctgatt tacaagaccg tggtgtggtc aggattaaac ttgaatacat gtaaggaagc 8460
ctgaaagtgt ctaacacata gcgagtattc aaatgccacc ttctatttga tccttcccct 8520
ccagttcctt aagttttgga atctaggttt ctcagttcca aatggattga catttgcata 8580
tccccattgc acaatggatc aaataaactt tatgttatca tttctccaac atagtgccag 8640
taagcaaatc ctttttaata acaacagtat gttgagaaac atatcaccaa ataatattta 8700
actttgtagc tttgataagt tctttaggtt ttggttttgg ttttgttttc tgagacaggg 8760
tcttgatctg tcacccagac tggagtgcaa tggttcaatt ttagctcact gcaacctgta 8820
actcctgagc tcaagtgatc ctcccacctc agcctctcaa gtagctggga ctacaggtgt 8880
gctccaccat gcccagcgat tttttttttt tttttttttt ttttttttgg tagggacaag 8940
gtctcgctat gttgcccagg ctggtcttca attcctggcc ttaagtgatc ctcctgcctc 9000
ggcctcccaa agtgctggaa ttacaggcat gagccaccac ccataacttt atgtttgttt 9060
ttttgatgca gtataagttc agcttgcttc ttatgcagcc ataccatttc atgttaactc 9120
tgatttttag cagcttatta cattagtgtt ttattattaa tataatttta cagaaattta 9180
ctaaaccatg actctgtaga gttttaataa tactacctcc aaacatcatt gcaaacatct 9240
agaagaatga acaaaaatga tcttagatcg acagtatatc tgtttgtctt agtttctaca 9300
caggatgttc agacatattc catttcttta aaaaaaaaat atatatatat atatatatat 9360
ataggcctgg cacggtggct catgactgta atcccagcac tttgggaggc ttaggcaggc 9420
aaatcacctg aggtcaggag tttgagactg gcctgaccaa catggtgaaa cctcgtctct 9480
attaaaatta caaaaattag ccgggcgtgg tggcacatgc ctgtaatccc agctactcgg 9540
gaggctgagg caggagaatc acttgaacct gggaggcaga ggttgcagtg agttgagatc 9600
acgccattgc actccagcct tggtgacaag agtgaaactc cgtctcaaaa aaaaaaaaaa 9660
tatgtatata tatatatata tatatatata tatatatata tatatatata taaaatccca 9720
ccaaaagtct gcagagtgac caaattagac ggctctggtt tcagattaaa ttctaaatgt 9780
gagaaaccac atagctccca tgatccatcc aataattcct cacgtcctct tcactcttta 9840
ctccatgcat aaaacagaat ttttttttct catcctgggt atgaagcaat taataattta 9900
cggatttagc ctatttggat tcaatccctt caaactccat actacatcca aggtggaagt 9960
gacttaaact ctgatatcaa tcatcaggct tgtaatatag gctttgttaa tggcaggaga 10020
gtctaataaa actttctgtt ccttatcctt catttaaatg aaaaactttt tattgaaaac 10080
aatcataact ctagctcatc ataaatataa ttcatgagga cattttatta tttttatatt 10140
aaagaaataa tattatagat gtaaactttg cacctttcta attattatca tgagttaagc 10200
taatacttgt cttctggtcc ctagatgatg attctttttt gccttactgg aggagccctt 10260
gtcttgaagt gagttgcttc aacagcagag gacttctagt ttctcccagt tgagcctaaa 10320
gtgaactttt catcttcttc agaggaaggg gcttccttga tttgtacttt tgtggctcct 10380
cagataacac agacaatttt atcttggatc ccaggttctc ttcaccatta agaataagaa 10440
agagagaaaa tgctgtgcat gacagccacc tactccaaac tacccaaccc cctgtaacca 10500
ggtaccttcc aacaacgaga tgattctgcc ctcactcaag agtctccccc acaaagattc 10560
cattctccct ttacttttta ttttttattt ttttgcaaaa caaaggcctc ctttagtacc 10620
tccttagttt ataaccctta tcttcccagc tcttcccttc actgatacct ctgatttcaa 10680
aagttctgaa gtcggaagac cacacaattt cagactgtga acagaaattc agtcagaaat 10740
tattggagtt agaaagaatt tagagaaatt gtattgaact agaaagtcct ctttgattag 10800
tggtggccct ttagaaagtt ctagggcaga gtcccatggt gtttcatctt ttccatgatt 10860
tgcttgacca aaaacttttc tttcacaaca tgaaaatatg ctaatccacc acacattttg 10920
gattctgctc tgtttgcctg aggtgttaga tctctagcca ggactgtgaa gggaaggaac 10980
ttgaatcctt cctattgagc tattaatgca gagtcagtga gatgaagggt tccactcggg 11040
gtcaaaatca tgtcagttac caagcaaagg agcaagtaag gggaaacatc tcctcatctg 11100
gttagtggag ccacatttca cccactgatc aagccagacc tgagcaatag tctagatttc 11160
tccctccaca tctaattggt gacaatggtg attgtaccac tgacaggtca cacaagtcca 11220
caccctcctt cacagcctca ctgcttcggc tcttgtttat gctcttatca gcatctgtca 11280
ctaggatcca tttgtctcct gactcatcta cttcctttca ctcccctggg ccatccttca 11340
catcatgcta gaagactgtg tctaacttgc agaacttatt gtgtcaatct cctggttacg 11400
gcccctccat ggctccccac ctgacatagg aggcccttca caatctggct tctgtcactc 11460
ataacttgtc tccagccttc attcttcagt ctgattttat ggttttctag ttccccaata 11520
caccacacca gtgtttatga aaccctagtc attggcctac gaactttatg attattgaca 11580
tattcatgta ccacctgtat tattttttgc atagtgttta tttttaattg actacatttt 11640
taactacaat aaagtaattt caactaaaa 11669
<![CDATA[<210> 139]]>
<![CDATA[<211> 12]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 139]]>
Met Gly Lys Gly Arg Ala Gly Arg Val Gly Thr Thr
1 5 10
<![CDATA[<210> 140]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 140]]>
Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly
1 5 10 15
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu
20 25 30
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys
35 40 45
Pro Gln Asn Ile Lys Ile Gln Lys Lys Pro
50 55
<![CDATA[<210> 141]]>
<![CDATA[<211> 40]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 141]]>
Lys Ser Glu Leu Arg Ile Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu
1 5 10 15
Tyr Met Cys Lys Val Ile Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala
20 25 30
Asn Ile Thr Ile Val Glu Ser Asn
35 40
<![CDATA[<210> 142]]>
<![CDATA[<211> 16]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 142]]>
Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser
1 5 10 15
<![CDATA[<210> 143]]>
<![CDATA[<211> 16]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 143]]>
Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly Ala Asn Thr Ser Ser
1 5 10 15
<![CDATA[<210> 144]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 144]]>
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
1 5 10 15
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys
20 25 30
Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys
35 40
<![CDATA[<210> 145]]>
<![CDATA[<211> 19]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 145]]>
Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala
1 5 10 15
Ser Phe Tyr
<![CDATA[<210> 146]]>
<![CDATA[<211> 7]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 146]]>
His Leu Gly Ile Glu Phe Met
1 5
<![CDATA[<210> 147]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 147]]>
Ala Glu Glu Leu Tyr Gln Lys Arg Val Leu Thr Ile Thr Gly Ile Cys
1 5 10 15
Ile Ala Leu Leu Val Val Gly Ile Met Cys Val Val Ala Tyr Cys Lys
20 25 30
Thr
<![CDATA[<210> 148]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 148]]>
Lys Gln Arg Lys Lys Leu His Asp Arg Leu Arg Gln Ser Leu Arg Ser
1 5 10 15
Glu Arg Asn Asn Met Met Asn Ile Ala Asn Gly Pro His His Pro Asn
20 25 30
Pro Pro Pro Glu Asn Val Gln Leu Val Asn
35 40
<![CDATA[<210> 149]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 149]]>
Gln Tyr Val Ser Lys Asn Val Ile Ser Ser Glu His Ile Val Glu Arg
1 5 10 15
Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His
20 25 30
His Ser Thr Thr Val Thr Gln Thr Pro Ser His
35 40
<![CDATA[<210> 150]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 150]]>
Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His Ser Val
1 5 10 15
Ile Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr Gly
20 25 30
Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn
35 40 45
Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser
50 55 60
Pro His Ser Glu
65
<![CDATA[<210> 151]]>
<![CDATA[<211> 217]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 151]]>
Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp Phe
1 5 10 15
His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro
20 25 30
Val Ser Ser Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe
35 40 45
Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg
50 55 60
Glu Lys Lys Phe Asp His His Pro Gln Gln Phe Ser Ser Phe His His
65 70 75 80
Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg Ile
85 90 95
Val Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu Tyr Glu Pro Ala Gln
100 105 110
Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys
115 120 125
Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val Asp Ser Asn Thr Ser
130 135 140
Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly
145 150 155 160
Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro Leu Ala Ala Ser Leu
165 170 175
Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala
180 185 190
Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln Ala Arg Leu Ser Ser Val
195 200 205
Ile Ala Asn Gln Asp Pro Ile Ala Val
210 215
<![CDATA[<210> 152]]>
<![CDATA[<211> 624]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 152]]>
Met Gly Lys Gly Arg Ala Gly Arg Val Gly Thr Thr Ala Leu Pro Pro
1 5 10 15
Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu
20 25 30
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys
35 40 45
Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gln Asn
50 55 60
Ile Lys Ile Gln Lys Lys Pro Gly Lys Ser Glu Leu Arg Ile Asn Lys
65 70 75 80
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val Ile Ser Lys
85 90 95
Leu Gly Asn Asp Ser Ala Ser Ala Asn Ile Thr Ile Val Glu Ser Asn
100 105 110
Glu Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser
115 120 125
Ser Glu Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly Ala Asn Thr
130 135 140
Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys
145 150 155 160
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe
165 170 175
Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro
180 185 190
Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala Ser Phe
195 200 205
Tyr Lys His Leu Gly Ile Glu Phe Met Glu Ala Glu Glu Leu Tyr Gln
210 215 220
Lys Arg Val Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu Leu Val Val
225 230 235 240
Gly Ile Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gln Arg Lys
245 250 255
Lys Leu His Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu Arg Asn Asn
260 265 270
Met Met Asn Ile Ala Asn Gly Pro His His Pro Asn Pro Pro Pro Glu
275 280 285
Asn Val Gln Leu Val Asn Gln Tyr Val Ser Lys Asn Val Ile Ser Ser
290 295 300
Glu His Ile Val Glu Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His
305 310 315 320
Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr Gln Thr Pro Ser
325 330 335
His Ser Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His
340 345 350
Ser Val Ile Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro
355 360 365
Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu
370 375 380
Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg
385 390 395 400
Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala
405 410 415
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro
420 425 430
Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro
435 440 445
Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu
450 455 460
Val Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gln
465 470 475 480
Gln Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu
485 490 495
Pro Ala Ser Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr Glu Thr Thr
500 505 510
Gln Glu Tyr Glu Pro Ala Gln Glu Pro Val Lys Lys Leu Ala Asn Ser
515 520 525
Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His Ile Ala Asn Arg Leu
530 535 540
Glu Val Asp Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu
545 550 555 560
Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln
565 570 575
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala
580 585 590
Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile
595 600 605
Gln Ala Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
610 615 620
<![CDATA[<210> 153]]>
<![CDATA[<211> 11482]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 153]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag attcaagtgg ttcaagaatg 120
ggaatgaatt gaatcgaaaa aacaaaccac aaaatatcaa gatacaaaaa aagccaggga 180
agtcagaact tcgcattaac aaagcatcac tggctgattc tggagagtat atgtgcaaag 240
tgatcagcaa attaggaaat gacagtgcct ctgccaatat caccatcgtg gaatcaaacg 300
agatcatcac tggtatgcca gcctcaactg aaggagcata tgtgtcttca gagtctccca 360
ttagaatatc agtatccaca gaaggagcaa atacttcttc atctacatct acatccacca 420
ctgggacaag ccatcttgta aaatgtgcgg agaaggagaa aactttctgt gtgaatggag 480
gggagtgctt catggtgaaa gacctttcaa acccctcgag atacttgtgc aagtgcccaa 540
atgagtttac tggtgatcgc tgccaaaact acgtaatggc cagcttctac aagcatcttg 600
ggattgaatt tatggaggcg gaggagctgt accagaagag agtgctgacc ataaccggca 660
tctgcatcgc cctccttgtg gtcggcatca tgtgtgtggt ggcctactgc aaaaccaaga 720
aacagcggaa aaagctgcat gaccgtcttc ggcagagcct tcggtctgaa cgaaacaata 780
tgatgaacat tgccaatggg cctcaccatc ctaacccacc ccccgagaat gtccagctgg 840
tgaatcaata cgtatctaaa aacgtcatct ccagtgagca tattgttgag agagaagcag 900
agacatcctt ttccaccagt cactatactt ccacagccca tcactccact actgtcaccc 960
agactcctag ccacagctgg agcaacggac acactgaaag catcctttcc gaaagccact 1020
ctgtaatcgt gatgtcatcc gtagaaaaca gtaggcacag cagcccaact gggggcccaa 1080
gaggacgtct taatggcaca ggaggccctc gtgaatgtaa cagcttcctc aggcatgcca 1140
gagaaacccc tgattcctac cgagactctc ctcatagtga aaggtatgtg tcagccatga 1200
ccaccccggc tcgtatgtca cctgtagatt tccacacgcc aagctccccc aaatcgcccc 1260
cttcggaaat gtctccaccc gtgtccagca tgacggtgtc catgccttcc atggcggtca 1320
gccccttcat ggaagaagag agacctctac ttctcgtgac accaccaagg ctgcgggaga 1380
agaagtttga ccatcaccct cagcagttca gctccttcca ccacaacccc gcgcatgaca 1440
gtaacagcct ccctgctagc cccttgagga tagtggagga tgaggagtat gaaacgaccc 1500
aagagtacga gccagcccaa gagcctgtta agaaactcgc caatagccgg cgggccaaaa 1560
gaaccaagcc caatggccac attgctaaca gattggaagt ggacagcaac acaagctccc 1620
agagcagtaa ctcagagagt gaaacagaag atgaaagagt aggtgaagat acgcctttcc 1680
tgggcataca gaaccccctg gcagccagtc ttgaggcaac acctgccttc cgcctggctg 1740
acagcaggac taacccagca ggccgcttct cgacacagga agaaatccag gccaggctgt 1800
ctagtgtaat tgctaaccaa gaccctattg ctgtataaaa cctaaataaa cacatagatt 1860
cacctgtaaa actttatttt atataataaa gtattccacc ttaaattaaa caatttattt 1920
tattttagca gttctgcaaa tagaaaacag gaaaaaaact tttataaatt aaatatatgt 1980
atgtaaaaat gtgttatgtg ccatatgtag caatttttta cagtatttca aaacgagaaa 2040
gatatcaatg gtgcctttat gttatgttat gtcgagagca agttttgtac agttacagtg 2100
attgcttttc cacagtattt ctgcaaaacc tctcatagat tcagtttttg ctggcttctt 2160
gtgcattgca ttatgatgtt gactggatgt atgatttgca agacttgcaa ctgtccctct 2220
gtttgcttgt agtagcaccc gatcagtatg tcttgtaatg gcacatccat ccagatatgc 2280
ctctcttgtg tatgaagttt tctttgcttt cagaatatga aatgagttgt gtctactctg 2340
ccagccaaag gtttgcctca ttgggctctg agataatagt agatccaaca gcatgctact 2400
attaaataca gcaagaaact gcattaagta atgttaaata ttaggaagaa agtaatactg 2460
tgatttaaaa aaaactatat tattaatcag aagacagctt gctcttacta aaaggagctc 2520
tcatttactt tatttgattt tatttttctt gacaaaaagc aacagtttta gggatagctt 2580
agaaaatggg ttctggcttg ctatcagggt aaatctaaca ccttacaaga ggactgagtg 2640
tcactttctc tctgggggaa tgatccagca gcttatctag ttgacaatca aaacacggct 2700
gataaaggtg caatcatttc tgacatgtat ttttcactga ttttgaagct agtgattggt 2760
tgtgtcttct tggctcaaaa agaagcatat tacggcacaa aaagcccagc ccagacagca 2820
catgcagcat tttgtctgaa atacttctag agtcaaacgt gcctgctgta catagcgatg 2880
acttgtcatc atagggaagt atttccatcg tagagtgttc agaaggagtg actgtatagg 2940
tggagagaag cttagtgact ccgttgaaat tttaaaatgt ggatgaccac ccctttctcc 3000
cccttatttt tcttttatct ttccatgttg ccttgatcag gtcataacta tgcatgaaca 3060
ttttttatca ggaatggccg atgtgtatgt gatttgtaat cacaagtaat gattcatcag 3120
gaaatgtcaa tcctgttgga aagattgcac ctttacttgc agaagtgacc cccacctgtg 3180
tcctgacctc tccatttaca ggctctctca cccatttccc ccacctcctt taatttttgc 3240
tttactgtca taaagtagga ctaagattgg tctaagcatt gcatgttctt ttgtgatggt 3300
aaatccaaag gaaggcctat aagtattaac atttgaaata actgctaatt caggaaaatg 3360
gaagaaaaaa aattatttga aacacagaac ccatttcatg gcctgcctga tatctgtgaa 3420
atcagggctg gagctttact taggattcac atggcctcct aggaaccatg ggacaaatgg 3480
gaaacaggtt atcgggggat tcatgaagtc agtgagagta attgcttctt ttttgcgggt 3540
gaactgaatg tatttcttca ccaaatcttg atgttaacaa ttaaaaagaa gaaatgacat 3600
gcaagtaggt cttagcagaa aaatgcaggc tgggcatgag tcatgttgtt accctcccac 3660
atgctcctac aatccacaga gatgcctgtc tgcaggttct tgaagttatt gttagtattt 3720
ggtatctcaa atttttcgtc actgttcaca tgccactttc tctgtgcaca gtggtatcct 3780
catttgcttt ttaacctaca ctgaggagtc tttgtcaggt tgcactgatt ttccaattct 3840
gcagtaatga gtaagctcac ggcatgggga agaagacagt cagtccaatg aagttctcta 3900
aattatttta acattgcctt tgaaggcctt gactcatcct tagctatttc aatgaagaaa 3960
ttcctaccat gaatttaaaa ccctaaaaat tctgtttcaa attctttggg cattggggta 4020
ctcagatatc ccattgtgga agaattttaa gaataaatag aagtttctgt tgagaaccat 4080
gagcaacatg tttcttacaa tgagaattgc tatgcatttt aaaattgcaa atatatatga 4140
aaattgaaga caagaggaaa ttgtatttct aacttgattc tgatcactca cagaggtggc 4200
atattattat agttgggaca tcctttgcac ccttcataaa aaaggccagc tgactgctca 4260
gcatcacctg ccaaggccac tagatttgtg tttacagggg tatctctgtg atgcttgtca 4320
catcactctt gaccacctct gttaataaat tccgacagtg cagtggcgat cggagtgtga 4380
acttatgttc ccagcatatg gaaagctatc ttaggtttta aggtagtaga aattgcccag 4440
gagtttgaca gcaactttgt ttcccgggtc taaaatcgta tcccactgag gtgtatgcag 4500
tggagcataa tacatgcaaa tacatgcaaa actccttttg tttcacctaa gattcacttt 4560
ctatcttact ttcccttcct gcctagtgtg acttttgccc ccaagagtgc ctggacagca 4620
ttctagtttc tacaaaatgg tcctctgtgt aggtgaatgt gtcccaaacc tgctatcact 4680
ttcttgtttc agtgtgactg tcttgttaga ggtgaagttt atccagggta acttgctcac 4740
taactattcc tttttatggc ctggggttaa agggcgcatg gctcacactg gtgaaaataa 4800
ggaaggcctg gtcttatctt gtattaataa tactggctgc attccaccag ccagagattt 4860
ctatctgcga agacctatga aacactgaag agaaatgtag gcagaaggaa atggccacat 4920
atcacaagtt ctattatata ttcttttgta aatacatatt gtatattact tggatgtttt 4980
cttatatcat ttactgtctt tttgagttaa tgtcagtttt tactctctca acttactatg 5040
taacattgta aataacataa tgtcctttat tatttatatt taagcatcta acatatagag 5100
ttgttttcat ataagtttaa gataaatgtc aaaaatatat gttcttttgt ttttctttgc 5160
tttaaaatta tgtatctttt ccttttcttt tttttaagaa taatttattg ttcaggagaa 5220
agaatgtata tgtaactgaa actatctgaa gaatgcacat tgaaggccgt gaggtactga 5280
taaactaaag aatttattat tcaaaatact aagcaataag taattgtgat ttatttaaag 5340
ttttgtccat tttccatgaa agacatactg caataaaaat gctactctgt ggagacctgg 5400
gagtgttgct cagcagacta cagcttcagt ctgttagacc agcacctttc atctcattcc 5460
catagttatg ctaatttagg attgtgttcc atggacccca tgatcacctt gtctatacgt 5520
tgcttcttgt ctgtccattg cttttgccac accacctgtt ctcaaatcat ctcctcccta 5580
ccaatgctgt ttatcacttt cttccttgtt gaagaggcca cacaaccaga cagtactatg 5640
cttccttttt cctccataca caataacaga gagagaatat tctagggcat gactgcctgg 5700
atcctggctg ttgctatctt ttgtagtggc agtaagaaac tccttcagac taatgaaaat 5760
gtcaacgtgc cattcaatca cgaaaggtaa cgaaaaatgc tctcatggtt caaatagtcc 5820
aatggcccat agtggcctaa aaggcagcca gttgacacct ggccatgcta agcttcctta 5880
taccatccgc taatgacttt ccattgggcc cacaatttac ggattcataa ttttaaaaga 5940
ggagaaggcc aagttaggtt cattcccctt attctgtcaa taaaacaaat caaactcatg 6000
tctatctaac tgctcaggga ggagcctttg catgagaaaa ttctcatatt ctaagactga 6060
gtcatagaaa tgagggtatt acttttctta ctgcaattaa cctaaacaaa agccatattt 6120
taacaaatag atatttgcat ggtacccttc atatattcca agcatttcac tcattattcc 6180
aggtaggtta agagcttctg aagtgtatga agtaaaggtc agcaatcctt tggggtgaac 6240
agtggcctcc tttggagttt gggggtaacc tgagacttcc caccaatgtc cacctccatc 6300
tgtgtaccta attcctatta cctagttatg gctcctctag gatcatttcc aaacactctg 6360
gatgtccagg aaatttaaat tgtagctttt gactgagcta gtttttccta tttatattaa 6420
taaattttca aaaatgcttg aaatcttcac atttgcaaca actttagttt tcatgcacat 6480
acaaacacag agagacaaaa attccaaaca gacactctcc aaaagccacc acactctttc 6540
acttgctcta tagtcattta gccaaccagc catgcagaga atatttaaaa cttaaagatt 6600
gagacattat tctcagtttt tgctgaggct ttgtaacgaa attgaacact ataagcagct 6660
attgtagtaa ttttggttaa aattgtttgc ctgggatata gtatttgagg cagaagcacg 6720
tgtgtgaagg aggtgaggtg gtttggaaag agtgaagact cgcagccaga ttgaatgtct 6780
ggataattac tataattctc ccttcttggt tgaaaccatg ttctctcttg atttttaaac 6840
ccaggctgcc tctggaaaca agcaaacctg agtctttcta acctgagtct tcccaatcat 6900
tagatttctt ttctgtccta acgatgaatg ataaaaggac ttgatgttca caatttgggg 6960
ttataaggca ggtctgaaat ctggagactc aagatgctgg aaggagtgga aagtttcgat 7020
gactttatat gaatcacttt gcactctatg tttggcttgt cctctttgaa actgatttac 7080
taaaataaat gtaagggaac tattactcca aaagattaac ttggcaggaa ataccaatac 7140
tttcagttta tgaaagacaa aactgtcttg ttgctacagg aagctgcaat gttcctaacc 7200
tttaaggttg gtgttgaata gggtggtcat gccctcccct gcaggtatct ttaggctcct 7260
gttgacctcc tggtactata actgttcgtc ttctctgggt agctattgat tttgaacttt 7320
aacatgcttc aaaactttat tcatcaggga aataggaaaa gagttttgtt acctggagga 7380
aatctattgt gatctacctg agctttttaa aaacagacca ggagaaggaa accagtaatt 7440
tttaaagaag agacagagaa tgggataata gtttcaccca ggatctcttt ctaacccttt 7500
cccttcaaat gaacttattg gaacagaatt ggaaagaaga aaggacatct ctgcccaccc 7560
cacaggatgc caaaaaggct aaagaattac ctctgtagat ttaaacatct ttaatggctt 7620
atgtatagat ttgctaatac agagagaaat gaactattaa ataaaaatca cattttataa 7680
tatttttatg gcttaaaaca tcctttatct cctttttgtt ctctctacat gatatggtaa 7740
gtgatgagga aaatttaggc tcaggaaggt taaaatcttt cttggagtta cacatctaag 7800
agagctgcag agctgacact tgtacccagg ttttctgact gcaaatccag tttctttcta 7860
ttgcgttctt cccctttccc tgcctcaagc agaaacaggt tttttatttt caacctttat 7920
gtatacagta tgttatgtta catctacagc taagtttctt tttagaagaa tgtgagccct 7980
tctagctttg gtttagagtg attctagaag ccaatttcct tggcttagtg attctatgca 8040
cctttcctaa acttagcttt ctaaggaaat gaagtgtacg agtgagaatg aattcacaat 8100
ttcgacatgt aggtagcatc ctaaagtgaa aagaggagga aatttgtggt caaagcactc 8160
tccccaccac ttagaaactt actgactgtg ggcagcttcc tcctccaagt ttccttcctg 8220
atttacaaga ccgtggtgtg gtcaggatta aacttgaata catgtaagga agcctgaaag 8280
tgtctaacac atagcgagta ttcaaatgcc accttctatt tgatccttcc cctccagttc 8340
cttaagtttt ggaatctagg tttctcagtt ccaaatggat tgacatttgc atatccccat 8400
tgcacaatgg atcaaataaa ctttatgtta tcatttctcc aacatagtgc cagtaagcaa 8460
atccttttta ataacaacag tatgttgaga aacatatcac caaataatat ttaactttgt 8520
agctttgata agttctttag gttttggttt tggttttgtt ttctgagaca gggtcttgat 8580
ctgtcaccca gactggagtg caatggttca attttagctc actgcaacct gtaactcctg 8640
agctcaagtg atcctcccac ctcagcctct caagtagctg ggactacagg tgtgctccac 8700
catgcccagc gatttttttt tttttttttt tttttttttt tggtagggac aaggtctcgc 8760
tatgttgccc aggctggtct tcaattcctg gccttaagtg atcctcctgc ctcggcctcc 8820
caaagtgctg gaattacagg catgagccac cacccataac tttatgtttg tttttttgat 8880
gcagtataag ttcagcttgc ttcttatgca gccataccat ttcatgttaa ctctgatttt 8940
tagcagctta ttacattagt gttttattat taatataatt ttacagaaat ttactaaacc 9000
atgactctgt agagttttaa taatactacc tccaaacatc attgcaaaca tctagaagaa 9060
tgaacaaaaa tgatcttaga tcgacagtat atctgtttgt cttagtttct acacaggatg 9120
ttcagacata ttccatttct ttaaaaaaaa aatatatata tatatatata tatataggcc 9180
tggcacggtg gctcatgact gtaatcccag cactttggga ggcttaggca ggcaaatcac 9240
ctgaggtcag gagtttgaga ctggcctgac caacatggtg aaacctcgtc tctattaaaa 9300
ttacaaaaat tagccgggcg tggtggcaca tgcctgtaat cccagctact cgggaggctg 9360
aggcaggaga atcacttgaa cctgggaggc agaggttgca gtgagttgag atcacgccat 9420
tgcactccag ccttggtgac aagagtgaaa ctccgtctca aaaaaaaaaa aaatatgtat 9480
atatatatat atatatatat atatatatat atatatatat atataaaatc ccaccaaaag 9540
tctgcagagt gaccaaatta gacggctctg gtttcagatt aaattctaaa tgtgagaaac 9600
cacatagctc ccatgatcca tccaataatt cctcacgtcc tcttcactct ttactccatg 9660
cataaaacag aatttttttt tctcatcctg ggtatgaagc aattaataat ttacggattt 9720
agcctatttg gattcaatcc cttcaaactc catactacat ccaaggtgga agtgacttaa 9780
actctgatat caatcatcag gcttgtaata taggctttgt taatggcagg agagtctaat 9840
aaaactttct gttccttatc cttcatttaa atgaaaaact ttttattgaa aacaatcata 9900
actctagctc atcataaata taattcatga ggacatttta ttatttttat attaaagaaa 9960
taatattata gatgtaaact ttgcaccttt ctaattatta tcatgagtta agctaatact 10020
tgtcttctgg tccctagatg atgattcttt tttgccttac tggaggagcc cttgtcttga 10080
agtgagttgc ttcaacagca gaggacttct agtttctccc agttgagcct aaagtgaact 10140
tttcatcttc ttcagaggaa ggggcttcct tgatttgtac ttttgtggct cctcagataa 10200
cacagacaat tttatcttgg atcccaggtt ctcttcacca ttaagaataa gaaagagaga 10260
aaatgctgtg catgacagcc acctactcca aactacccaa ccccctgtaa ccaggtacct 10320
tccaacaacg agatgattct gccctcactc aagagtctcc cccacaaaga ttccattctc 10380
cctttacttt ttatttttta tttttttgca aaacaaaggc ctcctttagt acctccttag 10440
tttataaccc ttatcttccc agctcttccc ttcactgata cctctgattt caaaagttct 10500
gaagtcggaa gaccacacaa tttcagactg tgaacagaaa ttcagtcaga aattattgga 10560
gttagaaaga atttagagaa attgtattga actagaaagt cctctttgat tagtggtggc 10620
cctttagaaa gttctagggc agagtcccat ggtgtttcat cttttccatg atttgcttga 10680
ccaaaaactt ttctttcaca acatgaaaat atgctaatcc accacacatt ttggattctg 10740
ctctgtttgc ctgaggtgtt agatctctag ccaggactgt gaagggaagg aacttgaatc 10800
cttcctattg agctattaat gcagagtcag tgagatgaag ggttccactc ggggtcaaaa 10860
tcatgtcagt taccaagcaa aggagcaagt aaggggaaac atctcctcat ctggttagtg 10920
gagccacatt tcacccactg atcaagccag acctgagcaa tagtctagat ttctccctcc 10980
acatctaatt ggtgacaatg gtgattgtac cactgacagg tcacacaagt ccacaccctc 11040
cttcacagcc tcactgcttc ggctcttgtt tatgctctta tcagcatctg tcactaggat 11100
ccatttgtct cctgactcat ctacttcctt tcactcccct gggccatcct tcacatcatg 11160
ctagaagact gtgtctaact tgcagaactt attgtgtcaa tctcctggtt acggcccctc 11220
catggctccc cacctgacat aggaggccct tcacaatctg gcttctgtca ctcataactt 11280
gtctccagcc ttcattcttc agtctgattt tatggttttc tagttcccca atacaccaca 11340
ccagtgttta tgaaacccta gtcattggcc tacgaacttt atgattattg acatattcat 11400
gtaccacctg tattattttt tgcatagtgt ttatttttaa ttgactacat ttttaactac 11460
aataaagtaa tttcaactaa aa 11482
<![CDATA[<210> 154]]>
<![CDATA[<211> 611]]>
<![CDATA[<212>]]> PRT
<![CDATA[<213> 人類]]>
<![CDATA[<400> 154]]>
Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly
1 5 10 15
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu
20 25 30
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys
35 40 45
Pro Gln Asn Ile Lys Ile Gln Lys Lys Pro Gly Lys Ser Glu Leu Arg
50 55 60
Ile Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val
65 70 75 80
Ile Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn Ile Thr Ile Val
85 90 95
Glu Ser Asn Glu Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala
100 105 110
Tyr Val Ser Ser Glu Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly
115 120 125
Ala Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His
130 135 140
Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly
145 150 155 160
Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys
165 170 175
Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met
180 185 190
Ala Ser Phe Tyr Lys His Leu Gly Ile Glu Phe Met Glu Ala Glu Glu
195 200 205
Leu Tyr Gln Lys Arg Val Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu
210 215 220
Leu Val Val Gly Ile Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys
225 230 235 240
Gln Arg Lys Lys Leu His Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu
245 250 255
Arg Asn Asn Met Met Asn Ile Ala Asn Gly Pro His His Pro Asn Pro
260 265 270
Pro Pro Glu Asn Val Gln Leu Val Asn Gln Tyr Val Ser Lys Asn Val
275 280 285
Ile Ser Ser Glu His Ile Val Glu Arg Glu Ala Glu Thr Ser Phe Ser
290 295 300
Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr Gln
305 310 315 320
Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser
325 330 335
Glu Ser His Ser Val Ile Val Met Ser Ser Val Glu Asn Ser Arg His
340 345 350
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly
355 360 365
Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp
370 375 380
Ser Tyr Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr
385 390 395 400
Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro
405 410 415
Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val
420 425 430
Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro
435 440 445
Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His
450 455 460
His Pro Gln Gln Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser
465 470 475 480
Asn Ser Leu Pro Ala Ser Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr
485 490 495
Glu Thr Thr Gln Glu Tyr Glu Pro Ala Gln Glu Pro Val Lys Lys Leu
500 505 510
Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His Ile Ala
515 520 525
Asn Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser
530 535 540
Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu
545 550 555 560
Gly Ile Gln Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe
565 570 575
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln
580 585 590
Glu Glu Ile Gln Ala Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro
595 600 605
Ile Ala Val
610
<![CDATA[<210> 155]]>
<![CDATA[<211> 11080]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 155]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac ccctcgagat 120
acttgtgcaa gtgcccaaat gagtttactg gtgatcgctg ccaaaactac gtaatggcca 180
gcttctacaa gcatcttggg attgaattta tggaggcgga ggagctgtac cagaagagag 240
tgctgaccat aaccggcatc tgcatcgccc tccttgtggt cggcatcatg tgtgtggtgg 300
cctactgcaa aaccaagaaa cagcggaaaa agctgcatga ccgtcttcgg cagagccttc 360
ggtctgaacg aaacaatatg atgaacattg ccaatgggcc tcaccatcct aacccacccc 420
ccgagaatgt ccagctggtg aatcaatacg tatctaaaaa cgtcatctcc agtgagcata 480
ttgttgagag agaagcagag acatcctttt ccaccagtca ctatacttcc acagcccatc 540
actccactac tgtcacccag actcctagcc acagctggag caacggacac actgaaagca 600
tcctttccga aagccactct gtaatcgtga tgtcatccgt agaaaacagt aggcacagca 660
gcccaactgg gggcccaaga ggacgtctta atggcacagg aggccctcgt gaatgtaaca 720
gcttcctcag gcatgccaga gaaacccctg attcctaccg agactctcct catagtgaaa 780
ggtatgtgtc agccatgacc accccggctc gtatgtcacc tgtagatttc cacacgccaa 840
gctcccccaa atcgccccct tcggaaatgt ctccacccgt gtccagcatg acggtgtcca 900
tgccttccat ggcggtcagc cccttcatgg aagaagagag acctctactt ctcgtgacac 960
caccaaggct gcgggagaag aagtttgacc atcaccctca gcagttcagc tccttccacc 1020
acaaccccgc gcatgacagt aacagcctcc ctgctagccc cttgaggata gtggaggatg 1080
aggagtatga aacgacccaa gagtacgagc cagcccaaga gcctgttaag aaactcgcca 1140
atagccggcg ggccaaaaga accaagccca atggccacat tgctaacaga ttggaagtgg 1200
acagcaacac aagctcccag agcagtaact cagagagtga aacagaagat gaaagagtag 1260
gtgaagatac gcctttcctg ggcatacaga accccctggc agccagtctt gaggcaacac 1320
ctgccttccg cctggctgac agcaggacta acccagcagg ccgcttctcg acacaggaag 1380
aaatccaggc caggctgtct agtgtaattg ctaaccaaga ccctattgct gtataaaacc 1440
taaataaaca catagattca cctgtaaaac tttattttat ataataaagt attccacctt 1500
aaattaaaca atttatttta ttttagcagt tctgcaaata gaaaacagga aaaaaacttt 1560
tataaattaa atatatgtat gtaaaaatgt gttatgtgcc atatgtagca attttttaca 1620
gtatttcaaa acgagaaaga tatcaatggt gcctttatgt tatgttatgt cgagagcaag 1680
ttttgtacag ttacagtgat tgcttttcca cagtatttct gcaaaacctc tcatagattc 1740
agtttttgct ggcttcttgt gcattgcatt atgatgttga ctggatgtat gatttgcaag 1800
acttgcaact gtccctctgt ttgcttgtag tagcacccga tcagtatgtc ttgtaatggc 1860
acatccatcc agatatgcct ctcttgtgta tgaagttttc tttgctttca gaatatgaaa 1920
tgagttgtgt ctactctgcc agccaaaggt ttgcctcatt gggctctgag ataatagtag 1980
atccaacagc atgctactat taaatacagc aagaaactgc attaagtaat gttaaatatt 2040
aggaagaaag taatactgtg atttaaaaaa aactatatta ttaatcagaa gacagcttgc 2100
tcttactaaa aggagctctc atttacttta tttgatttta tttttcttga caaaaagcaa 2160
cagttttagg gatagcttag aaaatgggtt ctggcttgct atcagggtaa atctaacacc 2220
ttacaagagg actgagtgtc actttctctc tgggggaatg atccagcagc ttatctagtt 2280
gacaatcaaa acacggctga taaaggtgca atcatttctg acatgtattt ttcactgatt 2340
ttgaagctag tgattggttg tgtcttcttg gctcaaaaag aagcatatta cggcacaaaa 2400
agcccagccc agacagcaca tgcagcattt tgtctgaaat acttctagag tcaaacgtgc 2460
ctgctgtaca tagcgatgac ttgtcatcat agggaagtat ttccatcgta gagtgttcag 2520
aaggagtgac tgtataggtg gagagaagct tagtgactcc gttgaaattt taaaatgtgg 2580
atgaccaccc ctttctcccc cttatttttc ttttatcttt ccatgttgcc ttgatcaggt 2640
cataactatg catgaacatt ttttatcagg aatggccgat gtgtatgtga tttgtaatca 2700
caagtaatga ttcatcagga aatgtcaatc ctgttggaaa gattgcacct ttacttgcag 2760
aagtgacccc cacctgtgtc ctgacctctc catttacagg ctctctcacc catttccccc 2820
acctccttta atttttgctt tactgtcata aagtaggact aagattggtc taagcattgc 2880
atgttctttt gtgatggtaa atccaaagga aggcctataa gtattaacat ttgaaataac 2940
tgctaattca ggaaaatgga agaaaaaaaa ttatttgaaa cacagaaccc atttcatggc 3000
ctgcctgata tctgtgaaat cagggctgga gctttactta ggattcacat ggcctcctag 3060
gaaccatggg acaaatggga aacaggttat cgggggattc atgaagtcag tgagagtaat 3120
tgcttctttt ttgcgggtga actgaatgta tttcttcacc aaatcttgat gttaacaatt 3180
aaaaagaaga aatgacatgc aagtaggtct tagcagaaaa atgcaggctg ggcatgagtc 3240
atgttgttac cctcccacat gctcctacaa tccacagaga tgcctgtctg caggttcttg 3300
aagttattgt tagtatttgg tatctcaaat ttttcgtcac tgttcacatg ccactttctc 3360
tgtgcacagt ggtatcctca tttgcttttt aacctacact gaggagtctt tgtcaggttg 3420
cactgatttt ccaattctgc agtaatgagt aagctcacgg catggggaag aagacagtca 3480
gtccaatgaa gttctctaaa ttattttaac attgcctttg aaggccttga ctcatcctta 3540
gctatttcaa tgaagaaatt cctaccatga atttaaaacc ctaaaaattc tgtttcaaat 3600
tctttgggca ttggggtact cagatatccc attgtggaag aattttaaga ataaatagaa 3660
gtttctgttg agaaccatga gcaacatgtt tcttacaatg agaattgcta tgcattttaa 3720
aattgcaaat atatatgaaa attgaagaca agaggaaatt gtatttctaa cttgattctg 3780
atcactcaca gaggtggcat attattatag ttgggacatc ctttgcaccc ttcataaaaa 3840
aggccagctg actgctcagc atcacctgcc aaggccacta gatttgtgtt tacaggggta 3900
tctctgtgat gcttgtcaca tcactcttga ccacctctgt taataaattc cgacagtgca 3960
gtggcgatcg gagtgtgaac ttatgttccc agcatatgga aagctatctt aggttttaag 4020
gtagtagaaa ttgcccagga gtttgacagc aactttgttt cccgggtcta aaatcgtatc 4080
ccactgaggt gtatgcagtg gagcataata catgcaaata catgcaaaac tccttttgtt 4140
tcacctaaga ttcactttct atcttacttt cccttcctgc ctagtgtgac ttttgccccc 4200
aagagtgcct ggacagcatt ctagtttcta caaaatggtc ctctgtgtag gtgaatgtgt 4260
cccaaacctg ctatcacttt cttgtttcag tgtgactgtc ttgttagagg tgaagtttat 4320
ccagggtaac ttgctcacta actattcctt tttatggcct ggggttaaag ggcgcatggc 4380
tcacactggt gaaaataagg aaggcctggt cttatcttgt attaataata ctggctgcat 4440
tccaccagcc agagatttct atctgcgaag acctatgaaa cactgaagag aaatgtaggc 4500
agaaggaaat ggccacatat cacaagttct attatatatt cttttgtaaa tacatattgt 4560
atattacttg gatgttttct tatatcattt actgtctttt tgagttaatg tcagttttta 4620
ctctctcaac ttactatgta acattgtaaa taacataatg tcctttatta tttatattta 4680
agcatctaac atatagagtt gttttcatat aagtttaaga taaatgtcaa aaatatatgt 4740
tcttttgttt ttctttgctt taaaattatg tatcttttcc ttttcttttt tttaagaata 4800
atttattgtt caggagaaag aatgtatatg taactgaaac tatctgaaga atgcacattg 4860
aaggccgtga ggtactgata aactaaagaa tttattattc aaaatactaa gcaataagta 4920
attgtgattt atttaaagtt ttgtccattt tccatgaaag acatactgca ataaaaatgc 4980
tactctgtgg agacctggga gtgttgctca gcagactaca gcttcagtct gttagaccag 5040
cacctttcat ctcattccca tagttatgct aatttaggat tgtgttccat ggaccccatg 5100
atcaccttgt ctatacgttg cttcttgtct gtccattgct tttgccacac cacctgttct 5160
caaatcatct cctccctacc aatgctgttt atcactttct tccttgttga agaggccaca 5220
caaccagaca gtactatgct tcctttttcc tccatacaca ataacagaga gagaatattc 5280
tagggcatga ctgcctggat cctggctgtt gctatctttt gtagtggcag taagaaactc 5340
cttcagacta atgaaaatgt caacgtgcca ttcaatcacg aaaggtaacg aaaaatgctc 5400
tcatggttca aatagtccaa tggcccatag tggcctaaaa ggcagccagt tgacacctgg 5460
ccatgctaag cttccttata ccatccgcta atgactttcc attgggccca caatttacgg 5520
attcataatt ttaaaagagg agaaggccaa gttaggttca ttccccttat tctgtcaata 5580
aaacaaatca aactcatgtc tatctaactg ctcagggagg agcctttgca tgagaaaatt 5640
ctcatattct aagactgagt catagaaatg agggtattac ttttcttact gcaattaacc 5700
taaacaaaag ccatatttta acaaatagat atttgcatgg tacccttcat atattccaag 5760
catttcactc attattccag gtaggttaag agcttctgaa gtgtatgaag taaaggtcag 5820
caatcctttg gggtgaacag tggcctcctt tggagtttgg gggtaacctg agacttccca 5880
ccaatgtcca cctccatctg tgtacctaat tcctattacc tagttatggc tcctctagga 5940
tcatttccaa acactctgga tgtccaggaa atttaaattg tagcttttga ctgagctagt 6000
ttttcctatt tatattaata aattttcaaa aatgcttgaa atcttcacat ttgcaacaac 6060
tttagttttc atgcacatac aaacacagag agacaaaaat tccaaacaga cactctccaa 6120
aagccaccac actctttcac ttgctctata gtcatttagc caaccagcca tgcagagaat 6180
atttaaaact taaagattga gacattattc tcagtttttg ctgaggcttt gtaacgaaat 6240
tgaacactat aagcagctat tgtagtaatt ttggttaaaa ttgtttgcct gggatatagt 6300
atttgaggca gaagcacgtg tgtgaaggag gtgaggtggt ttggaaagag tgaagactcg 6360
cagccagatt gaatgtctgg ataattacta taattctccc ttcttggttg aaaccatgtt 6420
ctctcttgat ttttaaaccc aggctgcctc tggaaacaag caaacctgag tctttctaac 6480
ctgagtcttc ccaatcatta gatttctttt ctgtcctaac gatgaatgat aaaaggactt 6540
gatgttcaca atttggggtt ataaggcagg tctgaaatct ggagactcaa gatgctggaa 6600
ggagtggaaa gtttcgatga ctttatatga atcactttgc actctatgtt tggcttgtcc 6660
tctttgaaac tgatttacta aaataaatgt aagggaacta ttactccaaa agattaactt 6720
ggcaggaaat accaatactt tcagtttatg aaagacaaaa ctgtcttgtt gctacaggaa 6780
gctgcaatgt tcctaacctt taaggttggt gttgaatagg gtggtcatgc cctcccctgc 6840
aggtatcttt aggctcctgt tgacctcctg gtactataac tgttcgtctt ctctgggtag 6900
ctattgattt tgaactttaa catgcttcaa aactttattc atcagggaaa taggaaaaga 6960
gttttgttac ctggaggaaa tctattgtga tctacctgag ctttttaaaa acagaccagg 7020
agaaggaaac cagtaatttt taaagaagag acagagaatg ggataatagt ttcacccagg 7080
atctctttct aaccctttcc cttcaaatga acttattgga acagaattgg aaagaagaaa 7140
ggacatctct gcccacccca caggatgcca aaaaggctaa agaattacct ctgtagattt 7200
aaacatcttt aatggcttat gtatagattt gctaatacag agagaaatga actattaaat 7260
aaaaatcaca ttttataata tttttatggc ttaaaacatc ctttatctcc tttttgttct 7320
ctctacatga tatggtaagt gatgaggaaa atttaggctc aggaaggtta aaatctttct 7380
tggagttaca catctaagag agctgcagag ctgacacttg tacccaggtt ttctgactgc 7440
aaatccagtt tctttctatt gcgttcttcc cctttccctg cctcaagcag aaacaggttt 7500
tttattttca acctttatgt atacagtatg ttatgttaca tctacagcta agtttctttt 7560
tagaagaatg tgagcccttc tagctttggt ttagagtgat tctagaagcc aatttccttg 7620
gcttagtgat tctatgcacc tttcctaaac ttagctttct aaggaaatga agtgtacgag 7680
tgagaatgaa ttcacaattt cgacatgtag gtagcatcct aaagtgaaaa gaggaggaaa 7740
tttgtggtca aagcactctc cccaccactt agaaacttac tgactgtggg cagcttcctc 7800
ctccaagttt ccttcctgat ttacaagacc gtggtgtggt caggattaaa cttgaataca 7860
tgtaaggaag cctgaaagtg tctaacacat agcgagtatt caaatgccac cttctatttg 7920
atccttcccc tccagttcct taagttttgg aatctaggtt tctcagttcc aaatggattg 7980
acatttgcat atccccattg cacaatggat caaataaact ttatgttatc atttctccaa 8040
catagtgcca gtaagcaaat cctttttaat aacaacagta tgttgagaaa catatcacca 8100
aataatattt aactttgtag ctttgataag ttctttaggt tttggttttg gttttgtttt 8160
ctgagacagg gtcttgatct gtcacccaga ctggagtgca atggttcaat tttagctcac 8220
tgcaacctgt aactcctgag ctcaagtgat cctcccacct cagcctctca agtagctggg 8280
actacaggtg tgctccacca tgcccagcga tttttttttt tttttttttt tttttttttg 8340
gtagggacaa ggtctcgcta tgttgcccag gctggtcttc aattcctggc cttaagtgat 8400
cctcctgcct cggcctccca aagtgctgga attacaggca tgagccacca cccataactt 8460
tatgtttgtt tttttgatgc agtataagtt cagcttgctt cttatgcagc cataccattt 8520
catgttaact ctgattttta gcagcttatt acattagtgt tttattatta atataatttt 8580
acagaaattt actaaaccat gactctgtag agttttaata atactacctc caaacatcat 8640
tgcaaacatc tagaagaatg aacaaaaatg atcttagatc gacagtatat ctgtttgtct 8700
tagtttctac acaggatgtt cagacatatt ccatttcttt aaaaaaaaaa tatatatata 8760
tatatatata tataggcctg gcacggtggc tcatgactgt aatcccagca ctttgggagg 8820
cttaggcagg caaatcacct gaggtcagga gtttgagact ggcctgacca acatggtgaa 8880
acctcgtctc tattaaaatt acaaaaatta gccgggcgtg gtggcacatg cctgtaatcc 8940
cagctactcg ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt 9000
gagttgagat cacgccattg cactccagcc ttggtgacaa gagtgaaact ccgtctcaaa 9060
aaaaaaaaaa atatgtatat atatatatat atatatatat atatatatat atatatatat 9120
ataaaatccc accaaaagtc tgcagagtga ccaaattaga cggctctggt ttcagattaa 9180
attctaaatg tgagaaacca catagctccc atgatccatc caataattcc tcacgtcctc 9240
ttcactcttt actccatgca taaaacagaa tttttttttc tcatcctggg tatgaagcaa 9300
ttaataattt acggatttag cctatttgga ttcaatccct tcaaactcca tactacatcc 9360
aaggtggaag tgacttaaac tctgatatca atcatcaggc ttgtaatata ggctttgtta 9420
atggcaggag agtctaataa aactttctgt tccttatcct tcatttaaat gaaaaacttt 9480
ttattgaaaa caatcataac tctagctcat cataaatata attcatgagg acattttatt 9540
atttttatat taaagaaata atattataga tgtaaacttt gcacctttct aattattatc 9600
atgagttaag ctaatacttg tcttctggtc cctagatgat gattcttttt tgccttactg 9660
gaggagccct tgtcttgaag tgagttgctt caacagcaga ggacttctag tttctcccag 9720
ttgagcctaa agtgaacttt tcatcttctt cagaggaagg ggcttccttg atttgtactt 9780
ttgtggctcc tcagataaca cagacaattt tatcttggat cccaggttct cttcaccatt 9840
aagaataaga aagagagaaa atgctgtgca tgacagccac ctactccaaa ctacccaacc 9900
ccctgtaacc aggtaccttc caacaacgag atgattctgc cctcactcaa gagtctcccc 9960
cacaaagatt ccattctccc tttacttttt attttttatt tttttgcaaa acaaaggcct 10020
cctttagtac ctccttagtt tataaccctt atcttcccag ctcttccctt cactgatacc 10080
tctgatttca aaagttctga agtcggaaga ccacacaatt tcagactgtg aacagaaatt 10140
cagtcagaaa ttattggagt tagaaagaat ttagagaaat tgtattgaac tagaaagtcc 10200
tctttgatta gtggtggccc tttagaaagt tctagggcag agtcccatgg tgtttcatct 10260
tttccatgat ttgcttgacc aaaaactttt ctttcacaac atgaaaatat gctaatccac 10320
cacacatttt ggattctgct ctgtttgcct gaggtgttag atctctagcc aggactgtga 10380
agggaaggaa cttgaatcct tcctattgag ctattaatgc agagtcagtg agatgaaggg 10440
ttccactcgg ggtcaaaatc atgtcagtta ccaagcaaag gagcaagtaa ggggaaacat 10500
ctcctcatct ggttagtgga gccacatttc acccactgat caagccagac ctgagcaata 10560
gtctagattt ctccctccac atctaattgg tgacaatggt gattgtacca ctgacaggtc 10620
acacaagtcc acaccctcct tcacagcctc actgcttcgg ctcttgttta tgctcttatc 10680
agcatctgtc actaggatcc atttgtctcc tgactcatct acttcctttc actcccctgg 10740
gccatccttc acatcatgct agaagactgt gtctaacttg cagaacttat tgtgtcaatc 10800
tcctggttac ggcccctcca tggctcccca cctgacatag gaggcccttc acaatctggc 10860
ttctgtcact cataacttgt ctccagcctt cattcttcag tctgatttta tggttttcta 10920
gttccccaat acaccacacc agtgtttatg aaaccctagt cattggccta cgaactttat 10980
gattattgac atattcatgt accacctgta ttattttttg catagtgttt atttttaatt 11040
gactacattt ttaactacaa taaagtaatt tcaactaaaa 11080
<![CDATA[<210> 156]]>
<![CDATA[<211> 477]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人]]>類
<![CDATA[<400> 156]]>
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
1 5 10 15
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys
20 25 30
Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe
35 40 45
Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala Ser Phe Tyr Lys His
50 55 60
Leu Gly Ile Glu Phe Met Glu Ala Glu Glu Leu Tyr Gln Lys Arg Val
65 70 75 80
Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu Leu Val Val Gly Ile Met
85 90 95
Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gln Arg Lys Lys Leu His
100 105 110
Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn
115 120 125
Ile Ala Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val Gln
130 135 140
Leu Val Asn Gln Tyr Val Ser Lys Asn Val Ile Ser Ser Glu His Ile
145 150 155 160
Val Glu Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser
165 170 175
Thr Ala His His Ser Thr Thr Val Thr Gln Thr Pro Ser His Ser Trp
180 185 190
Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His Ser Val Ile
195 200 205
Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr Gly Gly
210 215 220
Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser
225 230 235 240
Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro
245 250 255
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser
260 265 270
Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu
275 280 285
Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met Ala
290 295 300
Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro
305 310 315 320
Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gln Gln Phe Ser
325 330 335
Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala Ser
340 345 350
Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu Tyr
355 360 365
Glu Pro Ala Gln Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala
370 375 380
Lys Arg Thr Lys Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val Asp
385 390 395 400
Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp
405 410 415
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro Leu
420 425 430
Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg
435 440 445
Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln Ala Arg
450 455 460
Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
465 470 475
<![CDATA[<210> 157]]>
<![CDATA[<211> 11083]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 157]]>
catctacatc tacatccacc actgggacaa gccatcttgt aaaatgtgcg gagaaggaga 60
aaactttctg tgtgaatgga ggggagtgct tcatggtgaa agacctttca aacccctcga 120
gatacttgtg caagtgccca aatgagttta ctggtgatcg ctgccaaaac tacgtaatgg 180
ccagcttcta caagcatctt gggattgaat ttatggaggc ggaggagctg taccagaaga 240
gagtgctgac cataaccggc atctgcatcg ccctccttgt ggtcggcatc atgtgtgtgg 300
tggcctactg caaaaccaag aaacagcgga aaaagctgca tgaccgtctt cggcagagcc 360
ttcggtctga acgaaacaat atgatgaaca ttgccaatgg gcctcaccat cctaacccac 420
cccccgagaa tgtccagctg gtgaatcaat acgtatctaa aaacgtcatc tccagtgagc 480
atattgttga gagagaagca gagacatcct tttccaccag tcactatact tccacagccc 540
atcactccac tactgtcacc cagactccta gccacagctg gagcaacgga cacactgaaa 600
gcatcctttc cgaaagccac tctgtaatcg tgatgtcatc cgtagaaaac agtaggcaca 660
gcagcccaac tgggggccca agaggacgtc ttaatggcac aggaggccct cgtgaatgta 720
acagcttcct caggcatgcc agagaaaccc ctgattccta ccgagactct cctcatagtg 780
aaaggtatgt gtcagccatg accaccccgg ctcgtatgtc acctgtagat ttccacacgc 840
caagctcccc caaatcgccc ccttcggaaa tgtctccacc cgtgtccagc atgacggtgt 900
ccatgccttc catggcggtc agccccttca tggaagaaga gagacctcta cttctcgtga 960
caccaccaag gctgcgggag aagaagtttg accatcaccc tcagcagttc agctccttcc 1020
accacaaccc cgcgcatgac agtaacagcc tccctgctag ccccttgagg atagtggagg 1080
atgaggagta tgaaacgacc caagagtacg agccagccca agagcctgtt aagaaactcg 1140
ccaatagccg gcgggccaaa agaaccaagc ccaatggcca cattgctaac agattggaag 1200
tggacagcaa cacaagctcc cagagcagta actcagagag tgaaacagaa gatgaaagag 1260
taggtgaaga tacgcctttc ctgggcatac agaaccccct ggcagccagt cttgaggcaa 1320
cacctgcctt ccgcctggct gacagcagga ctaacccagc aggccgcttc tcgacacagg 1380
aagaaatcca ggccaggctg tctagtgtaa ttgctaacca agaccctatt gctgtataaa 1440
acctaaataa acacatagat tcacctgtaa aactttattt tatataataa agtattccac 1500
cttaaattaa acaatttatt ttattttagc agttctgcaa atagaaaaca ggaaaaaaac 1560
ttttataaat taaatatatg tatgtaaaaa tgtgttatgt gccatatgta gcaatttttt 1620
acagtatttc aaaacgagaa agatatcaat ggtgccttta tgttatgtta tgtcgagagc 1680
aagttttgta cagttacagt gattgctttt ccacagtatt tctgcaaaac ctctcataga 1740
ttcagttttt gctggcttct tgtgcattgc attatgatgt tgactggatg tatgatttgc 1800
aagacttgca actgtccctc tgtttgcttg tagtagcacc cgatcagtat gtcttgtaat 1860
ggcacatcca tccagatatg cctctcttgt gtatgaagtt ttctttgctt tcagaatatg 1920
aaatgagttg tgtctactct gccagccaaa ggtttgcctc attgggctct gagataatag 1980
tagatccaac agcatgctac tattaaatac agcaagaaac tgcattaagt aatgttaaat 2040
attaggaaga aagtaatact gtgatttaaa aaaaactata ttattaatca gaagacagct 2100
tgctcttact aaaaggagct ctcatttact ttatttgatt ttatttttct tgacaaaaag 2160
caacagtttt agggatagct tagaaaatgg gttctggctt gctatcaggg taaatctaac 2220
accttacaag aggactgagt gtcactttct ctctggggga atgatccagc agcttatcta 2280
gttgacaatc aaaacacggc tgataaaggt gcaatcattt ctgacatgta tttttcactg 2340
attttgaagc tagtgattgg ttgtgtcttc ttggctcaaa aagaagcata ttacggcaca 2400
aaaagcccag cccagacagc acatgcagca ttttgtctga aatacttcta gagtcaaacg 2460
tgcctgctgt acatagcgat gacttgtcat catagggaag tatttccatc gtagagtgtt 2520
cagaaggagt gactgtatag gtggagagaa gcttagtgac tccgttgaaa ttttaaaatg 2580
tggatgacca cccctttctc ccccttattt ttcttttatc tttccatgtt gccttgatca 2640
ggtcataact atgcatgaac attttttatc aggaatggcc gatgtgtatg tgatttgtaa 2700
tcacaagtaa tgattcatca ggaaatgtca atcctgttgg aaagattgca cctttacttg 2760
cagaagtgac ccccacctgt gtcctgacct ctccatttac aggctctctc acccatttcc 2820
cccacctcct ttaatttttg ctttactgtc ataaagtagg actaagattg gtctaagcat 2880
tgcatgttct tttgtgatgg taaatccaaa ggaaggccta taagtattaa catttgaaat 2940
aactgctaat tcaggaaaat ggaagaaaaa aaattatttg aaacacagaa cccatttcat 3000
ggcctgcctg atatctgtga aatcagggct ggagctttac ttaggattca catggcctcc 3060
taggaaccat gggacaaatg ggaaacaggt tatcggggga ttcatgaagt cagtgagagt 3120
aattgcttct tttttgcggg tgaactgaat gtatttcttc accaaatctt gatgttaaca 3180
attaaaaaga agaaatgaca tgcaagtagg tcttagcaga aaaatgcagg ctgggcatga 3240
gtcatgttgt taccctccca catgctccta caatccacag agatgcctgt ctgcaggttc 3300
ttgaagttat tgttagtatt tggtatctca aatttttcgt cactgttcac atgccacttt 3360
ctctgtgcac agtggtatcc tcatttgctt tttaacctac actgaggagt ctttgtcagg 3420
ttgcactgat tttccaattc tgcagtaatg agtaagctca cggcatgggg aagaagacag 3480
tcagtccaat gaagttctct aaattatttt aacattgcct ttgaaggcct tgactcatcc 3540
ttagctattt caatgaagaa attcctacca tgaatttaaa accctaaaaa ttctgtttca 3600
aattctttgg gcattggggt actcagatat cccattgtgg aagaatttta agaataaata 3660
gaagtttctg ttgagaacca tgagcaacat gtttcttaca atgagaattg ctatgcattt 3720
taaaattgca aatatatatg aaaattgaag acaagaggaa attgtatttc taacttgatt 3780
ctgatcactc acagaggtgg catattatta tagttgggac atcctttgca cccttcataa 3840
aaaaggccag ctgactgctc agcatcacct gccaaggcca ctagatttgt gtttacaggg 3900
gtatctctgt gatgcttgtc acatcactct tgaccacctc tgttaataaa ttccgacagt 3960
gcagtggcga tcggagtgtg aacttatgtt cccagcatat ggaaagctat cttaggtttt 4020
aaggtagtag aaattgccca ggagtttgac agcaactttg tttcccgggt ctaaaatcgt 4080
atcccactga ggtgtatgca gtggagcata atacatgcaa atacatgcaa aactcctttt 4140
gtttcaccta agattcactt tctatcttac tttcccttcc tgcctagtgt gacttttgcc 4200
cccaagagtg cctggacagc attctagttt ctacaaaatg gtcctctgtg taggtgaatg 4260
tgtcccaaac ctgctatcac tttcttgttt cagtgtgact gtcttgttag aggtgaagtt 4320
tatccagggt aacttgctca ctaactattc ctttttatgg cctggggtta aagggcgcat 4380
ggctcacact ggtgaaaata aggaaggcct ggtcttatct tgtattaata atactggctg 4440
cattccacca gccagagatt tctatctgcg aagacctatg aaacactgaa gagaaatgta 4500
ggcagaagga aatggccaca tatcacaagt tctattatat attcttttgt aaatacatat 4560
tgtatattac ttggatgttt tcttatatca tttactgtct ttttgagtta atgtcagttt 4620
ttactctctc aacttactat gtaacattgt aaataacata atgtccttta ttatttatat 4680
ttaagcatct aacatataga gttgttttca tataagttta agataaatgt caaaaatata 4740
tgttcttttg tttttctttg ctttaaaatt atgtatcttt tccttttctt ttttttaaga 4800
ataatttatt gttcaggaga aagaatgtat atgtaactga aactatctga agaatgcaca 4860
ttgaaggccg tgaggtactg ataaactaaa gaatttatta ttcaaaatac taagcaataa 4920
gtaattgtga tttatttaaa gttttgtcca ttttccatga aagacatact gcaataaaaa 4980
tgctactctg tggagacctg ggagtgttgc tcagcagact acagcttcag tctgttagac 5040
cagcaccttt catctcattc ccatagttat gctaatttag gattgtgttc catggacccc 5100
atgatcacct tgtctatacg ttgcttcttg tctgtccatt gcttttgcca caccacctgt 5160
tctcaaatca tctcctccct accaatgctg tttatcactt tcttccttgt tgaagaggcc 5220
acacaaccag acagtactat gcttcctttt tcctccatac acaataacag agagagaata 5280
ttctagggca tgactgcctg gatcctggct gttgctatct tttgtagtgg cagtaagaaa 5340
ctccttcaga ctaatgaaaa tgtcaacgtg ccattcaatc acgaaaggta acgaaaaatg 5400
ctctcatggt tcaaatagtc caatggccca tagtggccta aaaggcagcc agttgacacc 5460
tggccatgct aagcttcctt ataccatccg ctaatgactt tccattgggc ccacaattta 5520
cggattcata attttaaaag aggagaaggc caagttaggt tcattcccct tattctgtca 5580
ataaaacaaa tcaaactcat gtctatctaa ctgctcaggg aggagccttt gcatgagaaa 5640
attctcatat tctaagactg agtcatagaa atgagggtat tacttttctt actgcaatta 5700
acctaaacaa aagccatatt ttaacaaata gatatttgca tggtaccctt catatattcc 5760
aagcatttca ctcattattc caggtaggtt aagagcttct gaagtgtatg aagtaaaggt 5820
cagcaatcct ttggggtgaa cagtggcctc ctttggagtt tgggggtaac ctgagacttc 5880
ccaccaatgt ccacctccat ctgtgtacct aattcctatt acctagttat ggctcctcta 5940
ggatcatttc caaacactct ggatgtccag gaaatttaaa ttgtagcttt tgactgagct 6000
agtttttcct atttatatta ataaattttc aaaaatgctt gaaatcttca catttgcaac 6060
aactttagtt ttcatgcaca tacaaacaca gagagacaaa aattccaaac agacactctc 6120
caaaagccac cacactcttt cacttgctct atagtcattt agccaaccag ccatgcagag 6180
aatatttaaa acttaaagat tgagacatta ttctcagttt ttgctgaggc tttgtaacga 6240
aattgaacac tataagcagc tattgtagta attttggtta aaattgtttg cctgggatat 6300
agtatttgag gcagaagcac gtgtgtgaag gaggtgaggt ggtttggaaa gagtgaagac 6360
tcgcagccag attgaatgtc tggataatta ctataattct cccttcttgg ttgaaaccat 6420
gttctctctt gatttttaaa cccaggctgc ctctggaaac aagcaaacct gagtctttct 6480
aacctgagtc ttcccaatca ttagatttct tttctgtcct aacgatgaat gataaaagga 6540
cttgatgttc acaatttggg gttataaggc aggtctgaaa tctggagact caagatgctg 6600
gaaggagtgg aaagtttcga tgactttata tgaatcactt tgcactctat gtttggcttg 6660
tcctctttga aactgattta ctaaaataaa tgtaagggaa ctattactcc aaaagattaa 6720
cttggcagga aataccaata ctttcagttt atgaaagaca aaactgtctt gttgctacag 6780
gaagctgcaa tgttcctaac ctttaaggtt ggtgttgaat agggtggtca tgccctcccc 6840
tgcaggtatc tttaggctcc tgttgacctc ctggtactat aactgttcgt cttctctggg 6900
tagctattga ttttgaactt taacatgctt caaaacttta ttcatcaggg aaataggaaa 6960
agagttttgt tacctggagg aaatctattg tgatctacct gagcttttta aaaacagacc 7020
aggagaagga aaccagtaat ttttaaagaa gagacagaga atgggataat agtttcaccc 7080
aggatctctt tctaaccctt tcccttcaaa tgaacttatt ggaacagaat tggaaagaag 7140
aaaggacatc tctgcccacc ccacaggatg ccaaaaaggc taaagaatta cctctgtaga 7200
tttaaacatc tttaatggct tatgtataga tttgctaata cagagagaaa tgaactatta 7260
aataaaaatc acattttata atatttttat ggcttaaaac atcctttatc tcctttttgt 7320
tctctctaca tgatatggta agtgatgagg aaaatttagg ctcaggaagg ttaaaatctt 7380
tcttggagtt acacatctaa gagagctgca gagctgacac ttgtacccag gttttctgac 7440
tgcaaatcca gtttctttct attgcgttct tcccctttcc ctgcctcaag cagaaacagg 7500
ttttttattt tcaaccttta tgtatacagt atgttatgtt acatctacag ctaagtttct 7560
ttttagaaga atgtgagccc ttctagcttt ggtttagagt gattctagaa gccaatttcc 7620
ttggcttagt gattctatgc acctttccta aacttagctt tctaaggaaa tgaagtgtac 7680
gagtgagaat gaattcacaa tttcgacatg taggtagcat cctaaagtga aaagaggagg 7740
aaatttgtgg tcaaagcact ctccccacca cttagaaact tactgactgt gggcagcttc 7800
ctcctccaag tttccttcct gatttacaag accgtggtgt ggtcaggatt aaacttgaat 7860
acatgtaagg aagcctgaaa gtgtctaaca catagcgagt attcaaatgc caccttctat 7920
ttgatccttc ccctccagtt ccttaagttt tggaatctag gtttctcagt tccaaatgga 7980
ttgacatttg catatcccca ttgcacaatg gatcaaataa actttatgtt atcatttctc 8040
caacatagtg ccagtaagca aatccttttt aataacaaca gtatgttgag aaacatatca 8100
ccaaataata tttaactttg tagctttgat aagttcttta ggttttggtt ttggttttgt 8160
tttctgagac agggtcttga tctgtcaccc agactggagt gcaatggttc aattttagct 8220
cactgcaacc tgtaactcct gagctcaagt gatcctccca cctcagcctc tcaagtagct 8280
gggactacag gtgtgctcca ccatgcccag cgattttttt tttttttttt tttttttttt 8340
ttggtaggga caaggtctcg ctatgttgcc caggctggtc ttcaattcct ggccttaagt 8400
gatcctcctg cctcggcctc ccaaagtgct ggaattacag gcatgagcca ccacccataa 8460
ctttatgttt gtttttttga tgcagtataa gttcagcttg cttcttatgc agccatacca 8520
tttcatgtta actctgattt ttagcagctt attacattag tgttttatta ttaatataat 8580
tttacagaaa tttactaaac catgactctg tagagtttta ataatactac ctccaaacat 8640
cattgcaaac atctagaaga atgaacaaaa atgatcttag atcgacagta tatctgtttg 8700
tcttagtttc tacacaggat gttcagacat attccatttc tttaaaaaaa aaatatatat 8760
atatatatat atatataggc ctggcacggt ggctcatgac tgtaatccca gcactttggg 8820
aggcttaggc aggcaaatca cctgaggtca ggagtttgag actggcctga ccaacatggt 8880
gaaacctcgt ctctattaaa attacaaaaa ttagccgggc gtggtggcac atgcctgtaa 8940
tcccagctac tcgggaggct gaggcaggag aatcacttga acctgggagg cagaggttgc 9000
agtgagttga gatcacgcca ttgcactcca gccttggtga caagagtgaa actccgtctc 9060
aaaaaaaaaa aaaatatgta tatatatata tatatatata tatatatata tatatatata 9120
tatataaaat cccaccaaaa gtctgcagag tgaccaaatt agacggctct ggtttcagat 9180
taaattctaa atgtgagaaa ccacatagct cccatgatcc atccaataat tcctcacgtc 9240
ctcttcactc tttactccat gcataaaaca gaattttttt ttctcatcct gggtatgaag 9300
caattaataa tttacggatt tagcctattt ggattcaatc ccttcaaact ccatactaca 9360
tccaaggtgg aagtgactta aactctgata tcaatcatca ggcttgtaat ataggctttg 9420
ttaatggcag gagagtctaa taaaactttc tgttccttat ccttcattta aatgaaaaac 9480
tttttattga aaacaatcat aactctagct catcataaat ataattcatg aggacatttt 9540
attattttta tattaaagaa ataatattat agatgtaaac tttgcacctt tctaattatt 9600
atcatgagtt aagctaatac ttgtcttctg gtccctagat gatgattctt ttttgcctta 9660
ctggaggagc ccttgtcttg aagtgagttg cttcaacagc agaggacttc tagtttctcc 9720
cagttgagcc taaagtgaac ttttcatctt cttcagagga aggggcttcc ttgatttgta 9780
cttttgtggc tcctcagata acacagacaa ttttatcttg gatcccaggt tctcttcacc 9840
attaagaata agaaagagag aaaatgctgt gcatgacagc cacctactcc aaactaccca 9900
accccctgta accaggtacc ttccaacaac gagatgattc tgccctcact caagagtctc 9960
ccccacaaag attccattct ccctttactt tttatttttt atttttttgc aaaacaaagg 10020
cctcctttag tacctcctta gtttataacc cttatcttcc cagctcttcc cttcactgat 10080
acctctgatt tcaaaagttc tgaagtcgga agaccacaca atttcagact gtgaacagaa 10140
attcagtcag aaattattgg agttagaaag aatttagaga aattgtattg aactagaaag 10200
tcctctttga ttagtggtgg ccctttagaa agttctaggg cagagtccca tggtgtttca 10260
tcttttccat gatttgcttg accaaaaact tttctttcac aacatgaaaa tatgctaatc 10320
caccacacat tttggattct gctctgtttg cctgaggtgt tagatctcta gccaggactg 10380
tgaagggaag gaacttgaat ccttcctatt gagctattaa tgcagagtca gtgagatgaa 10440
gggttccact cggggtcaaa atcatgtcag ttaccaagca aaggagcaag taaggggaaa 10500
catctcctca tctggttagt ggagccacat ttcacccact gatcaagcca gacctgagca 10560
atagtctaga tttctccctc cacatctaat tggtgacaat ggtgattgta ccactgacag 10620
gtcacacaag tccacaccct ccttcacagc ctcactgctt cggctcttgt ttatgctctt 10680
atcagcatct gtcactagga tccatttgtc tcctgactca tctacttcct ttcactcccc 10740
tgggccatcc ttcacatcat gctagaagac tgtgtctaac ttgcagaact tattgtgtca 10800
atctcctggt tacggcccct ccatggctcc ccacctgaca taggaggccc ttcacaatct 10860
ggcttctgtc actcataact tgtctccagc cttcattctt cagtctgatt ttatggtttt 10920
ctagttcccc aatacaccac accagtgttt atgaaaccct agtcattggc ctacgaactt 10980
tatgattatt gacatattca tgtaccacct gtattatttt ttgcatagtg tttattttta 11040
attgactaca tttttaacta caataaagta atttcaacta aaa 11083
<![CDATA[<210> 158]]>
<![CDATA[<211> 478]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 158]]>
Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala
1 5 10 15
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val
20 25 30
Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu
35 40 45
Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala Ser Phe Tyr Lys
50 55 60
His Leu Gly Ile Glu Phe Met Glu Ala Glu Glu Leu Tyr Gln Lys Arg
65 70 75 80
Val Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu Leu Val Val Gly Ile
85 90 95
Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gln Arg Lys Lys Leu
100 105 110
His Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu Arg Asn Asn Met Met
115 120 125
Asn Ile Ala Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val
130 135 140
Gln Leu Val Asn Gln Tyr Val Ser Lys Asn Val Ile Ser Ser Glu His
145 150 155 160
Ile Val Glu Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr
165 170 175
Ser Thr Ala His His Ser Thr Thr Val Thr Gln Thr Pro Ser His Ser
180 185 190
Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His Ser Val
195 200 205
Ile Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr Gly
210 215 220
Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn
225 230 235 240
Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser
245 250 255
Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met
260 265 270
Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser
275 280 285
Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met
290 295 300
Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr
305 310 315 320
Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gln Gln Phe
325 330 335
Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala
340 345 350
Ser Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu
355 360 365
Tyr Glu Pro Ala Gln Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg
370 375 380
Ala Lys Arg Thr Lys Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val
385 390 395 400
Asp Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu
405 410 415
Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro
420 425 430
Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser
435 440 445
Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln Ala
450 455 460
Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
465 470 475
<![CDATA[<210> 159]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 159]]>
000
<![CDATA[<210> 160]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 160]]>
000
<![CDATA[<210> 161]]>
<![CDATA[<211> 586]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 161]]>
agtaagcctc cgcagcccac tcggactgca gcctgtttgc cgcccgtcct cccattgcag 60
cactcggggc gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac 120
tcgcgggtcc cgctccgctc cggcagcagc atggggaaag gacgcgcggg ccgagttggc 180
accacagcct tgcctccccg attgaaagag atgaaaagcc aggaatcggc tgcaggttcc 240
aaactagtcc ttcggtgtga aaccagttct gaatactcct ctctcagatt caagtggttc 300
aagaatggga atgaattgaa tcgaaaaaac aaaccacaaa atatcaagat acaaaaaaag 360
ccagggaagt cagaacttcg cattaacaaa gcatcactgg ctgattctgg agagtatatg 420
tgcaaagtga tcagcaaatt aggaaatgac agtgcctctg ccaatatcac catcgtggaa 480
tcaaacgaga tcatcactgg tatgccagcc tcaactgaag gagcatatgt gtcttcagag 540
tctcccatta gaatatcagt atccacagaa ggagcaaata cttctt 586
<![CDATA[<210> 162]]>
<![CDATA[<211> 145]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 162]]>
Met Gly Lys Gly Arg Ala Gly Arg Val Gly Thr Thr Ala Leu Pro Pro
1 5 10 15
Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu
20 25 30
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys
35 40 45
Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gln Asn
50 55 60
Ile Lys Ile Gln Lys Lys Pro Gly Lys Ser Glu Leu Arg Ile Asn Lys
65 70 75 80
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val Ile Ser Lys
85 90 95
Leu Gly Asn Asp Ser Ala Ser Ala Asn Ile Thr Ile Val Glu Ser Asn
100 105 110
Glu Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser
115 120 125
Ser Glu Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly Ala Asn Thr
130 135 140
Ser
145
<![CDATA[<210> 163]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 163]]>
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly
1 5 10 15
Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu
20 25 30
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val
35 40 45
Met Ala Ser Phe
50
<![CDATA[<210> 164]]>
<![CDATA[<211> 65]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 164]]>
cataattagc atcatcatta ttctggctgg agcaattgca ctcatcattg gctttggtat 60
ttcag 65
<![CDATA[<210> 165]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 165]]>
ccttgcctcc ccgattgaaa gagatgaa 28
<![CDATA[<210> 166]]>
<![CDATA[<211> 93]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 166]]>
cataattagc atcatcatta ttctggctgg agcaattgca ctcatcattg gctttggtat 60
ttcagccttg cctccccgat tgaaagagat gaa 93
<![CDATA[<210> ]]>167
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 167]]>
Ile Ile Ser Ile Ile Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile
1 5 10 15
Gly Phe Gly Ile Ser Ala Leu Pro Pro Arg Leu Lys Glu Met
20 25 30
<![CDATA[<210> 168]]>
<![CDATA[<211> 104]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 168]]>
gtgagtcacc aaggaaggca gcggcagctc cactcagcca gtacccagat acgctgggaa 60
ccttccccag ccatggcttc cctggggcag atcctcttct ggag 104
<![CDATA[<210> 169]]>
<![CDATA[<211> 65]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 169]]>
cataattagc atcatcatta ttctggctgg agcaattgca ctcatcattg gctttggtat 60
ttcag 65
<![CDATA[<210> 170]]>
<![CDATA[<211> 348]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 170]]>
ggagacactc catcacagtc actactgtcg cctcagctgg gaacattggg gaggatggaa 60
tcctgagctg cacttttgaa cctgacatca aactttctga tatcgtgata caatggctga 120
aggaaggtgt tttaggcttg gtccatgagt tcaaagaagg caaagatgag ctgtcggagc 180
aggatgaaat gttcagaggc cggacagcag tgtttgctga tcaagtgata gttggcaatg 240
cctctttgcg gctgaaaaac gtgcaactca cagatgctgg cacctacaaa tgttatatca 300
tcacttctaa aggcaagggg aatgctaacc ttgagtataa aactggag 348
<![CDATA[<210> 171]]>
<![CDATA[<211> 279]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 171]]>
ccttcagcat gccggaagtg aatgtggact ataatgccag ctcagagacc ttgcggtgtg 60
aggctccccg atggttcccc cagcccacag tggtctgggc atcccaagtt gaccagggag 120
ccaacttctc ggaagtctcc aataccagct ttgagctgaa ctctgagaat gtgaccatga 180
aggttgtgtc tgtgctctac aatgttacga tcaacaacac atactcctgt atgattgaaa 240
atgacattgc caaagcaaca ggggatatca aagtgacag 279
<![CDATA[<210> 172]]>
<![CDATA[<211> 170]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 172]]>
aatcggagat caaaaggcgg agtcacctac agctgctaaa ctcaaaggct tctctgtgtg 60
tctcttcttt ctttgccatc agctgggcac ttctgcctct cagcccttac ctgatgctaa 120
aataatgtgc ctcggccaca aaaaagcatg caaagtcatt gttacaacag 170
<![CDATA[<210> 173]]>
<![CDATA[<211> 1639]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 173]]>
ggatctacag aactatttca ccaccagata tgacctagtt ttatatttct gggaggaaat 60
gaattcatat ctagaagtct ggagtgagca aacaagagca agaaacaaaa agaagccaaa 120
agcagaaggc tccaatatga acaagataaa tctatcttca aagacatatt agaagttggg 180
aaaataattc atgtgaacta gacaagtgtg ttaagagtga taagtaaaat gcacgtggag 240
acaagtgcat ccccagatct cagggacctc cccctgcctg tcacctgggg agtgagagga 300
caggatagtg catgttcttt gtctctgaat ttttagttat atgtgctgta atgttgctct 360
gaggaagccc ctggaaagtc tatcccaaca tatccacatc ttatattcca caaattaagc 420
tgtagtatgt accctaagac gctgctaatt gactgccact tcgcaactca ggggcggctg 480
cattttagta atgggtcaaa tgattcactt tttatgatgc ttccaaaggt gccttggctt 540
ctcttcccaa ctgacaaatg ccaaagttga gaaaaatgat cataatttta gcataaacag 600
agcagtcggc gacaccgatt ttataaataa actgagcacc ttctttttaa acaaacaaat 660
gcgggtttat ttctcagatg atgttcatcc gtgaatggtc cagggaagga cctttcacct 720
tgtctatatg gcattatgtc atcacaagct ctgaggcttc tcctttccat cctgcgtgga 780
cagctaagac ctcagttttc aatagcatct agagcagtgg gactcagctg gggtgatttc 840
gccccccatc tccgggggaa tgtctgaaga caattttggt tacctcaatg agggagtgga 900
ggaggataca gtgctactac caactagtgg atagaggcca gggatgctgc tcaacctcct 960
accatgtaca ggacgtctcc ccattacaac tacccaatcc gaagtgtcaa ctgtgtcagg 1020
gctaagaaac cctggttttg agtagaaaag ggcctggaaa gaggggagcc aacaaatctg 1080
tctgcttcct cacattagtc attggcaaat aagcattctg tctctttggc tgctgcctca 1140
gcacagagag ccagaactct atcgggcacc aggataacat ctctcagtga acagagttga 1200
caaggcctat gggaaatgcc tgatgggatt atcttcagct tgttgagctt ctaagtttct 1260
ttcccttcat tctaccctgc aagccaagtt ctgtaagaga aatgcctgag ttctagctca 1320
ggttttctta ctctgaattt agatctccag accctgcctg gccacaattc aaattaaggc 1380
aacaaacata taccttccat gaagcacaca cagacttttg aaagcaagga caatgactgc 1440
ttgaattgag gccttgagga atgaagcttt gaaggaaaag aatactttgt ttccagcccc 1500
cttcccacac tcttcatgtg ttaaccactg ccttcctgga ccttggagcc acggtgactg 1560
tattacatgt tgttatagaa aactgatttt agagttctga tcgttcaaga gaatgattaa 1620
atatacattt cctacacca 1639
<![CDATA[<210> 174]]>
<![CDATA[<211> 2605]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 174]]>
gtgagtcacc aaggaaggca gcggcagctc cactcagcca gtacccagat acgctgggaa 60
ccttccccag ccatggcttc cctggggcag atcctcttct ggagcataat tagcatcatc 120
attattctgg ctggagcaat tgcactcatc attggctttg gtatttcagg gagacactcc 180
atcacagtca ctactgtcgc ctcagctggg aacattgggg aggatggaat cctgagctgc 240
acttttgaac ctgacatcaa actttctgat atcgtgatac aatggctgaa ggaaggtgtt 300
ttaggcttgg tccatgagtt caaagaaggc aaagatgagc tgtcggagca ggatgaaatg 360
ttcagaggcc ggacagcagt gtttgctgat caagtgatag ttggcaatgc ctctttgcgg 420
ctgaaaaacg tgcaactcac agatgctggc acctacaaat gttatatcat cacttctaaa 480
ggcaagggga atgctaacct tgagtataaa actggagcct tcagcatgcc ggaagtgaat 540
gtggactata atgccagctc agagaccttg cggtgtgagg ctccccgatg gttcccccag 600
cccacagtgg tctgggcatc ccaagttgac cagggagcca acttctcgga agtctccaat 660
accagctttg agctgaactc tgagaatgtg accatgaagg ttgtgtctgt gctctacaat 720
gttacgatca acaacacata ctcctgtatg attgaaaatg acattgccaa agcaacaggg 780
gatatcaaag tgacagaatc ggagatcaaa aggcggagtc acctacagct gctaaactca 840
aaggcttctc tgtgtgtctc ttctttcttt gccatcagct gggcacttct gcctctcagc 900
ccttacctga tgctaaaata atgtgcctcg gccacaaaaa agcatgcaaa gtcattgtta 960
caacagggat ctacagaact atttcaccac cagatatgac ctagttttat atttctggga 1020
ggaaatgaat tcatatctag aagtctggag tgagcaaaca agagcaagaa acaaaaagaa 1080
gccaaaagca gaaggctcca atatgaacaa gataaatcta tcttcaaaga catattagaa 1140
gttgggaaaa taattcatgt gaactagaca agtgtgttaa gagtgataag taaaatgcac 1200
gtggagacaa gtgcatcccc agatctcagg gacctccccc tgcctgtcac ctggggagtg 1260
agaggacagg atagtgcatg ttctttgtct ctgaattttt agttatatgt gctgtaatgt 1320
tgctctgagg aagcccctgg aaagtctatc ccaacatatc cacatcttat attccacaaa 1380
ttaagctgta gtatgtaccc taagacgctg ctaattgact gccacttcgc aactcagggg 1440
cggctgcatt ttagtaatgg gtcaaatgat tcacttttta tgatgcttcc aaaggtgcct 1500
tggcttctct tcccaactga caaatgccaa agttgagaaa aatgatcata attttagcat 1560
aaacagagca gtcggcgaca ccgattttat aaataaactg agcaccttct ttttaaacaa 1620
acaaatgcgg gtttatttct cagatgatgt tcatccgtga atggtccagg gaaggacctt 1680
tcaccttgtc tatatggcat tatgtcatca caagctctga ggcttctcct ttccatcctg 1740
cgtggacagc taagacctca gttttcaata gcatctagag cagtgggact cagctggggt 1800
gatttcgccc cccatctccg ggggaatgtc tgaagacaat tttggttacc tcaatgaggg 1860
agtggaggag gatacagtgc tactaccaac tagtggatag aggccaggga tgctgctcaa 1920
cctcctacca tgtacaggac gtctccccat tacaactacc caatccgaag tgtcaactgt 1980
gtcagggcta agaaaccctg gttttgagta gaaaagggcc tggaaagagg ggagccaaca 2040
aatctgtctg cttcctcaca ttagtcattg gcaaataagc attctgtctc tttggctgct 2100
gcctcagcac agagagccag aactctatcg ggcaccagga taacatctct cagtgaacag 2160
agttgacaag gcctatggga aatgcctgat gggattatct tcagcttgtt gagcttctaa 2220
gtttctttcc cttcattcta ccctgcaagc caagttctgt aagagaaatg cctgagttct 2280
agctcaggtt ttcttactct gaatttagat ctccagaccc tgcctggcca caattcaaat 2340
taaggcaaca aacatatacc ttccatgaag cacacacaga cttttgaaag caaggacaat 2400
gactgcttga attgaggcct tgaggaatga agctttgaag gaaaagaata ctttgtttcc 2460
agcccccttc ccacactctt catgtgttaa ccactgcctt cctggacctt ggagccacgg 2520
tgactgtatt acatgttgtt atagaaaact gattttagag ttctgatcgt tcaagagaat 2580
gattaaatat acatttccta cacca 2605
<![CDATA[<210> 175]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 175]]>
Met Ala Ser Leu Gly Gln Ile Leu Phe Trp
1 5 10
<![CDATA[<210> 176]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 176]]>
Ile Ile Ser Ile Ile Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile
1 5 10 15
Gly Phe Gly Ile Ser
20
<![CDATA[<210> 177]]>
<![CDATA[<211> 115]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 177]]>
Arg His Ser Ile Thr Val Thr Thr Val Ala Ser Ala Gly Asn Ile Gly
1 5 10 15
Glu Asp Gly Ile Leu Ser Cys Thr Phe Glu Pro Asp Ile Lys Leu Ser
20 25 30
Asp Ile Val Ile Gln Trp Leu Lys Glu Gly Val Leu Gly Leu Val His
35 40 45
Glu Phe Lys Glu Gly Lys Asp Glu Leu Ser Glu Gln Asp Glu Met Phe
50 55 60
Arg Gly Arg Thr Ala Val Phe Ala Asp Gln Val Ile Val Gly Asn Ala
65 70 75 80
Ser Leu Arg Leu Lys Asn Val Gln Leu Thr Asp Ala Gly Thr Tyr Lys
85 90 95
Cys Tyr Ile Ile Thr Ser Lys Gly Lys Gly Asn Ala Asn Leu Glu Tyr
100 105 110
Lys Thr Gly
115
<![CDATA[<210> 178]]>
<![CDATA[<211> 92]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 178]]>
Phe Ser Met Pro Glu Val Asn Val Asp Tyr Asn Ala Ser Ser Glu Thr
1 5 10 15
Leu Arg Cys Glu Ala Pro Arg Trp Phe Pro Gln Pro Thr Val Val Trp
20 25 30
Ala Ser Gln Val Asp Gln Gly Ala Asn Phe Ser Glu Val Ser Asn Thr
35 40 45
Ser Phe Glu Leu Asn Ser Glu Asn Val Thr Met Lys Val Val Ser Val
50 55 60
Leu Tyr Asn Val Thr Ile Asn Asn Thr Tyr Ser Cys Met Ile Glu Asn
65 70 75 80
Asp Ile Ala Lys Ala Thr Gly Asp Ile Lys Val Thr
85 90
<![CDATA[<210> 179]]>
<![CDATA[<211> 40]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 179]]>
Ser Glu Ile Lys Arg Arg Ser His Leu Gln Leu Leu Asn Ser Lys Ala
1 5 10 15
Ser Leu Cys Val Ser Ser Phe Phe Ala Ile Ser Trp Ala Leu Leu Pro
20 25 30
Leu Ser Pro Tyr Leu Met Leu Lys
35 40
<![CDATA[<210> 180]]>
<![CDATA[<211> 282]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 180]]>
Met Ala Ser Leu Gly Gln Ile Leu Phe Trp Ser Ile Ile Ser Ile Ile
1 5 10 15
Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile Gly Phe Gly Ile Ser
20 25 30
Gly Arg His Ser Ile Thr Val Thr Thr Val Ala Ser Ala Gly Asn Ile
35 40 45
Gly Glu Asp Gly Ile Leu Ser Cys Thr Phe Glu Pro Asp Ile Lys Leu
50 55 60
Ser Asp Ile Val Ile Gln Trp Leu Lys Glu Gly Val Leu Gly Leu Val
65 70 75 80
His Glu Phe Lys Glu Gly Lys Asp Glu Leu Ser Glu Gln Asp Glu Met
85 90 95
Phe Arg Gly Arg Thr Ala Val Phe Ala Asp Gln Val Ile Val Gly Asn
100 105 110
Ala Ser Leu Arg Leu Lys Asn Val Gln Leu Thr Asp Ala Gly Thr Tyr
115 120 125
Lys Cys Tyr Ile Ile Thr Ser Lys Gly Lys Gly Asn Ala Asn Leu Glu
130 135 140
Tyr Lys Thr Gly Ala Phe Ser Met Pro Glu Val Asn Val Asp Tyr Asn
145 150 155 160
Ala Ser Ser Glu Thr Leu Arg Cys Glu Ala Pro Arg Trp Phe Pro Gln
165 170 175
Pro Thr Val Val Trp Ala Ser Gln Val Asp Gln Gly Ala Asn Phe Ser
180 185 190
Glu Val Ser Asn Thr Ser Phe Glu Leu Asn Ser Glu Asn Val Thr Met
195 200 205
Lys Val Val Ser Val Leu Tyr Asn Val Thr Ile Asn Asn Thr Tyr Ser
210 215 220
Cys Met Ile Glu Asn Asp Ile Ala Lys Ala Thr Gly Asp Ile Lys Val
225 230 235 240
Thr Glu Ser Glu Ile Lys Arg Arg Ser His Leu Gln Leu Leu Asn Ser
245 250 255
Lys Ala Ser Leu Cys Val Ser Ser Phe Phe Ala Ile Ser Trp Ala Leu
260 265 270
Leu Pro Leu Ser Pro Tyr Leu Met Leu Lys
275 280
<![CDATA[<210> 181]]>
<![CDATA[<211> 169]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 181]]>
gtgagtcacc aaggaaggca gcggcagctc cactcagcca gtacccagat acgctgggaa 60
ccttccccag ccatggcttc cctggggcag atcctcttct ggagcataat tagcatcatc 120
attattctgg ctggagcaat tgcactcatc attggctttg gtatttcag 169
<![CDATA[<210> 182]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 182]]>
Met Ala Ser Leu Gly Gln Ile Leu Phe Trp Ser Ile Ile Ser Ile Ile
1 5 10 15
Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile Gly Phe Gly Ile Ser
20 25 30
<![CDATA[<210> 183]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 183]]>
000
<![CDATA[<210> 184]]>
<![CDATA[<211> 119]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 184]]>
ctggctggag attaatccgg acactggtgc catttccact cgggctgagc tggacaggga 60
ggattttgag cacgtgaaga acagcacgta cacagcccta atcatagcta cagacaatg 119
<![CDATA[<210> 185]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 185]]>
ccttgcctcc ccgattgaaa gagatgaaaa 30
<![CDATA[<210> 186]]>
<![CDATA[<211> 148]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 186]]>
ctggctggag attaatccgg acactggtgc catttccact cgggctgagc tggacaggga 60
ggattttgag cacgtgaaga acagcacgta cacagcccta atcatagcta cagacaatgc 120
cttgcctccc cgattgaaag agatgaaa 148
<![CDATA[<210> 187]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 187]]>
Trp Leu Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr Arg Ala Glu
1 5 10 15
Leu Asp Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr Tyr Thr Ala
20 25 30
Leu Ile Ile Ala Thr Asp Asn Ala Leu Pro Pro Arg Leu Lys Glu Met
35 40 45
Lys
<![CDATA[<210> 188]]>
<![CDATA[<211> 172]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 188]]>
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc 60
gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc 120
agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc ag 172
<![CDATA[<210> 189]]>
<![CDATA[<211> 115]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 189]]>
gtctcctctt ggctctgcca ggagccggag ccctgccacc ctggctttga cgccgagagc 60
tacacgttca cggtgccccg gcgccacctg gagagaggcc gcgtcctggg cagag 115
<![CDATA[<210> 190]]>
<![CDATA[<211> 224]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 190]]>
tgaattttga agattgcacc ggtcgacaaa ggacagccta tttttccctc gacacccgat 60
tcaaagtggg cacagatggt gtgattacag tcaaaaggcc tctacggttt cataacccac 120
agatccattt cttggtctac gcctgggact ccacctacag aaagttttcc accaaagtca 180
cgctgaatac agtggggcac caccaccgcc ccccgcccca tcag 224
<![CDATA[<210> 191]]>
<![CDATA[<211> 144]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 191]]>
gcctccgttt ctggaatcca agcagaattg ctcacatttc ccaactcctc tcctggcctc 60
agaagacaga agagagactg ggttattcct cccatcagct gcccagaaaa tgaaaaaggc 120
ccatttccta aaaacctggt tcag 144
<![CDATA[<210> 192]]>
<![CDATA[<211> 156]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 192]]>
atcaaatcca acaaagacaa agaaggcaag gttttctaca gcatcactgg ccaaggagct 60
gacacacccc ctgttggtgt ctttattatt gaaagagaaa caggatggct gaaggtgaca 120
gagcctctgg atagagaacg cattgccaca tacact 156
<![CDATA[<210> 193]]>
<![CDATA[<211> 145]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 193]]>
ctcttctctc acgctgtgtc atccaacggg aatgcagttg aggatccaat ggagattttg 60
atcacggtaa ccgatcagaa tgacaacaag cccgaattca cccaggaggt ctttaagggg 120
tctgtcatgg aaggtgctct tccag 145
<![CDATA[<210> 194]]>
<![CDATA[<211> 176]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 194]]>
gaacctctgt gatggaggtc acagccacag acgcggacga tgatgtgaac acctacaatg 60
ccgccatcgc ttacaccatc ctcagccaag atcctgagct ccctgacaaa aatatgttca 120
ccattaacag gaacacagga gtcatcagtg tggtcaccac tgggctggac cgagag 176
<![CDATA[<210> 195]]>
<![CDATA[<211> 129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 195]]>
agtttcccta cgtataccct ggtggttcaa gctgctgacc ttcaaggtga ggggttaagc 60
acaacagcaa cagctgtgat cacagtcact gacaccaacg ataatcctcc gatcttcaat 120
cccaccacg 129
<![CDATA[<210> 196]]>
<![CDATA[<211> 250]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 196]]>
tacaagggtc aggtgcctga gaacgaggct aacgtcgtaa tcaccacact gaaagtgact 60
gatgctgatg cccccaatac cccagcgtgg gaggctgtat acaccatatt gaatgatgat 120
ggtggacaat ttgtcgtcac cacaaatcca gtgaacaacg atggcatttt gaaaacagca 180
aaggtttgta tggtacctgg caagatgcag aaactggcat cctcacagct gttcataccc 240
ttgtcccctg 250
<![CDATA[<210> 197]]>
<![CDATA[<211> 245]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 197]]>
ggcttggatt ttgaggccaa gcagcagtac attctacacg tagcagtgac gaatgtggta 60
ccttttgagg tctctctcac cacctccaca gccaccgtca ccgtggatgt gctggatgtg 120
aatgaagccc ccatctttgt gcctcctgaa aagagagtgg aagtgtccga ggactttggc 180
gtgggccagg aaatcacatc ctacactgcc caggagccag acacatttat ggaacagaaa 240
ataac 245
<![CDATA[<210> 198]]>
<![CDATA[<211> 146]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 198]]>
atatcggatt tggagagaca ctgccaactg gctggagatt aatccggaca ctggtgccat 60
ttccactcgg gctgagctgg acagggagga ttttgagcac gtgaagaaca gcacgtacac 120
agccctaatc atagctacag acaatg 146
<![CDATA[<210> 199]]>
<![CDATA[<211> 225]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 199]]>
gttctccagt tgctactgga acagggacac ttctgctgat cctgtctgat gtgaatgaca 60
acgcccccat accagaacct cgaactatat tcttctgtga gaggaatcca aagcctcagg 120
tcataaacat cattgatgca gaccttcctc ccaatacatc tcccttcaca gcagaactaa 180
cacacggggc gagtgccaac tggaccattc agtacaacga cccaa 225
<![CDATA[<210> 200]]>
<![CDATA[<211> 228]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 200]]>
cccaagaatc tatcattttg aagccaaaga tggccttaga ggtgggtgac tacaaaatca 60
atctcaagct catggataac cagaataaag accaagtgac caccttagag gtcagcgtgt 120
gtgactgtga aggggccgct ggcgtctgta ggaaggcaca gcctgtcgaa gcaggattgc 180
aaattcctgc cattctgggg attcttggag gaattcttgc tttgctaa 228
<![CDATA[<210> 201]]>
<![CDATA[<211> 131]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 201]]>
ttctgattct gctgctcttg ctgtttcttc ggaggagagc ggtggtcaaa gagcccttac 60
tgcccccaga ggatgacacc cgggacaacg tttattacta tgatgaagaa ggaggcggag 120
aagaggacca g 131
<![CDATA[<210> 202]]>
<![CDATA[<211> 144]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 202]]>
gactttgact tgagccagct gcacaggggc ctggacgctc ggcctgaagt gactcgtaac 60
gacgttgcac caaccctcat gagtgtcccc cggtatcttc cccgccctgc caatcccgat 120
gaaattggaa attttattga tgaa 144
<![CDATA[<210> 203]]>
<![CDATA[<211> 2248]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 203]]>
aatctgaaag cggctgatac tgaccccaca gccccgcctt atgattctct gctcgtgttt 60
gactatgaag gaagcggttc cgaagctgct agtctgagct ccctgaactc ctcagagtca 120
gacaaagacc aggactatga ctacttgaac gaatggggca atcgcttcaa gaagctggct 180
gacatgtacg gaggcggcga ggacgactag gggactcgag agaggcgggc cccagaccca 240
tgtgctggga aatgcagaaa tcacgttgct ggtggttttt cagctccctt cccttgagat 300
gagtttctgg ggaaaaaaaa gagactggtt agtgatgcag ttagtatagc tttatactct 360
ctccacttta tagctctaat aagtttgtgt tagaaaagtt tcgacttatt tcttaaagct 420
tttttttttt tcccatcact ctttacatgg tggtgatgtc caaaagatac ccaaatttta 480
atattccaga agaacaactt tagcatcaga aggttcaccc agcaccttgc agattttctt 540
aaggaatttt gtctcacttt taaaaagaag gggagaagtc agctactcta gttctgttgt 600
tttgtgtata taatttttta aaaaaaattt gtgtgcttct gctcattact acactggtgt 660
gtccctctgc cttttttttt tttttaagac agggtctcat tctatcggcc aggctggagt 720
gcagtggtgc aatcacagct cactgcagcc ttgtcctccc aggctcaagc tatccttgca 780
cctcagcctc ccaagtagct gggaccacag gcatgcacca ctacgcatga ctaatttttt 840
aaatatttga gacggggtct ccctgtgtta cccaggctgg tctcaaactc ctgggctcaa 900
gtgatcctcc catcttggcc tcccagagta ttgggattac agacatgagc cactgcacct 960
gcccagctcc ccaactccct gccatttttt aagagacagt ttcgctccat cgcccaggcc 1020
tgggatgcag tgatgtgatc atagctcact gtaacctcaa actctggggc tcaagcagtt 1080
ctcccaccag cctccttttt atttttttgt acagatgggg tcttgctatg ttgcccaagc 1140
tggtcttaaa ctcctggcct caagcaatcc ttctgccttg gccccccaaa gtgctgggat 1200
tgtgggcatg agctgctgtg cccagcctcc atgttttaat atcaactctc actcctgaat 1260
tcagttgctt tgcccaagat aggagttctc tgatgcagaa attattgggc tcttttaggg 1320
taagaagttt gtgtctttgt ctggccacat cttgactagg tattgtctac tctgaagacc 1380
tttaatggct tccctctttc atctcctgag tatgtaactt gcaatgggca gctatccagt 1440
gacttgttct gagtaagtgt gttcattaat gtttatttag ctctgaagca agagtgatat 1500
actccaggac ttagaatagt gcctaaagtg ctgcagccaa agacagagcg gaactatgaa 1560
aagtgggctt ggagatggca ggagagcttg tcattgagcc tggcaattta gcaaactgat 1620
gctgaggatg attgaggtgg gtctacctca tctctgaaaa ttctggaagg aatggaggag 1680
tctcaacatg tgtttctgac acaagatccg tggtttgtac tcaaagccca gaatccccaa 1740
gtgcctgctt ttgatgatgt ctacagaaaa tgctggctga gctgaacaca tttgcccaat 1800
tccaggtgtg cacagaaaac cgagaatatt caaaattcca aatttttttc ttaggagcaa 1860
gaagaaaatg tggccctaaa gggggttagt tgaggggtag ggggtagtga ggatcttgat 1920
ttggatctct ttttatttaa atgtgaattt caacttttga caatcaaaga aaagactttt 1980
gttgaaatag ctttactgtt tctcaagtgt tttggagaaa aaaatcaacc ctgcaatcac 2040
tttttggaat tgtcttgatt tttcggcagt tcaagctata tcgaatatag ttctgtgtag 2100
agaatgtcac tgtagttttg agtgtataca tgtgtgggtg ctgataattg tgtattttct 2160
ttgggggtgg aaaaggaaaa caattcaagc tgagaaaagt attctcaaag atgcattttt 2220
ataaatttta ttaaacaatt ttgttaaa 2248
<![CDATA[<210> 204]]>
<![CDATA[<211> 4878]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 204]]>
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc 60
gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc 120
agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc aggtctcctc 180
ttggctctgc caggagccgg agccctgcca ccctggcttt gacgccgaga gctacacgtt 240
cacggtgccc cggcgccacc tggagagagg ccgcgtcctg ggcagagtga attttgaaga 300
ttgcaccggt cgacaaagga cagcctattt ttccctcgac acccgattca aagtgggcac 360
agatggtgtg attacagtca aaaggcctct acggtttcat aacccacaga tccatttctt 420
ggtctacgcc tgggactcca cctacagaaa gttttccacc aaagtcacgc tgaatacagt 480
ggggcaccac caccgccccc cgccccatca ggcctccgtt tctggaatcc aagcagaatt 540
gctcacattt cccaactcct ctcctggcct cagaagacag aagagagact gggttattcc 600
tcccatcagc tgcccagaaa atgaaaaagg cccatttcct aaaaacctgg ttcagatcaa 660
atccaacaaa gacaaagaag gcaaggtttt ctacagcatc actggccaag gagctgacac 720
accccctgtt ggtgtcttta ttattgaaag agaaacagga tggctgaagg tgacagagcc 780
tctggataga gaacgcattg ccacatacac tctcttctct cacgctgtgt catccaacgg 840
gaatgcagtt gaggatccaa tggagatttt gatcacggta accgatcaga atgacaacaa 900
gcccgaattc acccaggagg tctttaaggg gtctgtcatg gaaggtgctc ttccaggaac 960
ctctgtgatg gaggtcacag ccacagacgc ggacgatgat gtgaacacct acaatgccgc 1020
catcgcttac accatcctca gccaagatcc tgagctccct gacaaaaata tgttcaccat 1080
taacaggaac acaggagtca tcagtgtggt caccactggg ctggaccgag agagtttccc 1140
tacgtatacc ctggtggttc aagctgctga ccttcaaggt gaggggttaa gcacaacagc 1200
aacagctgtg atcacagtca ctgacaccaa cgataatcct ccgatcttca atcccaccac 1260
gtacaagggt caggtgcctg agaacgaggc taacgtcgta atcaccacac tgaaagtgac 1320
tgatgctgat gcccccaata ccccagcgtg ggaggctgta tacaccatat tgaatgatga 1380
tggtggacaa tttgtcgtca ccacaaatcc agtgaacaac gatggcattt tgaaaacagc 1440
aaaggtttgt atggtacctg gcaagatgca gaaactggca tcctcacagc tgttcatacc 1500
cttgtcccct gggcttggat tttgaggcca agcagcagta cattctacac gtagcagtga 1560
cgaatgtggt accttttgag gtctctctca ccacctccac agccaccgtc accgtggatg 1620
tgctggatgt gaatgaagcc cccatctttg tgcctcctga aaagagagtg gaagtgtccg 1680
aggactttgg cgtgggccag gaaatcacat cctacactgc ccaggagcca gacacattta 1740
tggaacagaa aataacatat cggatttgga gagacactgc caactggctg gagattaatc 1800
cggacactgg tgccatttcc actcgggctg agctggacag ggaggatttt gagcacgtga 1860
agaacagcac gtacacagcc ctaatcatag ctacagacaa tggttctcca gttgctactg 1920
gaacagggac acttctgctg atcctgtctg atgtgaatga caacgccccc ataccagaac 1980
ctcgaactat attcttctgt gagaggaatc caaagcctca ggtcataaac atcattgatg 2040
cagaccttcc tcccaataca tctcccttca cagcagaact aacacacggg gcgagtgcca 2100
actggaccat tcagtacaac gacccaaccc aagaatctat cattttgaag ccaaagatgg 2160
ccttagaggt gggtgactac aaaatcaatc tcaagctcat ggataaccag aataaagacc 2220
aagtgaccac cttagaggtc agcgtgtgtg actgtgaagg ggccgctggc gtctgtagga 2280
aggcacagcc tgtcgaagca ggattgcaaa ttcctgccat tctggggatt cttggaggaa 2340
ttcttgcttt gctaattctg attctgctgc tcttgctgtt tcttcggagg agagcggtgg 2400
tcaaagagcc cttactgccc ccagaggatg acacccggga caacgtttat tactatgatg 2460
aagaaggagg cggagaagag gaccaggact ttgacttgag ccagctgcac aggggcctgg 2520
acgctcggcc tgaagtgact cgtaacgacg ttgcaccaac cctcatgagt gtcccccggt 2580
atcttccccg ccctgccaat cccgatgaaa ttggaaattt tattgatgaa aatctgaaag 2640
cggctgatac tgaccccaca gccccgcctt atgattctct gctcgtgttt gactatgaag 2700
gaagcggttc cgaagctgct agtctgagct ccctgaactc ctcagagtca gacaaagacc 2760
aggactatga ctacttgaac gaatggggca atcgcttcaa gaagctggct gacatgtacg 2820
gaggcggcga ggacgactag gggactcgag agaggcgggc cccagaccca tgtgctggga 2880
aatgcagaaa tcacgttgct ggtggttttt cagctccctt cccttgagat gagtttctgg 2940
ggaaaaaaaa gagactggtt agtgatgcag ttagtatagc tttatactct ctccacttta 3000
tagctctaat aagtttgtgt tagaaaagtt tcgacttatt tcttaaagct tttttttttt 3060
tcccatcact ctttacatgg tggtgatgtc caaaagatac ccaaatttta atattccaga 3120
agaacaactt tagcatcaga aggttcaccc agcaccttgc agattttctt aaggaatttt 3180
gtctcacttt taaaaagaag gggagaagtc agctactcta gttctgttgt tttgtgtata 3240
taatttttta aaaaaaattt gtgtgcttct gctcattact acactggtgt gtccctctgc 3300
cttttttttt tttttaagac agggtctcat tctatcggcc aggctggagt gcagtggtgc 3360
aatcacagct cactgcagcc ttgtcctccc aggctcaagc tatccttgca cctcagcctc 3420
ccaagtagct gggaccacag gcatgcacca ctacgcatga ctaatttttt aaatatttga 3480
gacggggtct ccctgtgtta cccaggctgg tctcaaactc ctgggctcaa gtgatcctcc 3540
catcttggcc tcccagagta ttgggattac agacatgagc cactgcacct gcccagctcc 3600
ccaactccct gccatttttt aagagacagt ttcgctccat cgcccaggcc tgggatgcag 3660
tgatgtgatc atagctcact gtaacctcaa actctggggc tcaagcagtt ctcccaccag 3720
cctccttttt atttttttgt acagatgggg tcttgctatg ttgcccaagc tggtcttaaa 3780
ctcctggcct caagcaatcc ttctgccttg gccccccaaa gtgctgggat tgtgggcatg 3840
agctgctgtg cccagcctcc atgttttaat atcaactctc actcctgaat tcagttgctt 3900
tgcccaagat aggagttctc tgatgcagaa attattgggc tcttttaggg taagaagttt 3960
gtgtctttgt ctggccacat cttgactagg tattgtctac tctgaagacc tttaatggct 4020
tccctctttc atctcctgag tatgtaactt gcaatgggca gctatccagt gacttgttct 4080
gagtaagtgt gttcattaat gtttatttag ctctgaagca agagtgatat actccaggac 4140
ttagaatagt gcctaaagtg ctgcagccaa agacagagcg gaactatgaa aagtgggctt 4200
ggagatggca ggagagcttg tcattgagcc tggcaattta gcaaactgat gctgaggatg 4260
attgaggtgg gtctacctca tctctgaaaa ttctggaagg aatggaggag tctcaacatg 4320
tgtttctgac acaagatccg tggtttgtac tcaaagccca gaatccccaa gtgcctgctt 4380
ttgatgatgt ctacagaaaa tgctggctga gctgaacaca tttgcccaat tccaggtgtg 4440
cacagaaaac cgagaatatt caaaattcca aatttttttc ttaggagcaa gaagaaaatg 4500
tggccctaaa gggggttagt tgaggggtag ggggtagtga ggatcttgat ttggatctct 4560
ttttatttaa atgtgaattt caacttttga caatcaaaga aaagactttt gttgaaatag 4620
ctttactgtt tctcaagtgt tttggagaaa aaaatcaacc ctgcaatcac tttttggaat 4680
tgtcttgatt tttcggcagt tcaagctata tcgaatatag ttctgtgtag agaatgtcac 4740
tgtagttttg agtgtataca tgtgtgggtg ctgataattg tgtattttct ttgggggtgg 4800
aaaaggaaaa caattcaagc tgagaaaagt attctcaaag atgcattttt ataaatttta 4860
ttaaacaatt ttgttaaa 4878
<![CDATA[<210> 205]]>
<![CDATA[<211> 5]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 205]]>
Met Glu Gln Lys Ile
1 5
<![CDATA[<210> 206]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 206]]>
Tyr Arg Ile Trp Arg Asp Thr Ala Asn Trp Leu Glu Ile Asn Pro Asp
1 5 10 15
Thr Gly Ala Ile Ser Thr Arg Ala Glu Leu Asp Arg Glu Asp Phe Glu
20 25 30
His Val Lys Asn Ser Thr Tyr Thr Ala Leu Ile Ile Ala Thr Asp Asn
35 40 45
<![CDATA[<210> 207]]>
<![CDATA[<211> 74]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 207]]>
Ser Pro Val Ala Thr Gly Thr Gly Thr Leu Leu Leu Ile Leu Ser Asp
1 5 10 15
Val Asn Asp Asn Ala Pro Ile Pro Glu Pro Arg Thr Ile Phe Phe Cys
20 25 30
Glu Arg Asn Pro Lys Pro Gln Val Ile Asn Ile Ile Asp Ala Asp Leu
35 40 45
Pro Pro Asn Thr Ser Pro Phe Thr Ala Glu Leu Thr His Gly Ala Ser
50 55 60
Ala Asn Trp Thr Ile Gln Tyr Asn Asp Pro
65 70
<![CDATA[<210> 208]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 208]]>
Gln Glu Ser Ile Ile Leu Lys Pro Lys Met Ala Leu Glu Val Gly Asp
1 5 10 15
Tyr Lys Ile Asn Leu Lys Leu Met Asp Asn Gln Asn Lys Asp Gln Val
20 25 30
Thr Thr Leu Glu Val Ser Val Cys Asp Cys Glu Gly Ala Ala Gly Val
35 40 45
Cys Arg Lys Ala Gln Pro Val Glu Ala Gly Leu Gln Ile Pro Ala Ile
50 55 60
Leu Gly Ile Leu Gly Gly Ile Leu Ala Leu Leu
65 70 75
<![CDATA[<210> 209]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 209]]>
Leu Ile Leu Leu Leu Leu Leu Phe Leu Arg Arg Arg Ala Val Val Lys
1 5 10 15
Glu Pro Leu Leu Pro Pro Glu Asp Asp Thr Arg Asp Asn Val Tyr Tyr
20 25 30
Tyr Asp Glu Glu Gly Gly Gly Glu Glu Asp Gln
35 40
<![CDATA[<210> 210]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 210]]>
Asp Phe Asp Leu Ser Gln Leu His Arg Gly Leu Asp Ala Arg Pro Glu
1 5 10 15
Val Thr Arg Asn Asp Val Ala Pro Thr Leu Met Ser Val Pro Arg Tyr
20 25 30
Leu Pro Arg Pro Ala Asn Pro Asp Glu Ile Gly Asn Phe Ile Asp Glu
35 40 45
<![CDATA[<210> 211]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 211]]>
Asn Leu Lys Ala Ala Asp Thr Asp Pro Thr Ala Pro Pro Tyr Asp Ser
1 5 10 15
Leu Leu Val Phe Asp Tyr Glu Gly Ser Gly Ser Glu Ala Ala Ser Leu
20 25 30
Ser Ser Leu Asn Ser Ser Glu Ser Asp Lys Asp Gln Asp Tyr Asp Tyr
35 40 45
Leu Asn Glu Trp Gly Asn Arg Phe Lys Lys Leu Ala Asp Met Tyr Gly
50 55 60
Gly Gly Glu Asp Asp
65
<![CDATA[<210> 212]]>
<![CDATA[<211> 366]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 212]]>
Met Glu Gln Lys Ile Thr Tyr Arg Ile Trp Arg Asp Thr Ala Asn Trp
1 5 10 15
Leu Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr Arg Ala Glu Leu
20 25 30
Asp Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr Tyr Thr Ala Leu
35 40 45
Ile Ile Ala Thr Asp Asn Gly Ser Pro Val Ala Thr Gly Thr Gly Thr
50 55 60
Leu Leu Leu Ile Leu Ser Asp Val Asn Asp Asn Ala Pro Ile Pro Glu
65 70 75 80
Pro Arg Thr Ile Phe Phe Cys Glu Arg Asn Pro Lys Pro Gln Val Ile
85 90 95
Asn Ile Ile Asp Ala Asp Leu Pro Pro Asn Thr Ser Pro Phe Thr Ala
100 105 110
Glu Leu Thr His Gly Ala Ser Ala Asn Trp Thr Ile Gln Tyr Asn Asp
115 120 125
Pro Thr Gln Glu Ser Ile Ile Leu Lys Pro Lys Met Ala Leu Glu Val
130 135 140
Gly Asp Tyr Lys Ile Asn Leu Lys Leu Met Asp Asn Gln Asn Lys Asp
145 150 155 160
Gln Val Thr Thr Leu Glu Val Ser Val Cys Asp Cys Glu Gly Ala Ala
165 170 175
Gly Val Cys Arg Lys Ala Gln Pro Val Glu Ala Gly Leu Gln Ile Pro
180 185 190
Ala Ile Leu Gly Ile Leu Gly Gly Ile Leu Ala Leu Leu Ile Leu Ile
195 200 205
Leu Leu Leu Leu Leu Phe Leu Arg Arg Arg Ala Val Val Lys Glu Pro
210 215 220
Leu Leu Pro Pro Glu Asp Asp Thr Arg Asp Asn Val Tyr Tyr Tyr Asp
225 230 235 240
Glu Glu Gly Gly Gly Glu Glu Asp Gln Asp Phe Asp Leu Ser Gln Leu
245 250 255
His Arg Gly Leu Asp Ala Arg Pro Glu Val Thr Arg Asn Asp Val Ala
260 265 270
Pro Thr Leu Met Ser Val Pro Arg Tyr Leu Pro Arg Pro Ala Asn Pro
275 280 285
Asp Glu Ile Gly Asn Phe Ile Asp Glu Asn Leu Lys Ala Ala Asp Thr
290 295 300
Asp Pro Thr Ala Pro Pro Tyr Asp Ser Leu Leu Val Phe Asp Tyr Glu
305 310 315 320
Gly Ser Gly Ser Glu Ala Ala Ser Leu Ser Ser Leu Asn Ser Ser Glu
325 330 335
Ser Asp Lys Asp Gln Asp Tyr Asp Tyr Leu Asn Glu Trp Gly Asn Arg
340 345 350
Phe Lys Lys Leu Ala Asp Met Tyr Gly Gly Gly Glu Asp Asp
355 360 365
<![CDATA[<210> 213]]>
<![CDATA[<211> 1902]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 213]]>
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc 60
gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc 120
agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc aggtctcctc 180
ttggctctgc caggagccgg agccctgcca ccctggcttt gacgccgaga gctacacgtt 240
cacggtgccc cggcgccacc tggagagagg ccgcgtcctg ggcagagtga attttgaaga 300
ttgcaccggt cgacaaagga cagcctattt ttccctcgac acccgattca aagtgggcac 360
agatggtgtg attacagtca aaaggcctct acggtttcat aacccacaga tccatttctt 420
ggtctacgcc tgggactcca cctacagaaa gttttccacc aaagtcacgc tgaatacagt 480
ggggcaccac caccgccccc cgccccatca ggcctccgtt tctggaatcc aagcagaatt 540
gctcacattt cccaactcct ctcctggcct cagaagacag aagagagact gggttattcc 600
tcccatcagc tgcccagaaa atgaaaaagg cccatttcct aaaaacctgg ttcagatcaa 660
atccaacaaa gacaaagaag gcaaggtttt ctacagcatc actggccaag gagctgacac 720
accccctgtt ggtgtcttta ttattgaaag agaaacagga tggctgaagg tgacagagcc 780
tctggataga gaacgcattg ccacatacac tctcttctct cacgctgtgt catccaacgg 840
gaatgcagtt gaggatccaa tggagatttt gatcacggta accgatcaga atgacaacaa 900
gcccgaattc acccaggagg tctttaaggg gtctgtcatg gaaggtgctc ttccaggaac 960
ctctgtgatg gaggtcacag ccacagacgc ggacgatgat gtgaacacct acaatgccgc 1020
catcgcttac accatcctca gccaagatcc tgagctccct gacaaaaata tgttcaccat 1080
taacaggaac acaggagtca tcagtgtggt caccactggg ctggaccgag agagtttccc 1140
tacgtatacc ctggtggttc aagctgctga ccttcaaggt gaggggttaa gcacaacagc 1200
aacagctgtg atcacagtca ctgacaccaa cgataatcct ccgatcttca atcccaccac 1260
gtacaagggt caggtgcctg agaacgaggc taacgtcgta atcaccacac tgaaagtgac 1320
tgatgctgat gcccccaata ccccagcgtg ggaggctgta tacaccatat tgaatgatga 1380
tggtggacaa tttgtcgtca ccacaaatcc agtgaacaac gatggcattt tgaaaacagc 1440
aaaggtttgt atggtacctg gcaagatgca gaaactggca tcctcacagc tgttcatacc 1500
cttgtcccct gggcttggat tttgaggcca agcagcagta cattctacac gtagcagtga 1560
cgaatgtggt accttttgag gtctctctca ccacctccac agccaccgtc accgtggatg 1620
tgctggatgt gaatgaagcc cccatctttg tgcctcctga aaagagagtg gaagtgtccg 1680
aggactttgg cgtgggccag gaaatcacat cctacactgc ccaggagcca gacacattta 1740
tggaacagaa aataacatat cggatttgga gagacactgc caactggctg gagattaatc 1800
cggacactgg tgccatttcc actcgggctg agctggacag ggaggatttt gagcacgtga 1860
agaacagcac gtacacagcc ctaatcatag ctacagacaa tg 1902
<![CDATA[<210> 214]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 214]]>
Met Glu Gln Lys Ile Tyr Arg Ile Trp Arg Asp Thr Ala Asn Trp Leu
1 5 10 15
Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr Arg Ala Glu Leu Asp
20 25 30
Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr Tyr Thr Ala Leu Ile
35 40 45
Ile Ala Thr Asp Asn
50
<![CDATA[<210> 215]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 215]]>
atggcgctcc tgctgtgctt cgtgctcctg tgcggagtag tgg 43
<![CDATA[<210> 216]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 216]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaact 58
<![CDATA[<210> 217]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 217]]>
atggcgctcc tgctgtgctt cgtgctcctg tgcggagtag tggccttgcc tccccgattg 60
aaagagatga aaagccagga atcggctgca ggttccaaac t 101
<![CDATA[<210> 218]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 218]]>
Met Ala Leu Leu Leu Cys Phe Val Leu Leu Cys Gly Val Val Ala Leu
1 5 10 15
Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser
20 25 30
Lys
<![CDATA[<210> 219]]>
<![CDATA[<211> 130]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 219]]>
agtcgggagc gcgcgaggcg cggggagcct gggaccagga gcgagagccg cctacctgca 60
gccgccgccc acggcacggc agccaccatg gcgctcctgc tgtgcttcgt gctcctgtgc 120
ggagtagtgg 130
<![CDATA[<210> 220]]>
<![CDATA[<211> 167]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 220]]>
atttcgccag aagtttgagt atcactactc ctgaagagat gattgaaaaa gccaaagggg 60
aaactgccta tctgccatgc aaatttacgc ttagtcccga agaccaggga ccgctggaca 120
tcgagtggct gatatcacca gctgataatc agaaggtgga tcaagtg 167
<![CDATA[<210> 221]]>
<![CDATA[<211> 205]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 221]]>
attattttat attctggaga caaaatttat gatgactact atccagatct gaaaggccga 60
gtacatttta cgagtaatga tctcaaatct ggtgatgcat caataaatgt aacgaattta 120
caactgtcag atattggcac atatcagtgc aaagtgaaaa aagctcctgg tgttgcaaat 180
aagaagattc atctggtagt tcttg 205
<![CDATA[<210> 222]]>
<![CDATA[<211> 156]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 222]]>
ttaagccttc aggtgcgaga tgttacgttg atggatctga agaaattgga agtgacttta 60
agataaaatg tgaaccaaaa gaaggttcac ttccattaca gtatgagtgg caaaaattgt 120
ctgactcaca gaaaatgccc acttcatggt tagcag 156
<![CDATA[<210> 223]]>
<![CDATA[<211> 4683]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 223]]>
ggaagatgtg ccacctccaa agagccgtac gtccactgcc agaagctaca tcggcagtaa 60
tcattcatcc ctggggtcca tgtctccttc caacatggaa ggatattcca agactcagta 120
taaccaagta ccaagtgaag actttgaacg cactcctcag agtccgactc tcccacctgc 180
taaggtagct gcccctaatc taagtcgaat gggtgcgatt cctgtgatga ttccagcaca 240
gagcaaggat gggtctatag tatagagcct ccatatgtct catctgtgct ctccgtgttc 300
ctttcctttt tttgatatat gaaaacctat tctggtctaa attgtgttac tagcctcaaa 360
atacatcaaa aaataagtta atcaggaact gtacggaata tatttttaaa aatttttgtt 420
tggttatatc gaaatagtta caggcactaa agttagtaaa gaaaagttta ccatctgaaa 480
aagctggatt ttctttaaga ggttgattat aaagttttct aaatttatca gtacctaagt 540
aagatgtagc gctttgaata tgaaatcata ggtgaagaca tgggtgaact tacttgcata 600
ccaagttgat acttgaataa ccatctgaaa gtggtacttg atcattttta ccattatttt 660
taggatgtgt atttcattta tttatggccc accagtctcc cccaaattag tacagaaata 720
tccatgacaa aattacttac gtatgtttgt acttggtttt acagctcctt tgaaaactct 780
gtgtttggaa tatctctaaa aacatagaaa acactacagt ggtttagaaa ttactaattt 840
tacttctaag tcattcataa accttgtcta tgaaatgact tcttaaatat ttagttgata 900
gactgctaca ggtaataggg acttagcaag ctcttttata tgctaaagga gcatctatca 960
gattaagtta gaacatttgc tgtcagccac atattgagat gacactaggt gcaatagcag 1020
ggatagattt tgttggtgag tagtctcatg ccttgagatc tgtggtggtc ttcaaaatgg 1080
tggccagcca gatcaaggat gtagtatctc atagttccca ggtgatattt ttcttattag 1140
aaaaatatta taactcattt gttgtttgac acttatagat tgaaatttcc taatttattc 1200
taaattttaa gtggttcttt ggttccagtg ctttatgttg ttgttgtttt tggatggtgt 1260
tacatattat atgttctaga aacatgtaat cctaaattta ccctcttgaa tataatccct 1320
ggatgatatt ttttatcata aatgcagaat aatcaaatac attttaagca agttaagtgt 1380
cctccatcaa ttctgtattc cagacttggg aggatgtaca gttgctgttg tgtgatcaaa 1440
catgtctctg tgtagttcca gcaaatcaag ctgagctttg aaaaagtttg tcttagtttt 1500
gtgaaggtga tttattctta aaaaaaaaaa agaaagaaaa agaaaaaaag ataagaagga 1560
ggagtaaagg gactactcct ccttgccaaa tgtgctaaat atcattttag gagaagaaag 1620
tgggtttatt gtatttccct taagattgtg agggagtgtg gatacagtag aatgagccaa 1680
cagtttcttt ataataaata cggtctgcaa taaattattt cactagctct aaaacctttc 1740
cctagatttt agtagggagt tggtttctgt taatatcttt gggtgctgtg gtggtaaatg 1800
ctatattatg aacggtggca tgtatttaca gttagagtat tgtgtgtaca ctttttaatg 1860
gtaaacttaa gctgaatgtg taatggattt gtgtatagtt ttacatattt ggaagcattt 1920
taaaaacagg ttttaacctt atgtaaaatt acttttatac tcgtgttaac attttcatct 1980
gtgccttttg gtaatttaat ttctattatg aatttctggt gcctatgagc tagctatcac 2040
ctacctgaaa ggtgcttaga ggtgaaggta ctgtttctaa aaacacatca ctgtgatacc 2100
tttctatcct cacattttca agcttgcctc ttttctgttc tttgtggata taacttaagc 2160
aattgtgtta ttcataaagg tttagaaatt tcaatattcc caacactcta tgtttctgat 2220
tttataacag tagccatttt tgaaagtcag atgtttggcc tgttttatat gaataaagtt 2280
tatttataaa atattataaa aaataagtaa ataaacagaa cattaataat aaagttttgg 2340
ttcttcctat tcctgacttt catataatga aaattatcct attgatctaa gtagaagtta 2400
tcatagaaag tggacacgta taagactttc cttccttttt ttttttaata acatatgagg 2460
aacaagactt ctcttcccat atacttcata ttttagagga cattgtttta aaggcttatg 2520
tctcactgta aaattctgtc agccaaatag taccaatacg ttttcaagta gttctcactg 2580
ataatttagt tgaaccagag atcaaatatt tgctcccgaa ttactactgg taatcaagta 2640
gttgaacaaa aaattactaa agcatttccg ttagatcagt caaggacagt actgcatctt 2700
tttttttttt ttttttttta agacggagtc tcggtctgtc acccaggctg gagtgcagtg 2760
gcgggatctc ggctcactgc aagctccgcc tcccaggttc acgccattct cctgcctcag 2820
cctcccgagc agctgggact acaggctccc atcaccacgc tcggctaagt ttttgtaatt 2880
ttagtagaga cagggtttca ccgtgttagc caggatggtc tcgatctcct gacctcgtga 2940
tctgcctgcc tcggcctccc aaagtgctgg gattacaggc gtgaaccact gcacccggcc 3000
cagtactgca tcttaacagc aaagccattt tattctactt tataactgag agacttgata 3060
ccatccatct ctttaggtta cagaggataa tttgaagaga aatgttactg tagaatatat 3120
agttctgtac tttttttttt tttttttaag agatagggtt tcactattgc tcaggctggt 3180
ctcaaactgc tgggctcagg agatcctcct gccttggcct cccaaattgc tgggattaca 3240
ggtgtgagcc gcagcatcca gccagttctg tactttgaat atggagtagt ttacagctat 3300
ttttttttct tactggtaat cttaactaat atgattccct tgttagagag cctctcactc 3360
ccccaccccc aaaaatgtct actattcatg acagtaacca attattctgg acaaattgct 3420
tctttttaat ttgagctatc tgccatggac tttctaaaat ggaaacacag cctgagtgta 3480
tcttagggag agtttgattg aaaaaatcca aatcactatc catatagatc atggatataa 3540
agagatacct gatttttatt aaaaagatac tttttcaaat ttaagagtta atcttggaaa 3600
tttggaacaa gtaaaggggc aagtaaacct tttgatgaaa tataaaagga actcattgca 3660
tgaagttgac tatcaaattc tgtgatgtgt ggcttcttaa aaatattctc agtgtctttt 3720
gtgtgcgtgc agcatgtaca tttgatgtta tgtgaatgtt gagttttttc ttctaatttt 3780
cacttcagca gtgtttaggg ctttcagatg ccttattcca gtgtgaacag aaaaagttca 3840
tattttatgt ggttaatgct ttgatgtgtc acataaagag tagtttgtag aaaatgttgg 3900
cacaatttta acttcttagt ggcttgtgac attatatatt atatatatat atgtacatat 3960
atctttataa cattcctgtg tttagtagtg taaatgttct gggcaagttt taatattttg 4020
aatgcctttg gatattccag caataaaggc atcatgttct gcaataggat ttcttactca 4080
tttacctatt ttaacactaa aatagaccac aactgagcac aaattccttt tataaatgtt 4140
atagaagcag ggaagaataa taaacacatt tgtgaattgt ggttcagttt atttatcttt 4200
agggaaggct gatcatttat cttatagcag ataaccccag cctcttattc attatggtta 4260
acttttataa tttatcttat tttataattt aagaatatag tacatatcag ttgggtttgg 4320
ttttggtcat cgagactaaa agctccatca aaacagaact ttgtgttttc tgctaactta 4380
tttaatgaca caagttttaa gagaaccaca attcattgat tcacttattc ttttccctaa 4440
ttgtgaattt tagtgataaa tacacctgta ctactgagga aaatattctg acacttcacg 4500
tgtgcaaagt atagaactga cagtgtcagt ttcagatttt gtatgtacga tttctggctt 4560
atatatccaa tggtgcagat tttgaaattt gtaagaacaa aatttgttaa gaaaaacaac 4620
ttgctctagt tttgtgacct tgtgtacttt tgaaataaaa tcaagaaagc agttctctgc 4680
ctc 4683
<![CDATA[<210> 224]]>
<![CDATA[<211> 5341]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 224]]>
agtcgggagc gcgcgaggcg cggggagcct gggaccagga gcgagagccg cctacctgca 60
gccgccgccc acggcacggc agccaccatg gcgctcctgc tgtgcttcgt gctcctgtgc 120
ggagtagtgg atttcgccag aagtttgagt atcactactc ctgaagagat gattgaaaaa 180
gccaaagggg aaactgccta tctgccatgc aaatttacgc ttagtcccga agaccaggga 240
ccgctggaca tcgagtggct gatatcacca gctgataatc agaaggtgga tcaagtgatt 300
attttatatt ctggagacaa aatttatgat gactactatc cagatctgaa aggccgagta 360
cattttacga gtaatgatct caaatctggt gatgcatcaa taaatgtaac gaatttacaa 420
ctgtcagata ttggcacata tcagtgcaaa gtgaaaaaag ctcctggtgt tgcaaataag 480
aagattcatc tggtagttct tgttaagcct tcaggtgcga gatgttacgt tgatggatct 540
gaagaaattg gaagtgactt taagataaaa tgtgaaccaa aagaaggttc acttccatta 600
cagtatgagt ggcaaaaatt gtctgactca cagaaaatgc ccacttcatg gttagcaggg 660
aagatgtgcc acctccaaag agccgtacgt ccactgccag aagctacatc ggcagtaatc 720
attcatccct ggggtccatg tctccttcca acatggaagg atattccaag actcagtata 780
accaagtacc aagtgaagac tttgaacgca ctcctcagag tccgactctc ccacctgcta 840
aggtagctgc ccctaatcta agtcgaatgg gtgcgattcc tgtgatgatt ccagcacaga 900
gcaaggatgg gtctatagta tagagcctcc atatgtctca tctgtgctct ccgtgttcct 960
ttcctttttt tgatatatga aaacctattc tggtctaaat tgtgttacta gcctcaaaat 1020
acatcaaaaa ataagttaat caggaactgt acggaatata tttttaaaaa tttttgtttg 1080
gttatatcga aatagttaca ggcactaaag ttagtaaaga aaagtttacc atctgaaaaa 1140
gctggatttt ctttaagagg ttgattataa agttttctaa atttatcagt acctaagtaa 1200
gatgtagcgc tttgaatatg aaatcatagg tgaagacatg ggtgaactta cttgcatacc 1260
aagttgatac ttgaataacc atctgaaagt ggtacttgat catttttacc attattttta 1320
ggatgtgtat ttcatttatt tatggcccac cagtctcccc caaattagta cagaaatatc 1380
catgacaaaa ttacttacgt atgtttgtac ttggttttac agctcctttg aaaactctgt 1440
gtttggaata tctctaaaaa catagaaaac actacagtgg tttagaaatt actaatttta 1500
cttctaagtc attcataaac cttgtctatg aaatgacttc ttaaatattt agttgataga 1560
ctgctacagg taatagggac ttagcaagct cttttatatg ctaaaggagc atctatcaga 1620
ttaagttaga acatttgctg tcagccacat attgagatga cactaggtgc aatagcaggg 1680
atagattttg ttggtgagta gtctcatgcc ttgagatctg tggtggtctt caaaatggtg 1740
gccagccaga tcaaggatgt agtatctcat agttcccagg tgatattttt cttattagaa 1800
aaatattata actcatttgt tgtttgacac ttatagattg aaatttccta atttattcta 1860
aattttaagt ggttctttgg ttccagtgct ttatgttgtt gttgtttttg gatggtgtta 1920
catattatat gttctagaaa catgtaatcc taaatttacc ctcttgaata taatccctgg 1980
atgatatttt ttatcataaa tgcagaataa tcaaatacat tttaagcaag ttaagtgtcc 2040
tccatcaatt ctgtattcca gacttgggag gatgtacagt tgctgttgtg tgatcaaaca 2100
tgtctctgtg tagttccagc aaatcaagct gagctttgaa aaagtttgtc ttagttttgt 2160
gaaggtgatt tattcttaaa aaaaaaaaag aaagaaaaag aaaaaaagat aagaaggagg 2220
agtaaaggga ctactcctcc ttgccaaatg tgctaaatat cattttagga gaagaaagtg 2280
ggtttattgt atttccctta agattgtgag ggagtgtgga tacagtagaa tgagccaaca 2340
gtttctttat aataaatacg gtctgcaata aattatttca ctagctctaa aacctttccc 2400
tagattttag tagggagttg gtttctgtta atatctttgg gtgctgtggt ggtaaatgct 2460
atattatgaa cggtggcatg tatttacagt tagagtattg tgtgtacact ttttaatggt 2520
aaacttaagc tgaatgtgta atggatttgt gtatagtttt acatatttgg aagcatttta 2580
aaaacaggtt ttaaccttat gtaaaattac ttttatactc gtgttaacat tttcatctgt 2640
gccttttggt aatttaattt ctattatgaa tttctggtgc ctatgagcta gctatcacct 2700
acctgaaagg tgcttagagg tgaaggtact gtttctaaaa acacatcact gtgatacctt 2760
tctatcctca cattttcaag cttgcctctt ttctgttctt tgtggatata acttaagcaa 2820
ttgtgttatt cataaaggtt tagaaatttc aatattccca acactctatg tttctgattt 2880
tataacagta gccatttttg aaagtcagat gtttggcctg ttttatatga ataaagttta 2940
tttataaaat attataaaaa ataagtaaat aaacagaaca ttaataataa agttttggtt 3000
cttcctattc ctgactttca tataatgaaa attatcctat tgatctaagt agaagttatc 3060
atagaaagtg gacacgtata agactttcct tccttttttt ttttaataac atatgaggaa 3120
caagacttct cttcccatat acttcatatt ttagaggaca ttgttttaaa ggcttatgtc 3180
tcactgtaaa attctgtcag ccaaatagta ccaatacgtt ttcaagtagt tctcactgat 3240
aatttagttg aaccagagat caaatatttg ctcccgaatt actactggta atcaagtagt 3300
tgaacaaaaa attactaaag catttccgtt agatcagtca aggacagtac tgcatctttt 3360
tttttttttt tttttttaag acggagtctc ggtctgtcac ccaggctgga gtgcagtggc 3420
gggatctcgg ctcactgcaa gctccgcctc ccaggttcac gccattctcc tgcctcagcc 3480
tcccgagcag ctgggactac aggctcccat caccacgctc ggctaagttt ttgtaatttt 3540
agtagagaca gggtttcacc gtgttagcca ggatggtctc gatctcctga cctcgtgatc 3600
tgcctgcctc ggcctcccaa agtgctggga ttacaggcgt gaaccactgc acccggccca 3660
gtactgcatc ttaacagcaa agccatttta ttctacttta taactgagag acttgatacc 3720
atccatctct ttaggttaca gaggataatt tgaagagaaa tgttactgta gaatatatag 3780
ttctgtactt tttttttttt tttttaagag atagggtttc actattgctc aggctggtct 3840
caaactgctg ggctcaggag atcctcctgc cttggcctcc caaattgctg ggattacagg 3900
tgtgagccgc agcatccagc cagttctgta ctttgaatat ggagtagttt acagctattt 3960
ttttttctta ctggtaatct taactaatat gattcccttg ttagagagcc tctcactccc 4020
ccacccccaa aaatgtctac tattcatgac agtaaccaat tattctggac aaattgcttc 4080
tttttaattt gagctatctg ccatggactt tctaaaatgg aaacacagcc tgagtgtatc 4140
ttagggagag tttgattgaa aaaatccaaa tcactatcca tatagatcat ggatataaag 4200
agatacctga tttttattaa aaagatactt tttcaaattt aagagttaat cttggaaatt 4260
tggaacaagt aaaggggcaa gtaaaccttt tgatgaaata taaaaggaac tcattgcatg 4320
aagttgacta tcaaattctg tgatgtgtgg cttcttaaaa atattctcag tgtcttttgt 4380
gtgcgtgcag catgtacatt tgatgttatg tgaatgttga gttttttctt ctaattttca 4440
cttcagcagt gtttagggct ttcagatgcc ttattccagt gtgaacagaa aaagttcata 4500
ttttatgtgg ttaatgcttt gatgtgtcac ataaagagta gtttgtagaa aatgttggca 4560
caattttaac ttcttagtgg cttgtgacat tatatattat atatatatat gtacatatat 4620
ctttataaca ttcctgtgtt tagtagtgta aatgttctgg gcaagtttta atattttgaa 4680
tgcctttgga tattccagca ataaaggcat catgttctgc aataggattt cttactcatt 4740
tacctatttt aacactaaaa tagaccacaa ctgagcacaa attcctttta taaatgttat 4800
agaagcaggg aagaataata aacacatttg tgaattgtgg ttcagtttat ttatctttag 4860
ggaaggctga tcatttatct tatagcagat aaccccagcc tcttattcat tatggttaac 4920
ttttataatt tatcttattt tataatttaa gaatatagta catatcagtt gggtttggtt 4980
ttggtcatcg agactaaaag ctccatcaaa acagaacttt gtgttttctg ctaacttatt 5040
taatgacaca agttttaaga gaaccacaat tcattgattc acttattctt ttccctaatt 5100
gtgaatttta gtgataaata cacctgtact actgaggaaa atattctgac acttcacgtg 5160
tgcaaagtat agaactgaca gtgtcagttt cagattttgt atgtacgatt tctggcttat 5220
atatccaatg gtgcagattt tgaaatttgt aagaacaaaa tttgttaaga aaaacaactt 5280
gctctagttt tgtgaccttg tgtacttttg aaataaaatc aagaaagcag ttctctgcct 5340
c 5341
<![CDATA[<210> 225]]>
<![CDATA[<211> 14]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 225]]>
Met Ala Leu Leu Leu Cys Phe Val Leu Leu Cys Gly Val Val
1 5 10
<![CDATA[<210> 226]]>
<![CDATA[<211> 55]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 226]]>
Phe Ala Arg Ser Leu Ser Ile Thr Thr Pro Glu Glu Met Ile Glu Lys
1 5 10 15
Ala Lys Gly Glu Thr Ala Tyr Leu Pro Cys Lys Phe Thr Leu Ser Pro
20 25 30
Glu Asp Gln Gly Pro Leu Asp Ile Glu Trp Leu Ile Ser Pro Ala Asp
35 40 45
Asn Gln Lys Val Asp Gln Val
50 55
<![CDATA[<210> 227]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 227]]>
Ile Ile Leu Tyr Ser Gly Asp Lys Ile Tyr Asp Asp Tyr Tyr Pro Asp
1 5 10 15
Leu Lys Gly Arg Val His Phe Thr Ser Asn Asp Leu Lys Ser Gly Asp
20 25 30
Ala Ser Ile Asn Val Thr Asn Leu Gln Leu Ser Asp Ile Gly Thr Tyr
35 40 45
Gln Cys Lys Val Lys Lys Ala Pro Gly Val Ala Asn Lys Lys Ile His
50 55 60
Leu Val Val Leu
65
<![CDATA[<210> 228]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 228]]>
Lys Pro Ser Gly Ala Arg Cys Tyr Val Asp Gly Ser Glu Glu Ile Gly
1 5 10 15
Ser Asp Phe Lys Ile Lys Cys Glu Pro Lys Glu Gly Ser Leu Pro Leu
20 25 30
Gln Tyr Glu Trp Gln Lys Leu Ser Asp Ser Gln Lys Met Pro Thr Ser
35 40 45
Trp Leu Ala
50
<![CDATA[<210> 229]]>
<![CDATA[<211> 61]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 229]]>
Lys Met Cys His Leu Gln Arg Ala Val Arg Pro Leu Pro Glu Ala Thr
1 5 10 15
Ser Ala Val Ile Ile His Pro Trp Gly Pro Cys Leu Leu Pro Thr Trp
20 25 30
Lys Asp Ile Pro Arg Leu Ser Ile Thr Lys Tyr Gln Val Lys Thr Leu
35 40 45
Asn Ala Leu Leu Arg Val Arg Leu Ser His Leu Leu Arg
50 55 60
<![CDATA[<210> 230]]>
<![CDATA[<211> 252]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 230]]>
Met Ala Leu Leu Leu Cys Phe Val Leu Leu Cys Gly Val Val Asp Phe
1 5 10 15
Ala Arg Ser Leu Ser Ile Thr Thr Pro Glu Glu Met Ile Glu Lys Ala
20 25 30
Lys Gly Glu Thr Ala Tyr Leu Pro Cys Lys Phe Thr Leu Ser Pro Glu
35 40 45
Asp Gln Gly Pro Leu Asp Ile Glu Trp Leu Ile Ser Pro Ala Asp Asn
50 55 60
Gln Lys Val Asp Gln Val Ile Ile Leu Tyr Ser Gly Asp Lys Ile Tyr
65 70 75 80
Asp Asp Tyr Tyr Pro Asp Leu Lys Gly Arg Val His Phe Thr Ser Asn
85 90 95
Asp Leu Lys Ser Gly Asp Ala Ser Ile Asn Val Thr Asn Leu Gln Leu
100 105 110
Ser Asp Ile Gly Thr Tyr Gln Cys Lys Val Lys Lys Ala Pro Gly Val
115 120 125
Ala Asn Lys Lys Ile His Leu Val Val Leu Val Lys Pro Ser Gly Ala
130 135 140
Arg Cys Tyr Val Asp Gly Ser Glu Glu Ile Gly Ser Asp Phe Lys Ile
145 150 155 160
Lys Cys Glu Pro Lys Glu Gly Ser Leu Pro Leu Gln Tyr Glu Trp Gln
165 170 175
Lys Leu Ser Asp Ser Gln Lys Met Pro Thr Ser Trp Leu Ala Gly Lys
180 185 190
Met Cys His Leu Gln Arg Ala Val Arg Pro Leu Pro Glu Ala Thr Ser
195 200 205
Ala Val Ile Ile His Pro Trp Gly Pro Cys Leu Leu Pro Thr Trp Lys
210 215 220
Asp Ile Pro Arg Leu Ser Ile Thr Lys Tyr Gln Val Lys Thr Leu Asn
225 230 235 240
Ala Leu Leu Arg Val Arg Leu Ser His Leu Leu Arg
245 250
<![CDATA[<210> 231]]>
<![CDATA[<211> 141]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 231]]>
gggagctgtt caaaaaacga gctctttcta ctcctgtagt agaaaaacgt tcagcatctt 60
ctgagtcatc atcatcatcg tcaaagaaga agaaaacaaa ggtagaacat ggaggatcgt 120
caggctctaa acaaaattct g 141
<![CDATA[<210> 232]]>
<![CDATA[<211> 127]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 232]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag attcaagtgg ttcaagaatg 120
ggaatga 127
<![CDATA[<210> 233]]>
<![CDATA[<211> 268]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 233]]>
gggagctgtt caaaaaacga gctctttcta ctcctgtagt agaaaaacgt tcagcatctt 60
ctgagtcatc atcatcatcg tcaaagaaga agaaaacaaa ggtagaacat ggaggatcgt 120
caggctctaa acaaaattct gccttgcctc cccgattgaa agagatgaaa agccaggaat 180
cggctgcagg ttccaaacta gtccttcggt gtgaaaccag ttctgaatac tcctctctca 240
gattcaagtg gttcaagaat gggaatga 268
<![CDATA[<210> 234]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 234]]>
Glu Leu Phe Lys Lys Arg Ala Leu Ser Thr Pro Val Val Glu Lys Arg
1 5 10 15
Ser Ala Ser Ser Glu Ser Ser Ser Ser Ser Ser Lys Lys Lys Lys Thr
20 25 30
Lys Val Glu His Gly Gly Ser Ser Gly Ser Lys Gln Asn Ser Ala Leu
35 40 45
Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser
50 55 60
Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg
65 70 75 80
Phe Lys Trp Phe Lys Asn Gly Asn
85
<![CDATA[<210> 235]]>
<![CDATA[<211> 264]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 2]]>35
actgagctcc tagcacccga tcgggaagtg gcgggcggag tcccgggtcc agtcgccgcc 60
tcagctaccg ccgctgccgc cgccgccgcc gccaccgcca gtggtgagac cccgacctgg 120
cgggtcagcg ctgggcgtgc gtgcgggcag gcgggggcgc tgacgagaag caggaagagg 180
gtgcagtgcc ggcgtgggcg gccggccgag gcggaggcgc aggaaggggg cggcgagtcg 240
tgcgaggctg cccttctcac tcag 264
<![CDATA[<210> 236]]>
<![CDATA[<211> 170]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 236]]>
cattatggat ccaagcctgt tgagagaaag ggagctgttc aaaaaacgag ctctttctac 60
tcctgtagta gaaaaacgtt cagcatcttc tgagtcatca tcatcatcgt caaagaagaa 120
gaaaacaaag gtagaacatg gaggatcgtc aggctctaaa caaaattctg 170
<![CDATA[<210> 237]]>
<![CDATA[<211> 92]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 237]]>
atcatagcaa tggatcattt aacttgaaag ctttgtcagg aagctctgga tataagtttg 60
gtgttcttgc taagattgtg aattacatga ag 92
<![CDATA[<210> 238]]>
<![CDATA[<211> 108]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 238]]>
acacggcatc agcgaggaga tacgcatcct ctaaccttag atgaaatttt ggatgaaaca 60
caacatttag atattggact caagcagaaa caatggctaa tgactgag 108
<![CDATA[<210> 239]]>
<![CDATA[<211> 183]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 239]]>
gctttagtca acaatcccaa aattgaagta atagatggga agtatgcttt caagcccaag 60
tacaacgtga gagataagaa ggccctactt aggctcttag atcagcatga ccagcgagga 120
ttaggaggaa ttcttttaga agacatagaa gaagcactgc ccaattccca gaaagctgtc 180
aag 183
<![CDATA[<210> 240]]>
<![CDATA[<211> 94]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 240]]>
gctttggggg accagatact atttgtaaat cgtcccgata agaagaaaat acttttcttc 60
aatgataaga gctgtcagtt ttctgtggat gaag 94
<![CDATA[<210> 241]]>
<![CDATA[<211> 116]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 241]]>
aatttcagaa actgtggagg agtgtcactg tagattccat ggacgaggag aaaattgaag 60
aatatctgaa gcgacagggt atttcttcca tgcaggaatc tggaccaaag aaagtg 116
<![CDATA[<210> 242]]>
<![CDATA[<211> 720]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 242]]>
gcccctattc agagaaggaa aaagcctgct tcacagaaaa agcgacgctt taagactcat 60
aacgaacact tggctggagt gctgaaggat tactctgaca ttacttccag caaataggga 120
acagttttgc cctggaacag agttacagat acacaatcaa gagtgttctt gctgatgctc 180
ggggtctgaa gactgtcttc ctatctgctt cttgcggctg aggagaggag cagttcagtt 240
tacaaaacaa gtgcaaatta ccaaactcaa agcttatttg agtagaatgg gctcatgggc 300
aatgtgatgt tccctgttaa ccttctgtta ctccctggga gaaaggcgct gagcgtggca 360
tgcaggtgtc tttgctgtgt ttttctccac ttctaaatgg ttcctggttc ctttcttcct 420
cgtttgttac tttagagcaa gtttgcccat agtcttgaat gcaatatttg tttattccaa 480
aagaacatat ttataataaa atcactgtag aaggattttt aagatgttag tgaattctgt 540
ttcttttcat tctcggaaat ggcaggaagc agctccagtc tctgatttcc atgggtcacg 600
tgctggggat gtgatgaagc ctgcagtctg cactgtgttg ctgagcacat ggatttcacc 660
actggaacac aggtgtgctg cttgttagca agcagagcaa taaagatgtg ctggatgtca 720
<![CDATA[<210> 243]]>
<![CDATA[<211> 1747]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 243]]>
actgagctcc tagcacccga tcgggaagtg gcgggcggag tcccgggtcc agtcgccgcc 60
tcagctaccg ccgctgccgc cgccgccgcc gccaccgcca gtggtgagac cccgacctgg 120
cgggtcagcg ctgggcgtgc gtgcgggcag gcgggggcgc tgacgagaag caggaagagg 180
gtgcagtgcc ggcgtgggcg gccggccgag gcggaggcgc aggaaggggg cggcgagtcg 240
tgcgaggctg cccttctcac tcagcattat ggatccaagc ctgttgagag aaagggagct 300
gttcaaaaaa cgagctcttt ctactcctgt agtagaaaaa cgttcagcat cttctgagtc 360
atcatcatca tcgtcaaaga agaagaaaac aaaggtagaa catggaggat cgtcaggctc 420
taaacaaaat tctgatcata gcaatggatc atttaacttg aaagctttgt caggaagctc 480
tggatataag tttggtgttc ttgctaagat tgtgaattac atgaagacac ggcatcagcg 540
aggagatacg catcctctaa ccttagatga aattttggat gaaacacaac atttagatat 600
tggactcaag cagaaacaat ggctaatgac tgaggcttta gtcaacaatc ccaaaattga 660
agtaatagat gggaagtatg ctttcaagcc caagtacaac gtgagagata agaaggccct 720
acttaggctc ttagatcagc atgaccagcg aggattagga ggaattcttt tagaagacat 780
agaagaagca ctgcccaatt cccagaaagc tgtcaaggct ttgggggacc agatactatt 840
tgtaaatcgt cccgataaga agaaaatact tttcttcaat gataagagct gtcagttttc 900
tgtggatgaa gaatttcaga aactgtggag gagtgtcact gtagattcca tggacgagga 960
gaaaattgaa gaatatctga agcgacaggg tatttcttcc atgcaggaat ctggaccaaa 1020
gaaagtggcc cctattcaga gaaggaaaaa gcctgcttca cagaaaaagc gacgctttaa 1080
gactcataac gaacacttgg ctggagtgct gaaggattac tctgacatta cttccagcaa 1140
atagggaaca gttttgccct ggaacagagt tacagataca caatcaagag tgttcttgct 1200
gatgctcggg gtctgaagac tgtcttccta tctgcttctt gcggctgagg agaggagcag 1260
ttcagtttac aaaacaagtg caaattacca aactcaaagc ttatttgagt agaatgggct 1320
catgggcaat gtgatgttcc ctgttaacct tctgttactc cctgggagaa aggcgctgag 1380
cgtggcatgc aggtgtcttt gctgtgtttt tctccacttc taaatggttc ctggttcctt 1440
tcttcctcgt ttgttacttt agagcaagtt tgcccatagt cttgaatgca atatttgttt 1500
attccaaaag aacatattta taataaaatc actgtagaag gatttttaag atgttagtga 1560
attctgtttc ttttcattct cggaaatggc aggaagcagc tccagtctct gatttccatg 1620
ggtcacgtgc tggggatgtg atgaagcctg cagtctgcac tgtgttgctg agcacatgga 1680
tttcaccact ggaacacagg tgtgctgctt gttagcaagc agagcaataa agatgtgctg 1740
gatgtca 1747
<![CDATA[<210> 244]]>
<![CDATA[<211> 55]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 244]]>
Met Asp Pro Ser Leu Leu Arg Glu Arg Glu Leu Phe Lys Lys Arg Ala
1 5 10 15
Leu Ser Thr Pro Val Val Glu Lys Arg Ser Ala Ser Ser Glu Ser Ser
20 25 30
Ser Ser Ser Ser Lys Lys Lys Lys Thr Lys Val Glu His Gly Gly Ser
35 40 45
Ser Gly Ser Lys Gln Asn Ser
50 55
<![CDATA[<210> 245]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 245]]>
His Ser Asn Gly Ser Phe Asn Leu Lys Ala Leu Ser Gly Ser Ser Gly
1 5 10 15
Tyr Lys Phe Gly Val Leu Ala Lys Ile Val Asn Tyr Met Lys
20 25 30
<![CDATA[<210> 246]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 246]]>
Thr Arg His Gln Arg Gly Asp Thr His Pro Leu Thr Leu Asp Glu Ile
1 5 10 15
Leu Asp Glu Thr Gln His Leu Asp Ile Gly Leu Lys Gln Lys Gln Trp
20 25 30
Leu Met Thr Glu
35
<![CDATA[<210> 247]]>
<![CDATA[<211> 61]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 247]]>
Ala Leu Val Asn Asn Pro Lys Ile Glu Val Ile Asp Gly Lys Tyr Ala
1 5 10 15
Phe Lys Pro Lys Tyr Asn Val Arg Asp Lys Lys Ala Leu Leu Arg Leu
20 25 30
Leu Asp Gln His Asp Gln Arg Gly Leu Gly Gly Ile Leu Leu Glu Asp
35 40 45
Ile Glu Glu Ala Leu Pro Asn Ser Gln Lys Ala Val Lys
50 55 60
<![CDATA[<210> 248]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 248]]>
Ala Leu Gly Asp Gln Ile Leu Phe Val Asn Arg Pro Asp Lys Lys Lys
1 5 10 15
Ile Leu Phe Phe Asn Asp Lys Ser Cys Gln Phe Ser Val Asp Glu
20 25 30
<![CDATA[<210> 249]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 249]]>
Phe Gln Lys Leu Trp Arg Ser Val Thr Val Asp Ser Met Asp Glu Glu
1 5 10 15
Lys Ile Glu Glu Tyr Leu Lys Arg Gln Gly Ile Ser Ser Met Gln Glu
20 25 30
Ser Gly Pro Lys Lys Val
35
<![CDATA[<210> 250]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 250]]>
Ala Pro Ile Gln Arg Arg Lys Lys Pro Ala Ser Gln Lys Lys Arg Arg
1 5 10 15
Phe Lys Thr His Asn Glu His Leu Ala Gly Val Leu Lys Asp Tyr Ser
20 25 30
Asp Ile Thr Ser Ser Lys
35
<![CDATA[<210> 251]]>
<![CDATA[<211> 291]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 251]]>
Met Asp Pro Ser Leu Leu Arg Glu Arg Glu Leu Phe Lys Lys Arg Ala
1 5 10 15
Leu Ser Thr Pro Val Val Glu Lys Arg Ser Ala Ser Ser Glu Ser Ser
20 25 30
Ser Ser Ser Ser Lys Lys Lys Lys Thr Lys Val Glu His Gly Gly Ser
35 40 45
Ser Gly Ser Lys Gln Asn Ser Asp His Ser Asn Gly Ser Phe Asn Leu
50 55 60
Lys Ala Leu Ser Gly Ser Ser Gly Tyr Lys Phe Gly Val Leu Ala Lys
65 70 75 80
Ile Val Asn Tyr Met Lys Thr Arg His Gln Arg Gly Asp Thr His Pro
85 90 95
Leu Thr Leu Asp Glu Ile Leu Asp Glu Thr Gln His Leu Asp Ile Gly
100 105 110
Leu Lys Gln Lys Gln Trp Leu Met Thr Glu Ala Leu Val Asn Asn Pro
115 120 125
Lys Ile Glu Val Ile Asp Gly Lys Tyr Ala Phe Lys Pro Lys Tyr Asn
130 135 140
Val Arg Asp Lys Lys Ala Leu Leu Arg Leu Leu Asp Gln His Asp Gln
145 150 155 160
Arg Gly Leu Gly Gly Ile Leu Leu Glu Asp Ile Glu Glu Ala Leu Pro
165 170 175
Asn Ser Gln Lys Ala Val Lys Ala Leu Gly Asp Gln Ile Leu Phe Val
180 185 190
Asn Arg Pro Asp Lys Lys Lys Ile Leu Phe Phe Asn Asp Lys Ser Cys
195 200 205
Gln Phe Ser Val Asp Glu Glu Phe Gln Lys Leu Trp Arg Ser Val Thr
210 215 220
Val Asp Ser Met Asp Glu Glu Lys Ile Glu Glu Tyr Leu Lys Arg Gln
225 230 235 240
Gly Ile Ser Ser Met Gln Glu Ser Gly Pro Lys Lys Val Ala Pro Ile
245 250 255
Gln Arg Arg Lys Lys Pro Ala Ser Gln Lys Lys Arg Arg Phe Lys Thr
260 265 270
His Asn Glu His Leu Ala Gly Val Leu Lys Asp Tyr Ser Asp Ile Thr
275 280 285
Ser Ser Lys
290
<![CDATA[<210> 252]]>
<![CDATA[<211> 434]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 252]]>
actgagctcc tagcacccga tcgggaagtg gcgggcggag tcccgggtcc agtcgccgcc 60
tcagctaccg ccgctgccgc cgccgccgcc gccaccgcca gtggtgagac cccgacctgg 120
cgggtcagcg ctgggcgtgc gtgcgggcag gcgggggcgc tgacgagaag caggaagagg 180
gtgcagtgcc ggcgtgggcg gccggccgag gcggaggcgc aggaaggggg cggcgagtcg 240
tgcgaggctg cccttctcac tcagcattat ggatccaagc ctgttgagag aaagggagct 300
gttcaaaaaa cgagctcttt ctactcctgt agtagaaaaa cgttcagcat cttctgagtc 360
atcatcatca tcgtcaaaga agaagaaaac aaaggtagaa catggaggat cgtcaggctc 420
taaacaaaat tctg 434
<![CDATA[<210> 253]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 253]]>
atcctaaaca gcacatccaa tcacctgtgg ctagagttca acaccaatgg atctgacacc 60
gaccaaggtt ttcaactcac ctatacca 88
<![CDATA[<210> 254]]>
<![CDATA[<211> 62]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 254]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ct 62
<![CDATA[<210> 255]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 255]]>
atcctaaaca gcacatccaa tcacctgtgg ctagagttca acaccaatgg atctgacacc 60
gaccaaggtt ttcaactcac ctataccact acatctacat ccaccactgg gacaagccat 120
cttgtaaaat gtgcggagaa ggagaaaact 150
<![CDATA[<210> 256]]>
<![CDATA[<211> 50]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 256]]>
Ile Leu Asn Ser Thr Ser Asn His Leu Trp Leu Glu Phe Asn Thr Asn
1 5 10 15
Gly Ser Asp Thr Asp Gln Gly Phe Gln Leu Thr Tyr Thr Thr Thr Ser
20 25 30
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu
35 40 45
Lys Thr
50
<![CDATA[<210> 257]]>
<![CDATA[<211> 583]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 257]]>
ctccacggca gcggctcctt gtgccactag cagcccttct tctgcgctct ccgccttttc 60
tctctagact ggatctctcc tcccccccgc gcccccctcc ccgcatctcc cactcgctgg 120
ctctctctcc agctgcctcc tctccaggtc tctcctggct gcgcgcgctc ctctccccgc 180
ttctccccct cccgcagcct cgccgccttg gtgccttcct gcccggctcg gccggcgctc 240
gtccccggcc ccggccccgc cagcccgggt ctccgcgctc ggagcagctc agccctgcag 300
tggctcggga cccgatgcta tgagagggaa gcgagccggg cgcccagacc ttcaggaggc 360
gtcggatgcg cggcgggtct tgggaccggg ctctctctcc ggctcgcctt gccctcgggt 420
gattatttgg ctccgctcat agccctgcct tcctcggagg agccatcggt gtcgcgtgcg 480
tgtggagtat ctgcagacat gactgcgtgg aggagattcc agtcgctgct cctgcttctc 540
gggctgctgg tgctgtgcgc gaggctcctc actgcagcga agg 583
<![CDATA[<210> 258]]>
<![CDATA[<211> 217]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 258]]>
gtcagaactg tggaggctta gtccagggtc ccaatggcac tattgagagc ccagggtttc 60
ctcacgggta tccgaactat gccaactgca cctggatcat catcacgggc gagcgcaata 120
ggatacagtt gtccttccat acctttgctc ttgaagaaga ttttgatatt ttatcagttt 180
acgatggaca gcctcaacaa gggaatttaa aagtgag 217
<![CDATA[<210> 259]]>
<![CDATA[<211> 113]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 259]]>
attatcggga tttcagctgc cctcctctat agtgagtaca ggatctatcc tcactctgtg 60
gttcacgaca gacttcgctg tgagtgccca aggtttcaaa gcattatatg aag 113
<![CDATA[<210> 260]]>
<![CDATA[<211> 195]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 260]]>
ttttacctag ccacacttgt ggaaatcctg gagaaatcct gaaaggagtt ctgcatggaa 60
cgagattcaa cataggagac aaaatccggt acagctgcct ccctggctac atcttggaag 120
gccacgccat cctgacctgc atcgtcagcc caggaaatgg tgcatcgtgg gacttcccag 180
ctcccttttg cagag 195
<![CDATA[<210> 261]]>
<![CDATA[<211> 208]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 261]]>
ctgagggagc ctgcggagga accttacgcg ggaccagcag ctccatctcc agcccgcact 60
tcccttcaga gtacgagaac aacgcggact gcacctggac cattctggct gagcccgggg 120
acaccattgc gctggtcttc actgactttc agctagaaga aggatatgat ttcttagaga 180
tcagtggcac ggaagctcca tccatatg 208
<![CDATA[<210> 262]]>
<![CDATA[<211> 113]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 262]]>
gctaactggc atgaacctcc cctctccagt tatcagtagc aagaattggc tacgactcca 60
tttcacctct gacagcaacc accgacgcaa aggatttaac gctcagttcc aag 113
<![CDATA[<210> 263]]>
<![CDATA[<211> 78]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 263]]>
tgaaaaaggc gattgagttg aagtcaagag gagtcaagat gctgcccagc aaggatggaa 60
gccataaaaa ctctgtct 78
<![CDATA[<210> 264]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 264]]>
tgagccaagg aggtgttgca ttggtctctg acatgtgtcc agatcctggg attccagaaa 60
atggtagaag agcaggttcc gacttcag 88
<![CDATA[<210> 265]]>
<![CDATA[<211> 125]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 265]]>
ggttggtgca aatgtacagt tttcatgtga ggacaattac gtgctccagg gatctaaaag 60
catcacctgt cagagagtta cagagacgct cgctgcttgg agtgaccaca ggcccatctg 120
ccgag 125
<![CDATA[<210> 266]]>
<![CDATA[<211> 122]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 266]]>
cgagaacatg tggatccaat ctgcgtgggc ccagcggcgt cattacctcc cctaattatc 60
cggttcagta tgaagataat gcacactgtg tgtgggtcat caccaccacc gacccggaca 120
ag 122
<![CDATA[<210> 267]]>
<![CDATA[<211> 104]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 267]]>
gtcatcaagc ttgcctttga agagtttgag ctggagcgag gctatgacac cctgacggtt 60
ggtgatgctg ggaaggtggg agacaccaga tcggtcttgt acgt 104
<![CDATA[<210> 268]]>
<![CDATA[<211> 113]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 268]]>
gctcacggga tccagtgttc ctgacctcat tgtgagcatg agcaaccaga tgtggctaca 60
tctgcagtcg gatgatagca ttggctcacc tgggtttaaa gctgtttacc aag 113
<![CDATA[<210> 269]]>
<![CDATA[<211> 183]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 269]]>
aaattgaaaa gggagggtgt ggggatcctg gaatccccgc ctatgggaag cggacgggca 60
gcagtttcct ccatggagat acactcacct ttgaatgccc ggcggccttt gagctggtgg 120
gggagagagt tatcacctgt cagcagaaca atcagtggtc tggcaacaag cccagctgtg 180
tat 183
<![CDATA[<210> 270]]>
<![CDATA[<211> 327]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 270]]>
tttcatgttt cttcaacttt acggcatcat ctgggattat tctgtcacca aattatccag 60
aggaatatgg gaacaacatg aactgtgtct ggttgattat ctcggagcca ggaagtcgaa 120
ttcacctaat ctttaatgat tttgatgttg agcctcagtt tgactttctc gcggtcaagg 180
atgatggcat ttctgacata actgtcctgg gtactttttc tggcaatgaa gtgccttccc 240
agctggccag cagtgggcat atagttcgct tggaatttca gtctgaccat tccactactg 300
gcagagggtt caacatcact tacacca 327
<![CDATA[<210> 271]]>
<![CDATA[<211> 195]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 271]]>
catttggtca gaatgagtgc catgatcctg gcattcctat aaacggacga cgttttggtg 60
acaggtttct actcgggagc tcggtttctt tccactgtga tgatggcttt gtcaagaccc 120
agggatccga gtccattacc tgcatactgc aagacgggaa cgtggtctgg agctccaccg 180
tgccccgctg tgaag 195
<![CDATA[<210> 272]]>
<![CDATA[<211> 139]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 272]]>
ctccatgtgg tggacatctg acagcgtcca gcggagtcat tttgcctcct ggatggccag 60
gatattataa ggattcttta cattgtgaat ggataattga agcaaaacca ggccactcta 120
tcaaaataac ttttgacag 139
<![CDATA[<210> 273]]>
<![CDATA[<211> 188]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 273]]>
atttcagaca gaggtcaatt atgacacctt ggaggtcaga gatgggccag ccagttcgtc 60
cccactgatc ggcgagtacc acggcaccca ggcaccccag ttcctcatca gcaccgggaa 120
cttcatgtac ctgctgttca ccactgacaa cagccgctcc agcatcggct tcctcatcca 180
ctatgaga 188
<![CDATA[<210> 274]]>
<![CDATA[<211]]>> 189]]>
<br/><![CDATA[<212> DNA]]>
<br/><![CDATA[<213> 人類]]>
<br/>
<br/><![CDATA[<400> 274]]>
<br/><![CDATA[gtgtgacgct tgagtcggat tcctgcctgg acccgggcat ccctgtgaac ggccatcgcc 60
acggtggaga ctttggcatc aggtccacag tgactttcag ctgtgacccg gggtacacac 120
taagtgacga cgagcccctc gtctgtgaga ggaaccacca gtggaaccac gccttgccca 180
gctgcgacg 189
<![CDATA[<210> 275]]>
<![CDATA[<211> 117]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 275]]>
ctctatgtgg aggctacatc caagggaaga gtggaacagt cctttctcct gggtttccag 60
atttttatcc aaactctcta aactgcacgt ggaccattga agtgtctcat gggaaag 117
<![CDATA[<210> 276]]>
<![CDATA[<211> 216]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 276]]>
gagttcaaat gatctttcac acctttcatc ttgagagttc ccacgactat ttactgatca 60
cagaggatgg aagtttttcc gagcccgttg ccaggctcac cgggtcggtg ttgcctcata 120
cgatcaaggc aggcctgttt ggaaacttca ctgcccagct tcggtttata tcagacttct 180
caatttcgta cgagggcttc aatatcacat tttcag 216
<![CDATA[<210> 277]]>
<![CDATA[<211> 189]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 277]]>
aatatgacct ggagccatgt gatgatcctg gagtccctgc cttcagccga agaattggtt 60
ttcactttgg tgtgggagac tctctgacgt tttcctgctt cctgggatat cgtttagaag 120
gtgccaccaa gcttacctgc ctgggtgggg gccgccgtgt gtggagtgca cctctgccaa 180
ggtgtgtgg 189
<![CDATA[<210> 278]]>
<![CDATA[<211> 170]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 278]]>
ccgaatgtgg agcaagtgtc aaaggaaatg aaggaacatt actgtctcca aattttccat 60
ccaattatga taataaccat gagtgtatct ataaaataga aacagaagcc ggcaagggca 120
tccaccttag aacacgaagc ttccagctgt ttgaaggaga tactctaaag 170
<![CDATA[<210> 279]]>
<![CDATA[<211> 157]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 279]]>
gtatatgatg gaaaagacag ttcctcacgt ccactgggca cgttcactaa aaatgaactt 60
ctggggctga tcctaaacag cacatccaat cacctgtggc tagagttcaa caccaatgga 120
tctgacaccg accaaggttt tcaactcacc tatacca 157
<![CDATA[<210> 280]]>
<![CDATA[<211> 192]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 280]]>
gttttgatct ggtaaaatgt gaggatccgg gcatccctaa ctacggctat aggatccgtg 60
atgaaggcca ctttaccgac actgtagttc tgtacagttg caacccgggg tacgccatgc 120
atggcagcaa caccctgacc tgtttgagtg gagacaggag agtgtgggac aaaccactac 180
cttcgtgcat ag 192
<![CDATA[<210> 281]]>
<![CDATA[<211> 10288]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 281]]>
gttttgatct ggtaaaatgt gaggatccgg gcatccctaa ctacggctat aggatccgtg 60
atgaaggcca ctttaccgac actgtagttc tgtacagttg caacccgggg tacgccatgc 120
atggcagcaa caccctgacc tgtttgagtg gagacaggag agtgtgggac aaaccactac 180
cttcgtgcat agcggaatgt ggtggtcaga tccatgcagc cacatcagga cgaatattgt 240
cccctggcta tccagctccg tatgacaaca acctccactg cacctggatt atagaggcag 300
acccaggaaa gaccattagc ctccatttca ttgttttcga cacggagatg gctcacgaca 360
tcctcaaggt ctgggacggg ccggtggaca gtgacatcct gctgaaggag tggagtggct 420
ccgcccttcc ggaggacatc cacagcacct tcaactcact caccctgcag ttcgacagcg 480
acttcttcat cagcaagtct ggcttctcca tccagttctc cacctcaatt gcagccacct 540
gtaacgatcc aggtatgccc caaaatggca cccgctatgg agacagcaga gaggctggag 600
acaccgtcac attccagtgt gaccctggct atcagctcca aggacaagcc aaaatcacct 660
gtgtgcagct gaataaccgg ttcttttggc aaccagaccc tcctacatgc atagctgctt 720
gtggagggaa tctgacgggc ccagcaggtg ttattttgtc acccaactac ccacagccgt 780
atcctcctgg gaaggaatgt gactggagag taaaagtgaa cccggacttt gtcatcgcct 840
tgatattcaa aagtttcaac atggagccca gctatgactt cctacacatc tatgaagggg 900
aagattccaa cagccccctc attgggagtt accagggctc tcaggcccca gaaagaatag 960
agagtagcgg aaacagcctg tttctggcat ttcggagtga tgcctccgtg ggcctttcag 1020
ggttcgccat tgaatttaaa gagaaaccac gggaagcttg ttttgaccca ggaaatataa 1080
tgaatgggac aagagttgga acagacttca agcttggctc caccatcacc taccagtgtg 1140
actctggcta taagattctt gacccctcat ccatcacctg tgtgattggg gctgatggga 1200
aaccctcctg ggaccaagtg ctgccctcct gcaatgctcc ctgtggaggc cagtacacgg 1260
gatcagaagg ggtagtttta tcaccaaact acccccataa ttacacagct ggtcaaatat 1320
gcctctattc catcacggta ccaaaggaat tcgtggtctt tggacagttt gcctatttcc 1380
agacagccct gaatgatttg gcagaattat ttgatggaac ccatgcacag gccagacttc 1440
tcagctcact ctcggggtct cactcagggg aaacattgcc cttggctacg tcaaatcaaa 1500
ttctgctccg attcagtgca aagagcggtg cctctgcccg cggcttccac ttcgtgtatc 1560
aagctgttcc tcgtaccagt gacacccaat gcagctctgt ccccgagccc agatacggaa 1620
ggagaattgg ttctgagttt tctgccggct ccatcgtccg attcgagtgc aacccgggat 1680
acctgcttca gggttccacg gcgctccact gccagtccgt gcccaacgcc ttggcacagt 1740
ggaacgacac gatccccagc tgtgtggtac cctgcagtgg caatttcact caacgaagag 1800
gtacaatcct gtcccccggc taccctgagc catacggaaa caacttgaac tgtatatgga 1860
agatcatagt tacggagggc tcgggaattc agatccaagt gatcagtttt gccacggagc 1920
agaactggga ctcccttgag atccacgatg gtggggatgt gaccgcaccc agactgggaa 1980
gcttctcagg caccacagta ccggcactgc tgaacagtac ttccaaccaa ctctacctgc 2040
atttccagtc tgacattagt gtggcagctg ctggtttcca cctggaatac aaaactgtag 2100
gtcttgctgc atgccaagaa ccagccctcc ccagcaacag catcaaaatc ggagatcggt 2160
acatggtgaa cgacgtgctc tccttccagt gcgagcccgg gtacaccctg cagggccgtt 2220
cccacatttc ctgtatgcca gggaccgttc gccgttggaa ctatccgtct cccctgtgca 2280
ttgcaacctg tggagggacg ctgagcacct tgggtggtgt gatcctgagc cccggcttcc 2340
caggttctta ccccaacaac ttagactgca cctggaggat ctcattaccc atcggctatg 2400
gtgcacatat tcagtttctg aatttttcta ccgaagctaa tcatgacttc cttgaaattc 2460
aaaatggacc ttaccacacc agccccatga ttggacaatt tagcggcacg gatctccccg 2520
cggccctgct gagcacaacg catgaaaccc tcatccactt ttatagtgac cattcgcaaa 2580
accggcaagg atttaaactt gcttaccaag cctatgaatt acagaactgt ccagatccac 2640
ccccatttca gaatgggtac atgatcaact cggattacag cgtggggcaa tcagtatctt 2700
tcgagtgtta tcctgggtac attctaatag gccatcctgt cctcacttgt cagcatggga 2760
tcaacagaaa ctggaactac ccttttccaa gatgtgatgc cccttgtggg tacaacgtaa 2820
cttctcagaa cggcaccatc tactcccctg gctttcctga tgagtatccg atcctgaagg 2880
actgcatttg gctcatcacg gtgcctccag ggcacggagt ttacatcaac ttcaccctgt 2940
tacagacgga agctgtcaac gattacattg ctgtttggga cggtcccgat cagaactcac 3000
cccagctggg agttttcagt ggcaacacag ccctcgaaac ggcgtatagc tccaccaacc 3060
aagtcctgct caagttccac agcgactttt caaatggagg cttctttgtc ctcaatttcc 3120
acgcatttca gctcaagaaa tgtcaacctc ccccagcggt tccacaggca gaaatgctta 3180
ctgaggatga tgatttcgaa ataggagatt ttgtgaagta ccagtgccac cccgggtaca 3240
ccttggtggg gaccgacatt ctgacttgca agctcagttc ccagttgcag tttgagggtt 3300
ctctcccaac atgtgaagca caatgcccag caaatgaagt ccggactgga tcatcgggag 3360
tcattctcag tccagggtat ccgggtaatt attttaactc ccagacttgc tcttggagta 3420
ttaaagtgga accaaactac aacattacca tctttgtgga cacatttcaa agtgaaaagc 3480
agtttgatgc actggaagtg tttgatggtt cttctgggca aagtcctctg ctagtagtct 3540
taagtgggaa tcatactgaa caatcaaatt ttacaagcag gagtaatcag ttatatctcc 3600
gctggtccac tgaccatgcc accagtaaga aaggattcaa gattcgctat gcagcacctt 3660
actgcagttt gacccacccc ctgaagaatg ggggtattct aaacaggact gcaggagcgg 3720
ttggaagcaa agtgcattat ttttgcaagc ctggataccg aatggtcggc cacagcaatg 3780
caacctgtag acgaaaccca cttggcatgt accagtggga ctccctcacg ccactctgcc 3840
aggctgtgtc ctgtggaatc ccagaatccc caggaaacgg ttcatttacc gggaacgagt 3900
tcactttgga cagtaaagtg gtctatgaat gtcatgaggg cttcaagctt gaatccagcc 3960
agcaagcaac agccgtgtgt caagaagatg ggttgtggag taacaagggg aagccgccca 4020
cgtgtaagcc ggtcgcttgc cccagcattg aagctcagct ctcagaacat gtcatctgga 4080
ggctggtttc aggatccttg aatgagtacg gtgctcaagt attgctgagc tgcagtcctg 4140
gttactactt agaaggctgg aggctcctgc ggtgccaggc caatgggacg tggaacatag 4200
gagatgagag gccaagctgt cgagttatct cgtgtggaag cctttccttt cccccaaatg 4260
gcaacaagat tggaacgttg acagtttatg gggccacagc tatatttacg tgcaacaccg 4320
gctacacgct tgtggggtct catgtcagag agtgcttggc aaatgggctc tggagcggca 4380
gcgaaactcg atgtctggct ggccactgcg gttccccaga cccgattgtg aacggtcaca 4440
ttagtggaga tggcttcagt tacagagaca cggtggttta ccagtgcaat cctggtttcc 4500
ggcttgtggg aacttccgtg aggatatgcc tgcaagacca caagtggtct ggacaaacgc 4560
ctgtctgtgt ccccatcaca tgtggtcacc ctggaaaccc tgcccacgga ttcactaatg 4620
gcagtgagtt caacctgaat gatgtcgtga atttcacctg caacacgggc tatttgctgc 4680
agggcgtgtc tcgagcccag tgtcggagca acggccagtg gagtagccct ctgcccacgt 4740
gtcgagtggt gaactgttct gatccaggct ttgtggaaaa tgccattcgt cacgggcaac 4800
agaacttccc tgagagtttt gagtatggaa tgagtatcct gtaccattgc aagaagggat 4860
tttacttgct gggatcttca gccttgacct gtatggcaaa tggcttatgg gaccgatccc 4920
tgcccaagtg tttggctata tcgtgtggac acccaggggt ccctgccaac gccgtcctca 4980
ctggagagct gtttacctat ggcgccgtcg tgcactactc ctgcagaggg agcgagagcc 5040
tcataggcaa cgacacgaga gtgtgccagg aagacagtca ctggagcggg gcactgcccc 5100
actgcacagg aaataatcct ggattctgtg gtgatccggg gaccccagca catgggtctc 5160
ggcttggtga tgactttaag acaaagagtc ttctccgctt ctcctgtgaa atggggcacc 5220
agctgagggg ctcccctgaa cgcacgtgtt tgctcaatgg gtcatggtca ggactgcagc 5280
cggtgtgtga ggccgtgtcc tgtggcaacc ctggcacacc caccaacgga atgattgtca 5340
gtagtgatgg cattctgttc tccagctcgg tcatctatgc ctgctgggaa ggctacaaga 5400
cctcagggct catgacacgg cattgcacag ccaatgggac ctggacaggc actgctcccg 5460
actgcacaat tataagttgt ggggatccag gcacactagc aaatggcatc cagtttggga 5520
ccgacttcac cttcaacaag actgtgagct atcagtgtaa cccaggctat gtcatggaag 5580
cagtcacatc cgccactatt cgctgtacca aagacggcag gtggaatccg agcaaacctg 5640
tctgcaaagc cgtgctgtgt cctcagccgc cgccggtgca gaatggaaca gtggagggaa 5700
gtgatttccg ctggggctcc agcataagtt acagctgcat ggacggttac cagctctctc 5760
actccgccat cctctcctgt gaaggtcgcg gggtgtggaa aggagagatc ccccagtgtc 5820
tccctgtgtt ctgcggagac cctggcatcc ccgcagaagg gcgacttagt gggaaaagtt 5880
tcacctataa gtccgaagtc ttcttccagt gcaaatctcc atttatactc gtgggatcct 5940
ccagaagagt ctgccaagct gacggcacgt ggagcggcat acaacccacc tgcattgatc 6000
ctgctcataa cacctgccca gaccctggta cgccacactt tggaatacag aatagctcca 6060
gaggctatga ggttggaagc acggtttttt tcaggtgcag aaaaggctac catattcaag 6120
gttccacgac tcgcacctgc cttgccaatt taacatggag tgggatacag accgaatgta 6180
tacctcatgc ctgcagacag ccagaaaccc cggcacacgc ggatgtgaga gccatcgatc 6240
ttcctacttt cggctacacc ttagtgtaca cctgccatcc aggctttttc ctcgcagggg 6300
gatctgagca cagaacatgt aaagcagaca tgaaatggac aggaaagtcg cctgtgtgta 6360
aaagtaaagg agtgagagaa gttaatgaaa cagttactaa aactccagtt ccttcagatg 6420
tctttttcgt caattcactg tggaaggggt attatgaata tttagggaaa agacaacccg 6480
ccactctaac tgttgactgg ttcaatgcaa caagcagtaa ggtgaatgcc accttcagcg 6540
aagcctcgcc agtggagctg aagttgacag gcatttacaa gaaggaggag gcccacttac 6600
tcctgaaagc ttttcaaatt aaaggccagg cagatatttt tgtaagcaag ttcgaaaatg 6660
acaactgggg actagatggt tatgtgtcat ctggacttga aagaggagga tttacttttc 6720
aaggtgacat tcatggaaaa gactttggaa aatttaagct agaaaggcaa gatcctttaa 6780
acccagatca agactcttcc agtcattacc acggcaccag cagtggctct gtggcggctg 6840
ccattctggt tcctttcttt gctctaattt tatcagggtt tgcattttac ctctacaaac 6900
acagaacgag accaaaagtt caatacaatg gctatgctgg gcatgaaaac agcaatggac 6960
aagcatcgtt tgaaaacccc atgtatgata caaacttaaa acccacagaa gccaaggctg 7020
tgaggtttga cacaactctg aacacagtct gtacagtggt atagccctca gtgccccaac 7080
aggactgatt catagccata cctctgatgg acaagcagtg attcctttgg tgccatatac 7140
cactctccct tccactctgg ctttactgca gcgatcttca accttgtcta ctggcataag 7200
tgcagcgggg atctctactc aaatgtgtca gggtcttcta cggatcaaac tacacatgcg 7260
ttttcattcc aaaagtgggt tctaaatgcc tggctgcatc tgtatgaaat caaggcacac 7320
tccaggaaga ctgccacgtc gcgccaacac gtcatactca atgcctcaga ctttcatatt 7380
tctgtgttgc tgagatgcct ttcaatgcaa tcgtctgggc tcgtggatat gtccctcagg 7440
tgcggtgaca gaatggtggc accacgatat gtgttctctt gtgttgtttt tcctttttaa 7500
acccccatga acacgaatac tctgaaaaaa ataaaaagct ttctggaaga agacaccttt 7560
ctgatagagg ctcacaccta caaatgcttc actctgtcct tccgagacct gacaagcttt 7620
gaggacctca cagctcccct gtgtgttcat ctctagggat gtttgcaatt tcccagtcag 7680
ctgttctgtc gcagaatgtt taatgcacaa ttttttgcac tagtgtgtta tgaatgacta 7740
agattctgat aaaaaaaata aattatttac acagggttta tacacactat ccattgtata 7800
taagcattat ttcatattat caagctaaac attcccccat cagcttagtt ggagtgttag 7860
ggaaaagtat tcctagatat ggcacagatt ttaaaaggaa atacagtatt gaagagattt 7920
attttattat tgcttcaatt agctccattt acgtgttgaa ttcattgaag aggtccaatg 7980
agaaaaaaac agaagcctcc ttatttcaca cgttttcctc ctttagtacc atcctcatcc 8040
aattactgtc tctctgatac tacttaatag cagggggttt gcagaaattt ctgtttgcca 8100
tgtaaaactg tgaatagtaa tttattttag atagtcgatg aacttgtggg ttttagctca 8160
caatgcagcc ttcccttttg cagtgttttt tttttgtttt tttttttttt tgtcttttac 8220
tgtgccatcg atctttgata ttgcattgaa agacaatata ccacagtagc accttgaact 8280
cagtgaaaat tgttcaggat caaaatacca agtgttcttt tagagggaag gaaaaagtac 8340
acacactctc ctctcacaat gatatatttt atacattcat ttgttatttg tttcatgctt 8400
tatgattcca gatggaaagg taatttcagt gacttttcaa gtttaaattc cattataggt 8460
aaatgataag ttatgatgca aataaaatct ataagatccc cagggcaaat aaaaatcaaa 8520
acatgaagta gaagatgtgg ccgtgaggta gtttatgtaa caaattcaaa gtgaaaatca 8580
tgtttacttt tacttatact tatttgataa aaatattttt gaaacgatag tacttatttt 8640
attatttgat atttcagttc ctattcaatt gtggcagatt ttctctgttt cacattttag 8700
attggcgttg gtaatagaaa tgtcagaatg ttcaaattgg ccttcacgtt gtcggagtga 8760
acacattgac acctagcttt aagactgatt tatctgttgg tgtactgaag gtttccatgt 8820
aggacttcaa atgtggaaaa ggaaaagcag tcaggaaaat ggggcattct ttggagagtc 8880
acgcgttttg attcggacat ttccgtagag ctcggctccc agtgttgtgt tcctcggtcg 8940
aaagggtctc tgctgtttgg ggactcactg gcctctccta gggactcctt tgtcttgtga 9000
accccacgct gttggattct gtatcattat gctgaattct ctgcacagtt ttccctggcc 9060
aacctgccca catccttgga gatttgcttt gccagtggga atccttacat tgctgtttca 9120
cagtagacgg gacgaggtca gcgggagtcg tgctcctaac acacacattg aacgaaacag 9180
aagatgattg aaagtgtgag gaggctcgtg tgcaagggag aacagggtta ctatacatat 9240
tagtgtatat atatacatac atatatatat atatatattg tacatatcta agtttgagtc 9300
attcaaacta ggtgcaaaat gctgacttca gagtctgaat taacatctct gttcccatat 9360
ccctgacctg ctccctggtc aacgatgcta tgaaatcctg aaatgacagg acatacatac 9420
atacaagaaa ccacatatca aattagatat gattttcctt tgtgtgcaaa gtcaaactgt 9480
cctagggttg ccagtttgaa gcatgttatt taaatgaaaa aaaaaatcag tgaaattctc 9540
gtgtgagaat tctgcctagt ttcttcctaa ggttgtgtgc agtgttgaac ggcgtctccg 9600
caaggtgttg gaggatctca ttttagggca gtcaggagct gtgcttgctg agttaggtct 9660
agaagactct tccctgaagg caacgggaac acgcgtgagg gacgcgacca cacactaaca 9720
gaggacacgt gcttcagagc tgtttaaaac tgctgcttgt tttacacaca catcttgcct 9780
tttttcaggc tagctgcaat aatttttttc ttctgtaaaa tattttgtaa acaacaacaa 9840
aaagctatta taaaaagggg gtaaaaaaaa gaacgctggc attatgatca ggaaaaccca 9900
ttgtcatcgc cgaccctccc tcccgtccca ccacacgctg ctgtcacgac gtaggtgcga 9960
aagacctttt tgtacagaga tatatttttt atgaagaatt tgtaaaatta ttaaatatgc 10020
tgtaattttt tgattaatgt aggtacattg ttaaaaaata aatgttttta caatacagaa 10080
ctgtaatttt cccaataatg taaaatgtac catctctagc tgattttcag ttccaatcct 10140
attacacatg tattaatatt aaagtggcct gttaaaatga acagtatctt ttttttgtca 10200
aaaaaattat aaagagggtg taatatagcc tgtgcaatgc caccaatctt taaagcaaat 10260
cagagttcta attaaatatt taatttta 10288
<![CDATA[<210> 282]]>
<![CDATA[<211> 14417]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 282]]>
ctccacggca gcggctcctt gtgccactag cagcccttct tctgcgctct ccgccttttc 60
tctctagact ggatctctcc tcccccccgc gcccccctcc ccgcatctcc cactcgctgg 120
ctctctctcc agctgcctcc tctccaggtc tctcctggct gcgcgcgctc ctctccccgc 180
ttctccccct cccgcagcct cgccgccttg gtgccttcct gcccggctcg gccggcgctc 240
gtccccggcc ccggccccgc cagcccgggt ctccgcgctc ggagcagctc agccctgcag 300
tggctcggga cccgatgcta tgagagggaa gcgagccggg cgcccagacc ttcaggaggc 360
gtcggatgcg cggcgggtct tgggaccggg ctctctctcc ggctcgcctt gccctcgggt 420
gattatttgg ctccgctcat agccctgcct tcctcggagg agccatcggt gtcgcgtgcg 480
tgtggagtat ctgcagacat gactgcgtgg aggagattcc agtcgctgct cctgcttctc 540
gggctgctgg tgctgtgcgc gaggctcctc actgcagcga agggtcagaa ctgtggaggc 600
ttagtccagg gtcccaatgg cactattgag agcccagggt ttcctcacgg gtatccgaac 660
tatgccaact gcacctggat catcatcacg ggcgagcgca ataggataca gttgtccttc 720
catacctttg ctcttgaaga agattttgat attttatcag tttacgatgg acagcctcaa 780
caagggaatt taaaagtgag attatcggga tttcagctgc cctcctctat agtgagtaca 840
ggatctatcc tcactctgtg gttcacgaca gacttcgctg tgagtgccca aggtttcaaa 900
gcattatatg aagttttacc tagccacact tgtggaaatc ctggagaaat cctgaaagga 960
gttctgcatg gaacgagatt caacatagga gacaaaatcc ggtacagctg cctccctggc 1020
tacatcttgg aaggccacgc catcctgacc tgcatcgtca gcccaggaaa tggtgcatcg 1080
tgggacttcc cagctccctt ttgcagagct gagggagcct gcggaggaac cttacgcggg 1140
accagcagct ccatctccag cccgcacttc ccttcagagt acgagaacaa cgcggactgc 1200
acctggacca ttctggctga gcccggggac accattgcgc tggtcttcac tgactttcag 1260
ctagaagaag gatatgattt cttagagatc agtggcacgg aagctccatc catatggcta 1320
actggcatga acctcccctc tccagttatc agtagcaaga attggctacg actccatttc 1380
acctctgaca gcaaccaccg acgcaaagga tttaacgctc agttccaagt gaaaaaggcg 1440
attgagttga agtcaagagg agtcaagatg ctgcccagca aggatggaag ccataaaaac 1500
tctgtcttga gccaaggagg tgttgcattg gtctctgaca tgtgtccaga tcctgggatt 1560
ccagaaaatg gtagaagagc aggttccgac ttcagggttg gtgcaaatgt acagttttca 1620
tgtgaggaca attacgtgct ccagggatct aaaagcatca cctgtcagag agttacagag 1680
acgctcgctg cttggagtga ccacaggccc atctgccgag cgagaacatg tggatccaat 1740
ctgcgtgggc ccagcggcgt cattacctcc cctaattatc cggttcagta tgaagataat 1800
gcacactgtg tgtgggtcat caccaccacc gacccggaca aggtcatcaa gcttgccttt 1860
gaagagtttg agctggagcg aggctatgac accctgacgg ttggtgatgc tgggaaggtg 1920
ggagacacca gatcggtctt gtacgtgctc acgggatcca gtgttcctga cctcattgtg 1980
agcatgagca accagatgtg gctacatctg cagtcggatg atagcattgg ctcacctggg 2040
tttaaagctg tttaccaaga aattgaaaag ggagggtgtg gggatcctgg aatccccgcc 2100
tatgggaagc ggacgggcag cagtttcctc catggagata cactcacctt tgaatgcccg 2160
gcggcctttg agctggtggg ggagagagtt atcacctgtc agcagaacaa tcagtggtct 2220
ggcaacaagc ccagctgtgt attttcatgt ttcttcaact ttacggcatc atctgggatt 2280
attctgtcac caaattatcc agaggaatat gggaacaaca tgaactgtgt ctggttgatt 2340
atctcggagc caggaagtcg aattcaccta atctttaatg attttgatgt tgagcctcag 2400
tttgactttc tcgcggtcaa ggatgatggc atttctgaca taactgtcct gggtactttt 2460
tctggcaatg aagtgccttc ccagctggcc agcagtgggc atatagttcg cttggaattt 2520
cagtctgacc attccactac tggcagaggg ttcaacatca cttacaccac atttggtcag 2580
aatgagtgcc atgatcctgg cattcctata aacggacgac gttttggtga caggtttcta 2640
ctcgggagct cggtttcttt ccactgtgat gatggctttg tcaagaccca gggatccgag 2700
tccattacct gcatactgca agacgggaac gtggtctgga gctccaccgt gccccgctgt 2760
gaagctccat gtggtggaca tctgacagcg tccagcggag tcattttgcc tcctggatgg 2820
ccaggatatt ataaggattc tttacattgt gaatggataa ttgaagcaaa accaggccac 2880
tctatcaaaa taacttttga cagatttcag acagaggtca attatgacac cttggaggtc 2940
agagatgggc cagccagttc gtccccactg atcggcgagt accacggcac ccaggcaccc 3000
cagttcctca tcagcaccgg gaacttcatg tacctgctgt tcaccactga caacagccgc 3060
tccagcatcg gcttcctcat ccactatgag agtgtgacgc ttgagtcgga ttcctgcctg 3120
gacccgggca tccctgtgaa cggccatcgc cacggtggag actttggcat caggtccaca 3180
gtgactttca gctgtgaccc ggggtacaca ctaagtgacg acgagcccct cgtctgtgag 3240
aggaaccacc agtggaacca cgccttgccc agctgcgacg ctctatgtgg aggctacatc 3300
caagggaaga gtggaacagt cctttctcct gggtttccag atttttatcc aaactctcta 3360
aactgcacgt ggaccattga agtgtctcat gggaaaggag ttcaaatgat ctttcacacc 3420
tttcatcttg agagttccca cgactattta ctgatcacag aggatggaag tttttccgag 3480
cccgttgcca ggctcaccgg gtcggtgttg cctcatacga tcaaggcagg cctgtttgga 3540
aacttcactg cccagcttcg gtttatatca gacttctcaa tttcgtacga gggcttcaat 3600
atcacatttt cagaatatga cctggagcca tgtgatgatc ctggagtccc tgccttcagc 3660
cgaagaattg gttttcactt tggtgtggga gactctctga cgttttcctg cttcctggga 3720
tatcgtttag aaggtgccac caagcttacc tgcctgggtg ggggccgccg tgtgtggagt 3780
gcacctctgc caaggtgtgt ggccgaatgt ggagcaagtg tcaaaggaaa tgaaggaaca 3840
ttactgtctc caaattttcc atccaattat gataataacc atgagtgtat ctataaaata 3900
gaaacagaag ccggcaaggg catccacctt agaacacgaa gcttccagct gtttgaagga 3960
gatactctaa aggtatatga tggaaaagac agttcctcac gtccactggg cacgttcact 4020
aaaaatgaac ttctggggct gatcctaaac agcacatcca atcacctgtg gctagagttc 4080
aacaccaatg gatctgacac cgaccaaggt tttcaactca cctataccag ttttgatctg 4140
gtaaaatgtg aggatccggg catccctaac tacggctata ggatccgtga tgaaggccac 4200
tttaccgaca ctgtagttct gtacagttgc aacccggggt acgccatgca tggcagcaac 4260
accctgacct gtttgagtgg agacaggaga gtgtgggaca aaccactacc ttcgtgcata 4320
gcggaatgtg gtggtcagat ccatgcagcc acatcaggac gaatattgtc ccctggctat 4380
ccagctccgt atgacaacaa cctccactgc acctggatta tagaggcaga cccaggaaag 4440
accattagcc tccatttcat tgttttcgac acggagatgg ctcacgacat cctcaaggtc 4500
tgggacgggc cggtggacag tgacatcctg ctgaaggagt ggagtggctc cgcccttccg 4560
gaggacatcc acagcacctt caactcactc accctgcagt tcgacagcga cttcttcatc 4620
agcaagtctg gcttctccat ccagttctcc acctcaattg cagccacctg taacgatcca 4680
ggtatgcccc aaaatggcac ccgctatgga gacagcagag aggctggaga caccgtcaca 4740
ttccagtgtg accctggcta tcagctccaa ggacaagcca aaatcacctg tgtgcagctg 4800
aataaccggt tcttttggca accagaccct cctacatgca tagctgcttg tggagggaat 4860
ctgacgggcc cagcaggtgt tattttgtca cccaactacc cacagccgta tcctcctggg 4920
aaggaatgtg actggagagt aaaagtgaac ccggactttg tcatcgcctt gatattcaaa 4980
agtttcaaca tggagcccag ctatgacttc ctacacatct atgaagggga agattccaac 5040
agccccctca ttgggagtta ccagggctct caggccccag aaagaataga gagtagcgga 5100
aacagcctgt ttctggcatt tcggagtgat gcctccgtgg gcctttcagg gttcgccatt 5160
gaatttaaag agaaaccacg ggaagcttgt tttgacccag gaaatataat gaatgggaca 5220
agagttggaa cagacttcaa gcttggctcc accatcacct accagtgtga ctctggctat 5280
aagattcttg acccctcatc catcacctgt gtgattgggg ctgatgggaa accctcctgg 5340
gaccaagtgc tgccctcctg caatgctccc tgtggaggcc agtacacggg atcagaaggg 5400
gtagttttat caccaaacta cccccataat tacacagctg gtcaaatatg cctctattcc 5460
atcacggtac caaaggaatt cgtggtcttt ggacagtttg cctatttcca gacagccctg 5520
aatgatttgg cagaattatt tgatggaacc catgcacagg ccagacttct cagctcactc 5580
tcggggtctc actcagggga aacattgccc ttggctacgt caaatcaaat tctgctccga 5640
ttcagtgcaa agagcggtgc ctctgcccgc ggcttccact tcgtgtatca agctgttcct 5700
cgtaccagtg acacccaatg cagctctgtc cccgagccca gatacggaag gagaattggt 5760
tctgagtttt ctgccggctc catcgtccga ttcgagtgca acccgggata cctgcttcag 5820
ggttccacgg cgctccactg ccagtccgtg cccaacgcct tggcacagtg gaacgacacg 5880
atccccagct gtgtggtacc ctgcagtggc aatttcactc aacgaagagg tacaatcctg 5940
tcccccggct accctgagcc atacggaaac aacttgaact gtatatggaa gatcatagtt 6000
acggagggct cgggaattca gatccaagtg atcagttttg ccacggagca gaactgggac 6060
tcccttgaga tccacgatgg tggggatgtg accgcaccca gactgggaag cttctcaggc 6120
accacagtac cggcactgct gaacagtact tccaaccaac tctacctgca tttccagtct 6180
gacattagtg tggcagctgc tggtttccac ctggaataca aaactgtagg tcttgctgca 6240
tgccaagaac cagccctccc cagcaacagc atcaaaatcg gagatcggta catggtgaac 6300
gacgtgctct ccttccagtg cgagcccggg tacaccctgc agggccgttc ccacatttcc 6360
tgtatgccag ggaccgttcg ccgttggaac tatccgtctc ccctgtgcat tgcaacctgt 6420
ggagggacgc tgagcacctt gggtggtgtg atcctgagcc ccggcttccc aggttcttac 6480
cccaacaact tagactgcac ctggaggatc tcattaccca tcggctatgg tgcacatatt 6540
cagtttctga atttttctac cgaagctaat catgacttcc ttgaaattca aaatggacct 6600
taccacacca gccccatgat tggacaattt agcggcacgg atctccccgc ggccctgctg 6660
agcacaacgc atgaaaccct catccacttt tatagtgacc attcgcaaaa ccggcaagga 6720
tttaaacttg cttaccaagc ctatgaatta cagaactgtc cagatccacc cccatttcag 6780
aatgggtaca tgatcaactc ggattacagc gtggggcaat cagtatcttt cgagtgttat 6840
cctgggtaca ttctaatagg ccatcctgtc ctcacttgtc agcatgggat caacagaaac 6900
tggaactacc cttttccaag atgtgatgcc ccttgtgggt acaacgtaac ttctcagaac 6960
ggcaccatct actcccctgg ctttcctgat gagtatccga tcctgaagga ctgcatttgg 7020
ctcatcacgg tgcctccagg gcacggagtt tacatcaact tcaccctgtt acagacggaa 7080
gctgtcaacg attacattgc tgtttgggac ggtcccgatc agaactcacc ccagctggga 7140
gttttcagtg gcaacacagc cctcgaaacg gcgtatagct ccaccaacca agtcctgctc 7200
aagttccaca gcgacttttc aaatggaggc ttctttgtcc tcaatttcca cgcatttcag 7260
ctcaagaaat gtcaacctcc cccagcggtt ccacaggcag aaatgcttac tgaggatgat 7320
gatttcgaaa taggagattt tgtgaagtac cagtgccacc ccgggtacac cttggtgggg 7380
accgacattc tgacttgcaa gctcagttcc cagttgcagt ttgagggttc tctcccaaca 7440
tgtgaagcac aatgcccagc aaatgaagtc cggactggat catcgggagt cattctcagt 7500
ccagggtatc cgggtaatta ttttaactcc cagacttgct cttggagtat taaagtggaa 7560
ccaaactaca acattaccat ctttgtggac acatttcaaa gtgaaaagca gtttgatgca 7620
ctggaagtgt ttgatggttc ttctgggcaa agtcctctgc tagtagtctt aagtgggaat 7680
catactgaac aatcaaattt tacaagcagg agtaatcagt tatatctccg ctggtccact 7740
gaccatgcca ccagtaagaa aggattcaag attcgctatg cagcacctta ctgcagtttg 7800
acccaccccc tgaagaatgg gggtattcta aacaggactg caggagcggt tggaagcaaa 7860
gtgcattatt tttgcaagcc tggataccga atggtcggcc acagcaatgc aacctgtaga 7920
cgaaacccac ttggcatgta ccagtgggac tccctcacgc cactctgcca ggctgtgtcc 7980
tgtggaatcc cagaatcccc aggaaacggt tcatttaccg ggaacgagtt cactttggac 8040
agtaaagtgg tctatgaatg tcatgagggc ttcaagcttg aatccagcca gcaagcaaca 8100
gccgtgtgtc aagaagatgg gttgtggagt aacaagggga agccgcccac gtgtaagccg 8160
gtcgcttgcc ccagcattga agctcagctc tcagaacatg tcatctggag gctggtttca 8220
ggatccttga atgagtacgg tgctcaagta ttgctgagct gcagtcctgg ttactactta 8280
gaaggctgga ggctcctgcg gtgccaggcc aatgggacgt ggaacatagg agatgagagg 8340
ccaagctgtc gagttatctc gtgtggaagc ctttcctttc ccccaaatgg caacaagatt 8400
ggaacgttga cagtttatgg ggccacagct atatttacgt gcaacaccgg ctacacgctt 8460
gtggggtctc atgtcagaga gtgcttggca aatgggctct ggagcggcag cgaaactcga 8520
tgtctggctg gccactgcgg ttccccagac ccgattgtga acggtcacat tagtggagat 8580
ggcttcagtt acagagacac ggtggtttac cagtgcaatc ctggtttccg gcttgtggga 8640
acttccgtga ggatatgcct gcaagaccac aagtggtctg gacaaacgcc tgtctgtgtc 8700
cccatcacat gtggtcaccc tggaaaccct gcccacggat tcactaatgg cagtgagttc 8760
aacctgaatg atgtcgtgaa tttcacctgc aacacgggct atttgctgca gggcgtgtct 8820
cgagcccagt gtcggagcaa cggccagtgg agtagccctc tgcccacgtg tcgagtggtg 8880
aactgttctg atccaggctt tgtggaaaat gccattcgtc acgggcaaca gaacttccct 8940
gagagttttg agtatggaat gagtatcctg taccattgca agaagggatt ttacttgctg 9000
ggatcttcag ccttgacctg tatggcaaat ggcttatggg accgatccct gcccaagtgt 9060
ttggctatat cgtgtggaca cccaggggtc cctgccaacg ccgtcctcac tggagagctg 9120
tttacctatg gcgccgtcgt gcactactcc tgcagaggga gcgagagcct cataggcaac 9180
gacacgagag tgtgccagga agacagtcac tggagcgggg cactgcccca ctgcacagga 9240
aataatcctg gattctgtgg tgatccgggg accccagcac atgggtctcg gcttggtgat 9300
gactttaaga caaagagtct tctccgcttc tcctgtgaaa tggggcacca gctgaggggc 9360
tcccctgaac gcacgtgttt gctcaatggg tcatggtcag gactgcagcc ggtgtgtgag 9420
gccgtgtcct gtggcaaccc tggcacaccc accaacggaa tgattgtcag tagtgatggc 9480
attctgttct ccagctcggt catctatgcc tgctgggaag gctacaagac ctcagggctc 9540
atgacacggc attgcacagc caatgggacc tggacaggca ctgctcccga ctgcacaatt 9600
ataagttgtg gggatccagg cacactagca aatggcatcc agtttgggac cgacttcacc 9660
ttcaacaaga ctgtgagcta tcagtgtaac ccaggctatg tcatggaagc agtcacatcc 9720
gccactattc gctgtaccaa agacggcagg tggaatccga gcaaacctgt ctgcaaagcc 9780
gtgctgtgtc ctcagccgcc gccggtgcag aatggaacag tggagggaag tgatttccgc 9840
tggggctcca gcataagtta cagctgcatg gacggttacc agctctctca ctccgccatc 9900
ctctcctgtg aaggtcgcgg ggtgtggaaa ggagagatcc cccagtgtct ccctgtgttc 9960
tgcggagacc ctggcatccc cgcagaaggg cgacttagtg ggaaaagttt cacctataag 10020
tccgaagtct tcttccagtg caaatctcca tttatactcg tgggatcctc cagaagagtc 10080
tgccaagctg acggcacgtg gagcggcata caacccacct gcattgatcc tgctcataac 10140
acctgcccag accctggtac gccacacttt ggaatacaga atagctccag aggctatgag 10200
gttggaagca cggttttttt caggtgcaga aaaggctacc atattcaagg ttccacgact 10260
cgcacctgcc ttgccaattt aacatggagt gggatacaga ccgaatgtat acctcatgcc 10320
tgcagacagc cagaaacccc ggcacacgcg gatgtgagag ccatcgatct tcctactttc 10380
ggctacacct tagtgtacac ctgccatcca ggctttttcc tcgcaggggg atctgagcac 10440
agaacatgta aagcagacat gaaatggaca ggaaagtcgc ctgtgtgtaa aagtaaagga 10500
gtgagagaag ttaatgaaac agttactaaa actccagttc cttcagatgt ctttttcgtc 10560
aattcactgt ggaaggggta ttatgaatat ttagggaaaa gacaacccgc cactctaact 10620
gttgactggt tcaatgcaac aagcagtaag gtgaatgcca ccttcagcga agcctcgcca 10680
gtggagctga agttgacagg catttacaag aaggaggagg cccacttact cctgaaagct 10740
tttcaaatta aaggccaggc agatattttt gtaagcaagt tcgaaaatga caactgggga 10800
ctagatggtt atgtgtcatc tggacttgaa agaggaggat ttacttttca aggtgacatt 10860
catggaaaag actttggaaa atttaagcta gaaaggcaag atcctttaaa cccagatcaa 10920
gactcttcca gtcattacca cggcaccagc agtggctctg tggcggctgc cattctggtt 10980
cctttctttg ctctaatttt atcagggttt gcattttacc tctacaaaca cagaacgaga 11040
ccaaaagttc aatacaatgg ctatgctggg catgaaaaca gcaatggaca agcatcgttt 11100
gaaaacccca tgtatgatac aaacttaaaa cccacagaag ccaaggctgt gaggtttgac 11160
acaactctga acacagtctg tacagtggta tagccctcag tgccccaaca ggactgattc 11220
atagccatac ctctgatgga caagcagtga ttcctttggt gccatatacc actctccctt 11280
ccactctggc tttactgcag cgatcttcaa ccttgtctac tggcataagt gcagcgggga 11340
tctctactca aatgtgtcag ggtcttctac ggatcaaact acacatgcgt tttcattcca 11400
aaagtgggtt ctaaatgcct ggctgcatct gtatgaaatc aaggcacact ccaggaagac 11460
tgccacgtcg cgccaacacg tcatactcaa tgcctcagac tttcatattt ctgtgttgct 11520
gagatgcctt tcaatgcaat cgtctgggct cgtggatatg tccctcaggt gcggtgacag 11580
aatggtggca ccacgatatg tgttctcttg tgttgttttt cctttttaaa cccccatgaa 11640
cacgaatact ctgaaaaaaa taaaaagctt tctggaagaa gacacctttc tgatagaggc 11700
tcacacctac aaatgcttca ctctgtcctt ccgagacctg acaagctttg aggacctcac 11760
agctcccctg tgtgttcatc tctagggatg tttgcaattt cccagtcagc tgttctgtcg 11820
cagaatgttt aatgcacaat tttttgcact agtgtgttat gaatgactaa gattctgata 11880
aaaaaaataa attatttaca cagggtttat acacactatc cattgtatat aagcattatt 11940
tcatattatc aagctaaaca ttcccccatc agcttagttg gagtgttagg gaaaagtatt 12000
cctagatatg gcacagattt taaaaggaaa tacagtattg aagagattta ttttattatt 12060
gcttcaatta gctccattta cgtgttgaat tcattgaaga ggtccaatga gaaaaaaaca 12120
gaagcctcct tatttcacac gttttcctcc tttagtacca tcctcatcca attactgtct 12180
ctctgatact acttaatagc agggggtttg cagaaatttc tgtttgccat gtaaaactgt 12240
gaatagtaat ttattttaga tagtcgatga acttgtgggt tttagctcac aatgcagcct 12300
tcccttttgc agtgtttttt ttttgttttt tttttttttt gtcttttact gtgccatcga 12360
tctttgatat tgcattgaaa gacaatatac cacagtagca ccttgaactc agtgaaaatt 12420
gttcaggatc aaaataccaa gtgttctttt agagggaagg aaaaagtaca cacactctcc 12480
tctcacaatg atatatttta tacattcatt tgttatttgt ttcatgcttt atgattccag 12540
atggaaaggt aatttcagtg acttttcaag tttaaattcc attataggta aatgataagt 12600
tatgatgcaa ataaaatcta taagatcccc agggcaaata aaaatcaaaa catgaagtag 12660
aagatgtggc cgtgaggtag tttatgtaac aaattcaaag tgaaaatcat gtttactttt 12720
acttatactt atttgataaa aatatttttg aaacgatagt acttatttta ttatttgata 12780
tttcagttcc tattcaattg tggcagattt tctctgtttc acattttaga ttggcgttgg 12840
taatagaaat gtcagaatgt tcaaattggc cttcacgttg tcggagtgaa cacattgaca 12900
cctagcttta agactgattt atctgttggt gtactgaagg tttccatgta ggacttcaaa 12960
tgtggaaaag gaaaagcagt caggaaaatg gggcattctt tggagagtca cgcgttttga 13020
ttcggacatt tccgtagagc tcggctccca gtgttgtgtt cctcggtcga aagggtctct 13080
gctgtttggg gactcactgg cctctcctag ggactccttt gtcttgtgaa ccccacgctg 13140
ttggattctg tatcattatg ctgaattctc tgcacagttt tccctggcca acctgcccac 13200
atccttggag atttgctttg ccagtgggaa tccttacatt gctgtttcac agtagacggg 13260
acgaggtcag cgggagtcgt gctcctaaca cacacattga acgaaacaga agatgattga 13320
aagtgtgagg aggctcgtgt gcaagggaga acagggttac tatacatatt agtgtatata 13380
tatacataca tatatatata tatatattgt acatatctaa gtttgagtca ttcaaactag 13440
gtgcaaaatg ctgacttcag agtctgaatt aacatctctg ttcccatatc cctgacctgc 13500
tccctggtca acgatgctat gaaatcctga aatgacagga catacataca tacaagaaac 13560
cacatatcaa attagatatg attttccttt gtgtgcaaag tcaaactgtc ctagggttgc 13620
cagtttgaag catgttattt aaatgaaaaa aaaaatcagt gaaattctcg tgtgagaatt 13680
ctgcctagtt tcttcctaag gttgtgtgca gtgttgaacg gcgtctccgc aaggtgttgg 13740
aggatctcat tttagggcag tcaggagctg tgcttgctga gttaggtcta gaagactctt 13800
ccctgaaggc aacgggaaca cgcgtgaggg acgcgaccac acactaacag aggacacgtg 13860
cttcagagct gtttaaaact gctgcttgtt ttacacacac atcttgcctt ttttcaggct 13920
agctgcaata atttttttct tctgtaaaat attttgtaaa caacaacaaa aagctattat 13980
aaaaaggggg taaaaaaaag aacgctggca ttatgatcag gaaaacccat tgtcatcgcc 14040
gaccctccct cccgtcccac cacacgctgc tgtcacgacg taggtgcgaa agaccttttt 14100
gtacagagat atatttttta tgaagaattt gtaaaattat taaatatgct gtaatttttt 14160
gattaatgta ggtacattgt taaaaaataa atgtttttac aatacagaac tgtaattttc 14220
ccaataatgt aaaatgtacc atctctagct gattttcagt tccaatccta ttacacatgt 14280
attaatatta aagtggcctg ttaaaatgaa cagtatcttt tttttgtcaa aaaaattata 14340
aagagggtgt aatatagcct gtgcaatgcc accaatcttt aaagcaaatc agagttctaa 14400
ttaaatattt aatttta 14417
<![CDATA[<210> 283]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 283]]>
Met Thr Ala Trp Arg Arg Phe Gln Ser Leu Leu Leu Leu Leu Gly Leu
1 5 10 15
Leu Val Leu Cys Ala Arg Leu Leu Thr Ala Ala Lys
20 25
<![CDATA[<210> 284]]>
<![CDATA[<211> 71]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 284]]>
Gln Asn Cys Gly Gly Leu Val Gln Gly Pro Asn Gly Thr Ile Glu Ser
1 5 10 15
Pro Gly Phe Pro His Gly Tyr Pro Asn Tyr Ala Asn Cys Thr Trp Ile
20 25 30
Ile Ile Thr Gly Glu Arg Asn Arg Ile Gln Leu Ser Phe His Thr Phe
35 40 45
Ala Leu Glu Glu Asp Phe Asp Ile Leu Ser Val Tyr Asp Gly Gln Pro
50 55 60
Gln Gln Gly Asn Leu Lys Val
65 70
<![CDATA[<210> 285]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 285]]>
Leu Ser Gly Phe Gln Leu Pro Ser Ser Ile Val Ser Thr Gly Ser Ile
1 5 10 15
Leu Thr Leu Trp Phe Thr Thr Asp Phe Ala Val Ser Ala Gln Gly Phe
20 25 30
Lys Ala Leu Tyr Glu
35
<![CDATA[<210> 286]]>
<![CDATA[<211> 64]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 286]]>
Leu Pro Ser His Thr Cys Gly Asn Pro Gly Glu Ile Leu Lys Gly Val
1 5 10 15
Leu His Gly Thr Arg Phe Asn Ile Gly Asp Lys Ile Arg Tyr Ser Cys
20 25 30
Leu Pro Gly Tyr Ile Leu Glu Gly His Ala Ile Leu Thr Cys Ile Val
35 40 45
Ser Pro Gly Asn Gly Ala Ser Trp Asp Phe Pro Ala Pro Phe Cys Arg
50 55 60
<![CDATA[<210> 287]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 287]]>
Glu Gly Ala Cys Gly Gly Thr Leu Arg Gly Thr Ser Ser Ser Ile Ser
1 5 10 15
Ser Pro His Phe Pro Ser Glu Tyr Glu Asn Asn Ala Asp Cys Thr Trp
20 25 30
Thr Ile Leu Ala Glu Pro Gly Asp Thr Ile Ala Leu Val Phe Thr Asp
35 40 45
Phe Gln Leu Glu Glu Gly Tyr Asp Phe Leu Glu Ile Ser Gly Thr Glu
50 55 60
Ala Pro Ser Ile
65
<![CDATA[<210> 288]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 288]]>
Leu Thr Gly Met Asn Leu Pro Ser Pro Val Ile Ser Ser Lys Asn Trp
1 5 10 15
Leu Arg Leu His Phe Thr Ser Asp Ser Asn His Arg Arg Lys Gly Phe
20 25 30
Asn Ala Gln Phe Gln
35
<![CDATA[<210> 289]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 289]]>
Lys Lys Ala Ile Glu Leu Lys Ser Arg Gly Val Lys Met Leu Pro Ser
1 5 10 15
Lys Asp Gly Ser His Lys Asn Ser Val
20 25
<![CDATA[<210> 290]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 290]]>
Ser Gln Gly Gly Val Ala Leu Val Ser Asp Met Cys Pro Asp Pro Gly
1 5 10 15
Ile Pro Glu Asn Gly Arg Arg Ala Gly Ser Asp Phe
20 25
<![CDATA[<210> 291]]>
<![CDATA[<211> 41]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 291]]>
Val Gly Ala Asn Val Gln Phe Ser Cys Glu Asp Asn Tyr Val Leu Gln
1 5 10 15
Gly Ser Lys Ser Ile Thr Cys Gln Arg Val Thr Glu Thr Leu Ala Ala
20 25 30
Trp Ser Asp His Arg Pro Ile Cys Arg
35 40
<![CDATA[<210> 292]]>
<![CDATA[<211> 40]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 292]]>
Arg Thr Cys Gly Ser Asn Leu Arg Gly Pro Ser Gly Val Ile Thr Ser
1 5 10 15
Pro Asn Tyr Pro Val Gln Tyr Glu Asp Asn Ala His Cys Val Trp Val
20 25 30
Ile Thr Thr Thr Asp Pro Asp Lys
35 40
<![CDATA[<210> 293]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 293]]>
Val Ile Lys Leu Ala Phe Glu Glu Phe Glu Leu Glu Arg Gly Tyr Asp
1 5 10 15
Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val
20 25 30
Leu Tyr
<![CDATA[<210> 294]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 294]]>
Leu Thr Gly Ser Ser Val Pro Asp Leu Ile Val Ser Met Ser Asn Gln
1 5 10 15
Met Trp Leu His Leu Gln Ser Asp Asp Ser Ile Gly Ser Pro Gly Phe
20 25 30
Lys Ala Val Tyr Gln
35
<![CDATA[<210> 295]]>
<![CDATA[<211> 60]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 295]]>
Ile Glu Lys Gly Gly Cys Gly Asp Pro Gly Ile Pro Ala Tyr Gly Lys
1 5 10 15
Arg Thr Gly Ser Ser Phe Leu His Gly Asp Thr Leu Thr Phe Glu Cys
20 25 30
Pro Ala Ala Phe Glu Leu Val Gly Glu Arg Val Ile Thr Cys Gln Gln
35 40 45
Asn Asn Gln Trp Ser Gly Asn Lys Pro Ser Cys Val
50 55 60
<![CDATA[<210> 296]]>
<![CDATA[<211> 108]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 296]]>
Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser Gly Ile Ile Leu Ser Pro
1 5 10 15
Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met Asn Cys Val Trp Leu Ile
20 25 30
Ile Ser Glu Pro Gly Ser Arg Ile His Leu Ile Phe Asn Asp Phe Asp
35 40 45
Val Glu Pro Gln Phe Asp Phe Leu Ala Val Lys Asp Asp Gly Ile Ser
50 55 60
Asp Ile Thr Val Leu Gly Thr Phe Ser Gly Asn Glu Val Pro Ser Gln
65 70 75 80
Leu Ala Ser Ser Gly His Ile Val Arg Leu Glu Phe Gln Ser Asp His
85 90 95
Ser Thr Thr Gly Arg Gly Phe Asn Ile Thr Tyr Thr
100 105
<![CDATA[<210> 297]]>
<![CDATA[<211> 64]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 297]]>
Phe Gly Gln Asn Glu Cys His Asp Pro Gly Ile Pro Ile Asn Gly Arg
1 5 10 15
Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser Phe His Cys
20 25 30
Asp Asp Gly Phe Val Lys Thr Gln Gly Ser Glu Ser Ile Thr Cys Ile
35 40 45
Leu Gln Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro Arg Cys Glu
50 55 60
<![CDATA[<210> 298]]>
<![CDATA[<211> 45]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 298]]>
Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val Ile Leu Pro Pro
1 5 10 15
Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys Glu Trp Ile Ile
20 25 30
Glu Ala Lys Pro Gly His Ser Ile Lys Ile Thr Phe Asp
35 40 45
<![CDATA[<210> 299]]>
<![CDATA[<211> 62]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 299]]>
Phe Gln Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp Gly Pro
1 5 10 15
Ala Ser Ser Ser Pro Leu Ile Gly Glu Tyr His Gly Thr Gln Ala Pro
20 25 30
Gln Phe Leu Ile Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe Thr Thr
35 40 45
Asp Asn Ser Arg Ser Ser Ile Gly Phe Leu Ile His Tyr Glu
50 55 60
<![CDATA[<210> 300]]>
<![CDATA[<211> 62]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 300]]>
Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly Ile Pro Val Asn
1 5 10 15
Gly His Arg His Gly Gly Asp Phe Gly Ile Arg Ser Thr Val Thr Phe
20 25 30
Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val Cys
35 40 45
Glu Arg Asn His Gln Trp Asn His Ala Leu Pro Ser Cys Asp
50 55 60
<![CDATA[<210> 301]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 301]]>
Leu Cys Gly Gly Tyr Ile Gln Gly Lys Ser Gly Thr Val Leu Ser Pro
1 5 10 15
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr Ile
20 25 30
Glu Val Ser His Gly Lys
35
<![CDATA[<210> 302]]>
<![CDATA[<211> 71]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 302]]>
Val Gln Met Ile Phe His Thr Phe His Leu Glu Ser Ser His Asp Tyr
1 5 10 15
Leu Leu Ile Thr Glu Asp Gly Ser Phe Ser Glu Pro Val Ala Arg Leu
20 25 30
Thr Gly Ser Val Leu Pro His Thr Ile Lys Ala Gly Leu Phe Gly Asn
35 40 45
Phe Thr Ala Gln Leu Arg Phe Ile Ser Asp Phe Ser Ile Ser Tyr Glu
50 55 60
Gly Phe Asn Ile Thr Phe Ser
65 70
<![CDATA[<210> 303]]>
<![CDATA[<211> 62]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 303]]>
Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe Ser Arg
1 5 10 15
Arg Ile Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr Phe Ser Cys
20 25 30
Phe Leu Gly Tyr Arg Leu Glu Gly Ala Thr Lys Leu Thr Cys Leu Gly
35 40 45
Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro Arg Cys Val
50 55 60
<![CDATA[<210> 304]]>
<![CDATA[<211> 56]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 304]]>
Glu Cys Gly Ala Ser Val Lys Gly Asn Glu Gly Thr Leu Leu Ser Pro
1 5 10 15
Asn Phe Pro Ser Asn Tyr Asp Asn Asn His Glu Cys Ile Tyr Lys Ile
20 25 30
Glu Thr Glu Ala Gly Lys Gly Ile His Leu Arg Thr Arg Ser Phe Gln
35 40 45
Leu Phe Glu Gly Asp Thr Leu Lys
50 55
<![CDATA[<210> 305]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 305]]>
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe Thr
1 5 10 15
Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn His Leu
20 25 30
Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln Gly Phe Gln
35 40 45
Leu Thr Tyr Thr
50
<![CDATA[<210> 306]]>
<![CDATA[<211> 63]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 306]]>
Phe Asp Leu Val Lys Cys Glu Asp Pro Gly Ile Pro Asn Tyr Gly Tyr
1 5 10 15
Arg Ile Arg Asp Glu Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser
20 25 30
Cys Asn Pro Gly Tyr Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu
35 40 45
Ser Gly Asp Arg Arg Val Trp Asp Lys Pro Leu Pro Ser Cys Ile
50 55 60
<![CDATA[<210> 307]]>
<![CDATA[<211> 2353]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 307]]>
Phe Asp Leu Val Lys Cys Glu Asp Pro Gly Ile Pro Asn Tyr Gly Tyr
1 5 10 15
Arg Ile Arg Asp Glu Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser
20 25 30
Cys Asn Pro Gly Tyr Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu
35 40 45
Ser Gly Asp Arg Arg Val Trp Asp Lys Pro Leu Pro Ser Cys Ile Ala
50 55 60
Glu Cys Gly Gly Gln Ile His Ala Ala Thr Ser Gly Arg Ile Leu Ser
65 70 75 80
Pro Gly Tyr Pro Ala Pro Tyr Asp Asn Asn Leu His Cys Thr Trp Ile
85 90 95
Ile Glu Ala Asp Pro Gly Lys Thr Ile Ser Leu His Phe Ile Val Phe
100 105 110
Asp Thr Glu Met Ala His Asp Ile Leu Lys Val Trp Asp Gly Pro Val
115 120 125
Asp Ser Asp Ile Leu Leu Lys Glu Trp Ser Gly Ser Ala Leu Pro Glu
130 135 140
Asp Ile His Ser Thr Phe Asn Ser Leu Thr Leu Gln Phe Asp Ser Asp
145 150 155 160
Phe Phe Ile Ser Lys Ser Gly Phe Ser Ile Gln Phe Ser Thr Ser Ile
165 170 175
Ala Ala Thr Cys Asn Asp Pro Gly Met Pro Gln Asn Gly Thr Arg Tyr
180 185 190
Gly Asp Ser Arg Glu Ala Gly Asp Thr Val Thr Phe Gln Cys Asp Pro
195 200 205
Gly Tyr Gln Leu Gln Gly Gln Ala Lys Ile Thr Cys Val Gln Leu Asn
210 215 220
Asn Arg Phe Phe Trp Gln Pro Asp Pro Pro Thr Cys Ile Ala Ala Cys
225 230 235 240
Gly Gly Asn Leu Thr Gly Pro Ala Gly Val Ile Leu Ser Pro Asn Tyr
245 250 255
Pro Gln Pro Tyr Pro Pro Gly Lys Glu Cys Asp Trp Arg Val Lys Val
260 265 270
Asn Pro Asp Phe Val Ile Ala Leu Ile Phe Lys Ser Phe Asn Met Glu
275 280 285
Pro Ser Tyr Asp Phe Leu His Ile Tyr Glu Gly Glu Asp Ser Asn Ser
290 295 300
Pro Leu Ile Gly Ser Tyr Gln Gly Ser Gln Ala Pro Glu Arg Ile Glu
305 310 315 320
Ser Ser Gly Asn Ser Leu Phe Leu Ala Phe Arg Ser Asp Ala Ser Val
325 330 335
Gly Leu Ser Gly Phe Ala Ile Glu Phe Lys Glu Lys Pro Arg Glu Ala
340 345 350
Cys Phe Asp Pro Gly Asn Ile Met Asn Gly Thr Arg Val Gly Thr Asp
355 360 365
Phe Lys Leu Gly Ser Thr Ile Thr Tyr Gln Cys Asp Ser Gly Tyr Lys
370 375 380
Ile Leu Asp Pro Ser Ser Ile Thr Cys Val Ile Gly Ala Asp Gly Lys
385 390 395 400
Pro Ser Trp Asp Gln Val Leu Pro Ser Cys Asn Ala Pro Cys Gly Gly
405 410 415
Gln Tyr Thr Gly Ser Glu Gly Val Val Leu Ser Pro Asn Tyr Pro His
420 425 430
Asn Tyr Thr Ala Gly Gln Ile Cys Leu Tyr Ser Ile Thr Val Pro Lys
435 440 445
Glu Phe Val Val Phe Gly Gln Phe Ala Tyr Phe Gln Thr Ala Leu Asn
450 455 460
Asp Leu Ala Glu Leu Phe Asp Gly Thr His Ala Gln Ala Arg Leu Leu
465 470 475 480
Ser Ser Leu Ser Gly Ser His Ser Gly Glu Thr Leu Pro Leu Ala Thr
485 490 495
Ser Asn Gln Ile Leu Leu Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala
500 505 510
Arg Gly Phe His Phe Val Tyr Gln Ala Val Pro Arg Thr Ser Asp Thr
515 520 525
Gln Cys Ser Ser Val Pro Glu Pro Arg Tyr Gly Arg Arg Ile Gly Ser
530 535 540
Glu Phe Ser Ala Gly Ser Ile Val Arg Phe Glu Cys Asn Pro Gly Tyr
545 550 555 560
Leu Leu Gln Gly Ser Thr Ala Leu His Cys Gln Ser Val Pro Asn Ala
565 570 575
Leu Ala Gln Trp Asn Asp Thr Ile Pro Ser Cys Val Val Pro Cys Ser
580 585 590
Gly Asn Phe Thr Gln Arg Arg Gly Thr Ile Leu Ser Pro Gly Tyr Pro
595 600 605
Glu Pro Tyr Gly Asn Asn Leu Asn Cys Ile Trp Lys Ile Ile Val Thr
610 615 620
Glu Gly Ser Gly Ile Gln Ile Gln Val Ile Ser Phe Ala Thr Glu Gln
625 630 635 640
Asn Trp Asp Ser Leu Glu Ile His Asp Gly Gly Asp Val Thr Ala Pro
645 650 655
Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro Ala Leu Leu Asn Ser
660 665 670
Thr Ser Asn Gln Leu Tyr Leu His Phe Gln Ser Asp Ile Ser Val Ala
675 680 685
Ala Ala Gly Phe His Leu Glu Tyr Lys Thr Val Gly Leu Ala Ala Cys
690 695 700
Gln Glu Pro Ala Leu Pro Ser Asn Ser Ile Lys Ile Gly Asp Arg Tyr
705 710 715 720
Met Val Asn Asp Val Leu Ser Phe Gln Cys Glu Pro Gly Tyr Thr Leu
725 730 735
Gln Gly Arg Ser His Ile Ser Cys Met Pro Gly Thr Val Arg Arg Trp
740 745 750
Asn Tyr Pro Ser Pro Leu Cys Ile Ala Thr Cys Gly Gly Thr Leu Ser
755 760 765
Thr Leu Gly Gly Val Ile Leu Ser Pro Gly Phe Pro Gly Ser Tyr Pro
770 775 780
Asn Asn Leu Asp Cys Thr Trp Arg Ile Ser Leu Pro Ile Gly Tyr Gly
785 790 795 800
Ala His Ile Gln Phe Leu Asn Phe Ser Thr Glu Ala Asn His Asp Phe
805 810 815
Leu Glu Ile Gln Asn Gly Pro Tyr His Thr Ser Pro Met Ile Gly Gln
820 825 830
Phe Ser Gly Thr Asp Leu Pro Ala Ala Leu Leu Ser Thr Thr His Glu
835 840 845
Thr Leu Ile His Phe Tyr Ser Asp His Ser Gln Asn Arg Gln Gly Phe
850 855 860
Lys Leu Ala Tyr Gln Ala Tyr Glu Leu Gln Asn Cys Pro Asp Pro Pro
865 870 875 880
Pro Phe Gln Asn Gly Tyr Met Ile Asn Ser Asp Tyr Ser Val Gly Gln
885 890 895
Ser Val Ser Phe Glu Cys Tyr Pro Gly Tyr Ile Leu Ile Gly His Pro
900 905 910
Val Leu Thr Cys Gln His Gly Ile Asn Arg Asn Trp Asn Tyr Pro Phe
915 920 925
Pro Arg Cys Asp Ala Pro Cys Gly Tyr Asn Val Thr Ser Gln Asn Gly
930 935 940
Thr Ile Tyr Ser Pro Gly Phe Pro Asp Glu Tyr Pro Ile Leu Lys Asp
945 950 955 960
Cys Ile Trp Leu Ile Thr Val Pro Pro Gly His Gly Val Tyr Ile Asn
965 970 975
Phe Thr Leu Leu Gln Thr Glu Ala Val Asn Asp Tyr Ile Ala Val Trp
980 985 990
Asp Gly Pro Asp Gln Asn Ser Pro Gln Leu Gly Val Phe Ser Gly Asn
995 1000 1005
Thr Ala Leu Glu Thr Ala Tyr Ser Ser Thr Asn Gln Val Leu Leu
1010 1015 1020
Lys Phe His Ser Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn
1025 1030 1035
Phe His Ala Phe Gln Leu Lys Lys Cys Gln Pro Pro Pro Ala Val
1040 1045 1050
Pro Gln Ala Glu Met Leu Thr Glu Asp Asp Asp Phe Glu Ile Gly
1055 1060 1065
Asp Phe Val Lys Tyr Gln Cys His Pro Gly Tyr Thr Leu Val Gly
1070 1075 1080
Thr Asp Ile Leu Thr Cys Lys Leu Ser Ser Gln Leu Gln Phe Glu
1085 1090 1095
Gly Ser Leu Pro Thr Cys Glu Ala Gln Cys Pro Ala Asn Glu Val
1100 1105 1110
Arg Thr Gly Ser Ser Gly Val Ile Leu Ser Pro Gly Tyr Pro Gly
1115 1120 1125
Asn Tyr Phe Asn Ser Gln Thr Cys Ser Trp Ser Ile Lys Val Glu
1130 1135 1140
Pro Asn Tyr Asn Ile Thr Ile Phe Val Asp Thr Phe Gln Ser Glu
1145 1150 1155
Lys Gln Phe Asp Ala Leu Glu Val Phe Asp Gly Ser Ser Gly Gln
1160 1165 1170
Ser Pro Leu Leu Val Val Leu Ser Gly Asn His Thr Glu Gln Ser
1175 1180 1185
Asn Phe Thr Ser Arg Ser Asn Gln Leu Tyr Leu Arg Trp Ser Thr
1190 1195 1200
Asp His Ala Thr Ser Lys Lys Gly Phe Lys Ile Arg Tyr Ala Ala
1205 1210 1215
Pro Tyr Cys Ser Leu Thr His Pro Leu Lys Asn Gly Gly Ile Leu
1220 1225 1230
Asn Arg Thr Ala Gly Ala Val Gly Ser Lys Val His Tyr Phe Cys
1235 1240 1245
Lys Pro Gly Tyr Arg Met Val Gly His Ser Asn Ala Thr Cys Arg
1250 1255 1260
Arg Asn Pro Leu Gly Met Tyr Gln Trp Asp Ser Leu Thr Pro Leu
1265 1270 1275
Cys Gln Ala Val Ser Cys Gly Ile Pro Glu Ser Pro Gly Asn Gly
1280 1285 1290
Ser Phe Thr Gly Asn Glu Phe Thr Leu Asp Ser Lys Val Val Tyr
1295 1300 1305
Glu Cys His Glu Gly Phe Lys Leu Glu Ser Ser Gln Gln Ala Thr
1310 1315 1320
Ala Val Cys Gln Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro
1325 1330 1335
Pro Thr Cys Lys Pro Val Ala Cys Pro Ser Ile Glu Ala Gln Leu
1340 1345 1350
Ser Glu His Val Ile Trp Arg Leu Val Ser Gly Ser Leu Asn Glu
1355 1360 1365
Tyr Gly Ala Gln Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu
1370 1375 1380
Glu Gly Trp Arg Leu Leu Arg Cys Gln Ala Asn Gly Thr Trp Asn
1385 1390 1395
Ile Gly Asp Glu Arg Pro Ser Cys Arg Val Ile Ser Cys Gly Ser
1400 1405 1410
Leu Ser Phe Pro Pro Asn Gly Asn Lys Ile Gly Thr Leu Thr Val
1415 1420 1425
Tyr Gly Ala Thr Ala Ile Phe Thr Cys Asn Thr Gly Tyr Thr Leu
1430 1435 1440
Val Gly Ser His Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser
1445 1450 1455
Gly Ser Glu Thr Arg Cys Leu Ala Gly His Cys Gly Ser Pro Asp
1460 1465 1470
Pro Ile Val Asn Gly His Ile Ser Gly Asp Gly Phe Ser Tyr Arg
1475 1480 1485
Asp Thr Val Val Tyr Gln Cys Asn Pro Gly Phe Arg Leu Val Gly
1490 1495 1500
Thr Ser Val Arg Ile Cys Leu Gln Asp His Lys Trp Ser Gly Gln
1505 1510 1515
Thr Pro Val Cys Val Pro Ile Thr Cys Gly His Pro Gly Asn Pro
1520 1525 1530
Ala His Gly Phe Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val
1535 1540 1545
Val Asn Phe Thr Cys Asn Thr Gly Tyr Leu Leu Gln Gly Val Ser
1550 1555 1560
Arg Ala Gln Cys Arg Ser Asn Gly Gln Trp Ser Ser Pro Leu Pro
1565 1570 1575
Thr Cys Arg Val Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn
1580 1585 1590
Ala Ile Arg His Gly Gln Gln Asn Phe Pro Glu Ser Phe Glu Tyr
1595 1600 1605
Gly Met Ser Ile Leu Tyr His Cys Lys Lys Gly Phe Tyr Leu Leu
1610 1615 1620
Gly Ser Ser Ala Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg
1625 1630 1635
Ser Leu Pro Lys Cys Leu Ala Ile Ser Cys Gly His Pro Gly Val
1640 1645 1650
Pro Ala Asn Ala Val Leu Thr Gly Glu Leu Phe Thr Tyr Gly Ala
1655 1660 1665
Val Val His Tyr Ser Cys Arg Gly Ser Glu Ser Leu Ile Gly Asn
1670 1675 1680
Asp Thr Arg Val Cys Gln Glu Asp Ser His Trp Ser Gly Ala Leu
1685 1690 1695
Pro His Cys Thr Gly Asn Asn Pro Gly Phe Cys Gly Asp Pro Gly
1700 1705 1710
Thr Pro Ala His Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys
1715 1720 1725
Ser Leu Leu Arg Phe Ser Cys Glu Met Gly His Gln Leu Arg Gly
1730 1735 1740
Ser Pro Glu Arg Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu
1745 1750 1755
Gln Pro Val Cys Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro
1760 1765 1770
Thr Asn Gly Met Ile Val Ser Ser Asp Gly Ile Leu Phe Ser Ser
1775 1780 1785
Ser Val Ile Tyr Ala Cys Trp Glu Gly Tyr Lys Thr Ser Gly Leu
1790 1795 1800
Met Thr Arg His Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala
1805 1810 1815
Pro Asp Cys Thr Ile Ile Ser Cys Gly Asp Pro Gly Thr Leu Ala
1820 1825 1830
Asn Gly Ile Gln Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val
1835 1840 1845
Ser Tyr Gln Cys Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser
1850 1855 1860
Ala Thr Ile Arg Cys Thr Lys Asp Gly Arg Trp Asn Pro Ser Lys
1865 1870 1875
Pro Val Cys Lys Ala Val Leu Cys Pro Gln Pro Pro Pro Val Gln
1880 1885 1890
Asn Gly Thr Val Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser Ile
1895 1900 1905
Ser Tyr Ser Cys Met Asp Gly Tyr Gln Leu Ser His Ser Ala Ile
1910 1915 1920
Leu Ser Cys Glu Gly Arg Gly Val Trp Lys Gly Glu Ile Pro Gln
1925 1930 1935
Cys Leu Pro Val Phe Cys Gly Asp Pro Gly Ile Pro Ala Glu Gly
1940 1945 1950
Arg Leu Ser Gly Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe
1955 1960 1965
Gln Cys Lys Ser Pro Phe Ile Leu Val Gly Ser Ser Arg Arg Val
1970 1975 1980
Cys Gln Ala Asp Gly Thr Trp Ser Gly Ile Gln Pro Thr Cys Ile
1985 1990 1995
Asp Pro Ala His Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe
2000 2005 2010
Gly Ile Gln Asn Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val
2015 2020 2025
Phe Phe Arg Cys Arg Lys Gly Tyr His Ile Gln Gly Ser Thr Thr
2030 2035 2040
Arg Thr Cys Leu Ala Asn Leu Thr Trp Ser Gly Ile Gln Thr Glu
2045 2050 2055
Cys Ile Pro His Ala Cys Arg Gln Pro Glu Thr Pro Ala His Ala
2060 2065 2070
Asp Val Arg Ala Ile Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val
2075 2080 2085
Tyr Thr Cys His Pro Gly Phe Phe Leu Ala Gly Gly Ser Glu His
2090 2095 2100
Arg Thr Cys Lys Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val
2105 2110 2115
Cys Lys Ser Lys Gly Val Arg Glu Val Asn Glu Thr Val Thr Lys
2120 2125 2130
Thr Pro Val Pro Ser Asp Val Phe Phe Val Asn Ser Leu Trp Lys
2135 2140 2145
Gly Tyr Tyr Glu Tyr Leu Gly Lys Arg Gln Pro Ala Thr Leu Thr
2150 2155 2160
Val Asp Trp Phe Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe
2165 2170 2175
Ser Glu Ala Ser Pro Val Glu Leu Lys Leu Thr Gly Ile Tyr Lys
2180 2185 2190
Lys Glu Glu Ala His Leu Leu Leu Lys Ala Phe Gln Ile Lys Gly
2195 2200 2205
Gln Ala Asp Ile Phe Val Ser Lys Phe Glu Asn Asp Asn Trp Gly
2210 2215 2220
Leu Asp Gly Tyr Val Ser Ser Gly Leu Glu Arg Gly Gly Phe Thr
2225 2230 2235
Phe Gln Gly Asp Ile His Gly Lys Asp Phe Gly Lys Phe Lys Leu
2240 2245 2250
Glu Arg Gln Asp Pro Leu Asn Pro Asp Gln Asp Ser Ser Ser His
2255 2260 2265
Tyr His Gly Thr Ser Ser Gly Ser Val Ala Ala Ala Ile Leu Val
2270 2275 2280
Pro Phe Phe Ala Leu Ile Leu Ser Gly Phe Ala Phe Tyr Leu Tyr
2285 2290 2295
Lys His Arg Thr Arg Pro Lys Val Gln Tyr Asn Gly Tyr Ala Gly
2300 2305 2310
His Glu Asn Ser Asn Gly Gln Ala Ser Phe Glu Asn Pro Met Tyr
2315 2320 2325
Asp Thr Asn Leu Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp
2330 2335 2340
Thr Thr Leu Asn Thr Val Cys Thr Val Val
2345 2350
<![CDATA[<210> 308]]>
<![CDATA[<211> 3564]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 308]]>
Met Thr Ala Trp Arg Arg Phe Gln Ser Leu Leu Leu Leu Leu Gly Leu
1 5 10 15
Leu Val Leu Cys Ala Arg Leu Leu Thr Ala Ala Lys Gly Gln Asn Cys
20 25 30
Gly Gly Leu Val Gln Gly Pro Asn Gly Thr Ile Glu Ser Pro Gly Phe
35 40 45
Pro His Gly Tyr Pro Asn Tyr Ala Asn Cys Thr Trp Ile Ile Ile Thr
50 55 60
Gly Glu Arg Asn Arg Ile Gln Leu Ser Phe His Thr Phe Ala Leu Glu
65 70 75 80
Glu Asp Phe Asp Ile Leu Ser Val Tyr Asp Gly Gln Pro Gln Gln Gly
85 90 95
Asn Leu Lys Val Arg Leu Ser Gly Phe Gln Leu Pro Ser Ser Ile Val
100 105 110
Ser Thr Gly Ser Ile Leu Thr Leu Trp Phe Thr Thr Asp Phe Ala Val
115 120 125
Ser Ala Gln Gly Phe Lys Ala Leu Tyr Glu Val Leu Pro Ser His Thr
130 135 140
Cys Gly Asn Pro Gly Glu Ile Leu Lys Gly Val Leu His Gly Thr Arg
145 150 155 160
Phe Asn Ile Gly Asp Lys Ile Arg Tyr Ser Cys Leu Pro Gly Tyr Ile
165 170 175
Leu Glu Gly His Ala Ile Leu Thr Cys Ile Val Ser Pro Gly Asn Gly
180 185 190
Ala Ser Trp Asp Phe Pro Ala Pro Phe Cys Arg Ala Glu Gly Ala Cys
195 200 205
Gly Gly Thr Leu Arg Gly Thr Ser Ser Ser Ile Ser Ser Pro His Phe
210 215 220
Pro Ser Glu Tyr Glu Asn Asn Ala Asp Cys Thr Trp Thr Ile Leu Ala
225 230 235 240
Glu Pro Gly Asp Thr Ile Ala Leu Val Phe Thr Asp Phe Gln Leu Glu
245 250 255
Glu Gly Tyr Asp Phe Leu Glu Ile Ser Gly Thr Glu Ala Pro Ser Ile
260 265 270
Trp Leu Thr Gly Met Asn Leu Pro Ser Pro Val Ile Ser Ser Lys Asn
275 280 285
Trp Leu Arg Leu His Phe Thr Ser Asp Ser Asn His Arg Arg Lys Gly
290 295 300
Phe Asn Ala Gln Phe Gln Val Lys Lys Ala Ile Glu Leu Lys Ser Arg
305 310 315 320
Gly Val Lys Met Leu Pro Ser Lys Asp Gly Ser His Lys Asn Ser Val
325 330 335
Leu Ser Gln Gly Gly Val Ala Leu Val Ser Asp Met Cys Pro Asp Pro
340 345 350
Gly Ile Pro Glu Asn Gly Arg Arg Ala Gly Ser Asp Phe Arg Val Gly
355 360 365
Ala Asn Val Gln Phe Ser Cys Glu Asp Asn Tyr Val Leu Gln Gly Ser
370 375 380
Lys Ser Ile Thr Cys Gln Arg Val Thr Glu Thr Leu Ala Ala Trp Ser
385 390 395 400
Asp His Arg Pro Ile Cys Arg Ala Arg Thr Cys Gly Ser Asn Leu Arg
405 410 415
Gly Pro Ser Gly Val Ile Thr Ser Pro Asn Tyr Pro Val Gln Tyr Glu
420 425 430
Asp Asn Ala His Cys Val Trp Val Ile Thr Thr Thr Asp Pro Asp Lys
435 440 445
Val Ile Lys Leu Ala Phe Glu Glu Phe Glu Leu Glu Arg Gly Tyr Asp
450 455 460
Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val
465 470 475 480
Leu Tyr Val Leu Thr Gly Ser Ser Val Pro Asp Leu Ile Val Ser Met
485 490 495
Ser Asn Gln Met Trp Leu His Leu Gln Ser Asp Asp Ser Ile Gly Ser
500 505 510
Pro Gly Phe Lys Ala Val Tyr Gln Glu Ile Glu Lys Gly Gly Cys Gly
515 520 525
Asp Pro Gly Ile Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu
530 535 540
His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu Val
545 550 555 560
Gly Glu Arg Val Ile Thr Cys Gln Gln Asn Asn Gln Trp Ser Gly Asn
565 570 575
Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser
580 585 590
Gly Ile Ile Leu Ser Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met
595 600 605
Asn Cys Val Trp Leu Ile Ile Ser Glu Pro Gly Ser Arg Ile His Leu
610 615 620
Ile Phe Asn Asp Phe Asp Val Glu Pro Gln Phe Asp Phe Leu Ala Val
625 630 635 640
Lys Asp Asp Gly Ile Ser Asp Ile Thr Val Leu Gly Thr Phe Ser Gly
645 650 655
Asn Glu Val Pro Ser Gln Leu Ala Ser Ser Gly His Ile Val Arg Leu
660 665 670
Glu Phe Gln Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn Ile Thr
675 680 685
Tyr Thr Thr Phe Gly Gln Asn Glu Cys His Asp Pro Gly Ile Pro Ile
690 695 700
Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser
705 710 715 720
Phe His Cys Asp Asp Gly Phe Val Lys Thr Gln Gly Ser Glu Ser Ile
725 730 735
Thr Cys Ile Leu Gln Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro
740 745 750
Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val
755 760 765
Ile Leu Pro Pro Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys
770 775 780
Glu Trp Ile Ile Glu Ala Lys Pro Gly His Ser Ile Lys Ile Thr Phe
785 790 795 800
Asp Arg Phe Gln Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp
805 810 815
Gly Pro Ala Ser Ser Ser Pro Leu Ile Gly Glu Tyr His Gly Thr Gln
820 825 830
Ala Pro Gln Phe Leu Ile Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe
835 840 845
Thr Thr Asp Asn Ser Arg Ser Ser Ile Gly Phe Leu Ile His Tyr Glu
850 855 860
Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly Ile Pro Val
865 870 875 880
Asn Gly His Arg His Gly Gly Asp Phe Gly Ile Arg Ser Thr Val Thr
885 890 895
Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val
900 905 910
Cys Glu Arg Asn His Gln Trp Asn His Ala Leu Pro Ser Cys Asp Ala
915 920 925
Leu Cys Gly Gly Tyr Ile Gln Gly Lys Ser Gly Thr Val Leu Ser Pro
930 935 940
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr Ile
945 950 955 960
Glu Val Ser His Gly Lys Gly Val Gln Met Ile Phe His Thr Phe His
965 970 975
Leu Glu Ser Ser His Asp Tyr Leu Leu Ile Thr Glu Asp Gly Ser Phe
980 985 990
Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr Ile
995 1000 1005
Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gln Leu Arg Phe Ile
1010 1015 1020
Ser Asp Phe Ser Ile Ser Tyr Glu Gly Phe Asn Ile Thr Phe Ser
1025 1030 1035
Glu Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe
1040 1045 1050
Ser Arg Arg Ile Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr
1055 1060 1065
Phe Ser Cys Phe Leu Gly Tyr Arg Leu Glu Gly Ala Thr Lys Leu
1070 1075 1080
Thr Cys Leu Gly Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro
1085 1090 1095
Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys Gly Asn Glu Gly
1100 1105 1110
Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr Asp Asn Asn His
1115 1120 1125
Glu Cys Ile Tyr Lys Ile Glu Thr Glu Ala Gly Lys Gly Ile His
1130 1135 1140
Leu Arg Thr Arg Ser Phe Gln Leu Phe Glu Gly Asp Thr Leu Lys
1145 1150 1155
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe
1160 1165 1170
Thr Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn
1175 1180 1185
His Leu Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln
1190 1195 1200
Gly Phe Gln Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys Cys Glu
1205 1210 1215
Asp Pro Gly Ile Pro Asn Tyr Gly Tyr Arg Ile Arg Asp Glu Gly
1220 1225 1230
His Phe Thr Asp Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr
1235 1240 1245
Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu Ser Gly Asp Arg
1250 1255 1260
Arg Val Trp Asp Lys Pro Leu Pro Ser Cys Ile Ala Glu Cys Gly
1265 1270 1275
Gly Gln Ile His Ala Ala Thr Ser Gly Arg Ile Leu Ser Pro Gly
1280 1285 1290
Tyr Pro Ala Pro Tyr Asp Asn Asn Leu His Cys Thr Trp Ile Ile
1295 1300 1305
Glu Ala Asp Pro Gly Lys Thr Ile Ser Leu His Phe Ile Val Phe
1310 1315 1320
Asp Thr Glu Met Ala His Asp Ile Leu Lys Val Trp Asp Gly Pro
1325 1330 1335
Val Asp Ser Asp Ile Leu Leu Lys Glu Trp Ser Gly Ser Ala Leu
1340 1345 1350
Pro Glu Asp Ile His Ser Thr Phe Asn Ser Leu Thr Leu Gln Phe
1355 1360 1365
Asp Ser Asp Phe Phe Ile Ser Lys Ser Gly Phe Ser Ile Gln Phe
1370 1375 1380
Ser Thr Ser Ile Ala Ala Thr Cys Asn Asp Pro Gly Met Pro Gln
1385 1390 1395
Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu Ala Gly Asp Thr Val
1400 1405 1410
Thr Phe Gln Cys Asp Pro Gly Tyr Gln Leu Gln Gly Gln Ala Lys
1415 1420 1425
Ile Thr Cys Val Gln Leu Asn Asn Arg Phe Phe Trp Gln Pro Asp
1430 1435 1440
Pro Pro Thr Cys Ile Ala Ala Cys Gly Gly Asn Leu Thr Gly Pro
1445 1450 1455
Ala Gly Val Ile Leu Ser Pro Asn Tyr Pro Gln Pro Tyr Pro Pro
1460 1465 1470
Gly Lys Glu Cys Asp Trp Arg Val Lys Val Asn Pro Asp Phe Val
1475 1480 1485
Ile Ala Leu Ile Phe Lys Ser Phe Asn Met Glu Pro Ser Tyr Asp
1490 1495 1500
Phe Leu His Ile Tyr Glu Gly Glu Asp Ser Asn Ser Pro Leu Ile
1505 1510 1515
Gly Ser Tyr Gln Gly Ser Gln Ala Pro Glu Arg Ile Glu Ser Ser
1520 1525 1530
Gly Asn Ser Leu Phe Leu Ala Phe Arg Ser Asp Ala Ser Val Gly
1535 1540 1545
Leu Ser Gly Phe Ala Ile Glu Phe Lys Glu Lys Pro Arg Glu Ala
1550 1555 1560
Cys Phe Asp Pro Gly Asn Ile Met Asn Gly Thr Arg Val Gly Thr
1565 1570 1575
Asp Phe Lys Leu Gly Ser Thr Ile Thr Tyr Gln Cys Asp Ser Gly
1580 1585 1590
Tyr Lys Ile Leu Asp Pro Ser Ser Ile Thr Cys Val Ile Gly Ala
1595 1600 1605
Asp Gly Lys Pro Ser Trp Asp Gln Val Leu Pro Ser Cys Asn Ala
1610 1615 1620
Pro Cys Gly Gly Gln Tyr Thr Gly Ser Glu Gly Val Val Leu Ser
1625 1630 1635
Pro Asn Tyr Pro His Asn Tyr Thr Ala Gly Gln Ile Cys Leu Tyr
1640 1645 1650
Ser Ile Thr Val Pro Lys Glu Phe Val Val Phe Gly Gln Phe Ala
1655 1660 1665
Tyr Phe Gln Thr Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp Gly
1670 1675 1680
Thr His Ala Gln Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser His
1685 1690 1695
Ser Gly Glu Thr Leu Pro Leu Ala Thr Ser Asn Gln Ile Leu Leu
1700 1705 1710
Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala Arg Gly Phe His Phe
1715 1720 1725
Val Tyr Gln Ala Val Pro Arg Thr Ser Asp Thr Gln Cys Ser Ser
1730 1735 1740
Val Pro Glu Pro Arg Tyr Gly Arg Arg Ile Gly Ser Glu Phe Ser
1745 1750 1755
Ala Gly Ser Ile Val Arg Phe Glu Cys Asn Pro Gly Tyr Leu Leu
1760 1765 1770
Gln Gly Ser Thr Ala Leu His Cys Gln Ser Val Pro Asn Ala Leu
1775 1780 1785
Ala Gln Trp Asn Asp Thr Ile Pro Ser Cys Val Val Pro Cys Ser
1790 1795 1800
Gly Asn Phe Thr Gln Arg Arg Gly Thr Ile Leu Ser Pro Gly Tyr
1805 1810 1815
Pro Glu Pro Tyr Gly Asn Asn Leu Asn Cys Ile Trp Lys Ile Ile
1820 1825 1830
Val Thr Glu Gly Ser Gly Ile Gln Ile Gln Val Ile Ser Phe Ala
1835 1840 1845
Thr Glu Gln Asn Trp Asp Ser Leu Glu Ile His Asp Gly Gly Asp
1850 1855 1860
Val Thr Ala Pro Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro
1865 1870 1875
Ala Leu Leu Asn Ser Thr Ser Asn Gln Leu Tyr Leu His Phe Gln
1880 1885 1890
Ser Asp Ile Ser Val Ala Ala Ala Gly Phe His Leu Glu Tyr Lys
1895 1900 1905
Thr Val Gly Leu Ala Ala Cys Gln Glu Pro Ala Leu Pro Ser Asn
1910 1915 1920
Ser Ile Lys Ile Gly Asp Arg Tyr Met Val Asn Asp Val Leu Ser
1925 1930 1935
Phe Gln Cys Glu Pro Gly Tyr Thr Leu Gln Gly Arg Ser His Ile
1940 1945 1950
Ser Cys Met Pro Gly Thr Val Arg Arg Trp Asn Tyr Pro Ser Pro
1955 1960 1965
Leu Cys Ile Ala Thr Cys Gly Gly Thr Leu Ser Thr Leu Gly Gly
1970 1975 1980
Val Ile Leu Ser Pro Gly Phe Pro Gly Ser Tyr Pro Asn Asn Leu
1985 1990 1995
Asp Cys Thr Trp Arg Ile Ser Leu Pro Ile Gly Tyr Gly Ala His
2000 2005 2010
Ile Gln Phe Leu Asn Phe Ser Thr Glu Ala Asn His Asp Phe Leu
2015 2020 2025
Glu Ile Gln Asn Gly Pro Tyr His Thr Ser Pro Met Ile Gly Gln
2030 2035 2040
Phe Ser Gly Thr Asp Leu Pro Ala Ala Leu Leu Ser Thr Thr His
2045 2050 2055
Glu Thr Leu Ile His Phe Tyr Ser Asp His Ser Gln Asn Arg Gln
2060 2065 2070
Gly Phe Lys Leu Ala Tyr Gln Ala Tyr Glu Leu Gln Asn Cys Pro
2075 2080 2085
Asp Pro Pro Pro Phe Gln Asn Gly Tyr Met Ile Asn Ser Asp Tyr
2090 2095 2100
Ser Val Gly Gln Ser Val Ser Phe Glu Cys Tyr Pro Gly Tyr Ile
2105 2110 2115
Leu Ile Gly His Pro Val Leu Thr Cys Gln His Gly Ile Asn Arg
2120 2125 2130
Asn Trp Asn Tyr Pro Phe Pro Arg Cys Asp Ala Pro Cys Gly Tyr
2135 2140 2145
Asn Val Thr Ser Gln Asn Gly Thr Ile Tyr Ser Pro Gly Phe Pro
2150 2155 2160
Asp Glu Tyr Pro Ile Leu Lys Asp Cys Ile Trp Leu Ile Thr Val
2165 2170 2175
Pro Pro Gly His Gly Val Tyr Ile Asn Phe Thr Leu Leu Gln Thr
2180 2185 2190
Glu Ala Val Asn Asp Tyr Ile Ala Val Trp Asp Gly Pro Asp Gln
2195 2200 2205
Asn Ser Pro Gln Leu Gly Val Phe Ser Gly Asn Thr Ala Leu Glu
2210 2215 2220
Thr Ala Tyr Ser Ser Thr Asn Gln Val Leu Leu Lys Phe His Ser
2225 2230 2235
Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn Phe His Ala Phe
2240 2245 2250
Gln Leu Lys Lys Cys Gln Pro Pro Pro Ala Val Pro Gln Ala Glu
2255 2260 2265
Met Leu Thr Glu Asp Asp Asp Phe Glu Ile Gly Asp Phe Val Lys
2270 2275 2280
Tyr Gln Cys His Pro Gly Tyr Thr Leu Val Gly Thr Asp Ile Leu
2285 2290 2295
Thr Cys Lys Leu Ser Ser Gln Leu Gln Phe Glu Gly Ser Leu Pro
2300 2305 2310
Thr Cys Glu Ala Gln Cys Pro Ala Asn Glu Val Arg Thr Gly Ser
2315 2320 2325
Ser Gly Val Ile Leu Ser Pro Gly Tyr Pro Gly Asn Tyr Phe Asn
2330 2335 2340
Ser Gln Thr Cys Ser Trp Ser Ile Lys Val Glu Pro Asn Tyr Asn
2345 2350 2355
Ile Thr Ile Phe Val Asp Thr Phe Gln Ser Glu Lys Gln Phe Asp
2360 2365 2370
Ala Leu Glu Val Phe Asp Gly Ser Ser Gly Gln Ser Pro Leu Leu
2375 2380 2385
Val Val Leu Ser Gly Asn His Thr Glu Gln Ser Asn Phe Thr Ser
2390 2395 2400
Arg Ser Asn Gln Leu Tyr Leu Arg Trp Ser Thr Asp His Ala Thr
2405 2410 2415
Ser Lys Lys Gly Phe Lys Ile Arg Tyr Ala Ala Pro Tyr Cys Ser
2420 2425 2430
Leu Thr His Pro Leu Lys Asn Gly Gly Ile Leu Asn Arg Thr Ala
2435 2440 2445
Gly Ala Val Gly Ser Lys Val His Tyr Phe Cys Lys Pro Gly Tyr
2450 2455 2460
Arg Met Val Gly His Ser Asn Ala Thr Cys Arg Arg Asn Pro Leu
2465 2470 2475
Gly Met Tyr Gln Trp Asp Ser Leu Thr Pro Leu Cys Gln Ala Val
2480 2485 2490
Ser Cys Gly Ile Pro Glu Ser Pro Gly Asn Gly Ser Phe Thr Gly
2495 2500 2505
Asn Glu Phe Thr Leu Asp Ser Lys Val Val Tyr Glu Cys His Glu
2510 2515 2520
Gly Phe Lys Leu Glu Ser Ser Gln Gln Ala Thr Ala Val Cys Gln
2525 2530 2535
Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro Pro Thr Cys Lys
2540 2545 2550
Pro Val Ala Cys Pro Ser Ile Glu Ala Gln Leu Ser Glu His Val
2555 2560 2565
Ile Trp Arg Leu Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala Gln
2570 2575 2580
Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu Glu Gly Trp Arg
2585 2590 2595
Leu Leu Arg Cys Gln Ala Asn Gly Thr Trp Asn Ile Gly Asp Glu
2600 2605 2610
Arg Pro Ser Cys Arg Val Ile Ser Cys Gly Ser Leu Ser Phe Pro
2615 2620 2625
Pro Asn Gly Asn Lys Ile Gly Thr Leu Thr Val Tyr Gly Ala Thr
2630 2635 2640
Ala Ile Phe Thr Cys Asn Thr Gly Tyr Thr Leu Val Gly Ser His
2645 2650 2655
Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser Gly Ser Glu Thr
2660 2665 2670
Arg Cys Leu Ala Gly His Cys Gly Ser Pro Asp Pro Ile Val Asn
2675 2680 2685
Gly His Ile Ser Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val Val
2690 2695 2700
Tyr Gln Cys Asn Pro Gly Phe Arg Leu Val Gly Thr Ser Val Arg
2705 2710 2715
Ile Cys Leu Gln Asp His Lys Trp Ser Gly Gln Thr Pro Val Cys
2720 2725 2730
Val Pro Ile Thr Cys Gly His Pro Gly Asn Pro Ala His Gly Phe
2735 2740 2745
Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val Val Asn Phe Thr
2750 2755 2760
Cys Asn Thr Gly Tyr Leu Leu Gln Gly Val Ser Arg Ala Gln Cys
2765 2770 2775
Arg Ser Asn Gly Gln Trp Ser Ser Pro Leu Pro Thr Cys Arg Val
2780 2785 2790
Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn Ala Ile Arg His
2795 2800 2805
Gly Gln Gln Asn Phe Pro Glu Ser Phe Glu Tyr Gly Met Ser Ile
2810 2815 2820
Leu Tyr His Cys Lys Lys Gly Phe Tyr Leu Leu Gly Ser Ser Ala
2825 2830 2835
Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg Ser Leu Pro Lys
2840 2845 2850
Cys Leu Ala Ile Ser Cys Gly His Pro Gly Val Pro Ala Asn Ala
2855 2860 2865
Val Leu Thr Gly Glu Leu Phe Thr Tyr Gly Ala Val Val His Tyr
2870 2875 2880
Ser Cys Arg Gly Ser Glu Ser Leu Ile Gly Asn Asp Thr Arg Val
2885 2890 2895
Cys Gln Glu Asp Ser His Trp Ser Gly Ala Leu Pro His Cys Thr
2900 2905 2910
Gly Asn Asn Pro Gly Phe Cys Gly Asp Pro Gly Thr Pro Ala His
2915 2920 2925
Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys Ser Leu Leu Arg
2930 2935 2940
Phe Ser Cys Glu Met Gly His Gln Leu Arg Gly Ser Pro Glu Arg
2945 2950 2955
Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu Gln Pro Val Cys
2960 2965 2970
Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly Met
2975 2980 2985
Ile Val Ser Ser Asp Gly Ile Leu Phe Ser Ser Ser Val Ile Tyr
2990 2995 3000
Ala Cys Trp Glu Gly Tyr Lys Thr Ser Gly Leu Met Thr Arg His
3005 3010 3015
Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala Pro Asp Cys Thr
3020 3025 3030
Ile Ile Ser Cys Gly Asp Pro Gly Thr Leu Ala Asn Gly Ile Gln
3035 3040 3045
Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val Ser Tyr Gln Cys
3050 3055 3060
Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser Ala Thr Ile Arg
3065 3070 3075
Cys Thr Lys Asp Gly Arg Trp Asn Pro Ser Lys Pro Val Cys Lys
3080 3085 3090
Ala Val Leu Cys Pro Gln Pro Pro Pro Val Gln Asn Gly Thr Val
3095 3100 3105
Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser Ile Ser Tyr Ser Cys
3110 3115 3120
Met Asp Gly Tyr Gln Leu Ser His Ser Ala Ile Leu Ser Cys Glu
3125 3130 3135
Gly Arg Gly Val Trp Lys Gly Glu Ile Pro Gln Cys Leu Pro Val
3140 3145 3150
Phe Cys Gly Asp Pro Gly Ile Pro Ala Glu Gly Arg Leu Ser Gly
3155 3160 3165
Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe Gln Cys Lys Ser
3170 3175 3180
Pro Phe Ile Leu Val Gly Ser Ser Arg Arg Val Cys Gln Ala Asp
3185 3190 3195
Gly Thr Trp Ser Gly Ile Gln Pro Thr Cys Ile Asp Pro Ala His
3200 3205 3210
Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe Gly Ile Gln Asn
3215 3220 3225
Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val Phe Phe Arg Cys
3230 3235 3240
Arg Lys Gly Tyr His Ile Gln Gly Ser Thr Thr Arg Thr Cys Leu
3245 3250 3255
Ala Asn Leu Thr Trp Ser Gly Ile Gln Thr Glu Cys Ile Pro His
3260 3265 3270
Ala Cys Arg Gln Pro Glu Thr Pro Ala His Ala Asp Val Arg Ala
3275 3280 3285
Ile Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val Tyr Thr Cys His
3290 3295 3300
Pro Gly Phe Phe Leu Ala Gly Gly Ser Glu His Arg Thr Cys Lys
3305 3310 3315
Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val Cys Lys Ser Lys
3320 3325 3330
Gly Val Arg Glu Val Asn Glu Thr Val Thr Lys Thr Pro Val Pro
3335 3340 3345
Ser Asp Val Phe Phe Val Asn Ser Leu Trp Lys Gly Tyr Tyr Glu
3350 3355 3360
Tyr Leu Gly Lys Arg Gln Pro Ala Thr Leu Thr Val Asp Trp Phe
3365 3370 3375
Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe Ser Glu Ala Ser
3380 3385 3390
Pro Val Glu Leu Lys Leu Thr Gly Ile Tyr Lys Lys Glu Glu Ala
3395 3400 3405
His Leu Leu Leu Lys Ala Phe Gln Ile Lys Gly Gln Ala Asp Ile
3410 3415 3420
Phe Val Ser Lys Phe Glu Asn Asp Asn Trp Gly Leu Asp Gly Tyr
3425 3430 3435
Val Ser Ser Gly Leu Glu Arg Gly Gly Phe Thr Phe Gln Gly Asp
3440 3445 3450
Ile His Gly Lys Asp Phe Gly Lys Phe Lys Leu Glu Arg Gln Asp
3455 3460 3465
Pro Leu Asn Pro Asp Gln Asp Ser Ser Ser His Tyr His Gly Thr
3470 3475 3480
Ser Ser Gly Ser Val Ala Ala Ala Ile Leu Val Pro Phe Phe Ala
3485 3490 3495
Leu Ile Leu Ser Gly Phe Ala Phe Tyr Leu Tyr Lys His Arg Thr
3500 3505 3510
Arg Pro Lys Val Gln Tyr Asn Gly Tyr Ala Gly His Glu Asn Ser
3515 3520 3525
Asn Gly Gln Ala Ser Phe Glu Asn Pro Met Tyr Asp Thr Asn Leu
3530 3535 3540
Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp Thr Thr Leu Asn
3545 3550 3555
Thr Val Cys Thr Val Val
3560
<![CDATA[<210> 309]]>
<![CDATA[<211> 4129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 309]]>
ctccacggca gcggctcctt gtgccactag cagcccttct tctgcgctct ccgccttttc 60
tctctagact ggatctctcc tcccccccgc gcccccctcc ccgcatctcc cactcgctgg 120
ctctctctcc agctgcctcc tctccaggtc tctcctggct gcgcgcgctc ctctccccgc 180
ttctccccct cccgcagcct cgccgccttg gtgccttcct gcccggctcg gccggcgctc 240
gtccccggcc ccggccccgc cagcccgggt ctccgcgctc ggagcagctc agccctgcag 300
tggctcggga cccgatgcta tgagagggaa gcgagccggg cgcccagacc ttcaggaggc 360
gtcggatgcg cggcgggtct tgggaccggg ctctctctcc ggctcgcctt gccctcgggt 420
gattatttgg ctccgctcat agccctgcct tcctcggagg agccatcggt gtcgcgtgcg 480
tgtggagtat ctgcagacat gactgcgtgg aggagattcc agtcgctgct cctgcttctc 540
gggctgctgg tgctgtgcgc gaggctcctc actgcagcga agggtcagaa ctgtggaggc 600
ttagtccagg gtcccaatgg cactattgag agcccagggt ttcctcacgg gtatccgaac 660
tatgccaact gcacctggat catcatcacg ggcgagcgca ataggataca gttgtccttc 720
catacctttg ctcttgaaga agattttgat attttatcag tttacgatgg acagcctcaa 780
caagggaatt taaaagtgag attatcggga tttcagctgc cctcctctat agtgagtaca 840
ggatctatcc tcactctgtg gttcacgaca gacttcgctg tgagtgccca aggtttcaaa 900
gcattatatg aagttttacc tagccacact tgtggaaatc ctggagaaat cctgaaagga 960
gttctgcatg gaacgagatt caacatagga gacaaaatcc ggtacagctg cctccctggc 1020
tacatcttgg aaggccacgc catcctgacc tgcatcgtca gcccaggaaa tggtgcatcg 1080
tgggacttcc cagctccctt ttgcagagct gagggagcct gcggaggaac cttacgcggg 1140
accagcagct ccatctccag cccgcacttc ccttcagagt acgagaacaa cgcggactgc 1200
acctggacca ttctggctga gcccggggac accattgcgc tggtcttcac tgactttcag 1260
ctagaagaag gatatgattt cttagagatc agtggcacgg aagctccatc catatggcta 1320
actggcatga acctcccctc tccagttatc agtagcaaga attggctacg actccatttc 1380
acctctgaca gcaaccaccg acgcaaagga tttaacgctc agttccaagt gaaaaaggcg 1440
attgagttga agtcaagagg agtcaagatg ctgcccagca aggatggaag ccataaaaac 1500
tctgtcttga gccaaggagg tgttgcattg gtctctgaca tgtgtccaga tcctgggatt 1560
ccagaaaatg gtagaagagc aggttccgac ttcagggttg gtgcaaatgt acagttttca 1620
tgtgaggaca attacgtgct ccagggatct aaaagcatca cctgtcagag agttacagag 1680
acgctcgctg cttggagtga ccacaggccc atctgccgag cgagaacatg tggatccaat 1740
ctgcgtgggc ccagcggcgt cattacctcc cctaattatc cggttcagta tgaagataat 1800
gcacactgtg tgtgggtcat caccaccacc gacccggaca aggtcatcaa gcttgccttt 1860
gaagagtttg agctggagcg aggctatgac accctgacgg ttggtgatgc tgggaaggtg 1920
ggagacacca gatcggtctt gtacgtgctc acgggatcca gtgttcctga cctcattgtg 1980
agcatgagca accagatgtg gctacatctg cagtcggatg atagcattgg ctcacctggg 2040
tttaaagctg tttaccaaga aattgaaaag ggagggtgtg gggatcctgg aatccccgcc 2100
tatgggaagc ggacgggcag cagtttcctc catggagata cactcacctt tgaatgcccg 2160
gcggcctttg agctggtggg ggagagagtt atcacctgtc agcagaacaa tcagtggtct 2220
ggcaacaagc ccagctgtgt attttcatgt ttcttcaact ttacggcatc atctgggatt 2280
attctgtcac caaattatcc agaggaatat gggaacaaca tgaactgtgt ctggttgatt 2340
atctcggagc caggaagtcg aattcaccta atctttaatg attttgatgt tgagcctcag 2400
tttgactttc tcgcggtcaa ggatgatggc atttctgaca taactgtcct gggtactttt 2460
tctggcaatg aagtgccttc ccagctggcc agcagtgggc atatagttcg cttggaattt 2520
cagtctgacc attccactac tggcagaggg ttcaacatca cttacaccac atttggtcag 2580
aatgagtgcc atgatcctgg cattcctata aacggacgac gttttggtga caggtttcta 2640
ctcgggagct cggtttcttt ccactgtgat gatggctttg tcaagaccca gggatccgag 2700
tccattacct gcatactgca agacgggaac gtggtctgga gctccaccgt gccccgctgt 2760
gaagctccat gtggtggaca tctgacagcg tccagcggag tcattttgcc tcctggatgg 2820
ccaggatatt ataaggattc tttacattgt gaatggataa ttgaagcaaa accaggccac 2880
tctatcaaaa taacttttga cagatttcag acagaggtca attatgacac cttggaggtc 2940
agagatgggc cagccagttc gtccccactg atcggcgagt accacggcac ccaggcaccc 3000
cagttcctca tcagcaccgg gaacttcatg tacctgctgt tcaccactga caacagccgc 3060
tccagcatcg gcttcctcat ccactatgag agtgtgacgc ttgagtcgga ttcctgcctg 3120
gacccgggca tccctgtgaa cggccatcgc cacggtggag actttggcat caggtccaca 3180
gtgactttca gctgtgaccc ggggtacaca ctaagtgacg acgagcccct cgtctgtgag 3240
aggaaccacc agtggaacca cgccttgccc agctgcgacg ctctatgtgg aggctacatc 3300
caagggaaga gtggaacagt cctttctcct gggtttccag atttttatcc aaactctcta 3360
aactgcacgt ggaccattga agtgtctcat gggaaaggag ttcaaatgat ctttcacacc 3420
tttcatcttg agagttccca cgactattta ctgatcacag aggatggaag tttttccgag 3480
cccgttgcca ggctcaccgg gtcggtgttg cctcatacga tcaaggcagg cctgtttgga 3540
aacttcactg cccagcttcg gtttatatca gacttctcaa tttcgtacga gggcttcaat 3600
atcacatttt cagaatatga cctggagcca tgtgatgatc ctggagtccc tgccttcagc 3660
cgaagaattg gttttcactt tggtgtggga gactctctga cgttttcctg cttcctggga 3720
tatcgtttag aaggtgccac caagcttacc tgcctgggtg ggggccgccg tgtgtggagt 3780
gcacctctgc caaggtgtgt ggccgaatgt ggagcaagtg tcaaaggaaa tgaaggaaca 3840
ttactgtctc caaattttcc atccaattat gataataacc atgagtgtat ctataaaata 3900
gaaacagaag ccggcaaggg catccacctt agaacacgaa gcttccagct gtttgaagga 3960
gatactctaa aggtatatga tggaaaagac agttcctcac gtccactggg cacgttcact 4020
aaaaatgaac ttctggggct gatcctaaac agcacatcca atcacctgtg gctagagttc 4080
aacaccaatg gatctgacac cgaccaaggt tttcaactca cctatacca 4129
<![CDATA[<210> 310]]>
<![CDATA[<211> 1210]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 310]]>
Met Thr Ala Trp Arg Arg Phe Gln Ser Leu Leu Leu Leu Leu Gly Leu
1 5 10 15
Leu Val Leu Cys Ala Arg Leu Leu Thr Ala Ala Lys Gly Gln Asn Cys
20 25 30
Gly Gly Leu Val Gln Gly Pro Asn Gly Thr Ile Glu Ser Pro Gly Phe
35 40 45
Pro His Gly Tyr Pro Asn Tyr Ala Asn Cys Thr Trp Ile Ile Ile Thr
50 55 60
Gly Glu Arg Asn Arg Ile Gln Leu Ser Phe His Thr Phe Ala Leu Glu
65 70 75 80
Glu Asp Phe Asp Ile Leu Ser Val Tyr Asp Gly Gln Pro Gln Gln Gly
85 90 95
Asn Leu Lys Val Arg Leu Ser Gly Phe Gln Leu Pro Ser Ser Ile Val
100 105 110
Ser Thr Gly Ser Ile Leu Thr Leu Trp Phe Thr Thr Asp Phe Ala Val
115 120 125
Ser Ala Gln Gly Phe Lys Ala Leu Tyr Glu Val Leu Pro Ser His Thr
130 135 140
Cys Gly Asn Pro Gly Glu Ile Leu Lys Gly Val Leu His Gly Thr Arg
145 150 155 160
Phe Asn Ile Gly Asp Lys Ile Arg Tyr Ser Cys Leu Pro Gly Tyr Ile
165 170 175
Leu Glu Gly His Ala Ile Leu Thr Cys Ile Val Ser Pro Gly Asn Gly
180 185 190
Ala Ser Trp Asp Phe Pro Ala Pro Phe Cys Arg Ala Glu Gly Ala Cys
195 200 205
Gly Gly Thr Leu Arg Gly Thr Ser Ser Ser Ile Ser Ser Pro His Phe
210 215 220
Pro Ser Glu Tyr Glu Asn Asn Ala Asp Cys Thr Trp Thr Ile Leu Ala
225 230 235 240
Glu Pro Gly Asp Thr Ile Ala Leu Val Phe Thr Asp Phe Gln Leu Glu
245 250 255
Glu Gly Tyr Asp Phe Leu Glu Ile Ser Gly Thr Glu Ala Pro Ser Ile
260 265 270
Trp Leu Thr Gly Met Asn Leu Pro Ser Pro Val Ile Ser Ser Lys Asn
275 280 285
Trp Leu Arg Leu His Phe Thr Ser Asp Ser Asn His Arg Arg Lys Gly
290 295 300
Phe Asn Ala Gln Phe Gln Val Lys Lys Ala Ile Glu Leu Lys Ser Arg
305 310 315 320
Gly Val Lys Met Leu Pro Ser Lys Asp Gly Ser His Lys Asn Ser Val
325 330 335
Leu Ser Gln Gly Gly Val Ala Leu Val Ser Asp Met Cys Pro Asp Pro
340 345 350
Gly Ile Pro Glu Asn Gly Arg Arg Ala Gly Ser Asp Phe Arg Val Gly
355 360 365
Ala Asn Val Gln Phe Ser Cys Glu Asp Asn Tyr Val Leu Gln Gly Ser
370 375 380
Lys Ser Ile Thr Cys Gln Arg Val Thr Glu Thr Leu Ala Ala Trp Ser
385 390 395 400
Asp His Arg Pro Ile Cys Arg Ala Arg Thr Cys Gly Ser Asn Leu Arg
405 410 415
Gly Pro Ser Gly Val Ile Thr Ser Pro Asn Tyr Pro Val Gln Tyr Glu
420 425 430
Asp Asn Ala His Cys Val Trp Val Ile Thr Thr Thr Asp Pro Asp Lys
435 440 445
Val Ile Lys Leu Ala Phe Glu Glu Phe Glu Leu Glu Arg Gly Tyr Asp
450 455 460
Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val
465 470 475 480
Leu Tyr Val Leu Thr Gly Ser Ser Val Pro Asp Leu Ile Val Ser Met
485 490 495
Ser Asn Gln Met Trp Leu His Leu Gln Ser Asp Asp Ser Ile Gly Ser
500 505 510
Pro Gly Phe Lys Ala Val Tyr Gln Glu Ile Glu Lys Gly Gly Cys Gly
515 520 525
Asp Pro Gly Ile Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu
530 535 540
His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu Val
545 550 555 560
Gly Glu Arg Val Ile Thr Cys Gln Gln Asn Asn Gln Trp Ser Gly Asn
565 570 575
Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser
580 585 590
Gly Ile Ile Leu Ser Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met
595 600 605
Asn Cys Val Trp Leu Ile Ile Ser Glu Pro Gly Ser Arg Ile His Leu
610 615 620
Ile Phe Asn Asp Phe Asp Val Glu Pro Gln Phe Asp Phe Leu Ala Val
625 630 635 640
Lys Asp Asp Gly Ile Ser Asp Ile Thr Val Leu Gly Thr Phe Ser Gly
645 650 655
Asn Glu Val Pro Ser Gln Leu Ala Ser Ser Gly His Ile Val Arg Leu
660 665 670
Glu Phe Gln Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn Ile Thr
675 680 685
Tyr Thr Thr Phe Gly Gln Asn Glu Cys His Asp Pro Gly Ile Pro Ile
690 695 700
Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser
705 710 715 720
Phe His Cys Asp Asp Gly Phe Val Lys Thr Gln Gly Ser Glu Ser Ile
725 730 735
Thr Cys Ile Leu Gln Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro
740 745 750
Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val
755 760 765
Ile Leu Pro Pro Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys
770 775 780
Glu Trp Ile Ile Glu Ala Lys Pro Gly His Ser Ile Lys Ile Thr Phe
785 790 795 800
Asp Arg Phe Gln Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp
805 810 815
Gly Pro Ala Ser Ser Ser Pro Leu Ile Gly Glu Tyr His Gly Thr Gln
820 825 830
Ala Pro Gln Phe Leu Ile Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe
835 840 845
Thr Thr Asp Asn Ser Arg Ser Ser Ile Gly Phe Leu Ile His Tyr Glu
850 855 860
Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly Ile Pro Val
865 870 875 880
Asn Gly His Arg His Gly Gly Asp Phe Gly Ile Arg Ser Thr Val Thr
885 890 895
Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val
900 905 910
Cys Glu Arg Asn His Gln Trp Asn His Ala Leu Pro Ser Cys Asp Ala
915 920 925
Leu Cys Gly Gly Tyr Ile Gln Gly Lys Ser Gly Thr Val Leu Ser Pro
930 935 940
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr Ile
945 950 955 960
Glu Val Ser His Gly Lys Gly Val Gln Met Ile Phe His Thr Phe His
965 970 975
Leu Glu Ser Ser His Asp Tyr Leu Leu Ile Thr Glu Asp Gly Ser Phe
980 985 990
Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr Ile
995 1000 1005
Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gln Leu Arg Phe Ile
1010 1015 1020
Ser Asp Phe Ser Ile Ser Tyr Glu Gly Phe Asn Ile Thr Phe Ser
1025 1030 1035
Glu Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe
1040 1045 1050
Ser Arg Arg Ile Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr
1055 1060 1065
Phe Ser Cys Phe Leu Gly Tyr Arg Leu Glu Gly Ala Thr Lys Leu
1070 1075 1080
Thr Cys Leu Gly Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro
1085 1090 1095
Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys Gly Asn Glu Gly
1100 1105 1110
Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr Asp Asn Asn His
1115 1120 1125
Glu Cys Ile Tyr Lys Ile Glu Thr Glu Ala Gly Lys Gly Ile His
1130 1135 1140
Leu Arg Thr Arg Ser Phe Gln Leu Phe Glu Gly Asp Thr Leu Lys
1145 1150 1155
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe
1160 1165 1170
Thr Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn
1175 1180 1185
His Leu Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln
1190 1195 1200
Gly Phe Gln Leu Thr Tyr Thr
1205 1210
<![CDATA[<210> 311]]>
<![CDATA[<211> 102]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 311]]>
ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 60
tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca ag 102
<![CDATA[<210> 312]]>
<![CDATA[<211> 103]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 312]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag att 103
<![CDATA[<210> 313]]>
<![CDATA[<211> 205]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 313]]>
ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 60
tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca agccttgcct ccccgattga 120
aagagatgaa aagccaggaa tcggctgcag gttccaaact agtccttcgg tgtgaaacca 180
gttctgaata ctcctctctc agatt 205
<![CDATA[<210> 314]]>
<![CDATA[<211> 67]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 314]]>
Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn Ala Glu Cys Gln Lys
1 5 10 15
Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu Thr Lys Pro Lys Pro
20 25 30
Gln Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala
35 40 45
Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser
50 55 60
Ser Leu Arg
65
<![CDATA[<210> 315]]>
<![CDATA[<211> 532]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 315]]>
aaggggaaat aagggacaag agagaccctc tcatattgtt ttatattatt tcatactcag 60
aaaaggaaag agaagccaaa caaaaggcag gtaacccagc gcctaggaac cagacccgaa 120
accaaggaac cagatctgaa accaggcctg ggcctgcctg acctaagcct ggtagtaaaa 180
attccacccc tgacctgacc tggcaactgt tgttatctac agattccaga cattgtatgg 240
aaggacactg tgaaacctcc cgttctgttc tgtttcactc tgaccatcgg tgctcacagc 300
ccctatcacg taccccctgg cttgctcagt cgatcacgac cctctcacgt ggaccccctt 360
agagttgtta gcccttaaaa gggacagaag ttgagcacct gaggagctca gattttaaga 420
cgctaggctg ctgatgctcc cagctgatta aagccactcc cttcactatc tcggtgtctc 480
ctgtccgcgg ctcgtcctgc tacatttctt ggttccctga ccggcaagcg ag 532
<![CDATA[<210> 316]]>
<![CDATA[<211> 116]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 316]]>
aatgcaggct caacagtacc agcagcagcg tcgaaaattt gcagctgcct tcttggcatt 60
cattttcata ctggcagctg tggatactgc tgaagcaggg aagaaagaga aaccag 116
<![CDATA[<210> 317]]>
<![CDATA[<211> 174]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 317]]>
aaaaaaaagt gaagaagtct gactgtggag aatggcagtg gagtgtgtgt gtgcccacca 60
gtggagactg tgggctgggc acacgggagg gcactcggac tggagctgag tgcaagcaaa 120
ccatgaagac ccagagatgt aagatcccct gcaactggaa gaagcaattt ggcg 174
<![CDATA[<210> 318]]>
<![CDATA[<211> 162]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 318]]>
cggagtgcaa ataccagttc caggcctggg gagaatgtga cctgaacaca gccctgaaga 60
ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 120
tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca ag 162
<![CDATA[<210> 319]]>
<![CDATA[<211> 735]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 319]]>
cagaatctaa gaagaagaaa aaggaaggca agaaacagga gaagatgctg gattaaaaga 60
tgtcacctgt ggaacataaa aaggacatca gcaaacagga tcagttaact attgcattta 120
tatgtaccgt aggctttgta ttcaaaaatt atctatagct aagtacacaa taagcaaaaa 180
caaaaagaaa agaaaatttt tgtagtagcg ttttttaaat gtatactata gtaccagtag 240
gggcttataa taaaggactg taatcttatt taggaagttg acttatagta catgataaat 300
gatagacaat tgaggtaagt tttttgaaat tatgtgacat tttacattaa atttttttta 360
cattttttgg gcagcaattt aaatgttatg actatgtaaa ctacttctct tgttaggtaa 420
tttttttcac ctagattttt ttcccaattg agaaaaatat atactaaaca aaatagcaat 480
aaaacataat cactctattt gaagaaaata tcttgttttc tgccaataga ttttttaaaa 540
tgtagtcagc aaaatggggg tggggaagca gagcatgtcc tagttcaatg ttgacttttt 600
ttttttttaa agaaaagcat taagacataa aattctttca ctttggcaga agcatttgtt 660
ttcttgatga aattattttt ccatctgagg aaaaaaatac taggaaaata aatcaaggtg 720
atgctgaaaa aaaaa 735
<![CDATA[<210> 320]]>
<![CDATA[<211> 1719]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 320]]>
aaggggaaat aagggacaag agagaccctc tcatattgtt ttatattatt tcatactcag 60
aaaaggaaag agaagccaaa caaaaggcag gtaacccagc gcctaggaac cagacccgaa 120
accaaggaac cagatctgaa accaggcctg ggcctgcctg acctaagcct ggtagtaaaa 180
attccacccc tgacctgacc tggcaactgt tgttatctac agattccaga cattgtatgg 240
aaggacactg tgaaacctcc cgttctgttc tgtttcactc tgaccatcgg tgctcacagc 300
ccctatcacg taccccctgg cttgctcagt cgatcacgac cctctcacgt ggaccccctt 360
agagttgtta gcccttaaaa gggacagaag ttgagcacct gaggagctca gattttaaga 420
cgctaggctg ctgatgctcc cagctgatta aagccactcc cttcactatc tcggtgtctc 480
ctgtccgcgg ctcgtcctgc tacatttctt ggttccctga ccggcaagcg agaatgcagg 540
ctcaacagta ccagcagcag cgtcgaaaat ttgcagctgc cttcttggca ttcattttca 600
tactggcagc tgtggatact gctgaagcag ggaagaaaga gaaaccagaa aaaaaagtga 660
agaagtctga ctgtggagaa tggcagtgga gtgtgtgtgt gcccaccagt ggagactgtg 720
ggctgggcac acgggagggc actcggactg gagctgagtg caagcaaacc atgaagaccc 780
agagatgtaa gatcccctgc aactggaaga agcaatttgg cgcggagtgc aaataccagt 840
tccaggcctg gggagaatgt gacctgaaca cagccctgaa gaccagaact ggaagtctga 900
agcgagccct gcacaatgcc gaatgccaga agactgtcac catctccaag ccctgtggca 960
aactgaccaa gcccaaacct caagcagaat ctaagaagaa gaaaaaggaa ggcaagaaac 1020
aggagaagat gctggattaa aagatgtcac ctgtggaaca taaaaaggac atcagcaaac 1080
aggatcagtt aactattgca tttatatgta ccgtaggctt tgtattcaaa aattatctat 1140
agctaagtac acaataagca aaaacaaaaa gaaaagaaaa tttttgtagt agcgtttttt 1200
aaatgtatac tatagtacca gtaggggctt ataataaagg actgtaatct tatttaggaa 1260
gttgacttat agtacatgat aaatgataga caattgaggt aagttttttg aaattatgtg 1320
acattttaca ttaaattttt tttacatttt ttgggcagca atttaaatgt tatgactatg 1380
taaactactt ctcttgttag gtaatttttt tcacctagat ttttttccca attgagaaaa 1440
atatatacta aacaaaatag caataaaaca taatcactct atttgaagaa aatatcttgt 1500
tttctgccaa tagatttttt aaaatgtagt cagcaaaatg ggggtgggga agcagagcat 1560
gtcctagttc aatgttgact tttttttttt ttaaagaaaa gcattaagac ataaaattct 1620
ttcactttgg cagaagcatt tgttttcttg atgaaattat ttttccatct gaggaaaaaa 1680
atactaggaa aataaatcaa ggtgatgctg aaaaaaaaa 1719
<![CDATA[<210> 321]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 321]]>
Met Gln Ala Gln Gln Tyr Gln Gln Gln Arg Arg Lys Phe Ala Ala Ala
1 5 10 15
Phe Leu Ala Phe Ile Phe Ile Leu Ala Ala Val Asp Thr Ala Glu Ala
20 25 30
Gly Lys Lys Glu Lys Pro
35
<![CDATA[<210> 322]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 322]]>
Lys Lys Val Lys Lys Ser Asp Cys Gly Glu Trp Gln Trp Ser Val Cys
1 5 10 15
Val Pro Thr Ser Gly Asp Cys Gly Leu Gly Thr Arg Glu Gly Thr Arg
20 25 30
Thr Gly Ala Glu Cys Lys Gln Thr Met Lys Thr Gln Arg Cys Lys Ile
35 40 45
Pro Cys Asn Trp Lys Lys Gln Phe Gly
50 55
<![CDATA[<210> 323]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 323]]>
Glu Cys Lys Tyr Gln Phe Gln Ala Trp Gly Glu Cys Asp Leu Asn Thr
1 5 10 15
Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn Ala
20 25 30
Glu Cys Gln Lys Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu Thr
35 40 45
Lys Pro Lys Pro Gln
50
<![CDATA[<210> 324]]>
<![CDATA[<211> 17]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 324]]>
Glu Ser Lys Lys Lys Lys Lys Glu Gly Lys Lys Gln Glu Lys Met Leu
1 5 10 15
Asp
<![CDATA[<210> 325]]>
<![CDATA[<211> 168]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 325]]>
Met Gln Ala Gln Gln Tyr Gln Gln Gln Arg Arg Lys Phe Ala Ala Ala
1 5 10 15
Phe Leu Ala Phe Ile Phe Ile Leu Ala Ala Val Asp Thr Ala Glu Ala
20 25 30
Gly Lys Lys Glu Lys Pro Glu Lys Lys Val Lys Lys Ser Asp Cys Gly
35 40 45
Glu Trp Gln Trp Ser Val Cys Val Pro Thr Ser Gly Asp Cys Gly Leu
50 55 60
Gly Thr Arg Glu Gly Thr Arg Thr Gly Ala Glu Cys Lys Gln Thr Met
65 70 75 80
Lys Thr Gln Arg Cys Lys Ile Pro Cys Asn Trp Lys Lys Gln Phe Gly
85 90 95
Ala Glu Cys Lys Tyr Gln Phe Gln Ala Trp Gly Glu Cys Asp Leu Asn
100 105 110
Thr Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn
115 120 125
Ala Glu Cys Gln Lys Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu
130 135 140
Thr Lys Pro Lys Pro Gln Ala Glu Ser Lys Lys Lys Lys Lys Glu Gly
145 150 155 160
Lys Lys Gln Glu Lys Met Leu Asp
165
<![CDATA[<210> 326]]>
<![CDATA[<211> 984]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 326]]>
aaggggaaat aagggacaag agagaccctc tcatattgtt ttatattatt tcatactcag 60
aaaaggaaag agaagccaaa caaaaggcag gtaacccagc gcctaggaac cagacccgaa 120
accaaggaac cagatctgaa accaggcctg ggcctgcctg acctaagcct ggtagtaaaa 180
attccacccc tgacctgacc tggcaactgt tgttatctac agattccaga cattgtatgg 240
aaggacactg tgaaacctcc cgttctgttc tgtttcactc tgaccatcgg tgctcacagc 300
ccctatcacg taccccctgg cttgctcagt cgatcacgac cctctcacgt ggaccccctt 360
agagttgtta gcccttaaaa gggacagaag ttgagcacct gaggagctca gattttaaga 420
cgctaggctg ctgatgctcc cagctgatta aagccactcc cttcactatc tcggtgtctc 480
ctgtccgcgg ctcgtcctgc tacatttctt ggttccctga ccggcaagcg agaatgcagg 540
ctcaacagta ccagcagcag cgtcgaaaat ttgcagctgc cttcttggca ttcattttca 600
tactggcagc tgtggatact gctgaagcag ggaagaaaga gaaaccagaa aaaaaagtga 660
agaagtctga ctgtggagaa tggcagtgga gtgtgtgtgt gcccaccagt ggagactgtg 720
ggctgggcac acgggagggc actcggactg gagctgagtg caagcaaacc atgaagaccc 780
agagatgtaa gatcccctgc aactggaaga agcaatttgg cgcggagtgc aaataccagt 840
tccaggcctg gggagaatgt gacctgaaca cagccctgaa gaccagaact ggaagtctga 900
agcgagccct gcacaatgcc gaatgccaga agactgtcac catctccaag ccctgtggca 960
aactgaccaa gcccaaacct caag 984
<![CDATA[<210> 327]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 327]]>
Met Gln Ala Gln Gln Tyr Gln Gln Gln Arg Arg Lys Phe Ala Ala Ala
1 5 10 15
Phe Leu Ala Phe Ile Phe Ile Leu Ala Ala Val Asp Thr Ala Glu Ala
20 25 30
Gly Lys Lys Glu Lys Pro Glu Lys Lys Val Lys Lys Ser Asp Cys Gly
35 40 45
Glu Trp Gln Trp Ser Val Cys Val Pro Thr Ser Gly Asp Cys Gly Leu
50 55 60
Gly Thr Arg Glu Gly Thr Arg Thr Gly Ala Glu Cys Lys Gln Thr Met
65 70 75 80
Lys Thr Gln Arg Cys Lys Ile Pro Cys Asn Trp Lys Lys Gln Phe Gly
85 90 95
Ala Glu Cys Lys Tyr Gln Phe Gln Ala Trp Gly Glu Cys Asp Leu Asn
100 105 110
Thr Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn
115 120 125
Ala Glu Cys Gln Lys Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu
130 135 140
Thr Lys Pro Lys Pro Gln
145 150
<![CDATA[<210> 328]]>
<![CDATA[<211> 95]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 328]]>
caacagcaac aagatcacag ttcgcttcca ctcagatcag tcctacaccg acaccggctt 60
cttagctgaa tacctctcct acgactccag tgacc 95
<![CDATA[<210> 329]]>
<![CDATA[<211> 87]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 329]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgct 87
<![CDATA[<210> 330]]>
<![CDATA[<211> 182]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 330]]>
caacagcaac aagatcacag ttcgcttcca ctcagatcag tcctacaccg acaccggctt 60
cttagctgaa tacctctcct acgactccag tgaccctaca tctacatcca ccactgggac 120
aagccatctt gtaaaatgtg cggagaagga gaaaactttc tgtgtgaatg gaggggagtg 180
ct 182
<![CDATA[<210> 331]]>
<![CDATA[<211> 60]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 331]]>
Asn Ser Asn Lys Ile Thr Val Arg Phe His Ser Asp Gln Ser Tyr Thr
1 5 10 15
Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp Ser Ser Asp Pro
20 25 30
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
35 40 45
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
50 55 60
<![CDATA[<210> 332]]>
<![CDATA[<211> 279]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 332]]>
gtgagagcgg agctgcagcc ggagaaagag gaagagggag agagagcgcg ccagggcgag 60
ggcaccgccg ccggtcgggc gcgctgggcc tgcccggaat cccgccgcct gcgccccgcg 120
ccccgcgccc tgcgggccat gggagccggc cgccggcagg gacgacgcct gtgagacccg 180
cgagcggcct cggggaccat ggggagcgat cgggcccgca agggcggagg gggcccgaag 240
gacttcggcg cgggactcaa gtacaactcc cggcacgag 279
<![CDATA[<210> 333]]>
<![CDATA[<211> 160]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 333]]>
aaagtgaatg gcttggagga aggcgtggag ttcctgccag tcaacaacgt caagaaggtg 60
gaaaagcatg gcccggggcg ctgggtggtg ctggcagccg tgctgatcgg cctcctcttg 120
gtcttgctgg ggatcggctt cctggtgtgg catttgcagt 160
<![CDATA[<210> 334]]>
<![CDATA[<211> 128]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 334]]>
accgggacgt gcgtgtccag aaggtcttca atggctacat gaggatcaca aatgagaatt 60
ttgtggatgc ctacgagaac tccaactcca ctgagtttgt aagcctggcc agcaaggtga 120
aggacgcg 128
<![CDATA[<210> 335]]>
<![CDATA[<211> 71]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 335]]>
ctgaagctgc tgtacagcgg agtcccattc ctgggcccct accacaagga gtcggctgtg 60
acggccttca g 71
<![CDATA[<210> 336]]>
<![CDATA[<211> 158]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 336]]>
cgagggcagc gtcatcgcct actactggtc tgagttcagc atcccgcagc acctggtgga 60
ggaggccgag cgcgtcatgg ccgaggagcg cgtagtcatg ctgcccccgc gggcgcgctc 120
cctgaagtcc tttgtggtca cctcagtggt ggctttcc 158
<![CDATA[<210> 337]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 337]]>
ccacggactc caaaacagta cagaggaccc aggaca 36
<![CDATA[<210> 338]]>
<![CDATA[<211> 241]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 338]]>
acagctgcag ctttggcctg cacgcccgcg gtgtggagct gatgcgcttc accacgcccg 60
gcttccctga cagcccctac cccgctcatg cccgctgcca gtgggccctg cggggggacg 120
ccgactcagt gctgagcctc accttccgca gctttgacct tgcgtcctgc gacgagcgcg 180
gcagcgacct ggtgacggtg tacaacaccc tgagccccat ggagccccac gccctggtgc 240
a 241
<![CDATA[<210> 339]]>
<![CDATA[<211> 140]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 339]]>
gttgtgtggc acctaccctc cctcctacaa cctgaccttc cactcctccc agaacgtcct 60
gctcatcaca ctgataacca acactgagcg gcggcatccc ggctttgagg ccaccttctt 120
ccagctgcct aggatgagca 140
<![CDATA[<210> 340]]>
<![CDATA[<211> 98]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 340]]>
gctgtggagg ccgcttacgt aaagcccagg ggacattcaa cagcccctac tacccaggcc 60
actacccacc caacattgac tgcacatgga acattgag 98
<![CDATA[<210> 341]]>
<![CDATA[<211> 110]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 341]]>
gtgcccaaca accagcatgt gaaggtgcgc ttcaaattct tctacctgct ggagcccggc 60
gtgcctgcgg gcacctgccc caaggactac gtggagatca acggggagaa 110
<![CDATA[<210> 342]]>
<![CDATA[<211> 131]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 342]]>
atactgcgga gagaggtccc agttcgtcgt caccagcaac agcaacaaga tcacagttcg 60
cttccactca gatcagtcct acaccgacac cggcttctta gctgaatacc tctcctacga 120
ctccagtgac c 131
<![CDATA[<210> 343]]>
<![CDATA[<211> 105]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 343]]>
catgcccggg gcagttcacg tgccgcacgg ggcggtgtat ccggaaggag ctgcgctgtg 60
atggctgggc cgactgcacc gaccacagcg atgagctcaa ctgca 105
<![CDATA[<210> 344]]>
<![CDATA[<211> 111]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 344]]>
gttgcgacgc cggccaccag ttcacgtgca agaacaagtt ctgcaagccc ctcttctggg 60
tctgcgacag tgtgaacgac tgcggagaca acagcgacga gcaggggtgc a 111
<![CDATA[<210> 345]]>
<![CDATA[<211> 114]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 345]]>
gttgtccggc ccagaccttc aggtgttcca atgggaagtg cctctcgaaa agccagcagt 60
gcaatgggaa ggacgactgt ggggacgggt ccgacgaggc ctcctgcccc aagg 114
<![CDATA[<210> 346]]>
<![CDATA[<211> 123]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 346]]>
tgaacgtcgt cacttgtacc aaacacacct accgctgcct caatgggctc tgcttgagca 60
agggcaaccc tgagtgtgac gggaaggagg actgtagcga cggctcagat gagaaggact 120
gcg 123
<![CDATA[<210> 347]]>
<![CDATA[<211> 187]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 347]]>
actgtgggct gcggtcattc acgagacagg ctcgtgttgt tgggggcacg gatgcggatg 60
agggcgagtg gccctggcag gtaagcctgc atgctctggg ccagggccac atctgcggtg 120
cttccctcat ctctcccaac tggctggtct ctgccgcaca ctgctacatc gatgacagag 180
gattcag 187
<![CDATA[<210> 348]]>
<![CDATA[<211> 275]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 348]]>
gtactcagac cccacgcagt ggacggcctt cctgggcttg cacgaccaga gccagcgcag 60
cgcccctggg gtgcaggagc gcaggctcaa gcgcatcatc tcccacccct tcttcaatga 120
cttcaccttc gactatgaca tcgcgctgct ggagctggag aaaccggcag agtacagctc 180
catggtgcgg cccatctgcc tgccggacgc ctcccatgtc ttccctgccg gcaaggccat 240
ctgggtcacg ggctggggac acacccagta tggag 275
<![CDATA[<210> 349]]>
<![CDATA[<211> 137]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 349]]>
gcactggcgc gctgatcctg caaaagggtg agatccgcgt catcaaccag accacctgcg 60
agaacctcct gccgcagcag atcacgccgc gcatgatgtg cgtgggcttc ctcagcggcg 120
gcgtggactc ctgccag 137
<![CDATA[<210> 350]]>
<![CDATA[<211> 701]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 350]]>
ggtgattccg ggggacccct gtccagcgtg gaggcggatg ggcggatctt ccaggccggt 60
gtggtgagct ggggagacgg ctgcgctcag aggaacaagc caggcgtgta cacaaggctc 120
cctctgtttc gggactggat caaagagaac actggggtat aggggccggg gccacccaaa 180
tgtgtacacc tgcggggcca cccatcgtcc accccagtgt gcacgcctgc aggctggaga 240
ctggaccgct gactgcacca gcgcccccag aacatacact gtgaactcaa tctccagggc 300
tccaaatctg cctagaaaac ctctcgcttc ctcagcctcc aaagtggagc tgggaggtag 360
aaggggagga cactggtggt tctactgacc caactggggg caaaggtttg aagacacagc 420
ctcccccgcc agccccaagc tgggccgagg cgcgtttgtg catatctgcc tcccctgtct 480
ctaaggagca gcgggaacgg agcttcgggg cctcctcagt gaaggtggtg gggctgccgg 540
atctgggctg tggggccctt gggccacgct cttgaggaag cccaggctcg gaggaccctg 600
gaaaacagac gggtctgaga ctgaaattgt tttaccagct cccagggtgg acttcagtgt 660
gtgtatttgt gtaaatgagt aaaacatttt atttcttttt a 701
<![CDATA[<210> 351]]>
<![CDATA[<211> 3305]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> ST14之全mRNA多核苷酸序列 ]]>
<![CDATA[<400> 351]]>
gtgagagcgg agctgcagcc ggagaaagag gaagagggag agagagcgcg ccagggcgag 60
ggcaccgccg ccggtcgggc gcgctgggcc tgcccggaat cccgccgcct gcgccccgcg 120
ccccgcgccc tgcgggccat gggagccggc cgccggcagg gacgacgcct gtgagacccg 180
cgagcggcct cggggaccat ggggagcgat cgggcccgca agggcggagg gggcccgaag 240
gacttcggcg cgggactcaa gtacaactcc cggcacgaga aagtgaatgg cttggaggaa 300
ggcgtggagt tcctgccagt caacaacgtc aagaaggtgg aaaagcatgg cccggggcgc 360
tgggtggtgc tggcagccgt gctgatcggc ctcctcttgg tcttgctggg gatcggcttc 420
ctggtgtggc atttgcagta ccgggacgtg cgtgtccaga aggtcttcaa tggctacatg 480
aggatcacaa atgagaattt tgtggatgcc tacgagaact ccaactccac tgagtttgta 540
agcctggcca gcaaggtgaa ggacgcgctg aagctgctgt acagcggagt cccattcctg 600
ggcccctacc acaaggagtc ggctgtgacg gccttcagcg agggcagcgt catcgcctac 660
tactggtctg agttcagcat cccgcagcac ctggtggagg aggccgagcg cgtcatggcc 720
gaggagcgcg tagtcatgct gcccccgcgg gcgcgctccc tgaagtcctt tgtggtcacc 780
tcagtggtgg ctttccccac ggactccaaa acagtacaga ggacccagga caacagctgc 840
agctttggcc tgcacgcccg cggtgtggag ctgatgcgct tcaccacgcc cggcttccct 900
gacagcccct accccgctca tgcccgctgc cagtgggccc tgcgggggga cgccgactca 960
gtgctgagcc tcaccttccg cagctttgac cttgcgtcct gcgacgagcg cggcagcgac 1020
ctggtgacgg tgtacaacac cctgagcccc atggagcccc acgccctggt gcagttgtgt 1080
ggcacctacc ctccctccta caacctgacc ttccactcct cccagaacgt cctgctcatc 1140
acactgataa ccaacactga gcggcggcat cccggctttg aggccacctt cttccagctg 1200
cctaggatga gcagctgtgg aggccgctta cgtaaagccc aggggacatt caacagcccc 1260
tactacccag gccactaccc acccaacatt gactgcacat ggaacattga ggtgcccaac 1320
aaccagcatg tgaaggtgcg cttcaaattc ttctacctgc tggagcccgg cgtgcctgcg 1380
ggcacctgcc ccaaggacta cgtggagatc aacggggaga aatactgcgg agagaggtcc 1440
cagttcgtcg tcaccagcaa cagcaacaag atcacagttc gcttccactc agatcagtcc 1500
tacaccgaca ccggcttctt agctgaatac ctctcctacg actccagtga cccatgcccg 1560
gggcagttca cgtgccgcac ggggcggtgt atccggaagg agctgcgctg tgatggctgg 1620
gccgactgca ccgaccacag cgatgagctc aactgcagtt gcgacgccgg ccaccagttc 1680
acgtgcaaga acaagttctg caagcccctc ttctgggtct gcgacagtgt gaacgactgc 1740
ggagacaaca gcgacgagca ggggtgcagt tgtccggccc agaccttcag gtgttccaat 1800
gggaagtgcc tctcgaaaag ccagcagtgc aatgggaagg acgactgtgg ggacgggtcc 1860
gacgaggcct cctgccccaa ggtgaacgtc gtcacttgta ccaaacacac ctaccgctgc 1920
ctcaatgggc tctgcttgag caagggcaac cctgagtgtg acgggaagga ggactgtagc 1980
gacggctcag atgagaagga ctgcgactgt gggctgcggt cattcacgag acaggctcgt 2040
gttgttgggg gcacggatgc ggatgagggc gagtggccct ggcaggtaag cctgcatgct 2100
ctgggccagg gccacatctg cggtgcttcc ctcatctctc ccaactggct ggtctctgcc 2160
gcacactgct acatcgatga cagaggattc aggtactcag accccacgca gtggacggcc 2220
ttcctgggct tgcacgacca gagccagcgc agcgcccctg gggtgcagga gcgcaggctc 2280
aagcgcatca tctcccaccc cttcttcaat gacttcacct tcgactatga catcgcgctg 2340
ctggagctgg agaaaccggc agagtacagc tccatggtgc ggcccatctg cctgccggac 2400
gcctcccatg tcttccctgc cggcaaggcc atctgggtca cgggctgggg acacacccag 2460
tatggaggca ctggcgcgct gatcctgcaa aagggtgaga tccgcgtcat caaccagacc 2520
acctgcgaga acctcctgcc gcagcagatc acgccgcgca tgatgtgcgt gggcttcctc 2580
agcggcggcg tggactcctg ccagggtgat tccgggggac ccctgtccag cgtggaggcg 2640
gatgggcgga tcttccaggc cggtgtggtg agctggggag acggctgcgc tcagaggaac 2700
aagccaggcg tgtacacaag gctccctctg tttcgggact ggatcaaaga gaacactggg 2760
gtataggggc cggggccacc caaatgtgta cacctgcggg gccacccatc gtccacccca 2820
gtgtgcacgc ctgcaggctg gagactggac cgctgactgc accagcgccc ccagaacata 2880
cactgtgaac tcaatctcca gggctccaaa tctgcctaga aaacctctcg cttcctcagc 2940
ctccaaagtg gagctgggag gtagaagggg aggacactgg tggttctact gacccaactg 3000
ggggcaaagg tttgaagaca cagcctcccc cgccagcccc aagctgggcc gaggcgcgtt 3060
tgtgcatatc tgcctcccct gtctctaagg agcagcggga acggagcttc ggggcctcct 3120
cagtgaaggt ggtggggctg ccggatctgg gctgtggggc ccttgggcca cgctcttgag 3180
gaagcccagg ctcggaggac cctggaaaac agacgggtct gagactgaaa ttgttttacc 3240
agctcccagg gtggacttca gtgtgtgtat ttgtgtaaat gagtaaaaca ttttatttct 3300
tttta 3305
<![CDATA[<210> 352]]>
<![CDATA[<211> 27]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 352]]>
Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp Phe
1 5 10 15
Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu
20 25
<![CDATA[<210> 353]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 353]]>
Lys Val Asn Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn
1 5 10 15
Val Lys Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu Ala
20 25 30
Ala Val Leu Ile Gly Leu Leu Leu Val Leu Leu Gly Ile Gly Phe Leu
35 40 45
Val Trp His Leu Gln
50
<![CDATA[<210> 354]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 354]]>
Arg Asp Val Arg Val Gln Lys Val Phe Asn Gly Tyr Met Arg Ile Thr
1 5 10 15
Asn Glu Asn Phe Val Asp Ala Tyr Glu Asn Ser Asn Ser Thr Glu Phe
20 25 30
Val Ser Leu Ala Ser Lys Val Lys Asp Ala
35 40
<![CDATA[<210> 355]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 355]]>
Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly Pro Tyr His Lys
1 5 10 15
Glu Ser Ala Val Thr Ala Phe
20
<![CDATA[<210> 356]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 356]]>
Glu Gly Ser Val Ile Ala Tyr Tyr Trp Ser Glu Phe Ser Ile Pro Gln
1 5 10 15
His Leu Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu Arg Val Val
20 25 30
Met Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val Val Thr Ser
35 40 45
Val Val Ala Phe
50
<![CDATA[<210> 357]]>
<![CDATA[<211> 11]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 357]]>
Thr Asp Ser Lys Thr Val Gln Arg Thr Gln Asp
1 5 10
<![CDATA[<210> 358]]>
<![CDATA[<211> 79]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 35]]>8
Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu Leu Met Arg Phe
1 5 10 15
Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala His Ala Arg Cys
20 25 30
Gln Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu Ser Leu Thr Phe
35 40 45
Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly Ser Asp Leu Val
50 55 60
Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His Ala Leu Val
65 70 75
<![CDATA[<210> 359]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 359]]>
Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr Phe His Ser Ser
1 5 10 15
Gln Asn Val Leu Leu Ile Thr Leu Ile Thr Asn Thr Glu Arg Arg His
20 25 30
Pro Gly Phe Glu Ala Thr Phe Phe Gln Leu Pro Arg Met Ser
35 40 45
<![CDATA[<210> 360]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 360]]>
Cys Gly Gly Arg Leu Arg Lys Ala Gln Gly Thr Phe Asn Ser Pro Tyr
1 5 10 15
Tyr Pro Gly His Tyr Pro Pro Asn Ile Asp Cys Thr Trp Asn Ile Glu
20 25 30
<![CDATA[<210> 361]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 361]]>
Val Pro Asn Asn Gln His Val Lys Val Arg Phe Lys Phe Phe Tyr Leu
1 5 10 15
Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp Tyr Val Glu
20 25 30
Ile Asn Gly Glu
35
<![CDATA[<210> 362]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 362]]>
Tyr Cys Gly Glu Arg Ser Gln Phe Val Val Thr Ser Asn Ser Asn Lys
1 5 10 15
Ile Thr Val Arg Phe His Ser Asp Gln Ser Tyr Thr Asp Thr Gly Phe
20 25 30
Leu Ala Glu Tyr Leu Ser Tyr Asp Ser Ser Asp
35 40
<![CDATA[<210> 363]]>
<![CDATA[<211> 34]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 363]]>
Cys Pro Gly Gln Phe Thr Cys Arg Thr Gly Arg Cys Ile Arg Lys Glu
1 5 10 15
Leu Arg Cys Asp Gly Trp Ala Asp Cys Thr Asp His Ser Asp Glu Leu
20 25 30
Asn Cys
<![CDATA[<210> 364]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 36]]>4
Cys Asp Ala Gly His Gln Phe Thr Cys Lys Asn Lys Phe Cys Lys Pro
1 5 10 15
Leu Phe Trp Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn Ser Asp
20 25 30
Glu Gln Gly Cys
35
<![CDATA[<210> 365]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 365]]>
Cys Pro Ala Gln Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu Ser Lys
1 5 10 15
Ser Gln Gln Cys Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu
20 25 30
Ala Ser Cys Pro Lys
35
<![CDATA[<210> 366]]>
<![CDATA[<211> 40]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 366]]>
Asn Val Val Thr Cys Thr Lys His Thr Tyr Arg Cys Leu Asn Gly Leu
1 5 10 15
Cys Leu Ser Lys Gly Asn Pro Glu Cys Asp Gly Lys Glu Asp Cys Ser
20 25 30
Asp Gly Ser Asp Glu Lys Asp Cys
35 40
<![CDATA[<210> 367]]>
<![CDATA[<211> 61]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 367]]>
Cys Gly Leu Arg Ser Phe Thr Arg Gln Ala Arg Val Val Gly Gly Thr
1 5 10 15
Asp Ala Asp Glu Gly Glu Trp Pro Trp Gln Val Ser Leu His Ala Leu
20 25 30
Gly Gln Gly His Ile Cys Gly Ala Ser Leu Ile Ser Pro Asn Trp Leu
35 40 45
Val Ser Ala Ala His Cys Tyr Ile Asp Asp Arg Gly Phe
50 55 60
<![CDATA[<210> 368]]>
<![CDATA[<211> 91]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 368]]>
Tyr Ser Asp Pro Thr Gln Trp Thr Ala Phe Leu Gly Leu His Asp Gln
1 5 10 15
Ser Gln Arg Ser Ala Pro Gly Val Gln Glu Arg Arg Leu Lys Arg Ile
20 25 30
Ile Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr Asp Ile Ala
35 40 45
Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro
50 55 60
Ile Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala Ile
65 70 75 80
Trp Val Thr Gly Trp Gly His Thr Gln Tyr Gly
85 90
<![CDATA[<210> 369]]>
<![CDATA[<211> 45]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 369]]>
Thr Gly Ala Leu Ile Leu Gln Lys Gly Glu Ile Arg Val Ile Asn Gln
1 5 10 15
Thr Thr Cys Glu Asn Leu Leu Pro Gln Gln Ile Thr Pro Arg Met Met
20 25 30
Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gln
35 40 45
<![CDATA[<210> 370]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 370]]>
Gly Asp Ser Gly Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg Ile
1 5 10 15
Phe Gln Ala Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gln Arg Asn
20 25 30
Lys Pro Gly Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp Ile Lys
35 40 45
Glu Asn Thr Gly Val
50
<![CDATA[<210> 371]]>
<![CDATA[<211> 855]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 371]]>
Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp Phe
1 5 10 15
Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu Lys Val Asn Gly Leu
20 25 30
Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn Val Lys Lys Val Glu
35 40 45
Lys His Gly Pro Gly Arg Trp Val Val Leu Ala Ala Val Leu Ile Gly
50 55 60
Leu Leu Leu Val Leu Leu Gly Ile Gly Phe Leu Val Trp His Leu Gln
65 70 75 80
Tyr Arg Asp Val Arg Val Gln Lys Val Phe Asn Gly Tyr Met Arg Ile
85 90 95
Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu Asn Ser Asn Ser Thr Glu
100 105 110
Phe Val Ser Leu Ala Ser Lys Val Lys Asp Ala Leu Lys Leu Leu Tyr
115 120 125
Ser Gly Val Pro Phe Leu Gly Pro Tyr His Lys Glu Ser Ala Val Thr
130 135 140
Ala Phe Ser Glu Gly Ser Val Ile Ala Tyr Tyr Trp Ser Glu Phe Ser
145 150 155 160
Ile Pro Gln His Leu Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu
165 170 175
Arg Val Val Met Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val
180 185 190
Val Thr Ser Val Val Ala Phe Pro Thr Asp Ser Lys Thr Val Gln Arg
195 200 205
Thr Gln Asp Asn Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu
210 215 220
Leu Met Arg Phe Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala
225 230 235 240
His Ala Arg Cys Gln Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu
245 250 255
Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly
260 265 270
Ser Asp Leu Val Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His
275 280 285
Ala Leu Val Gln Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr
290 295 300
Phe His Ser Ser Gln Asn Val Leu Leu Ile Thr Leu Ile Thr Asn Thr
305 310 315 320
Glu Arg Arg His Pro Gly Phe Glu Ala Thr Phe Phe Gln Leu Pro Arg
325 330 335
Met Ser Ser Cys Gly Gly Arg Leu Arg Lys Ala Gln Gly Thr Phe Asn
340 345 350
Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro Asn Ile Asp Cys Thr Trp
355 360 365
Asn Ile Glu Val Pro Asn Asn Gln His Val Lys Val Arg Phe Lys Phe
370 375 380
Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp
385 390 395 400
Tyr Val Glu Ile Asn Gly Glu Lys Tyr Cys Gly Glu Arg Ser Gln Phe
405 410 415
Val Val Thr Ser Asn Ser Asn Lys Ile Thr Val Arg Phe His Ser Asp
420 425 430
Gln Ser Tyr Thr Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp
435 440 445
Ser Ser Asp Pro Cys Pro Gly Gln Phe Thr Cys Arg Thr Gly Arg Cys
450 455 460
Ile Arg Lys Glu Leu Arg Cys Asp Gly Trp Ala Asp Cys Thr Asp His
465 470 475 480
Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala Gly His Gln Phe Thr Cys
485 490 495
Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp Val Cys Asp Ser Val Asn
500 505 510
Asp Cys Gly Asp Asn Ser Asp Glu Gln Gly Cys Ser Cys Pro Ala Gln
515 520 525
Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu Ser Lys Ser Gln Gln Cys
530 535 540
Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ser Cys Pro
545 550 555 560
Lys Val Asn Val Val Thr Cys Thr Lys His Thr Tyr Arg Cys Leu Asn
565 570 575
Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu Cys Asp Gly Lys Glu Asp
580 585 590
Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys Asp Cys Gly Leu Arg Ser
595 600 605
Phe Thr Arg Gln Ala Arg Val Val Gly Gly Thr Asp Ala Asp Glu Gly
610 615 620
Glu Trp Pro Trp Gln Val Ser Leu His Ala Leu Gly Gln Gly His Ile
625 630 635 640
Cys Gly Ala Ser Leu Ile Ser Pro Asn Trp Leu Val Ser Ala Ala His
645 650 655
Cys Tyr Ile Asp Asp Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gln Trp
660 665 670
Thr Ala Phe Leu Gly Leu His Asp Gln Ser Gln Arg Ser Ala Pro Gly
675 680 685
Val Gln Glu Arg Arg Leu Lys Arg Ile Ile Ser His Pro Phe Phe Asn
690 695 700
Asp Phe Thr Phe Asp Tyr Asp Ile Ala Leu Leu Glu Leu Glu Lys Pro
705 710 715 720
Ala Glu Tyr Ser Ser Met Val Arg Pro Ile Cys Leu Pro Asp Ala Ser
725 730 735
His Val Phe Pro Ala Gly Lys Ala Ile Trp Val Thr Gly Trp Gly His
740 745 750
Thr Gln Tyr Gly Gly Thr Gly Ala Leu Ile Leu Gln Lys Gly Glu Ile
755 760 765
Arg Val Ile Asn Gln Thr Thr Cys Glu Asn Leu Leu Pro Gln Gln Ile
770 775 780
Thr Pro Arg Met Met Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser
785 790 795 800
Cys Gln Gly Asp Ser Gly Gly Pro Leu Ser Ser Val Glu Ala Asp Gly
805 810 815
Arg Ile Phe Gln Ala Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gln
820 825 830
Arg Asn Lys Pro Gly Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp
835 840 845
Ile Lys Glu Asn Thr Gly Val
850 855
<![CDATA[<210> 372]]>
<![CDATA[<211> 1552]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 372]]>
gtgagagcgg agctgcagcc ggagaaagag gaagagggag agagagcgcg ccagggcgag 60
ggcaccgccg ccggtcgggc gcgctgggcc tgcccggaat cccgccgcct gcgccccgcg 120
ccccgcgccc tgcgggccat gggagccggc cgccggcagg gacgacgcct gtgagacccg 180
cgagcggcct cggggaccat ggggagcgat cgggcccgca agggcggagg gggcccgaag 240
gacttcggcg cgggactcaa gtacaactcc cggcacgaga aagtgaatgg cttggaggaa 300
ggcgtggagt tcctgccagt caacaacgtc aagaaggtgg aaaagcatgg cccggggcgc 360
tgggtggtgc tggcagccgt gctgatcggc ctcctcttgg tcttgctggg gatcggcttc 420
ctggtgtggc atttgcagta ccgggacgtg cgtgtccaga aggtcttcaa tggctacatg 480
aggatcacaa atgagaattt tgtggatgcc tacgagaact ccaactccac tgagtttgta 540
agcctggcca gcaaggtgaa ggacgcgctg aagctgctgt acagcggagt cccattcctg 600
ggcccctacc acaaggagtc ggctgtgacg gccttcagcg agggcagcgt catcgcctac 660
tactggtctg agttcagcat cccgcagcac ctggtggagg aggccgagcg cgtcatggcc 720
gaggagcgcg tagtcatgct gcccccgcgg gcgcgctccc tgaagtcctt tgtggtcacc 780
tcagtggtgg ctttccccac ggactccaaa acagtacaga ggacccagga caacagctgc 840
agctttggcc tgcacgcccg cggtgtggag ctgatgcgct tcaccacgcc cggcttccct 900
gacagcccct accccgctca tgcccgctgc cagtgggccc tgcgggggga cgccgactca 960
gtgctgagcc tcaccttccg cagctttgac cttgcgtcct gcgacgagcg cggcagcgac 1020
ctggtgacgg tgtacaacac cctgagcccc atggagcccc acgccctggt gcagttgtgt 1080
ggcacctacc ctccctccta caacctgacc ttccactcct cccagaacgt cctgctcatc 1140
acactgataa ccaacactga gcggcggcat cccggctttg aggccacctt cttccagctg 1200
cctaggatga gcagctgtgg aggccgctta cgtaaagccc aggggacatt caacagcccc 1260
tactacccag gccactaccc acccaacatt gactgcacat ggaacattga ggtgcccaac 1320
aaccagcatg tgaaggtgcg cttcaaattc ttctacctgc tggagcccgg cgtgcctgcg 1380
ggcacctgcc ccaaggacta cgtggagatc aacggggaga aatactgcgg agagaggtcc 1440
cagttcgtcg tcaccagcaa cagcaacaag atcacagttc gcttccactc agatcagtcc 1500
tacaccgaca ccggcttctt agctgaatac ctctcctacg actccagtga cc 1552
<![CDATA[<210> 373]]>
<![CDATA[<211> 451]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 373]]>
Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp Phe
1 5 10 15
Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu Lys Val Asn Gly Leu
20 25 30
Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn Val Lys Lys Val Glu
35 40 45
Lys His Gly Pro Gly Arg Trp Val Val Leu Ala Ala Val Leu Ile Gly
50 55 60
Leu Leu Leu Val Leu Leu Gly Ile Gly Phe Leu Val Trp His Leu Gln
65 70 75 80
Tyr Arg Asp Val Arg Val Gln Lys Val Phe Asn Gly Tyr Met Arg Ile
85 90 95
Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu Asn Ser Asn Ser Thr Glu
100 105 110
Phe Val Ser Leu Ala Ser Lys Val Lys Asp Ala Leu Lys Leu Leu Tyr
115 120 125
Ser Gly Val Pro Phe Leu Gly Pro Tyr His Lys Glu Ser Ala Val Thr
130 135 140
Ala Phe Ser Glu Gly Ser Val Ile Ala Tyr Tyr Trp Ser Glu Phe Ser
145 150 155 160
Ile Pro Gln His Leu Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu
165 170 175
Arg Val Val Met Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val
180 185 190
Val Thr Ser Val Val Ala Phe Pro Thr Asp Ser Lys Thr Val Gln Arg
195 200 205
Thr Gln Asp Asn Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu
210 215 220
Leu Met Arg Phe Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala
225 230 235 240
His Ala Arg Cys Gln Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu
245 250 255
Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly
260 265 270
Ser Asp Leu Val Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His
275 280 285
Ala Leu Val Gln Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr
290 295 300
Phe His Ser Ser Gln Asn Val Leu Leu Ile Thr Leu Ile Thr Asn Thr
305 310 315 320
Glu Arg Arg His Pro Gly Phe Glu Ala Thr Phe Phe Gln Leu Pro Arg
325 330 335
Met Ser Ser Cys Gly Gly Arg Leu Arg Lys Ala Gln Gly Thr Phe Asn
340 345 350
Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro Asn Ile Asp Cys Thr Trp
355 360 365
Asn Ile Glu Val Pro Asn Asn Gln His Val Lys Val Arg Phe Lys Phe
370 375 380
Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp
385 390 395 400
Tyr Val Glu Ile Asn Gly Glu Lys Tyr Cys Gly Glu Arg Ser Gln Phe
405 410 415
Val Val Thr Ser Asn Ser Asn Lys Ile Thr Val Arg Phe His Ser Asp
420 425 430
Gln Ser Tyr Thr Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp
435 440 445
Ser Ser Asp
450
<![CDATA[<210> 374]]>
<![CDATA[<211> 56]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 374]]>
accctgtgaa ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gcccca 56
<![CDATA[<210> 375]]>
<![CDATA[<211> 89]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 375]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttc 89
<![CDATA[<210> 376]]>
<![CDATA[<211> 145]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 376]]>
accctgtgaa ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gccccactac 60
atctacatcc accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt 120
ctgtgtgaat ggaggggagt gcttc 145
<![CDATA[<210> 377]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 377]]>
Pro Cys Glu Gly Glu Ala Arg Glu Thr Lys Ala Cys Lys Lys Asp Ala
1 5 10 15
Cys Pro Thr Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys
20 25 30
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe
35 40 45
<![CDATA[<210> 378]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 378]]>
agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 60
tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 120
ctcccggccg ccgcgctccg gtacacacag 150
<![CDATA[<210> 379]]>
<![CDATA[<211> 96]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 379]]>
gatccctgct gggcaccaac agctccacca tggggctggc ctggggacta ggcgtcctgt 60
tcctgatgca tgtgtgtggc accaaccgca ttccag 96
<![CDATA[<210> 380]]>
<![CDATA[<211> 560]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 380]]>
agtctggcgg agacaacagc gtgtttgaca tctttgaact caccggggcc gcccgcaagg 60
ggtctgggcg ccgactggtg aagggccccg acccttccag cccagctttc cgcatcgagg 120
atgccaacct gatcccccct gtgcctgatg acaagttcca agacctggtg gatgctgtgc 180
gggcagaaaa gggtttcctc cttctggcat ccctgaggca gatgaagaag acccggggca 240
cgctgctggc cctggagcgg aaagaccact ctggccaggt cttcagcgtg gtgtccaatg 300
gcaaggcggg caccctggac ctcagcctga ccgtccaagg aaagcagcac gtggtgtctg 360
tggaagaagc tctcctggca accggccagt ggaagagcat caccctgttt gtgcaggaag 420
acagggccca gctgtacatc gactgtgaaa agatggagaa tgctgagttg gacgtcccca 480
tccaaagcgt cttcaccaga gacctggcca gcatcgccag actccgcatc gcaaaggggg 540
gcgtcaatga caatttccag 560
<![CDATA[<210> 381]]>
<![CDATA[<211> 76]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 381]]>
ggggtgctgc agaatgtgag gtttgtcttt ggaaccacac cagaagacat cctcaggaac 60
aaaggctgct ccagct 76
<![CDATA[<210> 382]]>
<![CDATA[<211> 200]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 382]]>
ctaccagtgt cctcctcacc cttgacaaca acgtggtgaa tggttccagc cctgccatcc 60
gcactaacta cattggccac aagacaaagg acttgcaagc catctgcggc atctcctgtg 120
atgagctgtc cagcatggtc ctggaactca ggggcctgcg caccattgtg accacgctgc 180
aggacagcat ccgcaaagtg 200
<![CDATA[<210> 383]]>
<![CDATA[<211> 123]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 383]]>
actgaagaga acaaagagtt ggccaatgag ctgaggcggc ctcccctatg ctatcacaac 60
ggagttcagt acagaaataa cgaggaatgg actgttgata gctgcactga gtgtcactgt 120
cag 123
<![CDATA[<210> 384]]>
<![CDATA[<211> 94]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 384]]>
aactcagtta ccatctgcaa aaaggtgtcc tgccccatca tgccctgctc caatgccaca 60
gttcctgatg gagaatgctg tcctcgctgt tggc 94
<![CDATA[<210> 385]]>
<![CDATA[<211> 174]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 385]]>
ccagcgactc tgcggacgat ggctggtctc catggtccga gtggacctcc tgttctacga 60
gctgtggcaa tggaattcag cagcgcggcc gctcctgcga tagcctcaac aaccgatgtg 120
agggctcctc ggtccagaca cggacctgcc acattcagga gtgtgacaag agat 174
<![CDATA[<210> 386]]>
<![CDATA[<211> 177]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 386]]>
ttaaacagga tggtggctgg agccactggt ccccgtggtc atcttgttct gtgacatgtg 60
gtgatggtgt gatcacaagg atccggctct gcaactctcc cagcccccag atgaacggga 120
aaccctgtga aggcgaagcg cgggagacca aagcctgcaa gaaagacgcc tgcccca 177
<![CDATA[<210> 387]]>
<![CDATA[<211> 4547]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 387]]>
ttaaacagga tggtggctgg agccactggt ccccgtggtc atcttgttct gtgacatgtg 60
gtgatggtgt gatcacaagg atccggctct gcaactctcc cagcccccag atgaacggga 120
aaccctgtga aggcgaagcg cgggagacca aagcctgcaa gaaagacgcc tgccccatca 180
atggaggctg gggtccttgg tcaccatggg acatctgttc tgtcacctgt ggaggagggg 240
tacagaaacg tagtcgtctc tgcaacaacc ccacacccca gtttggaggc aaggactgcg 300
ttggtgatgt aacagaaaac cagatctgca acaagcagga ctgtccaatt gatggatgcc 360
tgtccaatcc ctgctttgcc ggcgtgaagt gtactagcta ccctgatggc agctggaaat 420
gtggtgcttg tccccctggt tacagtggaa atggcatcca gtgcacagat gttgatgagt 480
gcaaagaagt gcctgatgcc tgcttcaacc acaatggaga gcaccggtgt gagaacacgg 540
accccggcta caactgcctg ccctgccccc cacgcttcac cggctcacag cccttcggcc 600
agggtgtcga acatgccacg gccaacaaac aggtgtgcaa gccccgtaac ccctgcacgg 660
atgggaccca cgactgcaac aagaacgcca agtgcaacta cctgggccac tatagcgacc 720
ccatgtaccg ctgcgagtgc aagcctggct acgctggcaa tggcatcatc tgcggggagg 780
acacagacct ggatggctgg cccaatgaga acctggtgtg cgtggccaat gcgacttacc 840
actgcaaaaa ggataattgc cccaaccttc ccaactcagg gcaggaagac tatgacaagg 900
atggaattgg tgatgcctgt gatgatgacg atgacaatga taaaattcca gatgacaggg 960
acaactgtcc attccattac aacccagctc agtatgacta tgacagagat gatgtgggag 1020
accgctgtga caactgtccc tacaaccaca acccagatca ggcagacaca gacaacaatg 1080
gggaaggaga cgcctgtgct gcagacattg atggagacgg tatcctcaat gaacgggaca 1140
actgccagta cgtctacaat gtggaccaga gagacactga tatggatggg gttggagatc 1200
agtgtgacaa ttgccccttg gaacacaatc cggatcagct ggactctgac tcagaccgca 1260
ttggagatac ctgtgacaac aatcaggata ttgatgaaga tggccaccag aacaatctgg 1320
acaactgtcc ctatgtgccc aatgccaacc aggctgacca tgacaaagat ggcaagggag 1380
atgcctgtga ccacgatgat gacaacgatg gcattcctga tgacaaggac aactgcagac 1440
tcgtgcccaa tcccgaccag aaggactctg acggcgatgg tcgaggtgat gcctgcaaag 1500
atgattttga ccatgacagt gtgccagaca tcgatgacat ctgtcctgag aatgttgaca 1560
tcagtgagac cgatttccgc cgattccaga tgattcctct ggaccccaaa gggacatccc 1620
aaaatgaccc taactgggtt gtacgccatc agggtaaaga actcgtccag actgtcaact 1680
gtgatcctgg actcgctgta ggttatgatg agtttaatgc tgtggacttc agtggcacct 1740
tcttcatcaa caccgaaagg gacgatgact atgctggatt tgtctttggc taccagtcca 1800
gcagccgctt ttatgttgtg atgtggaagc aagtcaccca gtcctactgg gacaccaacc 1860
ccacgagggc tcagggatac tcgggccttt ctgtgaaagt tgtaaactcc accacagggc 1920
ctggcgagca cctgcggaac gccctgtggc acacaggaaa cacccctggc caggtgcgca 1980
ccctgtggca tgaccctcgt cacataggct ggaaagattt caccgcctac agatggcgtc 2040
tcagccacag gccaaagacg ggtttcatta gagtggtgat gtatgaaggg aagaaaatca 2100
tggctgactc aggacccatc tatgataaaa cctatgctgg tggtagacta gggttgtttg 2160
tcttctctca agaaatggtg ttcttctctg acctgaaata cgaatgtaga gatccctaat 2220
catcaaattg ttgattgaaa gactgatcat aaaccaatgc tggtattgca ccttctggaa 2280
ctatgggctt gagaaaaccc ccaggatcac ttctccttgg cttccttctt ttctgtgctt 2340
gcatcagtgt ggactcctag aacgtgcgac ctgcctcaag aaaatgcagt tttcaaaaac 2400
agactcagca ttcagcctcc aatgaataag acatcttcca agcatataaa caattgcttt 2460
ggtttccttt tgaaaaagca tctacttgct tcagttggga aggtgcccat tccactctgc 2520
ctttgtcaca gagcagggtg ctattgtgag gccatctctg agcagtggac tcaaaagcat 2580
tttcaggcat gtcagagaag ggaggactca ctagaattag caaacaaaac caccctgaca 2640
tcctccttca ggaacacggg gagcagaggc caaagcacta aggggagggc gcatacccga 2700
gacgattgta tgaagaaaat atggaggaac tgttacatgt tcggtactaa gtcattttca 2760
ggggattgaa agactattgc tggatttcat gatgctgact ggcgttagct gattaaccca 2820
tgtaaatagg cacttaaata gaagcaggaa agggagacaa agactggctt ctggacttcc 2880
tccctgatcc ccacccttac tcatcacctg cagtggccag aattagggaa tcagaatcaa 2940
accagtgtaa ggcagtgctg gctgccattg cctggtcaca ttgaaattgg tggcttcatt 3000
ctagatgtag cttgtgcaga tgtagcagga aaataggaaa acctaccatc tcagtgagca 3060
ccagctgcct cccaaaggag gggcagccgt gcttatattt ttatggttac aatggcacaa 3120
aattattatc aacctaacta aaacattcct tttctctttt ttcctgaatt atcatggagt 3180
tttctaattc tctcttttgg aatgtagatt ttttttaaat gctttacgat gtaaaatatt 3240
tattttttac ttattctgga agatctggct gaaggattat tcatggaaca ggaagaagcg 3300
taaagactat ccatgtcatc tttgttgaga gtcttcgtga ctgtaagatt gtaaatacag 3360
attatttatt aactctgttc tgcctggaaa tttaggcttc atacggaaag tgtttgagag 3420
caagtagttg acatttatca gcaaatctct tgcaagaaca gcacaaggaa aatcagtcta 3480
ataagctgct ctgccccttg tgctcagagt ggatgttatg ggattctttt tttctctgtt 3540
ttatcttttc aagtggaatt agttggttat ccatttgcaa atgttttaaa ttgcaaagaa 3600
agccatgagg tcttcaatac tgttttaccc catcccttgt gcatatttcc agggagaagg 3660
aaagcatata cacttttttc tttcattttt ccaaaagaga aaaaaatgac aaaaggtgaa 3720
acttacatac aaatattacc tcatttgttg tgtgactgag taaagaattt ttggatcaag 3780
cggaaagagt ttaagtgtct aacaaactta aagctactgt agtacctaaa aagtcagtgt 3840
tgtacatagc ataaaaactc tgcagagaag tattcccaat aaggaaatag cattgaaatg 3900
ttaaatacaa tttctgaaag ttatgttttt tttctatcat ctggtatacc attgctttat 3960
ttttataaat tattttctca ttgccattgg aatagatatc tcagattgtg tagatatgct 4020
atttaaataa tttatcagga aatactgcct gtagagttag tatttctatt tttatataat 4080
gtttgcacac tgaattgaag aattgttggt tttttctttt ttttgttttg tttttttttt 4140
tttttttttt tgcttttgac ctcccatttt tactatttgc caataccttt ttctaggaat 4200
gtgctttttt ttgtacacat ttttatccat tttacattct aaagcagtgt aagttgtata 4260
ttactgtttc ttatgtacaa ggaacaacaa taaatcatat ggaaatttat atttatactt 4320
actgtatcca tgcttatttg ttctctactg gctttatgtc atgaagtata tgcgtaaata 4380
ccattcataa atcaatatag catatacaaa aataaattac agtaagtcat agcaacattc 4440
acagtttgta tgtgattgag aaagactgag ttgctcaggc ctaggcttag aatttgctgc 4500
gtttgtggaa taaaagaaca aaatgataca ttagcctgcc atatcaa 4547
<![CDATA[<210> 388]]>
<![CDATA[<211> 6197]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> THBS1之全mRNA多核苷酸序列]]>
<![CDATA[<400> 388]]>
agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 60
tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 120
ctcccggccg ccgcgctccg gtacacacag gatccctgct gggcaccaac agctccacca 180
tggggctggc ctggggacta ggcgtcctgt tcctgatgca tgtgtgtggc accaaccgca 240
ttccagagtc tggcggagac aacagcgtgt ttgacatctt tgaactcacc ggggccgccc 300
gcaaggggtc tgggcgccga ctggtgaagg gccccgaccc ttccagccca gctttccgca 360
tcgaggatgc caacctgatc ccccctgtgc ctgatgacaa gttccaagac ctggtggatg 420
ctgtgcgggc agaaaagggt ttcctccttc tggcatccct gaggcagatg aagaagaccc 480
ggggcacgct gctggccctg gagcggaaag accactctgg ccaggtcttc agcgtggtgt 540
ccaatggcaa ggcgggcacc ctggacctca gcctgaccgt ccaaggaaag cagcacgtgg 600
tgtctgtgga agaagctctc ctggcaaccg gccagtggaa gagcatcacc ctgtttgtgc 660
aggaagacag ggcccagctg tacatcgact gtgaaaagat ggagaatgct gagttggacg 720
tccccatcca aagcgtcttc accagagacc tggccagcat cgccagactc cgcatcgcaa 780
aggggggcgt caatgacaat ttccaggggg tgctgcagaa tgtgaggttt gtctttggaa 840
ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt gtcctcctca 900
cccttgacaa caacgtggtg aatggttcca gccctgccat ccgcactaac tacattggcc 960
acaagacaaa ggacttgcaa gccatctgcg gcatctcctg tgatgagctg tccagcatgg 1020
tcctggaact caggggcctg cgcaccattg tgaccacgct gcaggacagc atccgcaaag 1080
tgactgaaga gaacaaagag ttggccaatg agctgaggcg gcctccccta tgctatcaca 1140
acggagttca gtacagaaat aacgaggaat ggactgttga tagctgcact gagtgtcact 1200
gtcagaactc agttaccatc tgcaaaaagg tgtcctgccc catcatgccc tgctccaatg 1260
ccacagttcc tgatggagaa tgctgtcctc gctgttggcc cagcgactct gcggacgatg 1320
gctggtctcc atggtccgag tggacctcct gttctacgag ctgtggcaat ggaattcagc 1380
agcgcggccg ctcctgcgat agcctcaaca accgatgtga gggctcctcg gtccagacac 1440
ggacctgcca cattcaggag tgtgacaaga gatttaaaca ggatggtggc tggagccact 1500
ggtccccgtg gtcatcttgt tctgtgacat gtggtgatgg tgtgatcaca aggatccggc 1560
tctgcaactc tcccagcccc cagatgaacg ggaaaccctg tgaaggcgaa gcgcgggaga 1620
ccaaagcctg caagaaagac gcctgcccca ttaaacagga tggtggctgg agccactggt 1680
ccccgtggtc atcttgttct gtgacatgtg gtgatggtgt gatcacaagg atccggctct 1740
gcaactctcc cagcccccag atgaacggga aaccctgtga aggcgaagcg cgggagacca 1800
aagcctgcaa gaaagacgcc tgccccatca atggaggctg gggtccttgg tcaccatggg 1860
acatctgttc tgtcacctgt ggaggagggg tacagaaacg tagtcgtctc tgcaacaacc 1920
ccacacccca gtttggaggc aaggactgcg ttggtgatgt aacagaaaac cagatctgca 1980
acaagcagga ctgtccaatt gatggatgcc tgtccaatcc ctgctttgcc ggcgtgaagt 2040
gtactagcta ccctgatggc agctggaaat gtggtgcttg tccccctggt tacagtggaa 2100
atggcatcca gtgcacagat gttgatgagt gcaaagaagt gcctgatgcc tgcttcaacc 2160
acaatggaga gcaccggtgt gagaacacgg accccggcta caactgcctg ccctgccccc 2220
cacgcttcac cggctcacag cccttcggcc agggtgtcga acatgccacg gccaacaaac 2280
aggtgtgcaa gccccgtaac ccctgcacgg atgggaccca cgactgcaac aagaacgcca 2340
agtgcaacta cctgggccac tatagcgacc ccatgtaccg ctgcgagtgc aagcctggct 2400
acgctggcaa tggcatcatc tgcggggagg acacagacct ggatggctgg cccaatgaga 2460
acctggtgtg cgtggccaat gcgacttacc actgcaaaaa ggataattgc cccaaccttc 2520
ccaactcagg gcaggaagac tatgacaagg atggaattgg tgatgcctgt gatgatgacg 2580
atgacaatga taaaattcca gatgacaggg acaactgtcc attccattac aacccagctc 2640
agtatgacta tgacagagat gatgtgggag accgctgtga caactgtccc tacaaccaca 2700
acccagatca ggcagacaca gacaacaatg gggaaggaga cgcctgtgct gcagacattg 2760
atggagacgg tatcctcaat gaacgggaca actgccagta cgtctacaat gtggaccaga 2820
gagacactga tatggatggg gttggagatc agtgtgacaa ttgccccttg gaacacaatc 2880
cggatcagct ggactctgac tcagaccgca ttggagatac ctgtgacaac aatcaggata 2940
ttgatgaaga tggccaccag aacaatctgg acaactgtcc ctatgtgccc aatgccaacc 3000
aggctgacca tgacaaagat ggcaagggag atgcctgtga ccacgatgat gacaacgatg 3060
gcattcctga tgacaaggac aactgcagac tcgtgcccaa tcccgaccag aaggactctg 3120
acggcgatgg tcgaggtgat gcctgcaaag atgattttga ccatgacagt gtgccagaca 3180
tcgatgacat ctgtcctgag aatgttgaca tcagtgagac cgatttccgc cgattccaga 3240
tgattcctct ggaccccaaa gggacatccc aaaatgaccc taactgggtt gtacgccatc 3300
agggtaaaga actcgtccag actgtcaact gtgatcctgg actcgctgta ggttatgatg 3360
agtttaatgc tgtggacttc agtggcacct tcttcatcaa caccgaaagg gacgatgact 3420
atgctggatt tgtctttggc taccagtcca gcagccgctt ttatgttgtg atgtggaagc 3480
aagtcaccca gtcctactgg gacaccaacc ccacgagggc tcagggatac tcgggccttt 3540
ctgtgaaagt tgtaaactcc accacagggc ctggcgagca cctgcggaac gccctgtggc 3600
acacaggaaa cacccctggc caggtgcgca ccctgtggca tgaccctcgt cacataggct 3660
ggaaagattt caccgcctac agatggcgtc tcagccacag gccaaagacg ggtttcatta 3720
gagtggtgat gtatgaaggg aagaaaatca tggctgactc aggacccatc tatgataaaa 3780
cctatgctgg tggtagacta gggttgtttg tcttctctca agaaatggtg ttcttctctg 3840
acctgaaata cgaatgtaga gatccctaat catcaaattg ttgattgaaa gactgatcat 3900
aaaccaatgc tggtattgca ccttctggaa ctatgggctt gagaaaaccc ccaggatcac 3960
ttctccttgg cttccttctt ttctgtgctt gcatcagtgt ggactcctag aacgtgcgac 4020
ctgcctcaag aaaatgcagt tttcaaaaac agactcagca ttcagcctcc aatgaataag 4080
acatcttcca agcatataaa caattgcttt ggtttccttt tgaaaaagca tctacttgct 4140
tcagttggga aggtgcccat tccactctgc ctttgtcaca gagcagggtg ctattgtgag 4200
gccatctctg agcagtggac tcaaaagcat tttcaggcat gtcagagaag ggaggactca 4260
ctagaattag caaacaaaac caccctgaca tcctccttca ggaacacggg gagcagaggc 4320
caaagcacta aggggagggc gcatacccga gacgattgta tgaagaaaat atggaggaac 4380
tgttacatgt tcggtactaa gtcattttca ggggattgaa agactattgc tggatttcat 4440
gatgctgact ggcgttagct gattaaccca tgtaaatagg cacttaaata gaagcaggaa 4500
agggagacaa agactggctt ctggacttcc tccctgatcc ccacccttac tcatcacctg 4560
cagtggccag aattagggaa tcagaatcaa accagtgtaa ggcagtgctg gctgccattg 4620
cctggtcaca ttgaaattgg tggcttcatt ctagatgtag cttgtgcaga tgtagcagga 4680
aaataggaaa acctaccatc tcagtgagca ccagctgcct cccaaaggag gggcagccgt 4740
gcttatattt ttatggttac aatggcacaa aattattatc aacctaacta aaacattcct 4800
tttctctttt ttcctgaatt atcatggagt tttctaattc tctcttttgg aatgtagatt 4860
ttttttaaat gctttacgat gtaaaatatt tattttttac ttattctgga agatctggct 4920
gaaggattat tcatggaaca ggaagaagcg taaagactat ccatgtcatc tttgttgaga 4980
gtcttcgtga ctgtaagatt gtaaatacag attatttatt aactctgttc tgcctggaaa 5040
tttaggcttc atacggaaag tgtttgagag caagtagttg acatttatca gcaaatctct 5100
tgcaagaaca gcacaaggaa aatcagtcta ataagctgct ctgccccttg tgctcagagt 5160
ggatgttatg ggattctttt tttctctgtt ttatcttttc aagtggaatt agttggttat 5220
ccatttgcaa atgttttaaa ttgcaaagaa agccatgagg tcttcaatac tgttttaccc 5280
catcccttgt gcatatttcc agggagaagg aaagcatata cacttttttc tttcattttt 5340
ccaaaagaga aaaaaatgac aaaaggtgaa acttacatac aaatattacc tcatttgttg 5400
tgtgactgag taaagaattt ttggatcaag cggaaagagt ttaagtgtct aacaaactta 5460
aagctactgt agtacctaaa aagtcagtgt tgtacatagc ataaaaactc tgcagagaag 5520
tattcccaat aaggaaatag cattgaaatg ttaaatacaa tttctgaaag ttatgttttt 5580
tttctatcat ctggtatacc attgctttat ttttataaat tattttctca ttgccattgg 5640
aatagatatc tcagattgtg tagatatgct atttaaataa tttatcagga aatactgcct 5700
gtagagttag tatttctatt tttatataat gtttgcacac tgaattgaag aattgttggt 5760
tttttctttt ttttgttttg tttttttttt tttttttttt tgcttttgac ctcccatttt 5820
tactatttgc caataccttt ttctaggaat gtgctttttt ttgtacacat ttttatccat 5880
tttacattct aaagcagtgt aagttgtata ttactgtttc ttatgtacaa ggaacaacaa 5940
taaatcatat ggaaatttat atttatactt actgtatcca tgcttatttg ttctctactg 6000
gctttatgtc atgaagtata tgcgtaaata ccattcataa atcaatatag catatacaaa 6060
aataaattac agtaagtcat agcaacattc acagtttgta tgtgattgag aaagactgag 6120
ttgctcaggc ctaggcttag aatttgctgc gtttgtggaa taaaagaaca aaatgataca 6180
ttagcctgcc atatcaa 6197
<![CDATA[<210> 389]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 389]]>
Met Gly Leu Ala Trp Gly Leu Gly Val Leu Phe Leu Met His Val Cys
1 5 10 15
Gly Thr Asn Arg Ile Pro
20
<![CDATA[<210> 390]]>
<![CDATA[<211> 186]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 390]]>
Ser Gly Gly Asp Asn Ser Val Phe Asp Ile Phe Glu Leu Thr Gly Ala
1 5 10 15
Ala Arg Lys Gly Ser Gly Arg Arg Leu Val Lys Gly Pro Asp Pro Ser
20 25 30
Ser Pro Ala Phe Arg Ile Glu Asp Ala Asn Leu Ile Pro Pro Val Pro
35 40 45
Asp Asp Lys Phe Gln Asp Leu Val Asp Ala Val Arg Ala Glu Lys Gly
50 55 60
Phe Leu Leu Leu Ala Ser Leu Arg Gln Met Lys Lys Thr Arg Gly Thr
65 70 75 80
Leu Leu Ala Leu Glu Arg Lys Asp His Ser Gly Gln Val Phe Ser Val
85 90 95
Val Ser Asn Gly Lys Ala Gly Thr Leu Asp Leu Ser Leu Thr Val Gln
100 105 110
Gly Lys Gln His Val Val Ser Val Glu Glu Ala Leu Leu Ala Thr Gly
115 120 125
Gln Trp Lys Ser Ile Thr Leu Phe Val Gln Glu Asp Arg Ala Gln Leu
130 135 140
Tyr Ile Asp Cys Glu Lys Met Glu Asn Ala Glu Leu Asp Val Pro Ile
145 150 155 160
Gln Ser Val Phe Thr Arg Asp Leu Ala Ser Ile Ala Arg Leu Arg Ile
165 170 175
Ala Lys Gly Gly Val Asn Asp Asn Phe Gln
180 185
<![CDATA[<210> 391]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 391]]>
Gly Val Leu Gln Asn Val Arg Phe Val Phe Gly Thr Thr Pro Glu Asp
1 5 10 15
Ile Leu Arg Asn Lys Gly Cys Ser Ser
20 25
<![CDATA[<210> 392]]>
<![CDATA[<211> 66]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 392]]>
Thr Ser Val Leu Leu Thr Leu Asp Asn Asn Val Val Asn Gly Ser Ser
1 5 10 15
Pro Ala Ile Arg Thr Asn Tyr Ile Gly His Lys Thr Lys Asp Leu Gln
20 25 30
Ala Ile Cys Gly Ile Ser Cys Asp Glu Leu Ser Ser Met Val Leu Glu
35 40 45
Leu Arg Gly Leu Arg Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg
50 55 60
Lys Val
65
<![CDATA[<210> 393]]>
<![CDATA[<211> 41]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 393]]>
Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu
1 5 10 15
Cys Tyr His Asn Gly Val Gln Tyr Arg Asn Asn Glu Glu Trp Thr Val
20 25 30
Asp Ser Cys Thr Glu Cys His Cys Gln
35 40
<![CDATA[<210> 394]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 394]]>
Asn Ser Val Thr Ile Cys Lys Lys Val Ser Cys Pro Ile Met Pro Cys
1 5 10 15
Ser Asn Ala Thr Val Pro Asp Gly Glu Cys Cys Pro Arg Cys Trp
20 25 30
<![CDATA[<210> 395]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 395]]>
Ser Asp Ser Ala Asp Asp Gly Trp Ser Pro Trp Ser Glu Trp Thr Ser
1 5 10 15
Cys Ser Thr Ser Cys Gly Asn Gly Ile Gln Gln Arg Gly Arg Ser Cys
20 25 30
Asp Ser Leu Asn Asn Arg Cys Glu Gly Ser Ser Val Gln Thr Arg Thr
35 40 45
Cys His Ile Gln Glu Cys Asp Lys Arg
50 55
<![CDATA[<210> 396]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 396]]>
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
1 5 10 15
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
20 25 30
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
35 40 45
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro
50 55
<![CDATA[<210> 397]]>
<![CDATA[<211> 738]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 397]]>
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
1 5 10 15
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
20 25 30
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
35 40 45
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro Ile Asn Gly Gly Trp Gly
50 55 60
Pro Trp Ser Pro Trp Asp Ile Cys Ser Val Thr Cys Gly Gly Gly Val
65 70 75 80
Gln Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr Pro Gln Phe Gly Gly
85 90 95
Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gln Ile Cys Asn Lys Gln
100 105 110
Asp Cys Pro Ile Asp Gly Cys Leu Ser Asn Pro Cys Phe Ala Gly Val
115 120 125
Lys Cys Thr Ser Tyr Pro Asp Gly Ser Trp Lys Cys Gly Ala Cys Pro
130 135 140
Pro Gly Tyr Ser Gly Asn Gly Ile Gln Cys Thr Asp Val Asp Glu Cys
145 150 155 160
Lys Glu Val Pro Asp Ala Cys Phe Asn His Asn Gly Glu His Arg Cys
165 170 175
Glu Asn Thr Asp Pro Gly Tyr Asn Cys Leu Pro Cys Pro Pro Arg Phe
180 185 190
Thr Gly Ser Gln Pro Phe Gly Gln Gly Val Glu His Ala Thr Ala Asn
195 200 205
Lys Gln Val Cys Lys Pro Arg Asn Pro Cys Thr Asp Gly Thr His Asp
210 215 220
Cys Asn Lys Asn Ala Lys Cys Asn Tyr Leu Gly His Tyr Ser Asp Pro
225 230 235 240
Met Tyr Arg Cys Glu Cys Lys Pro Gly Tyr Ala Gly Asn Gly Ile Ile
245 250 255
Cys Gly Glu Asp Thr Asp Leu Asp Gly Trp Pro Asn Glu Asn Leu Val
260 265 270
Cys Val Ala Asn Ala Thr Tyr His Cys Lys Lys Asp Asn Cys Pro Asn
275 280 285
Leu Pro Asn Ser Gly Gln Glu Asp Tyr Asp Lys Asp Gly Ile Gly Asp
290 295 300
Ala Cys Asp Asp Asp Asp Asp Asn Asp Lys Ile Pro Asp Asp Arg Asp
305 310 315 320
Asn Cys Pro Phe His Tyr Asn Pro Ala Gln Tyr Asp Tyr Asp Arg Asp
325 330 335
Asp Val Gly Asp Arg Cys Asp Asn Cys Pro Tyr Asn His Asn Pro Asp
340 345 350
Gln Ala Asp Thr Asp Asn Asn Gly Glu Gly Asp Ala Cys Ala Ala Asp
355 360 365
Ile Asp Gly Asp Gly Ile Leu Asn Glu Arg Asp Asn Cys Gln Tyr Val
370 375 380
Tyr Asn Val Asp Gln Arg Asp Thr Asp Met Asp Gly Val Gly Asp Gln
385 390 395 400
Cys Asp Asn Cys Pro Leu Glu His Asn Pro Asp Gln Leu Asp Ser Asp
405 410 415
Ser Asp Arg Ile Gly Asp Thr Cys Asp Asn Asn Gln Asp Ile Asp Glu
420 425 430
Asp Gly His Gln Asn Asn Leu Asp Asn Cys Pro Tyr Val Pro Asn Ala
435 440 445
Asn Gln Ala Asp His Asp Lys Asp Gly Lys Gly Asp Ala Cys Asp His
450 455 460
Asp Asp Asp Asn Asp Gly Ile Pro Asp Asp Lys Asp Asn Cys Arg Leu
465 470 475 480
Val Pro Asn Pro Asp Gln Lys Asp Ser Asp Gly Asp Gly Arg Gly Asp
485 490 495
Ala Cys Lys Asp Asp Phe Asp His Asp Ser Val Pro Asp Ile Asp Asp
500 505 510
Ile Cys Pro Glu Asn Val Asp Ile Ser Glu Thr Asp Phe Arg Arg Phe
515 520 525
Gln Met Ile Pro Leu Asp Pro Lys Gly Thr Ser Gln Asn Asp Pro Asn
530 535 540
Trp Val Val Arg His Gln Gly Lys Glu Leu Val Gln Thr Val Asn Cys
545 550 555 560
Asp Pro Gly Leu Ala Val Gly Tyr Asp Glu Phe Asn Ala Val Asp Phe
565 570 575
Ser Gly Thr Phe Phe Ile Asn Thr Glu Arg Asp Asp Asp Tyr Ala Gly
580 585 590
Phe Val Phe Gly Tyr Gln Ser Ser Ser Arg Phe Tyr Val Val Met Trp
595 600 605
Lys Gln Val Thr Gln Ser Tyr Trp Asp Thr Asn Pro Thr Arg Ala Gln
610 615 620
Gly Tyr Ser Gly Leu Ser Val Lys Val Val Asn Ser Thr Thr Gly Pro
625 630 635 640
Gly Glu His Leu Arg Asn Ala Leu Trp His Thr Gly Asn Thr Pro Gly
645 650 655
Gln Val Arg Thr Leu Trp His Asp Pro Arg His Ile Gly Trp Lys Asp
660 665 670
Phe Thr Ala Tyr Arg Trp Arg Leu Ser His Arg Pro Lys Thr Gly Phe
675 680 685
Ile Arg Val Val Met Tyr Glu Gly Lys Lys Ile Met Ala Asp Ser Gly
690 695 700
Pro Ile Tyr Asp Lys Thr Tyr Ala Gly Gly Arg Leu Gly Leu Phe Val
705 710 715 720
Phe Ser Gln Glu Met Val Phe Phe Ser Asp Leu Lys Tyr Glu Cys Arg
725 730 735
Asp Pro
<![CDATA[<210> 398]]>
<![CDATA[<211> 1170]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 398]]>
Met Gly Leu Ala Trp Gly Leu Gly Val Leu Phe Leu Met His Val Cys
1 5 10 15
Gly Thr Asn Arg Ile Pro Glu Ser Gly Gly Asp Asn Ser Val Phe Asp
20 25 30
Ile Phe Glu Leu Thr Gly Ala Ala Arg Lys Gly Ser Gly Arg Arg Leu
35 40 45
Val Lys Gly Pro Asp Pro Ser Ser Pro Ala Phe Arg Ile Glu Asp Ala
50 55 60
Asn Leu Ile Pro Pro Val Pro Asp Asp Lys Phe Gln Asp Leu Val Asp
65 70 75 80
Ala Val Arg Ala Glu Lys Gly Phe Leu Leu Leu Ala Ser Leu Arg Gln
85 90 95
Met Lys Lys Thr Arg Gly Thr Leu Leu Ala Leu Glu Arg Lys Asp His
100 105 110
Ser Gly Gln Val Phe Ser Val Val Ser Asn Gly Lys Ala Gly Thr Leu
115 120 125
Asp Leu Ser Leu Thr Val Gln Gly Lys Gln His Val Val Ser Val Glu
130 135 140
Glu Ala Leu Leu Ala Thr Gly Gln Trp Lys Ser Ile Thr Leu Phe Val
145 150 155 160
Gln Glu Asp Arg Ala Gln Leu Tyr Ile Asp Cys Glu Lys Met Glu Asn
165 170 175
Ala Glu Leu Asp Val Pro Ile Gln Ser Val Phe Thr Arg Asp Leu Ala
180 185 190
Ser Ile Ala Arg Leu Arg Ile Ala Lys Gly Gly Val Asn Asp Asn Phe
195 200 205
Gln Gly Val Leu Gln Asn Val Arg Phe Val Phe Gly Thr Thr Pro Glu
210 215 220
Asp Ile Leu Arg Asn Lys Gly Cys Ser Ser Ser Thr Ser Val Leu Leu
225 230 235 240
Thr Leu Asp Asn Asn Val Val Asn Gly Ser Ser Pro Ala Ile Arg Thr
245 250 255
Asn Tyr Ile Gly His Lys Thr Lys Asp Leu Gln Ala Ile Cys Gly Ile
260 265 270
Ser Cys Asp Glu Leu Ser Ser Met Val Leu Glu Leu Arg Gly Leu Arg
275 280 285
Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg Lys Val Thr Glu Glu
290 295 300
Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu Cys Tyr His
305 310 315 320
Asn Gly Val Gln Tyr Arg Asn Asn Glu Glu Trp Thr Val Asp Ser Cys
325 330 335
Thr Glu Cys His Cys Gln Asn Ser Val Thr Ile Cys Lys Lys Val Ser
340 345 350
Cys Pro Ile Met Pro Cys Ser Asn Ala Thr Val Pro Asp Gly Glu Cys
355 360 365
Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp Gly Trp Ser Pro
370 375 380
Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly Asn Gly Ile Gln
385 390 395 400
Gln Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Arg Cys Glu Gly Ser
405 410 415
Ser Val Gln Thr Arg Thr Cys His Ile Gln Glu Cys Asp Lys Arg Phe
420 425 430
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
435 440 445
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
450 455 460
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
465 470 475 480
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro Ile Asn Gly Gly Trp Gly
485 490 495
Pro Trp Ser Pro Trp Asp Ile Cys Ser Val Thr Cys Gly Gly Gly Val
500 505 510
Gln Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr Pro Gln Phe Gly Gly
515 520 525
Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gln Ile Cys Asn Lys Gln
530 535 540
Asp Cys Pro Ile Asp Gly Cys Leu Ser Asn Pro Cys Phe Ala Gly Val
545 550 555 560
Lys Cys Thr Ser Tyr Pro Asp Gly Ser Trp Lys Cys Gly Ala Cys Pro
565 570 575
Pro Gly Tyr Ser Gly Asn Gly Ile Gln Cys Thr Asp Val Asp Glu Cys
580 585 590
Lys Glu Val Pro Asp Ala Cys Phe Asn His Asn Gly Glu His Arg Cys
595 600 605
Glu Asn Thr Asp Pro Gly Tyr Asn Cys Leu Pro Cys Pro Pro Arg Phe
610 615 620
Thr Gly Ser Gln Pro Phe Gly Gln Gly Val Glu His Ala Thr Ala Asn
625 630 635 640
Lys Gln Val Cys Lys Pro Arg Asn Pro Cys Thr Asp Gly Thr His Asp
645 650 655
Cys Asn Lys Asn Ala Lys Cys Asn Tyr Leu Gly His Tyr Ser Asp Pro
660 665 670
Met Tyr Arg Cys Glu Cys Lys Pro Gly Tyr Ala Gly Asn Gly Ile Ile
675 680 685
Cys Gly Glu Asp Thr Asp Leu Asp Gly Trp Pro Asn Glu Asn Leu Val
690 695 700
Cys Val Ala Asn Ala Thr Tyr His Cys Lys Lys Asp Asn Cys Pro Asn
705 710 715 720
Leu Pro Asn Ser Gly Gln Glu Asp Tyr Asp Lys Asp Gly Ile Gly Asp
725 730 735
Ala Cys Asp Asp Asp Asp Asp Asn Asp Lys Ile Pro Asp Asp Arg Asp
740 745 750
Asn Cys Pro Phe His Tyr Asn Pro Ala Gln Tyr Asp Tyr Asp Arg Asp
755 760 765
Asp Val Gly Asp Arg Cys Asp Asn Cys Pro Tyr Asn His Asn Pro Asp
770 775 780
Gln Ala Asp Thr Asp Asn Asn Gly Glu Gly Asp Ala Cys Ala Ala Asp
785 790 795 800
Ile Asp Gly Asp Gly Ile Leu Asn Glu Arg Asp Asn Cys Gln Tyr Val
805 810 815
Tyr Asn Val Asp Gln Arg Asp Thr Asp Met Asp Gly Val Gly Asp Gln
820 825 830
Cys Asp Asn Cys Pro Leu Glu His Asn Pro Asp Gln Leu Asp Ser Asp
835 840 845
Ser Asp Arg Ile Gly Asp Thr Cys Asp Asn Asn Gln Asp Ile Asp Glu
850 855 860
Asp Gly His Gln Asn Asn Leu Asp Asn Cys Pro Tyr Val Pro Asn Ala
865 870 875 880
Asn Gln Ala Asp His Asp Lys Asp Gly Lys Gly Asp Ala Cys Asp His
885 890 895
Asp Asp Asp Asn Asp Gly Ile Pro Asp Asp Lys Asp Asn Cys Arg Leu
900 905 910
Val Pro Asn Pro Asp Gln Lys Asp Ser Asp Gly Asp Gly Arg Gly Asp
915 920 925
Ala Cys Lys Asp Asp Phe Asp His Asp Ser Val Pro Asp Ile Asp Asp
930 935 940
Ile Cys Pro Glu Asn Val Asp Ile Ser Glu Thr Asp Phe Arg Arg Phe
945 950 955 960
Gln Met Ile Pro Leu Asp Pro Lys Gly Thr Ser Gln Asn Asp Pro Asn
965 970 975
Trp Val Val Arg His Gln Gly Lys Glu Leu Val Gln Thr Val Asn Cys
980 985 990
Asp Pro Gly Leu Ala Val Gly Tyr Asp Glu Phe Asn Ala Val Asp Phe
995 1000 1005
Ser Gly Thr Phe Phe Ile Asn Thr Glu Arg Asp Asp Asp Tyr Ala
1010 1015 1020
Gly Phe Val Phe Gly Tyr Gln Ser Ser Ser Arg Phe Tyr Val Val
1025 1030 1035
Met Trp Lys Gln Val Thr Gln Ser Tyr Trp Asp Thr Asn Pro Thr
1040 1045 1050
Arg Ala Gln Gly Tyr Ser Gly Leu Ser Val Lys Val Val Asn Ser
1055 1060 1065
Thr Thr Gly Pro Gly Glu His Leu Arg Asn Ala Leu Trp His Thr
1070 1075 1080
Gly Asn Thr Pro Gly Gln Val Arg Thr Leu Trp His Asp Pro Arg
1085 1090 1095
His Ile Gly Trp Lys Asp Phe Thr Ala Tyr Arg Trp Arg Leu Ser
1100 1105 1110
His Arg Pro Lys Thr Gly Phe Ile Arg Val Val Met Tyr Glu Gly
1115 1120 1125
Lys Lys Ile Met Ala Asp Ser Gly Pro Ile Tyr Asp Lys Thr Tyr
1130 1135 1140
Ala Gly Gly Arg Leu Gly Leu Phe Val Phe Ser Gln Glu Met Val
1145 1150 1155
Phe Phe Ser Asp Leu Lys Tyr Glu Cys Arg Asp Pro
1160 1165 1170
<![CDATA[<210> 399]]>
<![CDATA[<211> 1650]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 399]]>
agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 60
tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 120
ctcccggccg ccgcgctccg gtacacacag gatccctgct gggcaccaac agctccacca 180
tggggctggc ctggggacta ggcgtcctgt tcctgatgca tgtgtgtggc accaaccgca 240
ttccagagtc tggcggagac aacagcgtgt ttgacatctt tgaactcacc ggggccgccc 300
gcaaggggtc tgggcgccga ctggtgaagg gccccgaccc ttccagccca gctttccgca 360
tcgaggatgc caacctgatc ccccctgtgc ctgatgacaa gttccaagac ctggtggatg 420
ctgtgcgggc agaaaagggt ttcctccttc tggcatccct gaggcagatg aagaagaccc 480
ggggcacgct gctggccctg gagcggaaag accactctgg ccaggtcttc agcgtggtgt 540
ccaatggcaa ggcgggcacc ctggacctca gcctgaccgt ccaaggaaag cagcacgtgg 600
tgtctgtgga agaagctctc ctggcaaccg gccagtggaa gagcatcacc ctgtttgtgc 660
aggaagacag ggcccagctg tacatcgact gtgaaaagat ggagaatgct gagttggacg 720
tccccatcca aagcgtcttc accagagacc tggccagcat cgccagactc cgcatcgcaa 780
aggggggcgt caatgacaat ttccaggggg tgctgcagaa tgtgaggttt gtctttggaa 840
ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt gtcctcctca 900
cccttgacaa caacgtggtg aatggttcca gccctgccat ccgcactaac tacattggcc 960
acaagacaaa ggacttgcaa gccatctgcg gcatctcctg tgatgagctg tccagcatgg 1020
tcctggaact caggggcctg cgcaccattg tgaccacgct gcaggacagc atccgcaaag 1080
tgactgaaga gaacaaagag ttggccaatg agctgaggcg gcctccccta tgctatcaca 1140
acggagttca gtacagaaat aacgaggaat ggactgttga tagctgcact gagtgtcact 1200
gtcagaactc agttaccatc tgcaaaaagg tgtcctgccc catcatgccc tgctccaatg 1260
ccacagttcc tgatggagaa tgctgtcctc gctgttggcc cagcgactct gcggacgatg 1320
gctggtctcc atggtccgag tggacctcct gttctacgag ctgtggcaat ggaattcagc 1380
agcgcggccg ctcctgcgat agcctcaaca accgatgtga gggctcctcg gtccagacac 1440
ggacctgcca cattcaggag tgtgacaaga gatttaaaca ggatggtggc tggagccact 1500
ggtccccgtg gtcatcttgt tctgtgacat gtggtgatgg tgtgatcaca aggatccggc 1560
tctgcaactc tcccagcccc cagatgaacg ggaaaccctg tgaaggcgaa gcgcgggaga 1620
ccaaagcctg caagaaagac gcctgcccca 1650
<![CDATA[<210> 400]]>
<![CDATA[<211> 490]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 400]]>
Met Gly Leu Ala Trp Gly Leu Gly Val Leu Phe Leu Met His Val Cys
1 5 10 15
Gly Thr Asn Arg Ile Pro Glu Ser Gly Gly Asp Asn Ser Val Phe Asp
20 25 30
Ile Phe Glu Leu Thr Gly Ala Ala Arg Lys Gly Ser Gly Arg Arg Leu
35 40 45
Val Lys Gly Pro Asp Pro Ser Ser Pro Ala Phe Arg Ile Glu Asp Ala
50 55 60
Asn Leu Ile Pro Pro Val Pro Asp Asp Lys Phe Gln Asp Leu Val Asp
65 70 75 80
Ala Val Arg Ala Glu Lys Gly Phe Leu Leu Leu Ala Ser Leu Arg Gln
85 90 95
Met Lys Lys Thr Arg Gly Thr Leu Leu Ala Leu Glu Arg Lys Asp His
100 105 110
Ser Gly Gln Val Phe Ser Val Val Ser Asn Gly Lys Ala Gly Thr Leu
115 120 125
Asp Leu Ser Leu Thr Val Gln Gly Lys Gln His Val Val Ser Val Glu
130 135 140
Glu Ala Leu Leu Ala Thr Gly Gln Trp Lys Ser Ile Thr Leu Phe Val
145 150 155 160
Gln Glu Asp Arg Ala Gln Leu Tyr Ile Asp Cys Glu Lys Met Glu Asn
165 170 175
Ala Glu Leu Asp Val Pro Ile Gln Ser Val Phe Thr Arg Asp Leu Ala
180 185 190
Ser Ile Ala Arg Leu Arg Ile Ala Lys Gly Gly Val Asn Asp Asn Phe
195 200 205
Gln Gly Val Leu Gln Asn Val Arg Phe Val Phe Gly Thr Thr Pro Glu
210 215 220
Asp Ile Leu Arg Asn Lys Gly Cys Ser Ser Ser Thr Ser Val Leu Leu
225 230 235 240
Thr Leu Asp Asn Asn Val Val Asn Gly Ser Ser Pro Ala Ile Arg Thr
245 250 255
Asn Tyr Ile Gly His Lys Thr Lys Asp Leu Gln Ala Ile Cys Gly Ile
260 265 270
Ser Cys Asp Glu Leu Ser Ser Met Val Leu Glu Leu Arg Gly Leu Arg
275 280 285
Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg Lys Val Thr Glu Glu
290 295 300
Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu Cys Tyr His
305 310 315 320
Asn Gly Val Gln Tyr Arg Asn Asn Glu Glu Trp Thr Val Asp Ser Cys
325 330 335
Thr Glu Cys His Cys Gln Asn Ser Val Thr Ile Cys Lys Lys Val Ser
340 345 350
Cys Pro Ile Met Pro Cys Ser Asn Ala Thr Val Pro Asp Gly Glu Cys
355 360 365
Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp Gly Trp Ser Pro
370 375 380
Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly Asn Gly Ile Gln
385 390 395 400
Gln Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Arg Cys Glu Gly Ser
405 410 415
Ser Val Gln Thr Arg Thr Cys His Ile Gln Glu Cys Asp Lys Arg Phe
420 425 430
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
435 440 445
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
450 455 460
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
465 470 475 480
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro
485 490
<![CDATA[<210> 401]]>
<![CDATA[<211> 106]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 401]]>
gtgtgcggct cagatggggt cacctacagc accgagtgtg agctgaagaa ggccaggtgt 60
gagtcacagc gagggctcta cgtagcggcc cagggagcct gccgag 106
<![CDATA[<210> 402]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 402]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga c 101
<![CDATA[<210> 403]]>
<![CDATA[<211> 207]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 403]]>
gtgtgcggct cagatggggt cacctacagc accgagtgtg agctgaagaa ggccaggtgt 60
gagtcacagc gagggctcta cgtagcggcc cagggagcct gccgagctac atctacatcc 120
accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtgaat 180
ggaggggagt gcttcatggt gaaagac 207
<![CDATA[<210> 404]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 404]]>
Val Cys Gly Ser Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys
1 5 10 15
Lys Ala Arg Cys Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly
20 25 30
Ala Cys Arg Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val
35 40 45
Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
50 55 60
Phe Met Val Lys Asp
65
<![CDATA[<210> 405]]>
<![CDATA[<211> 254]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 405]]>
agtcccgtcc ccggcgcggc ccgcgcgctc ctccgccgcc tctcgcctgc gccatggccg 60
gccggtccca cccgggcccg ctgcggccgc tgctgccgct ccttgtggtg gccgcgtgcg 120
tcctgcccgg agccggcggg acatgcccgg agcgcgcgct ggagcggcgc gaggaggagg 180
cgaacgtggt gctcaccggg acggtggagg agatcctcaa cgtggacccg gtgcagcaca 240
cgtactcctg caag 254
<![CDATA[<210> 406]]>
<![CDATA[<211> 262]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 406]]>
gttcgggtct ggcggtactt gaagggcaaa gacctggtgg cccgggagag cctgctggac 60
ggcggcaaca aggtggtgat cagcggcttt ggagaccccc tcatctgtga caaccaggtg 120
tccactgggg acaccaggat cttctttgtg aaccctgcac ccccatacct gtggccagcc 180
cacaagaacg agctgatgct caactccagc ctcatgcgga tcaccctgcg gaacctggag 240
gaggtggagt tctgtgtgga ag 262
<![CDATA[<210> 407]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 407]]>
ataaacccgg gacccacttc actccagtgc ctccgacgcc tcctgatg 48
<![CDATA[<210> 408]]>
<![CDATA[<211> 216]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 408]]>
cgtgccgggg aatgctgtgc ggcttcggcg ccgtgtgcga gcccaacgcg gaggggccgg 60
gccgggcgtc ctgcgtctgc aagaagagcc cgtgccccag cgtggtggcg cctgtgtgtg 120
ggtcggacgc ctccacctac agcaacgaat gcgagctgca gcgggcgcag tgcagccagc 180
agcgccgcat ccgcctgctc agccgcgggc cgtgcg 216
<![CDATA[<210> 409]]>
<![CDATA[<211> 225]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 409]]>
gctcgcggga cccctgctcc aacgtgacct gcagcttcgg cagcacctgt gcgcgctcgg 60
ccgacgggct gacggcctcg tgcctgtgcc ccgcgacctg ccgtggcgcc cccgagggga 120
ccgtctgcgg cagcgacggc gccgactacc ccggcgagtg ccagctcctg cgccgcgcct 180
gcgcccgcca ggagaatgtc ttcaagaagt tcgacggccc ttgtg 225
<![CDATA[<210> 410]]>
<![CDATA[<211> 225]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 410]]>
acccctgtca gggcgccctc cctgacccga gccgcagctg ccgtgtgaac ccgcgcacgc 60
ggcgccctga gatgctccta cggcccgaga gctgccctgc ccggcaggcg ccagtgtgtg 120
gggacgacgg agtcacctac gaaaacgact gtgtcatggg ccgatcgggg gccgcccggg 180
gtctcctcct gcagaaagtg cgctccggcc agtgccaggg tcgag 225
<![CDATA[<210> 411]]>
<![CDATA[<211> 207]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 411]]>
accagtgccc ggagccctgc cggttcaatg ccgtgtgcct gtcccgccgt ggccgtcccc 60
gctgctcctg cgaccgcgtc acctgtgacg gggcctacag gcccgtgtgt gcccaggacg 120
ggcgcacgta tgacagtgat tgctggcggc agcaggctga gtgccggcag cagcgtgcca 180
tccccagcaa gcaccagggc ccgtgtg 207
<![CDATA[<210> 412]]>
<![CDATA[<211> 219]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 412]]>
accaggcccc gtccccatgc ctcggggtgc agtgtgcatt tggggcgacg tgtgctgtga 60
agaacgggca ggcagcgtgt gaatgcctgc aggcgtgctc gagcctctac gatcctgtgt 120
gcggcagcga cggcgtcaca tacggcagcg cgtgcgagct ggaggccacg gcctgtaccc 180
tcgggcggga gatccaggtg gcgcgcaaag gaccctgtg 219
<![CDATA[<210> 413]]>
<![CDATA[<211> 195]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 413]]>
accgctgcgg gcagtgccgc tttggagccc tgtgcgaggc cgagaccggg cgctgcgtgt 60
gcccctctga atgcgtggct ttggcccagc ccgtgtgtgg ctccgacggg cacacgtacc 120
ccagcgagtg catgctgcac gtgcacgcct gcacacacca gatcagcctg cacgtggcct 180
cagctggacc ctgtg 195
<![CDATA[<210> 414]]>
<![CDATA[<211> 201]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 414]]>
agacctgtgg agatgccgtg tgtgcttttg gggctgtgtg ctccgcaggg cagtgtgtgt 60
gtccccggtg tgagcacccc ccgcccggcc ccgtgtgtgg cagcgacggt gtcacctacg 120
gcagtgcctg cgagctacgg gaagccgcct gcctccagca gacacagatc gaggaggccc 180
gggcagggcc gtgcgagcag g 201
<![CDATA[<210> 415]]>
<![CDATA[<211> 149]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 415]]>
ccgagtgcgg ttccggaggc tctggctctg gggaggacgg tgactgtgag caggagctgt 60
gccggcagcg cggtggcatc tgggacgagg actcggagga cgggccgtgt gtctgtgact 120
tcagctgcca gagtgtccca ggcagcccg 149
<![CDATA[<210> 416]]>
<![CDATA[<211> 106]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 416]]>
gtgtgcggct cagatggggt cacctacagc accgagtgtg agctgaagaa ggccaggtgt 60
gagtcacagc gagggctcta cgtagcggcc cagggagcct gccgag 106
<![CDATA[<210> 417]]>
<![CDATA[<211> 5088]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 417]]>
gccccacctt cgccccgctg ccgcctgtgg cccccttaca ctgtgcccag acgccctacg 60
gctgctgcca ggacaatatc accgcagccc ggggcgtggg cctggctggc tgccccagtg 120
cctgccagtg caacccccat ggctcttacg gcggcacctg tgacccagcc acaggccagt 180
gctcctgccg cccaggtgtg gggggcctca ggtgtgaccg ctgtgagcct ggcttctgga 240
actttcgagg catcgtcacc gatggccgga gtggctgtac accctgcagc tgtgatcccc 300
aaggcgccgt gcgggatgac tgtgagcaga tgacggggct gtgctcgtgt aagcccgggg 360
tggctggacc caagtgtggg cagtgtccag acggccgtgc cctgggcccc gcgggctgtg 420
aagctgacgc ttctgcgcct gcgacctgtg cggagatgcg ctgtgagttc ggtgcgcggt 480
gcgtggagga gtctggctca gcccactgtg tctgcccgat gctcacctgt ccagaggcca 540
acgctaccaa ggtctgtggg tcagatggag tcacatacgg caacgagtgt cagctgaaga 600
ccatcgcctg ccgccagggc ctgcaaatct ctatccagag cctgggcccg tgccaggagg 660
ctgttgctcc cagcactcac ccgacatctg cctccgtgac tgtgaccacc ccagggctcc 720
tcctgagcca ggcactgccg gccccccccg gcgccctccc cctggctccc agcagtaccg 780
cacacagcca gaccacccct ccgccctcat cacgacctcg gaccactgcc agcgtcccca 840
ggaccaccgt gtggcccgtg ctgacggtgc cccccacggc accctcccct gcacccagcc 900
tggtggcgtc cgcctttggt gaatctggca gcactgatgg aagcagcgat gaggaactga 960
gcggggacca ggaggccagt gggggtggct ctggggggct cgagcccttg gagggcagca 1020
gcgtggccac ccctgggcca cctgtcgaga gggcttcctg ctacaactcc gcgttgggct 1080
gctgctctga tgggaagacg ccctcgctgg acgcagaggg ctccaactgc cccgccacca 1140
aggtgttcca gggcgtcctg gagctggagg gcgtcgaggg ccaggagctg ttctacacgc 1200
ccgagatggc tgaccccaag tcagaactgt tcggggagac agccaggagc attgagagca 1260
ccctggacga cctcttccgg aattcagacg tcaagaagga ttttcggagt gtccgcttgc 1320
gggacctggg gcccggcaaa tccgtccgcg ccattgtgga tgtgcacttt gaccccacca 1380
cagccttcag ggcacccgac gtggcccggg ccctgctccg gcagatccag gtgtccaggc 1440
gccggtcctt gggggtgagg cggccgctgc aggagcacgt gcgatttatg gactttgact 1500
ggtttcctgc gtttatcacg ggggccacgt caggagccat tgctgcggga gccacggcca 1560
gagccaccac tgcatcgcgc ctgccgtcct ctgctgtgac ccctcgggcc ccgcacccca 1620
gtcacacaag ccagcccgtt gccaagacca cggcagcccc caccacacgt cggcccccca 1680
ccactgcccc cagccgtgtg cccggacgtc ggcccccggc cccccagcag cctccaaagc 1740
cctgtgactc acagccctgc ttccacgggg ggacctgcca ggactgggca ttgggcgggg 1800
gcttcacctg cagctgcccg gcaggcaggg gaggcgccgt ctgtgagaag gtgcttggcg 1860
cccctgtgcc ggccttcgag ggccgctcct tcctggcctt ccccactctc cgcgcctacc 1920
acacgctgcg cctggcactg gaattccggg cgctggagcc tcaggggctg ctgctgtaca 1980
atggcaacgc ccggggcaag gacttcctgg cattggcgct gctagatggc cgcgtgcagc 2040
tcaggtttga cacaggttcg gggccggcgg tgctgaccag tgccgtgccg gtagagccgg 2100
gccagtggca ccgcctggag ctgtcccggc actggcgccg gggcaccctc tcggtggatg 2160
gtgagacccc tgttctgggc gagagtccca gtggcaccga cggcctcaac ctggacacag 2220
acctctttgt gggcggcgta cccgaggacc aggctgccgt ggcgctggag cggaccttcg 2280
tgggcgccgg cctgaggggg tgcatccgtt tgctggacgt caacaaccag cgcctggagc 2340
ttggcattgg gccgggggct gccacccgag gctctggcgt gggcgagtgc ggggaccacc 2400
cctgcctgcc caacccctgc catggcgggg ccccatgcca gaacctggag gctggaaggt 2460
tccattgcca gtgcccgccc ggccgcgtcg gaccaacctg tgccgatgag aagagcccct 2520
gccagcccaa cccctgccat ggggcggcgc cctgccgtgt gctgcccgag ggtggtgctc 2580
agtgcgagtg ccccctgggg cgtgagggca ccttctgcca gacagcctcg gggcaggacg 2640
gctctgggcc cttcctggct gacttcaacg gcttctccca cctggagctg agaggcctgc 2700
acacctttgc acgggacctg ggggagaaga tggcgctgga ggtcgtgttc ctggcacgag 2760
gccccagcgg cctcctgctc tacaacgggc agaagacgga cggcaagggg gacttcgtgt 2820
cgctggcact gcgggaccgc cgcctggagt tccgctacga cctgggcaag ggggcagcgg 2880
tcatcaggag cagggagcca gtcaccctgg gagcctggac cagggtctca ctggagcgaa 2940
acggccgcaa gggtgccctg cgtgtgggcg acggcccccg tgtgttgggg gagtccccga 3000
aatcccgcaa ggttccgcac accgtcctca acctgaagga gccgctctac gtagggggcg 3060
ctcccgactt cagcaagctg gcccgtgctg ctgccgtgtc ctctggcttc gacggtgcca 3120
tccagctggt ctccctcgga ggccgccagc tgctgacccc ggagcacgtg ctgcggcagg 3180
tggacgtcac gtcctttgca ggtcacccct gcacccgggc ctcaggccac ccctgcctca 3240
atggggcctc ctgcgtcccg agggaggctg cctatgtgtg cctgtgtccc gggggattct 3300
caggaccgca ctgcgagaag gggctggtgg agaagtcagc gggggacgtg gataccttgg 3360
cctttgacgg gcggaccttt gtcgagtacc tcaacgctgt gaccgagagc gaactggcca 3420
atgagatccc cgtccccgaa actctggatt ccggggccct tcacagcgag aaggcactgc 3480
agagcaacca ctttgaactg agcctgcgca ctgaggccac gcaggggctg gtgctctgga 3540
gtggcaaggc cacggagcgg gcagactatg tggcactggc cattgtggac gggcacctgc 3600
aactgagcta caacctgggc tcccagcccg tggtgctgcg ttccaccgtg cccgtcaaca 3660
ccaaccgctg gttgcgggtc gtggcacata gggagcagag ggaaggttcc ctgcaggtgg 3720
gcaatgaggc ccctgtgacc ggctcctccc cgctgggcgc cacgcagctg gacactgatg 3780
gagccctgtg gcttgggggc ctgccggagc tgcccgtggg cccagcactg cccaaggcct 3840
acggcacagg ctttgtgggc tgcttgcggg acgtggtggt gggccggcac ccgctgcacc 3900
tgctggagga cgccgtcacc aagccagagc tgcggccctg ccccacccca tgagctggca 3960
ccagagcccc gcgcccgctg taattatttt ctatttttgt aaacttgttg ctttttgata 4020
tgattttctt gcctgagtgt tggccggagg gactgctggc ccggcctccc ttccgtccag 4080
gcagccgtgc tgcagacaga cctagtgccg agggatggac aggcgaggtg gcagcgtgga 4140
gggctcggcg tggatggcag cctcaggaca cacacccctg cctcaaggtg ctgagccccc 4200
gccttgcact gcgcctgccc cacggtgtcc ccgccgggaa gcagccccgg ctcctgaatc 4260
accctcgctc cgtcaggcgg gactcgtgtc ccagagagga aggggctgct gaggtctgat 4320
ggggcccttc ctccgggtga ccccacaggg cctttccaag cccccatttg agctgctcct 4380
tcctgtgtgt gctctgggcc ctgcctcggc ctcctgcgcc aatactgtga cttccaaaca 4440
atgttactgc tgggcacagc tctgcgttgc tcccgtgctg cctgcgccag ccccaggctg 4500
ctgaggagca gaggccagac cagggccgat ctgggtgtcc tgaccctcag ctggccctgc 4560
ccagccaccc tggacgtgac cgtatccctc tgccacaccc caggccctgc gaggggctat 4620
cgagaggagc tcactgtggg atggggttga cctctgccgc ctgcctgggt atctgggcct 4680
ggccatggct gtgttcttca tgtgttgatt ttatttgacc cctggagtgg tgggtctcat 4740
ctttcccatc tcgcctgaga gcggctgagg gctgcctcac tgcaaatcct ccccacagcg 4800
tcagtgaaag tcgtccttgt ctcagaatga ccaggggcca gccagtgtct gaccaaggtc 4860
aaggggcagg tgcagaggtg gcagggatgg ctccgaagcc agaaatgcct taaactgcaa 4920
cgtcccgtcc cttccccacc cccatcccat ccccaccccc agccccagcc cagtcctcct 4980
aggagcagga cccgatgaag cgggcggcgg tggggctggg tgccgtgtta ctaactctag 5040
tatgtttctg tgtcaatcgc tgtgaaataa agtctgaaaa ctttaaaa 5088
<![CDATA[<210> 418]]>
<![CDATA[<211> 7395]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> AGRN之全mRNA多核苷酸序列]]>
<![CDATA[<400> 418]]>
agtcccgtcc ccggcgcggc ccgcgcgctc ctccgccgcc tctcgcctgc gccatggccg 60
gccggtccca cccgggcccg ctgcggccgc tgctgccgct ccttgtggtg gccgcgtgcg 120
tcctgcccgg agccggcggg acatgcccgg agcgcgcgct ggagcggcgc gaggaggagg 180
cgaacgtggt gctcaccggg acggtggagg agatcctcaa cgtggacccg gtgcagcaca 240
cgtactcctg caaggttcgg gtctggcggt acttgaaggg caaagacctg gtggcccggg 300
agagcctgct ggacggcggc aacaaggtgg tgatcagcgg ctttggagac cccctcatct 360
gtgacaacca ggtgtccact ggggacacca ggatcttctt tgtgaaccct gcacccccat 420
acctgtggcc agcccacaag aacgagctga tgctcaactc cagcctcatg cggatcaccc 480
tgcggaacct ggaggaggtg gagttctgtg tggaagataa acccgggacc cacttcactc 540
cagtgcctcc gacgcctcct gatgcgtgcc ggggaatgct gtgcggcttc ggcgccgtgt 600
gcgagcccaa cgcggagggg ccgggccggg cgtcctgcgt ctgcaagaag agcccgtgcc 660
ccagcgtggt ggcgcctgtg tgtgggtcgg acgcctccac ctacagcaac gaatgcgagc 720
tgcagcgggc gcagtgcagc cagcagcgcc gcatccgcct gctcagccgc gggccgtgcg 780
gctcgcggga cccctgctcc aacgtgacct gcagcttcgg cagcacctgt gcgcgctcgg 840
ccgacgggct gacggcctcg tgcctgtgcc ccgcgacctg ccgtggcgcc cccgagggga 900
ccgtctgcgg cagcgacggc gccgactacc ccggcgagtg ccagctcctg cgccgcgcct 960
gcgcccgcca ggagaatgtc ttcaagaagt tcgacggccc ttgtgacccc tgtcagggcg 1020
ccctccctga cccgagccgc agctgccgtg tgaacccgcg cacgcggcgc cctgagatgc 1080
tcctacggcc cgagagctgc cctgcccggc aggcgccagt gtgtggggac gacggagtca 1140
cctacgaaaa cgactgtgtc atgggccgat cgggggccgc ccggggtctc ctcctgcaga 1200
aagtgcgctc cggccagtgc cagggtcgag accagtgccc ggagccctgc cggttcaatg 1260
ccgtgtgcct gtcccgccgt ggccgtcccc gctgctcctg cgaccgcgtc acctgtgacg 1320
gggcctacag gcccgtgtgt gcccaggacg ggcgcacgta tgacagtgat tgctggcggc 1380
agcaggctga gtgccggcag cagcgtgcca tccccagcaa gcaccagggc ccgtgtgacc 1440
aggccccgtc cccatgcctc ggggtgcagt gtgcatttgg ggcgacgtgt gctgtgaaga 1500
acgggcaggc agcgtgtgaa tgcctgcagg cgtgctcgag cctctacgat cctgtgtgcg 1560
gcagcgacgg cgtcacatac ggcagcgcgt gcgagctgga ggccacggcc tgtaccctcg 1620
ggcgggagat ccaggtggcg cgcaaaggac cctgtgaccg ctgcgggcag tgccgctttg 1680
gagccctgtg cgaggccgag accgggcgct gcgtgtgccc ctctgaatgc gtggctttgg 1740
cccagcccgt gtgtggctcc gacgggcaca cgtaccccag cgagtgcatg ctgcacgtgc 1800
acgcctgcac acaccagatc agcctgcacg tggcctcagc tggaccctgt gagacctgtg 1860
gagatgccgt gtgtgctttt ggggctgtgt gctccgcagg gcagtgtgtg tgtccccggt 1920
gtgagcaccc cccgcccggc cccgtgtgtg gcagcgacgg tgtcacctac ggcagtgcct 1980
gcgagctacg ggaagccgcc tgcctccagc agacacagat cgaggaggcc cgggcagggc 2040
cgtgcgagca ggccgagtgc ggttccggag gctctggctc tggggaggac ggtgactgtg 2100
agcaggagct gtgccggcag cgcggtggca tctgggacga ggactcggag gacgggccgt 2160
gtgtctgtga cttcagctgc cagagtgtcc caggcagccc ggtgtgcggc tcagatgggg 2220
tcacctacag caccgagtgt gagctgaaga aggccaggtg tgagtcacag cgagggctct 2280
acgtagcggc ccagggagcc tgccgaggcc ccaccttcgc cccgctgccg cctgtggccc 2340
ccttacactg tgcccagacg ccctacggct gctgccagga caatatcacc gcagcccggg 2400
gcgtgggcct ggctggctgc cccagtgcct gccagtgcaa cccccatggc tcttacggcg 2460
gcacctgtga cccagccaca ggccagtgct cctgccgccc aggtgtgggg ggcctcaggt 2520
gtgaccgctg tgagcctggc ttctggaact ttcgaggcat cgtcaccgat ggccggagtg 2580
gctgtacacc ctgcagctgt gatccccaag gcgccgtgcg ggatgactgt gagcagatga 2640
cggggctgtg ctcgtgtaag cccggggtgg ctggacccaa gtgtgggcag tgtccagacg 2700
gccgtgccct gggccccgcg ggctgtgaag ctgacgcttc tgcgcctgcg acctgtgcgg 2760
agatgcgctg tgagttcggt gcgcggtgcg tggaggagtc tggctcagcc cactgtgtct 2820
gcccgatgct cacctgtcca gaggccaacg ctaccaaggt ctgtgggtca gatggagtca 2880
catacggcaa cgagtgtcag ctgaagacca tcgcctgccg ccagggcctg caaatctcta 2940
tccagagcct gggcccgtgc caggaggctg ttgctcccag cactcacccg acatctgcct 3000
ccgtgactgt gaccacccca gggctcctcc tgagccaggc actgccggcc ccccccggcg 3060
ccctccccct ggctcccagc agtaccgcac acagccagac cacccctccg ccctcatcac 3120
gacctcggac cactgccagc gtccccagga ccaccgtgtg gcccgtgctg acggtgcccc 3180
ccacggcacc ctcccctgca cccagcctgg tggcgtccgc ctttggtgaa tctggcagca 3240
ctgatggaag cagcgatgag gaactgagcg gggaccagga ggccagtggg ggtggctctg 3300
gggggctcga gcccttggag ggcagcagcg tggccacccc tgggccacct gtcgagaggg 3360
cttcctgcta caactccgcg ttgggctgct gctctgatgg gaagacgccc tcgctggacg 3420
cagagggctc caactgcccc gccaccaagg tgttccaggg cgtcctggag ctggagggcg 3480
tcgagggcca ggagctgttc tacacgcccg agatggctga ccccaagtca gaactgttcg 3540
gggagacagc caggagcatt gagagcaccc tggacgacct cttccggaat tcagacgtca 3600
agaaggattt tcggagtgtc cgcttgcggg acctggggcc cggcaaatcc gtccgcgcca 3660
ttgtggatgt gcactttgac cccaccacag ccttcagggc acccgacgtg gcccgggccc 3720
tgctccggca gatccaggtg tccaggcgcc ggtccttggg ggtgaggcgg ccgctgcagg 3780
agcacgtgcg atttatggac tttgactggt ttcctgcgtt tatcacgggg gccacgtcag 3840
gagccattgc tgcgggagcc acggccagag ccaccactgc atcgcgcctg ccgtcctctg 3900
ctgtgacccc tcgggccccg caccccagtc acacaagcca gcccgttgcc aagaccacgg 3960
cagcccccac cacacgtcgg ccccccacca ctgcccccag ccgtgtgccc ggacgtcggc 4020
ccccggcccc ccagcagcct ccaaagccct gtgactcaca gccctgcttc cacgggggga 4080
cctgccagga ctgggcattg ggcgggggct tcacctgcag ctgcccggca ggcaggggag 4140
gcgccgtctg tgagaaggtg cttggcgccc ctgtgccggc cttcgagggc cgctccttcc 4200
tggccttccc cactctccgc gcctaccaca cgctgcgcct ggcactggaa ttccgggcgc 4260
tggagcctca ggggctgctg ctgtacaatg gcaacgcccg gggcaaggac ttcctggcat 4320
tggcgctgct agatggccgc gtgcagctca ggtttgacac aggttcgggg ccggcggtgc 4380
tgaccagtgc cgtgccggta gagccgggcc agtggcaccg cctggagctg tcccggcact 4440
ggcgccgggg caccctctcg gtggatggtg agacccctgt tctgggcgag agtcccagtg 4500
gcaccgacgg cctcaacctg gacacagacc tctttgtggg cggcgtaccc gaggaccagg 4560
ctgccgtggc gctggagcgg accttcgtgg gcgccggcct gagggggtgc atccgtttgc 4620
tggacgtcaa caaccagcgc ctggagcttg gcattgggcc gggggctgcc acccgaggct 4680
ctggcgtggg cgagtgcggg gaccacccct gcctgcccaa cccctgccat ggcggggccc 4740
catgccagaa cctggaggct ggaaggttcc attgccagtg cccgcccggc cgcgtcggac 4800
caacctgtgc cgatgagaag agcccctgcc agcccaaccc ctgccatggg gcggcgccct 4860
gccgtgtgct gcccgagggt ggtgctcagt gcgagtgccc cctggggcgt gagggcacct 4920
tctgccagac agcctcgggg caggacggct ctgggccctt cctggctgac ttcaacggct 4980
tctcccacct ggagctgaga ggcctgcaca cctttgcacg ggacctgggg gagaagatgg 5040
cgctggaggt cgtgttcctg gcacgaggcc ccagcggcct cctgctctac aacgggcaga 5100
agacggacgg caagggggac ttcgtgtcgc tggcactgcg ggaccgccgc ctggagttcc 5160
gctacgacct gggcaagggg gcagcggtca tcaggagcag ggagccagtc accctgggag 5220
cctggaccag ggtctcactg gagcgaaacg gccgcaaggg tgccctgcgt gtgggcgacg 5280
gcccccgtgt gttgggggag tccccgaaat cccgcaaggt tccgcacacc gtcctcaacc 5340
tgaaggagcc gctctacgta gggggcgctc ccgacttcag caagctggcc cgtgctgctg 5400
ccgtgtcctc tggcttcgac ggtgccatcc agctggtctc cctcggaggc cgccagctgc 5460
tgaccccgga gcacgtgctg cggcaggtgg acgtcacgtc ctttgcaggt cacccctgca 5520
cccgggcctc aggccacccc tgcctcaatg gggcctcctg cgtcccgagg gaggctgcct 5580
atgtgtgcct gtgtcccggg ggattctcag gaccgcactg cgagaagggg ctggtggaga 5640
agtcagcggg ggacgtggat accttggcct ttgacgggcg gacctttgtc gagtacctca 5700
acgctgtgac cgagagcgaa ctggccaatg agatccccgt ccccgaaact ctggattccg 5760
gggcccttca cagcgagaag gcactgcaga gcaaccactt tgaactgagc ctgcgcactg 5820
aggccacgca ggggctggtg ctctggagtg gcaaggccac ggagcgggca gactatgtgg 5880
cactggccat tgtggacggg cacctgcaac tgagctacaa cctgggctcc cagcccgtgg 5940
tgctgcgttc caccgtgccc gtcaacacca accgctggtt gcgggtcgtg gcacataggg 6000
agcagaggga aggttccctg caggtgggca atgaggcccc tgtgaccggc tcctccccgc 6060
tgggcgccac gcagctggac actgatggag ccctgtggct tgggggcctg ccggagctgc 6120
ccgtgggccc agcactgccc aaggcctacg gcacaggctt tgtgggctgc ttgcgggacg 6180
tggtggtggg ccggcacccg ctgcacctgc tggaggacgc cgtcaccaag ccagagctgc 6240
ggccctgccc caccccatga gctggcacca gagccccgcg cccgctgtaa ttattttcta 6300
tttttgtaaa cttgttgctt tttgatatga ttttcttgcc tgagtgttgg ccggagggac 6360
tgctggcccg gcctcccttc cgtccaggca gccgtgctgc agacagacct agtgccgagg 6420
gatggacagg cgaggtggca gcgtggaggg ctcggcgtgg atggcagcct caggacacac 6480
acccctgcct caaggtgctg agcccccgcc ttgcactgcg cctgccccac ggtgtccccg 6540
ccgggaagca gccccggctc ctgaatcacc ctcgctccgt caggcgggac tcgtgtccca 6600
gagaggaagg ggctgctgag gtctgatggg gcccttcctc cgggtgaccc cacagggcct 6660
ttccaagccc ccatttgagc tgctccttcc tgtgtgtgct ctgggccctg cctcggcctc 6720
ctgcgccaat actgtgactt ccaaacaatg ttactgctgg gcacagctct gcgttgctcc 6780
cgtgctgcct gcgccagccc caggctgctg aggagcagag gccagaccag ggccgatctg 6840
ggtgtcctga ccctcagctg gccctgccca gccaccctgg acgtgaccgt atccctctgc 6900
cacaccccag gccctgcgag gggctatcga gaggagctca ctgtgggatg gggttgacct 6960
ctgccgcctg cctgggtatc tgggcctggc catggctgtg ttcttcatgt gttgatttta 7020
tttgacccct ggagtggtgg gtctcatctt tcccatctcg cctgagagcg gctgagggct 7080
gcctcactgc aaatcctccc cacagcgtca gtgaaagtcg tccttgtctc agaatgacca 7140
ggggccagcc agtgtctgac caaggtcaag gggcaggtgc agaggtggca gggatggctc 7200
cgaagccaga aatgccttaa actgcaacgt cccgtccctt ccccaccccc atcccatccc 7260
cacccccagc cccagcccag tcctcctagg agcaggaccc gatgaagcgg gcggcggtgg 7320
ggctgggtgc cgtgttacta actctagtat gtttctgtgt caatcgctgt gaaataaagt 7380
ctgaaaactt taaaa 7395
<![CDATA[<210> 419]]>
<![CDATA[<211> 67]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 419]]>
Met Ala Gly Arg Ser His Pro Gly Pro Leu Arg Pro Leu Leu Pro Leu
1 5 10 15
Leu Val Val Ala Ala Cys Val Leu Pro Gly Ala Gly Gly Thr Cys Pro
20 25 30
Glu Arg Ala Leu Glu Arg Arg Glu Glu Glu Ala Asn Val Val Leu Thr
35 40 45
Gly Thr Val Glu Glu Ile Leu Asn Val Asp Pro Val Gln His Thr Tyr
50 55 60
Ser Cys Lys
65
<![CDATA[<210> 420]]>
<![CDATA[<211> 87]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 420]]>
Val Arg Val Trp Arg Tyr Leu Lys Gly Lys Asp Leu Val Ala Arg Glu
1 5 10 15
Ser Leu Leu Asp Gly Gly Asn Lys Val Val Ile Ser Gly Phe Gly Asp
20 25 30
Pro Leu Ile Cys Asp Asn Gln Val Ser Thr Gly Asp Thr Arg Ile Phe
35 40 45
Phe Val Asn Pro Ala Pro Pro Tyr Leu Trp Pro Ala His Lys Asn Glu
50 55 60
Leu Met Leu Asn Ser Ser Leu Met Arg Ile Thr Leu Arg Asn Leu Glu
65 70 75 80
Glu Val Glu Phe Cys Val Glu
85
<![CDATA[<210> 421]]>
<![CDATA[<211> 15]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 421]]>
Lys Pro Gly Thr His Phe Thr Pro Val Pro Pro Thr Pro Pro Asp
1 5 10 15
<![CDATA[<210> 422]]>
<![CDATA[<211> 71]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 422]]>
Cys Arg Gly Met Leu Cys Gly Phe Gly Ala Val Cys Glu Pro Asn Ala
1 5 10 15
Glu Gly Pro Gly Arg Ala Ser Cys Val Cys Lys Lys Ser Pro Cys Pro
20 25 30
Ser Val Val Ala Pro Val Cys Gly Ser Asp Ala Ser Thr Tyr Ser Asn
35 40 45
Glu Cys Glu Leu Gln Arg Ala Gln Cys Ser Gln Gln Arg Arg Ile Arg
50 55 60
Leu Leu Ser Arg Gly Pro Cys
65 70
<![CDATA[<210> 423]]>
<![CDATA[<211> 74]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 423]]>
Ser Arg Asp Pro Cys Ser Asn Val Thr Cys Ser Phe Gly Ser Thr Cys
1 5 10 15
Ala Arg Ser Ala Asp Gly Leu Thr Ala Ser Cys Leu Cys Pro Ala Thr
20 25 30
Cys Arg Gly Ala Pro Glu Gly Thr Val Cys Gly Ser Asp Gly Ala Asp
35 40 45
Tyr Pro Gly Glu Cys Gln Leu Leu Arg Arg Ala Cys Ala Arg Gln Glu
50 55 60
Asn Val Phe Lys Lys Phe Asp Gly Pro Cys
65 70
<![CDATA[<210> 424]]>
<![CDATA[<211> 74]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 424]]>
Pro Cys Gln Gly Ala Leu Pro Asp Pro Ser Arg Ser Cys Arg Val Asn
1 5 10 15
Pro Arg Thr Arg Arg Pro Glu Met Leu Leu Arg Pro Glu Ser Cys Pro
20 25 30
Ala Arg Gln Ala Pro Val Cys Gly Asp Asp Gly Val Thr Tyr Glu Asn
35 40 45
Asp Cys Val Met Gly Arg Ser Gly Ala Ala Arg Gly Leu Leu Leu Gln
50 55 60
Lys Val Arg Ser Gly Gln Cys Gln Gly Arg
65 70
<![CDATA[<210> 425]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 425]]>
Gln Cys Pro Glu Pro Cys Arg Phe Asn Ala Val Cys Leu Ser Arg Arg
1 5 10 15
Gly Arg Pro Arg Cys Ser Cys Asp Arg Val Thr Cys Asp Gly Ala Tyr
20 25 30
Arg Pro Val Cys Ala Gln Asp Gly Arg Thr Tyr Asp Ser Asp Cys Trp
35 40 45
Arg Gln Gln Ala Glu Cys Arg Gln Gln Arg Ala Ile Pro Ser Lys His
50 55 60
Gln Gly Pro Cys
65
<![CDATA[<210> 426]]>
<![CDATA[<211> 72]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 426]]>
Gln Ala Pro Ser Pro Cys Leu Gly Val Gln Cys Ala Phe Gly Ala Thr
1 5 10 15
Cys Ala Val Lys Asn Gly Gln Ala Ala Cys Glu Cys Leu Gln Ala Cys
20 25 30
Ser Ser Leu Tyr Asp Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly
35 40 45
Ser Ala Cys Glu Leu Glu Ala Thr Ala Cys Thr Leu Gly Arg Glu Ile
50 55 60
Gln Val Ala Arg Lys Gly Pro Cys
65 70
<![CDATA[<210> 427]]>
<![CDATA[<211> 64]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 427]]>
Arg Cys Gly Gln Cys Arg Phe Gly Ala Leu Cys Glu Ala Glu Thr Gly
1 5 10 15
Arg Cys Val Cys Pro Ser Glu Cys Val Ala Leu Ala Gln Pro Val Cys
20 25 30
Gly Ser Asp Gly His Thr Tyr Pro Ser Glu Cys Met Leu His Val His
35 40 45
Ala Cys Thr His Gln Ile Ser Leu His Val Ala Ser Ala Gly Pro Cys
50 55 60
<![CDATA[<210> 428]]>
<![CDATA[<211> 66]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 428]]>
Thr Cys Gly Asp Ala Val Cys Ala Phe Gly Ala Val Cys Ser Ala Gly
1 5 10 15
Gln Cys Val Cys Pro Arg Cys Glu His Pro Pro Pro Gly Pro Val Cys
20 25 30
Gly Ser Asp Gly Val Thr Tyr Gly Ser Ala Cys Glu Leu Arg Glu Ala
35 40 45
Ala Cys Leu Gln Gln Thr Gln Ile Glu Glu Ala Arg Ala Gly Pro Cys
50 55 60
Glu Gln
65
<![CDATA[<210> 429]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 429]]>
Glu Cys Gly Ser Gly Gly Ser Gly Ser Gly Glu Asp Gly Asp Cys Glu
1 5 10 15
Gln Glu Leu Cys Arg Gln Arg Gly Gly Ile Trp Asp Glu Asp Ser Glu
20 25 30
Asp Gly Pro Cys Val Cys Asp Phe Ser Cys Gln Ser Val Pro Gly Ser
35 40 45
Pro
<![CDATA[<210> 430]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 430]]>
Val Cys Gly Ser Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys
1 5 10 15
Lys Ala Arg Cys Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly
20 25 30
Ala Cys Arg
35
<![CDATA[<210> 431]]>
<![CDATA[<211> 1352]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 431]]>
Val Cys Gly Ser Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys
1 5 10 15
Lys Ala Arg Cys Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly
20 25 30
Ala Cys Arg Gly Pro Thr Phe Ala Pro Leu Pro Pro Val Ala Pro Leu
35 40 45
His Cys Ala Gln Thr Pro Tyr Gly Cys Cys Gln Asp Asn Ile Thr Ala
50 55 60
Ala Arg Gly Val Gly Leu Ala Gly Cys Pro Ser Ala Cys Gln Cys Asn
65 70 75 80
Pro His Gly Ser Tyr Gly Gly Thr Cys Asp Pro Ala Thr Gly Gln Cys
85 90 95
Ser Cys Arg Pro Gly Val Gly Gly Leu Arg Cys Asp Arg Cys Glu Pro
100 105 110
Gly Phe Trp Asn Phe Arg Gly Ile Val Thr Asp Gly Arg Ser Gly Cys
115 120 125
Thr Pro Cys Ser Cys Asp Pro Gln Gly Ala Val Arg Asp Asp Cys Glu
130 135 140
Gln Met Thr Gly Leu Cys Ser Cys Lys Pro Gly Val Ala Gly Pro Lys
145 150 155 160
Cys Gly Gln Cys Pro Asp Gly Arg Ala Leu Gly Pro Ala Gly Cys Glu
165 170 175
Ala Asp Ala Ser Ala Pro Ala Thr Cys Ala Glu Met Arg Cys Glu Phe
180 185 190
Gly Ala Arg Cys Val Glu Glu Ser Gly Ser Ala His Cys Val Cys Pro
195 200 205
Met Leu Thr Cys Pro Glu Ala Asn Ala Thr Lys Val Cys Gly Ser Asp
210 215 220
Gly Val Thr Tyr Gly Asn Glu Cys Gln Leu Lys Thr Ile Ala Cys Arg
225 230 235 240
Gln Gly Leu Gln Ile Ser Ile Gln Ser Leu Gly Pro Cys Gln Glu Ala
245 250 255
Val Ala Pro Ser Thr His Pro Thr Ser Ala Ser Val Thr Val Thr Thr
260 265 270
Pro Gly Leu Leu Leu Ser Gln Ala Leu Pro Ala Pro Pro Gly Ala Leu
275 280 285
Pro Leu Ala Pro Ser Ser Thr Ala His Ser Gln Thr Thr Pro Pro Pro
290 295 300
Ser Ser Arg Pro Arg Thr Thr Ala Ser Val Pro Arg Thr Thr Val Trp
305 310 315 320
Pro Val Leu Thr Val Pro Pro Thr Ala Pro Ser Pro Ala Pro Ser Leu
325 330 335
Val Ala Ser Ala Phe Gly Glu Ser Gly Ser Thr Asp Gly Ser Ser Asp
340 345 350
Glu Glu Leu Ser Gly Asp Gln Glu Ala Ser Gly Gly Gly Ser Gly Gly
355 360 365
Leu Glu Pro Leu Glu Gly Ser Ser Val Ala Thr Pro Gly Pro Pro Val
370 375 380
Glu Arg Ala Ser Cys Tyr Asn Ser Ala Leu Gly Cys Cys Ser Asp Gly
385 390 395 400
Lys Thr Pro Ser Leu Asp Ala Glu Gly Ser Asn Cys Pro Ala Thr Lys
405 410 415
Val Phe Gln Gly Val Leu Glu Leu Glu Gly Val Glu Gly Gln Glu Leu
420 425 430
Phe Tyr Thr Pro Glu Met Ala Asp Pro Lys Ser Glu Leu Phe Gly Glu
435 440 445
Thr Ala Arg Ser Ile Glu Ser Thr Leu Asp Asp Leu Phe Arg Asn Ser
450 455 460
Asp Val Lys Lys Asp Phe Arg Ser Val Arg Leu Arg Asp Leu Gly Pro
465 470 475 480
Gly Lys Ser Val Arg Ala Ile Val Asp Val His Phe Asp Pro Thr Thr
485 490 495
Ala Phe Arg Ala Pro Asp Val Ala Arg Ala Leu Leu Arg Gln Ile Gln
500 505 510
Val Ser Arg Arg Arg Ser Leu Gly Val Arg Arg Pro Leu Gln Glu His
515 520 525
Val Arg Phe Met Asp Phe Asp Trp Phe Pro Ala Phe Ile Thr Gly Ala
530 535 540
Thr Ser Gly Ala Ile Ala Ala Gly Ala Thr Ala Arg Ala Thr Thr Ala
545 550 555 560
Ser Arg Leu Pro Ser Ser Ala Val Thr Pro Arg Ala Pro His Pro Ser
565 570 575
His Thr Ser Gln Pro Val Ala Lys Thr Thr Ala Ala Pro Thr Thr Arg
580 585 590
Arg Pro Pro Thr Thr Ala Pro Ser Arg Val Pro Gly Arg Arg Pro Pro
595 600 605
Ala Pro Gln Gln Pro Pro Lys Pro Cys Asp Ser Gln Pro Cys Phe His
610 615 620
Gly Gly Thr Cys Gln Asp Trp Ala Leu Gly Gly Gly Phe Thr Cys Ser
625 630 635 640
Cys Pro Ala Gly Arg Gly Gly Ala Val Cys Glu Lys Val Leu Gly Ala
645 650 655
Pro Val Pro Ala Phe Glu Gly Arg Ser Phe Leu Ala Phe Pro Thr Leu
660 665 670
Arg Ala Tyr His Thr Leu Arg Leu Ala Leu Glu Phe Arg Ala Leu Glu
675 680 685
Pro Gln Gly Leu Leu Leu Tyr Asn Gly Asn Ala Arg Gly Lys Asp Phe
690 695 700
Leu Ala Leu Ala Leu Leu Asp Gly Arg Val Gln Leu Arg Phe Asp Thr
705 710 715 720
Gly Ser Gly Pro Ala Val Leu Thr Ser Ala Val Pro Val Glu Pro Gly
725 730 735
Gln Trp His Arg Leu Glu Leu Ser Arg His Trp Arg Arg Gly Thr Leu
740 745 750
Ser Val Asp Gly Glu Thr Pro Val Leu Gly Glu Ser Pro Ser Gly Thr
755 760 765
Asp Gly Leu Asn Leu Asp Thr Asp Leu Phe Val Gly Gly Val Pro Glu
770 775 780
Asp Gln Ala Ala Val Ala Leu Glu Arg Thr Phe Val Gly Ala Gly Leu
785 790 795 800
Arg Gly Cys Ile Arg Leu Leu Asp Val Asn Asn Gln Arg Leu Glu Leu
805 810 815
Gly Ile Gly Pro Gly Ala Ala Thr Arg Gly Ser Gly Val Gly Glu Cys
820 825 830
Gly Asp His Pro Cys Leu Pro Asn Pro Cys His Gly Gly Ala Pro Cys
835 840 845
Gln Asn Leu Glu Ala Gly Arg Phe His Cys Gln Cys Pro Pro Gly Arg
850 855 860
Val Gly Pro Thr Cys Ala Asp Glu Lys Ser Pro Cys Gln Pro Asn Pro
865 870 875 880
Cys His Gly Ala Ala Pro Cys Arg Val Leu Pro Glu Gly Gly Ala Gln
885 890 895
Cys Glu Cys Pro Leu Gly Arg Glu Gly Thr Phe Cys Gln Thr Ala Ser
900 905 910
Gly Gln Asp Gly Ser Gly Pro Phe Leu Ala Asp Phe Asn Gly Phe Ser
915 920 925
His Leu Glu Leu Arg Gly Leu His Thr Phe Ala Arg Asp Leu Gly Glu
930 935 940
Lys Met Ala Leu Glu Val Val Phe Leu Ala Arg Gly Pro Ser Gly Leu
945 950 955 960
Leu Leu Tyr Asn Gly Gln Lys Thr Asp Gly Lys Gly Asp Phe Val Ser
965 970 975
Leu Ala Leu Arg Asp Arg Arg Leu Glu Phe Arg Tyr Asp Leu Gly Lys
980 985 990
Gly Ala Ala Val Ile Arg Ser Arg Glu Pro Val Thr Leu Gly Ala Trp
995 1000 1005
Thr Arg Val Ser Leu Glu Arg Asn Gly Arg Lys Gly Ala Leu Arg
1010 1015 1020
Val Gly Asp Gly Pro Arg Val Leu Gly Glu Ser Pro Lys Ser Arg
1025 1030 1035
Lys Val Pro His Thr Val Leu Asn Leu Lys Glu Pro Leu Tyr Val
1040 1045 1050
Gly Gly Ala Pro Asp Phe Ser Lys Leu Ala Arg Ala Ala Ala Val
1055 1060 1065
Ser Ser Gly Phe Asp Gly Ala Ile Gln Leu Val Ser Leu Gly Gly
1070 1075 1080
Arg Gln Leu Leu Thr Pro Glu His Val Leu Arg Gln Val Asp Val
1085 1090 1095
Thr Ser Phe Ala Gly His Pro Cys Thr Arg Ala Ser Gly His Pro
1100 1105 1110
Cys Leu Asn Gly Ala Ser Cys Val Pro Arg Glu Ala Ala Tyr Val
1115 1120 1125
Cys Leu Cys Pro Gly Gly Phe Ser Gly Pro His Cys Glu Lys Gly
1130 1135 1140
Leu Val Glu Lys Ser Ala Gly Asp Val Asp Thr Leu Ala Phe Asp
1145 1150 1155
Gly Arg Thr Phe Val Glu Tyr Leu Asn Ala Val Thr Glu Ser Glu
1160 1165 1170
Leu Ala Asn Glu Ile Pro Val Pro Glu Thr Leu Asp Ser Gly Ala
1175 1180 1185
Leu His Ser Glu Lys Ala Leu Gln Ser Asn His Phe Glu Leu Ser
1190 1195 1200
Leu Arg Thr Glu Ala Thr Gln Gly Leu Val Leu Trp Ser Gly Lys
1205 1210 1215
Ala Thr Glu Arg Ala Asp Tyr Val Ala Leu Ala Ile Val Asp Gly
1220 1225 1230
His Leu Gln Leu Ser Tyr Asn Leu Gly Ser Gln Pro Val Val Leu
1235 1240 1245
Arg Ser Thr Val Pro Val Asn Thr Asn Arg Trp Leu Arg Val Val
1250 1255 1260
Ala His Arg Glu Gln Arg Glu Gly Ser Leu Gln Val Gly Asn Glu
1265 1270 1275
Ala Pro Val Thr Gly Ser Ser Pro Leu Gly Ala Thr Gln Leu Asp
1280 1285 1290
Thr Asp Gly Ala Leu Trp Leu Gly Gly Leu Pro Glu Leu Pro Val
1295 1300 1305
Gly Pro Ala Leu Pro Lys Ala Tyr Gly Thr Gly Phe Val Gly Cys
1310 1315 1320
Leu Arg Asp Val Val Val Gly Arg His Pro Leu His Leu Leu Glu
1325 1330 1335
Asp Ala Val Thr Lys Pro Glu Leu Arg Pro Cys Pro Thr Pro
1340 1345 1350
<![CDATA[<210> 432]]>
<![CDATA[<211> 2068]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 432]]>
Met Ala Gly Arg Ser His Pro Gly Pro Leu Arg Pro Leu Leu Pro Leu
1 5 10 15
Leu Val Val Ala Ala Cys Val Leu Pro Gly Ala Gly Gly Thr Cys Pro
20 25 30
Glu Arg Ala Leu Glu Arg Arg Glu Glu Glu Ala Asn Val Val Leu Thr
35 40 45
Gly Thr Val Glu Glu Ile Leu Asn Val Asp Pro Val Gln His Thr Tyr
50 55 60
Ser Cys Lys Val Arg Val Trp Arg Tyr Leu Lys Gly Lys Asp Leu Val
65 70 75 80
Ala Arg Glu Ser Leu Leu Asp Gly Gly Asn Lys Val Val Ile Ser Gly
85 90 95
Phe Gly Asp Pro Leu Ile Cys Asp Asn Gln Val Ser Thr Gly Asp Thr
100 105 110
Arg Ile Phe Phe Val Asn Pro Ala Pro Pro Tyr Leu Trp Pro Ala His
115 120 125
Lys Asn Glu Leu Met Leu Asn Ser Ser Leu Met Arg Ile Thr Leu Arg
130 135 140
Asn Leu Glu Glu Val Glu Phe Cys Val Glu Asp Lys Pro Gly Thr His
145 150 155 160
Phe Thr Pro Val Pro Pro Thr Pro Pro Asp Ala Cys Arg Gly Met Leu
165 170 175
Cys Gly Phe Gly Ala Val Cys Glu Pro Asn Ala Glu Gly Pro Gly Arg
180 185 190
Ala Ser Cys Val Cys Lys Lys Ser Pro Cys Pro Ser Val Val Ala Pro
195 200 205
Val Cys Gly Ser Asp Ala Ser Thr Tyr Ser Asn Glu Cys Glu Leu Gln
210 215 220
Arg Ala Gln Cys Ser Gln Gln Arg Arg Ile Arg Leu Leu Ser Arg Gly
225 230 235 240
Pro Cys Gly Ser Arg Asp Pro Cys Ser Asn Val Thr Cys Ser Phe Gly
245 250 255
Ser Thr Cys Ala Arg Ser Ala Asp Gly Leu Thr Ala Ser Cys Leu Cys
260 265 270
Pro Ala Thr Cys Arg Gly Ala Pro Glu Gly Thr Val Cys Gly Ser Asp
275 280 285
Gly Ala Asp Tyr Pro Gly Glu Cys Gln Leu Leu Arg Arg Ala Cys Ala
290 295 300
Arg Gln Glu Asn Val Phe Lys Lys Phe Asp Gly Pro Cys Asp Pro Cys
305 310 315 320
Gln Gly Ala Leu Pro Asp Pro Ser Arg Ser Cys Arg Val Asn Pro Arg
325 330 335
Thr Arg Arg Pro Glu Met Leu Leu Arg Pro Glu Ser Cys Pro Ala Arg
340 345 350
Gln Ala Pro Val Cys Gly Asp Asp Gly Val Thr Tyr Glu Asn Asp Cys
355 360 365
Val Met Gly Arg Ser Gly Ala Ala Arg Gly Leu Leu Leu Gln Lys Val
370 375 380
Arg Ser Gly Gln Cys Gln Gly Arg Asp Gln Cys Pro Glu Pro Cys Arg
385 390 395 400
Phe Asn Ala Val Cys Leu Ser Arg Arg Gly Arg Pro Arg Cys Ser Cys
405 410 415
Asp Arg Val Thr Cys Asp Gly Ala Tyr Arg Pro Val Cys Ala Gln Asp
420 425 430
Gly Arg Thr Tyr Asp Ser Asp Cys Trp Arg Gln Gln Ala Glu Cys Arg
435 440 445
Gln Gln Arg Ala Ile Pro Ser Lys His Gln Gly Pro Cys Asp Gln Ala
450 455 460
Pro Ser Pro Cys Leu Gly Val Gln Cys Ala Phe Gly Ala Thr Cys Ala
465 470 475 480
Val Lys Asn Gly Gln Ala Ala Cys Glu Cys Leu Gln Ala Cys Ser Ser
485 490 495
Leu Tyr Asp Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly Ser Ala
500 505 510
Cys Glu Leu Glu Ala Thr Ala Cys Thr Leu Gly Arg Glu Ile Gln Val
515 520 525
Ala Arg Lys Gly Pro Cys Asp Arg Cys Gly Gln Cys Arg Phe Gly Ala
530 535 540
Leu Cys Glu Ala Glu Thr Gly Arg Cys Val Cys Pro Ser Glu Cys Val
545 550 555 560
Ala Leu Ala Gln Pro Val Cys Gly Ser Asp Gly His Thr Tyr Pro Ser
565 570 575
Glu Cys Met Leu His Val His Ala Cys Thr His Gln Ile Ser Leu His
580 585 590
Val Ala Ser Ala Gly Pro Cys Glu Thr Cys Gly Asp Ala Val Cys Ala
595 600 605
Phe Gly Ala Val Cys Ser Ala Gly Gln Cys Val Cys Pro Arg Cys Glu
610 615 620
His Pro Pro Pro Gly Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly
625 630 635 640
Ser Ala Cys Glu Leu Arg Glu Ala Ala Cys Leu Gln Gln Thr Gln Ile
645 650 655
Glu Glu Ala Arg Ala Gly Pro Cys Glu Gln Ala Glu Cys Gly Ser Gly
660 665 670
Gly Ser Gly Ser Gly Glu Asp Gly Asp Cys Glu Gln Glu Leu Cys Arg
675 680 685
Gln Arg Gly Gly Ile Trp Asp Glu Asp Ser Glu Asp Gly Pro Cys Val
690 695 700
Cys Asp Phe Ser Cys Gln Ser Val Pro Gly Ser Pro Val Cys Gly Ser
705 710 715 720
Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys Lys Ala Arg Cys
725 730 735
Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly Ala Cys Arg Gly
740 745 750
Pro Thr Phe Ala Pro Leu Pro Pro Val Ala Pro Leu His Cys Ala Gln
755 760 765
Thr Pro Tyr Gly Cys Cys Gln Asp Asn Ile Thr Ala Ala Arg Gly Val
770 775 780
Gly Leu Ala Gly Cys Pro Ser Ala Cys Gln Cys Asn Pro His Gly Ser
785 790 795 800
Tyr Gly Gly Thr Cys Asp Pro Ala Thr Gly Gln Cys Ser Cys Arg Pro
805 810 815
Gly Val Gly Gly Leu Arg Cys Asp Arg Cys Glu Pro Gly Phe Trp Asn
820 825 830
Phe Arg Gly Ile Val Thr Asp Gly Arg Ser Gly Cys Thr Pro Cys Ser
835 840 845
Cys Asp Pro Gln Gly Ala Val Arg Asp Asp Cys Glu Gln Met Thr Gly
850 855 860
Leu Cys Ser Cys Lys Pro Gly Val Ala Gly Pro Lys Cys Gly Gln Cys
865 870 875 880
Pro Asp Gly Arg Ala Leu Gly Pro Ala Gly Cys Glu Ala Asp Ala Ser
885 890 895
Ala Pro Ala Thr Cys Ala Glu Met Arg Cys Glu Phe Gly Ala Arg Cys
900 905 910
Val Glu Glu Ser Gly Ser Ala His Cys Val Cys Pro Met Leu Thr Cys
915 920 925
Pro Glu Ala Asn Ala Thr Lys Val Cys Gly Ser Asp Gly Val Thr Tyr
930 935 940
Gly Asn Glu Cys Gln Leu Lys Thr Ile Ala Cys Arg Gln Gly Leu Gln
945 950 955 960
Ile Ser Ile Gln Ser Leu Gly Pro Cys Gln Glu Ala Val Ala Pro Ser
965 970 975
Thr His Pro Thr Ser Ala Ser Val Thr Val Thr Thr Pro Gly Leu Leu
980 985 990
Leu Ser Gln Ala Leu Pro Ala Pro Pro Gly Ala Leu Pro Leu Ala Pro
995 1000 1005
Ser Ser Thr Ala His Ser Gln Thr Thr Pro Pro Pro Ser Ser Arg
1010 1015 1020
Pro Arg Thr Thr Ala Ser Val Pro Arg Thr Thr Val Trp Pro Val
1025 1030 1035
Leu Thr Val Pro Pro Thr Ala Pro Ser Pro Ala Pro Ser Leu Val
1040 1045 1050
Ala Ser Ala Phe Gly Glu Ser Gly Ser Thr Asp Gly Ser Ser Asp
1055 1060 1065
Glu Glu Leu Ser Gly Asp Gln Glu Ala Ser Gly Gly Gly Ser Gly
1070 1075 1080
Gly Leu Glu Pro Leu Glu Gly Ser Ser Val Ala Thr Pro Gly Pro
1085 1090 1095
Pro Val Glu Arg Ala Ser Cys Tyr Asn Ser Ala Leu Gly Cys Cys
1100 1105 1110
Ser Asp Gly Lys Thr Pro Ser Leu Asp Ala Glu Gly Ser Asn Cys
1115 1120 1125
Pro Ala Thr Lys Val Phe Gln Gly Val Leu Glu Leu Glu Gly Val
1130 1135 1140
Glu Gly Gln Glu Leu Phe Tyr Thr Pro Glu Met Ala Asp Pro Lys
1145 1150 1155
Ser Glu Leu Phe Gly Glu Thr Ala Arg Ser Ile Glu Ser Thr Leu
1160 1165 1170
Asp Asp Leu Phe Arg Asn Ser Asp Val Lys Lys Asp Phe Arg Ser
1175 1180 1185
Val Arg Leu Arg Asp Leu Gly Pro Gly Lys Ser Val Arg Ala Ile
1190 1195 1200
Val Asp Val His Phe Asp Pro Thr Thr Ala Phe Arg Ala Pro Asp
1205 1210 1215
Val Ala Arg Ala Leu Leu Arg Gln Ile Gln Val Ser Arg Arg Arg
1220 1225 1230
Ser Leu Gly Val Arg Arg Pro Leu Gln Glu His Val Arg Phe Met
1235 1240 1245
Asp Phe Asp Trp Phe Pro Ala Phe Ile Thr Gly Ala Thr Ser Gly
1250 1255 1260
Ala Ile Ala Ala Gly Ala Thr Ala Arg Ala Thr Thr Ala Ser Arg
1265 1270 1275
Leu Pro Ser Ser Ala Val Thr Pro Arg Ala Pro His Pro Ser His
1280 1285 1290
Thr Ser Gln Pro Val Ala Lys Thr Thr Ala Ala Pro Thr Thr Arg
1295 1300 1305
Arg Pro Pro Thr Thr Ala Pro Ser Arg Val Pro Gly Arg Arg Pro
1310 1315 1320
Pro Ala Pro Gln Gln Pro Pro Lys Pro Cys Asp Ser Gln Pro Cys
1325 1330 1335
Phe His Gly Gly Thr Cys Gln Asp Trp Ala Leu Gly Gly Gly Phe
1340 1345 1350
Thr Cys Ser Cys Pro Ala Gly Arg Gly Gly Ala Val Cys Glu Lys
1355 1360 1365
Val Leu Gly Ala Pro Val Pro Ala Phe Glu Gly Arg Ser Phe Leu
1370 1375 1380
Ala Phe Pro Thr Leu Arg Ala Tyr His Thr Leu Arg Leu Ala Leu
1385 1390 1395
Glu Phe Arg Ala Leu Glu Pro Gln Gly Leu Leu Leu Tyr Asn Gly
1400 1405 1410
Asn Ala Arg Gly Lys Asp Phe Leu Ala Leu Ala Leu Leu Asp Gly
1415 1420 1425
Arg Val Gln Leu Arg Phe Asp Thr Gly Ser Gly Pro Ala Val Leu
1430 1435 1440
Thr Ser Ala Val Pro Val Glu Pro Gly Gln Trp His Arg Leu Glu
1445 1450 1455
Leu Ser Arg His Trp Arg Arg Gly Thr Leu Ser Val Asp Gly Glu
1460 1465 1470
Thr Pro Val Leu Gly Glu Ser Pro Ser Gly Thr Asp Gly Leu Asn
1475 1480 1485
Leu Asp Thr Asp Leu Phe Val Gly Gly Val Pro Glu Asp Gln Ala
1490 1495 1500
Ala Val Ala Leu Glu Arg Thr Phe Val Gly Ala Gly Leu Arg Gly
1505 1510 1515
Cys Ile Arg Leu Leu Asp Val Asn Asn Gln Arg Leu Glu Leu Gly
1520 1525 1530
Ile Gly Pro Gly Ala Ala Thr Arg Gly Ser Gly Val Gly Glu Cys
1535 1540 1545
Gly Asp His Pro Cys Leu Pro Asn Pro Cys His Gly Gly Ala Pro
1550 1555 1560
Cys Gln Asn Leu Glu Ala Gly Arg Phe His Cys Gln Cys Pro Pro
1565 1570 1575
Gly Arg Val Gly Pro Thr Cys Ala Asp Glu Lys Ser Pro Cys Gln
1580 1585 1590
Pro Asn Pro Cys His Gly Ala Ala Pro Cys Arg Val Leu Pro Glu
1595 1600 1605
Gly Gly Ala Gln Cys Glu Cys Pro Leu Gly Arg Glu Gly Thr Phe
1610 1615 1620
Cys Gln Thr Ala Ser Gly Gln Asp Gly Ser Gly Pro Phe Leu Ala
1625 1630 1635
Asp Phe Asn Gly Phe Ser His Leu Glu Leu Arg Gly Leu His Thr
1640 1645 1650
Phe Ala Arg Asp Leu Gly Glu Lys Met Ala Leu Glu Val Val Phe
1655 1660 1665
Leu Ala Arg Gly Pro Ser Gly Leu Leu Leu Tyr Asn Gly Gln Lys
1670 1675 1680
Thr Asp Gly Lys Gly Asp Phe Val Ser Leu Ala Leu Arg Asp Arg
1685 1690 1695
Arg Leu Glu Phe Arg Tyr Asp Leu Gly Lys Gly Ala Ala Val Ile
1700 1705 1710
Arg Ser Arg Glu Pro Val Thr Leu Gly Ala Trp Thr Arg Val Ser
1715 1720 1725
Leu Glu Arg Asn Gly Arg Lys Gly Ala Leu Arg Val Gly Asp Gly
1730 1735 1740
Pro Arg Val Leu Gly Glu Ser Pro Lys Ser Arg Lys Val Pro His
1745 1750 1755
Thr Val Leu Asn Leu Lys Glu Pro Leu Tyr Val Gly Gly Ala Pro
1760 1765 1770
Asp Phe Ser Lys Leu Ala Arg Ala Ala Ala Val Ser Ser Gly Phe
1775 1780 1785
Asp Gly Ala Ile Gln Leu Val Ser Leu Gly Gly Arg Gln Leu Leu
1790 1795 1800
Thr Pro Glu His Val Leu Arg Gln Val Asp Val Thr Ser Phe Ala
1805 1810 1815
Gly His Pro Cys Thr Arg Ala Ser Gly His Pro Cys Leu Asn Gly
1820 1825 1830
Ala Ser Cys Val Pro Arg Glu Ala Ala Tyr Val Cys Leu Cys Pro
1835 1840 1845
Gly Gly Phe Ser Gly Pro His Cys Glu Lys Gly Leu Val Glu Lys
1850 1855 1860
Ser Ala Gly Asp Val Asp Thr Leu Ala Phe Asp Gly Arg Thr Phe
1865 1870 1875
Val Glu Tyr Leu Asn Ala Val Thr Glu Ser Glu Leu Ala Asn Glu
1880 1885 1890
Ile Pro Val Pro Glu Thr Leu Asp Ser Gly Ala Leu His Ser Glu
1895 1900 1905
Lys Ala Leu Gln Ser Asn His Phe Glu Leu Ser Leu Arg Thr Glu
1910 1915 1920
Ala Thr Gln Gly Leu Val Leu Trp Ser Gly Lys Ala Thr Glu Arg
1925 1930 1935
Ala Asp Tyr Val Ala Leu Ala Ile Val Asp Gly His Leu Gln Leu
1940 1945 1950
Ser Tyr Asn Leu Gly Ser Gln Pro Val Val Leu Arg Ser Thr Val
1955 1960 1965
Pro Val Asn Thr Asn Arg Trp Leu Arg Val Val Ala His Arg Glu
1970 1975 1980
Gln Arg Glu Gly Ser Leu Gln Val Gly Asn Glu Ala Pro Val Thr
1985 1990 1995
Gly Ser Ser Pro Leu Gly Ala Thr Gln Leu Asp Thr Asp Gly Ala
2000 2005 2010
Leu Trp Leu Gly Gly Leu Pro Glu Leu Pro Val Gly Pro Ala Leu
2015 2020 2025
Pro Lys Ala Tyr Gly Thr Gly Phe Val Gly Cys Leu Arg Asp Val
2030 2035 2040
Val Val Gly Arg His Pro Leu His Leu Leu Glu Asp Ala Val Thr
2045 2050 2055
Lys Pro Glu Leu Arg Pro Cys Pro Thr Pro
2060 2065
<![CDATA[<210> 433]]>
<![CDATA[<211> 2307]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 433]]>
agtcccgtcc ccggcgcggc ccgcgcgctc ctccgccgcc tctcgcctgc gccatggccg 60
gccggtccca cccgggcccg ctgcggccgc tgctgccgct ccttgtggtg gccgcgtgcg 120
tcctgcccgg agccggcggg acatgcccgg agcgcgcgct ggagcggcgc gaggaggagg 180
cgaacgtggt gctcaccggg acggtggagg agatcctcaa cgtggacccg gtgcagcaca 240
cgtactcctg caaggttcgg gtctggcggt acttgaaggg caaagacctg gtggcccggg 300
agagcctgct ggacggcggc aacaaggtgg tgatcagcgg ctttggagac cccctcatct 360
gtgacaacca ggtgtccact ggggacacca ggatcttctt tgtgaaccct gcacccccat 420
acctgtggcc agcccacaag aacgagctga tgctcaactc cagcctcatg cggatcaccc 480
tgcggaacct ggaggaggtg gagttctgtg tggaagataa acccgggacc cacttcactc 540
cagtgcctcc gacgcctcct gatgcgtgcc ggggaatgct gtgcggcttc ggcgccgtgt 600
gcgagcccaa cgcggagggg ccgggccggg cgtcctgcgt ctgcaagaag agcccgtgcc 660
ccagcgtggt ggcgcctgtg tgtgggtcgg acgcctccac ctacagcaac gaatgcgagc 720
tgcagcgggc gcagtgcagc cagcagcgcc gcatccgcct gctcagccgc gggccgtgcg 780
gctcgcggga cccctgctcc aacgtgacct gcagcttcgg cagcacctgt gcgcgctcgg 840
ccgacgggct gacggcctcg tgcctgtgcc ccgcgacctg ccgtggcgcc cccgagggga 900
ccgtctgcgg cagcgacggc gccgactacc ccggcgagtg ccagctcctg cgccgcgcct 960
gcgcccgcca ggagaatgtc ttcaagaagt tcgacggccc ttgtgacccc tgtcagggcg 1020
ccctccctga cccgagccgc agctgccgtg tgaacccgcg cacgcggcgc cctgagatgc 1080
tcctacggcc cgagagctgc cctgcccggc aggcgccagt gtgtggggac gacggagtca 1140
cctacgaaaa cgactgtgtc atgggccgat cgggggccgc ccggggtctc ctcctgcaga 1200
aagtgcgctc cggccagtgc cagggtcgag accagtgccc ggagccctgc cggttcaatg 1260
ccgtgtgcct gtcccgccgt ggccgtcccc gctgctcctg cgaccgcgtc acctgtgacg 1320
gggcctacag gcccgtgtgt gcccaggacg ggcgcacgta tgacagtgat tgctggcggc 1380
agcaggctga gtgccggcag cagcgtgcca tccccagcaa gcaccagggc ccgtgtgacc 1440
aggccccgtc cccatgcctc ggggtgcagt gtgcatttgg ggcgacgtgt gctgtgaaga 1500
acgggcaggc agcgtgtgaa tgcctgcagg cgtgctcgag cctctacgat cctgtgtgcg 1560
gcagcgacgg cgtcacatac ggcagcgcgt gcgagctgga ggccacggcc tgtaccctcg 1620
ggcgggagat ccaggtggcg cgcaaaggac cctgtgaccg ctgcgggcag tgccgctttg 1680
gagccctgtg cgaggccgag accgggcgct gcgtgtgccc ctctgaatgc gtggctttgg 1740
cccagcccgt gtgtggctcc gacgggcaca cgtaccccag cgagtgcatg ctgcacgtgc 1800
acgcctgcac acaccagatc agcctgcacg tggcctcagc tggaccctgt gagacctgtg 1860
gagatgccgt gtgtgctttt ggggctgtgt gctccgcagg gcagtgtgtg tgtccccggt 1920
gtgagcaccc cccgcccggc cccgtgtgtg gcagcgacgg tgtcacctac ggcagtgcct 1980
gcgagctacg ggaagccgcc tgcctccagc agacacagat cgaggaggcc cgggcagggc 2040
cgtgcgagca ggccgagtgc ggttccggag gctctggctc tggggaggac ggtgactgtg 2100
agcaggagct gtgccggcag cgcggtggca tctgggacga ggactcggag gacgggccgt 2160
gtgtctgtga cttcagctgc cagagtgtcc caggcagccc ggtgtgcggc tcagatgggg 2220
tcacctacag caccgagtgt gagctgaaga aggccaggtg tgagtcacag cgagggctct 2280
acgtagcggc ccagggagcc tgccgag 2307
<![CDATA[<210> 434]]>
<![CDATA[<211> 751]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 434]]>
Met Ala Gly Arg Ser His Pro Gly Pro Leu Arg Pro Leu Leu Pro Leu
1 5 10 15
Leu Val Val Ala Ala Cys Val Leu Pro Gly Ala Gly Gly Thr Cys Pro
20 25 30
Glu Arg Ala Leu Glu Arg Arg Glu Glu Glu Ala Asn Val Val Leu Thr
35 40 45
Gly Thr Val Glu Glu Ile Leu Asn Val Asp Pro Val Gln His Thr Tyr
50 55 60
Ser Cys Lys Val Arg Val Trp Arg Tyr Leu Lys Gly Lys Asp Leu Val
65 70 75 80
Ala Arg Glu Ser Leu Leu Asp Gly Gly Asn Lys Val Val Ile Ser Gly
85 90 95
Phe Gly Asp Pro Leu Ile Cys Asp Asn Gln Val Ser Thr Gly Asp Thr
100 105 110
Arg Ile Phe Phe Val Asn Pro Ala Pro Pro Tyr Leu Trp Pro Ala His
115 120 125
Lys Asn Glu Leu Met Leu Asn Ser Ser Leu Met Arg Ile Thr Leu Arg
130 135 140
Asn Leu Glu Glu Val Glu Phe Cys Val Glu Asp Lys Pro Gly Thr His
145 150 155 160
Phe Thr Pro Val Pro Pro Thr Pro Pro Asp Ala Cys Arg Gly Met Leu
165 170 175
Cys Gly Phe Gly Ala Val Cys Glu Pro Asn Ala Glu Gly Pro Gly Arg
180 185 190
Ala Ser Cys Val Cys Lys Lys Ser Pro Cys Pro Ser Val Val Ala Pro
195 200 205
Val Cys Gly Ser Asp Ala Ser Thr Tyr Ser Asn Glu Cys Glu Leu Gln
210 215 220
Arg Ala Gln Cys Ser Gln Gln Arg Arg Ile Arg Leu Leu Ser Arg Gly
225 230 235 240
Pro Cys Gly Ser Arg Asp Pro Cys Ser Asn Val Thr Cys Ser Phe Gly
245 250 255
Ser Thr Cys Ala Arg Ser Ala Asp Gly Leu Thr Ala Ser Cys Leu Cys
260 265 270
Pro Ala Thr Cys Arg Gly Ala Pro Glu Gly Thr Val Cys Gly Ser Asp
275 280 285
Gly Ala Asp Tyr Pro Gly Glu Cys Gln Leu Leu Arg Arg Ala Cys Ala
290 295 300
Arg Gln Glu Asn Val Phe Lys Lys Phe Asp Gly Pro Cys Asp Pro Cys
305 310 315 320
Gln Gly Ala Leu Pro Asp Pro Ser Arg Ser Cys Arg Val Asn Pro Arg
325 330 335
Thr Arg Arg Pro Glu Met Leu Leu Arg Pro Glu Ser Cys Pro Ala Arg
340 345 350
Gln Ala Pro Val Cys Gly Asp Asp Gly Val Thr Tyr Glu Asn Asp Cys
355 360 365
Val Met Gly Arg Ser Gly Ala Ala Arg Gly Leu Leu Leu Gln Lys Val
370 375 380
Arg Ser Gly Gln Cys Gln Gly Arg Asp Gln Cys Pro Glu Pro Cys Arg
385 390 395 400
Phe Asn Ala Val Cys Leu Ser Arg Arg Gly Arg Pro Arg Cys Ser Cys
405 410 415
Asp Arg Val Thr Cys Asp Gly Ala Tyr Arg Pro Val Cys Ala Gln Asp
420 425 430
Gly Arg Thr Tyr Asp Ser Asp Cys Trp Arg Gln Gln Ala Glu Cys Arg
435 440 445
Gln Gln Arg Ala Ile Pro Ser Lys His Gln Gly Pro Cys Asp Gln Ala
450 455 460
Pro Ser Pro Cys Leu Gly Val Gln Cys Ala Phe Gly Ala Thr Cys Ala
465 470 475 480
Val Lys Asn Gly Gln Ala Ala Cys Glu Cys Leu Gln Ala Cys Ser Ser
485 490 495
Leu Tyr Asp Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly Ser Ala
500 505 510
Cys Glu Leu Glu Ala Thr Ala Cys Thr Leu Gly Arg Glu Ile Gln Val
515 520 525
Ala Arg Lys Gly Pro Cys Asp Arg Cys Gly Gln Cys Arg Phe Gly Ala
530 535 540
Leu Cys Glu Ala Glu Thr Gly Arg Cys Val Cys Pro Ser Glu Cys Val
545 550 555 560
Ala Leu Ala Gln Pro Val Cys Gly Ser Asp Gly His Thr Tyr Pro Ser
565 570 575
Glu Cys Met Leu His Val His Ala Cys Thr His Gln Ile Ser Leu His
580 585 590
Val Ala Ser Ala Gly Pro Cys Glu Thr Cys Gly Asp Ala Val Cys Ala
595 600 605
Phe Gly Ala Val Cys Ser Ala Gly Gln Cys Val Cys Pro Arg Cys Glu
610 615 620
His Pro Pro Pro Gly Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly
625 630 635 640
Ser Ala Cys Glu Leu Arg Glu Ala Ala Cys Leu Gln Gln Thr Gln Ile
645 650 655
Glu Glu Ala Arg Ala Gly Pro Cys Glu Gln Ala Glu Cys Gly Ser Gly
660 665 670
Gly Ser Gly Ser Gly Glu Asp Gly Asp Cys Glu Gln Glu Leu Cys Arg
675 680 685
Gln Arg Gly Gly Ile Trp Asp Glu Asp Ser Glu Asp Gly Pro Cys Val
690 695 700
Cys Asp Phe Ser Cys Gln Ser Val Pro Gly Ser Pro Val Cys Gly Ser
705 710 715 720
Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys Lys Ala Arg Cys
725 730 735
Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly Ala Cys Arg
740 745 750
<![CDATA[<210> 435]]>
<![CDATA[<211> 102]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 435]]>
taaaaggctt ctccccagat gccagagacc tgtctgctaa agaaaccaag atgctgatgg 60
ctgctggaga caaagatggg gacggcaaaa ttggggttga cg 102
<![CDATA[<210> 436]]>
<![CDATA[<211> 125]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 436]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac ccctcgagat 120
acttg 125
<![CDATA[<210> 437]]>
<![CDATA[<211> 227]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 437]]>
taaaaggctt ctccccagat gccagagacc tgtctgctaa agaaaccaag atgctgatgg 60
ctgctggaga caaagatggg gacggcaaaa ttggggttga cgctacatct acatccacca 120
ctgggacaag ccatcttgta aaatgtgcgg agaaggagaa aactttctgt gtgaatggag 180
gggagtgctt catggtgaaa gacctttcaa acccctcgag atacttg 227
<![CDATA[<210> 438]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 438]]>
Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys Glu Thr Lys
1 5 10 15
Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys Ile Gly Val
20 25 30
Asp Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys
35 40 45
Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met
50 55 60
Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu
65 70 75
<![CDATA[<210> 439]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 439]]>
acttcccgac aggacttccc accagcccag cctttcagtg caggctccag ccctccaccc 60
ccacccgag 69
<![CDATA[<210> 440]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 440]]>
ttgcaggatg tcgatgacag acttgctgaa cgctgaggac atcaagaagg cggtgggagc 60
ctttagcg 68
<![CDATA[<210> 441]]>
<![CDATA[<211> 133]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 441]]>
ctaccgactc cttcgaccac aaaaagttct tccaaatggt cggcctgaag aaaaagagtg 60
cggatgatgt gaagaaggtg tttcacatgc tggacaagga caaaagtggc ttcatcgagg 120
aggatgagct ggg 133
<![CDATA[<210> 442]]>
<![CDATA[<211> 110]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 442]]>
attcatccta aaaggcttct ccccagatgc cagagacctg tctgctaaag aaaccaagat 60
gctgatggct gctggagaca aagatgggga cggcaaaatt ggggttgacg 110
<![CDATA[<210> 443]]>
<![CDATA[<211> 216]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 443]]>
aattctccac tctggtggct gaaagctaag aagcactgac tgcccctggt cttccacctc 60
tctgccctga acacccaatc tcggcccctc tcgccaccct cctgcatttc tgttcagttc 120
gtttatgtta ttttttactc ccccatcccc tgtggccctc taatgacacc attcttctgg 180
aaaatgctgg agaagcaata aaggttgtac cagtca 216
<![CDATA[<210> 444]]>
<![CDATA[<211> 596]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> PVALB之全mRNA多核苷酸序列]]>
<![CDATA[<400> 444]]>
acttcccgac aggacttccc accagcccag cctttcagtg caggctccag ccctccaccc 60
ccacccgagt tgcaggatgt cgatgacaga cttgctgaac gctgaggaca tcaagaaggc 120
ggtgggagcc tttagcgcta ccgactcctt cgaccacaaa aagttcttcc aaatggtcgg 180
cctgaagaaa aagagtgcgg atgatgtgaa gaaggtgttt cacatgctgg acaaggacaa 240
aagtggcttc atcgaggagg atgagctggg attcatccta aaaggcttct ccccagatgc 300
cagagacctg tctgctaaag aaaccaagat gctgatggct gctggagaca aagatgggga 360
cggcaaaatt ggggttgacg aattctccac tctggtggct gaaagctaag aagcactgac 420
tgcccctggt cttccacctc tctgccctga acacccaatc tcggcccctc tcgccaccct 480
cctgcatttc tgttcagttc gtttatgtta ttttttactc ccccatcccc tgtggccctc 540
taatgacacc attcttctgg aaaatgctgg agaagcaata aaggttgtac cagtca 596
<![CDATA[<210> 445]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 445]]>
Met Ser Met Thr Asp Leu Leu Asn Ala Glu Asp Ile Lys Lys Ala Val
1 5 10 15
Gly Ala Phe Ser
20
<![CDATA[<210> 446]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 446]]>
Thr Asp Ser Phe Asp His Lys Lys Phe Phe Gln Met Val Gly Leu Lys
1 5 10 15
Lys Lys Ser Ala Asp Asp Val Lys Lys Val Phe His Met Leu Asp Lys
20 25 30
Asp Lys Ser Gly Phe Ile Glu Glu Asp Glu Leu
35 40
<![CDATA[<210> 447]]>
<![CDATA[<211> 36]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 447]]>
Phe Ile Leu Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys
1 5 10 15
Glu Thr Lys Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys
20 25 30
Ile Gly Val Asp
35
<![CDATA[<210> 448]]>
<![CDATA[<211> 8]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 448]]>
Phe Ser Thr Leu Val Ala Glu Ser
1 5
<![CDATA[<210> 449]]>
<![CDATA[<211> 110]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 449]]>
Met Ser Met Thr Asp Leu Leu Asn Ala Glu Asp Ile Lys Lys Ala Val
1 5 10 15
Gly Ala Phe Ser Ala Thr Asp Ser Phe Asp His Lys Lys Phe Phe Gln
20 25 30
Met Val Gly Leu Lys Lys Lys Ser Ala Asp Asp Val Lys Lys Val Phe
35 40 45
His Met Leu Asp Lys Asp Lys Ser Gly Phe Ile Glu Glu Asp Glu Leu
50 55 60
Gly Phe Ile Leu Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala
65 70 75 80
Lys Glu Thr Lys Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly
85 90 95
Lys Ile Gly Val Asp Glu Phe Ser Thr Leu Val Ala Glu Ser
100 105 110
<![CDATA[<210> 450]]>
<![CDATA[<211> 380]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 450]]>
acttcccgac aggacttccc accagcccag cctttcagtg caggctccag ccctccaccc 60
ccacccgagt tgcaggatgt cgatgacaga cttgctgaac gctgaggaca tcaagaaggc 120
ggtgggagcc tttagcgcta ccgactcctt cgaccacaaa aagttcttcc aaatggtcgg 180
cctgaagaaa aagagtgcgg atgatgtgaa gaaggtgttt cacatgctgg acaaggacaa 240
aagtggcttc atcgaggagg atgagctggg attcatccta aaaggcttct ccccagatgc 300
cagagacctg tctgctaaag aaaccaagat gctgatggct gctggagaca aagatgggga 360
cggcaaaatt ggggttgacg 380
<![CDATA[<210> 451]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 451]]>
Met Ser Met Thr Asp Leu Leu Asn Ala Glu Asp Ile Lys Lys Ala Val
1 5 10 15
Gly Ala Phe Ser Ala Thr Asp Ser Phe Asp His Lys Lys Phe Phe Gln
20 25 30
Met Val Gly Leu Lys Lys Lys Ser Ala Asp Asp Val Lys Lys Val Phe
35 40 45
His Met Leu Asp Lys Asp Lys Ser Gly Phe Ile Glu Glu Asp Glu Leu
50 55 60
Gly Phe Ile Leu Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala
65 70 75 80
Lys Glu Thr Lys Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly
85 90 95
Lys Ile Gly Val Asp
100
<![CDATA[<210> 452]]>
<![CDATA[<211> 93]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 452]]>
agttggggtc tcactgtgtt gcccagactg gtctcgaact cttggcctca ggtgatcctc 60
ttccctcagc ttcccagaat gccgagatga tag 93
<![CDATA[<210> 453]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 453]]>
ctacatctac atccaccact gggacaag 28
<![CDATA[<210> 454]]>
<![CDATA[<211> 121]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 454]]>
agttggggtc tcactgtgtt gcccagactg gtctcgaact cttggcctca ggtgatcctc 60
ttccctcagc ttcccagaat gccgagatga tagctacatc tacatccacc actgggacaa 120
g 121
<![CDATA[<210> 455]]>
<![CDATA[<211> 39]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 455]]>
Leu Gly Ser His Cys Val Ala Gln Thr Gly Leu Glu Leu Leu Ala Ser
1 5 10 15
Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn Ala Glu Met Ile Ala Thr
20 25 30
Ser Thr Ser Thr Thr Gly Thr
35
<![CDATA[<210> 456]]>
<![CDATA[<211> 273]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 456]]>
gcattgcggc ttggttttct cacccagtgc atgtggcagg agcggtgaga tcactgcctc 60
acggcgatcc tggactgacg gtcacgactg cctaccctct aaccctgttc tgagctgccc 120
cttgcccaca caccccaaac ctgtgtgcag gatccgcctc catggagcta cagcctcctg 180
aagcctcgat cgccgtcgtg tcgattccgc gccagttgcc tggctcacat tcggaggctg 240
gtgtccaggg tctcagcgcg ggggacgact cag 273
<![CDATA[<210> 457]]>
<![CDATA[<211> 93]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 457]]>
agttggggtc tcactgtgtt gcccagactg gtctcgaact cttggcctca ggtgatcctc 60
ttccctcagc ttcccagaat gccgagatga tag 93
<![CDATA[<210> 458]]>
<![CDATA[<211> 93]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 458]]>
agacggggtc tgactgtgtt acccaggctg gtcttcaact cttggcctca agtgatcctc 60
ctgccttagc ttccaagaat gctgaggtta cag 93
<![CDATA[<210> 459]]>
<![CDATA[<211> 429]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 459]]>
gcaccatgag ccaggacacc gaggtggata tgaaggaggt ggagctgaat gagttagagc 60
ccgagaagca gccgatgaac gcggcgtctg gggcggccat gtccctggcg ggagccgaga 120
agaatggtct ggtgaagatc aaggtggcgg aagacgaggc ggaggcggca gccgcggcta 180
agttcacggg cctgtccaag gaggagctgc tgaaggtggc aggcagcccc ggctgggtac 240
gcacccgctg ggcactgctg ctgctcttct ggctcggctg gctcggcatg cttgctggtg 300
ccgtggtcat aatcgtgcga gcgccgcgtt gtcgcgagct accggcgcag aagtggtggc 360
acacgggcgc cctctaccgc atcggcgacc ttcaggcctt ccagggccac ggcgcgggca 420
acctggcgg 429
<![CDATA[<210> 460]]>
<![CDATA[<211> 174]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 460]]>
gtctgaaggg gcgtctcgat tacctgagct ctctgaaggt gaagggcctt gtgctgggtc 60
caattcacaa gaaccagaag gatgatgtcg ctcagactga cttgctgcag atcgacccca 120
attttggctc caaggaagat tttgacagtc tcttgcaatc ggctaaaaaa aaga 174
<![CDATA[<210> 461]]>
<![CDATA[<211> 92]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 461]]>
gcatccgtgt cattctggac cttactccca actaccgggg tgagaactcg tggttctcca 60
ctcaggttga cactgtggcc accaaggtga ag 92
<![CDATA[<210> 462]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 462]]>
gatgctctgg agttttggct gcaagctggc gtggatgggt tccaggttcg ggacatagag 60
aatctgaag 69
<![CDATA[<210> 463]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 463]]>
gatgcatcct cattcttggc tgagtggcaa aatatcacca agggcttcag tgaagacag 59
<![CDATA[<210> 464]]>
<![CDATA[<211> 181]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 464]]>
gctcttgatt gcggggacta actcctccga ccttcagcag atcctgagcc tactcgaatc 60
caacaaagac ttgctgttga ctagctcata cctgtctgat tctggttcta ctggggagca 120
tacaaaatcc ctagtcacac agtatttgaa tgccactggc aatcgctggt gcagctggag 180
t 181
<![CDATA[<210> 465]]>
<![CDATA[<211> 144]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 465]]>
ttgtctcagg caaggctcct gacttccttc ttgccggctc aacttctccg actctaccag 60
ctgatgctct tcaccctgcc agggacccct gttttcagct acggggatga gattggcctg 120
gatgcagctg cccttcctgg acag 144
<![CDATA[<210> 466]]>
<![CDATA[<211> 84]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 466]]>
cctatggagg ctccagtcat gctgtgggat gagtccagct tccctgacat cccaggggct 60
gtaagtgcca acatgactgt gaag 84
<![CDATA[<210> 467]]>
<![CDATA[<211> 530]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 467]]>
ggccagagtg aagaccctgg ctccctcctt tccttgttcc ggcggctgag tgaccagcgg 60
agtaaggagc gctccctact gcatggggac ttccacgcgt tctccgctgg gcctggactc 120
ttctcctata tccgccactg ggaccagaat gagcgttttc tggtagtgct taactttggg 180
gatgtgggcc tctcggctgg actgcaggcc tccgacctgc ctgccagcgc cagcctgcca 240
gccaaggctg acctcctgct cagcacccag ccaggccgtg aggagggctc ccctcttgag 300
ctggaacgcc tgaaactgga gcctcacgaa gggctgctgc tccgcttccc ctacgcggcc 360
tgacttcagc ctgacatgga cccactaccc ttctcctttc cttcccaggc cctttggctt 420
ctgatttttc tcttttttaa aaacaaacaa acaaactgtt gcagattatg agtgaacccc 480
caaatagggt gttttctgcc ttcaaataaa agtcacccct gcatggtgaa 530
<![CDATA[<210> 468]]>
<![CDATA[<211> 2221]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> SLC3A2之全mRNA多核苷酸序列]]>
<![CDATA[<400> 468]]>
gcattgcggc ttggttttct cacccagtgc atgtggcagg agcggtgaga tcactgcctc 60
acggcgatcc tggactgacg gtcacgactg cctaccctct aaccctgttc tgagctgccc 120
cttgcccaca caccccaaac ctgtgtgcag gatccgcctc catggagcta cagcctcctg 180
aagcctcgat cgccgtcgtg tcgattccgc gccagttgcc tggctcacat tcggaggctg 240
gtgtccaggg tctcagcgcg ggggacgact cagagttggg gtctcactgt gttgcccaga 300
ctggtctcga actcttggcc tcaggtgatc ctcttccctc agcttcccag aatgccgaga 360
tgatagagac ggggtctgac tgtgttaccc aggctggtct tcaactcttg gcctcaagtg 420
atcctcctgc cttagcttcc aagaatgctg aggttacagg caccatgagc caggacaccg 480
aggtggatat gaaggaggtg gagctgaatg agttagagcc cgagaagcag ccgatgaacg 540
cggcgtctgg ggcggccatg tccctggcgg gagccgagaa gaatggtctg gtgaagatca 600
aggtggcgga agacgaggcg gaggcggcag ccgcggctaa gttcacgggc ctgtccaagg 660
aggagctgct gaaggtggca ggcagccccg gctgggtacg cacccgctgg gcactgctgc 720
tgctcttctg gctcggctgg ctcggcatgc ttgctggtgc cgtggtcata atcgtgcgag 780
cgccgcgttg tcgcgagcta ccggcgcaga agtggtggca cacgggcgcc ctctaccgca 840
tcggcgacct tcaggccttc cagggccacg gcgcgggcaa cctggcgggt ctgaaggggc 900
gtctcgatta cctgagctct ctgaaggtga agggccttgt gctgggtcca attcacaaga 960
accagaagga tgatgtcgct cagactgact tgctgcagat cgaccccaat tttggctcca 1020
aggaagattt tgacagtctc ttgcaatcgg ctaaaaaaaa gagcatccgt gtcattctgg 1080
accttactcc caactaccgg ggtgagaact cgtggttctc cactcaggtt gacactgtgg 1140
ccaccaaggt gaaggatgct ctggagtttt ggctgcaagc tggcgtggat gggttccagg 1200
ttcgggacat agagaatctg aaggatgcat cctcattctt ggctgagtgg caaaatatca 1260
ccaagggctt cagtgaagac aggctcttga ttgcggggac taactcctcc gaccttcagc 1320
agatcctgag cctactcgaa tccaacaaag acttgctgtt gactagctca tacctgtctg 1380
attctggttc tactggggag catacaaaat ccctagtcac acagtatttg aatgccactg 1440
gcaatcgctg gtgcagctgg agtttgtctc aggcaaggct cctgacttcc ttcttgccgg 1500
ctcaacttct ccgactctac cagctgatgc tcttcaccct gccagggacc cctgttttca 1560
gctacgggga tgagattggc ctggatgcag ctgcccttcc tggacagcct atggaggctc 1620
cagtcatgct gtgggatgag tccagcttcc ctgacatccc aggggctgta agtgccaaca 1680
tgactgtgaa gggccagagt gaagaccctg gctccctcct ttccttgttc cggcggctga 1740
gtgaccagcg gagtaaggag cgctccctac tgcatgggga cttccacgcg ttctccgctg 1800
ggcctggact cttctcctat atccgccact gggaccagaa tgagcgtttt ctggtagtgc 1860
ttaactttgg ggatgtgggc ctctcggctg gactgcaggc ctccgacctg cctgccagcg 1920
ccagcctgcc agccaaggct gacctcctgc tcagcaccca gccaggccgt gaggagggct 1980
cccctcttga gctggaacgc ctgaaactgg agcctcacga agggctgctg ctccgcttcc 2040
cctacgcggc ctgacttcag cctgacatgg acccactacc cttctccttt ccttcccagg 2100
ccctttggct tctgattttt ctctttttta aaaacaaaca aacaaactgt tgcagattat 2160
gagtgaaccc ccaaataggg tgttttctgc cttcaaataa aagtcacccc tgcatggtga 2220
a 2221
<![CDATA[<210> 469]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 469]]>
Met Glu Leu Gln Pro Pro Glu Ala Ser Ile Ala Val Val Ser Ile Pro
1 5 10 15
Arg Gln Leu Pro Gly Ser His Ser Glu Ala Gly Val Gln Gly Leu Ser
20 25 30
Ala Gly Asp Asp Ser
35
<![CDATA[<210> 470]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> ]]> PRT
<![CDATA[<213> 人類]]>
<![CDATA[<400> 470]]>
Leu Gly Ser His Cys Val Ala Gln Thr Gly Leu Glu Leu Leu Ala Ser
1 5 10 15
Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn Ala Glu Met Ile
20 25 30
<![CDATA[<210> 471]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 471]]>
Thr Gly Ser Asp Cys Val Thr Gln Ala Gly Leu Gln Leu Leu Ala Ser
1 5 10 15
Ser Asp Pro Pro Ala Leu Ala Ser Lys Asn Ala Glu Val Thr
20 25 30
<![CDATA[<210> 472]]>
<![CDATA[<211> 142]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 472]]>
Thr Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu Val Glu Leu Asn
1 5 10 15
Glu Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala Ser Gly Ala Ala
20 25 30
Met Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val Lys Ile Lys Val
35 40 45
Ala Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Lys Phe Thr Gly Leu
50 55 60
Ser Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro Gly Trp Val Arg
65 70 75 80
Thr Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly Trp Leu Gly Met
85 90 95
Leu Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro Arg Cys Arg Glu
100 105 110
Leu Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu Tyr Arg Ile Gly
115 120 125
Asp Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu Ala
130 135 140
<![CDATA[<210> 473]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 473]]>
Leu Lys Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val Lys Gly Leu
1 5 10 15
Val Leu Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val Ala Gln Thr
20 25 30
Asp Leu Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu Asp Phe Asp
35 40 45
Ser Leu Leu Gln Ser Ala Lys Lys Lys
50 55
<![CDATA[<210> 474]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 474]]>
Ile Arg Val Ile Leu Asp Leu Thr Pro Asn Tyr Arg Gly Glu Asn Ser
1 5 10 15
Trp Phe Ser Thr Gln Val Asp Thr Val Ala Thr Lys Val Lys
20 25 30
<![CDATA[<210> 475]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 475]]>
Asp Ala Leu Glu Phe Trp Leu Gln Ala Gly Val Asp Gly Phe Gln Val
1 5 10 15
Arg Asp Ile Glu Asn Leu Lys
20
<![CDATA[<210> 476]]>
<![CDATA[<211> 19]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 476]]>
Asp Ala Ser Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys Gly Phe
1 5 10 15
Ser Glu Asp
<![CDATA[<210> 477]]>
<![CDATA[<211> 60]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 477]]>
Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp Leu Gln Gln Ile Leu Ser
1 5 10 15
Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu Thr Ser Ser Tyr Leu Ser
20 25 30
Asp Ser Gly Ser Thr Gly Glu His Thr Lys Ser Leu Val Thr Gln Tyr
35 40 45
Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser Trp Ser
50 55 60
<![CDATA[<210> 478]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 478]]>
Leu Ser Gln Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln Leu Leu
1 5 10 15
Arg Leu Tyr Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro Val Phe
20 25 30
Ser Tyr Gly Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro Gly Gln
35 40 45
<![CDATA[<210> 479]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 479]]>
Pro Met Glu Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe Pro Asp
1 5 10 15
Ile Pro Gly Ala Val Ser Ala Asn Met Thr Val Lys
20 25
<![CDATA[<210> 480]]>
<![CDATA[<211> 120]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 480]]>
Gly Gln Ser Glu Asp Pro Gly Ser Leu Leu Ser Leu Phe Arg Arg Leu
1 5 10 15
Ser Asp Gln Arg Ser Lys Glu Arg Ser Leu Leu His Gly Asp Phe His
20 25 30
Ala Phe Ser Ala Gly Pro Gly Leu Phe Ser Tyr Ile Arg His Trp Asp
35 40 45
Gln Asn Glu Arg Phe Leu Val Val Leu Asn Phe Gly Asp Val Gly Leu
50 55 60
Ser Ala Gly Leu Gln Ala Ser Asp Leu Pro Ala Ser Ala Ser Leu Pro
65 70 75 80
Ala Lys Ala Asp Leu Leu Leu Ser Thr Gln Pro Gly Arg Glu Glu Gly
85 90 95
Ser Pro Leu Glu Leu Glu Arg Leu Lys Leu Glu Pro His Glu Gly Leu
100 105 110
Leu Leu Arg Phe Pro Tyr Ala Ala
115 120
<![CDATA[<210> 481]]>
<![CDATA[<211> 630]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 481]]>
Met Glu Leu Gln Pro Pro Glu Ala Ser Ile Ala Val Val Ser Ile Pro
1 5 10 15
Arg Gln Leu Pro Gly Ser His Ser Glu Ala Gly Val Gln Gly Leu Ser
20 25 30
Ala Gly Asp Asp Ser Glu Leu Gly Ser His Cys Val Ala Gln Thr Gly
35 40 45
Leu Glu Leu Leu Ala Ser Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn
50 55 60
Ala Glu Met Ile Glu Thr Gly Ser Asp Cys Val Thr Gln Ala Gly Leu
65 70 75 80
Gln Leu Leu Ala Ser Ser Asp Pro Pro Ala Leu Ala Ser Lys Asn Ala
85 90 95
Glu Val Thr Gly Thr Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu
100 105 110
Val Glu Leu Asn Glu Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala
115 120 125
Ser Gly Ala Ala Met Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val
130 135 140
Lys Ile Lys Val Ala Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Lys
145 150 155 160
Phe Thr Gly Leu Ser Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro
165 170 175
Gly Trp Val Arg Thr Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly
180 185 190
Trp Leu Gly Met Leu Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro
195 200 205
Arg Cys Arg Glu Leu Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu
210 215 220
Tyr Arg Ile Gly Asp Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn
225 230 235 240
Leu Ala Gly Leu Lys Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val
245 250 255
Lys Gly Leu Val Leu Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val
260 265 270
Ala Gln Thr Asp Leu Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu
275 280 285
Asp Phe Asp Ser Leu Leu Gln Ser Ala Lys Lys Lys Ser Ile Arg Val
290 295 300
Ile Leu Asp Leu Thr Pro Asn Tyr Arg Gly Glu Asn Ser Trp Phe Ser
305 310 315 320
Thr Gln Val Asp Thr Val Ala Thr Lys Val Lys Asp Ala Leu Glu Phe
325 330 335
Trp Leu Gln Ala Gly Val Asp Gly Phe Gln Val Arg Asp Ile Glu Asn
340 345 350
Leu Lys Asp Ala Ser Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys
355 360 365
Gly Phe Ser Glu Asp Arg Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp
370 375 380
Leu Gln Gln Ile Leu Ser Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu
385 390 395 400
Thr Ser Ser Tyr Leu Ser Asp Ser Gly Ser Thr Gly Glu His Thr Lys
405 410 415
Ser Leu Val Thr Gln Tyr Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser
420 425 430
Trp Ser Leu Ser Gln Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln
435 440 445
Leu Leu Arg Leu Tyr Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro
450 455 460
Val Phe Ser Tyr Gly Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro
465 470 475 480
Gly Gln Pro Met Glu Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe
485 490 495
Pro Asp Ile Pro Gly Ala Val Ser Ala Asn Met Thr Val Lys Gly Gln
500 505 510
Ser Glu Asp Pro Gly Ser Leu Leu Ser Leu Phe Arg Arg Leu Ser Asp
515 520 525
Gln Arg Ser Lys Glu Arg Ser Leu Leu His Gly Asp Phe His Ala Phe
530 535 540
Ser Ala Gly Pro Gly Leu Phe Ser Tyr Ile Arg His Trp Asp Gln Asn
545 550 555 560
Glu Arg Phe Leu Val Val Leu Asn Phe Gly Asp Val Gly Leu Ser Ala
565 570 575
Gly Leu Gln Ala Ser Asp Leu Pro Ala Ser Ala Ser Leu Pro Ala Lys
580 585 590
Ala Asp Leu Leu Leu Ser Thr Gln Pro Gly Arg Glu Glu Gly Ser Pro
595 600 605
Leu Glu Leu Glu Arg Leu Lys Leu Glu Pro His Glu Gly Leu Leu Leu
610 615 620
Arg Phe Pro Tyr Ala Ala
625 630
<![CDATA[<210> 482]]>
<![CDATA[<211> 366]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 482]]>
gcattgcggc ttggttttct cacccagtgc atgtggcagg agcggtgaga tcactgcctc 60
acggcgatcc tggactgacg gtcacgactg cctaccctct aaccctgttc tgagctgccc 120
cttgcccaca caccccaaac ctgtgtgcag gatccgcctc catggagcta cagcctcctg 180
aagcctcgat cgccgtcgtg tcgattccgc gccagttgcc tggctcacat tcggaggctg 240
gtgtccaggg tctcagcgcg ggggacgact cagagttggg gtctcactgt gttgcccaga 300
ctggtctcga actcttggcc tcaggtgatc ctcttccctc agcttcccag aatgccgaga 360
tgatag 366
<![CDATA[<210> 483]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 483]]>
Met Glu Leu Gln Pro Pro Glu Ala Ser Ile Ala Val Val Ser Ile Pro
1 5 10 15
Arg Gln Leu Pro Gly Ser His Ser Glu Ala Gly Val Gln Gly Leu Ser
20 25 30
Ala Gly Asp Asp Ser Glu Leu Gly Ser His Cys Val Ala Gln Thr Gly
35 40 45
Leu Glu Leu Leu Ala Ser Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn
50 55 60
Ala Glu Met Ile
65
<![CDATA[<210> 484]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 484]]>
ttgagcctgt tgatgcccgc cctgctgccg accgaggact gaccactcga ccag 54
<![CDATA[<210> 485]]>
<![CDATA[<211> 87]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 485]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgct 87
<![CDATA[<210> 486]]>
<![CDATA[<211> 141]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 486]]>
ttgagcctgt tgatgcccgc cctgctgccg accgaggact gaccactcga ccagctacat 60
ctacatccac cactgggaca agccatcttg taaaatgtgc ggagaaggag aaaactttct 120
gtgtgaatgg aggggagtgc t 141
<![CDATA[<210> 487]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 487]]>
Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg
1 5 10 15
Pro Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys
20 25 30
Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
35 40 45
<![CDATA[<210> 488]]>
<![CDATA[<211> 207]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 488]]>
gtcagtttcc tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg 60
gagacggcgg cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc 120
agcgcactcg gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg 180
gccgcctgga cggctcgggc gctggag 207
<![CDATA[<210> 489]]>
<![CDATA[<211> 130]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 489]]>
gtctaccctg aactgcagat caccaatgtg gtagaagcca accaaccagt gaccatccag 60
aactggtgca agcggggccg caagcagtgc aagacccatc cccactttgt gattccctac 120
cgctgcttag 130
<![CDATA[<210> 490]]>
<![CDATA[<211> 113]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 490]]>
ttggtgagtt tgtaagtgat gcccttctcg ttcctgacaa gtgcaaattc ttacaccagg 60
agaggatgga tgtttgcgaa actcatcttc actggcacac cgtcgccaaa gag 113
<![CDATA[<210> 491]]>
<![CDATA[<211> 194]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 491]]>
acatgcagtg agaagagtac caacttgcat gactacggca tgttgctgcc ctgcggaatt 60
gacaagttcc gaggggtaga gtttgtgtgt tgcccactgg ctgaagaaag tgacaatgtg 120
gattctgctg atgcggagga ggatgactcg gatgtctggt ggggcggagc agacacagac 180
tatgcagatg ggag 194
<![CDATA[<210> 492]]>
<![CDATA[<211> 203]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 492]]>
tgaagacaaa gtagtagaag tagcagagga ggaagaagtg gctgaggtgg aagaagaaga 60
agccgatgat gacgaggacg atgaggatgg tgatgaggta gaggaagagg ctgaggaacc 120
ctacgaagaa gccacagaga gaaccaccag cattgccacc accaccacca ccaccacaga 180
gtctgtggaa gaggtggttc gag 203
<![CDATA[<210> 493]]>
<![CDATA[<211> 168]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 493]]>
aggtgtgctc tgaacaagcc gagacggggc cgtgccgagc aatgatctcc cgctggtact 60
ttgatgtgac tgaagggaag tgtgccccat tcttttacgg cggatgtggc ggcaaccgga 120
acaactttga cacagaagag tactgcatgg ccgtgtgtgg cagcgcca 168
<![CDATA[<210> 494]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 494]]>
tgtcccaaag tttactcaag actacccagg aacctcttgc ccgagatcct gttaaac 57
<![CDATA[<210> 495]]>
<![CDATA[<211> 134]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 495]]>
ttcctacaac agcagccagt acccctgatg ccgttgacaa gtatctcgag acacctgggg 60
atgagaatga acatgcccat ttccagaaag ccaaagagag gcttgaggcc aagcaccgag 120
agagaatgtc ccag 134
<![CDATA[<210> 496]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> ]]> 496
gtcatgagag aatgggaaga ggcagaacgt caagcaaaga acttgcctaa agctgataag 60
aaggcagtta tccag 75
<![CDATA[<210> 497]]>
<![CDATA[<211> 159]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 497]]>
catttccagg agaaagtgga atctttggaa caggaagcag ccaacgagag acagcagctg 60
gtggagacac acatggccag agtggaagcc atgctcaatg accgccgccg cctggccctg 120
gagaactaca tcaccgctct gcaggctgtt cctcctcgg 159
<![CDATA[<210> 498]]>
<![CDATA[<211> 129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 498]]>
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 60
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 120
cggtcccag 129
<![CDATA[<210> 499]]>
<![CDATA[<211> 100]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 499]]>
gttatgacac acctccgtgt gatttatgag cgcatgaatc agtctctctc cctgctctac 60
aacgtgcctg cagtggccga ggagattcag gatgaagttg 100
<![CDATA[<210> 500]]>
<![CDATA[<211> 222]]>
<![CDATA[<212> DNA]]>
<![CDATA[<21]]>3> 人類]]>
<br/>
<br/><![CDATA[<400> 500]]>
<br/><![CDATA[atgagctgct tcagaaagag caaaactatt cagatgacgt cttggccaac atgattagtg 60
aaccaaggat cagttacgga aacgatgctc tcatgccatc tttgaccgaa acgaaaacca 120
ccgtggagct ccttcccgtg aatggagagt tcagcctgga cgatctccag ccgtggcatt 180
cttttggggc tgactctgtg ccagccaaca cagaaaacga ag 222
<![CDATA[<210> 501]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 501]]>
ttgagcctgt tgatgcccgc cctgctgccg accgaggact gaccactcga ccag 54
<![CDATA[<210> 502]]>
<![CDATA[<211> 101]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 502]]>
gttctgggtt gacaaatatc aagacggagg agatctctga agtgaagatg gatgcagaat 60
tccgacatga ctcaggatat gaagttcatc atcaaaaatt g 101
<![CDATA[<210> 503]]>
<![CDATA[<211> 147]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 503]]>
gtgttctttg cagaagatgt gggttcaaac aaaggtgcaa tcattggact catggtgggc 60
ggtgttgtca tagcgacagt gatcgtcatc accttggtga tgctgaagaa gaaacagtac 120
acatccattc atcatggtgt ggtggag 147
<![CDATA[<210> 504]]>
<![CDATA[<211> 1222]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 504]]>
gttgacgccg ctgtcacccc agaggagcgc cacctgtcca agatgcagca gaacggctac 60
gaaaatccaa cctacaagtt ctttgagcag atgcagaact agacccccgc cacagcagcc 120
tctgaagttg gacagcaaaa ccattgcttc actacccatc ggtgtccatt tatagaataa 180
tgtgggaaga aacaaacccg ttttatgatt tactcattat cgccttttga cagctgtgct 240
gtaacacaag tagatgcctg aacttgaatt aatccacaca tcagtaatgt attctatctc 300
tctttacatt ttggtctcta tactacatta ttaatgggtt ttgtgtactg taaagaattt 360
agctgtatca aactagtgca tgaatagatt ctctcctgat tatttatcac atagcccctt 420
agccagttgt atattattct tgtggtttgt gacccaatta agtcctactt tacatatgct 480
ttaagaatcg atgggggatg cttcatgtga acgtgggagt tcagctgctt ctcttgccta 540
agtattcctt tcctgatcac tatgcatttt aaagttaaac atttttaagt atttcagatg 600
ctttagagag attttttttc catgactgca ttttactgta cagattgctg cttctgctat 660
atttgtgata taggaattaa gaggatacac acgtttgttt cttcgtgcct gttttatgtg 720
cacacattag gcattgagac ttcaagcttt tctttttttg tccacgtatc tttgggtctt 780
tgataaagaa aagaatccct gttcattgta agcactttta cggggcgggt ggggaggggt 840
gctctgctgg tcttcaatta ccaagaattc tccaaaacaa ttttctgcag gatgattgta 900
cagaatcatt gcttatgaca tgatcgcttt ctacactgta ttacataaat aaattaaata 960
aaataacccc gggcaagact tttctttgaa ggatgactac agacattaaa taatcgaagt 1020
aattttgggt ggggagaaga ggcagattca attttcttta accagtctga agtttcattt 1080
atgatacaaa agaagatgaa aatggaagtg gcaatataag gggatgagga aggcatgcct 1140
ggacaaaccc ttcttttaag atgtgtcttc aatttgtata aaatggtgtt ttcatgtaaa 1200
taaatacatt cttggaggag ca 1222
<![CDATA[<210> 505]]>
<![CDATA[<211> 3415]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> APP之全mRNA多核苷酸序列]]>
<![CDATA[<400> 505]]>
gtcagtttcc tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg 60
gagacggcgg cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc 120
agcgcactcg gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg 180
gccgcctgga cggctcgggc gctggaggtc taccctgaac tgcagatcac caatgtggta 240
gaagccaacc aaccagtgac catccagaac tggtgcaagc ggggccgcaa gcagtgcaag 300
acccatcccc actttgtgat tccctaccgc tgcttagttg gtgagtttgt aagtgatgcc 360
cttctcgttc ctgacaagtg caaattctta caccaggaga ggatggatgt ttgcgaaact 420
catcttcact ggcacaccgt cgccaaagag acatgcagtg agaagagtac caacttgcat 480
gactacggca tgttgctgcc ctgcggaatt gacaagttcc gaggggtaga gtttgtgtgt 540
tgcccactgg ctgaagaaag tgacaatgtg gattctgctg atgcggagga ggatgactcg 600
gatgtctggt ggggcggagc agacacagac tatgcagatg ggagtgaaga caaagtagta 660
gaagtagcag aggaggaaga agtggctgag gtggaagaag aagaagccga tgatgacgag 720
gacgatgagg atggtgatga ggtagaggaa gaggctgagg aaccctacga agaagccaca 780
gagagaacca ccagcattgc caccaccacc accaccacca cagagtctgt ggaagaggtg 840
gttcgagagg tgtgctctga acaagccgag acggggccgt gccgagcaat gatctcccgc 900
tggtactttg atgtgactga agggaagtgt gccccattct tttacggcgg atgtggcggc 960
aaccggaaca actttgacac agaagagtac tgcatggccg tgtgtggcag cgccatgtcc 1020
caaagtttac tcaagactac ccaggaacct cttgcccgag atcctgttaa acttcctaca 1080
acagcagcca gtacccctga tgccgttgac aagtatctcg agacacctgg ggatgagaat 1140
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagagaatg 1200
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcctaaagct 1260
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 1320
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 1380
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 1440
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 1500
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 1560
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 1620
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 1680
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 1740
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 1800
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 1860
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 1920
gaccgaggac tgaccactcg accaggttct gggttgacaa atatcaagac ggaggagatc 1980
tctgaagtga agatggatgc agaattccga catgactcag gatatgaagt tcatcatcaa 2040
aaattggtgt tctttgcaga agatgtgggt tcaaacaaag gtgcaatcat tggactcatg 2100
gtgggcggtg ttgtcatagc gacagtgatc gtcatcacct tggtgatgct gaagaagaaa 2160
cagtacacat ccattcatca tggtgtggtg gaggttgacg ccgctgtcac cccagaggag 2220
cgccacctgt ccaagatgca gcagaacggc tacgaaaatc caacctacaa gttctttgag 2280
cagatgcaga actagacccc cgccacagca gcctctgaag ttggacagca aaaccattgc 2340
ttcactaccc atcggtgtcc atttatagaa taatgtggga agaaacaaac ccgttttatg 2400
atttactcat tatcgccttt tgacagctgt gctgtaacac aagtagatgc ctgaacttga 2460
attaatccac acatcagtaa tgtattctat ctctctttac attttggtct ctatactaca 2520
ttattaatgg gttttgtgta ctgtaaagaa tttagctgta tcaaactagt gcatgaatag 2580
attctctcct gattatttat cacatagccc cttagccagt tgtatattat tcttgtggtt 2640
tgtgacccaa ttaagtccta ctttacatat gctttaagaa tcgatggggg atgcttcatg 2700
tgaacgtggg agttcagctg cttctcttgc ctaagtattc ctttcctgat cactatgcat 2760
tttaaagtta aacattttta agtatttcag atgctttaga gagatttttt ttccatgact 2820
gcattttact gtacagattg ctgcttctgc tatatttgtg atataggaat taagaggata 2880
cacacgtttg tttcttcgtg cctgttttat gtgcacacat taggcattga gacttcaagc 2940
ttttcttttt ttgtccacgt atctttgggt ctttgataaa gaaaagaatc cctgttcatt 3000
gtaagcactt ttacggggcg ggtggggagg ggtgctctgc tggtcttcaa ttaccaagaa 3060
ttctccaaaa caattttctg caggatgatt gtacagaatc attgcttatg acatgatcgc 3120
tttctacact gtattacata aataaattaa ataaaataac cccgggcaag acttttcttt 3180
gaaggatgac tacagacatt aaataatcga agtaattttg ggtggggaga agaggcagat 3240
tcaattttct ttaaccagtc tgaagtttca tttatgatac aaaagaagat gaaaatggaa 3300
gtggcaatat aaggggatga ggaaggcatg cctggacaaa cccttctttt aagatgtgtc 3360
ttcaatttgt ataaaatggt gttttcatgt aaataaatac attcttggag gagca 3415
<![CDATA[<210> 506]]>
<![CDATA[<211> 19]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 506]]>
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg
1 5 10 15
Ala Leu Glu
<![CDATA[<21]]>0> 507]]>
<br/><![CDATA[<211> 43]]>
<br/><![CDATA[<212> PRT]]>
<br/><![CDATA[<213> 人類]]>
<br/>
<br/><![CDATA[<400> 507]]>
<br/>
<br/><![CDATA[Val Tyr Pro Glu Leu Gln Ile Thr Asn Val Val Glu Ala Asn Gln Pro
1 5 10 15
Val Thr Ile Gln Asn Trp Cys Lys Arg Gly Arg Lys Gln Cys Lys Thr
20 25 30
His Pro His Phe Val Ile Pro Tyr Arg Cys Leu
35 40
<![CDATA[<210> 508]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 508]]>
Gly Glu Phe Val Ser Asp Ala Leu Leu Val Pro Asp Lys Cys Lys Phe
1 5 10 15
Leu His Gln Glu Arg Met Asp Val Cys Glu Thr His Leu His Trp His
20 25 30
Thr Val Ala Lys Glu
35
<![CDATA[<210> 509]]>
<![CDATA[<211> 64]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 509]]>
Thr Cys Ser Glu Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu
1 5 10 15
Pro Cys Gly Ile Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro
20 25 30
Leu Ala Glu Glu Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp
35 40 45
Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly
50 55 60
<![CDATA[<210> 510]]>
<![CDATA[<211> 67]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 510]]>
Glu Asp Lys Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val
1 5 10 15
Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu
20 25 30
Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr
35 40 45
Thr Ser Ile Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu
50 55 60
Val Val Arg
65
<![CDATA[<210> 511]]>
<![CDATA[<211> 55]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 511]]>
Val Cys Ser Glu Gln Ala Glu Thr Gly Pro Cys Arg Ala Met Ile Ser
1 5 10 15
Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe Tyr
20 25 30
Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr Cys
35 40 45
Met Ala Val Cys Gly Ser Ala
50 55
<![CDATA[<210> 512]]>
<![CDATA[<211> 18]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 512]]>
Ser Gln Ser Leu Leu Lys Thr Thr Gln Glu Pro Leu Ala Arg Asp Pro
1 5 10 15
Val Lys
<![CDATA[<210> 513]]>
<![CDATA[<211> 44]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 513]]>
Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu
1 5 10 15
Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gln Lys Ala Lys Glu
20 25 30
Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gln
35 40
<![CDATA[<210> 514]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 514]]>
Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gln Ala Lys Asn Leu Pro
1 5 10 15
Lys Ala Asp Lys Lys Ala Val Ile Gln
20 25
<![CDATA[<210> 515]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 515]]>
His Phe Gln Glu Lys Val Glu Ser Leu Glu Gln Glu Ala Ala Asn Glu
1 5 10 15
Arg Gln Gln Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu
20 25 30
Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr Ile Thr Ala Leu Gln
35 40 45
Ala Val Pro Pro Arg
50
<![CDATA[<210> 516]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 516]]>
Pro Arg His Val Phe Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln
1 5 10 15
Lys Asp Arg Gln His Thr Leu Lys His Phe Glu His Val Arg Met Val
20 25 30
Asp Pro Lys Lys Ala Ala Gln Ile Arg Ser Gln
35 40
<![CDATA[<210> 517]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 517]]>
Val Met Thr His Leu Arg Val Ile Tyr Glu Arg Met Asn Gln Ser Leu
1 5 10 15
Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu Ile Gln Asp Glu
20 25 30
Val
<![CDATA[<210> 518]]>
<![CDATA[<211> 73]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 518]]>
Glu Leu Leu Gln Lys Glu Gln Asn Tyr Ser Asp Asp Val Leu Ala Asn
1 5 10 15
Met Ile Ser Glu Pro Arg Ile Ser Tyr Gly Asn Asp Ala Leu Met Pro
20 25 30
Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly
35 40 45
Glu Phe Ser Leu Asp Asp Leu Gln Pro Trp His Ser Phe Gly Ala Asp
50 55 60
Ser Val Pro Ala Asn Thr Glu Asn Glu
65 70
<![CDATA[<210> 519]]>
<![CDATA[<211> 17]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 519]]>
Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg
1 5 10 15
Pro
<![CDATA[<210> 520]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 520]]>
Ser Gly Leu Thr Asn Ile Lys Thr Glu Glu Ile Ser Glu Val Lys Met
1 5 10 15
Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gln Lys
20 25 30
Leu
<![CDATA[<210> 521]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 521]]>
Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala Ile Ile Gly
1 5 10 15
Leu Met Val Gly Gly Val Val Ile Ala Thr Val Ile Val Ile Thr Leu
20 25 30
Val Met Leu Lys Lys Lys Gln Tyr Thr Ser Ile His His Gly Val Val
35 40 45
Glu
<![CDATA[<210> 522]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 522]]>
Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gln
1 5 10 15
Gln Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gln Met Gln
20 25 30
Asn
<![CDATA[<210> 523]]>
<![CDATA[<211> 714]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 523]]>
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg
1 5 10 15
Ala Leu Glu Val Tyr Pro Glu Leu Gln Ile Thr Asn Val Val Glu Ala
20 25 30
Asn Gln Pro Val Thr Ile Gln Asn Trp Cys Lys Arg Gly Arg Lys Gln
35 40 45
Cys Lys Thr His Pro His Phe Val Ile Pro Tyr Arg Cys Leu Val Gly
50 55 60
Glu Phe Val Ser Asp Ala Leu Leu Val Pro Asp Lys Cys Lys Phe Leu
65 70 75 80
His Gln Glu Arg Met Asp Val Cys Glu Thr His Leu His Trp His Thr
85 90 95
Val Ala Lys Glu Thr Cys Ser Glu Lys Ser Thr Asn Leu His Asp Tyr
100 105 110
Gly Met Leu Leu Pro Cys Gly Ile Asp Lys Phe Arg Gly Val Glu Phe
115 120 125
Val Cys Cys Pro Leu Ala Glu Glu Ser Asp Asn Val Asp Ser Ala Asp
130 135 140
Ala Glu Glu Asp Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp
145 150 155 160
Tyr Ala Asp Gly Ser Glu Asp Lys Val Val Glu Val Ala Glu Glu Glu
165 170 175
Glu Val Ala Glu Val Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp
180 185 190
Glu Asp Gly Asp Glu Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu
195 200 205
Ala Thr Glu Arg Thr Thr Ser Ile Ala Thr Thr Thr Thr Thr Thr Thr
210 215 220
Glu Ser Val Glu Glu Val Val Arg Glu Val Cys Ser Glu Gln Ala Glu
225 230 235 240
Thr Gly Pro Cys Arg Ala Met Ile Ser Arg Trp Tyr Phe Asp Val Thr
245 250 255
Glu Gly Lys Cys Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg
260 265 270
Asn Asn Phe Asp Thr Glu Glu Tyr Cys Met Ala Val Cys Gly Ser Ala
275 280 285
Met Ser Gln Ser Leu Leu Lys Thr Thr Gln Glu Pro Leu Ala Arg Asp
290 295 300
Pro Val Lys Leu Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp
305 310 315 320
Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gln
325 330 335
Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gln
340 345 350
Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gln Ala Lys Asn Leu Pro
355 360 365
Lys Ala Asp Lys Lys Ala Val Ile Gln His Phe Gln Glu Lys Val Glu
370 375 380
Ser Leu Glu Gln Glu Ala Ala Asn Glu Arg Gln Gln Leu Val Glu Thr
385 390 395 400
His Met Ala Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala
405 410 415
Leu Glu Asn Tyr Ile Thr Ala Leu Gln Ala Val Pro Pro Arg Pro Arg
420 425 430
His Val Phe Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln Lys Asp
435 440 445
Arg Gln His Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro
450 455 460
Lys Lys Ala Ala Gln Ile Arg Ser Gln Val Met Thr His Leu Arg Val
465 470 475 480
Ile Tyr Glu Arg Met Asn Gln Ser Leu Ser Leu Leu Tyr Asn Val Pro
485 490 495
Ala Val Ala Glu Glu Ile Gln Asp Glu Val Asp Glu Leu Leu Gln Lys
500 505 510
Glu Gln Asn Tyr Ser Asp Asp Val Leu Ala Asn Met Ile Ser Glu Pro
515 520 525
Arg Ile Ser Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr
530 535 540
Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp
545 550 555 560
Asp Leu Gln Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn
565 570 575
Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg
580 585 590
Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr Asn Ile Lys Thr Glu
595 600 605
Glu Ile Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly
610 615 620
Tyr Glu Val His His Gln Lys Leu Val Phe Phe Ala Glu Asp Val Gly
625 630 635 640
Ser Asn Lys Gly Ala Ile Ile Gly Leu Met Val Gly Gly Val Val Ile
645 650 655
Ala Thr Val Ile Val Ile Thr Leu Val Met Leu Lys Lys Lys Gln Tyr
660 665 670
Thr Ser Ile His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro
675 680 685
Glu Glu Arg His Leu Ser Lys Met Gln Gln Asn Gly Tyr Glu Asn Pro
690 695 700
Thr Tyr Lys Phe Phe Glu Gln Met Gln Asn
705 710
<![CDATA[<210> 524]]>
<![CDATA[<211> 1945]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 524]]>
gtcagtttcc tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg 60
gagacggcgg cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc 120
agcgcactcg gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg 180
gccgcctgga cggctcgggc gctggaggtc taccctgaac tgcagatcac caatgtggta 240
gaagccaacc aaccagtgac catccagaac tggtgcaagc ggggccgcaa gcagtgcaag 300
acccatcccc actttgtgat tccctaccgc tgcttagttg gtgagtttgt aagtgatgcc 360
cttctcgttc ctgacaagtg caaattctta caccaggaga ggatggatgt ttgcgaaact 420
catcttcact ggcacaccgt cgccaaagag acatgcagtg agaagagtac caacttgcat 480
gactacggca tgttgctgcc ctgcggaatt gacaagttcc gaggggtaga gtttgtgtgt 540
tgcccactgg ctgaagaaag tgacaatgtg gattctgctg atgcggagga ggatgactcg 600
gatgtctggt ggggcggagc agacacagac tatgcagatg ggagtgaaga caaagtagta 660
gaagtagcag aggaggaaga agtggctgag gtggaagaag aagaagccga tgatgacgag 720
gacgatgagg atggtgatga ggtagaggaa gaggctgagg aaccctacga agaagccaca 780
gagagaacca ccagcattgc caccaccacc accaccacca cagagtctgt ggaagaggtg 840
gttcgagagg tgtgctctga acaagccgag acggggccgt gccgagcaat gatctcccgc 900
tggtactttg atgtgactga agggaagtgt gccccattct tttacggcgg atgtggcggc 960
aaccggaaca actttgacac agaagagtac tgcatggccg tgtgtggcag cgccatgtcc 1020
caaagtttac tcaagactac ccaggaacct cttgcccgag atcctgttaa acttcctaca 1080
acagcagcca gtacccctga tgccgttgac aagtatctcg agacacctgg ggatgagaat 1140
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagagaatg 1200
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcctaaagct 1260
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 1320
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 1380
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 1440
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 1500
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 1560
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 1620
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 1680
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 1740
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 1800
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 1860
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 1920
gaccgaggac tgaccactcg accag 1945
<![CDATA[<210> 525]]>
<![CDATA[<211> 598]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 525]]>
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg
1 5 10 15
Ala Leu Glu Val Tyr Pro Glu Leu Gln Ile Thr Asn Val Val Glu Ala
20 25 30
Asn Gln Pro Val Thr Ile Gln Asn Trp Cys Lys Arg Gly Arg Lys Gln
35 40 45
Cys Lys Thr His Pro His Phe Val Ile Pro Tyr Arg Cys Leu Val Gly
50 55 60
Glu Phe Val Ser Asp Ala Leu Leu Val Pro Asp Lys Cys Lys Phe Leu
65 70 75 80
His Gln Glu Arg Met Asp Val Cys Glu Thr His Leu His Trp His Thr
85 90 95
Val Ala Lys Glu Thr Cys Ser Glu Lys Ser Thr Asn Leu His Asp Tyr
100 105 110
Gly Met Leu Leu Pro Cys Gly Ile Asp Lys Phe Arg Gly Val Glu Phe
115 120 125
Val Cys Cys Pro Leu Ala Glu Glu Ser Asp Asn Val Asp Ser Ala Asp
130 135 140
Ala Glu Glu Asp Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp
145 150 155 160
Tyr Ala Asp Gly Ser Glu Asp Lys Val Val Glu Val Ala Glu Glu Glu
165 170 175
Glu Val Ala Glu Val Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp
180 185 190
Glu Asp Gly Asp Glu Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu
195 200 205
Ala Thr Glu Arg Thr Thr Ser Ile Ala Thr Thr Thr Thr Thr Thr Thr
210 215 220
Glu Ser Val Glu Glu Val Val Arg Glu Val Cys Ser Glu Gln Ala Glu
225 230 235 240
Thr Gly Pro Cys Arg Ala Met Ile Ser Arg Trp Tyr Phe Asp Val Thr
245 250 255
Glu Gly Lys Cys Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg
260 265 270
Asn Asn Phe Asp Thr Glu Glu Tyr Cys Met Ala Val Cys Gly Ser Ala
275 280 285
Met Ser Gln Ser Leu Leu Lys Thr Thr Gln Glu Pro Leu Ala Arg Asp
290 295 300
Pro Val Lys Leu Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp
305 310 315 320
Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gln
325 330 335
Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gln
340 345 350
Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gln Ala Lys Asn Leu Pro
355 360 365
Lys Ala Asp Lys Lys Ala Val Ile Gln His Phe Gln Glu Lys Val Glu
370 375 380
Ser Leu Glu Gln Glu Ala Ala Asn Glu Arg Gln Gln Leu Val Glu Thr
385 390 395 400
His Met Ala Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala
405 410 415
Leu Glu Asn Tyr Ile Thr Ala Leu Gln Ala Val Pro Pro Arg Pro Arg
420 425 430
His Val Phe Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln Lys Asp
435 440 445
Arg Gln His Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro
450 455 460
Lys Lys Ala Ala Gln Ile Arg Ser Gln Val Met Thr His Leu Arg Val
465 470 475 480
Ile Tyr Glu Arg Met Asn Gln Ser Leu Ser Leu Leu Tyr Asn Val Pro
485 490 495
Ala Val Ala Glu Glu Ile Gln Asp Glu Val Asp Glu Leu Leu Gln Lys
500 505 510
Glu Gln Asn Tyr Ser Asp Asp Val Leu Ala Asn Met Ile Ser Glu Pro
515 520 525
Arg Ile Ser Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr
530 535 540
Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp
545 550 555 560
Asp Leu Gln Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn
565 570 575
Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg
580 585 590
Gly Leu Thr Thr Arg Pro
595
<![CDATA[<210> 526]]>
<![CDATA[<211> 96]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 526]]>
aagctggctg cccccttgat ttggagcgag caggcctgac tccagaggtt cagaagatta 60
ttgctgatgt tatccgaaac cctcccgtca actcag 96
<![CDATA[<210> 527]]>
<![CDATA[<211> 87]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 527]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgct 87
<![CDATA[<210> 528]]>
<![CDATA[<211> 182]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 528]]>
aagctggctg cccccttgat ttggagcgag caggcctgac tccagaggtt cagaagatta 60
ttgctgatgt tatccgaaac cctcccgtca actcagctac atctacatcc accactggga 120
caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtgaat ggaggggagt 180
gc 182
<![CDATA[<210> 529]]>
<![CDATA[<211> 60]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 529]]>
Ala Gly Cys Pro Leu Asp Leu Glu Arg Ala Gly Leu Thr Pro Glu Val
1 5 10 15
Gln Lys Ile Ile Ala Asp Val Ile Arg Asn Pro Pro Val Asn Ser Ala
20 25 30
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
35 40 45
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
50 55 60
<![CDATA[<210> 530]]>
<![CDATA[<211> 164]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 530]]>
gtgtactgtg tgcgccgggg aggcgccggc ttgtactcgg cagcgcggga ataaagtttg 60
ctgatttggt gtctagcctg gatgcctggg ttgcaggccc tgcttgtggt ggcgctccac 120
agtcatccgg ctgaagaaga cctgttggac tggatcttct cggg 164
<![CDATA[<210> 531]]>
<![CDATA[<211> 172]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 531]]>
ttttctttca gatattgttt tgtatttacc catgaagaca ttgttttttg gactctgcaa 60
ataggacatt tcaaagatga gtgaaaaaaa attggaaaca actgcacagc agcggaaatg 120
tcctgaatgg atgaatgtgc agaataaaag atgtgctgta gaagaaagaa ag 172
<![CDATA[<210> 532]]>
<![CDATA[<211> 113]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 532]]>
gcatgtgttc ggaagagtgt ttttgaagat gacctcccct tcttagaatt cactggatcc 60
attgtgtata gttacgatgc tagtgattgc tctttcctgt cagaagatat tag 113
<![CDATA[<210> 533]]>
<![CDATA[<211> 146]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 533]]>
catgagtcta tcagatgggg atgtggtggg atttgacatg gagtggccac cattatacaa 60
tagagggaaa cttggcaaag ttgcactaat tcagttgtgt gtttctgaga gcaaatgtta 120
cttgttccac gtttcttcca tgtcag 146
<![CDATA[<210> 534]]>
<![CDATA[<211> 149]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 534]]>
tttttcccca gggattaaaa atgttgcttg aaaataaagc agttaaaaag gcaggtgtag 60
gaattgaagg agatcagtgg aaacttctac gtgactttga tatcaaattg aagaattttg 120
tggagttgac agatgttgcc aataaaaag 149
<![CDATA[<210> 535]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 535]]>
ctgaaatgca cagagacctg gagccttaac agtctggtta aacacctctt aggtaaacag 60
ctcctgaaag acaagtctat ccgctgtagc aattggagta aatttcctct cactgaggac 120
cagaaactgt atgcagccac tgatgcttat 150
<![CDATA[<210> 536]]>
<![CDATA[<211> 70]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 536]]>
gctggtttta ttatttaccg aaatttagag attttggatg atactgtgca aaggtttgct 60
ataaataaag 70
<![CDATA[<210> 537]]>
<![CDATA[<211> 115]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 537]]>
aggaagaaat cctacttagc gacatgaaca aacagttgac ttcaatctct gaggaagtga 60
tggatctggc taagcatctt cctcatgctt tcagtaaatt ggaaaaccca cggag 115
<![CDATA[<210> 538]]>
<![CDATA[<211> 430]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 538]]>
ggtttctatc ttactaaagg atatttcaga aaatctatat tcactgagga ggatgataat 60
tgggtctact aacattgaga ctgaactgag gcccagcaat aatttaaact tattatcctt 120
tgaagattca actactgggg gagtacaaca gaaacaaatt agagaacatg aagttttaat 180
tcacgttgaa gatgaaacat gggacccaac acttgatcat ttagctaaac atgatggaga 240
agatgtactt ggaaataaag tggaacgaaa agaagatgga tttgaagatg gagtagaaga 300
caacaaattg aaagagaata tggaaagagc ttgtttgatg tcgttagata ttacagaaca 360
tgaactccaa attttggaac agcagtctca ggaagaatat cttagtgata ttgcttataa 420
atctactgag 430
<![CDATA[<210> 539]]>
<![CDATA[<211> 81]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 539]]>
catttatctc ccaatgataa tgaaaacgat acgtcctatg taattgagag tgatgaagat 60
ttagaaatgg agatgcttaa g 81
<![CDATA[<210> 540]]>
<![CDATA[<211> 81]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 540]]>
catttatctc ccaatgataa tgaaaacgat acgtcctatg taattgagag tgatgaagat 60
ttagaaatgg agatgcttaa g 81
<![CDATA[<210> 541]]>
<![CDATA[<211> 145]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 541]]>
tctttagaaa acctcaatag tggcacggta gaaccaactc attctaaatg cttaaaaatg 60
gaaagaaatc tgggtcttcc tactaaagaa gaagaagaag atgatgaaaa tgaagctaat 120
gaaggggaag aagatgatga taagg 145
<![CDATA[<210> 542]]>
<![CDATA[<211> 76]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 542]]>
actttttgtg gccagcaccc aatgaagagc aagttacttg cctcaagatg tactttggcc 60
attccagttt taaacc 76
<![CDATA[<210> 543]]>
<![CDATA[<211> 68]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 543]]>
agttcagtgg aaagtgattc attcagtatt agaagaaaga agagataatg ttgctgtcat 60
ggcaactg 68
<![CDATA[<210> 544]]>
<![CDATA[<211> 109]]>
<![CDATA[<212> DNA]]>
<![CDATA[<21]]>3> 人類]]>
<br/>
<br/><![CDATA[<400> 544]]>
<br/><![CDATA[gatatggaaa gagtttgtgc ttccagtatc cacctgttta tgtaggcaag attggccttg 60
ttatctctcc ccttatttct ctgatggaag accaagtgct acagcttaa 109
<![CDATA[<210> 545]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 545]]>
aatgtccaac atcccagctt gcttccttgg atcagcacag tcagaaaatg ttctaacaga 60
tattaaatt 69
<![CDATA[<210> 546]]>
<![CDATA[<211> 83]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 546]]>
aggtaaatac cggattgtat acgtaactcc agaatactgt tcaggtaaca tgggcctgct 60
ccagcaactt gaggctgata ttg 83
<![CDATA[<210> 547]]>
<![CDATA[<211> 107]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 547]]>
gtatcacgct cattgctgtg gatgaggctc actgtatttc tgagtggggg catgatttta 60
gggattcatt caggaagttg ggctccctaa agacagcact gccaatg 107
<![CDATA[<210> 548]]>
<![CDATA[<211> 185]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 548]]>
gttccaatcg ttgcacttac tgctactgca agttcttcaa tccgggaaga cattgtacgt 60
tgcttaaatc tgagaaatcc tcagatcacc tgtactggtt ttgatcgacc aaacctgtat 120
ttagaagtta ggcgaaaaac agggaatatc cttcaggatc tgcagccatt tcttgtcaaa 180
acaag 185
<![CDATA[<210> 549]]>
<![CDATA[<211> 175]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 549]]>
ttcccactgg gaatttgaag gtccaacaat catctactgt ccttctagaa aaatgacaca 60
acaagttaca ggtgaactta ggaaactgaa tctatcctgt ggaacatacc atgcgggcat 120
gagttttagc acaaggaaag acattcatca taggtttgta agagatgaaa ttcag 175
<![CDATA[<210> 550]]>
<![CDATA[<211> 182]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 550]]>
tgtgtcatag ctaccatagc ttttggaatg ggcattaata aagctgacat tcgccaagtc 60
attcattacg gtgctcctaa ggacatggaa tcatattatc aggagattgg tagagctggt 120
cgtgatggac ttcaaagttc ttgtcacgtc ctctgggctc ctgcagacat taacttaaat 180
ag 182
<![CDATA[<210> 551]]>
<![CDATA[<211> 102]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 551]]>
gcaccttctt actgagatac gtaatgagaa gtttcgatta tacaaattaa agatgatggc 60
aaagatggaa aaatatcttc attctagcag atgtaggaga ca 102
<![CDATA[<210> 552]]>
<![CDATA[<211> 93]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 552]]>
aatcatcttg tctcattttg aggacaaaca agtacaaaaa gcctccttgg gaattatggg 60
aactgaaaaa tgctgtgata attgcaggtc cag 93
<![CDATA[<210> 553]]>
<![CDATA[<211> 142]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 553]]>
attggatcat tgctattcca tggatgactc agaggataca tcctgggact ttggtccaca 60
agcatttaag cttttgtctg ctgtggacat cttaggcgaa aaatttggaa ttgggcttcc 120
aattttattt ctccgaggat ct 142
<![CDATA[<210> 554]]>
<![CDATA[<211> 171]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 554]]>
aattctcagc gtcttgccga tcaatatcgc aggcacagtt tatttggcac tggcaaggat 60
caaacagaga gttggtggaa ggctttttcc cgtcagctga tcactgaggg attcttggta 120
gaagtttctc ggtataacaa atttatgaag atttgcgccc ttacgaaaaa g 171
<![CDATA[<210> 555]]>
<![CDATA[<211> 95]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 555]]>
ggtagaaatt ggcttcataa agctaataca gaatctcaga gcctcatcct tcaagctaat 60
gaagaattgt gtccaaagaa gttgcttctg cctag 95
<![CDATA[<210> 556]]>
<![CDATA[<211> 76]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 556]]>
ttcgaaaact gtatcttcgg gcaccaaaga gcattgttat aatcaagtac cagttgaatt 60
aagtacagag aagaag 76
<![CDATA[<210> 557]]>
<![CDATA[<211> 74]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 557]]>
tctaacttgg agaagttata ttcttataaa ccatgtgata agatttcttc tgggagtaac 60
atttctaaaa aaag 74
<![CDATA[<210> 558]]>
<![CDATA[<211> 76]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 558]]>
tatcatggta cagtcaccag aaaaagctta cagttcctca cagcctgtta tttcggcaca 60
agagcaggag actcag 76
<![CDATA[<210> 559]]>
<![CDATA[<211> 113]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 559]]>
attgtgttat atggcaaatt ggtagaagct aggcagaaac atgccaataa aatggatgtt 60
cccccagcta ttctggcaac aaacaagata ctggtggata tggccaaaat gag 113
<![CDATA[<210> 560]]>
<![CDATA[<211> 115]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 560]]>
accaactacg gttgaaaacg taaaaaggat tgatggtgtt tctgaaggca aagctgccat 60
gttggcccct ctgttggaag tcatcaaaca tttctgccaa acaaatagtg ttcag 115
<![CDATA[<210> 561]]>
<![CDATA[<211> 132]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 561]]>
acagacctct tttcaagtac aaaacctcaa gaagaacaga agacgagtct ggtagcaaaa 60
aataaaatat gcacactttc acagtctatg gccatcacat actctttatt ccaagaaaag 120
aagatgcctt tg 132
<![CDATA[<210> 562]]>
<![CDATA[<211> 163]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 562]]>
aagagcatag ctgagagcag gattctgcct ctcatgacaa ttggcatgca cttatcccaa 60
gcggtgaaag ctggctgccc ccttgatttg gagcgagcag gcctgactcc agaggttcag 120
aagattattg ctgatgttat ccgaaaccct cccgtcaact cag 163
<![CDATA[<210> 563]]>
<![CDATA[<211> 209]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 563]]>
atatgagtaa aattagccta atcagaatgt tagttcctga aaacattgac acgtacctta 60
tccacatggc aattgagatc cttaaacatg gtcctgacag cggacttcaa ccttcatgtg 120
atgtcaacaa aaggagatgt tttcccggtt ctgaagagat ctgttcaagt tctaagagaa 180
gcaaggaaga agtaggcatc aatactgag 209
<![CDATA[<210> 564]]>
<![CDATA[<211> 3144]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 564]]>
acttcatctg cagagagaaa gagacgatta cctgtgtggt ttgccaaagg aagtgatacc 60
agcaagaaat taatggacaa aacgaaaagg ggaggtcttt ttagttaagc tggcaattac 120
cagaacaatt atgtttcttg ctgtattata agaggatagc tatattttat ttctgaagag 180
taaggagtag tattttggct taaaaatcat tctaattaca aagttcactg tttattgaag 240
aactggcatc ttaaatcagc cttccgcaat tcatgtagtt tctgggtctt ctgggagcct 300
acgtgagtac atcacctaac agaatattaa attagacttc ctgtaagatt gctttaagaa 360
actgttactg tcctgttttc taatctcttt attaaaacag tgtatttgga aaatgttatg 420
tgctctgatt tgatatagat aacagattag tagttacatg gtaattatgt gatataaaat 480
attcatatat tatcaaaatt ctgttttgta aatgtaagaa agcatagtta ttttacaaat 540
tgtttttact gtcttttgaa gaagttctta aatacgttgt taaatggtat tagttgacca 600
gggcagtgaa aatgaaaccg cattttgggt gccattaaat agggaaaaaa catgtaaaaa 660
atgtaaaatg gagaccaatt gcactaggca agtgtatatt ttgtatttta tatacaattt 720
ctattatttt tcaagtaata aaacaatgtt tttcatactg aatattatat atatattttt 780
tagctttcat ttacttaatt attttaagta cctttatttt tccaggatgt cagaatttga 840
ttctaatctc tcttatgtag cacatgtgac ttaatttaaa acctatactg tgacacagag 900
ttgggtaaac gatgattatt taactttaag cagttcacca tccatttcaa agcctttgat 960
tggctttttt gtaaataaaa ataacttgtt aagaaacaaa tatatctgtc atagaagaac 1020
tagaaaatcc agggaagtga gaaaaatgaa aataaaaatc attcatagtt ttactagtag 1080
ctaatcacag tcaacctctt ttgtgtatcc caccagactt ttttatattc atttgttttt 1140
agttaaaata taaaagtctc gtatattccc atttttctgc attgcattac cagaaggtag 1200
tggcgcctat taaatatgtg atatgttgtt gtccagccat ggcttctgca tttgcatgct 1260
tttgtgtgtg catctgcaat accctgtgaa tatcctgtgt gatggagtgg caagtacgca 1320
cagacacgtc tgctgcatgc ctaggtacga ggctgtctcc aggagaagca cttgtttgat 1380
tatttgagtt gccaattgaa tttgctgctt tttttcatgg cttgccattt tcactgaaaa 1440
gaatgactaa tgaaaaacga tgattggtta ttagatttgg atgtttggca gacattttct 1500
caaaattgaa ctaagttggc ctcttcacgg aaaacaactg gtatttgttg tgccaatgat 1560
aaaattggag atttctagca aaatgtataa ttttggaaaa gttgtgttcc tccactggaa 1620
gcttgacagc tttccttaac ataaagactt ctctttctct tcgctttcac tactactact 1680
actaattctt cttctgattc ttcttcttct ccttcttcct tcttccttcc ttcctcctcc 1740
tcctccttct tcttcctctt cctcttcttc tttctctctt tccttccttc ccttcccttc 1800
cccttccttc cttccttcct tccttcctcc ctccctccct ccctccctcc ctccctcctt 1860
tctttttctt tctctttctt tctttctttc tctctctctc tctctttctt tctttttctt 1920
tctctttttc tttctttcaa gcagtcctcc cgcctcagtc ccccaaaata gtgggattac 1980
aggtgtgagc caccatgcac agccttacat aaagcctttt ctaatgagat ggatagtaat 2040
taacaaatgt gagtttttga tattatataa agattttttc tgtgtttcga agatccgtat 2100
aactcagtga atcagtatgt tctggatgac taatatgtga tgttaagaaa tcatgactga 2160
ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggcgga 2220
tcacgagatc aggagatcga gaccaccctg gccaacatgg tgaaaccccg tctctactaa 2280
aaatacaaaa attagctggg tgtgttggtg cgtgcctata atcccagcta ctcgggaggc 2340
tgaggcagga gaatcgcttg aactcaggag gcggagattg cagtgagctg agactgcgcc 2400
actgcacccc agcctggcga cagagcaaga ctccgtctca aaaataaaaa aagaaatcat 2460
gactgggtaa aagatctgtt cagagtacaa gatggaccaa tggatttgat atatttgaat 2520
ataacagagt atgaaaaagt ttattgatat agtttcagat tacacactgc aactaatctt 2580
taagaaacta ttacttgtcc actttttggt aaaatttcag agaacaatgt ccaccattat 2640
ctgaacaggc tattaaaata ctcttctctt ttccaactac gtgcctgtgc aaagtcagat 2700
ttttttcata tacttcagcc aaaacagcat atcaaaatgg attgaatgca gaagtagatc 2760
tgagaataca gccacttttg ttaagccaga caatgagatt tgcaaaatgt aaacaatgct 2820
gctgttctca gtttttaaaa atatgttttt taaaagtatt tatgttaatg tgtacttggt 2880
ttactactgc tatttttaaa taaaacaaga aacattttta aatgtctgtt ttaatttcta 2940
aagtggtagt gatagatata acccatatta ataaaagctc tttggggtcc tcagtgattt 3000
ttttttaaga gtatggaagg gttctcagac ctaagagatt gagaaatgct gatgtaatgt 3060
tttattataa aggtgtacca tgaattatgt accttacttc atattgttgg acattaaagt 3120
tgctttcagt ttttttgttt taaa 3144
<![CDATA[<210> 565]]>
<![CDATA[<211> 7575]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> WRN之全mRNA多核苷酸序列]]>
<![CDATA[<400> 565]]>
gtgtactgtg tgcgccgggg aggcgccggc ttgtactcgg cagcgcggga ataaagtttg 60
ctgatttggt gtctagcctg gatgcctggg ttgcaggccc tgcttgtggt ggcgctccac 120
agtcatccgg ctgaagaaga cctgttggac tggatcttct cgggttttct ttcagatatt 180
gttttgtatt tacccatgaa gacattgttt tttggactct gcaaatagga catttcaaag 240
atgagtgaaa aaaaattgga aacaactgca cagcagcgga aatgtcctga atggatgaat 300
gtgcagaata aaagatgtgc tgtagaagaa agaaaggcat gtgttcggaa gagtgttttt 360
gaagatgacc tccccttctt agaattcact ggatccattg tgtatagtta cgatgctagt 420
gattgctctt tcctgtcaga agatattagc atgagtctat cagatgggga tgtggtggga 480
tttgacatgg agtggccacc attatacaat agagggaaac ttggcaaagt tgcactaatt 540
cagttgtgtg tttctgagag caaatgttac ttgttccacg tttcttccat gtcagttttt 600
ccccagggat taaaaatgtt gcttgaaaat aaagcagtta aaaaggcagg tgtaggaatt 660
gaaggagatc agtggaaact tctacgtgac tttgatatca aattgaagaa ttttgtggag 720
ttgacagatg ttgccaataa aaagctgaaa tgcacagaga cctggagcct taacagtctg 780
gttaaacacc tcttaggtaa acagctcctg aaagacaagt ctatccgctg tagcaattgg 840
agtaaatttc ctctcactga ggaccagaaa ctgtatgcag ccactgatgc ttatgctggt 900
tttattattt accgaaattt agagattttg gatgatactg tgcaaaggtt tgctataaat 960
aaagaggaag aaatcctact tagcgacatg aacaaacagt tgacttcaat ctctgaggaa 1020
gtgatggatc tggctaagca tcttcctcat gctttcagta aattggaaaa cccacggagg 1080
gtttctatct tactaaagga tatttcagaa aatctatatt cactgaggag gatgataatt 1140
gggtctacta acattgagac tgaactgagg cccagcaata atttaaactt attatccttt 1200
gaagattcaa ctactggggg agtacaacag aaacaaatta gagaacatga agttttaatt 1260
cacgttgaag atgaaacatg ggacccaaca cttgatcatt tagctaaaca tgatggagaa 1320
gatgtacttg gaaataaagt ggaacgaaaa gaagatggat ttgaagatgg agtagaagac 1380
aacaaattga aagagaatat ggaaagagct tgtttgatgt cgttagatat tacagaacat 1440
gaactccaaa ttttggaaca gcagtctcag gaagaatatc ttagtgatat tgcttataaa 1500
tctactgagc atttatctcc caatgataat gaaaacgata cgtcctatgt aattgagagt 1560
gatgaagatt tagaaatgga gatgcttaag catttatctc ccaatgataa tgaaaacgat 1620
acgtcctatg taattgagag tgatgaagat ttagaaatgg agatgcttaa gtctttagaa 1680
aacctcaata gtggcacggt agaaccaact cattctaaat gcttaaaaat ggaaagaaat 1740
ctgggtcttc ctactaaaga agaagaagaa gatgatgaaa atgaagctaa tgaaggggaa 1800
gaagatgatg ataaggactt tttgtggcca gcacccaatg aagagcaagt tacttgcctc 1860
aagatgtact ttggccattc cagttttaaa ccagttcagt ggaaagtgat tcattcagta 1920
ttagaagaaa gaagagataa tgttgctgtc atggcaactg gatatggaaa gagtttgtgc 1980
ttccagtatc cacctgttta tgtaggcaag attggccttg ttatctctcc ccttatttct 2040
ctgatggaag accaagtgct acagcttaaa atgtccaaca tcccagcttg cttccttgga 2100
tcagcacagt cagaaaatgt tctaacagat attaaattag gtaaataccg gattgtatac 2160
gtaactccag aatactgttc aggtaacatg ggcctgctcc agcaacttga ggctgatatt 2220
ggtatcacgc tcattgctgt ggatgaggct cactgtattt ctgagtgggg gcatgatttt 2280
agggattcat tcaggaagtt gggctcccta aagacagcac tgccaatggt tccaatcgtt 2340
gcacttactg ctactgcaag ttcttcaatc cgggaagaca ttgtacgttg cttaaatctg 2400
agaaatcctc agatcacctg tactggtttt gatcgaccaa acctgtattt agaagttagg 2460
cgaaaaacag ggaatatcct tcaggatctg cagccatttc ttgtcaaaac aagttcccac 2520
tgggaatttg aaggtccaac aatcatctac tgtccttcta gaaaaatgac acaacaagtt 2580
acaggtgaac ttaggaaact gaatctatcc tgtggaacat accatgcggg catgagtttt 2640
agcacaagga aagacattca tcataggttt gtaagagatg aaattcagtg tgtcatagct 2700
accatagctt ttggaatggg cattaataaa gctgacattc gccaagtcat tcattacggt 2760
gctcctaagg acatggaatc atattatcag gagattggta gagctggtcg tgatggactt 2820
caaagttctt gtcacgtcct ctgggctcct gcagacatta acttaaatag gcaccttctt 2880
actgagatac gtaatgagaa gtttcgatta tacaaattaa agatgatggc aaagatggaa 2940
aaatatcttc attctagcag atgtaggaga caaatcatct tgtctcattt tgaggacaaa 3000
caagtacaaa aagcctcctt gggaattatg ggaactgaaa aatgctgtga taattgcagg 3060
tccagattgg atcattgcta ttccatggat gactcagagg atacatcctg ggactttggt 3120
ccacaagcat ttaagctttt gtctgctgtg gacatcttag gcgaaaaatt tggaattggg 3180
cttccaattt tatttctccg aggatctaat tctcagcgtc ttgccgatca atatcgcagg 3240
cacagtttat ttggcactgg caaggatcaa acagagagtt ggtggaaggc tttttcccgt 3300
cagctgatca ctgagggatt cttggtagaa gtttctcggt ataacaaatt tatgaagatt 3360
tgcgccctta cgaaaaaggg tagaaattgg cttcataaag ctaatacaga atctcagagc 3420
ctcatccttc aagctaatga agaattgtgt ccaaagaagt tgcttctgcc tagttcgaaa 3480
actgtatctt cgggcaccaa agagcattgt tataatcaag taccagttga attaagtaca 3540
gagaagaagt ctaacttgga gaagttatat tcttataaac catgtgataa gatttcttct 3600
gggagtaaca tttctaaaaa aagtatcatg gtacagtcac cagaaaaagc ttacagttcc 3660
tcacagcctg ttatttcggc acaagagcag gagactcaga ttgtgttata tggcaaattg 3720
gtagaagcta ggcagaaaca tgccaataaa atggatgttc ccccagctat tctggcaaca 3780
aacaagatac tggtggatat ggccaaaatg agaccaacta cggttgaaaa cgtaaaaagg 3840
attgatggtg tttctgaagg caaagctgcc atgttggccc ctctgttgga agtcatcaaa 3900
catttctgcc aaacaaatag tgttcagaca gacctctttt caagtacaaa acctcaagaa 3960
gaacagaaga cgagtctggt agcaaaaaat aaaatatgca cactttcaca gtctatggcc 4020
atcacatact ctttattcca agaaaagaag atgcctttga agagcatagc tgagagcagg 4080
attctgcctc tcatgacaat tggcatgcac ttatcccaag cggtgaaagc tggctgcccc 4140
cttgatttgg agcgagcagg cctgactcca gaggttcaga agattattgc tgatgttatc 4200
cgaaaccctc ccgtcaactc agatatgagt aaaattagcc taatcagaat gttagttcct 4260
gaaaacattg acacgtacct tatccacatg gcaattgaga tccttaaaca tggtcctgac 4320
agcggacttc aaccttcatg tgatgtcaac aaaaggagat gttttcccgg ttctgaagag 4380
atctgttcaa gttctaagag aagcaaggaa gaagtaggca tcaatactga gacttcatct 4440
gcagagagaa agagacgatt acctgtgtgg tttgccaaag gaagtgatac cagcaagaaa 4500
ttaatggaca aaacgaaaag gggaggtctt tttagttaag ctggcaatta ccagaacaat 4560
tatgtttctt gctgtattat aagaggatag ctatatttta tttctgaaga gtaaggagta 4620
gtattttggc ttaaaaatca ttctaattac aaagttcact gtttattgaa gaactggcat 4680
cttaaatcag ccttccgcaa ttcatgtagt ttctgggtct tctgggagcc tacgtgagta 4740
catcacctaa cagaatatta aattagactt cctgtaagat tgctttaaga aactgttact 4800
gtcctgtttt ctaatctctt tattaaaaca gtgtatttgg aaaatgttat gtgctctgat 4860
ttgatataga taacagatta gtagttacat ggtaattatg tgatataaaa tattcatata 4920
ttatcaaaat tctgttttgt aaatgtaaga aagcatagtt attttacaaa ttgtttttac 4980
tgtcttttga agaagttctt aaatacgttg ttaaatggta ttagttgacc agggcagtga 5040
aaatgaaacc gcattttggg tgccattaaa tagggaaaaa acatgtaaaa aatgtaaaat 5100
ggagaccaat tgcactaggc aagtgtatat tttgtatttt atatacaatt tctattattt 5160
ttcaagtaat aaaacaatgt ttttcatact gaatattata tatatatttt ttagctttca 5220
tttacttaat tattttaagt acctttattt ttccaggatg tcagaatttg attctaatct 5280
ctcttatgta gcacatgtga cttaatttaa aacctatact gtgacacaga gttgggtaaa 5340
cgatgattat ttaactttaa gcagttcacc atccatttca aagcctttga ttggcttttt 5400
tgtaaataaa aataacttgt taagaaacaa atatatctgt catagaagaa ctagaaaatc 5460
cagggaagtg agaaaaatga aaataaaaat cattcatagt tttactagta gctaatcaca 5520
gtcaacctct tttgtgtatc ccaccagact tttttatatt catttgtttt tagttaaaat 5580
ataaaagtct cgtatattcc catttttctg cattgcatta ccagaaggta gtggcgccta 5640
ttaaatatgt gatatgttgt tgtccagcca tggcttctgc atttgcatgc ttttgtgtgt 5700
gcatctgcaa taccctgtga atatcctgtg tgatggagtg gcaagtacgc acagacacgt 5760
ctgctgcatg cctaggtacg aggctgtctc caggagaagc acttgtttga ttatttgagt 5820
tgccaattga atttgctgct ttttttcatg gcttgccatt ttcactgaaa agaatgacta 5880
atgaaaaacg atgattggtt attagatttg gatgtttggc agacattttc tcaaaattga 5940
actaagttgg cctcttcacg gaaaacaact ggtatttgtt gtgccaatga taaaattgga 6000
gatttctagc aaaatgtata attttggaaa agttgtgttc ctccactgga agcttgacag 6060
ctttccttaa cataaagact tctctttctc ttcgctttca ctactactac tactaattct 6120
tcttctgatt cttcttcttc tccttcttcc ttcttccttc cttcctcctc ctcctccttc 6180
ttcttcctct tcctcttctt ctttctctct ttccttcctt cccttccctt ccccttcctt 6240
ccttccttcc ttccttcctc cctccctccc tccctccctc cctccctcct ttctttttct 6300
ttctctttct ttctttcttt ctctctctct ctctctttct ttctttttct ttctcttttt 6360
ctttctttca agcagtcctc ccgcctcagt cccccaaaat agtgggatta caggtgtgag 6420
ccaccatgca cagccttaca taaagccttt tctaatgaga tggatagtaa ttaacaaatg 6480
tgagtttttg atattatata aagatttttt ctgtgtttcg aagatccgta taactcagtg 6540
aatcagtatg ttctggatga ctaatatgtg atgttaagaa atcatgactg aggccgggcg 6600
cggtggctca cgcctgtaat cccagcactt tgggaggccg aggcgggcgg atcacgagat 6660
caggagatcg agaccaccct ggccaacatg gtgaaacccc gtctctacta aaaatacaaa 6720
aattagctgg gtgtgttggt gcgtgcctat aatcccagct actcgggagg ctgaggcagg 6780
agaatcgctt gaactcagga ggcggagatt gcagtgagct gagactgcgc cactgcaccc 6840
cagcctggcg acagagcaag actccgtctc aaaaataaaa aaagaaatca tgactgggta 6900
aaagatctgt tcagagtaca agatggacca atggatttga tatatttgaa tataacagag 6960
tatgaaaaag tttattgata tagtttcaga ttacacactg caactaatct ttaagaaact 7020
attacttgtc cactttttgg taaaatttca gagaacaatg tccaccatta tctgaacagg 7080
ctattaaaat actcttctct tttccaacta cgtgcctgtg caaagtcaga tttttttcat 7140
atacttcagc caaaacagca tatcaaaatg gattgaatgc agaagtagat ctgagaatac 7200
agccactttt gttaagccag acaatgagat ttgcaaaatg taaacaatgc tgctgttctc 7260
agtttttaaa aatatgtttt ttaaaagtat ttatgttaat gtgtacttgg tttactactg 7320
ctatttttaa ataaaacaag aaacattttt aaatgtctgt tttaatttct aaagtggtag 7380
tgatagatat aacccatatt aataaaagct ctttggggtc ctcagtgatt tttttttaag 7440
agtatggaag ggttctcaga cctaagagat tgagaaatgc tgatgtaatg ttttattata 7500
aaggtgtacc atgaattatg taccttactt catattgttg gacattaaag ttgctttcag 7560
tttttttgtt ttaaa 7575
<![CDATA[<210> 566]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 566]]>
Met Ser Glu Lys Lys Leu Glu Thr Thr Ala Gln Gln Arg Lys Cys Pro
1 5 10 15
Glu Trp Met Asn Val Gln Asn Lys Arg Cys Ala Val Glu Glu Arg Lys
20 25 30
<![CDATA[<210> 567]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 567]]>
Ala Cys Val Arg Lys Ser Val Phe Glu Asp Asp Leu Pro Phe Leu Glu
1 5 10 15
Phe Thr Gly Ser Ile Val Tyr Ser Tyr Asp Ala Ser Asp Cys Ser Phe
20 25 30
Leu Ser Glu Asp Ile
35
<![CDATA[<210> 568]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 568]]>
Met Ser Leu Ser Asp Gly Asp Val Val Gly Phe Asp Met Glu Trp Pro
1 5 10 15
Pro Leu Tyr Asn Arg Gly Lys Leu Gly Lys Val Ala Leu Ile Gln Leu
20 25 30
Cys Val Ser Glu Ser Lys Cys Tyr Leu Phe His Val Ser Ser Met Ser
35 40 45
<![CDATA[<210> 569]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 569]]>
Phe Pro Gln Gly Leu Lys Met Leu Leu Glu Asn Lys Ala Val Lys Lys
1 5 10 15
Ala Gly Val Gly Ile Glu Gly Asp Gln Trp Lys Leu Leu Arg Asp Phe
20 25 30
Asp Ile Lys Leu Lys Asn Phe Val Glu Leu Thr Asp Val Ala Asn Lys
35 40 45
Lys
<![CDATA[<210> 570]]>
<![CDATA[<211> 50]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 570]]>
Leu Lys Cys Thr Glu Thr Trp Ser Leu Asn Ser Leu Val Lys His Leu
1 5 10 15
Leu Gly Lys Gln Leu Leu Lys Asp Lys Ser Ile Arg Cys Ser Asn Trp
20 25 30
Ser Lys Phe Pro Leu Thr Glu Asp Gln Lys Leu Tyr Ala Ala Thr Asp
35 40 45
Ala Tyr
50
<![CDATA[<210> 571]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 571]]>
Ala Gly Phe Ile Ile Tyr Arg Asn Leu Glu Ile Leu Asp Asp Thr Val
1 5 10 15
Gln Arg Phe Ala Ile Asn Lys
20
<![CDATA[<210> 572]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 572]]>
Glu Glu Ile Leu Leu Ser Asp Met Asn Lys Gln Leu Thr Ser Ile Ser
1 5 10 15
Glu Glu Val Met Asp Leu Ala Lys His Leu Pro His Ala Phe Ser Lys
20 25 30
Leu Glu Asn Pro Arg
35
<![CDATA[<210> 573]]>
<![CDATA[<211> 143]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 573]]>
Val Ser Ile Leu Leu Lys Asp Ile Ser Glu Asn Leu Tyr Ser Leu Arg
1 5 10 15
Arg Met Ile Ile Gly Ser Thr Asn Ile Glu Thr Glu Leu Arg Pro Ser
20 25 30
Asn Asn Leu Asn Leu Leu Ser Phe Glu Asp Ser Thr Thr Gly Gly Val
35 40 45
Gln Gln Lys Gln Ile Arg Glu His Glu Val Leu Ile His Val Glu Asp
50 55 60
Glu Thr Trp Asp Pro Thr Leu Asp His Leu Ala Lys His Asp Gly Glu
65 70 75 80
Asp Val Leu Gly Asn Lys Val Glu Arg Lys Glu Asp Gly Phe Glu Asp
85 90 95
Gly Val Glu Asp Asn Lys Leu Lys Glu Asn Met Glu Arg Ala Cys Leu
100 105 110
Met Ser Leu Asp Ile Thr Glu His Glu Leu Gln Ile Leu Glu Gln Gln
115 120 125
Ser Gln Glu Glu Tyr Leu Ser Asp Ile Ala Tyr Lys Ser Thr Glu
130 135 140
<![CDATA[<210> 574]]>
<![CDATA[<211> 27]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 574]]>
His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val Ile Glu
1 5 10 15
Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys
20 25
<![CDATA[<210> 575]]>
<![CDATA[<211> 27]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 575]]>
His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val Ile Glu
1 5 10 15
Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys
20 25
<![CDATA[<210> 576]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 576]]>
Ser Leu Glu Asn Leu Asn Ser Gly Thr Val Glu Pro Thr His Ser Lys
1 5 10 15
Cys Leu Lys Met Glu Arg Asn Leu Gly Leu Pro Thr Lys Glu Glu Glu
20 25 30
Glu Asp Asp Glu Asn Glu Ala Asn Glu Gly Glu Glu Asp Asp Asp Lys
35 40 45
<![CDATA[<210> 577]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 577]]>
Phe Leu Trp Pro Ala Pro Asn Glu Glu Gln Val Thr Cys Leu Lys Met
1 5 10 15
Tyr Phe Gly His Ser Ser Phe Lys
20
<![CDATA[<210> 578]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 578]]>
Val Gln Trp Lys Val Ile His Ser Val Leu Glu Glu Arg Arg Asp Asn
1 5 10 15
Val Ala Val Met Ala Thr
20
<![CDATA[<210> 579]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 579]]>
Tyr Gly Lys Ser Leu Cys Phe Gln Tyr Pro Pro Val Tyr Val Gly Lys
1 5 10 15
Ile Gly Leu Val Ile Ser Pro Leu Ile Ser Leu Met Glu Asp Gln Val
20 25 30
Leu Gln Leu
35
<![CDATA[<210> 580]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 580]]>
Met Ser Asn Ile Pro Ala Cys Phe Leu Gly Ser Ala Gln Ser Glu Asn
1 5 10 15
Val Leu Thr Asp Ile Lys
20
<![CDATA[<210> 581]]>
<![CDATA[<211> 27]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 581]]>
Gly Lys Tyr Arg Ile Val Tyr Val Thr Pro Glu Tyr Cys Ser Gly Asn
1 5 10 15
Met Gly Leu Leu Gln Gln Leu Glu Ala Asp Ile
20 25
<![CDATA[<210> 582]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 58]]>2
Ile Thr Leu Ile Ala Val Asp Glu Ala His Cys Ile Ser Glu Trp Gly
1 5 10 15
His Asp Phe Arg Asp Ser Phe Arg Lys Leu Gly Ser Leu Lys Thr Ala
20 25 30
Leu Pro Met
35
<![CDATA[<210> 583]]>
<![CDATA[<211> 61]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 583]]>
Val Pro Ile Val Ala Leu Thr Ala Thr Ala Ser Ser Ser Ile Arg Glu
1 5 10 15
Asp Ile Val Arg Cys Leu Asn Leu Arg Asn Pro Gln Ile Thr Cys Thr
20 25 30
Gly Phe Asp Arg Pro Asn Leu Tyr Leu Glu Val Arg Arg Lys Thr Gly
35 40 45
Asn Ile Leu Gln Asp Leu Gln Pro Phe Leu Val Lys Thr
50 55 60
<![CDATA[<210> 584]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 584]]>
Ser His Trp Glu Phe Glu Gly Pro Thr Ile Ile Tyr Cys Pro Ser Arg
1 5 10 15
Lys Met Thr Gln Gln Val Thr Gly Glu Leu Arg Lys Leu Asn Leu Ser
20 25 30
Cys Gly Thr Tyr His Ala Gly Met Ser Phe Ser Thr Arg Lys Asp Ile
35 40 45
His His Arg Phe Val Arg Asp Glu Ile Gln
50 55
<![CDATA[<210> 585]]>
<![CDATA[<211> 60]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 585]]>
Cys Val Ile Ala Thr Ile Ala Phe Gly Met Gly Ile Asn Lys Ala Asp
1 5 10 15
Ile Arg Gln Val Ile His Tyr Gly Ala Pro Lys Asp Met Glu Ser Tyr
20 25 30
Tyr Gln Glu Ile Gly Arg Ala Gly Arg Asp Gly Leu Gln Ser Ser Cys
35 40 45
His Val Leu Trp Ala Pro Ala Asp Ile Asn Leu Asn
50 55 60
<![CDATA[<210> 586]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 586]]>
His Leu Leu Thr Glu Ile Arg Asn Glu Lys Phe Arg Leu Tyr Lys Leu
1 5 10 15
Lys Met Met Ala Lys Met Glu Lys Tyr Leu His Ser Ser Arg Cys Arg
20 25 30
Arg
<![CDATA[<210> 587]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 587]]>
Ile Ile Leu Ser His Phe Glu Asp Lys Gln Val Gln Lys Ala Ser Leu
1 5 10 15
Gly Ile Met Gly Thr Glu Lys Cys Cys Asp Asn Cys Arg Ser
20 25 30
<![CDATA[<210> 588]]>
<![CDATA[<211> 47]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 588]]>
Leu Asp His Cys Tyr Ser Met Asp Asp Ser Glu Asp Thr Ser Trp Asp
1 5 10 15
Phe Gly Pro Gln Ala Phe Lys Leu Leu Ser Ala Val Asp Ile Leu Gly
20 25 30
Glu Lys Phe Gly Ile Gly Leu Pro Ile Leu Phe Leu Arg Gly Ser
35 40 45
<![CDATA[<210> 589]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 589]]>
Asn Ser Gln Arg Leu Ala Asp Gln Tyr Arg Arg His Ser Leu Phe Gly
1 5 10 15
Thr Gly Lys Asp Gln Thr Glu Ser Trp Trp Lys Ala Phe Ser Arg Gln
20 25 30
Leu Ile Thr Glu Gly Phe Leu Val Glu Val Ser Arg Tyr Asn Lys Phe
35 40 45
Met Lys Ile Cys Ala Leu Thr Lys Lys
50 55
<![CDATA[<210> 590]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 590]]>
Gly Arg Asn Trp Leu His Lys Ala Asn Thr Glu Ser Gln Ser Leu Ile
1 5 10 15
Leu Gln Ala Asn Glu Glu Leu Cys Pro Lys Lys Leu Leu Leu Pro
20 25 30
<![CDATA[<210> 591]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 591]]>
Ser Lys Thr Val Ser Ser Gly Thr Lys Glu His Cys Tyr Asn Gln Val
1 5 10 15
Pro Val Glu Leu Ser Thr Glu Lys Lys
20 25
<![CDATA[<210> 592]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 592]]>
Ser Asn Leu Glu Lys Leu Tyr Ser Tyr Lys Pro Cys Asp Lys Ile Ser
1 5 10 15
Ser Gly Ser Asn Ile Ser Lys Lys
20
<![CDATA[<210> 593]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 593]]>
Ile Met Val Gln Ser Pro Glu Lys Ala Tyr Ser Ser Ser Gln Pro Val
1 5 10 15
Ile Ser Ala Gln Glu Gln Glu Thr Gln
20 25
<![CDATA[<210> 594]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 594]]>
Ile Val Leu Tyr Gly Lys Leu Val Glu Ala Arg Gln Lys His Ala Asn
1 5 10 15
Lys Met Asp Val Pro Pro Ala Ile Leu Ala Thr Asn Lys Ile Leu Val
20 25 30
Asp Met Ala Lys Met
35
<![CDATA[<210> 595]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 595]]>
Pro Thr Thr Val Glu Asn Val Lys Arg Ile Asp Gly Val Ser Glu Gly
1 5 10 15
Lys Ala Ala Met Leu Ala Pro Leu Leu Glu Val Ile Lys His Phe Cys
20 25 30
Gln Thr Asn Ser Val Gln
35
<![CDATA[<210> 596]]>
<![CDATA[<211> 44]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 596]]>
Thr Asp Leu Phe Ser Ser Thr Lys Pro Gln Glu Glu Gln Lys Thr Ser
1 5 10 15
Leu Val Ala Lys Asn Lys Ile Cys Thr Leu Ser Gln Ser Met Ala Ile
20 25 30
Thr Tyr Ser Leu Phe Gln Glu Lys Lys Met Pro Leu
35 40
<![CDATA[<210> 597]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 597]]>
Lys Ser Ile Ala Glu Ser Arg Ile Leu Pro Leu Met Thr Ile Gly Met
1 5 10 15
His Leu Ser Gln Ala Val Lys Ala Gly Cys Pro Leu Asp Leu Glu Arg
20 25 30
Ala Gly Leu Thr Pro Glu Val Gln Lys Ile Ile Ala Asp Val Ile Arg
35 40 45
Asn Pro Pro Val Asn Ser
50
<![CDATA[<210> 598]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 598]]>
Met Ser Lys Ile Ser Leu Ile Arg Met Leu Val Pro Glu Asn Ile Asp
1 5 10 15
Thr Tyr Leu Ile His Met Ala Ile Glu Ile Leu Lys His Gly Pro Asp
20 25 30
Ser Gly Leu Gln Pro Ser Cys Asp Val Asn Lys Arg Arg Cys Phe Pro
35 40 45
Gly Ser Glu Glu Ile Cys Ser Ser Ser Lys Arg Ser Lys Glu Glu Val
50 55 60
Gly Ile Asn Thr Glu
65
<![CDATA[<210> 599]]>
<![CDATA[<211> 35]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 599]]>
Thr Ser Ser Ala Glu Arg Lys Arg Arg Leu Pro Val Trp Phe Ala Lys
1 5 10 15
Gly Ser Asp Thr Ser Lys Lys Leu Met Asp Lys Thr Lys Arg Gly Gly
20 25 30
Leu Phe Ser
35
<![CDATA[<210> 600]]>
<![CDATA[<211> 1432]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 600]]>
Met Ser Glu Lys Lys Leu Glu Thr Thr Ala Gln Gln Arg Lys Cys Pro
1 5 10 15
Glu Trp Met Asn Val Gln Asn Lys Arg Cys Ala Val Glu Glu Arg Lys
20 25 30
Ala Cys Val Arg Lys Ser Val Phe Glu Asp Asp Leu Pro Phe Leu Glu
35 40 45
Phe Thr Gly Ser Ile Val Tyr Ser Tyr Asp Ala Ser Asp Cys Ser Phe
50 55 60
Leu Ser Glu Asp Ile Ser Met Ser Leu Ser Asp Gly Asp Val Val Gly
65 70 75 80
Phe Asp Met Glu Trp Pro Pro Leu Tyr Asn Arg Gly Lys Leu Gly Lys
85 90 95
Val Ala Leu Ile Gln Leu Cys Val Ser Glu Ser Lys Cys Tyr Leu Phe
100 105 110
His Val Ser Ser Met Ser Val Phe Pro Gln Gly Leu Lys Met Leu Leu
115 120 125
Glu Asn Lys Ala Val Lys Lys Ala Gly Val Gly Ile Glu Gly Asp Gln
130 135 140
Trp Lys Leu Leu Arg Asp Phe Asp Ile Lys Leu Lys Asn Phe Val Glu
145 150 155 160
Leu Thr Asp Val Ala Asn Lys Lys Leu Lys Cys Thr Glu Thr Trp Ser
165 170 175
Leu Asn Ser Leu Val Lys His Leu Leu Gly Lys Gln Leu Leu Lys Asp
180 185 190
Lys Ser Ile Arg Cys Ser Asn Trp Ser Lys Phe Pro Leu Thr Glu Asp
195 200 205
Gln Lys Leu Tyr Ala Ala Thr Asp Ala Tyr Ala Gly Phe Ile Ile Tyr
210 215 220
Arg Asn Leu Glu Ile Leu Asp Asp Thr Val Gln Arg Phe Ala Ile Asn
225 230 235 240
Lys Glu Glu Glu Ile Leu Leu Ser Asp Met Asn Lys Gln Leu Thr Ser
245 250 255
Ile Ser Glu Glu Val Met Asp Leu Ala Lys His Leu Pro His Ala Phe
260 265 270
Ser Lys Leu Glu Asn Pro Arg Arg Val Ser Ile Leu Leu Lys Asp Ile
275 280 285
Ser Glu Asn Leu Tyr Ser Leu Arg Arg Met Ile Ile Gly Ser Thr Asn
290 295 300
Ile Glu Thr Glu Leu Arg Pro Ser Asn Asn Leu Asn Leu Leu Ser Phe
305 310 315 320
Glu Asp Ser Thr Thr Gly Gly Val Gln Gln Lys Gln Ile Arg Glu His
325 330 335
Glu Val Leu Ile His Val Glu Asp Glu Thr Trp Asp Pro Thr Leu Asp
340 345 350
His Leu Ala Lys His Asp Gly Glu Asp Val Leu Gly Asn Lys Val Glu
355 360 365
Arg Lys Glu Asp Gly Phe Glu Asp Gly Val Glu Asp Asn Lys Leu Lys
370 375 380
Glu Asn Met Glu Arg Ala Cys Leu Met Ser Leu Asp Ile Thr Glu His
385 390 395 400
Glu Leu Gln Ile Leu Glu Gln Gln Ser Gln Glu Glu Tyr Leu Ser Asp
405 410 415
Ile Ala Tyr Lys Ser Thr Glu His Leu Ser Pro Asn Asp Asn Glu Asn
420 425 430
Asp Thr Ser Tyr Val Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met
435 440 445
Leu Lys His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val
450 455 460
Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys Ser Leu Glu
465 470 475 480
Asn Leu Asn Ser Gly Thr Val Glu Pro Thr His Ser Lys Cys Leu Lys
485 490 495
Met Glu Arg Asn Leu Gly Leu Pro Thr Lys Glu Glu Glu Glu Asp Asp
500 505 510
Glu Asn Glu Ala Asn Glu Gly Glu Glu Asp Asp Asp Lys Asp Phe Leu
515 520 525
Trp Pro Ala Pro Asn Glu Glu Gln Val Thr Cys Leu Lys Met Tyr Phe
530 535 540
Gly His Ser Ser Phe Lys Pro Val Gln Trp Lys Val Ile His Ser Val
545 550 555 560
Leu Glu Glu Arg Arg Asp Asn Val Ala Val Met Ala Thr Gly Tyr Gly
565 570 575
Lys Ser Leu Cys Phe Gln Tyr Pro Pro Val Tyr Val Gly Lys Ile Gly
580 585 590
Leu Val Ile Ser Pro Leu Ile Ser Leu Met Glu Asp Gln Val Leu Gln
595 600 605
Leu Lys Met Ser Asn Ile Pro Ala Cys Phe Leu Gly Ser Ala Gln Ser
610 615 620
Glu Asn Val Leu Thr Asp Ile Lys Leu Gly Lys Tyr Arg Ile Val Tyr
625 630 635 640
Val Thr Pro Glu Tyr Cys Ser Gly Asn Met Gly Leu Leu Gln Gln Leu
645 650 655
Glu Ala Asp Ile Gly Ile Thr Leu Ile Ala Val Asp Glu Ala His Cys
660 665 670
Ile Ser Glu Trp Gly His Asp Phe Arg Asp Ser Phe Arg Lys Leu Gly
675 680 685
Ser Leu Lys Thr Ala Leu Pro Met Val Pro Ile Val Ala Leu Thr Ala
690 695 700
Thr Ala Ser Ser Ser Ile Arg Glu Asp Ile Val Arg Cys Leu Asn Leu
705 710 715 720
Arg Asn Pro Gln Ile Thr Cys Thr Gly Phe Asp Arg Pro Asn Leu Tyr
725 730 735
Leu Glu Val Arg Arg Lys Thr Gly Asn Ile Leu Gln Asp Leu Gln Pro
740 745 750
Phe Leu Val Lys Thr Ser Ser His Trp Glu Phe Glu Gly Pro Thr Ile
755 760 765
Ile Tyr Cys Pro Ser Arg Lys Met Thr Gln Gln Val Thr Gly Glu Leu
770 775 780
Arg Lys Leu Asn Leu Ser Cys Gly Thr Tyr His Ala Gly Met Ser Phe
785 790 795 800
Ser Thr Arg Lys Asp Ile His His Arg Phe Val Arg Asp Glu Ile Gln
805 810 815
Cys Val Ile Ala Thr Ile Ala Phe Gly Met Gly Ile Asn Lys Ala Asp
820 825 830
Ile Arg Gln Val Ile His Tyr Gly Ala Pro Lys Asp Met Glu Ser Tyr
835 840 845
Tyr Gln Glu Ile Gly Arg Ala Gly Arg Asp Gly Leu Gln Ser Ser Cys
850 855 860
His Val Leu Trp Ala Pro Ala Asp Ile Asn Leu Asn Arg His Leu Leu
865 870 875 880
Thr Glu Ile Arg Asn Glu Lys Phe Arg Leu Tyr Lys Leu Lys Met Met
885 890 895
Ala Lys Met Glu Lys Tyr Leu His Ser Ser Arg Cys Arg Arg Gln Ile
900 905 910
Ile Leu Ser His Phe Glu Asp Lys Gln Val Gln Lys Ala Ser Leu Gly
915 920 925
Ile Met Gly Thr Glu Lys Cys Cys Asp Asn Cys Arg Ser Arg Leu Asp
930 935 940
His Cys Tyr Ser Met Asp Asp Ser Glu Asp Thr Ser Trp Asp Phe Gly
945 950 955 960
Pro Gln Ala Phe Lys Leu Leu Ser Ala Val Asp Ile Leu Gly Glu Lys
965 970 975
Phe Gly Ile Gly Leu Pro Ile Leu Phe Leu Arg Gly Ser Asn Ser Gln
980 985 990
Arg Leu Ala Asp Gln Tyr Arg Arg His Ser Leu Phe Gly Thr Gly Lys
995 1000 1005
Asp Gln Thr Glu Ser Trp Trp Lys Ala Phe Ser Arg Gln Leu Ile
1010 1015 1020
Thr Glu Gly Phe Leu Val Glu Val Ser Arg Tyr Asn Lys Phe Met
1025 1030 1035
Lys Ile Cys Ala Leu Thr Lys Lys Gly Arg Asn Trp Leu His Lys
1040 1045 1050
Ala Asn Thr Glu Ser Gln Ser Leu Ile Leu Gln Ala Asn Glu Glu
1055 1060 1065
Leu Cys Pro Lys Lys Leu Leu Leu Pro Ser Ser Lys Thr Val Ser
1070 1075 1080
Ser Gly Thr Lys Glu His Cys Tyr Asn Gln Val Pro Val Glu Leu
1085 1090 1095
Ser Thr Glu Lys Lys Ser Asn Leu Glu Lys Leu Tyr Ser Tyr Lys
1100 1105 1110
Pro Cys Asp Lys Ile Ser Ser Gly Ser Asn Ile Ser Lys Lys Ser
1115 1120 1125
Ile Met Val Gln Ser Pro Glu Lys Ala Tyr Ser Ser Ser Gln Pro
1130 1135 1140
Val Ile Ser Ala Gln Glu Gln Glu Thr Gln Ile Val Leu Tyr Gly
1145 1150 1155
Lys Leu Val Glu Ala Arg Gln Lys His Ala Asn Lys Met Asp Val
1160 1165 1170
Pro Pro Ala Ile Leu Ala Thr Asn Lys Ile Leu Val Asp Met Ala
1175 1180 1185
Lys Met Arg Pro Thr Thr Val Glu Asn Val Lys Arg Ile Asp Gly
1190 1195 1200
Val Ser Glu Gly Lys Ala Ala Met Leu Ala Pro Leu Leu Glu Val
1205 1210 1215
Ile Lys His Phe Cys Gln Thr Asn Ser Val Gln Thr Asp Leu Phe
1220 1225 1230
Ser Ser Thr Lys Pro Gln Glu Glu Gln Lys Thr Ser Leu Val Ala
1235 1240 1245
Lys Asn Lys Ile Cys Thr Leu Ser Gln Ser Met Ala Ile Thr Tyr
1250 1255 1260
Ser Leu Phe Gln Glu Lys Lys Met Pro Leu Lys Ser Ile Ala Glu
1265 1270 1275
Ser Arg Ile Leu Pro Leu Met Thr Ile Gly Met His Leu Ser Gln
1280 1285 1290
Ala Val Lys Ala Gly Cys Pro Leu Asp Leu Glu Arg Ala Gly Leu
1295 1300 1305
Thr Pro Glu Val Gln Lys Ile Ile Ala Asp Val Ile Arg Asn Pro
1310 1315 1320
Pro Val Asn Ser Asp Met Ser Lys Ile Ser Leu Ile Arg Met Leu
1325 1330 1335
Val Pro Glu Asn Ile Asp Thr Tyr Leu Ile His Met Ala Ile Glu
1340 1345 1350
Ile Leu Lys His Gly Pro Asp Ser Gly Leu Gln Pro Ser Cys Asp
1355 1360 1365
Val Asn Lys Arg Arg Cys Phe Pro Gly Ser Glu Glu Ile Cys Ser
1370 1375 1380
Ser Ser Lys Arg Ser Lys Glu Glu Val Gly Ile Asn Thr Glu Thr
1385 1390 1395
Ser Ser Ala Glu Arg Lys Arg Arg Leu Pro Val Trp Phe Ala Lys
1400 1405 1410
Gly Ser Asp Thr Ser Lys Lys Leu Met Asp Lys Thr Lys Arg Gly
1415 1420 1425
Gly Leu Phe Ser
1430
<![CDATA[<210> 601]]>
<![CDATA[<211> 4222]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 601]]>
gtgtactgtg tgcgccgggg aggcgccggc ttgtactcgg cagcgcggga ataaagtttg 60
ctgatttggt gtctagcctg gatgcctggg ttgcaggccc tgcttgtggt ggcgctccac 120
agtcatccgg ctgaagaaga cctgttggac tggatcttct cgggttttct ttcagatatt 180
gttttgtatt tacccatgaa gacattgttt tttggactct gcaaatagga catttcaaag 240
atgagtgaaa aaaaattgga aacaactgca cagcagcgga aatgtcctga atggatgaat 300
gtgcagaata aaagatgtgc tgtagaagaa agaaaggcat gtgttcggaa gagtgttttt 360
gaagatgacc tccccttctt agaattcact ggatccattg tgtatagtta cgatgctagt 420
gattgctctt tcctgtcaga agatattagc atgagtctat cagatgggga tgtggtggga 480
tttgacatgg agtggccacc attatacaat agagggaaac ttggcaaagt tgcactaatt 540
cagttgtgtg tttctgagag caaatgttac ttgttccacg tttcttccat gtcagttttt 600
ccccagggat taaaaatgtt gcttgaaaat aaagcagtta aaaaggcagg tgtaggaatt 660
gaaggagatc agtggaaact tctacgtgac tttgatatca aattgaagaa ttttgtggag 720
ttgacagatg ttgccaataa aaagctgaaa tgcacagaga cctggagcct taacagtctg 780
gttaaacacc tcttaggtaa acagctcctg aaagacaagt ctatccgctg tagcaattgg 840
agtaaatttc ctctcactga ggaccagaaa ctgtatgcag ccactgatgc ttatgctggt 900
tttattattt accgaaattt agagattttg gatgatactg tgcaaaggtt tgctataaat 960
aaagaggaag aaatcctact tagcgacatg aacaaacagt tgacttcaat ctctgaggaa 1020
gtgatggatc tggctaagca tcttcctcat gctttcagta aattggaaaa cccacggagg 1080
gtttctatct tactaaagga tatttcagaa aatctatatt cactgaggag gatgataatt 1140
gggtctacta acattgagac tgaactgagg cccagcaata atttaaactt attatccttt 1200
gaagattcaa ctactggggg agtacaacag aaacaaatta gagaacatga agttttaatt 1260
cacgttgaag atgaaacatg ggacccaaca cttgatcatt tagctaaaca tgatggagaa 1320
gatgtacttg gaaataaagt ggaacgaaaa gaagatggat ttgaagatgg agtagaagac 1380
aacaaattga aagagaatat ggaaagagct tgtttgatgt cgttagatat tacagaacat 1440
gaactccaaa ttttggaaca gcagtctcag gaagaatatc ttagtgatat tgcttataaa 1500
tctactgagc atttatctcc caatgataat gaaaacgata cgtcctatgt aattgagagt 1560
gatgaagatt tagaaatgga gatgcttaag catttatctc ccaatgataa tgaaaacgat 1620
acgtcctatg taattgagag tgatgaagat ttagaaatgg agatgcttaa gtctttagaa 1680
aacctcaata gtggcacggt agaaccaact cattctaaat gcttaaaaat ggaaagaaat 1740
ctgggtcttc ctactaaaga agaagaagaa gatgatgaaa atgaagctaa tgaaggggaa 1800
gaagatgatg ataaggactt tttgtggcca gcacccaatg aagagcaagt tacttgcctc 1860
aagatgtact ttggccattc cagttttaaa ccagttcagt ggaaagtgat tcattcagta 1920
ttagaagaaa gaagagataa tgttgctgtc atggcaactg gatatggaaa gagtttgtgc 1980
ttccagtatc cacctgttta tgtaggcaag attggccttg ttatctctcc ccttatttct 2040
ctgatggaag accaagtgct acagcttaaa atgtccaaca tcccagcttg cttccttgga 2100
tcagcacagt cagaaaatgt tctaacagat attaaattag gtaaataccg gattgtatac 2160
gtaactccag aatactgttc aggtaacatg ggcctgctcc agcaacttga ggctgatatt 2220
ggtatcacgc tcattgctgt ggatgaggct cactgtattt ctgagtgggg gcatgatttt 2280
agggattcat tcaggaagtt gggctcccta aagacagcac tgccaatggt tccaatcgtt 2340
gcacttactg ctactgcaag ttcttcaatc cgggaagaca ttgtacgttg cttaaatctg 2400
agaaatcctc agatcacctg tactggtttt gatcgaccaa acctgtattt agaagttagg 2460
cgaaaaacag ggaatatcct tcaggatctg cagccatttc ttgtcaaaac aagttcccac 2520
tgggaatttg aaggtccaac aatcatctac tgtccttcta gaaaaatgac acaacaagtt 2580
acaggtgaac ttaggaaact gaatctatcc tgtggaacat accatgcggg catgagtttt 2640
agcacaagga aagacattca tcataggttt gtaagagatg aaattcagtg tgtcatagct 2700
accatagctt ttggaatggg cattaataaa gctgacattc gccaagtcat tcattacggt 2760
gctcctaagg acatggaatc atattatcag gagattggta gagctggtcg tgatggactt 2820
caaagttctt gtcacgtcct ctgggctcct gcagacatta acttaaatag gcaccttctt 2880
actgagatac gtaatgagaa gtttcgatta tacaaattaa agatgatggc aaagatggaa 2940
aaatatcttc attctagcag atgtaggaga caaatcatct tgtctcattt tgaggacaaa 3000
caagtacaaa aagcctcctt gggaattatg ggaactgaaa aatgctgtga taattgcagg 3060
tccagattgg atcattgcta ttccatggat gactcagagg atacatcctg ggactttggt 3120
ccacaagcat ttaagctttt gtctgctgtg gacatcttag gcgaaaaatt tggaattggg 3180
cttccaattt tatttctccg aggatctaat tctcagcgtc ttgccgatca atatcgcagg 3240
cacagtttat ttggcactgg caaggatcaa acagagagtt ggtggaaggc tttttcccgt 3300
cagctgatca ctgagggatt cttggtagaa gtttctcggt ataacaaatt tatgaagatt 3360
tgcgccctta cgaaaaaggg tagaaattgg cttcataaag ctaatacaga atctcagagc 3420
ctcatccttc aagctaatga agaattgtgt ccaaagaagt tgcttctgcc tagttcgaaa 3480
actgtatctt cgggcaccaa agagcattgt tataatcaag taccagttga attaagtaca 3540
gagaagaagt ctaacttgga gaagttatat tcttataaac catgtgataa gatttcttct 3600
gggagtaaca tttctaaaaa aagtatcatg gtacagtcac cagaaaaagc ttacagttcc 3660
tcacagcctg ttatttcggc acaagagcag gagactcaga ttgtgttata tggcaaattg 3720
gtagaagcta ggcagaaaca tgccaataaa atggatgttc ccccagctat tctggcaaca 3780
aacaagatac tggtggatat ggccaaaatg agaccaacta cggttgaaaa cgtaaaaagg 3840
attgatggtg tttctgaagg caaagctgcc atgttggccc ctctgttgga agtcatcaaa 3900
catttctgcc aaacaaatag tgttcagaca gacctctttt caagtacaaa acctcaagaa 3960
gaacagaaga cgagtctggt agcaaaaaat aaaatatgca cactttcaca gtctatggcc 4020
atcacatact ctttattcca agaaaagaag atgcctttga agagcatagc tgagagcagg 4080
attctgcctc tcatgacaat tggcatgcac ttatcccaag cggtgaaagc tggctgcccc 4140
cttgatttgg agcgagcagg cctgactcca gaggttcaga agattattgc tgatgttatc 4200
cgaaaccctc ccgtcaactc ag 4222
<![CDATA[<210> 602]]>
<![CDATA[<211> 1327]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 602]]>
Met Ser Glu Lys Lys Leu Glu Thr Thr Ala Gln Gln Arg Lys Cys Pro
1 5 10 15
Glu Trp Met Asn Val Gln Asn Lys Arg Cys Ala Val Glu Glu Arg Lys
20 25 30
Ala Cys Val Arg Lys Ser Val Phe Glu Asp Asp Leu Pro Phe Leu Glu
35 40 45
Phe Thr Gly Ser Ile Val Tyr Ser Tyr Asp Ala Ser Asp Cys Ser Phe
50 55 60
Leu Ser Glu Asp Ile Ser Met Ser Leu Ser Asp Gly Asp Val Val Gly
65 70 75 80
Phe Asp Met Glu Trp Pro Pro Leu Tyr Asn Arg Gly Lys Leu Gly Lys
85 90 95
Val Ala Leu Ile Gln Leu Cys Val Ser Glu Ser Lys Cys Tyr Leu Phe
100 105 110
His Val Ser Ser Met Ser Val Phe Pro Gln Gly Leu Lys Met Leu Leu
115 120 125
Glu Asn Lys Ala Val Lys Lys Ala Gly Val Gly Ile Glu Gly Asp Gln
130 135 140
Trp Lys Leu Leu Arg Asp Phe Asp Ile Lys Leu Lys Asn Phe Val Glu
145 150 155 160
Leu Thr Asp Val Ala Asn Lys Lys Leu Lys Cys Thr Glu Thr Trp Ser
165 170 175
Leu Asn Ser Leu Val Lys His Leu Leu Gly Lys Gln Leu Leu Lys Asp
180 185 190
Lys Ser Ile Arg Cys Ser Asn Trp Ser Lys Phe Pro Leu Thr Glu Asp
195 200 205
Gln Lys Leu Tyr Ala Ala Thr Asp Ala Tyr Ala Gly Phe Ile Ile Tyr
210 215 220
Arg Asn Leu Glu Ile Leu Asp Asp Thr Val Gln Arg Phe Ala Ile Asn
225 230 235 240
Lys Glu Glu Glu Ile Leu Leu Ser Asp Met Asn Lys Gln Leu Thr Ser
245 250 255
Ile Ser Glu Glu Val Met Asp Leu Ala Lys His Leu Pro His Ala Phe
260 265 270
Ser Lys Leu Glu Asn Pro Arg Arg Val Ser Ile Leu Leu Lys Asp Ile
275 280 285
Ser Glu Asn Leu Tyr Ser Leu Arg Arg Met Ile Ile Gly Ser Thr Asn
290 295 300
Ile Glu Thr Glu Leu Arg Pro Ser Asn Asn Leu Asn Leu Leu Ser Phe
305 310 315 320
Glu Asp Ser Thr Thr Gly Gly Val Gln Gln Lys Gln Ile Arg Glu His
325 330 335
Glu Val Leu Ile His Val Glu Asp Glu Thr Trp Asp Pro Thr Leu Asp
340 345 350
His Leu Ala Lys His Asp Gly Glu Asp Val Leu Gly Asn Lys Val Glu
355 360 365
Arg Lys Glu Asp Gly Phe Glu Asp Gly Val Glu Asp Asn Lys Leu Lys
370 375 380
Glu Asn Met Glu Arg Ala Cys Leu Met Ser Leu Asp Ile Thr Glu His
385 390 395 400
Glu Leu Gln Ile Leu Glu Gln Gln Ser Gln Glu Glu Tyr Leu Ser Asp
405 410 415
Ile Ala Tyr Lys Ser Thr Glu His Leu Ser Pro Asn Asp Asn Glu Asn
420 425 430
Asp Thr Ser Tyr Val Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met
435 440 445
Leu Lys His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val
450 455 460
Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys Ser Leu Glu
465 470 475 480
Asn Leu Asn Ser Gly Thr Val Glu Pro Thr His Ser Lys Cys Leu Lys
485 490 495
Met Glu Arg Asn Leu Gly Leu Pro Thr Lys Glu Glu Glu Glu Asp Asp
500 505 510
Glu Asn Glu Ala Asn Glu Gly Glu Glu Asp Asp Asp Lys Asp Phe Leu
515 520 525
Trp Pro Ala Pro Asn Glu Glu Gln Val Thr Cys Leu Lys Met Tyr Phe
530 535 540
Gly His Ser Ser Phe Lys Pro Val Gln Trp Lys Val Ile His Ser Val
545 550 555 560
Leu Glu Glu Arg Arg Asp Asn Val Ala Val Met Ala Thr Gly Tyr Gly
565 570 575
Lys Ser Leu Cys Phe Gln Tyr Pro Pro Val Tyr Val Gly Lys Ile Gly
580 585 590
Leu Val Ile Ser Pro Leu Ile Ser Leu Met Glu Asp Gln Val Leu Gln
595 600 605
Leu Lys Met Ser Asn Ile Pro Ala Cys Phe Leu Gly Ser Ala Gln Ser
610 615 620
Glu Asn Val Leu Thr Asp Ile Lys Leu Gly Lys Tyr Arg Ile Val Tyr
625 630 635 640
Val Thr Pro Glu Tyr Cys Ser Gly Asn Met Gly Leu Leu Gln Gln Leu
645 650 655
Glu Ala Asp Ile Gly Ile Thr Leu Ile Ala Val Asp Glu Ala His Cys
660 665 670
Ile Ser Glu Trp Gly His Asp Phe Arg Asp Ser Phe Arg Lys Leu Gly
675 680 685
Ser Leu Lys Thr Ala Leu Pro Met Val Pro Ile Val Ala Leu Thr Ala
690 695 700
Thr Ala Ser Ser Ser Ile Arg Glu Asp Ile Val Arg Cys Leu Asn Leu
705 710 715 720
Arg Asn Pro Gln Ile Thr Cys Thr Gly Phe Asp Arg Pro Asn Leu Tyr
725 730 735
Leu Glu Val Arg Arg Lys Thr Gly Asn Ile Leu Gln Asp Leu Gln Pro
740 745 750
Phe Leu Val Lys Thr Ser Ser His Trp Glu Phe Glu Gly Pro Thr Ile
755 760 765
Ile Tyr Cys Pro Ser Arg Lys Met Thr Gln Gln Val Thr Gly Glu Leu
770 775 780
Arg Lys Leu Asn Leu Ser Cys Gly Thr Tyr His Ala Gly Met Ser Phe
785 790 795 800
Ser Thr Arg Lys Asp Ile His His Arg Phe Val Arg Asp Glu Ile Gln
805 810 815
Cys Val Ile Ala Thr Ile Ala Phe Gly Met Gly Ile Asn Lys Ala Asp
820 825 830
Ile Arg Gln Val Ile His Tyr Gly Ala Pro Lys Asp Met Glu Ser Tyr
835 840 845
Tyr Gln Glu Ile Gly Arg Ala Gly Arg Asp Gly Leu Gln Ser Ser Cys
850 855 860
His Val Leu Trp Ala Pro Ala Asp Ile Asn Leu Asn Arg His Leu Leu
865 870 875 880
Thr Glu Ile Arg Asn Glu Lys Phe Arg Leu Tyr Lys Leu Lys Met Met
885 890 895
Ala Lys Met Glu Lys Tyr Leu His Ser Ser Arg Cys Arg Arg Gln Ile
900 905 910
Ile Leu Ser His Phe Glu Asp Lys Gln Val Gln Lys Ala Ser Leu Gly
915 920 925
Ile Met Gly Thr Glu Lys Cys Cys Asp Asn Cys Arg Ser Arg Leu Asp
930 935 940
His Cys Tyr Ser Met Asp Asp Ser Glu Asp Thr Ser Trp Asp Phe Gly
945 950 955 960
Pro Gln Ala Phe Lys Leu Leu Ser Ala Val Asp Ile Leu Gly Glu Lys
965 970 975
Phe Gly Ile Gly Leu Pro Ile Leu Phe Leu Arg Gly Ser Asn Ser Gln
980 985 990
Arg Leu Ala Asp Gln Tyr Arg Arg His Ser Leu Phe Gly Thr Gly Lys
995 1000 1005
Asp Gln Thr Glu Ser Trp Trp Lys Ala Phe Ser Arg Gln Leu Ile
1010 1015 1020
Thr Glu Gly Phe Leu Val Glu Val Ser Arg Tyr Asn Lys Phe Met
1025 1030 1035
Lys Ile Cys Ala Leu Thr Lys Lys Gly Arg Asn Trp Leu His Lys
1040 1045 1050
Ala Asn Thr Glu Ser Gln Ser Leu Ile Leu Gln Ala Asn Glu Glu
1055 1060 1065
Leu Cys Pro Lys Lys Leu Leu Leu Pro Ser Ser Lys Thr Val Ser
1070 1075 1080
Ser Gly Thr Lys Glu His Cys Tyr Asn Gln Val Pro Val Glu Leu
1085 1090 1095
Ser Thr Glu Lys Lys Ser Asn Leu Glu Lys Leu Tyr Ser Tyr Lys
1100 1105 1110
Pro Cys Asp Lys Ile Ser Ser Gly Ser Asn Ile Ser Lys Lys Ser
1115 1120 1125
Ile Met Val Gln Ser Pro Glu Lys Ala Tyr Ser Ser Ser Gln Pro
1130 1135 1140
Val Ile Ser Ala Gln Glu Gln Glu Thr Gln Ile Val Leu Tyr Gly
1145 1150 1155
Lys Leu Val Glu Ala Arg Gln Lys His Ala Asn Lys Met Asp Val
1160 1165 1170
Pro Pro Ala Ile Leu Ala Thr Asn Lys Ile Leu Val Asp Met Ala
1175 1180 1185
Lys Met Arg Pro Thr Thr Val Glu Asn Val Lys Arg Ile Asp Gly
1190 1195 1200
Val Ser Glu Gly Lys Ala Ala Met Leu Ala Pro Leu Leu Glu Val
1205 1210 1215
Ile Lys His Phe Cys Gln Thr Asn Ser Val Gln Thr Asp Leu Phe
1220 1225 1230
Ser Ser Thr Lys Pro Gln Glu Glu Gln Lys Thr Ser Leu Val Ala
1235 1240 1245
Lys Asn Lys Ile Cys Thr Leu Ser Gln Ser Met Ala Ile Thr Tyr
1250 1255 1260
Ser Leu Phe Gln Glu Lys Lys Met Pro Leu Lys Ser Ile Ala Glu
1265 1270 1275
Ser Arg Ile Leu Pro Leu Met Thr Ile Gly Met His Leu Ser Gln
1280 1285 1290
Ala Val Lys Ala Gly Cys Pro Leu Asp Leu Glu Arg Ala Gly Leu
1295 1300 1305
Thr Pro Glu Val Gln Lys Ile Ile Ala Asp Val Ile Arg Asn Pro
1310 1315 1320
Pro Val Asn Ser
1325
<![CDATA[<210> 603]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 603]]>
gaaggaaact gtttaaccgg atcccattgt acccagagtg cagagccgcc tttccagcat 60
gcaggggctg ctcag 75
<![CDATA[<210> 604]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 604]]>
gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac tcgcgggtcc 60
cgctccgctc cggca 75
<![CDATA[<210> 605]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 605]]>
gaaggaaact gtttaaccgg atcccattgt acccagagtg cagagccgcc tttccagcat 60
gcaggggctg ctcaggacag agagggagga ggcgcgcggg gacggggacg cccaggagga 120
cccactcgcg ggtcccgctc cgctccggca 150
<![CDATA[<210> 606]]>
<![CDATA[<211> 102]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 606]]>
gcgagttagt aacctacgag cggctgtgaa ggaaactgtt taaccggatc ccattgtacc 60
cagagtgcag agccgccttt ccagcatgca ggggctgctc ag 102
<![CDATA[<210> 607]]>
<![CDATA[<211> 220]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 607]]>
cgtttagtca catcaagaaa tagaacagaa ttcagccatg gccccaagaa agagaggtgg 60
acgaggtatt tcattcatct tttgctgttt ccgaaataat gatcacccag aaatcacgta 120
tcggctgcga aatgatagca actttgcgct tcagaccatg gaaccagcat tgcccatgcc 180
ccctgtggag gagctggatg tcatgttcag tgaactggtg 220
<![CDATA[<210> 608]]>
<![CDATA[<211> 90]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 608]]>
gatgaactgg acctcacaga caaacacaga gaagccatgt ttgcacttcc agctgagaaa 60
aaatggcaaa tatactgtag caagaaaaag 90
<![CDATA[<210> 609]]>
<![CDATA[<211> 72]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 609]]>
gaccaggaag aaaacaaggg agctacaagt tggcctgaat tctacattga tcagctcaat 60
tccatggctg ct 72
<![CDATA[<210> 610]]>
<![CDATA[<211> 9]]>5
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 610]]>
agaaaatctc tgctggcttt agagaaggaa gaagaagaag aaagaagtaa aactatagag 60
agtttaaaga cagcactgag gacaaaacca atgag 95
<![CDATA[<210> 611]]>
<![CDATA[<211> 334]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 611]]>
gtttgtaacc agattcatcg acttggatgg cctatcatgt atcctcaact ttctaaagac 60
catggactac gagacctcag agtctcgaat acatacttct ctcattggct gtataaaggc 120
gttaatgaac aactctcaag gccgggctca cgtcctggct cattctgaga gtattaatgt 180
aattgctcag agtctgagca cagagaacat taaaacgaag gtggccgtgc tggaaatctt 240
gggcgccgtg tgcctggttc ccgggggcca caagaaggtt ctgcaggcca tgctgcacta 300
ccagaagtat gccagcgaaa ggacccgctt tcag 334
<![CDATA[<210> 612]]>
<![CDATA[<211> 111]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 612]]>
acattaatta acgacttgga taaaagcact gggcggtatc gagatgaagt gagtctcaag 60
actgccatca tgtccttcat taatgcagtg ctcagccaag gtgcaggagt g 111
<![CDATA[<210> 613]]>
<![CDATA[<211> 104]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 613]]>
gagagtttgg actttagact tcatcttcgc tatgaatttc tgatgttagg aattcaacct 60
gtaatagata aattaaggga acacgaaaat tcaacattag atag 104
<![CDATA[<210> 614]]>
<![CDATA[<211> 67]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 614]]>
gcatttagac ttttttgaaa tgctccgaaa tgaagatgaa ctagaatttg ccaaaagatt 60
tgaactg 67
<![CDATA[<210> 615]]>
<![CDATA[<211> 118]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 615]]>
gttcacatag acacaaaaag tgcaactcag atgtttgagc tgaccaggaa gaggctgaca 60
catagtgaag cttacccgca tttcatgtcc atcctgcacc actgcctcca aatgcctt 118
<![CDATA[<210> 616]]>
<![CDATA[<211> 139]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 616]]>
acaagaggag tggcaacact gttcagtact ggctactact agatagaatt atacagcaga 60
tagttatcca gaatgacaaa ggacaggacc ctgactccac acctttggaa aactttaata 120
ttaagaatgt cgtacgaat 139
<![CDATA[<210> 617]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 617]]>
gttggttaat gaaaatgaag ttaagcagtg gaaagaacaa gcggaaaaaa tgagaaaag 59
<![CDATA[<210> 618]]>
<![CDATA[<211> 188]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 618]]>
agcacaatga gctacaacag aaactggaaa agaaagaacg agaatgtgat gctaagactc 60
aagagaagga agagatgatg cagaccttaa ataaaatgaa agagaaactt gaaaaggaga 120
ctactgagca taagcaagtc aagcagcagg tggcggacct cacagcacag ctccatgagc 180
tcagcagg 188
<![CDATA[<210> 619]]>
<![CDATA[<211> 300]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 619]]>
agggccgtct gtgcttcaat cccaggtgga ccctcgcctg gagcaccagg agggcccttt 60
ccttcctctg tgcctggatc tctccttcct cccccaccac ccccacctct accaggtggg 120
atgcttcccc ctccaccgcc tcccctccct ccaggtggcc ctcctcctcc cccagggcct 180
cctcccttag gggcaatcat gccacctcct ggtgctccaa tgggcctagc actgaagaag 240
aaaagcattc ctcagcccac aaatgccctg aaatccttca actggtctaa actgcccgag 300
<![CDATA[<210> 620]]>
<![CDATA[<211> 108]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 620]]>
aacaaactgg aaggaacagt atggaccgaa attgatgata caaaagtctt caaaattcta 60
gatcttgaag acctggaaag aaccttctct gcctatcaaa gacagcag 108
<![CDATA[<210> 621]]>
<![CDATA[<211> 107]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 621]]>
aaagaagcag atgccattga tgacactctg agttccaaac ttaaagttaa agagctttcg 60
gtgattgatg gtcggagagc tcagaattgc aacatccttc tatcgag 107
<![CDATA[<210> 622]]>
<![CDATA[<211> 85]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 622]]>
gttgaaatta tccaatgacg aaatcaaacg ggcaattcta acaatggacg aacaggaaga 60
tctgcccaag gacatgttgg aacag 85
<![CDATA[<210> 623]]>
<![CDATA[<211> 107]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 623]]>
ctcttgaaat ttgttcctga aaaaagtgac attgacctat tggaggaaca taaacacgaa 60
ctggatcgga tggccaaggc tgataggttc ctttttgaga tgagccg 107
<![CDATA[<210> 6]]>24
<![CDATA[<211> 89]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 624]]>
aattaatcac tatcagcaaa ggttgcaatc gctgtacttc aaaaagaagt ttgcagagcg 60
tgtggcagaa gtgaaaccta aagtggaag 89
<![CDATA[<210> 625]]>
<![CDATA[<211> 169]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 625]]>
caattcgttc tggctcagaa gaggtgttta ggagtggtgc cctcaagcag ttgctggagg 60
tggttttggc atttggaaat tatatgaata aaggtcaaag agggaatgca tatggattca 120
agatatctag cctaaacaaa attgctgaca caaaatccag catcgacaa 169
<![CDATA[<210> 626]]>
<![CDATA[<211> 108]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 626]]>
aaacattacc cttttgcact atctcatcac tattgtggaa aataagtacc ccagtgttct 60
caatctaaat gaagaattgc gagatattcc tcaagctgcg aaagtaaa 108
<![CDATA[<210> 627]]>
<![CDATA[<211> 61]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 627]]>
catgactgag ctggacaaag aaataagtac cttgagaagt ggcttgaaag cagtagagac 60
a 61
<![CDATA[<210> 628]]>
<![CDATA[<211> 132]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 628]]>
gagctggaat atcagaagtc tcagccccca cagcccggag ataagtttgt gtctgttgtc 60
agccagttca tcacagtagc cagcttcagc ttctctgatg ttgaagacct tctagcagaa 120
gctaaagacc tg 132
<![CDATA[<210> 629]]>
<![CDATA[<211> 171]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 629]]>
tttactaaag cagtgaagca ctttggggaa gaggctggca aaatacaacc agatgagttc 60
tttggcattt ttgatcaatt tcttcaagct gtgtcagaag ccaaacaaga aaacgaaaat 120
atgagaaaga aaaaggagga agaagaacgt cgagctcgca tggaagctca g 171
<![CDATA[<210> 630]]>
<![CDATA[<211> 2754]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 630]]>
ctcaaagaac aacgtgaaag ggaacgtaaa atgagaaaag ctaaagagaa tagtgaagaa 60
agcggagagt ttgatgacct tgtttcagct ttacgctcag gagaagtgtt tgacaaagac 120
ctttctaaat tgaaacggaa tcgcaaacgt attaccaacc agatgactga cagcagcaga 180
gagagaccaa tcacaaaact taatttctaa ttttccatga atactttttt ttagaaagct 240
cattagcagc cctctaaagt gactagaacg tttcattaca ctgccttgca atccaaacag 300
tggcaatttt ttccttcatc tgtgagtgaa tgtgtgaacg tgtgtatgta aatgtatgtg 360
tgtatatatt aaaaaatgta tatagatgtc tgagtgttgt ctggagacct atacgtatgg 420
ttaaaaagat ttatgttaat gtatgtgctc caaaaccttt cgtgtatgca ttcacattga 480
gtgtggctca ttttctttcc ccgaacgcca tgactgttca gaagcacaat actatctcct 540
gaaagagata agagacattc cctagattca aaggcaaaac agaagaaaca aacaaacaaa 600
caaaaaaagc ttgcaaaata ttttatggtt tccaagcttg atatccttta aaattatttt 660
cattgatgga actggagttg ttggaaaaac atagatttaa aatgattttt gatagctgac 720
attgtgatgt tgatgtatca catcagtaat aggaccagct ttgaatttct gacattggtg 780
tggggataca gtctgtaaat gtttattgag aacatcttgc acacaatttg aattatgtag 840
aatgtcaatc aagtttttgt atatttaaaa gttggacatc aattttttcc cctgatttca 900
tcaagttatc tctgccaagt gctcttgata atttcttcag atttttggaa aaaaacacta 960
tataaatgca atccatgctt tttttaaaga acaacattgc cagagtatgc ttgttctaac 1020
aatatagata tataaacctt aaaaataata aaatatctca cccaagactt aaaggaagaa 1080
ttctctgaag ggataaagat tactaaaaaa aaaaaaaaaa aaaaaaaatt aatggggtgc 1140
ctttttgtta tagtttctat tttctgtttt gtaggacaag ctgcattttc tgtaaatata 1200
ggtctggact aaaggataca taaagaatgc acaaaatgtc aacatcagca gagatgccca 1260
gatctattta tctctaagta tatttgaagt gattgctgtt tatatgttgt cattttaaaa 1320
ttgtgtgtca gtaaagctac ctgtaaaatt tcagtccaaa aaaataaagc tctcagggag 1380
acatgaataa aatcaatgaa cattagaaaa taaaatatag atgcttacca ttaacctacc 1440
aactcttaat atccttaaat tatgtgatat ataaagagga ctgttacttt tttacttttt 1500
tttttttttt tttttttttt tggctttgct ttatttattt gtagttgggg gctaacgttt 1560
tctcttttct ttctattgat cctgttgtgg ttgggtttcc tgtggagaga gtagtttgtc 1620
ctgttgcact agaacattat ttactcacta aattgagttt ttcagtcaat taacaaatat 1680
ttattaagtt cctactatgt accaggcata gtagggctac aatggtaagc aagacagagt 1740
ccctgccccc aaagagctta tattctaatg gggatattaa gggatgaata gaataccaat 1800
gtgtgcactg tacaggaatc atactattta aaaataattt gtataaacta taatgcttag 1860
cacagatggc gagttatctg tgctatgtga aagctgtgaa atagtgctct aagagttgtc 1920
aagagctgtg gttttacatc ttttcctcat tgcaaattta gtgactttct acacactata 1980
tggaaataaa tgactagcaa ataaaacagt cataaataca aagcagaggt tgcactcccc 2040
caatcccgag ttaacccagg tctgcaataa caccatgtta aaggtgcaga tagagacttg 2100
gctcaaaaag gcttggagat gaggaagatt ggaatataat gatggttttg tctgttcctt 2160
aactaaagtg cctctatgta tattcttttc tatttgtagc aggataaatt tggtctggct 2220
cagttttgga actgtatttt gaaaatggct ttgtcttaca gtttaaggaa tagacaggtg 2280
gagggaaagt cacataaagg agcaagtttg tgtagctgtc cctcttgccc cttttaatca 2340
tcctcctttg atatggccat cctggtgggc ctcctttgcc atttccattt ttggtttctt 2400
tccctgaaaa ctgtgtgcag gtaattccat gtgccattgt tgaaaagaaa aaaaaaaaac 2460
aaaaaaaaaa cctacttttt agattggtgc tggtgtaagt agccactttt ctctcttggg 2520
tgtgtatttt aaactttttt tgttttttta aattaatgcc aaaaagaaaa tgcataattt 2580
gtaaacttaa ttatatgtct tatatcttat tagcttagta gttggaacca cttagtcttt 2640
aggtgcaaga ctgttgttag atagtactga gaaaaaaaaa gtatgtgtta tgagactgta 2700
catgtttttt taaaaatagc aatatgcaat aaagagatga attcattggg tgta 2754
<![CDATA[<210> 631]]>
<![CDATA[<211> 5890]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> DAAM1之全mRNA多核苷酸序列]]>
<![CDATA[<400> 631]]>
gcgagttagt aacctacgag cggctgtgaa ggaaactgtt taaccggatc ccattgtacc 60
cagagtgcag agccgccttt ccagcatgca ggggctgctc agcgtttagt cacatcaaga 120
aatagaacag aattcagcca tggccccaag aaagagaggt ggacgaggta tttcattcat 180
cttttgctgt ttccgaaata atgatcaccc agaaatcacg tatcggctgc gaaatgatag 240
caactttgcg cttcagacca tggaaccagc attgcccatg ccccctgtgg aggagctgga 300
tgtcatgttc agtgaactgg tggatgaact ggacctcaca gacaaacaca gagaagccat 360
gtttgcactt ccagctgaga aaaaatggca aatatactgt agcaagaaaa aggaccagga 420
agaaaacaag ggagctacaa gttggcctga attctacatt gatcagctca attccatggc 480
tgctagaaaa tctctgctgg ctttagagaa ggaagaagaa gaagaaagaa gtaaaactat 540
agagagttta aagacagcac tgaggacaaa accaatgagg tttgtaacca gattcatcga 600
cttggatggc ctatcatgta tcctcaactt tctaaagacc atggactacg agacctcaga 660
gtctcgaata catacttctc tcattggctg tataaaggcg ttaatgaaca actctcaagg 720
ccgggctcac gtcctggctc attctgagag tattaatgta attgctcaga gtctgagcac 780
agagaacatt aaaacgaagg tggccgtgct ggaaatcttg ggcgccgtgt gcctggttcc 840
cgggggccac aagaaggttc tgcaggccat gctgcactac cagaagtatg ccagcgaaag 900
gacccgcttt cagacattaa ttaacgactt ggataaaagc actgggcggt atcgagatga 960
agtgagtctc aagactgcca tcatgtcctt cattaatgca gtgctcagcc aaggtgcagg 1020
agtggagagt ttggacttta gacttcatct tcgctatgaa tttctgatgt taggaattca 1080
acctgtaata gataaattaa gggaacacga aaattcaaca ttagataggc atttagactt 1140
ttttgaaatg ctccgaaatg aagatgaact agaatttgcc aaaagatttg aactggttca 1200
catagacaca aaaagtgcaa ctcagatgtt tgagctgacc aggaagaggc tgacacatag 1260
tgaagcttac ccgcatttca tgtccatcct gcaccactgc ctccaaatgc cttacaagag 1320
gagtggcaac actgttcagt actggctact actagataga attatacagc agatagttat 1380
ccagaatgac aaaggacagg accctgactc cacacctttg gaaaacttta atattaagaa 1440
tgtcgtacga atgttggtta atgaaaatga agttaagcag tggaaagaac aagcggaaaa 1500
aatgagaaaa gagcacaatg agctacaaca gaaactggaa aagaaagaac gagaatgtga 1560
tgctaagact caagagaagg aagagatgat gcagacctta aataaaatga aagagaaact 1620
tgaaaaggag actactgagc ataagcaagt caagcagcag gtggcggacc tcacagcaca 1680
gctccatgag ctcagcagga gggccgtctg tgcttcaatc ccaggtggac cctcgcctgg 1740
agcaccagga gggccctttc cttcctctgt gcctggatct ctccttcctc ccccaccacc 1800
cccacctcta ccaggtggga tgcttccccc tccaccgcct cccctccctc caggtggccc 1860
tcctcctccc ccagggcctc ctcccttagg ggcaatcatg ccacctcctg gtgctccaat 1920
gggcctagca ctgaagaaga aaagcattcc tcagcccaca aatgccctga aatccttcaa 1980
ctggtctaaa ctgcccgaga acaaactgga aggaacagta tggaccgaaa ttgatgatac 2040
aaaagtcttc aaaattctag atcttgaaga cctggaaaga accttctctg cctatcaaag 2100
acagcagaaa gaagcagatg ccattgatga cactctgagt tccaaactta aagttaaaga 2160
gctttcggtg attgatggtc ggagagctca gaattgcaac atccttctat cgaggttgaa 2220
attatccaat gacgaaatca aacgggcaat tctaacaatg gacgaacagg aagatctgcc 2280
caaggacatg ttggaacagc tcttgaaatt tgttcctgaa aaaagtgaca ttgacctatt 2340
ggaggaacat aaacacgaac tggatcggat ggccaaggct gataggttcc tttttgagat 2400
gagccgaatt aatcactatc agcaaaggtt gcaatcgctg tacttcaaaa agaagtttgc 2460
agagcgtgtg gcagaagtga aacctaaagt ggaagcaatt cgttctggct cagaagaggt 2520
gtttaggagt ggtgccctca agcagttgct ggaggtggtt ttggcatttg gaaattatat 2580
gaataaaggt caaagaggga atgcatatgg attcaagata tctagcctaa acaaaattgc 2640
tgacacaaaa tccagcatcg acaaaaacat tacccttttg cactatctca tcactattgt 2700
ggaaaataag taccccagtg ttctcaatct aaatgaagaa ttgcgagata ttcctcaagc 2760
tgcgaaagta aacatgactg agctggacaa agaaataagt accttgagaa gtggcttgaa 2820
agcagtagag acagagctgg aatatcagaa gtctcagccc ccacagcccg gagataagtt 2880
tgtgtctgtt gtcagccagt tcatcacagt agccagcttc agcttctctg atgttgaaga 2940
ccttctagca gaagctaaag acctgtttac taaagcagtg aagcactttg gggaagaggc 3000
tggcaaaata caaccagatg agttctttgg catttttgat caatttcttc aagctgtgtc 3060
agaagccaaa caagaaaacg aaaatatgag aaagaaaaag gaggaagaag aacgtcgagc 3120
tcgcatggaa gctcagctca aagaacaacg tgaaagggaa cgtaaaatga gaaaagctaa 3180
agagaatagt gaagaaagcg gagagtttga tgaccttgtt tcagctttac gctcaggaga 3240
agtgtttgac aaagaccttt ctaaattgaa acggaatcgc aaacgtatta ccaaccagat 3300
gactgacagc agcagagaga gaccaatcac aaaacttaat ttctaatttt ccatgaatac 3360
ttttttttag aaagctcatt agcagccctc taaagtgact agaacgtttc attacactgc 3420
cttgcaatcc aaacagtggc aattttttcc ttcatctgtg agtgaatgtg tgaacgtgtg 3480
tatgtaaatg tatgtgtgta tatattaaaa aatgtatata gatgtctgag tgttgtctgg 3540
agacctatac gtatggttaa aaagatttat gttaatgtat gtgctccaaa acctttcgtg 3600
tatgcattca cattgagtgt ggctcatttt ctttccccga acgccatgac tgttcagaag 3660
cacaatacta tctcctgaaa gagataagag acattcccta gattcaaagg caaaacagaa 3720
gaaacaaaca aacaaacaaa aaaagcttgc aaaatatttt atggtttcca agcttgatat 3780
cctttaaaat tattttcatt gatggaactg gagttgttgg aaaaacatag atttaaaatg 3840
atttttgata gctgacattg tgatgttgat gtatcacatc agtaatagga ccagctttga 3900
atttctgaca ttggtgtggg gatacagtct gtaaatgttt attgagaaca tcttgcacac 3960
aatttgaatt atgtagaatg tcaatcaagt ttttgtatat ttaaaagttg gacatcaatt 4020
ttttcccctg atttcatcaa gttatctctg ccaagtgctc ttgataattt cttcagattt 4080
ttggaaaaaa acactatata aatgcaatcc atgctttttt taaagaacaa cattgccaga 4140
gtatgcttgt tctaacaata tagatatata aaccttaaaa ataataaaat atctcaccca 4200
agacttaaag gaagaattct ctgaagggat aaagattact aaaaaaaaaa aaaaaaaaaa 4260
aaaattaatg gggtgccttt ttgttatagt ttctattttc tgttttgtag gacaagctgc 4320
attttctgta aatataggtc tggactaaag gatacataaa gaatgcacaa aatgtcaaca 4380
tcagcagaga tgcccagatc tatttatctc taagtatatt tgaagtgatt gctgtttata 4440
tgttgtcatt ttaaaattgt gtgtcagtaa agctacctgt aaaatttcag tccaaaaaaa 4500
taaagctctc agggagacat gaataaaatc aatgaacatt agaaaataaa atatagatgc 4560
ttaccattaa cctaccaact cttaatatcc ttaaattatg tgatatataa agaggactgt 4620
tactttttta cttttttttt tttttttttt tttttttggc tttgctttat ttatttgtag 4680
ttgggggcta acgttttctc ttttctttct attgatcctg ttgtggttgg gtttcctgtg 4740
gagagagtag tttgtcctgt tgcactagaa cattatttac tcactaaatt gagtttttca 4800
gtcaattaac aaatatttat taagttccta ctatgtacca ggcatagtag ggctacaatg 4860
gtaagcaaga cagagtccct gcccccaaag agcttatatt ctaatgggga tattaaggga 4920
tgaatagaat accaatgtgt gcactgtaca ggaatcatac tatttaaaaa taatttgtat 4980
aaactataat gcttagcaca gatggcgagt tatctgtgct atgtgaaagc tgtgaaatag 5040
tgctctaaga gttgtcaaga gctgtggttt tacatctttt cctcattgca aatttagtga 5100
ctttctacac actatatgga aataaatgac tagcaaataa aacagtcata aatacaaagc 5160
agaggttgca ctcccccaat cccgagttaa cccaggtctg caataacacc atgttaaagg 5220
tgcagataga gacttggctc aaaaaggctt ggagatgagg aagattggaa tataatgatg 5280
gttttgtctg ttccttaact aaagtgcctc tatgtatatt cttttctatt tgtagcagga 5340
taaatttggt ctggctcagt tttggaactg tattttgaaa atggctttgt cttacagttt 5400
aaggaataga caggtggagg gaaagtcaca taaaggagca agtttgtgta gctgtccctc 5460
ttgccccttt taatcatcct cctttgatat ggccatcctg gtgggcctcc tttgccattt 5520
ccatttttgg tttctttccc tgaaaactgt gtgcaggtaa ttccatgtgc cattgttgaa 5580
aagaaaaaaa aaaaacaaaa aaaaaaccta ctttttagat tggtgctggt gtaagtagcc 5640
acttttctct cttgggtgtg tattttaaac tttttttgtt tttttaaatt aatgccaaaa 5700
agaaaatgca taatttgtaa acttaattat atgtcttata tcttattagc ttagtagttg 5760
gaaccactta gtctttaggt gcaagactgt tgttagatag tactgagaaa aaaaaagtat 5820
gtgttatgag actgtacatg tttttttaaa aatagcaata tgcaataaag agatgaattc 5880
attgggtgta 5890
<![CDATA[<210> 632]]>
<![CDATA[<211> 1068]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 632]]>
Met Ala Pro Arg Lys Arg Gly Gly Arg Gly Ile Ser Phe Ile Phe Cys
1 5 10 15
Cys Phe Arg Asn Asn Asp His Pro Glu Ile Thr Tyr Arg Leu Arg Asn
20 25 30
Asp Ser Asn Phe Ala Leu Gln Thr Met Glu Pro Ala Leu Pro Met Pro
35 40 45
Pro Val Glu Glu Leu Asp Val Met Phe Ser Glu Leu Val Asp Glu Leu
50 55 60
Asp Leu Thr Asp Lys His Arg Glu Ala Met Phe Ala Leu Pro Ala Glu
65 70 75 80
Lys Lys Trp Gln Ile Tyr Cys Ser Lys Lys Lys Asp Gln Glu Glu Asn
85 90 95
Lys Gly Ala Thr Ser Trp Pro Glu Phe Tyr Ile Asp Gln Leu Asn Ser
100 105 110
Met Ala Ala Arg Lys Ser Leu Leu Ala Leu Glu Lys Glu Glu Glu Glu
115 120 125
Glu Arg Ser Lys Thr Ile Glu Ser Leu Lys Thr Ala Leu Arg Thr Lys
130 135 140
Pro Met Arg Phe Val Thr Arg Phe Ile Asp Leu Asp Gly Leu Ser Cys
145 150 155 160
Ile Leu Asn Phe Leu Lys Thr Met Asp Tyr Glu Thr Ser Glu Ser Arg
165 170 175
Ile His Thr Ser Leu Ile Gly Cys Ile Lys Ala Leu Met Asn Asn Ser
180 185 190
Gln Gly Arg Ala His Val Leu Ala His Ser Glu Ser Ile Asn Val Ile
195 200 205
Ala Gln Ser Leu Ser Thr Glu Asn Ile Lys Thr Lys Val Ala Val Leu
210 215 220
Glu Ile Leu Gly Ala Val Cys Leu Val Pro Gly Gly His Lys Lys Val
225 230 235 240
Leu Gln Ala Met Leu His Tyr Gln Lys Tyr Ala Ser Glu Arg Thr Arg
245 250 255
Phe Gln Thr Leu Ile Asn Asp Leu Asp Lys Ser Thr Gly Arg Tyr Arg
260 265 270
Asp Glu Val Ser Leu Lys Thr Ala Ile Met Ser Phe Ile Asn Ala Val
275 280 285
Leu Ser Gln Gly Ala Gly Val Glu Ser Leu Asp Phe Arg Leu His Leu
290 295 300
Arg Tyr Glu Phe Leu Met Leu Gly Ile Gln Pro Val Ile Asp Lys Leu
305 310 315 320
Arg Glu His Glu Asn Ser Thr Leu Asp Arg His Leu Asp Phe Phe Glu
325 330 335
Met Leu Arg Asn Glu Asp Glu Leu Glu Phe Ala Lys Arg Phe Glu Leu
340 345 350
Val His Ile Asp Thr Lys Ser Ala Thr Gln Met Phe Glu Leu Thr Arg
355 360 365
Lys Arg Leu Thr His Ser Glu Ala Tyr Pro His Phe Met Ser Ile Leu
370 375 380
His His Cys Leu Gln Met Pro Tyr Lys Arg Ser Gly Asn Thr Val Gln
385 390 395 400
Tyr Trp Leu Leu Leu Asp Arg Ile Ile Gln Gln Ile Val Ile Gln Asn
405 410 415
Asp Lys Gly Gln Asp Pro Asp Ser Thr Pro Leu Glu Asn Phe Asn Ile
420 425 430
Lys Asn Val Val Arg Met Leu Val Asn Glu Asn Glu Val Lys Gln Trp
435 440 445
Lys Glu Gln Ala Glu Lys Met Arg Lys Glu His Asn Glu Leu Gln Gln
450 455 460
Lys Leu Glu Lys Lys Glu Arg Glu Cys Asp Ala Lys Thr Gln Glu Lys
465 470 475 480
Glu Glu Met Met Gln Thr Leu Asn Lys Met Lys Glu Lys Leu Glu Lys
485 490 495
Glu Thr Thr Glu His Lys Gln Val Lys Gln Gln Val Ala Asp Leu Thr
500 505 510
Ala Gln Leu His Glu Leu Ser Arg Arg Ala Val Cys Ala Ser Ile Pro
515 520 525
Gly Gly Pro Ser Pro Gly Ala Pro Gly Gly Pro Phe Pro Ser Ser Val
530 535 540
Pro Gly Ser Leu Leu Pro Pro Pro Pro Pro Pro Pro Leu Pro Gly Gly
545 550 555 560
Met Leu Pro Pro Pro Pro Pro Pro Leu Pro Pro Gly Gly Pro Pro Pro
565 570 575
Pro Pro Gly Pro Pro Pro Leu Gly Ala Ile Met Pro Pro Pro Gly Ala
580 585 590
Pro Met Gly Leu Ala Leu Lys Lys Lys Ser Ile Pro Gln Pro Thr Asn
595 600 605
Ala Leu Lys Ser Phe Asn Trp Ser Lys Leu Pro Glu Asn Lys Leu Glu
610 615 620
Gly Thr Val Trp Thr Glu Ile Asp Asp Thr Lys Val Phe Lys Ile Leu
625 630 635 640
Asp Leu Glu Asp Leu Glu Arg Thr Phe Ser Ala Tyr Gln Arg Gln Gln
645 650 655
Lys Glu Ala Asp Ala Ile Asp Asp Thr Leu Ser Ser Lys Leu Lys Val
660 665 670
Lys Glu Leu Ser Val Ile Asp Gly Arg Arg Ala Gln Asn Cys Asn Ile
675 680 685
Leu Leu Ser Arg Leu Lys Leu Ser Asn Asp Glu Ile Lys Arg Ala Ile
690 695 700
Leu Thr Met Asp Glu Gln Glu Asp Leu Pro Lys Asp Met Leu Glu Gln
705 710 715 720
Leu Leu Lys Phe Val Pro Glu Lys Ser Asp Ile Asp Leu Leu Glu Glu
725 730 735
His Lys His Glu Leu Asp Arg Met Ala Lys Ala Asp Arg Phe Leu Phe
740 745 750
Glu Met Ser Arg Ile Asn His Tyr Gln Gln Arg Leu Gln Ser Leu Tyr
755 760 765
Phe Lys Lys Lys Phe Ala Glu Arg Val Ala Glu Val Lys Pro Lys Val
770 775 780
Glu Ala Ile Arg Ser Gly Ser Glu Glu Val Phe Arg Ser Gly Ala Leu
785 790 795 800
Lys Gln Leu Leu Glu Val Val Leu Ala Phe Gly Asn Tyr Met Asn Lys
805 810 815
Gly Gln Arg Gly Asn Ala Tyr Gly Phe Lys Ile Ser Ser Leu Asn Lys
820 825 830
Ile Ala Asp Thr Lys Ser Ser Ile Asp Lys Asn Ile Thr Leu Leu His
835 840 845
Tyr Leu Ile Thr Ile Val Glu Asn Lys Tyr Pro Ser Val Leu Asn Leu
850 855 860
Asn Glu Glu Leu Arg Asp Ile Pro Gln Ala Ala Lys Val Asn Met Thr
865 870 875 880
Glu Leu Asp Lys Glu Ile Ser Thr Leu Arg Ser Gly Leu Lys Ala Val
885 890 895
Glu Thr Glu Leu Glu Tyr Gln Lys Ser Gln Pro Pro Gln Pro Gly Asp
900 905 910
Lys Phe Val Ser Val Val Ser Gln Phe Ile Thr Val Ala Ser Phe Ser
915 920 925
Phe Ser Asp Val Glu Asp Leu Leu Ala Glu Ala Lys Asp Leu Phe Thr
930 935 940
Lys Ala Val Lys His Phe Gly Glu Glu Ala Gly Lys Ile Gln Pro Asp
945 950 955 960
Glu Phe Phe Gly Ile Phe Asp Gln Phe Leu Gln Ala Val Ser Glu Ala
965 970 975
Lys Gln Glu Asn Glu Asn Met Arg Lys Lys Lys Glu Glu Glu Glu Arg
980 985 990
Arg Ala Arg Met Glu Ala Gln Leu Lys Glu Gln Arg Glu Arg Glu Arg
995 1000 1005
Lys Met Arg Lys Ala Lys Glu Asn Ser Glu Glu Ser Gly Glu Phe
1010 1015 1020
Asp Asp Leu Val Ser Ala Leu Arg Ser Gly Glu Val Phe Asp Lys
1025 1030 1035
Asp Leu Ser Lys Leu Lys Arg Asn Arg Lys Arg Ile Thr Asn Gln
1040 1045 1050
Met Thr Asp Ser Ser Arg Glu Arg Pro Ile Thr Lys Leu Asn Phe
1055 1060 1065
<![CDATA[<210> 633]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 633]]>
aaggtctctt cctgcctgag gatgaaaacc tgagggaaaa aggggactgg agccagttca 60
cgctgtggca gcaag 75
<![CDATA[<210> 634]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 634]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[<210> 635]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 635]]>
aaggtctctt cctgcctgag gatgaaaacc tgagggaaaa aggggactgg agccagttca 60
cgctgtggca gcaagccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[<210> 636]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人]]>類
<![CDATA[<400> 636]]>
Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg Glu Lys Gly Asp Trp
1 5 10 15
Ser Gln Phe Thr Leu Trp Gln Gln Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[<210> 637]]>
<![CDATA[<211> 139]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 637]]>
agtctcaagc tctggtaggc aagtgcatgc agtgtgccta aaacctgcca gcagtacttt 60
tgagtttttt tttttgtttt gttttacttt agcatttatt attcatggat tgaagaaatc 120
aaaatggctg aagataaag 139
<![CDATA[<210> 638]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 638]]>
agacaaagca tggaggacac aagaatggga ggaaaggcgg actctcagga acttcattct 60
tcacgtggtt tatggtgatt gcattgctgg gcgtctggac atctgtagct gtcgtttggt 120
ttgatcttgt tgactatgag gaagttctag 150
<![CDATA[<210> 639]]>
<![CDATA[<211> 69]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 639]]>
gaaaactagg aatctatgat gctgatggtg atggagattt tgatgtggat gatgccaaag 60
ttttattag 69
<![CDATA[<210> 640]]>
<![CDATA[<211> 93]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 640]]>
gacttaaaga gagatctact tcagagccag cagtcccgcc agaagaggct gagccacaca 60
ctgagcccga ggagcaggtt cctgtggagg cag 93
<![CDATA[<210> 641]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 641]]>
aaccccagaa tatcgaagat gaagcaaaag aacaaattca gtcccttctc catgaaatgg 60
tacacgcaga acatg 75
<![CDATA[<210> 642]]>
<![CDATA[<211> 129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 642]]>
ttgagggaga agacttgcaa caagaagatg gacccacagg agaaccacaa caagaggatg 60
atgagtttct tatggcgact gatgtagatg atagatttga gaccctggaa cctgaagtat 120
ctcatgaag 129
<![CDATA[<210> 643]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 643]]>
aaaccgagca tagttaccac gtggaagaga cag 33
<![CDATA[<210> 644]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 644]]>
tttcacaaga ctgtaatcag gatatggaag agatgatgtc tgagcaggaa aatccag 57
<![CDATA[<210> 645]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 645]]>
attccagtga accagtagta gaagatgaaa gattgcacca tgatacag 48
<![CDATA[<210> 646]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 646]]>
atgatgtaac ataccaagtc tatgaggaac aag 33
<![CDATA[<210> 647]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 647]]>
cagtatatga acctctagaa aatgaaggga tagaaatcac ag 42
<![CDATA[<210> 648]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 648]]>
aagtaactgc tccccctgag gataatcctg tagaagattc acaggtaatt gtagaag 57
<![CDATA[<210> 649]]>
<![CDATA[<211> 45]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 649]]>
aagtaagcat ttttcctgtg gaagaacagc aggaagtacc accag 45
<![CDATA[<210> 650]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 650]]>
aaacaaatag aaaaacagat gatccagaac aaaaagcaaa ag 42
<![CDATA[<210> 651]]>
<![CDATA[<211> 86]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 651]]>
ttaagaaaaa gaagcctaaa cttttaaata aatttgataa gactattaaa gctgaacttg 60
atgctgcaga aaaactccgt aaaagg 86
<![CDATA[<210> 652]]>
<![CDATA[<211> 87]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 652]]>
ggaaaaattg aggaagcagt gaatgcattt aaagaactag tacgcaaata ccctcagagt 60
ccacgagcaa gatatgggaa ggcgcag 87
<![CDATA[<210> 653]]>
<![CDATA[<211> 151]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 653]]>
tgtgaggatg atttggctga gaagaggaga agtaatgagg tgctacgtgg agccatcgag 60
acctaccaag aggtggccag cctacctgat gtccctgcag acctgctgaa gctgagtttg 120
aagcgtcgct cagacaggca acaatttcta g 151
<![CDATA[<210> 654]]>
<![CDATA[<211> 137]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 654]]>
gtcatatgag aggttccctg cttaccctgc agagattagt tcaactattt cccaatgata 60
cttccttaaa aaatgacctt ggcgtgggat acctcttgat aggagataat gacaatgcaa 120
agaaagttta tgaagag 137
<![CDATA[<210> 655]]>
<![CDATA[<211> 99]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 655]]>
gtgctgagtg tgacacctaa tgatggcttt gctaaagtcc attatggctt catcctgaag 60
gcacagaaca aaattgctga gagcatccca tatttaaag 99
<![CDATA[<210> 656]]>
<![CDATA[<211> 90]]>
<![CDATA[<212> ]]> DNA
<![CDATA[<213> 人類]]>
<![CDATA[<400> 656]]>
gaaggaatag aatccggaga tcctggcact gatgatggga gattttattt ccacctgggg 60
gatgccatgc agagggttgg gaacaaagag 90
<![CDATA[<210> 657]]>
<![CDATA[<211> 138]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 657]]>
gcatataagt ggtatgagct tgggcacaag agaggacact ttgcatctgt ctggcaacgc 60
tcactctaca atgtgaatgg actgaaagca cagccttggt ggaccccaaa agaaacgggc 120
tacacagagt tagtaaag 138
<![CDATA[<210> 658]]>
<![CDATA[<211> 136]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 658]]>
tctttagaaa gaaactggaa gttaatccga gatgaaggcc ttgcagtgat ggataaagcc 60
aaaggtctct tcctgcctga ggatgaaaac ctgagggaaa aaggggactg gagccagttc 120
acgctgtggc agcaag 136
<![CDATA[<210> 659]]>
<![CDATA[<211> 92]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 659]]>
gaagaagaaa tgaaaatgcc tgcaaaggag ctcctaaaac ctgtacctta ctagaaaagt 60
tccccgagac aacaggatgc agaagaggac ag 92
<![CDATA[<210> 660]]>
<![CDATA[<211> 134]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 660]]>
atcaaatatt ccatcatgca ccccgggact cacgtgtggc cgcacacagg gcccacaaac 60
tgcaggctcc gaatgcacct gggcttggtg attcccaagg aaggctgcaa gattcgatgt 120
gccaacgaga ccaa 134
<![CDATA[<210> 661]]>
<![CDATA[<211> 2954]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 661]]>
gacctgggag gaaggcaagg tgctcatctt tgatgactcc tttgagcacg aggtatggca 60
ggatgcctca tctttccggc tgatattcat cgtggatgtg tggcatccgg aactgacacc 120
acagcagaga cgcagccttc cagcaattta gcatgaattc atgcaagctt gggaaactct 180
ggagagaggc tgcctttctg gttccatctc cttgggtgtg aggatagaat ttcgaacacc 240
aagagtcaat tcccttgact tgcagcccga gtaattcaaa gcctcctcct agggtcagaa 300
gacactaaag ggaatatttg cctcgctgca attcatttag gaaacaccct gctgtgtgtc 360
atctcatgac agcactggtc ttctgccagt atttaaggtg aacatttgat agcttctacc 420
ttaccagcca aagatatttt ttccacatag aataggtcta attcaatgta taatgagaac 480
atatgtagaa actgtgaatg gattgcttta gtttgtaatt tttctatgca gttatatttt 540
tctagtgtag ctagactatt ttgtcatcat gtaccactac atttttgttt attttaatga 600
caagctgtat aaatgcttta cttctagcta tttaatggta gcattactgg ggaactcaga 660
cttccctctt ttaattcttc ttagtaaaag atactcatga aaaaagcagt tttattttcc 720
taacaaaaaa gaaagagctc attatgtcag tgtctatgaa ctgtacccat cccaactctc 780
aaatcgtttg gtttttttta tcttgattga gatcctcttc tcactatgct agtggtggag 840
atattgacaa aatcctattt ctttcaaaga ggaacttttc acaccgaaaa aagagcatgg 900
aattatttta tattgttata aaaatcccag atgcaaattt ttttaatgcc aattattaga 960
gcttctgggg aaaaagtata gttcacggaa ataaaactat gttctttcag ggttgggtgg 1020
ataggtggct gctagggtgt ctggctcctg gcggctttgc catccatgag gcaagggctg 1080
ggaacacagt gtctttgcct atggtagatc catgtgaatg tcaggaagcc agctcttcag 1140
tcttggagat gatttctgct acaattctgt agaaagatta aggatggcag agtaaaaggt 1200
taccaagaat gccaggatgt ttttcttggg cgtaggaggt ccagattact ttcctttttg 1260
atgaaagagt ttggaagact gtcccatctc tctggcttga gaaatctctg ccattttaaa 1320
catcactgtg aaatagcaat tattatcatc tgtatttagt tttaacatta cccacaacat 1380
agaaataata ggtaaaaatc gtcttgccta ctcattccaa agatgatcaa gtcattaatc 1440
tagcaaagta ttcatgtatc agattttgta tattttgaat caaagctaac taggaatgtt 1500
agatataaga atgtaatgat attcatgcac tgaattctaa gccaatatga acaaaaatgc 1560
tgcatgaatg gcacatatag gtcaccaaag ttcattcaca ggtagaaaaa acttgtgctt 1620
tcttttccat ctaaaaacaa aaggagactt tctttatctc atttaaagaa cagctctttg 1680
aaattgaaat tgaccctttt tgcttgacct taaggagatt agcttccagt agatgagttt 1740
gcaaaatact tttcctgttc ttttgttttg ctggtattga aaacatccca ctaaatcaga 1800
tgaagaggca tgggaggaaa aatatccaaa ttaattacta aaatcgagaa gagaaggcaa 1860
actcttgaaa agtaaaaagg tgtttgtgac cttcagtatt tattgaacag aggaaataac 1920
tgacaagggc aatacaattc aatgttcatg tagtaacatt catgtcactt gttgaatttg 1980
gttctcatat gtatattgca tacacataaa ttcaaactat aagtcgtcat ttttgagcca 2040
tcatcttaca ttcatgtaat gaaattatgg aagagagtaa aaactagctc ttaacttagt 2100
aaatataata tggtatttaa aatcaggtca ctacagtaag gttctaagta ttgccaattg 2160
aaaagctaga aatggtatta ctgttgcaaa gtgttgtcaa taattgactc caatagcatt 2220
gtaaatactt gtatcccaca actattttaa acccaagcaa taaaatggat tttctaattc 2280
acttcaattc tttatttctc ttacctatct atgtcttggt acataagaaa agtgtttcca 2340
tcactgcatt ggtagaatta tttcagtgtt attatttttg tgatttcgta tgtctacaca 2400
aaaatgaatt agtttaattt atattgtgat acagttgttt gagaaatatt tttaattctg 2460
tttcaacttt atttctccaa gtgtataata taaaaattac ttctgttatt gttctctacc 2520
aataggggaa aaaattaaaa cattagatcc attgagaaag agatgatgta ataaataatt 2580
aagattagta ataatattat caggggtgat tatgaccagt tgaataatct ctttcccttg 2640
aattatttag ctaacaaatt aactctccca aatatttaaa ataatgtaaa atcatatttt 2700
actgcccatt attaactaaa atatttttgt ttgactttga gcaccaactg gtaatactaa 2760
taaataccca tgtcatgcag atggctgggc gaataagaga tgtctaaaaa tatgcactgg 2820
tcttggaaaa catggcacaa gtaaggatat catatatgat gtctgtttat tttatgtctg 2880
atttcttttg aatgagtagt tggggactcc atttctaagg agactaggta aataaaatga 2940
cctttgacat ttca 2954
<![CDATA[<210> 662]]>
<![CDATA[<211> 5116]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> ASPH之全mRNA多核苷酸序列]]>
<![CDATA[<400> 662]]>
agtctcaagc tctggtaggc aagtgcatgc agtgtgccta aaacctgcca gcagtacttt 60
tgagtttttt tttttgtttt gttttacttt agcatttatt attcatggat tgaagaaatc 120
aaaatggctg aagataaaga gacaaagcat ggaggacaca agaatgggag gaaaggcgga 180
ctctcaggaa cttcattctt cacgtggttt atggtgattg cattgctggg cgtctggaca 240
tctgtagctg tcgtttggtt tgatcttgtt gactatgagg aagttctagg aaaactagga 300
atctatgatg ctgatggtga tggagatttt gatgtggatg atgccaaagt tttattagga 360
cttaaagaga gatctacttc agagccagca gtcccgccag aagaggctga gccacacact 420
gagcccgagg agcaggttcc tgtggaggca gaaccccaga atatcgaaga tgaagcaaaa 480
gaacaaattc agtcccttct ccatgaaatg gtacacgcag aacatgttga gggagaagac 540
ttgcaacaag aagatggacc cacaggagaa ccacaacaag aggatgatga gtttcttatg 600
gcgactgatg tagatgatag atttgagacc ctggaacctg aagtatctca tgaagaaacc 660
gagcatagtt accacgtgga agagacagtt tcacaagact gtaatcagga tatggaagag 720
atgatgtctg agcaggaaaa tccagattcc agtgaaccag tagtagaaga tgaaagattg 780
caccatgata cagatgatgt aacataccaa gtctatgagg aacaagcagt atatgaacct 840
ctagaaaatg aagggataga aatcacagaa gtaactgctc cccctgagga taatcctgta 900
gaagattcac aggtaattgt agaagaagta agcatttttc ctgtggaaga acagcaggaa 960
gtaccaccag aaacaaatag aaaaacagat gatccagaac aaaaagcaaa agttaagaaa 1020
aagaagccta aacttttaaa taaatttgat aagactatta aagctgaact tgatgctgca 1080
gaaaaactcc gtaaaagggg aaaaattgag gaagcagtga atgcatttaa agaactagta 1140
cgcaaatacc ctcagagtcc acgagcaaga tatgggaagg cgcagtgtga ggatgatttg 1200
gctgagaaga ggagaagtaa tgaggtgcta cgtggagcca tcgagaccta ccaagaggtg 1260
gccagcctac ctgatgtccc tgcagacctg ctgaagctga gtttgaagcg tcgctcagac 1320
aggcaacaat ttctaggtca tatgagaggt tccctgctta ccctgcagag attagttcaa 1380
ctatttccca atgatacttc cttaaaaaat gaccttggcg tgggatacct cttgatagga 1440
gataatgaca atgcaaagaa agtttatgaa gaggtgctga gtgtgacacc taatgatggc 1500
tttgctaaag tccattatgg cttcatcctg aaggcacaga acaaaattgc tgagagcatc 1560
ccatatttaa aggaaggaat agaatccgga gatcctggca ctgatgatgg gagattttat 1620
ttccacctgg gggatgccat gcagagggtt gggaacaaag aggcatataa gtggtatgag 1680
cttgggcaca agagaggaca ctttgcatct gtctggcaac gctcactcta caatgtgaat 1740
ggactgaaag cacagccttg gtggacccca aaagaaacgg gctacacaga gttagtaaag 1800
tctttagaaa gaaactggaa gttaatccga gatgaaggcc ttgcagtgat ggataaagcc 1860
aaaggtctct tcctgcctga ggatgaaaac ctgagggaaa aaggggactg gagccagttc 1920
acgctgtggc agcaaggaag aagaaatgaa aatgcctgca aaggagctcc taaaacctgt 1980
accttactag aaaagttccc cgagacaaca ggatgcagaa gaggacagat caaatattcc 2040
atcatgcacc ccgggactca cgtgtggccg cacacagggc ccacaaactg caggctccga 2100
atgcacctgg gcttggtgat tcccaaggaa ggctgcaaga ttcgatgtgc caacgagacc 2160
aagacctggg aggaaggcaa ggtgctcatc tttgatgact cctttgagca cgaggtatgg 2220
caggatgcct catctttccg gctgatattc atcgtggatg tgtggcatcc ggaactgaca 2280
ccacagcaga gacgcagcct tccagcaatt tagcatgaat tcatgcaagc ttgggaaact 2340
ctggagagag gctgcctttc tggttccatc tccttgggtg tgaggataga atttcgaaca 2400
ccaagagtca attcccttga cttgcagccc gagtaattca aagcctcctc ctagggtcag 2460
aagacactaa agggaatatt tgcctcgctg caattcattt aggaaacacc ctgctgtgtg 2520
tcatctcatg acagcactgg tcttctgcca gtatttaagg tgaacatttg atagcttcta 2580
ccttaccagc caaagatatt ttttccacat agaataggtc taattcaatg tataatgaga 2640
acatatgtag aaactgtgaa tggattgctt tagtttgtaa tttttctatg cagttatatt 2700
tttctagtgt agctagacta ttttgtcatc atgtaccact acatttttgt ttattttaat 2760
gacaagctgt ataaatgctt tacttctagc tatttaatgg tagcattact ggggaactca 2820
gacttccctc ttttaattct tcttagtaaa agatactcat gaaaaaagca gttttatttt 2880
cctaacaaaa aagaaagagc tcattatgtc agtgtctatg aactgtaccc atcccaactc 2940
tcaaatcgtt tggttttttt tatcttgatt gagatcctct tctcactatg ctagtggtgg 3000
agatattgac aaaatcctat ttctttcaaa gaggaacttt tcacaccgaa aaaagagcat 3060
ggaattattt tatattgtta taaaaatccc agatgcaaat ttttttaatg ccaattatta 3120
gagcttctgg ggaaaaagta tagttcacgg aaataaaact atgttctttc agggttgggt 3180
ggataggtgg ctgctagggt gtctggctcc tggcggcttt gccatccatg aggcaagggc 3240
tgggaacaca gtgtctttgc ctatggtaga tccatgtgaa tgtcaggaag ccagctcttc 3300
agtcttggag atgatttctg ctacaattct gtagaaagat taaggatggc agagtaaaag 3360
gttaccaaga atgccaggat gtttttcttg ggcgtaggag gtccagatta ctttcctttt 3420
tgatgaaaga gtttggaaga ctgtcccatc tctctggctt gagaaatctc tgccatttta 3480
aacatcactg tgaaatagca attattatca tctgtattta gttttaacat tacccacaac 3540
atagaaataa taggtaaaaa tcgtcttgcc tactcattcc aaagatgatc aagtcattaa 3600
tctagcaaag tattcatgta tcagattttg tatattttga atcaaagcta actaggaatg 3660
ttagatataa gaatgtaatg atattcatgc actgaattct aagccaatat gaacaaaaat 3720
gctgcatgaa tggcacatat aggtcaccaa agttcattca caggtagaaa aaacttgtgc 3780
tttcttttcc atctaaaaac aaaaggagac tttctttatc tcatttaaag aacagctctt 3840
tgaaattgaa attgaccctt tttgcttgac cttaaggaga ttagcttcca gtagatgagt 3900
ttgcaaaata cttttcctgt tcttttgttt tgctggtatt gaaaacatcc cactaaatca 3960
gatgaagagg catgggagga aaaatatcca aattaattac taaaatcgag aagagaaggc 4020
aaactcttga aaagtaaaaa ggtgtttgtg accttcagta tttattgaac agaggaaata 4080
actgacaagg gcaatacaat tcaatgttca tgtagtaaca ttcatgtcac ttgttgaatt 4140
tggttctcat atgtatattg catacacata aattcaaact ataagtcgtc atttttgagc 4200
catcatctta cattcatgta atgaaattat ggaagagagt aaaaactagc tcttaactta 4260
gtaaatataa tatggtattt aaaatcaggt cactacagta aggttctaag tattgccaat 4320
tgaaaagcta gaaatggtat tactgttgca aagtgttgtc aataattgac tccaatagca 4380
ttgtaaatac ttgtatccca caactatttt aaacccaagc aataaaatgg attttctaat 4440
tcacttcaat tctttatttc tcttacctat ctatgtcttg gtacataaga aaagtgtttc 4500
catcactgca ttggtagaat tatttcagtg ttattatttt tgtgatttcg tatgtctaca 4560
caaaaatgaa ttagtttaat ttatattgtg atacagttgt ttgagaaata tttttaattc 4620
tgtttcaact ttatttctcc aagtgtataa tataaaaatt acttctgtta ttgttctcta 4680
ccaatagggg aaaaaattaa aacattagat ccattgagaa agagatgatg taataaataa 4740
ttaagattag taataatatt atcaggggtg attatgacca gttgaataat ctctttccct 4800
tgaattattt agctaacaaa ttaactctcc caaatattta aaataatgta aaatcatatt 4860
ttactgccca ttattaacta aaatattttt gtttgacttt gagcaccaac tggtaatact 4920
aataaatacc catgtcatgc agatggctgg gcgaataaga gatgtctaaa aatatgcact 4980
ggtcttggaa aacatggcac aagtaaggat atcatatatg atgtctgttt attttatgtc 5040
tgatttcttt tgaatgagta gttggggact ccatttctaa ggagactagg taaataaaat 5100
gacctttgac atttca 5116
<![CDATA[<210> 663]]>
<![CDATA[<211> 5]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 663]]>
Met Ala Glu Asp Lys
1 5
<![CDATA[<210> 664]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 664]]>
Thr Lys His Gly Gly His Lys Asn Gly Arg Lys Gly Gly Leu Ser Gly
1 5 10 15
Thr Ser Phe Phe Thr Trp Phe Met Val Ile Ala Leu Leu Gly Val Trp
20 25 30
Thr Ser Val Ala Val Val Trp Phe Asp Leu Val Asp Tyr Glu Glu Val
35 40 45
Leu
<![CDATA[<210> 665]]>
<![CDATA[<211> 22]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 665]]>
Lys Leu Gly Ile Tyr Asp Ala Asp Gly Asp Gly Asp Phe Asp Val Asp
1 5 10 15
Asp Ala Lys Val Leu Leu
20
<![CDATA[<210> 666]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 666]]>
Leu Lys Glu Arg Ser Thr Ser Glu Pro Ala Val Pro Pro Glu Glu Ala
1 5 10 15
Glu Pro His Thr Glu Pro Glu Glu Gln Val Pro Val Glu Ala
20 25 30
<![CDATA[<210> 667]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 667]]>
Pro Gln Asn Ile Glu Asp Glu Ala Lys Glu Gln Ile Gln Ser Leu Leu
1 5 10 15
His Glu Met Val His Ala Glu His
20
<![CDATA[<210> 668]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 668]]>
Glu Gly Glu Asp Leu Gln Gln Glu Asp Gly Pro Thr Gly Glu Pro Gln
1 5 10 15
Gln Glu Asp Asp Glu Phe Leu Met Ala Thr Asp Val Asp Asp Arg Phe
20 25 30
Glu Thr Leu Glu Pro Glu Val Ser His Glu
35 40
<![CDATA[<210> 669]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 669]]>
Thr Glu His Ser Tyr His Val Glu Glu Thr
1 5 10
<![CDATA[<210> 670]]>
<![CDATA[<211> 18]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 670]]>
Ser Gln Asp Cys Asn Gln Asp Met Glu Glu Met Met Ser Glu Gln Glu
1 5 10 15
Asn Pro
<![CDATA[<210> 671]]>
<![CDATA[<211> 15]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 671]]>
Ser Ser Glu Pro Val Val Glu Asp Glu Arg Leu His His Asp Thr
1 5 10 15
<![CDATA[<210> 672]]>
<![CDATA[<211> 10]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 672]]>
Asp Val Thr Tyr Gln Val Tyr Glu Glu Gln
1 5 10
<![CDATA[<210> 673]]>
<![CDATA[<211> 13]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 673]]>
Val Tyr Glu Pro Leu Glu Asn Glu Gly Ile Glu Ile Thr
1 5 10
<![CDATA[<210> 674]]>
<![CDATA[<211> 18]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 674]]>
Val Thr Ala Pro Pro Glu Asp Asn Pro Val Glu Asp Ser Gln Val Ile
1 5 10 15
Val Glu
<![CDATA[<210> 675]]>
<![CDATA[<211> 14]]>
<![CDATA[<212> PRT]]>
<![CDATA[<21]]>3> 人類]]>
<br/>
<br/><![CDATA[<400> 675]]>
<br/>
<br/><![CDATA[Val Ser Ile Phe Pro Val Glu Glu Gln Gln Glu Val Pro Pro
1 5 10
<![CDATA[<210> 676]]>
<![CDATA[<211> 13]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 676]]>
Thr Asn Arg Lys Thr Asp Asp Pro Glu Gln Lys Ala Lys
1 5 10
<![CDATA[<210> 677]]>
<![CDATA[<211> 28]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 677]]>
Lys Lys Lys Lys Pro Lys Leu Leu Asn Lys Phe Asp Lys Thr Ile Lys
1 5 10 15
Ala Glu Leu Asp Ala Ala Glu Lys Leu Arg Lys Arg
20 25
<![CDATA[<210> 678]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 678]]>
Gly Lys Ile Glu Glu Ala Val Asn Ala Phe Lys Glu Leu Val Arg Lys
1 5 10 15
Tyr Pro Gln Ser Pro Arg Ala Arg Tyr Gly Lys Ala Gln
20 25
<![CDATA[<210> 679]]>
<![CDATA[<211> 50]]>
<![CDATA[<212> PR]]>T
<![CDATA[<213> 人類]]>
<![CDATA[<400> 679]]>
Cys Glu Asp Asp Leu Ala Glu Lys Arg Arg Ser Asn Glu Val Leu Arg
1 5 10 15
Gly Ala Ile Glu Thr Tyr Gln Glu Val Ala Ser Leu Pro Asp Val Pro
20 25 30
Ala Asp Leu Leu Lys Leu Ser Leu Lys Arg Arg Ser Asp Arg Gln Gln
35 40 45
Phe Leu
50
<![CDATA[<210> 680]]>
<![CDATA[<211> 45]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 680]]>
His Met Arg Gly Ser Leu Leu Thr Leu Gln Arg Leu Val Gln Leu Phe
1 5 10 15
Pro Asn Asp Thr Ser Leu Lys Asn Asp Leu Gly Val Gly Tyr Leu Leu
20 25 30
Ile Gly Asp Asn Asp Asn Ala Lys Lys Val Tyr Glu Glu
35 40 45
<![CDATA[<210> 681]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 681]]>
Val Leu Ser Val Thr Pro Asn Asp Gly Phe Ala Lys Val His Tyr Gly
1 5 10 15
Phe Ile Leu Lys Ala Gln Asn Lys Ile Ala Glu Ser Ile Pro Tyr Leu
20 25 30
Lys
<![CDATA[<210> 682]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 682]]>
Glu Gly Ile Glu Ser Gly Asp Pro Gly Thr Asp Asp Gly Arg Phe Tyr
1 5 10 15
Phe His Leu Gly Asp Ala Met Gln Arg Val Gly Asn Lys Glu
20 25 30
<![CDATA[<210> 683]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 683]]>
Ala Tyr Lys Trp Tyr Glu Leu Gly His Lys Arg Gly His Phe Ala Ser
1 5 10 15
Val Trp Gln Arg Ser Leu Tyr Asn Val Asn Gly Leu Lys Ala Gln Pro
20 25 30
Trp Trp Thr Pro Lys Glu Thr Gly Tyr Thr Glu Leu Val Lys
35 40 45
<![CDATA[<210> 684]]>
<![CDATA[<211> 45]]>
<![CDATA[<212> PRT]]>
<![CDATA[<21]]>3> 人類]]>
<br/>
<br/><![CDATA[<400> 684]]>
<br/>
<br/><![CDATA[Ser Leu Glu Arg Asn Trp Lys Leu Ile Arg Asp Glu Gly Leu Ala Val
1 5 10 15
Met Asp Lys Ala Lys Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg
20 25 30
Glu Lys Gly Asp Trp Ser Gln Phe Thr Leu Trp Gln Gln
35 40 45
<![CDATA[<210> 685]]>
<![CDATA[<211> 30]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 685]]>
Arg Arg Asn Glu Asn Ala Cys Lys Gly Ala Pro Lys Thr Cys Thr Leu
1 5 10 15
Leu Glu Lys Phe Pro Glu Thr Thr Gly Cys Arg Arg Gly Gln
20 25 30
<![CDATA[<210> 686]]>
<![CDATA[<211> 44]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 686]]>
Ile Lys Tyr Ser Ile Met His Pro Gly Thr His Val Trp Pro His Thr
1 5 10 15
Gly Pro Thr Asn Cys Arg Leu Arg Met His Leu Gly Leu Val Ile Pro
20 25 30
Lys Glu Gly Cys Lys Ile Arg Cys Ala Asn Glu Thr
35 40
<![CDATA[<210> 687]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 687]]>
Thr Trp Glu Glu Gly Lys Val Leu Ile Phe Asp Asp Ser Phe Glu His
1 5 10 15
Glu Val Trp Gln Asp Ala Ser Ser Phe Arg Leu Ile Phe Ile Val Asp
20 25 30
Val Trp His Pro Glu Leu Thr Pro Gln Gln Arg Arg Ser Leu Pro Ala
35 40 45
Ile
<![CDATA[<210> 688]]>
<![CDATA[<211> 729]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 688]]>
Met Ala Glu Asp Lys Glu Thr Lys His Gly Gly His Lys Asn Gly Arg
1 5 10 15
Lys Gly Gly Leu Ser Gly Thr Ser Phe Phe Thr Trp Phe Met Val Ile
20 25 30
Ala Leu Leu Gly Val Trp Thr Ser Val Ala Val Val Trp Phe Asp Leu
35 40 45
Val Asp Tyr Glu Glu Val Leu Gly Lys Leu Gly Ile Tyr Asp Ala Asp
50 55 60
Gly Asp Gly Asp Phe Asp Val Asp Asp Ala Lys Val Leu Leu Gly Leu
65 70 75 80
Lys Glu Arg Ser Thr Ser Glu Pro Ala Val Pro Pro Glu Glu Ala Glu
85 90 95
Pro His Thr Glu Pro Glu Glu Gln Val Pro Val Glu Ala Glu Pro Gln
100 105 110
Asn Ile Glu Asp Glu Ala Lys Glu Gln Ile Gln Ser Leu Leu His Glu
115 120 125
Met Val His Ala Glu His Val Glu Gly Glu Asp Leu Gln Gln Glu Asp
130 135 140
Gly Pro Thr Gly Glu Pro Gln Gln Glu Asp Asp Glu Phe Leu Met Ala
145 150 155 160
Thr Asp Val Asp Asp Arg Phe Glu Thr Leu Glu Pro Glu Val Ser His
165 170 175
Glu Glu Thr Glu His Ser Tyr His Val Glu Glu Thr Val Ser Gln Asp
180 185 190
Cys Asn Gln Asp Met Glu Glu Met Met Ser Glu Gln Glu Asn Pro Asp
195 200 205
Ser Ser Glu Pro Val Val Glu Asp Glu Arg Leu His His Asp Thr Asp
210 215 220
Asp Val Thr Tyr Gln Val Tyr Glu Glu Gln Ala Val Tyr Glu Pro Leu
225 230 235 240
Glu Asn Glu Gly Ile Glu Ile Thr Glu Val Thr Ala Pro Pro Glu Asp
245 250 255
Asn Pro Val Glu Asp Ser Gln Val Ile Val Glu Glu Val Ser Ile Phe
260 265 270
Pro Val Glu Glu Gln Gln Glu Val Pro Pro Glu Thr Asn Arg Lys Thr
275 280 285
Asp Asp Pro Glu Gln Lys Ala Lys Val Lys Lys Lys Lys Pro Lys Leu
290 295 300
Leu Asn Lys Phe Asp Lys Thr Ile Lys Ala Glu Leu Asp Ala Ala Glu
305 310 315 320
Lys Leu Arg Lys Arg Gly Lys Ile Glu Glu Ala Val Asn Ala Phe Lys
325 330 335
Glu Leu Val Arg Lys Tyr Pro Gln Ser Pro Arg Ala Arg Tyr Gly Lys
340 345 350
Ala Gln Cys Glu Asp Asp Leu Ala Glu Lys Arg Arg Ser Asn Glu Val
355 360 365
Leu Arg Gly Ala Ile Glu Thr Tyr Gln Glu Val Ala Ser Leu Pro Asp
370 375 380
Val Pro Ala Asp Leu Leu Lys Leu Ser Leu Lys Arg Arg Ser Asp Arg
385 390 395 400
Gln Gln Phe Leu Gly His Met Arg Gly Ser Leu Leu Thr Leu Gln Arg
405 410 415
Leu Val Gln Leu Phe Pro Asn Asp Thr Ser Leu Lys Asn Asp Leu Gly
420 425 430
Val Gly Tyr Leu Leu Ile Gly Asp Asn Asp Asn Ala Lys Lys Val Tyr
435 440 445
Glu Glu Val Leu Ser Val Thr Pro Asn Asp Gly Phe Ala Lys Val His
450 455 460
Tyr Gly Phe Ile Leu Lys Ala Gln Asn Lys Ile Ala Glu Ser Ile Pro
465 470 475 480
Tyr Leu Lys Glu Gly Ile Glu Ser Gly Asp Pro Gly Thr Asp Asp Gly
485 490 495
Arg Phe Tyr Phe His Leu Gly Asp Ala Met Gln Arg Val Gly Asn Lys
500 505 510
Glu Ala Tyr Lys Trp Tyr Glu Leu Gly His Lys Arg Gly His Phe Ala
515 520 525
Ser Val Trp Gln Arg Ser Leu Tyr Asn Val Asn Gly Leu Lys Ala Gln
530 535 540
Pro Trp Trp Thr Pro Lys Glu Thr Gly Tyr Thr Glu Leu Val Lys Ser
545 550 555 560
Leu Glu Arg Asn Trp Lys Leu Ile Arg Asp Glu Gly Leu Ala Val Met
565 570 575
Asp Lys Ala Lys Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg Glu
580 585 590
Lys Gly Asp Trp Ser Gln Phe Thr Leu Trp Gln Gln Gly Arg Arg Asn
595 600 605
Glu Asn Ala Cys Lys Gly Ala Pro Lys Thr Cys Thr Leu Leu Glu Lys
610 615 620
Phe Pro Glu Thr Thr Gly Cys Arg Arg Gly Gln Ile Lys Tyr Ser Ile
625 630 635 640
Met His Pro Gly Thr His Val Trp Pro His Thr Gly Pro Thr Asn Cys
645 650 655
Arg Leu Arg Met His Leu Gly Leu Val Ile Pro Lys Glu Gly Cys Lys
660 665 670
Ile Arg Cys Ala Asn Glu Thr Lys Thr Trp Glu Glu Gly Lys Val Leu
675 680 685
Ile Phe Asp Asp Ser Phe Glu His Glu Val Trp Gln Asp Ala Ser Ser
690 695 700
Phe Arg Leu Ile Phe Ile Val Asp Val Trp His Pro Glu Leu Thr Pro
705 710 715 720
Gln Gln Arg Arg Ser Leu Pro Ala Ile
725
<![CDATA[<210> 689]]>
<![CDATA[<211> 1936]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 689]]>
agtctcaagc tctggtaggc aagtgcatgc agtgtgccta aaacctgcca gcagtacttt 60
tgagtttttt tttttgtttt gttttacttt agcatttatt attcatggat tgaagaaatc 120
aaaatggctg aagataaaga gacaaagcat ggaggacaca agaatgggag gaaaggcgga 180
ctctcaggaa cttcattctt cacgtggttt atggtgattg cattgctggg cgtctggaca 240
tctgtagctg tcgtttggtt tgatcttgtt gactatgagg aagttctagg aaaactagga 300
atctatgatg ctgatggtga tggagatttt gatgtggatg atgccaaagt tttattagga 360
cttaaagaga gatctacttc agagccagca gtcccgccag aagaggctga gccacacact 420
gagcccgagg agcaggttcc tgtggaggca gaaccccaga atatcgaaga tgaagcaaaa 480
gaacaaattc agtcccttct ccatgaaatg gtacacgcag aacatgttga gggagaagac 540
ttgcaacaag aagatggacc cacaggagaa ccacaacaag aggatgatga gtttcttatg 600
gcgactgatg tagatgatag atttgagacc ctggaacctg aagtatctca tgaagaaacc 660
gagcatagtt accacgtgga agagacagtt tcacaagact gtaatcagga tatggaagag 720
atgatgtctg agcaggaaaa tccagattcc agtgaaccag tagtagaaga tgaaagattg 780
caccatgata cagatgatgt aacataccaa gtctatgagg aacaagcagt atatgaacct 840
ctagaaaatg aagggataga aatcacagaa gtaactgctc cccctgagga taatcctgta 900
gaagattcac aggtaattgt agaagaagta agcatttttc ctgtggaaga acagcaggaa 960
gtaccaccag aaacaaatag aaaaacagat gatccagaac aaaaagcaaa agttaagaaa 1020
aagaagccta aacttttaaa taaatttgat aagactatta aagctgaact tgatgctgca 1080
gaaaaactcc gtaaaagggg aaaaattgag gaagcagtga atgcatttaa agaactagta 1140
cgcaaatacc ctcagagtcc acgagcaaga tatgggaagg cgcagtgtga ggatgatttg 1200
gctgagaaga ggagaagtaa tgaggtgcta cgtggagcca tcgagaccta ccaagaggtg 1260
gccagcctac ctgatgtccc tgcagacctg ctgaagctga gtttgaagcg tcgctcagac 1320
aggcaacaat ttctaggtca tatgagaggt tccctgctta ccctgcagag attagttcaa 1380
ctatttccca atgatacttc cttaaaaaat gaccttggcg tgggatacct cttgatagga 1440
gataatgaca atgcaaagaa agtttatgaa gaggtgctga gtgtgacacc taatgatggc 1500
tttgctaaag tccattatgg cttcatcctg aaggcacaga acaaaattgc tgagagcatc 1560
ccatatttaa aggaaggaat agaatccgga gatcctggca ctgatgatgg gagattttat 1620
ttccacctgg gggatgccat gcagagggtt gggaacaaag aggcatataa gtggtatgag 1680
cttgggcaca agagaggaca ctttgcatct gtctggcaac gctcactcta caatgtgaat 1740
ggactgaaag cacagccttg gtggacccca aaagaaacgg gctacacaga gttagtaaag 1800
tctttagaaa gaaactggaa gttaatccga gatgaaggcc ttgcagtgat ggataaagcc 1860
aaaggtctct tcctgcctga ggatgaaaac ctgagggaaa aaggggactg gagccagttc 1920
acgctgtggc agcaag 1936
<![CDATA[<210> 690]]>
<![CDATA[<211> 604]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 690]]>
Met Ala Glu Asp Lys Glu Thr Lys His Gly Gly His Lys Asn Gly Arg
1 5 10 15
Lys Gly Gly Leu Ser Gly Thr Ser Phe Phe Thr Trp Phe Met Val Ile
20 25 30
Ala Leu Leu Gly Val Trp Thr Ser Val Ala Val Val Trp Phe Asp Leu
35 40 45
Val Asp Tyr Glu Glu Val Leu Gly Lys Leu Gly Ile Tyr Asp Ala Asp
50 55 60
Gly Asp Gly Asp Phe Asp Val Asp Asp Ala Lys Val Leu Leu Gly Leu
65 70 75 80
Lys Glu Arg Ser Thr Ser Glu Pro Ala Val Pro Pro Glu Glu Ala Glu
85 90 95
Pro His Thr Glu Pro Glu Glu Gln Val Pro Val Glu Ala Glu Pro Gln
100 105 110
Asn Ile Glu Asp Glu Ala Lys Glu Gln Ile Gln Ser Leu Leu His Glu
115 120 125
Met Val His Ala Glu His Val Glu Gly Glu Asp Leu Gln Gln Glu Asp
130 135 140
Gly Pro Thr Gly Glu Pro Gln Gln Glu Asp Asp Glu Phe Leu Met Ala
145 150 155 160
Thr Asp Val Asp Asp Arg Phe Glu Thr Leu Glu Pro Glu Val Ser His
165 170 175
Glu Glu Thr Glu His Ser Tyr His Val Glu Glu Thr Val Ser Gln Asp
180 185 190
Cys Asn Gln Asp Met Glu Glu Met Met Ser Glu Gln Glu Asn Pro Asp
195 200 205
Ser Ser Glu Pro Val Val Glu Asp Glu Arg Leu His His Asp Thr Asp
210 215 220
Asp Val Thr Tyr Gln Val Tyr Glu Glu Gln Ala Val Tyr Glu Pro Leu
225 230 235 240
Glu Asn Glu Gly Ile Glu Ile Thr Glu Val Thr Ala Pro Pro Glu Asp
245 250 255
Asn Pro Val Glu Asp Ser Gln Val Ile Val Glu Glu Val Ser Ile Phe
260 265 270
Pro Val Glu Glu Gln Gln Glu Val Pro Pro Glu Thr Asn Arg Lys Thr
275 280 285
Asp Asp Pro Glu Gln Lys Ala Lys Val Lys Lys Lys Lys Pro Lys Leu
290 295 300
Leu Asn Lys Phe Asp Lys Thr Ile Lys Ala Glu Leu Asp Ala Ala Glu
305 310 315 320
Lys Leu Arg Lys Arg Gly Lys Ile Glu Glu Ala Val Asn Ala Phe Lys
325 330 335
Glu Leu Val Arg Lys Tyr Pro Gln Ser Pro Arg Ala Arg Tyr Gly Lys
340 345 350
Ala Gln Cys Glu Asp Asp Leu Ala Glu Lys Arg Arg Ser Asn Glu Val
355 360 365
Leu Arg Gly Ala Ile Glu Thr Tyr Gln Glu Val Ala Ser Leu Pro Asp
370 375 380
Val Pro Ala Asp Leu Leu Lys Leu Ser Leu Lys Arg Arg Ser Asp Arg
385 390 395 400
Gln Gln Phe Leu Gly His Met Arg Gly Ser Leu Leu Thr Leu Gln Arg
405 410 415
Leu Val Gln Leu Phe Pro Asn Asp Thr Ser Leu Lys Asn Asp Leu Gly
420 425 430
Val Gly Tyr Leu Leu Ile Gly Asp Asn Asp Asn Ala Lys Lys Val Tyr
435 440 445
Glu Glu Val Leu Ser Val Thr Pro Asn Asp Gly Phe Ala Lys Val His
450 455 460
Tyr Gly Phe Ile Leu Lys Ala Gln Asn Lys Ile Ala Glu Ser Ile Pro
465 470 475 480
Tyr Leu Lys Glu Gly Ile Glu Ser Gly Asp Pro Gly Thr Asp Asp Gly
485 490 495
Arg Phe Tyr Phe His Leu Gly Asp Ala Met Gln Arg Val Gly Asn Lys
500 505 510
Glu Ala Tyr Lys Trp Tyr Glu Leu Gly His Lys Arg Gly His Phe Ala
515 520 525
Ser Val Trp Gln Arg Ser Leu Tyr Asn Val Asn Gly Leu Lys Ala Gln
530 535 540
Pro Trp Trp Thr Pro Lys Glu Thr Gly Tyr Thr Glu Leu Val Lys Ser
545 550 555 560
Leu Glu Arg Asn Trp Lys Leu Ile Arg Asp Glu Gly Leu Ala Val Met
565 570 575
Asp Lys Ala Lys Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg Glu
580 585 590
Lys Gly Asp Trp Ser Gln Phe Thr Leu Trp Gln Gln
595 600
<![CDATA[<210> 691]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 691]]>
cctcctgtac tccaggctcc acctgcatcg accgtgtggc ctccttctct tgcatgtgcc 60
cagaggggaa ggcag 75
<![CDATA[<210> 692]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 692]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatg 75
<![CDATA[<210> 693]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 693]]>
cctcctgtac tccaggctcc acctgcatcg accgtgtggc ctccttctct tgcatgtgcc 60
cagaggggaa ggcagctaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[<210> 694]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 694]]>
Ser Cys Thr Pro Gly Ser Thr Cys Ile Asp Arg Val Ala Ser Phe Ser
1 5 10 15
Cys Met Cys Pro Glu Gly Lys Ala Ala Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[<210> 695]]>
<![CDATA[<211> 329]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 695]]>
aggctgcttc gttgcacacc cgagaaagtt tcagccaaac ttcgggcggc ggctgaggcg 60
gcggccgagg agcggcggac tcggggcgcg gggagtcgag gcatttgcgc ctgggcttcg 120
gagcgtagcg ccagggcctg agcctttgaa gcaggaggag gggaggagag agtggggctc 180
ctctatcggg accccctccc catgtggatc tgcccaggcg gcggcggcgg cggcggagga 240
ggaggcgacc gagaagatgc ccgccctgcg ccccgctctg ctgtgggcgc tgctggcgct 300
ctggctgtgc tgcgcggccc ccgcgcatg 329
<![CDATA[<210> 696]]>
<![CDATA[<211> 82]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 696]]>
cattgcagtg tcgagatggc tatgaaccct gtgtaaatga aggaatgtgt gttacctacc 60
acaatggcac aggatactgc aa 82
<![CDATA[<210> 697]]>
<![CDATA[<211> 260]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 697]]>
atgtccagaa ggcttcttgg gggaatattg tcaacatcga gacccctgtg agaagaaccg 60
ctgccagaat ggtgggactt gtgtggccca ggccatgctg gggaaagcca cgtgccgatg 120
tgcctcaggg tttacaggag aggactgcca gtactcaaca tctcatccat gctttgtgtc 180
tcgaccctgc ctgaatggcg gcacatgcca tatgctcagc cgggatacct atgagtgcac 240
ctgtcaagtc gggtttacag 260
<![CDATA[<210> 698]]>
<![CDATA[<211> 336]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 698]]>
gtaaggagtg ccaatggacg gatgcctgcc tgtctcatcc ctgtgcaaat ggaagtacct 60
gtaccactgt ggccaaccag ttctcctgca aatgcctcac aggcttcaca gggcagaaat 120
gtgagactga tgtcaatgag tgtgacattc caggacactg ccagcatggt ggcacctgcc 180
tcaacctgcc tggttcctac cagtgccagt gccctcaggg cttcacaggc cagtactgtg 240
acagcctgta tgtgccctgt gcaccctcac cttgtgtcaa tggaggcacc tgtcggcaga 300
ctggtgactt cacttttgag tgcaactgcc ttccag 336
<![CDATA[<210> 699]]>
<![CDATA[<211> 123]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 699]]>
gttttgaagg gagcacctgt gagaggaata ttgatgactg ccctaaccac aggtgtcaga 60
atggaggggt ttgtgtggat ggggtcaaca cttacaactg ccgctgtccc ccacaatgga 120
cag 123
<![CDATA[<210> 700]]>
<![CDATA[<211> 234]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 700]]>
gacagttctg cacagaggat gtggatgaat gcctgctgca gcccaatgcc tgtcaaaatg 60
ggggcacctg tgccaaccgc aatggaggct atggctgtgt atgtgtcaac ggctggagtg 120
gagatgactg cagtgagaac attgatgatt gtgccttcgc ctcctgtact ccaggctcca 180
cctgcatcga ccgtgtggcc tccttctctt gcatgtgccc agaggggaag gcag 234
<![CDATA[<210> 701]]>
<![CDATA[<211> 156]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 701]]>
gtctcctgtg tcatctggat gatgcatgca tcagcaatcc ttgccacaag ggggcactgt 60
gtgacaccaa ccccctaaat gggcaatata tttgcacctg cccacaaggc tacaaagggg 120
ctgactgcac agaagatgtg gatgaatgtg ccatgg 156
<![CDATA[<210> 702]]>
<![CDATA[<211> 9905]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 702]]>
ccaatagcaa tccttgtgag catgcaggaa aatgtgtgaa cacggatggc gccttccact 60
gtgagtgtct gaagggttat gcaggacctc gttgtgagat ggacatcaat gagtgccatt 120
cagacccctg ccagaatgat gctacctgtc tggataagat tggaggcttc acatgtctgt 180
gcatgccagg tttcaaaggt gtgcattgtg aattagaaat aaatgaatgt cagagcaacc 240
cttgtgtgaa caatgggcag tgtgtggata aagtcaatcg tttccagtgc ctgtgtcctc 300
ctggtttcac tgggccagtt tgccagattg atattgatga ctgttccagt actccgtgtc 360
tgaatggggc aaagtgtatc gatcacccga atggctatga atgccagtgt gccacaggtt 420
tcactggtgt gttgtgtgag gagaacattg acaactgtga ccccgatcct tgccaccatg 480
gtcagtgtca ggatggtatt gattcctaca cctgcatctg caatcccggg tacatgggcg 540
ccatctgcag tgaccagatt gatgaatgtt acagcagccc ttgcctgaac gatggtcgct 600
gcattgacct ggtcaatggc taccagtgca actgccagcc aggcacgtca ggggttaatt 660
gtgaaattaa ttttgatgac tgtgcaagta acccttgtat ccatggaatc tgtatggatg 720
gcattaatcg ctacagttgt gtctgctcac caggattcac agggcagaga tgtaacattg 780
acattgatga gtgtgcctcc aatccctgtc gcaagggtgc aacatgtatc aacggtgtga 840
atggtttccg ctgtatatgc cccgagggac cccatcaccc cagctgctac tcacaggtga 900
acgaatgcct gagcaatccc tgcatccatg gaaactgtac tggaggtctc agtggatata 960
agtgtctctg tgatgcaggc tgggttggca tcaactgtga agtggacaaa aatgaatgcc 1020
tttcgaatcc atgccagaat ggaggaactt gtgacaatct ggtgaatgga tacaggtgta 1080
cttgcaagaa gggctttaaa ggctataact gccaggtgaa tattgatgaa tgtgcctcaa 1140
atccatgcct gaaccaagga acctgctttg atgacataag tggctacact tgccactgtg 1200
tgctgccata cacaggcaag aattgtcaga cagtattggc tccctgttcc ccaaaccctt 1260
gtgagaatgc tgctgtttgc aaagagtcac caaattttga gagttatact tgcttgtgtg 1320
ctcctggctg gcaaggtcag cggtgtacca ttgacattga cgagtgtatc tccaagccct 1380
gcatgaacca tggtctctgc cataacaccc agggcagcta catgtgtgaa tgtccaccag 1440
gcttcagtgg tatggactgt gaggaggaca ttgatgactg ccttgccaat ccttgccaga 1500
atggaggttc ctgtatggat ggagtgaata ctttctcctg cctctgcctt ccgggtttca 1560
ctggggataa gtgccagaca gacatgaatg agtgtctgag tgaaccctgt aagaatggag 1620
ggacctgctc tgactacgtc aacagttaca cttgcaagtg ccaggcagga tttgatggag 1680
tccattgtga gaacaacatc aatgagtgca ctgagagctc ctgtttcaat ggtggcacat 1740
gtgttgatgg gattaactcc ttctcttgct tgtgccctgt gggtttcact ggatccttct 1800
gcctccatga gatcaatgaa tgcagctctc atccatgcct gaatgaggga acgtgtgttg 1860
atggcctggg tacctaccgc tgcagctgcc ccctgggcta cactgggaaa aactgtcaga 1920
ccctggtgaa tctctgcagt cggtctccat gtaaaaacaa aggtacttgc gttcagaaaa 1980
aagcagagtc ccagtgccta tgtccatctg gatgggctgg tgcctattgt gacgtgccca 2040
atgtctcttg tgacatagca gcctccagga gaggtgtgct tgttgaacac ttgtgccagc 2100
actcaggtgt ctgcatcaat gctggcaaca cgcattactg tcagtgcccc ctgggctata 2160
ctgggagcta ctgtgaggag caactcgatg agtgtgcgtc caacccctgc cagcacgggg 2220
caacatgcag tgacttcatt ggtggataca gatgcgagtg tgtcccaggc tatcagggtg 2280
tcaactgtga gtatgaagtg gatgagtgcc agaatcagcc ctgccagaat ggaggcacct 2340
gtattgacct tgtgaaccat ttcaagtgct cttgcccacc aggcactcgg ggcctactct 2400
gtgaagagaa cattgatgac tgtgcccggg gtccccattg ccttaatggt ggtcagtgca 2460
tggataggat tggaggctac agttgtcgct gcttgcctgg ctttgctggg gagcgttgtg 2520
agggagacat caacgagtgc ctctccaacc cctgcagctc tgagggcagc ctggactgta 2580
tacagctcac caatgactac ctgtgtgttt gccgtagtgc ctttactggc cggcactgtg 2640
aaaccttcgt cgatgtgtgt ccccagatgc cctgcctgaa tggagggact tgtgctgtgg 2700
ccagtaacat gcctgatggt ttcatttgcc gttgtccccc gggattttcc ggggcaaggt 2760
gccagagcag ctgtggacaa gtgaaatgta ggaaggggga gcagtgtgtg cacaccgcct 2820
ctggaccccg ctgcttctgc cccagtcccc gggactgcga gtcaggctgt gccagtagcc 2880
cctgccagca cgggggcagc tgccaccctc agcgccagcc tccttattac tcctgccagt 2940
gtgccccacc attctcgggt agccgctgtg aactctacac ggcacccccc agcacccctc 3000
ctgccacctg tctgagccag tattgtgccg acaaagctcg ggatggcgtc tgtgatgagg 3060
cctgcaacag ccatgcctgc cagtgggatg ggggtgactg ttctctcacc atggagaacc 3120
cctgggccaa ctgctcctcc ccacttccct gctgggatta tatcaacaac cagtgtgatg 3180
agctgtgcaa cacggtcgag tgcctgtttg acaactttga atgccagggg aacagcaaga 3240
catgcaagta tgacaaatac tgtgcagacc acttcaaaga caaccactgt gaccaggggt 3300
gcaacagtga ggagtgtggt tgggatgggc tggactgtgc tgctgaccaa cctgagaacc 3360
tggcagaagg taccctggtt attgtggtat tgatgccacc tgaacaactg ctccaggatg 3420
ctcgcagctt cttgcgggca ctgggtaccc tgctccacac caacctgcgc attaagcggg 3480
actcccaggg ggaactcatg gtgtacccct attatggtga gaagtcagct gctatgaaga 3540
aacagaggat gacacgcaga tcccttcctg gtgaacaaga acaggaggtg gctggctcta 3600
aagtctttct ggaaattgac aaccgccagt gtgttcaaga ctcagaccac tgcttcaaga 3660
acacggatgc agcagcagct ctcctggcct ctcacgccat acaggggacc ctgtcatacc 3720
ctcttgtgtc tgtcgtcagt gaatccctga ctccagaacg cactcagctc ctctatctcc 3780
ttgctgttgc tgttgtcatc attctgttta ttattctgct gggggtaatc atggcaaaac 3840
gaaagcgtaa gcatggctct ctctggctgc ctgaaggttt cactcttcgc cgagatgcaa 3900
gcaatcacaa gcgtcgtgag ccagtgggac aggatgctgt ggggctgaaa aatctctcag 3960
tgcaagtctc agaagctaac ctaattggta ctggaacaag tgaacactgg gtcgatgatg 4020
aagggcccca gccaaagaaa gtaaaggctg aagatgaggc cttactctca gaagaagatg 4080
accccattga tcgacggcca tggacacagc agcaccttga agctgcagac atccgtagga 4140
caccatcgct ggctctcacc cctcctcagg cagagcagga ggtggatgtg ttagatgtga 4200
atgtccgtgg cccagatggc tgcaccccat tgatgttggc ttctctccga ggaggcagct 4260
cagatttgag tgatgaagat gaagatgcag aggactcttc tgctaacatc atcacagact 4320
tggtctacca gggtgccagc ctccaggccc agacagaccg gactggtgag atggccctgc 4380
accttgcagc ccgctactca cgggctgatg ctgccaagcg tctcctggat gcaggtgcag 4440
atgccaatgc ccaggacaac atgggccgct gtccactcca tgctgcagtg gcagctgatg 4500
cccaaggtgt cttccagatt ctgattcgca accgagtaac tgatctagat gccaggatga 4560
atgatggtac tacacccctg atcctggctg cccgcctggc tgtggaggga atggtggcag 4620
aactgatcaa ctgccaagcg gatgtgaatg cagtggatga ccatggaaaa tctgctcttc 4680
actgggcagc tgctgtcaat aatgtggagg caactctttt gttgttgaaa aatggggcca 4740
accgagacat gcaggacaac aaggaagaga cacctctgtt tcttgctgcc cgggagggga 4800
gctatgaagc agccaagatc ctgttagacc attttgccaa tcgagacatc acagaccata 4860
tggatcgtct tccccgggat gtggctcggg atcgcatgca ccatgacatt gtgcgccttc 4920
tggatgaata caatgtgacc ccaagccctc caggcaccgt gttgacttct gctctctcac 4980
ctgtcatctg tgggcccaac agatctttcc tcagcctgaa gcacacccca atgggcaaga 5040
agtctagacg gcccagtgcc aagagtacca tgcctactag cctccctaac cttgccaagg 5100
aggcaaagga tgccaagggt agtaggagga agaagtctct gagtgagaag gtccaactgt 5160
ctgagagttc agtaacttta tcccctgttg attccctaga atctcctcac acgtatgttt 5220
ccgacaccac atcctctcca atgattacat cccctgggat cttacaggcc tcacccaacc 5280
ctatgttggc cactgccgcc cctcctgccc cagtccatgc ccagcatgca ctatcttttt 5340
ctaaccttca tgaaatgcag cctttggcac atggggccag cactgtgctt ccctcagtga 5400
gccagttgct atcccaccac cacattgtgt ctccaggcag tggcagtgct ggaagcttga 5460
gtaggctcca tccagtccca gtcccagcag attggatgaa ccgcatggag gtgaatgaga 5520
cccagtacaa tgagatgttt ggtatggtcc tggctccagc tgagggcacc catcctggca 5580
tagctcccca gagcaggcca cctgaaggga agcacataac cacccctcgg gagcccttgc 5640
cccccattgt gactttccag ctcatcccta aaggcagtat tgcccaacca gcgggggctc 5700
cccagcctca gtccacctgc cctccagctg ttgcgggccc cctgcccacc atgtaccaga 5760
ttccagaaat ggcccgtttg cccagtgtgg ctttccccac tgccatgatg ccccagcagg 5820
acgggcaggt agctcagacc attctcccag cctatcatcc tttcccagcc tctgtgggca 5880
agtaccccac acccccttca cagcacagtt atgcttcctc aaatgctgct gagcgaacac 5940
ccagtcacag tggtcacctc cagggtgagc atccctacct gacaccatcc ccagagtctc 6000
ctgaccagtg gtcaagttca tcaccccact ctgcttctga ctggtcagat gtgaccacca 6060
gccctacccc tgggggtgct ggaggaggtc agcggggacc tgggacacac atgtctgagc 6120
caccacacaa caacatgcag gtttatgcgt gagagagtcc acctccagtg tagagacata 6180
actgactttt gtaaatgctg ctgaggaaca aatgaaggtc atccgggaga gaaatgaaga 6240
aatctctgga gccagcttct agaggtagga aagagaagat gttcttattc agataatgca 6300
agagaagcaa ttcgtcagtt tcactgggta tctgcaaggc ttattgatta ttctaatcta 6360
ataagacaag tttgtggaaa tgcaagatga atacaagcct tgggtccatg tttactctct 6420
tctatttgga gaataagatg gatgcttatt gaagcccaga cattcttgca gcttggactg 6480
cattttaagc cctgcaggct tctgccatat ccatgagaag attctacact agcgtcctgt 6540
tgggaattat gccctggaat tctgcctgaa ttgacctacg catctcctcc tccttggaca 6600
ttcttttgtc ttcatttggt gcttttggtt ttgcacctct ccgtgattgt agccctacca 6660
gcatgttata gggcaagacc tttgtgcttt tgatcattct ggcccatgaa agcaactttg 6720
gtctcctttc ccctcctgtc ttcccggtat cccttggagt ctcacaaggt ttactttggt 6780
atggttctca gcacaaacct ttcaagtatg ttgtttcttt ggaaaatgga catactgtat 6840
tgtgttctcc tgcatatatc attcctggag agagaagggg agaagaatac ttttcttcaa 6900
caaattttgg gggcaggaga tcccttcaag aggctgcacc ttaatttttc ttgtctgtgt 6960
gcaggtcttc atataaactt taccaggaag aagggtgtga gtttgttgtt tttctgtgta 7020
tgggcctggt cagtgtaaag ttttatcctt gatagtctag ttactatgac cctccccact 7080
tttttaaaac cagaaaaagg tttggaatgt tggaatgacc aagagacaag ttaactcgtg 7140
caagagccag ttacccaccc acaggtcccc ctacttcctg ccaagcattc cattgactgc 7200
ctgtatggaa cacatttgtc ccagatctga gcattctagg cctgtttcac tcactcaccc 7260
agcatatgaa actagtctta actgttgagc ctttcctttc atatccacag aagacactgt 7320
ctcaaatgtt gtacccttgc catttaggac tgaactttcc ttagcccaag ggacccagtg 7380
acagttgtct tccgtttgtc agatgatcag tctctactga ttatcttgct gcttaaaggc 7440
ctgctcacca atctttcttt cacaccgtgt ggtccgtgtt actggtatac ccagtatgtt 7500
ctcactgaag acatggactt tatatgttca agtgcaggaa ttggaaagtt ggacttgttt 7560
tctatgatcc aaaacagccc tataagaagg ttggaaaagg aggaactata tagcagcctt 7620
tgctattttc tgctaccatt tcttttcctc tgaagcggcc atgacattcc ctttggcaac 7680
taacgtagaa actcaacaga acattttcct ttcctagagt caccttttag atgataatgg 7740
acaactatag acttgctcat tgttcagact gattgcccct cacctgaatc cactctctgt 7800
attcatgctc ttggcaattt ctttgacttt cttttaaggg cagaagcatt ttagttaatt 7860
gtagataaag aatagttttc ttcctcttct ccttgggcca gttaataatt ggtccatggc 7920
tacactgcaa cttccgtcca gtgctgtgat gcccatgaca cctgcaaaat aagttctgcc 7980
tgggcatttt gtagatatta acaggtgaat tcccgactct tttggtttga atgacagttc 8040
tcattccttc tatggctgca agtatgcatc agtgcttccc acttacctga tttgtctgtc 8100
ggtggcccca tatggaaacc ctgcgtgtct gttggcataa tagtttacaa atggtttttt 8160
cagtcctatc caaatttatt gaaccaacaa aaataattac ttctgccctg agataagcag 8220
attaagtttg ttcattctct gctttattct ctccatgtgg caacattctg tcagcctctt 8280
tcatagtgtg caaacatttt atcattctaa atggtgactc tctgcccttg gacccattta 8340
ttattcacag atggggagaa cctatctgca tggacctctg tggaccacag cgtacctgcc 8400
cctttctgcc ctcctgctcc agccccactt ctgaaagtat cagctactga tccagccact 8460
ggatatttta tatcctccct tttccttaag cacaatgtca gaccaaattg cttgtttctt 8520
tttcttggac tactttaatt tggatccttt gggtttggag aaagggaatg tgaaagctgt 8580
cattacagac aacaggtttc agtgatgagg aggacaacac tgcctttcaa actttttact 8640
gatctcttag attttaagaa ctcttgaatt gtgtggtatc taataaaagg gaaggtaaga 8700
tggataatca ctttctcatt tgggttctga attggagact cagtttttat gagacacatc 8760
ttttatgcca tgtatagatc ctcccctgct atttttggtt tatttttatt gttataaatg 8820
ctttctttct ttgactcctc ttctgcctgc ctttggggat aggttttttt gtttgtttat 8880
ttgcttcctc tgttttgttt taagcatcat tttcttatgt gaggtgggga agggaaaggt 8940
atgagggaaa gagagtctga gaattaaaat attttagtat aagcaattgg ctgtgatgct 9000
caaatccatt gcatcctctt attgaatttg ccaatttgta atttttgcat aataaagaac 9060
caaaggtgta atgttttgtt gagaggtggt ttagggattt tggccctaac caatacattg 9120
aatgtatgat gactatttgg gaggacacat ttatgtaccc agaggccccc actaataagt 9180
ggtactatgg ttacttcctt gtgtacattt ctcttaaaag tgatattata tctgtttgta 9240
tgagaaaccc agtaaccaat aaaatgaccg catattcctg actaaacgta gtaaggaaaa 9300
tgcacacttt gtttttactt ttccgtttca ttctaaaggt agttaagatg aaatttatat 9360
gaaagcattt ttatcacaaa ataaaaaagg tttgccaagc tcagtggtgt tgtatttttt 9420
attttccaat actgcatcca tggcctggca gtgttacctc atgatgtcat aatttgctga 9480
gagagcaaat tttcttttct ttctgaatcc cacaaagcct agcaccaaac ttcttttttt 9540
cttcctttaa ttagatcata aataaatgat cctggggaaa aagcatctgt caaataggaa 9600
acatcacaaa actgagcact cttctgtgca ctagccatag ctggtgacaa acagatggtt 9660
gctcagggac aaggtgcctt ccaatggaaa tgcgaagtag ttgctatagc aagaattggg 9720
aactgggata taagtcataa tattaattat gctgttatgt aaatgattgg tttgtaacat 9780
tccttaagtg aaatttgtgt agaacttaat atacaggatt ataaaataat attttgtgta 9840
taaatttgtt ataagttcac attcatacat ttatttataa agtcagtgag atatttgaac 9900
atgaa 9905
<![CDATA[<210> 703]]>
<![CDATA[<211> 11425]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> NOTCH2之全mRNA多核苷酸序列]]>
<![CDATA[<400> 703]]>
aggctgcttc gttgcacacc cgagaaagtt tcagccaaac ttcgggcggc ggctgaggcg 60
gcggccgagg agcggcggac tcggggcgcg gggagtcgag gcatttgcgc ctgggcttcg 120
gagcgtagcg ccagggcctg agcctttgaa gcaggaggag gggaggagag agtggggctc 180
ctctatcggg accccctccc catgtggatc tgcccaggcg gcggcggcgg cggcggagga 240
ggaggcgacc gagaagatgc ccgccctgcg ccccgctctg ctgtgggcgc tgctggcgct 300
ctggctgtgc tgcgcggccc ccgcgcatgc attgcagtgt cgagatggct atgaaccctg 360
tgtaaatgaa ggaatgtgtg ttacctacca caatggcaca ggatactgca aatgtccaga 420
aggcttcttg ggggaatatt gtcaacatcg agacccctgt gagaagaacc gctgccagaa 480
tggtgggact tgtgtggccc aggccatgct ggggaaagcc acgtgccgat gtgcctcagg 540
gtttacagga gaggactgcc agtactcaac atctcatcca tgctttgtgt ctcgaccctg 600
cctgaatggc ggcacatgcc atatgctcag ccgggatacc tatgagtgca cctgtcaagt 660
cgggtttaca ggtaaggagt gccaatggac ggatgcctgc ctgtctcatc cctgtgcaaa 720
tggaagtacc tgtaccactg tggccaacca gttctcctgc aaatgcctca caggcttcac 780
agggcagaaa tgtgagactg atgtcaatga gtgtgacatt ccaggacact gccagcatgg 840
tggcacctgc ctcaacctgc ctggttccta ccagtgccag tgccctcagg gcttcacagg 900
ccagtactgt gacagcctgt atgtgccctg tgcaccctca ccttgtgtca atggaggcac 960
ctgtcggcag actggtgact tcacttttga gtgcaactgc cttccaggtt ttgaagggag 1020
cacctgtgag aggaatattg atgactgccc taaccacagg tgtcagaatg gaggggtttg 1080
tgtggatggg gtcaacactt acaactgccg ctgtccccca caatggacag gacagttctg 1140
cacagaggat gtggatgaat gcctgctgca gcccaatgcc tgtcaaaatg ggggcacctg 1200
tgccaaccgc aatggaggct atggctgtgt atgtgtcaac ggctggagtg gagatgactg 1260
cagtgagaac attgatgatt gtgccttcgc ctcctgtact ccaggctcca cctgcatcga 1320
ccgtgtggcc tccttctctt gcatgtgccc agaggggaag gcaggtctcc tgtgtcatct 1380
ggatgatgca tgcatcagca atccttgcca caagggggca ctgtgtgaca ccaaccccct 1440
aaatgggcaa tatatttgca cctgcccaca aggctacaaa ggggctgact gcacagaaga 1500
tgtggatgaa tgtgccatgg ccaatagcaa tccttgtgag catgcaggaa aatgtgtgaa 1560
cacggatggc gccttccact gtgagtgtct gaagggttat gcaggacctc gttgtgagat 1620
ggacatcaat gagtgccatt cagacccctg ccagaatgat gctacctgtc tggataagat 1680
tggaggcttc acatgtctgt gcatgccagg tttcaaaggt gtgcattgtg aattagaaat 1740
aaatgaatgt cagagcaacc cttgtgtgaa caatgggcag tgtgtggata aagtcaatcg 1800
tttccagtgc ctgtgtcctc ctggtttcac tgggccagtt tgccagattg atattgatga 1860
ctgttccagt actccgtgtc tgaatggggc aaagtgtatc gatcacccga atggctatga 1920
atgccagtgt gccacaggtt tcactggtgt gttgtgtgag gagaacattg acaactgtga 1980
ccccgatcct tgccaccatg gtcagtgtca ggatggtatt gattcctaca cctgcatctg 2040
caatcccggg tacatgggcg ccatctgcag tgaccagatt gatgaatgtt acagcagccc 2100
ttgcctgaac gatggtcgct gcattgacct ggtcaatggc taccagtgca actgccagcc 2160
aggcacgtca ggggttaatt gtgaaattaa ttttgatgac tgtgcaagta acccttgtat 2220
ccatggaatc tgtatggatg gcattaatcg ctacagttgt gtctgctcac caggattcac 2280
agggcagaga tgtaacattg acattgatga gtgtgcctcc aatccctgtc gcaagggtgc 2340
aacatgtatc aacggtgtga atggtttccg ctgtatatgc cccgagggac cccatcaccc 2400
cagctgctac tcacaggtga acgaatgcct gagcaatccc tgcatccatg gaaactgtac 2460
tggaggtctc agtggatata agtgtctctg tgatgcaggc tgggttggca tcaactgtga 2520
agtggacaaa aatgaatgcc tttcgaatcc atgccagaat ggaggaactt gtgacaatct 2580
ggtgaatgga tacaggtgta cttgcaagaa gggctttaaa ggctataact gccaggtgaa 2640
tattgatgaa tgtgcctcaa atccatgcct gaaccaagga acctgctttg atgacataag 2700
tggctacact tgccactgtg tgctgccata cacaggcaag aattgtcaga cagtattggc 2760
tccctgttcc ccaaaccctt gtgagaatgc tgctgtttgc aaagagtcac caaattttga 2820
gagttatact tgcttgtgtg ctcctggctg gcaaggtcag cggtgtacca ttgacattga 2880
cgagtgtatc tccaagccct gcatgaacca tggtctctgc cataacaccc agggcagcta 2940
catgtgtgaa tgtccaccag gcttcagtgg tatggactgt gaggaggaca ttgatgactg 3000
ccttgccaat ccttgccaga atggaggttc ctgtatggat ggagtgaata ctttctcctg 3060
cctctgcctt ccgggtttca ctggggataa gtgccagaca gacatgaatg agtgtctgag 3120
tgaaccctgt aagaatggag ggacctgctc tgactacgtc aacagttaca cttgcaagtg 3180
ccaggcagga tttgatggag tccattgtga gaacaacatc aatgagtgca ctgagagctc 3240
ctgtttcaat ggtggcacat gtgttgatgg gattaactcc ttctcttgct tgtgccctgt 3300
gggtttcact ggatccttct gcctccatga gatcaatgaa tgcagctctc atccatgcct 3360
gaatgaggga acgtgtgttg atggcctggg tacctaccgc tgcagctgcc ccctgggcta 3420
cactgggaaa aactgtcaga ccctggtgaa tctctgcagt cggtctccat gtaaaaacaa 3480
aggtacttgc gttcagaaaa aagcagagtc ccagtgccta tgtccatctg gatgggctgg 3540
tgcctattgt gacgtgccca atgtctcttg tgacatagca gcctccagga gaggtgtgct 3600
tgttgaacac ttgtgccagc actcaggtgt ctgcatcaat gctggcaaca cgcattactg 3660
tcagtgcccc ctgggctata ctgggagcta ctgtgaggag caactcgatg agtgtgcgtc 3720
caacccctgc cagcacgggg caacatgcag tgacttcatt ggtggataca gatgcgagtg 3780
tgtcccaggc tatcagggtg tcaactgtga gtatgaagtg gatgagtgcc agaatcagcc 3840
ctgccagaat ggaggcacct gtattgacct tgtgaaccat ttcaagtgct cttgcccacc 3900
aggcactcgg ggcctactct gtgaagagaa cattgatgac tgtgcccggg gtccccattg 3960
ccttaatggt ggtcagtgca tggataggat tggaggctac agttgtcgct gcttgcctgg 4020
ctttgctggg gagcgttgtg agggagacat caacgagtgc ctctccaacc cctgcagctc 4080
tgagggcagc ctggactgta tacagctcac caatgactac ctgtgtgttt gccgtagtgc 4140
ctttactggc cggcactgtg aaaccttcgt cgatgtgtgt ccccagatgc cctgcctgaa 4200
tggagggact tgtgctgtgg ccagtaacat gcctgatggt ttcatttgcc gttgtccccc 4260
gggattttcc ggggcaaggt gccagagcag ctgtggacaa gtgaaatgta ggaaggggga 4320
gcagtgtgtg cacaccgcct ctggaccccg ctgcttctgc cccagtcccc gggactgcga 4380
gtcaggctgt gccagtagcc cctgccagca cgggggcagc tgccaccctc agcgccagcc 4440
tccttattac tcctgccagt gtgccccacc attctcgggt agccgctgtg aactctacac 4500
ggcacccccc agcacccctc ctgccacctg tctgagccag tattgtgccg acaaagctcg 4560
ggatggcgtc tgtgatgagg cctgcaacag ccatgcctgc cagtgggatg ggggtgactg 4620
ttctctcacc atggagaacc cctgggccaa ctgctcctcc ccacttccct gctgggatta 4680
tatcaacaac cagtgtgatg agctgtgcaa cacggtcgag tgcctgtttg acaactttga 4740
atgccagggg aacagcaaga catgcaagta tgacaaatac tgtgcagacc acttcaaaga 4800
caaccactgt gaccaggggt gcaacagtga ggagtgtggt tgggatgggc tggactgtgc 4860
tgctgaccaa cctgagaacc tggcagaagg taccctggtt attgtggtat tgatgccacc 4920
tgaacaactg ctccaggatg ctcgcagctt cttgcgggca ctgggtaccc tgctccacac 4980
caacctgcgc attaagcggg actcccaggg ggaactcatg gtgtacccct attatggtga 5040
gaagtcagct gctatgaaga aacagaggat gacacgcaga tcccttcctg gtgaacaaga 5100
acaggaggtg gctggctcta aagtctttct ggaaattgac aaccgccagt gtgttcaaga 5160
ctcagaccac tgcttcaaga acacggatgc agcagcagct ctcctggcct ctcacgccat 5220
acaggggacc ctgtcatacc ctcttgtgtc tgtcgtcagt gaatccctga ctccagaacg 5280
cactcagctc ctctatctcc ttgctgttgc tgttgtcatc attctgttta ttattctgct 5340
gggggtaatc atggcaaaac gaaagcgtaa gcatggctct ctctggctgc ctgaaggttt 5400
cactcttcgc cgagatgcaa gcaatcacaa gcgtcgtgag ccagtgggac aggatgctgt 5460
ggggctgaaa aatctctcag tgcaagtctc agaagctaac ctaattggta ctggaacaag 5520
tgaacactgg gtcgatgatg aagggcccca gccaaagaaa gtaaaggctg aagatgaggc 5580
cttactctca gaagaagatg accccattga tcgacggcca tggacacagc agcaccttga 5640
agctgcagac atccgtagga caccatcgct ggctctcacc cctcctcagg cagagcagga 5700
ggtggatgtg ttagatgtga atgtccgtgg cccagatggc tgcaccccat tgatgttggc 5760
ttctctccga ggaggcagct cagatttgag tgatgaagat gaagatgcag aggactcttc 5820
tgctaacatc atcacagact tggtctacca gggtgccagc ctccaggccc agacagaccg 5880
gactggtgag atggccctgc accttgcagc ccgctactca cgggctgatg ctgccaagcg 5940
tctcctggat gcaggtgcag atgccaatgc ccaggacaac atgggccgct gtccactcca 6000
tgctgcagtg gcagctgatg cccaaggtgt cttccagatt ctgattcgca accgagtaac 6060
tgatctagat gccaggatga atgatggtac tacacccctg atcctggctg cccgcctggc 6120
tgtggaggga atggtggcag aactgatcaa ctgccaagcg gatgtgaatg cagtggatga 6180
ccatggaaaa tctgctcttc actgggcagc tgctgtcaat aatgtggagg caactctttt 6240
gttgttgaaa aatggggcca accgagacat gcaggacaac aaggaagaga cacctctgtt 6300
tcttgctgcc cgggagggga gctatgaagc agccaagatc ctgttagacc attttgccaa 6360
tcgagacatc acagaccata tggatcgtct tccccgggat gtggctcggg atcgcatgca 6420
ccatgacatt gtgcgccttc tggatgaata caatgtgacc ccaagccctc caggcaccgt 6480
gttgacttct gctctctcac ctgtcatctg tgggcccaac agatctttcc tcagcctgaa 6540
gcacacccca atgggcaaga agtctagacg gcccagtgcc aagagtacca tgcctactag 6600
cctccctaac cttgccaagg aggcaaagga tgccaagggt agtaggagga agaagtctct 6660
gagtgagaag gtccaactgt ctgagagttc agtaacttta tcccctgttg attccctaga 6720
atctcctcac acgtatgttt ccgacaccac atcctctcca atgattacat cccctgggat 6780
cttacaggcc tcacccaacc ctatgttggc cactgccgcc cctcctgccc cagtccatgc 6840
ccagcatgca ctatcttttt ctaaccttca tgaaatgcag cctttggcac atggggccag 6900
cactgtgctt ccctcagtga gccagttgct atcccaccac cacattgtgt ctccaggcag 6960
tggcagtgct ggaagcttga gtaggctcca tccagtccca gtcccagcag attggatgaa 7020
ccgcatggag gtgaatgaga cccagtacaa tgagatgttt ggtatggtcc tggctccagc 7080
tgagggcacc catcctggca tagctcccca gagcaggcca cctgaaggga agcacataac 7140
cacccctcgg gagcccttgc cccccattgt gactttccag ctcatcccta aaggcagtat 7200
tgcccaacca gcgggggctc cccagcctca gtccacctgc cctccagctg ttgcgggccc 7260
cctgcccacc atgtaccaga ttccagaaat ggcccgtttg cccagtgtgg ctttccccac 7320
tgccatgatg ccccagcagg acgggcaggt agctcagacc attctcccag cctatcatcc 7380
tttcccagcc tctgtgggca agtaccccac acccccttca cagcacagtt atgcttcctc 7440
aaatgctgct gagcgaacac ccagtcacag tggtcacctc cagggtgagc atccctacct 7500
gacaccatcc ccagagtctc ctgaccagtg gtcaagttca tcaccccact ctgcttctga 7560
ctggtcagat gtgaccacca gccctacccc tgggggtgct ggaggaggtc agcggggacc 7620
tgggacacac atgtctgagc caccacacaa caacatgcag gtttatgcgt gagagagtcc 7680
acctccagtg tagagacata actgactttt gtaaatgctg ctgaggaaca aatgaaggtc 7740
atccgggaga gaaatgaaga aatctctgga gccagcttct agaggtagga aagagaagat 7800
gttcttattc agataatgca agagaagcaa ttcgtcagtt tcactgggta tctgcaaggc 7860
ttattgatta ttctaatcta ataagacaag tttgtggaaa tgcaagatga atacaagcct 7920
tgggtccatg tttactctct tctatttgga gaataagatg gatgcttatt gaagcccaga 7980
cattcttgca gcttggactg cattttaagc cctgcaggct tctgccatat ccatgagaag 8040
attctacact agcgtcctgt tgggaattat gccctggaat tctgcctgaa ttgacctacg 8100
catctcctcc tccttggaca ttcttttgtc ttcatttggt gcttttggtt ttgcacctct 8160
ccgtgattgt agccctacca gcatgttata gggcaagacc tttgtgcttt tgatcattct 8220
ggcccatgaa agcaactttg gtctcctttc ccctcctgtc ttcccggtat cccttggagt 8280
ctcacaaggt ttactttggt atggttctca gcacaaacct ttcaagtatg ttgtttcttt 8340
ggaaaatgga catactgtat tgtgttctcc tgcatatatc attcctggag agagaagggg 8400
agaagaatac ttttcttcaa caaattttgg gggcaggaga tcccttcaag aggctgcacc 8460
ttaatttttc ttgtctgtgt gcaggtcttc atataaactt taccaggaag aagggtgtga 8520
gtttgttgtt tttctgtgta tgggcctggt cagtgtaaag ttttatcctt gatagtctag 8580
ttactatgac cctccccact tttttaaaac cagaaaaagg tttggaatgt tggaatgacc 8640
aagagacaag ttaactcgtg caagagccag ttacccaccc acaggtcccc ctacttcctg 8700
ccaagcattc cattgactgc ctgtatggaa cacatttgtc ccagatctga gcattctagg 8760
cctgtttcac tcactcaccc agcatatgaa actagtctta actgttgagc ctttcctttc 8820
atatccacag aagacactgt ctcaaatgtt gtacccttgc catttaggac tgaactttcc 8880
ttagcccaag ggacccagtg acagttgtct tccgtttgtc agatgatcag tctctactga 8940
ttatcttgct gcttaaaggc ctgctcacca atctttcttt cacaccgtgt ggtccgtgtt 9000
actggtatac ccagtatgtt ctcactgaag acatggactt tatatgttca agtgcaggaa 9060
ttggaaagtt ggacttgttt tctatgatcc aaaacagccc tataagaagg ttggaaaagg 9120
aggaactata tagcagcctt tgctattttc tgctaccatt tcttttcctc tgaagcggcc 9180
atgacattcc ctttggcaac taacgtagaa actcaacaga acattttcct ttcctagagt 9240
caccttttag atgataatgg acaactatag acttgctcat tgttcagact gattgcccct 9300
cacctgaatc cactctctgt attcatgctc ttggcaattt ctttgacttt cttttaaggg 9360
cagaagcatt ttagttaatt gtagataaag aatagttttc ttcctcttct ccttgggcca 9420
gttaataatt ggtccatggc tacactgcaa cttccgtcca gtgctgtgat gcccatgaca 9480
cctgcaaaat aagttctgcc tgggcatttt gtagatatta acaggtgaat tcccgactct 9540
tttggtttga atgacagttc tcattccttc tatggctgca agtatgcatc agtgcttccc 9600
acttacctga tttgtctgtc ggtggcccca tatggaaacc ctgcgtgtct gttggcataa 9660
tagtttacaa atggtttttt cagtcctatc caaatttatt gaaccaacaa aaataattac 9720
ttctgccctg agataagcag attaagtttg ttcattctct gctttattct ctccatgtgg 9780
caacattctg tcagcctctt tcatagtgtg caaacatttt atcattctaa atggtgactc 9840
tctgcccttg gacccattta ttattcacag atggggagaa cctatctgca tggacctctg 9900
tggaccacag cgtacctgcc cctttctgcc ctcctgctcc agccccactt ctgaaagtat 9960
cagctactga tccagccact ggatatttta tatcctccct tttccttaag cacaatgtca 10020
gaccaaattg cttgtttctt tttcttggac tactttaatt tggatccttt gggtttggag 10080
aaagggaatg tgaaagctgt cattacagac aacaggtttc agtgatgagg aggacaacac 10140
tgcctttcaa actttttact gatctcttag attttaagaa ctcttgaatt gtgtggtatc 10200
taataaaagg gaaggtaaga tggataatca ctttctcatt tgggttctga attggagact 10260
cagtttttat gagacacatc ttttatgcca tgtatagatc ctcccctgct atttttggtt 10320
tatttttatt gttataaatg ctttctttct ttgactcctc ttctgcctgc ctttggggat 10380
aggttttttt gtttgtttat ttgcttcctc tgttttgttt taagcatcat tttcttatgt 10440
gaggtgggga agggaaaggt atgagggaaa gagagtctga gaattaaaat attttagtat 10500
aagcaattgg ctgtgatgct caaatccatt gcatcctctt attgaatttg ccaatttgta 10560
atttttgcat aataaagaac caaaggtgta atgttttgtt gagaggtggt ttagggattt 10620
tggccctaac caatacattg aatgtatgat gactatttgg gaggacacat ttatgtaccc 10680
agaggccccc actaataagt ggtactatgg ttacttcctt gtgtacattt ctcttaaaag 10740
tgatattata tctgtttgta tgagaaaccc agtaaccaat aaaatgaccg catattcctg 10800
actaaacgta gtaaggaaaa tgcacacttt gtttttactt ttccgtttca ttctaaaggt 10860
agttaagatg aaatttatat gaaagcattt ttatcacaaa ataaaaaagg tttgccaagc 10920
tcagtggtgt tgtatttttt attttccaat actgcatcca tggcctggca gtgttacctc 10980
atgatgtcat aatttgctga gagagcaaat tttcttttct ttctgaatcc cacaaagcct 11040
agcaccaaac ttcttttttt cttcctttaa ttagatcata aataaatgat cctggggaaa 11100
aagcatctgt caaataggaa acatcacaaa actgagcact cttctgtgca ctagccatag 11160
ctggtgacaa acagatggtt gctcagggac aaggtgcctt ccaatggaaa tgcgaagtag 11220
ttgctatagc aagaattggg aactgggata taagtcataa tattaattat gctgttatgt 11280
aaatgattgg tttgtaacat tccttaagtg aaatttgtgt agaacttaat atacaggatt 11340
ataaaataat attttgtgta taaatttgtt ataagttcac attcatacat ttatttataa 11400
agtcagtgag atatttgaac atgaa 11425
<![CDATA[<210> 704]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 704]]>
Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp
1 5 10 15
Leu Cys Cys Ala Ala Pro Ala His
20
<![CDATA[<210> 705]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 705]]>
Leu Gln Cys Arg Asp Gly Tyr Glu Pro Cys Val Asn Glu Gly Met Cys
1 5 10 15
Val Thr Tyr His Asn Gly Thr Gly Tyr Cys
20 25
<![CDATA[<210> 706]]>
<![CDATA[<211> 86]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 706]]>
Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gln His Arg Asp Pro Cys
1 5 10 15
Glu Lys Asn Arg Cys Gln Asn Gly Gly Thr Cys Val Ala Gln Ala Met
20 25 30
Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe Thr Gly Glu Asp
35 40 45
Cys Gln Tyr Ser Thr Ser His Pro Cys Phe Val Ser Arg Pro Cys Leu
50 55 60
Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr Tyr Glu Cys Thr
65 70 75 80
Cys Gln Val Gly Phe Thr
85
<![CDATA[<210> 707]]>
<![CDATA[<211> 111]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 707]]>
Lys Glu Cys Gln Trp Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn
1 5 10 15
Gly Ser Thr Cys Thr Thr Val Ala Asn Gln Phe Ser Cys Lys Cys Leu
20 25 30
Thr Gly Phe Thr Gly Gln Lys Cys Glu Thr Asp Val Asn Glu Cys Asp
35 40 45
Ile Pro Gly His Cys Gln His Gly Gly Thr Cys Leu Asn Leu Pro Gly
50 55 60
Ser Tyr Gln Cys Gln Cys Pro Gln Gly Phe Thr Gly Gln Tyr Cys Asp
65 70 75 80
Ser Leu Tyr Val Pro Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr
85 90 95
Cys Arg Gln Thr Gly Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro
100 105 110
<![CDATA[<210> 708]]>
<![CDATA[<211> 40]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 708]]>
Phe Glu Gly Ser Thr Cys Glu Arg Asn Ile Asp Asp Cys Pro Asn His
1 5 10 15
Arg Cys Gln Asn Gly Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn
20 25 30
Cys Arg Cys Pro Pro Gln Trp Thr
35 40
<![CDATA[<210> 709]]>
<![CDATA[<211> 77]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 709]]>
Gln Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu Gln Pro Asn Ala
1 5 10 15
Cys Gln Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly Gly Tyr Gly Cys
20 25 30
Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser Glu Asn Ile Asp
35 40 45
Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr Cys Ile Asp Arg
50 55 60
Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys Ala
65 70 75
<![CDATA[<210> 710]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 710]]>
Leu Leu Cys His Leu Asp Asp Ala Cys Ile Ser Asn Pro Cys His Lys
1 5 10 15
Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gln Tyr Ile Cys Thr
20 25 30
Cys Pro Gln Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val Asp Glu
35 40 45
Cys Ala Met
50
<![CDATA[<210> 711]]>
<![CDATA[<211> 2049]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 711]]>
Asn Ser Asn Pro Cys Glu His Ala Gly Lys Cys Val Asn Thr Asp Gly
1 5 10 15
Ala Phe His Cys Glu Cys Leu Lys Gly Tyr Ala Gly Pro Arg Cys Glu
20 25 30
Met Asp Ile Asn Glu Cys His Ser Asp Pro Cys Gln Asn Asp Ala Thr
35 40 45
Cys Leu Asp Lys Ile Gly Gly Phe Thr Cys Leu Cys Met Pro Gly Phe
50 55 60
Lys Gly Val His Cys Glu Leu Glu Ile Asn Glu Cys Gln Ser Asn Pro
65 70 75 80
Cys Val Asn Asn Gly Gln Cys Val Asp Lys Val Asn Arg Phe Gln Cys
85 90 95
Leu Cys Pro Pro Gly Phe Thr Gly Pro Val Cys Gln Ile Asp Ile Asp
100 105 110
Asp Cys Ser Ser Thr Pro Cys Leu Asn Gly Ala Lys Cys Ile Asp His
115 120 125
Pro Asn Gly Tyr Glu Cys Gln Cys Ala Thr Gly Phe Thr Gly Val Leu
130 135 140
Cys Glu Glu Asn Ile Asp Asn Cys Asp Pro Asp Pro Cys His His Gly
145 150 155 160
Gln Cys Gln Asp Gly Ile Asp Ser Tyr Thr Cys Ile Cys Asn Pro Gly
165 170 175
Tyr Met Gly Ala Ile Cys Ser Asp Gln Ile Asp Glu Cys Tyr Ser Ser
180 185 190
Pro Cys Leu Asn Asp Gly Arg Cys Ile Asp Leu Val Asn Gly Tyr Gln
195 200 205
Cys Asn Cys Gln Pro Gly Thr Ser Gly Val Asn Cys Glu Ile Asn Phe
210 215 220
Asp Asp Cys Ala Ser Asn Pro Cys Ile His Gly Ile Cys Met Asp Gly
225 230 235 240
Ile Asn Arg Tyr Ser Cys Val Cys Ser Pro Gly Phe Thr Gly Gln Arg
245 250 255
Cys Asn Ile Asp Ile Asp Glu Cys Ala Ser Asn Pro Cys Arg Lys Gly
260 265 270
Ala Thr Cys Ile Asn Gly Val Asn Gly Phe Arg Cys Ile Cys Pro Glu
275 280 285
Gly Pro His His Pro Ser Cys Tyr Ser Gln Val Asn Glu Cys Leu Ser
290 295 300
Asn Pro Cys Ile His Gly Asn Cys Thr Gly Gly Leu Ser Gly Tyr Lys
305 310 315 320
Cys Leu Cys Asp Ala Gly Trp Val Gly Ile Asn Cys Glu Val Asp Lys
325 330 335
Asn Glu Cys Leu Ser Asn Pro Cys Gln Asn Gly Gly Thr Cys Asp Asn
340 345 350
Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys Lys Gly Phe Lys Gly Tyr
355 360 365
Asn Cys Gln Val Asn Ile Asp Glu Cys Ala Ser Asn Pro Cys Leu Asn
370 375 380
Gln Gly Thr Cys Phe Asp Asp Ile Ser Gly Tyr Thr Cys His Cys Val
385 390 395 400
Leu Pro Tyr Thr Gly Lys Asn Cys Gln Thr Val Leu Ala Pro Cys Ser
405 410 415
Pro Asn Pro Cys Glu Asn Ala Ala Val Cys Lys Glu Ser Pro Asn Phe
420 425 430
Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly Trp Gln Gly Gln Arg Cys
435 440 445
Thr Ile Asp Ile Asp Glu Cys Ile Ser Lys Pro Cys Met Asn His Gly
450 455 460
Leu Cys His Asn Thr Gln Gly Ser Tyr Met Cys Glu Cys Pro Pro Gly
465 470 475 480
Phe Ser Gly Met Asp Cys Glu Glu Asp Ile Asp Asp Cys Leu Ala Asn
485 490 495
Pro Cys Gln Asn Gly Gly Ser Cys Met Asp Gly Val Asn Thr Phe Ser
500 505 510
Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp Lys Cys Gln Thr Asp Met
515 520 525
Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn Gly Gly Thr Cys Ser Asp
530 535 540
Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gln Ala Gly Phe Asp Gly Val
545 550 555 560
His Cys Glu Asn Asn Ile Asn Glu Cys Thr Glu Ser Ser Cys Phe Asn
565 570 575
Gly Gly Thr Cys Val Asp Gly Ile Asn Ser Phe Ser Cys Leu Cys Pro
580 585 590
Val Gly Phe Thr Gly Ser Phe Cys Leu His Glu Ile Asn Glu Cys Ser
595 600 605
Ser His Pro Cys Leu Asn Glu Gly Thr Cys Val Asp Gly Leu Gly Thr
610 615 620
Tyr Arg Cys Ser Cys Pro Leu Gly Tyr Thr Gly Lys Asn Cys Gln Thr
625 630 635 640
Leu Val Asn Leu Cys Ser Arg Ser Pro Cys Lys Asn Lys Gly Thr Cys
645 650 655
Val Gln Lys Lys Ala Glu Ser Gln Cys Leu Cys Pro Ser Gly Trp Ala
660 665 670
Gly Ala Tyr Cys Asp Val Pro Asn Val Ser Cys Asp Ile Ala Ala Ser
675 680 685
Arg Arg Gly Val Leu Val Glu His Leu Cys Gln His Ser Gly Val Cys
690 695 700
Ile Asn Ala Gly Asn Thr His Tyr Cys Gln Cys Pro Leu Gly Tyr Thr
705 710 715 720
Gly Ser Tyr Cys Glu Glu Gln Leu Asp Glu Cys Ala Ser Asn Pro Cys
725 730 735
Gln His Gly Ala Thr Cys Ser Asp Phe Ile Gly Gly Tyr Arg Cys Glu
740 745 750
Cys Val Pro Gly Tyr Gln Gly Val Asn Cys Glu Tyr Glu Val Asp Glu
755 760 765
Cys Gln Asn Gln Pro Cys Gln Asn Gly Gly Thr Cys Ile Asp Leu Val
770 775 780
Asn His Phe Lys Cys Ser Cys Pro Pro Gly Thr Arg Gly Leu Leu Cys
785 790 795 800
Glu Glu Asn Ile Asp Asp Cys Ala Arg Gly Pro His Cys Leu Asn Gly
805 810 815
Gly Gln Cys Met Asp Arg Ile Gly Gly Tyr Ser Cys Arg Cys Leu Pro
820 825 830
Gly Phe Ala Gly Glu Arg Cys Glu Gly Asp Ile Asn Glu Cys Leu Ser
835 840 845
Asn Pro Cys Ser Ser Glu Gly Ser Leu Asp Cys Ile Gln Leu Thr Asn
850 855 860
Asp Tyr Leu Cys Val Cys Arg Ser Ala Phe Thr Gly Arg His Cys Glu
865 870 875 880
Thr Phe Val Asp Val Cys Pro Gln Met Pro Cys Leu Asn Gly Gly Thr
885 890 895
Cys Ala Val Ala Ser Asn Met Pro Asp Gly Phe Ile Cys Arg Cys Pro
900 905 910
Pro Gly Phe Ser Gly Ala Arg Cys Gln Ser Ser Cys Gly Gln Val Lys
915 920 925
Cys Arg Lys Gly Glu Gln Cys Val His Thr Ala Ser Gly Pro Arg Cys
930 935 940
Phe Cys Pro Ser Pro Arg Asp Cys Glu Ser Gly Cys Ala Ser Ser Pro
945 950 955 960
Cys Gln His Gly Gly Ser Cys His Pro Gln Arg Gln Pro Pro Tyr Tyr
965 970 975
Ser Cys Gln Cys Ala Pro Pro Phe Ser Gly Ser Arg Cys Glu Leu Tyr
980 985 990
Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr Cys Leu Ser Gln Tyr Cys
995 1000 1005
Ala Asp Lys Ala Arg Asp Gly Val Cys Asp Glu Ala Cys Asn Ser
1010 1015 1020
His Ala Cys Gln Trp Asp Gly Gly Asp Cys Ser Leu Thr Met Glu
1025 1030 1035
Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys Trp Asp Tyr
1040 1045 1050
Ile Asn Asn Gln Cys Asp Glu Leu Cys Asn Thr Val Glu Cys Leu
1055 1060 1065
Phe Asp Asn Phe Glu Cys Gln Gly Asn Ser Lys Thr Cys Lys Tyr
1070 1075 1080
Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys Asp Gln
1085 1090 1095
Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys Ala
1100 1105 1110
Ala Asp Gln Pro Glu Asn Leu Ala Glu Gly Thr Leu Val Ile Val
1115 1120 1125
Val Leu Met Pro Pro Glu Gln Leu Leu Gln Asp Ala Arg Ser Phe
1130 1135 1140
Leu Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg Ile Lys
1145 1150 1155
Arg Asp Ser Gln Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu
1160 1165 1170
Lys Ser Ala Ala Met Lys Lys Gln Arg Met Thr Arg Arg Ser Leu
1175 1180 1185
Pro Gly Glu Gln Glu Gln Glu Val Ala Gly Ser Lys Val Phe Leu
1190 1195 1200
Glu Ile Asp Asn Arg Gln Cys Val Gln Asp Ser Asp His Cys Phe
1205 1210 1215
Lys Asn Thr Asp Ala Ala Ala Ala Leu Leu Ala Ser His Ala Ile
1220 1225 1230
Gln Gly Thr Leu Ser Tyr Pro Leu Val Ser Val Val Ser Glu Ser
1235 1240 1245
Leu Thr Pro Glu Arg Thr Gln Leu Leu Tyr Leu Leu Ala Val Ala
1250 1255 1260
Val Val Ile Ile Leu Phe Ile Ile Leu Leu Gly Val Ile Met Ala
1265 1270 1275
Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro Glu Gly Phe
1280 1285 1290
Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu Pro Val
1295 1300 1305
Gly Gln Asp Ala Val Gly Leu Lys Asn Leu Ser Val Gln Val Ser
1310 1315 1320
Glu Ala Asn Leu Ile Gly Thr Gly Thr Ser Glu His Trp Val Asp
1325 1330 1335
Asp Glu Gly Pro Gln Pro Lys Lys Val Lys Ala Glu Asp Glu Ala
1340 1345 1350
Leu Leu Ser Glu Glu Asp Asp Pro Ile Asp Arg Arg Pro Trp Thr
1355 1360 1365
Gln Gln His Leu Glu Ala Ala Asp Ile Arg Arg Thr Pro Ser Leu
1370 1375 1380
Ala Leu Thr Pro Pro Gln Ala Glu Gln Glu Val Asp Val Leu Asp
1385 1390 1395
Val Asn Val Arg Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala
1400 1405 1410
Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp
1415 1420 1425
Ala Glu Asp Ser Ser Ala Asn Ile Ile Thr Asp Leu Val Tyr Gln
1430 1435 1440
Gly Ala Ser Leu Gln Ala Gln Thr Asp Arg Thr Gly Glu Met Ala
1445 1450 1455
Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala Lys Arg
1460 1465 1470
Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gln Asp Asn Met Gly
1475 1480 1485
Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gln Gly Val
1490 1495 1500
Phe Gln Ile Leu Ile Arg Asn Arg Val Thr Asp Leu Asp Ala Arg
1505 1510 1515
Met Asn Asp Gly Thr Thr Pro Leu Ile Leu Ala Ala Arg Leu Ala
1520 1525 1530
Val Glu Gly Met Val Ala Glu Leu Ile Asn Cys Gln Ala Asp Val
1535 1540 1545
Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala
1550 1555 1560
Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly
1565 1570 1575
Ala Asn Arg Asp Met Gln Asp Asn Lys Glu Glu Thr Pro Leu Phe
1580 1585 1590
Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys Ile Leu Leu
1595 1600 1605
Asp His Phe Ala Asn Arg Asp Ile Thr Asp His Met Asp Arg Leu
1610 1615 1620
Pro Arg Asp Val Ala Arg Asp Arg Met His His Asp Ile Val Arg
1625 1630 1635
Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val
1640 1645 1650
Leu Thr Ser Ala Leu Ser Pro Val Ile Cys Gly Pro Asn Arg Ser
1655 1660 1665
Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys Ser Arg Arg
1670 1675 1680
Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn Leu Ala
1685 1690 1695
Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys Ser Leu
1700 1705 1710
Ser Glu Lys Val Gln Leu Ser Glu Ser Ser Val Thr Leu Ser Pro
1715 1720 1725
Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr
1730 1735 1740
Ser Ser Pro Met Ile Thr Ser Pro Gly Ile Leu Gln Ala Ser Pro
1745 1750 1755
Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala
1760 1765 1770
Gln His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gln Pro Leu
1775 1780 1785
Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gln Leu Leu
1790 1795 1800
Ser His His His Ile Val Ser Pro Gly Ser Gly Ser Ala Gly Ser
1805 1810 1815
Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp Met Asn
1820 1825 1830
Arg Met Glu Val Asn Glu Thr Gln Tyr Asn Glu Met Phe Gly Met
1835 1840 1845
Val Leu Ala Pro Ala Glu Gly Thr His Pro Gly Ile Ala Pro Gln
1850 1855 1860
Ser Arg Pro Pro Glu Gly Lys His Ile Thr Thr Pro Arg Glu Pro
1865 1870 1875
Leu Pro Pro Ile Val Thr Phe Gln Leu Ile Pro Lys Gly Ser Ile
1880 1885 1890
Ala Gln Pro Ala Gly Ala Pro Gln Pro Gln Ser Thr Cys Pro Pro
1895 1900 1905
Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gln Ile Pro Glu Met
1910 1915 1920
Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met Pro Gln
1925 1930 1935
Gln Asp Gly Gln Val Ala Gln Thr Ile Leu Pro Ala Tyr His Pro
1940 1945 1950
Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gln His
1955 1960 1965
Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser
1970 1975 1980
Gly His Leu Gln Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu
1985 1990 1995
Ser Pro Asp Gln Trp Ser Ser Ser Ser Pro His Ser Ala Ser Asp
2000 2005 2010
Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly
2015 2020 2025
Gly Gln Arg Gly Pro Gly Thr His Met Ser Glu Pro Pro His Asn
2030 2035 2040
Asn Met Gln Val Tyr Ala
2045
<![CDATA[<210> 712]]>
<![CDATA[<211> 2471]]>
<![CDATA[<212>]]> PRT
<![CDATA[<213> 人類]]>
<![CDATA[<400> 712]]>
Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp
1 5 10 15
Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gln Cys Arg Asp Gly Tyr
20 25 30
Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr
35 40 45
Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gln His
50 55 60
Arg Asp Pro Cys Glu Lys Asn Arg Cys Gln Asn Gly Gly Thr Cys Val
65 70 75 80
Ala Gln Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe
85 90 95
Thr Gly Glu Asp Cys Gln Tyr Ser Thr Ser His Pro Cys Phe Val Ser
100 105 110
Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr
115 120 125
Tyr Glu Cys Thr Cys Gln Val Gly Phe Thr Gly Lys Glu Cys Gln Trp
130 135 140
Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr
145 150 155 160
Thr Val Ala Asn Gln Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly
165 170 175
Gln Lys Cys Glu Thr Asp Val Asn Glu Cys Asp Ile Pro Gly His Cys
180 185 190
Gln His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gln Cys Gln
195 200 205
Cys Pro Gln Gly Phe Thr Gly Gln Tyr Cys Asp Ser Leu Tyr Val Pro
210 215 220
Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gln Thr Gly
225 230 235 240
Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr
245 250 255
Cys Glu Arg Asn Ile Asp Asp Cys Pro Asn His Arg Cys Gln Asn Gly
260 265 270
Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro
275 280 285
Gln Trp Thr Gly Gln Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu
290 295 300
Gln Pro Asn Ala Cys Gln Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly
305 310 315 320
Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser
325 330 335
Glu Asn Ile Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr
340 345 350
Cys Ile Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys
355 360 365
Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys Ile Ser Asn Pro Cys
370 375 380
His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gln Tyr Ile
385 390 395 400
Cys Thr Cys Pro Gln Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val
405 410 415
Asp Glu Cys Ala Met Ala Asn Ser Asn Pro Cys Glu His Ala Gly Lys
420 425 430
Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys Gly Tyr
435 440 445
Ala Gly Pro Arg Cys Glu Met Asp Ile Asn Glu Cys His Ser Asp Pro
450 455 460
Cys Gln Asn Asp Ala Thr Cys Leu Asp Lys Ile Gly Gly Phe Thr Cys
465 470 475 480
Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu Ile Asn
485 490 495
Glu Cys Gln Ser Asn Pro Cys Val Asn Asn Gly Gln Cys Val Asp Lys
500 505 510
Val Asn Arg Phe Gln Cys Leu Cys Pro Pro Gly Phe Thr Gly Pro Val
515 520 525
Cys Gln Ile Asp Ile Asp Asp Cys Ser Ser Thr Pro Cys Leu Asn Gly
530 535 540
Ala Lys Cys Ile Asp His Pro Asn Gly Tyr Glu Cys Gln Cys Ala Thr
545 550 555 560
Gly Phe Thr Gly Val Leu Cys Glu Glu Asn Ile Asp Asn Cys Asp Pro
565 570 575
Asp Pro Cys His His Gly Gln Cys Gln Asp Gly Ile Asp Ser Tyr Thr
580 585 590
Cys Ile Cys Asn Pro Gly Tyr Met Gly Ala Ile Cys Ser Asp Gln Ile
595 600 605
Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys Ile Asp
610 615 620
Leu Val Asn Gly Tyr Gln Cys Asn Cys Gln Pro Gly Thr Ser Gly Val
625 630 635 640
Asn Cys Glu Ile Asn Phe Asp Asp Cys Ala Ser Asn Pro Cys Ile His
645 650 655
Gly Ile Cys Met Asp Gly Ile Asn Arg Tyr Ser Cys Val Cys Ser Pro
660 665 670
Gly Phe Thr Gly Gln Arg Cys Asn Ile Asp Ile Asp Glu Cys Ala Ser
675 680 685
Asn Pro Cys Arg Lys Gly Ala Thr Cys Ile Asn Gly Val Asn Gly Phe
690 695 700
Arg Cys Ile Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr Ser Gln
705 710 715 720
Val Asn Glu Cys Leu Ser Asn Pro Cys Ile His Gly Asn Cys Thr Gly
725 730 735
Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val Gly Ile
740 745 750
Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys Gln Asn
755 760 765
Gly Gly Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys
770 775 780
Lys Gly Phe Lys Gly Tyr Asn Cys Gln Val Asn Ile Asp Glu Cys Ala
785 790 795 800
Ser Asn Pro Cys Leu Asn Gln Gly Thr Cys Phe Asp Asp Ile Ser Gly
805 810 815
Tyr Thr Cys His Cys Val Leu Pro Tyr Thr Gly Lys Asn Cys Gln Thr
820 825 830
Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala Val Cys
835 840 845
Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly
850 855 860
Trp Gln Gly Gln Arg Cys Thr Ile Asp Ile Asp Glu Cys Ile Ser Lys
865 870 875 880
Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gln Gly Ser Tyr Met
885 890 895
Cys Glu Cys Pro Pro Gly Phe Ser Gly Met Asp Cys Glu Glu Asp Ile
900 905 910
Asp Asp Cys Leu Ala Asn Pro Cys Gln Asn Gly Gly Ser Cys Met Asp
915 920 925
Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp
930 935 940
Lys Cys Gln Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn
945 950 955 960
Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gln
965 970 975
Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn Ile Asn Glu Cys Thr
980 985 990
Glu Ser Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly Ile Asn Ser
995 1000 1005
Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys Leu
1010 1015 1020
His Glu Ile Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly
1025 1030 1035
Thr Cys Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu
1040 1045 1050
Gly Tyr Thr Gly Lys Asn Cys Gln Thr Leu Val Asn Leu Cys Ser
1055 1060 1065
Arg Ser Pro Cys Lys Asn Lys Gly Thr Cys Val Gln Lys Lys Ala
1070 1075 1080
Glu Ser Gln Cys Leu Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys
1085 1090 1095
Asp Val Pro Asn Val Ser Cys Asp Ile Ala Ala Ser Arg Arg Gly
1100 1105 1110
Val Leu Val Glu His Leu Cys Gln His Ser Gly Val Cys Ile Asn
1115 1120 1125
Ala Gly Asn Thr His Tyr Cys Gln Cys Pro Leu Gly Tyr Thr Gly
1130 1135 1140
Ser Tyr Cys Glu Glu Gln Leu Asp Glu Cys Ala Ser Asn Pro Cys
1145 1150 1155
Gln His Gly Ala Thr Cys Ser Asp Phe Ile Gly Gly Tyr Arg Cys
1160 1165 1170
Glu Cys Val Pro Gly Tyr Gln Gly Val Asn Cys Glu Tyr Glu Val
1175 1180 1185
Asp Glu Cys Gln Asn Gln Pro Cys Gln Asn Gly Gly Thr Cys Ile
1190 1195 1200
Asp Leu Val Asn His Phe Lys Cys Ser Cys Pro Pro Gly Thr Arg
1205 1210 1215
Gly Leu Leu Cys Glu Glu Asn Ile Asp Asp Cys Ala Arg Gly Pro
1220 1225 1230
His Cys Leu Asn Gly Gly Gln Cys Met Asp Arg Ile Gly Gly Tyr
1235 1240 1245
Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu Gly
1250 1255 1260
Asp Ile Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Glu Gly Ser
1265 1270 1275
Leu Asp Cys Ile Gln Leu Thr Asn Asp Tyr Leu Cys Val Cys Arg
1280 1285 1290
Ser Ala Phe Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys
1295 1300 1305
Pro Gln Met Pro Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser
1310 1315 1320
Asn Met Pro Asp Gly Phe Ile Cys Arg Cys Pro Pro Gly Phe Ser
1325 1330 1335
Gly Ala Arg Cys Gln Ser Ser Cys Gly Gln Val Lys Cys Arg Lys
1340 1345 1350
Gly Glu Gln Cys Val His Thr Ala Ser Gly Pro Arg Cys Phe Cys
1355 1360 1365
Pro Ser Pro Arg Asp Cys Glu Ser Gly Cys Ala Ser Ser Pro Cys
1370 1375 1380
Gln His Gly Gly Ser Cys His Pro Gln Arg Gln Pro Pro Tyr Tyr
1385 1390 1395
Ser Cys Gln Cys Ala Pro Pro Phe Ser Gly Ser Arg Cys Glu Leu
1400 1405 1410
Tyr Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr Cys Leu Ser Gln
1415 1420 1425
Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val Cys Asp Glu Ala Cys
1430 1435 1440
Asn Ser His Ala Cys Gln Trp Asp Gly Gly Asp Cys Ser Leu Thr
1445 1450 1455
Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys Trp
1460 1465 1470
Asp Tyr Ile Asn Asn Gln Cys Asp Glu Leu Cys Asn Thr Val Glu
1475 1480 1485
Cys Leu Phe Asp Asn Phe Glu Cys Gln Gly Asn Ser Lys Thr Cys
1490 1495 1500
Lys Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys
1505 1510 1515
Asp Gln Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp
1520 1525 1530
Cys Ala Ala Asp Gln Pro Glu Asn Leu Ala Glu Gly Thr Leu Val
1535 1540 1545
Ile Val Val Leu Met Pro Pro Glu Gln Leu Leu Gln Asp Ala Arg
1550 1555 1560
Ser Phe Leu Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg
1565 1570 1575
Ile Lys Arg Asp Ser Gln Gly Glu Leu Met Val Tyr Pro Tyr Tyr
1580 1585 1590
Gly Glu Lys Ser Ala Ala Met Lys Lys Gln Arg Met Thr Arg Arg
1595 1600 1605
Ser Leu Pro Gly Glu Gln Glu Gln Glu Val Ala Gly Ser Lys Val
1610 1615 1620
Phe Leu Glu Ile Asp Asn Arg Gln Cys Val Gln Asp Ser Asp His
1625 1630 1635
Cys Phe Lys Asn Thr Asp Ala Ala Ala Ala Leu Leu Ala Ser His
1640 1645 1650
Ala Ile Gln Gly Thr Leu Ser Tyr Pro Leu Val Ser Val Val Ser
1655 1660 1665
Glu Ser Leu Thr Pro Glu Arg Thr Gln Leu Leu Tyr Leu Leu Ala
1670 1675 1680
Val Ala Val Val Ile Ile Leu Phe Ile Ile Leu Leu Gly Val Ile
1685 1690 1695
Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro Glu
1700 1705 1710
Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu
1715 1720 1725
Pro Val Gly Gln Asp Ala Val Gly Leu Lys Asn Leu Ser Val Gln
1730 1735 1740
Val Ser Glu Ala Asn Leu Ile Gly Thr Gly Thr Ser Glu His Trp
1745 1750 1755
Val Asp Asp Glu Gly Pro Gln Pro Lys Lys Val Lys Ala Glu Asp
1760 1765 1770
Glu Ala Leu Leu Ser Glu Glu Asp Asp Pro Ile Asp Arg Arg Pro
1775 1780 1785
Trp Thr Gln Gln His Leu Glu Ala Ala Asp Ile Arg Arg Thr Pro
1790 1795 1800
Ser Leu Ala Leu Thr Pro Pro Gln Ala Glu Gln Glu Val Asp Val
1805 1810 1815
Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys Thr Pro Leu Met
1820 1825 1830
Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp Glu Asp
1835 1840 1845
Glu Asp Ala Glu Asp Ser Ser Ala Asn Ile Ile Thr Asp Leu Val
1850 1855 1860
Tyr Gln Gly Ala Ser Leu Gln Ala Gln Thr Asp Arg Thr Gly Glu
1865 1870 1875
Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala
1880 1885 1890
Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gln Asp Asn
1895 1900 1905
Met Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gln
1910 1915 1920
Gly Val Phe Gln Ile Leu Ile Arg Asn Arg Val Thr Asp Leu Asp
1925 1930 1935
Ala Arg Met Asn Asp Gly Thr Thr Pro Leu Ile Leu Ala Ala Arg
1940 1945 1950
Leu Ala Val Glu Gly Met Val Ala Glu Leu Ile Asn Cys Gln Ala
1955 1960 1965
Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp
1970 1975 1980
Ala Ala Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys
1985 1990 1995
Asn Gly Ala Asn Arg Asp Met Gln Asp Asn Lys Glu Glu Thr Pro
2000 2005 2010
Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys Ile
2015 2020 2025
Leu Leu Asp His Phe Ala Asn Arg Asp Ile Thr Asp His Met Asp
2030 2035 2040
Arg Leu Pro Arg Asp Val Ala Arg Asp Arg Met His His Asp Ile
2045 2050 2055
Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly
2060 2065 2070
Thr Val Leu Thr Ser Ala Leu Ser Pro Val Ile Cys Gly Pro Asn
2075 2080 2085
Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys Ser
2090 2095 2100
Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn
2105 2110 2115
Leu Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys
2120 2125 2130
Ser Leu Ser Glu Lys Val Gln Leu Ser Glu Ser Ser Val Thr Leu
2135 2140 2145
Ser Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp
2150 2155 2160
Thr Thr Ser Ser Pro Met Ile Thr Ser Pro Gly Ile Leu Gln Ala
2165 2170 2175
Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val
2180 2185 2190
His Ala Gln His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gln
2195 2200 2205
Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gln
2210 2215 2220
Leu Leu Ser His His His Ile Val Ser Pro Gly Ser Gly Ser Ala
2225 2230 2235
Gly Ser Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp
2240 2245 2250
Met Asn Arg Met Glu Val Asn Glu Thr Gln Tyr Asn Glu Met Phe
2255 2260 2265
Gly Met Val Leu Ala Pro Ala Glu Gly Thr His Pro Gly Ile Ala
2270 2275 2280
Pro Gln Ser Arg Pro Pro Glu Gly Lys His Ile Thr Thr Pro Arg
2285 2290 2295
Glu Pro Leu Pro Pro Ile Val Thr Phe Gln Leu Ile Pro Lys Gly
2300 2305 2310
Ser Ile Ala Gln Pro Ala Gly Ala Pro Gln Pro Gln Ser Thr Cys
2315 2320 2325
Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gln Ile Pro
2330 2335 2340
Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met
2345 2350 2355
Pro Gln Gln Asp Gly Gln Val Ala Gln Thr Ile Leu Pro Ala Tyr
2360 2365 2370
His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser
2375 2380 2385
Gln His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser
2390 2395 2400
His Ser Gly His Leu Gln Gly Glu His Pro Tyr Leu Thr Pro Ser
2405 2410 2415
Pro Glu Ser Pro Asp Gln Trp Ser Ser Ser Ser Pro His Ser Ala
2420 2425 2430
Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala
2435 2440 2445
Gly Gly Gly Gln Arg Gly Pro Gly Thr His Met Ser Glu Pro Pro
2450 2455 2460
His Asn Asn Met Gln Val Tyr Ala
2465 2470
<![CDATA[<210> 713]]>
<![CDATA[<211> 1364]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人]]>類
<![CDATA[<400> 713]]>
aggctgcttc gttgcacacc cgagaaagtt tcagccaaac ttcgggcggc ggctgaggcg 60
gcggccgagg agcggcggac tcggggcgcg gggagtcgag gcatttgcgc ctgggcttcg 120
gagcgtagcg ccagggcctg agcctttgaa gcaggaggag gggaggagag agtggggctc 180
ctctatcggg accccctccc catgtggatc tgcccaggcg gcggcggcgg cggcggagga 240
ggaggcgacc gagaagatgc ccgccctgcg ccccgctctg ctgtgggcgc tgctggcgct 300
ctggctgtgc tgcgcggccc ccgcgcatgc attgcagtgt cgagatggct atgaaccctg 360
tgtaaatgaa ggaatgtgtg ttacctacca caatggcaca ggatactgca aatgtccaga 420
aggcttcttg ggggaatatt gtcaacatcg agacccctgt gagaagaacc gctgccagaa 480
tggtgggact tgtgtggccc aggccatgct ggggaaagcc acgtgccgat gtgcctcagg 540
gtttacagga gaggactgcc agtactcaac atctcatcca tgctttgtgt ctcgaccctg 600
cctgaatggc ggcacatgcc atatgctcag ccgggatacc tatgagtgca cctgtcaagt 660
cgggtttaca ggtaaggagt gccaatggac ggatgcctgc ctgtctcatc cctgtgcaaa 720
tggaagtacc tgtaccactg tggccaacca gttctcctgc aaatgcctca caggcttcac 780
agggcagaaa tgtgagactg atgtcaatga gtgtgacatt ccaggacact gccagcatgg 840
tggcacctgc ctcaacctgc ctggttccta ccagtgccag tgccctcagg gcttcacagg 900
ccagtactgt gacagcctgt atgtgccctg tgcaccctca ccttgtgtca atggaggcac 960
ctgtcggcag actggtgact tcacttttga gtgcaactgc cttccaggtt ttgaagggag 1020
cacctgtgag aggaatattg atgactgccc taaccacagg tgtcagaatg gaggggtttg 1080
tgtggatggg gtcaacactt acaactgccg ctgtccccca caatggacag gacagttctg 1140
cacagaggat gtggatgaat gcctgctgca gcccaatgcc tgtcaaaatg ggggcacctg 1200
tgccaaccgc aatggaggct atggctgtgt atgtgtcaac ggctggagtg gagatgactg 1260
cagtgagaac attgatgatt gtgccttcgc ctcctgtact ccaggctcca cctgcatcga 1320
ccgtgtggcc tccttctctt gcatgtgccc agaggggaag gcag 1364
<![CDATA[<210> 714]]>
<![CDATA[<211> 369]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 714]]>
Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp
1 5 10 15
Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gln Cys Arg Asp Gly Tyr
20 25 30
Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr
35 40 45
Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gln His
50 55 60
Arg Asp Pro Cys Glu Lys Asn Arg Cys Gln Asn Gly Gly Thr Cys Val
65 70 75 80
Ala Gln Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe
85 90 95
Thr Gly Glu Asp Cys Gln Tyr Ser Thr Ser His Pro Cys Phe Val Ser
100 105 110
Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr
115 120 125
Tyr Glu Cys Thr Cys Gln Val Gly Phe Thr Gly Lys Glu Cys Gln Trp
130 135 140
Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr
145 150 155 160
Thr Val Ala Asn Gln Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly
165 170 175
Gln Lys Cys Glu Thr Asp Val Asn Glu Cys Asp Ile Pro Gly His Cys
180 185 190
Gln His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gln Cys Gln
195 200 205
Cys Pro Gln Gly Phe Thr Gly Gln Tyr Cys Asp Ser Leu Tyr Val Pro
210 215 220
Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gln Thr Gly
225 230 235 240
Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr
245 250 255
Cys Glu Arg Asn Ile Asp Asp Cys Pro Asn His Arg Cys Gln Asn Gly
260 265 270
Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro
275 280 285
Gln Trp Thr Gly Gln Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu
290 295 300
Gln Pro Asn Ala Cys Gln Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly
305 310 315 320
Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser
325 330 335
Glu Asn Ile Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr
340 345 350
Cys Ile Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys
355 360 365
Ala
<![CDATA[<210> 715]]>
<![CDATA[<211> 7]]>5
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 715]]>
agggccggct ggacaaactg acagtcacct cccagaacct gcagctggag aacctgcgca 60
tgaagcttcc caagc 75
<![CDATA[<210> 716]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 716]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[<210> 717]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 717]]>
agggccggct ggacaaactg acagtcacct cccagaacct gcagctggag aacctgcgca 60
tgaagcttcc caagcccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[<210> 718]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 718]]>
Gly Arg Leu Asp Lys Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu
1 5 10 15
Asn Leu Arg Met Lys Leu Pro Lys Pro Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[<210> 719]]>
<![CDATA[<211> 286]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 719]]>
atcctgcccc gcaaaaggca gcttcaccaa agtggggtat ttccagcctt tgtagctttc 60
acttccacat ctaccaagtg ggcggagtgg ccttctgtgg acgaatcaga ttcctctcca 120
gcaccgactt taagaggcga gccggggggt cagggtccca gatgcacagg aggagaagca 180
ggagctgtcg ggaagatcag aagccagtca tggatgacca gcgcgacctt atctccaaca 240
atgagcaact gcccatgctg ggccggcgcc ctggggcccc ggagag 286
<![CDATA[<210> 720]]>
<![CDATA[<211> 173]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 720]]>
caagtgcagc cgcggagccc tgtacacagg cttttccatc ctggtgactc tgctcctcgc 60
tggccaggcc accaccgcct acttcctgta ccagcagcag ggccggctgg acaaactgac 120
agtcacctcc cagaacctgc agctggagaa cctgcgcatg aagcttccca agc 173
<![CDATA[<210> 721]]>
<![CDATA[<211> 80]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 721]]>
ctcccaagcc tgtgagcaag atgcgcatgg ccaccccgct gctgatgcag gcgctgccca 60
tgggagccct gccccagggg 80
<![CDATA[<210> 722]]>
<![CDATA[<211> 63]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 722]]>
cccatgcaga atgccaccaa gtatggcaac atgacagagg accatgtgat gcacctgctc 60
cag 63
<![CDATA[<210> 723]]>
<![CDATA[<211> 96]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 723]]>
aatgctgacc ccctgaaggt gtacccgcca ctgaagggga gcttcccgga gaacctgaga 60
caccttaaga acaccatgga gaccatagac tggaag 96
<![CDATA[<210> 724]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 724]]>
gtctttgaga gctggatgca ccattggctc ctgtttgaaa tgagcaggca ctccttggag 60
caaaagccca ctgacgctcc accgaaag 88
<![CDATA[<210> 725]]>
<![CDATA[<211> 192]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 725]]>
tactgaccaa gtgccaggaa gaggtcagcc acatccctgc tgtccacccg ggttcattca 60
ggcccaagtg cgacgagaac ggcaactatc tgccactcca gtgctatggg agcatcggct 120
actgctggtg tgtcttcccc aacggcacgg aggtccccaa caccagaagc cgcgggcacc 180
ataactgcag tg 192
<![CDATA[<210> 726]]>
<![CDATA[<211> 63]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 726]]>
agtcactgga actggaggac ccgtcttctg ggctgggtgt gaccaagcag gatctgggcc 60
cag 63
<![CDATA[<210> 727]]>
<![CDATA[<211> 612]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 727]]>
tccccatgtg agagcagcag aggcggtctt caacatcctg ccagccccac acagctacag 60
ctttcttgct cccttcagcc cccagcccct cccccatctc ccaccctgta cctcatccca 120
tgagaccctg gtgcctggct ctttcgtcac ccttggacaa gacaaaccaa gtcggaacag 180
cagataacaa tgcagcaagg ccctgctgcc caatctccat ctgtcaacag gggcgtgagg 240
tcccaggaag tggccaaaag ctagacagat ccccgttcct gacatcacag cagcctccaa 300
cacaaggctc caagacctag gctcatggac gagatgggaa ggcacaggga gaagggataa 360
ccctacaccc agaccccagg ctggacatgc tgactgtcct ctcccctcca gcctttggcc 420
ttggcttttc tagcctattt acctgcaggc tgagccactc tcttcccttt ccccagcatc 480
actccccaag gaagagccaa tgttttccac ccataatcct ttctgccgac ccctagttcc 540
ctctgctcag ccaagcttgt tatcagcttt cagggccatg gttcacatta gaataaaagg 600
tagtaattag aa 612
<![CDATA[<210> 728]]>
<![CDATA[<211> 1653]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> CD74之全mRNA多核苷酸序列]]>
<![CDATA[<400> 728]]>
atcctgcccc gcaaaaggca gcttcaccaa agtggggtat ttccagcctt tgtagctttc 60
acttccacat ctaccaagtg ggcggagtgg ccttctgtgg acgaatcaga ttcctctcca 120
gcaccgactt taagaggcga gccggggggt cagggtccca gatgcacagg aggagaagca 180
ggagctgtcg ggaagatcag aagccagtca tggatgacca gcgcgacctt atctccaaca 240
atgagcaact gcccatgctg ggccggcgcc ctggggcccc ggagagcaag tgcagccgcg 300
gagccctgta cacaggcttt tccatcctgg tgactctgct cctcgctggc caggccacca 360
ccgcctactt cctgtaccag cagcagggcc ggctggacaa actgacagtc acctcccaga 420
acctgcagct ggagaacctg cgcatgaagc ttcccaagcc tcccaagcct gtgagcaaga 480
tgcgcatggc caccccgctg ctgatgcagg cgctgcccat gggagccctg ccccaggggc 540
ccatgcagaa tgccaccaag tatggcaaca tgacagagga ccatgtgatg cacctgctcc 600
agaatgctga ccccctgaag gtgtacccgc cactgaaggg gagcttcccg gagaacctga 660
gacaccttaa gaacaccatg gagaccatag actggaaggt ctttgagagc tggatgcacc 720
attggctcct gtttgaaatg agcaggcact ccttggagca aaagcccact gacgctccac 780
cgaaagtact gaccaagtgc caggaagagg tcagccacat ccctgctgtc cacccgggtt 840
cattcaggcc caagtgcgac gagaacggca actatctgcc actccagtgc tatgggagca 900
tcggctactg ctggtgtgtc ttccccaacg gcacggaggt ccccaacacc agaagccgcg 960
ggcaccataa ctgcagtgag tcactggaac tggaggaccc gtcttctggg ctgggtgtga 1020
ccaagcagga tctgggccca gtccccatgt gagagcagca gaggcggtct tcaacatcct 1080
gccagcccca cacagctaca gctttcttgc tcccttcagc ccccagcccc tcccccatct 1140
cccaccctgt acctcatccc atgagaccct ggtgcctggc tctttcgtca cccttggaca 1200
agacaaacca agtcggaaca gcagataaca atgcagcaag gccctgctgc ccaatctcca 1260
tctgtcaaca ggggcgtgag gtcccaggaa gtggccaaaa gctagacaga tccccgttcc 1320
tgacatcaca gcagcctcca acacaaggct ccaagaccta ggctcatgga cgagatggga 1380
aggcacaggg agaagggata accctacacc cagaccccag gctggacatg ctgactgtcc 1440
tctcccctcc agcctttggc cttggctttt ctagcctatt tacctgcagg ctgagccact 1500
ctcttccctt tccccagcat cactccccaa ggaagagcca atgttttcca cccataatcc 1560
tttctgccga cccctagttc cctctgctca gccaagcttg ttatcagctt tcagggccat 1620
ggttcacatt agaataaaag gtagtaatta gaa 1653
<![CDATA[<210> 729]]>
<![CDATA[<211> 41]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 729]]>
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu
35 40
<![CDATA[<210> 730]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 730]]>
Lys Cys Ser Arg Gly Ala Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr
1 5 10 15
Leu Leu Leu Ala Gly Gln Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln
20 25 30
Gln Gly Arg Leu Asp Lys Leu Thr Val Thr Ser Gln Asn Leu Gln Leu
35 40 45
Glu Asn Leu Arg Met Lys Leu Pro Lys
50 55
<![CDATA[<210> 731]]>
<![CDATA[<211> 26]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 731]]>
Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr Pro Leu Leu Met Gln
1 5 10 15
Ala Leu Pro Met Gly Ala Leu Pro Gln Gly
20 25
<![CDATA[<210> 732]]>
<![CDATA[<211> 21]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 732]]>
Pro Met Gln Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val
1 5 10 15
Met His Leu Leu Gln
20
<![CDATA[<210> 733]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 733]]>
Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly Ser Phe Pro
1 5 10 15
Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr Ile Asp Trp Lys
20 25 30
<![CDATA[<210> 734]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 734]]>
Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu Met Ser Arg
1 5 10 15
His Ser Leu Glu Gln Lys Pro Thr Asp Ala Pro Pro Lys
20 25
<![CDATA[<210> 735]]>
<![CDATA[<211> 63]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 735]]>
Leu Thr Lys Cys Gln Glu Glu Val Ser His Ile Pro Ala Val His Pro
1 5 10 15
Gly Ser Phe Arg Pro Lys Cys Asp Glu Asn Gly Asn Tyr Leu Pro Leu
20 25 30
Gln Cys Tyr Gly Ser Ile Gly Tyr Cys Trp Cys Val Phe Pro Asn Gly
35 40 45
Thr Glu Val Pro Asn Thr Arg Ser Arg Gly His His Asn Cys Ser
50 55 60
<![CDATA[<210> 736]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 736]]>
Ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val Thr Lys Gln
1 5 10 15
Asp Leu Gly Pro
20
<![CDATA[<210> 737]]>
<![CDATA[<211> 2]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 737]]>
Pro Met
1
<![CDATA[<210> 738]]>
<![CDATA[<211> 296]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 738]]>
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly Ala
35 40 45
Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr Leu Leu Leu Ala Gly Gln
50 55 60
Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln Gln Gly Arg Leu Asp Lys
65 70 75 80
Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu Asn Leu Arg Met Lys
85 90 95
Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr Pro
100 105 110
Leu Leu Met Gln Ala Leu Pro Met Gly Ala Leu Pro Gln Gly Pro Met
115 120 125
Gln Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val Met His
130 135 140
Leu Leu Gln Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly
145 150 155 160
Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr Ile
165 170 175
Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu
180 185 190
Met Ser Arg His Ser Leu Glu Gln Lys Pro Thr Asp Ala Pro Pro Lys
195 200 205
Val Leu Thr Lys Cys Gln Glu Glu Val Ser His Ile Pro Ala Val His
210 215 220
Pro Gly Ser Phe Arg Pro Lys Cys Asp Glu Asn Gly Asn Tyr Leu Pro
225 230 235 240
Leu Gln Cys Tyr Gly Ser Ile Gly Tyr Cys Trp Cys Val Phe Pro Asn
245 250 255
Gly Thr Glu Val Pro Asn Thr Arg Ser Arg Gly His His Asn Cys Ser
260 265 270
Glu Ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val Thr Lys
275 280 285
Gln Asp Leu Gly Pro Val Pro Met
290 295
<![CDATA[<210> 739]]>
<![CDATA[<211> 459]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 739]]>
atcctgcccc gcaaaaggca gcttcaccaa agtggggtat ttccagcctt tgtagctttc 60
acttccacat ctaccaagtg ggcggagtgg ccttctgtgg acgaatcaga ttcctctcca 120
gcaccgactt taagaggcga gccggggggt cagggtccca gatgcacagg aggagaagca 180
ggagctgtcg ggaagatcag aagccagtca tggatgacca gcgcgacctt atctccaaca 240
atgagcaact gcccatgctg ggccggcgcc ctggggcccc ggagagcaag tgcagccgcg 300
gagccctgta cacaggcttt tccatcctgg tgactctgct cctcgctggc caggccacca 360
ccgcctactt cctgtaccag cagcagggcc ggctggacaa actgacagtc acctcccaga 420
acctgcagct ggagaacctg cgcatgaagc ttcccaagc 459
<![CDATA[<210> 740]]>
<![CDATA[<211> 99]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 740]]>
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly Ala
35 40 45
Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr Leu Leu Leu Ala Gly Gln
50 55 60
Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln Gln Gly Arg Leu Asp Lys
65 70 75 80
Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu Asn Leu Arg Met Lys
85 90 95
Leu Pro Lys
<![CDATA[<210> 741]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 741]]>
taccagacga tgaggatgta gtggggcccg ggcaggaatc tgatgacttt gagctgtctg 60
gctctggaga tctgg 75
<![CDATA[<210> 742]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 742]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[<210> 743]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 743]]>
taccagacga tgaggatgta gtggggcccg ggcaggaatc tgatgacttt gagctgtctg 60
gctctggaga tctggccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[<210> 744]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 744]]>
Pro Asp Asp Glu Asp Val Val Gly Pro Gly Gln Glu Ser Asp Asp Phe
1 5 10 15
Glu Leu Ser Gly Ser Gly Asp Leu Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[<210> 745]]>
<![CDATA[<211> 100]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 745]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg 100
<![CDATA[<210> 746]]>
<![CDATA[<211> 139]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 746]]>
atccgagaga ctgaggtcat cgacccccag gacctcctag aaggccgata cttctccgga 60
gccctaccag acgatgagga tgtagtgggg cccgggcagg aatctgatga ctttgagctg 120
tctggctctg gagatctgg 139
<![CDATA[<210> 747]]>
<![CDATA[<211> 47]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 747]]>
atgacttgga agactccatg atcggccctg aagttgtcca tcccttg 47
<![CDATA[<210> 748]]>
<![CDATA[<211> 199]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 748]]>
gtgcctctag ataaccatat ccctgagagg gcagggtctg ggagccaagt ccccaccgaa 60
cccaagaaac tagaggagaa tgaggttatc cccaagagaa tctcacccgt tgaagagagt 120
gaggatgtgt ccaacaaggt gtcaatgtcc agcactgtgc agggcagcaa catctttgag 180
agaacggagg tcctggcag 199
<![CDATA[<210> 749]]>
<![CDATA[<211> 2128]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 749]]>
ctctgattgt gggtggcatc gtgggcatcc tctttgccgt cttcctgatc ctactgctca 60
tgtaccgtat gaagaagaag gatgaaggca gctatgacct gggcaagaaa cccatctaca 120
agaaagcccc caccaatgag ttctacgcgt gaagcttgct tgtgggcact ggcttggact 180
ttagcgggga gggaagccag gggattttga agggtggaca ttagggtagg gtgaggtcaa 240
cctaatactg acttgtcagt atctccagct ctgattacct ttgaagtgtt cagaagagac 300
attgtcttct actgttctgc caggttcttc ttgagctttg ggcctcagtt gccctggcag 360
aaaaatggat tcaacttggc ctttctgaag gcaagactgg gattggatca cttcttaaac 420
ttccagttaa gaatctaggt ccgccctcaa gcccatactg accatgcctc atccagagct 480
cctctgaagc cagggggcta acggatgttg tgtggagtcc tggctggagg tcctccccca 540
gtggccttcc tcccttcctt tcacagccgg tctctctgcc aggaaatggg ggaaggaact 600
agaaccacct gcaccttgag atgtttctgt aaatgggtac ttgtgatcac actacgggaa 660
tctctgtggt atatacctgg ggccattcta ggctctttca agtgactttt ggaaatcaac 720
cttttttatt tgggggggag gatggggaaa agagctgaga gtttatgctg aaatggattt 780
atagaatatt tgtaaatcta tttttagtgt ttgttcgttt ttttaactgt tcattccttt 840
gtgcagagtg tatatctctg cctgggcaag agtgtggagg tgccgaggtg tcttcattct 900
ctcgcacatt tccacagcac ctgctaagtt tgtatttaat ggtttttgtt tttgtttttg 960
tttgtttctt gaaaatgaga gaagagccgg agagatgatt tttattaatt tttttttttt 1020
tttttttttt ttactattta tagctttaga tagggcctcc cttcccctct tctttctttg 1080
ttctctttca ttaaacccct tccccagttt tttttttata ctttaaaccc cgctcctcat 1140
ggccttggcc ctttctgaag ctgcttcctc ttataaaata gcttttgccg aaacatagtt 1200
tttttttagc agatcccaaa atataatgaa ggggatggtg ggatatttgt gtctgtgttc 1260
ttataatata ttattattct tccttggttc tagaaaaata gataaatata tttttttcag 1320
gaaatagtgt ggtgtttcca gtttgatgtt gctgggtggt tgagtgagtg aattttcatg 1380
tggctgggtg ggtttttgcc tttttctctt gccctgttcc tggtgccttc tgatggggct 1440
ggaatagttg aggtggatgg ttctaccctt tctgccttct gtttgggacc cagctggtgt 1500
tctttggttt gctttcttca ggctctaggg ctgtgctatc caatacagta accacatgcg 1560
gctgtttaaa gttaagccaa ttaaaatcac ataagattaa aaattccttc ctcagttgca 1620
ctaaccacgt ttctagaggc gtcactgtat gtagttcatg gctactgtac tgacagcgag 1680
agcatgtcca tctgttggac agcactattc tagagaacta aactggctta acgagtcaca 1740
gcctcagctg tgctgggacg acccttgtct ccctgggtag ggggggggga atgggggagg 1800
gctgatgagg ccccagctgg ggcctgttgt ctgggaccct ccctctcctg agaggggagg 1860
cctggtggct tagcctgggc aggtcgtgtc tcctcctgac cccagtggct gcggtgaggg 1920
gaaccaccct cccttgctgc accagtggcc attagctccc gtcaccactg caacccaggg 1980
tcccagctgg ctgggtcctc ttctgccccc agtgcccttc cccttgggct gtgttggagt 2040
gagcacctcc tctgtaggca cctctcacac tgttgtctgt tactgatttt ttttgataaa 2100
aagataataa aacctggtac tttctaaa 2128
<![CDATA[<210> 750]]>
<![CDATA[<211> 2613]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> SDC4之全mRNA多核苷酸序列]]>
<![CDATA[<400> 750]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg atccgagaga ctgaggtcat 120
cgacccccag gacctcctag aaggccgata cttctccgga gccctaccag acgatgagga 180
tgtagtgggg cccgggcagg aatctgatga ctttgagctg tctggctctg gagatctgga 240
tgacttggaa gactccatga tcggccctga agttgtccat cccttggtgc ctctagataa 300
ccatatccct gagagggcag ggtctgggag ccaagtcccc accgaaccca agaaactaga 360
ggagaatgag gttatcccca agagaatctc acccgttgaa gagagtgagg atgtgtccaa 420
caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa cggaggtcct 480
ggcagctctg attgtgggtg gcatcgtggg catcctcttt gccgtcttcc tgatcctact 540
gctcatgtac cgtatgaaga agaaggatga aggcagctat gacctgggca agaaacccat 600
ctacaagaaa gcccccacca atgagttcta cgcgtgaagc ttgcttgtgg gcactggctt 660
ggactttagc ggggagggaa gccaggggat tttgaagggt ggacattagg gtagggtgag 720
gtcaacctaa tactgacttg tcagtatctc cagctctgat tacctttgaa gtgttcagaa 780
gagacattgt cttctactgt tctgccaggt tcttcttgag ctttgggcct cagttgccct 840
ggcagaaaaa tggattcaac ttggcctttc tgaaggcaag actgggattg gatcacttct 900
taaacttcca gttaagaatc taggtccgcc ctcaagccca tactgaccat gcctcatcca 960
gagctcctct gaagccaggg ggctaacgga tgttgtgtgg agtcctggct ggaggtcctc 1020
ccccagtggc cttcctccct tcctttcaca gccggtctct ctgccaggaa atgggggaag 1080
gaactagaac cacctgcacc ttgagatgtt tctgtaaatg ggtacttgtg atcacactac 1140
gggaatctct gtggtatata cctggggcca ttctaggctc tttcaagtga cttttggaaa 1200
tcaacctttt ttatttgggg gggaggatgg ggaaaagagc tgagagttta tgctgaaatg 1260
gatttataga atatttgtaa atctattttt agtgtttgtt cgttttttta actgttcatt 1320
cctttgtgca gagtgtatat ctctgcctgg gcaagagtgt ggaggtgccg aggtgtcttc 1380
attctctcgc acatttccac agcacctgct aagtttgtat ttaatggttt ttgtttttgt 1440
ttttgtttgt ttcttgaaaa tgagagaaga gccggagaga tgatttttat taattttttt 1500
tttttttttt tttttttact atttatagct ttagataggg cctcccttcc cctcttcttt 1560
ctttgttctc tttcattaaa ccccttcccc agtttttttt ttatacttta aaccccgctc 1620
ctcatggcct tggccctttc tgaagctgct tcctcttata aaatagcttt tgccgaaaca 1680
tagttttttt ttagcagatc ccaaaatata atgaagggga tggtgggata tttgtgtctg 1740
tgttcttata atatattatt attcttcctt ggttctagaa aaatagataa atatattttt 1800
ttcaggaaat agtgtggtgt ttccagtttg atgttgctgg gtggttgagt gagtgaattt 1860
tcatgtggct gggtgggttt ttgccttttt ctcttgccct gttcctggtg ccttctgatg 1920
gggctggaat agttgaggtg gatggttcta ccctttctgc cttctgtttg ggacccagct 1980
ggtgttcttt ggtttgcttt cttcaggctc tagggctgtg ctatccaata cagtaaccac 2040
atgcggctgt ttaaagttaa gccaattaaa atcacataag attaaaaatt ccttcctcag 2100
ttgcactaac cacgtttcta gaggcgtcac tgtatgtagt tcatggctac tgtactgaca 2160
gcgagagcat gtccatctgt tggacagcac tattctagag aactaaactg gcttaacgag 2220
tcacagcctc agctgtgctg ggacgaccct tgtctccctg ggtagggggg ggggaatggg 2280
ggagggctga tgaggcccca gctggggcct gttgtctggg accctccctc tcctgagagg 2340
ggaggcctgg tggcttagcc tgggcaggtc gtgtctcctc ctgaccccag tggctgcggt 2400
gaggggaacc accctccctt gctgcaccag tggccattag ctcccgtcac cactgcaacc 2460
cagggtccca gctggctggg tcctcttctg cccccagtgc ccttcccctt gggctgtgtt 2520
ggagtgagca cctcctctgt aggcacctct cacactgttg tctgttactg attttttttg 2580
ataaaaagat aataaaacct ggtactttct aaa 2613
<![CDATA[<210> 751]]>
<![CDATA[<211> 20]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 751]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser
20
<![CDATA[<210> 752]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 752]]>
Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu Leu Glu Gly Arg
1 5 10 15
Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val Val Gly Pro Gly
20 25 30
Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly Asp Leu
35 40 45
<![CDATA[<210> 753]]>
<![CDATA[<211> 15]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 753]]>
Asp Leu Glu Asp Ser Met Ile Gly Pro Glu Val Val His Pro Leu
1 5 10 15
<![CDATA[<210> 754]]>
<![CDATA[<211> 66]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 754]]>
Val Pro Leu Asp Asn His Ile Pro Glu Arg Ala Gly Ser Gly Ser Gln
1 5 10 15
Val Pro Thr Glu Pro Lys Lys Leu Glu Glu Asn Glu Val Ile Pro Lys
20 25 30
Arg Ile Ser Pro Val Glu Glu Ser Glu Asp Val Ser Asn Lys Val Ser
35 40 45
Met Ser Ser Thr Val Gln Gly Ser Asn Ile Phe Glu Arg Thr Glu Val
50 55 60
Leu Ala
65
<![CDATA[<210> 755]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 755]]>
Leu Ile Val Gly Gly Ile Val Gly Ile Leu Phe Ala Val Phe Leu Ile
1 5 10 15
Leu Leu Leu Met Tyr Arg Met Lys Lys Lys Asp Glu Gly Ser Tyr Asp
20 25 30
Leu Gly Lys Lys Pro Ile Tyr Lys Lys Ala Pro Thr Asn Glu Phe Tyr
35 40 45
Ala
<![CDATA[<210> 756]]>
<![CDATA[<211> 198]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 756]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu
20 25 30
Leu Glu Gly Arg Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val
35 40 45
Val Gly Pro Gly Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly
50 55 60
Asp Leu Asp Asp Leu Glu Asp Ser Met Ile Gly Pro Glu Val Val His
65 70 75 80
Pro Leu Val Pro Leu Asp Asn His Ile Pro Glu Arg Ala Gly Ser Gly
85 90 95
Ser Gln Val Pro Thr Glu Pro Lys Lys Leu Glu Glu Asn Glu Val Ile
100 105 110
Pro Lys Arg Ile Ser Pro Val Glu Glu Ser Glu Asp Val Ser Asn Lys
115 120 125
Val Ser Met Ser Ser Thr Val Gln Gly Ser Asn Ile Phe Glu Arg Thr
130 135 140
Glu Val Leu Ala Ala Leu Ile Val Gly Gly Ile Val Gly Ile Leu Phe
145 150 155 160
Ala Val Phe Leu Ile Leu Leu Leu Met Tyr Arg Met Lys Lys Lys Asp
165 170 175
Glu Gly Ser Tyr Asp Leu Gly Lys Lys Pro Ile Tyr Lys Lys Ala Pro
180 185 190
Thr Asn Glu Phe Tyr Ala
195
<![CDATA[<210> 757]]>
<![CDATA[<211> 239]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 757]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg atccgagaga ctgaggtcat 120
cgacccccag gacctcctag aaggccgata cttctccgga gccctaccag acgatgagga 180
tgtagtgggg cccgggcagg aatctgatga ctttgagctg tctggctctg gagatctgg 239
<![CDATA[<210> 758]]>
<![CDATA[<211> 66]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 758]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu
20 25 30
Leu Glu Gly Arg Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val
35 40 45
Val Gly Pro Gly Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly
50 55 60
Asp Leu
65
<![CDATA[<210> 759]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 759]]>
tttctactgt acaccccatc ccagacgaag acagtccctg gatcaccgac agcacagaca 60
gaatccctgc tacca 75
<![CDATA[<210> 760]]>
<![CDATA[<21]]>1> 75]]>
<br/><![CDATA[<212> DNA]]>
<br/><![CDATA[<213> 人類]]>
<br/>
<br/><![CDATA[<400> 760]]>
<br/><![CDATA[ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatg 75
<![CDATA[<210> 761]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 761]]>
tttctactgt acaccccatc ccagacgaag acagtccctg gatcaccgac agcacagaca 60
gaatccctgc taccactaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[<210> 762]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 762]]>
Ser Thr Val His Pro Ile Pro Asp Glu Asp Ser Pro Trp Ile Thr Asp
1 5 10 15
Ser Thr Asp Arg Ile Pro Ala Thr Thr Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[<210> 763]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> ]]>DNA
<![CDATA[<213> 人類]]>
<![CDATA[<400> 763]]>
actaccccat caactccaac ttcaaagtgg gctacaacac tctcttttcc tgtacctgtg 60
tgccacctga cccag 75
<![CDATA[<210> 764]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 764]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatg 75
<![CDATA[<210> 765]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 765]]>
actaccccat caactccaac ttcaaagtgg gctacaacac tctcttttcc tgtacctgtg 60
tgccacctga cccagctaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[<210> 766]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 766]]>
Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu Phe Ser
1 5 10 15
Cys Thr Cys Val Pro Pro Asp Pro Ala Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[<210> 767]]>
<![CDATA[<211> 143]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 767]]>
ggcggcgcgc cgggcagcgc ttcggtggcg gcggcggccg cggtggcagc gaaggcggcg 60
gcggcggcgg cagtggcagt ggccgctgca gccccacact ccgccgccaa actggaggag 120
cgacggaagc cagaccccag gag 143
<![CDATA[<210> 768]]>
<![CDATA[<211> 74]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 768]]>
gatggaggat gaagctgtcc tggacagagg ggcttccttc ctcaagcatg tgtgtgatga 60
agaagaagta gaag 74
<![CDATA[<210> 769]]>
<![CDATA[<211> 180]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 769]]>
gccaccatac catttacatc ggagtccatg tgccgaagag ttacaggaga aggagacgtc 60
acaagagaaa gacagggcac aaagaaaaga aggaaaagga gagaatctct gagaactact 120
ctgacaaatc agatattgaa aatgctgatg aatccagcag cagcatccta aaacctctca 180
<![CDATA[<210> 770]]>
<![CDATA[<211> 136]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 770]]>
tctctcctgc tgcagaacgc atccgattca tcttgggaga ggaggatgac agcccagctc 60
cccctcagct cttcacggaa ctggatgagc tgctggccgt ggatgggcag gagatggagt 120
ggaaggaaac agccag 136
<![CDATA[<210> 771]]>
<![CDATA[<211> 161]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 771]]>
gtggatcaag tttgaagaaa aagtggaaca gggtggggaa agatggagca agccccatgt 60
ggccacattg tcccttcata gtttatttga gctgaggaca tgtatggaga aaggatccat 120
catgcttgat cgggaggctt cttctctccc acagttggtg g 161
<![CDATA[<210> 772]]>
<![CDATA[<211> 180]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 772]]>
agatgattgt tgaccatcag attgagacag gcctattgaa acctgaactt aaggataagg 60
tgacctatac tttgctccgg aagcaccggc atcaaaccaa gaaatccaac cttcggtccc 120
tggctgacat tgggaagaca gtctccagtg caagtaggat gtttaccaac cctgataatg 180
<![CDATA[<210> 773]]>
<![CDATA[<211> 77]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 773]]>
gtagcccagc catgacccat aggaatctga cttcctccag tctgaatgac atttctgata 60
aaccggagaa ggaccag 77
<![CDATA[<210> 774]]>
<![CDATA[<211> 158]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 774]]>
ctgaagaata agttcatgaa aaaattgcca cgtgatgcag aagcttccaa cgtgcttgtt 60
ggggaggttg actttttgga tactcctttc attgcctttg ttaggctaca gcaggctgtc 120
atgctgggtg ccctgactga agttcctgtg cccacaag 158
<![CDATA[<210> 775]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 775]]>
gttcttgttc attctcttag gtcctaaggg gaaagccaag tcctaccacg agattggcag 60
agccattgcc accctgatgt ctgatgag 88
<![CDATA[<210> 776]]>
<![CDATA[<211> 155]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 776]]>
gtgttccatg acattgctta taaagcaaaa gacaggcacg acctgattgc tggtattgat 60
gagttcctag atgaagtcat cgtccttcca cctggggaat gggatccagc aattaggata 120
gagcctccta agagtcttcc atcctctgac aaaag 155
<![CDATA[<210> 777]]>
<![CDATA[<211> 114]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 777]]>
aaagaatatg tactcaggtg gagagaatgt tcagatgaat ggggatacgc cccatgatgg 60
aggtcacgga ggaggaggac atggggattg tgaagaattg cagcgaactg gacg 114
<![CDATA[<210> 778]]>
<![CDATA[<211> 175]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 778]]>
gttctgtggt ggactaatta aagacataaa gaggaaagcg ccattttttg ccagtgattt 60
ttatgatgct ttaaatattc aagctctttc ggcaattctc ttcatttatc tggcaactgt 120
aactaatgct atcacttttg gaggactgct tggggatgcc actgacaaca tgcag 175
<![CDATA[<210> 779]]>
<![CDATA[<211> 134]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 779]]>
ggcgtgttgg agagtttcct gggcactgct gtctctggag ccatcttttg cctttttgct 60
ggtcaaccac tcactattct gagcagcacc ggacctgtcc tagtttttga gaggcttcta 120
tttaatttca gcaa 134
<![CDATA[<210> 780]]>
<![CDATA[<211> 272]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 780]]>
ggacaataat tttgactatt tggagtttcg cctttggatt ggcctgtggt ccgccttcct 60
atgtctcatt ttggtagcca ctgatgccag cttcttggtt caatacttca cacgtttcac 120
ggaggagggc ttttcctctc tgattagctt catctttatc tatgatgctt tcaagaagat 180
gatcaagctt gcagattact accccatcaa ctccaacttc aaagtgggct acaacactct 240
cttttcctgt acctgtgtgc cacctgaccc ag 272
<![CDATA[<210> 781]]>
<![CDATA[<211> 71]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 781]]>
ctaatatctc aatatctaat gacaccacac tggccccaga gtatttgcca actatgtctt 60
ctactgacat g 71
<![CDATA[<210> 782]]>
<![CDATA[<211> 192]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 782]]>
taccataata ctacctttga ctgggcattt ttgtcgaaga aggagtgttc aaaatacgga 60
ggaaacctcg tcgggaacaa ctgtaatttt gttcctgata tcacactcat gtcttttatc 120
ctcttcttgg gaacctacac ctcttccatg gctctgaaaa aattcaaaac tagtccttat 180
tttccaacca ca 192
<![CDATA[<210> 783]]>
<![CDATA[<211> 114]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 783]]>
gcaagaaaac tgatcagtga ttttgccatt atcttgtcca ttctcatctt ttgtgtaata 60
gatgccctag taggcgtgga caccccaaaa ctaattgtgc caagtgagtt caag 114
<![CDATA[<210> 784]]>
<![CDATA[<211> 162]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 784]]>
ccaacaagtc caaaccgagg ttggttcgtt ccaccgtttg gagaaaaccc ctggtgggtg 60
tgccttgctg ctgctatccc ggctttgttg gtcactatac tgattttcat ggaccaacaa 120
attacagctg tgattgtaaa caggaaagaa cataaactca ag 162
<![CDATA[<210> 785]]>
<![CDATA[<211> 179]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 785]]>
aaaggagcag ggtatcactt ggatctcttt tgggtggcca tcctcatggt tatatgctcc 60
ctcatggctc ttccgtggta tgtagctgct acggtcatct ccattgctca catcgacagt 120
ttgaagatgg agacagagac ttctgcacct ggagaacaac caaagtttct aggagtgag 179
<![CDATA[<210> 786]]>
<![CDATA[<211]]>> 73]]>
<br/><![CDATA[<212> DNA]]>
<br/><![CDATA[<213> 人類]]>
<br/>
<br/><![CDATA[<400> 786]]>
<br/><![CDATA[ggaacaaaga gtcactggaa cccttgtgtt tattctgact ggtctgtcag tctttatggc 60
tcccatcttg aag 73
<![CDATA[<210> 787]]>
<![CDATA[<211> 70]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 787]]>
gtttataccc atgcctgtac tctatggtgt gttcctgtat atgggagtag catcccttaa 60
tggtgtgcag 70
<![CDATA[<210> 788]]>
<![CDATA[<211> 174]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 788]]>
ttcatggatc gtctgaagct gcttctgatg cctctgaagc atcagcctga cttcatctac 60
ctgcgtcatg ttcctctgcg cagagtccac ctgttcactt tcctgcaggt gttgtgtctg 120
gccctgcttt ggatcctcaa gtcaacggtg gctgctatca tttttccagt aatg 174
<![CDATA[<210> 789]]>
<![CDATA[<211> 162]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 789]]>
atcttggcac ttgtagctgt cagaaaaggc atggactacc tcttctccca gcacgacctc 60
agcttcctgg atgatgtcat tccagaaaag gacaagaaaa agaaggagga tgagaagaaa 120
aagaaaaaga agaagggaag tctggacagt gacaatgatg at 162
<![CDATA[<210> 790]]>
<![CDATA[<211> 97]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 790]]>
tctgactgcc catactcaga aaaagttcca agtattaaaa ttccaatgga catcatggaa 60
cagcaacctt tcctaagcga tagcaaacct tctgaca 97
<![CDATA[<210> 791]]>
<![CDATA[<211> 80]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 791]]>
gagaaagatc accaacattc cttgaacgcc acacatcatg ctgataaaat tcctttcctt 60
cagtcactcg gtatgccaag 80
<![CDATA[<210> 792]]>
<![CDATA[<211> 4296]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 792]]>
tcctcctaga actccagtaa aagttgtgcc tcaaattaga atagaacttg aacctgaaga 60
caatgattat ttctggagga gcaagggaac agaaactaca ttgtaacctg tttgtctttc 120
ttaaaactga catttgttgt taatgtcatt tgtttttgtt tggctgtttg tttatttttt 180
aacttttatt tcgtctcagt ttttggtcac aggccaaata atacagcgct ctctctgctt 240
ctctcttgca tagacacaat caagacaata gtgcaccgtt ccttaaaaac agcatctgag 300
gaatccccct tttgttctta aactttcaga tgtgtccttt gataaccaaa ttctgtcact 360
caagacacag acacgcacag accctgtcct ttgcctctat taagcagagg atggaagtat 420
taaggatttt gtaacacctt ttatgaaaat gttgaaggaa cttaaaactt tagctttgga 480
gctgtgctta ctggcttgtc tttgtctggt agaacaaacc ttgacctcca gacagagtcc 540
cttctcactt atagagctct ccaggactgg aaaaagtgct gctattttaa cttgctcttg 600
cttgtaaatc ctaatcttag agttatcaaa agaagaaaaa actgaaggta ctttactccc 660
tatagagaaa ccattgccat cattgtagca agtgctggaa tgtccctttt ttcctatgca 720
acttttttta accctttaat gaacttatct gttgagtaca ttgaagaata tttttcttcc 780
tagattttgt tgtttaaatt atggggccta acctgccact tattttttgt caatttttaa 840
aacttttttt taattactgt aaagaaaatg aattttttcc tgcagcagga aacatagttt 900
tgagtagttc tacctcttat ttgtagctgc caggctttct gtaaaaattg tattgtatat 960
aatgtgattt ttacacatac atacacacac aaatacacaa tctctagggt aagccagaag 1020
gcaagatcag attaaaaaca ccatgtttct aagcatccat ttttcccttt ctttaaaaga 1080
aacttaactg ttctatgaag gagattgagg gagaagagac aaactcctat gtcatgagaa 1140
taaccgatgt tctgataata gtagcatcta ggtacagatg ctggttgtat taccacgtca 1200
atgtcctatg cagtattgtt agacattttc tcattttgaa atatttgtgt gtttgtgtat 1260
gtgctctgtg ccatggctgg tgtatatatg tgcaatgtta gaaggcaaaa gagtgatggt 1320
aggcagaggg caaagtcatt gaatctctta tgccagtttt cataaaaccc aaaccacata 1380
tgaaaaaatc cattaagggt ccaagaagtc tgtccatatg aaaatgaggg taaatatagt 1440
ttatttccca ggtatcagtc attataattg atataatagc tctaacatgc aatataaaat 1500
tcataggagt attaatagcc catttacaca tctataaaat gtaatgggat tgcagagctg 1560
cagagtacag tgtaacagta ctctcatgca atttttttca ggatgcaaag gcaattattc 1620
tttgtaagcg ggacatttag aatatatttg tgtacatatt atatgtatgt atatttcaaa 1680
gtaccacact gaaaattaga catttattaa ccaaatttaa cgtggtattt aaaggtaata 1740
tttttaatat gatacattac atattgtgaa tgtatactaa aaaaacattt taaatgttaa 1800
aattataatt tcagattcat ataaccacaa ctgtgatata tcctaactat aaccagttgt 1860
tgaggggtat actagaagca gaatgaaacc acattttttg gtttgataat atgcacttat 1920
tgactcccac tcattgttat gttaattaag ttattattct gtctccttgt aattttgatt 1980
acaaaaattt tattatcctg agttagctgt tacttttaca gtacctgata ctcctaaaac 2040
ttttaactta tacaaattag tcaataatga ccccaatttt ttcattaaaa taatagtggt 2100
gaattatatg ttattgtgtt aaaacctcac ttgccaaatt ctggcttcac atttgtattt 2160
agggctatcc ttaaaatgat gagtctatat tatctagctt tctattaccc taatataaac 2220
tggtataaga agactttcct tttttcttta tgcatggaag catcaataaa ttgtttaaaa 2280
accatgtata gtaaattcag cttaacccgt gatcttctta agttaaaggt acttttgttt 2340
tataaaagct ctagataaaa ctttcttttc tgatcatgaa tcaagtatct gtggtttcat 2400
gcccctctct atacctttca aagaactcct gaagcaactt aactcatcat ttcagcctct 2460
gagtagaggt aaaacctatg tgtacttctg tttatgatcc atattgatat ttatgacatg 2520
aacacagaat agtaccttac atttgctaaa cagacagtta atatcaaatc ctttcaatat 2580
tctgggaacc cagggaagtt tttaaaaatg tcattacttt caaaggaaca gaagtagtta 2640
accaaactaa caagcaaaac ctgaggttta cctagtgaca ccaaattatc ggtattttaa 2700
ctgaatttac ccattgacta agaatgaacc agatttggtg gtggttttgt ttctatgcaa 2760
actggacaca aattacaaca gtaaattttt ttataagtgc ttctcccttc tccatgatgt 2820
gacttccgga gataaaggat tcaaaagata aagacaaagt acgctcagag ttgttaacca 2880
gaaagtcctg gctgtggttg cagaaacact gttggaagaa aagagatgac taagtcaagt 2940
gtctgcctta tcaaaagagc aaaaatgcct ctggttttgt gtttgggaga aaagtatctt 3000
ggacgcactg ttttccttga taaaagtcat cttctctact gtgtgaaatg aatacttgga 3060
attctaattg ttttgtgtgc caggggcagt aatgtccctg cctcttctcc caatcaaggt 3120
tgaggagtgg ggctggggag aggacttaac tgacttaaga agtaggaaaa caaaaacctc 3180
tctcctcagc cttccacctc caagagagga ggaaaaacag ttgtctgctg tctgtaattc 3240
agtttgcgtg tattttatgc tcatgcacca acccatacag agtaaatctt ttatcaactg 3300
tatactggtg tttaatagag aatgattgtc ttccgagttt tttggttcct tttttaactg 3360
tgttaaagta cttgaaatgt attgactgct gactatattt taaaaacaaa atgaaataat 3420
ttgagttgta ttacagaggt tgacattgtt cagggatggg acaaagcctt cttcaatcct 3480
tttcatacta cttaatgatt ttggtgcagg aacctgagat tttctgattt atatttcatg 3540
atatttcaca tttgctcttc acagcatgag catgaagccc agtggcacca aatggctggg 3600
tacaatcaag tgatattttg tagcacctca ctatctgaaa ggccatgagt tttcagatga 3660
tttcattgag cttcattgca gcctgaaatt ttaaaaaagt tgtgtaatac gccaaccagt 3720
caagttgtgt tttggccaga gatttagata tgtccaattt cctggctcat ttcattgtgc 3780
tctatgggta cgtataaaaa gcaagaattc tgtttcctag gcaaacattg caactcaggg 3840
ctaaagtcat ccagtgaaac ttttagagcc agaagtaact ttgtcccagt cctacaatgt 3900
gaaaagagtg aatagttgcc tctttttagc cattttcatg gctggtacat attcgtacgc 3960
attacttttc agaatcaata cgcactttca gatattctta tttttattct cttaagtctt 4020
tattaacttt ggagagagaa atgatgcatc tttttatttt aaatgaagta gatcaacatg 4080
gtggaacaaa atgataaaga acagaaaaca tttcaatata ttactaataa ctttttccaa 4140
tataaatcct aaaattccta taacatagta ttttacagtt ttatgaagct ttctattgtg 4200
acttttatgg aattaagaga tgaagaagat gagatatttt agcatttata tttttcaaaa 4260
ttatatgtat acttaaaaat aaagtaactt tatgca 4296
<![CDATA[<210> 793]]>
<![CDATA[<211> 7716]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> SLC4A4之全mRNA多核苷酸序列]]>
<![CDATA[<400> 793]]>
ggcggcgcgc cgggcagcgc ttcggtggcg gcggcggccg cggtggcagc gaaggcggcg 60
gcggcggcgg cagtggcagt ggccgctgca gccccacact ccgccgccaa actggaggag 120
cgacggaagc cagaccccag gaggatggag gatgaagctg tcctggacag aggggcttcc 180
ttcctcaagc atgtgtgtga tgaagaagaa gtagaaggcc accataccat ttacatcgga 240
gtccatgtgc cgaagagtta caggagaagg agacgtcaca agagaaagac agggcacaaa 300
gaaaagaagg aaaaggagag aatctctgag aactactctg acaaatcaga tattgaaaat 360
gctgatgaat ccagcagcag catcctaaaa cctctcatct ctcctgctgc agaacgcatc 420
cgattcatct tgggagagga ggatgacagc ccagctcccc ctcagctctt cacggaactg 480
gatgagctgc tggccgtgga tgggcaggag atggagtgga aggaaacagc caggtggatc 540
aagtttgaag aaaaagtgga acagggtggg gaaagatgga gcaagcccca tgtggccaca 600
ttgtcccttc atagtttatt tgagctgagg acatgtatgg agaaaggatc catcatgctt 660
gatcgggagg cttcttctct cccacagttg gtggagatga ttgttgacca tcagattgag 720
acaggcctat tgaaacctga acttaaggat aaggtgacct atactttgct ccggaagcac 780
cggcatcaaa ccaagaaatc caaccttcgg tccctggctg acattgggaa gacagtctcc 840
agtgcaagta ggatgtttac caaccctgat aatggtagcc cagccatgac ccataggaat 900
ctgacttcct ccagtctgaa tgacatttct gataaaccgg agaaggacca gctgaagaat 960
aagttcatga aaaaattgcc acgtgatgca gaagcttcca acgtgcttgt tggggaggtt 1020
gactttttgg atactccttt cattgccttt gttaggctac agcaggctgt catgctgggt 1080
gccctgactg aagttcctgt gcccacaagg ttcttgttca ttctcttagg tcctaagggg 1140
aaagccaagt cctaccacga gattggcaga gccattgcca ccctgatgtc tgatgaggtg 1200
ttccatgaca ttgcttataa agcaaaagac aggcacgacc tgattgctgg tattgatgag 1260
ttcctagatg aagtcatcgt ccttccacct ggggaatggg atccagcaat taggatagag 1320
cctcctaaga gtcttccatc ctctgacaaa agaaagaata tgtactcagg tggagagaat 1380
gttcagatga atggggatac gccccatgat ggaggtcacg gaggaggagg acatggggat 1440
tgtgaagaat tgcagcgaac tggacggttc tgtggtggac taattaaaga cataaagagg 1500
aaagcgccat tttttgccag tgatttttat gatgctttaa atattcaagc tctttcggca 1560
attctcttca tttatctggc aactgtaact aatgctatca cttttggagg actgcttggg 1620
gatgccactg acaacatgca gggcgtgttg gagagtttcc tgggcactgc tgtctctgga 1680
gccatctttt gcctttttgc tggtcaacca ctcactattc tgagcagcac cggacctgtc 1740
ctagtttttg agaggcttct atttaatttc agcaaggaca ataattttga ctatttggag 1800
tttcgccttt ggattggcct gtggtccgcc ttcctatgtc tcattttggt agccactgat 1860
gccagcttct tggttcaata cttcacacgt ttcacggagg agggcttttc ctctctgatt 1920
agcttcatct ttatctatga tgctttcaag aagatgatca agcttgcaga ttactacccc 1980
atcaactcca acttcaaagt gggctacaac actctctttt cctgtacctg tgtgccacct 2040
gacccagcta atatctcaat atctaatgac accacactgg ccccagagta tttgccaact 2100
atgtcttcta ctgacatgta ccataatact acctttgact gggcattttt gtcgaagaag 2160
gagtgttcaa aatacggagg aaacctcgtc gggaacaact gtaattttgt tcctgatatc 2220
acactcatgt cttttatcct cttcttggga acctacacct cttccatggc tctgaaaaaa 2280
ttcaaaacta gtccttattt tccaaccaca gcaagaaaac tgatcagtga ttttgccatt 2340
atcttgtcca ttctcatctt ttgtgtaata gatgccctag taggcgtgga caccccaaaa 2400
ctaattgtgc caagtgagtt caagccaaca agtccaaacc gaggttggtt cgttccaccg 2460
tttggagaaa acccctggtg ggtgtgcctt gctgctgcta tcccggcttt gttggtcact 2520
atactgattt tcatggacca acaaattaca gctgtgattg taaacaggaa agaacataaa 2580
ctcaagaaag gagcagggta tcacttggat ctcttttggg tggccatcct catggttata 2640
tgctccctca tggctcttcc gtggtatgta gctgctacgg tcatctccat tgctcacatc 2700
gacagtttga agatggagac agagacttct gcacctggag aacaaccaaa gtttctagga 2760
gtgagggaac aaagagtcac tggaaccctt gtgtttattc tgactggtct gtcagtcttt 2820
atggctccca tcttgaagtt tatacccatg cctgtactct atggtgtgtt cctgtatatg 2880
ggagtagcat cccttaatgg tgtgcagttc atggatcgtc tgaagctgct tctgatgcct 2940
ctgaagcatc agcctgactt catctacctg cgtcatgttc ctctgcgcag agtccacctg 3000
ttcactttcc tgcaggtgtt gtgtctggcc ctgctttgga tcctcaagtc aacggtggct 3060
gctatcattt ttccagtaat gatcttggca cttgtagctg tcagaaaagg catggactac 3120
ctcttctccc agcacgacct cagcttcctg gatgatgtca ttccagaaaa ggacaagaaa 3180
aagaaggagg atgagaagaa aaagaaaaag aagaagggaa gtctggacag tgacaatgat 3240
gattctgact gcccatactc agaaaaagtt ccaagtatta aaattccaat ggacatcatg 3300
gaacagcaac ctttcctaag cgatagcaaa ccttctgaca gagaaagatc accaacattc 3360
cttgaacgcc acacatcatg ctgataaaat tcctttcctt cagtcactcg gtatgccaag 3420
tcctcctaga actccagtaa aagttgtgcc tcaaattaga atagaacttg aacctgaaga 3480
caatgattat ttctggagga gcaagggaac agaaactaca ttgtaacctg tttgtctttc 3540
ttaaaactga catttgttgt taatgtcatt tgtttttgtt tggctgtttg tttatttttt 3600
aacttttatt tcgtctcagt ttttggtcac aggccaaata atacagcgct ctctctgctt 3660
ctctcttgca tagacacaat caagacaata gtgcaccgtt ccttaaaaac agcatctgag 3720
gaatccccct tttgttctta aactttcaga tgtgtccttt gataaccaaa ttctgtcact 3780
caagacacag acacgcacag accctgtcct ttgcctctat taagcagagg atggaagtat 3840
taaggatttt gtaacacctt ttatgaaaat gttgaaggaa cttaaaactt tagctttgga 3900
gctgtgctta ctggcttgtc tttgtctggt agaacaaacc ttgacctcca gacagagtcc 3960
cttctcactt atagagctct ccaggactgg aaaaagtgct gctattttaa cttgctcttg 4020
cttgtaaatc ctaatcttag agttatcaaa agaagaaaaa actgaaggta ctttactccc 4080
tatagagaaa ccattgccat cattgtagca agtgctggaa tgtccctttt ttcctatgca 4140
acttttttta accctttaat gaacttatct gttgagtaca ttgaagaata tttttcttcc 4200
tagattttgt tgtttaaatt atggggccta acctgccact tattttttgt caatttttaa 4260
aacttttttt taattactgt aaagaaaatg aattttttcc tgcagcagga aacatagttt 4320
tgagtagttc tacctcttat ttgtagctgc caggctttct gtaaaaattg tattgtatat 4380
aatgtgattt ttacacatac atacacacac aaatacacaa tctctagggt aagccagaag 4440
gcaagatcag attaaaaaca ccatgtttct aagcatccat ttttcccttt ctttaaaaga 4500
aacttaactg ttctatgaag gagattgagg gagaagagac aaactcctat gtcatgagaa 4560
taaccgatgt tctgataata gtagcatcta ggtacagatg ctggttgtat taccacgtca 4620
atgtcctatg cagtattgtt agacattttc tcattttgaa atatttgtgt gtttgtgtat 4680
gtgctctgtg ccatggctgg tgtatatatg tgcaatgtta gaaggcaaaa gagtgatggt 4740
aggcagaggg caaagtcatt gaatctctta tgccagtttt cataaaaccc aaaccacata 4800
tgaaaaaatc cattaagggt ccaagaagtc tgtccatatg aaaatgaggg taaatatagt 4860
ttatttccca ggtatcagtc attataattg atataatagc tctaacatgc aatataaaat 4920
tcataggagt attaatagcc catttacaca tctataaaat gtaatgggat tgcagagctg 4980
cagagtacag tgtaacagta ctctcatgca atttttttca ggatgcaaag gcaattattc 5040
tttgtaagcg ggacatttag aatatatttg tgtacatatt atatgtatgt atatttcaaa 5100
gtaccacact gaaaattaga catttattaa ccaaatttaa cgtggtattt aaaggtaata 5160
tttttaatat gatacattac atattgtgaa tgtatactaa aaaaacattt taaatgttaa 5220
aattataatt tcagattcat ataaccacaa ctgtgatata tcctaactat aaccagttgt 5280
tgaggggtat actagaagca gaatgaaacc acattttttg gtttgataat atgcacttat 5340
tgactcccac tcattgttat gttaattaag ttattattct gtctccttgt aattttgatt 5400
acaaaaattt tattatcctg agttagctgt tacttttaca gtacctgata ctcctaaaac 5460
ttttaactta tacaaattag tcaataatga ccccaatttt ttcattaaaa taatagtggt 5520
gaattatatg ttattgtgtt aaaacctcac ttgccaaatt ctggcttcac atttgtattt 5580
agggctatcc ttaaaatgat gagtctatat tatctagctt tctattaccc taatataaac 5640
tggtataaga agactttcct tttttcttta tgcatggaag catcaataaa ttgtttaaaa 5700
accatgtata gtaaattcag cttaacccgt gatcttctta agttaaaggt acttttgttt 5760
tataaaagct ctagataaaa ctttcttttc tgatcatgaa tcaagtatct gtggtttcat 5820
gcccctctct atacctttca aagaactcct gaagcaactt aactcatcat ttcagcctct 5880
gagtagaggt aaaacctatg tgtacttctg tttatgatcc atattgatat ttatgacatg 5940
aacacagaat agtaccttac atttgctaaa cagacagtta atatcaaatc ctttcaatat 6000
tctgggaacc cagggaagtt tttaaaaatg tcattacttt caaaggaaca gaagtagtta 6060
accaaactaa caagcaaaac ctgaggttta cctagtgaca ccaaattatc ggtattttaa 6120
ctgaatttac ccattgacta agaatgaacc agatttggtg gtggttttgt ttctatgcaa 6180
actggacaca aattacaaca gtaaattttt ttataagtgc ttctcccttc tccatgatgt 6240
gacttccgga gataaaggat tcaaaagata aagacaaagt acgctcagag ttgttaacca 6300
gaaagtcctg gctgtggttg cagaaacact gttggaagaa aagagatgac taagtcaagt 6360
gtctgcctta tcaaaagagc aaaaatgcct ctggttttgt gtttgggaga aaagtatctt 6420
ggacgcactg ttttccttga taaaagtcat cttctctact gtgtgaaatg aatacttgga 6480
attctaattg ttttgtgtgc caggggcagt aatgtccctg cctcttctcc caatcaaggt 6540
tgaggagtgg ggctggggag aggacttaac tgacttaaga agtaggaaaa caaaaacctc 6600
tctcctcagc cttccacctc caagagagga ggaaaaacag ttgtctgctg tctgtaattc 6660
agtttgcgtg tattttatgc tcatgcacca acccatacag agtaaatctt ttatcaactg 6720
tatactggtg tttaatagag aatgattgtc ttccgagttt tttggttcct tttttaactg 6780
tgttaaagta cttgaaatgt attgactgct gactatattt taaaaacaaa atgaaataat 6840
ttgagttgta ttacagaggt tgacattgtt cagggatggg acaaagcctt cttcaatcct 6900
tttcatacta cttaatgatt ttggtgcagg aacctgagat tttctgattt atatttcatg 6960
atatttcaca tttgctcttc acagcatgag catgaagccc agtggcacca aatggctggg 7020
tacaatcaag tgatattttg tagcacctca ctatctgaaa ggccatgagt tttcagatga 7080
tttcattgag cttcattgca gcctgaaatt ttaaaaaagt tgtgtaatac gccaaccagt 7140
caagttgtgt tttggccaga gatttagata tgtccaattt cctggctcat ttcattgtgc 7200
tctatgggta cgtataaaaa gcaagaattc tgtttcctag gcaaacattg caactcaggg 7260
ctaaagtcat ccagtgaaac ttttagagcc agaagtaact ttgtcccagt cctacaatgt 7320
gaaaagagtg aatagttgcc tctttttagc cattttcatg gctggtacat attcgtacgc 7380
attacttttc agaatcaata cgcactttca gatattctta tttttattct cttaagtctt 7440
tattaacttt ggagagagaa atgatgcatc tttttatttt aaatgaagta gatcaacatg 7500
gtggaacaaa atgataaaga acagaaaaca tttcaatata ttactaataa ctttttccaa 7560
tataaatcct aaaattccta taacatagta ttttacagtt ttatgaagct ttctattgtg 7620
acttttatgg aattaagaga tgaagaagat gagatatttt agcatttata tttttcaaaa 7680
ttatatgtat acttaaaaat aaagtaactt tatgca 7716
<![CDATA[<210> 794]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 794]]>
Met Glu Asp Glu Ala Val Leu Asp Arg Gly Ala Ser Phe Leu Lys His
1 5 10 15
Val Cys Asp Glu Glu Glu Val Glu
20
<![CDATA[<210> 795]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 795]]>
His His Thr Ile Tyr Ile Gly Val His Val Pro Lys Ser Tyr Arg Arg
1 5 10 15
Arg Arg Arg His Lys Arg Lys Thr Gly His Lys Glu Lys Lys Glu Lys
20 25 30
Glu Arg Ile Ser Glu Asn Tyr Ser Asp Lys Ser Asp Ile Glu Asn Ala
35 40 45
Asp Glu Ser Ser Ser Ser Ile Leu Lys Pro Leu
50 55
<![CDATA[<210> 796]]>
<![CDATA[<211> 44]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 796]]>
Ser Pro Ala Ala Glu Arg Ile Arg Phe Ile Leu Gly Glu Glu Asp Asp
1 5 10 15
Ser Pro Ala Pro Pro Gln Leu Phe Thr Glu Leu Asp Glu Leu Leu Ala
20 25 30
Val Asp Gly Gln Glu Met Glu Trp Lys Glu Thr Ala
35 40
<![CDATA[<210> 797]]>
<![CDATA[<211> 53]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 797]]>
Trp Ile Lys Phe Glu Glu Lys Val Glu Gln Gly Gly Glu Arg Trp Ser
1 5 10 15
Lys Pro His Val Ala Thr Leu Ser Leu His Ser Leu Phe Glu Leu Arg
20 25 30
Thr Cys Met Glu Lys Gly Ser Ile Met Leu Asp Arg Glu Ala Ser Ser
35 40 45
Leu Pro Gln Leu Val
50
<![CDATA[<210> 798]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 798]]>
Met Ile Val Asp His Gln Ile Glu Thr Gly Leu Leu Lys Pro Glu Leu
1 5 10 15
Lys Asp Lys Val Thr Tyr Thr Leu Leu Arg Lys His Arg His Gln Thr
20 25 30
Lys Lys Ser Asn Leu Arg Ser Leu Ala Asp Ile Gly Lys Thr Val Ser
35 40 45
Ser Ala Ser Arg Met Phe Thr Asn Pro Asp Asn
50 55
<![CDATA[<210> 799]]>
<![CDATA[<211> 25]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 799]]>
Ser Pro Ala Met Thr His Arg Asn Leu Thr Ser Ser Ser Leu Asn Asp
1 5 10 15
Ile Ser Asp Lys Pro Glu Lys Asp Gln
20 25
<![CDATA[<210> 800]]>
<![CDATA[<211> 52]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 800]]>
Leu Lys Asn Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala Ser
1 5 10 15
Asn Val Leu Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe Ile Ala
20 25 30
Phe Val Arg Leu Gln Gln Ala Val Met Leu Gly Ala Leu Thr Glu Val
35 40 45
Pro Val Pro Thr
50
<![CDATA[<210> 801]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 801]]>
Phe Leu Phe Ile Leu Leu Gly Pro Lys Gly Lys Ala Lys Ser Tyr His
1 5 10 15
Glu Ile Gly Arg Ala Ile Ala Thr Leu Met Ser Asp Glu
20 25
<![CDATA[<210> 802]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 802]]>
Val Phe His Asp Ile Ala Tyr Lys Ala Lys Asp Arg His Asp Leu Ile
1 5 10 15
Ala Gly Ile Asp Glu Phe Leu Asp Glu Val Ile Val Leu Pro Pro Gly
20 25 30
Glu Trp Asp Pro Ala Ile Arg Ile Glu Pro Pro Lys Ser Leu Pro Ser
35 40 45
Ser Asp Lys
50
<![CDATA[<210> 803]]>
<![CDATA[<211> 37]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 803]]>
Lys Asn Met Tyr Ser Gly Gly Glu Asn Val Gln Met Asn Gly Asp Thr
1 5 10 15
Pro His Asp Gly Gly His Gly Gly Gly Gly His Gly Asp Cys Glu Glu
20 25 30
Leu Gln Arg Thr Gly
35
<![CDATA[<210> 804]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 804]]>
Phe Cys Gly Gly Leu Ile Lys Asp Ile Lys Arg Lys Ala Pro Phe Phe
1 5 10 15
Ala Ser Asp Phe Tyr Asp Ala Leu Asn Ile Gln Ala Leu Ser Ala Ile
20 25 30
Leu Phe Ile Tyr Leu Ala Thr Val Thr Asn Ala Ile Thr Phe Gly Gly
35 40 45
Leu Leu Gly Asp Ala Thr Asp Asn Met Gln
50 55
<![CDATA[<210> 805]]>
<![CDATA[<211> 44]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 805]]>
Gly Val Leu Glu Ser Phe Leu Gly Thr Ala Val Ser Gly Ala Ile Phe
1 5 10 15
Cys Leu Phe Ala Gly Gln Pro Leu Thr Ile Leu Ser Ser Thr Gly Pro
20 25 30
Val Leu Val Phe Glu Arg Leu Leu Phe Asn Phe Ser
35 40
<![CDATA[<210> 806]]>
<![CDATA[<211> 90]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 806]]>
Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp Ile Gly Leu Trp
1 5 10 15
Ser Ala Phe Leu Cys Leu Ile Leu Val Ala Thr Asp Ala Ser Phe Leu
20 25 30
Val Gln Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Ser Ser Leu Ile
35 40 45
Ser Phe Ile Phe Ile Tyr Asp Ala Phe Lys Lys Met Ile Lys Leu Ala
50 55 60
Asp Tyr Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu
65 70 75 80
Phe Ser Cys Thr Cys Val Pro Pro Asp Pro
85 90
<![CDATA[<210> 807]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 807]]>
Asn Ile Ser Ile Ser Asn Asp Thr Thr Leu Ala Pro Glu Tyr Leu Pro
1 5 10 15
Thr Met Ser Ser Thr Asp Met
20
<![CDATA[<210> 808]]>
<![CDATA[<211> 64]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 808]]>
Tyr His Asn Thr Thr Phe Asp Trp Ala Phe Leu Ser Lys Lys Glu Cys
1 5 10 15
Ser Lys Tyr Gly Gly Asn Leu Val Gly Asn Asn Cys Asn Phe Val Pro
20 25 30
Asp Ile Thr Leu Met Ser Phe Ile Leu Phe Leu Gly Thr Tyr Thr Ser
35 40 45
Ser Met Ala Leu Lys Lys Phe Lys Thr Ser Pro Tyr Phe Pro Thr Thr
50 55 60
<![CDATA[<210> 809]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 809]]>
Ala Arg Lys Leu Ile Ser Asp Phe Ala Ile Ile Leu Ser Ile Leu Ile
1 5 10 15
Phe Cys Val Ile Asp Ala Leu Val Gly Val Asp Thr Pro Lys Leu Ile
20 25 30
Val Pro Ser Glu Phe Lys
35
<![CDATA[<210> 810]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 810]]>
Pro Thr Ser Pro Asn Arg Gly Trp Phe Val Pro Pro Phe Gly Glu Asn
1 5 10 15
Pro Trp Trp Val Cys Leu Ala Ala Ala Ile Pro Ala Leu Leu Val Thr
20 25 30
Ile Leu Ile Phe Met Asp Gln Gln Ile Thr Ala Val Ile Val Asn Arg
35 40 45
Lys Glu His Lys Leu Lys
50
<![CDATA[<210> 811]]>
<![CDATA[<211> 59]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 811]]>
Lys Gly Ala Gly Tyr His Leu Asp Leu Phe Trp Val Ala Ile Leu Met
1 5 10 15
Val Ile Cys Ser Leu Met Ala Leu Pro Trp Tyr Val Ala Ala Thr Val
20 25 30
Ile Ser Ile Ala His Ile Asp Ser Leu Lys Met Glu Thr Glu Thr Ser
35 40 45
Ala Pro Gly Glu Gln Pro Lys Phe Leu Gly Val
50 55
<![CDATA[<210> 812]]>
<![CDATA[<211> 24]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 812]]>
Glu Gln Arg Val Thr Gly Thr Leu Val Phe Ile Leu Thr Gly Leu Ser
1 5 10 15
Val Phe Met Ala Pro Ile Leu Lys
20
<![CDATA[<210> 813]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 813]]>
Phe Ile Pro Met Pro Val Leu Tyr Gly Val Phe Leu Tyr Met Gly Val
1 5 10 15
Ala Ser Leu Asn Gly Val Gln
20
<![CDATA[<210> 814]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 814]]>
Phe Met Asp Arg Leu Lys Leu Leu Leu Met Pro Leu Lys His Gln Pro
1 5 10 15
Asp Phe Ile Tyr Leu Arg His Val Pro Leu Arg Arg Val His Leu Phe
20 25 30
Thr Phe Leu Gln Val Leu Cys Leu Ala Leu Leu Trp Ile Leu Lys Ser
35 40 45
Thr Val Ala Ala Ile Ile Phe Pro Val Met
50 55
<![CDATA[<210> 815]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 815]]>
Ile Leu Ala Leu Val Ala Val Arg Lys Gly Met Asp Tyr Leu Phe Ser
1 5 10 15
Gln His Asp Leu Ser Phe Leu Asp Asp Val Ile Pro Glu Lys Asp Lys
20 25 30
Lys Lys Lys Glu Asp Glu Lys Lys Lys Lys Lys Lys Lys Gly Ser Leu
35 40 45
Asp Ser Asp Asn Asp Asp
50
<![CDATA[<210> 816]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 816]]>
Ser Asp Cys Pro Tyr Ser Glu Lys Val Pro Ser Ile Lys Ile Pro Met
1 5 10 15
Asp Ile Met Glu Gln Gln Pro Phe Leu Ser Asp Ser Lys Pro Ser Asp
20 25 30
<![CDATA[<210> 817]]>
<![CDATA[<211> 13]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 817]]>
Glu Arg Ser Pro Thr Phe Leu Glu Arg His Thr Ser Cys
1 5 10
<![CDATA[<210> 818]]>
<![CDATA[<211> 0]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> 略過序列]]>
<![CDATA[<400> 818]]>
000
<![CDATA[<210> 819]]>
<![CDATA[<211> 1079]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 819]]>
Met Glu Asp Glu Ala Val Leu Asp Arg Gly Ala Ser Phe Leu Lys His
1 5 10 15
Val Cys Asp Glu Glu Glu Val Glu Gly His His Thr Ile Tyr Ile Gly
20 25 30
Val His Val Pro Lys Ser Tyr Arg Arg Arg Arg Arg His Lys Arg Lys
35 40 45
Thr Gly His Lys Glu Lys Lys Glu Lys Glu Arg Ile Ser Glu Asn Tyr
50 55 60
Ser Asp Lys Ser Asp Ile Glu Asn Ala Asp Glu Ser Ser Ser Ser Ile
65 70 75 80
Leu Lys Pro Leu Ile Ser Pro Ala Ala Glu Arg Ile Arg Phe Ile Leu
85 90 95
Gly Glu Glu Asp Asp Ser Pro Ala Pro Pro Gln Leu Phe Thr Glu Leu
100 105 110
Asp Glu Leu Leu Ala Val Asp Gly Gln Glu Met Glu Trp Lys Glu Thr
115 120 125
Ala Arg Trp Ile Lys Phe Glu Glu Lys Val Glu Gln Gly Gly Glu Arg
130 135 140
Trp Ser Lys Pro His Val Ala Thr Leu Ser Leu His Ser Leu Phe Glu
145 150 155 160
Leu Arg Thr Cys Met Glu Lys Gly Ser Ile Met Leu Asp Arg Glu Ala
165 170 175
Ser Ser Leu Pro Gln Leu Val Glu Met Ile Val Asp His Gln Ile Glu
180 185 190
Thr Gly Leu Leu Lys Pro Glu Leu Lys Asp Lys Val Thr Tyr Thr Leu
195 200 205
Leu Arg Lys His Arg His Gln Thr Lys Lys Ser Asn Leu Arg Ser Leu
210 215 220
Ala Asp Ile Gly Lys Thr Val Ser Ser Ala Ser Arg Met Phe Thr Asn
225 230 235 240
Pro Asp Asn Gly Ser Pro Ala Met Thr His Arg Asn Leu Thr Ser Ser
245 250 255
Ser Leu Asn Asp Ile Ser Asp Lys Pro Glu Lys Asp Gln Leu Lys Asn
260 265 270
Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala Ser Asn Val Leu
275 280 285
Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe Ile Ala Phe Val Arg
290 295 300
Leu Gln Gln Ala Val Met Leu Gly Ala Leu Thr Glu Val Pro Val Pro
305 310 315 320
Thr Arg Phe Leu Phe Ile Leu Leu Gly Pro Lys Gly Lys Ala Lys Ser
325 330 335
Tyr His Glu Ile Gly Arg Ala Ile Ala Thr Leu Met Ser Asp Glu Val
340 345 350
Phe His Asp Ile Ala Tyr Lys Ala Lys Asp Arg His Asp Leu Ile Ala
355 360 365
Gly Ile Asp Glu Phe Leu Asp Glu Val Ile Val Leu Pro Pro Gly Glu
370 375 380
Trp Asp Pro Ala Ile Arg Ile Glu Pro Pro Lys Ser Leu Pro Ser Ser
385 390 395 400
Asp Lys Arg Lys Asn Met Tyr Ser Gly Gly Glu Asn Val Gln Met Asn
405 410 415
Gly Asp Thr Pro His Asp Gly Gly His Gly Gly Gly Gly His Gly Asp
420 425 430
Cys Glu Glu Leu Gln Arg Thr Gly Arg Phe Cys Gly Gly Leu Ile Lys
435 440 445
Asp Ile Lys Arg Lys Ala Pro Phe Phe Ala Ser Asp Phe Tyr Asp Ala
450 455 460
Leu Asn Ile Gln Ala Leu Ser Ala Ile Leu Phe Ile Tyr Leu Ala Thr
465 470 475 480
Val Thr Asn Ala Ile Thr Phe Gly Gly Leu Leu Gly Asp Ala Thr Asp
485 490 495
Asn Met Gln Gly Val Leu Glu Ser Phe Leu Gly Thr Ala Val Ser Gly
500 505 510
Ala Ile Phe Cys Leu Phe Ala Gly Gln Pro Leu Thr Ile Leu Ser Ser
515 520 525
Thr Gly Pro Val Leu Val Phe Glu Arg Leu Leu Phe Asn Phe Ser Lys
530 535 540
Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp Ile Gly Leu Trp
545 550 555 560
Ser Ala Phe Leu Cys Leu Ile Leu Val Ala Thr Asp Ala Ser Phe Leu
565 570 575
Val Gln Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Ser Ser Leu Ile
580 585 590
Ser Phe Ile Phe Ile Tyr Asp Ala Phe Lys Lys Met Ile Lys Leu Ala
595 600 605
Asp Tyr Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu
610 615 620
Phe Ser Cys Thr Cys Val Pro Pro Asp Pro Ala Asn Ile Ser Ile Ser
625 630 635 640
Asn Asp Thr Thr Leu Ala Pro Glu Tyr Leu Pro Thr Met Ser Ser Thr
645 650 655
Asp Met Tyr His Asn Thr Thr Phe Asp Trp Ala Phe Leu Ser Lys Lys
660 665 670
Glu Cys Ser Lys Tyr Gly Gly Asn Leu Val Gly Asn Asn Cys Asn Phe
675 680 685
Val Pro Asp Ile Thr Leu Met Ser Phe Ile Leu Phe Leu Gly Thr Tyr
690 695 700
Thr Ser Ser Met Ala Leu Lys Lys Phe Lys Thr Ser Pro Tyr Phe Pro
705 710 715 720
Thr Thr Ala Arg Lys Leu Ile Ser Asp Phe Ala Ile Ile Leu Ser Ile
725 730 735
Leu Ile Phe Cys Val Ile Asp Ala Leu Val Gly Val Asp Thr Pro Lys
740 745 750
Leu Ile Val Pro Ser Glu Phe Lys Pro Thr Ser Pro Asn Arg Gly Trp
755 760 765
Phe Val Pro Pro Phe Gly Glu Asn Pro Trp Trp Val Cys Leu Ala Ala
770 775 780
Ala Ile Pro Ala Leu Leu Val Thr Ile Leu Ile Phe Met Asp Gln Gln
785 790 795 800
Ile Thr Ala Val Ile Val Asn Arg Lys Glu His Lys Leu Lys Lys Gly
805 810 815
Ala Gly Tyr His Leu Asp Leu Phe Trp Val Ala Ile Leu Met Val Ile
820 825 830
Cys Ser Leu Met Ala Leu Pro Trp Tyr Val Ala Ala Thr Val Ile Ser
835 840 845
Ile Ala His Ile Asp Ser Leu Lys Met Glu Thr Glu Thr Ser Ala Pro
850 855 860
Gly Glu Gln Pro Lys Phe Leu Gly Val Arg Glu Gln Arg Val Thr Gly
865 870 875 880
Thr Leu Val Phe Ile Leu Thr Gly Leu Ser Val Phe Met Ala Pro Ile
885 890 895
Leu Lys Phe Ile Pro Met Pro Val Leu Tyr Gly Val Phe Leu Tyr Met
900 905 910
Gly Val Ala Ser Leu Asn Gly Val Gln Phe Met Asp Arg Leu Lys Leu
915 920 925
Leu Leu Met Pro Leu Lys His Gln Pro Asp Phe Ile Tyr Leu Arg His
930 935 940
Val Pro Leu Arg Arg Val His Leu Phe Thr Phe Leu Gln Val Leu Cys
945 950 955 960
Leu Ala Leu Leu Trp Ile Leu Lys Ser Thr Val Ala Ala Ile Ile Phe
965 970 975
Pro Val Met Ile Leu Ala Leu Val Ala Val Arg Lys Gly Met Asp Tyr
980 985 990
Leu Phe Ser Gln His Asp Leu Ser Phe Leu Asp Asp Val Ile Pro Glu
995 1000 1005
Lys Asp Lys Lys Lys Lys Glu Asp Glu Lys Lys Lys Lys Lys Lys
1010 1015 1020
Lys Gly Ser Leu Asp Ser Asp Asn Asp Asp Ser Asp Cys Pro Tyr
1025 1030 1035
Ser Glu Lys Val Pro Ser Ile Lys Ile Pro Met Asp Ile Met Glu
1040 1045 1050
Gln Gln Pro Phe Leu Ser Asp Ser Lys Pro Ser Asp Arg Glu Arg
1055 1060 1065
Ser Pro Thr Phe Leu Glu Arg His Thr Ser Cys
1070 1075
<![CDATA[<210> 820]]>
<![CDATA[<211> 2047]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 820]]>
ggcggcgcgc cgggcagcgc ttcggtggcg gcggcggccg cggtggcagc gaaggcggcg 60
gcggcggcgg cagtggcagt ggccgctgca gccccacact ccgccgccaa actggaggag 120
cgacggaagc cagaccccag gaggatggag gatgaagctg tcctggacag aggggcttcc 180
ttcctcaagc atgtgtgtga tgaagaagaa gtagaaggcc accataccat ttacatcgga 240
gtccatgtgc cgaagagtta caggagaagg agacgtcaca agagaaagac agggcacaaa 300
gaaaagaagg aaaaggagag aatctctgag aactactctg acaaatcaga tattgaaaat 360
gctgatgaat ccagcagcag catcctaaaa cctctcatct ctcctgctgc agaacgcatc 420
cgattcatct tgggagagga ggatgacagc ccagctcccc ctcagctctt cacggaactg 480
gatgagctgc tggccgtgga tgggcaggag atggagtgga aggaaacagc caggtggatc 540
aagtttgaag aaaaagtgga acagggtggg gaaagatgga gcaagcccca tgtggccaca 600
ttgtcccttc atagtttatt tgagctgagg acatgtatgg agaaaggatc catcatgctt 660
gatcgggagg cttcttctct cccacagttg gtggagatga ttgttgacca tcagattgag 720
acaggcctat tgaaacctga acttaaggat aaggtgacct atactttgct ccggaagcac 780
cggcatcaaa ccaagaaatc caaccttcgg tccctggctg acattgggaa gacagtctcc 840
agtgcaagta ggatgtttac caaccctgat aatggtagcc cagccatgac ccataggaat 900
ctgacttcct ccagtctgaa tgacatttct gataaaccgg agaaggacca gctgaagaat 960
aagttcatga aaaaattgcc acgtgatgca gaagcttcca acgtgcttgt tggggaggtt 1020
gactttttgg atactccttt cattgccttt gttaggctac agcaggctgt catgctgggt 1080
gccctgactg aagttcctgt gcccacaagg ttcttgttca ttctcttagg tcctaagggg 1140
aaagccaagt cctaccacga gattggcaga gccattgcca ccctgatgtc tgatgaggtg 1200
ttccatgaca ttgcttataa agcaaaagac aggcacgacc tgattgctgg tattgatgag 1260
ttcctagatg aagtcatcgt ccttccacct ggggaatggg atccagcaat taggatagag 1320
cctcctaaga gtcttccatc ctctgacaaa agaaagaata tgtactcagg tggagagaat 1380
gttcagatga atggggatac gccccatgat ggaggtcacg gaggaggagg acatggggat 1440
tgtgaagaat tgcagcgaac tggacggttc tgtggtggac taattaaaga cataaagagg 1500
aaagcgccat tttttgccag tgatttttat gatgctttaa atattcaagc tctttcggca 1560
attctcttca tttatctggc aactgtaact aatgctatca cttttggagg actgcttggg 1620
gatgccactg acaacatgca gggcgtgttg gagagtttcc tgggcactgc tgtctctgga 1680
gccatctttt gcctttttgc tggtcaacca ctcactattc tgagcagcac cggacctgtc 1740
ctagtttttg agaggcttct atttaatttc agcaaggaca ataattttga ctatttggag 1800
tttcgccttt ggattggcct gtggtccgcc ttcctatgtc tcattttggt agccactgat 1860
gccagcttct tggttcaata cttcacacgt ttcacggagg agggcttttc ctctctgatt 1920
agcttcatct ttatctatga tgctttcaag aagatgatca agcttgcaga ttactacccc 1980
atcaactcca acttcaaagt gggctacaac actctctttt cctgtacctg tgtgccacct 2040
gacccag 2047
<![CDATA[<210> 821]]>
<![CDATA[<211> 634]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 821]]>
Met Glu Asp Glu Ala Val Leu Asp Arg Gly Ala Ser Phe Leu Lys His
1 5 10 15
Val Cys Asp Glu Glu Glu Val Glu Gly His His Thr Ile Tyr Ile Gly
20 25 30
Val His Val Pro Lys Ser Tyr Arg Arg Arg Arg Arg His Lys Arg Lys
35 40 45
Thr Gly His Lys Glu Lys Lys Glu Lys Glu Arg Ile Ser Glu Asn Tyr
50 55 60
Ser Asp Lys Ser Asp Ile Glu Asn Ala Asp Glu Ser Ser Ser Ser Ile
65 70 75 80
Leu Lys Pro Leu Ile Ser Pro Ala Ala Glu Arg Ile Arg Phe Ile Leu
85 90 95
Gly Glu Glu Asp Asp Ser Pro Ala Pro Pro Gln Leu Phe Thr Glu Leu
100 105 110
Asp Glu Leu Leu Ala Val Asp Gly Gln Glu Met Glu Trp Lys Glu Thr
115 120 125
Ala Arg Trp Ile Lys Phe Glu Glu Lys Val Glu Gln Gly Gly Glu Arg
130 135 140
Trp Ser Lys Pro His Val Ala Thr Leu Ser Leu His Ser Leu Phe Glu
145 150 155 160
Leu Arg Thr Cys Met Glu Lys Gly Ser Ile Met Leu Asp Arg Glu Ala
165 170 175
Ser Ser Leu Pro Gln Leu Val Glu Met Ile Val Asp His Gln Ile Glu
180 185 190
Thr Gly Leu Leu Lys Pro Glu Leu Lys Asp Lys Val Thr Tyr Thr Leu
195 200 205
Leu Arg Lys His Arg His Gln Thr Lys Lys Ser Asn Leu Arg Ser Leu
210 215 220
Ala Asp Ile Gly Lys Thr Val Ser Ser Ala Ser Arg Met Phe Thr Asn
225 230 235 240
Pro Asp Asn Gly Ser Pro Ala Met Thr His Arg Asn Leu Thr Ser Ser
245 250 255
Ser Leu Asn Asp Ile Ser Asp Lys Pro Glu Lys Asp Gln Leu Lys Asn
260 265 270
Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala Ser Asn Val Leu
275 280 285
Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe Ile Ala Phe Val Arg
290 295 300
Leu Gln Gln Ala Val Met Leu Gly Ala Leu Thr Glu Val Pro Val Pro
305 310 315 320
Thr Arg Phe Leu Phe Ile Leu Leu Gly Pro Lys Gly Lys Ala Lys Ser
325 330 335
Tyr His Glu Ile Gly Arg Ala Ile Ala Thr Leu Met Ser Asp Glu Val
340 345 350
Phe His Asp Ile Ala Tyr Lys Ala Lys Asp Arg His Asp Leu Ile Ala
355 360 365
Gly Ile Asp Glu Phe Leu Asp Glu Val Ile Val Leu Pro Pro Gly Glu
370 375 380
Trp Asp Pro Ala Ile Arg Ile Glu Pro Pro Lys Ser Leu Pro Ser Ser
385 390 395 400
Asp Lys Arg Lys Asn Met Tyr Ser Gly Gly Glu Asn Val Gln Met Asn
405 410 415
Gly Asp Thr Pro His Asp Gly Gly His Gly Gly Gly Gly His Gly Asp
420 425 430
Cys Glu Glu Leu Gln Arg Thr Gly Arg Phe Cys Gly Gly Leu Ile Lys
435 440 445
Asp Ile Lys Arg Lys Ala Pro Phe Phe Ala Ser Asp Phe Tyr Asp Ala
450 455 460
Leu Asn Ile Gln Ala Leu Ser Ala Ile Leu Phe Ile Tyr Leu Ala Thr
465 470 475 480
Val Thr Asn Ala Ile Thr Phe Gly Gly Leu Leu Gly Asp Ala Thr Asp
485 490 495
Asn Met Gln Gly Val Leu Glu Ser Phe Leu Gly Thr Ala Val Ser Gly
500 505 510
Ala Ile Phe Cys Leu Phe Ala Gly Gln Pro Leu Thr Ile Leu Ser Ser
515 520 525
Thr Gly Pro Val Leu Val Phe Glu Arg Leu Leu Phe Asn Phe Ser Lys
530 535 540
Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp Ile Gly Leu Trp
545 550 555 560
Ser Ala Phe Leu Cys Leu Ile Leu Val Ala Thr Asp Ala Ser Phe Leu
565 570 575
Val Gln Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Ser Ser Leu Ile
580 585 590
Ser Phe Ile Phe Ile Tyr Asp Ala Phe Lys Lys Met Ile Lys Leu Ala
595 600 605
Asp Tyr Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu
610 615 620
Phe Ser Cys Thr Cys Val Pro Pro Asp Pro
625 630
<![CDATA[<210> 822]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 822]]>
atgtgtccaa caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa 60
cggaggtcct ggcag 75
<![CDATA[<210> 823]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 823]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[<210> 824]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 824]]>
atgtgtccaa caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa 60
cggaggtcct ggcagccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[<210> 825]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 825]]>
Val Ser Asn Lys Val Ser Met Ser Ser Thr Val Gln Gly Ser Asn Ile
1 5 10 15
Phe Glu Arg Thr Glu Val Leu Ala Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[<210> 826]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 826]]>
acaggaagca ccctaatgag gagtatgcca acgtgggcac cggggagctg gcagcggagg 60
tgctcatcca gcaag 75
<![CDATA[<210> 827]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 827]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatg 75
<![CDATA[<210> 828]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 828]]>
acaggaagca ccctaatgag gagtatgcca acgtgggcac cggggagctg gcagcggagg 60
tgctcatcca gcaagctaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[<210> 829]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 829]]>
Arg Lys His Pro Asn Glu Glu Tyr Ala Asn Val Gly Thr Gly Glu Leu
1 5 10 15
Ala Ala Glu Val Leu Ile Gln Gln Ala Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[<210> 830]]>
<![CDATA[<211> 187]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 830]]>
atccgccatg ttggatgccg cagattcgcc ataacctcgc cggctctttt cttaaaaaaa 60
taaaaataaa aagcgaagcg tcagcagggc gccccgcccc ctcggtcggc acgggagggg 120
gcccggaaga gcccgaggct tttttttcct ccgcggtggg gcgttgccat ggagacgcgg 180
gcggcag 187
<![CDATA[<210> 831]]>
<![CDATA[<211> 177]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 831]]>
aaaacacggc catctttatg tgtaaatgtt gtaacctctt ctcaccaaat cagtcggaac 60
tcctctccca cgtttcagag aagcacatgg aagaaggggt taatgttgat gagattatta 120
ttccccttag gcctctgagt acacctgaac cccccaactc aagcaaaacc ggagatg 177
<![CDATA[<210> 832]]>
<![CDATA[<211> 252]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 832]]>
agtttttggt catgaagagg aagagaggca ggcctaaggg gtccacgaag aagtccagca 60
cagaagagga gctggcagaa aacatcgtga gtccgactga ggacagcccg ctggctccgg 120
aggaagggaa cagcctgcct ccaagcagct tggagtgtag caagtgctgt cggaagttct 180
ccaacacgcg ccagctgcgg aagcacatct gcattatcgt gctgaatttg ggtgaggagg 240
aaggagaagc ag 252
<![CDATA[<210> 833]]>
<![CDATA[<211> 186]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 833]]>
gtaacgagtc tgaccttgaa ctagaaaaga agtgtaagga agatgatcgg gaaaaagcct 60
cgaaaagacc acggtcacag aaaacagaga aagtccagaa gatctcagga aaggaggcca 120
gacagctttc tggggcgaag aaacccatca taagtgtggt tttaactgca cacgaagcaa 180
ttccag 186
<![CDATA[<210> 834]]>
<![CDATA[<211> 151]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 834]]>
gtgctaccaa gattgtgcca gtggaggctg ggccccctga aacaggagct acaaattctg 60
agaccacttc agcagacctg gtgcctcgga gaggctacca ggaatacgcc attcagcaga 120
caccttatga gcaaccaatg aagtcaagca g 151
<![CDATA[<210> 835]]>
<![CDATA[<211> 1457]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 835]]>
gctaggtccc actcagctca aaatcttcac ttgtgaatac tgcaacaagg tcttcaagtt 60
caagcactcg ctgcaggccc acctgaggat ccacaccaat gaaaagccat acaagtgccc 120
ccagtgcagc tatgccagtg ccatcaaggc caacctcaat gtgcacctgc gcaagcacac 180
tggagagaag ttcgcctgcg actattgctc gttcacctgc ctgagcaagg gccacctcaa 240
ggtgcacatc gagcgagtgc acaagaagat caagcagcac tgccgcttct gcaagaagaa 300
gtactctgac gtcaagaacc tcatcaagca catccgagac gcgcatgacc cacaggacaa 360
gaaggtcaaa gaggccttgg acgagctctg cctgatgacg agggagggca agcggcagct 420
gctctatgac tgccacatct gtgagcgcaa gttcaagaac gagctggacc gtgaccgcca 480
tatgctggtc cacggagaca agtggccttt tgcctgtgag ctctgtggcc atggggccac 540
caagtaccag gcgctggaac tgcatgtcag gaagcacccc ttcgtgtacg tctgtgccgt 600
ctgccgcaag aagttcgtca gctccatcag gctgcgcacc cacatcaaag aggtgcacgg 660
ggctgcccag gaggccttgg tcttcaccag ttccatcaac cagagcttct gcctcctgga 720
acctggtggg gacatccagc aagaagctct gggggaccag ctacagctgg tggaagagga 780
gtttgccctc cagggcgtga atgcactcaa ggaagaggcc tgtcctgggg acactcagct 840
ggaggagggc cggaaggagc cggaggcccc tggggaaatg cctgccccag ctgtgcacct 900
ggcctccccg caggccgaaa gcacagccct gccaccctgt gagctggaaa ccaccgtggt 960
ctcctcctca gacctgcatt ctcaagaggt ggtttcagat gattttttgt tgaaaaatga 1020
tacctcctcc gcagaggctc atgctgctcc tgagaagccc ccagacatgc agcacagaag 1080
ctcagtccag acgcaaggtg aagtgatcac actactgctg tccaaggccc agagtgctgg 1140
gtcagatcag gaaagccatg gcgcccagag ccccctaggg gaagggcaga acatggctgt 1200
gctttcagct ggtgacccag atcccagcag gtgtctcagg tcaaacccag ctgaggcctc 1260
agacctcctc cctccagtag ctggtggtgg ggacaccatc acacatcagc ctgactcttg 1320
caaagctgcc cctgagcacc ggtcaggcat caccgctttc atgaaggtcc tgaacagttt 1380
acagaagaag caaatgaaca ccagcttgtg tgagcggatc cggaaggttt atggagacct 1440
ggagtgtgaa tactgtg 1457
<![CDATA[<210> 836]]>
<![CDATA[<211> 233]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 836]]>
gcaaactttt ttggtaccaa gtgcattttg atatgcatgt ccgcacccac acccgggaac 60
atctgtatta ttgctctcag tgtcattatt cttccatcac caaaaactgc cttaaacgcc 120
acgtaattca gaaacacagt aacatcttgc tgaagtgtcc caccgatggc tgtgactact 180
caactccaga taaatataag ctacaggcac atcttaaagt tcacacagca ctg 233
<![CDATA[<210> 837]]>
<![CDATA[<211> 88]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 837]]>
gacaaaagga gttattcttg tcctgtttgt gaaaagtctt tttcagagga tcgattgata 60
aagtcacata tcaagaccaa ccatcctg 88
<![CDATA[<210> 838]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 838]]>
aggtctccat gagcaccatt tctgaggttc tcgggaggag ggttcagctg aaagggctaa 60
ttggaaagag agccatgaaa tgcccatatt gtgactttta tttcatgaag aatggctcag 120
accttcagcg tcatatttgg gctcatgaag 150
<![CDATA[<210> 839]]>
<![CDATA[<211> 174]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 839]]>
gtgtgaagcc cttcaagtgt tctttgtgtg agtatgcaac tcgtagcaag agtaacctca 60
aggctcatat gaatcgtcac agcactgaga aaacccacct atgtgacatg tgtggcaaga 120
aattcaaatc aaaagggaca ctgaaaagtc acaaactcct tcacactgca gatg 174
<![CDATA[<210> 840]]>
<![CDATA[<211> 89]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 840]]>
ggaagcagtt taagtgcacg gtgtgtgact acacagcggc ccagaagcca cagctgctgc 60
ggcacatgga acagcatgtc tccttcaag 89
<![CDATA[<210> 841]]>
<![CDATA[<211> 139]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 841]]>
cctttccgct gtgcccattg ccattactcc tgcaacatat ctggctctct gaagcggcac 60
tacaacagga agcaccctaa tgaggagtat gccaacgtgg gcaccgggga gctggcagcg 120
gaggtgctca tccagcaag 139
<![CDATA[<210> 842]]>
<![CDATA[<211> 119]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 842]]>
gtggtttgaa gtgtcctgtt tgcagctttg tatatggcac caaatgggag ttcaataggc 60
acttgaagaa caaacatggc ttgaaggtgg tggaaattga tggagacccc aagtgggag 119
<![CDATA[<210> 843]]>
<![CDATA[<211> 127]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 843]]>
acagcaacag aagctcctga ggagccctcc acccagtatc tccacatcac agaggccgaa 60
gaagacgttc aagggacaca ggcagcggtg gccgcgctcc aggacctgag atacacctct 120
gagagtg 127
<![CDATA[<210> 844]]>
<![CDATA[<211> 131]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 844]]>
gcgaccgact ggaccccacg gccgtgaaca tcctgcagca gatcattgag ctgggcgccg 60
agacccatga cgccactgcc cttgcctcgg tggttgccat ggcaccaggg acggtgactg 120
tggttaagca g 131
<![CDATA[<210> 845]]>
<![CDATA[<211> 934]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 845]]>
gtcaccgagg aggagcccag ctccaaccac acggtcatga tccaggagac ggtccagcaa 60
gcgtccgtgg agcttgccga gcagcaccac ctggtggtgt cctccgacga cgtggagggc 120
attgagacgg tgactgtcta cacgcagggc ggggaggcct cggagttcat cgtctacgtg 180
caggaggcca tgcagcctgt ggaggagcag gctgtggagc agccggccca ggaactctag 240
aggacatgtg gcatcggatg gccacagggc ggggctgcca ggctctgcag gcacccaggg 300
tggggaggcc acccttcctg ccctacccgc agaatggtgc tctcctttgc cctccctgcc 360
cagcagcctg ataggactct cctagtccaa cttggggtgg gcaaggcagt cagcatcacc 420
agcaatacca caggaccctc accccagcat agacacacac cccctgaccc ttaccatctg 480
cttcctgaaa gacttcagtg tcagctcccc tacacacacc ccacaccttc accccttgct 540
tcaagattca aacagagact cccagtcccc ctcagcatct tccctgaatc acagccccag 600
ctccttgacc cccatctagg tgccaaatgt tcatctgcaa ccgctatgca gtctggtgag 660
agggagacag ccatcacata gaaagtgacc gtacgggttt ttaatcactg ctgggtgggg 720
tgggggtagg gggattgtcc tggctttgtc gacaaagtcc cacttccccg agtattaagg 780
gcccttggta tcaagtgagg taaattcacc catcacaggg tctcgcccta ccatcctgga 840
attatttcac ttttaagata aatgcactat ttcactgttc gcctcccatt ctaaggaggt 900
gaggtggttg gaataaaaac agttcctgtc tgaa 934
<![CDATA[<210> 846]]>
<![CDATA[<211> 4594]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> ZFAT之全mRNA多核苷酸序列]]>
<![CDATA[<400> 846]]>
atccgccatg ttggatgccg cagattcgcc ataacctcgc cggctctttt cttaaaaaaa 60
taaaaataaa aagcgaagcg tcagcagggc gccccgcccc ctcggtcggc acgggagggg 120
gcccggaaga gcccgaggct tttttttcct ccgcggtggg gcgttgccat ggagacgcgg 180
gcggcagaaa acacggccat ctttatgtgt aaatgttgta acctcttctc accaaatcag 240
tcggaactcc tctcccacgt ttcagagaag cacatggaag aaggggttaa tgttgatgag 300
attattattc cccttaggcc tctgagtaca cctgaacccc ccaactcaag caaaaccgga 360
gatgagtttt tggtcatgaa gaggaagaga ggcaggccta aggggtccac gaagaagtcc 420
agcacagaag aggagctggc agaaaacatc gtgagtccga ctgaggacag cccgctggct 480
ccggaggaag ggaacagcct gcctccaagc agcttggagt gtagcaagtg ctgtcggaag 540
ttctccaaca cgcgccagct gcggaagcac atctgcatta tcgtgctgaa tttgggtgag 600
gaggaaggag aagcaggtaa cgagtctgac cttgaactag aaaagaagtg taaggaagat 660
gatcgggaaa aagcctcgaa aagaccacgg tcacagaaaa cagagaaagt ccagaagatc 720
tcaggaaagg aggccagaca gctttctggg gcgaagaaac ccatcataag tgtggtttta 780
actgcacacg aagcaattcc aggtgctacc aagattgtgc cagtggaggc tgggccccct 840
gaaacaggag ctacaaattc tgagaccact tcagcagacc tggtgcctcg gagaggctac 900
caggaatacg ccattcagca gacaccttat gagcaaccaa tgaagtcaag caggctaggt 960
cccactcagc tcaaaatctt cacttgtgaa tactgcaaca aggtcttcaa gttcaagcac 1020
tcgctgcagg cccacctgag gatccacacc aatgaaaagc catacaagtg cccccagtgc 1080
agctatgcca gtgccatcaa ggccaacctc aatgtgcacc tgcgcaagca cactggagag 1140
aagttcgcct gcgactattg ctcgttcacc tgcctgagca agggccacct caaggtgcac 1200
atcgagcgag tgcacaagaa gatcaagcag cactgccgct tctgcaagaa gaagtactct 1260
gacgtcaaga acctcatcaa gcacatccga gacgcgcatg acccacagga caagaaggtc 1320
aaagaggcct tggacgagct ctgcctgatg acgagggagg gcaagcggca gctgctctat 1380
gactgccaca tctgtgagcg caagttcaag aacgagctgg accgtgaccg ccatatgctg 1440
gtccacggag acaagtggcc ttttgcctgt gagctctgtg gccatggggc caccaagtac 1500
caggcgctgg aactgcatgt caggaagcac cccttcgtgt acgtctgtgc cgtctgccgc 1560
aagaagttcg tcagctccat caggctgcgc acccacatca aagaggtgca cggggctgcc 1620
caggaggcct tggtcttcac cagttccatc aaccagagct tctgcctcct ggaacctggt 1680
ggggacatcc agcaagaagc tctgggggac cagctacagc tggtggaaga ggagtttgcc 1740
ctccagggcg tgaatgcact caaggaagag gcctgtcctg gggacactca gctggaggag 1800
ggccggaagg agccggaggc ccctggggaa atgcctgccc cagctgtgca cctggcctcc 1860
ccgcaggccg aaagcacagc cctgccaccc tgtgagctgg aaaccaccgt ggtctcctcc 1920
tcagacctgc attctcaaga ggtggtttca gatgattttt tgttgaaaaa tgatacctcc 1980
tccgcagagg ctcatgctgc tcctgagaag cccccagaca tgcagcacag aagctcagtc 2040
cagacgcaag gtgaagtgat cacactactg ctgtccaagg cccagagtgc tgggtcagat 2100
caggaaagcc atggcgccca gagcccccta ggggaagggc agaacatggc tgtgctttca 2160
gctggtgacc cagatcccag caggtgtctc aggtcaaacc cagctgaggc ctcagacctc 2220
ctccctccag tagctggtgg tggggacacc atcacacatc agcctgactc ttgcaaagct 2280
gcccctgagc accggtcagg catcaccgct ttcatgaagg tcctgaacag tttacagaag 2340
aagcaaatga acaccagctt gtgtgagcgg atccggaagg tttatggaga cctggagtgt 2400
gaatactgtg gcaaactttt ttggtaccaa gtgcattttg atatgcatgt ccgcacccac 2460
acccgggaac atctgtatta ttgctctcag tgtcattatt cttccatcac caaaaactgc 2520
cttaaacgcc acgtaattca gaaacacagt aacatcttgc tgaagtgtcc caccgatggc 2580
tgtgactact caactccaga taaatataag ctacaggcac atcttaaagt tcacacagca 2640
ctggacaaaa ggagttattc ttgtcctgtt tgtgaaaagt ctttttcaga ggatcgattg 2700
ataaagtcac atatcaagac caaccatcct gaggtctcca tgagcaccat ttctgaggtt 2760
ctcgggagga gggttcagct gaaagggcta attggaaaga gagccatgaa atgcccatat 2820
tgtgactttt atttcatgaa gaatggctca gaccttcagc gtcatatttg ggctcatgaa 2880
ggtgtgaagc ccttcaagtg ttctttgtgt gagtatgcaa ctcgtagcaa gagtaacctc 2940
aaggctcata tgaatcgtca cagcactgag aaaacccacc tatgtgacat gtgtggcaag 3000
aaattcaaat caaaagggac actgaaaagt cacaaactcc ttcacactgc agatgggaag 3060
cagtttaagt gcacggtgtg tgactacaca gcggcccaga agccacagct gctgcggcac 3120
atggaacagc atgtctcctt caagcctttc cgctgtgccc attgccatta ctcctgcaac 3180
atatctggct ctctgaagcg gcactacaac aggaagcacc ctaatgagga gtatgccaac 3240
gtgggcaccg gggagctggc agcggaggtg ctcatccagc aaggtggttt gaagtgtcct 3300
gtttgcagct ttgtatatgg caccaaatgg gagttcaata ggcacttgaa gaacaaacat 3360
ggcttgaagg tggtggaaat tgatggagac cccaagtggg agacagcaac agaagctcct 3420
gaggagccct ccacccagta tctccacatc acagaggccg aagaagacgt tcaagggaca 3480
caggcagcgg tggccgcgct ccaggacctg agatacacct ctgagagtgg cgaccgactg 3540
gaccccacgg ccgtgaacat cctgcagcag atcattgagc tgggcgccga gacccatgac 3600
gccactgccc ttgcctcggt ggttgccatg gcaccaggga cggtgactgt ggttaagcag 3660
gtcaccgagg aggagcccag ctccaaccac acggtcatga tccaggagac ggtccagcaa 3720
gcgtccgtgg agcttgccga gcagcaccac ctggtggtgt cctccgacga cgtggagggc 3780
attgagacgg tgactgtcta cacgcagggc ggggaggcct cggagttcat cgtctacgtg 3840
caggaggcca tgcagcctgt ggaggagcag gctgtggagc agccggccca ggaactctag 3900
aggacatgtg gcatcggatg gccacagggc ggggctgcca ggctctgcag gcacccaggg 3960
tggggaggcc acccttcctg ccctacccgc agaatggtgc tctcctttgc cctccctgcc 4020
cagcagcctg ataggactct cctagtccaa cttggggtgg gcaaggcagt cagcatcacc 4080
agcaatacca caggaccctc accccagcat agacacacac cccctgaccc ttaccatctg 4140
cttcctgaaa gacttcagtg tcagctcccc tacacacacc ccacaccttc accccttgct 4200
tcaagattca aacagagact cccagtcccc ctcagcatct tccctgaatc acagccccag 4260
ctccttgacc cccatctagg tgccaaatgt tcatctgcaa ccgctatgca gtctggtgag 4320
agggagacag ccatcacata gaaagtgacc gtacgggttt ttaatcactg ctgggtgggg 4380
tgggggtagg gggattgtcc tggctttgtc gacaaagtcc cacttccccg agtattaagg 4440
gcccttggta tcaagtgagg taaattcacc catcacaggg tctcgcccta ccatcctgga 4500
attatttcac ttttaagata aatgcactat ttcactgttc gcctcccatt ctaaggaggt 4560
gaggtggttg gaataaaaac agttcctgtc tgaa 4594
<![CDATA[<210> 847]]>
<![CDATA[<211> 5]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 847]]>
Met Glu Thr Arg Ala
1 5
<![CDATA[<210> 848]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 848]]>
Asn Thr Ala Ile Phe Met Cys Lys Cys Cys Asn Leu Phe Ser Pro Asn
1 5 10 15
Gln Ser Glu Leu Leu Ser His Val Ser Glu Lys His Met Glu Glu Gly
20 25 30
Val Asn Val Asp Glu Ile Ile Ile Pro Leu Arg Pro Leu Ser Thr Pro
35 40 45
Glu Pro Pro Asn Ser Ser Lys Thr Gly Asp
50 55
<![CDATA[<210> 849]]>
<![CDATA[<211> 83]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 849]]>
Phe Leu Val Met Lys Arg Lys Arg Gly Arg Pro Lys Gly Ser Thr Lys
1 5 10 15
Lys Ser Ser Thr Glu Glu Glu Leu Ala Glu Asn Ile Val Ser Pro Thr
20 25 30
Glu Asp Ser Pro Leu Ala Pro Glu Glu Gly Asn Ser Leu Pro Pro Ser
35 40 45
Ser Leu Glu Cys Ser Lys Cys Cys Arg Lys Phe Ser Asn Thr Arg Gln
50 55 60
Leu Arg Lys His Ile Cys Ile Ile Val Leu Asn Leu Gly Glu Glu Glu
65 70 75 80
Gly Glu Ala
<![CDATA[<210> 850]]>
<![CDATA[<211> 61]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 850]]>
Asn Glu Ser Asp Leu Glu Leu Glu Lys Lys Cys Lys Glu Asp Asp Arg
1 5 10 15
Glu Lys Ala Ser Lys Arg Pro Arg Ser Gln Lys Thr Glu Lys Val Gln
20 25 30
Lys Ile Ser Gly Lys Glu Ala Arg Gln Leu Ser Gly Ala Lys Lys Pro
35 40 45
Ile Ile Ser Val Val Leu Thr Ala His Glu Ala Ile Pro
50 55 60
<![CDATA[<210> 851]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 851]]>
Ala Thr Lys Ile Val Pro Val Glu Ala Gly Pro Pro Glu Thr Gly Ala
1 5 10 15
Thr Asn Ser Glu Thr Thr Ser Ala Asp Leu Val Pro Arg Arg Gly Tyr
20 25 30
Gln Glu Tyr Ala Ile Gln Gln Thr Pro Tyr Glu Gln Pro Met Lys Ser
35 40 45
Ser
<![CDATA[<210> 852]]>
<![CDATA[<211> 485]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> ]]> 人類
<![CDATA[<400> 852]]>
Leu Gly Pro Thr Gln Leu Lys Ile Phe Thr Cys Glu Tyr Cys Asn Lys
1 5 10 15
Val Phe Lys Phe Lys His Ser Leu Gln Ala His Leu Arg Ile His Thr
20 25 30
Asn Glu Lys Pro Tyr Lys Cys Pro Gln Cys Ser Tyr Ala Ser Ala Ile
35 40 45
Lys Ala Asn Leu Asn Val His Leu Arg Lys His Thr Gly Glu Lys Phe
50 55 60
Ala Cys Asp Tyr Cys Ser Phe Thr Cys Leu Ser Lys Gly His Leu Lys
65 70 75 80
Val His Ile Glu Arg Val His Lys Lys Ile Lys Gln His Cys Arg Phe
85 90 95
Cys Lys Lys Lys Tyr Ser Asp Val Lys Asn Leu Ile Lys His Ile Arg
100 105 110
Asp Ala His Asp Pro Gln Asp Lys Lys Val Lys Glu Ala Leu Asp Glu
115 120 125
Leu Cys Leu Met Thr Arg Glu Gly Lys Arg Gln Leu Leu Tyr Asp Cys
130 135 140
His Ile Cys Glu Arg Lys Phe Lys Asn Glu Leu Asp Arg Asp Arg His
145 150 155 160
Met Leu Val His Gly Asp Lys Trp Pro Phe Ala Cys Glu Leu Cys Gly
165 170 175
His Gly Ala Thr Lys Tyr Gln Ala Leu Glu Leu His Val Arg Lys His
180 185 190
Pro Phe Val Tyr Val Cys Ala Val Cys Arg Lys Lys Phe Val Ser Ser
195 200 205
Ile Arg Leu Arg Thr His Ile Lys Glu Val His Gly Ala Ala Gln Glu
210 215 220
Ala Leu Val Phe Thr Ser Ser Ile Asn Gln Ser Phe Cys Leu Leu Glu
225 230 235 240
Pro Gly Gly Asp Ile Gln Gln Glu Ala Leu Gly Asp Gln Leu Gln Leu
245 250 255
Val Glu Glu Glu Phe Ala Leu Gln Gly Val Asn Ala Leu Lys Glu Glu
260 265 270
Ala Cys Pro Gly Asp Thr Gln Leu Glu Glu Gly Arg Lys Glu Pro Glu
275 280 285
Ala Pro Gly Glu Met Pro Ala Pro Ala Val His Leu Ala Ser Pro Gln
290 295 300
Ala Glu Ser Thr Ala Leu Pro Pro Cys Glu Leu Glu Thr Thr Val Val
305 310 315 320
Ser Ser Ser Asp Leu His Ser Gln Glu Val Val Ser Asp Asp Phe Leu
325 330 335
Leu Lys Asn Asp Thr Ser Ser Ala Glu Ala His Ala Ala Pro Glu Lys
340 345 350
Pro Pro Asp Met Gln His Arg Ser Ser Val Gln Thr Gln Gly Glu Val
355 360 365
Ile Thr Leu Leu Leu Ser Lys Ala Gln Ser Ala Gly Ser Asp Gln Glu
370 375 380
Ser His Gly Ala Gln Ser Pro Leu Gly Glu Gly Gln Asn Met Ala Val
385 390 395 400
Leu Ser Ala Gly Asp Pro Asp Pro Ser Arg Cys Leu Arg Ser Asn Pro
405 410 415
Ala Glu Ala Ser Asp Leu Leu Pro Pro Val Ala Gly Gly Gly Asp Thr
420 425 430
Ile Thr His Gln Pro Asp Ser Cys Lys Ala Ala Pro Glu His Arg Ser
435 440 445
Gly Ile Thr Ala Phe Met Lys Val Leu Asn Ser Leu Gln Lys Lys Gln
450 455 460
Met Asn Thr Ser Leu Cys Glu Arg Ile Arg Lys Val Tyr Gly Asp Leu
465 470 475 480
Glu Cys Glu Tyr Cys
485
<![CDATA[<210> 853]]>
<![CDATA[<211> 77]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 853]]>
Lys Leu Phe Trp Tyr Gln Val His Phe Asp Met His Val Arg Thr His
1 5 10 15
Thr Arg Glu His Leu Tyr Tyr Cys Ser Gln Cys His Tyr Ser Ser Ile
20 25 30
Thr Lys Asn Cys Leu Lys Arg His Val Ile Gln Lys His Ser Asn Ile
35 40 45
Leu Leu Lys Cys Pro Thr Asp Gly Cys Asp Tyr Ser Thr Pro Asp Lys
50 55 60
Tyr Lys Leu Gln Ala His Leu Lys Val His Thr Ala Leu
65 70 75
<![CDATA[<210> 854]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 854]]>
Asp Lys Arg Ser Tyr Ser Cys Pro Val Cys Glu Lys Ser Phe Ser Glu
1 5 10 15
Asp Arg Leu Ile Lys Ser His Ile Lys Thr Asn His Pro
20 25
<![CDATA[<210> 855]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 855]]>
Val Ser Met Ser Thr Ile Ser Glu Val Leu Gly Arg Arg Val Gln Leu
1 5 10 15
Lys Gly Leu Ile Gly Lys Arg Ala Met Lys Cys Pro Tyr Cys Asp Phe
20 25 30
Tyr Phe Met Lys Asn Gly Ser Asp Leu Gln Arg His Ile Trp Ala His
35 40 45
Glu
<![CDATA[<210> 856]]>
<![CDATA[<211> 57]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 856]]>
Val Lys Pro Phe Lys Cys Ser Leu Cys Glu Tyr Ala Thr Arg Ser Lys
1 5 10 15
Ser Asn Leu Lys Ala His Met Asn Arg His Ser Thr Glu Lys Thr His
20 25 30
Leu Cys Asp Met Cys Gly Lys Lys Phe Lys Ser Lys Gly Thr Leu Lys
35 40 45
Ser His Lys Leu Leu His Thr Ala Asp
50 55
<![CDATA[<210> 857]]>
<![CDATA[<211> 29]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 857]]>
Lys Gln Phe Lys Cys Thr Val Cys Asp Tyr Thr Ala Ala Gln Lys Pro
1 5 10 15
Gln Leu Leu Arg His Met Glu Gln His Val Ser Phe Lys
20 25
<![CDATA[<210> 858]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 858]]>
Pro Phe Arg Cys Ala His Cys His Tyr Ser Cys Asn Ile Ser Gly Ser
1 5 10 15
Leu Lys Arg His Tyr Asn Arg Lys His Pro Asn Glu Glu Tyr Ala Asn
20 25 30
Val Gly Thr Gly Glu Leu Ala Ala Glu Val Leu Ile Gln Gln
35 40 45
<![CDATA[<210> 859]]>
<![CDATA[<211> 39]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 859]]>
Gly Leu Lys Cys Pro Val Cys Ser Phe Val Tyr Gly Thr Lys Trp Glu
1 5 10 15
Phe Asn Arg His Leu Lys Asn Lys His Gly Leu Lys Val Val Glu Ile
20 25 30
Asp Gly Asp Pro Lys Trp Glu
35
<![CDATA[<210> 860]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 860]]>
Thr Ala Thr Glu Ala Pro Glu Glu Pro Ser Thr Gln Tyr Leu His Ile
1 5 10 15
Thr Glu Ala Glu Glu Asp Val Gln Gly Thr Gln Ala Ala Val Ala Ala
20 25 30
Leu Gln Asp Leu Arg Tyr Thr Ser Glu Ser
35 40
<![CDATA[<210> 861]]>
<![CDATA[<211> 43]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 861]]>
Asp Arg Leu Asp Pro Thr Ala Val Asn Ile Leu Gln Gln Ile Ile Glu
1 5 10 15
Leu Gly Ala Glu Thr His Asp Ala Thr Ala Leu Ala Ser Val Val Ala
20 25 30
Met Ala Pro Gly Thr Val Thr Val Val Lys Gln
35 40
<![CDATA[<210> 862]]>
<![CDATA[<211> 79]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 862]]>
Val Thr Glu Glu Glu Pro Ser Ser Asn His Thr Val Met Ile Gln Glu
1 5 10 15
Thr Val Gln Gln Ala Ser Val Glu Leu Ala Glu Gln His His Leu Val
20 25 30
Val Ser Ser Asp Asp Val Glu Gly Ile Glu Thr Val Thr Val Tyr Thr
35 40 45
Gln Gly Gly Glu Ala Ser Glu Phe Ile Val Tyr Val Gln Glu Ala Met
50 55 60
Gln Pro Val Glu Glu Gln Ala Val Glu Gln Pro Ala Gln Glu Leu
65 70 75
<![CDATA[<210> 863]]>
<![CDATA[<211> 1243]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 863]]>
Met Glu Thr Arg Ala Ala Glu Asn Thr Ala Ile Phe Met Cys Lys Cys
1 5 10 15
Cys Asn Leu Phe Ser Pro Asn Gln Ser Glu Leu Leu Ser His Val Ser
20 25 30
Glu Lys His Met Glu Glu Gly Val Asn Val Asp Glu Ile Ile Ile Pro
35 40 45
Leu Arg Pro Leu Ser Thr Pro Glu Pro Pro Asn Ser Ser Lys Thr Gly
50 55 60
Asp Glu Phe Leu Val Met Lys Arg Lys Arg Gly Arg Pro Lys Gly Ser
65 70 75 80
Thr Lys Lys Ser Ser Thr Glu Glu Glu Leu Ala Glu Asn Ile Val Ser
85 90 95
Pro Thr Glu Asp Ser Pro Leu Ala Pro Glu Glu Gly Asn Ser Leu Pro
100 105 110
Pro Ser Ser Leu Glu Cys Ser Lys Cys Cys Arg Lys Phe Ser Asn Thr
115 120 125
Arg Gln Leu Arg Lys His Ile Cys Ile Ile Val Leu Asn Leu Gly Glu
130 135 140
Glu Glu Gly Glu Ala Gly Asn Glu Ser Asp Leu Glu Leu Glu Lys Lys
145 150 155 160
Cys Lys Glu Asp Asp Arg Glu Lys Ala Ser Lys Arg Pro Arg Ser Gln
165 170 175
Lys Thr Glu Lys Val Gln Lys Ile Ser Gly Lys Glu Ala Arg Gln Leu
180 185 190
Ser Gly Ala Lys Lys Pro Ile Ile Ser Val Val Leu Thr Ala His Glu
195 200 205
Ala Ile Pro Gly Ala Thr Lys Ile Val Pro Val Glu Ala Gly Pro Pro
210 215 220
Glu Thr Gly Ala Thr Asn Ser Glu Thr Thr Ser Ala Asp Leu Val Pro
225 230 235 240
Arg Arg Gly Tyr Gln Glu Tyr Ala Ile Gln Gln Thr Pro Tyr Glu Gln
245 250 255
Pro Met Lys Ser Ser Arg Leu Gly Pro Thr Gln Leu Lys Ile Phe Thr
260 265 270
Cys Glu Tyr Cys Asn Lys Val Phe Lys Phe Lys His Ser Leu Gln Ala
275 280 285
His Leu Arg Ile His Thr Asn Glu Lys Pro Tyr Lys Cys Pro Gln Cys
290 295 300
Ser Tyr Ala Ser Ala Ile Lys Ala Asn Leu Asn Val His Leu Arg Lys
305 310 315 320
His Thr Gly Glu Lys Phe Ala Cys Asp Tyr Cys Ser Phe Thr Cys Leu
325 330 335
Ser Lys Gly His Leu Lys Val His Ile Glu Arg Val His Lys Lys Ile
340 345 350
Lys Gln His Cys Arg Phe Cys Lys Lys Lys Tyr Ser Asp Val Lys Asn
355 360 365
Leu Ile Lys His Ile Arg Asp Ala His Asp Pro Gln Asp Lys Lys Val
370 375 380
Lys Glu Ala Leu Asp Glu Leu Cys Leu Met Thr Arg Glu Gly Lys Arg
385 390 395 400
Gln Leu Leu Tyr Asp Cys His Ile Cys Glu Arg Lys Phe Lys Asn Glu
405 410 415
Leu Asp Arg Asp Arg His Met Leu Val His Gly Asp Lys Trp Pro Phe
420 425 430
Ala Cys Glu Leu Cys Gly His Gly Ala Thr Lys Tyr Gln Ala Leu Glu
435 440 445
Leu His Val Arg Lys His Pro Phe Val Tyr Val Cys Ala Val Cys Arg
450 455 460
Lys Lys Phe Val Ser Ser Ile Arg Leu Arg Thr His Ile Lys Glu Val
465 470 475 480
His Gly Ala Ala Gln Glu Ala Leu Val Phe Thr Ser Ser Ile Asn Gln
485 490 495
Ser Phe Cys Leu Leu Glu Pro Gly Gly Asp Ile Gln Gln Glu Ala Leu
500 505 510
Gly Asp Gln Leu Gln Leu Val Glu Glu Glu Phe Ala Leu Gln Gly Val
515 520 525
Asn Ala Leu Lys Glu Glu Ala Cys Pro Gly Asp Thr Gln Leu Glu Glu
530 535 540
Gly Arg Lys Glu Pro Glu Ala Pro Gly Glu Met Pro Ala Pro Ala Val
545 550 555 560
His Leu Ala Ser Pro Gln Ala Glu Ser Thr Ala Leu Pro Pro Cys Glu
565 570 575
Leu Glu Thr Thr Val Val Ser Ser Ser Asp Leu His Ser Gln Glu Val
580 585 590
Val Ser Asp Asp Phe Leu Leu Lys Asn Asp Thr Ser Ser Ala Glu Ala
595 600 605
His Ala Ala Pro Glu Lys Pro Pro Asp Met Gln His Arg Ser Ser Val
610 615 620
Gln Thr Gln Gly Glu Val Ile Thr Leu Leu Leu Ser Lys Ala Gln Ser
625 630 635 640
Ala Gly Ser Asp Gln Glu Ser His Gly Ala Gln Ser Pro Leu Gly Glu
645 650 655
Gly Gln Asn Met Ala Val Leu Ser Ala Gly Asp Pro Asp Pro Ser Arg
660 665 670
Cys Leu Arg Ser Asn Pro Ala Glu Ala Ser Asp Leu Leu Pro Pro Val
675 680 685
Ala Gly Gly Gly Asp Thr Ile Thr His Gln Pro Asp Ser Cys Lys Ala
690 695 700
Ala Pro Glu His Arg Ser Gly Ile Thr Ala Phe Met Lys Val Leu Asn
705 710 715 720
Ser Leu Gln Lys Lys Gln Met Asn Thr Ser Leu Cys Glu Arg Ile Arg
725 730 735
Lys Val Tyr Gly Asp Leu Glu Cys Glu Tyr Cys Gly Lys Leu Phe Trp
740 745 750
Tyr Gln Val His Phe Asp Met His Val Arg Thr His Thr Arg Glu His
755 760 765
Leu Tyr Tyr Cys Ser Gln Cys His Tyr Ser Ser Ile Thr Lys Asn Cys
770 775 780
Leu Lys Arg His Val Ile Gln Lys His Ser Asn Ile Leu Leu Lys Cys
785 790 795 800
Pro Thr Asp Gly Cys Asp Tyr Ser Thr Pro Asp Lys Tyr Lys Leu Gln
805 810 815
Ala His Leu Lys Val His Thr Ala Leu Asp Lys Arg Ser Tyr Ser Cys
820 825 830
Pro Val Cys Glu Lys Ser Phe Ser Glu Asp Arg Leu Ile Lys Ser His
835 840 845
Ile Lys Thr Asn His Pro Glu Val Ser Met Ser Thr Ile Ser Glu Val
850 855 860
Leu Gly Arg Arg Val Gln Leu Lys Gly Leu Ile Gly Lys Arg Ala Met
865 870 875 880
Lys Cys Pro Tyr Cys Asp Phe Tyr Phe Met Lys Asn Gly Ser Asp Leu
885 890 895
Gln Arg His Ile Trp Ala His Glu Gly Val Lys Pro Phe Lys Cys Ser
900 905 910
Leu Cys Glu Tyr Ala Thr Arg Ser Lys Ser Asn Leu Lys Ala His Met
915 920 925
Asn Arg His Ser Thr Glu Lys Thr His Leu Cys Asp Met Cys Gly Lys
930 935 940
Lys Phe Lys Ser Lys Gly Thr Leu Lys Ser His Lys Leu Leu His Thr
945 950 955 960
Ala Asp Gly Lys Gln Phe Lys Cys Thr Val Cys Asp Tyr Thr Ala Ala
965 970 975
Gln Lys Pro Gln Leu Leu Arg His Met Glu Gln His Val Ser Phe Lys
980 985 990
Pro Phe Arg Cys Ala His Cys His Tyr Ser Cys Asn Ile Ser Gly Ser
995 1000 1005
Leu Lys Arg His Tyr Asn Arg Lys His Pro Asn Glu Glu Tyr Ala
1010 1015 1020
Asn Val Gly Thr Gly Glu Leu Ala Ala Glu Val Leu Ile Gln Gln
1025 1030 1035
Gly Gly Leu Lys Cys Pro Val Cys Ser Phe Val Tyr Gly Thr Lys
1040 1045 1050
Trp Glu Phe Asn Arg His Leu Lys Asn Lys His Gly Leu Lys Val
1055 1060 1065
Val Glu Ile Asp Gly Asp Pro Lys Trp Glu Thr Ala Thr Glu Ala
1070 1075 1080
Pro Glu Glu Pro Ser Thr Gln Tyr Leu His Ile Thr Glu Ala Glu
1085 1090 1095
Glu Asp Val Gln Gly Thr Gln Ala Ala Val Ala Ala Leu Gln Asp
1100 1105 1110
Leu Arg Tyr Thr Ser Glu Ser Gly Asp Arg Leu Asp Pro Thr Ala
1115 1120 1125
Val Asn Ile Leu Gln Gln Ile Ile Glu Leu Gly Ala Glu Thr His
1130 1135 1140
Asp Ala Thr Ala Leu Ala Ser Val Val Ala Met Ala Pro Gly Thr
1145 1150 1155
Val Thr Val Val Lys Gln Val Thr Glu Glu Glu Pro Ser Ser Asn
1160 1165 1170
His Thr Val Met Ile Gln Glu Thr Val Gln Gln Ala Ser Val Glu
1175 1180 1185
Leu Ala Glu Gln His His Leu Val Val Ser Ser Asp Asp Val Glu
1190 1195 1200
Gly Ile Glu Thr Val Thr Val Tyr Thr Gln Gly Gly Glu Ala Ser
1205 1210 1215
Glu Phe Ile Val Tyr Val Gln Glu Ala Met Gln Pro Val Glu Glu
1220 1225 1230
Gln Ala Val Glu Gln Pro Ala Gln Glu Leu
1235 1240
<![CDATA[<210> 864]]>
<![CDATA[<211> 3283]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 864]]>
atccgccatg ttggatgccg cagattcgcc ataacctcgc cggctctttt cttaaaaaaa 60
taaaaataaa aagcgaagcg tcagcagggc gccccgcccc ctcggtcggc acgggagggg 120
gcccggaaga gcccgaggct tttttttcct ccgcggtggg gcgttgccat ggagacgcgg 180
gcggcagaaa acacggccat ctttatgtgt aaatgttgta acctcttctc accaaatcag 240
tcggaactcc tctcccacgt ttcagagaag cacatggaag aaggggttaa tgttgatgag 300
attattattc cccttaggcc tctgagtaca cctgaacccc ccaactcaag caaaaccgga 360
gatgagtttt tggtcatgaa gaggaagaga ggcaggccta aggggtccac gaagaagtcc 420
agcacagaag aggagctggc agaaaacatc gtgagtccga ctgaggacag cccgctggct 480
ccggaggaag ggaacagcct gcctccaagc agcttggagt gtagcaagtg ctgtcggaag 540
ttctccaaca cgcgccagct gcggaagcac atctgcatta tcgtgctgaa tttgggtgag 600
gaggaaggag aagcaggtaa cgagtctgac cttgaactag aaaagaagtg taaggaagat 660
gatcgggaaa aagcctcgaa aagaccacgg tcacagaaaa cagagaaagt ccagaagatc 720
tcaggaaagg aggccagaca gctttctggg gcgaagaaac ccatcataag tgtggtttta 780
actgcacacg aagcaattcc aggtgctacc aagattgtgc cagtggaggc tgggccccct 840
gaaacaggag ctacaaattc tgagaccact tcagcagacc tggtgcctcg gagaggctac 900
caggaatacg ccattcagca gacaccttat gagcaaccaa tgaagtcaag caggctaggt 960
cccactcagc tcaaaatctt cacttgtgaa tactgcaaca aggtcttcaa gttcaagcac 1020
tcgctgcagg cccacctgag gatccacacc aatgaaaagc catacaagtg cccccagtgc 1080
agctatgcca gtgccatcaa ggccaacctc aatgtgcacc tgcgcaagca cactggagag 1140
aagttcgcct gcgactattg ctcgttcacc tgcctgagca agggccacct caaggtgcac 1200
atcgagcgag tgcacaagaa gatcaagcag cactgccgct tctgcaagaa gaagtactct 1260
gacgtcaaga acctcatcaa gcacatccga gacgcgcatg acccacagga caagaaggtc 1320
aaagaggcct tggacgagct ctgcctgatg acgagggagg gcaagcggca gctgctctat 1380
gactgccaca tctgtgagcg caagttcaag aacgagctgg accgtgaccg ccatatgctg 1440
gtccacggag acaagtggcc ttttgcctgt gagctctgtg gccatggggc caccaagtac 1500
caggcgctgg aactgcatgt caggaagcac cccttcgtgt acgtctgtgc cgtctgccgc 1560
aagaagttcg tcagctccat caggctgcgc acccacatca aagaggtgca cggggctgcc 1620
caggaggcct tggtcttcac cagttccatc aaccagagct tctgcctcct ggaacctggt 1680
ggggacatcc agcaagaagc tctgggggac cagctacagc tggtggaaga ggagtttgcc 1740
ctccagggcg tgaatgcact caaggaagag gcctgtcctg gggacactca gctggaggag 1800
ggccggaagg agccggaggc ccctggggaa atgcctgccc cagctgtgca cctggcctcc 1860
ccgcaggccg aaagcacagc cctgccaccc tgtgagctgg aaaccaccgt ggtctcctcc 1920
tcagacctgc attctcaaga ggtggtttca gatgattttt tgttgaaaaa tgatacctcc 1980
tccgcagagg ctcatgctgc tcctgagaag cccccagaca tgcagcacag aagctcagtc 2040
cagacgcaag gtgaagtgat cacactactg ctgtccaagg cccagagtgc tgggtcagat 2100
caggaaagcc atggcgccca gagcccccta ggggaagggc agaacatggc tgtgctttca 2160
gctggtgacc cagatcccag caggtgtctc aggtcaaacc cagctgaggc ctcagacctc 2220
ctccctccag tagctggtgg tggggacacc atcacacatc agcctgactc ttgcaaagct 2280
gcccctgagc accggtcagg catcaccgct ttcatgaagg tcctgaacag tttacagaag 2340
aagcaaatga acaccagctt gtgtgagcgg atccggaagg tttatggaga cctggagtgt 2400
gaatactgtg gcaaactttt ttggtaccaa gtgcattttg atatgcatgt ccgcacccac 2460
acccgggaac atctgtatta ttgctctcag tgtcattatt cttccatcac caaaaactgc 2520
cttaaacgcc acgtaattca gaaacacagt aacatcttgc tgaagtgtcc caccgatggc 2580
tgtgactact caactccaga taaatataag ctacaggcac atcttaaagt tcacacagca 2640
ctggacaaaa ggagttattc ttgtcctgtt tgtgaaaagt ctttttcaga ggatcgattg 2700
ataaagtcac atatcaagac caaccatcct gaggtctcca tgagcaccat ttctgaggtt 2760
ctcgggagga gggttcagct gaaagggcta attggaaaga gagccatgaa atgcccatat 2820
tgtgactttt atttcatgaa gaatggctca gaccttcagc gtcatatttg ggctcatgaa 2880
ggtgtgaagc ccttcaagtg ttctttgtgt gagtatgcaa ctcgtagcaa gagtaacctc 2940
aaggctcata tgaatcgtca cagcactgag aaaacccacc tatgtgacat gtgtggcaag 3000
aaattcaaat caaaagggac actgaaaagt cacaaactcc ttcacactgc agatgggaag 3060
cagtttaagt gcacggtgtg tgactacaca gcggcccaga agccacagct gctgcggcac 3120
atggaacagc atgtctcctt caagcctttc cgctgtgccc attgccatta ctcctgcaac 3180
atatctggct ctctgaagcg gcactacaac aggaagcacc ctaatgagga gtatgccaac 3240
gtgggcaccg gggagctggc agcggaggtg ctcatccagc aag 3283
<![CDATA[<210> 865]]>
<![CDATA[<211> 1038]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 865]]>
Met Glu Thr Arg Ala Ala Glu Asn Thr Ala Ile Phe Met Cys Lys Cys
1 5 10 15
Cys Asn Leu Phe Ser Pro Asn Gln Ser Glu Leu Leu Ser His Val Ser
20 25 30
Glu Lys His Met Glu Glu Gly Val Asn Val Asp Glu Ile Ile Ile Pro
35 40 45
Leu Arg Pro Leu Ser Thr Pro Glu Pro Pro Asn Ser Ser Lys Thr Gly
50 55 60
Asp Glu Phe Leu Val Met Lys Arg Lys Arg Gly Arg Pro Lys Gly Ser
65 70 75 80
Thr Lys Lys Ser Ser Thr Glu Glu Glu Leu Ala Glu Asn Ile Val Ser
85 90 95
Pro Thr Glu Asp Ser Pro Leu Ala Pro Glu Glu Gly Asn Ser Leu Pro
100 105 110
Pro Ser Ser Leu Glu Cys Ser Lys Cys Cys Arg Lys Phe Ser Asn Thr
115 120 125
Arg Gln Leu Arg Lys His Ile Cys Ile Ile Val Leu Asn Leu Gly Glu
130 135 140
Glu Glu Gly Glu Ala Gly Asn Glu Ser Asp Leu Glu Leu Glu Lys Lys
145 150 155 160
Cys Lys Glu Asp Asp Arg Glu Lys Ala Ser Lys Arg Pro Arg Ser Gln
165 170 175
Lys Thr Glu Lys Val Gln Lys Ile Ser Gly Lys Glu Ala Arg Gln Leu
180 185 190
Ser Gly Ala Lys Lys Pro Ile Ile Ser Val Val Leu Thr Ala His Glu
195 200 205
Ala Ile Pro Gly Ala Thr Lys Ile Val Pro Val Glu Ala Gly Pro Pro
210 215 220
Glu Thr Gly Ala Thr Asn Ser Glu Thr Thr Ser Ala Asp Leu Val Pro
225 230 235 240
Arg Arg Gly Tyr Gln Glu Tyr Ala Ile Gln Gln Thr Pro Tyr Glu Gln
245 250 255
Pro Met Lys Ser Ser Arg Leu Gly Pro Thr Gln Leu Lys Ile Phe Thr
260 265 270
Cys Glu Tyr Cys Asn Lys Val Phe Lys Phe Lys His Ser Leu Gln Ala
275 280 285
His Leu Arg Ile His Thr Asn Glu Lys Pro Tyr Lys Cys Pro Gln Cys
290 295 300
Ser Tyr Ala Ser Ala Ile Lys Ala Asn Leu Asn Val His Leu Arg Lys
305 310 315 320
His Thr Gly Glu Lys Phe Ala Cys Asp Tyr Cys Ser Phe Thr Cys Leu
325 330 335
Ser Lys Gly His Leu Lys Val His Ile Glu Arg Val His Lys Lys Ile
340 345 350
Lys Gln His Cys Arg Phe Cys Lys Lys Lys Tyr Ser Asp Val Lys Asn
355 360 365
Leu Ile Lys His Ile Arg Asp Ala His Asp Pro Gln Asp Lys Lys Val
370 375 380
Lys Glu Ala Leu Asp Glu Leu Cys Leu Met Thr Arg Glu Gly Lys Arg
385 390 395 400
Gln Leu Leu Tyr Asp Cys His Ile Cys Glu Arg Lys Phe Lys Asn Glu
405 410 415
Leu Asp Arg Asp Arg His Met Leu Val His Gly Asp Lys Trp Pro Phe
420 425 430
Ala Cys Glu Leu Cys Gly His Gly Ala Thr Lys Tyr Gln Ala Leu Glu
435 440 445
Leu His Val Arg Lys His Pro Phe Val Tyr Val Cys Ala Val Cys Arg
450 455 460
Lys Lys Phe Val Ser Ser Ile Arg Leu Arg Thr His Ile Lys Glu Val
465 470 475 480
His Gly Ala Ala Gln Glu Ala Leu Val Phe Thr Ser Ser Ile Asn Gln
485 490 495
Ser Phe Cys Leu Leu Glu Pro Gly Gly Asp Ile Gln Gln Glu Ala Leu
500 505 510
Gly Asp Gln Leu Gln Leu Val Glu Glu Glu Phe Ala Leu Gln Gly Val
515 520 525
Asn Ala Leu Lys Glu Glu Ala Cys Pro Gly Asp Thr Gln Leu Glu Glu
530 535 540
Gly Arg Lys Glu Pro Glu Ala Pro Gly Glu Met Pro Ala Pro Ala Val
545 550 555 560
His Leu Ala Ser Pro Gln Ala Glu Ser Thr Ala Leu Pro Pro Cys Glu
565 570 575
Leu Glu Thr Thr Val Val Ser Ser Ser Asp Leu His Ser Gln Glu Val
580 585 590
Val Ser Asp Asp Phe Leu Leu Lys Asn Asp Thr Ser Ser Ala Glu Ala
595 600 605
His Ala Ala Pro Glu Lys Pro Pro Asp Met Gln His Arg Ser Ser Val
610 615 620
Gln Thr Gln Gly Glu Val Ile Thr Leu Leu Leu Ser Lys Ala Gln Ser
625 630 635 640
Ala Gly Ser Asp Gln Glu Ser His Gly Ala Gln Ser Pro Leu Gly Glu
645 650 655
Gly Gln Asn Met Ala Val Leu Ser Ala Gly Asp Pro Asp Pro Ser Arg
660 665 670
Cys Leu Arg Ser Asn Pro Ala Glu Ala Ser Asp Leu Leu Pro Pro Val
675 680 685
Ala Gly Gly Gly Asp Thr Ile Thr His Gln Pro Asp Ser Cys Lys Ala
690 695 700
Ala Pro Glu His Arg Ser Gly Ile Thr Ala Phe Met Lys Val Leu Asn
705 710 715 720
Ser Leu Gln Lys Lys Gln Met Asn Thr Ser Leu Cys Glu Arg Ile Arg
725 730 735
Lys Val Tyr Gly Asp Leu Glu Cys Glu Tyr Cys Gly Lys Leu Phe Trp
740 745 750
Tyr Gln Val His Phe Asp Met His Val Arg Thr His Thr Arg Glu His
755 760 765
Leu Tyr Tyr Cys Ser Gln Cys His Tyr Ser Ser Ile Thr Lys Asn Cys
770 775 780
Leu Lys Arg His Val Ile Gln Lys His Ser Asn Ile Leu Leu Lys Cys
785 790 795 800
Pro Thr Asp Gly Cys Asp Tyr Ser Thr Pro Asp Lys Tyr Lys Leu Gln
805 810 815
Ala His Leu Lys Val His Thr Ala Leu Asp Lys Arg Ser Tyr Ser Cys
820 825 830
Pro Val Cys Glu Lys Ser Phe Ser Glu Asp Arg Leu Ile Lys Ser His
835 840 845
Ile Lys Thr Asn His Pro Glu Val Ser Met Ser Thr Ile Ser Glu Val
850 855 860
Leu Gly Arg Arg Val Gln Leu Lys Gly Leu Ile Gly Lys Arg Ala Met
865 870 875 880
Lys Cys Pro Tyr Cys Asp Phe Tyr Phe Met Lys Asn Gly Ser Asp Leu
885 890 895
Gln Arg His Ile Trp Ala His Glu Gly Val Lys Pro Phe Lys Cys Ser
900 905 910
Leu Cys Glu Tyr Ala Thr Arg Ser Lys Ser Asn Leu Lys Ala His Met
915 920 925
Asn Arg His Ser Thr Glu Lys Thr His Leu Cys Asp Met Cys Gly Lys
930 935 940
Lys Phe Lys Ser Lys Gly Thr Leu Lys Ser His Lys Leu Leu His Thr
945 950 955 960
Ala Asp Gly Lys Gln Phe Lys Cys Thr Val Cys Asp Tyr Thr Ala Ala
965 970 975
Gln Lys Pro Gln Leu Leu Arg His Met Glu Gln His Val Ser Phe Lys
980 985 990
Pro Phe Arg Cys Ala His Cys His Tyr Ser Cys Asn Ile Ser Gly Ser
995 1000 1005
Leu Lys Arg His Tyr Asn Arg Lys His Pro Asn Glu Glu Tyr Ala
1010 1015 1020
Asn Val Gly Thr Gly Glu Leu Ala Ala Glu Val Leu Ile Gln Gln
1025 1030 1035
<![CDATA[<210> 866]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 866]]>
gcctcatccc ctcttcagtg caggaatatg ttagcgttgt atcttgggag aaagacacag 60
tctccatcat cccag 75
<![CDATA[<210> 867]]>
<![CDATA[<211> 75]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 867]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[<210> 868]]>
<![CDATA[<211> 150]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 868]]>
gcctcatccc ctcttcagtg caggaatatg ttagcgttgt atcttgggag aaagacacag 60
tctccatcat cccagccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[<210> 869]]>
<![CDATA[<211> 49]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 869]]>
Leu Ile Pro Ser Ser Val Gln Glu Tyr Val Ser Val Val Ser Trp Glu
1 5 10 15
Lys Asp Thr Val Ser Ile Ile Pro Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[<210> 870]]>
<![CDATA[<211> 183]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 870]]>
agccgagcgc tgggctgagg agcagagaga gcggggcgcc gagtgcgggc ggctgggagc 60
gcgctgagcg ggggagaggc gctgccgcac ggccggccac aggaccacct ccccggagaa 120
tagggcctct ttatggcatg tggctggtaa ctttcctcct gctcctggac tctttacaca 180
aag 183
<![CDATA[<210> 871]]>
<![CDATA[<211> 318]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 871]]>
cccgccctga agatgttggc accagcctct actttgtaaa tgactccttg cagcaggtga 60
ccttttccag ctccgtgggg gtggtggtgc cctgcccggc cgcgggctcc cccagcgcgg 120
cccttcgatg gtacctggcc acaggggacg acatctacga cgtgccgcac atccggcacg 180
tccacgccaa cgggacgctg cagctctacc ccttctcccc ctccgccttc aatagcttta 240
tccacgacaa tgactacttc tgcaccgcgg agaacgctgc cggcaagatc cggagcccca 300
acatccgcgt caaagcag 318
<![CDATA[<210> 872]]>
<![CDATA[<211> 147]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 872]]>
ttttcaggga accctacacc gtccgggtgg aggatcaaag gtcaatgcgt ggcaacgtgg 60
ccgtcttcaa gtgcctcatc ccctcttcag tgcaggaata tgttagcgtt gtatcttggg 120
agaaagacac agtctccatc atcccag 147
<![CDATA[<210> 873]]>
<![CDATA[<211> 147]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 873]]>
aacacaggtt ttttattacc taccacggcg ggctgtacat ctctgacgta cagaaggagg 60
acgccctctc cacctatcgc tgcatcacca agcacaagta tagcggggag acccggcaga 120
gcaatggggc acgcctctct gtgacag 147
<![CDATA[<210> 874]]>
<![CDATA[<211> 279]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 874]]>
accctgctga gtcgatcccc accatcctgg atggcttcca ctcccaggaa gtgtgggccg 60
gccacaccgt ggagctgccc tgcaccgcct cgggctaccc tatccccgcc atccgctggc 120
tcaaggatgg ccggcccctc ccggctgaca gccgctggac caagcgcatc acagggctga 180
ccatcagcga cttgcggacc gaggacagcg gcacctacat ttgtgaggtc accaacacct 240
tcggttcggc agaggccaca ggcatcctca tggtcattg 279
<![CDATA[<210> 875]]>
<![CDATA[<211> 276]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 875]]>
atccccttca tgtgaccctg acaccaaaga agctgaagac cggcattggc agcacggtca 60
tcctctcctg tgccctgacg ggctccccag agttcaccat ccgctggtat cgcaacacgg 120
agctggtgct gcctgacgag gccatctcca tccgcgggct cagcaacgag acgctgctca 180
tcacctcggc ccagaagagc cattccgggg cctaccagtg cttcgctacc cgcaaggccc 240
agaccgccca ggactttgcc atcattgcac ttgagg 276
<![CDATA[<210> 876]]>
<![CDATA[<211> 297]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 876]]>
atggcacgcc ccgcatcgtc tcgtccttca gcgagaaggt ggtcaacccc ggggagcagt 60
tctcactgat gtgtgcggcc aagggcgccc cgccccccac ggtcacctgg gccctcgacg 120
atgagcccat cgtgcgggat ggcagccacc gcaccaacca gtacaccatg tcggacggca 180
ccaccatcag ccacatgaac gtcacaggcc cccagatccg cgacgggggc gtgtaccggt 240
gcacagcgcg gaacttggtg ggcagtgctg aatatcaggc gcgaataaac gtaagag 297
<![CDATA[<210> 877]]>
<![CDATA[<211> 273]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 877]]>
gcccacccag catccgggct atgcggaaca tcacagcagt cgccgggcgg gacaccctta 60
tcaactgcag ggtcatcggc tatccctact actccatcaa gtggtacaag gatgccctgc 120
tgctgccaga caaccaccgc caggtggtgt ttgagaatgg gaccctcaag ctgactgacg 180
tgcagaaggg catggatgag ggggagtacc tgtgcagtgt cctcatccag ccccagctct 240
ccatcagcca gagcgttcac gtagccgtca aag 273
<![CDATA[<210> 878]]>
<![CDATA[<211> 279]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 878]]>
tgccccctct gatccagccc ttcgaattcc cacccgcctc catcggccag ctgctctaca 60
ttccctgtgt ggtgtcctcg ggggacatgc ccatccgtat cacctggagg aaggacggac 120
aggtgatcat ctcaggctcg ggcgtgacca tcgagagcaa ggaattcatg agctccctgc 180
agatctctag cgtctccctc aagcacaacg gcaactatac atgcatcgcc agcaacgcag 240
ccgccaccgt gagccgggag cgccagctca tcgtgcgtg 279
<![CDATA[<210> 879]]>
<![CDATA[<211> 120]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 879]]>
tgccccctcg atttgtggtg caacccaaca accaggatgg catctacggc aaagctggtg 60
tgctcaactg ctcggtggac ggctaccccc cacccaaggt catgtggaag catgccaagg 120
<![CDATA[<210> 880]]>
<![CDATA[<211> 177]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 880]]>
ggagcgggaa cccccagcag taccaccctg tgcccctcac tggccgcatc cagatcctgc 60
ccaacagctc gctgctgatc cgccacgtcc tagaagagga catcggctac tacctctgcc 120
aggccagcaa cggcgtaggc accgacatca gcaagtccat gttcctcaca gtcaaga 177
<![CDATA[<210> 881]]>
<![CDATA[<211> 200]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 881]]>
tcccggccat gatcacttcc caccccaaca ccaccatcgc catcaagggc catgcgaagg 60
agctaaactg cacggcacgg ggtgagcggc ccatcatcat ccgctgggag aagggggaca 120
cagtcatcga ccctgaccgc gtcatgcggt atgccatcgc caccaaggac aacggcgacg 180
aggtcgtctc cacactgaag 200
<![CDATA[<210> 882]]>
<![CDATA[<211> 97]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 882]]>
ctcaagcccg ctgaccgtgg ggactctgtg ttcttcagct gccatgccat caactcgtat 60
ggggaggacc ggggcttgat ccaactcact gtgcaag 97
<![CDATA[<210> 883]]>
<![CDATA[<211> 129]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 883]]>
agccccccga ccccccagag ctggagatcc gggaggtgaa ggcccggagc atgaacctgc 60
gctggaccca gcgattcgac gggaacagca tcatcacggg cttcgacatt gaatacaaga 120
acaaatcag 129
<![CDATA[<210> 884]]>
<![CDATA[<211> 168]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 884]]>
attcctggga cttcaagcag tccacacgca acatctcccc caccatcaac caggccaaca 60
ttgtggactt gcacccggca tctgtgtaca gcatccgcat gtactctttc aacaagattg 120
gccgcagtga accaagcaag gagctcacca tcagcactga ggaggccg 168
<![CDATA[<210> 885]]>
<![CDATA[<211> 71]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 885]]>
ctcccgatgg gccccccatg gatgttacct tgcagccagt gacctcacag agcatccagg 60
tgacctggaa g 71
<![CDATA[<210> 886]]>
<![CDATA[<211> 241]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 886]]>
gcacccaaga aggagctgca gaacggtgtc atccggggct accagattgg ctacagagag 60
aacagccccg gcagcaacgg gcagtacagc atcgtggaga tgaaggccac gggggacagc 120
gaggtctaca ccctggacaa cctcaagaag ttcgcccagt atggggtggt ggtccaagcc 180
ttcaatcggg ctggcacggg gccctcttcc agcgagatca atgccaccac tctggaggat 240
g 241
<![CDATA[<210> 887]]>
<![CDATA[<211> 147]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 887]]>
tgcccagcca gccccctgag aacgtccggg ccctgtccat cacttctgac gtggccgtca 60
tctcctggtc agagcccccg cgcagcaccc tcaatggcgt cctcaaaggc tatcgggtca 120
tcttctggtc cctctatgtt gatgggg 147
<![CDATA[<210> 888]]>
<![CDATA[<211> 156]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 888]]>
agtggggcga gatgcagaac atcaccacca cgcgggagcg ggtggagctg cggggcatgg 60
agaagttcac caactacagc gtccaggtgc tggcctacac ccaggctggg gacggcgtac 120
gcagcagtgt gctctacatc cagaccaagg aggacg 156
<![CDATA[<210> 889]]>
<![CDATA[<211> 140]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 889]]>
ttccaggtcc ccctgctggc atcaaagctg tcccttcatc agctagcagt gtggttgtgt 60
cttggctccc ccctaccaag cccaacgggg tgatccgcaa gtacaccatc ttctgttcca 120
gccccgggtc tggccagccg 140
<![CDATA[<210> 890]]>
<![CDATA[<211> 154]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 890]]>
gctcccagcg agtacgagac gagtccagag cagctcttct accggatcgc ccacctaaac 60
cgcggtcagc agtatctgct gtgggtggcc gccgtcacct ctgccggccg gggcaacagc 120
agcgagaagg tgaccatcga gcctgctggc aagg 154
<![CDATA[<210> 891]]>
<![CDATA[<211> 118]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 891]]>
ccccagcaaa gatcatctcc tttgggggca ccgtgacaac accttggatg aaagatgttc 60
ggctgccttg caattcagtg ggagatccag cccctgctgt gaagtggacc aaggacag 118
<![CDATA[<210> 892]]>
<![CDATA[<211> 164]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 892]]>
tgaagactcg gccattccag tgtccatgga tgggcaccgg ctcatccaca ccaatggcac 60
actgctgctg cgtgcagtga aggctgagga ctctggctac tacacgtgca cggccaccaa 120
cactggtggc tttgacacca tcatcgtcaa ccttctggtg caag 164
<![CDATA[<210> 893]]>
<![CDATA[<211> 99]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 893]]>
ttcccccgga ccagccccgc ctcactgtct ccaaaacctc agcttcgtcc atcaccctga 60
cctggattcc aggtgacaat gggggcagct ccatccgag 99
<![CDATA[<210> 894]]>
<![CDATA[<211> 189]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 894]]>
gcttcgtgct acagtactcg gtggacaaca gcgaggagtg gaaggatgtg ttcatcagct 60
ccagcgagcg ctccttcaag ctggacagcc tcaagtgtgg cacgtggtac aaggtgaagc 120
tggcagccaa gaacagcgtg ggctctgggc gcatcagcga gatcatcgag gccaagaccc 180
acgggcggg 189
<![CDATA[<210> 895]]>
<![CDATA[<211> 288]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 895]]>
agccctcctt cagcaaagac caacacctct tcacccacat caactccacg catgctcggc 60
ttaacctgca gggctggaac aatgggggct gccctatcac agccatcgtt ctggagtacc 120
ggcccaaggg gacctgggcc tggcagggcc tccgggccaa cagctccggg gaggtgtttc 180
tgacggaact gcgagaggcc acgtggtacg agctgcgcat gagggcttgc aacagtgcgg 240
gctgcggcaa tgaaacagcc cagttcgcca ccctggacta cgatggca 288
<![CDATA[<210> 896]]>
<![CDATA[<211> 156]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 896]]>
gcaccattcc acccatcaag tctgctcaag gtgaagggga tgatgtgaag aagctgttca 60
ccatcggctg ccctgtcatc ctggccacac tgggggtggc actgctcttc atcgtacgca 120
agaagaggaa ggagaaacgg ctgaagcgac tccgag 156
<![CDATA[<210> 897]]>
<![CDATA[<211> 31]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 897]]>
atgcaaagag tttggcagaa atgttgataa g 31
<![CDATA[<210> 898]]>
<![CDATA[<211> 119]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 898]]>
caagaacaat agaagctttg acacccctgt gaaagggcca ccccagggcc cacggctaca 60
cattgacatc cccagggtcc agctgctcat cgaggacaaa gaaggcatca agcaactgg 119
<![CDATA[<210> 899]]>
<![CDATA[<211> 153]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 899]]>
gagatgacaa ggccaccatc cctgtgacag atgctgagtt cagccaagct gtcaacccac 60
agagcttctg tactggcgtc tccttgcacc acccaaccct catccagagc acaggacccc 120
tcatcgacat gtctgacatc cggccaggaa cca 153
<![CDATA[<210> 900]]>
<![CDATA[<211> 195]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 900]]>
atccagtgtc caggaagaat gtgaagtcag cccacagcac ccggaaccgg tactcaagcc 60
agtggaccct gaccaagtgc caggcctcca cacctgcccg caccctcacc tccgactggc 120
gcaccgtggg ctcccagcat ggtgtcacgg tcactgagag tgacagctac agtgccagcc 180
tgtcccagga cacag 195
<![CDATA[<210> 901]]>
<![CDATA[<211> 312]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 901]]>
acaaaggaag gaacagcatg gtgtccactg agagtgcctc ttccacctac gaggagctgg 60
cccgggccta tgagcatgcc aagctggagg agcagctgca gcacgccaag tttgagatca 120
ccgagtgctt catctctgac agttcctctg accagatgac cacaggcacc aacgagaacg 180
ccgacagcat gacatccatg agcacaccct cagagcctgg catctgccgc tttaccgcct 240
caccacccaa gccccaggat gcggaccggg gcaaaaacgt ggctgtgccc atccctcacc 300
gggccaacaa ga 312
<![CDATA[<210> 902]]>
<![CDATA[<211> 1032]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 902]]>
gtgactactg caacctgccc ctgtatgcca agtcagaggc cttctttcga aaggcagatg 60
gacgtgagcc ctgccccgtg gtcccacccc gtgaggcctc catccggaac ctggctcgaa 120
cctaccacac ccaggctcgc cacctgaccc tggaccctgc cagcaagtcc ttgggccttc 180
cccacccagg ggcccccgct gccgcctcca cagccacctt acctcagagg actctggcca 240
tgccagcccc cccagccggc acagcccccc cagcccccgg ccccacccct gctgagccac 300
ccaccgcccc cagcgctgcc cctccggccc ccagcaccga gcctccacga gccgggggcc 360
cacacaccaa aatggggggc tccagggact cgcttctcga gatgagcaca tcgggggtag 420
ggaggtctca gaagcagggg gccggggcct actccaaatc ctacaccctg gtgtagggcc 480
cgcaggaaga gcagccacgc ctggaccgcg ccgcgccgca gccccacacg ccagctcggc 540
tgtttttctg cattatttat attcaactga cagacaaaaa ccaaccaacg acaaaacaaa 600
aacccccaat catgaacgcc tgtacataga actcttttgt acaaatgaaa ctattttctt 660
cttctccatg aagccagggc acaaagaatt tgacagtaca agtcaaatcc cccaccccac 720
aaaatatgtg tggagatata tatacatata tagacagaca ggaacgcgtc cacgagctat 780
atatctatat atttctctca ccctattttg agacagaggc acaaagactc agcaattttt 840
ttccctcctc ctcaccttcc ccccagtcta ggtggttttg acaaagacca aaatcccaac 900
tcagagacac tgcatgcgat tttactgttc caagaaaacc aggagttgct tcaatttgca 960
gatgcttatg tgttaatacc tttttctatg aaaaaagacc cagcgccgtg tgcaataaag 1020
gttatgtttc ta 1032
<![CDATA[<210> 903]]>
<![CDATA[<211> 6855]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> DSCAML1之全mRNA多核苷酸序列]]>
<![CDATA[<400> 903]]>
agccgagcgc tgggctgagg agcagagaga gcggggcgcc gagtgcgggc ggctgggagc 60
gcgctgagcg ggggagaggc gctgccgcac ggccggccac aggaccacct ccccggagaa 120
tagggcctct ttatggcatg tggctggtaa ctttcctcct gctcctggac tctttacaca 180
aagcccgccc tgaagatgtt ggcaccagcc tctactttgt aaatgactcc ttgcagcagg 240
tgaccttttc cagctccgtg ggggtggtgg tgccctgccc ggccgcgggc tcccccagcg 300
cggcccttcg atggtacctg gccacagggg acgacatcta cgacgtgccg cacatccggc 360
acgtccacgc caacgggacg ctgcagctct accccttctc cccctccgcc ttcaatagct 420
ttatccacga caatgactac ttctgcaccg cggagaacgc tgccggcaag atccggagcc 480
ccaacatccg cgtcaaagca gttttcaggg aaccctacac cgtccgggtg gaggatcaaa 540
ggtcaatgcg tggcaacgtg gccgtcttca agtgcctcat cccctcttca gtgcaggaat 600
atgttagcgt tgtatcttgg gagaaagaca cagtctccat catcccagaa cacaggtttt 660
ttattaccta ccacggcggg ctgtacatct ctgacgtaca gaaggaggac gccctctcca 720
cctatcgctg catcaccaag cacaagtata gcggggagac ccggcagagc aatggggcac 780
gcctctctgt gacagaccct gctgagtcga tccccaccat cctggatggc ttccactccc 840
aggaagtgtg ggccggccac accgtggagc tgccctgcac cgcctcgggc taccctatcc 900
ccgccatccg ctggctcaag gatggccggc ccctcccggc tgacagccgc tggaccaagc 960
gcatcacagg gctgaccatc agcgacttgc ggaccgagga cagcggcacc tacatttgtg 1020
aggtcaccaa caccttcggt tcggcagagg ccacaggcat cctcatggtc attgatcccc 1080
ttcatgtgac cctgacacca aagaagctga agaccggcat tggcagcacg gtcatcctct 1140
cctgtgccct gacgggctcc ccagagttca ccatccgctg gtatcgcaac acggagctgg 1200
tgctgcctga cgaggccatc tccatccgcg ggctcagcaa cgagacgctg ctcatcacct 1260
cggcccagaa gagccattcc ggggcctacc agtgcttcgc tacccgcaag gcccagaccg 1320
cccaggactt tgccatcatt gcacttgagg atggcacgcc ccgcatcgtc tcgtccttca 1380
gcgagaaggt ggtcaacccc ggggagcagt tctcactgat gtgtgcggcc aagggcgccc 1440
cgccccccac ggtcacctgg gccctcgacg atgagcccat cgtgcgggat ggcagccacc 1500
gcaccaacca gtacaccatg tcggacggca ccaccatcag ccacatgaac gtcacaggcc 1560
cccagatccg cgacgggggc gtgtaccggt gcacagcgcg gaacttggtg ggcagtgctg 1620
aatatcaggc gcgaataaac gtaagaggcc cacccagcat ccgggctatg cggaacatca 1680
cagcagtcgc cgggcgggac acccttatca actgcagggt catcggctat ccctactact 1740
ccatcaagtg gtacaaggat gccctgctgc tgccagacaa ccaccgccag gtggtgtttg 1800
agaatgggac cctcaagctg actgacgtgc agaagggcat ggatgagggg gagtacctgt 1860
gcagtgtcct catccagccc cagctctcca tcagccagag cgttcacgta gccgtcaaag 1920
tgccccctct gatccagccc ttcgaattcc cacccgcctc catcggccag ctgctctaca 1980
ttccctgtgt ggtgtcctcg ggggacatgc ccatccgtat cacctggagg aaggacggac 2040
aggtgatcat ctcaggctcg ggcgtgacca tcgagagcaa ggaattcatg agctccctgc 2100
agatctctag cgtctccctc aagcacaacg gcaactatac atgcatcgcc agcaacgcag 2160
ccgccaccgt gagccgggag cgccagctca tcgtgcgtgt gccccctcga tttgtggtgc 2220
aacccaacaa ccaggatggc atctacggca aagctggtgt gctcaactgc tcggtggacg 2280
gctacccccc acccaaggtc atgtggaagc atgccaaggg gagcgggaac ccccagcagt 2340
accaccctgt gcccctcact ggccgcatcc agatcctgcc caacagctcg ctgctgatcc 2400
gccacgtcct agaagaggac atcggctact acctctgcca ggccagcaac ggcgtaggca 2460
ccgacatcag caagtccatg ttcctcacag tcaagatccc ggccatgatc acttcccacc 2520
ccaacaccac catcgccatc aagggccatg cgaaggagct aaactgcacg gcacggggtg 2580
agcggcccat catcatccgc tgggagaagg gggacacagt catcgaccct gaccgcgtca 2640
tgcggtatgc catcgccacc aaggacaacg gcgacgaggt cgtctccaca ctgaagctca 2700
agcccgctga ccgtggggac tctgtgttct tcagctgcca tgccatcaac tcgtatgggg 2760
aggaccgggg cttgatccaa ctcactgtgc aagagccccc cgacccccca gagctggaga 2820
tccgggaggt gaaggcccgg agcatgaacc tgcgctggac ccagcgattc gacgggaaca 2880
gcatcatcac gggcttcgac attgaataca agaacaaatc agattcctgg gacttcaagc 2940
agtccacacg caacatctcc cccaccatca accaggccaa cattgtggac ttgcacccgg 3000
catctgtgta cagcatccgc atgtactctt tcaacaagat tggccgcagt gaaccaagca 3060
aggagctcac catcagcact gaggaggccg ctcccgatgg gccccccatg gatgttacct 3120
tgcagccagt gacctcacag agcatccagg tgacctggaa ggcacccaag aaggagctgc 3180
agaacggtgt catccggggc taccagattg gctacagaga gaacagcccc ggcagcaacg 3240
ggcagtacag catcgtggag atgaaggcca cgggggacag cgaggtctac accctggaca 3300
acctcaagaa gttcgcccag tatggggtgg tggtccaagc cttcaatcgg gctggcacgg 3360
ggccctcttc cagcgagatc aatgccacca ctctggagga tgtgcccagc cagccccctg 3420
agaacgtccg ggccctgtcc atcacttctg acgtggccgt catctcctgg tcagagcccc 3480
cgcgcagcac cctcaatggc gtcctcaaag gctatcgggt catcttctgg tccctctatg 3540
ttgatgggga gtggggcgag atgcagaaca tcaccaccac gcgggagcgg gtggagctgc 3600
ggggcatgga gaagttcacc aactacagcg tccaggtgct ggcctacacc caggctgggg 3660
acggcgtacg cagcagtgtg ctctacatcc agaccaagga ggacgttcca ggtccccctg 3720
ctggcatcaa agctgtccct tcatcagcta gcagtgtggt tgtgtcttgg ctccccccta 3780
ccaagcccaa cggggtgatc cgcaagtaca ccatcttctg ttccagcccc gggtctggcc 3840
agccggctcc cagcgagtac gagacgagtc cagagcagct cttctaccgg atcgcccacc 3900
taaaccgcgg tcagcagtat ctgctgtggg tggccgccgt cacctctgcc ggccggggca 3960
acagcagcga gaaggtgacc atcgagcctg ctggcaaggc cccagcaaag atcatctcct 4020
ttgggggcac cgtgacaaca ccttggatga aagatgttcg gctgccttgc aattcagtgg 4080
gagatccagc ccctgctgtg aagtggacca aggacagtga agactcggcc attccagtgt 4140
ccatggatgg gcaccggctc atccacacca atggcacact gctgctgcgt gcagtgaagg 4200
ctgaggactc tggctactac acgtgcacgg ccaccaacac tggtggcttt gacaccatca 4260
tcgtcaacct tctggtgcaa gttcccccgg accagccccg cctcactgtc tccaaaacct 4320
cagcttcgtc catcaccctg acctggattc caggtgacaa tgggggcagc tccatccgag 4380
gcttcgtgct acagtactcg gtggacaaca gcgaggagtg gaaggatgtg ttcatcagct 4440
ccagcgagcg ctccttcaag ctggacagcc tcaagtgtgg cacgtggtac aaggtgaagc 4500
tggcagccaa gaacagcgtg ggctctgggc gcatcagcga gatcatcgag gccaagaccc 4560
acgggcggga gccctccttc agcaaagacc aacacctctt cacccacatc aactccacgc 4620
atgctcggct taacctgcag ggctggaaca atgggggctg ccctatcaca gccatcgttc 4680
tggagtaccg gcccaagggg acctgggcct ggcagggcct ccgggccaac agctccgggg 4740
aggtgtttct gacggaactg cgagaggcca cgtggtacga gctgcgcatg agggcttgca 4800
acagtgcggg ctgcggcaat gaaacagccc agttcgccac cctggactac gatggcagca 4860
ccattccacc catcaagtct gctcaaggtg aaggggatga tgtgaagaag ctgttcacca 4920
tcggctgccc tgtcatcctg gccacactgg gggtggcact gctcttcatc gtacgcaaga 4980
agaggaagga gaaacggctg aagcgactcc gagatgcaaa gagtttggca gaaatgttga 5040
taagcaagaa caatagaagc tttgacaccc ctgtgaaagg gccaccccag ggcccacggc 5100
tacacattga catccccagg gtccagctgc tcatcgagga caaagaaggc atcaagcaac 5160
tgggagatga caaggccacc atccctgtga cagatgctga gttcagccaa gctgtcaacc 5220
cacagagctt ctgtactggc gtctccttgc accacccaac cctcatccag agcacaggac 5280
ccctcatcga catgtctgac atccggccag gaaccaatcc agtgtccagg aagaatgtga 5340
agtcagccca cagcacccgg aaccggtact caagccagtg gaccctgacc aagtgccagg 5400
cctccacacc tgcccgcacc ctcacctccg actggcgcac cgtgggctcc cagcatggtg 5460
tcacggtcac tgagagtgac agctacagtg ccagcctgtc ccaggacaca gacaaaggaa 5520
ggaacagcat ggtgtccact gagagtgcct cttccaccta cgaggagctg gcccgggcct 5580
atgagcatgc caagctggag gagcagctgc agcacgccaa gtttgagatc accgagtgct 5640
tcatctctga cagttcctct gaccagatga ccacaggcac caacgagaac gccgacagca 5700
tgacatccat gagcacaccc tcagagcctg gcatctgccg ctttaccgcc tcaccaccca 5760
agccccagga tgcggaccgg ggcaaaaacg tggctgtgcc catccctcac cgggccaaca 5820
agagtgacta ctgcaacctg cccctgtatg ccaagtcaga ggccttcttt cgaaaggcag 5880
atggacgtga gccctgcccc gtggtcccac cccgtgaggc ctccatccgg aacctggctc 5940
gaacctacca cacccaggct cgccacctga ccctggaccc tgccagcaag tccttgggcc 6000
ttccccaccc aggggccccc gctgccgcct ccacagccac cttacctcag aggactctgg 6060
ccatgccagc ccccccagcc ggcacagccc ccccagcccc cggccccacc cctgctgagc 6120
cacccaccgc ccccagcgct gcccctccgg cccccagcac cgagcctcca cgagccgggg 6180
gcccacacac caaaatgggg ggctccaggg actcgcttct cgagatgagc acatcggggg 6240
tagggaggtc tcagaagcag ggggccgggg cctactccaa atcctacacc ctggtgtagg 6300
gcccgcagga agagcagcca cgcctggacc gcgccgcgcc gcagccccac acgccagctc 6360
ggctgttttt ctgcattatt tatattcaac tgacagacaa aaaccaacca acgacaaaac 6420
aaaaaccccc aatcatgaac gcctgtacat agaactcttt tgtacaaatg aaactatttt 6480
cttcttctcc atgaagccag ggcacaaaga atttgacagt acaagtcaaa tcccccaccc 6540
cacaaaatat gtgtggagat atatatacat atatagacag acaggaacgc gtccacgagc 6600
tatatatcta tatatttctc tcaccctatt ttgagacaga ggcacaaaga ctcagcaatt 6660
tttttccctc ctcctcacct tccccccagt ctaggtggtt ttgacaaaga ccaaaatccc 6720
aactcagaga cactgcatgc gattttactg ttccaagaaa accaggagtt gcttcaattt 6780
gcagatgctt atgtgttaat acctttttct atgaaaaaag acccagcgcc gtgtgcaata 6840
aaggttatgt ttcta 6855
<![CDATA[<210> 904]]>
<![CDATA[<211> 15]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 904]]>
Met Trp Leu Val Thr Phe Leu Leu Leu Leu Asp Ser Leu His Lys
1 5 10 15
<![CDATA[<210> 905]]>
<![CDATA[<211> 105]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 905]]>
Arg Pro Glu Asp Val Gly Thr Ser Leu Tyr Phe Val Asn Asp Ser Leu
1 5 10 15
Gln Gln Val Thr Phe Ser Ser Ser Val Gly Val Val Val Pro Cys Pro
20 25 30
Ala Ala Gly Ser Pro Ser Ala Ala Leu Arg Trp Tyr Leu Ala Thr Gly
35 40 45
Asp Asp Ile Tyr Asp Val Pro His Ile Arg His Val His Ala Asn Gly
50 55 60
Thr Leu Gln Leu Tyr Pro Phe Ser Pro Ser Ala Phe Asn Ser Phe Ile
65 70 75 80
His Asp Asn Asp Tyr Phe Cys Thr Ala Glu Asn Ala Ala Gly Lys Ile
85 90 95
Arg Ser Pro Asn Ile Arg Val Lys Ala
100 105
<![CDATA[<210> 906]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 906]]>
Phe Arg Glu Pro Tyr Thr Val Arg Val Glu Asp Gln Arg Ser Met Arg
1 5 10 15
Gly Asn Val Ala Val Phe Lys Cys Leu Ile Pro Ser Ser Val Gln Glu
20 25 30
Tyr Val Ser Val Val Ser Trp Glu Lys Asp Thr Val Ser Ile Ile Pro
35 40 45
<![CDATA[<210> 907]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213]]>> 人類]]>
<br/>
<br/><![CDATA[<400> 907]]>
<br/>
<br/><![CDATA[His Arg Phe Phe Ile Thr Tyr His Gly Gly Leu Tyr Ile Ser Asp Val
1 5 10 15
Gln Lys Glu Asp Ala Leu Ser Thr Tyr Arg Cys Ile Thr Lys His Lys
20 25 30
Tyr Ser Gly Glu Thr Arg Gln Ser Asn Gly Ala Arg Leu Ser Val Thr
35 40 45
<![CDATA[<210> 908]]>
<![CDATA[<211> 92]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 908]]>
Pro Ala Glu Ser Ile Pro Thr Ile Leu Asp Gly Phe His Ser Gln Glu
1 5 10 15
Val Trp Ala Gly His Thr Val Glu Leu Pro Cys Thr Ala Ser Gly Tyr
20 25 30
Pro Ile Pro Ala Ile Arg Trp Leu Lys Asp Gly Arg Pro Leu Pro Ala
35 40 45
Asp Ser Arg Trp Thr Lys Arg Ile Thr Gly Leu Thr Ile Ser Asp Leu
50 55 60
Arg Thr Glu Asp Ser Gly Thr Tyr Ile Cys Glu Val Thr Asn Thr Phe
65 70 75 80
Gly Ser Ala Glu Ala Thr Gly Ile Leu Met Val Ile
85 90
<![CDATA[<210> 909]]>
<![CDATA[<211> 91]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 909]]>
Pro Leu His Val Thr Leu Thr Pro Lys Lys Leu Lys Thr Gly Ile Gly
1 5 10 15
Ser Thr Val Ile Leu Ser Cys Ala Leu Thr Gly Ser Pro Glu Phe Thr
20 25 30
Ile Arg Trp Tyr Arg Asn Thr Glu Leu Val Leu Pro Asp Glu Ala Ile
35 40 45
Ser Ile Arg Gly Leu Ser Asn Glu Thr Leu Leu Ile Thr Ser Ala Gln
50 55 60
Lys Ser His Ser Gly Ala Tyr Gln Cys Phe Ala Thr Arg Lys Ala Gln
65 70 75 80
Thr Ala Gln Asp Phe Ala Ile Ile Ala Leu Glu
85 90
<![CDATA[<210> 910]]>
<![CDATA[<211> 98]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 910]]>
Gly Thr Pro Arg Ile Val Ser Ser Phe Ser Glu Lys Val Val Asn Pro
1 5 10 15
Gly Glu Gln Phe Ser Leu Met Cys Ala Ala Lys Gly Ala Pro Pro Pro
20 25 30
Thr Val Thr Trp Ala Leu Asp Asp Glu Pro Ile Val Arg Asp Gly Ser
35 40 45
His Arg Thr Asn Gln Tyr Thr Met Ser Asp Gly Thr Thr Ile Ser His
50 55 60
Met Asn Val Thr Gly Pro Gln Ile Arg Asp Gly Gly Val Tyr Arg Cys
65 70 75 80
Thr Ala Arg Asn Leu Val Gly Ser Ala Glu Tyr Gln Ala Arg Ile Asn
85 90 95
Val Arg
<![CDATA[<210> 911]]>
<![CDATA[<211> 90]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> ]]> 911
Pro Pro Ser Ile Arg Ala Met Arg Asn Ile Thr Ala Val Ala Gly Arg
1 5 10 15
Asp Thr Leu Ile Asn Cys Arg Val Ile Gly Tyr Pro Tyr Tyr Ser Ile
20 25 30
Lys Trp Tyr Lys Asp Ala Leu Leu Leu Pro Asp Asn His Arg Gln Val
35 40 45
Val Phe Glu Asn Gly Thr Leu Lys Leu Thr Asp Val Gln Lys Gly Met
50 55 60
Asp Glu Gly Glu Tyr Leu Cys Ser Val Leu Ile Gln Pro Gln Leu Ser
65 70 75 80
Ile Ser Gln Ser Val His Val Ala Val Lys
85 90
<![CDATA[<210> 912]]>
<![CDATA[<211> 92]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 912]]>
Pro Pro Leu Ile Gln Pro Phe Glu Phe Pro Pro Ala Ser Ile Gly Gln
1 5 10 15
Leu Leu Tyr Ile Pro Cys Val Val Ser Ser Gly Asp Met Pro Ile Arg
20 25 30
Ile Thr Trp Arg Lys Asp Gly Gln Val Ile Ile Ser Gly Ser Gly Val
35 40 45
Thr Ile Glu Ser Lys Glu Phe Met Ser Ser Leu Gln Ile Ser Ser Val
50 55 60
Ser Leu Lys His Asn Gly Asn Tyr Thr Cys Ile Ala Ser Asn Ala Ala
65 70 75 80
Ala Thr Val Ser Arg Glu Arg Gln Leu Ile Val Arg
85 90
<![CDATA[<210> 913]]>
<![CDATA[<211> 39]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 913]]>
Pro Pro Arg Phe Val Val Gln Pro Asn Asn Gln Asp Gly Ile Tyr Gly
1 5 10 15
Lys Ala Gly Val Leu Asn Cys Ser Val Asp Gly Tyr Pro Pro Pro Lys
20 25 30
Val Met Trp Lys His Ala Lys
35
<![CDATA[<210> 914]]>
<![CDATA[<211> 58]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 914]]>
Ser Gly Asn Pro Gln Gln Tyr His Pro Val Pro Leu Thr Gly Arg Ile
1 5 10 15
Gln Ile Leu Pro Asn Ser Ser Leu Leu Ile Arg His Val Leu Glu Glu
20 25 30
Asp Ile Gly Tyr Tyr Leu Cys Gln Ala Ser Asn Gly Val Gly Thr Asp
35 40 45
Ile Ser Lys Ser Met Phe Leu Thr Val Lys
50 55
<![CDATA[<210> 915]]>
<![CDATA[<211> 66]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 915]]>
Pro Ala Met Ile Thr Ser His Pro Asn Thr Thr Ile Ala Ile Lys Gly
1 5 10 15
His Ala Lys Glu Leu Asn Cys Thr Ala Arg Gly Glu Arg Pro Ile Ile
20 25 30
Ile Arg Trp Glu Lys Gly Asp Thr Val Ile Asp Pro Asp Arg Val Met
35 40 45
Arg Tyr Ala Ile Ala Thr Lys Asp Asn Gly Asp Glu Val Val Ser Thr
50 55 60
Leu Lys
65
<![CDATA[<210> 916]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 916]]>
Leu Lys Pro Ala Asp Arg Gly Asp Ser Val Phe Phe Ser Cys His Ala
1 5 10 15
Ile Asn Ser Tyr Gly Glu Asp Arg Gly Leu Ile Gln Leu Thr Val Gln
20 25 30
<![CDATA[<210> 917]]>
<![CDATA[<211> 42]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 917]]>
Pro Pro Asp Pro Pro Glu Leu Glu Ile Arg Glu Val Lys Ala Arg Ser
1 5 10 15
Met Asn Leu Arg Trp Thr Gln Arg Phe Asp Gly Asn Ser Ile Ile Thr
20 25 30
Gly Phe Asp Ile Glu Tyr Lys Asn Lys Ser
35 40
<![CDATA[<210> 918]]>
<![CDATA[<211> 55]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 918]]>
Ser Trp Asp Phe Lys Gln Ser Thr Arg Asn Ile Ser Pro Thr Ile Asn
1 5 10 15
Gln Ala Asn Ile Val Asp Leu His Pro Ala Ser Val Tyr Ser Ile Arg
20 25 30
Met Tyr Ser Phe Asn Lys Ile Gly Arg Ser Glu Pro Ser Lys Glu Leu
35 40 45
Thr Ile Ser Thr Glu Glu Ala
50 55
<![CDATA[<210> 919]]>
<![CDATA[<211> 23]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 919]]>
Pro Asp Gly Pro Pro Met Asp Val Thr Leu Gln Pro Val Thr Ser Gln
1 5 10 15
Ser Ile Gln Val Thr Trp Lys
20
<![CDATA[<210> 920]]>
<![CDATA[<211> 80]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 920]]>
Ala Pro Lys Lys Glu Leu Gln Asn Gly Val Ile Arg Gly Tyr Gln Ile
1 5 10 15
Gly Tyr Arg Glu Asn Ser Pro Gly Ser Asn Gly Gln Tyr Ser Ile Val
20 25 30
Glu Met Lys Ala Thr Gly Asp Ser Glu Val Tyr Thr Leu Asp Asn Leu
35 40 45
Lys Lys Phe Ala Gln Tyr Gly Val Val Val Gln Ala Phe Asn Arg Ala
50 55 60
Gly Thr Gly Pro Ser Ser Ser Glu Ile Asn Ala Thr Thr Leu Glu Asp
65 70 75 80
<![CDATA[<210> 921]]>
<![CDATA[<211> 48]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 921]]>
Pro Ser Gln Pro Pro Glu Asn Val Arg Ala Leu Ser Ile Thr Ser Asp
1 5 10 15
Val Ala Val Ile Ser Trp Ser Glu Pro Pro Arg Ser Thr Leu Asn Gly
20 25 30
Val Leu Lys Gly Tyr Arg Val Ile Phe Trp Ser Leu Tyr Val Asp Gly
35 40 45
<![CDATA[<210> 922]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 922]]>
Trp Gly Glu Met Gln Asn Ile Thr Thr Thr Arg Glu Arg Val Glu Leu
1 5 10 15
Arg Gly Met Glu Lys Phe Thr Asn Tyr Ser Val Gln Val Leu Ala Tyr
20 25 30
Thr Gln Ala Gly Asp Gly Val Arg Ser Ser Val Leu Tyr Ile Gln Thr
35 40 45
Lys Glu Asp
50
<![CDATA[<210> 923]]>
<![CDATA[<211> 46]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 923]]>
Pro Gly Pro Pro Ala Gly Ile Lys Ala Val Pro Ser Ser Ala Ser Ser
1 5 10 15
Val Val Val Ser Trp Leu Pro Pro Thr Lys Pro Asn Gly Val Ile Arg
20 25 30
Lys Tyr Thr Ile Phe Cys Ser Ser Pro Gly Ser Gly Gln Pro
35 40 45
<![CDATA[<210> 924]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 924]]>
Ala Pro Ser Glu Tyr Glu Thr Ser Pro Glu Gln Leu Phe Tyr Arg Ile
1 5 10 15
Ala His Leu Asn Arg Gly Gln Gln Tyr Leu Leu Trp Val Ala Ala Val
20 25 30
Thr Ser Ala Gly Arg Gly Asn Ser Ser Glu Lys Val Thr Ile Glu Pro
35 40 45
Ala Gly Lys
50
<![CDATA[<210> 925]]>
<![CDATA[<211> 38]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 925]]>
Pro Ala Lys Ile Ile Ser Phe Gly Gly Thr Val Thr Thr Pro Trp Met
1 5 10 15
Lys Asp Val Arg Leu Pro Cys Asn Ser Val Gly Asp Pro Ala Pro Ala
20 25 30
Val Lys Trp Thr Lys Asp
35
<![CDATA[<210> 926]]>
<![CDATA[<211> 54]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 926]]>
Glu Asp Ser Ala Ile Pro Val Ser Met Asp Gly His Arg Leu Ile His
1 5 10 15
Thr Asn Gly Thr Leu Leu Leu Arg Ala Val Lys Ala Glu Asp Ser Gly
20 25 30
Tyr Tyr Thr Cys Thr Ala Thr Asn Thr Gly Gly Phe Asp Thr Ile Ile
35 40 45
Val Asn Leu Leu Val Gln
50
<![CDATA[<210> 927]]>
<![CDATA[<211> 32]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 927]]>
Pro Pro Asp Gln Pro Arg Leu Thr Val Ser Lys Thr Ser Ala Ser Ser
1 5 10 15
Ile Thr Leu Thr Trp Ile Pro Gly Asp Asn Gly Gly Ser Ser Ile Arg
20 25 30
<![CDATA[<210> 928]]>
<![CDATA[<211> 62]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 928]]>
Phe Val Leu Gln Tyr Ser Val Asp Asn Ser Glu Glu Trp Lys Asp Val
1 5 10 15
Phe Ile Ser Ser Ser Glu Arg Ser Phe Lys Leu Asp Ser Leu Lys Cys
20 25 30
Gly Thr Trp Tyr Lys Val Lys Leu Ala Ala Lys Asn Ser Val Gly Ser
35 40 45
Gly Arg Ile Ser Glu Ile Ile Glu Ala Lys Thr His Gly Arg
50 55 60
<![CDATA[<210> 929]]>
<![CDATA[<211> 95]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 929]]>
Pro Ser Phe Ser Lys Asp Gln His Leu Phe Thr His Ile Asn Ser Thr
1 5 10 15
His Ala Arg Leu Asn Leu Gln Gly Trp Asn Asn Gly Gly Cys Pro Ile
20 25 30
Thr Ala Ile Val Leu Glu Tyr Arg Pro Lys Gly Thr Trp Ala Trp Gln
35 40 45
Gly Leu Arg Ala Asn Ser Ser Gly Glu Val Phe Leu Thr Glu Leu Arg
50 55 60
Glu Ala Thr Trp Tyr Glu Leu Arg Met Arg Ala Cys Asn Ser Ala Gly
65 70 75 80
Cys Gly Asn Glu Thr Ala Gln Phe Ala Thr Leu Asp Tyr Asp Gly
85 90 95
<![CDATA[<210> 930]]>
<![CDATA[<211> 51]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 930]]>
Thr Ile Pro Pro Ile Lys Ser Ala Gln Gly Glu Gly Asp Asp Val Lys
1 5 10 15
Lys Leu Phe Thr Ile Gly Cys Pro Val Ile Leu Ala Thr Leu Gly Val
20 25 30
Ala Leu Leu Phe Ile Val Arg Lys Lys Arg Lys Glu Lys Arg Leu Lys
35 40 45
Arg Leu Arg
50
<![CDATA[<210> 931]]>
<![CDATA[<211> 9]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 931]]>
Ala Lys Ser Leu Ala Glu Met Leu Ile
1 5
<![CDATA[<210> 932]]>
<![CDATA[<211> 39]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 932]]>
Lys Asn Asn Arg Ser Phe Asp Thr Pro Val Lys Gly Pro Pro Gln Gly
1 5 10 15
Pro Arg Leu His Ile Asp Ile Pro Arg Val Gln Leu Leu Ile Glu Asp
20 25 30
Lys Glu Gly Ile Lys Gln Leu
35
<![CDATA[<210> 933]]>
<![CDATA[<211> 50]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 933]]>
Asp Asp Lys Ala Thr Ile Pro Val Thr Asp Ala Glu Phe Ser Gln Ala
1 5 10 15
Val Asn Pro Gln Ser Phe Cys Thr Gly Val Ser Leu His His Pro Thr
20 25 30
Leu Ile Gln Ser Thr Gly Pro Leu Ile Asp Met Ser Asp Ile Arg Pro
35 40 45
Gly Thr
50
<![CDATA[<210> 934]]>
<![CDATA[<211> 64]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 934]]>
Pro Val Ser Arg Lys Asn Val Lys Ser Ala His Ser Thr Arg Asn Arg
1 5 10 15
Tyr Ser Ser Gln Trp Thr Leu Thr Lys Cys Gln Ala Ser Thr Pro Ala
20 25 30
Arg Thr Leu Thr Ser Asp Trp Arg Thr Val Gly Ser Gln His Gly Val
35 40 45
Thr Val Thr Glu Ser Asp Ser Tyr Ser Ala Ser Leu Ser Gln Asp Thr
50 55 60
<![CDATA[<210> 935]]>
<![CDATA[<211> 103]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 935]]>
Lys Gly Arg Asn Ser Met Val Ser Thr Glu Ser Ala Ser Ser Thr Tyr
1 5 10 15
Glu Glu Leu Ala Arg Ala Tyr Glu His Ala Lys Leu Glu Glu Gln Leu
20 25 30
Gln His Ala Lys Phe Glu Ile Thr Glu Cys Phe Ile Ser Asp Ser Ser
35 40 45
Ser Asp Gln Met Thr Thr Gly Thr Asn Glu Asn Ala Asp Ser Met Thr
50 55 60
Ser Met Ser Thr Pro Ser Glu Pro Gly Ile Cys Arg Phe Thr Ala Ser
65 70 75 80
Pro Pro Lys Pro Gln Asp Ala Asp Arg Gly Lys Asn Val Ala Val Pro
85 90 95
Ile Pro His Arg Ala Asn Lys
100
<![CDATA[<210> 936]]>
<![CDATA[<211> 157]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 936]]>
Asp Tyr Cys Asn Leu Pro Leu Tyr Ala Lys Ser Glu Ala Phe Phe Arg
1 5 10 15
Lys Ala Asp Gly Arg Glu Pro Cys Pro Val Val Pro Pro Arg Glu Ala
20 25 30
Ser Ile Arg Asn Leu Ala Arg Thr Tyr His Thr Gln Ala Arg His Leu
35 40 45
Thr Leu Asp Pro Ala Ser Lys Ser Leu Gly Leu Pro His Pro Gly Ala
50 55 60
Pro Ala Ala Ala Ser Thr Ala Thr Leu Pro Gln Arg Thr Leu Ala Met
65 70 75 80
Pro Ala Pro Pro Ala Gly Thr Ala Pro Pro Ala Pro Gly Pro Thr Pro
85 90 95
Ala Glu Pro Pro Thr Ala Pro Ser Ala Ala Pro Pro Ala Pro Ser Thr
100 105 110
Glu Pro Pro Arg Ala Gly Gly Pro His Thr Lys Met Gly Gly Ser Arg
115 120 125
Asp Ser Leu Leu Glu Met Ser Thr Ser Gly Val Gly Arg Ser Gln Lys
130 135 140
Gln Gly Ala Gly Ala Tyr Ser Lys Ser Tyr Thr Leu Val
145 150 155
<![CDATA[<210> 937]]>
<![CDATA[<211> 2053]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 937]]>
Met Trp Leu Val Thr Phe Leu Leu Leu Leu Asp Ser Leu His Lys Ala
1 5 10 15
Arg Pro Glu Asp Val Gly Thr Ser Leu Tyr Phe Val Asn Asp Ser Leu
20 25 30
Gln Gln Val Thr Phe Ser Ser Ser Val Gly Val Val Val Pro Cys Pro
35 40 45
Ala Ala Gly Ser Pro Ser Ala Ala Leu Arg Trp Tyr Leu Ala Thr Gly
50 55 60
Asp Asp Ile Tyr Asp Val Pro His Ile Arg His Val His Ala Asn Gly
65 70 75 80
Thr Leu Gln Leu Tyr Pro Phe Ser Pro Ser Ala Phe Asn Ser Phe Ile
85 90 95
His Asp Asn Asp Tyr Phe Cys Thr Ala Glu Asn Ala Ala Gly Lys Ile
100 105 110
Arg Ser Pro Asn Ile Arg Val Lys Ala Val Phe Arg Glu Pro Tyr Thr
115 120 125
Val Arg Val Glu Asp Gln Arg Ser Met Arg Gly Asn Val Ala Val Phe
130 135 140
Lys Cys Leu Ile Pro Ser Ser Val Gln Glu Tyr Val Ser Val Val Ser
145 150 155 160
Trp Glu Lys Asp Thr Val Ser Ile Ile Pro Glu His Arg Phe Phe Ile
165 170 175
Thr Tyr His Gly Gly Leu Tyr Ile Ser Asp Val Gln Lys Glu Asp Ala
180 185 190
Leu Ser Thr Tyr Arg Cys Ile Thr Lys His Lys Tyr Ser Gly Glu Thr
195 200 205
Arg Gln Ser Asn Gly Ala Arg Leu Ser Val Thr Asp Pro Ala Glu Ser
210 215 220
Ile Pro Thr Ile Leu Asp Gly Phe His Ser Gln Glu Val Trp Ala Gly
225 230 235 240
His Thr Val Glu Leu Pro Cys Thr Ala Ser Gly Tyr Pro Ile Pro Ala
245 250 255
Ile Arg Trp Leu Lys Asp Gly Arg Pro Leu Pro Ala Asp Ser Arg Trp
260 265 270
Thr Lys Arg Ile Thr Gly Leu Thr Ile Ser Asp Leu Arg Thr Glu Asp
275 280 285
Ser Gly Thr Tyr Ile Cys Glu Val Thr Asn Thr Phe Gly Ser Ala Glu
290 295 300
Ala Thr Gly Ile Leu Met Val Ile Asp Pro Leu His Val Thr Leu Thr
305 310 315 320
Pro Lys Lys Leu Lys Thr Gly Ile Gly Ser Thr Val Ile Leu Ser Cys
325 330 335
Ala Leu Thr Gly Ser Pro Glu Phe Thr Ile Arg Trp Tyr Arg Asn Thr
340 345 350
Glu Leu Val Leu Pro Asp Glu Ala Ile Ser Ile Arg Gly Leu Ser Asn
355 360 365
Glu Thr Leu Leu Ile Thr Ser Ala Gln Lys Ser His Ser Gly Ala Tyr
370 375 380
Gln Cys Phe Ala Thr Arg Lys Ala Gln Thr Ala Gln Asp Phe Ala Ile
385 390 395 400
Ile Ala Leu Glu Asp Gly Thr Pro Arg Ile Val Ser Ser Phe Ser Glu
405 410 415
Lys Val Val Asn Pro Gly Glu Gln Phe Ser Leu Met Cys Ala Ala Lys
420 425 430
Gly Ala Pro Pro Pro Thr Val Thr Trp Ala Leu Asp Asp Glu Pro Ile
435 440 445
Val Arg Asp Gly Ser His Arg Thr Asn Gln Tyr Thr Met Ser Asp Gly
450 455 460
Thr Thr Ile Ser His Met Asn Val Thr Gly Pro Gln Ile Arg Asp Gly
465 470 475 480
Gly Val Tyr Arg Cys Thr Ala Arg Asn Leu Val Gly Ser Ala Glu Tyr
485 490 495
Gln Ala Arg Ile Asn Val Arg Gly Pro Pro Ser Ile Arg Ala Met Arg
500 505 510
Asn Ile Thr Ala Val Ala Gly Arg Asp Thr Leu Ile Asn Cys Arg Val
515 520 525
Ile Gly Tyr Pro Tyr Tyr Ser Ile Lys Trp Tyr Lys Asp Ala Leu Leu
530 535 540
Leu Pro Asp Asn His Arg Gln Val Val Phe Glu Asn Gly Thr Leu Lys
545 550 555 560
Leu Thr Asp Val Gln Lys Gly Met Asp Glu Gly Glu Tyr Leu Cys Ser
565 570 575
Val Leu Ile Gln Pro Gln Leu Ser Ile Ser Gln Ser Val His Val Ala
580 585 590
Val Lys Val Pro Pro Leu Ile Gln Pro Phe Glu Phe Pro Pro Ala Ser
595 600 605
Ile Gly Gln Leu Leu Tyr Ile Pro Cys Val Val Ser Ser Gly Asp Met
610 615 620
Pro Ile Arg Ile Thr Trp Arg Lys Asp Gly Gln Val Ile Ile Ser Gly
625 630 635 640
Ser Gly Val Thr Ile Glu Ser Lys Glu Phe Met Ser Ser Leu Gln Ile
645 650 655
Ser Ser Val Ser Leu Lys His Asn Gly Asn Tyr Thr Cys Ile Ala Ser
660 665 670
Asn Ala Ala Ala Thr Val Ser Arg Glu Arg Gln Leu Ile Val Arg Val
675 680 685
Pro Pro Arg Phe Val Val Gln Pro Asn Asn Gln Asp Gly Ile Tyr Gly
690 695 700
Lys Ala Gly Val Leu Asn Cys Ser Val Asp Gly Tyr Pro Pro Pro Lys
705 710 715 720
Val Met Trp Lys His Ala Lys Gly Ser Gly Asn Pro Gln Gln Tyr His
725 730 735
Pro Val Pro Leu Thr Gly Arg Ile Gln Ile Leu Pro Asn Ser Ser Leu
740 745 750
Leu Ile Arg His Val Leu Glu Glu Asp Ile Gly Tyr Tyr Leu Cys Gln
755 760 765
Ala Ser Asn Gly Val Gly Thr Asp Ile Ser Lys Ser Met Phe Leu Thr
770 775 780
Val Lys Ile Pro Ala Met Ile Thr Ser His Pro Asn Thr Thr Ile Ala
785 790 795 800
Ile Lys Gly His Ala Lys Glu Leu Asn Cys Thr Ala Arg Gly Glu Arg
805 810 815
Pro Ile Ile Ile Arg Trp Glu Lys Gly Asp Thr Val Ile Asp Pro Asp
820 825 830
Arg Val Met Arg Tyr Ala Ile Ala Thr Lys Asp Asn Gly Asp Glu Val
835 840 845
Val Ser Thr Leu Lys Leu Lys Pro Ala Asp Arg Gly Asp Ser Val Phe
850 855 860
Phe Ser Cys His Ala Ile Asn Ser Tyr Gly Glu Asp Arg Gly Leu Ile
865 870 875 880
Gln Leu Thr Val Gln Glu Pro Pro Asp Pro Pro Glu Leu Glu Ile Arg
885 890 895
Glu Val Lys Ala Arg Ser Met Asn Leu Arg Trp Thr Gln Arg Phe Asp
900 905 910
Gly Asn Ser Ile Ile Thr Gly Phe Asp Ile Glu Tyr Lys Asn Lys Ser
915 920 925
Asp Ser Trp Asp Phe Lys Gln Ser Thr Arg Asn Ile Ser Pro Thr Ile
930 935 940
Asn Gln Ala Asn Ile Val Asp Leu His Pro Ala Ser Val Tyr Ser Ile
945 950 955 960
Arg Met Tyr Ser Phe Asn Lys Ile Gly Arg Ser Glu Pro Ser Lys Glu
965 970 975
Leu Thr Ile Ser Thr Glu Glu Ala Ala Pro Asp Gly Pro Pro Met Asp
980 985 990
Val Thr Leu Gln Pro Val Thr Ser Gln Ser Ile Gln Val Thr Trp Lys
995 1000 1005
Ala Pro Lys Lys Glu Leu Gln Asn Gly Val Ile Arg Gly Tyr Gln
1010 1015 1020
Ile Gly Tyr Arg Glu Asn Ser Pro Gly Ser Asn Gly Gln Tyr Ser
1025 1030 1035
Ile Val Glu Met Lys Ala Thr Gly Asp Ser Glu Val Tyr Thr Leu
1040 1045 1050
Asp Asn Leu Lys Lys Phe Ala Gln Tyr Gly Val Val Val Gln Ala
1055 1060 1065
Phe Asn Arg Ala Gly Thr Gly Pro Ser Ser Ser Glu Ile Asn Ala
1070 1075 1080
Thr Thr Leu Glu Asp Val Pro Ser Gln Pro Pro Glu Asn Val Arg
1085 1090 1095
Ala Leu Ser Ile Thr Ser Asp Val Ala Val Ile Ser Trp Ser Glu
1100 1105 1110
Pro Pro Arg Ser Thr Leu Asn Gly Val Leu Lys Gly Tyr Arg Val
1115 1120 1125
Ile Phe Trp Ser Leu Tyr Val Asp Gly Glu Trp Gly Glu Met Gln
1130 1135 1140
Asn Ile Thr Thr Thr Arg Glu Arg Val Glu Leu Arg Gly Met Glu
1145 1150 1155
Lys Phe Thr Asn Tyr Ser Val Gln Val Leu Ala Tyr Thr Gln Ala
1160 1165 1170
Gly Asp Gly Val Arg Ser Ser Val Leu Tyr Ile Gln Thr Lys Glu
1175 1180 1185
Asp Val Pro Gly Pro Pro Ala Gly Ile Lys Ala Val Pro Ser Ser
1190 1195 1200
Ala Ser Ser Val Val Val Ser Trp Leu Pro Pro Thr Lys Pro Asn
1205 1210 1215
Gly Val Ile Arg Lys Tyr Thr Ile Phe Cys Ser Ser Pro Gly Ser
1220 1225 1230
Gly Gln Pro Ala Pro Ser Glu Tyr Glu Thr Ser Pro Glu Gln Leu
1235 1240 1245
Phe Tyr Arg Ile Ala His Leu Asn Arg Gly Gln Gln Tyr Leu Leu
1250 1255 1260
Trp Val Ala Ala Val Thr Ser Ala Gly Arg Gly Asn Ser Ser Glu
1265 1270 1275
Lys Val Thr Ile Glu Pro Ala Gly Lys Ala Pro Ala Lys Ile Ile
1280 1285 1290
Ser Phe Gly Gly Thr Val Thr Thr Pro Trp Met Lys Asp Val Arg
1295 1300 1305
Leu Pro Cys Asn Ser Val Gly Asp Pro Ala Pro Ala Val Lys Trp
1310 1315 1320
Thr Lys Asp Ser Glu Asp Ser Ala Ile Pro Val Ser Met Asp Gly
1325 1330 1335
His Arg Leu Ile His Thr Asn Gly Thr Leu Leu Leu Arg Ala Val
1340 1345 1350
Lys Ala Glu Asp Ser Gly Tyr Tyr Thr Cys Thr Ala Thr Asn Thr
1355 1360 1365
Gly Gly Phe Asp Thr Ile Ile Val Asn Leu Leu Val Gln Val Pro
1370 1375 1380
Pro Asp Gln Pro Arg Leu Thr Val Ser Lys Thr Ser Ala Ser Ser
1385 1390 1395
Ile Thr Leu Thr Trp Ile Pro Gly Asp Asn Gly Gly Ser Ser Ile
1400 1405 1410
Arg Gly Phe Val Leu Gln Tyr Ser Val Asp Asn Ser Glu Glu Trp
1415 1420 1425
Lys Asp Val Phe Ile Ser Ser Ser Glu Arg Ser Phe Lys Leu Asp
1430 1435 1440
Ser Leu Lys Cys Gly Thr Trp Tyr Lys Val Lys Leu Ala Ala Lys
1445 1450 1455
Asn Ser Val Gly Ser Gly Arg Ile Ser Glu Ile Ile Glu Ala Lys
1460 1465 1470
Thr His Gly Arg Glu Pro Ser Phe Ser Lys Asp Gln His Leu Phe
1475 1480 1485
Thr His Ile Asn Ser Thr His Ala Arg Leu Asn Leu Gln Gly Trp
1490 1495 1500
Asn Asn Gly Gly Cys Pro Ile Thr Ala Ile Val Leu Glu Tyr Arg
1505 1510 1515
Pro Lys Gly Thr Trp Ala Trp Gln Gly Leu Arg Ala Asn Ser Ser
1520 1525 1530
Gly Glu Val Phe Leu Thr Glu Leu Arg Glu Ala Thr Trp Tyr Glu
1535 1540 1545
Leu Arg Met Arg Ala Cys Asn Ser Ala Gly Cys Gly Asn Glu Thr
1550 1555 1560
Ala Gln Phe Ala Thr Leu Asp Tyr Asp Gly Ser Thr Ile Pro Pro
1565 1570 1575
Ile Lys Ser Ala Gln Gly Glu Gly Asp Asp Val Lys Lys Leu Phe
1580 1585 1590
Thr Ile Gly Cys Pro Val Ile Leu Ala Thr Leu Gly Val Ala Leu
1595 1600 1605
Leu Phe Ile Val Arg Lys Lys Arg Lys Glu Lys Arg Leu Lys Arg
1610 1615 1620
Leu Arg Asp Ala Lys Ser Leu Ala Glu Met Leu Ile Ser Lys Asn
1625 1630 1635
Asn Arg Ser Phe Asp Thr Pro Val Lys Gly Pro Pro Gln Gly Pro
1640 1645 1650
Arg Leu His Ile Asp Ile Pro Arg Val Gln Leu Leu Ile Glu Asp
1655 1660 1665
Lys Glu Gly Ile Lys Gln Leu Gly Asp Asp Lys Ala Thr Ile Pro
1670 1675 1680
Val Thr Asp Ala Glu Phe Ser Gln Ala Val Asn Pro Gln Ser Phe
1685 1690 1695
Cys Thr Gly Val Ser Leu His His Pro Thr Leu Ile Gln Ser Thr
1700 1705 1710
Gly Pro Leu Ile Asp Met Ser Asp Ile Arg Pro Gly Thr Asn Pro
1715 1720 1725
Val Ser Arg Lys Asn Val Lys Ser Ala His Ser Thr Arg Asn Arg
1730 1735 1740
Tyr Ser Ser Gln Trp Thr Leu Thr Lys Cys Gln Ala Ser Thr Pro
1745 1750 1755
Ala Arg Thr Leu Thr Ser Asp Trp Arg Thr Val Gly Ser Gln His
1760 1765 1770
Gly Val Thr Val Thr Glu Ser Asp Ser Tyr Ser Ala Ser Leu Ser
1775 1780 1785
Gln Asp Thr Asp Lys Gly Arg Asn Ser Met Val Ser Thr Glu Ser
1790 1795 1800
Ala Ser Ser Thr Tyr Glu Glu Leu Ala Arg Ala Tyr Glu His Ala
1805 1810 1815
Lys Leu Glu Glu Gln Leu Gln His Ala Lys Phe Glu Ile Thr Glu
1820 1825 1830
Cys Phe Ile Ser Asp Ser Ser Ser Asp Gln Met Thr Thr Gly Thr
1835 1840 1845
Asn Glu Asn Ala Asp Ser Met Thr Ser Met Ser Thr Pro Ser Glu
1850 1855 1860
Pro Gly Ile Cys Arg Phe Thr Ala Ser Pro Pro Lys Pro Gln Asp
1865 1870 1875
Ala Asp Arg Gly Lys Asn Val Ala Val Pro Ile Pro His Arg Ala
1880 1885 1890
Asn Lys Ser Asp Tyr Cys Asn Leu Pro Leu Tyr Ala Lys Ser Glu
1895 1900 1905
Ala Phe Phe Arg Lys Ala Asp Gly Arg Glu Pro Cys Pro Val Val
1910 1915 1920
Pro Pro Arg Glu Ala Ser Ile Arg Asn Leu Ala Arg Thr Tyr His
1925 1930 1935
Thr Gln Ala Arg His Leu Thr Leu Asp Pro Ala Ser Lys Ser Leu
1940 1945 1950
Gly Leu Pro His Pro Gly Ala Pro Ala Ala Ala Ser Thr Ala Thr
1955 1960 1965
Leu Pro Gln Arg Thr Leu Ala Met Pro Ala Pro Pro Ala Gly Thr
1970 1975 1980
Ala Pro Pro Ala Pro Gly Pro Thr Pro Ala Glu Pro Pro Thr Ala
1985 1990 1995
Pro Ser Ala Ala Pro Pro Ala Pro Ser Thr Glu Pro Pro Arg Ala
2000 2005 2010
Gly Gly Pro His Thr Lys Met Gly Gly Ser Arg Asp Ser Leu Leu
2015 2020 2025
Glu Met Ser Thr Ser Gly Val Gly Arg Ser Gln Lys Gln Gly Ala
2030 2035 2040
Gly Ala Tyr Ser Lys Ser Tyr Thr Leu Val
2045 2050
<![CDATA[<210> 938]]>
<![CDATA[<211> 648]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 938]]>
agccgagcgc tgggctgagg agcagagaga gcggggcgcc gagtgcgggc ggctgggagc 60
gcgctgagcg ggggagaggc gctgccgcac ggccggccac aggaccacct ccccggagaa 120
tagggcctct ttatggcatg tggctggtaa ctttcctcct gctcctggac tctttacaca 180
aagcccgccc tgaagatgtt ggcaccagcc tctactttgt aaatgactcc ttgcagcagg 240
tgaccttttc cagctccgtg ggggtggtgg tgccctgccc ggccgcgggc tcccccagcg 300
cggcccttcg atggtacctg gccacagggg acgacatcta cgacgtgccg cacatccggc 360
acgtccacgc caacgggacg ctgcagctct accccttctc cccctccgcc ttcaatagct 420
ttatccacga caatgactac ttctgcaccg cggagaacgc tgccggcaag atccggagcc 480
ccaacatccg cgtcaaagca gttttcaggg aaccctacac cgtccgggtg gaggatcaaa 540
ggtcaatgcg tggcaacgtg gccgtcttca agtgcctcat cccctcttca gtgcaggaat 600
atgttagcgt tgtatcttgg gagaaagaca cagtctccat catcccag 648
<![CDATA[<210> 939]]>
<![CDATA[<211> 170]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 939]]>
Met Trp Leu Val Thr Phe Leu Leu Leu Leu Asp Ser Leu His Lys Ala
1 5 10 15
Arg Pro Glu Asp Val Gly Thr Ser Leu Tyr Phe Val Asn Asp Ser Leu
20 25 30
Gln Gln Val Thr Phe Ser Ser Ser Val Gly Val Val Val Pro Cys Pro
35 40 45
Ala Ala Gly Ser Pro Ser Ala Ala Leu Arg Trp Tyr Leu Ala Thr Gly
50 55 60
Asp Asp Ile Tyr Asp Val Pro His Ile Arg His Val His Ala Asn Gly
65 70 75 80
Thr Leu Gln Leu Tyr Pro Phe Ser Pro Ser Ala Phe Asn Ser Phe Ile
85 90 95
His Asp Asn Asp Tyr Phe Cys Thr Ala Glu Asn Ala Ala Gly Lys Ile
100 105 110
Arg Ser Pro Asn Ile Arg Val Lys Ala Val Phe Arg Glu Pro Tyr Thr
115 120 125
Val Arg Val Glu Asp Gln Arg Ser Met Arg Gly Asn Val Ala Val Phe
130 135 140
Lys Cys Leu Ile Pro Ser Ser Val Gln Glu Tyr Val Ser Val Val Ser
145 150 155 160
Trp Glu Lys Asp Thr Val Ser Ile Ile Pro
165 170
<![CDATA[<210> 940]]>
<![CDATA[<211> 485]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 940]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg atccgagaga ctgaggtcat 120
cgacccccag gacctcctag aaggccgata cttctccgga gccctaccag acgatgagga 180
tgtagtgggg cccgggcagg aatctgatga ctttgagctg tctggctctg gagatctgga 240
tgacttggaa gactccatga tcggccctga agttgtccat cccttggtgc ctctagataa 300
ccatatccct gagagggcag ggtctgggag ccaagtcccc accgaaccca agaaactaga 360
ggagaatgag gttatcccca agagaatctc acccgttgaa gagagtgagg atgtgtccaa 420
caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa cggaggtcct 480
ggcag 485
<![CDATA[<210> 941]]>
<![CDATA[<211> 148]]>
<![CDATA[<212> PRT]]>
<![CDATA[<213> 人類]]>
<![CDATA[<400> 941]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu
20 25 30
Leu Glu Gly Arg Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val
35 40 45
Val Gly Pro Gly Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly
50 55 60
Asp Leu Asp Asp Leu Glu Asp Ser Met Ile Gly Pro Glu Val Val His
65 70 75 80
Pro Leu Val Pro Leu Asp Asn His Ile Pro Glu Arg Ala Gly Ser Gly
85 90 95
Ser Gln Val Pro Thr Glu Pro Lys Lys Leu Glu Glu Asn Glu Val Ile
100 105 110
Pro Lys Arg Ile Ser Pro Val Glu Glu Ser Glu Asp Val Ser Asn Lys
115 120 125
Val Ser Met Ser Ser Thr Val Gln Gly Ser Asn Ile Phe Glu Arg Thr
130 135 140
Glu Val Leu Ala
145
<![CDATA[<210> 942]]>
<![CDATA[<211> 33]]>
<![CDATA[<212> DNA]]>
<![CDATA[<213> 人工序列]]>
<![CDATA[<220>]]>
<![CDATA[<223> T7啟動子]]>
<![CDATA[<400> 942]]>
cagagatgca taatacgact cactataggg aga 33
<![CDATA[ <110> MERUS N.V.]]>
<![CDATA[ <120> Novel NRG1 fusions, fusion junctions and methods of detecting them]]>
<![CDATA[ <140> TW 111120470]]>
<![CDATA[ <141> 2022-06-01]]>
<![CDATA[ <150> NL 2028384]]>
<![CDATA[ <151> 2021-06-03]]>
<![CDATA[ <150> NL 2030006]]>
<![CDATA[ <151> 2021-12-03]]>
<![CDATA[ <160> 942 ]]>
<![CDATA[ <170> PatentIn version 3.5]]>
<![CDATA[ <210> 1]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 1]]>
caggtcctga gcctcgagcc gcagcacgag ctcaaattcc gag 43
<![CDATA[ <210> 2]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 2]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt t 51
<![CDATA[ <210> 3]]>
<![CDATA[ <211> 94]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 3]]>
caggtcctga gcctcgagcc gcagcacgag ctcaaattcc gagccttgcc tccccgattg 60
aaagagatga aaagccagga atcggctgca ggtt 94
<![CDATA[ <210> 4]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 4]]>
Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu Lys Phe Arg Ala Leu
1 5 10 15
Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly
20 25 30
<![CDATA[ <210> 5]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 5]]>
agcttcaaac atagtgggga aagctcactc ggattatatg ctgtatgtat acg 53
<![CDATA[ <210> 6]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 6]]>
ctacatctac atccaccact gggacaagcc at 32
<![CDATA[ <210> 7]]>
<![CDATA[ <211> 85]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 7]]>
agcttcaaac atagtgggga aagctcactc ggattatatg ctgtatgtat acgctacatc 60
tacatccacc actgggacaa gccat 85
<![CDATA[ <210> 8]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 8]]>
Ala Ser Asn Ile Val Gly Lys Ala His Ser Asp Tyr Met Leu Tyr Val
1 5 10 15
Tyr Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His
20 25
<![CDATA[ <210> 9]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 9]]>
gacgaagaca gtccctggat caccgacagc acagacagaa tccctgctac ca 52
<![CDATA[ <210> 10]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 10]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaact 58
<![CDATA[ <210> 11]]>
<![CDATA[ <211> 110]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 11]]>
gacgaagaca gtccctggat caccgacagc acagacagaa tccctgctac caccttgcct 60
ccccgattga aagagatgaa aagccaggaa tcggctgcag gttccaaact 110
<![CDATA[ <210> 12]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 12]]>
Asp Glu Asp Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala
1 5 10 15
Thr Thr Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala
20 25 30
Ala Gly Ser Lys
35
<![CDATA[ <210> 13]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 13]]>
ccgcatcggc gaccttcagg ccttccaggg ccacggcgcg ggcaacctgg cgg 53
<![CDATA[ <210> 14]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 14]]>
catctacatc tacatccacc actgggacaa gccat 35
<![CDATA[ <210> 15]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 15]]>
ccgcatcggc gaccttcagg ccttccaggg ccacggcgcg ggcaacctgg cggcatctac 60
atctacatcc accactggga caagccat 88
<![CDATA[ <210> 16]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 16]]>
Arg Ile Gly Asp Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu
1 5 10 15
Ala Ala Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His
20 25
<![CDATA[ <210> 17]]>
<![CDATA[ <211> 399]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 17]]>
agtccccgcc cctggagccg gcggcgcagg gcgcagcttc ccgccgccag agcgggccag 60
cctgctgcgt gcgtgcgtgt gtacgactct gcgtgcgtgc gtgcgtgcgt gcgtgccgtc 120
agctcgccgg gcaccgcggc ctcgccctcg ccctccgccc ctgcgcctgc accgcgtaga 180
ccgacccccc cccagcgcgc ccaccccggta gaggaccccc gcccgtgccc cgaccggtcc 240
ccgccttttt gtaaaactta aagcgggcgc agcattaacg cttcccgccc cggtgacctc 300
tcaggggtct ccccgccaaa ggtgctccgc cgctaaggaa catggcgaag gtggagcagg 360
tcctgagcct cgagccgcag cacgagctca aattccgag 399
<![CDATA[ <210> 18]]>
<![CDATA[ <211> 153]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 18]]>
gtcccttcac cgatgttgtc accaccaacc taaagcttgg caacccgaca gaccgaaatg 60
tgtgttttaa ggtgaagact acagcaccac gtaggtactg tgtgaggccc aacagcggaa 120
tcatcgatgc aggggcctca attaatgtat ctg 153
<![CDATA[ <210> 19]]>
<![CDATA[ <211> 104]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 19]]>
tgatgttaca gcctttcgat tatgatccca atgagaaaag taaacacaag tttatggttc 60
agtctatgtt tgctccaact gacacttcag atatggaagc agta 104
<![CDATA[ <210> 20]]>
<![CDATA[ <211> 81]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 20]]>
tggaaggagg caaaaccgga agaccttatg gattcaaaac ttagatgtgtgtttgaattg 60
ccagcagaga atgataaacc a 81
<![CDATA[ <210> 21]]>
<![CDATA[ <211> 177]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 21]]>
catgatgtag aaataaataa aattatatcc acaactgcat caaagacaga aacaccaata 60
gtgtctaagt ctctgagttc ttctttggat gacaccgaag ttaagaaggt tatggaagaa 120
tgtaagaggc tgcaaggtga agttcagagg ctacgggagg agaacaagca gttcaag 177
<![CDATA[ <210> 22]]>
<![CDATA[ <211> 7035]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 22]]>
gaagaagatg gactgcggat gaggaagaca gtgcagagca acagccccat ttcagcatta 60
gccccaactg ggaaggaaga aggccttagc acccggctct tggctctggt ggttttgttc 120
tttatcgttg gtgtaattat tgggaagatt gccttgtaga ggtagcatgc acaggatggt 180
aaattggatt ggtggatcca ccatatcatg ggatttaaat ttatcataac catgtgtaaa 240
aagaaattaa tgtatgatga catctcacag gtcttgcctt taaattaccc ctccctgcac 300
acacatacac agatacacac acacaaatat aatgtaacga tcttttagaa agttaaaaat 360
gtatagtaac tgattgaggg ggaaaagaat gatctttat aatgacaagg gaaaccatga 420
gtaatgccac aatggcatat tgtaaatgtc attttaaaca ttggtaggcc ttggtacatg 480
atgctggatt acctctctta aaatgacacc cttcctcgcc tgttggtgct ggcccttggg 540
gagctggagc ccagcatgct ggggagtgcg gtcagctcca cacagtagtc cccacgtggc 600
ccactcccgg cccaggctgc tttccgtgtc ttcagttctg tccaagccat cagctccttg 660
ggactgatga acagagtcag aagcccaaag gaattgcact gtggcagcat cagacgtact 720
cgtcataagt gagaggcgtg tgttgactga ttgacccagc gctttggaaa taaatggcag 780
tgctttgttc acttaaaggg accaagctaa atttgtattg gttcatgtag tgaagtcaaa 840
ctgttattca gagatgttta atgcatattt aacttattta atgtatttca tctcatgttt 900
tcttattgtc acaagagtac agttaatgct gcgtgctgct gaactctgtt gggtgaactg 960
gtattgctgc tggagggctg tgggctcctc tgtctctgga gagtctggtc atgtggaggt 1020
ggggtttatt gggatgctgg agaagagctg ccaggaagtg ttttttctgg gtcagtaaat 1080
aacaactgtc atagggaggg aaattctcag tagtgacagt caactctagg ttaccttttt 1140
taatgaagag tagtcagtct tctagattgt tcttatacca cctctcaacc attackcaca 1200
cttccagcgc ccaggtccaa gtctgagcct gacctcccct tggggaccta gcctggagtc 1260
aggacaaatg gatcgggctg cagagggtta gaagcgaggg caccagcagt tgtgggtggg 1320
gagcaaggga agagagaaac tcttcagcga atccttctag tactagttga gagtttgact 1380
gtgaattaat tttatgccat aaaagaccaa cccagttctg tttgactatg tagcatcttg 1440
aaaagaaaaa ttataataaa gccccaaaat taagaattct tttgtcattt tgtcacattt 1500
gctctatggg gggaattatt attttatcat ttttattatt ttgccattgg aaggttaact 1560
ttaaaatgag ccctatcact gagaaatacg tgtttcatga tttaactctg tgtgtgtgtg 1620
tgtgtgtgtg tgtgtgtgtg tgtgtatttt ttttttggtt gtcttcagct gacagtatga 1680
aaaatgaaac tgctgaaaaa gctgagcacc tggtcaccct tggccttcca ttgctttggc 1740
cttcagtaaa aagcagcctc ccttctaggt cagggaacca tgccattgag actagtaacg 1800
ggcgttctgg gcacagtccc actgtgcaca ggtttgagag gacaagttca tcagaaggaa 1860
ggcagtcctt agaagtcaca tacgttgagc cacgttgctc ctaagcctgg ctctgtcaag 1920
ctgggtcagg ggccttgaaa ctggagaagt ggaagtctat ggttggtctg agtaagtaac 1980
ttcctgtctt catgaaaaaa gttgactttg aatcccaggt actcacagaa atggtgaaca 2040
gacttagttg ttacccaggc acccatggat tgtgttgagt gtgcagacag ggaggccacc 2100
ccaataggaa ttcgtctcca ggatttttcc catgtgtccc ccagtactta taaaagggag 2160
tgaaaagacc gagctgtaag gcatgtgcct tctgccacct gactttccgt gaggggacta 2220
aaatttacta attgtagttg ctgcagccag ttaagtcctg tagcttccag gccctcatgt 2280
ctttgatagg agagtgctta ggtggtcccc aacagtgcct aggggtacag tacagtccca 2340
ttacactaga gcagggctct atttattttt aaaggatatg gccgtgtgtt ttgataaaac 2400
tttattcaca aaaacagccg ggtcatggga tttggcttgt gagtctgtaa cagttcttaa 2460
aaagaatatc tgagaaacta ctctgtttta gacctttgaa ggtgatttag agttttgtgt 2520
acatctagga gaaggtgttc agcttctcag aggatgtgga cattttggtt gcagctaaaa 2580
atcagtctct gaagtctctc tcccttctag aggttaggac ttggtgaaca tgtttgtggg 2640
ccttttgact gagtggcaga aggaaactgc tcaggaagag aaacaggtga ctgatgggaa 2700
ggttgattat tttctcagtc atcctggcag ccaaaaatgt gccaggaaaa gaaagaatgt 2760
ggagcacgcg tggctcctgg aggacttgga gatgcatgca catttagggt gttttcccta 2820
gaattacata atgaaaaaaa gaataaggca aagaggaggt gaatatgggg cctgtcacaa 2880
cggcctgccc tgccccaaga gggttaagag tcagataatc gggacgaaac tggcatggaa 2940
agagcgagcc tagggaggat gccgctgggc agtgtgcatg ggggagctgc tgccaggctg 3000
ccctccagtc tgctcctgtg gttactggct ccacagcacc tcagagaggg cggccctggc 3060
ttcagaaatg ccagccatag tgctcacaaa tgcagaagag atggaagcgg tgacagaatc 3120
ctgaaagttt ttattgattg aagttttaaa ttggtaactt aagcttcctt ggcacgatac 3180
aaaatacctc ttaaagacag caggcttttt tatttgtagg tgtgaggaac tggctttaac 3240
ttttttctcc tcctagtttg catgttttcc ttctctcgtc ttctgaactg ctggcaccag 3300
cagtaataca tactgataaa atcaaaattg atttttacca gtggccagtt tatggctaga 3360
gagaacgactt atacctccat aacacagaag ggggaaaaat gaagaacctc cagtgatccg 3420
tgaaaaccta aacgctttca aacaaatccc aggaacagaa ttgctatcga aagatatcat 3480
tgcccagttt gcaggctatg ttgagtcaga tagaactgaa tgtagtgaga gctcagagct 3540
acagagcctt tcagatgaat ttgaaaacag actctgtgtg tgtgtgcatg tgtgcatgtg 3600
tgcatgtgtg gcatatgtgc cgtatgtcag tagcttgaca gttttcaaat cgtgcctata 3660
tttttttgca tacacaaatt tttgtgtttg caaactcaga atccatgcca aaatacaatg 3720
ttatatgtca ttttcagctc cttctctaaa ggaatggccc atttctcatt gtagtttgag 3780
aaatacatgt atgaagagat aggggtcttg ggcttcccag tgtcactttg aacacctgaa 3840
taacatttaa ctcctgagac cttctcggtg tagaggccac tgcttccccc tgctggagat 3900
ggcatttcat tgaagggcct ctcgtggctt tccctgcccc cggctgtctg gcctgaagaa 3960
ggagaaagaa ccaaactgaa ctatgaaaag ttaccactct gaggagacct ctcttaatta 4020
acacttgggg ccatgtttgc tgttgttgag aaggagtgtt ctcaaagatg agctggaatg 4080
gaattgtatt tagaaaggcc cctgcaaagt atatagatgg atgactctag ttcatgacat 4140
acaaatccca taaggccaac gaccactctt ctggaacacc aagagcagct ctgagatcat 4200
gctggcccta cgcgaattga gtttctgtgg cctaattgga tttggagaac gccttccctg 4260
gccccttttc ctcagacaga tctgctctga taggaacctt ttcaagaaag ttactgttgt 4320
ttcaatgcca ctccttacct gtatagaaca tttccaatac attcgctcat tgaacttaat 4380
ccttgcaact gtgactgggg ggtagatggc tctgtttgca tacgaagaaa taaaggctcc 4440
aggagggttaa atcgggcaac tttttagaac taaatcagtc tctgtaaggc ctacattgct 4500
aagataccat ttcagctctg aaaatctgct tcagggaagt gagtggatga ggccttcctg 4560
cctcagctac tctgcccgtc tgtacatctt ttgtgtctgc ctccgtacct tattcagtta 4620
ttttcacact aaagtaagta gaattaagac tgtagttcag atgctttttc tttttctgtt 4680
ggaaactgaa cacactacag acagtgaaaa aaggtacata ttccattttc tcattgcctg 4740
aagatctctg ctgatgctcc tggagaatga ctttgggggc tttagaaaga atattgccag 4800
tccgtctcgg caaggagatg atgggagcgc tttatatgga ggctttacat gacttgtaaa 4860
ttaaatgtga atgagggcag ttgattaaaa ttggtattac agaagggccc tgctgaggtt 4920
tgaaaacagc tgagctgctg atgtctcagg cctttccctg aattagcact gcggttctcc 4980
aggatatcag caaagagggc aagtaataga agcccctgat aaggagcgtc agccgacagg 5040
caagcttggg aggctgtggg aatgggtctg cccccagctt cacagacctc ttcctccagc 5100
ctctgaatcc cattagccac agcctagaac attagctgag ctgcacaagc tcacccaccc 5160
ctgtgccagg gggccctgac ctccctccat gccatgtttt tggctgtatc tacggcactt 5220
aacaataggg gctttttatt ttcattacag agatattttg aaaaatttaa aagacatgaa 5280
ctcacataaa cagttatgga tgatagttaa aagagaaacg ggtggaggtg gatgagaggt 5340
tgtcttcatg aatataatta cttgagattt ttttttctta atggaattag tttattagaa 5400
aatgtctgtg ttaaatccgt agaaaaggaa gaaaagtgta gcaacaaaaa tgtagccatt 5460
atctaacttg ccataaatat ttgcagttat gataccttgg aatgttgcca cgatatggat 5520
tgctttgatt aaaagatgtc agttgaataa aacagtactg tgggagaatc gctttctgct 5580
gctagataaa tgctgatgtt tatttttaaa ccaggaaaca ttgatcctgt aacaatgccc 5640
gattacaatt gctttattac accccagggc tgatggagat gtaatcactt ggctaatgga 5700
tgtgggtgca ggacagatgc tcgcttgctg gcctgctttc ctgcttgcat tctgatgagc 5760
tgcaggagtg cgcctggcct tctgcaggtg gagctgctgt cagagcttcg tttcactgat 5820
acccaaagcc atgtctgact gaaataaaac aggttccctt tttttttccc tttggaaaat 5880
gccaactaag ggagactaat cagatatctt aacacaattt catccaggct tagtgctaac 5940
aagattgcgg ggctttttag ggtttaagaa gatgagaaat gagtgtgcac gtttcacacg 6000
ttgacttgcc ggtttttcca tgtcatacaa aaaagtcctg gctgtttctc cgaactggct 6060
gcctgcattc ccgtctttct tttgttttta agaaatagac tgaattcagc tgttaatcct 6120
ctagtacagt atccatgtta aaatgttttt ccattgcatc ttttatgtga attcaaaggt 6180
cagaatttt tgtctgtgat attgagacca tgtgtacaag aactactttt tgcttttcat 6240
cattcactcc ttagcaaacg tttcgtaagt accctctgtc tgtttgctac tatatgaggt 6300
gctgcgaaat tagtgggcgt ggctttttat atttttcatt cgtgtgtagc ctaagtaagg 6360
tgactcaaga tgatacaccg agagaaaaat gcaaaatata tttggttctc atttctgttg 6420
ctgtcgtttc ctttttaaag acgatttatc aactgctgcc atttggaact tcctataaga 6480
aactaaaaat gatctatttc agtgttcctt tcgcctttcc tctgctttct gaataaatgg 6540
tttcagtaac ccatgctgtt ctctccctat tctacgtctt tctccctatg ttgaaaaaag 6600
attcccacag tttctgatgt gtgtgtttat agtcttcaat gtatgttaac atgttaggaa 6660
ctgagtatct taagagatgt cttagaatgc tttagttttc ataatttgtc ctttatgtat 6720
ttttcattgt atttgctgtt ttgacatgga agtaatttaa aaagttggtg caggaaagga 6780
ctctttactg ttgcacattt tggttttctg atatgtaata aattcatggc ttggcagctg 6840
acatgatgtt tccccagagag aaggagatgt atttctgcag ggtccagacc aaaagagcca 6900
tttacagcat gttctcccat gttccattat cagcctgatg aaacctgccc tgccaaggca 6960
taaacttttg tactagctgt ctccatatta tgttcaataa attctgtgct ctgaatatat 7020
ttaaaaaaaaaaaaa 7035
<![CDATA[ <210> 23]]>
<![CDATA[ <211> 7949]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 23]]>
agtccccgcc cctggagccg gcggcgcagg gcgcagcttc ccgccgccag agcgggccag 60
cctgctgcgt gcgtgcgtgt gtacgactct gcgtgcgtgc gtgcgtgcgt gcgtgccgtc 120
agctcgccgg gcaccgcggc ctcgccctcg ccctccgccc ctgcgcctgc accgcgtaga 180
ccgacccccc cccagcgcgc ccaccccggta gaggaccccc gcccgtgccc cgaccggtcc 240
ccgccttttt gtaaaactta aagcgggcgc agcattaacg cttcccgccc cggtgacctc 300
tcaggggtct ccccgccaaa ggtgctccgc cgctaaggaa catggcgaag gtggagcagg 360
tcctgagcct cgagccgcag cacgagctca aattccgagg tcccttcacc gatgttgtca 420
ccaccaacct aaagcttggc aacccgacag accgaaatgt gtgttttaag gtgaagacta 480
cagcaccacg taggtactgt gtgaggccca acagcggaat catcgatgca ggggcctcaa 540
ttaatgtatc tgtgatgtta cagcctttcg attatgatcc caatgagaaa agtaaacaca 600
agtttatggt tcagtctatg tttgctccaa ctgacacttc agatatggaa gcagtatgga 660
aggaggcaaa accggaagac cttatggatt caaaacttag atgtgtgttt gaattgccag 720
cagagaatga taaaccacat gatgtagaaa taaataaaat tatatccaca actgcatcaa 780
agacagaaac accaatagtg tctaagtctc tgagttcttc tttggatgac accgaagtta 840
agaaggttat ggaagaatgt aagaggctgc aaggtgaagt tcagaggcta cgggaggaga 900
acaagcagtt caaggaagaa gatggactgc ggatgaggaa gacagtgcag agcaacagcc 960
ccatttcagc attagcccca actgggaagg aagaaggcct tagcacccgg ctcttggctc 1020
tggtggtttt gttctttatc gttggtgtaa ttatgggaa gattgccttg tagaggtagc 1080
atgcacagga tggtaaattg gattggtgga tccaccatat catgggattt aaatttatca 1140
taaccatgtg taaaaagaaa ttaatgtatg atgacatctc acaggtcttg cctttaaatt 1200
acccctccct gcacacacat acacagatac acacacacaa atataatgta acgatctttt 1260
agaaagttaa aaatgtatag taactgattg agggggaaaa gaatgatctt tattaatgac 1320
aagggaaacc atgagtaatg ccacaatggc atattgtaaa tgtcatttta aacattggta 1380
ggccttggta catgatgctg gattacctct cttaaaatga cacccttcct cgcctgttgg 1440
tgctggccct tggggagctg gagcccagca tgctggggag tgcggtcagc tccacacagt 1500
agtccccacg tggcccactc ccggcccagg ctgctttccg tgtcttcagt tctgtccaag 1560
ccatcagctc cttgggactg atgaacagag tcagaagccc aaaggaattg cactgtggca 1620
gcatcagacg tactcgtcat aagtgagagg cgtgtgttga ctgattgacc cagcgctttg 1680
gaaataaatg gcagtgcttt gttcacttaa agggaccaag ctaaatttgt attggttcat 1740
gtagtgaagt caaactgtta ttcagagatg tttaatgcat atttaactta tttaatgtat 1800
ttcatctcat gttttcttat tgtcacaaga gtacagttaa tgctgcgtgc tgctgaactc 1860
tgttgggtga actggtattg ctgctgggagg gctgtgggct cctctgtctc tggagagtct 1920
ggtcatgtgg aggtggggtt tattgggatg ctggagaaga gctgccagga agtgtttttt 1980
ctgggtcagt aaataacaac tgtcatagggg agggaaattc tcagtcagtga cagtcaactc 2040
taggttacct tttttaatga agagtagtca gtcttctaga ttgttcttat accacctctc 2100
aaccattact cacacttcca gcgcccaggt ccaagtctga gcctgacctc cccttgggga 2160
cctagcctgg agtcaggaca aatggatcgg gctgcagagg gttagaagcg agggcaccag 2220
cagttgtggg tggggagcaa gggaagagag aaactcttca gcgaatcctt ctagtactag 2280
ttgagagttt gactgtgaat taattttatg ccataaaaga ccaacccagt tctgtttgac 2340
tatgtagcat cttgaaaaga aaaattataa taaagcccca aaattaagaa ttcttttgtc 2400
attttgtcac atttgctcta tggggggaat tattatttta tcatttttat tattttgcca 2460
ttggaaggtt aactttaaaa tgagccctat cactgagaaa tacgtgtttc atgatttaac 2520
tctgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtgta tttttttttt ggttgtcttc 2580
agctgacagt atgaaaaatg aaactgctga aaaagctgag cacctggtca cccttggcct 2640
tccattgctt tggccttcag taaaaagcag cctcccttct aggtcaggga accatgccat 2700
tgagactagt aacgggcgtt ctgggcacag tcccactgtg cacaggtttg agaggacaag 2760
ttcatcagaa ggaaggcagt ccttagaagt cacatacgtt gagccacgtt gctcctaagc 2820
ctggctctgt caagctgggt caggggcctt gaaactggag aagtggaagt ctatggttgg 2880
tctgagtaag taacttcctg tcttcatgaa aaaagttgac tttgaatccc aggtactcac 2940
agaaatggtg aacagactta gttgttaccc aggcacccat ggattgtgtt gagtgtgcag 3000
acagggaggc caccccaata ggaattcgtc tccaggattt ttcccatgtg tcccccagta 3060
ccttataaaag ggagtgaaaa gaccgagctg taaggcatgt gccttctgcc acctgacttt 3120
ccgtgagggg actaaaattt actaattgta gttgctgcag ccagttaagt cctgtagctt 3180
ccaggccctc atgtctttga taggagtg cttaggtggt ccccaacagt gcctaggggt 3240
acagtacagt cccattacac tagagcaggg ctctatttat ttttaaagga tatggccgtg 3300
tgttttgata aaactttat cacaaaaaca gccgggtcat gggatttggc ttgtgagtct 3360
gtaacagttc ttaaaaagaa tatctgagaa actactctgt tttagacctt tgaaggtgat 3420
ttagagtttt gtgtacatct aggagaaggt gttcagcttc tcagaggatg tggacatttt 3480
ggttgcagct aaaaatcagt ctctgaagtc tctctccctt ctagaggtta ggacttggtg 3540
aacatgtttg tgggcctttt gactgagtgg cagaaggaaa ctgctcagga agagaaacag 3600
gtgactgatg ggaaggttga ttattttctc agtcatcctg gcagccaaaa atgtgccagg 3660
aaaagaaaga atgtggagca cgcgtggctc ctggaggact tggagatgca tgcacattta 3720
gggtgttttc cctagaatta cataatgaaa aaaagaataa ggcaaagagg aggtgaatat 3780
ggggcctgtc acaacggcct gccctgcccc aagagggtta agagtcagat aatcgggacg 3840
aaactggcat ggaaagagcg agcctaggga ggatgccgct gggcagtgtg catgggggag 3900
ctgctgccag gctgccctcc agtctgctcc tgtggttact ggctccacag cacctcagag 3960
agggcggccc tggcttcaga aatgccagcc atagtgctca caaatgcaga agagatggaa 4020
gcggtgacag aatcctgaaa gtttttattg attgaagttt taaattggta acttaagctt 4080
ccttggcacg atacaaaata cctcttaaag acagcaggct tttttatttg taggtgtgag 4140
gaactggctt taactttttt ctcctcctag tttgcatgtt ttccttctct cgtcttctga 4200
actgctggca ccagcagtaa tacatactga taaaatcaaa attgattttt accagtggcc 4260
agtttatggc tagagagacg acttatacct ccataacaca gaagggggaa aaatgaagaa 4320
cctccagtga tccgtgaaaa cctaaacgct ttcaaacaaa tcccaggaac agaattgcta 4380
tcgaaagata tcattgccca gtttgcaggc tatgttgagt cagatagaac tgaatgtagt 4440
gagagctcag agctacagag cctttcagat gaatttgaaa acagactctg tgtgtgtgtg 4500
catgtgtgca tgtgtgcatg tgtggcatat gtgccgtatg tcagtagctt gacagttttc 4560
aaatcgtgcc tatatttttt tgcatacaca aatttttgtg tttgcaaact cagaatccat 4620
gccaaaatac aatgttatat gtcattttca gctccttctc taaaggaatg gcccatttct 4680
cattgtagtt tgagaaatac atgtatgaag agataggggt cttgggcttc ccagtgtcac 4740
tttgaacacc tgaataacat ttaactcctg agaccttctc ggtgtagagg ccactgcttc 4800
cccctgctgg agatggcatt tcattgaagg gcctctcgtg gctttccctg cccccggctg 4860
tctggcctga agaaggagaa agaaccaaac tgaactatga aaagttacca ctctgaggag 4920
acctctctta attaacactt ggggccatgt ttgctgttgt tgagaaggag tgttctcaaa 4980
gatgagctgg aatggaattg tattagaaa ggcccctgca aagtatatag atggatgact 5040
ctagttcatg acatacaaat cccataaggc caacgaccac tcttctggaa caccaagagc 5100
agctctgaga tcatgctggc cctacgcgaa ttgagtttct gtggcctaat tggatttgga 5160
gaacgccttc cctggcccct tttcctcaga cagatctgct ctgataggaa ccttttcaag 5220
aaagttactg ttgtttcaat gccactcctt acctgtatag aacatttcca atacattcgc 5280
tcattgaact taatccttgc aactgtgact ggggggtaga tggctctgtt tgcatacgaa 5340
gaaataaagg ctccaggagg ttaaatcggg caacttttta gaactaaatc agtctctgta 5400
aggcctacat tgctaagata ccatttcagc tctgaaaatc tgcttcaggg aagtgagtgg 5460
atgaggcctt cctgcctcag ctactctgcc cgtctgtaca tcttttgtgt ctgcctccgt 5520
accttattca gttattttca cactaaagta agtagaatta agactgtagt tcagatgctt 5580
tttctttttc tgttggaaac tgaacacact acagacagtg aaaaaaggta catattccat 5640
tttctcattg cctgaagatc tctgctgatg ctcctggaga atgactttgg gggctttaga 5700
aagaatattg ccagtccgtc tcggcaagga gatgatggga gcgctttata tggaggcttt 5760
acatgacttg taaattaaat gtgaatgagg gcagttgatt aaaattggta ttacagaagg 5820
gccctgctga ggtttgaaaa cagctgagct gctgatgtct caggcctttc cctgaattag 5880
cactgcggtt ctccaggata tcagcaaaga gggcaagtaa tagaagcccc tgataaggag 5940
cgtcagccga caggcaagct tgggaggctg tgggaatggg tctgccccca gcttcacaga 6000
cctcttcctc cagcctctga atcccattag ccacagccta gaacattagc tgagctgcac 6060
aagctcaccc acccctgtgc cagggggccc tgacctccct ccatgccatg tttttggctg 6120
tatctacggc acttaacaat aggggctttt tattttcatt acagagatat tttgaaaaat 6180
ttaaaagaca tgaactcaca taaacagtta tggatgatag ttaaaagaga aacgggtgga 6240
ggtggatgag aggttgtctt catgaatata attacttgag attttttttt cttaatggaa 6300
ttagtttatt agaaaatgtc tgtgttaaat ccgtagaaaa ggaagaaaag tgtagcaaca 6360
aaaatgtagc cattatctaa cttgccataa atatttgcag ttatgatacc ttggaatgtt 6420
gccacgatat ggattgcttt gattaaaaga tgtcagttga ataaaacagt actgtggggag 6480
aatcgctttc tgctgctaga taaatgctga tgtttatttt taaaccagga aacattgatc 6540
ctgtaacaat gcccgattac aattgcttta ttacacccca gggctgatgg agatgtaatc 6600
acttggctaa tggatgtggg tgcaggacag atgctcgctt gctggcctgc tttcctgctt 6660
gcattctgat gagctgcagg agtgcgcctg gccttctgca ggtggagctg ctgtcagagc 6720
ttcgtttcac tgatacccaa agccatgtct gactgaaata aaacaggttc cctttttttt 6780
tccctttgga aaatgccaac taagggagac taatcagata tcttaacaca atttcatcca 6840
ggcttagtgc taacaagatt gcggggcttt ttagggttta agaagatgag aaatgagtgt 6900
gcacgtttca cacgttgact tgccggtttt tccatgtcat acaaaaaagt cctggctgtt 6960
tctccgaact ggctgcctgc attcccgtct ttcttttgtt tttaagaaat agactgaatt 7020
cagctgttaa tcctctagta cagtatccat gttaaaatgt ttttccattg catcttttat 7080
gtgaattcaa aggtcagaat ttaattgtctg tgatattgag accatgtgta caagaactac 7140
tttttgcttt tcatcattca ctccttagca aacgtttcgt aagtaccctc tgtctgtttg 7200
ctactatatg aggtgctgcg aaattagtgg gcgtggcttt ttatattttt cattcgtgtg 7260
tagcctaagt aaggtgactc aagatgatac accgagagaa aaatgcaaaa tatatttggt 7320
tctcatttct gttgctgtcg tttccttttt aaagacgatt tatcaactgc tgccatttgg 7380
aacttcctat aagaaactaa aaatgatcta tttcagtgtt cctttcgcct ttcctctgct 7440
ttctgaataa atggtttcag taacccatgc tgttctctcc ctattctacg tctttctccc 7500
tatgttgaaa aaagattccc acagtttctg atgtgtgtgt ttatagtctt caatgtatgt 7560
taacatgtta ggaactgagt atcttaagag atgtcttaga atgctttagt tttcataatt 7620
tgtcctttat gtatttttca tgtattttgc tgttttgaca tggaagtaat ttaaaaagtt 7680
ggtgcaggaa aggactcttt actgttgcac attttggttt tctgatatgt aataaattca 7740
tggcttggca gctgacatga tgtttcccag agagaaggag atgtatttct gcagggtcca 7800
gaccaaaaga gccattaca gcatgttctc ccatgttcca ttatcagcct gatgaaacct 7860
gccctgccaa ggcataaact tttgtactag ctgtctccat attatgttca ataaattctg 7920
tgctctgaat atatttaaaaaaaaaaaaa 7949
<![CDATA[ <210> 24]]>
<![CDATA[ <211> 19]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 24]]>
Met Ala Lys Val Glu Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu
1 5 10 15
Lys Phe Arg
<![CDATA[ <210> 25]]>
<![CDATA[ <211> 50]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 25]]>
Pro Phe Thr Asp Val Val Thr Thr Asn Leu Lys Leu Gly Asn Pro Thr
1 5 10 15
Asp Arg Asn Val Cys Phe Lys Val Lys Thr Thr Ala Pro Arg Arg Tyr
20 25 30
Cys Val Arg Pro Asn Ser Gly Ile Ile Asp Ala Gly Ala Ser Ile Asn
35 40 45
Val Ser
50
<![CDATA[ <210> 26]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 26]]>
Met Leu Gln Pro Phe Asp Tyr Asp Pro Asn Glu Lys Ser Lys His Lys
1 5 10 15
Phe Met Val Gln Ser Met Phe Ala Pro Thr Asp Thr Ser Asp Met Glu
20 25 30
Ala Val
<![CDATA[ <210> 27]]>
<![CDATA[ <211> 27]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 27]]>
Trp Lys Glu Ala Lys Pro Glu Asp Leu Met Asp Ser Lys Leu Arg Cys
1 5 10 15
Val Phe Glu Leu Pro Ala Glu Asn Asp Lys Pro
20 25
<![CDATA[ <210> 28]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 28]]>
His Asp Val Glu Ile Asn Lys Ile Ile Ser Thr Thr Ala Ser Lys Thr
1 5 10 15
Glu Thr Pro Ile Val Ser Lys Ser Leu Ser Ser Ser Leu Asp Asp Thr
20 25 30
Glu Val Lys Lys Val Met Glu Glu Cys Lys Arg Leu Gln Gly Glu Val
35 40 45
Gln Arg Leu Arg Glu Glu Asn Lys Gln Phe Lys
50 55
<![CDATA[ <210> 29]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 29]]>
Glu Glu Asp Gly Leu Arg Met Arg Lys Thr Val Gln Ser Asn Ser Pro
1 5 10 15
Ile Ser Ala Leu Ala Pro Thr Gly Lys Glu Glu Gly Leu Ser Thr Arg
20 25 30
Leu Leu Ala Leu Val Val Leu Phe Phe Ile Val Gly Val Ile Ile Gly
35 40 45
Lys Ile Ala Leu
50
<![CDATA[ <210> 30]]>
<![CDATA[ <211> 243]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 30]]>
Met Ala Lys Val Glu Gln Val Leu Ser Leu Glu Pro Gln His Glu Leu
1 5 10 15
Lys Phe Arg Gly Pro Phe Thr Asp Val Val Thr Thr Asn Leu Lys Leu
20 25 30
Gly Asn Pro Thr Asp Arg Asn Val Cys Phe Lys Val Lys Thr Thr Ala
35 40 45
Pro Arg Arg Tyr Cys Val Arg Pro Asn Ser Gly Ile Ile Asp Ala Gly
50 55 60
Ala Ser Ile Asn Val Ser Val Met Leu Gln Pro Phe Asp Tyr Asp Pro
65 70 75 80
Asn Glu Lys Ser Lys His Lys Phe Met Val Gln Ser Met Phe Ala Pro
85 90 95
Thr Asp Thr Ser Asp Met Glu Ala Val Trp Lys Glu Ala Lys Pro Glu
100 105 110
Asp Leu Met Asp Ser Lys Leu Arg Cys Val Phe Glu Leu Pro Ala Glu
115 120 125
Asn Asp Lys Pro His Asp Val Glu Ile Asn Lys Ile Ile Ser Thr Thr
130 135 140
Ala Ser Lys Thr Glu Thr Pro Ile Val Ser Lys Ser Leu Ser Ser Ser Ser
145 150 155 160
Leu Asp Asp Thr Glu Val Lys Lys Val Met Glu Glu Cys Lys Arg Leu
165 170 175
Gln Gly Glu Val Gln Arg Leu Arg Glu Glu Asn Lys Gln Phe Lys Glu
180 185 190
Glu Asp Gly Leu Arg Met Arg Lys Thr Val Gln Ser Asn Ser Pro Ile
195 200 205
Ser Ala Leu Ala Pro Thr Gly Lys Glu Glu Gly Leu Ser Thr Arg Leu
210 215 220
Leu Ala Leu Val Val Leu Phe Phe Ile Val Gly Val Ile Ile Gly Lys
225 230 235 240
Ile Ala Leu
<![CDATA[ <210> 31]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 31]]>
000
<![CDATA[ <210> 32]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 32]]>
000
<![CDATA[ <210> 33]]>
<![CDATA[ <211> 253]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 33]]>
ggttgggctc gcggcgctgt gattggtctg cccggactcc gcctccagcg catgtcatta 60
gcatctcatt agctgtccgc tcgggctccg gaggcagcca acgccgccag tctgaggcag 120
gtgcccgaca tggcgagtgt agtgctgccg agcggatccc agtgtgcggc ggcagcggcg 180
gcggcggcgc ctcccgggct ccggctccgg cttctgctgt tgctcttctc cgccgcggca 240
ctgatcccca cag 253
<![CDATA[ <210> 34]]>
<![CDATA[ <211> 147]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 34]]>
gtgatgggca gaatctgttt acgaaagacg tgacagtgat cgaggggagag gttgcgacca 60
tcagttgcca agtcaataag agtgacgact ctgtgattca gctactgaat cccaacaggc 120
agaccattta tttcagggac ttcaggc 147
<![CDATA[ <210> 35]]>
<![CDATA[ <211> 153]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 35]]>
ctttgaagga cagcaggttt cagttgctga atttttctag cagtgaactc aaagtatcat 60
tgacaaacgt ctcaatttct gatgaaggaa gatacttttg ccagctctat accgatcccc 120
cacaggaaag ttacaccacc atcacagtcc tgg 153
<![CDATA[ <210> 36]]>
<![CDATA[ <211> 138]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 36]]>
tcccaccacg taatctgatg atcgatatcc agaaagacac tgcggtggaa ggtgaggaga 60
ttgaagtcaa ctgcactgct atggccagca agccagccac gactatcagg tggttcaaag 120
ggaacacaga gctaaaag 138
<![CDATA[ <210> 37]]>
<![CDATA[ <211> 159]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 37]]>
gcaaatcgga ggtggaagag tggtcagaca tgtacactgt gaccagtcag ctgatgctga 60
aggtgcacaa ggaggacgat gggtcccag tgatctgcca ggtggagcac cctgcggtca 120
ctggaaacct gcagacccag cggtatctag aagtacagt 159
<![CDATA[ <210> 38]]>
<![CDATA[ <211> 100]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 38]]>
ataagcctca agtgcacatt cagatgactt atcctctaca aggcttaacc cgggaagggg 60
acgcgcttga gttaacatgt gaagccatcg ggaagcccca 100
<![CDATA[ <210> 39]]>
<![CDATA[ <211> 173]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 39]]>
gcctgtgatg gtaacttggg tgagagtcga tgatgaaatg cctcaacacg ccgtactgtc 60
tgggcccaac ctgttcatca ataacctaaa caaaacagat aatggtacat accgctgtga 120
agcttcaaac atagtgggga aagctcactc ggattatatg ctgtatgtat acg 173
<![CDATA[ <210> 40]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 40]]>
acacaacggc gacgacagaa ccagcagttc acg 33
<![CDATA[ <210> 41]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 41]]>
gccttactca gttgcccaat tccgcagaag aactggacag tgaggacctc tcag 54
<![CDATA[ <210> 42]]>
<![CDATA[ <211> 132]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 42]]>
attcccgagc aggtgaagaa ggctcgatca gggcagtgga tcatgccgtg atcggtggcg 60
tcgtggcggt ggtggtgttc gccatgctgt gcttgctcat cattctgggg cgctattttg 120
ccagacataa ag 132
<![CDATA[ <210> 43]]>
<![CDATA[ <211> 2968]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 43]]>
gtacatactt cactcatgaa gccaaaggag ccgatgacgc agcagacgca gacacagcta 60
taatcaatgc agaaggagga cagaacaact ccgaagaaaa gaaagagtac ttcatctaga 120
tcagcctttt tgtttcaatg aggtgtccaa ctggccctat ttagatgata aagagacagt 180
gatattggaa cttgcgagaa attcgtgtgtgtttttttatga atgggtggaa aggtgtgaga 240
ctgggaaggc ttgggaatttg ctgtgtaaaa aaaaaaaaaa tgttctttgg aaagtacact 300
ctgctgtttg acacctcttt tttcgtttgt ttgtttgttt aatttttatt tcttcctacc 360
aagtcaaact tggatacttg gatttagttt cagtagattg cagaaaattc tgtgccttgt 420
tttttgtttg tttgttgcgt tcctttcttt tccccctttg tgcacattta tttcctccct 480
ctaccccaat ttcggatttt ttccaaaatc tcccattttg gaatttgcct gctgggattc 540
cttagactct tttccttccc ttttctgttc tagtttttta cttttgttta tttttatggt 600
aactgctttc tgttccaaat tcagtttcat aaaaggagaa ccagcacagc ttagatttca 660
tagttcagaa tttagtgtat ccataatgca ttcttctctg ttgtcgtaaa gatttgggtg 720
aacaaacaat gaaaactctt tgctgctgcc catgtttcaa atacttagag cagtgaagac 780
tagaaaatta gactgtgatt cagaaaatgt tctgtttgct gtggaactac attackgtac 840
agggttatct gcaagtgagg tgtgtcacaa tgagattgaa tttcactgtc tttaattctg 900
tatctgtaga cggctcagta tagataccct acgctgtcca gaaaggtttg gggcagaaag 960
gactcctcct ttttccatgc cctaaacaga cctgacaggt gaggtctgtt ccttttatat 1020
aagtggaca attttgagtt gccacaggag gggaagtagg gaggggggaa atacagttct 1080
gctctggttg tttctgttcc aaatgattcc atccaccttt cccaatcggc cttacttctc 1140
actaatttgt aggaaaaagc aagttcgtct gttgtgcgaa tgactgaatg ggacagagtt 1200
gatttttttt tttttttcct ttgtgcttag ttaggaaggc agtaggatgt ggcctgcatg 1260
tactgtatat tacagatatt tgtcatgctg ggatttccaa ctcgaatctg tgtgaaactt 1320
tcattccttc agatttggct tgacaaaggc aggaggtaca aaagaagggc tggtattgtt 1380
ctcacactgg tctgctgtcg ctctcagttc tcgataggtc agagcagagg tggaaaaaca 1440
gcatgtacgg attttcagtt acttaatcaa aactcaaatg tgagtgtttt tatcttttta 1500
cctttcatac actagccttg gcctctttcc tcagccttaa gaaccatctg ccaaaaatta 1560
ctgatcctcg catgatggca gccatagtgc atagctacta aaatcagtga ccttgaacat 1620
atcttagatg gggagcctcg ggaaaaggta gaggagtcac gttaccattt acatgtttta 1680
aagaaagaag tgtggggatt ttcactgaaa cgtctaggaa atctagaagt agtcctgaag 1740
gacagaaact aaactcttac catatgtttg gtaagactcc agactccagc taacagtccc 1800
tatggaaaga tggcatcaaa aaagatagat ctatatatat atataaatat atattctatt 1860
acattttcag tgagtaattt tggattttgc aaggtgcatt tttactattg ttacattatg 1920
tggaaaactt atgctgattt atttaagggg gaaaaagtgt caactctttg ttatttgaaa 1980
acatgtttatttttcttgtc tttattttaa cctttgatag aaccattgca atatgggggc 2040
cttttgggaa cggactggta tgtaaaagaa aatccattat cgagcagcat tttattacc 2100
cctcccctat ccctaggcac ttaaccaaga caaaaagcca caatgaacat ccctttttca 2160
atgaatttta taatctgcag ctctattccg agcccttagc acccattccg accatagtat 2220
aatcatatca aagggtgaga atcatttagc atgttgttga aaggtttttt ttcagttgtt 2280
ctttttagaaaaaaagaaaaaaaaaacaaaaacaaaaaaaaaaaatcac accattgctc 2340
acagaattgg catctcattt ttgggacctc ccatctttct gttttgaaaa gtgtacagta 2400
gtgcagtgtt cctgatgtaa ctttatggct tacaatgttg acatgtctca ggttcatgtg 2460
ttgcgattgg tgttttccgt ctcaggtaga ttgcaaagtg taggccccac aattggaaa 2520
aaataataat aaaacaaagc aaaaacagga aattatggat tttagttgta tattggttta 2580
tgtatttttt cttaagtata cagtgcactg tttgaaatgt attgttgagt attactttgt 2640
acaggttgat cacttttttt agagtgaaga aagaacaaac ttgttttttg tgttttttaa 2700
aggaatataa aataatgaag gatgtataat tgatgccaaa taagcttgtt ctttagtcac 2760
accgacgtct tatttttccc tttaggccag ttctgttttt aaggtgtaca tggacaatgt 2820
tacagtgtaa gaaactccat atccatatgt tcccattcgc attttgtatt ggttcatgta 2880
taccattttt acaaaaaaaaaaagaaaaaaaagaagtact ataaaatatc tgtcttctta 2940
ataaaaaaaa attaatgtta caaagtga 2968
<![CDATA[ <210> 44]]>
<![CDATA[ <211> 4310]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 44]]>
ggttgggctc gcggcgctgt gattggtctg cccggactcc gcctccagcg catgtcatta 60
gcatctcatt agctgtccgc tcgggctccg gaggcagcca acgccgccag tctgaggcag 120
gtgcccgaca tggcgagtgt agtgctgccg agcggatccc agtgtgcggc ggcagcggcg 180
gcggcggcgc ctcccgggct ccggctccgg cttctgctgt tgctcttctc cgccgcggca 240
ctgatcccca caggtgatgg gcagaatctg tttacgaaag acgtgacagt gatcgaggga 300
gaggttgcga ccatcagttg ccaagtcaat aagagtgacg actctgtgat tcagctactg 360
aatcccaaca ggcagaccat ttatttcagg gacttcaggc ctttgaagga cagcaggttt 420
cagttgctga atttttctag cagtgaactc aaagtatcat tgacaaacgt ctcaatttct 480
gatgaaggaa gatacttttg ccagctctat accgatcccc cacaggaaag ttacaccacc 540
atcacagtcc tggtcccacc acgtaatctg atgatcgata tccagaaaga cactgcggtg 600
gaaggtgagg agattgaagt caactgcact gctatggcca gcaagccagc cacgactatc 660
aggtggttca aagggaacac agagctaaaa ggcaaatcgg aggtggaaga gtggtcagac 720
atgtacactg tgaccagtca gctgatgctg aaggtgcaca aggaggacga tggggtccca 780
gtgatctgcc aggtggagca ccctgcggtc actggaaacc tgcagaccca gcggtatcta 840
gaagtacagt ataagcctca agtgcacatt cagatgactt atcctctaca aggcttaacc 900
cgggaagggg acgcgcttga gttaacatgt gaagccatcg ggaagcccca gcctgtgatg 960
gtaacttggg tgagagtcga tgatgaaatg cctcaacacg ccgtactgtc tgggcccaac 1020
ctgttcatca ataacctaaa caaaacagat aatggtacat accgctgtga agcttcaaac 1080
atagtgggga aagctcactc ggattatatg ctgtatgtat acgacacaac ggcgacgaca 1140
gaaccagcag ttcacggcct tactcagttg cccaattccg cagaagaact ggacagtgag 1200
gacctctcag attcccgagc aggtgaagaa ggctcgatca gggcagtgga tcatgccgtg 1260
atcggtggcg tcgtggcggt ggtggtgttc gccatgctgt gcttgctcat cattctgggg 1320
cgctattttg ccagacataa aggtacatac ttcactcatg aagccaaagg agccgatgac 1380
gcagcagacg cagacacagc tataatcaat gcagaaggag gacagaacaa ctccgaagaa 1440
aagaaagagt acttcatcta gatcagcctt tttgtttcaa tgaggtgtcc aactggccct 1500
atttagatga taaagagaca gtgatattgg aacttgcgag aaattcgtgtgtttttttat 1560
gaatgggtgg aaaggtgtga gactgggaag gcttgggat tgctgtgtaa aaaaaaaaaa 1620
aatgttcttt ggaaagtaca ctctgctgtt tgacacctct tttttcgttt gtttgtttgt 1680
ttaattttta tttcttccta ccaagtcaaa cttggatact tggatttagt ttcagtagat 1740
tgcagaaaat tctgtgcctt gttttttgtt tgtttgttgc gttcctttct tttccccctt 1800
tgtgcacatt tatttcctcc ctctacccca atttcggatt ttttccaaaa tctcccattt 1860
tggaatttgc ctgctgggat tccttagact cttttccttc ccttttctgt tctagttttt 1920
tacttttgtt tatttttatg gtaactgctt tctgttccaa attcagtttc ataaaaggag 1980
aaccagcaca gcttagattt catagttcag aatttaggtgt atccataatg cattcttctc 2040
tgttgtcgta aagatttggg tgaacaaaca atgaaaactc tttgctgctg cccatgtttc 2100
aaatacttag agcagtgaag actagaaaat tagactgtga ttcagaaaat gttctgtttg 2160
ctgtggaact aattactgt acagggttat ctgcaagtga ggtgtgtcac aatgagattg 2220
aatttcactg tctttaattc tgtatctgta gacggctcag tatagatacc ctacgctgtc 2280
cagaaaggtt tggggcagaa aggactcctc ctttttccat gccctaaaca gacctgacag 2340
gtgaggtctg ttccttttat ataagtggac aaattttgag ttgccacagg agggggaagta 2400
gggagggggg aaatacagtt ctgctctggt tgtttctgtt ccaaatgatt ccatccacct 2460
ttcccaatcg gccttacttc tcactaattt gtaggaaaaa gcaagttcgt ctgttgtgcg 2520
aatgactgaa tgggacagag ttgatttttttttttttttc ctttgtgctt agttaggaag 2580
gcagtaggat gtggcctgca tgtactgtat attacagata tttgtcatgc tgggatttcc 2640
aactcgaatc tgtgtgaaac tttcattcct tcagatttgg cttgacaaag gcaggaggta 2700
caaaagaagg gctggtattg ttctcacact ggtctgctgt cgctctcagt tctcgatagg 2760
tcagagcaga ggtggaaaaa cagcatgtac ggattttcag ttacttaatc aaaactcaaa 2820
tgtgagtgtt tttatctttt tacctttcat aacactagcct tggcctcttt cctcagcctt 2880
aagaaccatc tgccaaaaat tactgatcct cgcatgatgg cagccatagt gcatagctac 2940
taaaatcagt gaccttgaac atatcttaga tggggagcct cgggaaaagg tagagaggtc 3000
acgttaccat ttacatgttt taaagaaaga agtgtggggga ttttcactga aacgtctagg 3060
aaatctagaa gtagtcctga aggacagaaa ctaaactctt accatatgtt tggtaagact 3120
ccagactcca gctaacagtc cctatggaaa gatggcatca aaaaagatag atctatatat 3180
atatataaat atatattcta ttacattttc agtgagtaat tttggatttt gcaaggtgca 3240
tttttactat tgttacatta tgtggaaaac ttatgctgat ttattaagg gggaaaaagt 3300
gtcaactctt tgttatttga aaacatgttt atttttcttg tctttattt aacctttgat 3360
agaaccattg caatatgggg gccttttgggg aacggactgg tatgtaaaag aaaatccatt 3420
atcgagcagc attttatta cccctcccct atccctaggc acttaaccaa gacaaaaagc 3480
cacaatgaac atcccttttt caatgaattt tataatctgc agctctattc cgagccctta 3540
gcacccattc cgaccatagt ataatcatat caaagggtga gaatcattta gcatgttgtt 3600
gaaaggttttttttcagttg ttctttttag aaaaaaagaaaaacaaaaac aaaaacaaaa 3660
aaaaaaaatc acaccattgc tcacagaatt ggcatctcat ttttgggacc tcccatcttt 3720
ctgttttgaa aagtgtacag tagtgcagtg ttcctgatgt aactttatgg cttacaatgt 3780
tgacatgtct caggttcatg tgttgcgatt ggtgttttcc gtctcaggta gattgcaaag 3840
tgtaggcccc acacattgga aaaaataata ataaaacaaa gcaaaaacag gaaattatgg 3900
attttagttg tatattggtt tatgtattttttcttaagta tacagtgcac tgtttgaaat 3960
gtattgttga gtattacttt gtacaggttg atcacttttt ttagagtgaa gaaagaacaa 4020
acttgttttttgtgtttttt aaaggaatat aaaataatga aggatgtata attgatgcca 4080
aataagcttg ttctttagtc acaccgacgt cttatttttc cctttaggcc agttctgttt 4140
ttaaggtgta catggacaat gttacagtgt aagaaactcc atatccatat gttcccattc 4200
gcattttgta ttggttcatg tataccattt ttacaaaaaaaaaaagaaaaaaaagaagta 4260
ctataaaata tctgtcttct taataaaaaa aaattaatgt tacaaagtga 4310
<![CDATA[ <210> 45]]>
<![CDATA[ <211> 41]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 45]]>
Met Ala Ser Val Val Leu Pro Ser Gly Ser Gln Cys Ala Ala Ala Ala
1 5 10 15
Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu Leu
20 25 30
Phe Ser Ala Ala Ala Leu Ile Pro Thr
35 40
<![CDATA[ <210> 46]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 46]]>
Asp Gly Gln Asn Leu Phe Thr Lys Asp Val Thr Val Ile Glu Gly Glu
1 5 10 15
Val Ala Thr Ile Ser Cys Gln Val Asn Lys Ser Asp Asp Ser Val Ile
20 25 30
Gln Leu Leu Asn Pro Asn Arg Gln Thr Ile Tyr Phe Arg Asp Phe Arg
35 40 45
<![CDATA[ <210> 47]]>
<![CDATA[ <211> 50]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 47]]>
Leu Lys Asp Ser Arg Phe Gln Leu Leu Asn Phe Ser Ser Ser Ser Glu Leu
1 5 10 15
Lys Val Ser Leu Thr Asn Val Ser Ile Ser Asp Glu Gly Arg Tyr Phe
20 25 30
Cys Gln Leu Tyr Thr Asp Pro Gln Glu Ser Tyr Thr Thr Ile Thr
35 40 45
Val Leu
50
<![CDATA[ <210> 48]]>
<![CDATA[ <211> 45]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 48]]>
Pro Pro Arg Asn Leu Met Ile Asp Ile Gln Lys Asp Thr Ala Val Glu
1 5 10 15
Gly Glu Glu Ile Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala
20 25 30
Thr Thr Ile Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys
35 40 45
<![CDATA[ <210> 49]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 49]]>
Lys Ser Glu Val Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gln
1 5 10 15
Leu Met Leu Lys Val His Lys Glu Asp Asp Gly Val Pro Val Ile Cys
20 25 30
Gln Val Glu His Pro Ala Val Thr Gly Asn Leu Gln Thr Gln Arg Tyr
35 40 45
Leu Glu Val Gln
50
<![CDATA[ <210> 50]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 50]]>
Lys Pro Gln Val His Ile Gln Met Thr Tyr Pro Leu Gln Gly Leu Thr
1 5 10 15
Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala Ile Gly Lys Pro
20 25 30
<![CDATA[ <210> 51]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 51]]>
Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro Gln His
1 5 10 15
Ala Val Leu Ser Gly Pro Asn Leu Phe Ile Asn Asn Leu Asn Lys Thr
20 25 30
Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn Ile Val Gly Lys Ala
35 40 45
His Ser Asp Tyr Met Leu Tyr Val Tyr
50 55
<![CDATA[ <210> 52]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 52]]>
Thr Thr Ala Thr Thr Glu Pro Ala Val His
1 5 10
<![CDATA[ <210> 53]]>
<![CDATA[ <211> 17]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 53]]>
Leu Thr Gln Leu Pro Asn Ser Ala Glu Glu Leu Asp Ser Glu Asp Leu
1 5 10 15
Ser
<![CDATA[ <210> 54]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 54]]>
Ser Arg Ala Gly Glu Glu Gly Ser Ile Arg Ala Val Asp His Ala Val
1 5 10 15
Ile Gly Gly Val Val Ala Val Val Val Phe Ala Met Leu Cys Leu Leu
20 25 30
Ile Ile Leu Gly Arg Tyr Phe Ala Arg His Lys
35 40
<![CDATA[ <210> 55]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 55]]>
Thr Tyr Phe Thr His Glu Ala Lys Gly Ala Asp Asp Ala Ala Asp Ala
1 5 10 15
Asp Thr Ala Ile Ile Asn Ala Glu Gly Gly Gln Asn Asn Ser Glu Glu
20 25 30
Lys Lys Glu Tyr Phe Ile
35
<![CDATA[ <210> 56]]>
<![CDATA[ <211> 443]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 56]]>
Met Ala Ser Val Val Leu Pro Ser Gly Ser Gln Cys Ala Ala Ala Ala
1 5 10 15
Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu Leu
20 25 30
Phe Ser Ala Ala Ala Ala Leu Ile Pro Thr Gly Asp Gly Gln Asn Leu Phe
35 40 45
Thr Lys Asp Val Thr Val Ile Glu Gly Glu Val Ala Thr Ile Ser Cys
50 55 60
Gln Val Asn Lys Ser Asp Asp Ser Val Ile Gln Leu Leu Asn Pro Asn
65 70 75 80
Arg Gln Thr Ile Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg
85 90 95
Phe Gln Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr
100 105 110
Asn Val Ser Ile Ser Asp Glu Gly Arg Tyr Phe Cys Gln Leu Tyr Thr
115 120 125
Asp Pro Pro Gln Glu Ser Tyr Thr Thr Ile Thr Val Leu Val Pro Pro
130 135 140
Arg Asn Leu Met Ile Asp Ile Gln Lys Asp Thr Ala Val Glu Gly Glu
145 150 155 160
Glu Ile Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr
165 170 175
Ile Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val
180 185 190
Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gln Leu Met Leu Lys
195 200 205
Val His Lys Glu Asp Asp Gly Val Pro Val Ile Cys Gln Val Glu His
210 215 220
Pro Ala Val Thr Gly Asn Leu Gln Thr Gln Arg Tyr Leu Glu Val Gln
225 230 235 240
Tyr Lys Pro Gln Val His Ile Gln Met Thr Tyr Pro Leu Gln Gly Leu
245 250 255
Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala Ile Gly Lys
260 265 270
Pro Gln Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro
275 280 285
Gln His Ala Val Leu Ser Gly Pro Asn Leu Phe Ile Asn Asn Leu Asn
290 295 300
Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn Ile Val Gly
305 310 315 320
Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr Asp Thr Thr Ala Thr
325 330 335
Thr Glu Pro Ala Val His Gly Leu Thr Gln Leu Pro Asn Ser Ala Glu
340 345 350
Glu Leu Asp Ser Glu Asp Leu Ser Asp Ser Arg Ala Gly Glu Glu Gly
355 360 365
Ser Ile Arg Ala Val Asp His Ala Val Ile Gly Gly Val Val Ala Val
370 375 380
Val Val Phe Ala Met Leu Cys Leu Leu Ile Ile Leu Gly Arg Tyr Phe
385 390 395 400
Ala Arg His Lys Gly Thr Tyr Phe Thr His Glu Ala Lys Gly Ala Asp
405 410 415
Asp Ala Ala Asp Ala Asp Thr Ala Ile Ile Asn Ala Glu Gly Gly Gln
420 425 430
Asn Asn Ser Glu Glu Lys Lys Glu Tyr Phe Ile
435 440
<![CDATA[ <210> 57]]>
<![CDATA[ <211> 1123]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 57]]>
ggttgggctc gcggcgctgt gattggtctg cccggactcc gcctccagcg catgtcatta 60
gcatctcatt agctgtccgc tcgggctccg gaggcagcca acgccgccag tctgaggcag 120
gtgcccgaca tggcgagtgt agtgctgccg agcggatccc agtgtgcggc ggcagcggcg 180
gcggcggcgc ctcccgggct ccggctccgg cttctgctgt tgctcttctc cgccgcggca 240
ctgatcccca caggtgatgg gcagaatctg tttacgaaag acgtgacagt gatcgaggga 300
gaggttgcga ccatcagttg ccaagtcaat aagagtgacg actctgtgat tcagctactg 360
aatcccaaca ggcagaccat ttatttcagg gacttcaggc ctttgaagga cagcaggttt 420
cagttgctga atttttctag cagtgaactc aaagtatcat tgacaaacgt ctcaatttct 480
gatgaaggaa gatacttttg ccagctctat accgatcccc cacaggaaag ttacaccacc 540
atcacagtcc tggtcccacc acgtaatctg atgatcgata tccagaaaga cactgcggtg 600
gaaggtgagg agattgaagt caactgcact gctatggcca gcaagccagc cacgactatc 660
aggtggttca aagggaacac agagctaaaa ggcaaatcgg aggtggaaga gtggtcagac 720
atgtacactg tgaccagtca gctgatgctg aaggtgcaca aggaggacga tggggtccca 780
gtgatctgcc aggtggagca ccctgcggtc actggaaacc tgcagaccca gcggtatcta 840
gaagtacagt ataagcctca agtgcacatt cagatgactt atcctctaca aggcttaacc 900
cgggaagggg acgcgcttga gttaacatgt gaagccatcg ggaagcccca gcctgtgatg 960
gtaacttggg tgagagtcga tgatgaaatg cctcaacacg ccgtactgtc tgggcccaac 1020
ctgttcatca ataacctaaa caaaacagat aatggtacat accgctgtga agcttcaaac 1080
atagtgggga aagctcactc ggattatatg ctgtatgtat acg 1123
<![CDATA[ <210> 58]]>
<![CDATA[ <211> 331]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 58]]>
Met Ala Ser Val Val Leu Pro Ser Gly Ser Gln Cys Ala Ala Ala Ala
1 5 10 15
Ala Ala Ala Ala Pro Pro Gly Leu Arg Leu Arg Leu Leu Leu Leu Leu Leu
20 25 30
Phe Ser Ala Ala Ala Ala Leu Ile Pro Thr Gly Asp Gly Gln Asn Leu Phe
35 40 45
Thr Lys Asp Val Thr Val Ile Glu Gly Glu Val Ala Thr Ile Ser Cys
50 55 60
Gln Val Asn Lys Ser Asp Asp Ser Val Ile Gln Leu Leu Asn Pro Asn
65 70 75 80
Arg Gln Thr Ile Tyr Phe Arg Asp Phe Arg Pro Leu Lys Asp Ser Arg
85 90 95
Phe Gln Leu Leu Asn Phe Ser Ser Ser Glu Leu Lys Val Ser Leu Thr
100 105 110
Asn Val Ser Ile Ser Asp Glu Gly Arg Tyr Phe Cys Gln Leu Tyr Thr
115 120 125
Asp Pro Pro Gln Glu Ser Tyr Thr Thr Ile Thr Val Leu Val Pro Pro
130 135 140
Arg Asn Leu Met Ile Asp Ile Gln Lys Asp Thr Ala Val Glu Gly Glu
145 150 155 160
Glu Ile Glu Val Asn Cys Thr Ala Met Ala Ser Lys Pro Ala Thr Thr
165 170 175
Ile Arg Trp Phe Lys Gly Asn Thr Glu Leu Lys Gly Lys Ser Glu Val
180 185 190
Glu Glu Trp Ser Asp Met Tyr Thr Val Thr Ser Gln Leu Met Leu Lys
195 200 205
Val His Lys Glu Asp Asp Gly Val Pro Val Ile Cys Gln Val Glu His
210 215 220
Pro Ala Val Thr Gly Asn Leu Gln Thr Gln Arg Tyr Leu Glu Val Gln
225 230 235 240
Tyr Lys Pro Gln Val His Ile Gln Met Thr Tyr Pro Leu Gln Gly Leu
245 250 255
Thr Arg Glu Gly Asp Ala Leu Glu Leu Thr Cys Glu Ala Ile Gly Lys
260 265 270
Pro Gln Pro Val Met Val Thr Trp Val Arg Val Asp Asp Glu Met Pro
275 280 285
Gln His Ala Val Leu Ser Gly Pro Asn Leu Phe Ile Asn Asn Leu Asn
290 295 300
Lys Thr Asp Asn Gly Thr Tyr Arg Cys Glu Ala Ser Asn Ile Val Gly
305 310 315 320
Lys Ala His Ser Asp Tyr Met Leu Tyr Val Tyr
325 330
<![CDATA[ <210> 59]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 59]]>
000
<![CDATA[ <210> 60]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 60]]>
000
<![CDATA[ <210> 61]]>
<![CDATA[ <211> 200]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 61]]>
ctcattgccc agcggacccc agcctctgcc aggttcggtc cgccatcctc gtcccgtcct 60
ccgccggccc ctgccccgcg cccagggatc ctccagctcc tttcgcccgc gccctccgtt 120
cgctccggac accatggaca agttttggtg gcacgcagcc tggggactct gcctcgtgcc 180
gctgagcctg gcgcagatcg 200
<![CDATA[ <210> 62]]>
<![CDATA[ <211> 166]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 62]]>
atttgaatat aacctgccgc tttgcaggtg tattccacgt ggagaaaaat ggtcgctaca 60
gcatctctcg gacggaggcc gctgacctct gcaaggcttt caatagcacc ttgcccacaa 120
tggccccagat ggagaaagct ctgagcatcg gatttgagac ctgcag 166
<![CDATA[ <210> 63]]>
<![CDATA[ <211> 134]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 63]]>
gtatgggttc atagaagggc acgtggtgat tccccggatc caccccaact ccatctgtgc 60
agcaaacaac acagggggtgt acatcctcac atccaacacc tccccagtatg acacatattg 120
cttcaatgct tcag 134
<![CDATA[ <210> 64]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 64]]>
ctccacctga agaagattgt acatcagtca cagacctgcc caatgccttt gatggaccaa 60
ttaccataa 69
<![CDATA[ <210> 65]]>
<![CDATA[ <211> 231]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 65]]>
ctattgttaa ccgtgatggc acccgctatg tccagaaagg agaatacaga acgaatcctg 60
aagacatcta ccccagcaac cctactgatg atgacgtgag cagcggctcc tccagtgaaa 120
ggagcagcac ttcaggaggt tacatctttt acaccttttc tactgtacac cccatcccag 180
acgaagacag tccctggatc accgacagca cagacagaat ccctgctacc a 231
<![CDATA[ <210> 66]]>
<![CDATA[ <211> 129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 66]]>
ctttgatgag cactagtgct acagcaactg agacagcaac caagaggcaa gaaacctggg 60
attggttttc atggttgttt ctaccatcag agtcaaagaa tcatcttcac acaacaacac 120
aaatggctg 129
<![CDATA[ <210> 67]]>
<![CDATA[ <211> 126]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 67]]>
gtacgtcttc aaataccatc tcagcaggct gggagccaaa tgaagaaaat gaagatgaaa 60
gagacagaca cctcagtttt tctggatcag gcattgatga tgatgaagat tttatctcca 120
gcacca 126
<![CDATA[ <210> 68]]>
<![CDATA[ <211> 114]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 68]]>
tttcaaccac accacgggct tttgaccaca caaaacagaa ccaggactgg acccagtgga 60
acccaagcca ttcaaatccg gaagtgctac ttcagacaac cacaaggatg actg 114
<![CDATA[ <210> 69]]>
<![CDATA[ <211> 117]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 69]]>
atgtagacag aaatggcacc actgcttatg aaggaaactg gaacccagaa gcacaccctc 60
ccctcattca ccatgagcat catgaggaag aagagacccc aattctaca agcacaa 117
<![CDATA[ <210> 70]]>
<![CDATA[ <211> 129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 70]]>
tccaggcaac tcctagtagt acaacggaag aaacagctac ccagaaggaa cagtggtttg 60
gcaacagatg gcatgaggga tatcgccaaa cacccaaaga agactcccat tcgacaacag 120
ggacagctg 129
<![CDATA[ <210> 71]]>
<![CDATA[ <211> 132]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 71]]>
cagcctcagc tcataccagc catccaatgc aaggaaggac aacaccaagc ccagaggaca 60
gttcctggac tgatttcttc aacccaatct cacaccccat gggacgaggt catcaagcag 120
gaagaaggat gg 132
<![CDATA[ <210> 72]]>
<![CDATA[ <211> 102]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 72]]>
atatggactc cagtcatagt ataacgcttc agcctactgc aaatccaaac acaggtttgg 60
tggaagattt ggacaggaca ggacctcttt caatgacaac gc 102
<![CDATA[ <210> 73]]>
<![CDATA[ <211> 90]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 73]]>
agcagagtaa ttctcagagc ttctctacat cacatgaagg cttggaagaa gataaagacc 60
atccaacaac ttctactctg acatcaagca 90
<![CDATA[ <210> 74]]>
<![CDATA[ <211> 204]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 74]]>
ataggaatga tgtcacaggt ggaagaagag acccaaatca ttctgaaggc tcaactactt 60
tactggaagg ttatacctct cattacccac acacgaagga aagcaggacc ttcatcccag 120
tgacctcagc taagactggg tcctttggag ttactgcagt tactgttgga gattccaact 180
ctaatgtcaa tcgttcctta tcag 204
<![CDATA[ <210> 75]]>
<![CDATA[ <211> 63]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 75]]>
gagaccaaga cacattccac cccagtgggg ggtcccatac cactcatgga tctgaatcag 60
atg 63
<![CDATA[ <210> 76]]>
<![CDATA[ <211> 72]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400>]]> 76
gacactcaca tgggagtcaa gaaggtggag caaacacaac ctctggtcct ataaggacac 60
cccaaattcc ag 72
<![CDATA[ <210> 77]]>
<![CDATA[ <211> 79]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 77]]>
aatggctgat catcttggca tccctcttgg ccttggcttt gattcttgca gtttgcattg 60
cagtcaacag tcgaagaag 79
<![CDATA[ <210> 78]]>
<![CDATA[ <211> 3274]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 78]]>
gtgtgggcag aagaaaaagc tagtgatcaa cagtggcaat ggagctgtgg aggacagaaa 60
gccaagtgga ctcaacggag aggccagcaa gtctcaggaa atggtgcatt tggtgaacaa 120
ggagtcgtca gaaactccag accagtttat gacagctgat gagacaagga acctgcagaa 180
tgtggacatg aagattgggg tgtaacacct acaccattat cttggaaaga aacaaccgtt 240
ggaaacataa ccattacagg gagctgggac acttaacaga tgcaatgtgc tactgattgt 300
ttcattgcga atctttttta gcataaaatt ttctactctt tttgtttttt gtgttttgtt 360
ctttaaagtc aggtccaatt tgtaaaaaca gcattgcttt ctgaaattag ggcccaatta 420
ataatcagca agaatttgat cgttccagtt cccacttgga ggcctttcat ccctcgggtg 480
tgctatggat ggcttctaac aaaaactaca catatgtatt cctgatcgcc aacctttccc 540
ccaccagcta aggacatttc ccagggttaa tagggcctgg tccctgggag gaaatttgaa 600
tgggtccatt ttgcccttcc atagcctaat ccctgggcat tgctttccac tgaggttggg 660
ggttggggtg tactagttac acatcttcaa cagaccccct ctagaaattt ttcagatgct 720
tctgggagac acccaaaggg tgaagctatt tatctgtagt aaactattta tctgtgtttt 780
tgaaatatta aaccctggat cagtcctttg atcagtataa ttttttaaag ttactttgtc 840
agaggcacaa aagggtttaa actgattcat aataaatatc tgtacttctt cgatcttcac 900
cttttgtgct gtgattcttc agtttctaaa ccagcactgt ctgggtccct acaatgtatc 960
aggaagagct gagaatggta aggagactct tctaagtctt catctcagag accctgagtt 1020
cccactcaga cccactcagc caaatctcat ggaagaccaa ggagggcagc actgtttttg 1080
ttttttgttt tttgttttttttttttgaca ctgtccaaag gttttccatc ctgtcctgga 1140
atcagagttg gaagctgagg agcttcagcc tcttttatgg tttaatggcc acctgttctc 1200
tcctgtgaaa ggctttgcaa agtcacatta agtttgcatg acctgttatc cctggggccc 1260
tatttcatag aggctggccc tattagtgat ttccaaaaac aatatggaag tgccttttga 1320
tgtcttacaa taagagaaga agccaatgga aatgaaagag attggcaaag gggaaggatg 1380
atgccatgta gatcctgttt gacatttta tggctgtatt tgtaaactta aacacaccag 1440
tgtctgttct tgatgcagtt gctatttagg atgagttaag tgcctgggga gtccctcaaa 1500
aggttaaagg gattcccatc attggaatct tatcaccaga taggcaagtt tatgaccaaa 1560
caagagagta ctggctttat cctctaacct catattttct cccacttggc aagtcctttg 1620
tggcatttat tcatcagtca gggtgtccga ttggtcctag aacttccaaa ggctgcttgt 1680
catagaagcc attgcatcta taaagcaacg gctcctgtta aatggtatct cctttctgag 1740
gctcctacta aaagtcattt gttacctaaa cttatgtgct taacaggcaa tgcttctcag 1800
accacaaagc agaaagaaga agaaaagctc ctgactaaat cagggctggg cttagacaga 1860
gttgatctgt agaatatctt taaaggagag atgtcaactt tctgcactat tcccagcctc 1920
tgctcctccc tgtctaccct ctcccctccc tctctccctc cacttcaccc cacaatcttg 1980
aaaaacttcc tttctcttct gtgaacatca ttggccagat ccattttcag tggtctggat 2040
ttctttttat tttcttttca acttgaaaga aactggacat taggccacta tgtgttgtta 2100
ctgccactag tgttcaagtg cctcttgttt tcccagagat ttcctgggtc tgccagaggc 2160
ccagacaggc tcactcaagc tctttaactg aaaagcaaca agccactcca ggacaaggtt 2220
caaaatggtt acaacagcct ctacctgtcg ccccaggggag aaaggggtag tgatacaagt 2280
ctcatagcca gagatggttt tccactcctt ctagatattc ccaaaaagag gctgagacag 2340
gaggttattt tcaattttat tttggaatta aatacttttt tccctttatt actgttgtag 2400
tccctcactt ggatatacct ctgttttcac gatagaaata aggggaggtct agagcttcta 2460
ttccttggcc attgtcaacg gagagctggc caagtcttca caaacccttg caacattgcc 2520
tgaagtttat ggaataagat gtattctcac tcccttgatc tcaagggcgt aactctggaa 2580
gcacagcttg actacacgtc atttttacca atgattttca ggtgacctgg gctaagtcat 2640
ttaaactggg tctttataaa agtaaaaggc caacatttaa ttatttgca aagcaaccta 2700
agagctaaag atgtaatttt tcttgcaatt gtaaatcttt tgtgtctcct gaagacttcc 2760
cttaaaatta gctctgagtg aaaaatcaaa agagacaaaa gacatcttcg aatccatatt 2820
tcaagcctgg tagaattggc ttttctagca gaacctttcc aaaagtttta tattgagatt 2880
cataacaaca ccaagaattg attttgtagc caacattcat tcaatactgt tatatcagag 2940
gagtaggaga gaggaaacat ttgacttatc tggaaaagca aaatgtactt aagaataaga 3000
ataacatggt ccattcacct ttatgttata gatatgtctt tgtgtaaatc atttgttttg 3060
agttttcaaa gaatagccca ttgttcattc ttgtgctgta caatgaccac tgttattgtt 3120
actttgactt ttcagagcac acccttcctc tggtttttgt atatttattg atggatcaat 3180
aataatgagg aaagcatgat atgtatattg ctgagttgaa agcacttatt ggaaaatatt 3240
aaaaggctaa cattaaaaga ctaaaggaaa caga 3274
<![CDATA[ <210> 79]]>
<![CDATA[ <211> 5431]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 79]]>
ctcattgccc agcggacccc agcctctgcc aggttcggtc cgccatcctc gtcccgtcct 60
ccgccggccc ctgccccgcg cccagggatc ctccagctcc tttcgcccgc gccctccgtt 120
cgctccggac accatggaca agttttggtg gcacgcagcc tggggactct gcctcgtgcc 180
gctgagcctg gcgcagatcg atttgaatat aacctgccgc tttgcaggtg tattccacgt 240
ggagaaaaat ggtcgctaca gcatctctcg gacggaggcc gctgacctct gcaaggcttt 300
caatagcacc ttgcccacaa tggcccagat ggagaaagct ctgagcatcg gatttgagac 360
ctgcaggtat gggttcatag aagggcacgt ggtgattccc cggatccacc ccaactccat 420
ctgtgcagca aacaacacag gggtgtacat cctcacatcc aacacctccc agtatgacac 480
atattgcttc aatgcttcag ctccacctga agaagattgt acatcagtca cagacctgcc 540
caatgccttt gatggaccaa ttaccataac tattgttaac cgtgatggca cccgctatgt 600
ccagaaagga gaatacagaa cgaatcctga agacatctac cccagcaacc ctactgatga 660
tgacgtgagc agcggctcct ccagtgaaag gagcagcact tcaggaggtt acatctttta 720
caccttttct actgtacacc ccatcccaga cgaagacagt ccctggatca ccgacagcac 780
agacagaatc cctgctacca ctttgatgag cactagtgct acagcaactg agacagcaac 840
caagaggcaa gaaacctggg atggttttc atggttgttt ctaccatcag agtcaaagaa 900
tcatcttcac acaacaacac aaatggctgg tacgtcttca aataccatct cagcaggctg 960
ggagccaaat gaagaaaatg aagatgaaag aagacagacac ctcagttttt ctggatcagg 1020
cattgatgat gatgaagatt ttatctccag caccatttca accacacac gggcttttga 1080
ccacacaaaa cagaaccagg actggaccca gtggaaccca agccatcaa atccggaagt 1140
gctacttcag acaaccacaa ggatgactga tgtagacaga aatggcacca ctgcttatga 1200
aggaaactgg aacccagaag cacaccctcc cctcattcac catgagcatc atgaggaaga 1260
agagacccca cattctacaa gcacaatcca ggcaactcct agtagtacaa cggaagaaac 1320
agctacccag aaggaacagt ggtttggcaa cagatggcat gagggatatc gccaaacacc 1380
caaagaagac tcccattcga caacagggac agctgcagcc tcagctcata ccagccatcc 1440
aatgcaagga aggacaacac caagcccaga ggacagttcc tggactgatt tcttcaaccc 1500
aatctcacac cccatgggac gaggtcatca agcaggaaga aggatggata tggactccag 1560
tcatagtata acgcttcagc ctactgcaaa tccaaacaca ggtttggtgg aagatttgga 1620
caggacagga cctctttcaa tgacaacgca gcagagtaat tctcagagct tctctacatc 1680
acatgaaggc ttggaagaag ataaagacca tccaacaact tctactctga catcaagcaa 1740
taggaatgat gtcacaggtg gaagaagaga cccaaatcat tctgaaggct caactacttt 1800
actggaaggt tatacctctc attacccaca cacgaaggaa agcaggacct tcatcccagt 1860
gacctcagct aagactgggt cctttggagt tactgcagtt actgttggag attccaactc 1920
taatgtcaat cgttccttat caggagacca agaacacattc caccccagtg gggggtccca 1980
taccactcat ggatctgaat cagatggaca ctcacatggg agtcaagaag gtggagcaaa 2040
cacaacctct ggtcctataa ggacacccca aattccagaa tggctgatca tcttggcatc 2100
cctcttggcc ttggctttga ttcttgcagt ttgcattgca gtcaacagtc gaagaaggtg 2160
tgggcagaag aaaaagctag tgatcaacag tggcaatgga gctgtggagg acagaaagcc 2220
aagtggactc aacggagagg ccagcaagtc tcaggaaatg gtgcatttgg tgaacaagga 2280
gtcgtcagaa actccagacc agtttatgac agctgatgag acaaggaacc tgcagaatgt 2340
ggacatgaag attggggtgt aacacctaca ccattatctt ggaaagaaac aaccgttgga 2400
aacataacca ttacaggggag ctgggacact taacagatgc aatgtgctac tgattgtttc 2460
attgcgaatc ttttttagca taaaattttc tactcttttt gttttttgtg ttttgttctt 2520
taaagtcagg tccaatttgt aaaaacagca ttgctttctg aaattagggc ccaattaata 2580
atcagcaaga atttgatcgt tccagttccc acttggaggc ctttcatccc tcgggtgtgc 2640
tatggatggc ttctaacaaa aactacacat atgtattcct gatcgccaac ctttccccca 2700
ccagctaagg aatttccca gggttaatag ggcctggtcc ctgggagga atttgaatgg 2760
gtccattttg cccttccata gcctaatccc tgggcattgc tttccactga ggttgggggt 2820
tggggtgtac tagttacaca tcttcaacag accccctcta gaaatttttc agatgcttct 2880
gggagacacc caaagggtga agctatttat ctgtagtaaa ctatttatct gtgtttttga 2940
aatattaaac cctggatcag tcctttgatc agtataattt tttaaagtta ctttgtcaga 3000
ggcacaaaag ggtttaaact gattcataat aaatatctgt acttcttcga tcttcacctt 3060
ttgtgctgtg attcttcagt ttctaaacca gcactgtctg ggtccctaca atgtatcagg 3120
aagagctgag aatggtaagg agactcttct aagtcttcat ctcagagacc ctgagttccc 3180
actcagaccc actcagccaa atctcatgga agaccaagga gggcagcact gtttttgttt 3240
tttgtttttt gttttttttt tttgacactg tccaaaggtt ttccatcctg tcctggaatc 3300
agagttggaa gctgaggagc ttcagcctct tttatggttt aatggccacc tgttctctcc 3360
tgtgaaaggc tttgcaaagt cacattaagt ttgcatgacc tgttatccct ggggccctat 3420
ttcatagagg ctggccctat tagtgatttc caaaaacaat atggaagtgc cttttgatgt 3480
cttacaataa gagaagaagc caatggaaat gaaagagatt ggcaaaagggg aaggatgatg 3540
ccatgtagat cctgtttgac atttttatgg ctgtatttgt aaacttaaac acaccagtgt 3600
ctgttcttga tgcagttgct atttaggatg agttaagtgc ctggggagtc cctcaaaagg 3660
ttaaagggat tcccatcatt ggaatcttat caccagatag gcaagtttat gaccaaacaa 3720
gagagtactg gctttatcct ctaacctcat attttctccc acttggcaag tcctttgtgg 3780
catttatca tcagtcaggg tgtccgattg gtcctagaac ttccaaaggc tgcttgtcat 3840
agaagccatt gcatctataa agcaacggct cctgttaaat ggtatctcct ttctgaggct 3900
cctactaaaa gtcatttgtt acctaaactt atgtgcttaa caggcaatgc ttctcagacc 3960
acaaagcaga aagaagaagaaaagctcctg actaaatcag ggctgggctt agacagagtt 4020
gatctgtaga atatctttaa aggagagatg tcaactttct gcactattcc cagcctctgc 4080
tcctccctgt ctacccctctc ccctccctct ctccctccac ttcaccccac aatcttgaaa 4140
aacttccttt ctcttctgtg aacatcattg gccagatcca ttttcagtgg tctggatttc 4200
tttttatttt cttttcaact tgaaagaaac tggacattag gccactatgt gttgttactg 4260
ccactagtgt tcaagtgcct cttgttttcc cagagatttc ctgggtctgc cagaggccca 4320
gacaggctca ctcaagctct ttaactgaaa agcaacaagc cactccagga caaggttcaa 4380
aatggttaca acagcctcta cctgtcgccc cagggagaaa ggggtagtga tacaagtctc 4440
atagccagag atggttttcc actccttcta gatattccca aaaagaggct gagacaggag 4500
gttattttca attttatttt ggaattaaat acttttttcc ctttattact gttgtagtcc 4560
ctcacttgga tatacctctg ttttcacgat agaaataagg gaggtctaga gcttctattc 4620
cttggccatt gtcaacggag agctggccaa gtcttcacaa acccttgcaa cattgcctga 4680
agtttatgga ataagatgta ttctcactcc cttgatctca agggcgtaac tctggaagca 4740
cagcttgact aacacgtcatt tttaccaatg attttcaggt gacctgggct aagtcattta 4800
aactgggtct ttataaaagt aaaaggccaa catttaatta ttttgcaaag caacctaaga 4860
gctaaagatg taatttttct tgcaattgta aatcttttgt gtctcctgaa gacttccctt 4920
aaaattagct ctgagtgaaa aatcaaaaga gacaaaagac atcttcgaat ccatatttca 4980
agcctggtag aattggcttt tctagcagaa cctttccaaa agttttatat tgagattcat 5040
aacaacacca agaattgatt ttgtagccaa cattcattca atactgttat atcagaggag 5100
taggagagag gaaacatttg acttatctgg aaaagcaaaa tgtacttaag aataagaata 5160
acatggtcca ttcaccttta tgttatagat atgtctttgt gtaaatcatt tgttttgagt 5220
tttcaaagaa tagcccattg ttcattcttg tgctgtacaa tgaccactgt tattgttact 5280
ttgacttttc agagcacacc cttcctctgg tttttgtata tttattgatg gatcaataat 5340
aatgaggaaa gcatgatatg tatattgctg agttgaaagc acttattgga aaatattaaa 5400
aggctaacat taaaagacta aaggaaacag a 5431
<![CDATA[ <210> 80]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 80]]>
Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro
1 5 10 15
Leu Ser Leu Ala Gln Ile
20
<![CDATA[ <210> 81]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 81]]>
Leu Asn Ile Thr Cys Arg Phe Ala Gly Val Phe His Val Glu Lys Asn
1 5 10 15
Gly Arg Tyr Ser Ile Ser Arg Thr Glu Ala Ala Asp Leu Cys Lys Ala
20 25 30
Phe Asn Ser Thr Leu Pro Thr Met Ala Gln Met Glu Lys Ala Leu Ser
35 40 45
Ile Gly Phe Glu Thr Cys
50
<![CDATA[ <210> 82]]>
<![CDATA[ <211> 44]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 82]]>
Tyr Gly Phe Ile Glu Gly His Val Val Ile Pro Arg Ile His Pro Asn
1 5 10 15
Ser Ile Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn
20 25 30
Thr Ser Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser
35 40
<![CDATA[ <210> 83]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 83]]>
Pro Pro Glu Glu Asp Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe
1 5 10 15
Asp Gly Pro Ile Thr Ile
20
<![CDATA[ <210> 84]]>
<![CDATA[ <211> 76]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 84]]>
Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu Tyr Arg
1 5 10 15
Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp Asp Val
20 25 30
Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly Tyr Ile
35 40 45
Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp Ser Pro
50 55 60
Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr
65 70 75
<![CDATA[ <210> 85]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 85]]>
Leu Met Ser Thr Ser Ala Thr Ala Thr Glu Thr Ala Thr Lys Arg Gln
1 5 10 15
Glu Thr Trp Asp Trp Phe Ser Trp Leu Phe Leu Pro Ser Glu Ser Lys
20 25 30
Asn His Leu His Thr Thr Thr Gln Met Ala
35 40
<![CDATA[ <210> 86]]>
<![CDATA[ <211> 41]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 86]]>
Thr Ser Ser Asn Thr Ile Ser Ala Gly Trp Glu Pro Asn Glu Glu Asn
1 5 10 15
Glu Asp Glu Arg Asp Arg His Leu Ser Phe Ser Gly Ser Gly Ile Asp
20 25 30
Asp Asp Glu Asp Phe Ile Ser Ser Thr
35 40
<![CDATA[ <210> 87]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 87]]>
Ser Thr Thr Pro Arg Ala Phe Asp His Thr Lys Gln Asn Gln Asp Trp
1 5 10 15
Thr Gln Trp Asn Pro Ser His Ser Asn Pro Glu Val Leu Leu Gln Thr
20 25 30
Thr Thr Arg Met Thr
35
<![CDATA[ <210> 88]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 88]]>
Val Asp Arg Asn Gly Thr Thr Ala Tyr Glu Gly Asn Trp Asn Pro Glu
1 5 10 15
Ala His Pro Pro Leu Ile His His Glu His His Glu Glu Glu Glu Thr
20 25 30
Pro His Ser Thr Ser Thr
35
<![CDATA[ <210> 89]]>
<![CDATA[ <211> 4]]>2
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 89]]>
Gln Ala Thr Pro Ser Ser Thr Thr Glu Glu Thr Ala Thr Gln Lys Glu
1 5 10 15
Gln Trp Phe Gly Asn Arg Trp His Glu Gly Tyr Arg Gln Thr Pro Lys
20 25 30
Glu Asp Ser His Ser Thr Thr Gly Thr Ala
35 40
<![CDATA[ <210> 90]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 90]]>
Ala Ser Ala His Thr Ser His Pro Met Gln Gly Arg Thr Thr Pro Ser
1 5 10 15
Pro Glu Asp Ser Ser Trp Thr Asp Phe Phe Asn Pro Ile Ser His Pro
20 25 30
Met Gly Arg Gly His Gln Ala Gly Arg Arg Met
35 40
<![CDATA[ <210> 91]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 91]]>
Met Asp Ser Ser His Ser Ile Thr Leu Gln Pro Thr Ala Asn Pro Asn
1 5 10 15
Thr Gly Leu Val Glu Asp Leu Asp Arg Thr Gly Pro Leu Ser Met Thr
20 25 30
Thr
<![CDATA[ <210> 92]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 92]]>
Gln Ser Asn Ser Gln Ser Phe Ser Thr Ser His Glu Gly Leu Glu Glu
1 5 10 15
Asp Lys Asp His Pro Thr Thr Ser Ser Thr Leu Thr Ser Ser
20 25
<![CDATA[ <210> 93]]>
<![CDATA[ <211> ]]> 67
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 93]]>
Arg Asn Asp Val Thr Gly Gly Arg Arg Asp Pro Asn His Ser Glu Gly
1 5 10 15
Ser Thr Thr Leu Leu Glu Gly Tyr Thr Ser His Tyr Pro His Thr Lys
20 25 30
Glu Ser Arg Thr Phe Ile Pro Val Thr Ser Ala Lys Thr Gly Ser Phe
35 40 45
Gly Val Thr Ala Val Thr Val Gly Asp Ser Asn Ser Asn Val Asn Arg
50 55 60
Ser Leu Ser
65
<![CDATA[ <210> 94]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 94]]>
Asp Gln Asp Thr Phe His Pro Ser Gly Gly Ser His Thr Thr His Gly
1 5 10 15
Ser Glu Ser Asp
20
<![CDATA[ <210> 95]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 95]]>
His Ser His Gly Ser Gln Glu Gly Gly Ala Asn Thr Thr Ser Gly Pro
1 5 10 15
Ile Arg Thr Pro Gln Ile Pro
20
<![CDATA[ <210> 96]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 96]]>
Trp Leu Ile Ile Leu Ala Ser Leu Leu Ala Leu Ala Leu Ile Leu Ala
1 5 10 15
Val Cys Ile Ala Val Asn Ser Arg Arg
20 25
<![CDATA[ <210> 97]]>
<![CDATA[ <211> 67]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 97]]>
Cys Gly Gln Lys Lys Lys Leu Val Ile Asn Ser Gly Asn Gly Ala Val
1 5 10 15
Glu Asp Arg Lys Pro Ser Gly Leu Asn Gly Glu Ala Ser Lys Ser Gln
20 25 30
Glu Met Val His Leu Val Asn Lys Glu Ser Ser Glu Thr Pro Asp Gln
35 40 45
Phe Met Thr Ala Asp Glu Thr Arg Asn Leu Gln Asn Val Asp Met Lys
50 55 60
Ile Gly Val
65
<![CDATA[ <210> 98]]>
<![CDATA[ <211> 742]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 98]]>
Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro
1 5 10 15
Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30
Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser Ile Ser Arg Thr Glu
35 40 45
Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met Ala
50 55 60
Gln Met Glu Lys Ala Leu Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly
65 70 75 80
Phe Ile Glu Gly His Val Val Ile Pro Arg Ile His Pro Asn Ser Ile
85 90 95
Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110
Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp
115 120 125
Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro Ile Thr
130 135 140
Ile Thr Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu
145 150 155 160
Tyr Arg Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp
165 170 175
Asp Val Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly
180 185 190
Tyr Ile Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp
195 200 205
Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr Thr Leu
210 215 220
Met Ser Thr Ser Ala Thr Ala Thr Glu Thr Ala Thr Lys Arg Gln Glu
225 230 235 240
Thr Trp Asp Trp Phe Ser Trp Leu Phe Leu Pro Ser Glu Ser Lys Asn
245 250 255
His Leu His Thr Thr Thr Gln Met Ala Gly Thr Ser Ser Asn Thr Ile
260 265 270
Ser Ala Gly Trp Glu Pro Asn Glu Glu Asn Glu Asp Glu Arg Asp Arg
275 280 285
His Leu Ser Phe Ser Gly Ser Gly Ile Asp Asp Asp Glu Asp Phe Ile
290 295 300
Ser Ser Thr Ile Ser Thr Thr Pro Arg Ala Phe Asp His Thr Lys Gln
305 310 315 320
Asn Gln Asp Trp Thr Gln Trp Asn Pro Ser His Ser Asn Pro Glu Val
325 330 335
Leu Leu Gln Thr Thr Thr Arg Met Thr Asp Val Asp Arg Asn Gly Thr
340 345 350
Thr Ala Tyr Glu Gly Asn Trp Asn Pro Glu Ala His Pro Pro Leu Ile
355 360 365
His His Glu His His Glu Glu Glu Glu Thr Pro His Ser Thr Ser Thr
370 375 380
Ile Gln Ala Thr Pro Ser Ser Thr Thr Glu Glu Thr Ala Thr Gln Lys
385 390 395 400
Glu Gln Trp Phe Gly Asn Arg Trp His Glu Gly Tyr Arg Gln Thr Pro
405 410 415
Lys Glu Asp Ser His Ser Thr Thr Gly Thr Ala Ala Ala Ser Ala His
420 425 430
Thr Ser His Pro Met Gln Gly Arg Thr Thr Pro Ser Pro Glu Asp Ser
435 440 445
Ser Trp Thr Asp Phe Phe Asn Pro Ile Ser His Pro Met Gly Arg Gly
450 455 460
His Gln Ala Gly Arg Arg Met Asp Met Asp Ser Ser His Ser Ile Thr
465 470 475 480
Leu Gln Pro Thr Ala Asn Pro Asn Thr Gly Leu Val Glu Asp Leu Asp
485 490 495
Arg Thr Gly Pro Leu Ser Met Thr Thr Gln Gln Ser Asn Ser Gln Ser
500 505 510
Phe Ser Thr Ser His Glu Gly Leu Glu Glu Asp Lys Asp His Pro Thr
515 520 525
Thr Ser Thr Leu Thr Ser Ser Asn Arg Asn Asp Val Thr Gly Gly Arg
530 535 540
Arg Asp Pro Asn His Ser Glu Gly Ser Thr Thr Leu Leu Glu Gly Tyr
545 550 555 560
Thr Ser His Tyr Pro His Thr Lys Glu Ser Arg Thr Phe Ile Pro Val
565 570 575
Thr Ser Ala Lys Thr Gly Ser Phe Gly Val Thr Ala Val Thr Val Gly
580 585 590
Asp Ser Asn Ser Asn Val Asn Arg Ser Leu Ser Gly Asp Gln Asp Thr
595 600 605
Phe His Pro Ser Gly Gly Ser His Thr Thr His Gly Ser Glu Ser Asp
610 615 620
Gly His Ser His Gly Ser Gln Glu Gly Gly Ala Asn Thr Thr Ser Gly
625 630 635 640
Pro Ile Arg Thr Pro Gln Ile Pro Glu Trp Leu Ile Ile Leu Ala Ser
645 650 655
Leu Leu Ala Leu Ala Leu Ile Leu Ala Val Cys Ile Ala Val Asn Ser
660 665 670
Arg Arg Arg Cys Gly Gln Lys Lys Lys Leu Val Ile Asn Ser Gly Asn
675 680 685
Gly Ala Val Glu Asp Arg Lys Pro Ser Gly Leu Asn Gly Glu Ala Ser
690 695 700
Lys Ser Gln Glu Met Val His Leu Val Asn Lys Glu Ser Ser Ser Glu Thr
705 710 715 720
Pro Asp Gln Phe Met Thr Ala Asp Glu Thr Arg Asn Leu Gln Asn Val
725 730 735
Asp Met Lys Ile Gly Val
740
<![CDATA[ <210> 99]]>
<![CDATA[ <211> 800]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 99]]>
ctcattgccc agcggacccc agcctctgcc aggttcggtc cgccatcctc gtcccgtcct 60
ccgccggccc ctgccccgcg cccagggatc ctccagctcc tttcgcccgc gccctccgtt 120
cgctccggac accatggaca agttttggtg gcacgcagcc tggggactct gcctcgtgcc 180
gctgagcctg gcgcagatcg atttgaatat aacctgccgc tttgcaggtg tattccacgt 240
ggagaaaaat ggtcgctaca gcatctctcg gacggaggcc gctgacctct gcaaggcttt 300
caatagcacc ttgcccacaa tggcccagat ggagaaagct ctgagcatcg gatttgagac 360
ctgcaggtat gggttcatag aagggcacgt ggtgattccc cggatccacc ccaactccat 420
ctgtgcagca aacaacacag gggtgtacat cctcacatcc aacacctccc agtatgacac 480
atattgcttc aatgcttcag ctccacctga agaagattgt acatcagtca cagacctgcc 540
caatgccttt gatggaccaa ttaccataac tattgttaac cgtgatggca cccgctatgt 600
ccagaaagga gaatacagaa cgaatcctga agacatctac cccagcaacc ctactgatga 660
tgacgtgagc agcggctcct ccagtgaaag gagcagcact tcaggaggtt acatctttta 720
caccttttct actgtacacc ccatcccaga cgaagacagt ccctggatca ccgacagcac 780
agacagaatc cctgctacca 800
<![CDATA[ <210> 100]]>
<![CDATA[ <211> 222]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 100]]>
Met Asp Lys Phe Trp Trp His Ala Ala Trp Gly Leu Cys Leu Val Pro
1 5 10 15
Leu Ser Leu Ala Gln Ile Asp Leu Asn Ile Thr Cys Arg Phe Ala Gly
20 25 30
Val Phe His Val Glu Lys Asn Gly Arg Tyr Ser Ile Ser Arg Thr Glu
35 40 45
Ala Ala Asp Leu Cys Lys Ala Phe Asn Ser Thr Leu Pro Thr Met Ala
50 55 60
Gln Met Glu Lys Ala Leu Ser Ile Gly Phe Glu Thr Cys Arg Tyr Gly
65 70 75 80
Phe Ile Glu Gly His Val Val Ile Pro Arg Ile His Pro Asn Ser Ile
85 90 95
Cys Ala Ala Asn Asn Thr Gly Val Tyr Ile Leu Thr Ser Asn Thr Ser
100 105 110
Gln Tyr Asp Thr Tyr Cys Phe Asn Ala Ser Ala Pro Pro Glu Glu Asp
115 120 125
Cys Thr Ser Val Thr Asp Leu Pro Asn Ala Phe Asp Gly Pro Ile Thr
130 135 140
Ile Thr Ile Val Asn Arg Asp Gly Thr Arg Tyr Val Gln Lys Gly Glu
145 150 155 160
Tyr Arg Thr Asn Pro Glu Asp Ile Tyr Pro Ser Asn Pro Thr Asp Asp
165 170 175
Asp Val Ser Ser Gly Ser Ser Ser Glu Arg Ser Ser Thr Ser Gly Gly
180 185 190
Tyr Ile Phe Tyr Thr Phe Ser Thr Val His Pro Ile Pro Asp Glu Asp
195 200 205
Ser Pro Trp Ile Thr Asp Ser Thr Asp Arg Ile Pro Ala Thr
210 215 220
<![CDATA[ <210> 101]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 101]]>
000
<![CDATA[ <210> 102]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 102]]>
000
<![CDATA[ <210> 103]]>
<![CDATA[ <211> 552]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 103]]>
agatgcagta gccgaaactg cgcggaggca cagaggccgg ggagagcgtt ctgggtccga 60
gggtccaggt aggggttgag ccaccacatctg accgcaagct gcgtcgtgtc gccggttctg 120
caggcaccat gagccaggac accgaggtgg atatgaagga ggtggagctg aatgagttag 180
agcccgagaa gcagccgatg aacgcggcgt ctggggcggc catgtccctg gcgggagccg 240
agaagaatgg tctggtgaag atcaaggtgg cggaagacga ggcggaggcg gcagccgcgg 300
ctaagttcac gggcctgtcc aaggaggagc tgctgaaggt ggcaggcagc cccggctggg 360
tacgcacccg ctgggcactg ctgctgctct tctggctcgg ctggctcggc atgcttgctg 420
gtgccgtggt cataatcgtg cgagcgccgc gttgtcgcga gctaccggcg cagaagtggt 480
ggcacacggg cgccctctac cgcatcggcg accttcaggc cttccagggc cacggcgcgg 540
gcaacctggc gg 552
<![CDATA[ <210> 104]]>
<![CDATA[ <211> 174]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 104]]>
gtctgaaggg gcgtctcgat tacctgagct ctctgaaggt gaagggcctt gtgctgggtc 60
caattcacaa gaaccagaag gatgatgtcg ctcagactga cttgctgcag atcgacccca 120
atttggctc caaggaagat tttgacagtc tcttgcaatc ggctaaaaaa aaga 174
<![CDATA[ <210> 105]]>
<![CDATA[ <211> 91]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 105]]>
gcatccgtgt cattctggac cttactccca actaccgggg tgagaactcg tggttctcca 60
ctcaggttga cactgtggcc accaaggtga a 91
<![CDATA[ <210> 106]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 106]]>
gatgctctgg agttttggct gcaagctggc gtggatgggt tccaggttcg ggacatagag 60
aatctgaag 69
<![CDATA[ <210> 107]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 107]]>
gatgcatcct cattcttggc tgagtggcaa aatatcacca agggcttcag tgaagacag 59
<![CDATA[ <210> 108]]>
<![CDATA[ <211> 181]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 108]]>
gctcttgatt gcggggacta actcctccga ccttcagcag atcctgagcc tactcgaatc 60
caacaaagac ttgctgttga ctagctcata cctgtctgat tctggttcta ctggggagca 120
tacaaaatcc ctagtcacac agtatttgaa tgccactggc aatcgctggt gcagctggag 180
t 181
<![CDATA[ <210> 109]]>
<![CDATA[ <211> 144]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 109]]>
ttgtctcagg caaggctcct gacttccttc ttgccggctc aacttctccg actctaccag 60
ctgatgctct tcaccctgcc agggacccct gttttcagct acggggatga gattggcctg 120
gatgcagctg cccttcctgg acag 144
<![CDATA[ <210> 110]]>
<![CDATA[ <211> 84]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 110]]>
cctatggagg ctccagtcat gctgtggggat gagtccagct tccctgacat cccaggggct 60
gtaagtgcca acatgactgt gaag 84
<![CDATA[ <210> 111]]>
<![CDATA[ <211> 530]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 111]]>
ggccagagtg aagaccctgg ctccctcctt tccttgttcc ggcggctgag tgaccagcgg 60
agtaaggagc gctccctact gcatggggac ttccacgcgt tctccgctgg gcctggactc 120
ttctcctata tccgccactg ggaccagaat gagcgttttc tggtagtgct taactttggg 180
gatgtgggcc tctcggctgg actgcaggcc tccgacctgc ctgccagcgc cagcctgcca 240
gccaaggctg acctcctgct cagcacccag ccaggccgtg aggagggctc ccctcttgag 300
ctggaacgcc tgaaactgga gcctcacgaa gggctgctgc tccgcttccc ctacgcggcc 360
tgacttcagc ctgacatgga cccactaccc ttctcctttc cttcccaggc cctttggctt 420
ctgatttttc tcttttttaa aaacaaacaa acaaactgtt gcagattatg agtgaacccc 480
caaatagggt gttttctgcc ttcaaataaa agtcacccct gcatggtgaa 530
<![CDATA[ <210> 112]]>
<![CDATA[ <211> 1884]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 112]]>
agatgcagta gccgaaactg cgcggaggca cagaggccgg ggagagcgtt ctgggtccga 60
gggtccaggt aggggttgag ccaccacatctg accgcaagct gcgtcgtgtc gccggttctg 120
caggcaccat gagccaggac accgaggtgg atatgaagga ggtggagctg aatgagttag 180
agcccgagaa gcagccgatg aacgcggcgt ctggggcggc catgtccctg gcgggagccg 240
agaagaatgg tctggtgaag atcaaggtgg cggaagacga ggcggaggcg gcagccgcgg 300
ctaagttcac gggcctgtcc aaggaggagc tgctgaaggt ggcaggcagc cccggctggg 360
tacgcacccg ctgggcactg ctgctgctct tctggctcgg ctggctcggc atgcttgctg 420
gtgccgtggt cataatcgtg cgagcgccgc gttgtcgcga gctaccggcg cagaagtggt 480
ggcacacggg cgccctctac cgcatcggcg accttcaggc cttccagggc cacggcgcgg 540
gcaacctggc gggtctgaag gggcgtctcg attacctgag ctctctgaag gtgaagggcc 600
ttgtgctggg tccaattcac aagaaccaga aggatgatgt cgctcagact gacttgctgc 660
agatcgaccc caattttggc tccaaggaag attttgacag tctcttgcaa tcggctaaaa 720
aaaagagcat ccgtgtcatt ctggacctta ctcccaacta ccggggtgag aactcgtggt 780
tctccactca ggttgacact gtggccacca aggtgaagat gctctggagt tttggctgca 840
agctggcgtg gatgggttcc aggttcggga catagagaat ctgaaggatg catcctcatt 900
cttggctgag tggcaaaata tcaccaaggg cttcagtgaa gacaggctct tgattgcggg 960
gactaactcc tccgaccttc agcagatcct gagcctactc gaatccaaca aagacttgct 1020
gttgactagc tcatacctgt ctgattctgg ttctactggg gagcatacaa aatccctagt 1080
cacacagtat ttgaatgcca ctggcaatcg ctggtgcagc tggagtttgt ctcaggcaag 1140
gctcctgact tccttcttgc cggctcaact tctccgactc taccagctga tgctcttcac 1200
cctgccaggg acccctgttt tcagctacgg ggatgagatt ggcctggatg cagctgccct 1260
tcctggacag cctatgggagg ctccagtcat gctgtgggat gagtccagct tccctgacat 1320
cccaggggct gtaagtgcca acatgactgt gaagggccag agtgaagacc ctggctccct 1380
cctttccttg ttccggcggc tgagtgacca gcggagtaag gagcgctccc tactgcatgg 1440
ggacttccac gcgttctccg ctgggcctgg actcttctcc tatatccgcc actgggacca 1500
gaatgagcgt tttctggtag tgcttaactt tggggatgtg ggcctctcgg ctggactgca 1560
ggcctccgac ctgcctgcca gcgccagcct gccagccaag gctgacctcc tgctcagcac 1620
ccagccaggc cgtgaggagg gctcccctct tgagctggaa cgcctgaaac tggagcctca 1680
cgaagggctg ctgctccgct tcccctacgc ggcctgactt cagcctgaca tggacccact 1740
acccttctcc tttccttccc aggccctttg gcttctgatt tttctctttt ttaaaaacaa 1800
acaaacaaac tgttgcagat tatgagtgaa cccccaaata gggtgttttc tgccttcaaa 1860
taaaagtcac ccctgcatgg tgaa 1884
<![CDATA[ <210> 113]]>
<![CDATA[ <211> 141]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 113]]>
Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu Val Glu Leu Asn Glu
1 5 10 15
Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala Ser Gly Ala Ala Met
20 25 30
Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val Lys Ile Lys Val Ala
35 40 45
Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Ala Lys Phe Thr Gly Leu Ser
50 55 60
Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro Gly Trp Val Arg Thr
65 70 75 80
Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly Trp Leu Gly Met Leu
85 90 95
Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro Arg Cys Arg Glu Leu
100 105 110
Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu Tyr Arg Ile Gly Asp
115 120 125
Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu Ala
130 135 140
<![CDATA[ <210> 114]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 114]]>
Leu Lys Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val Lys Gly Leu
1 5 10 15
Val Leu Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val Ala Gln Thr
20 25 30
Asp Leu Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu Asp Phe Asp
35 40 45
Ser Leu Leu Gln Ser Ala Lys Lys Lys Lys
50 55
<![CDATA[ <210> 115]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 115]]>
Ile Arg Val Ile Leu Asp Leu Thr Pro Asn Tyr Arg Gly Glu Asn Ser
1 5 10 15
Trp Phe Ser Thr Gln Val Asp Thr Val Ala Thr Lys Val
20 25
<![CDATA[ <210> 116]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 116]]>
Asp Ala Leu Glu Phe Trp Leu Gln Ala Gly Val Asp Gly Phe Gln Val
1 5 10 15
Arg Asp Ile Glu Asn Leu Lys
20
<![CDATA[ <210> 117]]>
<![CDATA[ <211> 19]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 117]]>
Asp Ala Ser Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys Gly Phe
1 5 10 15
Ser Glu Asp
<![CDATA[ <210> 118]]>
<![CDATA[ <211> 60]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 118]]>
Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp Leu Gln Gln Ile Leu Ser
1 5 10 15
Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu Thr Ser Ser Tyr Leu Ser
20 25 30
Asp Ser Gly Ser Thr Gly Glu His Thr Lys Ser Leu Val Thr Gln Tyr
35 40 45
Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser Trp Ser
50 55 60
<![CDATA[ <210> 119]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 119]]>
Leu Ser Gln Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln Leu Leu
1 5 10 15
Arg Leu Tyr Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro Val Phe
20 25 30
Ser Tyr Gly Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro Gly Gln
35 40 45
<![CDATA[ <210> 120]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 120]]>
Pro Met Glu Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe Pro Asp
1 5 10 15
Ile Pro Gly Ala Val Ser Ala Asn Met Thr Val Lys
20 25
<![CDATA[ <210> 121]]>
<![CDATA[ <211> 120]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 121]]>
Gly Gln Ser Glu Asp Pro Gly Ser Leu Leu Ser Leu Phe Arg Arg Leu
1 5 10 15
Ser Asp Gln Arg Ser Lys Glu Arg Ser Leu Leu His Gly Asp Phe His
20 25 30
Ala Phe Ser Ala Gly Pro Gly Leu Phe Ser Tyr Ile Arg His Trp Asp
35 40 45
Gln Asn Glu Arg Phe Leu Val Val Leu Asn Phe Gly Asp Val Gly Leu
50 55 60
Ser Ala Gly Leu Gln Ala Ser Asp Leu Pro Ala Ser Ala Ser Leu Pro
65 70 75 80
Ala Lys Ala Asp Leu Leu Leu Ser Thr Gln Pro Gly Arg Glu Glu Gly
85 90 95
Ser Pro Leu Glu Leu Glu Arg Leu Lys Leu Glu Pro His Glu Gly Leu
100 105 110
Leu Leu Arg Phe Pro Tyr Ala Ala
115 120
<![CDATA[ <210> 122]]>
<![CDATA[ <211> 529]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 122]]>
Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu Val Glu Leu Asn Glu
1 5 10 15
Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala Ser Gly Ala Ala Met
20 25 30
Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val Lys Ile Lys Val Ala
35 40 45
Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Ala Lys Phe Thr Gly Leu Ser
50 55 60
Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro Gly Trp Val Arg Thr
65 70 75 80
Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly Trp Leu Gly Met Leu
85 90 95
Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro Arg Cys Arg Glu Leu
100 105 110
Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu Tyr Arg Ile Gly Asp
115 120 125
Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu Ala Gly Leu Lys
130 135 140
Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val Lys Gly Leu Val Leu
145 150 155 160
Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val Ala Gln Thr Asp Leu
165 170 175
Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu Asp Phe Asp Ser Leu
180 185 190
Leu Gln Ser Ala Lys Lys Lys Ser Ile Arg Val Ile Leu Asp Leu Thr
195 200 205
Pro Asn Tyr Arg Gly Glu Asn Ser Trp Phe Ser Thr Gln Val Asp Thr
210 215 220
Val Ala Thr Lys Val Lys Asp Ala Leu Glu Phe Trp Leu Gln Ala Gly
225 230 235 240
Val Asp Gly Phe Gln Val Arg Asp Ile Glu Asn Leu Lys Asp Ala Ser
245 250 255
Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys Gly Phe Ser Glu Asp
260 265 270
Arg Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp Leu Gln Gln Ile Leu
275 280 285
Ser Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu Thr Ser Ser Tyr Leu
290 295 300
Ser Asp Ser Gly Ser Thr Gly Glu His Thr Lys Ser Leu Val Thr Gln
305 310 315 320
Tyr Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser Trp Ser Leu Ser Gln
325 330 335
Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln Leu Leu Arg Leu Tyr
340 345 350
Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro Val Phe Ser Tyr Gly
355 360 365
Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro Gly Gln Pro Met Glu
370 375 380
Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe Pro Asp Ile Pro Gly
385 390 395 400
Ala Val Ser Ala Asn Met Thr Val Lys Gly Gln Ser Glu Asp Pro Gly
405 410 415
Ser Leu Leu Ser Leu Phe Arg Arg Leu Ser Asp Gln Arg Ser Lys Glu
420 425 430
Arg Ser Leu Leu His Gly Asp Phe His Ala Phe Ser Ala Gly Pro Gly
435 440 445
Leu Phe Ser Tyr Ile Arg His Trp Asp Gln Asn Glu Arg Phe Leu Val
450 455 460
Val Leu Asn Phe Gly Asp Val Gly Leu Ser Ala Gly Leu Gln Ala Ser
465 470 475 480
Asp Leu Pro Ala Ser Ala Ser Leu Pro Ala Lys Ala Asp Leu Leu Leu
485 490 495
Ser Thr Gln Pro Gly Arg Glu Glu Gly Ser Pro Leu Glu Leu Glu Arg
500 505 510
Leu Lys Leu Glu Pro His Glu Gly Leu Leu Leu Arg Phe Pro Tyr Ala
515 520 525
Ala
<![CDATA[ <210> 123]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 123]]>
000
<![CDATA[ <210> 124]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 124]]>
000
<![CDATA[ <210> 125]]>
<![CDATA[ <211> 187]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 125]]>
agtaagcctc cgcagcccac tcggactgca gcctgtttgc cgcccgtcct cccattgcag 60
cactcggggc gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac 120
tcgcgggtcc cgctccgctc cggcagcagc atggggaaag gacgcgcggg ccgagttggc 180
accacag 187
<![CDATA[ <210> 126]]>
<![CDATA[ <211> 178]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 126]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag attcaagtgg ttcaagaatg 120
ggaatgaatt gaatcgaaaa aacaaaccac aaaatatcaa gatacaaaaa aagccagg 178
<![CDATA[ <210> 127]]>
<![CDATA[ <211> 122]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 127]]>
gaagtcagaa cttcgcatta acaaagcatc actggctgat tctggagagt atatgtgcaa 60
agtgatcagc aaattaggaa atgacagtgc ctctgccaat atcaccatcg tggaatcaaa 120
cg 122
<![CDATA[ <210> 128]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 128]]>
agatcatcac tggtatgcca gcctcaactg aaggagcata tgtgtcttca g 51
<![CDATA[ <210> 129]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 129]]>
agtctcccat tagaatatca gtatccacag aaggagcaaa tacttcttca t 51
<![CDATA[ <210> 130]]>
<![CDATA[ <211> 1]]>30
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 130]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac ccctcgagat 120
acttgtgcaa 130
<![CDATA[ <210> 131]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 131]]>
gtgcccaaat ggtttactg gtgatcgctg ccaaaactac gtaatggcca gcttctaca 59
<![CDATA[ <210> 132]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 132]]>
agcatcttgg gattgaattt atgg 24
<![CDATA[ <210> 133]]>
<![CDATA[ <211> 103]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 133]]>
aggcggagga gctgtaccag aagagagtgc tgaccataac cggcatctgc atcgccctcc 60
ttgtggtcgg catcatgtgt gtggtggcct actgcaaaac caa 103
<![CDATA[ <210> 134]]>
<![CDATA[ <211> 127]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 134]]>
gaaacagcgg aaaaagctgc atgaccgtct tcggcagagc cttcggtctg aacgaaacaa 60
tatgatgaac attgccaatg ggcctcacca tcctaaccca ccccccgaga atgtccagct 120
ggtgaat 127
<![CDATA[ <210> 135]]>
<![CDATA[ <211> 131]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 135]]>
caatacgtat ctaaaaacgt catctccagt gagcatattg ttgagagaga agcagagaca 60
tccttttcca ccagtcacta tacttccaca gcccatcact ccactactgt cacccagact 120
cctagccaca g 131
<![CDATA[ <210> 136]]>
<![CDATA[ <211> 207]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 136]]>
ctggagcaac ggacacactg aaagcatcct ttccgaaagc cactctgtaa tcgtgatgtc 60
atccgtagaa aacagtaggc acagcagccc aactgggggc ccaagaggac gtcttaatgg 120
cacaggaggc cctcgtgaat gtaacagctt cctcaggcat gccagagaaa cccctgattc 180
ctaccgagac tctcctcata gtgaaag 207
<![CDATA[ <210> 137]]>
<![CDATA[ <211> 10299]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 137]]>
gtatgtgtca gccatgacca ccccggctcg tatgtcacct gtagatttcc acacgccaag 60
ctcccccaaa tcgccccctt cggaaatgtc tccacccgtg tccagcatga cggtgtccat 120
gccttccatg gcggtcagcc ccttcatgga agaagagaga cctctacttc tcgtgacacc 180
accaaggctg cgggagaaga agtttgacca tcaccctcag cagttcagct ccttccacca 240
caaccccgcg catgacagta acagcctccc tgctagcccc ttgaggatag tggaggatga 300
ggagtatgaa acgacccaag agtacgagcc agcccaagag cctgttaaga aactcgccaa 360
tagccggcgg gccaaaagaa ccaagcccaa tggccacatt gctaacagat tggaagtgga 420
cagcaacaca agctccccaga gcagtaactc agagagtgaa acagaagatg aaagagtagg 480
tgaagatacg cctttcctgg gcatacagaa ccccctggca gccagtcttg aggcaacacc 540
tgccttccgc ctggctgaca gcaggactaa cccagcaggc cgcttctcga cacaggaaga 600
aatccaggcc aggctgtcta gtgtaattgc taaccaagac cctattgctg tataaaacct 660
aaataaacac atagattcac ctgtaaaact ttattattata taataaagta ttccacctta 720
aattaaacaa tttattttat tttagcagtt ctgcaaatag aaaacaggaa aaaaactttt 780
ataaattaaa tatatgtatg taaaaatgtg ttatgtgcca tatgtagcaa ttttttacag 840
tatttcaaaa cgagaaagat atcaatggtg cctttatgtt atgttatgtc gagagcaagt 900
tttgtacagt tacagtgatt gcttttccac agtatttctg caaaacctct catagattca 960
gtttttgctg gcttcttgtg cattgcatta tgatgttgac tggatgtatg atttgcaaga 1020
cttgcaactg tccctctgtt tgcttgtagt agcacccgat cagtatgtct tgtaatggca 1080
catccatcca gatatgcctc tcttgtgtat gaagttttct ttgctttcag aatatgaaat 1140
gagttgtgtc tactctgcca gccaaaggtt tgcctcattg ggctctgaga taatagtaga 1200
tccaacagca tgctactatt aaatacagca agaaactgca ttaagtaatg ttaaatatta 1260
ggaagaaagt aatactgtga tttaaaaaaa actatattat taatcagaag acagcttgct 1320
cttactaaaa ggagctctca tttactttat ttgattttat ttttcttgac aaaaagcaac 1380
agttttaggg atagcttaga aaatgggttc tggcttgcta tcagggtaaa tctaacacct 1440
tacaagagga ctgagtgtca ctttctctct gggggaatga tccagcagct tatctagttg 1500
acaatcaaaa cacggctgat aaaggtgcaa tcatttctga catgtatttt tcactgattt 1560
tgaagctagt gattggttgt gtcttcttgg ctcaaaaaga agcatattac ggcacaaaaa 1620
gcccagccca gacagcacat gcagcatttt gtctgaaata cttctagagt caaacgtgcc 1680
tgctgtacat agcgatgact tgtcatcata gggaagtatt tccatcgtag agtgttcaga 1740
aggagtgact gtataggtgg agagaagctt agtgactccg ttgaaatttt aaaatgtgga 1800
tgaccacccc tttctccccc ttatttttct tttatctttc catgttgcct tgatcaggtc 1860
ataactatgc atgaacattt tttatcagga atggccgatg tgtatgtgat ttgtaatcac 1920
aagtaatgat tcatcaggaa atgtcaatcc tgttggaaag attgcacctt tacttgcaga 1980
agtgaccccc acctgtgtcc tgacctctcc atttacaggc tctctcaccc atttccccca 2040
cctcctttaa tttttgcttt actgtcataa agtaggacta agattggtct aagcattgca 2100
tgttcttttg tgatggtaaa tccaaaggaa ggcctataag tattaacatt tgaaataact 2160
gctaattcag gaaaatggaa gaaaaaaaat tatttgaaac acagaaccca tttcatggcc 2220
tgcctgatat ctgtgaaatc agggctggag ctttacttag gattcacatg gcctcctagg 2280
aaccatggga caaatgggaa acaggttatc gggggattca tgaagtcagt gagagtaatt 2340
gcttcttttt tgcgggtgaa ctgaatgtat ttcttcacca aatcttgatg ttaacaatta 2400
aaaagaagaa atgacatgca agtaggtctt agcagaaaaa tgcaggctgg gcatgagtca 2460
tgttgttacc ctcccacatg ctcctacaat ccacagagat gcctgtctgc aggttcttga 2520
agttattgtt agtatttggt atctcaaatt tttcgtcact gttcacatgc cactttctct 2580
gtgcacagtg gtatcctcat ttgcttttta acctacactg aggagtcttt gtcaggttgc 2640
actgattttc caattctgca gtaatgagta agctcacggc atggggaaga agacagtcag 2700
tccaatgaag ttctctaaat tattttaaca ttgcctttga aggccttgac tcatccttag 2760
ctatttcaat gaagaaattc ctaccatgaa tttaaaaccc taaaaattct gtttcaaatt 2820
ctttgggcat tggggtactc agatatccca ttgtggaaga attttaagaa taaatagaag 2880
tttctgttga gaaccatgag caacatgttt cttacaatga gaattgctat gcattttaaa 2940
attgcaaata tatatgaaaa ttgaagacaa gaggaaattg tatttctaac ttgattctga 3000
tcactcacag aggtggcata ttattatagt tgggacatcc tttgcaccct tcataaaaaa 3060
ggccagctga ctgctcagca tcacctgcca aggccactag atttgtgttt acaggggtat 3120
ctctgtgatg cttgtcacat cactcttgac cacctctgtt aataaattcc gacagtgcag 3180
tggcgatcgg agtgtgaact tatgttccca gcatatggaa agctatctta ggttttaagg 3240
tagtagaaat tgcccaggag tttgacagca actttgtttc ccgggtctaa aatcgtatcc 3300
cactgaggtg tatgcagtgg agcataatac atgcaaatac atgcaaaact ccttttgttt 3360
cacctaagat tcactttcta tcttactttc ccttcctgcc tagtgtgact tttgccccca 3420
agagtgcctg gacagcattc tagtttctac aaaatggtcc tctgtgtagg tgaatgtgtc 3480
ccaaacctgc tatcactttc ttgtttcagt gtgactgtct tgttagaggt gaagtttatc 3540
cagggtaact tgctcactaa ctattccttt ttatggcctg gggttaaagg gcgcatggct 3600
cacactggtg aaaataagga aggcctggtc ttatcttgta ttaataatac tggctgcatt 3660
ccaccagcca gagatttcta tctgcgaaga cctatgaaac actgaagaga aatgtaggca 3720
gaaggaaatg gccacatatc acaagttcta ttatatattc ttttgtaaat acatattgta 3780
tattacttgg atgttttctt atatcattta ctgtcttttt gagttaatgt cagtttttac 3840
tctctcaact tactatgtaa cattgtaaat aacataatgt cctttattat ttatatttaa 3900
gcatctaaca tatagagttg ttttcatata agtttaagat aaatgtcaaa aatatatgtt 3960
cttttgtttt tctttgcttt aaaattatgt atcttttcct tttctttttt ttaagaataa 4020
tttattgttc aggagaaaga atgtatatgt aactgaaact atctgaagaa tgcacattga 4080
aggccgtgag gtactgataa actaaagaat ttattattca aaatactaag caataagtaa 4140
ttgtgattta tttaaagttt tgtccatttt ccatgaaaga catactgcaa taaaaatgct 4200
actctgtgga gacctgggag tgttgctcag cagactacag cttcagtctg ttagaccagc 4260
acctttcatc tcattcccat agttatgcta atttaggatt gtgttccatg gaccccatga 4320
tcaccttgtc tatacgttgc ttcttgtctg tccattgctt ttgccacacc acctgttctc 4380
aaatcatctc ctccctacca atgctgttta tcactttctt ccttgttgaa gaggccaacac 4440
aaccagacag tactatgctt cctttttcct ccatacacaa taacagagag agaatattct 4500
agggcatgac tgcctggatc ctggctgttg ctatcttttg tagtggcagt aagaaactcc 4560
ttcagactaa tgaaaatgtc aacgtgccat tcaatcacga aaggtaacga aaaatgctct 4620
catggttcaa atagtccaat ggcccatagt ggcctaaaag gcagccagtt gacacctggc 4680
catgctaagc ttccttatac catccgctaa tgactttcca ttgggcccac aatttacgga 4740
ttcataattt taaaagagga gaaggccaag ttaggttcat tccccttat ctgtcaataa 4800
aacaaatcaa actcatgtct atctaactgc tcaggggagga gcctttgcat gagaaaattc 4860
tcatattcta agactgagtc atagaaatga gggtattact tttcttactg caattaacct 4920
aaacaaaagc catattttaa caaatagata tttgcatggt acccttcata tattccaagc 4980
atttcactca ttattccagg taggttaaga gcttctgaag tgtatgaagt aaaggtcagc 5040
aatcctttgg ggtgaacagt ggcctccttt gagtttggg ggtaacctga gacttcccac 5100
caatgtccac ctccatctgt gtacctaatt cctattacct agttatggct cctctaggat 5160
catttccaaa cactctggat gtccaggaaa tttaaattgt agcttttgac tgagctagtt 5220
tttcctattt atattaataa attttcaaaa atgcttgaaa tcttcacatt tgcaacaact 5280
ttagttttca tgcacataca aacacagaga gacaaaaatt ccaaacagac actctccaaa 5340
agccaccaca ctctttcact tgctctatag tcatttagcc aaccagccat gcagagaata 5400
tttaaaactt aaagattgag aattattct cagtttttgc tgaggctttg taacgaaatt 5460
gaacactata agcagctatt gtagtaattt tggttaaaat tgtttgcctg ggatatagta 5520
tttgaggcag aagcacgtgt gtgaaggagg tgaggtggtt tggaaagagt gaagactcgc 5580
agccagattg aatgtctgga taattactat aattctccct tcttggttga aaccatgttc 5640
tctcttgatt tttaaaccca ggctgcctct ggaaacaagc aaacctgagt ctttctaacc 5700
tgagtcttcc caatcattag atttcttttc tgtcctaacg atgaatgata aaaggacttg 5760
atgttcacaa tttggggtta taaggcaggt ctgaaatctg gagactcaag atgctggaag 5820
gagtggaaag tttcgatgac tttatatgaa tcactttgca ctctatgttt ggcttgtcct 5880
ctttgaaact gatttactaa aataaatgta agggaactat tactccaaaa gatttaacttg 5940
gcaggaaata ccaatacttt cagtttatga aagacaaaac tgtcttgttg ctacaggaag 6000
ctgcaatgtt cctaaccttt aaggttggtg ttgaataggg tggtcatgcc ctcccctgca 6060
ggtatcttta ggctcctgtt gacctcctgg tactataact gttcgtcttc tctgggtagc 6120
tattgatttt gaactttaac atgcttcaaa actttatca tcagggaaat aggaaaagag 6180
ttttgttacc tggaggaaat ctattgtgat ctacctgagc tttttaaaaa cagaccagga 6240
gaaggaaacc agtaattttt aaagaagaga cagagaatgg gataatagtt tcacccagga 6300
tctctttcta accctttccc ttcaaatgaa cttattggaa cagaattgga aagaagaaag 6360
gacatctctg cccaccccac aggatgccaa aaaggctaaa gaattacctc tgtagattta 6420
aacatcttta atggcttatg tatagatttg ctaatacaga gagaaatgaa ctattaaata 6480
aaaatcacat tttataatat ttttatggct taaaacatcc tttatctcct ttttgttctc 6540
tctacatgat atggtaagtg atgaggaaaa ttaggctca ggaaggttaa aatctttctt 6600
ggagttacac atctaagaga gctgcagagc tgacacttgt accccaggttt tctgactgca 6660
aatccagttt ctttctattg cgttcttccc ctttccctgc ctcaagcaga aacaggtttt 6720
ttattttcaa cctttatgta tacagtatgt tatgttacat ctacagctaa gtttcttttt 6780
agaagaatgt gagcccttct agctttggtt tagagtgatt ctagaagcca atttccttgg 6840
cttagtgatt ctatgcacct ttcctaaact tagctttcta aggaaatgaa gtgtacgagt 6900
gagaatgaat tcacaatttc gacatgtagg tagcatccta aagtgaaaag aggaggaaat 6960
ttgtggtcaa agcactctcc ccaccactta gaaacttact gactgtgggc agcttcctcc 7020
tccaagtttc cttcctgatt tacaagaccg tggtgtggtc aggattaaac ttgaatacat 7080
gtaaggaagc ctgaaagtgt ctaacacata gcgagtattc aaatgccacc ttctatttga 7140
tccttcccct ccagttccctt aagttttgga atctaggttt ctcagttcca aatggattga 7200
catttgcata tccccattgc acaatggatc aaataaactt tatgttatca tttctccaac 7260
atagtgccag taagcaaatc ctttttaata acaacagtat gttgagaaac atatcaccaa 7320
ataatatta actttgtagc tttgataagt tctttaggtt ttggttttgg ttttgttttc 7380
tgagacagggg tcttgatctg tcacccagac tggagtgcaa tggttcaatt ttagctcact 7440
gcaacctgta actcctgagc tcaagtgatc ctccccacctc agcctctcaa gtagctggga 7500
ctacaggtgt gctccaccat gcccagcgat ttttttttttttttttttttttttttttgg 7560
tagggacaag gtctcgctat gttgcccagg ctggtcttca attcctggcc ttaagtgatc 7620
ctcctgcctc ggcctcccaa agtgctggaa ttacaggcat gagccaccac ccataacttt 7680
atgtttgttt ttttgatgca gtataagttc agcttgcttc ttatgcagcc ataccatttc 7740
atgttaactc tgatttttag cagcttatta cattagtgtt ttattattaa tataatttta 7800
cagaaattta ctaaaccatg actctgtaga gttttaataa tactacctcc aaacatcatt 7860
gcaaacatct agaagaatga acaaaaatga tcttagatcg acagtatatc tgtttgtctt 7920
agtttctaca caggatgttc agacatattc catttcttta aaaaaaaaat atatatatat 7980
atatatatat ataggcctgg cacggtggct catgactgta atcccagcac tttgggaggc 8040
ttaggcaggc aaatcacctg aggtcaggag tttgagactg gcctgaccaa catggtgaaa 8100
cctcgtctct attaaaatta caaaaattag ccgggcgtgg tggcacatgc ctgtaatccc 8160
agctactcgg gaggctgagg caggagaatc acttgaacct gggaggcaga ggttgcagtg 8220
agttgagatc acgccattgc actccagcct tggtgacaag agtgaaactc cgtctcaaaa 8280
aaaaaaaaaa tatgtatata tatatatata tatatatata tatatatata tatatatata 8340
taaaatccca ccaaaagtct gcagagtgac caaattagac ggctctggtt tcagattaaa 8400
ttctaaatgt gagaaaccac atagctccca tgatccatcc aataattcct cacgtcctct 8460
tcactcttta ctccatgcat aaaacagaat ttttttttct catcctgggt atgaagcaat 8520
taataattta cggatttagc ctatttggat tcaatccctt caaactccat actacatcca 8580
aggtggaagt gacttaaact ctgatatcaa tcatcaggct tgtaatatag gctttgttaa 8640
tggcaggaga gtctaataaa actttctgtt ccttatcctt catttaaatg aaaaactttt 8700
tattgaaaac aatcataact ctagctcatc ataaatataa ttcatgagga cattttatta 8760
tttttatatt aaagaaataa tattatagat gtaaactttg cacctttcta attattatca 8820
tgagttaagc taatacttgt cttctggtcc ctagatgatg attctttttt gccttactgg 8880
aggagccctt gtcttgaagt gagttgcttc aacagcagag gacttctagt ttctcccagt 8940
tgagcctaaa gtgaactttt catcttcttc agaggaaggg gcttccttga tttgtacttt 9000
tgtggctcct cagataacac agacaatttt atcttggatc ccaggttctc ttcaccatta 9060
agaataagaa agagagaaaa tgctgtgcat gacagccacc tactccaaac tacccaaccc 9120
cctgtaacca ggtaccttcc aacaacgaga tgattctgcc ctcactcaag agtctccccc 9180
acaaagattc cattctccct ttacttttta ttttttatttttttgcaaaa caaaggcctc 9240
ctttagtacc tccttagttt ataaccctta tcttcccagc tcttcccttc actgatacct 9300
ctgatttcaa aagttctgaa gtcggaagac cacacaattt cagactgtga acagaaattc 9360
agtcagaaat tattggagtt agaaagaatt tagagaaatt gtattgaact agaaagtcct 9420
ctttgattag tggtggccct ttagaaagtt ctagggcaga gtcccatggt gtttcatctt 9480
ttccatgatt tgcttgacca aaaacttttc tttcacaaca tgaaaatatg ctaatccacc 9540
aacacattttg gattctgctc tgtttgcctg aggtgttaga tctctagcca ggactgtgaa 9600
gggaaggaac ttgaatcctt cctattgagc tattaatgca gagtcagtga gatgaagggt 9660
tccactcggg gtcaaaatca tgtcagttac caagcaaagg agcaagtaag gggaaacatc 9720
tcctcatctg gttagtggag ccacatttca cccactgatc aagccagacc tgagcaatag 9780
tctagatttc tccctccaca tctaattggt gacaatggtg attgtaccac tgacaggtca 9840
cacaagtcca caccctcctt cacagcctca ctgcttcggc tcttgtttat gctcttatca 9900
gcatctgtca ctaggatcca tttgtctcct gactcatcta cttcctttca ctcccctggg 9960
ccatccttca catcatgcta gaagactgtg tctaacttgc agaacttatt gtgtcaatct 10020
cctggttacg gcccctccat ggctccccac ctgacatagg aggcccttca caatctggct 10080
tctgtcactc ataacttgtc tccagccttc attcttcagt ctgattttat ggttttctag 10140
ttccccaata caccacacca gtgtttatga aaccctagtc attggcctac gaactttatg 10200
attattgaca tattcatgta ccacctgtat tattttttgc atagtgttta tttttaattg 10260
actacatttt taactacaat aaagtaattt caactaaaa 10299
<![CDATA[ <210> 138]]>
<![CDATA[ <211> 11669]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 138]]> agtaagcctc cgcagcccac tcggactgca gcctgtttgc cgcccgtcct cccattgcag 60 cactcggggc gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac 120 tcgcgggtcc cgctccgctc cggcagcagc atggggaaag gacgcgcggg ccgagttggc 180 accacagcct tgcctccccg attgaaagag atgaaaagcc aggaatcggc tgcaggttcc 240 aaactagtcc ttcggtgtga aaccagttct gaatactcct ctctcagatt caagtggttc 300 aagaatggga atgaattgaa tcgaaaaaac aaaccaaa atatcaagat acaaaaaaag 360 ccagggaagt cagaacttcg cattaacaaa gcatcactgg ctgattctgg agagtatatg 420 tgcaaagtga tcagcaaatt aggaaatgac agtgcctctg ccaatatcac catcgtggaa 480 tcaaacgaga tcatcactgg tatgccagcc tcaactgaag gagcatatgt gtcttcagag 540 t ctcccatta gaatatcagt atccacagaa ggagcaaata cttcttcatc tacatctaca 600 tccaccactg ggacaagcca tcttgtaaaa tgtgcggaga aggagaaaac tttctgtgtg 660 aatggagggg agtgcttcat ggtgaaagac ctttcaaacc cctcgagata cttgtgcaag 720 tgcccaaatg agtttactgg tgatcgctgc caaaactacg taatggccag cttctacaag 780 catcttggga ttgaatttat ggaggcggag gagctgtacc agaagagagt gctgaccata 840 accggcatct gcatcgccct ccttgtggtc ggcatcatgt gtgtggtggc ctactgcaaa 900 accaagaaac agcggaaaaa gctgcatgac cgtcttcggc agagccttcg gtctgaacga 960 aaca atatga tgaacattgc caatgggcct caccatccta accaccccc cgagaatgtc 1020 cagctggtga atcaatacgt atctaaaaac gtcatctcca gtgagcatat tgttgagaga 1080 gaagcagaga catccttttc caccagtcac tatacttcca cagcccatca ctccactact 1 140 gtcacccaga ctcctagcca cagctggagc aacggacaca ctgaaagcat cctttccgaa 1200 agccactctg taatcgtgat gtcatccgta gaaaacagta ggcacagcag cccaactggg 1260 ggcccaagag gacgtcttaa tggcacagga ggccctcgtg aatgtaacag cttcctcagg 1320 catgccagag aaacccctga ttcctaccga gactctcctc atagtgaaag gtatgtgtca 1380 gccatgacca cc ccggctcg tatgtcacct gtagatttcc acacgccaag ctcccccaaa 1440 tcgccccctt cggaaatgtc tccaccgtg tccagcatga cggtgtccat gccttccatg 1500 gcggtcagcc ccttcatgga agaagagaga cctctacttc tcgtgacacc accaaggctg 1560 cgggagaaga agtttgacca tcaccctcag cagttcagct ccttccacca caacccgcg 1620 catgacagta acagcctccc tgctagcccc ttgaggatag tggaggatga ggagtatgaa 1680 acgacccaag agtacgagcc agcccaagag cctgttaaga aactcgccaa tagccggcgg 1740 gccaaaagaa ccaagcccaa tggccacatt gctaacagat tggaagtgga cagcaacaca 1800 agctcccaga gcagtaactc agagagt gaa acagaagatg aaagagtagg tgaagatacg 1860 cctttcctgg gcatacagaa ccccctggca gccagtcttg aggcaacacc tgccttccgc 1920 ctggctgaca gcaggactaa cccagcaggc cgcttctcga cacaggaaga aatccaggcc 1980 aggctgtcta gtgta attgc taaccaagac cctattgctg tataaaacct aaataaacac 2040 atagattcac ctgtaaaact ttaattttata taataaagta ttccacctta aattaaacaa 2100 tttattttat tttagcagtt ctgcaaatag aaaacaggaa aaaaactttt ataaattaaa 2160 tatatgtatg taaaaatgtg ttatgtgcca tatgtagcaa ttttttacag tatttcaaaa 2220 cgagaaagat atcaat ggtg cctttatgtt atgttatgtc gagagcaagt tttgtacagt 2280 tacagtgatt gcttttccac agtatttctg caaaacctct catagattca gtttttgctg 2340 gcttcttgtg cattgcatta tgatgttgac tggatgtatg atttgcaaga ctt gcaactg 2400 tccctctgtt tgcttgtagt agcacccgat cagtatgtct tgtaatggca catccatcca 2460 gatatgcctc tcttgtgtat gaagttttct ttgctttcag aatatgaaat gagttgtgtc 2520 tactctgcca gccaaaaggtt tgcctcattg ggctctgaga taatagtaga tccaacagca 2580 tgctactatt aaatacagca agaaactgca ttaagtaatg ttaaatatta ggaagaaagt 2640 aatactgtga tttaaaaaa actatattat taatcagaag acagcttg ct cttactaaaa 2700 ggagctctca tttactttat ttgattttat ttttcttgac aaaaagcaac agttttaggg 2760 atagcttaga aaatgggttc tggcttgcta tcagggtaaa tctaacacct tacaagagga 2820 ctgagtgtca ctttctctct gggggaatga tccagcagct tatctagttg acaatcaaaa 2880 cacggctgat aaaggtgcaa tcatttctga catgtatttt tcactgattt tgaagctagt 2940 gattggttgt gtcttcttgg ctcaaaaaga agcatattac ggcacaaaaa gcccagccca 3000 gacagcacat gcagcatttt gtctgaaata cttctagagt caaacgtgcc tgctgtacat 3060 agcgatgact tgtcatcata gggaagtatt tccatcgtag agtgtt caga aggagtgact 3120 gtataggtgg agagaagctt agtgactccg ttgaaatttt aaaatgtgga tgaccacccc 3180 tttctccccc ttatttttct tttatctttc catgttgcct tgatcaggtc ataactatgc 3240 atgaacattt tttatcagga atggcc gatg tgtatgtgat ttgtaatcac aagtaatgat 3300 tcatcaggaa atgtcaatcc tgttggaaag attgcacctt tacttgcaga agtgaccccc 3360 acctgtgtcc tgacctctcc atttacaggc tctctcaccc atttccccca cctcctttaa 3420 tttttgcttt actgtcataa agtaggacta agattggtct aagcattgca tgttcttttg 3480 tgatggtaaa tccaaaggaa ggcctataag tattaacatt tgaaataact gctaattcag 3 540 gaaaatggaa gaaaaaaaat tatttgaaac acagaaccca tttcatggcc tgcctgatat 3600 ctgtgaaatc agggctggag ctttacttag gattcacatg gcctcctagg aaccatggga 3660 caaatgggaa acaggttatc gggggattca tgaagtcagt gagagta att gcttcttttt 3720 tgcgggtgaa ctgaatgtat ttcttcacca aatcttgatg ttaacaatta aaaagaagaa 3780 atgacatgca agtaggtctt agcagaaaaa tgcaggctgg gcatgagtca tgttgttacc 3840 ctcccacatg ctcctacaat ccacagagat gcctgtctgc aggttcttga agttattgtt 3900 agtatttggt atctcaaatt tttcgtcact gttcacatgc cactttctct gtgcacagtg 3960 gtatcctcat ttgcttttta acctacactg aggagtcttt gtcaggttgc actgattttc 4020 caattctgca gtaatgagta agctcacggc atggggaaga agacagtcag tccaatgaag 4080 ttctctaaat tattttaaca ttgcctttga aggccttgac tcatccttag ctatttcaat 4140 gaagaaattc ctaccatgaa tttaaaaccc taaaaattct gtttcaaatt ctttgggcat 4200 tggggtactc agatatccca ttgtggaaga attttaagaa taaatagaag tttctgttga 4260 gaaccatgag caacatgttt cttacaatga gaattgctat gcattttaaa attgcaaata 4320 tatatgaaaa ttgaagacaa gaggaaattg tatttctaac ttgattctga tcactcacag 4380 aggtggcata ttat atagt tgggacatcc tttgcaccct tcataaaaaa ggccagctga 4440 ctgctcagca tcacctgcca aggccactag atttgtgttt acagggggtat ctctgtgatg 4500 cttgtcacat cactcttgac cacctctgtt aataaattcc gacagtgcag tggcgatcgg 4 560 agtgtgaact tatgttccca gcatatggaa agctatctta ggttttaagg tagtagaaat 4620 tgcccaggag tttgacagca actttgtttc ccgggtctaa aatcgtatcc cactgaggtg 4680 tatgcagtgg agcataatac atgcaaatac atgcaaaact ccttttgttt cacctaagat 4740 tcactttcta tcttactttc ccttcctgcc tagtgtgact tttgccccca agagtgcctg 4800 gacagcattc tagt ttctac aaaatggtcc tctgtgtagg tgaatgtgtc ccaaacctgc 4860 tatcactttc ttgtttcagt gtgactgtct tgttagaggt gaagtttatc cagggtaact 4920 tgctcactaa ctattccttt ttatggcctg gggttaaagg gcgcatggct cacact ggtg 4980 aaaataagga aggcctggtc ttatcttgta ttaataatac tggctgcatt ccaccagcca 5040 gagatttcta tctgcgaaga cctatgaaac actgaagaga aatgtaggca gaaggaaatg 5100 gccacatatc acaagttcta ttatatattc ttttgtaaat acatattgta tattacttgg 5160 atgttttctt atatcattta ctgtcttttt gagttaatgt cagtttttac tctctcaact 5220 tactatgtaa cattgtaaat aacataatgt c ctttattat ttatatttaa gcatctaaca 5280 tatagagttg ttttcataagat aaatgtcaaa aatatatgtt cttttgtttt 5340 tctttgcttt aaaattatgt atcttttcct tttctttttt ttaagaataa tttatgttc 5400 aggagaaaga atgtatatgt aactgaaact atctgaagaa tgcacattga aggccgtgag 5460 gtactgataa actaaagaat ttattatca aaatactaag caataagtaa ttgtgattaa 5520 tttaaagttt tgtccatttt ccatgaaaga catactgcaa taaaaatgct actctgtgga 5580 gacctgggag tgttgctcag cagactacag cttcagtctg ttagaccagc acctttcatc 5640 tcattcccat agttatgcta atttaggatt gtgttccat g gaccccatga tcaccttgtc 5700 tatacgttgc ttcttgtctg tccattgctt ttgccacacc acctgttctc aaatcatctc 5760 ctccctacca atgctgttta tcactttctt ccttgttgaa gaggccaacac aaccagacag 5820 tactatgctt ccttt ttcct ccatacacaa taacagagag agaatattct agggcatgac 5880 tgcctggatc ctggctgttg ctatcttttg tagtggcagt aagaaactcc ttcagactaa 5940 tgaaaatgtc aacgtgccat tcaatcacga aaggtaacga aaaatgctct catggttcaa 6000 atagtccaat ggcccatagt ggcctaaaag gcagccagtt gacacctggc catgctaagc 6060 ttccttatac catccgctaa tgactttcca ttgggcccac aattta cgga ttcataattt 6120 taaaagagga gaaggccaag ttaggttcat tccccttat ctgtcaataa aacaaatcaa 6180 actcatgtct atctaactgc tcagggagga gcctttgcat gagaaaattc tcatattcta 6240 agactgagtc atagaaatga gggtattact ttt cttactg caattaacct aaacaaaagc 6300 catattttaa caaaagata tttgcatggt acccttcata tattccaagc atttcactca 6360 ttatccagg taggttaaga gcttctgaag tgtatgaagt aaaggtcagc aatcctttgg 6420 ggtgaacagt ggcctccttt gagtttggg ggtaacctga gacttcccac caatgtccac 6480 ctccatctgt gtacctaatt cctattacct agttatggct cctctaggat catttcca aa 6540 cactctggat gtccaggaaa tttaaattgt agcttttgac tgagctagtt tttcctatttt 6600 atattaataa attttcaaaa atgcttgaaa tcttcacatt tgcaacaact ttagttttca 6660 tgcacataca aacacagaga gacaaaaatt ccaaac agac actctccaaa agccaccaca 6720 ctctttcact tgctctatag tcatttagcc aaccagccat gcagagaata tttaaaactt 6780 aaagattgag aattattct cagtttttgc tgaggctttg taacgaaatt gaacactata 6840 agcagctatt gtagtaattt tggttaaaat tgtttgcctg ggatatagta tttgaggcag 6900 aagcacgtgt gtgaaggagg tgaggtggtt tggaaagagt gaagactcgc agccagattg 69 60 aatgtctgga taattactat aattctccct tcttggttga aaccatgttc tctcttgatt 7020 tttaaaccca ggctgcctct ggaaacaagc aaacctgagt ctttctaacc tgagtcttcc 7080 caatcattag atttcttttc tgtcctaacg atgaatgata aaagg acttg atgttcacaa 7140 tttggggtta taaggcaggt ctgaaatctg gagactcaag atgctggaag gagtggaaag 7200 tttcgatgac tttatatgaa tcactttgca ctctatgttt ggcttgtcct ctttgaaact 7260 gatttactaa aataaatgta agggaactat tactccaaaa gattaacttg gcaggaaata 7320 ccaatacttt cagtttatga aagacaaaac tgtcttgttg ctacaggaag ctgcaatgtt 7380 cctaacct tt aaggttggtg ttgaataggg tggtcatgcc ctcccctgca ggtatcttta 7440 ggctcctgtt gacctcctgg tactataact gttcgtcttc tctgggtagc tattgatttt 7500 gaactttaac atgcttcaaa actttattca tcagggaaat aggaaaagag tttt gttacc 7560 tggaggaaat ctattgtgat ctacctgagc tttttaaaaa cagaccagga gaaggaaacc 7620 agtaattttt aaagaagaga cagagaatgg gataatagtt tcacccagga tctctttcta 7680 accctttccc ttcaaatgaa cttattggaa cagaattgga aagaagaaag gacatctctg 7740 cccaccccac aggatgccaa aaaggctaaa gaattacctc tgtagattta aacatcttta 7800 atggcttatg tatagatttg cta atacaga gagaaatgaa cttattaaata aaaatcacat 7860 tttataatat ttttatggct taaaacatcc tttatctcct ttttgttctc tctacatgat 7920 atggtaagtg atgaggaaaa tttaggctca ggaaggttaa aatctttctt ggagttacac 7980 at ctaagaga gctgcagagc tgacacttgt accccaggttt tctgactgca aatccagttt 8040 ctttctattg cgttcttccc ctttccctgc ctcaagcaga aacaggtttt ttattttcaa 8100 cctttatgta tacagtatgt tatgttacat ctacagctaa gtttcttttt agaagaatgt 8160 gagcccttct agctttggtt tagagtgatt ctagaagcca atttccttgg cttagtgatt 8220 ctatgcacct ttcctaaact tagctt tcta aggaaatgaa gtgtacgagt gagaatgaat 8280 tcacaatttc gacatgtagg tagcatccta aagtgaaaag aggaggaaat ttgtggtcaa 8340 agcactctcc ccaccactta gaaacttact gactgtgggc agcttcctcc tccaagtttc 8400 cttcctgatt tacaagaccg tggtgtggtc aggattaaac ttgaatacat gtaaggaagc 8460 ctgaaagtgt ctaacacata gcgagtattc aaatgccacc ttctatttga tccttcccct 8520 ccagttcctt aagttttgga atctaggttt ctcagttcca aatggattga catttgcata 8580 tccccattgc acaatggatc aaataaactt tatgttatca tttctccaac atagtgccag 8640 taagcaaatc ctttttaata acaacagtat gttgaga aac atatcaccaa ataatatta 8700 actttgtagc tttgataagt tctttaggtt ttggttttgg ttttgttttc tgagacaggg 8760 tcttgatctg tcacccagac tggagtgcaa tggttcaatt ttagctcact gcaacctgta 8820 actcctgagc tcaag tgatc ctccccacctc agcctctcaa gtagctggga ctacaggtgt 8880 gctccaccat gcccagcgat ttttttttttttttttttttttttttttgg tagggacaag 8940 gtctcgctat gttgcccagg ctggtcttca attcctggcc ttaagtgatc ctcctgcctc 9000 ggcctcccaa agtgctggaa ttacaggcat gagccaccac ccataacttt atgtttgttt 9060 ttttgatgca gtataagttc agcttgcttc ttatgc agcc ataccatttc atgttaactc 9120 tgatttttag cagcttatta cattagtgtt ttattattaa tataatttta cagaaattta 9180 ctaaaccatg actctgtaga gttttaataa tactacctcc aaacatcatt gcaaacatct 9240 agaagaatga acaaaaatga tcttagatc g acagtatatc tgtttgtctt agtttctaca 9300 caggatgttc agacatattc catttcttta aaaaaaaaat atatatatat atatatatat 9360 ataggcctgg cacggtggct catgactgta atcccagcac tttgggaggc ttaggcaggc 9420 aaatcacctg aggtcaggag tttgagactg gcctgaccaa catggtgaaa cctcgtctct 9480 attaaaatta caaaaattag ccgggcgtgg tggcacatgc ctgtaatccc agctact cgg 9540 gaggctgagg caggagaatc acttgaacct gggaggcaga ggttgcagtg agttgagatc 9600 acgccattgc actccagcct tggtgacaag agtgaaactc cgtctcaaaa aaaaaaaaaa 9660 tatgtatata tatatatata tatatatata tatatata tatatatata taaaatccca 9720 ccaaaagtct gcagagtgac caaattagac ggctctggtt tcagattaaa ttctaaatgt 9780 gagaaaccac atagctccca tgatccatcc aataattcct cacgtcctct tcactcttta 9840 ctccatgcat aaaacagaat ttttttttct catcctgggt atgaagcaat taataattta 9900 cggatttagc ctatttggat tcaatccctt caaactccat actacatcca aggtggaagt 9960 gacttaaact ctgatatcaa tcatcaggct tgtaatatag gctttgttaa tggcaggaga 10020 gtctaataaa actttctgtt ccttatcctt catttaaatg aaaaactttt tattgaaaac 10080 aatcataact ctagctcatc ataaatataa ttcatgagga cattttatta tttttatatt 1 0140 aaagaaataa tattatagat gtaaactttg cacctttcta attattatca tgagttaagc 10200 taatacttgt cttctggtcc ctagatgatg attctttttt gccttactgg aggagccctt 10260 gtcttgaagt gagttgcttc aacagcagag gacttctagt ttctcccagt tgagcctaaa 10320 gtgaactttt catcttcttc agaggaaggg gcttccttga tttgtacttt tgtggctcct 10380 c agataacac agacaatttt atcttggatc ccaggttctc ttcaccatta agaataagaa 10440 agagagaaaa tgctgtgcat gacagccacc tactccaaac tacccaaccc cctgtaacca 10500 ggtaccttcc aacaacgaga tgattctgcc ctcactcaag agtctccccc acaaagatt c 10560 cattctccct ttacttttta ttttttattt ttttgcaaaa caaaggcctc ctttagtacc 10620 tccttagttt ataaccctta tcttcccagc tcttcccttc actgatacct ctgatttcaa 10680 aagttctgaa gtcggaagac cacacaattt cagactgtga acagaaattc agtcagaaat 10740 tattggagtt agaaagaatt tagagaaatt gtattgaact agaaagtcct ctttgattag 10800 tggtggccct ttagaaagtt ct agggcaga gtcccatggt gtttcatctt ttccatgatt 10860 tgcttgacca aaaacttttc tttcacaaca tgaaaatatg ctaatccacc acaattttg 10920 gattctgctc tgtttgcctg aggtgttaga tctctagcca ggactgtgaa gggaaggaac 1098 0 ttgaatcctt cctattgagc tattaatgca gagtcagtga gatgaagggt tccactcggg 11040 gtcaaaatca tgtcagttac caagcaaagg agcaagtaag gggaaacatc tcctcatctg 11100 gttagtggag ccacatttca cccactgatc aagccagacc tgagcaatag tctagatttc 11160 tccctccaca tctaattggt gacaatggtg attgtaccac tgacaggtca cacaagtcca 11220 caccctcctt cacagcctca ctgcttcggc tct tgtttat gctcttatca gcatctgtca 11280 ctaggatcca tttgtctcct gactcatcta cttcctttca ctcccctggg ccatccttca 11340 catcatgcta gaagactgtg tctaacttgc agaacttatt gtgtcaatct cctggttacg 11400 gcccctccat gg ctccccac ctgacatagg aggcccttca caatctggct tctgtcactc 11460 ataacttgtc tccagccttc attcttcagt ctgattttat ggttttctag ttccccaata 11520 caccacacca gtgtttatga aaccctagtc attggcctac gaactttatg attattgaca 11580 tattcatgta ccacctgtat tattttttgc atagtgttta tttttaattg actacatttt 11640 taactacaat aaagtaattt caactaaaa 1166 9 <![CDATA[ <210> 139]]>
<![CDATA[ <211> 12]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 139]]>
Met Gly Lys Gly Arg Ala Gly Arg Val Gly Thr Thr
1 5 10
<![CDATA[ <210> 140]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 140]]>
Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly
1 5 10 15
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu
20 25 30
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys
35 40 45
Pro Gln Asn Ile Lys Ile Gln Lys Lys Pro
50 55
<![CDATA[ <210> 141]]>
<![CDATA[ <211> 40]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 141]]>
Lys Ser Glu Leu Arg Ile Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu
1 5 10 15
Tyr Met Cys Lys Val Ile Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala
20 25 30
Asn Ile Thr Ile Val Glu Ser Asn
35 40
<![CDATA[ <210> 142]]>
<![CDATA[ <211> 16]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 142]]>
Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser
1 5 10 15
<![CDATA[ <210> 143]]>
<![CDATA[ <211> 16]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 143]]>
Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly Ala Asn Thr Ser Ser
1 5 10 15
<![CDATA[ <210> 144]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 144]]>
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
1 5 10 15
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys
20 25 30
Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys
35 40
<![CDATA[ <210> 145]]>
<![CDATA[ <211> 19]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 145]]>
Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala
1 5 10 15
Ser Phe Tyr
<![CDATA[ <210> 146]]>
<![CDATA[ <211> 7]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 146]]>
His Leu Gly Ile Glu Phe Met
1 5
<![CDATA[ <210> 147]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 147]]>
Ala Glu Glu Leu Tyr Gln Lys Arg Val Leu Thr Ile Thr Gly Ile Cys
1 5 10 15
Ile Ala Leu Leu Val Val Gly Ile Met Cys Val Val Ala Tyr Cys Lys
20 25 30
Thr
<![CDATA[ <210> 148]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 148]]>
Lys Gln Arg Lys Lys Leu His Asp Arg Leu Arg Gln Ser Leu Arg Ser
1 5 10 15
Glu Arg Asn Asn Met Met Asn Ile Ala Asn Gly Pro His His Pro Asn
20 25 30
Pro Pro Pro Glu Asn Val Gln Leu Val Asn
35 40
<![CDATA[ <210> 149]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 149]]>
Gln Tyr Val Ser Lys Asn Val Ile Ser Ser Glu His Ile Val Glu Arg
1 5 10 15
Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His
20 25 30
His Ser Thr Thr Val Thr Gln Thr Pro Ser His
35 40
<![CDATA[ <210> 150]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 150]]>
Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His Ser Val
1 5 10 15
Ile Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr Gly
20 25 30
Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn
35 40 45
Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser
50 55 60
Pro His Ser Glu
65
<![CDATA[ <210> 151]]>
<![CDATA[ <211> 217]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 151]]>
Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp Phe
1 5 10 15
His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro
20 25 30
Val Ser Ser Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe
35 40 45
Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg
50 55 60
Glu Lys Lys Phe Asp His His Pro Gln Gln Phe Ser Ser Phe His His
65 70 75 80
Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg Ile
85 90 95
Val Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu Tyr Glu Pro Ala Gln
100 105 110
Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys
115 120 125
Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val Asp Ser Asn Thr Ser
130 135 140
Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly
145 150 155 160
Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro Leu Ala Ala Ser Leu
165 170 175
Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala
180 185 190
Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln Ala Arg Leu Ser Ser Ser Val
195 200 205
Ile Ala Asn Gln Asp Pro Ile Ala Val
210 215
<![CDATA[ <210> 152]]>
<![CDATA[ <211> 624]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 152]]>
Met Gly Lys Gly Arg Ala Gly Arg Val Gly Thr Thr Ala Leu Pro Pro
1 5 10 15
Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu
20 25 30
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys
35 40 45
Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gln Asn
50 55 60
Ile Lys Ile Gln Lys Lys Pro Gly Lys Ser Glu Leu Arg Ile Asn Lys
65 70 75 80
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val Ile Ser Lys
85 90 95
Leu Gly Asn Asp Ser Ala Ser Ala Asn Ile Thr Ile Val Glu Ser Asn
100 105 110
Glu Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser
115 120 125
Ser Glu Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly Ala Asn Thr
130 135 140
Ser Ser Ser Thr Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys
145 150 155 160
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe
165 170 175
Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro
180 185 190
Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala Ser Phe
195 200 205
Tyr Lys His Leu Gly Ile Glu Phe Met Glu Ala Glu Glu Leu Tyr Gln
210 215 220
Lys Arg Val Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu Leu Val Val
225 230 235 240
Gly Ile Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gln Arg Lys
245 250 255
Lys Leu His Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu Arg Asn Asn
260 265 270
Met Met Asn Ile Ala Asn Gly Pro His His Pro Asn Pro Pro Pro Glu
275 280 285
Asn Val Gln Leu Val Asn Gln Tyr Val Ser Lys Asn Val Ile Ser Ser
290 295 300
Glu His Ile Val Glu Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His
305 310 315 320
Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr Gln Thr Pro Ser
325 330 335
His Ser Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His
340 345 350
Ser Val Ile Val Met Ser Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro
355 360 365
Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu
370 375 380
Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg
385 390 395 400
Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala
405 410 415
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro
420 425 430
Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro
435 440 445
Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu
450 455 460
Val Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gln
465 470 475 480
Gln Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu
485 490 495
Pro Ala Ser Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr Glu Thr Thr
500 505 510
Gln Glu Tyr Glu Pro Ala Gln Glu Pro Val Lys Lys Leu Ala Asn Ser
515 520 525
Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His Ile Ala Asn Arg Leu
530 535 540
Glu Val Asp Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu
545 550 555 560
Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln
565 570 575
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala
580 585 590
Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile
595 600 605
Gln Ala Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
610 615 620
<![CDATA[ <210> 153]]>
<![CDATA[ <211> 11482]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 153]]> ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60 tccttcggtg tgaaaccagt tctgaatact cctctctcag attcaagtgg ttcaagaatg 120 ggaatgaatt gaatcgaaaa aacaaaccac aaaat atcaa gatacaaaaa aagccaggga 180 agtcagaact tcgcattaac aaagcatcac tggctgattc tggagagtat atgtgcaaag 240 tgatcagcaa attaggaaat gacagtgcct ctgccaatat caccatcgtg gaatcaaacg 300 agatcatcac tggtatgcca gcct caactg aaggagcata tgtgtcttca gagtctccca 360 ttagaatatc agtatccaca gaaggagcaa atacttcttc atctacatct acatccacca 420 ctgggacaag ccatcttgta aaatgtgcgg agaaggagaa aactttctgt gtgaatggag 480 gggagtgctt catggtgaaa gacctttcaa accctcgag atacttgtgc aagtgcccaa 540 atgagtttac tggtgatcgc tgccaaaact acgtaatggc cagcttctac aagcatcttg 600 ggattgaatt tatggaggcg gaggagctgt accagaagag agtgctgacc ataaccggca 660 tctgcatcgc cctccttgtg gtcggcatca tgtgtgtggt ggcctactgc aaaaccaaga 720 aacagcggaa aaagctgcat gaccgtcttc ggcagagcct tcggtctgaa cgaaacaata 780 tgatgaacat tgccaatggg cctcaccatc ctaacccacc ccccgagaat gtccagctgg 840 tgaatcaata cgtatctaaa aacgtcatct ccagtgagca tattgttgag agagaagcag 900 agacatcctt ttccaccagt cactatactt ccacagccca tcactccact actgtcaccc 960 agactcctag ccacagct gg agcaacggac acactgaaag catcctttcc gaaagccact 1020 ctgtaatcgt gatgtcatcc gtagaaaaca gtaggcacag cagcccaact gggggcccaa 1080 gaggacgtct taatggcaca ggaggccctc gtgaatgtaa cagcttcctc aggcatgcca 1140 gagaaac ccc tgattcctac cgagactctc ctcatagtga aaggtatgtg tcagccatga 1200 ccaccccggc tcgtatgtca cctgtagatt tccaacacgcc aagctccccc aaatcgcccc 1260 cttcggaaat gtctccaccc gtgtccagca tgacggtgtc catgccttcc atggcggtca 1320 gccccttcat ggaagaagag agacctctac ttctcgtgac accaccaagg ctgcggggaga 1380 agaagttt ga ccatcaccct cagcagttca gctccttcca ccacaacccc gcgcatgaca 1440 gtaacagcct ccctgctagc cccttgagga tagtggagga tgaggagtat gaaacgaccc 1500 aagagtacga gccagcccaa gagcctgtta agaaactcgc caatagccgg cgggccaaaa 1 560 gaaccaagcc caatggccac attgctaaca gattggaagt ggacagcaac acaagctccc 1620 agagcagtaa ctcagagagt gaaacagaag atgaaagagt aggtgaagat acgcctttcc 1680 tgggcataca gaaccccctg gcagccagtc ttgaggcaac acctgccttc cgcctggctg 1740 acagcaggac taacccagca ggccgcttct cgacacagga agaaatccag gccaggctgt 1800 ctagtgtaat tgctaaccaa gaccctattg ctgtataaaa cctaaataaa cacatagatt 1860 cacctgtaaa actttatttt atataataaa gtattccacc ttaaattaaa caatttattt 1920 tattttagca gttctgcaaa tagaaaacag gaaaaaaact tttataaatt aaatatatgt 1980 atgtaaaaat gtgttatgtg ccatatgtag caatttttta cagtatttca aaacgagaaa 2040 gatatcaatg gtgcctttat gttatgttat gtcgagagca agttttgtac agttacagtg 2100 attgcttttc cacagtattt ctgcaaaacc tctcatagat tcagtttttg ctggcttctt 2160 gtgcattgca ttatgatgtt gactggatgt atgatttgca agacttgcaa ctgtccctct 2220 gtttgcttgt agtagc accc gatcagtatg tcttgtaatg gcacatccat ccagatatgc 2280 ctctcttgtg tatgaagttt tctttgcttt cagaatatga aatgagttgt gtctactctg 2340 ccagccaaag gtttgcctca ttgggctctg agataatagt agatccaaca gcatgctact 2 400 attaaataca gcaagaaact gcattaagta atgttaaata ttaggaagaa agtaatactg 2460 tgatttaaaa aaaactatat tattaatcag aagacagctt gctcttacta aaaggagctc 2520 tcatttactt tatttgattt tatttttctt gacaaaaagc aacagtttta gggatagctt 2580 agaaaatggg ttctggcttg ctatcagggt aaatctaaca ccttacaaga ggactgagtg 2640 tcactttctc tctgggggaa tgatccagca gcttatctag t tgacaatca aaacacggct 2700 gataaaggtg caatcatttc tgacatgtat ttttcactga ttttgaagct agtgattggt 2760 tgtgtcttct tggctcaaaa agaagcatat tacggcacaa aaagcccagc ccagacagca 2820 catgcagcat tttgtctgaa atact tctag agtcaaacgt gcctgctgta catagcgatg 2880 acttgtcatc atagggaagt atttccatcg tagagtgttc agaaggagtg actgtatagg 2940 tggagagaag cttagtgact ccgttgaaat tttaaaatgt ggatgaccac ccctttctcc 3000 cccttatttt tcttttatct ttccatgttg ccttgatcag gtcataacta tgcatgaaca 3060 ttttttca ggaatggccg atgtgtatgt gatttgtaat cacaagtaat gattcatcag 3120 gaaatgtcaa tcctgttgga aagattgcac ctttacttgc agaagtgacc cccacctgtg 3180 tcctgacctc tccattaca ggctctctca cccatttccc ccacctcctt taatttttgc 3240 tttactgtca taaagtagga c taagattgg tctaagcatt gcatgttctt ttgtgatggt 3300 aaatccaaag gaaggcctat aagtattaac atttgaaata actgctaatt caggaaaatg 3360 gaagaaaaaa aattatttga aacacagaac ccatttcatg gcctgcctga tatctgtgaa 3420 atcagggctg gagctttact taggattcac atggcctcct aggaaccatg ggacaaatgg 3480 gaaacaggtt atcgggggat tcatgaagtc agtgagagta attgcttctt ttttgcggg t 3540 gaactgaatg tatttcttca ccaaatcttg atgttaacaa ttaaaaagaa gaaatgacat 3600 gcaagtaggt cttagcagaa aaatgcaggc tgggcatgag tcatgttgtt accctcccac 3660 atgctcctac aatccacaga gatgcctgtc tgcaggt tct tgaagttat gttagtattt 3720 ggtatctcaa atttttcgtc actgttcaca tgccactttc tctgtgcaca gtggtatcct 3780 catttgcttt ttaacctaca ctgaggagtc tttgtcaggt tgcactgatt ttccaattct 3840 gcagtaatga gtaagctcac ggcatgggga agaagacagt cagtccaatg aagttctcta 3900 aattatttta aattgcctt tgaaggcctt gactcatcct tagctatttc aatgaagaaa 3960 ttcctac cat gaatttaaaa ccctaaaaat tctgtttcaa attctttggg cattggggta 4020 ctcagattc ccattgtgga agaattttaa gaataaatag aagtttctgt tgagaaccat 4080 gagcaacatg tttcttacaa tgagaattgc tatgcatttt aaaattgcaa atatatatga 4140 aaattgaaga caagaggaaa ttgtatttct aacttgattc tgatcactca cagaggtggc 4200 atattattat agttgggaca tcctttgcac ccttcataaa aaaggccagc tgactgctca 4260 gcatcacctg ccaaggccac tagatttgtg tttacagggg tatctctgtg atgcttgtca 4320 catcactctt gaccacctct gttaataaat tccgacagtg cagtggcgat cggagtgtga 4380 acttatgttc ccagcatatg gaaagctatc ttaggtttta aggtagtaga aattgcccag 4440 gagtttgaca gcaactttgt ttcccgggtc taaaatcgta tcccactgag gtgtatgcag 4500 tggagcataa tacatgcaaa tacatgcaaa actccttttg tttcacctaa gatt cacttt 4560 ctatcttact ttcccttcct gcctagtgtg acttttgccc ccaagagtgc ctggacagca 4620 ttctagtttc tacaaaatgg tcctctgtgt aggtgaatgt gtcccaaacc tgctatcact 4680 ttcttgtttc agtgtgactg tcttgttaga ggtgaagttt atccagggta acttgctcac 4740 taactattcc tttttatggc ctggggttaa agggcgcatg gctcacactg gtgaaaataa 4800 ggaaggcctg gt cttatctt gtattaataa tactggctgc attccaccag ccagagattt 4860 ctatctgcga agacctatga aacactgaag agaaatgtag gcagaaggaa atggccacat 4920 atcacaagtt ctattatata ttcttttgta aatacatatt gtatattact tggatgtttt 4980 cttatat cat ttactgtctt tttgagttaa tgtcagtttt tactctctca acttactatg 5040 taacattgta aataacataa tgtcctttat tattatatt taagcatcta acatatagag 5100 ttgttttcat ataagtttaa gataaatgtc aaaaatatat gttcttttgt ttttctttgc 5160 tttaaaatta tgtatctttt ccttttcttt tttttaagaa taatttattg ttcaggagaa 5220 agaatgtata tgtaactga a actatctgaa gaatgcacat tgaaggccgt gaggtactga 5280 taaactaaag aatttattat tcaaaatact aagcaataag taattgtgat ttattaaag 5340 ttttgtccat tttccatgaa agacatactg caataaaaat gctactctgt ggagacctgg 5400 gagtgttg ct cagcagacta cagcttcagt ctgttagacc agcacctttc atctcattcc 5460 catagttatg ctaatttagg attgtgttcc atggaccccca tgatcacctt gtctatacgt 5520 tgcttcttgt ctgtccattg cttttgccac accacctgtt ctcaaatcat ctcctcccta 5580 ccaatgctgt ttatcacttt cttccttgtt gaagaggcca cacaaccaga cagtactatg 5640 cttccttttt cctccataca caataacaga ga gagaatat tctagggcat gactgcctgg 5700 atcctggctg ttgctatctt ttgtagtggc agtaagaaac tccttcagac taatgaaaat 5760 gtcaacgtgc cattcaatca cgaaaggtaa cgaaaaatgc tctcatggtt caaatagtcc 5820 aatggcccat agtgg cctaa aaggcagcca gttgacacct ggccatgcta agcttcctta 5880 taccatccgc taatgacttt ccattgggcc cacaatttac ggattcataa ttttaaaaga 5940 ggagaaggcc aagttaggtt cattcccctt attctgtcaa taaaacaaat caaactcatg 6000 tctatctaac tgctcaggga ggagcctttg catgagaaaa ttctcatatt ctaagactga 6060 gtcatagaaa tgagggtatt acttttctta ctgcaattaa ctaaacaaa agccatattt 6120 taacaaatag atatttgcat ggtacccttc atatattcca agcatttcac tcattattcc 6180 aggtaggtta agagcttctg aagtgtatga agtaaaggtc agcaatcctt tggggtgaac 6240 agtggcctcc tttggagttt gggggtaacc tg agacttcc caccaatgtc cacctccatc 6300 tgtgtaccta attcctatta cctagttatg gctcctctag gatcattcc aaacactctg 6360 gatgtccagg aaatttaaat tgtagctttt gactgagcta gtttttccta tttatattaa 6420 taaattttca aaaatgcttg aaatcttcac atttgcaaca actttagttt tcatgcacat 6480 acaaacacag agagacaaaa attccaaaca gacactctcc aaaagccacc acact ctttc 6540 acttgctcta tagtcattta gccaaccagc catgcagaga atatttaaaa cttaaagatt 6600 gagacattat tctcagtttt tgctgaggct ttgtaacgaa attgaacact ataagcagct 6660 attgtagtaa ttttggttaa aattgtttgc ctgggat ata gtatttgagg cagaagcacg 6720 tgtgtgaagg aggtgaggtg gtttggaaag agtgaagact cgcagccaga ttgaatgtct 6780 ggataattac tataattctc ccttcttggt tgaaaccatg ttctctcttg atttttaaac 6840 ccaggctgcc tctggaaaca agcaaacctg agtctttcta acctgagtct tcccaatcat 6900 tagatttctt ttctgtccta acgatgaatg ataaaaggac ttgatgttca caatttgggg 696 0 ttataaggca ggtctgaaat ctggagactc aagatgctgg aaggagtgga aagtttcgat 7020 gactttatat gaatcacttt gcactctatg tttggcttgt cctctttgaa actgatttac 7080 taaaataaat gtaagggaac tattactcca aaagattaac ttggcaggaa ataccaatac 7140 tttcagttta tgaaagacaa aactgtcttg ttgctacagg aagctgcaat gttcctaacc 7200 tttaaggttg gtgttgaata gggtggtcat gccctcccct gcaggtatct ttaggctcct 7260 gttgacctcc tggtactata actgttcgtc ttctctgggt agctattgat tttgaacttt 7320 aacatgcttc aaaactttat tcatcaggga aataggaaaa gagttttgtt acctggagga 7380 aatctattgt g atctacctg agctttttaa aaacagacca ggagaaggaa accagtaatt 7440 tttaaagaag agacagagaa tgggataata gtttcaccca ggatctcttt ctaacccttt 7500 cccttcaaat gaacttattg gaacagaatt ggaaagaaga aaggacatct ctgcccaccc 7560 cacaggatgc caaaaaggct aaagaattac ctctgtagat ttaaacatct ttaatggctt 7620 atgtatagat ttgctaatac agagagaaat gaactattaa ataaaaatca cattttaggc tcaggaaggt taaaatcttt cttggagtta cacatctaag 7800 agagctgcag agctg acact tgtacccagg ttttctgact gcaaatccag tttctttcta 7860 ttgcgttctt cccctttccc tgcctcaagc agaaacaggt tttttttt caacctttat 7920 gtatacagta tgttatgtta catctacagc taagtttctt tttagaagaa tgtg agccct 7980 tctagctttg gtttagagtg attctagaag ccaatttcct tggcttagtg attctatgca 8040 cctttcctaa acttagcttt ctaaggaaat gaagtgtacg agtgagaatg aattcacaat 8100 ttcgacatgt aggtagcatc ctaaagtgaa aagaggagga aatttgtggt caaagcactc 8160 tccccaccac ttagaaactt actgactgtg ggcagcttcc tcctccaagt ttccttcctg 8220 atttacaaga ccgtggtgtg gtcaggatta aacttgaata catgtaagga agcctgaaag 8280 tgtctaacac atagcgagta ttcaaatgcc accttctatt tgatccttcc cctccagttc 8340 cttaagtttt ggaatctagg tttctcagtt ccaaatggat tgacatttgc atatccccat 8400 tgcacaatgg at caaataaa ctttatgtta tcatttctcc aacatagtgc cagtaagcaa 8460 atccttttta ataacaacag tatgttgaga aacatatcac caaataatat ttaactttgt 8520 agctttgata agttctttag gttttggttt tggttttgtt ttctgagaca gggtcttgat 8580 ctgtcaccca gactggagtg caatggttca attttagctc actgcaacct gtaactcctg 8640 agctcaagtg atcctcccac ctcagcctct caagtagctg ggactacagg tgtgctccac 8700 catgcccagc gatttttttttttttttttttttttttttttggtagggac aaggtctcgc 8760 tatgttgccc aggctggtct tcaattcctg gccttaagtg atcctcctgc ctcggcctcc 88 20 caaagtgctg gaattacagg catgagccac cacccataac tttatgtttg tttttttgat 8880 gcagtataag ttcagcttgc ttcttatgca gccataccat ttcatgttaa ctctgatttt 8940 tagcagctta ttacattagt gttttattat taatataatt ttacagaaat ttactaaacc 9000 atgactctgt agagttttaa taatactacc tccaaacatc attgcaaaca tctagaagaa 9060 tgaacaaaaa tgatcttaga tcgacagtat atctgtttgt ctta gtttct acacaggatg 9120 ttcagacata ttccatttct ttaaaaaaaa aatatatata tatatata tatataggcc 9180 tggcacggtg gctcatgact gtaatcccag cactttggga ggcttaggca ggcaaatcac 9240 ctgaggtcag gagtttgaga ctggcctgac ca acatggtg aaacctcgtc tctattaaaa 9300 ttacaaaaat tagccgggcg tggtggcaca tgcctgtaat cccagctact cgggaggctg 9360 aggcaggaga atcacttgaa cctgggaggc agaggttgca gtgagttgag atcacgccat 9420 tgcactccag ccttggtgac aagagtgaaa ctccgtctca aaaaaaaaaa aaatatgtat 9480 atatatatat atatatatat atatatatat atatatatat atataaaatc ccaccaaaag 9540 tctgcagagt gaccaaatta gacggctctg gtttcagatt aaattctaaa tgtgagaaac 9600 cacatagctc ccatgatcca tccaataatt cctcacgtcc tcttcactct ttactccatg 9660 cataaaacag aatttttttttctcatcctg ggtatgaag c aattaataat ttacggattt 9720 agcctatttg gattcaatcc cttcaaactc catactacat ccaaggtgga agtgacttaa 9780 actctgatat caatcatcag gcttgtaata taggctttgt taatggcagg agagtctaat 9840 aaaactttct gttccttatc cttcatttaa atgaaaaact ttttattgaa aacaatcata 9900 actctagctc atcataaata taattcatga ggacatttta ttatttttat attaaagaaa 9960 taatattata gatgtaaact ttgcaccttt ctaattatta tcatgagtta agctaatact 10020 tgtcttctgg tccctagatg atgattcttt tttgccttac tggaggagcc cttgtcttga 10080 agtgagttgc ttcaacagca gaggacttct agtttctccc agtt gagcct aaagtgaact 10140 tttcatcttc ttcagaggaa ggggcttcct tgatttgtac ttttgtggct cctcagataa 10200 cacagacaat tttatcttgg atcccaggtt ctcttcacca ttaagaataa gaaagagaga 10260 aaatgctgtg catgacagcc acctactcca aactacccaa ccccctgtaa ccaggtacct 10320 tccaacaacg agatgattct gccctcactc aagagtctcc cccacaaaga ttccattctc 10380 cctttacttt ttatttttta tttttttgca aaacaaaggc ctcctttagt acctccttag 10440 tttataaccc ttatcttccc agctcttccc ttcactgata cctctgattt caaaagttct 10500 gaagtcggaa gaccacacaa tttcagactg tgaacagaaa ttcag tcaga aattattgga 10560 gttagaaaga atttagagaa attgtattga actagaaagt cctctttgat tagtggtggc 10620 cctttagaaa gttctagggc agagtcccat ggtgtttcat cttttccatg atttgcttga 10680 ccaaaaactt ttctttcaca acatgaaaat atgctaatcc accacacatt ttggattctg 10740 ctctgtttgc ctgaggtgtt agatctctag ccaggactgt gaagggaagg aacttgaatc 10800 cttcctattg ag ctattaat gcagagtcag tgagatgaag ggttccactc ggggtcaaaa 10860 tcatgtcagt taccaagcaa aggagcaagt aaggggaaac atctcctcat ctggttagtg 10920 gagccacatt tcacccactg atcaagccag acctgagcaa tagtctagat ttctccctcc 109 80 acatctaatt ggtgacaatg gtgattgtac cactgacagg tcacacaagt ccacaccctc 11040 cttcacagcc tcactgcttc ggctcttgtt tatgctctta tcagcatctg tcactaggat 11100 ccatttgtct cctgactcat ctacttccctt tcactcccct gggccatcct tcacatcatg 11160 ctagaagact gtgtctaact tgcagaactt attgtgtcaa tctcctggtt acggcccctc 11220 catggctccc cacctgacat aggaggccct tcacaatct g gcttctgtca ctcataactt 11280 gtctccagcc ttcattcttc agtctgattt tatggttttc tagttcccca atacaccaca 11340 ccagtgttta tgaaacccta gtcattggcc tacgaacttt atgattattg acatattcat 11400 gtaccacctg tattattttt tg catagtgt ttatttttaa ttgactacat ttttaactac 11460 aataaagtaa tttcaactaa aa 11482 <![CDATA[ <210> 154]]>
<![CDATA[ <211> 611]]>
<![CDATA[ <212>]]> PRT
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 154]]>
Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly
1 5 10 15
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu
20 25 30
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys
35 40 45
Pro Gln Asn Ile Lys Ile Gln Lys Lys Pro Gly Lys Ser Glu Leu Arg
50 55 60
Ile Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val
65 70 75 80
Ile Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn Ile Thr Ile Val
85 90 95
Glu Ser Asn Glu Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala
100 105 110
Tyr Val Ser Ser Glu Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly
115 120 125
Ala Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Thr Gly Thr Ser His
130 135 140
Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly
145 150 155 160
Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys
165 170 175
Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met
180 185 190
Ala Ser Phe Tyr Lys His Leu Gly Ile Glu Phe Met Glu Ala Glu Glu
195 200 205
Leu Tyr Gln Lys Arg Val Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu
210 215 220
Leu Val Val Gly Ile Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys
225 230 235 240
Gln Arg Lys Lys Leu His Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu
245 250 255
Arg Asn Asn Met Met Asn Ile Ala Asn Gly Pro His His Pro Asn Pro
260 265 270
Pro Pro Glu Asn Val Gln Leu Val Asn Gln Tyr Val Ser Lys Asn Val
275 280 285
Ile Ser Ser Glu His Ile Val Glu Arg Glu Ala Glu Thr Ser Phe Ser
290 295 300
Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr Gln
305 310 315 320
Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser
325 330 335
Glu Ser His Ser Val Ile Val Met Ser Ser Val Glu Asn Ser Arg His
340 345 350
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly
355 360 365
Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp
370 375 380
Ser Tyr Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr
385 390 395 400
Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Ser Pro
405 410 415
Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val
420 425 430
Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro
435 440 445
Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His
450 455 460
His Pro Gln Gln Phe Ser Ser Ser Phe His His Asn Pro Ala His Asp Ser
465 470 475 480
Asn Ser Leu Pro Ala Ser Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr
485 490 495
Glu Thr Thr Gln Glu Tyr Glu Pro Ala Gln Glu Pro Val Lys Lys Leu
500 505 510
Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His Ile Ala
515 520 525
Asn Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser
530 535 540
Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu
545 550 555 560
Gly Ile Gln Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe
565 570 575
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln
580 585 590
Glu Glu Ile Gln Ala Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro
595 600 605
Ile Ala Val
610
<![CDATA[ <210> 155]]>
<![CDATA[ <211> 11080]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 155]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac ccctcgagat 120
acttgtgcaa gtgcccaaat gagtttactg gtgatcgctg ccaaaactac gtaatggcca 180
gcttctacaa gcatcttggg attgaattta tggaggcgga ggagctgtac cagaagagag 240
tgctgaccat aaccggcatc tgcatcgccc tccttgtggt cggcatcatg tgtgtggtgg 300
cctactgcaa aaccaagaaa cagcggaaaa agctgcatga ccgtcttcgg cagagccttc 360
ggtctgaacg aaacaatatg atgaacattg ccaatgggcc tcaccatcct aacccacccc 420
ccgagaatgt ccagctggtg aatcaatacg tatctaaaaa cgtcatctcc agtgagcata 480
ttgttgagag agaagcagag acatcctttt ccaccagtca ctatacttcc acagcccatc 540
actccactac tgtcacccag actcctagcc acagctggag caacggacac actgaaagca 600
tcctttccga aagccactct gtaatcgtga tgtcatccgt agaaaacagt aggcacagca 660
gcccaactgg gggcccaaga ggacgtctta atggcacagg aggccctcgt gaatgtaaca 720
gcttcctcag gcatgccaga gaaacccctg attcctaccg agactctcct catagtgaaa 780
ggtatgtgtc agccatgacc accccggctc gtatgtcacc tgtagatttc cacacgccaa 840
gctcccccaa atcgccccct tcggaaatgt ctccacccgt gtccagcatg acggtgtcca 900
tgccttccat ggcggtcagc cccttcatgg aagaagagag acctctactt ctcgtgacac 960
caccaaggct gcgggagaag aagtttgacc atcaccctca gcagttcagc tccttccacc 1020
acaaccccgc gcatgacagt aacagcctcc ctgctagccc cttgaggata gtggaggatg 1080
aggagtatga aacgacccaa gagtacgagc cagcccaaga gcctgttaag aaactcgcca 1140
atagccggcg ggccaaaaga accaagccca atggccacat tgctaacaga ttggaagtgg 1200
acagcaacac aagctccccag agcagtaact cagagagtga aacagaagat gaaagagtag 1260
gtgaagatac gcctttcctg ggcatacaga accccctggc agccagtctt gaggcaacac 1320
ctgccttccg cctggctgac agcaggacta acccagcagg ccgcttctcg acacaggaag 1380
aaatccaggc caggctgtct agtgtaattg ctaaccaaga ccctattgct gtataaaacc 1440
taaataaaca catagattca cctgtaaaac tttattttat ataataaagt attccacctt 1500
aaattaaaca atttatttta ttttagcagt tctgcaaata gaaaacagga aaaaaacttt 1560
tataaattaa atatatgtat gtaaaaatgt gttatgtgcc atatgtagca attttttaca 1620
gtatttcaaa acgagaaaga tatcaatggt gcctttatgt tatgttatgt cgagagcaag 1680
ttttgtacag ttacagtgat tgcttttcca cagtatttct gcaaaacctc tcatagattc 1740
agtttttgct ggcttcttgt gcattgcatt atgatgttga ctggatgtat gatttgcaag 1800
acttgcaact gtccctctgt ttgcttgtag tagcacccga tcagtatgtc ttgtaatggc 1860
acatccatcc agatatgcct ctcttgtgta tgaagttttc tttgctttca gaatatgaaa 1920
tgagttgtgt ctactctgcc agccaaaggt ttgcctcatt gggctctgag ataatagtag 1980
atccaacagc atgctactat taaatacagc aagaaactgc attaagtaat gttaaatatt 2040
aggaagaaag taatactgtg atttaaaaaa aactatatta ttaatcagaa gacagcttgc 2100
tcttactaaa aggagctctc atttacttta tttgatttta tttttcttga caaaaagcaa 2160
cagttttagg gatagcttag aaaatgggtt ctggcttgct atcagggtaa atctaacacc 2220
ttacaagagg actgagtgtc actttctctc tgggggaatg atccagcagc ttatctagtt 2280
gacaatcaaa acacggctga taaaggtgca atcatttctg acatgtattt ttcactgatt 2340
ttgaagctag tgattggttg tgtcttcttg gctcaaaaag aagcatatta cggcacaaaa 2400
agcccagccc agacagcaca tgcagcattt tgtctgaaat acttctagag tcaaacgtgc 2460
ctgctgtaca tagcgatgac ttgtcatcat agggaagtat ttccatcgta gagtgttcag 2520
aaggagtgac tgtataggtg gagagaagct tagtgactcc gttgaaattt taaaatgtgg 2580
atgaccaccc ctttctcccc cttatttttc ttttatcttt ccatgttgcc ttgatcaggt 2640
cataactatg catgaacatt ttttatcagg aatggccgat gtgtatgtga tttgtaatca 2700
caagtaatga ttcatcagga aatgtcaatc ctgttggaaa gattgcacct ttacttgcag 2760
aagtgacccc cacctgtgtc ctgacctctc catttacagg ctctctcacc catttccccc 2820
acctccttta atttttgctt tactgtcata aagtaggact aagattggtc taagcattgc 2880
atgttctttt gtgatggtaa atccaaagga aggcctataa gtattaacat ttgaaataac 2940
tgctaattca ggaaaatgga agaaaaaaaa ttatttgaaa cacagaaccc atttcatggc 3000
ctgcctgata tctgtgaaat cagggctgga gctttactta ggattcacat ggcctcctag 3060
gaaccatggg acaaatggga aacaggttat cggggattc atgaagtcag tgagagtaat 3120
tgcttctttt ttgcgggtga actgaatgta tttcttcacc aaatcttgat gttaacaatt 3180
aaaaagaaga aatgacatgc aagtaggtct tagcagaaaa atgcaggctg ggcatgagtc 3240
atgttgttac cctcccacat gctcctacaa tccacagaga tgcctgtctg caggttcttg 3300
aagttatgt tagtatttgg tatctcaaat ttttcgtcac tgttcacatg ccactttctc 3360
tgtgcacagt ggtatcctca tttgcttttt aacctacact gaggagtctt tgtcaggttg 3420
cactgatttt ccaattctgc agtaatgagt aagctcacgg catggggaag aagacagtca 3480
gtccaatgaa gttctctaaa ttattttaac attgcctttg aaggccttga ctcatcctta 3540
gctatttcaa tgaagaaatt cctaccatga atttaaaacc ctaaaaattc tgtttcaaat 3600
tctttgggca ttggggtact cagatatccc attgtggaag aattttaaga ataaatagaa 3660
gtttctgttg agaaccatga gcaacatgtt tcttacaatg agaattgcta tgcattttaa 3720
aattgcaaat atatatgaaa attgaagaca agaggaaatt gtatttctaa cttgattctg 3780
atcactcaca gaggtggcat atttattatag ttgggacatc ctttgcaccc ttcataaaaa 3840
aggccagctg actgctcagc atcacctgcc aaggccacta gatttgtgtt tacaggggta 3900
tctctgtgat gcttgtcaca tcactcttga ccaccctctgt taataaattc cgacagtgca 3960
gtggcgatcg gagtgtgaac ttatgttccc agcatatgga aagctatctt aggttttaag 4020
gtagtagaaa ttgcccagga gtttgacagc aactttgttt cccgggtcta aaatcgtatc 4080
ccactgaggt gtatgcagtg gagcataata catgcaaata catgcaaaac tccttttgtt 4140
tcacctaaga ttcactttct atcttacttt cccttcctgc ctagtgtgac ttttgccccc 4200
aagagtgcct ggacagcatt ctagtttcta caaaatggtc ctctgtgtag gtgaatgtgt 4260
cccaaacctg ctatcacttt cttgtttcag tgtgactgtc ttgttagagg tgaagtttat 4320
ccagggtaac ttgctcacta actattcctt tttatggcct ggggttaaag ggcgcatggc 4380
tcacactggt gaaaataagg aaggcctggt cttatcttgt attaataata ctggctgcat 4440
tccaccagcc agagatttct atctgcgaag acctatgaaa cactgaagag aaatgtaggc 4500
agaaggaaat ggccacatat cacaagttct atttatatt cttttgtaaa tacatattgt 4560
atattacttg gatgttttct tatatcattt actgtctttt tgagttaatg tcagttttta 4620
ctctctcaac ttactatgta acattgtaaa taacataatg tcctttatta ttattattatta 4680
agcatctaac atatagagtt gttttcatat aagtttaaga taaatgtcaa aaatatatgt 4740
tcttttgtttttctttgctt taaaattatg tatcttttcc ttttcttttttttaagaata 4800
atttattgtt caggagaaag aatgtatatg taactgaaac tatctgaaga atgcacattg 4860
aaggccgtga ggtactgata aactaaagaa tttaattattc aaaatactaa gcaataagta 4920
attgtgattt atttaaagtt ttgtccattt tccatgaaag acatactgca ataaaaatgc 4980
tactctgtgg agacctggga gtgttgctca gcagactaca gcttcagtct gttagaccag 5040
cacctttcat ctcattccca tagttatgct aatttaggat tgtgttccat ggaccccatg 5100
atcaccttgt ctatacgttg cttcttgtct gtccattgct tttgccaacac cacctgttct 5160
caaatcatct cctccctacc aatgctgttt atcactttct tccttgttga agaggccaca 5220
caaccagaca gtactatgct tcctttttcc tccatacaca ataacagaga gagaatattc 5280
tagggcatga ctgcctggat cctggctgtt gctatctttt gtagtggcag taagaaactc 5340
cttcagacta atgaaaatgt caacgtgcca ttcaatcacg aaaggtaacg aaaaatgctc 5400
tcatggttca aatagtccaa tggcccatag tggcctaaaa ggcagccagt tgacacctgg 5460
ccatgctaag cttccttata ccatccgcta atgactttcc attgggccca caatttacgg 5520
attcataatt ttaaaagagg agaaggccaa gttaggttca ttccccttat tctgtcaata 5580
aaacaaatca aactcatgtc tatctaactg ctcagggagg agcctttgca tgagaaaatt 5640
ctcatattct aagactgagt catagaaatg agggtattac ttttcttact gcaattaacc 5700
taaacaaaag ccatatttta acaaatagat atttgcatgg tacccttcat atattccaag 5760
catttcactc attattccag gtaggttaag agcttctgaa gtgtatgaag taaaggtcag 5820
caatcctttg gggtgaacag tggcctcctt tggagtttgg gggtaacctg agacttccca 5880
ccaatgtcca cctccatctg tgtacctaat tccttattacc tagttatggc tcctctagga 5940
tcatttccaa acactctgga tgtccaggaa atttaaattg tagcttttga ctgagctagt 6000
ttttcctatt tatattaata aattttcaaa aatgcttgaa atcttcacat ttgcaacaac 6060
tttagttttc atgcacatac aaacacagag agacaaaaat tccaaacaga cactctccaa 6120
aagccaccac actctttcac ttgctctata gtcatttagc caaccagcca tgcagagaat 6180
atttaaaact taaagattga gacattattc tcagtttttg ctgaggcttt gtaacgaaat 6240
tgaacactat aagcagctat tgtagtaatt ttggttaaaa ttgtttgcct gggatatagt 6300
atttgaggca gaagcacgtg tgtgaaggag gtgaggtggt ttggaaagag tgaagactcg 6360
cagccagatt gaatgtctgg ataattacta taattctccc ttcttggttg aaaccatgtt 6420
ctctcttgat ttttaaaccc aggctgcctc tggaaacaag caaacctgag tctttctaac 6480
ctgagtcttc ccaatcatta gatttctttt ctgtcctaac gatgaatgat aaaaggactt 6540
gatgttcaca atttggggtt ataaggcagg tctgaaatct ggagactcaa gatgctggaa 6600
ggagtggaaa gtttcgatga ctttatatga atcactttgc actctatgtt tggcttgtcc 6660
tctttgaaac tgattacta aaataaatgt aagggaacta ttactccaaa agattaactt 6720
ggcaggaaat accaatactt tcagtttatg aaagacaaaa ctgtcttgtt gctacaggaa 6780
gctgcaatgt tcctaacctt taaggttggt gttgaatagg gtggtcatgc cctcccctgc 6840
aggtatcttt aggctcctgt tgacctcctg gtactataac tgttcgtctt ctctgggtag 6900
ctattgattt tgaactttaa catgcttcaa aactttattc atcagggaaa taggaaaaga 6960
gttttgttac ctggaggaaa tctattgtga tctacctgag ctttttaaaa acagaccagg 7020
agaaggaaac cagtaatttt taaagaagag acagagaatg ggataatagt ttcacccagg 7080
atctctttct aaccctttcc cttcaaatga acttattgga acagaattgg aaagaagaaa 7140
ggacatctct gcccacccca caggatgcca aaaaggctaa agaattacct ctgtagattt 7200
aaacatcttt aatggcttat gtatagattt gctaatacag agagaaatga actattaaat 7260
aaaaatcaca ttttataata tttttatggc ttaaaacatc ctttatctcc tttttgttct 7320
ctctacatga tatggtaagt gatgaggaaa atttaggctc aggaaggtta aaatctttct 7380
tggagttaca catctaagag agctgcagag ctgacacttg tacccaggtt ttctgactgc 7440
aaatccagtt tctttctatt gcgttcttcc cctttccctg cctcaagcag aaacaggttt 7500
tttattttca acctttatgt atacagtatg ttatgttaca tctacagcta agtttctttt 7560
tagaagaatg tgagcccttc tagctttggt ttagagtgat tctagaagcc aatttccttg 7620
gcttagtgat tctatgcacc tttcctaaac ttagctttct aaggaaatga agtgtacgag 7680
tgagaatgaa ttcacaattt cgacatgtag gtagcatcct aaagtgaaaa gaggaggaaa 7740
tttgtggtca aagcactctc cccaccactt agaaacttac tgactgtggtcagcttcctc 7800
ctccaagttt ccttcctgat ttacaagacc gtggtgtggt caggattaaa cttgaataca 7860
tgtaaggaag cctgaaagtg tctaacacat agcgagtatt caaatgccac cttctatttg 7920
atccttcccc tccagttcct taagttttgg aatctaggtt tctcagttcc aaatggattg 7980
acatttgcat atccccattg cacaatggat caaataaact ttatgttatc atttctccaa 8040
catagtgcca gtaagcaaat cctttttaat aacaacagta tgttgagaaa catatcacca 8100
aataatattt aactttgtag ctttgataag ttctttaggt tttggttttg gttttgtttt 8160
ctgagacagg gtcttgatct gtcacccaga ctggagtgca atggttcaat tttagctcac 8220
tgcaacctgt aactcctgag ctcaagtgat cctcccacct cagcctctca agtagctggg 8280
actacaggtg tgctccacca tgcccagcga tttttttttttttttttttttttttttttg 8340
gtagggacaa ggtctcgcta tgttgcccag gctggtcttc aattcctggc cttaagtgat 8400
cctcctgcct cggcctccca aagtgctgga attacaggca tgagccacca cccataactt 8460
tatgtttgtt tttttgatgc agtataagtt cagcttgctt cttatgcagc cataccattt 8520
catgttaact ctgattttta gcagcttatt aacttagtgt tttattatta atataatttt 8580
acagaaattt actaaaccat gactctgtag agttttaata atactacctc caaacatcat 8640
tgcaaacatc tagaagaatg aacaaaaatg atcttagatc gacagtatat ctgtttgtct 8700
tagtttctac acaggatgtt cagacatatt ccatttcttt aaaaaaaaaa tatatatata 8760
tatatatata tataggcctg gcacggtggc tcatgactgt aatcccagca ctttgggagg 8820
cttaggcagg caaatcacct gaggtcagga gtttgagact ggcctgacca acatggtgaa 8880
acctcgtctc tattaaaatt acaaaaatta gccgggcgtg gtggcacatg cctgtaatcc 8940
cagctactcg ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt 9000
gagttgagat cacgccattg cactccagcc ttggtgacaa gagtgaaact ccgtctcaaa 9060
aaaaaaaaaa atatgtatat atatatatat atatatatat atatatatat atatatatat 9120
ataaaatccc accaaaagtc tgcagagtga ccaaattaga cggctctggt ttcagattaa 9180
attctaaatg tgagaaacca catagctccc atgatccatc caataattcc tcacgtcctc 9240
ttcactcttt actccatgca taaaacagaa tttttttttc tcatcctggg tatgaagcaa 9300
ttaataattt acggatttag cctatttgga ttcaatccct tcaaactcca tactacatcc 9360
aaggtggaag tgacttaaac tctgatatca atcatcaggc ttgtaatata ggctttgtta 9420
atggcaggag agtctaataa aactttctgt tccttatcct tcatttaaat gaaaaacttt 9480
ttattgaaaa caatcataac tctagctcat cataaatata attcatgagg aattttatt 9540
atttttatat taaagaaata atattataga tgtaaacttt gcacctttct aattattatc 9600
atgagttaag ctaatacttg tcttctggtc cctagatgat gattcttttt tgccttactg 9660
gaggagccct tgtcttgaag tgagttgctt caacagcaga ggacttctag tttctcccag 9720
ttgagcctaa agtgaacttt tcatcttctt cagaggaagg ggcttccttg atttgtactt 9780
ttgtggctcc tcagataaca cagacaattt tatcttggat cccaggttct cttcaccatt 9840
aagaataaga aagagagaaa atgctgtgca tgacagccac ctactccaaa ctacccaacc 9900
ccctgtaacc aggtaccttc caacaacgag atgattctgc cctcactcaa gagtctcccc 9960
cacaaagatt ccattctccc tttacttttt attttttttttttgcaaa acaaaggcct 10020
cctttagtac ctccttagtt tataaccctt atcttcccag ctcttccctt cactgatacc 10080
tctgatttca aaagttctga agtcggaaga ccacacaatt tcagactgtg aacagaaatt 10140
cagtcagaaa ttattggagt tagaaagaat ttagagaaat tgtattgaac tagaaagtcc 10200
tctttgatta gtggtggccc tttagaaagt tctagggcag agtcccatgg tgtttcatct 10260
tttccatgat ttgcttgacc aaaaactttt ctttcacaac atgaaaatat gctaatccac 10320
caacacatttt ggattctgct ctgtttgcct gaggtgttag atctctagcc aggactgtga 10380
agggaaggaa cttgaatcct tcctattgag ctattaatgc agagtcagtg agatgaaggg 10440
ttccactcgg ggtcaaaatc atgtcagtta ccaagcaaag gagcaagtaa ggggaaacat 10500
ctcctcatct ggttagtgga gccacatttc accactgat caagccagac ctgagcaata 10560
gtctagattt ctccctccac atctaattgg tgacaatggt gattgtacca ctgacaggtc 10620
acacaagtcc acaccctcct tcacagcctc actgcttcgg ctcttgttta tgctcttatc 10680
agcatctgtc actaggatcc atttgtctcc tgactcatct acttccctttc actcccctgg 10740
gccatccttc acatcatgct agaagactgt gtctaacttg cagaacttat tgtgtcaatc 10800
tcctggttac ggcccctcca tggctcccca cctgacatag gaggcccttc acaatctggc 10860
ttctgtcact cataacttgt ctccagcctt cattcttcag tctgatttta tggttttcta 10920
gttccccaat acaccacacc agtgtttatg aaaccctagt cattggccta cgaactttat 10980
gattattgac atattcatgt accacctgta ttattttttg catagtgttt atttttaatt 11040
gactacattt ttaactacaa taaagtaatt tcaactaaaa 11080
<![CDATA[ <210> 156]]>
<![CDATA[ <211> 477]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> person]]> class
<![CDATA[ <400> 156]]>
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
1 5 10 15
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys
20 25 30
Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe
35 40 45
Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala Ser Phe Tyr Lys His
50 55 60
Leu Gly Ile Glu Phe Met Glu Ala Glu Glu Leu Tyr Gln Lys Arg Val
65 70 75 80
Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu Leu Val Val Gly Ile Met
85 90 95
Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gln Arg Lys Lys Lys Leu His
100 105 110
Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn
115 120 125
Ile Ala Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val Gln
130 135 140
Leu Val Asn Gln Tyr Val Ser Lys Asn Val Ile Ser Ser Ser Glu His Ile
145 150 155 160
Val Glu Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser
165 170 175
Thr Ala His His Ser Thr Thr Val Thr Gln Thr Pro Ser His Ser Trp
180 185 190
Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His Ser Ser Val Ile
195 200 205
Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr Gly Gly
210 215 220
Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser
225 230 235 240
Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro
245 250 255
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser
260 265 270
Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu
275 280 285
Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met Ala
290 295 300
Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro
305 310 315 320
Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gln Gln Phe Ser
325 330 335
Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala Ser
340 345 350
Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu Tyr
355 360 365
Glu Pro Ala Gln Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala
370 375 380
Lys Arg Thr Lys Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val Asp
385 390 395 400
Ser Asn Thr Ser Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp
405 410 415
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro Leu
420 425 430
Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg
435 440 445
Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln Ala Arg
450 455 460
Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
465 470 475
<![CDATA[ <210> 157]]>
<![CDATA[ <211> 11083]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 157]]>
catctacatc tacatccacc actgggacaa gccatcttgt aaaatgtgcg gagaaggaga 60
aaactttctg tgtgaatgga ggggagtgct tcatggtgaa agacctttca aacccctcga 120
gatacttgtg caagtgccca aatgagttta ctggtgatcg ctgccaaaac tacgtaatgg 180
ccagcttcta caagcatctt gggattgaat ttatggaggc ggaggagctg taccagaaga 240
gagtgctgac cataaccggc atctgcatcg ccctccttgt ggtcggcatc atgtgtgtgg 300
tggcctactg caaaaccaag aaacagcgga aaaagctgca tgaccgtctt cggcagagcc 360
ttcggtctga acgaaacaat atgatgaaca ttgccaatgg gcctcaccat cctaacccac 420
cccccgagaa tgtccagctg gtgaatcaat acgtatctaa aaacgtcatc tccagtgagc 480
atattgttga gagagaagca gagacatcct tttccaccag tcactatact tccacagccc 540
atcactccac tactgtcacc cagactccta gccacagctg gagcaacgga cacactgaaa 600
gcatcctttc cgaaagccac tctgtaatcg tgatgtcatc cgtagaaaac agtaggcaca 660
gcagcccaac tgggggccca agaggacgtc ttaatggcac aggaggccct cgtgaatgta 720
acagcttcct caggcatgcc agagaaaccc ctgattccta ccgagactct cctcatagtg 780
aaaggtatgt gtcagccatg accacccccgg ctcgtatgtc acctgtagat ttccaacacgc 840
caagctcccc caaatcgccc ccttcggaaa tgtctccacc cgtgtccagc atgacggtgt 900
ccatgccttc catggcggtc agccccttca tggaagaaga gagacctcta cttctcgtga 960
caccaccaag gctgcggggag aagaagtttg accatcacccc tcagcagttc agctccttcc 1020
accacaaccc cgcgcatgac agtaacagcc tccctgctag ccccttgagg atagtggagg 1080
atgaggagta tgaaacgacc caagagtacg agccagccca agagcctgtt aagaaactcg 1140
ccaatagccg gcgggccaaa agaaccaagc ccaatggcca cattgctaac agatggaag 1200
tggacagcaa cacaagctcc cagagcagta actcagagag tgaaacagaa gatgaaagag 1260
taggtgaaga tacgcctttc ctgggcatac agaaccccct ggcagccagt cttgaggcaa 1320
cacctgcctt ccgcctggct gacagcagga ctaacccagc aggccgcttc tcgacacagg 1380
aagaaatcca ggccaggctg tctagtgtaa ttgctaacca agaccctatt gctgtataaa 1440
acctaaataa acacatagat tcacctgtaa aactttattt tatataataa agtattccac 1500
cttaaattaa acaatttatt ttattttagc agttctgcaa atagaaaaca ggaaaaaaac 1560
ttttataaat taaatatatg tatgtaaaaa tgtgttatgt gccatatgta gcaatttttt 1620
acagtatttc aaaacgagaa agatatcaat ggtgccttta tgttatgtta tgtcgagagc 1680
aagttttgta cagttacagt gattgctttt ccacagtatt tctgcaaaac ctctcataga 1740
ttcagttttt gctggcttct tgtgcattgc attatgatgt tgactggatg tatgatttgc 1800
aagacttgca actgtccctc tgtttgcttg tagtagcacc cgatcagtat gtcttgtaat 1860
ggcacatcca tccagatatg cctctcttgt gtatgaagtt ttctttgctt tcagaatatg 1920
aaatgagttg tgtctactct gccagccaaa ggtttgcctc attgggctct gagataatag 1980
tagatccaac agcatgctac tattaaatac agcaagaaac tgcattaagt aatgttaaat 2040
attaggaaga aagtaatact gtgattaaa aaaaactata ttattaatca gaagacagct 2100
tgctcttact aaaaggagct ctcatttact ttatttgatt ttatttttct tgacaaaaag 2160
caacagtttt agggatagct tagaaaatgg gttctggctt gctatcaggg taaatctaac 2220
accttacaag aggactgagt gtcactttct ctctggggga atgatccagc agcttatcta 2280
gttgacaatc aaaacacggc tgataaaggt gcaatcattt ctgacatgta tttttcactg 2340
attttgaagc tagtgattgg ttgtgtcttc ttggctcaaa aagaagcata ttacggcaca 2400
aaaagcccag cccagacagc acatgcagca ttttgtctga aatacttcta gagtcaaacg 2460
tgcctgctgt acatagcgat gacttgtcat catagggaag tatttccatc gtagaggtgtt 2520
cagaaggagt gactgtatag gtggagagaa gcttagtgac tccgttgaaa ttttaaaatg 2580
tggatgacca cccctttctc ccccttatttttcttttatc tttccatgtt gccttgatca 2640
ggtcataact atgcatgaac attttttatc aggaatggcc gatgtgtatg tgatttgtaa 2700
tcacaagtaa tgattcatca ggaaatgtca atcctgttgg aaagattgca cctttacttg 2760
cagaagtgac ccccacctgt gtcctgacct ctccattac aggctctctc accccatttcc 2820
cccacctcct ttaatttttg ctttactgtc ataaagtagg actaagattg gtctaagcat 2880
tgcatgttct tttgtgatgg taaatccaaa ggaaggccta taagtattaa catttgaaat 2940
aactgctaat tcaggaaaat ggaagaaaaa aaattatttg aaacacagaa cccatttcat 3000
ggcctgcctg atatctgtga aatcagggct ggagctttac ttaggattca catggcctcc 3060
taggaaccat gggacaaatg ggaaacaggt tatcggggga ttcatgaagt cagtgagagt 3120
aattgcttct tttttgcggg tgaactgaat gtatttcttc accaaatctt gatgttaaca 3180
attaaaaaga agaaatgaca tgcaagtagg tcttagcaga aaaatgcagg ctgggcatga 3240
gtcatgttgt taccctccca catgctccta caatccacag agatgcctgt ctgcaggttc 3300
ttgaagttat tgttagtatt tggtatctca aatttttcgt cactgttcac atgccacttt 3360
ctctgtgcac agtggtatcc tcatttgctt tttaacctac actgaggagt ctttgtcagg 3420
ttgcactgat tttccaattc tgcagtaatg agtaagctca cggcatgggg aagaagacag 3480
tcagtccaat gaagttctct aaattatttt aacattgcct ttgaaggcct tgactcatcc 3540
ttagctattt caatgaagaa attcctacca tgaatttaaa accctaaaaa ttctgtttca 3600
aattctttgg gcattggggt actcagatat cccattgtgg aagaatttta agaataaata 3660
gaagtttctg ttgagaacca tgagcaacat gtttcttaca atgagaattg ctatgcattt 3720
taaaattgca aatatatatg aaaattgaag acaagaggaa attgtatttc taacttgatt 3780
ctgatcactc acagaggtgg catattatta tagttgggac atcctttgca cccttcataa 3840
aaaaggccag ctgactgctc agcatcacct gccaaggcca ctagatttgt gtttacaggg 3900
gtatctctgt gatgcttgtc acatcactct tgaccacctc tgttaataaa ttccgacagt 3960
gcagtggcga tcggagtgtg aacttatgtt cccagcatat ggaaagctat cttaggtttt 4020
aaggtagtag aaattgccca ggagtttgac agcaactttg tttcccgggt ctaaaatcgt 4080
atcccactga ggtgtatgca gtggagcata atacatgcaa atacatgcaa aactcctttt 4140
gtttcaccta agattcactt tctatcttac tttcccttcc tgcctagtgt gacttttgcc 4200
cccaagagtg cctggacagc attctagttt ctacaaaatg gtcctctgtg taggtgaatg 4260
tgtcccaaac ctgctatcac tttcttgttt cagtgtgact gtcttgttag aggtgaagtt 4320
tatccagggt aacttgctca ctaactattc ctttttatgg cctggggtta aagggcgcat 4380
ggctcacact ggtgaaaata aggaaggcct ggtcttatct tgtattaata atactggctg 4440
cattccacca gccagagatt tctatctgcg aagacctatg aaacactgaa gagaaatgta 4500
ggcagaagga aatggccaca tatcacaagt tctattatat attcttttgt aaatacatat 4560
tgtatattac ttggatgttt tcttatatca tttactgtct ttttgagtta atgtcagttt 4620
ttactctctc aacttactat gtaacattgt aaataacata atgtccttta ttattatatat 4680
ttaagcatct aacatataga gttgttttca tataagttta agataaatgt caaaaatata 4740
tgttcttttg tttttctttg ctttaaaatt atgtatcttt tccttttctt ttttttaaga 4800
ataatttatt gttcaggaga aagaatgtat atgtaactga aactatctga agaatgcaca 4860
ttgaaggccg tgaggtactg ataaactaaa gaattttatta ttcaaaatac taagcaataa 4920
gtaattgtga tttattaaa gttttgtcca ttttccatga aagacatact gcaataaaaa 4980
tgctactctg tggagacctg ggagtgttgc tcagcagact acagcttcag tctgttagac 5040
cagcaccttt catctcattc ccatagttat gctaatttag gattgtgttc catggaccccc 5100
atgatcacct tgtctatacg ttgcttcttg tctgtccatt gcttttgcca caccacctgt 5160
tctcaaatca tctcctccct accaatgctg tttatcactt tcttccttgt tgaagaggcc 5220
acacaaccag acagtactat gcttcctttt tcctccatac acaataacag agagagaata 5280
ttctagggca tgactgcctg gatcctggct gttgctatct tttgtagtgg cagtaagaaa 5340
ctccttcaga ctaatgaaaa tgtcaacgtg ccattcaatc acgaaaggta acgaaaaatg 5400
ctctcatggt tcaaatagtc caatggccca tagtggccta aaaggcagcc agttgacacc 5460
tggccatgct aagcttcctt ataccatccg ctaatgactt tccatgggc ccacaattta 5520
cggattcata attttaaaag aggagaaggc caagttaggt tcattcccct tattctgtca 5580
ataaaacaaa tcaaactcat gtctatctaa ctgctcaggg aggagccttt gcatgagaaa 5640
attctcatat tctaagactg agtcatagaa atgagggtat tacttttctt actgcaatta 5700
acctaaacaa aagccatatt ttaacaaata gatatttgca tggtaccctt catatattcc 5760
aagcatttca ctcattattc caggtaggtt aagagcttct gaagtgtatg aagtaaaggt 5820
cagcaatcct ttggggtgaa cagtggcctc ctttggagtt tgggggtaac ctgagacttc 5880
ccaccaatgt ccacctccat ctgtgtacct aattcctatt acctagttat ggctcctcta 5940
ggatcatttc caaacactct ggatgtccag gaaatttaaa ttgtagcttt tgactgagct 6000
agtttttcct atttatatta ataaattttc aaaaatgctt gaaatcttca catttgcaac 6060
aactttagtt ttcatgcaca tacaaacaca gagagacaaa aattccaaac agacactctc 6120
caaaagccac cacactcttt cacttgctct atagtcattt agccaaccag ccatgcagag 6180
aatatttaaa acttaaagat tgagacatta ttctcagttt ttgctgaggc tttgtaacga 6240
aattgaacac tataagcagc tattgtagta attttggtta aaattgtttg cctgggatat 6300
agtatttgag gcagaagcac gtgtgtgaag gaggtgaggt ggtttggaaa gagtgaagac 6360
tcgcagccag attgaatgtc tggataatta ctataattct cccttcttgg ttgaaaccat 6420
gttctctctt gatttttaaa cccaggctgc ctctggaaac aagcaaacct gagtctttct 6480
aacctgagtc ttcccaatca ttagatttct tttctgtcct aacgatgaat gataaaagga 6540
cttgatgttc acaatttggg gttataaggc aggtctgaaa tctggagact caagatgctg 6600
gaaggagtgg aaagtttcga tgactttata tgaatcactt tgcactctat gtttggcttg 6660
tcctctttga aactgattaa ctaaaataaa tgtaagggaa ctattactcc aaaagattaa 6720
cttggcagga aataccaata ctttcagttt atgaaagaca aaactgtctt gttgctacag 6780
gaagctgcaa tgttcctaac ctttaaggtt ggtgttgaat agggtggtca tgccctcccc 6840
tgcaggtatc tttaggctcc tgttgacctc ctggtactat aactgttcgt cttctctggg 6900
tagctattga ttttgaactt taacatgctt caaaacttta ttcatcaggg aaataggaaa 6960
agagttttgt tacctggagg aaatctattg tgatctacct gagcttttta aaaacagacc 7020
aggagaagga aaccagtaat ttttaaagaa gagacagaga atgggataat agtttcaccc 7080
aggatctctt tctaaccctt tcccttcaaa tgaacttatt ggaacagaat tggaaagaag 7140
aaaggacatc tctgcccacc ccacaggatg ccaaaaaggc taaagaatta cctctgtaga 7200
tttaaacatc tttaatggct tatgtataga tttgctaata cagagagaaa tgaactatta 7260
aataaaaatc aattttata atatttttat ggcttaaaac atcctttatc tcctttttgt 7320
tctctctaca tgatatggta agtgatgagg aaaatttagg ctcaggaagg ttaaaatctt 7380
tcttggagtt acacatctaa gagagctgca gagctgacac ttgtacccag gttttctgac 7440
tgcaaatcca gtttctttct attgcgttct tcccctttcc ctgcctcaag cagaaacagg 7500
ttttttattt tcaaccttta tgtatacagt atgttatgtt acatctacag ctaagtttct 7560
ttttagaaga atgtgagccc ttctagcttt ggtttagagt gattctagaa gccaatttcc 7620
ttggcttagt gattctatgc acctttccta aacttagctt tctaaggaaa tgaagtgtac 7680
gagtgagaat gaattcacaa tttcgacatg taggtagcat cctaaagtga aaagaggagg 7740
aaatttgtgg tcaaagcact ctccccacca cttagaaact tactgactgt gggcagcttc 7800
ctcctccaag tttccttcct gatttacaag accgtggtgt ggtcaggatt aaacttgaat 7860
acatgtaagg aagcctgaaa gtgtctaaca catagcgagt attcaaatgc caccttctat 7920
ttgatccttc ccctccagtt ccttaagttt tggaatctag gtttctcagt tccaaatgga 7980
ttgacatttg catatcccca ttgcacaatg gatcaaataa actttatgtt atcatttctc 8040
caacatagtg ccagtaagca aatccttttt aataacaaca gtatgttgag aaacatatca 8100
ccaaataata tttaactttg tagctttgat aagttcttta ggttttggtt ttggttttgt 8160
tttctgagac agggtcttga tctgtcaccc agactggagt gcaatggttc aattttagct 8220
cactgcaacc tgtaactcct gagctcaagt gatcctccca cctcagcctc tcaagtagct 8280
gggactacag gtgtgctcca ccatgcccag cgattttttttttttttttttttttttttt 8340
ttggtaggga caaggtctcg ctatgttgcc caggctggtc ttcaattcct ggccttaagt 8400
gatcctcctg cctcggcctc ccaaagtgct ggaattacag gcatgagcca ccacccataa 8460
ctttatgttt gtttttttga tgcagtataa gttcagcttg cttcttatgc agccatacca 8520
tttcatgtta actctgattt ttagcagctt attacattag tgttttatta ttaatataat 8580
tttacagaaa tttactaaac catgactctg tagagtttta ataatactac ctccaaacat 8640
cattgcaaac atctagaaga atgaacaaaa atgatcttag atcgacagta tatctgtttg 8700
tcttagtttc tacacaggat gttcagacat attccatttc tttaaaaaaaaaatatatat 8760
atatatatat atatataggc ctggcacggt ggctcatgac tgtaatccca gcactttggg 8820
aggcttaggc aggcaaatca cctgaggtca ggagtttgag actggcctga ccaacatggt 8880
gaaacctcgt ctctattaaa attacaaaaa ttagccgggc gtggtggcac atgcctgtaa 8940
tcccagctac tcgggaggct gaggcaggag aatcacttga acctgggagg cagaggttgc 9000
agtgagttga gatcacgcca ttgcactcca gccttggtga caagagtgaa actccgtctc 9060
aaaaaaaaaa aaaatatgta tatatatata tatatatata tatatatata tatatatata 9120
tatataaaat cccaccaaaa gtctgcagag tgaccaaatt agacggctct ggtttcagat 9180
taaattctaa atgtgagaaa ccacatagct cccatgatcc atccaataat tcctcacgtc 9240
ctcttcactc tttactccat gcataaaaca gaattttttt ttctcatcct gggtatgaag 9300
caattaataa tttacggatt tagcctattt ggattcaatc ccttcaaact ccatactaca 9360
tccaaggtgg aagtgactta aactctgata tcaatcatca ggcttgtaat ataggctttg 9420
ttaatggcag gagagtctaa taaaactttc tgttccttat ccttcattta aatgaaaaac 9480
tttttattga aaacaatcat aactctagct catcataaat ataattcatg aggacatttt 9540
attattttta tattaaagaa ataatattat agatgtaaac tttgcacctt tctaattatt 9600
atcatgagtt aagctaatac ttgtcttctg gtccctagat gatgattctt ttttgcctta 9660
ctggaggagc ccttgtcttg aagtgagttg cttcaacagc agaggacttc tagtttctcc 9720
cagttgagcc taaagtgaac ttttcatctt cttcagagga aggggcttcc ttgatttgta 9780
cttttgtggc tcctcagata acacagacaa ttttatcttg gatcccaggt tctcttcacc 9840
attaagaata agaaagagag aaaatgctgt gcatgacagc cacctactcc aaactaccca 9900
accccctgta accaggtacc ttccaacaac gagatgattc tgccctcact caagagtctc 9960
ccccacaaag attccattct ccctttactt tttatttttt atttttttgc aaaacaaagg 10020
cctcctttag tacctcctta gtttataacc cttatcttcc cagctcttcc cttcactgat 10080
acctctgatt tcaaaagttc tgaagtcgga agaccacaca atttcagact gtgaacagaa 10140
attcagtcag aaattattgg agttagaaag aatttagaga aattgtattg aactagaaag 10200
tcctctttga ttagtggtgg ccctttagaa agttctaggg cagagtccca tggtgtttca 10260
tcttttccat gatttgcttg accaaaaact tttctttcac aacatgaaaa tatgctaatc 10320
caccacacat tttggattct gctctgtttg cctgaggtgt tagatctcta gccaggactg 10380
tgaagggaag gaacttgaat ccttcctatt gagctattaa tgcagagtca gtgagatgaa 10440
gggttccact cggggtcaaa atcatgtcag ttaccaagca aaggagcaag taaggggaaa 10500
catctcctca tctggttagt ggagccacat ttcacccact gatcaagcca gacctgagca 10560
atagtctaga tttctccctc cacatctaat tggtgacaat ggtgattgta ccactgacag 10620
gtcacacaag tccacaccct ccttcacagc ctcactgctt cggctcttgt ttatgctctt 10680
atcagcatct gtcactagga tccattgtc tcctgactca tctacttcct ttcactcccc 10740
tgggccatcc ttcacatcat gctagaagac tgtgtctaac ttgcagaact tattgtgtca 10800
atctcctggt tacggcccct ccatggctcc ccacctgaca taggaggccc ttcacaatct 10860
ggcttctgtc actcataact tgtctccagc cttcattctt cagtctgatt ttatggtttt 10920
ctagttcccc aatacaccac accagtgttt atgaaaccct agtcattggc ctacgaactt 10980
tatgattatt gacatattca tgtaccacct gtattatttt ttgcatagtg tttattttta 11040
attgactaca tttttaacta caataaagta atttcaacta aaa 11083
<![CDATA[ <210> 158]]>
<![CDATA[ <211> 478]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 158]]>
Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala
1 5 10 15
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val
20 25 30
Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu
35 40 45
Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val Met Ala Ser Phe Tyr Lys
50 55 60
His Leu Gly Ile Glu Phe Met Glu Ala Glu Glu Leu Tyr Gln Lys Arg
65 70 75 80
Val Leu Thr Ile Thr Gly Ile Cys Ile Ala Leu Leu Val Val Gly Ile
85 90 95
Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gln Arg Lys Lys Lys Leu
100 105 110
His Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu Arg Asn Asn Met Met
115 120 125
Asn Ile Ala Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val
130 135 140
Gln Leu Val Asn Gln Tyr Val Ser Lys Asn Val Ile Ser Ser Ser Glu His
145 150 155 160
Ile Val Glu Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr
165 170 175
Ser Thr Ala His His Ser Thr Thr Val Thr Gln Thr Pro Ser His Ser
180 185 190
Trp Ser Asn Gly His Thr Glu Ser Ile Leu Ser Glu Ser His Ser Val
195 200 205
Ile Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr Gly
210 215 220
Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn
225 230 235 240
Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser
245 250 255
Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met
260 265 270
Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser
275 280 285
Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met
290 295 300
Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr
305 310 315 320
Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gln Gln Phe
325 330 335
Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala
340 345 350
Ser Pro Leu Arg Ile Val Glu Asp Glu Glu Tyr Glu Thr Thr Gln Glu
355 360 365
Tyr Glu Pro Ala Gln Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg
370 375 380
Ala Lys Arg Thr Lys Pro Asn Gly His Ile Ala Asn Arg Leu Glu Val
385 390 395 400
Asp Ser Asn Thr Ser Ser Gln Ser Ser Asn Ser Glu Ser Glu Thr Glu
405 410 415
Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly Ile Gln Asn Pro
420 425 430
Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser
435 440 445
Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gln Glu Glu Ile Gln Ala
450 455 460
Arg Leu Ser Ser Val Ile Ala Asn Gln Asp Pro Ile Ala Val
465 470 475
<![CDATA[ <210> 159]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 159]]>
000
<![CDATA[ <210> 160]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 160]]>
000
<![CDATA[ <210> 161]]>
<![CDATA[ <211> 586]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 161]]>
agtaagcctc cgcagcccac tcggactgca gcctgtttgc cgcccgtcct cccattgcag 60
cactcggggc gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac 120
tcgcgggtcc cgctccgctc cggcagcagc atggggaaag gacgcgcggg ccgagttggc 180
accacagcct tgcctccccg attgaaagag atgaaaagcc aggaatcggc tgcaggttcc 240
aaactagtcc ttcggtgtga aaccagttct gaatactcct ctctcagatt caagtggttc 300
aagaatggga atgaattgaa tcgaaaaaac aaaccacaaa atatcaagat acaaaaaaag 360
ccagggaagt cagaacttcg cattaacaaa gcatcactgg ctgattctgg agagtatatg 420
tgcaaagtga tcagcaaatt aggaaatgac agtgcctctg ccaatatcac catcgtggaa 480
tcaaacgaga tcatcactgg tatgccagcc tcaactgaag gagcatatgt gtcttcagag 540
tctcccatta gaatatcagt atccacagaa ggagcaaata cttctt 586
<![CDATA[ <210> 162]]>
<![CDATA[ <211> 145]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 162]]>
Met Gly Lys Gly Arg Ala Gly Arg Val Gly Thr Thr Ala Leu Pro Pro
1 5 10 15
Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu
20 25 30
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys
35 40 45
Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gln Asn
50 55 60
Ile Lys Ile Gln Lys Lys Pro Gly Lys Ser Glu Leu Arg Ile Asn Lys
65 70 75 80
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val Ile Ser Lys
85 90 95
Leu Gly Asn Asp Ser Ala Ser Ala Asn Ile Thr Ile Val Glu Ser Asn
100 105 110
Glu Ile Ile Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser
115 120 125
Ser Glu Ser Pro Ile Arg Ile Ser Val Ser Thr Glu Gly Ala Asn Thr
130 135 140
Ser
145
<![CDATA[ <210> 163]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 163]]>
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly
1 5 10 15
Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu
20 25 30
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gln Asn Tyr Val
35 40 45
Met Ala Ser Phe
50
<![CDATA[ <210> 164]]>
<![CDATA[ <211> 65]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 164]]>
cataattagc atcatcatta ttctggctgg agcaattgca ctcatcattg gctttggtat 60
ttcag 65
<![CDATA[ <210> 165]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 165]]>
ccttgcctcc ccgattgaaa gagatgaa 28
<![CDATA[ <210> 166]]>
<![CDATA[ <211> 93]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 166]]>
cataattagc atcatcatta ttctggctgg agcaattgca ctcatcattg gctttggtat 60
ttcagccttg cctccccgat tgaaagagat gaa 93
<![CDATA[ <210> ]]>167
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 167]]>
Ile Ile Ser Ile Ile Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile
1 5 10 15
Gly Phe Gly Ile Ser Ala Leu Pro Pro Arg Leu Lys Glu Met
20 25 30
<![CDATA[ <210> 168]]>
<![CDATA[ <211> 104]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 168]]>
gtgagtcacc aaggaaggca gcggcagctc cactcagcca gtacccagat acgctgggaa 60
ccttccccag ccatggcttc cctggggcag atcctcttct ggag 104
<![CDATA[ <210> 169]]>
<![CDATA[ <211> 65]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 169]]>
cataattagc atcatcatta ttctggctgg agcaattgca ctcatcattg gctttggtat 60
ttcag 65
<![CDATA[ <210> 170]]>
<![CDATA[ <211> 348]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 170]]>
ggagacactc catcacagtc actactgtcg cctcagctgg gaacattggg gaggatggaa 60
tcctgagctg cacttttgaa cctgacatca aactttctga tatcgtgata caatggctga 120
aggaaggtgt tttaggcttg gtccatgagt tcaaagaagg caaagatgag ctgtcggagc 180
aggatgaaat gttcagaggc cggacagcag tgtttgctga tcaagtgata gttggcaatg 240
cctctttgcg gctgaaaaac gtgcaactca cagatgctgg cacctacaaa tgttatatca 300
tcacttctaa aggcaagggg aatgctaacc ttgagtataa aactggag 348
<![CDATA[ <210> 171]]>
<![CDATA[ <211> 279]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 171]]>
ccttcagcat gccggaagtg aatgtggact ataatgccag ctcagagacc ttgcggtgtg 60
aggctccccg atggttcccc cagcccacag tggtctgggc atcccaagtt gaccaggggag 120
ccaacttctc ggaagtctcc aataccagct ttgagctgaa ctctgagaat gtgaccatga 180
aggttgtgtc tgtgctctac aatgttacga tcaacaacac atactcctgt atgattgaaa 240
atgacattgc caaagcaaca ggggatatca aagtgacag 279
<![CDATA[ <210> 172]]>
<![CDATA[ <211> 170]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 172]]>
aatcggagat caaaaggcgg agtcacctac agctgctaaa ctcaaaggct tctctgtgtg 60
tctcttcttt ctttgccatc agctgggcac ttctgcctct cagcccttac ctgatgctaa 120
aataatgtgc ctcggccaca aaaaagcatg caaagtcatt gttacaacag 170
<![CDATA[ <210> 173]]>
<![CDATA[ <211> 1639]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 173]]>
ggatctacag aactatttca ccaccagata tgacctagtt ttatatttct gggaggaaat 60
gaattcatat ctagaagtct ggagtgagca aacaagagca agaaacaaaa agaagccaaa 120
agcagaaggc tccaatatga acaagataaa tctatcttca aagacatatt agaagttggg 180
aaaataattc atgtgaacta gacaagtgtg ttaagagtga taagtaaaat gcacgtggag 240
acaagtgcat ccccagatct cagggacctc cccctgcctg tcacctgggg agtgagagga 300
caggatagtg catgttcttt gtctctgaat ttttagttat atgtgctgta atgttgctct 360
gaggaagccc ctggaaagtc tatcccaaca tatccacatc ttatattcca caaattaagc 420
tgtagtatgt accctaagac gctgctaatt gactgccact tcgcaactca ggggcggctg 480
cattttagta atgggtcaaa tgattcactt tttatgatgc ttccaaaggt gccttggctt 540
ctcttcccaa ctgacaaatg ccaaagttga gaaaaatgat cataatttta gcataaacag 600
agcagtcggc gacaccgatt ttataaataa actgagcacc ttctttttaa acaaacaaat 660
gcgggtttat ttctcagatg atgttcatcc gtgaatggtc cagggaagga cctttcacct 720
tgtctatatg gcattatgtc atcacaagct ctgaggcttc tcctttccat cctgcgtgga 780
cagctaagac ctcagttttc aatagcatct agagcagtgg gactcagctg gggtgatttc 840
gccccccatc tccgggggaa tgtctgaaga caattttggt tacctcaatg agggagtgga 900
ggaggataca gtgctactac caactagtgg atagaggcca gggatgctgc tcaacctcct 960
accatgtaca ggacgtctcc ccattacaac tacccaatcc gaagtgtcaa ctgtgtcagg 1020
gctaagaaac cctggttttg agtagaaaag ggcctggaaa gaggggagcc aacaaatctg 1080
tctgcttcct cacattagtc attggcaaat aagcattctg tctctttggc tgctgcctca 1140
gcacagagag ccagaactct atcgggcacc aggataacat ctctcagtga acagagttga 1200
caaggcctat gggaaatgcc tgatggggatt atcttcagct tgttgagctt ctaagtttct 1260
ttcccttcat tctaccctgc aagccaagtt ctgtaagaga aatgcctgag ttctagctca 1320
ggttttctta ctctgaattt agatctccag accctgcctg gccacaattc aaattaaggc 1380
aacaaacata taccttccat gaagcacaca cagacttttg aaagcaagga caatgactgc 1440
ttgaattgag gccttgagga atgaagcttt gaaggaaaag aatactttgt ttccagcccc 1500
cttcccacac tcttcatgtg ttaaccactg ccttcctgga ccttggagcc acggtgactg 1560
tattacatgt tgttatagaa aactgatttt agagttctga tcgttcaaga gaatgattaa 1620
atatacattt cctacacca 1639
<![CDATA[ <210> 174]]>
<![CDATA[ <211> 2605]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 174]]>
gtgagtcacc aaggaaggca gcggcagctc cactcagcca gtacccagat acgctgggaa 60
ccttccccag ccatggcttc cctggggcag atcctcttct ggagcataat tagcatcatc 120
attattctgg ctggagcaat tgcactcatc attggctttg gtatttcagg gagacactcc 180
atcacagtca ctactgtcgc ctcagctggg aacattgggg aggatggaat cctgagctgc 240
acttttgaac ctgacatcaa actttctgat atcgtgatac aatggctgaa ggaaggtgtt 300
ttaggcttgg tccatgagtt caaagaaggc aaagatgagc tgtcggagca ggatgaaatg 360
ttcagaggcc ggacagcagt gtttgctgat caagtgatag ttggcaatgc ctctttgcgg 420
ctgaaaaacg tgcaactcac agatgctggc acctacaaat gttatatcat cacttctaaa 480
ggcaagggga atgctaacct tgagtataaa actggagcct tcagcatgcc ggaagtgaat 540
gtggactata atgccagctc agagaccttg cggtgtgagg ctccccgatg gttcccccag 600
cccacagtgg tctgggcatc ccaagttgac cagggagcca acttctcgga agtctccaat 660
accagctttg agctgaactc tgagaatgtg accatgaagg ttgtgtctgt gctctacaat 720
gttacgatca acaacacata ctcctgtatg attgaaaatg aattgccaa agcaacaggg 780
gatatcaaag tgacagaatc ggagatcaaa aggcggagtc acctacagct gctaaactca 840
aaggcttctc tgtgtgtctc ttctttcttt gccatcagct gggcacttct gcctctcagc 900
ccttacctga tgctaaaata atgtgcctcg gccacaaaaa agcatgcaaa gtcattgtta 960
caacagggat ctacagaact atttcaccac cagatatgac ctagttttat atttctggga 1020
ggaaatgaat tcatatctag aagtctggag tgagcaaaca agagcaagaa acaaaaagaa 1080
gccaaaagca gaaggctcca atatgaacaa gataaatcta tcttcaaaga catattagaa 1140
gttgggaaaa taattcatgt gaactagaca agtgtgttaa gagtgataag taaaatgcac 1200
gtggagacaa gtgcatcccc agatctcagg gacctccccc tgcctgtcac ctggggagtg 1260
agaggacagg atagtgcatg ttctttgtct ctgaattttt agttatatgt gctgtaatgt 1320
tgctctgagg aagcccctgg aaagtctatc ccaacatatc cacatcttat attccacaaa 1380
ttaagctgta gtatgtaccc taagacgctg ctaattgact gccacttcgc aactcagggg 1440
cggctgcatt ttagtaatgg gtcaaatgat tcacttttta tgatgcttcc aaaggtgcct 1500
tggcttctct tcccaactga caaatgccaa agttgagaaa aatgatcata attttagcat 1560
aaacagagca gtcggcgaca ccgattttat aaataaactg agcaccttct ttttaaacaa 1620
acaaatgcgg gtttatttct cagatgatgt tcatccgtga atggtccagg gaaggacctt 1680
tcaccttgtc tatatggcat tatgtcatca caagctctga ggcttctcct ttccatcctg 1740
cgtggacagc taagacctca gttttcaata gcatctagag cagtggggact cagctggggt 1800
gatttcgccc cccatctccg ggggaatgtc tgaagacaat tttggttacc tcaatgaggg 1860
agtgggaggag gatacagtgc tactaccaac tagtggatag aggccaggga tgctgctcaa 1920
cctcctacca tgtacaggac gtctccccat tacaactacc caatccgaag tgtcaactgt 1980
gtcagggcta agaaaccctg gttttgagta gaaaagggcc tggaaagagg ggagccaaca 2040
aatctgtctg cttcctcaca ttagtcattg gcaaataagc attctgtctc tttggctgct 2100
gcctcagcac agagagccag aactctatcg ggcaccagga taacatctct cagtgaacag 2160
agttgacaag gcctatggga aatgcctgat gggattatct tcagcttgtt gagcttctaa 2220
gtttctttcc cttcattcta ccctgcaagc caagttctgt aagagaaatg cctgagttct 2280
agctcaggtt ttcttactct gaatttagat ctccagaccc tgcctggcca caattcaaat 2340
taaggcaaca aacatatacc ttccatgaag cacacacaga cttttgaaag caaggacaat 2400
gactgcttga attgaggcct tgaggaatga agctttgaag gaaaagaata ctttgtttcc 2460
agcccccttc ccacactctt catgtgttaa ccactgcctt cctggacctt ggagccacgg 2520
tgactgtatt acatgttgtt atagaaaact gattttagag ttctgatcgt tcaagagaat 2580
gattaaatat aatttccta cacca 2605
<![CDATA[ <210> 175]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 175]]>
Met Ala Ser Leu Gly Gln Ile Leu Phe Trp
1 5 10
<![CDATA[ <210> 176]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 176]]>
Ile Ile Ser Ile Ile Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile
1 5 10 15
Gly Phe Gly Ile Ser
20
<![CDATA[ <210> 177]]>
<![CDATA[ <211> 115]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 177]]>
Arg His Ser Ile Thr Val Thr Thr Val Ala Ser Ala Gly Asn Ile Gly
1 5 10 15
Glu Asp Gly Ile Leu Ser Cys Thr Phe Glu Pro Asp Ile Lys Leu Ser
20 25 30
Asp Ile Val Ile Gln Trp Leu Lys Glu Gly Val Leu Gly Leu Val His
35 40 45
Glu Phe Lys Glu Gly Lys Asp Glu Leu Ser Glu Gln Asp Glu Met Phe
50 55 60
Arg Gly Arg Thr Ala Val Phe Ala Asp Gln Val Ile Val Gly Asn Ala
65 70 75 80
Ser Leu Arg Leu Lys Asn Val Gln Leu Thr Asp Ala Gly Thr Tyr Lys
85 90 95
Cys Tyr Ile Ile Thr Ser Lys Gly Lys Gly Asn Ala Asn Leu Glu Tyr
100 105 110
Lys Thr Gly
115
<![CDATA[ <210> 178]]>
<![CDATA[ <211> 92]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 178]]>
Phe Ser Met Pro Glu Val Asn Val Asp Tyr Asn Ala Ser Ser Glu Thr
1 5 10 15
Leu Arg Cys Glu Ala Pro Arg Trp Phe Pro Gln Pro Thr Val Val Trp
20 25 30
Ala Ser Gln Val Asp Gln Gly Ala Asn Phe Ser Glu Val Ser Asn Thr
35 40 45
Ser Phe Glu Leu Asn Ser Glu Asn Val Thr Met Lys Val Val Ser Val
50 55 60
Leu Tyr Asn Val Thr Ile Asn Asn Thr Tyr Ser Cys Met Ile Glu Asn
65 70 75 80
Asp Ile Ala Lys Ala Thr Gly Asp Ile Lys Val Thr
85 90
<![CDATA[ <210> 179]]>
<![CDATA[ <211> 40]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 179]]>
Ser Glu Ile Lys Arg Arg Ser His Leu Gln Leu Leu Asn Ser Lys Ala
1 5 10 15
Ser Leu Cys Val Ser Ser Phe Phe Ala Ile Ser Trp Ala Leu Leu Pro
20 25 30
Leu Ser Pro Tyr Leu Met Leu Lys
35 40
<![CDATA[ <210> 180]]>
<![CDATA[ <211> 282]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 180]]>
Met Ala Ser Leu Gly Gln Ile Leu Phe Trp Ser Ile Ile Ser Ile Ile
1 5 10 15
Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile Gly Phe Gly Ile Ser
20 25 30
Gly Arg His Ser Ile Thr Val Thr Thr Val Ala Ser Ala Gly Asn Ile
35 40 45
Gly Glu Asp Gly Ile Leu Ser Cys Thr Phe Glu Pro Asp Ile Lys Leu
50 55 60
Ser Asp Ile Val Ile Gln Trp Leu Lys Glu Gly Val Leu Gly Leu Val
65 70 75 80
His Glu Phe Lys Glu Gly Lys Asp Glu Leu Ser Glu Gln Asp Glu Met
85 90 95
Phe Arg Gly Arg Thr Ala Val Phe Ala Asp Gln Val Ile Val Gly Asn
100 105 110
Ala Ser Leu Arg Leu Lys Asn Val Gln Leu Thr Asp Ala Gly Thr Tyr
115 120 125
Lys Cys Tyr Ile Ile Thr Ser Lys Gly Lys Gly Asn Ala Asn Leu Glu
130 135 140
Tyr Lys Thr Gly Ala Phe Ser Met Pro Glu Val Asn Val Asp Tyr Asn
145 150 155 160
Ala Ser Ser Glu Thr Leu Arg Cys Glu Ala Pro Arg Trp Phe Pro Gln
165 170 175
Pro Thr Val Val Trp Ala Ser Gln Val Asp Gln Gly Ala Asn Phe Ser
180 185 190
Glu Val Ser Asn Thr Ser Phe Glu Leu Asn Ser Glu Asn Val Thr Met
195 200 205
Lys Val Val Ser Val Leu Tyr Asn Val Thr Ile Asn Asn Thr Tyr Ser
210 215 220
Cys Met Ile Glu Asn Asp Ile Ala Lys Ala Thr Gly Asp Ile Lys Val
225 230 235 240
Thr Glu Ser Glu Ile Lys Arg Arg Ser His Leu Gln Leu Leu Asn Ser
245 250 255
Lys Ala Ser Leu Cys Val Ser Ser Phe Phe Ala Ile Ser Trp Ala Leu
260 265 270
Leu Pro Leu Ser Pro Tyr Leu Met Leu Lys
275 280
<![CDATA[ <210> 181]]>
<![CDATA[ <211> 169]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 181]]>
gtgagtcacc aaggaaggca gcggcagctc cactcagcca gtacccagat acgctgggaa 60
ccttccccag ccatggcttc cctggggcag atcctcttct ggagcataat tagcatcatc 120
attattctgg ctggagcaat tgcactcatc attggctttg gtatttcag 169
<![CDATA[ <210> 182]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 182]]>
Met Ala Ser Leu Gly Gln Ile Leu Phe Trp Ser Ile Ile Ser Ile Ile
1 5 10 15
Ile Ile Leu Ala Gly Ala Ile Ala Leu Ile Ile Gly Phe Gly Ile Ser
20 25 30
<![CDATA[ <210> 183]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 183]]>
000
<![CDATA[ <210> 184]]>
<![CDATA[ <211> 119]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 184]]>
ctggctggag attaatccgg acactggtgc catttccact cgggctgagc tggacaggga 60
ggattttgag cacgtgaaga acagcacgta cacagcccta atcatagcta cagacaatg 119
<![CDATA[ <210> 185]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 185]]>
ccttgcctcc ccgattgaaa gagatgaaaa 30
<![CDATA[ <210> 186]]>
<![CDATA[ <211> 148]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 186]]>
ctggctggag attaatccgg acactggtgc catttccact cgggctgagc tggacaggga 60
ggattttgag cacgtgaaga acagcacgta cacagcccta atcatagcta cagacaatgc 120
cttgcctccc cgattgaaag agatgaaa 148
<![CDATA[ <210> 187]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 187]]>
Trp Leu Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr Arg Ala Glu
1 5 10 15
Leu Asp Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr Tyr Thr Ala
20 25 30
Leu Ile Ile Ala Thr Asp Asn Ala Leu Pro Pro Arg Leu Lys Glu Met
35 40 45
Lys
<![CDATA[ <210> 188]]>
<![CDATA[ <211> 172]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 188]]>
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc 60
gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc 120
agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc ag 172
<![CDATA[ <210> 189]]>
<![CDATA[ <211> 115]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 189]]>
gtctcctctt ggctctgcca ggagccggag ccctgccacc ctggctttga cgccgagagc 60
tacacgttca cggtgccccg gcgccacctg gagagaggcc gcgtcctggg cagag 115
<![CDATA[ <210> 190]]>
<![CDATA[ <211> 224]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 190]]>
tgaattttga agattgcacc ggtcgacaaa ggacagccta tttttccctc gacacccgat 60
tcaaagtggg cacagatggt gtgattacag tcaaaaggcc tctacggttt cataacccac 120
agatccattt cttggtctac gcctgggact ccacctacag aaagttttcc accaaagtca 180
cgctgaatac agtggggcac caccaccgcc ccccgcccca tcag 224
<![CDATA[ <210> 191]]>
<![CDATA[ <211> 144]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 191]]>
gcctccgttt ctggaatcca agcagaattg ctcacatttc ccaactcctc tcctggcctc 60
agaagacaga agagagactg ggttattcct cccatcagct gcccagaaaa tgaaaaaggc 120
ccatttccta aaaacctggt tcag 144
<![CDATA[ <210> 192]]>
<![CDATA[ <211> 156]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 192]]>
atcaaatcca acaaagacaa agaaggcaag gttttctaca gcatcactgg ccaaggagct 60
gacacacccc ctgttggtgt ctttattatt gaaagagaaa caggatggct gaaggtgaca 120
gagcctctgg atagagaacg cattgccaca tacact 156
<![CDATA[ <210> 193]]>
<![CDATA[ <211> 145]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 193]]>
ctcttctctc acgctgtgtc atccaacggg aatgcagttg aggatccaat ggagattttg 60
atcacggtaa ccgatcagaa tgacaacaag cccgaattca cccaggaggt ctttaagggg 120
tctgtcatgg aaggtgctct tccag 145
<![CDATA[ <210> 194]]>
<![CDATA[ <211> 176]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 194]]>
gaacctctgt gatggaggtc acagccacag acgcggacga tgatgtgaac acctacaatg 60
ccgccatcgc ttacaccatc ctcagccaag atcctgagct ccctgacaaa aatatgttca 120
ccattaacag gaacacagga gtcatcagtg tggtcaccac tgggctggac cgagag 176
<![CDATA[ <210> 195]]>
<![CDATA[ <211> 129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 195]]>
agtttcccta cgtataccct ggtggttcaa gctgctgacc ttcaaggtga ggggttaagc 60
acaacagcaa cagctgtgat cacagtcact gacaccaacg ataatcctcc gatcttcaat 120
cccaccacg 129
<![CDATA[ <210> 196]]>
<![CDATA[ <211> 250]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 196]]>
tacaagggtc aggtgcctga gaacgaggct aacgtcgtaa tcaccacact gaaagtgact 60
gatgctgatg cccccaatac cccagcgtgg gaggctgtat acaccatatt gaatgatgat 120
ggtggacaat ttgtcgtcac cacaaatcca gtgaacaacg atggcatttt gaaaacagca 180
aaggtttgta tggtacctgg caagatgcag aaactggcat cctcacagct gttcataccc 240
ttgtcccctg 250
<![CDATA[ <210> 197]]>
<![CDATA[ <211> 245]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 197]]>
ggcttggatt ttgaggccaa gcagcagtac attctacacg tagcagtgac gaatgtggta 60
ccttttgagg tctctctcac cacctccaca gccaccgtca ccgtggatgt gctggatgtg 120
aatgaagccc ccatctttgt gcctcctgaa aagagagtgg aagtgtccga ggactttggc 180
gtgggccagg aaatcacatc ctacactgcc caggagccag acacatttat ggaacagaaa 240
ataac 245
<![CDATA[ <210> 198]]>
<![CDATA[ <211> 146]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 198]]>
atatcggatt tggagagaca ctgccaactg gctggagatt aatccggaca ctggtgccat 60
ttccactcgg gctgagctgg acaggggagga ttttgagcac gtgaagaaca gcacgtacac 120
agccctaatc atagctacag acaatg 146
<![CDATA[ <210> 199]]>
<![CDATA[ <211> 225]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 199]]>
gttctccagt tgctactgga acagggacac ttctgctgat cctgtctgat gtgaatgaca 60
acgcccccat accagaacct cgaactatat tcttctgtga gaggaatcca aagcctcagg 120
tcataaacat cattgatgca gaccttcctc ccaatacatc tcccttcaca gcagaactaa 180
cacacggggc gagtgccaac tggaccattc agtacaacga cccaa 225
<![CDATA[ <210> 200]]>
<![CDATA[ <211> 228]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 200]]>
cccaagaatc tatcattttg aagccaaaga tggccttaga ggtgggtgac tacaaaatca 60
atctcaagct catggataac cagaataaag accaagtgac caccttagag gtcagcgtgt 120
gtgactgtga aggggccgct ggcgtctgta ggaaggcaca gcctgtcgaa gcaggattgc 180
aaattcctgc cattctgggg attcttggag gaattcttgc tttgctaa 228
<![CDATA[ <210> 201]]>
<![CDATA[ <211> 131]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 201]]>
ttctgattct gctgctcttg ctgtttcttc ggaggagagc ggtggtcaaa gagcccttac 60
tgccccccaga ggatgacacc cgggacaacg tttattaacta tgatgaagaa ggaggcggag 120
aagaggacca g 131
<![CDATA[ <210> 202]]>
<![CDATA[ <211> 144]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 202]]>
gactttgact tgagccagct gcacaggggc ctggacgctc ggcctgaagt gactcgtaac 60
gacgttgcac caaccctcat gagtgtcccc cggtatcttc cccgccctgc caatcccgat 120
gaaattggaa attttattga tgaa 144
<![CDATA[ <210> 203]]>
<![CDATA[ <211> 2248]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 203]]>
aatctgaaag cggctgatac tgaccccaca gccccgcctt atgattctct gctcgtgttt 60
gactatgaag gaagcggttc cgaagctgct agtctgagct ccctgaactc ctcagagtca 120
gacaaagacc aggactatga ctacttgaac gaatggggca atcgcttcaa gaagctggct 180
gacatgtacg gaggcggcga ggacgactag gggactcgag agaggcgggc cccagaccca 240
tgtgctggga aatgcagaaa tcacgttgct ggtggttttt cagctccctt cccttgagat 300
gagtttctgg ggaaaaaaaa gagactggtt agtgatgcag ttagtatagc tttatactct 360
ctccacttta tagctctaat aagtttgtgttagaaaagtt tcgacttatt tcttaaagct 420
tttttttttt tcccatcact ctttacatgg tggtgatgtc caaaagatac ccaaatttta 480
atattccaga agaacaactt tagcatcaga aggttcaccc agcaccttgc agattttctt 540
aaggaatttt gtctcacttt taaaaagaag gggagaagtc agctactcta gttctgttgt 600
tttgtgtata taatttttta aaaaaaattt gtgtgcttct gctcattact acactggtgt 660
gtccctctgc cttttttttt tttttaagac agggtctcat tctatcggcc aggctggagt 720
gcagtggtgc aatcacagct cactgcagcc ttgtcctccc aggctcaagc tatccttgca 780
cctcagcctc ccaagtagct gggaccacag gcatgcacca ctacgcatga ctaatttttt 840
aaatatttga gacggggtct ccctgtgtta cccaggctgg tctcaaactc ctggggctcaa 900
gtgatcctcc catcttggcc tcccagagta ttgggattac agacatgagc cactgcacct 960
gcccagctcc ccaactccct gccatttttt aagagacagt ttcgctccat cgcccaggcc 1020
tgggatgcag tgatgtgatc atagctcact gtaacctcaa actctggggc tcaagcagtt 1080
ctcccaccag cctccttttt atttttttgt acagatgggg tcttgctatg ttgcccaagc 1140
tggtcttaaa ctcctggcct caagcaatcc ttctgccttg gccccccaaa gtgctgggat 1200
tgtgggcatg agctgctgtg cccagcctcc atgttttaat atcaactctc actcctgaat 1260
tcagttgctt tgcccaagat aggagttctc tgatgcagaa attattgggc tcttttaggg 1320
taagaagttt gtgtctttgt ctggccacat cttgactagg tattgtctac tctgaagacc 1380
tttaatggct tccctctttc atctcctgag tatgtaactt gcaatgggca gctatccagt 1440
gacttgttct gagtaagtgt gttcattaat gtttattag ctctgaagca agagtgatat 1500
actccaggac ttagaatagt gcctaaagtg ctgcagccaa agacagagcg gaactatgaa 1560
aagtggggctt ggagatggca ggagagcttg tcattgagcc tggcaattta gcaaactgat 1620
gctgaggatg attgaggtgg gtctacctca tctctgaaaa ttctggaagg aatggaggag 1680
tctcaacatg tgtttctgac acaagatccg tggtttgtac tcaaagccca gaatccccaa 1740
gtgcctgctt ttgatgatgt ctacagaaaa tgctggctga gctgaacaca tttgcccaat 1800
tccaggtgtg cacagaaaac cgagaatatt caaaattcca aatttttttc ttaggagcaa 1860
gaagaaaatg tggccctaaa ggggggttagt tgaggggtag ggggtagtga ggatcttgat 1920
ttggatctct ttttatttaa atgtgaattt caacttttga caatcaaaga aaagactttt 1980
gttgaaatag ctttactgtt tctcaagtgt tttggagaaa aaaatcaacc ctgcaatcac 2040
tttttggaat tgtcttgatt tttcggcagt tcaagctata tcgaatatag ttctgtgtag 2100
agaatgtcac tgtagttttg agtgtataca tgtgtgggtg ctgataattg tgtattttct 2160
ttgggggtgg aaaaggaaaa caattcaagc tgagaaaagt attctcaaag atgcattttt 2220
ataaatttta ttaaacaatt ttgttaaa 2248
<![CDATA[ <210> 204]]>
<![CDATA[ <211> 4878]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 204]]>
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc 60
gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc 120
agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc aggtctcctc 180
ttggctctgc caggagccgg agccctgcca ccctggcttt gacgccgaga gctacacgtt 240
cacggtgccc cggcgccacc tggagagagg ccgcgtcctg ggcagagtga attttgaaga 300
ttgcaccggt cgacaaagga cagcctatttttccctcgac acccgattca aagtgggcac 360
agatggtgtg attacagtca aaaggcctct acggtttcat aacccacaga tccattctt 420
ggtctacgcc tgggactcca cctacagaaa gttttccacc aaagtcacgc tgaatacagt 480
ggggcaccac caccgccccc cgccccatca ggcctccgtt tctggaatcc aagcagaatt 540
gctcacattt cccaactcct ctcctggcct cagaagacag aagagagact gggttatcc 600
tcccatcagc tgcccagaaa atgaaaaagg cccatttcct aaaaacctgg ttcagatcaa 660
atccaacaaa gacaaagaag gcaaggtttt ctacagcatc actggccaag gagctgacac 720
accccctgtt ggtgtcttta ttattgaaag agaaacagga tggctgaagg tgacagagcc 780
tctggataga gaacgcattg ccacatacac tctcttctct cacgctgtgt catccaacgg 840
gaatgcagtt gaggatccaa tggagatttt gatcacggta accgatcaga atgacaacaa 900
gcccgaattc acccaggagg tctttaaggg gtctgtcatg gaaggtgctc ttccaggaac 960
ctctgtgatg gaggtcacag ccacagacgc ggacgatgat gtgaacacct acaatgccgc 1020
catcgcttac accatcctca gccaagatcc tgagctccct gacaaaaata tgttcaccat 1080
taacaggaac acaggaggtca tcagtgtggt caccactggg ctggaccgag agagtttccc 1140
tacgtatacc ctggtggttc aagctgctga ccttcaaggt gaggggttaa gcacaacagc 1200
aacagctgtg atcacagtca ctgacaccaa cgataatcct ccgatcttca atcccaccac 1260
gtacaagggt caggtgcctg agaacgaggc taacgtcgta atcaccacac tgaaagtgac 1320
tgatgctgat gcccccaata ccccagcgtg ggaggctgta tacaccatat tgaatgatga 1380
tggtggacaa tttgtcgtca ccacaaatcc agtgaacaac gatggcattt tgaaaacagc 1440
aaaggtttgt atggtacctg gcaagatgca gaaactggca tcctcacagc tgttcatacc 1500
cttgtcccct gggcttggat tttgaggcca agcagcagta cattctacac gtagcagtga 1560
cgaatgtggt accttttgag gtctctctca ccacctccac agccaccgtc accgtggatg 1620
tgctggatgt gaatgaagcc cccatctttg tgcctcctga aaagagagtg gaagtgtccg 1680
aggactttgg cgtggggccag gaaatcacat cctacactgc ccaggagcca gacacattta 1740
tggaacagaa aataacatat cggatttgga gagacactgc caactggctg gagattaatc 1800
cggacactgg tgccatttcc actcgggctg agctggacag ggaggatttt gagcacgtga 1860
agaacagcac gtacacagcc ctaatcatag ctacagacaa tggttctcca gttgctactg 1920
gaacagggac acttctgctg atcctgtctg atgtgaatga caacgccccc ataccagaac 1980
ctcgaactat attcttctgt gagaggaatc caaagcctca ggtcataaac atcattgatg 2040
cagaccttcc tcccaataca tctcccttca cagcagaact aacacacggg gcgagtgcca 2100
actggaccat tcagtacaac gacccaaccc aagaatctat cattttgaag ccaaagatgg 2160
ccttagaggt gggtgactac aaaatcaatc tcaagctcat ggataaccag aataaagacc 2220
aagtgaccac cttagaggtc agcgtgtgtg actgtgaagg ggccgctggc gtctgtagga 2280
aggcacagcc tgtcgaagca ggattgcaaa ttcctgccat tctggggatt cttggaggaa 2340
ttcttgcttt gctaattctg attctgctgc tcttgctgtt tcttcggagg agagcggtgg 2400
tcaaagagcc cttactgccc ccagaggatg acacccggga caacgtttat tactatgatg 2460
aagaaggagg cggagaagag gaccaggact ttgacttgag ccagctgcac aggggcctgg 2520
acgctcggcc tgaagtgact cgtaacgacg ttgcaccaac cctcatgagt gtcccccggt 2580
atcttccccg ccctgccaat cccgatgaaa ttggaaattt tattgatgaa aatctgaaag 2640
cggctgatac tgaccccaca gccccgcctt atgattctct gctcgtgttt gactatgaag 2700
gaagcggttc cgaagctgct agtctgagct ccctgaactc ctcagagtca gacaaagacc 2760
aggactatga ctacttgaac gaatggggca atcgcttcaa gaagctggct gacatgtacg 2820
gaggcggcga ggacgactag gggactcgag agaggcgggc cccagaccca tgtgctggga 2880
aatgcagaaa tcacgttgct ggtggttttt cagctccctt cccttgagat gagtttctgg 2940
ggaaaaaaaa gagactggtt agtgatgcag ttagtatagc tttatactct ctccacttta 3000
tagctctaat aagtttgtgttagaaaagtt tcgacttatt tcttaaagct tttttttttt 3060
tcccatcact ctttacatgg tggtgatgtc caaaagatac ccaaatttta atattccaga 3120
agaacaactt tagcatcaga aggttcaccc agcaccttgc aagattttctt aaggaatttt 3180
gtctcacttt taaaaagaag gggagaagtc agctactcta gttctgttgt tttgtgtata 3240
taatttttta aaaaaaattt gtgtgcttct gctcattact acactggtgt gtccctctgc 3300
cttttttttt tttttaagac agggtctcat tctatcggcc aggctggagt gcagtggtgc 3360
aatcacagct cactgcagcc ttgtcctccc aggctcaagc tatccttgca cctcagcctc 3420
ccaagtagct gggaccacag gcatgcacca ctacgcatga ctaatttttt aaatatttga 3480
gacggggtct ccctgtgtta cccaggctgg tctcaaactc ctggggctcaa gtgatcctcc 3540
catcttggcc tcccagagta ttgggattac agacatgagc cactgcacct gcccagctcc 3600
ccaactccct gccatttttt aagagacagt ttcgctccat cgcccaggcc tgggatgcag 3660
tgatgtgatc atagctcact gtaacctcaa actctggggc tcaagcagtt ctcccaccag 3720
cctccttttt atttttttgt acagatgggg tcttgctatg ttgcccaagc tggtcttaaa 3780
ctcctggcct caagcaatcc ttctgccttg gccccccaaa gtgctgggat tgtgggcatg 3840
agctgctgtg cccagcctcc atgttttaat atcaactctc actcctgaat tcagttgctt 3900
tgcccaagat aggagttctc tgatgcagaa attattgggc tcttttaggg taagaagttt 3960
gtgtctttgt ctggccacat cttgactagg tattgtctac tctgaagacc tttaatggct 4020
tccctctttc atctcctgag tatgtaactt gcaatgggca gctatccagt gacttgttct 4080
gagtaagtgt gttcattaat gtttatttag ctctgaagca agagtgatat actccaggac 4140
ttagaatagt gcctaaagtg ctgcagccaa agacagagcg gaactatgaa aagtgggctt 4200
ggagatggca ggagagcttg tcattgagcc tggcaattta gcaaactgat gctgaggatg 4260
attgaggtgg gtctacctca tctctgaaaa ttctggaagg aatggaggag tctcaacatg 4320
tgtttctgac acaagatccg tggtttgtac tcaaagccca gaatccccaa gtgcctgctt 4380
ttgatgatgt ctacagaaaa tgctggctga gctgaacaca tttgcccaat tccaggtgtg 4440
cacagaaaac cgagaatatt caaaattcca aatttttttc ttaggagcaa gaagaaaatg 4500
tggccctaaa gggggttagt tgaggggtag ggggtagtga ggatcttgat ttggatctct 4560
ttttatttaa atgtgaattt caacttttga caatcaaaga aaagactttt gttgaaatag 4620
ctttactgtt tctcaagtgt tttggagaaa aaaatcaacc ctgcaatcac tttttggaat 4680
tgtcttgatt tttcggcagt tcaagctata tcgaatatag ttctgtgtag agaatgtcac 4740
tgtagttttg agtgtataca tgtgtgggtg ctgataattg tgtattttct ttgggggtgg 4800
aaaaggaaaa caattcaagc tgagaaaagt attctcaaag atgcattttt ataaatttta 4860
ttaaacaatt ttgttaaa 4878
<![CDATA[ <210> 205]]>
<![CDATA[ <211> 5]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 205]]>
Met Glu Gln Lys Ile
1 5
<![CDATA[ <210> 206]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 206]]>
Tyr Arg Ile Trp Arg Asp Thr Ala Asn Trp Leu Glu Ile Asn Pro Asp
1 5 10 15
Thr Gly Ala Ile Ser Thr Arg Ala Glu Leu Asp Arg Glu Asp Phe Glu
20 25 30
His Val Lys Asn Ser Thr Tyr Thr Ala Leu Ile Ile Ala Thr Asp Asn
35 40 45
<![CDATA[ <210> 207]]>
<![CDATA[ <211> 74]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 207]]>
Ser Pro Val Ala Thr Gly Thr Gly Thr Leu Leu Leu Ile Leu Ser Asp
1 5 10 15
Val Asn Asp Asn Ala Pro Ile Pro Glu Pro Arg Thr Ile Phe Phe Cys
20 25 30
Glu Arg Asn Pro Lys Pro Gln Val Ile Asn Ile Ile Asp Ala Asp Leu
35 40 45
Pro Pro Asn Thr Ser Pro Phe Thr Ala Glu Leu Thr His Gly Ala Ser
50 55 60
Ala Asn Trp Thr Ile Gln Tyr Asn Asp Pro
65 70
<![CDATA[ <210> 208]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 208]]>
Gln Glu Ser Ile Ile Leu Lys Pro Lys Met Ala Leu Glu Val Gly Asp
1 5 10 15
Tyr Lys Ile Asn Leu Lys Leu Met Asp Asn Gln Asn Lys Asp Gln Val
20 25 30
Thr Thr Leu Glu Val Ser Val Cys Asp Cys Glu Gly Ala Ala Gly Val
35 40 45
Cys Arg Lys Ala Gln Pro Val Glu Ala Gly Leu Gln Ile Pro Ala Ile
50 55 60
Leu Gly Ile Leu Gly Gly Ile Leu Ala Leu Leu
65 70 75
<![CDATA[ <210> 209]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 209]]>
Leu Ile Leu Leu Leu Leu Leu Leu Phe Leu Arg Arg Arg Arg Ala Val Val Lys
1 5 10 15
Glu Pro Leu Leu Pro Pro Glu Asp Asp Thr Arg Asp Asn Val Tyr Tyr
20 25 30
Tyr Asp Glu Glu Gly Gly Gly Gly Glu Glu Asp Gln
35 40
<![CDATA[ <210> 210]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 210]]>
Asp Phe Asp Leu Ser Gln Leu His Arg Gly Leu Asp Ala Arg Pro Glu
1 5 10 15
Val Thr Arg Asn Asp Val Ala Pro Thr Leu Met Ser Val Pro Arg Tyr
20 25 30
Leu Pro Arg Pro Ala Asn Pro Asp Glu Ile Gly Asn Phe Ile Asp Glu
35 40 45
<![CDATA[ <210> 211]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 211]]>
Asn Leu Lys Ala Ala Asp Thr Asp Pro Thr Ala Pro Pro Tyr Asp Ser
1 5 10 15
Leu Leu Val Phe Asp Tyr Glu Gly Ser Gly Ser Glu Ala Ala Ser Leu
20 25 30
Ser Ser Leu Asn Ser Ser Glu Ser Asp Lys Asp Gln Asp Tyr Asp Tyr
35 40 45
Leu Asn Glu Trp Gly Asn Arg Phe Lys Lys Leu Ala Asp Met Tyr Gly
50 55 60
Gly Gly Glu Asp Asp
65
<![CDATA[ <210> 212]]>
<![CDATA[ <211> 366]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 212]]>
Met Glu Gln Lys Ile Thr Tyr Arg Ile Trp Arg Asp Thr Ala Asn Trp
1 5 10 15
Leu Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr Arg Ala Glu Leu
20 25 30
Asp Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr Tyr Thr Ala Leu
35 40 45
Ile Ile Ala Thr Asp Asn Gly Ser Pro Val Ala Thr Gly Thr Gly Thr
50 55 60
Leu Leu Leu Ile Leu Ser Asp Val Asn Asp Asn Ala Pro Ile Pro Glu
65 70 75 80
Pro Arg Thr Ile Phe Phe Cys Glu Arg Asn Pro Lys Pro Gln Val Ile
85 90 95
Asn Ile Ile Asp Ala Asp Leu Pro Asn Thr Ser Pro Phe Thr Ala
100 105 110
Glu Leu Thr His Gly Ala Ser Ala Asn Trp Thr Ile Gln Tyr Asn Asp
115 120 125
Pro Thr Gln Glu Ser Ile Ile Leu Lys Pro Lys Met Ala Leu Glu Val
130 135 140
Gly Asp Tyr Lys Ile Asn Leu Lys Leu Met Asp Asn Gln Asn Lys Asp
145 150 155 160
Gln Val Thr Thr Leu Glu Val Ser Val Cys Asp Cys Glu Gly Ala Ala
165 170 175
Gly Val Cys Arg Lys Ala Gln Pro Val Glu Ala Gly Leu Gln Ile Pro
180 185 190
Ala Ile Leu Gly Ile Leu Gly Gly Ile Leu Ala Leu Leu Ile Leu Ile
195 200 205
Leu Leu Leu Leu Leu Leu Phe Leu Arg Arg Arg Ala Val Val Lys Glu Pro
210 215 220
Leu Leu Pro Pro Glu Asp Asp Thr Arg Asp Asn Val Tyr Tyr Tyr Asp
225 230 235 240
Glu Glu Gly Gly Gly Glu Glu Asp Gln Asp Phe Asp Leu Ser Gln Leu
245 250 255
His Arg Gly Leu Asp Ala Arg Pro Glu Val Thr Arg Asn Asp Val Ala
260 265 270
Pro Thr Leu Met Ser Val Pro Arg Tyr Leu Pro Arg Pro Ala Asn Pro
275 280 285
Asp Glu Ile Gly Asn Phe Ile Asp Glu Asn Leu Lys Ala Ala Asp Thr
290 295 300
Asp Pro Thr Ala Pro Pro Tyr Asp Ser Leu Leu Val Phe Asp Tyr Glu
305 310 315 320
Gly Ser Gly Ser Glu Ala Ala Ser Leu Ser Ser Leu Asn Ser Ser Glu
325 330 335
Ser Asp Lys Asp Gln Asp Tyr Asp Tyr Leu Asn Glu Trp Gly Asn Arg
340 345 350
Phe Lys Lys Leu Ala Asp Met Tyr Gly Gly Gly Glu Asp Asp
355 360 365
<![CDATA[ <210> 213]]>
<![CDATA[ <211> 1902]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 213]]>
agtggcgtcg gaactgcaaa gcacctgtga gcttgcggaa gtcagttcag actccagccc 60
gctccagccc ggcccgaccc gaccgcaccc ggcgcctgcc ctcgctcggc gtccccggcc 120
agccatgggc ccttggagcc gcagcctctc ggcgctgctg ctgctgctgc aggtctcctc 180
ttggctctgc caggagccgg agccctgcca ccctggcttt gacgccgaga gctacacgtt 240
cacggtgccc cggcgccacc tggagagagg ccgcgtcctg ggcagagtga attttgaaga 300
ttgcaccggt cgacaaagga cagcctatttttccctcgac acccgattca aagtgggcac 360
agatggtgtg attacagtca aaaggcctct acggtttcat aacccacaga tccattctt 420
ggtctacgcc tgggactcca cctacagaaa gttttccacc aaagtcacgc tgaatacagt 480
ggggcaccac caccgccccc cgccccatca ggcctccgtt tctggaatcc aagcagaatt 540
gctcacattt cccaactcct ctcctggcct cagaagacag aagagagact gggttatcc 600
tcccatcagc tgcccagaaa atgaaaaagg cccatttcct aaaaacctgg ttcagatcaa 660
atccaacaaa gacaaagaag gcaaggtttt ctacagcatc actggccaag gagctgacac 720
accccctgtt ggtgtcttta ttattgaaag agaaacagga tggctgaagg tgacagagcc 780
tctggataga gaacgcattg ccacatacac tctcttctct cacgctgtgt catccaacgg 840
gaatgcagtt gaggatccaa tggagatttt gatcacggta accgatcaga atgacaacaa 900
gcccgaattc acccaggagg tctttaaggg gtctgtcatg gaaggtgctc ttccaggaac 960
ctctgtgatg gaggtcacag ccacagacgc ggacgatgat gtgaacacct acaatgccgc 1020
catcgcttac accatcctca gccaagatcc tgagctccct gacaaaaata tgttcaccat 1080
taacaggaac acaggaggtca tcagtgtggt caccactggg ctggaccgag agagtttccc 1140
tacgtatacc ctggtggttc aagctgctga ccttcaaggt gaggggttaa gcacaacagc 1200
aacagctgtg atcacagtca ctgacaccaa cgataatcct ccgatcttca atcccaccac 1260
gtacaagggt caggtgcctg agaacgaggc taacgtcgta atcaccacac tgaaagtgac 1320
tgatgctgat gcccccaata ccccagcgtg ggaggctgta tacaccatat tgaatgatga 1380
tggtggacaa tttgtcgtca ccacaaatcc agtgaacaac gatggcattt tgaaaacagc 1440
aaaggtttgt atggtacctg gcaagatgca gaaactggca tcctcacagc tgttcatacc 1500
cttgtcccct gggcttggat tttgaggcca agcagcagta cattctacac gtagcagtga 1560
cgaatgtggt accttttgag gtctctctca ccacctccac agccaccgtc accgtggatg 1620
tgctggatgt gaatgaagcc cccatctttg tgcctcctga aaagagagtg gaagtgtccg 1680
aggactttgg cgtggggccag gaaatcacat cctacactgc ccaggagcca gacacattta 1740
tggaacagaa aataacatat cggatttgga gagacactgc caactggctg gagattaatc 1800
cggacactgg tgccatttcc actcgggctg agctggacag ggaggatttt gagcacgtga 1860
agaacagcac gtacacagcc ctaatcatag ctacagacaa tg 1902
<![CDATA[ <210> 214]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 214]]>
Met Glu Gln Lys Ile Tyr Arg Ile Trp Arg Asp Thr Ala Asn Trp Leu
1 5 10 15
Glu Ile Asn Pro Asp Thr Gly Ala Ile Ser Thr Arg Ala Glu Leu Asp
20 25 30
Arg Glu Asp Phe Glu His Val Lys Asn Ser Thr Tyr Thr Ala Leu Ile
35 40 45
Ile Ala Thr Asp Asn
50
<![CDATA[ <210> 215]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 215]]>
atggcgctcc tgctgtgctt cgtgctcctg tgcggagtag tgg 43
<![CDATA[ <210> 216]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 216]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaact 58
<![CDATA[ <210> 217]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 217]]>
atggcgctcc tgctgtgctt cgtgctcctg tgcggagtag tggccttgcc tccccgattg 60
aaagagatga aaagccagga atcggctgca ggttccaaac t 101
<![CDATA[ <210> 218]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 218]]>
Met Ala Leu Leu Leu Cys Phe Val Leu Leu Cys Gly Val Val Ala Leu
1 5 10 15
Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser
20 25 30
Lys
<![CDATA[ <210> 219]]>
<![CDATA[ <211> 130]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 219]]>
agtcgggagc gcgcgaggcg cggggagcct gggaccagga gcgagagccg cctacctgca 60
gccgccgccc acggcacggc agccaccatg gcgctcctgc tgtgcttcgt gctcctgtgc 120
ggagtagtgg 130
<![CDATA[ <210> 220]]>
<![CDATA[ <211> 167]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 220]]>
atttcgccag aagtttgagt atcactactc ctgaagagat gattgaaaaa gccaaagggg 60
aaactgccta tctgccatgc aaatttacgc ttagtcccga agaccaggga ccgctggaca 120
tcgagtggct gatatcacca gctgataatc agaaggtgga tcaagtg 167
<![CDATA[ <210> 221]]>
<![CDATA[ <211> 205]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 221]]>
attattttat attctggaga caaaatttat gatgactact atccagatct gaaaggccga 60
gtacatttta cgagtaatga tctcaaatct ggtgatgcat caataaatgt aacgaattta 120
caactgtcag atattggcac atatcagtgc aaagtgaaaa aagctcctgg tgttgcaaat 180
aagaagattc atctggtagt tcttg 205
<![CDATA[ <210> 222]]>
<![CDATA[ <211> 156]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 222]]>
ttaagccttc aggtgcgaga tgttacgttg atggatctga agaaattgga agtgacttta 60
agataaaatg tgaaccaaaa gaaggttcac ttccattaca gtatgagtgg caaaaattgt 120
ctgactcaca gaaaatgccc acttcatggt tagcag 156
<![CDATA[ <210> 223]]>
<![CDATA[ <211> 4683]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 223]]>
ggaagatgtg ccacctccaa agagccgtac gtccactgcc agaagctaca tcggcagtaa 60
tcattcatcc ctggggtcca tgtctccttc caacatggaa ggatattcca agactcagta 120
taaccaagta ccaagtgaag actttgaacg cactcctcag agtccgactc tcccacctgc 180
taaggtagct gcccctaatc taagtcgaat gggtgcgatt cctgtgatga ttccagcaca 240
gagcaaggat gggtctatag tatagagcct ccatatgtct catctgtgct ctccgtgttc 300
ctttcctttt tttgatatat gaaaacctat tctggtctaa attgtgttac tagcctcaaa 360
atacatcaaa aaataagtta atcaggaact gtacggaata tatttttaaa aatttttgtt 420
tggttattc gaaatagtta caggcactaa agttagtaaa gaaaagttta ccatctgaaa 480
aagctggatt ttctttaaga ggttgattat aaagttttct aaatttatca gtacctaagt 540
aagatgtagc gctttgaata tgaaatcata ggtgaagaca tgggtgaact tacttgcata 600
ccaagttgat acttgaataa ccatctgaaa gtggtacttg atcattttta ccattatttt 660
taggatgtgt atttcattta tttatggccc accagtctcc cccaaattag tacagaaata 720
tccatgacaa aattacttac gtatgtttgt acttggtttt acagctcctt tgaaaactct 780
gtgtttggaa tatctctaaa aacatagaaa acactacagt ggtttagaaa ttactaattt 840
tacttctaag tcattcataa accttgtcta tgaaatgact tcttaaatat ttagttgata 900
gactgctaca ggtaataggg acttagcaag ctcttttata tgctaaagga gcatctatca 960
gattaagtta gaacatttgc tgtcagccac atattgagat gacactaggt gcaatagcag 1020
ggatagattt tgttggtgag tagtctcatg ccttgagatc tgtggtggtc ttcaaaatgg 1080
tggccagcca gatcaaggat gtagtatctc atagttccca ggtgatattt ttcttattag 1140
aaaaatatta taactcattt gttgtttgac acttatagat tgaaatttcc taatttattc 1200
taaattttaa gtggttcttt ggttccagtg ctttatgttg ttgttgtttt tggatggtgt 1260
tacatattat atgttctaga aacatgtaat cctaaattta ccctcttgaa tataatccct 1320
ggatgatatt ttttatcata aatgcagaat aatcaaatac attttaagca agttaagtgt 1380
cctccatcaa ttctgtattc cagacttggg aggatgtaca gttgctgttg tgtgatcaaa 1440
catgtctctg tgtagttcca gcaaatcaag ctgagctttg aaaaagtttg tcttagtttt 1500
gtgaaggtga tttattctta aaaaaaaaaa agaaagaaaa agaaaaaaag ataagaagga 1560
ggagtaaagg gactactcct ccttgccaaa tgtgctaaat atcattttag gagaagaaag 1620
tgggtttatt gtatttccct taagattgtg agggagtgtg gatacagtag aatgagccaa 1680
cagtttcttt aataataaata cggtctgcaa taaattattt cactagctct aaaacctttc 1740
cctagatttt agtaggggagt tggtttctgt taatatcttt gggtgctgtg gtggtaaatg 1800
ctatattatg aacggtggca tgtatttaca gttagagagtat tgtgtgtaca ctttttaatg 1860
gtaaacttaa gctgaatgtg taatggattt gtgtatagtt ttacatattt ggaagcattt 1920
taaaaacagg ttttaacctt atgtaaaatt acttttatac tcgtgttaac attttcatct 1980
gtgccttttg gtaatttaat ttctattatg aatttctggt gcctatgagc tagctatcac 2040
ctacctgaaa ggtgcttaga ggtgaaggta ctgtttctaa aaacacatca ctgtgatacc 2100
tttctatcct cacattttca agcttgcctc ttttctgttc tttgtggata taacttaagc 2160
aattgtgtta ttcataaagg tttagaaatt tcaatattcc caacactcta tgtttctgat 2220
tttataacag tagccatttt tgaaagtcag atgtttggcc tgttttatat gaataaagtt 2280
tattataaa atattataaa aaataagtaa ataaacagaa cattaataat aaagttttgg 2340
ttcttcctat tcctgacttt catataatga aaattatcct attgatctaa gtagaagtta 2400
tcatagaaag tggacacgta taagactttc cttccttttt ttttttaata acatatgagg 2460
aacaagactt ctcttcccat atacttcata ttttagagga cattgtttta aaggcttatg 2520
tctcactgta aaattctgtc agccaaatag taccaatacg ttttcaagta gttctcactg 2580
ataatttagt tgaaccagag atcaaatatt tgctcccgaa ttactactgg taatcaagta 2640
gttgaacaaa aaattactaa agcatttccg ttagatcagt caaggacagt actgcatctt 2700
ttttttttttttttttttta agacggagtc tcggtctgtc acccaggctg gagtgcagtg 2760
gcgggatctc ggctcactgc aagctccgcc tcccaggttc acgccattct cctgcctcag 2820
cctcccgagc agctgggact acaggctccc atcaccacgc tcggctaagt ttttgtaatt 2880
ttagtagaga cagggtttca ccgtgttagc caggatggtc tcgatctcct gacctcgtga 2940
tctgcctgcc tcggcctccc aaagtgctgg gattacaggc gtgaaccact gcacccggcc 3000
cagtactgca tcttaacagc aaagccattt tattctactt tataactgag agacttgata 3060
ccatccatct ctttaggtta cagaggataa tttgaagaga aatgttactg tagaatatat 3120
agttctgtac tttttttttt tttttttaag agatagggtt tcactattgc tcaggctggt 3180
ctcaaactgc tgggctcagg agatcctcct gccttggcct cccaaattgc tgggattaca 3240
ggtgtgagcc gcagcatcca gccagttctg tactttgaat atggagtagt ttacagctat 3300
ttttttttct tactggtaat cttaactaat atgattccct tgttagagag cctctcactc 3360
ccccacccccc aaaaatgtct actattcatg acagtaacca attattctgg acaaattgct 3420
tctttttaat ttgagctatc tgccatggac tttctaaaat ggaaacacag cctgagtgta 3480
tcttagggag agtttgattg aaaaaatcca aatcactatc catatagatc atggatataa 3540
agagatacct gatttttatt aaaaagatac tttttcaaat ttaagagtta atcttggaaa 3600
tttggaacaa gtaaaggggc aagtaaacct tttgatgaaa tataaaagga actcattgca 3660
tgaagttgac tatcaaattc tgtgatgtgt ggcttcttaa aaatattctc agtgtctttt 3720
gtgtgcgtgc agcatgtaca tttgatgtta tgtgaatgtt gagttttttc ttctaatttt 3780
cacttcagca gtgtttaggg ctttcagatg ccttattcca gtgtgaacag aaaaagttca 3840
tattttatgt ggttaatgct ttgatgtgtc acataaagag tagtttgtag aaaatgttgg 3900
cacaatttta acttcttagt ggcttgtgac attatatatt atatatatat atgtacatat 3960
atctttataa cattcctgtg tttagtagtg taaatgttct gggcaagttt taatattttg 4020
aatgcctttg gatattccag caataaaggc atcatgttct gcaataggat ttcttactca 4080
tttacctatt ttaacactaa aatagaccac aactgagcac aaattccttt tataaatgtt 4140
atagaagcag ggaagaataa taaacacatt tgtgaattgt ggttcagttt atttatcttt 4200
agggaaggct gatcatttat cttatagcag ataacccccag cctcttattc attatggtta 4260
acttttataa tttatcttat tttataattt aagaatatag tacatatcag ttgggtttgg 4320
ttttggtcat cgagactaaa agctccatca aaacagaact ttgtgttttc tgctaactta 4380
tttaatgaca caagttttaa gagaaccaca attcattgat tcacttattc ttttccctaa 4440
ttgtgaattt tagtgataaa tacacctgta ctactgagga aaatattctg acacttcacg 4500
tgtgcaaagt atagaactga cagtgtcagt ttcagatttt gtatgtacga tttctggctt 4560
atatatccaa tggtgcagat tttgaaattt gtaagaacaa aatttgttaa gaaaaacaac 4620
ttgctctagt tttgtgacct tgtgtacttt tgaaataaaa tcaagaaagc agttctctgc 4680
ctc 4683
<![CDATA[ <210> 224]]>
<![CDATA[ <211> 5341]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 224]]>
agtcgggagc gcgcgaggcg cggggagcct gggaccagga gcgagagccg cctacctgca 60
gccgccgccc acggcacggc agccaccatg gcgctcctgc tgtgcttcgt gctcctgtgc 120
ggagtagtgg atttcgccag aagtttgagt atcactactc ctgaagagat gattgaaaaa 180
gccaaagggg aaactgccta tctgccatgc aaatttacgc ttagtcccga agaccaggga 240
ccgctggaca tcgagtggct gatatcacca gctgataatc agaaggtgga tcaagtgatt 300
attttatatt ctggagacaa aatttatgat gactactatc cagatctgaa aggccgagta 360
cattttacga gtaatgatct caaatctggt gatgcatcaa taaatgtaac gaatttacaa 420
ctgtcagata ttggcacata tcagtgcaaa gtgaaaaaag ctcctggtgt tgcaaataag 480
aagattcatc tggtagttct tgttaagcct tcaggtgcga gatgttacgt tgatggatct 540
gaagaaattg gaagtgactt taagataaaa tgtgaaccaa aagaaggttc acttccatta 600
cagtatgagt ggcaaaaatt gtctgactca cagaaaatgc ccacttcatg gttagcaggg 660
aagatgtgcc acctccaaag agccgtacgt ccactgccag aagctacatc ggcagtaatc 720
attcatccct ggggtccatg tctccttcca acatggaagg atattccaag actcagtata 780
accaagtacc aagtgaagac tttgaacgca ctcctcagag tccgactctc ccacctgcta 840
aggtagctgc ccctaatcta agtcgaatgg gtgcgattcc tgtgatgatt ccagcacaga 900
gcaaggatgg gtctatagta tagagcctcc atatgtctca tctgtgctct ccgtgttcct 960
ttcctttttt tgatatatga aaacctattc tggtctaaat tgtgttacta gcctcaaaat 1020
acatcaaaaa ataagttaat caggaactgt acggaatata tttttaaaaa tttttgtttg 1080
gttatatcga aatagttaca ggcactaaag ttagtaaaga aaagtttacc atctgaaaaa 1140
gctggatttt ctttaagagg ttgattataa agttttctaa atttatcagt acctaagtaa 1200
gatgtagcgc tttgaatatg aaatcatagg tgaagacatg ggtgaactta cttgcatacc 1260
aagttgatac ttgaataacc atctgaaagt ggtacttgat catttttacc attattttta 1320
ggatgtgtat ttcatttatt tatggcccac cagtctcccc caaattagta cagaaatatc 1380
catgacaaaa ttacttacgt atgtttgtac ttggttttac agctcctttg aaaactctgt 1440
gtttggaata tctctaaaaa catagaaaac actacagtgg tttagaaatt actaatttta 1500
cttctaagtc attcataaac cttgtctatg aaatgacttc ttaaatattt agttgataga 1560
ctgctacagg taatagggac ttagcaagct cttttatatg ctaaaaggagc atctatcaga 1620
ttaagttaga aatttgctg tcagccacat attgagatga cactaggtgc aatagcaggg 1680
atagattttg ttggtgagta gtctcatgcc ttgagatctg tggtggtctt caaaatggtg 1740
gccagccaga tcaaggatgt agtatctcat agttcccagg tgatattttt cttattagaa 1800
aaatattata actcatttgt tgtttgacac ttatagattg aaatttccta atttattcta 1860
aattttaagt ggttctttgg ttccagtgct ttatgttgtt gttgtttttg gatggtgtta 1920
catattatat gttctagaaa catgtaatcc taaatttacc ctcttgaata taatccctgg 1980
atgatatttt ttatcataaa tgcagaataa tcaaatacat ttaagcaag ttaagtgtcc 2040
tccatcaatt ctgtattcca gacttgggag gatgtacagt tgctgttgtg tgatcaaaca 2100
tgtctctgtg tagttccagc aaatcaagct gagctttgaa aaagtttgtc ttagttttgt 2160
gaaggtgatt tattcttaaaaaaaaaaaag aaagaaaaag aaaaaaagat aagaaggagg 2220
agtaaaggga ctactcctcc ttgccaaatg tgctaaatat cattttagga gaagaaagtg 2280
ggtttattgt atttccctta agattgtgag ggagtgtgga tacagtagaa tgagccaaca 2340
gtttctttat aataaatacg gtctgcaata aattatttca ctagctctaa aacctttccc 2400
tagattttag tagggagttg gtttctgtta atatctttgg gtgctgtggt ggtaaatgct 2460
atattatgaa cggtggcatg tattatcagt tagagtattg tgtgtacact ttttaatggt 2520
aaacttaagc tgaatgtgta atggatttgt gtatagtttt acatatttgg aagcatttta 2580
aaaacaggtt ttaaccttat gtaaaattac ttttatactc gtgttaacat tttcatctgt 2640
gccttttggt aatttaattt ctattatgaa tttctggtgc ctatgagcta gctatcacct 2700
acctgaaagg tgcttagagg tgaaggtact gtttctaaaa acacatcact gtgatacctt 2760
tctatcctca cattttcaag cttgcctctt ttctgttctt tgtggatata acttaagcaa 2820
ttgtgttatt cataaaggtt tagaaatttc aatattccca acactctatg tttctgattt 2880
tataacagta gccatttttg aaagtcagat gtttggcctg ttttatatga ataaagttta 2940
tttataaaat attataaaaa ataagtaaat aaacagaaca ttaataataa agttttggtt 3000
cttcctattc ctgactttca tataatgaaa attatcctat tgatctaagt agaagttatc 3060
atagaaagtg gacacgtata agactttcct tcctttttttttttaataac atatgaggaa 3120
caagacttct cttcccatat acttcatatt ttagaggaca ttgttttaaa ggcttatgtc 3180
tcactgtaaa attctgtcag ccaaatagta ccaatacgtt ttcaagtagt tctcactgat 3240
aatttagttg aaccagagat caaatatttg ctcccgaatt actactggta atcaagtagt 3300
tgaacaaaaa attackaaag catttccgtt agatcagtca aggacagtac tgcatctttt 3360
tttttttttttttttttaag acggagtctc ggtctgtcac ccaggctgga gtgcagtggc 3420
gggatctcgg ctcactgcaa gctccgcctc ccaggttcac gccattctcc tgcctcagcc 3480
tcccgagcag ctgggactac aggctcccat caccacgctc ggctaagttt ttgtaatttt 3540
agtagagaca gggtttcacc gtgttagcca ggatggtctc gatctcctga cctcgtgatc 3600
tgcctgcctc ggcctcccaa agtgctggga ttacaggcgt gaaccactgc acccggccca 3660
gtactgcatc ttaacagcaa agccatttta ttctacttta taactgagag acttgatacc 3720
atccatctct ttaggttaca gaggataatt tgaagagaaa tgttactgta gaatatatag 3780
ttctgtactt tttttttttt tttttaagag atagggtttc actattgctc aggctggtct 3840
caaactgctg ggctcaggag atcctcctgc cttggcctcc caaattgctg ggattacagg 3900
tgtgagccgc agcatccagc cagttctgta ctttgaatat ggagtagttt acagctattt 3960
ttttttctta ctggtaatct taactaatat gattcccttg ttagagagcc tctcactccc 4020
ccacccccaa aaatgtctac tattcatgac agtaaccaat tattctggac aaattgcttc 4080
tttttaattt gagctatctg ccatggactt tctaaaatgg aaacacagcc tgagtgtatc 4140
ttagggagag tttgattgaa aaaatccaaa tcactatcca tatagatcat ggatataaag 4200
agatacctga tttttattaa aaagatactt tttcaaattt aagagttaat cttggaaatt 4260
tggaacaagt aaaggggcaa gtaaaccttt tgatgaaata taaaaggaac tcattgcatg 4320
aagttgacta tcaaattctg tgatgtgtgg cttcttaaaa atattctcag tgtcttttgt 4380
gtgcgtgcag catgtacatt tgatgttatg tgaatgttga gttttttctt ctaattttca 4440
cttcagcagt gtttagggct ttcagatgcc ttattccagt gtgaacagaa aaagttcata 4500
ttttatgtgg ttaatgcttt gatgtgtcac ataaagagta gtttgtagaa aatgttggca 4560
caattttaac ttcttagtgg cttgtgacat tatatattat atatatatat gtacatatat 4620
ctttataaca ttcctgtgtt tagtagtgta aatgttctgg gcaagtttta atattttgaa 4680
tgcctttgga tattccagca ataaaggcat catgttctgc aataggattt cttactcatt 4740
tacctatttt aacactaaaa tagaccaa ctgagcacaa attcctttta taaatgttat 4800
agaagcaggg aagaataata aacacatttg tgaattgtgg ttcagtttat ttatctttag 4860
ggaaggctga tcatttatct tatagcagat aacccccagcc tcttattcat tatggttaac 4920
ttttataatt tatcttattt tataatttaa gaatatagta catatcagtt gggtttggtt 4980
ttggtcatcg agactaaaag ctccatcaaa acagaacttt gtgttttctg ctaacttatt 5040
taatgacaca agttttaaga gaaccacaat tcattgattc acttattctt ttccctaatt 5100
gtgaatttta gtgataaata cacctgtact actgaggaaa atattctgac acttcacgtg 5160
tgcaaagtat agaactgaca gtgtcagttt cagattttgt atgtacgatt tctggcttat 5220
atatccaatg gtgcagattt tgaaatttgt aagaacaaaa tttgttaaga aaaacaactt 5280
gctctagttt tgtgaccttg tgtacttttg aaataaaatc aagaaagcag ttctctgcct 5340
c 5341
<![CDATA[ <210> 225]]>
<![CDATA[ <211> 14]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 225]]>
Met Ala Leu Leu Leu Cys Phe Val Leu Leu Cys Gly Val Val
1 5 10
<![CDATA[ <210> 226]]>
<![CDATA[ <211> 55]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 226]]>
Phe Ala Arg Ser Leu Ser Ile Thr Thr Pro Glu Glu Met Ile Glu Lys
1 5 10 15
Ala Lys Gly Glu Thr Ala Tyr Leu Pro Cys Lys Phe Thr Leu Ser Pro
20 25 30
Glu Asp Gln Gly Pro Leu Asp Ile Glu Trp Leu Ile Ser Pro Ala Asp
35 40 45
Asn Gln Lys Val Asp Gln Val
50 55
<![CDATA[ <210> 227]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 227]]>
Ile Ile Leu Tyr Ser Gly Asp Lys Ile Tyr Asp Asp Tyr Tyr Pro Asp
1 5 10 15
Leu Lys Gly Arg Val His Phe Thr Ser Asn Asp Leu Lys Ser Gly Asp
20 25 30
Ala Ser Ile Asn Val Thr Asn Leu Gln Leu Ser Asp Ile Gly Thr Tyr
35 40 45
Gln Cys Lys Val Lys Lys Ala Pro Gly Val Ala Asn Lys Lys Ile His
50 55 60
Leu Val Val Leu
65
<![CDATA[ <210> 228]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 228]]>
Lys Pro Ser Gly Ala Arg Cys Tyr Val Asp Gly Ser Glu Glu Ile Gly
1 5 10 15
Ser Asp Phe Lys Ile Lys Cys Glu Pro Lys Glu Gly Ser Leu Pro Leu
20 25 30
Gln Tyr Glu Trp Gln Lys Leu Ser Asp Ser Gln Lys Met Pro Thr Ser
35 40 45
Trp Leu Ala
50
<![CDATA[ <210> 229]]>
<![CDATA[ <211> 61]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 229]]>
Lys Met Cys His Leu Gln Arg Ala Val Arg Pro Leu Pro Glu Ala Thr
1 5 10 15
Ser Ala Val Ile Ile His Pro Trp Gly Pro Cys Leu Leu Pro Thr Trp
20 25 30
Lys Asp Ile Pro Arg Leu Ser Ile Thr Lys Tyr Gln Val Lys Thr Leu
35 40 45
Asn Ala Leu Leu Arg Val Arg Leu Ser His Leu Leu Arg
50 55 60
<![CDATA[ <210> 230]]>
<![CDATA[ <211> 252]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 230]]>
Met Ala Leu Leu Leu Cys Phe Val Leu Leu Cys Gly Val Val Asp Phe
1 5 10 15
Ala Arg Ser Leu Ser Ile Thr Thr Pro Glu Glu Met Ile Glu Lys Ala
20 25 30
Lys Gly Glu Thr Ala Tyr Leu Pro Cys Lys Phe Thr Leu Ser Pro Glu
35 40 45
Asp Gln Gly Pro Leu Asp Ile Glu Trp Leu Ile Ser Pro Ala Asp Asn
50 55 60
Gln Lys Val Asp Gln Val Ile Ile Leu Tyr Ser Gly Asp Lys Ile Tyr
65 70 75 80
Asp Asp Tyr Tyr Pro Asp Leu Lys Gly Arg Val His Phe Thr Ser Asn
85 90 95
Asp Leu Lys Ser Gly Asp Ala Ser Ile Asn Val Thr Asn Leu Gln Leu
100 105 110
Ser Asp Ile Gly Thr Tyr Gln Cys Lys Val Lys Lys Ala Pro Gly Val
115 120 125
Ala Asn Lys Lys Ile His Leu Val Val Leu Val Lys Pro Ser Gly Ala
130 135 140
Arg Cys Tyr Val Asp Gly Ser Glu Glu Ile Gly Ser Asp Phe Lys Ile
145 150 155 160
Lys Cys Glu Pro Lys Glu Gly Ser Leu Pro Leu Gln Tyr Glu Trp Gln
165 170 175
Lys Leu Ser Asp Ser Gln Lys Met Pro Thr Ser Trp Leu Ala Gly Lys
180 185 190
Met Cys His Leu Gln Arg Ala Val Arg Pro Leu Pro Glu Ala Thr Ser
195 200 205
Ala Val Ile Ile His Pro Trp Gly Pro Cys Leu Leu Pro Thr Trp Lys
210 215 220
Asp Ile Pro Arg Leu Ser Ile Thr Lys Tyr Gln Val Lys Thr Leu Asn
225 230 235 240
Ala Leu Leu Arg Val Arg Leu Ser His Leu Leu Arg
245 250
<![CDATA[ <210> 231]]>
<![CDATA[ <211> 141]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 231]]>
gggagctgtt caaaaaacga gctctttcta ctcctgtagt agaaaaacgt tcagcatctt 60
ctgagtcatc atcatcatcg tcaaagaaga agaaaacaaa ggtagaacat ggaggatcgt 120
caggctctaa acaaaattct g 141
<![CDATA[ <210> 232]]>
<![CDATA[ <211> 127]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 232]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag attcaagtgg ttcaagaatg 120
ggaatga 127
<![CDATA[ <210> 233]]>
<![CDATA[ <211> 268]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 233]]>
gggagctgtt caaaaaacga gctctttcta ctcctgtagt agaaaaacgt tcagcatctt 60
ctgagtcatc atcatcatcg tcaaagaaga agaaaacaaa ggtagaacat ggaggatcgt 120
caggctctaa acaaaattct gccttgcctc cccgattgaa agagatgaaa agccaggaat 180
cggctgcagg ttccaaacta gtccttcggt gtgaaaccag ttctgaatac tcctctctca 240
gattcaagtg gttcaagaat gggaatga 268
<![CDATA[ <210> 234]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 234]]>
Glu Leu Phe Lys Lys Arg Ala Leu Ser Thr Pro Val Val Glu Lys Arg
1 5 10 15
Ser Ala Ser Ser Ser Glu Ser Ser Ser Ser Ser Ser Ser Lys Lys Lys Lys Lys Thr
20 25 30
Lys Val Glu His Gly Gly Ser Ser Ser Gly Ser Lys Gln Asn Ser Ala Leu
35 40 45
Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala Ala Gly Ser
50 55 60
Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg
65 70 75 80
Phe Lys Trp Phe Lys Asn Gly Asn
85
<![CDATA[ <210> 235]]>
<![CDATA[ <211> 264]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 2]]>35
actgagctcc tagcacccga tcgggaagtg gcgggcggag tcccgggtcc agtcgccgcc 60
tcagctaccg ccgctgccgc cgccgccgcc gccaccgcca gtggtgagac cccgacctgg 120
cgggtcagcg ctgggcgtgc gtgcgggcag gcgggggcgc tgacgagaag caggaagagg 180
gtgcagtgcc ggcgtgggcg gccggccgag gcggaggcgc aggaaggggg cggcgagtcg 240
tgcgaggctg cccttctcac tcag 264
<![CDATA[ <210> 236]]>
<![CDATA[ <211> 170]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 236]]>
cattatggat ccaagcctgt tgagagaaag ggagctgttc aaaaaacgag ctctttctac 60
tcctgtagta gaaaaacgtt cagcatcttc tgagtcatca tcatcatcgt caaagaagaa 120
gaaaacaaag gtagaacatg gaggatcgtc aggctctaaa caaaattctg 170
<![CDATA[ <210> 237]]>
<![CDATA[ <211> 92]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 237]]>
atcatagcaa tggatcattt aacttgaaag ctttgtcagg aagctctgga tataagtttg 60
gtgttcttgc taagattgtg aattacatga ag 92
<![CDATA[ <210> 238]]>
<![CDATA[ <211> 108]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 238]]>
acacggcatc agcgaggaga tacgcatcct ctaaccttag atgaaatttt ggatgaaaca 60
caacatttag atattggact caagcagaaa caatggctaa tgactgag 108
<![CDATA[ <210> 239]]>
<![CDATA[ <211> 183]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 239]]>
gctttagtca acaatcccaa aattgaagta atagatggga agtatgcttt caagcccaag 60
tacaacgtga gagataagaa ggccctactt aggctcttag atcagcatga ccagcgagga 120
ttaggaggaa ttcttttaga agacatagaa gaagcactgc ccaattccca gaaagctgtc 180
aag 183
<![CDATA[ <210> 240]]>
<![CDATA[ <211> 94]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 240]]>
gctttggggg accagatact atttgtaaat cgtcccgata agaagaaaat acttttcttc 60
aatgataaga gctgtcagtt ttctgtggat gaag 94
<![CDATA[ <210> 241]]>
<![CDATA[ <211> 116]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 241]]>
aatttcagaa actgtgggagg agtgtcactg tagattccat ggacgaggag aaaattgaag 60
aatatctgaa gcgacagggt atttcttcca tgcaggaatc tggaccaaag aaagtg 116
<![CDATA[ <210> 242]]>
<![CDATA[ <211> 720]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 242]]>
gcccctattc agagaaggaa aaagcctgct tcacagaaaa agcgacgctt taagactcat 60
aacgaacact tggctggagt gctgaaggat tactctgaca ttacttccag caaataggga 120
acagttttgc cctggaacag agttacagat acacaatcaa gagtgttctt gctgatgctc 180
ggggtctgaa gactgtcttc ctatctgctt cttgcggctg aggagaggag cagttcagtt 240
tacaaaacaa gtgcaaatta ccaaactcaa agcttatttg agtagaatgg gctcatgggc 300
aatgtgatgt tccctgttaa ccttctgtta ctccctggga gaaaggcgct gagcgtggca 360
tgcaggtgtc tttgctgtgtttttctccac ttctaaatgg ttcctggttc ctttcttcct 420
cgtttgttac tttagagcaa gtttgcccat agtcttgaat gcaatatttg tttattccaa 480
aagaacatat ttataataaa atcactgtag aaggattttt aagatgttag tgaattctgt 540
ttcttttcat tctcggaaat ggcaggaagc agctccagtc tctgatttcc atgggtcacg 600
tgctggggat gtgatgaagc ctgcagtctg cactgtgttg ctgagcacat ggatttcacc 660
actggaacac aggtgtgctg cttgttagca agcagagcaa taaagatgtg ctggatgtca 720
<![CDATA[ <210> 243]]>
<![CDATA[ <211> 1747]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 243]]>
actgagctcc tagcacccga tcgggaagtg gcgggcggag tcccgggtcc agtcgccgcc 60
tcagctaccg ccgctgccgc cgccgccgcc gccaccgcca gtggtgagac cccgacctgg 120
cgggtcagcg ctgggcgtgc gtgcgggcag gcgggggcgc tgacgagaag caggaagagg 180
gtgcagtgcc ggcgtgggcg gccggccgag gcggaggcgc aggaaggggg cggcgagtcg 240
tgcgaggctg cccttctcac tcagcatttat ggatccaagc ctgttgagag aaagggagct 300
gttcaaaaaa cgagctcttt ctactcctgt agtagaaaaa cgttcagcat cttctgagtc 360
atcatcatca tcgtcaaaga agaagaaaac aaaggtagaa catggaggat cgtcaggctc 420
taaacaaaat tctgatcata gcaatggatc atttaacttg aaagctttgt caggaagctc 480
tggatataag tttggtgttc ttgctaagat tgtgaattac atgaagacac ggcatcagcg 540
aggagatacg catcctctaa ccttagatga aattttggat gaaacacaac atttagatat 600
tggactcaag cagaaacaat ggctaatgac tgaggcttta gtcaacaatc ccaaaattga 660
agtaatagat gggaagtatg ctttcaagcc caagtacaac gtgagagata agaaggccct 720
acttaggctc ttagatcagc atgaccagcg aggattagga ggaattcttt tagaagacat 780
agaagaagca ctgcccaatt cccagaaagc tgtcaaggct ttgggggacc agatactatt 840
tgtaaatcgt cccgataaga agaaaatact tttcttcaat gataagagct gtcagttttc 900
tgtggatgaa gaatttcaga aactgtggag gagtgtcact gtagattcca tggacgagga 960
gaaaattgaa gaatatctga agcgacaggg tatttcttcc atgcaggaat ctggaccaaa 1020
gaaagtggcc cctattcaga gaaggaaaaa gcctgcttca cagaaaaagc gacgctttaa 1080
gactcataac gaacacttgg ctggagtgct gaaggattac tctgacatta cttccagcaa 1140
atagggaaca gttttgccct ggaacagagt tacagataca caatcaagag tgttcttgct 1200
gatgctcggg gtctgaagac tgtcttccta tctgcttctt gcggctgagg agaggagcag 1260
ttcagtttac aaaacaagtg caaattacca aactcaaagc ttatttgagt agaatgggct 1320
catgggcaat gtgatgttcc ctgttaacct tctgttactc cctgggagaa aggcgctgag 1380
cgtggcatgc aggtgtcttt gctgtgtttt tctccacttc taaatggttc ctggttcctt 1440
tcttcctcgt ttgttacttt agagcaagtt tgcccatagt cttgaatgca atatttgttt 1500
attccaaaag aacatattta taataaaatc actgtagaag gatttttaag atgttagtga 1560
attctgtttc ttttcattct cggaaatggc aggaagcagc tccagtctct gatttccatg 1620
ggtcacgtgc tggggatgtg atgaagcctg cagtctgcac tgtgttgctg agcacatgga 1680
tttcaccact ggaacacagg tgtgctgctt gttagcaagc agagcaataa agatgtgctg 1740
gatgtca 1747
<![CDATA[ <210> 244]]>
<![CDATA[ <211> 55]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 244]]>
Met Asp Pro Ser Leu Leu Arg Glu Arg Glu Leu Phe Lys Lys Arg Ala
1 5 10 15
Leu Ser Thr Pro Val Val Glu Lys Arg Ser Ala Ser Ser Glu Ser Ser
20 25 30
Ser Ser Ser Ser Lys Lys Lys Lys Thr Lys Val Glu His Gly Gly Ser
35 40 45
Ser Gly Ser Lys Gln Asn Ser
50 55
<![CDATA[ <210> 245]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 245]]>
His Ser Asn Gly Ser Phe Asn Leu Lys Ala Leu Ser Gly Ser Ser Gly
1 5 10 15
Tyr Lys Phe Gly Val Leu Ala Lys Ile Val Asn Tyr Met Lys
20 25 30
<![CDATA[ <210> 246]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 246]]>
Thr Arg His Gln Arg Gly Asp Thr His Pro Leu Thr Leu Asp Glu Ile
1 5 10 15
Leu Asp Glu Thr Gln His Leu Asp Ile Gly Leu Lys Gln Lys Gln Trp
20 25 30
Leu Met Thr Glu
35
<![CDATA[ <210> 247]]>
<![CDATA[ <211> 61]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 247]]>
Ala Leu Val Asn Asn Pro Lys Ile Glu Val Ile Asp Gly Lys Tyr Ala
1 5 10 15
Phe Lys Pro Lys Tyr Asn Val Arg Asp Lys Lys Ala Leu Leu Arg Leu
20 25 30
Leu Asp Gln His Asp Gln Arg Gly Leu Gly Gly Ile Leu Leu Glu Asp
35 40 45
Ile Glu Glu Ala Leu Pro Asn Ser Gln Lys Ala Val Lys
50 55 60
<![CDATA[ <210> 248]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 248]]>
Ala Leu Gly Asp Gln Ile Leu Phe Val Asn Arg Pro Asp Lys Lys Lys Lys
1 5 10 15
Ile Leu Phe Phe Asn Asp Lys Ser Cys Gln Phe Ser Val Asp Glu
20 25 30
<![CDATA[ <210> 249]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 249]]>
Phe Gln Lys Leu Trp Arg Ser Val Thr Val Asp Ser Met Asp Glu Glu
1 5 10 15
Lys Ile Glu Glu Tyr Leu Lys Arg Gln Gly Ile Ser Ser Met Gln Glu
20 25 30
Ser Gly Pro Lys Lys Val
35
<![CDATA[ <210> 250]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 250]]>
Ala Pro Ile Gln Arg Arg Lys Lys Pro Ala Ser Gln Lys Lys Arg Arg
1 5 10 15
Phe Lys Thr His Asn Glu His Leu Ala Gly Val Leu Lys Asp Tyr Ser
20 25 30
Asp Ile Thr Ser Ser Lys
35
<![CDATA[ <210> 251]]>
<![CDATA[ <211> 291]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 251]]>
Met Asp Pro Ser Leu Leu Arg Glu Arg Glu Leu Phe Lys Lys Arg Ala
1 5 10 15
Leu Ser Thr Pro Val Val Glu Lys Arg Ser Ala Ser Ser Glu Ser Ser
20 25 30
Ser Ser Ser Ser Lys Lys Lys Lys Thr Lys Val Glu His Gly Gly Ser
35 40 45
Ser Gly Ser Lys Gln Asn Ser Asp His Ser Asn Gly Ser Phe Asn Leu
50 55 60
Lys Ala Leu Ser Gly Ser Ser Ser Gly Tyr Lys Phe Gly Val Leu Ala Lys
65 70 75 80
Ile Val Asn Tyr Met Lys Thr Arg His Gln Arg Gly Asp Thr His Pro
85 90 95
Leu Thr Leu Asp Glu Ile Leu Asp Glu Thr Gln His Leu Asp Ile Gly
100 105 110
Leu Lys Gln Lys Gln Trp Leu Met Thr Glu Ala Leu Val Asn Asn Pro
115 120 125
Lys Ile Glu Val Ile Asp Gly Lys Tyr Ala Phe Lys Pro Lys Tyr Asn
130 135 140
Val Arg Asp Lys Lys Ala Leu Leu Arg Leu Leu Asp Gln His Asp Gln
145 150 155 160
Arg Gly Leu Gly Gly Ile Leu Leu Glu Asp Ile Glu Glu Ala Leu Pro
165 170 175
Asn Ser Gln Lys Ala Val Lys Ala Leu Gly Asp Gln Ile Leu Phe Val
180 185 190
Asn Arg Pro Asp Lys Lys Lys Lys Ile Leu Phe Phe Asn Asp Lys Ser Cys
195 200 205
Gln Phe Ser Val Asp Glu Glu Phe Gln Lys Leu Trp Arg Ser Val Thr
210 215 220
Val Asp Ser Met Asp Glu Glu Lys Ile Glu Glu Tyr Leu Lys Arg Gln
225 230 235 240
Gly Ile Ser Ser Met Gln Glu Ser Gly Pro Lys Lys Val Ala Pro Ile
245 250 255
Gln Arg Arg Lys Lys Pro Ala Ser Gln Lys Lys Arg Arg Phe Lys Thr
260 265 270
His Asn Glu His Leu Ala Gly Val Leu Lys Asp Tyr Ser Asp Ile Thr
275 280 285
Ser Ser Lys
290
<![CDATA[ <210> 252]]>
<![CDATA[ <211> 434]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 252]]>
actgagctcc tagcacccga tcgggaagtg gcgggcggag tcccgggtcc agtcgccgcc 60
tcagctaccg ccgctgccgc cgccgccgcc gccaccgcca gtggtgagac cccgacctgg 120
cgggtcagcg ctgggcgtgc gtgcgggcag gcgggggcgc tgacgagaag caggaagagg 180
gtgcagtgcc ggcgtgggcg gccggccgag gcggaggcgc aggaaggggg cggcgagtcg 240
tgcgaggctg cccttctcac tcagcatttat ggatccaagc ctgttgagag aaagggagct 300
gttcaaaaaa cgagctcttt ctactcctgt agtagaaaaa cgttcagcat cttctgagtc 360
atcatcatca tcgtcaaaga agaagaaaac aaaggtagaa catggaggat cgtcaggctc 420
taaacaaaat tctg 434
<![CDATA[ <210> 253]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 253]]>
atcctaaaca gcacatccaa tcacctgtgg ctagagttca acaccaatgg atctgacacc 60
gaccaaggtt ttcaactcac ctatacca 88
<![CDATA[ <210> 254]]>
<![CDATA[ <211> 62]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 254]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ct 62
<![CDATA[ <210> 255]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 255]]>
atcctaaaca gcacatccaa tcacctgtgg ctagagttca acaccaatgg atctgacacc 60
gaccaaggtt ttcaactcac ctataccact acatctacat ccaccactgg gacaagccat 120
cttgtaaaat gtgcggagaa ggagaaaact 150
<![CDATA[ <210> 256]]>
<![CDATA[ <211> 50]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 256]]>
Ile Leu Asn Ser Thr Ser Asn His Leu Trp Leu Glu Phe Asn Thr Asn
1 5 10 15
Gly Ser Asp Thr Asp Gln Gly Phe Gln Leu Thr Tyr Thr Thr Thr Thr Ser
20 25 30
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu
35 40 45
Lys Thr
50
<![CDATA[ <210> 257]]>
<![CDATA[ <211> 583]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 257]]>
ctccacggca gcggctcctt gtgccactag cagcccttct tctgcgctct ccgccttttc 60
tctctagact ggatctctcc tcccccccgc gcccccctcc ccgcatctcc cactcgctgg 120
ctctctctcc agctgcctcc tctccaggtc tctcctggct gcgcgcgctc ctctccccgc 180
ttctccccct cccgcagcct cgccgccttg gtgccttcct gcccggctcg gccggcgctc 240
gtccccggcc ccggccccgc cagcccgggt ctccgcgctc ggagcagctc agccctgcag 300
tggctcggga cccgatgcta tgagagggaa gcgagccggg cgcccagacc ttcaggaggc 360
gtcggatgcg cggcgggtct tgggaccggg ctctctctcc ggctcgcctt gccctcgggt 420
gattatttgg ctccgctcat agccctgcct tcctcggagg agccatcggt gtcgcgtgcg 480
tgtggagtat ctgcagacat gactgcgtgg aggagattcc agtcgctgct cctgcttctc 540
gggctgctgg tgctgtgcgc gaggctcctc actgcagcga agg 583
<![CDATA[ <210> 258]]>
<![CDATA[ <211> 217]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 258]]>
gtcagaactg tggaggctta gtccagggtc ccaatggcac tattgagagc ccagggtttc 60
ctcacgggta tccgaactat gccaactgca cctggatcat catcacgggc gagcgcaata 120
ggatacagtt gtccttccat acctttgctc ttgaagaaga ttttgatatt ttatcagttt 180
acgatggaca gcctcaacaa gggaatttaa aagtgag 217
<![CDATA[ <210> 259]]>
<![CDATA[ <211> 113]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 259]]>
attatcggga tttcagctgc cctcctctat agtgagtaca ggatctatcc tcactctgtg 60
gttcacgaca gacttcgctg tgagtgccca aggtttcaaa gcatttatg aag 113
<![CDATA[ <210> 260]]>
<![CDATA[ <211> 195]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 260]]>
ttttacctag ccacacttgt ggaaatcctg gagaaatcct gaaaggagtt ctgcatggaa 60
cgagattcaa cataggagac aaaatccggt acagctgcct ccctggctac atcttggaag 120
gccacgccat cctgacctgc atcgtcagcc caggaaatgg tgcatcgtgg gacttcccag 180
ctcccttttg cagag 195
<![CDATA[ <210> 261]]>
<![CDATA[ <211> 208]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 261]]>
ctgagggagc ctgcggagga accttacgcg ggaccagcag ctccatctcc agcccgcact 60
tcccttcaga gtacgagaac aacgcggact gcacctggac cattctggct gagcccgggg 120
accacattgc gctggtcttc actgactttc agctagaaga aggatatgat ttcttagaga 180
tcagtggcac ggaagctcca tccatatg 208
<![CDATA[ <210> 262]]>
<![CDATA[ <211> 113]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 262]]>
gctaactggc atgaacctcc cctctccagt tatcagtagc aagaattggc tacgactcca 60
tttcacctct gacagcaacc accgacgcaa aggatttaac gctcagttcc aag 113
<![CDATA[ <210> 263]]>
<![CDATA[ <211> 78]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 263]]>
tgaaaaaggc gattgagttg aagtcaagag gagtcaagat gctgcccagc aaggatggaa 60
gccataaaaa ctctgtct 78
<![CDATA[ <210> 264]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 264]]>
tgagccaagg aggtgttgca ttggtctctg acatgtgtcc agatcctggg attccagaaa 60
atggtagaag agcaggttcc gacttcag 88
<![CDATA[ <210> 265]]>
<![CDATA[ <211> 125]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 265]]>
ggttggtgca aatgtacagt tttcatgtga ggacaattac gtgctccagg gatctaaaag 60
catcacctgt cagagagtta cagagacgct cgctgcttgg agtgaccaca ggcccatctg 120
ccgag 125
<![CDATA[ <210> 266]]>
<![CDATA[ <211> 122]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 266]]>
cgagaacatg tggatccaat ctgcgtgggc ccagcggcgt cattacctcc cctaattatc 60
cggttcagta tgaagataat gcacactgtg tgtgggtcat caccaccacc gacccggaca 120
ag 122
<![CDATA[ <210> 267]]>
<![CDATA[ <211> 104]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 267]]>
gtcatcaagc ttgcctttga agagtttgag ctggagcgag gctatgacac cctgacggtt 60
ggtgatgctg ggaaggtggg agacaccaga tcggtcttgt acgt 104
<![CDATA[ <210> 268]]>
<![CDATA[ <211> 113]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 268]]>
gctcacggga tccagtgttc ctgacctcat tgtgagcatg agcaaccaga tgtggctaca 60
tctgcagtcg gatgatagca ttggctcacc tgggtttaaa gctgtttacc aag 113
<![CDATA[ <210> 269]]>
<![CDATA[ <211> 183]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 269]]>
aaattgaaaa gggagggtgt ggggatcctg gaatccccgc ctatgggaag cggacgggca 60
gcagtttcct ccatggagat acactcacct ttgaatgccc ggcggccttt gagctggtgg 120
gggagagagt tatcacctgt cagcagaaca atcagtggtc tggcaacaag cccagctgtg 180
tat 183
<![CDATA[ <210> 270]]>
<![CDATA[ <211> 327]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 270]]>
tttcatgttt cttcaacttt acggcatcat ctgggattat tctgtcacca aattatccag 60
aggaatatgg gaacaacatg aactgtgtct ggttgattat ctcggagcca ggaagtcgaa 120
ttcacctaat ctttaatgat tttgatgttg agcctcagtt tgactttctc gcggtcaagg 180
atgatggcat ttctgacata actgtcctgg gtactttttc tggcaatgaa gtgccttccc 240
agctggccag cagtggggcat atagttcgct tggaatttca gtctgaccat tccactactg 300
gcagagggtt caacatcact tacacca 327
<![CDATA[ <210> 271]]>
<![CDATA[ <211> 195]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 271]]>
catttggtca gaatgagtgc catgatcctg gcattcctat aaacggacga cgttttggtg 60
acaggtttct actcgggagc tcggtttctt tccactgtga tgatggcttt gtcaagaccc 120
agggatccga gtccattacc tgcatactgc aagacgggaa cgtggtctgg agctccaccg 180
tgccccgctg tgaag 195
<![CDATA[ <210> 272]]>
<![CDATA[ <211> 139]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 272]]>
ctccatgtgg tggacatctg acagcgtcca gcggagtcat tttgcctcct ggatggccag 60
gatattataa ggattcttta cattgtgaat ggataattga agcaaaacca ggccactcta 120
tcaaaataac ttttgacag 139
<![CDATA[ <210> 273]]>
<![CDATA[ <211> 188]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 273]]>
atttcagaca gaggtcaatt atgacacctt ggaggtcaga gatgggccag ccagttcgtc 60
cccactgatc ggcgagtacc acggcaccca ggcaccccag ttcctcatca gcaccgggaa 120
cttcatgtac ctgctgttca ccactgacaa cagccgctcc agcatcggct tcctcatcca 180
ctatgaga 188
<![CDATA[ <210> 274]]>
<![CDATA[ <211]]>> 189]]>
<br/> <![CDATA[ <212>DNA]]>
<br/> <![CDATA[ <213>Human]]>
<br/>
<br/> <![CDATA[ <400>274]]>
<br/> <![CDATA[gtgtgacgct tgagtcggat tcctgcctgg acccgggcat ccctgtgaac ggccatcgcc 60
acggtggaga ctttggcatc aggtccacag tgactttcag ctgtgacccg gggtacacac 120
taagtgacga cgagcccctc gtctgtgaga ggaaccacca gtggaaccac gccttgccca 180
gctgcgacg 189
<![CDATA[ <210> 275]]>
<![CDATA[ <211> 117]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 275]]>
ctctatgtgg aggctacatc caagggaaga gtggaacagt cctttctcct gggtttccag 60
atttttatcc aaactctcta aactgcacgt ggaccattga agtgtctcat gggaaag 117
<![CDATA[ <210> 276]]>
<![CDATA[ <211> 216]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 276]]>
gagttcaaat gatctttcac acctttcatc ttgagagttc ccacgactat ttactgatca 60
cagaggatgg aagtttttcc gagcccgttg ccaggctcac cgggtcggtg ttgcctcata 120
cgatcaaggc aggcctgttt ggaaacttca ctgcccagct tcggtttata tcagacttct 180
caatttcgta cgagggcttc aatatcacat tttcag 216
<![CDATA[ <210> 277]]>
<![CDATA[ <211> 189]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 277]]>
aatatgacct ggagccatgt gatgatcctg gagtccctgc cttcagccga agaattggtt 60
ttcactttgg tgtgggagac tctctgacgt tttcctgctt cctgggatat cgtttagaag 120
gtgccaccaa gcttacctgc ctgggtgggg gccgccgtgtgtggagtgca cctctgccaa 180
ggtgtgtgg 189
<![CDATA[ <210> 278]]>
<![CDATA[ <211> 170]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 278]]>
ccgaatgtgg agcaagtgtc aaaggaaatg aaggaacatt actgtctcca aattttccat 60
ccaattatga taataaccat gagtgtatct ataaaataga aacagaagcc ggcaagggca 120
tccaccttag aacacgaagc ttccagctgt ttgaaggaga tactctaaag 170
<![CDATA[ <210> 279]]>
<![CDATA[ <211> 157]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 279]]>
gtatatgatg gaaaagacag ttcctcacgt ccactgggca cgttcactaa aaatgaactt 60
ctggggctga tcctaaacag cacatccaat cacctgtggc tagagttcaa caccaatgga 120
tctgacaccg accaaggttttcaactcacc tatacca 157
<![CDATA[ <210> 280]]>
<![CDATA[ <211> 192]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 280]]>
gttttgatct ggtaaaatgt gaggatccgg gcatccctaa ctacggctat aggatccgtg 60
atgaaggcca ctttaccgac actgtagttc tgtacagttg caacccgggg tacgccatgc 120
atggcagcaa caccctgacc tgtttgagtg gagacaggag agtgtgggac aaaccactac 180
cttcgtgcat ag 192
<![CDATA[ <210> 281]]>
<![CDATA[ <211> 10288]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 281]]>
gttttgatct ggtaaaatgt gaggatccgg gcatccctaa ctacggctat aggatccgtg 60
atgaaggcca ctttaccgac actgtagttc tgtacagttg caacccgggg tacgccatgc 120
atggcagcaa caccctgacc tgtttgagtg gagacaggag agtgtgggac aaaccactac 180
cttcgtgcat agcggaatgt ggtggtcaga tccatgcagc cacatcagga cgaatattgt 240
cccctggcta tccagctccg tatgacaaca acctccactg cacctggatt atagaggcag 300
acccaggaaa gacccattagc ctccatttca ttgttttcga cacggagatg gctcacgaca 360
tcctcaaggt ctgggacggg ccggtggaca gtgacatcct gctgaaggag tggagtggct 420
ccgcccttcc ggaggacatc cacagcacct tcaactcact caccctgcag ttcgacagcg 480
acttcttcat cagcaagtct ggcttctcca tccagttctc cacctcaatt gcagccacct 540
gtaacgatcc aggtatgccc caaaatggca cccgctatgg agacagcaga gaggctggag 600
acaccgtcac attccagtgt gaccctggct atcagctcca aggacaagcc aaaatcacct 660
gtgtgcagct gaataaccgg ttcttttggc aaccagaccc tcctacatgc atagctgctt 720
gtggagggaa tctgacgggc ccagcaggtg ttattttgtc acccaactac ccacagccgt 780
atcctcctgg gaaggaatgt gactggagag taaaagtgaa cccggacttt gtcatcgcct 840
tgatattcaa aagtttcaac atggagccca gctatgactt cttacacatc tatgaagggg 900
aagattccaa cagccccctc attgggagtt accagggctc tcaggcccca gaaagaatag 960
agagtagcgg aaacagcctg tttctggcat ttcggagtga tgcctccgtg ggcctttcag 1020
ggttcgccat tgaatttaaa gagaaaccac gggaagcttg ttttgaccca ggaaatataa 1080
tgaatgggac aagagttgga acagacttca agcttggctc caccatcacc taccagtgtg 1140
actctggcta taagattctt gacccctcat ccatcacctg tgtgattggg gctgatggga 1200
aaccctcctg ggaccaagtg ctgccctcct gcaatgctcc ctgtggaggc cagtacacgg 1260
gatcagaagg ggtagtttta tcaccaaact acccccataa ttacacagct ggtcaaatat 1320
gcctctattc catcacggta ccaaaggaat tcgtggtctt tggacagttt gcctatttcc 1380
agacagccct gaatgatttg gcagaattat ttgatggaac ccatgcacag gccagacttc 1440
tcagctcact ctcggggtct cactcagggg aaacattgcc cttggctacg tcaaatcaaa 1500
ttctgctccg attcagtgca aagagcggtg cctctgcccg cggcttccac ttcgtgtatc 1560
aagctgttcc tcgtaccagt gacacccaat gcagctctgt ccccgagccc agatacggaa 1620
ggagaattgg ttctgagttt tctgccggct ccatcgtccg attcgagtgc aacccgggat 1680
acctgcttca gggttccacg gcgctccact gccagtccgt gcccaacgcc ttggcacagt 1740
ggaacgacac gatccccagc tgtgtggtac cctgcagtgg caatttcact caacgaagag 1800
gtacaatcct gtcccccggc taccctgagc catacggaaa caacttgaac tgtatatgga 1860
agatcatagt tacggagggc tcgggaattc agatccaagt gatcagtttt gccacggagc 1920
agaactggga ctcccttgag atccacgatg gtggggatgt gaccgcaccc agactgggaa 1980
gcttctcagg caccacagta ccggcactgc tgaacagtac ttccaaccaa ctctacctgc 2040
atttccagtc tgacattagt gtggcagctg ctggtttcca cctggaatac aaaactgtag 2100
gtcttgctgc atgccaagaa ccagccctcc ccagcaacag catcaaaatc ggagatcggt 2160
acatggtgaa cgacgtgctc tccttccagt gcgagcccgg gtacaccctg cagggccgtt 2220
cccacatttc ctgtatgcca gggaccgttc gccgttggaa ctatccgtct cccctgtgca 2280
ttgcaacctg tggagggacg ctgagcacct tgggtggtgt gatcctgagc cccggcttcc 2340
caggttctta ccccaacaac ttagactgca cctggaggat ctcattaccc atcggctatg 2400
gtgcacatat tcagtttctg aatttttcta ccgaagctaa tcatgacttc cttgaaattc 2460
aaaatggacc ttaccacacc agccccatga ttggacaatt tagcggcacg gatctccccg 2520
cggccctgct gagcacaacg catgaaaccc tcatccactt ttatagtgac cattcgcaaa 2580
accggcaagg atttaaactt gcttaccaag cctatgaatt acagaactgt ccagatccac 2640
ccccatttca gaatgggtac atgatcaact cggattacag cgtggggcaa tcagtatctt 2700
tcgagtgtta tcctgggtac attctaatag gccatcctgt cctcacttgt cagcatggga 2760
tcaacagaaa ctggaactac ccttttccaa gatgtgatgc cccttgtggg tacaacgtaa 2820
cttctcagaa cggcaccatc tactcccctg gctttcctga tgagtatccg atcctgaagg 2880
actgcatttg gctcatcacg gtgcctccag ggcacggagt ttacatcaac ttcaccctgt 2940
tacagacgga agctgtcaac gattacattg ctgtttggga cggtcccgat cagaactcac 3000
cccagctggg agttttcagt ggcaacacag ccctcgaaac ggcgtatagc tccaccaacc 3060
aagtcctgct caagttccac agcgactttt caaatggagg cttctttgtc ctcaatttcc 3120
acgcatttca gctcaagaaa tgtcaacctc ccccagcggt tccacaggca gaaatgctta 3180
ctgaggatga tgatttcgaa ataggagatt ttgtgaagta ccagtgccac cccgggtaca 3240
ccttggtggg gaccgacatt ctgacttgca agctcagttc ccagttgcag tttgagggtt 3300
ctctcccaac atgtgaagca caatgcccag caaatgaagt ccggactgga tcatcggggag 3360
tcattctcag tccagggtat ccgggtaatt attttaactc ccagacttgc tcttggagta 3420
ttaaagtgga accaaactac aacattacca tctttgtgga cacatttcaa agtgaaaagc 3480
agtttgatgc actggaagtg tttgatggtt cttctgggca aagtcctctg ctagtagtct 3540
taagtgggaa tcatactgaa caatcaaatt ttacaagcag gagtaatcag ttatatctcc 3600
gctggtccac tgaccatgcc accagtaaga aaggattcaa gattcgctat gcagcacctt 3660
actgcagttt gacccacccc ctgaagaatg ggggtattct aaacaggact gcaggagcgg 3720
ttggaagcaa agtgcattat ttttgcaagc ctggataccg aatggtcggc cacagcaatg 3780
caacctgtag acgaaaccca cttggcatgt accagtggga ctccctcacg ccactctgcc 3840
aggctgtgtc ctgtggaatc ccagaatccc caggaaacgg ttcatttacc gggaacgagt 3900
tcactttgga cagtaaagtg gtctatgaat gtcatgaggg cttcaagctt gaatccagcc 3960
agcaagcaac agccgtgtgt caagaagatg ggttgtggag taacaagggg aagccgccca 4020
cgtgtaagcc ggtcgcttgc cccagcattg aagctcagct ctcagaacat gtcatctgga 4080
ggctggtttc aggatccttg aatgagtacg gtgctcaagt attgctgagc tgcagtcctg 4140
gttactactt agaaggctgg aggctcctgc ggtgccaggc caatgggacg tggaacatag 4200
gagatgagag gccaagctgt cgagttatct cgtgtggaag cctttccttt cccccaaatg 4260
gcaacaagat tggaacgttg acagtttatg gggccacagc tatatttacg tgcaacaccg 4320
gctacacgct tgtggggtct catgtcagag agtgcttggc aaatgggctc tggagcggca 4380
gcgaaactcg atgtctggct ggccactgcg gttccccaga cccgattgtg aacggtcaca 4440
ttagtggaga tggcttcagt tacagagaca cggtggttta ccagtgcaat cctggtttcc 4500
ggcttgtggg aacttccgtg aggatatgcc tgcaagacca caagtggtct ggacaaacgc 4560
ctgtctgtgt ccccatcaca tgtggtcacc ctggaaaccc tgcccacgga ttcactaatg 4620
gcagtgagtt caacctgaat gatgtcgtga atttcacctg caacacgggc tatttgctgc 4680
agggcgtgtc tcgagcccag tgtcggagca acggccagtg gagtagccct ctgcccacgt 4740
gtcgagtggt gaactgttct gatccaggct ttgtggaaaa tgccattcgt cacgggcaac 4800
agaacttccc tgagagtttt gagtatggaa tgagtatcct gtaccattgc aagaagggat 4860
tttacttgct gggatcttca gccttgacct gtatggcaaa tggcttatgg gaccgatccc 4920
tgcccaagtg tttggctata tcgtgtggac accccagggggt ccctgccaac gccgtcctca 4980
ctggagagct gtttacctat ggcgccgtcg tgcactactc ctgcagaggg agcgagagcc 5040
tcataggcaa cgacacgaga gtgtgccagg aagacagtca ctggagcggg gcactgcccc 5100
actgcacagg aaataatcct ggattctgtg gtgatccggg gaccccagca catgggtctc 5160
ggcttggtga tgactttaag acaaagagtc ttctccgctt ctcctgtgaa atggggcacc 5220
agctgagggg ctcccctgaa cgcacgtgtt tgctcaatgg gtcatggtca ggactgcagc 5280
cggtgtgtga ggccgtgtcc tgtggcaacc ctggcacacc caccaacgga atgattgtca 5340
gtagtgatgg cattctgttc tccagctcgg tcatctatgc ctgctgggaa ggctacaaga 5400
cctcagggct catgacacgg cattgcacag ccaatgggac ctggacaggc actgctcccg 5460
actgcacaat tataagttgt ggggatccag gcaacactagc aaatggcatc cagtttggga 5520
ccgacttcac cttcaacaag actgtgagct atcagtgtaa cccaggctat gtcatggaag 5580
cagtcacatc cgccactatt cgctgtacca aagacggcag gtggaatccg agcaaacctg 5640
tctgcaaagc cgtgctgtgt cctcagccgc cgccggtgca gaatggaaca gtggagggaa 5700
gtgatttccg ctggggctcc agcataagtt acagctgcat ggacggttac cagctctctc 5760
actccgccat cctctcctgt gaaggtcgcg gggtgtggaa aggagagatc ccccagtgtc 5820
tccctgtgtt ctgcggagac cctggcatcc ccgcagaagg gcgacttagt gggaaaagtt 5880
tcacctataa gtccgaagtc ttcttccagt gcaaatctcc atttatactc gtgggatcct 5940
ccagaagagt ctgccaagct gacggcacgt ggagcggcat acaacccacc tgcattgatc 6000
ctgctcataa cacctgccca gaccctggta cgccaacactt tggaatacag aatagctcca 6060
gaggctatga ggttggaagc acggtttttt tcaggtgcag aaaaggctac catattcaag 6120
gttccacgac tcgcacctgc cttgccaatt taacatggag tgggatacag accgaatgta 6180
tacctcatgc ctgcagacag ccagaaaccc cggcacacgc ggatgtgaga gccatcgatc 6240
ttcctacttt cggctacacc ttagtgtaca cctgccatcc aggctttttc ctcgcagggg 6300
gatctgagca cagaacatgt aaagcagaca tgaaatggac aggaaagtcg cctgtgtgta 6360
aaagtaaagg agtgagagaa gttaatgaaa cagttactaa aactccagtt ccttcagatg 6420
tctttttcgt caattcactg tggaaggggt attatgaata ttagggaaa agacaacccg 6480
ccactctaac tgttgactgg ttcaatgcaa caagcagtaa ggtgaatgcc accttcagcg 6540
aagcctcgcc agtggagctg aagttgacag gcatttacaa gaaggaggag gcccacttac 6600
tcctgaaagc ttttcaaatt aaaggccagg cagatatttt tgtaagcaag ttcgaaaatg 6660
acaactgggg actagatggt tatgtgtcat ctggacttga aagaggagga tttacttttc 6720
aaggtgacat tcatggaaaa gactttggaa aatttaagct agaaaggcaa gatcctttaa 6780
accccagatca agactcttcc agtcattacc acggcaccag cagtggctct gtggcggctg 6840
ccattctggt tcctttcttt gctctaattt tatcagggtt tgcattttac ctctacaaac 6900
acagaacgag accaaaagtt caatacaatg gctatgctgg gcatgaaaac agcaatggac 6960
aagcatcgtt tgaaaaccccc atgtatgata caaacttaaa accccacagaa gccaaggctg 7020
tgaggtttga cacaactctg aacacagtct gtacagtggt atagccctca gtgccccaac 7080
aggactgatt catagccata cctctgatgg acaagcagtg attcctttgg tgccatatac 7140
cactctccct tccactctgg ctttactgca gcgatcttca accttgtcta ctggcataag 7200
tgcagcgggg atctctactc aaatgtgtca gggtcttcta cggatcaaac tacacatgcg 7260
ttttcattcc aaaagtgggt tctaaatgcc tggctgcatc tgtatgaaat caaggcacac 7320
tccaggaaga ctgccacgtc gcgccaacac gtcatactca atgcctcaga ctttcatatt 7380
tctgtgttgc tgagatgcct ttcaatgcaa tcgtctgggc tcgtggatat gtccctcagg 7440
tgcggtgaca gaatggtggc accacgatat gtgttctctt gtgttgtttt tcctttttaa 7500
acccccatga acacgaatac tctgaaaaaa ataaaaagct ttctggaaga agaacaccttt 7560
ctgatagagg ctcacaccta caaatgcttc actctgtcct tccgagacct gacaagcttt 7620
gaggacctca cagctcccct gtgtgttcat ctctagggat gtttgcaatt tcccagtcag 7680
ctgttctgtc gcagaatgtt taatgcacaa ttttttgcac tagtgtgtta tgaatgacta 7740
agattctgat aaaaaaaata aattatttac acagggttta tacacactat ccattgtata 7800
taagcattat ttcatattat caagctaaac attcccccat cagcttagtt gaggtgttag 7860
ggaaaagtat tcctagatat ggcacagatt ttaaaaggaa atacagtatt gaagagattt 7920
atttattat tgcttcaatt agctccattt acgtgttgaa ttcattgaag aggtccaatg 7980
agaaaaaaac agaagcctcc ttatttcaca cgttttcctc ctttagtacc atcctcatcc 8040
aattactgtc tctctgatac tacttaatag caggggggttt gcagaaattt ctgtttgcca 8100
tgtaaaactg tgaatagtaa tttattttag atagtcgatg aacttgtggg ttttagctca 8160
caatgcagcc ttcccttttg cagtgttttttttttgtttttttttttttttgtcttttac 8220
tgtgccatcg atctttgata ttgcattgaa agacaatata ccacagtagc accttgaact 8280
cagtgaaaat tgttcaggat caaaatacca agtgttcttt tagagggaag gaaaaagtac 8340
acacactctc ctctcacaat gatatatttt atacattcat ttgttatttg tttcatgctt 8400
tatgattcca gatggaaagg taatttcagt gacttttcaa gtttaaattc catttataggt 8460
aaatgataag ttatgatgca aataaaatct ataagatccc cagggcaaat aaaaatcaaa 8520
acatgaagta gaagatgtgg ccgtgaggta gtttatgtaa caaattcaaa gtgaaaatca 8580
tgtttacttt tacttatact tatttgataa aaatattttt gaaacgatag tacttatttt 8640
attatttgat atttcagttc ctattcaatt gtggcagatt ttctctgttt cacattttag 8700
attggcgttg gtaatagaaa tgtcagaatg ttcaaattgg ccttcacgtt gtcggagtga 8760
aacacattgac acctagcttt aagactgatt tatctgttgg tgtactgaag gtttccatgt 8820
aggacttcaa atgtggaaaa ggaaaagcag tcaggaaaat ggggcattct ttggagagtc 8880
acgcgttttg attcggacat ttccgtagag ctcggctccc agtgttgtgt tcctcggtcg 8940
aaagggtctc tgctgtttgg ggactcactg gcctctccta gggactcctt tgtcttgtga 9000
accccacgct gttggattct gtatcattat gctgaattct ctgcacagtt ttccctggcc 9060
aacctgccca catccttgga gatttgcttt gccagtggga atccttacat tgctgtttca 9120
cagtagacgg gacgaggtca gcgggagtcg tgctcctaac aacacacattg aacgaaacag 9180
aagatgattg aaagtgtgag gaggctcgtg tgcaagggag aacagggtta ctatacatat 9240
tagtgtatat atatacatac atatatatat atatatattg tacatatcta agtttgagtc 9300
attcaaacta ggtgcaaaat gctgacttca gagtctgaat taacatctct gttcccatat 9360
ccctgacctg ctccctggtc aacgatgcta tgaaatcctg aaatgacagg acatacatac 9420
atacaagaaa ccacatatca aattagatat gattttcctt tgtgtgcaaa gtcaaactgt 9480
cctagggttg ccagtttgaa gcatgttatt taaatgaaaa aaaaaatcag tgaaattctc 9540
gtgtgagaat tctgcctagt ttcttcctaa ggttgtgtgc agtgttgaac ggcgtctccg 9600
caaggtgttg gaggatctca ttttagggca gtcaggagct gtgcttgctg agttaggtct 9660
agaagactct tccctgaagg caacgggaac acgcgtgagg gacgcgacca cacactaaca 9720
gaggacacgt gcttcagagc tgtttaaaac tgctgcttgt tttacacaca catcttgcct 9780
tttttcaggc tagctgcaat aatttttttc ttctgtaaaa tattttgtaa acaacaacaa 9840
aaagctatta taaaaagggg gtaaaaaaaa gaacgctggc attatgatca ggaaaaccca 9900
ttgtcatcgc cgaccctccc tcccgtccca ccaacacgctg ctgtcacgac gtaggtgcga 9960
aagacctttt tgtacagaga tatatttttt atgaagaatt tgtaaaatta ttaaatatgc 10020
tgtaattttt tgattaatgt aggtacattg ttaaaaaata aatgttttta caatacagaa 10080
ctgtaatttt cccaataatg taaaatgtac catctctagc tgattttcag ttccaatcct 10140
attacacatg tattaatatt aaagtggcct gttaaaatga acagtatctt ttttttgtca 10200
aaaaaattat aaagagggtg taatatagcc tgtgcaatgc caccaatctt taaagcaaat 10260
cagagttcta attaaatatt taatttta 10288
<![CDATA[ <210> 282]]>
<![CDATA[ <211> 14417]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 282]]>
ctccacggca gcggctcctt gtgccactag cagcccttct tctgcgctct ccgccttttc 60
tctctagact ggatctctcc tcccccccgc gcccccctcc ccgcatctcc cactcgctgg 120
ctctctctcc agctgcctcc tctccaggtc tctcctggct gcgcgcgctc ctctccccgc 180
ttctccccct cccgcagcct cgccgccttg gtgccttcct gcccggctcg gccggcgctc 240
gtccccggcc ccggccccgc cagcccgggt ctccgcgctc ggagcagctc agccctgcag 300
tggctcggga cccgatgcta tgagagggaa gcgagccggg cgcccagacc ttcaggaggc 360
gtcggatgcg cggcgggtct tgggaccggg ctctctctcc ggctcgcctt gccctcgggt 420
gattatttgg ctccgctcat agccctgcct tcctcggagg agccatcggt gtcgcgtgcg 480
tgtggagtat ctgcagacat gactgcgtgg aggagattcc agtcgctgct cctgcttctc 540
gggctgctgg tgctgtgcgc gaggctcctc actgcagcga agggtcagaa ctgtggaggc 600
ttagtccagg gtcccaatgg cactattgag agcccagggt ttcctcacgg gtatccgaac 660
tatgccaact gcacctggat catcatcacg ggcgagcgca ataggataca gttgtccttc 720
catacctttg ctcttgaaga agattttgat attttatcag tttacgatgg acagcctcaa 780
caagggaatt taaaagtgag attatcggga tttcagctgc cctcctctat agtgagtaca 840
ggatctatcc tcactctgtg gttcacgaca gacttcgctg tgagtgccca aggtttcaaa 900
gcatttatg aagttttacc tagccaacact tgtggaaatc ctggagaaat cctgaaagga 960
gttctgcatg gaacgagatt caacatagga gacaaaatcc ggtacagctg cctccctggc 1020
tacatcttgg aaggccacgc catcctgacc tgcatcgtca gcccaggaaa tggtgcatcg 1080
tgggacttcc cagctccctt ttgcagagct gagggagcct gcggaggaac cttacgcggg 1140
accagcagct ccatctccag cccgcacttc ccttcagagt acgagaacaa cgcggactgc 1200
acctggacca ttctggctga gcccggggac accattgcgc tggtcttcac tgactttcag 1260
ctagaagaag gatatgattt cttagagatc agtggcacgg aagctccatc catatggcta 1320
actggcatga acctcccctc tccagttatc agtagcaaga attggctacg actccatttc 1380
acctctgaca gcaaccaccg acgcaaagga tttaacgctc agttccaagt gaaaaaggcg 1440
attgagttga agtcaagagg agtcaagatg ctgcccagca aggatggaag ccataaaaac 1500
tctgtcttga gccaaggagg tgttgcattg gtctctgaca tgtgtccaga tcctggggatt 1560
ccagaaaatg gtagaagagc aggttccgac ttcagggttg gtgcaaatgt acagttttca 1620
tgtgaggaca attacgtgct ccagggatct aaaagcatca cctgtcagag agttacagag 1680
acgctcgctg cttggagtga ccacaggccc atctgccgag cgagaacatg tggatccaat 1740
ctgcgtgggc ccagcggcgt cattacctcc cctaattatc cggttcagta tgaagataat 1800
gcacactgtg tgtgggtcat caccaccacc gacccggaca aggtcatcaa gcttgccttt 1860
gaagagtttg agctggagcg aggctatgac accctgacgg ttggtgatgc tgggaaggtg 1920
ggagacacca gatcggtctt gtacgtgctc acgggatcca gtgttcctga cctcattgtg 1980
agcatgagca accagatgtg gctacatctg cagtcggatg atagcattgg ctcacctggg 2040
tttaaagctg tttaccaaga aattgaaaag gggaggtgtggggatcctgg aatccccgcc 2100
tatgggaagc ggacggggcag cagtttcctc catggagata cactcacctt tgaatgcccg 2160
gcggcctttg agctggtggg ggagagagtt atcacctgtc agcagaacaa tcagtggtct 2220
ggcaacaagc ccagctgtgt attttcatgt ttcttcaact ttacggcatc atctgggatt 2280
attctgtcac caaattatcc agaggaatat gggaacaaca tgaactgtgt ctggttgatt 2340
atctcggagc caggaagtcg aattcaccta atctttaatg attttgatgt tgagcctcag 2400
tttgactttc tcgcggtcaa ggatgatggc atttctgaca taactgtcct gggtactttt 2460
tctggcaatg aagtgccttc ccagctggcc agcagtgggc atatagttcg cttggaattt 2520
cagtctgacc attccactac tggcagaggg ttcaacatca cttacaccac atttggtcag 2580
aatgagtgcc atgatcctgg cattcctata aacggacgac gttttggtga caggtttcta 2640
ctcggggagct cggtttcttt ccactgtgat gatggctttg tcaagaccca gggatccgag 2700
tccattacct gcatactgca agacgggaac gtggtctgga gctccaccgt gccccgctgt 2760
gaagctccat gtggtggaca tctgacagcg tccagcggag tcattttgcc tcctggatgg 2820
ccaggatatt ataaggattc tttacattgt gaatggataa ttgaagcaaa accaggccac 2880
tctatcaaaa taacttttga cagatttcag acagaggtca attatgacac cttggaggtc 2940
agagatgggc cagccagttc gtccccactg atcggcgagt accacggcac ccaggcaccc 3000
cagttcctca tcagcaccgg gaacttcatg tacctgctgt tcaccactga caacagccgc 3060
tccagcatcg gcttcctcat ccactatgag agtgtgacgc ttgagtcgga ttcctgcctg 3120
gacccgggca tccctgtgaa cggccatcgc cacggtggag actttggcat caggtccaca 3180
gtgactttca gctgtgaccc ggggtacaca ctaagtgacg acgagcccct cgtctgtgag 3240
aggaaccacc agtggaacca cgccttgccc agctgcgacg ctctatgtgg aggctacatc 3300
caagggaaga gtggaacagt cctttctcct gggtttccag atttttatcc aaactctcta 3360
aactgcacgt ggaccatga agtgtctcat gggaaaggag ttcaaatgat ctttcacacc 3420
tttcatcttg agagttccca cgactattta ctgatcacag aggatggaag tttttccgag 3480
cccgttgcca ggctcaccgg gtcggtgttg cctcatacga tcaaggcagg cctgtttgga 3540
aacttcactg cccagcttcg gtttatatca gacttctcaa tttcgtacga gggcttcaat 3600
atcacatttt cagaatatga cctggagcca tgtgatgatc ctggagtccc tgccttcagc 3660
cgaagaattg gttttcactt tggtgtggga gactctctga cgttttcctg cttcctggga 3720
tatcgtttag aaggtgccac caagcttacc tgcctgggtg ggggccgccg tgtgtggagt 3780
gcacctctgc caaggtgtgt ggccgaatgt ggagcaagtg tcaaaggaaa tgaaggaaca 3840
ttactgtctc caaattttcc atccaattat gataataacc atgagtgtat ctataaaata 3900
gaaacagaag ccggcaaggg catccacctt agaacacgaa gcttccagct gtttgaagga 3960
gatactctaa aggtatatga tggaaaagac agttcctcac gtccactggg cacgttcact 4020
aaaaatgaac ttctggggct gatcctaaac agcacatcca atcacctgtg gctagagttc 4080
aacaccaatg gatctgacac cgaccaaggt tttcaactca cctataccag ttttgatctg 4140
gtaaaatgtg aggatccggg catccctaac tacggctata ggatccgtga tgaaggccac 4200
tttaccgaca ctgtagttct gtacagttgc aacccggggt acgccatgca tggcagcaac 4260
accctgacct gtttgagtgg agacaggaga gtgtgggaca aacccactacc ttcgtgcata 4320
gcggaatgtg gtggtcagat ccatgcagcc acatcaggac gaatattgtc ccctggctat 4380
ccagctccgt atgacaacaa cctccactgc acctggatta tagaggcaga cccaggaaag 4440
accattagcc tccatttcat tgttttcgac acggagatgg ctcacgacat cctcaaggtc 4500
tgggacgggc cggtggacag tgacatcctg ctgaaggagt ggagtggctc cgcccttccg 4560
gaggacatcc acagcacctt caactcactc accctgcagt tcgacagcga cttcttcatc 4620
agcaagtctg gcttctccat ccagttctcc acctcaattg cagccacctg taacgatcca 4680
ggtatgcccc aaaatggcac ccgctatgga gacagcagag aggctggaga caccgtcaca 4740
ttccagtgtg accctggcta tcagctccaa ggacaagcca aaatcacctg tgtgcagctg 4800
aataaccggt tcttttggca accagaccct cctacatgca tagctgcttg tggagggaat 4860
ctgacgggcc cagcaggtgttttttgtca cccaactacc cacagccgta tcctcctggg 4920
aaggaatgtg actggagagt aaaagtgaac ccggactttg tcatcgcctt gatattcaaa 4980
agtttcaaca tggagcccag ctatgacttc ctacacatct atgaagggga agattccaac 5040
agccccctca ttggggagtta ccagggctct caggccccag aaagaataga gagtagcgga 5100
aacagcctgt ttctggcatt tcggagtgat gcctccgtgg gcctttcagg gttcgccatt 5160
gaatttaaag agaaaccacg ggaagcttgt tttgacccag gaaatataat gaatgggaca 5220
agagttggaa cagacttcaa gcttggctcc accatcacct accacgtgtga ctctggctat 5280
aagattcttg acccctcatc catcacctgt gtgattgggg ctgatgggaa accctcctgg 5340
gaccaagtgc tgccctcctg caatgctccc tgtggaggcc agtacacggg atcagaaggg 5400
gtagttttat caccaaacta cccccataat tacacagctg gtcaaatatg cctctattcc 5460
atcacggtac caaaggaatt cgtggtcttt ggacagtttg cctatttcca gacagccctg 5520
aatgatttgg cagaattatt tgatggaacc catgcacagg ccagacttct cagctcactc 5580
tcggggtctc actcagggga aacattgccc ttggctacgt caaatcaaat tctgctccga 5640
ttcagtgcaa agagcggtgc ctctgcccgc ggcttccact tcgtgtatca agctgttcct 5700
cgtaccagtg acacccaatg cagctctgtc cccgagccca gatacggaag gagaattggt 5760
tctgagtttt ctgccggctc catcgtccga ttcgagtgca acccgggata cctgcttcag 5820
ggttccacgg cgctccactg ccagtccgtg cccaacgcct tggcacagtg gaacgacacg 5880
atccccagct gtgtggtacc ctgcagtggc aatttcactc aacgaagagg tacaatcctg 5940
tcccccggct accctgagcc atacggaaac aacttgaact gtatatggaa gatcatagtt 6000
acggagggct cgggaattca gatccaagtg atcagttttg ccacggagca gaactgggac 6060
tcccttgaga tccacgatgg tggggatgtg accgcaccca gactgggaag cttctcaggc 6120
accacagtac cggcactgct gaacagtact tccaaccaac tctacctgca tttccagtct 6180
gacattagtg tggcagctgc tggtttccac ctggaataca aaactgtagg tcttgctgca 6240
tgccaagaac cagccctccc cagcaacagc atcaaaatcg gagatcggta catggtgaac 6300
gacgtgctct ccttccagtg cgagcccggg tacaccctgc agggccgttc ccacatttcc 6360
tgtatgccag ggaccgttcg ccgttggaac tatccgtctc ccctgtgcat tgcaacctgt 6420
ggagggacgc tgagcacctt gggtggtgtg atcctgagcc ccggcttccc aggttcttac 6480
cccaacaact tagactgcac ctggaggatc tcattaccca tcggctatgg tgcacatatt 6540
cagtttctga atttttctac cgaagctaat catgacttcc ttgaaattca aaatggacct 6600
taccacacca gccccatgat tggacaattt agcggcacgg atctccccgc ggccctgctg 6660
agcacaacgc atgaaaccct catccacttt tatagtgacc attcgcaaaa ccggcaagga 6720
tttaaacttg cttaccaagc ctatgaatta cagaactgtc cagatccacc cccatttcag 6780
aatgggtaca tgatcaactc ggattacagc gtggggcaat cagtatcttt cgagtgttat 6840
cctgggtaca ttctaatagg ccatcctgtc ctcacttgtc agcatgggat caacagaaac 6900
tggaactacc cttttccaag atgtgatgcc ccttgtgggt acaacgtaac ttctcagaac 6960
ggcaccatct actcccctgg ctttcctgat gagtatccga tcctgaagga ctgcatttgg 7020
ctcatcacgg tgcctccagg gcacggagtt tacatcaact tcaccctgtt acagacggaa 7080
gctgtcaacg attacattgc tgtttgggac ggtcccgatc agaactcacc ccagctggga 7140
gttttcagtg gcaacacagc cctcgaaacg gcgtatagct ccaccaacca agtcctgctc 7200
aagttccaca gcgacttttc aaatggaggc ttctttgtcc tcaatttcca cgcatttcag 7260
ctcaagaaat gtcaacctcc cccagcggtt ccacaggcag aaatgcttac tgaggatgat 7320
gatttcgaaa tagggagattt tgtgaagtac cagtgccacc ccgggtacac cttggtgggg 7380
accgacattc tgacttgcaa gctcagttcc cagttgcagtttgagggttc tctcccaaca 7440
tgtgaagcac aatgcccagc aaatgaagtc cggactggat catcgggagt cattctcagt 7500
ccagggtatc cgggtaatta ttttaactcc cagacttgct cttggagtat taaagtggaa 7560
ccaaactaca aattaccat ctttgtggac aatttcaaa gtgaaaagca gtttgatgca 7620
ctggaagtgtttgatggttc ttctgggcaa agtcctctgc tagtagtctt aagtgggaat 7680
catactgaac aatcaaattt tacaagcagg agtaatcagt tatatctccg ctggtccact 7740
gaccatgcca ccagtaagaa aggattcaag attcgctatg cagcacctta ctgcagtttg 7800
acccacccccc tgaagaatgg gggtattcta aacaggactg caggagcggt tggaagcaaa 7860
gtgcattatt tttgcaagcc tggataccga atggtcggcc acagcaatgc aacctgtaga 7920
cgaaacccac ttggcatgta ccagtgggac tccctcacgc cactctgcca ggctgtgtcc 7980
tgtggaatcc cagaatcccc aggaaacggt tcatttaccg ggaacgagtt cactttggac 8040
agtaaagtgg tctatgaatg tcatgagggc ttcaagcttg aatccagcca gcaagcaaca 8100
gccgtgtgtc aagaagatgg gttgtggagt aacaagggga agccgcccac gtgtaagccg 8160
gtcgcttgcc ccagcattga agctcagctc tcagaacatg tcatctggag gctggtttca 8220
ggatccttga atgagtacgg tgctcaagta ttgctgagct gcagtcctgg ttactactta 8280
gaaggctgga ggctcctgcg gtgccaggcc aatgggacgt ggaacatagg agatgagagg 8340
ccaagctgtc gagttatctc gtgtggaagc ctttcctttc ccccaaatgg caacaagatt 8400
ggaacgttga cagtttatgg ggccacagct atattacgt gcaacaccgg ctacacgctt 8460
gtggggtctc atgtcagaga gtgcttggca aatggggctct ggagcggcag cgaaactcga 8520
tgtctggctg gccactgcgg ttccccagac ccgattgtga acggtcacat tagtggagat 8580
ggcttcagtt acagagacac ggtggtttac cagtgcaatc ctggtttccg gcttgtggga 8640
acttccgtga ggatatgcct gcaagaccac aagtggtctg gacaaacgcc tgtctgtgtc 8700
cccatcacat gtggtcaccc tggaaaccct gcccacggat tcactaatgg cagtgagttc 8760
aacctgaatg atgtcgtgaa tttcacctgc aacacgggct atttgctgca gggcgtgtct 8820
cgagcccagt gtcggagcaa cggccagtgg agtagccctc tgcccacgtg tcgagtggtg 8880
aactgttctg atccaggctt tgtggaaaat gccattcgtc acgggcaaca gaacttccct 8940
gagagttttg agtatggaat gagtatcctg taccatgca agaagggatt ttacttgctg 9000
ggatcttcag ccttgacctg tatggcaaat ggcttatggg accgatccct gcccaagtgt 9060
ttggctatat cgtgtggaca cccaggggtc cctgccaacg ccgtcctcac tggagagctg 9120
tttacctatg gcgccgtcgt gcactactcc tgcagaggga gcgagagcct cataggcaac 9180
gacacgagag tgtgccagga agacagtcac tggagcgggg cactgcccca ctgcacagga 9240
aataatcctg gattctgtgg tgatccgggg accccagcac atgggtctcg gcttggtgat 9300
gactttaaga caaagagtct tctccgcttc tcctgtgaaa tggggcacca gctgaggggc 9360
tcccctgaac gcacgtgttt gctcaatggg tcatggtcag gactgcagcc ggtgtgtgag 9420
gccgtgtcct gtggcaaccc tggcacaccc accaacggaa tgattgtcag tagtgatggc 9480
attctgttct ccagctcggt catctatgcc tgctgggaag gctacaagac ctcagggctc 9540
atgacacggc attgcacagc caatgggacc tggacaggca ctgctcccga ctgcacaatt 9600
ataagttgtg gggatccagg caactagca aatggcatcc agtttgggac cgacttcacc 9660
ttcaacaaga ctgtgagcta tcagtgtaac ccaggctatg tcatggaagc agtcacatcc 9720
gccactattc gctgtaccaa agacggcagg tggaatccga gcaaacctgt ctgcaaagcc 9780
gtgctgtgtc ctcagccgcc gccggtgcag aatggaacag tggagggaag tgatttccgc 9840
tggggctcca gcataagtta cagctgcatg gacggttacc agctctctca ctccgccatc 9900
ctctcctgtg aaggtcgcgg ggtgtggaaa ggagagatcc cccagtgtct ccctgtgttc 9960
tgcggagacc ctggcatccc cgcagaaggg cgacttagtg ggaaaagttt cacctataag 10020
tccgaagtct tcttccagtg caaatctcca tttatactcg tgggatcctc cagaagagtc 10080
tgccaagctg acggcacgtg gagcggcata caacccacct gcattgatcc tgctcataac 10140
acctgcccag accctggtac gccaacacttt ggaatacaga atagctccag aggctatgag 10200
gttggaagca cggttttttt caggtgcaga aaaggctacc atattcaagg ttccacgact 10260
cgcacctgcc ttgccaattt aacatggagt gggatacaga ccgaatgtat acctcatgcc 10320
tgcagacagc cagaaaccccc ggcacacgcg gatgtgagag ccatcgatct tcctactttc 10380
ggctacacct tagtgtacac ctgccatcca ggctttttcc tcgcagggggg atctgagcac 10440
agaacatgta aagcagacat gaaatggaca ggaaagtcgc ctgtgtgtaa aagtaaagga 10500
gtgagagaag ttaatgaaac agttactaaa actccagttc cttcagatgt ctttttcgtc 10560
aattcactgt ggaaggggta ttatgaatat ttagggaaaa gacaacccgc cactctaact 10620
gttgactggt tcaatgcaac aagcagtaag gtgaatgcca ccttcagcga agcctcgcca 10680
gtggagctga agttgacagg catttacaag aaggaggagg cccacttact cctgaaagct 10740
tttcaaatta aaggccaggc agatattttt gtaagcaagt tcgaaaatga caactgggga 10800
ctagatggtt atgtgtcatc tggacttgaa agaggagat ttacttttca aggtgacatt 10860
catggaaaag actttggaaa atttaagcta gaaaggcaag atcctttaaa cccagatcaa 10920
gactcttcca gtcattacca cggcaccagc agtggctctg tggcggctgc cattctggtt 10980
cctttctttg ctctaatttt atcagggttt gcattttacc tctacaaaca cagaacgaga 11040
ccaaaagttc aatacaatgg ctatgctggg catgaaaaca gcaatggaca agcatcgttt 11100
gaaaacccca tgtatgatac aaacttaaaa cccacagaag ccaaggctgt gaggtttgac 11160
acaactctga acacagtctg tacagtggta tagccctcag tgccccaaca ggactgattc 11220
atagccatac ctctgatgga caagcagtga ttcctttggt gccatatacc actctccctt 11280
ccactctggc tttactgcag cgatcttcaa ccttgtctac tggcataagt gcagcgggga 11340
tctctactca aatgtgtcag ggtcttctac ggatcaaact acacatgcgt tttcattcca 11400
aaagtgggtt ctaaatgcct ggctgcatct gtatgaaatc aaggcacact ccaggaagac 11460
tgccacgtcg cgccaacacg tcatactcaa tgcctcagac tttcatattt ctgtgttgct 11520
gagatgcctt tcaatgcaat cgtctgggct cgtggatatg tccctcaggt gcggtgacag 11580
aatggtggca ccacgatatg tgttctcttg tgttgttttt cctttttaaa cccccatgaa 11640
cacgaatact ctgaaaaaaa taaaaagctt tctggaagaa gacacctttc tgatagaggc 11700
tcacacctac aaatgcttca ctctgtcctt ccgagacctg acaagctttg aggacctcac 11760
agctcccctg tgtgttcatc tctagggatg tttgcaattt cccagtcagc tgttctgtcg 11820
cagaatgttt aatgcacaat tttttgcact agtgtgttat gaatgactaa gattctgata 11880
aaaaaaataa atttattaca cagggtttat acacactatc cattgtatat aagcattatt 11940
tcatattatc aagctaaaca ttcccccatc agcttagttg gagtgttagg gaaaagtatt 12000
cctagatatg gcacagattt taaaaggaaa tacagtattg aagagatta ttttattatt 12060
gcttcaatta gctccatta cgtgttgaat tcattgaaga ggtccaatga gaaaaaaaca 12120
gaagcctcct tatttcacac gttttcctcc tttagtacca tcctcatcca attackgtct 12180
ctctgatact acttaatagc agggggtttg cagaaatttc tgtttgccat gtaaaactgt 12240
gaatagtaat ttattttaga tagtcgatga acttgtgggt tttagctcac aatgcagcct 12300
tcccttttgc agtgttttttttttgtttttttttttttttgtcttttact gtgccatcga 12360
tctttgatat tgcattgaaa gacaatatac cacagtagca ccttgaactc agtgaaaatt 12420
gttcaggatc aaaataccaa gtgttctttt agagggaagg aaaaagtaca cacactctcc 12480
tctcacaatg atatatttta tacattcatt tgttatttgt ttcatgcttt atgattccag 12540
atggaaaggt aatttcagtg acttttcaag tttaaattcc attataggta aatgataagt 12600
tatgatgcaa ataaaatcta taagatcccc agggcaaata aaaatcaaaa catgaagtag 12660
aagatgtggc cgtgaggtag tttatgtaac aaattcaaag tgaaaatcat gtttactttt 12720
acttatactt atttgataaa aatatttttg aaacgatagt acttatttta ttatttgata 12780
tttcagttcc tattcaattg tggcagattt tctctgtttc aattttaga ttggcgttgg 12840
taatagaaat gtcagaatgt tcaaattggc cttcacgttg tcggagtgaa cacattgaca 12900
cctagcttta agactgattt atctgttggt gtactgaagg tttccatgta ggacttcaaa 12960
tgtggaaaag gaaaagcagt caggaaaatg gggcattctt tggagagtca cgcgttttga 13020
ttcggacatt tccgtagagc tcggctccca gtgttgtgtt cctcggtcga aagggtctct 13080
gctgtttggg gactcactgg cctctcctag ggactccttt gtcttgtgaa ccccacgctg 13140
ttggattctg tatcattatg ctgaattctc tgcacagttt tccctggcca acctgcccac 13200
atccttggag atttgctttg ccagtgggaa tccttacatt gctgtttcac agtagacggg 13260
acgaggtcag cgggagtcgt gctcctaaca caacacattga acgaaacaga agatgattga 13320
aagtgtgagg aggctcgtgt gcaagggaga acagggttac tatacatatt agtgtatata 13380
tatacataca tatatatata tatatattgt acatatctaa gtttgagtca ttcaaactag 13440
gtgcaaaatg ctgacttcag agtctgaatt aacatctctg ttcccatatc cctgacctgc 13500
tccctggtca acgatgctat gaaatcctga aatgacagga catacataca tacaagaaac 13560
cacatatcaa attagatatg attttccttt gtgtgcaaag tcaaactgtc ctagggttgc 13620
cagtttgaag catgttattt aaatgaaaaa aaaaatcagt gaaattctcg tgtgagaatt 13680
ctgcctagtt tcttcctaag gttgtgtgca gtgttgaacg gcgtctccgc aaggtgttgg 13740
aggatctcat tttagggcag tcaggagctg tgcttgctga gttaggtcta gaagactctt 13800
ccctgaaggc aacgggaaca cgcgtgaggg acgcgaccac aactaacag aggacacgtg 13860
cttcagagct gtttaaaact gctgcttgtt ttacacacac atcttgcctt ttttcaggct 13920
agctgcaata atttttttct tctgtaaaat attttgtaaa caacaacaaa aagctattat 13980
aaaaaggggg taaaaaaaag aacgctggca ttatgatcag gaaaacccat tgtcatcgcc 14040
gaccctccct cccgtccccac cacacgctgc tgtcacgacg taggtgcgaa agaccttttt 14100
gtacagagat atatttttta tgaagaattt gtaaaattat taaatatgct gtaatttttt 14160
gattaatgta ggtacattgt taaaaaataa atgtttttac aatacagaac tgtaattttc 14220
ccaataatgt aaaatgtacc atctctagct gattttcagt tccaatccta ttacacatgt 14280
attaatatta aagtggcctg ttaaaatgaa cagtatcttt tttttgtcaa aaaaattata 14340
aagagggtgt aatatagcct gtgcaatgcc accaatcttt aaagcaaatc agagttctaa 14400
ttaaatattt aatttta 14417
<![CDATA[ <210> 283]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 283]]>
Met Thr Ala Trp Arg Arg Phe Gln Ser Leu Leu Leu Leu Leu Leu Gly Leu
1 5 10 15
Leu Val Leu Cys Ala Arg Leu Leu Thr Ala Ala Lys
20 25
<![CDATA[ <210> 284]]>
<![CDATA[ <211> 71]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 284]]>
Gln Asn Cys Gly Gly Leu Val Gln Gly Pro Asn Gly Thr Ile Glu Ser
1 5 10 15
Pro Gly Phe Pro His Gly Tyr Pro Asn Tyr Ala Asn Cys Thr Trp Ile
20 25 30
Ile Ile Thr Gly Glu Arg Asn Arg Ile Gln Leu Ser Phe His Thr Phe
35 40 45
Ala Leu Glu Glu Asp Phe Asp Ile Leu Ser Val Tyr Asp Gly Gln Pro
50 55 60
Gln Gln Gly Asn Leu Lys Val
65 70
<![CDATA[ <210> 285]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 285]]>
Leu Ser Gly Phe Gln Leu Pro Ser Ser Ile Val Ser Thr Gly Ser Ile
1 5 10 15
Leu Thr Leu Trp Phe Thr Thr Asp Phe Ala Val Ser Ala Gln Gly Phe
20 25 30
Lys Ala Leu Tyr Glu
35
<![CDATA[ <210> 286]]>
<![CDATA[ <211> 64]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 286]]>
Leu Pro Ser His Thr Cys Gly Asn Pro Gly Glu Ile Leu Lys Gly Val
1 5 10 15
Leu His Gly Thr Arg Phe Asn Ile Gly Asp Lys Ile Arg Tyr Ser Cys
20 25 30
Leu Pro Gly Tyr Ile Leu Glu Gly His Ala Ile Leu Thr Cys Ile Val
35 40 45
Ser Pro Gly Asn Gly Ala Ser Trp Asp Phe Pro Ala Pro Phe Cys Arg
50 55 60
<![CDATA[ <210> 287]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 287]]>
Glu Gly Ala Cys Gly Gly Thr Leu Arg Gly Thr Ser Ser Ser Ser Ile Ser
1 5 10 15
Ser Pro His Phe Pro Ser Glu Tyr Glu Asn Asn Ala Asp Cys Thr Trp
20 25 30
Thr Ile Leu Ala Glu Pro Gly Asp Thr Ile Ala Leu Val Phe Thr Asp
35 40 45
Phe Gln Leu Glu Glu Gly Tyr Asp Phe Leu Glu Ile Ser Gly Thr Glu
50 55 60
Ala Pro Ser Ile
65
<![CDATA[ <210> 288]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 288]]>
Leu Thr Gly Met Asn Leu Pro Ser Pro Val Ile Ser Ser Lys Asn Trp
1 5 10 15
Leu Arg Leu His Phe Thr Ser Asp Ser Asn His Arg Arg Lys Gly Phe
20 25 30
Asn Ala Gln Phe Gln
35
<![CDATA[ <210> 289]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 289]]>
Lys Lys Ala Ile Glu Leu Lys Ser Arg Gly Val Lys Met Leu Pro Ser
1 5 10 15
Lys Asp Gly Ser His Lys Asn Ser Val
20 25
<![CDATA[ <210> 290]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 290]]>
Ser Gln Gly Gly Val Ala Leu Val Ser Asp Met Cys Pro Asp Pro Gly
1 5 10 15
Ile Pro Glu Asn Gly Arg Arg Ala Gly Ser Asp Phe
20 25
<![CDATA[ <210> 291]]>
<![CDATA[ <211> 41]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 291]]>
Val Gly Ala Asn Val Gln Phe Ser Cys Glu Asp Asn Tyr Val Leu Gln
1 5 10 15
Gly Ser Lys Ser Ile Thr Cys Gln Arg Val Thr Glu Thr Leu Ala Ala
20 25 30
Trp Ser Asp His Arg Pro Ile Cys Arg
35 40
<![CDATA[ <210> 292]]>
<![CDATA[ <211> 40]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 292]]>
Arg Thr Cys Gly Ser Asn Leu Arg Gly Pro Ser Gly Val Ile Thr Ser
1 5 10 15
Pro Asn Tyr Pro Val Gln Tyr Glu Asp Asn Ala His Cys Val Trp Val
20 25 30
Ile Thr Thr Thr Asp Pro Asp Lys
35 40
<![CDATA[ <210> 293]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 293]]>
Val Ile Lys Leu Ala Phe Glu Glu Phe Glu Leu Glu Arg Gly Tyr Asp
1 5 10 15
Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val
20 25 30
Leu Tyr
<![CDATA[ <210> 294]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 294]]>
Leu Thr Gly Ser Ser Val Pro Asp Leu Ile Val Ser Met Ser Asn Gln
1 5 10 15
Met Trp Leu His Leu Gln Ser Asp Asp Ser Ile Gly Ser Pro Gly Phe
20 25 30
Lys Ala Val Tyr Gln
35
<![CDATA[ <210> 295]]>
<![CDATA[ <211> 60]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 295]]>
Ile Glu Lys Gly Gly Cys Gly Asp Pro Gly Ile Pro Ala Tyr Gly Lys
1 5 10 15
Arg Thr Gly Ser Ser Phe Leu His Gly Asp Thr Leu Thr Phe Glu Cys
20 25 30
Pro Ala Ala Phe Glu Leu Val Gly Glu Arg Val Ile Thr Cys Gln Gln
35 40 45
Asn Asn Gln Trp Ser Gly Asn Lys Pro Ser Cys Val
50 55 60
<![CDATA[ <210> 296]]>
<![CDATA[ <211> 108]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 296]]>
Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser Gly Ile Ile Leu Ser Pro
1 5 10 15
Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met Asn Cys Val Trp Leu Ile
20 25 30
Ile Ser Glu Pro Gly Ser Arg Ile His Leu Ile Phe Asn Asp Phe Asp
35 40 45
Val Glu Pro Gln Phe Asp Phe Leu Ala Val Lys Asp Asp Gly Ile Ser
50 55 60
Asp Ile Thr Val Leu Gly Thr Phe Ser Gly Asn Glu Val Pro Ser Gln
65 70 75 80
Leu Ala Ser Ser Gly His Ile Val Arg Leu Glu Phe Gln Ser Asp His
85 90 95
Ser Thr Thr Gly Arg Gly Phe Asn Ile Thr Tyr Thr
100 105
<![CDATA[ <210> 297]]>
<![CDATA[ <211> 64]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 297]]>
Phe Gly Gln Asn Glu Cys His Asp Pro Gly Ile Pro Ile Asn Gly Arg
1 5 10 15
Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser Phe His Cys
20 25 30
Asp Asp Gly Phe Val Lys Thr Gln Gly Ser Glu Ser Ile Thr Cys Ile
35 40 45
Leu Gln Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro Arg Cys Glu
50 55 60
<![CDATA[ <210> 298]]>
<![CDATA[ <211> 45]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 298]]>
Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val Ile Leu Pro Pro
1 5 10 15
Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys Glu Trp Ile Ile
20 25 30
Glu Ala Lys Pro Gly His Ser Ile Lys Ile Thr Phe Asp
35 40 45
<![CDATA[ <210> 299]]>
<![CDATA[ <211> 62]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 299]]>
Phe Gln Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp Gly Pro
1 5 10 15
Ala Ser Ser Ser Pro Leu Ile Gly Glu Tyr His Gly Thr Gln Ala Pro
20 25 30
Gln Phe Leu Ile Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe Thr Thr
35 40 45
Asp Asn Ser Arg Ser Ser Ser Ile Gly Phe Leu Ile His Tyr Glu
50 55 60
<![CDATA[ <210> 300]]>
<![CDATA[ <211> 62]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 300]]>
Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly Ile Pro Val Asn
1 5 10 15
Gly His Arg His Gly Gly Asp Phe Gly Ile Arg Ser Thr Val Thr Phe
20 25 30
Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val Cys
35 40 45
Glu Arg Asn His Gln Trp Asn His Ala Leu Pro Ser Cys Asp
50 55 60
<![CDATA[ <210> 301]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 301]]>
Leu Cys Gly Gly Tyr Ile Gln Gly Lys Ser Gly Thr Val Leu Ser Pro
1 5 10 15
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr Ile
20 25 30
Glu Val Ser His Gly Lys
35
<![CDATA[ <210> 302]]>
<![CDATA[ <211> 71]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 302]]>
Val Gln Met Ile Phe His Thr Phe His Leu Glu Ser Ser His Asp Tyr
1 5 10 15
Leu Leu Ile Thr Glu Asp Gly Ser Phe Ser Glu Pro Val Ala Arg Leu
20 25 30
Thr Gly Ser Val Leu Pro His Thr Ile Lys Ala Gly Leu Phe Gly Asn
35 40 45
Phe Thr Ala Gln Leu Arg Phe Ile Ser Asp Phe Ser Ile Ser Tyr Glu
50 55 60
Gly Phe Asn Ile Thr Phe Ser
65 70
<![CDATA[ <210> 303]]>
<![CDATA[ <211> 62]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 303]]>
Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe Ser Arg
1 5 10 15
Arg Ile Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr Phe Ser Cys
20 25 30
Phe Leu Gly Tyr Arg Leu Glu Gly Ala Thr Lys Leu Thr Cys Leu Gly
35 40 45
Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro Arg Cys Val
50 55 60
<![CDATA[ <210> 304]]>
<![CDATA[ <211> 56]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 304]]>
Glu Cys Gly Ala Ser Val Lys Gly Asn Glu Gly Thr Leu Leu Ser Pro
1 5 10 15
Asn Phe Pro Ser Asn Tyr Asp Asn Asn His Glu Cys Ile Tyr Lys Ile
20 25 30
Glu Thr Glu Ala Gly Lys Gly Ile His Leu Arg Thr Arg Ser Phe Gln
35 40 45
Leu Phe Glu Gly Asp Thr Leu Lys
50 55
<![CDATA[ <210> 305]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 305]]>
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe Thr
1 5 10 15
Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn His Leu
20 25 30
Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln Gly Phe Gln
35 40 45
Leu Thr Tyr Thr
50
<![CDATA[ <210> 306]]>
<![CDATA[ <211> 63]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 306]]>
Phe Asp Leu Val Lys Cys Glu Asp Pro Gly Ile Pro Asn Tyr Gly Tyr
1 5 10 15
Arg Ile Arg Asp Glu Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser
20 25 30
Cys Asn Pro Gly Tyr Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu
35 40 45
Ser Gly Asp Arg Arg Val Trp Asp Lys Pro Leu Pro Ser Cys Ile
50 55 60
<![CDATA[ <210> 307]]>
<![CDATA[ <211> 2353]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 307]]> Phe Asp Leu Val Lys Cys Glu Asp Pro Gly Ile Pro Asn Tyr Gly Tyr 1 5 10 15 Arg Ile Arg Asp Glu Gly His Phe Thr Asp Thr Val Val Leu Tyr Ser 20 25 30 Cys Asn Pro Gly Tyr Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu 35 40 45 Ser Gly Asp Arg Arg Val Trp Asp Lys Pro Leu Pro Ser Cys Ile Ala 50 55 60 Glu Cys Gly Gly Gln Ile His Ala Ala Thr Ser Gly Arg Ile Leu Ser 65 70 75 80 Pro Gly Tyr Pro Ala Pro Tyr Asp Asn Asn Leu His Cys Thr Trp Ile 85 90 95 Ile Glu Ala Asp Pro Gly Lys Thr Ile Ser Leu His Phe Ile Val Phe 100 105 110 Asp Thr Glu Met Ala His Asp Ile Leu LYS Val Trp Asp Gly Pro Val 115 120 125 ASP Serite Leu LEU LYS GLU TRP Serp Serge Serge Seru Glu 130 135 140 ASP ILE HIS THR Leu Thr Leu Gln PHE ASP SP S ER ASP 145 150 155 160 Phe Phe Ile Ser Lys Ser Gly Phe Ser Ile Gln Phe Ser Thr Ser Ile 165 170 175 Ala Ala Thr Cys Asn Asp Pro Gly Met Pro Gln Asn Gly Thr Arg Tyr 180 185 190 Gly Asp Ser Arg Glu Ala Gly Asp Thr Val Thr Phe Gln Cys Asp Pro 195 200 205 Gly Tyr Gln Leu Gln Gly Gln Ala Lys Ile Thr Cys Val Gln Leu Asn 210 215 220 Asn Arg Phe Phe Trp Gln Pro Asp Pro Pro Thr Cys Ile Ala Ala Cys 225 230 235 240 Gly Gly Asn Leu Thr Gly Pro Ala Gly Val Ile Leu Ser Pro Asn Tyr 245 250 255 Pro Gln Pro Tyr Pro Pro Gly Lys Glu Cys Asp Trp Arg Val Lys Val 260 265 270 Asn Pro Asp Phe Val Ile Ala Leu Ile Phe Lys Ser Phe Asn Met Glu 275 280 285 Pro Ser Tyr Asp Phe Leu His Ile Tyr Glu Gly Glu Asp Ser Asn Ser 290 295 300 Pro Leu Ile Gly Ser Tyr Gln Gly Ser Gln Ala Pro Glu Arg Ile Glu 305 310 315 320 Ser Ser Gly Asn Ser Leu Phe Leu Ala Phe Arg Ser Asp Ala Ser Val 325 330 335 Gly Leu Ser Gly Phe Ala Ile Glu Phe Lys Glu Lys Pro Arg Glu Ala 340 345 350 Cys Phe Asp Pro Gly Asn Ile Met Asn Gly Thr Arg Val Gly Thr Asp 355 360 365 Phe Lys Leu Gly Ser Thr Ile Thr Tyr Gln Cys Asp Ser Gly Tyr Lys 370 375 380 Ile Leu Asp Pro Ser Ser Ile Thr Cys Val Ile Gly Ala Asp Gly Lys 385 390 395 400 Pro Ser Trp Asp Gln Val Leu Pro Ser Cys Asn Ala Pro Cys Gly Gly 410 415 GLN TYR THR GLY Ser Gly Val Val Leu Sern Tyr Pro His 420 425 430 ASN TYR Ala Gln Ile Cys Leu Tyr Val Pro Lys 43 5 440 445 GLU PHE VAL VAL PHE Gly Gln Phe Ala Tyr Phe Gln Thr Ala Leu Asn 450 455 460 Asp Leu Ala Glu Leu Phe Asp Gly Thr His Ala Gln Ala Arg Leu Leu 465 470 475 480 Ser Ser Leu Ser Gly Ser His Ser Gly Glu Thr Leu Pro Leu Ala Thr 485 490 495 Ser Asn Gln Ile Leu Leu Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala 500 505 510 Arg Gly Phe His Phe Val Tyr Gln Ala Val Pro Arg Thr Ser Asp Thr 515 520 525 Gln Cys Ser Ser Val Pro Glu Pro Arg Tyr Gly Arg Arg Ile Gly Ser 530 535 540 Glu Phe Ser Ala Gly Ser Ile Val Arg Phe Glu Cys Asn Pro Gly Tyr 545 550 555 560 Leu Leu Gln Gly Ser Thr Ala Leu His Cys Gln Ser Val Pro Asn Ala 565 570 575 Leu Ala Gln Trp Asn Asp Thr Ile Pro Ser Cys Val Val Pro Cys Ser 580 585 590 Gly Asn Phe Thr Gln Arg Arg Gly Thr Ile Leu Ser Pro Gly Tyr Pro 595 600 605 Glu Pro Tyr Gly Asn Asn Asn Leu Asn Cys Ile Trp Lys Ile Ile Val Thr 610 615 620 Glu Gly Ser Gly Ile Gln Ile Gln Val Ile Ser Phe Ala Thr Glu Gln 625 630 635 640 Asn Trp Asp Ser Leu Glu Ile His Asp Gly Gly Asp Val Thr Ala Pro 645 650 655 Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro Ala Leu Leu Asn Ser 660 665 670 Thr Ser Asn Gln Leu Tyr Leu His Phe Gln Ser Asp Ile Ser Val Ala 675 680 685 Ala Ala Gly Phe His Leu Glu Tyr Lys Thr Val Gly Leu Ala Ala Cys 690 695 700 GLN GLU Pro Ala Leu Pro Sero Serite Gly ASP ARG TYR 705 710 715 720 Met Val Leu Seru Gln Gln Glu GLY TYR Leu 725 735 Gln Gly. ARG Ser His Ile Ser Cys Met Pro Gly Thr Val Arg Arg Trp 740 745 750 Asn Tyr Pro Ser Pro Leu Cys Ile Ala Thr Cys Gly Gly Thr Leu Ser 755 760 765 Thr Leu Gly Gly Val Ile Leu Ser Pro Gly Phe Pro Gly Ser Tyr Pro 770 775 780 Asn Asn Leu Asp Cys Thr Trp Arg Ile Ser Leu Pro Ile Gly Tyr Gly 785 790 795 800 Ala His Ile Gln Phe Leu Asn Phe Ser Thr Glu Ala Asn His Asp Phe 805 810 815 Leu Glu Ile Gln Asn Gly Pro Tyr His Thr Ser Pro Met Ile Gly Gln 820 825 830 Phe Ser Gly Thr Asp Leu Pro Ala Ala Leu Leu Ser Thr Thr His Glu 835 840 845 Thr Leu Ile His Phe Tyr Ser Asp His Ser Gln Asn Arg Gln Gly Phe 850 855 860 Lys Leu Ala Tyr Gln Ala Tyr Glu Leu Gln Asn Cys Pro Asp Pro Pro 865 870 875 880 Pro Phe Gln Asn Gly Tyr Met Ile Asn Ser Asp Tyr Ser Val Gly Gln 885 890 895 Ser Val Ser Phe Glu Cys Tyr Pro Gly Tyr Ile Leu Ile Gly His Pro 900 905 910 Val Leu ThR CYS GLN HIS GLY ILE Asn ARG Asn Trp Asn Tyr Pro PHE 915 920 925 Pro ARG CYS Asp Ala Pro Cys GLY TYR ASN GLN GLN GLN GLN GLN 935 940 Thr Ile Tyr Ser Pro Gly PHE Pro ASP GLU Tyr Pro Ile Leu Lys Asp 945 950 955 960 Cys Ile Trp Leu Ile Thr Val Pro Gly His Gly Val Tyr Ile Asn 965 970 975 Phe Thr Leu Leu Gln Thr Glu Ala Val Asn Asp Tyr Ile Ala Val Trp 980 985 990 A sp Gly Pro Asp Gln Asn Ser Pro Gln Leu Gly Val Phe Ser Gly Asn 995 1000 1005 Thr Ala Leu Glu Thr Ala Tyr Ser Ser Thr Asn Gln Val Leu Leu 1010 1015 1020 Lys Phe His Ser Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn 1025 1030 1035 Phe His Ala Phe Gln Leu Lys Lys Cys Gln Pro Pro Pro Ala Val 1040 1045 1050 Pro Gln Ala Glu Met Leu Thr Glu Asp Asp Asp Phe Glu Ile Gly 1055 1060 1065 Asp Phe Val Lys Tyr Gln Cys His Pro Gly Tyr Thr Leu Val Gly 1070 1075 1080 Thr Asp Ile Leu Thr Cys Lys Leu Ser Ser Gln Leu Gln Phe Glu 1085 1090 1095 Gly Ser Leu Pro Thr Cys Glu Ala Gln Cys Pro Ala Asn Glu Val 1100 1105 1110 Arg Thr Gly S er Ser Gly Val Ile Leu Ser Pro Gly Tyr Pro Gly 1115 1120 1125 Asn Tyr Phe Asn Ser Gln Thr Cys Ser Trp Ser Ile Lys Val Glu 1130 1135 1140 Pro Asn Tyr Asn Ile Thr Ile Phe Val Asp Thr Phe Gln Ser Glu 1145 115 0 1155 Lys Gln Phe Asp Ala Leu Glu Val Phe Asp Gly Ser Ser Gly Gln 1160 1165 1170 Ser Pro Leu Leu Val Val Leu Ser Gly Asn His Thr Glu Gln Ser 1175 1180 1185 Asn Phe Thr Ser Arg Ser Asn Gln Leu Tyr Leu Arg Trp Ser Thr 1190 1195 1200 Asp His Ala Thr Ser Lys Lys Gly Phe Lys Ile Arg Tyr Ala Ala 1205 1210 1215 Pro Tyr Cys Ser Leu Thr His Pro Leu Lys Asn Gly Gly Ile Leu 1220 1225 1230 Asn Arg Thr Ala Gly Ala Val Gly Ser Lys Val His T yr Phe Cys 1235 1240 1245 Lys Pro Gly Tyr Arg Met Val Gly His Ser Asn Ala Thr Cys Arg 1250 1255 1260 Arg Asn Pro Leu Gly Met Tyr Gln Trp Asp Ser Leu Thr Pro Leu 1265 1270 1275 Cys Gln Ala Val S er Cys Gly Ile Pro Glu Ser Pro Gly Asn Gly 1280 1285 1290 Ser Phe Thr Gly Asn Glu Phe Thr Leu Asp Ser Lys Val Tyr 1295 1300 1305 Glu Cys His Glu Gly Phe Lys Leu Glu Ser Ser Gln Gln Ala Thr 1310 1315 1320 A la Val Cys Gln Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro 1325 1330 1335 Pro Thr Cys Lys Pro Val Ala Cys Pro Ser Ile Glu Ala Gln Leu 1340 1345 1350 Ser Glu His Val Ile Trp Arg Leu Val Ser Gly Ser Leu Asn Glu 1355 1360 1365 Tyr Gly Ala Gln Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu 1370 1375 1380 Glu Gly Trp Arg Leu Leu Arg Cys Gln Ala Asn Gly Thr Trp Asn 1385 1390 1395 Ile Gly Asp Glu Arg Pro Ser Cys Arg Val Ile Ser Cys Gly Ser 1400 1405 1410 Leu Ser Phe Pro Pro Asn Gly Asn Lys Ile Gly Thr Leu Thr Val 1415 1420 1425 Tyr Gly Ala Thr Ala Ile Phe Thr Cys Asn Thr Gly Tyr Thr Leu 1430 1435 1440 Val Gly Ser His Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser 1445 1450 1455 Gly Ser Glu Thr Arg Cys Leu Ala Gly His Cys Gly Ser Pro Asp 1460 1465 1470 Pro Ile Val Asn Gly His Ile Ser Gly Asp Gly Phe Ser Tyr Arg 1475 1480 1485 Asp Thr Val Val Tyr Gln Cys Asn Pro Gly Phe Arg Leu Val Gly 1490 1495 1500 Thr Ser Val Arg Ile Cys Leu Gln Asp His Lys Trp Ser Gly Gln 1505 1510 1515 Thr Pro Val Cys Val Pro Ile Thr Cys Gly His Pro Gly Asn Pro 1520 1525 1530 A la His Gly Phe Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val 1535 1540 1545 Val Asn Phe Thr Cys Asn Thr Gly Tyr Leu Leu Gln Gly Val Ser 1550 1555 1560 Arg Ala Gln Cys Arg Ser Asn Gly Gln Trp Ser Ser Pro Leu Pro 1565 1570 1575 Thr Cys Arg Val Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn 1580 1585 1590 Ala Ile Arg His Gly Gln Gln Asn Phe Pro Glu Ser Phe Glu Tyr 1595 1600 1605 Gly Met Ser Ile Leu Tyr His Cys Lys Lys Gly Phe Tyr Leu Leu 1610 1615 1620 Gly Ser Ala Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg 1625 1630 1635 Ser Leu Pro Lys Cys Leu Ala Ile Ser Cys Gly His Pro Gly Val 1640 1645 1650 Pro Ala Asn Ala Val Leu Th r Gly Glu Leu Phe Thr Tyr Gly Ala 1655 1660 1665 Val Val His Tyr Ser Cys Arg Gly Ser Glu Ser Leu Ile Gly Asn 1670 1675 1680 Asp Thr Arg Val Cys Gln Glu Asp Ser His Trp Ser Gly Ala Leu 1685 1690 1695 Pro His Cys Thr Gly Asn Asn Pro Gly Phe Cys Gly Asp Pro Gly 1700 1705 1710 Thr Pro Ala His Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys 1715 1720 1725 Ser Leu Leu Arg Phe Ser Cys Glu Met Gly His Gln Leu Arg Gly 1730 1735 1 740 Ser Pro Glu Arg Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu 1745 1750 1755 Gln Pro Val Cys Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro 1760 1765 1770 Thr Asn Gly Met Ile Val Ser Ser Asp Gly Ile Leu Phe Ser Ser 1775 1780 1785 Ser Val Ile Tyr Ala Cys Trp Glu Gly Tyr Lys Thr Ser Gly Leu 1790 1795 1800 Met Thr Arg His Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala 1805 1810 1815 Pro Asp Cys Thr Ile Ile S er Cys Gly Asp Pro Gly Thr Leu Ala 1820 1825 1830 Asn Gly Ile Gln Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val 1835 1840 1845 Ser Tyr Gln Cys Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser 1850 1855 1860 Ala Th r Ile Arg Cys Thr Lys Asp Gly Arg Trp Asn Pro Ser Lys 1865 1870 1875 Pro Val Cys Lys Ala Val Leu Cys Pro Gln Pro Pro Pro Val Gln 1880 1885 1890 Asn Gly Thr Val Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser Ile 1895 1900 1905 S er Tyr Ser Cys Met Asp Gly Tyr Gln Leu Ser His Ser Ala Ile 1910 1915 1920 Leu Ser Cys Glu Gly Arg Gly Val Trp Lys Gly Glu Ile Pro Gln 1925 1930 1935 Cys Leu Pro Val Phe Cys Gly Asp Pro Gly Ile Pro Ala Glu Gly 1940 19 45 1950 Arg Leu Ser Gly Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe 1955 1960 1965 Gln Cys Lys Ser Pro Phe Ile Leu Val Gly Ser Ser Arg Arg Val 1970 1975 1980 Cys Gln Ala Asp Gly Thr Trp Ser Gly Ile Gln Pro Thr Cys Ile 1985 1990 1995 Asp Pro Ala His Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe 2000 2005 2010 Gly Ile Gln Asn Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val 2015 2020 2025 Phe Phe Arg Cys Arg Lys G ly Tyr His Ile Gln Gly Ser Thr Thr 2030 2035 2040 Arg Thr Cys Leu Ala Asn Leu Thr Trp Ser Gly Ile Gln Thr Glu 2045 2050 2055 Cys Ile Pro His Ala Cys Arg Gln Pro Glu Thr Pro Ala His Ala 2060 2065 2070 Asp Val Arg Ala Ile Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val 2075 2080 2085 Tyr Thr Cys His Pro Gly Phe Phe Leu Ala Gly Gly Ser Glu His 2090 2095 2100 Arg Thr Cys Lys Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val 2105 2110 2 115 Cys LYS Ser Lys Gly Val ARG Glu Val Asn Glu Thr Val THR LYS 2125 2130 THR Pro Val Pro Ser ASP Val PHE Val Asn Serou TRP LYS 2135 2145 GLY Tyr Tyr Glu Tyr Leu GLY LYS ARG GLN Pro Ala THR Leu ThR 2150 2155 2160 Val Asp Trp Phe Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe 2165 2170 2175 Ser Glu Ala Ser Pro Val Glu Leu Lys Leu Thr Gly Ile Tyr Lys 2180 2185 2190 Lys Glu Glu Ala His Leu Leu Leu Lys Ala Phe Gln Ile Lys Gly 2195 2200 2205 Gln Ala Asp Ile Phe Val Ser Lys Phe Glu Asn Asp Asn Trp Gly 2210 2215 2220 Leu Asp Gly Tyr Val Ser Gly Leu Glu Arg Gly Gly Phe Thr 2225 2230 2235 Phe G ln Gly Asp Ile His Gly Lys Asp Phe Gly Lys Phe Lys Leu 2240 2245 2250 Glu Arg Gln Asp Pro Leu Asn Pro Asp Gln Asp Ser Ser Ser His 2255 2260 2265 Tyr His Gly Thr Ser Ser Ser Gly Ser Val Ala Ala Ala Ile Leu Val 2270 2275 228 0 Pro Phe Phe Ala Leu Ile Leu Ser Gly Phe Ala Phe Tyr Leu Tyr 2285 2290 2295 Lys His Arg Thr Arg Pro Lys Val Gln Tyr Asn Gly Tyr Ala Gly 2300 2305 2310 His Glu Asn Ser Asn Gly Gln Ala Ser Phe Glu Asn Pro Met Ty r 2315 2320 2325 Asp Thr Asn Leu Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp 2330 2335 2340 Thr Thr Leu Asn Thr Val Cys Thr Val Val 2345 2350 <![CDATA[ <210> 308]]>
<![CDATA[ <211> 3564]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 308]]>
Met Thr Ala Trp Arg Arg Phe Gln Ser Leu Leu Leu Leu Leu Leu Gly Leu
1 5 10 15
Leu Val Leu Cys Ala Arg Leu Leu Thr Ala Ala Lys Gly Gln Asn Cys
20 25 30
Gly Gly Leu Val Gln Gly Pro Asn Gly Thr Ile Glu Ser Pro Gly Phe
35 40 45
Pro His Gly Tyr Pro Asn Tyr Ala Asn Cys Thr Trp Ile Ile Ile Thr
50 55 60
Gly Glu Arg Asn Arg Ile Gln Leu Ser Phe His Thr Phe Ala Leu Glu
65 70 75 80
Glu Asp Phe Asp Ile Leu Ser Val Tyr Asp Gly Gln Pro Gln Gln Gly
85 90 95
Asn Leu Lys Val Arg Leu Ser Gly Phe Gln Leu Pro Ser Ser Ile Val
100 105 110
Ser Thr Gly Ser Ile Leu Thr Leu Trp Phe Thr Thr Asp Phe Ala Val
115 120 125
Ser Ala Gln Gly Phe Lys Ala Leu Tyr Glu Val Leu Pro Ser His Thr
130 135 140
Cys Gly Asn Pro Gly Glu Ile Leu Lys Gly Val Leu His Gly Thr Arg
145 150 155 160
Phe Asn Ile Gly Asp Lys Ile Arg Tyr Ser Cys Leu Pro Gly Tyr Ile
165 170 175
Leu Glu Gly His Ala Ile Leu Thr Cys Ile Val Ser Pro Gly Asn Gly
180 185 190
Ala Ser Trp Asp Phe Pro Ala Pro Phe Cys Arg Ala Glu Gly Ala Cys
195 200 205
Gly Gly Thr Leu Arg Gly Thr Ser Ser Ser Ile Ser Ser Pro His Phe
210 215 220
Pro Ser Glu Tyr Glu Asn Asn Ala Asp Cys Thr Trp Thr Ile Leu Ala
225 230 235 240
Glu Pro Gly Asp Thr Ile Ala Leu Val Phe Thr Asp Phe Gln Leu Glu
245 250 255
Glu Gly Tyr Asp Phe Leu Glu Ile Ser Gly Thr Glu Ala Pro Ser Ile
260 265 270
Trp Leu Thr Gly Met Asn Leu Pro Ser Pro Val Ile Ser Ser Lys Asn
275 280 285
Trp Leu Arg Leu His Phe Thr Ser Asp Ser Asn His Arg Arg Lys Gly
290 295 300
Phe Asn Ala Gln Phe Gln Val Lys Lys Ala Ile Glu Leu Lys Ser Arg
305 310 315 320
Gly Val Lys Met Leu Pro Ser Lys Asp Gly Ser His Lys Asn Ser Val
325 330 335
Leu Ser Gln Gly Gly Val Ala Leu Val Ser Asp Met Cys Pro Asp Pro
340 345 350
Gly Ile Pro Glu Asn Gly Arg Arg Ala Gly Ser Asp Phe Arg Val Gly
355 360 365
Ala Asn Val Gln Phe Ser Cys Glu Asp Asn Tyr Val Leu Gln Gly Ser
370 375 380
Lys Ser Ile Thr Cys Gln Arg Val Thr Glu Thr Leu Ala Ala Trp Ser
385 390 395 400
Asp His Arg Pro Ile Cys Arg Ala Arg Thr Cys Gly Ser Asn Leu Arg
405 410 415
Gly Pro Ser Gly Val Ile Thr Ser Pro Asn Tyr Pro Val Gln Tyr Glu
420 425 430
Asp Asn Ala His Cys Val Trp Val Ile Thr Thr Thr Asp Pro Asp Lys
435 440 445
Val Ile Lys Leu Ala Phe Glu Glu Phe Glu Leu Glu Arg Gly Tyr Asp
450 455 460
Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val
465 470 475 480
Leu Tyr Val Leu Thr Gly Ser Ser Val Pro Asp Leu Ile Val Ser Met
485 490 495
Ser Asn Gln Met Trp Leu His Leu Gln Ser Asp Asp Ser Ile Gly Ser
500 505 510
Pro Gly Phe Lys Ala Val Tyr Gln Glu Ile Glu Lys Gly Gly Cys Gly
515 520 525
Asp Pro Gly Ile Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu
530 535 540
His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu Val
545 550 555 560
Gly Glu Arg Val Ile Thr Cys Gln Gln Asn Asn Gln Trp Ser Gly Asn
565 570 575
Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser
580 585 590
Gly Ile Ile Leu Ser Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met
595 600 605
Asn Cys Val Trp Leu Ile Ile Ser Glu Pro Gly Ser Arg Ile His Leu
610 615 620
Ile Phe Asn Asp Phe Asp Val Glu Pro Gln Phe Asp Phe Leu Ala Val
625 630 635 640
Lys Asp Asp Gly Ile Ser Asp Ile Thr Val Leu Gly Thr Phe Ser Gly
645 650 655
Asn Glu Val Pro Ser Gln Leu Ala Ser Ser Gly His Ile Val Arg Leu
660 665 670
Glu Phe Gln Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn Ile Thr
675 680 685
Tyr Thr Thr Phe Gly Gln Asn Glu Cys His Asp Pro Gly Ile Pro Ile
690 695 700
Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser
705 710 715 720
Phe His Cys Asp Asp Gly Phe Val Lys Thr Gln Gly Ser Glu Ser Ile
725 730 735
Thr Cys Ile Leu Gln Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro
740 745 750
Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val
755 760 765
Ile Leu Pro Pro Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys
770 775 780
Glu Trp Ile Ile Glu Ala Lys Pro Gly His Ser Ile Lys Ile Thr Phe
785 790 795 800
Asp Arg Phe Gln Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp
805 810 815
Gly Pro Ala Ser Ser Ser Pro Leu Ile Gly Glu Tyr His Gly Thr Gln
820 825 830
Ala Pro Gln Phe Leu Ile Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe
835 840 845
Thr Thr Asp Asn Ser Arg Ser Ser Ser Ile Gly Phe Leu Ile His Tyr Glu
850 855 860
Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly Ile Pro Val
865 870 875 880
Asn Gly His Arg His Gly Gly Asp Phe Gly Ile Arg Ser Thr Val Thr
885 890 895
Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val
900 905 910
Cys Glu Arg Asn His Gln Trp Asn His Ala Leu Pro Ser Cys Asp Ala
915 920 925
Leu Cys Gly Gly Tyr Ile Gln Gly Lys Ser Gly Thr Val Leu Ser Pro
930 935 940
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr Ile
945 950 955 960
Glu Val Ser His Gly Lys Gly Val Gln Met Ile Phe His Thr Phe His
965 970 975
Leu Glu Ser Ser His Asp Tyr Leu Leu Ile Thr Glu Asp Gly Ser Phe
980 985 990
Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr Ile
995 1000 1005
Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gln Leu Arg Phe Ile
1010 1015 1020
Ser Asp Phe Ser Ile Ser Tyr Glu Gly Phe Asn Ile Thr Phe Ser
1025 1030 1035
Glu Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe
1040 1045 1050
Ser Arg Arg Ile Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr
1055 1060 1065
Phe Ser Cys Phe Leu Gly Tyr Arg Leu Glu Gly Ala Thr Lys Leu
1070 1075 1080
Thr Cys Leu Gly Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro
1085 1090 1095
Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys Gly Asn Glu Gly
1100 1105 1110
Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr Asp Asn Asn His
1115 1120 1125
Glu Cys Ile Tyr Lys Ile Glu Thr Glu Ala Gly Lys Gly Ile His
1130 1135 1140
Leu Arg Thr Arg Ser Phe Gln Leu Phe Glu Gly Asp Thr Leu Lys
1145 1150 1155
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe
1160 1165 1170
Thr Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn
1175 1180 1185
His Leu Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln
1190 1195 1200
Gly Phe Gln Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys Cys Glu
1205 1210 1215
Asp Pro Gly Ile Pro Asn Tyr Gly Tyr Arg Ile Arg Asp Glu Gly
1220 1225 1230
His Phe Thr Asp Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr
1235 1240 1245
Ala Met His Gly Ser Asn Thr Leu Thr Cys Leu Ser Gly Asp Arg
1250 1255 1260
Arg Val Trp Asp Lys Pro Leu Pro Ser Cys Ile Ala Glu Cys Gly
1265 1270 1275
Gly Gln Ile His Ala Ala Thr Ser Gly Arg Ile Leu Ser Pro Gly
1280 1285 1290
Tyr Pro Ala Pro Tyr Asp Asn Asn Leu His Cys Thr Trp Ile Ile
1295 1300 1305
Glu Ala Asp Pro Gly Lys Thr Ile Ser Leu His Phe Ile Val Phe
1310 1315 1320
Asp Thr Glu Met Ala His Asp Ile Leu Lys Val Trp Asp Gly Pro
1325 1330 1335
Val Asp Ser Asp Ile Leu Leu Lys Glu Trp Ser Gly Ser Ala Leu
1340 1345 1350
Pro Glu Asp Ile His Ser Thr Phe Asn Ser Leu Thr Leu Gln Phe
1355 1360 1365
Asp Ser Asp Phe Phe Ile Ser Lys Ser Gly Phe Ser Ile Gln Phe
1370 1375 1380
Ser Thr Ser Ile Ala Ala Thr Cys Asn Asp Pro Gly Met Pro Gln
1385 1390 1395
Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu Ala Gly Asp Thr Val
1400 1405 1410
Thr Phe Gln Cys Asp Pro Gly Tyr Gln Leu Gln Gly Gln Ala Lys
1415 1420 1425
Ile Thr Cys Val Gln Leu Asn Asn Arg Phe Phe Trp Gln Pro Asp
1430 1435 1440
Pro Pro Thr Cys Ile Ala Ala Cys Gly Gly Asn Leu Thr Gly Pro
1445 1450 1455
Ala Gly Val Ile Leu Ser Pro Asn Tyr Pro Gln Pro Tyr Pro Pro
1460 1465 1470
Gly Lys Glu Cys Asp Trp Arg Val Lys Val Asn Pro Asp Phe Val
1475 1480 1485
Ile Ala Leu Ile Phe Lys Ser Phe Asn Met Glu Pro Ser Tyr Asp
1490 1495 1500
Phe Leu His Ile Tyr Glu Gly Glu Asp Ser Asn Ser Pro Leu Ile
1505 1510 1515
Gly Ser Tyr Gln Gly Ser Gln Ala Pro Glu Arg Ile Glu Ser Ser
1520 1525 1530
Gly Asn Ser Leu Phe Leu Ala Phe Arg Ser Asp Ala Ser Val Gly
1535 1540 1545
Leu Ser Gly Phe Ala Ile Glu Phe Lys Glu Lys Pro Arg Glu Ala
1550 1555 1560
Cys Phe Asp Pro Gly Asn Ile Met Asn Gly Thr Arg Val Gly Thr
1565 1570 1575
Asp Phe Lys Leu Gly Ser Thr Ile Thr Tyr Gln Cys Asp Ser Gly
1580 1585 1590
Tyr Lys Ile Leu Asp Pro Ser Ser Ile Thr Cys Val Ile Gly Ala
1595 1600 1605
Asp Gly Lys Pro Ser Trp Asp Gln Val Leu Pro Ser Cys Asn Ala
1610 1615 1620
Pro Cys Gly Gly Gln Tyr Thr Gly Ser Glu Gly Val Val Leu Ser
1625 1630 1635
Pro Asn Tyr Pro His Asn Tyr Thr Ala Gly Gln Ile Cys Leu Tyr
1640 1645 1650
Ser Ile Thr Val Pro Lys Glu Phe Val Val Phe Gly Gln Phe Ala
1655 1660 1665
Tyr Phe Gln Thr Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp Gly
1670 1675 1680
Thr His Ala Gln Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser His
1685 1690 1695
Ser Gly Glu Thr Leu Pro Leu Ala Thr Ser Asn Gln Ile Leu Leu
1700 1705 1710
Arg Phe Ser Ala Lys Ser Gly Ala Ser Ala Arg Gly Phe His Phe
1715 1720 1725
Val Tyr Gln Ala Val Pro Arg Thr Ser Asp Thr Gln Cys Ser Ser
1730 1735 1740
Val Pro Glu Pro Arg Tyr Gly Arg Arg Ile Gly Ser Glu Phe Ser
1745 1750 1755
Ala Gly Ser Ile Val Arg Phe Glu Cys Asn Pro Gly Tyr Leu Leu
1760 1765 1770
Gln Gly Ser Thr Ala Leu His Cys Gln Ser Val Pro Asn Ala Leu
1775 1780 1785
Ala Gln Trp Asn Asp Thr Ile Pro Ser Cys Val Val Pro Cys Ser
1790 1795 1800
Gly Asn Phe Thr Gln Arg Arg Gly Thr Ile Leu Ser Pro Gly Tyr
1805 1810 1815
Pro Glu Pro Tyr Gly Asn Asn Leu Asn Cys Ile Trp Lys Ile Ile
1820 1825 1830
Val Thr Glu Gly Ser Gly Ile Gln Ile Gln Val Ile Ser Phe Ala
1835 1840 1845
Thr Glu Gln Asn Trp Asp Ser Leu Glu Ile His Asp Gly Gly Asp
1850 1855 1860
Val Thr Ala Pro Arg Leu Gly Ser Phe Ser Gly Thr Thr Val Pro
1865 1870 1875
Ala Leu Leu Asn Ser Thr Ser Asn Gln Leu Tyr Leu His Phe Gln
1880 1885 1890
Ser Asp Ile Ser Val Ala Ala Ala Gly Phe His Leu Glu Tyr Lys
1895 1900 1905
Thr Val Gly Leu Ala Ala Cys Gln Glu Pro Ala Leu Pro Ser Asn
1910 1915 1920
Ser Ile Lys Ile Gly Asp Arg Tyr Met Val Asn Asp Val Leu Ser
1925 1930 1935
Phe Gln Cys Glu Pro Gly Tyr Thr Leu Gln Gly Arg Ser His Ile
1940 1945 1950
Ser Cys Met Pro Gly Thr Val Arg Arg Trp Asn Tyr Pro Ser Pro
1955 1960 1965
Leu Cys Ile Ala Thr Cys Gly Gly Thr Leu Ser Thr Leu Gly Gly
1970 1975 1980
Val Ile Leu Ser Pro Gly Phe Pro Gly Ser Tyr Pro Asn Asn Leu
1985 1990 1995
Asp Cys Thr Trp Arg Ile Ser Leu Pro Ile Gly Tyr Gly Ala His
2000 2005 2010
Ile Gln Phe Leu Asn Phe Ser Thr Glu Ala Asn His Asp Phe Leu
2015 2020 2025
Glu Ile Gln Asn Gly Pro Tyr His Thr Ser Pro Met Ile Gly Gln
2030 2035 2040
Phe Ser Gly Thr Asp Leu Pro Ala Ala Leu Leu Ser Thr Thr His
2045 2050 2055
Glu Thr Leu Ile His Phe Tyr Ser Asp His Ser Gln Asn Arg Gln
2060 2065 2070
Gly Phe Lys Leu Ala Tyr Gln Ala Tyr Glu Leu Gln Asn Cys Pro
2075 2080 2085
Asp Pro Pro Pro Phe Gln Asn Gly Tyr Met Ile Asn Ser Asp Tyr
2090 2095 2100
Ser Val Gly Gln Ser Val Ser Phe Glu Cys Tyr Pro Gly Tyr Ile
2105 2110 2115
Leu Ile Gly His Pro Val Leu Thr Cys Gln His Gly Ile Asn Arg
2120 2125 2130
Asn Trp Asn Tyr Pro Phe Pro Arg Cys Asp Ala Pro Cys Gly Tyr
2135 2140 2145
Asn Val Thr Ser Gln Asn Gly Thr Ile Tyr Ser Pro Gly Phe Pro
2150 2155 2160
Asp Glu Tyr Pro Ile Leu Lys Asp Cys Ile Trp Leu Ile Thr Val
2165 2170 2175
Pro Pro Gly His Gly Val Tyr Ile Asn Phe Thr Leu Leu Gln Thr
2180 2185 2190
Glu Ala Val Asn Asp Tyr Ile Ala Val Trp Asp Gly Pro Asp Gln
2195 2200 2205
Asn Ser Pro Gln Leu Gly Val Phe Ser Gly Asn Thr Ala Leu Glu
2210 2215 2220
Thr Ala Tyr Ser Ser Thr Asn Gln Val Leu Leu Lys Phe His Ser
2225 2230 2235
Asp Phe Ser Asn Gly Gly Phe Phe Val Leu Asn Phe His Ala Phe
2240 2245 2250
Gln Leu Lys Lys Cys Gln Pro Pro Pro Ala Val Pro Gln Ala Glu
2255 2260 2265
Met Leu Thr Glu Asp Asp Asp Phe Glu Ile Gly Asp Phe Val Lys
2270 2275 2280
Tyr Gln Cys His Pro Gly Tyr Thr Leu Val Gly Thr Asp Ile Leu
2285 2290 2295
Thr Cys Lys Leu Ser Ser Gln Leu Gln Phe Glu Gly Ser Leu Pro
2300 2305 2310
Thr Cys Glu Ala Gln Cys Pro Ala Asn Glu Val Arg Thr Gly Ser
2315 2320 2325
Ser Gly Val Ile Leu Ser Pro Gly Tyr Pro Gly Asn Tyr Phe Asn
2330 2335 2340
Ser Gln Thr Cys Ser Trp Ser Ile Lys Val Glu Pro Asn Tyr Asn
2345 2350 2355
Ile Thr Ile Phe Val Asp Thr Phe Gln Ser Glu Lys Gln Phe Asp
2360 2365 2370
Ala Leu Glu Val Phe Asp Gly Ser Ser Gly Gln Ser Pro Leu Leu
2375 2380 2385
Val Val Leu Ser Gly Asn His Thr Glu Gln Ser Asn Phe Thr Ser
2390 2395 2400
Arg Ser Asn Gln Leu Tyr Leu Arg Trp Ser Thr Asp His Ala Thr
2405 2410 2415
Ser Lys Lys Gly Phe Lys Ile Arg Tyr Ala Ala Pro Tyr Cys Ser
2420 2425 2430
Leu Thr His Pro Leu Lys Asn Gly Gly Ile Leu Asn Arg Thr Ala
2435 2440 2445
Gly Ala Val Gly Ser Lys Val His Tyr Phe Cys Lys Pro Gly Tyr
2450 2455 2460
Arg Met Val Gly His Ser Asn Ala Thr Cys Arg Arg Asn Pro Leu
2465 2470 2475
Gly Met Tyr Gln Trp Asp Ser Leu Thr Pro Leu Cys Gln Ala Val
2480 2485 2490
Ser Cys Gly Ile Pro Glu Ser Pro Gly Asn Gly Ser Phe Thr Gly
2495 2500 2505
Asn Glu Phe Thr Leu Asp Ser Lys Val Val Tyr Glu Cys His Glu
2510 2515 2520
Gly Phe Lys Leu Glu Ser Ser Gln Gln Ala Thr Ala Val Cys Gln
2525 2530 2535
Glu Asp Gly Leu Trp Ser Asn Lys Gly Lys Pro Pro Thr Cys Lys
2540 2545 2550
Pro Val Ala Cys Pro Ser Ile Glu Ala Gln Leu Ser Glu His Val
2555 2560 2565
Ile Trp Arg Leu Val Ser Gly Ser Leu Asn Glu Tyr Gly Ala Gln
2570 2575 2580
Val Leu Leu Ser Cys Ser Pro Gly Tyr Tyr Leu Glu Gly Trp Arg
2585 2590 2595
Leu Leu Arg Cys Gln Ala Asn Gly Thr Trp Asn Ile Gly Asp Glu
2600 2605 2610
Arg Pro Ser Cys Arg Val Ile Ser Cys Gly Ser Leu Ser Phe Pro
2615 2620 2625
Pro Asn Gly Asn Lys Ile Gly Thr Leu Thr Val Tyr Gly Ala Thr
2630 2635 2640
Ala Ile Phe Thr Cys Asn Thr Gly Tyr Thr Leu Val Gly Ser His
2645 2650 2655
Val Arg Glu Cys Leu Ala Asn Gly Leu Trp Ser Gly Ser Glu Thr
2660 2665 2670
Arg Cys Leu Ala Gly His Cys Gly Ser Pro Asp Pro Ile Val Asn
2675 2680 2685
Gly His Ile Ser Gly Asp Gly Phe Ser Tyr Arg Asp Thr Val Val
2690 2695 2700
Tyr Gln Cys Asn Pro Gly Phe Arg Leu Val Gly Thr Ser Val Arg
2705 2710 2715
Ile Cys Leu Gln Asp His Lys Trp Ser Gly Gln Thr Pro Val Cys
2720 2725 2730
Val Pro Ile Thr Cys Gly His Pro Gly Asn Pro Ala His Gly Phe
2735 2740 2745
Thr Asn Gly Ser Glu Phe Asn Leu Asn Asp Val Val Asn Phe Thr
2750 2755 2760
Cys Asn Thr Gly Tyr Leu Leu Gln Gly Val Ser Arg Ala Gln Cys
2765 2770 2775
Arg Ser Asn Gly Gln Trp Ser Ser Pro Leu Pro Thr Cys Arg Val
2780 2785 2790
Val Asn Cys Ser Asp Pro Gly Phe Val Glu Asn Ala Ile Arg His
2795 2800 2805
Gly Gln Gln Asn Phe Pro Glu Ser Phe Glu Tyr Gly Met Ser Ile
2810 2815 2820
Leu Tyr His Cys Lys Lys Lys Gly Phe Tyr Leu Leu Gly Ser Ser Ser Ala
2825 2830 2835
Leu Thr Cys Met Ala Asn Gly Leu Trp Asp Arg Ser Leu Pro Lys
2840 2845 2850
Cys Leu Ala Ile Ser Cys Gly His Pro Gly Val Pro Ala Asn Ala
2855 2860 2865
Val Leu Thr Gly Glu Leu Phe Thr Tyr Gly Ala Val Val His Tyr
2870 2875 2880
Ser Cys Arg Gly Ser Glu Ser Leu Ile Gly Asn Asp Thr Arg Val
2885 2890 2895
Cys Gln Glu Asp Ser His Trp Ser Gly Ala Leu Pro His Cys Thr
2900 2905 2910
Gly Asn Asn Pro Gly Phe Cys Gly Asp Pro Gly Thr Pro Ala His
2915 2920 2925
Gly Ser Arg Leu Gly Asp Asp Phe Lys Thr Lys Ser Leu Leu Arg
2930 2935 2940
Phe Ser Cys Glu Met Gly His Gln Leu Arg Gly Ser Pro Glu Arg
2945 2950 2955
Thr Cys Leu Leu Asn Gly Ser Trp Ser Gly Leu Gln Pro Val Cys
2960 2965 2970
Glu Ala Val Ser Cys Gly Asn Pro Gly Thr Pro Thr Asn Gly Met
2975 2980 2985
Ile Val Ser Ser Asp Gly Ile Leu Phe Ser Ser Ser Val Ile Tyr
2990 2995 3000
Ala Cys Trp Glu Gly Tyr Lys Thr Ser Gly Leu Met Thr Arg His
3005 3010 3015
Cys Thr Ala Asn Gly Thr Trp Thr Gly Thr Ala Pro Asp Cys Thr
3020 3025 3030
Ile Ile Ser Cys Gly Asp Pro Gly Thr Leu Ala Asn Gly Ile Gln
3035 3040 3045
Phe Gly Thr Asp Phe Thr Phe Asn Lys Thr Val Ser Tyr Gln Cys
3050 3055 3060
Asn Pro Gly Tyr Val Met Glu Ala Val Thr Ser Ala Thr Ile Arg
3065 3070 3075
Cys Thr Lys Asp Gly Arg Trp Asn Pro Ser Lys Pro Val Cys Lys
3080 3085 3090
Ala Val Leu Cys Pro Gln Pro Pro Pro Val Gln Asn Gly Thr Val
3095 3100 3105
Glu Gly Ser Asp Phe Arg Trp Gly Ser Ser Ile Ser Tyr Ser Cys
3110 3115 3120
Met Asp Gly Tyr Gln Leu Ser His Ser Ala Ile Leu Ser Cys Glu
3125 3130 3135
Gly Arg Gly Val Trp Lys Gly Glu Ile Pro Gln Cys Leu Pro Val
3140 3145 3150
Phe Cys Gly Asp Pro Gly Ile Pro Ala Glu Gly Arg Leu Ser Gly
3155 3160 3165
Lys Ser Phe Thr Tyr Lys Ser Glu Val Phe Phe Gln Cys Lys Ser
3170 3175 3180
Pro Phe Ile Leu Val Gly Ser Ser Arg Arg Val Cys Gln Ala Asp
3185 3190 3195
Gly Thr Trp Ser Gly Ile Gln Pro Thr Cys Ile Asp Pro Ala His
3200 3205 3210
Asn Thr Cys Pro Asp Pro Gly Thr Pro His Phe Gly Ile Gln Asn
3215 3220 3225
Ser Ser Arg Gly Tyr Glu Val Gly Ser Thr Val Phe Phe Arg Cys
3230 3235 3240
Arg Lys Gly Tyr His Ile Gln Gly Ser Thr Thr Arg Thr Cys Leu
3245 3250 3255
Ala Asn Leu Thr Trp Ser Gly Ile Gln Thr Glu Cys Ile Pro His
3260 3265 3270
Ala Cys Arg Gln Pro Glu Thr Pro Ala His Ala Asp Val Arg Ala
3275 3280 3285
Ile Asp Leu Pro Thr Phe Gly Tyr Thr Leu Val Tyr Thr Cys His
3290 3295 3300
Pro Gly Phe Phe Leu Ala Gly Gly Ser Glu His Arg Thr Cys Lys
3305 3310 3315
Ala Asp Met Lys Trp Thr Gly Lys Ser Pro Val Cys Lys Ser Lys
3320 3325 3330
Gly Val Arg Glu Val Asn Glu Thr Val Thr Lys Thr Pro Val Pro
3335 3340 3345
Ser Asp Val Phe Phe Val Asn Ser Leu Trp Lys Gly Tyr Tyr Glu
3350 3355 3360
Tyr Leu Gly Lys Arg Gln Pro Ala Thr Leu Thr Val Asp Trp Phe
3365 3370 3375
Asn Ala Thr Ser Ser Lys Val Asn Ala Thr Phe Ser Glu Ala Ser
3380 3385 3390
Pro Val Glu Leu Lys Leu Thr Gly Ile Tyr Lys Lys Glu Glu Ala
3395 3400 3405
His Leu Leu Leu Lys Ala Phe Gln Ile Lys Gly Gln Ala Asp Ile
3410 3415 3420
Phe Val Ser Lys Phe Glu Asn Asp Asn Trp Gly Leu Asp Gly Tyr
3425 3430 3435
Val Ser Ser Gly Leu Glu Arg Gly Gly Phe Thr Phe Gln Gly Asp
3440 3445 3450
Ile His Gly Lys Asp Phe Gly Lys Phe Lys Leu Glu Arg Gln Asp
3455 3460 3465
Pro Leu Asn Pro Asp Gln Asp Ser Ser Ser His Tyr His Gly Thr
3470 3475 3480
Ser Ser Gly Ser Val Ala Ala Ala Ile Leu Val Pro Phe Phe Ala
3485 3490 3495
Leu Ile Leu Ser Gly Phe Ala Phe Tyr Leu Tyr Lys His Arg Thr
3500 3505 3510
Arg Pro Lys Val Gln Tyr Asn Gly Tyr Ala Gly His Glu Asn Ser
3515 3520 3525
Asn Gly Gln Ala Ser Phe Glu Asn Pro Met Tyr Asp Thr Asn Leu
3530 3535 3540
Lys Pro Thr Glu Ala Lys Ala Val Arg Phe Asp Thr Thr Leu Asn
3545 3550 3555
Thr Val Cys Thr Val Val
3560
<![CDATA[ <210> 309]]>
<![CDATA[ <211> 4129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 309]]>
ctccacggca gcggctcctt gtgccactag cagcccttct tctgcgctct ccgccttttc 60
tctctagact ggatctctcc tcccccccgc gcccccctcc ccgcatctcc cactcgctgg 120
ctctctctcc agctgcctcc tctccaggtc tctcctggct gcgcgcgctc ctctccccgc 180
ttctccccct cccgcagcct cgccgccttg gtgccttcct gcccggctcg gccggcgctc 240
gtccccggcc ccggccccgc cagcccgggt ctccgcgctc ggagcagctc agccctgcag 300
tggctcggga cccgatgcta tgagagggaa gcgagccggg cgcccagacc ttcaggaggc 360
gtcggatgcg cggcgggtct tgggaccggg ctctctctcc ggctcgcctt gccctcgggt 420
gattatttgg ctccgctcat agccctgcct tcctcggagg agccatcggt gtcgcgtgcg 480
tgtggagtat ctgcagacat gactgcgtgg aggagattcc agtcgctgct cctgcttctc 540
gggctgctgg tgctgtgcgc gaggctcctc actgcagcga agggtcagaa ctgtggaggc 600
ttagtccagg gtcccaatgg cactattgag agcccagggt ttcctcacgg gtatccgaac 660
tatgccaact gcacctggat catcatcacg ggcgagcgca ataggataca gttgtccttc 720
catacctttg ctcttgaaga agattttgat attttatcag tttacgatgg acagcctcaa 780
caagggaatt taaaagtgag attatcggga tttcagctgc cctcctctat agtgagtaca 840
ggatctatcc tcactctgtg gttcacgaca gacttcgctg tgagtgccca aggtttcaaa 900
gcatttatg aagttttacc tagccaacact tgtggaaatc ctggagaaat cctgaaagga 960
gttctgcatg gaacgagatt caacatagga gacaaaatcc ggtacagctg cctccctggc 1020
tacatcttgg aaggccacgc catcctgacc tgcatcgtca gcccaggaaa tggtgcatcg 1080
tgggacttcc cagctccctt ttgcagagct gagggagcct gcggaggaac cttacgcggg 1140
accagcagct ccatctccag cccgcacttc ccttcagagt acgagaacaa cgcggactgc 1200
acctggacca ttctggctga gcccggggac accattgcgc tggtcttcac tgactttcag 1260
ctagaagaag gatatgattt cttagagatc agtggcacgg aagctccatc catatggcta 1320
actggcatga acctcccctc tccagttatc agtagcaaga attggctacg actccatttc 1380
acctctgaca gcaaccaccg acgcaaagga tttaacgctc agttccaagt gaaaaaggcg 1440
attgagttga agtcaagagg agtcaagatg ctgcccagca aggatggaag ccataaaaac 1500
tctgtcttga gccaaggagg tgttgcattg gtctctgaca tgtgtccaga tcctggggatt 1560
ccagaaaatg gtagaagagc aggttccgac ttcagggttg gtgcaaatgt acagttttca 1620
tgtgaggaca attacgtgct ccagggatct aaaagcatca cctgtcagag agttacagag 1680
acgctcgctg cttggagtga ccacaggccc atctgccgag cgagaacatg tggatccaat 1740
ctgcgtgggc ccagcggcgt cattacctcc cctaattatc cggttcagta tgaagataat 1800
gcacactgtg tgtgggtcat caccaccacc gacccggaca aggtcatcaa gcttgccttt 1860
gaagagtttg agctggagcg aggctatgac accctgacgg ttggtgatgc tgggaaggtg 1920
ggagacacca gatcggtctt gtacgtgctc acgggatcca gtgttcctga cctcattgtg 1980
agcatgagca accagatgtg gctacatctg cagtcggatg atagcattgg ctcacctggg 2040
tttaaagctg tttaccaaga aattgaaaag gggaggtgtggggatcctgg aatccccgcc 2100
tatgggaagc ggacggggcag cagtttcctc catggagata cactcacctt tgaatgcccg 2160
gcggcctttg agctggtggg ggagagagtt atcacctgtc agcagaacaa tcagtggtct 2220
ggcaacaagc ccagctgtgt attttcatgt ttcttcaact ttacggcatc atctgggatt 2280
attctgtcac caaattatcc agaggaatat gggaacaaca tgaactgtgt ctggttgatt 2340
atctcggagc caggaagtcg aattcaccta atctttaatg attttgatgt tgagcctcag 2400
tttgactttc tcgcggtcaa ggatgatggc atttctgaca taactgtcct gggtactttt 2460
tctggcaatg aagtgccttc ccagctggcc agcagtgggc atatagttcg cttggaattt 2520
cagtctgacc attccactac tggcagaggg ttcaacatca cttacaccac atttggtcag 2580
aatgagtgcc atgatcctgg cattcctata aacggacgac gttttggtga caggtttcta 2640
ctcggggagct cggtttcttt ccactgtgat gatggctttg tcaagaccca gggatccgag 2700
tccattacct gcatactgca agacgggaac gtggtctgga gctccaccgt gccccgctgt 2760
gaagctccat gtggtggaca tctgacagcg tccagcggag tcattttgcc tcctggatgg 2820
ccaggatatt ataaggattc tttacattgt gaatggataa ttgaagcaaa accaggccac 2880
tctatcaaaa taacttttga cagatttcag acagaggtca attatgacac cttggaggtc 2940
agagatgggc cagccagttc gtccccactg atcggcgagt accacggcac ccaggcaccc 3000
cagttcctca tcagcaccgg gaacttcatg tacctgctgt tcaccactga caacagccgc 3060
tccagcatcg gcttcctcat ccactatgag agtgtgacgc ttgagtcgga ttcctgcctg 3120
gacccgggca tccctgtgaa cggccatcgc cacggtggag actttggcat caggtccaca 3180
gtgactttca gctgtgaccc ggggtacaca ctaagtgacg acgagcccct cgtctgtgag 3240
aggaaccacc agtggaacca cgccttgccc agctgcgacg ctctatgtgg aggctacatc 3300
caagggaaga gtggaacagt cctttctcct gggtttccag atttttatcc aaactctcta 3360
aactgcacgt ggaccatga agtgtctcat gggaaaggag ttcaaatgat ctttcacacc 3420
tttcatcttg agagttccca cgactattta ctgatcacag aggatggaag tttttccgag 3480
cccgttgcca ggctcaccgg gtcggtgttg cctcatacga tcaaggcagg cctgtttgga 3540
aacttcactg cccagcttcg gtttatatca gacttctcaa tttcgtacga gggcttcaat 3600
atcacatttt cagaatatga cctggagcca tgtgatgatc ctggagtccc tgccttcagc 3660
cgaagaattg gttttcactt tggtgtggga gactctctga cgttttcctg cttcctggga 3720
tatcgtttag aaggtgccac caagcttacc tgcctgggtg ggggccgccg tgtgtggagt 3780
gcacctctgc caaggtgtgt ggccgaatgt ggagcaagtg tcaaaggaaa tgaaggaaca 3840
ttactgtctc caaattttcc atccaattat gataataacc atgagtgtat ctataaaata 3900
gaaacagaag ccggcaaggg catccacctt agaacacgaa gcttccagct gtttgaagga 3960
gatactctaa aggtatatga tggaaaagac agttcctcac gtccactggg cacgttcact 4020
aaaaatgaac ttctggggct gatcctaaac agcacatcca atcacctgtg gctagagttc 4080
aacaccaatg gatctgacac cgaccaaggt tttcaactca cctatacca 4129
<![CDATA[ <210> 310]]>
<![CDATA[ <211> 1210]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 310]]>
Met Thr Ala Trp Arg Arg Phe Gln Ser Leu Leu Leu Leu Leu Leu Gly Leu
1 5 10 15
Leu Val Leu Cys Ala Arg Leu Leu Thr Ala Ala Lys Gly Gln Asn Cys
20 25 30
Gly Gly Leu Val Gln Gly Pro Asn Gly Thr Ile Glu Ser Pro Gly Phe
35 40 45
Pro His Gly Tyr Pro Asn Tyr Ala Asn Cys Thr Trp Ile Ile Ile Thr
50 55 60
Gly Glu Arg Asn Arg Ile Gln Leu Ser Phe His Thr Phe Ala Leu Glu
65 70 75 80
Glu Asp Phe Asp Ile Leu Ser Val Tyr Asp Gly Gln Pro Gln Gln Gly
85 90 95
Asn Leu Lys Val Arg Leu Ser Gly Phe Gln Leu Pro Ser Ser Ile Val
100 105 110
Ser Thr Gly Ser Ile Leu Thr Leu Trp Phe Thr Thr Asp Phe Ala Val
115 120 125
Ser Ala Gln Gly Phe Lys Ala Leu Tyr Glu Val Leu Pro Ser His Thr
130 135 140
Cys Gly Asn Pro Gly Glu Ile Leu Lys Gly Val Leu His Gly Thr Arg
145 150 155 160
Phe Asn Ile Gly Asp Lys Ile Arg Tyr Ser Cys Leu Pro Gly Tyr Ile
165 170 175
Leu Glu Gly His Ala Ile Leu Thr Cys Ile Val Ser Pro Gly Asn Gly
180 185 190
Ala Ser Trp Asp Phe Pro Ala Pro Phe Cys Arg Ala Glu Gly Ala Cys
195 200 205
Gly Gly Thr Leu Arg Gly Thr Ser Ser Ser Ile Ser Ser Pro His Phe
210 215 220
Pro Ser Glu Tyr Glu Asn Asn Ala Asp Cys Thr Trp Thr Ile Leu Ala
225 230 235 240
Glu Pro Gly Asp Thr Ile Ala Leu Val Phe Thr Asp Phe Gln Leu Glu
245 250 255
Glu Gly Tyr Asp Phe Leu Glu Ile Ser Gly Thr Glu Ala Pro Ser Ile
260 265 270
Trp Leu Thr Gly Met Asn Leu Pro Ser Pro Val Ile Ser Ser Lys Asn
275 280 285
Trp Leu Arg Leu His Phe Thr Ser Asp Ser Asn His Arg Arg Lys Gly
290 295 300
Phe Asn Ala Gln Phe Gln Val Lys Lys Ala Ile Glu Leu Lys Ser Arg
305 310 315 320
Gly Val Lys Met Leu Pro Ser Lys Asp Gly Ser His Lys Asn Ser Val
325 330 335
Leu Ser Gln Gly Gly Val Ala Leu Val Ser Asp Met Cys Pro Asp Pro
340 345 350
Gly Ile Pro Glu Asn Gly Arg Arg Ala Gly Ser Asp Phe Arg Val Gly
355 360 365
Ala Asn Val Gln Phe Ser Cys Glu Asp Asn Tyr Val Leu Gln Gly Ser
370 375 380
Lys Ser Ile Thr Cys Gln Arg Val Thr Glu Thr Leu Ala Ala Trp Ser
385 390 395 400
Asp His Arg Pro Ile Cys Arg Ala Arg Thr Cys Gly Ser Asn Leu Arg
405 410 415
Gly Pro Ser Gly Val Ile Thr Ser Pro Asn Tyr Pro Val Gln Tyr Glu
420 425 430
Asp Asn Ala His Cys Val Trp Val Ile Thr Thr Thr Asp Pro Asp Lys
435 440 445
Val Ile Lys Leu Ala Phe Glu Glu Phe Glu Leu Glu Arg Gly Tyr Asp
450 455 460
Thr Leu Thr Val Gly Asp Ala Gly Lys Val Gly Asp Thr Arg Ser Val
465 470 475 480
Leu Tyr Val Leu Thr Gly Ser Ser Val Pro Asp Leu Ile Val Ser Met
485 490 495
Ser Asn Gln Met Trp Leu His Leu Gln Ser Asp Asp Ser Ile Gly Ser
500 505 510
Pro Gly Phe Lys Ala Val Tyr Gln Glu Ile Glu Lys Gly Gly Cys Gly
515 520 525
Asp Pro Gly Ile Pro Ala Tyr Gly Lys Arg Thr Gly Ser Ser Phe Leu
530 535 540
His Gly Asp Thr Leu Thr Phe Glu Cys Pro Ala Ala Phe Glu Leu Val
545 550 555 560
Gly Glu Arg Val Ile Thr Cys Gln Gln Asn Asn Gln Trp Ser Gly Asn
565 570 575
Lys Pro Ser Cys Val Phe Ser Cys Phe Phe Asn Phe Thr Ala Ser Ser
580 585 590
Gly Ile Ile Leu Ser Pro Asn Tyr Pro Glu Glu Tyr Gly Asn Asn Met
595 600 605
Asn Cys Val Trp Leu Ile Ile Ser Glu Pro Gly Ser Arg Ile His Leu
610 615 620
Ile Phe Asn Asp Phe Asp Val Glu Pro Gln Phe Asp Phe Leu Ala Val
625 630 635 640
Lys Asp Asp Gly Ile Ser Asp Ile Thr Val Leu Gly Thr Phe Ser Gly
645 650 655
Asn Glu Val Pro Ser Gln Leu Ala Ser Ser Gly His Ile Val Arg Leu
660 665 670
Glu Phe Gln Ser Asp His Ser Thr Thr Gly Arg Gly Phe Asn Ile Thr
675 680 685
Tyr Thr Thr Phe Gly Gln Asn Glu Cys His Asp Pro Gly Ile Pro Ile
690 695 700
Asn Gly Arg Arg Phe Gly Asp Arg Phe Leu Leu Gly Ser Ser Val Ser
705 710 715 720
Phe His Cys Asp Asp Gly Phe Val Lys Thr Gln Gly Ser Glu Ser Ile
725 730 735
Thr Cys Ile Leu Gln Asp Gly Asn Val Val Trp Ser Ser Thr Val Pro
740 745 750
Arg Cys Glu Ala Pro Cys Gly Gly His Leu Thr Ala Ser Ser Gly Val
755 760 765
Ile Leu Pro Pro Gly Trp Pro Gly Tyr Tyr Lys Asp Ser Leu His Cys
770 775 780
Glu Trp Ile Ile Glu Ala Lys Pro Gly His Ser Ile Lys Ile Thr Phe
785 790 795 800
Asp Arg Phe Gln Thr Glu Val Asn Tyr Asp Thr Leu Glu Val Arg Asp
805 810 815
Gly Pro Ala Ser Ser Ser Pro Leu Ile Gly Glu Tyr His Gly Thr Gln
820 825 830
Ala Pro Gln Phe Leu Ile Ser Thr Gly Asn Phe Met Tyr Leu Leu Phe
835 840 845
Thr Thr Asp Asn Ser Arg Ser Ser Ser Ile Gly Phe Leu Ile His Tyr Glu
850 855 860
Ser Val Thr Leu Glu Ser Asp Ser Cys Leu Asp Pro Gly Ile Pro Val
865 870 875 880
Asn Gly His Arg His Gly Gly Asp Phe Gly Ile Arg Ser Thr Val Thr
885 890 895
Phe Ser Cys Asp Pro Gly Tyr Thr Leu Ser Asp Asp Glu Pro Leu Val
900 905 910
Cys Glu Arg Asn His Gln Trp Asn His Ala Leu Pro Ser Cys Asp Ala
915 920 925
Leu Cys Gly Gly Tyr Ile Gln Gly Lys Ser Gly Thr Val Leu Ser Pro
930 935 940
Gly Phe Pro Asp Phe Tyr Pro Asn Ser Leu Asn Cys Thr Trp Thr Ile
945 950 955 960
Glu Val Ser His Gly Lys Gly Val Gln Met Ile Phe His Thr Phe His
965 970 975
Leu Glu Ser Ser His Asp Tyr Leu Leu Ile Thr Glu Asp Gly Ser Phe
980 985 990
Ser Glu Pro Val Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr Ile
995 1000 1005
Lys Ala Gly Leu Phe Gly Asn Phe Thr Ala Gln Leu Arg Phe Ile
1010 1015 1020
Ser Asp Phe Ser Ile Ser Tyr Glu Gly Phe Asn Ile Thr Phe Ser
1025 1030 1035
Glu Tyr Asp Leu Glu Pro Cys Asp Asp Pro Gly Val Pro Ala Phe
1040 1045 1050
Ser Arg Arg Ile Gly Phe His Phe Gly Val Gly Asp Ser Leu Thr
1055 1060 1065
Phe Ser Cys Phe Leu Gly Tyr Arg Leu Glu Gly Ala Thr Lys Leu
1070 1075 1080
Thr Cys Leu Gly Gly Gly Arg Arg Val Trp Ser Ala Pro Leu Pro
1085 1090 1095
Arg Cys Val Ala Glu Cys Gly Ala Ser Val Lys Gly Asn Glu Gly
1100 1105 1110
Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr Asp Asn Asn His
1115 1120 1125
Glu Cys Ile Tyr Lys Ile Glu Thr Glu Ala Gly Lys Gly Ile His
1130 1135 1140
Leu Arg Thr Arg Ser Phe Gln Leu Phe Glu Gly Asp Thr Leu Lys
1145 1150 1155
Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr Phe
1160 1165 1170
Thr Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn
1175 1180 1185
His Leu Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln
1190 1195 1200
Gly Phe Gln Leu Thr Tyr Thr
1205 1210
<![CDATA[ <210> 311]]>
<![CDATA[ <211> 102]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 311]]>
ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 60
tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca ag 102
<![CDATA[ <210> 312]]>
<![CDATA[ <211> 103]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 312]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaaccagt tctgaatact cctctctcag att 103
<![CDATA[ <210> 313]]>
<![CDATA[ <211> 205]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 313]]>
ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 60
tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca agccttgcct ccccgattga 120
aagagatgaa aagccaggaa tcggctgcag gttccaaact agtccttcgg tgtgaaacca 180
gttctgaata ctcctctctc agatt 205
<![CDATA[ <210> 314]]>
<![CDATA[ <211> 67]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 314]]>
Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn Ala Glu Cys Gln Lys
1 5 10 15
Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu Thr Lys Pro Lys Pro
20 25 30
Gln Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gln Glu Ser Ala
35 40 45
Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser
50 55 60
Ser Leu Arg
65
<![CDATA[ <210> 315]]>
<![CDATA[ <211> 532]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 315]]>
aaggggaaat aagggacaag agagaccctc tcatattgtt ttatattatt tcatactcag 60
aaaaggaaag agaagccaaa caaaaggcag gtaacccagc gcctaggaac cagacccgaa 120
accaaggaac cagatctgaa accaggcctg ggcctgcctg acctaagcct ggtagtaaaa 180
attccacccc tgacctgacc tggcaactgt tgttatctac agattccaga cattgtatgg 240
aaggacactg tgaaacctcc cgttctgttc tgtttcactc tgaccatcgg tgctcacagc 300
ccctatcacg taccccctgg cttgctcagt cgatcacgac cctctcacgt ggaccccctt 360
agagttgtta gcccttaaaa gggacagaag ttgagcacct gaggagctca gattttaaga 420
cgctaggctg ctgatgctcc cagctgatta aagccactcc cttcactatc tcggtgtctc 480
ctgtccgcgg ctcgtcctgc tacatttctt ggttccctga ccggcaagcg ag 532
<![CDATA[ <210> 316]]>
<![CDATA[ <211> 116]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 316]]>
aatgcaggct caacagtacc agcagcagcg tcgaaaattt gcagctgcct tcttggcatt 60
cattttcata ctggcagctg tggatactgc tgaagcaggg aagaaagaga aaccag 116
<![CDATA[ <210> 317]]>
<![CDATA[ <211> 174]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 317]]>
aaaaaaaagt gaagaagtct gactgtggag aatggcagtg gagtgtgtgtgtgcccacca 60
gtggagactg tgggctgggc acacgggagg gcactcggac tggagctgag tgcaagcaaa 120
ccatgaagac ccagagatgt aagatcccct gcaactggaa gaagcaattt ggcg 174
<![CDATA[ <210> 318]]>
<![CDATA[ <211> 162]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 318]]>
cggagtgcaa ataccagttc caggcctggg gagaatgtga cctgaacaca gccctgaaga 60
ccagaactgg aagtctgaag cgagccctgc acaatgccga atgccagaag actgtcacca 120
tctccaagcc ctgtggcaaa ctgaccaagc ccaaacctca ag 162
<![CDATA[ <210> 319]]>
<![CDATA[ <211> 735]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 319]]>
cagaatctaa gaagaagaaa aaggaaggca agaaacagga gaagatgctg gattaaaaga 60
tgtcacctgt ggaacataaa aaggacatca gcaaacagga tcagttaact attgcattta 120
tatgtaccgt aggctttgta ttcaaaaatt atctatagct aagtacacaa taagcaaaaa 180
caaaaagaaa agaaaatttt tgtagtagcg ttttttaaat gtatactata gtaccagtag 240
gggcttataa taaaggactg taatcttatt taggaagttg acttatagta catgataaat 300
gatagacaat tgaggtaagt tttttgaaat tatgtgacat tttacattaa atttttttta 360
cattttttgg gcagcaattt aaatgttatg actatgtaaa ctacttctct tgttaggtaa 420
tttttttcac ctagattttt ttcccaattg agaaaaatat atactaaaca aaatagcaat 480
aaaacataat cactctattt gaagaaaata tcttgttttc tgccaataga ttttttaaaa 540
tgtagtcagc aaaatggggg tggggaagca gagcatgtcc tagttcaatg ttgacttttt 600
ttttttttaa agaaaagcat taagacataa aattctttca ctttggcaga agcatttgtt 660
ttcttgatga aattattttt ccatctgagg aaaaaaatac taggaaaata aatcaaggtg 720
atgctgaaaaaaaaa 735
<![CDATA[ <210> 320]]>
<![CDATA[ <211> 1719]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 320]]>
aaggggaaat aagggacaag agagaccctc tcatattgtt ttatattatt tcatactcag 60
aaaaggaaag agaagccaaa caaaaggcag gtaacccagc gcctaggaac cagacccgaa 120
accaaggaac cagatctgaa accaggcctg ggcctgcctg acctaagcct ggtagtaaaa 180
attccacccc tgacctgacc tggcaactgt tgttatctac agattccaga cattgtatgg 240
aaggacactg tgaaacctcc cgttctgttc tgtttcactc tgaccatcgg tgctcacagc 300
ccctatcacg taccccctgg cttgctcagt cgatcacgac cctctcacgt ggaccccctt 360
agagttgtta gcccttaaaa gggacagaag ttgagcacct gaggagctca gattttaaga 420
cgctaggctg ctgatgctcc cagctgatta aagccactcc cttcactatc tcggtgtctc 480
ctgtccgcgg ctcgtcctgc tacatttctt ggttccctga ccggcaagcg agaatgcagg 540
ctcaacagta ccagcagcag cgtcgaaaat ttgcagctgc cttcttggca ttcattttca 600
tactggcagc tgtggatact gctgaagcag ggaagaaaga gaaaccagaa aaaaaagtga 660
agaagtctga ctgtggagaa tggcagtgga gtgtgtgtgt gcccaccagt ggagactgtg 720
ggctgggcac acgggagggc actcggactg gagctgagtg caagcaaacc atgaagaccc 780
agagatgtaa gatcccctgc aactggaaga agcaatttgg cgcggagtgc aaataccagt 840
tccaggcctg gggagaatgt gacctgaaca cagccctgaa gaccagaact ggaagtctga 900
agcgagccct gcacaatgcc gaatgccaga agactgtcac catctccaag ccctgtggca 960
aactgaccaa gcccaaacct caagcagaat ctaagaagaa gaaaaaggaa ggcaagaaac 1020
aggagaagat gctggattaa aagatgtcac ctgtggaaca taaaaaggac atcagcaaac 1080
aggatcagtt aactattgca tttatatgta ccgtaggctt tgtattcaaa aattatctat 1140
agctaagtac acaataagca aaaacaaaaa gaaaagaaaa tttttgtagt agcgtttttt 1200
aaatgtatac tatagtacca gtaggggctt ataataaagg actgtaatct tattaggaa 1260
gttgacttat agtacatgat aaatgataga caattgaggt aagttttttg aaattatgtg 1320
acattttaca ttaaattttt tttacatttt ttgggcagca atttaaatgt tatgactatg 1380
taaactactt ctcttgttag gtaatttttt tcacctagat ttttttccca attgagaaaa 1440
atatatacta aacaaaatag caataaaaca taatcactct atttgaagaa aatatcttgt 1500
tttctgccaa tagatttttt aaaatgtagt cagcaaaatgggggtgggga agcagagcat 1560
gtcctagttc aatgttgact tttttttttt ttaaagaaaa gcattaagac ataaaattct 1620
ttcactttgg cagaagcatt tgttttcttg atgaaattat ttttccatct gaggaaaaaa 1680
atactaggaa aataaatcaa ggtgatgctg aaaaaaaaa 1719
<![CDATA[ <210> 321]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 321]]>
Met Gln Ala Gln Gln Tyr Gln Gln Gln Arg Arg Lys Phe Ala Ala Ala
1 5 10 15
Phe Leu Ala Phe Ile Phe Ile Leu Ala Ala Val Asp Thr Ala Glu Ala
20 25 30
Gly Lys Lys Glu Lys Pro
35
<![CDATA[ <210> 322]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 322]]>
Lys Lys Val Lys Lys Ser Asp Cys Gly Glu Trp Gln Trp Ser Val Cys
1 5 10 15
Val Pro Thr Ser Gly Asp Cys Gly Leu Gly Thr Arg Glu Gly Thr Arg
20 25 30
Thr Gly Ala Glu Cys Lys Gln Thr Met Lys Thr Gln Arg Cys Lys Ile
35 40 45
Pro Cys Asn Trp Lys Lys Gln Phe Gly
50 55
<![CDATA[ <210> 323]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 323]]>
Glu Cys Lys Tyr Gln Phe Gln Ala Trp Gly Glu Cys Asp Leu Asn Thr
1 5 10 15
Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn Ala
20 25 30
Glu Cys Gln Lys Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu Thr
35 40 45
Lys Pro Lys Pro Gln
50
<![CDATA[ <210> 324]]>
<![CDATA[ <211> 17]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 324]]>
Glu Ser Lys Lys Lys Lys Lys Lys Glu Gly Lys Lys Gln Glu Lys Met Leu
1 5 10 15
Asp
<![CDATA[ <210> 325]]>
<![CDATA[ <211> 168]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 325]]>
Met Gln Ala Gln Gln Tyr Gln Gln Gln Arg Arg Lys Phe Ala Ala Ala
1 5 10 15
Phe Leu Ala Phe Ile Phe Ile Leu Ala Ala Val Asp Thr Ala Glu Ala
20 25 30
Gly Lys Lys Glu Lys Pro Glu Lys Lys Val Lys Lys Ser Asp Cys Gly
35 40 45
Glu Trp Gln Trp Ser Val Cys Val Pro Thr Ser Gly Asp Cys Gly Leu
50 55 60
Gly Thr Arg Glu Gly Thr Arg Thr Gly Ala Glu Cys Lys Gln Thr Met
65 70 75 80
Lys Thr Gln Arg Cys Lys Ile Pro Cys Asn Trp Lys Lys Gln Phe Gly
85 90 95
Ala Glu Cys Lys Tyr Gln Phe Gln Ala Trp Gly Glu Cys Asp Leu Asn
100 105 110
Thr Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn
115 120 125
Ala Glu Cys Gln Lys Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu
130 135 140
Thr Lys Pro Lys Pro Gln Ala Glu Ser Lys Lys Lys Lys Lys Lys Glu Gly
145 150 155 160
Lys Lys Gln Glu Lys Met Leu Asp
165
<![CDATA[ <210> 326]]>
<![CDATA[ <211> 984]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 326]]>
aaggggaaat aagggacaag agagaccctc tcatattgtt ttatattatt tcatactcag 60
aaaaggaaag agaagccaaa caaaaggcag gtaacccagc gcctaggaac cagacccgaa 120
accaaggaac cagatctgaa accaggcctg ggcctgcctg acctaagcct ggtagtaaaa 180
attccacccc tgacctgacc tggcaactgt tgttatctac agattccaga cattgtatgg 240
aaggacactg tgaaacctcc cgttctgttc tgtttcactc tgaccatcgg tgctcacagc 300
ccctatcacg taccccctgg cttgctcagt cgatcacgac cctctcacgt ggaccccctt 360
agagttgtta gcccttaaaa gggacagaag ttgagcacct gaggagctca gattttaaga 420
cgctaggctg ctgatgctcc cagctgatta aagccactcc cttcactatc tcggtgtctc 480
ctgtccgcgg ctcgtcctgc tacatttctt ggttccctga ccggcaagcg agaatgcagg 540
ctcaacagta ccagcagcag cgtcgaaaat ttgcagctgc cttcttggca ttcattttca 600
tactggcagc tgtggatact gctgaagcag ggaagaaaga gaaaccagaa aaaaaagtga 660
agaagtctga ctgtggagaa tggcagtgga gtgtgtgtgt gcccaccagt ggagactgtg 720
ggctgggcac acgggagggc actcggactg gagctgagtg caagcaaacc atgaagaccc 780
agagatgtaa gatcccctgc aactggaaga agcaatttgg cgcggagtgc aaataccagt 840
tccaggcctg gggagaatgt gacctgaaca cagccctgaa gaccagaact ggaagtctga 900
agcgagccct gcacaatgcc gaatgccaga agactgtcac catctccaag ccctgtggca 960
aactgaccaa gcccaaacct caag 984
<![CDATA[ <210> 327]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 327]]>
Met Gln Ala Gln Gln Tyr Gln Gln Gln Arg Arg Lys Phe Ala Ala Ala
1 5 10 15
Phe Leu Ala Phe Ile Phe Ile Leu Ala Ala Val Asp Thr Ala Glu Ala
20 25 30
Gly Lys Lys Glu Lys Pro Glu Lys Lys Val Lys Lys Ser Asp Cys Gly
35 40 45
Glu Trp Gln Trp Ser Val Cys Val Pro Thr Ser Gly Asp Cys Gly Leu
50 55 60
Gly Thr Arg Glu Gly Thr Arg Thr Gly Ala Glu Cys Lys Gln Thr Met
65 70 75 80
Lys Thr Gln Arg Cys Lys Ile Pro Cys Asn Trp Lys Lys Gln Phe Gly
85 90 95
Ala Glu Cys Lys Tyr Gln Phe Gln Ala Trp Gly Glu Cys Asp Leu Asn
100 105 110
Thr Ala Leu Lys Thr Arg Thr Gly Ser Leu Lys Arg Ala Leu His Asn
115 120 125
Ala Glu Cys Gln Lys Thr Val Thr Ile Ser Lys Pro Cys Gly Lys Leu
130 135 140
Thr Lys Pro Lys Pro Gln
145 150
<![CDATA[ <210> 328]]>
<![CDATA[ <211> 95]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 328]]>
caacagcaac aagatcacag ttcgcttcca ctcagatcag tcctacaccg acaccggctt 60
cttagctgaa tacctctcct acgactccag tgacc 95
<![CDATA[ <210> 329]]>
<![CDATA[ <211> 87]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 329]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgct 87
<![CDATA[ <210> 330]]>
<![CDATA[ <211> 182]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 330]]>
caacagcaac aagatcacag ttcgcttcca ctcagatcag tcctacaccg acaccggctt 60
cttagctgaa taccctctcct acgactccag tgaccctaca tctacatcca ccactgggac 120
aagccatctt gtaaaatgtg cggagaagga gaaaactttc tgtgtgaatg gaggggagtg 180
ct 182
<![CDATA[ <210> 331]]>
<![CDATA[ <211> 60]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 331]]>
Asn Ser Asn Lys Ile Thr Val Arg Phe His Ser Asp Gln Ser Tyr Thr
1 5 10 15
Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp Ser Ser Ser Asp Pro
20 25 30
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
35 40 45
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
50 55 60
<![CDATA[ <210> 332]]>
<![CDATA[ <211> 279]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 332]]>
gtgagagcgg agctgcagcc ggagaaagag gaagaggggag agagagcgcg ccagggcgag 60
ggcaccgccg ccggtcgggc gcgctgggcc tgcccggaat cccgccgcct gcgccccgcg 120
ccccgcgccc tgcgggccat gggagccggc cgccggcagg gacgacgcct gtgagacccg 180
cgagcggcct cggggaccat ggggagcgat cgggcccgca agggcgggagg gggcccgaag 240
gacttcggcg cgggactcaa gtacaactcc cggcacgag 279
<![CDATA[ <210> 333]]>
<![CDATA[ <211> 160]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 333]]>
aaagtgaatg gcttggagga aggcgtggag ttcctgccag tcaacaacgt caagaaggtg 60
gaaaagcatg gcccggggcg ctgggtggtg ctggcagccg tgctgatcgg cctcctcttg 120
gtcttgctgg ggatcggctt cctggtgtgg catttgcagt 160
<![CDATA[ <210> 334]]>
<![CDATA[ <211> 128]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 334]]>
accgggacgt gcgtgtccag aaggtcttca atggctacat gaggatcaca aatgagaatt 60
ttgtggatgc ctacgagaac tccaactcca ctgagtttgt aagcctggcc agcaaggtga 120
aggacgcg 128
<![CDATA[ <210> 335]]>
<![CDATA[ <211> 71]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 335]]>
ctgaagctgc tgtacagcgg agtcccattc ctgggcccct accacaagga gtcggctgtg 60
acggccttca g 71
<![CDATA[ <210> 336]]>
<![CDATA[ <211> 158]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 336]]>
cgagggcagc gtcatcgcct actactggtc tgagttcagc atcccgcagc acctggtgga 60
ggaggccgag cgcgtcatgg ccgaggagcg cgtagtcatg ctgcccccgc gggcgcgctc 120
cctgaagtcc tttgtggtca cctcagtggt ggctttcc 158
<![CDATA[ <210> 337]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 337]]>
ccacggactc caaaacagta cagaggaccc aggaca 36
<![CDATA[ <210> 338]]>
<![CDATA[ <211> 241]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 338]]>
acagctgcag ctttggcctg cacgcccgcg gtgtggagct gatgcgcttc accacgcccg 60
gcttccctga cagcccctac cccgctcatg cccgctgcca gtgggccctg cggggggacg 120
ccgactcagt gctgagcctc accttccgca gctttgacct tgcgtcctgc gacgagcgcg 180
gcagcgacct ggtgacggtg tacaacaccc tgagccccat ggagccccac gccctggtgc 240
a 241
<![CDATA[ <210> 339]]>
<![CDATA[ <211> 140]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 339]]>
gttgtgtggc acctaccctc cctcctacaa cctgaccttc cactcctccc agaacgtcct 60
gctcatcaca ctgataacca acactgagcg gcggcatccc ggctttgagg ccaccttctt 120
ccagctgcct aggatgagca 140
<![CDATA[ <210> 340]]>
<![CDATA[ <211> 98]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 340]]>
gctgtggagg ccgcttacgt aaagcccagg ggacattcaa cagcccctac tacccaggcc 60
actacccacc caacattgac tgcacatgga actattgag 98
<![CDATA[ <210> 341]]>
<![CDATA[ <211> 110]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 341]]>
gtgcccaaca accagcatgt gaaggtgcgc ttcaaattct tctacctgct ggagcccggc 60
gtgcctgcgg gcacctgccc caaggactac gtggagatca acggggagaa 110
<![CDATA[ <210> 342]]>
<![CDATA[ <211> 131]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 342]]>
atactgcgga gagaggtccc agttcgtcgt caccagcaac agcaacaaga tcacagttcg 60
cttccactca gatcagtcct acaccgacac cggcttctta gctgaatacc tctcctacga 120
ctccagtgac c 131
<![CDATA[ <210> 343]]>
<![CDATA[ <211> 105]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 343]]>
catgcccggg gcagttcacg tgccgcacgg ggcggtgtat ccggaaggag ctgcgctgtg 60
atggctgggc cgactgcacc gaccacagcg atgagctcaa ctgca 105
<![CDATA[ <210> 344]]>
<![CDATA[ <211> 111]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 344]]>
gttgcgacgc cggccaccag ttcacgtgca agaacaagtt ctgcaagccc ctcttctggg 60
tctgcgacag tgtgaacgac tgcggagaca acagcgacga gcaggggtgc a 111
<![CDATA[ <210> 345]]>
<![CDATA[ <211> 114]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 345]]>
gttgtccggc ccagaccttc aggtgttcca atgggaagtg cctctcgaaa agccagcagt 60
gcaatgggaa ggacgactgt ggggacgggt ccgacgaggc ctcctgcccc aagg 114
<![CDATA[ <210> 346]]>
<![CDATA[ <211> 123]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 346]]>
tgaacgtcgt cacttgtacc aaacacacct accgctgcct caatgggctc tgcttgagca 60
agggcaaccc tgagtgtgac gggaaggagg actgtagcga cggctcagat gagaaggact 120
gcg 123
<![CDATA[ <210> 347]]>
<![CDATA[ <211> 187]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 347]]>
actgtggggct gcggtcattc acgagacagg ctcgtgttgt tgggggcacg gatgcggatg 60
agggcgagtg gccctggcag gtaagcctgc atgctctggg ccagggccac atctgcggtg 120
cttccctcat ctctcccaac tggctggtct ctgccgcaca ctgctacatc gatgacagag 180
gattcag 187
<![CDATA[ <210> 348]]>
<![CDATA[ <211> 275]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 348]]>
gtactcagac cccacgcagt ggacggcctt cctgggcttg cacgaccaga gccagcgcag 60
cgcccctggg gtgcaggagc gcaggctcaa gcgcatcatc tcccaccccct tcttcaatga 120
cttcaccttc gactatgaca tcgcgctgct ggagctggag aaaccggcag agtacagctc 180
catggtgcgg cccatctgcc tgccggacgc ctcccatgtc ttccctgccg gcaaggccat 240
ctgggtcacg ggctggggac acacccagta tggag 275
<![CDATA[ <210> 349]]>
<![CDATA[ <211> 137]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 349]]>
gcactggcgc gctgatcctg caaaagggtg agatccgcgt catcaaccag accacctgcg 60
agaacctcct gccgcagcag atcacgccgc gcatgatgtg cgtgggcttc ctcagcggcg 120
gcgtggactc ctgccag 137
<![CDATA[ <210> 350]]>
<![CDATA[ <211> 701]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 350]]>
ggtgattccg ggggacccct gtccagcgtg gaggcggatg ggcggatctt ccaggccggt 60
gtggtgagct ggggagacgg ctgcgctcag aggaacaagc caggcgtgta cacaaggctc 120
cctctgtttc gggactggat caaagagaac actggggtat aggggccggg gccacccaaa 180
tgtgtacaccc tgcggggcca cccatcgtcc accccagtgt gcacgcctgc aggctggaga 240
ctggaccgct gactgcacca gcgcccccag aacataacact gtgaactcaa tctccagggc 300
tccaaatctg cctagaaaac ctctcgcttc ctcagcctcc aaagtggagc tgggaggtag 360
aagggggagga cactggtggt tctactgacc caactggggg caaaggtttg aagacacagc 420
ctcccccgcc agccccaagc tgggccgagg cgcgtttgtg catatctgcc tcccctgtct 480
ctaaggagca gcgggaacgg agcttcgggg cctcctcagt gaaggtggtg gggctgccgg 540
atctgggctg tggggccctt gggccacgct cttgaggaag cccaggctcg gaggaccctg 600
gaaaacagac gggtctgaga ctgaaattgt ttaccagct cccagggtgg acttcagtgt 660
gtgtatttgt gtaaatgagt aaaacatttt atttcttttt a 701
<![CDATA[ <210> 351]]>
<![CDATA[ <211> 3305]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Whole mRNA polynucleotide sequence of ST14 ]]>
<![CDATA[ <400> 351]]>
gtgagagcgg agctgcagcc ggagaaagag gaagaggggag agagagcgcg ccagggcgag 60
ggcaccgccg ccggtcgggc gcgctgggcc tgcccggaat cccgccgcct gcgccccgcg 120
ccccgcgccc tgcgggccat gggagccggc cgccggcagg gacgacgcct gtgagacccg 180
cgagcggcct cggggaccat ggggagcgat cgggcccgca agggcgggagg gggcccgaag 240
gacttcggcg cgggactcaa gtacaactcc cggcacgaga aagtgaatgg cttggaggaa 300
ggcgtggagt tcctgccagt caacaacgtc aagaaggtgg aaaagcatgg cccggggcgc 360
tgggtggtgc tggcagccgt gctgatcggc ctcctcttgg tcttgctggg gatcggcttc 420
ctggtgtggc atttgcagta ccgggacgtg cgtgtccaga aggtcttcaa tggctacatg 480
aggatcacaa atgagaattt tgtggatgcc tacgagaact ccaactccac tgagtttgta 540
agcctggcca gcaaggtgaa ggacgcgctg aagctgctgt acagcggagt cccattcctg 600
ggcccctacc acaaggagtc ggctgtgacg gccttcagcg agggcagcgt catcgcctac 660
tactggtctg agttcagcat cccgcagcac ctggtggagg aggccgagcg cgtcatggcc 720
gaggagcgcg tagtcatgct gcccccgcgg gcgcgctccc tgaagtcctt tgtggtcacc 780
tcagtggtgg ctttccccac ggactccaaa acagtacaga ggacccagga caacagctgc 840
agctttggcc tgcacgcccg cggtgtggag ctgatgcgct tcaccacgcc cggcttccct 900
gacagcccct accccgctca tgcccgctgc cagtgggccc tgcgggggga cgccgactca 960
gtgctgagcc tcaccttccg cagctttgac cttgcgtcct gcgacgagcg cggcagcgac 1020
ctggtgacgg tgtacaacac cctgagcccc atggagcccc acgccctggt gcagttgtgt 1080
ggcacctacc ctccctccta caacctgacc ttccactcct cccagaacgt cctgctcatc 1140
acactgataa ccaacactga gcggcggcat cccggctttg aggccacctt cttccagctg 1200
cctaggatga gcagctgtgg aggccgctta cgtaaagccc aggggacatt caacagcccc 1260
tactacccag gccactaccc acccaacatt gactgcacat ggaacattga ggtgcccaac 1320
aaccagcatg tgaaggtgcg cttcaaattc ttctacctgc tggagcccgg cgtgcctgcg 1380
ggcacctgcc ccaaggacta cgtggagatc aacggggaga aatactgcgg agagaggtcc 1440
cagttcgtcg tcaccagcaa cagcaacaag atcacagttc gcttccactc agatcagtcc 1500
tacaccgaca ccggcttctt agctgaatac ctctcctacg actccagtga cccatgcccg 1560
gggcagttca cgtgccgcac ggggcggtgt atccggaagg agctgcgctg tgatggctgg 1620
gccgactgca ccgaccacag cgatgagctc aactgcagtt gcgacgccgg ccaccagttc 1680
acgtgcaaga acaagttctg caagcccctc ttctgggtct gcgacagtgt gaacgactgc 1740
ggagacaaca gcgacgagca ggggtgcagt tgtccggccc agaccttcag gtgttccaat 1800
gggaagtgcc tctcgaaaag ccagcagtgc aatgggaagg acgactgtgg ggacgggtcc 1860
gacgaggcct cctgccccaa ggtgaacgtc gtcacttgta ccaaacacac ctaccgctgc 1920
ctcaatgggc tctgcttgag caagggcaac cctgagtgtg acgggaagga ggactgtagc 1980
gacggctcag atgagaagga ctgcgactgt gggctgcggt cattcacgag acaggctcgt 2040
gttgttgggg gcacggatgc ggatgagggc gagtggccct ggcaggtaag cctgcatgct 2100
ctgggccagg gccacatctg cggtgcttcc ctcatctctc ccaactggct ggtctctgcc 2160
gcacactgct acatcgatga cagaggattc aggtactcag accccacgca gtggacggcc 2220
ttcctgggct tgcacgacca gagccagcgc agcgcccctg gggtgcagga gcgcaggctc 2280
aagcgcatca tctcccaccc cttcttcaat gacttcacct tcgactatga catcgcgctg 2340
ctggagctgg agaaaccggc agagtacagc tccatggtgc ggcccatctg cctgccggac 2400
gcctcccatg tcttccctgc cggcaaggcc atctgggtca cgggctgggg acacacccag 2460
tatggaggca ctggcgcgct gatcctgcaa aagggtgaga tccgcgtcat caaccagacc 2520
acctgcgaga acctcctgcc gcagcagatc acgccgcgca tgatgtgcgt gggcttcctc 2580
agcggcggcg tggactcctg ccagggtgat tccgggggac ccctgtccag cgtgggaggcg 2640
gatgggcgga tcttccaggc cggtgtggtg agctggggag acggctgcgc tcagaggaac 2700
aagccaggcg tgtacacaag gctccctctg tttcgggact ggatcaaaga gaacactggg 2760
gtataggggc cggggccacc caaatgtgta cacctgcggg gccacccatc gtccacccca 2820
gtgtgcacgc ctgcaggctg gagactggac cgctgactgc accagcgccc ccagaacata 2880
cactgtgaac tcaatctcca gggctccaaa tctgcctaga aaacctctcg cttcctcagc 2940
ctccaaagtg gagctggggag gtagaagggg aggacactgg tggttctact gacccaactg 3000
ggggcaaagg tttgaagaca cagcctcccc cgccagcccc aagctgggcc gaggcgcgtt 3060
tgtgcatatc tgcctcccct gtctctaagg agcagcggga acggagcttc ggggcctcct 3120
cagtgaaggt ggtggggctg ccggatctgg gctgtggggc ccttgggcca cgctcttgag 3180
gaagcccagg ctcggaggac cctggaaaac agacgggtct gagactgaaa ttgttttacc 3240
agctcccagg gtggacttca gtgtgtgtat ttgtgtaaat gagtaaaaca ttttatttct 3300
TTTT 3305
<![CDATA[ <210> 352]]>
<![CDATA[ <211> 27]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 352]]>
Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp Phe
1 5 10 15
Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu
20 25
<![CDATA[ <210> 353]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 353]]>
Lys Val Asn Gly Leu Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn
1 5 10 15
Val Lys Lys Val Glu Lys His Gly Pro Gly Arg Trp Val Val Leu Ala
20 25 30
Ala Val Leu Ile Gly Leu Leu Leu Val Leu Leu Gly Ile Gly Phe Leu
35 40 45
Val Trp His Leu Gln
50
<![CDATA[ <210> 354]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 354]]>
Arg Asp Val Arg Val Gln Lys Val Phe Asn Gly Tyr Met Arg Ile Thr
1 5 10 15
Asn Glu Asn Phe Val Asp Ala Tyr Glu Asn Ser Asn Ser Thr Glu Phe
20 25 30
Val Ser Leu Ala Ser Lys Val Lys Asp Ala
35 40
<![CDATA[ <210> 355]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 355]]>
Leu Lys Leu Leu Tyr Ser Gly Val Pro Phe Leu Gly Pro Tyr His Lys
1 5 10 15
Glu Ser Ala Val Thr Ala Phe
20
<![CDATA[ <210> 356]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 356]]>
Glu Gly Ser Val Ile Ala Tyr Tyr Trp Ser Glu Phe Ser Ile Pro Gln
1 5 10 15
His Leu Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu Arg Val Val
20 25 30
Met Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val Val Thr Ser
35 40 45
Val Val Ala Phe
50
<![CDATA[ <210> 357]]>
<![CDATA[ <211> 11]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 357]]>
Thr Asp Ser Lys Thr Val Gln Arg Thr Gln Asp
1 5 10
<![CDATA[ <210> 358]]>
<![CDATA[ <211> 79]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 35]]>8
Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu Leu Met Arg Phe
1 5 10 15
Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala His Ala Arg Cys
20 25 30
Gln Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu Ser Leu Thr Phe
35 40 45
Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly Ser Asp Leu Val
50 55 60
Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His Ala Leu Val
65 70 75
<![CDATA[ <210> 359]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 359]]>
Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr Phe His Ser Ser
1 5 10 15
Gln Asn Val Leu Leu Ile Thr Leu Ile Thr Asn Thr Glu Arg Arg His
20 25 30
Pro Gly Phe Glu Ala Thr Phe Phe Gln Leu Pro Arg Met Ser
35 40 45
<![CDATA[ <210> 360]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 360]]>
Cys Gly Gly Arg Leu Arg Lys Ala Gln Gly Thr Phe Asn Ser Pro Tyr
1 5 10 15
Tyr Pro Gly His Tyr Pro Pro Asn Ile Asp Cys Thr Trp Asn Ile Glu
20 25 30
<![CDATA[ <210> 361]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 361]]>
Val Pro Asn Asn Gln His Val Lys Val Arg Phe Lys Phe Phe Tyr Leu
1 5 10 15
Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp Tyr Val Glu
20 25 30
Ile Asn Gly Glu
35
<![CDATA[ <210> 362]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 362]]>
Tyr Cys Gly Glu Arg Ser Gln Phe Val Val Thr Ser Ser Asn Ser Asn Lys
1 5 10 15
Ile Thr Val Arg Phe His Ser Asp Gln Ser Tyr Thr Asp Thr Gly Phe
20 25 30
Leu Ala Glu Tyr Leu Ser Tyr Asp Ser Ser Asp
35 40
<![CDATA[ <210> 363]]>
<![CDATA[ <211> 34]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 363]]>
Cys Pro Gly Gln Phe Thr Cys Arg Thr Gly Arg Cys Ile Arg Lys Glu
1 5 10 15
Leu Arg Cys Asp Gly Trp Ala Asp Cys Thr Asp His Ser Asp Glu Leu
20 25 30
Asn Cys
<![CDATA[ <210> 364]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 36]]>4
Cys Asp Ala Gly His Gln Phe Thr Cys Lys Asn Lys Phe Cys Lys Pro
1 5 10 15
Leu Phe Trp Val Cys Asp Ser Val Asn Asp Cys Gly Asp Asn Ser Asp
20 25 30
Glu Gln Gly Cys
35
<![CDATA[ <210> 365]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 365]]>
Cys Pro Ala Gln Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu Ser Lys
1 5 10 15
Ser Gln Gln Cys Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu
20 25 30
Ala Ser Cys Pro Lys
35
<![CDATA[ <210> 366]]>
<![CDATA[ <211> 40]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 366]]>
Asn Val Val Thr Cys Thr Lys His Thr Tyr Arg Cys Leu Asn Gly Leu
1 5 10 15
Cys Leu Ser Lys Gly Asn Pro Glu Cys Asp Gly Lys Glu Asp Cys Ser
20 25 30
Asp Gly Ser Asp Glu Lys Asp Cys
35 40
<![CDATA[ <210> 367]]>
<![CDATA[ <211> 61]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 367]]>
Cys Gly Leu Arg Ser Phe Thr Arg Gln Ala Arg Val Val Gly Gly Thr
1 5 10 15
Asp Ala Asp Glu Gly Glu Trp Pro Trp Gln Val Ser Leu His Ala Leu
20 25 30
Gly Gln Gly His Ile Cys Gly Ala Ser Leu Ile Ser Pro Asn Trp Leu
35 40 45
Val Ser Ala Ala His Cys Tyr Ile Asp Asp Arg Gly Phe
50 55 60
<![CDATA[ <210> 368]]>
<![CDATA[ <211> 91]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 368]]>
Tyr Ser Asp Pro Thr Gln Trp Thr Ala Phe Leu Gly Leu His Asp Gln
1 5 10 15
Ser Gln Arg Ser Ala Pro Gly Val Gln Glu Arg Arg Leu Lys Arg Ile
20 25 30
Ile Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr Asp Ile Ala
35 40 45
Leu Leu Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met Val Arg Pro
50 55 60
Ile Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly Lys Ala Ile
65 70 75 80
Trp Val Thr Gly Trp Gly His Thr Gln Tyr Gly
85 90
<![CDATA[ <210> 369]]>
<![CDATA[ <211> 45]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 369]]>
Thr Gly Ala Leu Ile Leu Gln Lys Gly Glu Ile Arg Val Ile Asn Gln
1 5 10 15
Thr Thr Cys Glu Asn Leu Leu Pro Gln Gln Ile Thr Pro Arg Met Met
20 25 30
Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gln
35 40 45
<![CDATA[ <210> 370]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 370]]>
Gly Asp Ser Gly Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg Ile
1 5 10 15
Phe Gln Ala Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gln Arg Asn
20 25 30
Lys Pro Gly Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp Ile Lys
35 40 45
Glu Asn Thr Gly Val
50
<![CDATA[ <210> 371]]>
<![CDATA[ <211> 855]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 371]]>
Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp Phe
1 5 10 15
Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu Lys Val Asn Gly Leu
20 25 30
Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn Val Lys Lys Val Glu
35 40 45
Lys His Gly Pro Gly Arg Trp Val Val Leu Ala Ala Val Leu Ile Gly
50 55 60
Leu Leu Leu Val Leu Leu Gly Ile Gly Phe Leu Val Trp His Leu Gln
65 70 75 80
Tyr Arg Asp Val Arg Val Gln Lys Val Phe Asn Gly Tyr Met Arg Ile
85 90 95
Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu Asn Ser Asn Ser Thr Glu
100 105 110
Phe Val Ser Leu Ala Ser Lys Val Lys Asp Ala Leu Lys Leu Leu Tyr
115 120 125
Ser Gly Val Pro Phe Leu Gly Pro Tyr His Lys Glu Ser Ala Val Thr
130 135 140
Ala Phe Ser Glu Gly Ser Val Ile Ala Tyr Tyr Trp Ser Glu Phe Ser
145 150 155 160
Ile Pro Gln His Leu Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu
165 170 175
Arg Val Val Met Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val
180 185 190
Val Thr Ser Val Val Ala Phe Pro Thr Asp Ser Lys Thr Val Gln Arg
195 200 205
Thr Gln Asp Asn Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu
210 215 220
Leu Met Arg Phe Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala
225 230 235 240
His Ala Arg Cys Gln Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu
245 250 255
Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly
260 265 270
Ser Asp Leu Val Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His
275 280 285
Ala Leu Val Gln Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr
290 295 300
Phe His Ser Ser Gln Asn Val Leu Leu Ile Thr Leu Ile Thr Asn Thr
305 310 315 320
Glu Arg Arg His Pro Gly Phe Glu Ala Thr Phe Phe Gln Leu Pro Arg
325 330 335
Met Ser Ser Cys Gly Gly Arg Leu Arg Lys Ala Gln Gly Thr Phe Asn
340 345 350
Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro Asn Ile Asp Cys Thr Trp
355 360 365
Asn Ile Glu Val Pro Asn Asn Gln His Val Lys Val Arg Phe Lys Phe
370 375 380
Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp
385 390 395 400
Tyr Val Glu Ile Asn Gly Glu Lys Tyr Cys Gly Glu Arg Ser Gln Phe
405 410 415
Val Val Thr Ser Asn Ser Asn Lys Ile Thr Val Arg Phe His Ser Asp
420 425 430
Gln Ser Tyr Thr Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp
435 440 445
Ser Ser Asp Pro Cys Pro Gly Gln Phe Thr Cys Arg Thr Gly Arg Cys
450 455 460
Ile Arg Lys Glu Leu Arg Cys Asp Gly Trp Ala Asp Cys Thr Asp His
465 470 475 480
Ser Asp Glu Leu Asn Cys Ser Cys Asp Ala Gly His Gln Phe Thr Cys
485 490 495
Lys Asn Lys Phe Cys Lys Pro Leu Phe Trp Val Cys Asp Ser Val Asn
500 505 510
Asp Cys Gly Asp Asn Ser Asp Glu Gln Gly Cys Ser Cys Pro Ala Gln
515 520 525
Thr Phe Arg Cys Ser Asn Gly Lys Cys Leu Ser Lys Ser Gln Gln Cys
530 535 540
Asn Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ser Cys Pro
545 550 555 560
Lys Val Asn Val Val Thr Cys Thr Lys His Thr Tyr Arg Cys Leu Asn
565 570 575
Gly Leu Cys Leu Ser Lys Gly Asn Pro Glu Cys Asp Gly Lys Glu Asp
580 585 590
Cys Ser Asp Gly Ser Asp Glu Lys Asp Cys Asp Cys Gly Leu Arg Ser
595 600 605
Phe Thr Arg Gln Ala Arg Val Val Gly Gly Thr Asp Ala Asp Glu Gly
610 615 620
Glu Trp Pro Trp Gln Val Ser Leu His Ala Leu Gly Gln Gly His Ile
625 630 635 640
Cys Gly Ala Ser Leu Ile Ser Pro Asn Trp Leu Val Ser Ala Ala His
645 650 655
Cys Tyr Ile Asp Asp Arg Gly Phe Arg Tyr Ser Asp Pro Thr Gln Trp
660 665 670
Thr Ala Phe Leu Gly Leu His Asp Gln Ser Gln Arg Ser Ala Pro Gly
675 680 685
Val Gln Glu Arg Arg Leu Lys Arg Ile Ile Ser His Pro Phe Phe Asn
690 695 700
Asp Phe Thr Phe Asp Tyr Asp Ile Ala Leu Leu Glu Leu Glu Lys Pro
705 710 715 720
Ala Glu Tyr Ser Ser Met Val Arg Pro Ile Cys Leu Pro Asp Ala Ser
725 730 735
His Val Phe Pro Ala Gly Lys Ala Ile Trp Val Thr Gly Trp Gly His
740 745 750
Thr Gln Tyr Gly Gly Thr Gly Ala Leu Ile Leu Gln Lys Gly Glu Ile
755 760 765
Arg Val Ile Asn Gln Thr Thr Cys Glu Asn Leu Leu Pro Gln Gln Ile
770 775 780
Thr Pro Arg Met Met Cys Val Gly Phe Leu Ser Gly Gly Val Asp Ser
785 790 795 800
Cys Gln Gly Asp Ser Gly Gly Pro Leu Ser Ser Ser Val Glu Ala Asp Gly
805 810 815
Arg Ile Phe Gln Ala Gly Val Val Ser Trp Gly Asp Gly Cys Ala Gln
820 825 830
Arg Asn Lys Pro Gly Val Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp
835 840 845
Ile Lys Glu Asn Thr Gly Val
850 855
<![CDATA[ <210> 372]]>
<![CDATA[ <211> 1552]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 372]]>
gtgagagcgg agctgcagcc ggagaaagag gaagaggggag agagagcgcg ccagggcgag 60
ggcaccgccg ccggtcgggc gcgctgggcc tgcccggaat cccgccgcct gcgccccgcg 120
ccccgcgccc tgcgggccat gggagccggc cgccggcagg gacgacgcct gtgagacccg 180
cgagcggcct cggggaccat ggggagcgat cgggcccgca agggcgggagg gggcccgaag 240
gacttcggcg cgggactcaa gtacaactcc cggcacgaga aagtgaatgg cttggaggaa 300
ggcgtggagt tcctgccagt caacaacgtc aagaaggtgg aaaagcatgg cccggggcgc 360
tgggtggtgc tggcagccgt gctgatcggc ctcctcttgg tcttgctggg gatcggcttc 420
ctggtgtggc atttgcagta ccgggacgtg cgtgtccaga aggtcttcaa tggctacatg 480
aggatcacaa atgagaattt tgtggatgcc tacgagaact ccaactccac tgagtttgta 540
agcctggcca gcaaggtgaa ggacgcgctg aagctgctgt acagcggagt cccattcctg 600
ggcccctacc acaaggagtc ggctgtgacg gccttcagcg agggcagcgt catcgcctac 660
tactggtctg agttcagcat cccgcagcac ctggtggagg aggccgagcg cgtcatggcc 720
gaggagcgcg tagtcatgct gcccccgcgg gcgcgctccc tgaagtcctt tgtggtcacc 780
tcagtggtgg ctttccccac ggactccaaa acagtacaga ggacccagga caacagctgc 840
agctttggcc tgcacgcccg cggtgtggag ctgatgcgct tcaccacgcc cggcttccct 900
gacagcccct accccgctca tgcccgctgc cagtgggccc tgcgggggga cgccgactca 960
gtgctgagcc tcaccttccg cagctttgac cttgcgtcct gcgacgagcg cggcagcgac 1020
ctggtgacgg tgtacaacac cctgagcccc atggagcccc acgccctggt gcagttgtgt 1080
ggcacctacc ctccctccta caacctgacc ttccactcct cccagaacgt cctgctcatc 1140
acactgataa ccaacactga gcggcggcat cccggctttg aggccacctt cttccagctg 1200
cctaggatga gcagctgtgg aggccgctta cgtaaagccc aggggacatt caacagcccc 1260
tactacccag gccactaccc acccaacatt gactgcacat ggaacattga ggtgcccaac 1320
aaccagcatg tgaaggtgcg cttcaaattc ttctacctgc tggagcccgg cgtgcctgcg 1380
ggcacctgcc ccaaggacta cgtggagatc aacggggaga aatactgcgg agagaggtcc 1440
cagttcgtcg tcaccagcaa cagcaacaag atcacagttc gcttccactc agatcagtcc 1500
tacaccgaca ccggcttctt agctgaatac ctctcctacg actccagtga cc 1552
<![CDATA[ <210> 373]]>
<![CDATA[ <211> 451]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 373]]>
Met Gly Ser Asp Arg Ala Arg Lys Gly Gly Gly Gly Pro Lys Asp Phe
1 5 10 15
Gly Ala Gly Leu Lys Tyr Asn Ser Arg His Glu Lys Val Asn Gly Leu
20 25 30
Glu Glu Gly Val Glu Phe Leu Pro Val Asn Asn Val Lys Lys Val Glu
35 40 45
Lys His Gly Pro Gly Arg Trp Val Val Leu Ala Ala Val Leu Ile Gly
50 55 60
Leu Leu Leu Val Leu Leu Gly Ile Gly Phe Leu Val Trp His Leu Gln
65 70 75 80
Tyr Arg Asp Val Arg Val Gln Lys Val Phe Asn Gly Tyr Met Arg Ile
85 90 95
Thr Asn Glu Asn Phe Val Asp Ala Tyr Glu Asn Ser Asn Ser Thr Glu
100 105 110
Phe Val Ser Leu Ala Ser Lys Val Lys Asp Ala Leu Lys Leu Leu Tyr
115 120 125
Ser Gly Val Pro Phe Leu Gly Pro Tyr His Lys Glu Ser Ala Val Thr
130 135 140
Ala Phe Ser Glu Gly Ser Val Ile Ala Tyr Tyr Trp Ser Glu Phe Ser
145 150 155 160
Ile Pro Gln His Leu Val Glu Glu Ala Glu Arg Val Met Ala Glu Glu
165 170 175
Arg Val Val Met Leu Pro Pro Arg Ala Arg Ser Leu Lys Ser Phe Val
180 185 190
Val Thr Ser Val Val Ala Phe Pro Thr Asp Ser Lys Thr Val Gln Arg
195 200 205
Thr Gln Asp Asn Ser Cys Ser Phe Gly Leu His Ala Arg Gly Val Glu
210 215 220
Leu Met Arg Phe Thr Thr Pro Gly Phe Pro Asp Ser Pro Tyr Pro Ala
225 230 235 240
His Ala Arg Cys Gln Trp Ala Leu Arg Gly Asp Ala Asp Ser Val Leu
245 250 255
Ser Leu Thr Phe Arg Ser Phe Asp Leu Ala Ser Cys Asp Glu Arg Gly
260 265 270
Ser Asp Leu Val Thr Val Tyr Asn Thr Leu Ser Pro Met Glu Pro His
275 280 285
Ala Leu Val Gln Leu Cys Gly Thr Tyr Pro Pro Ser Tyr Asn Leu Thr
290 295 300
Phe His Ser Ser Gln Asn Val Leu Leu Ile Thr Leu Ile Thr Asn Thr
305 310 315 320
Glu Arg Arg His Pro Gly Phe Glu Ala Thr Phe Phe Gln Leu Pro Arg
325 330 335
Met Ser Ser Cys Gly Gly Arg Leu Arg Lys Ala Gln Gly Thr Phe Asn
340 345 350
Ser Pro Tyr Tyr Pro Gly His Tyr Pro Pro Asn Ile Asp Cys Thr Trp
355 360 365
Asn Ile Glu Val Pro Asn Asn Gln His Val Lys Val Arg Phe Lys Phe
370 375 380
Phe Tyr Leu Leu Glu Pro Gly Val Pro Ala Gly Thr Cys Pro Lys Asp
385 390 395 400
Tyr Val Glu Ile Asn Gly Glu Lys Tyr Cys Gly Glu Arg Ser Gln Phe
405 410 415
Val Val Thr Ser Asn Ser Asn Lys Ile Thr Val Arg Phe His Ser Asp
420 425 430
Gln Ser Tyr Thr Asp Thr Gly Phe Leu Ala Glu Tyr Leu Ser Tyr Asp
435 440 445
Ser Ser Asp
450
<![CDATA[ <210> 374]]>
<![CDATA[ <211> 56]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 374]]>
accctgtgaa ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gcccca 56
<![CDATA[ <210> 375]]>
<![CDATA[ <211> 89]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 375]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttc 89
<![CDATA[ <210> 376]]>
<![CDATA[ <211> 145]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 376]]>
accctgtgaa ggcgaagcgc gggagaccaa agcctgcaag aaagacgcct gccccactac 60
atctacatcc accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt 120
ctgtgtgaat ggaggggagt gcttc 145
<![CDATA[ <210> 377]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 377]]>
Pro Cys Glu Gly Glu Ala Arg Glu Thr Lys Ala Cys Lys Lys Asp Ala
1 5 10 15
Cys Pro Thr Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys
20 25 30
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe
35 40 45
<![CDATA[ <210> 378]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 378]]>
agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 60
tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 120
ctcccggccg ccgcgctccg gtacacacag 150
<![CDATA[ <210> 379]]>
<![CDATA[ <211> 96]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 379]]>
gatccctgct gggcaccaac agctccacca tggggctggc ctggggacta ggcgtcctgt 60
tcctgatgca tgtgtgtggc accaaccgca ttccag 96
<![CDATA[ <210> 380]]>
<![CDATA[ <211> 560]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 380]]>
agtctggcgg agacaacagc gtgtttgaca tctttgaact caccggggcc gcccgcaagg 60
ggtctgggcg ccgactggtg aagggccccg acccttccag cccagctttc cgcatcgagg 120
atgccaacct gatcccccct gtgcctgatg acaagttcca agacctggtg gatgctgtgc 180
gggcagaaaa gggtttcctc cttctggcat ccctgaggca gatgaagaag acccggggca 240
cgctgctggc cctggagcgg aaagaccact ctggccaggt cttcagcgtg gtgtccaatg 300
gcaaggcggg caccctggac ctcagcctga ccgtccaagg aaagcagcac gtggtgtctg 360
tggaagaagc tctcctggca accggccagt ggaagagcat caccctgttt gtgcaggaag 420
acagggccca gctgtacatc gactgtgaaa agatggagaa tgctgagttg gacgtcccca 480
tccaaagcgt cttcaccaga gacctggcca gcatcgccag actccgcatc gcaaaggggg 540
gcgtcaatga caatttccag 560
<![CDATA[ <210> 381]]>
<![CDATA[ <211> 76]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 381]]>
ggggtgctgc agaatgtgag gtttgtcttt ggaaccacac cagaagacat cctcaggaac 60
aaaggctgct ccagct 76
<![CDATA[ <210> 382]]>
<![CDATA[ <211> 200]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 382]]>
ctaccagtgt cctcctcacc cttgacaaca acgtggtgaa tggttccagc cctgccatcc 60
gcactaacta cattggccac aagacaaagg acttgcaagc catctgcggc atctcctgtg 120
atgagctgtc cagcatggtc ctggaactca ggggcctgcg caccatgtg accacgctgc 180
aggacagcat ccgcaaagtg 200
<![CDATA[ <210> 383]]>
<![CDATA[ <211> 123]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 383]]>
actgaagaga acaaagagtt ggccaatgag ctgaggcggc ctcccctatg ctatcacaac 60
ggagttcagt acagaaataa cgaggaatgg actgttgata gctgcactga gtgtcactgt 120
cag 123
<![CDATA[ <210> 384]]>
<![CDATA[ <211> 94]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 384]]>
aactcagtta ccatctgcaa aaaggtgtcc tgccccatca tgccctgctc caatgccaca 60
gttcctgatg gagaatgctg tcctcgctgt tggc 94
<![CDATA[ <210> 385]]>
<![CDATA[ <211> 174]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 385]]>
ccagcgactc tgcggacgat ggctggtctc catggtccga gtggacctcc tgttctacga 60
gctgtggcaa tggaattcag cagcgcggcc gctcctgcga tagcctcaac aaccgatgtg 120
agggctcctc ggtccagaca cggacctgcc aattcagga gtgtgacaag agat 174
<![CDATA[ <210> 386]]>
<![CDATA[ <211> 177]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 386]]>
ttaaacagga tggtggctgg agccactggt ccccgtggtc atcttgttct gtgacatgtg 60
gtgatggtgt gatcacaagg atccggctct gcaactctcc cagcccccag atgaacggga 120
aaccctgtga aggcgaagcg cgggagacca aagcctgcaa gaaagacgcc tgcccca 177
<![CDATA[ <210> 387]]>
<![CDATA[ <211> 4547]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 387]]>
ttaaacagga tggtggctgg agccactggt ccccgtggtc atcttgttct gtgacatgtg 60
gtgatggtgt gatcacaagg atccggctct gcaactctcc cagcccccag atgaacggga 120
aaccctgtga aggcgaagcg cgggagacca aagcctgcaa gaaagacgcc tgccccatca 180
atggaggctg gggtccttgg tcaccatggg acatctgttc tgtcacctgt ggaggagggg 240
tacagaaacg tagtcgtctc tgcaacaacc ccacacccca gtttggaggc aaggactgcg 300
ttggtgatgt aacagaaaac cagatctgca acaagcagga ctgtccaatt gatggatgcc 360
tgtccaatcc ctgctttgcc ggcgtgaagt gtactagcta ccctgatggc agctggaaat 420
gtggtgcttg tccccctggt tacagtggaa atggcatcca gtgcacagat gttgatgagt 480
gcaaagaagt gcctgatgcc tgcttcaacc acaatggaga gcaccggtgt gagaacacgg 540
accccggcta caactgcctg ccctgccccc cacgcttcac cggctcacag cccttcggcc 600
agggtgtcga acatgccacg gccaacaaac aggtgtgcaa gccccgtaac ccctgcacgg 660
atgggaccca cgactgcaac aagaacgcca agtgcaacta cctgggccac tatagcgacc 720
ccatgtaccg ctgcgagtgc aagcctggct acgctggcaa tggcatcatc tgcgggggagg 780
acacagacct ggatggctgg cccaatgaga acctggtgtg cgtggccaat gcgacttacc 840
actgcaaaaa ggataattgc cccaaccttc ccaactcagg gcaggaagac tatgacaagg 900
atggaattgg tgatgcctgt gatgatgacg atgacaatga taaaattcca gatgacaggg 960
acaactgtcc attccattac aacccagctc agtatgacta tgacagagat gatgtggggag 1020
accgctgtga caactgtccc tacaaccaca accccagatca ggcagacaca gacaacaatg 1080
gggaaggaga cgcctgtgct gcagacattg atggagacgg tatcctcaat gaacgggaca 1140
actgccagta cgtctacaat gtggaccaga gagacactga tatggatggg gttggagatc 1200
agtgtgacaa ttgccccttg gaacacaatc cggatcagct ggactctgac tcagaccgca 1260
ttggagatac ctgtgacaac aatcaggata ttgatgaaga tggccaccag aacaatctgg 1320
acaactgtcc ctatgtgccc aatgccaacc aggctgacca tgacaaagat ggcaagggag 1380
atgcctgtga ccacgatgat gacaacgatg gcattcctga tgacaaggac aactgcagac 1440
tcgtgcccaa tcccgaccag aaggactctg acggcgatgg tcgaggtgat gcctgcaaag 1500
atgattttga ccatgacagt gtgccagaca tcgatgacat ctgtcctgag aatgttgaca 1560
tcagtgagac cgatttccgc cgattccaga tgattcctct ggaccccaaa gggacatccc 1620
aaaatgaccc taactgggtt gtacgccatc agggtaaaga actcgtccag actgtcaact 1680
gtgatcctgg actcgctgta ggttatgatg agtttaatgc tgtggacttc agtggcacct 1740
tcttcatcaa caccgaaagg gacgatgact atgctggatt tgtctttggc taccagtcca 1800
gcagccgctt ttatgttgtg atgtggaagc aagtcaccca gtcctactgg gacaccaacc 1860
ccacgagggc tcagggatac tcgggccttt ctgtgaaagt tgtaaactcc accacagggc 1920
ctggcgagca cctgcggaac gccctgtggc acacaggaaa cacccctggc caggtgcgca 1980
ccctgtggca tgaccctcgt cacataggct ggaaagattt caccgcctac agatggcgtc 2040
tcagccacag gccaaagacg ggtttcatta gagtggtgat gtatgaaggg aagaaaatca 2100
tggctgactc aggaccccatc tatgataaaa cctatgctgg tggtagacta gggttgtttg 2160
tcttctctca agaaatggtg ttcttctctg acctgaaata cgaatgtaga gatccctaat 2220
catcaaattg ttgattgaaa gactgatcat aaaccaatgc tggtattgca ccttctggaa 2280
ctatggggctt gagaaaaccc ccaggatcac ttctccttgg cttccttctt ttctgtgctt 2340
gcatcagtgt ggactcctag aacgtgcgac ctgcctcaag aaaatgcagt tttcaaaaac 2400
agactcagca ttcagcctcc aatgaataag acatcttcca agcatataaa caattgcttt 2460
ggtttccttttgaaaaagca tctacttgct tcagttggga aggtgcccat tccactctgc 2520
ctttgtcaca gagcagggtg ctattgtgag gccatctctg agcagtggac tcaaaagcat 2580
tttcaggcat gtcagagaag ggaggactca ctagaattag caaacaaaac caccctgaca 2640
tcctccttca ggaacacggg gagcagaggc caaagcacta aggggagggc gcatacccga 2700
gacgattgta tgaagaaaat atggaggaac tgttacatgt tcggtactaa gtcattttca 2760
ggggattgaa agactattgc tggatttcat gatgctgact ggcgttagct gattaaccca 2820
tgtaaatagg cacttaaata gaagcaggaa agggagacaa agactggctt ctggacttcc 2880
tccctgatcc ccacccttac tcatcacctg cagtggccag aattagggaa tcagaatcaa 2940
accagtgtaa ggcagtgctg gctgccattg cctggtcaca ttgaaattgg tggcttcatt 3000
ctagatgtag cttgtgcaga tgtagcagga aaataggaaa acctaccatc tcagtgagca 3060
ccagctgcct cccaaaggag gggcagccgt gcttatattt ttatggttac aatggcacaa 3120
aattattatc aacctaacta aaacattcct tttctctttt ttcctgaatt atcatggagt 3180
tttctaattc tctcttttgg aatgtagatt ttttttaaat gctttacgat gtaaaatatt 3240
tattttttac ttattctgga agatctggct gaaggattat tcatggaaca ggaagaagcg 3300
taaagactat ccatgtcatc tttgttgaga gtcttcgtga ctgtaagatt gtaaatacag 3360
attatttatt aactctgttc tgcctggaaa tttaggcttc atacggaaag tgtttgagag 3420
caagtagttg acatttatca gcaaatctct tgcaagaaca gcacaaggaa aatcagtcta 3480
ataagctgct ctgccccttg tgctcagagt ggatgttatg ggattctttt tttctctgtt 3540
ttatcttttc aagtggaatt agttggttat ccatttgcaa atgttttaaa ttgcaaagaa 3600
agccatgagg tcttcaatac tgttttaccc catcccttgt gcatatttcc agggagaagg 3660
aaagcatata cacttttttc tttcattttt ccaaaagaga aaaaaatgac aaaaggtgaa 3720
acttacatac aaatattacc tcatttgttg tgtgactgag taaagaattt ttggatcaag 3780
cggaaagagt ttaagtgtct aacaaactta aagctactgt agtacctaaa aagtcagtgt 3840
tgtacatagc ataaaaactc tgcagagaag tattcccaat aaggaaatag cattgaaatg 3900
ttaaatacaa tttctgaaag ttatgttttt tttctatcat ctggtatacc attgctttat 3960
ttttataaat tattttctca ttgccattgg aatagatc tcagattgtg tagatatgct 4020
atttaaataa tttatcagga aatactgcct gtagagttag tatttctatt tttatataat 4080
gtttgcacac tgaattgaag aattgttggt tttttcttttttttgttttg tttttttttt 4140
tttttttttttgcttttgac ctcccatttttactattttgc caatacctttttctaggaat 4200
gtgctttttt ttgtacacat ttttatccat tttacattct aaagcagtgt aagttgtata 4260
ttatgtttc ttatgtacaa ggaacaacaa taaatcatat ggaaatttat atttatactt 4320
actgtatcca tgcttatttg ttctctactg gctttatgtc atgaagtata tgcgtaaata 4380
ccattcataa atcaatatag catatacaaa aataaattac agtaagtcat agcaacattc 4440
acagtttgta tgtgattgag aaagactgag ttgctcaggc ctaggcttag aatttgctgc 4500
gtttgtggaa taaaagaaca aaatgataca ttagcctgcc atatcaa 4547
<![CDATA[ <210> 388]]>
<![CDATA[ <211> 6197]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Whole mRNA polynucleotide sequence of THBS1]]>
<![CDATA[ <400> 388]]>
agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 60
tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 120
ctcccggccg ccgcgctccg gtacacacag gatccctgct gggcaccaac agctccacca 180
tggggctggc ctggggacta ggcgtcctgt tcctgatgca tgtgtgtggc accaaccgca 240
ttccagagtc tggcggagac aacagcgtgt ttgacatctt tgaactcacc ggggccgccc 300
gcaaggggtc tgggcgccga ctggtgaagg gccccgaccc ttccagccca gctttccgca 360
tcgaggatgc caacctgatc ccccctgtgc ctgatgacaa gttccaagac ctggtggatg 420
ctgtgcgggc agaaaagggt ttcctccttc tggcatccct gaggcagatg aagaagaccc 480
ggggcacgct gctggccctg gagcggaaag accactctgg ccaggtcttc agcgtggtgt 540
ccaatggcaa ggcgggcacc ctggacctca gcctgaccgt ccaaggaaag cagcacgtgg 600
tgtctgtgga agaagctctc ctggcaaccg gccagtggaa gagcatcacc ctgtttgtgc 660
aggaagacag ggcccagctg tacatcgact gtgaaaagat ggagaatgct gagttggacg 720
tccccatcca aagcgtcttc accagagacc tggccagcat cgccagactc cgcatcgcaa 780
aggggggcgt caatgacaat ttccagggggg tgctgcagaa tgtgaggttt gtctttggaa 840
ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt gtcctcctca 900
cccttgacaa caacgtggtg aatggttcca gccctgccat ccgcactaac tacattggcc 960
acaagacaaa ggacttgcaa gccatctgcg gcatctcctg tgatgagctg tccagcatgg 1020
tcctggaact caggggcctg cgcaccattg tgaccacgct gcaggacagc atccgcaaag 1080
tgactgaaga gaacaaagag ttggccaatg agctgaggcg gcctccccta tgctatcaca 1140
acggagttca gtacagaaat aacgaggaat ggactgttga tagctgcact gagtgtcact 1200
gtcagaactc agttaccatc tgcaaaaagg tgtcctgccc catcatgccc tgctccaatg 1260
ccacagttcc tgatggagaa tgctgtcctc gctgttggcc cagcgactct gcggacgatg 1320
gctggtctcc atggtccgag tggacctcct gttctacgag ctgtggcaat ggaattcagc 1380
agcgcggccg ctcctgcgat agcctcaaca accgatgtga gggctcctcg gtccagacac 1440
ggacctgcca cattcaggag tgtgacaaga gatttaaaca ggatggtggc tggagccact 1500
ggtccccgtg gtcatcttgt tctgtgacat gtggtgatgg tgtgatcaca aggatccggc 1560
tctgcaactc tcccagcccc cagatgaacg ggaaaccctg tgaaggcgaa gcgcggggaga 1620
ccaaagcctg caagaaagac gcctgcccca ttaaacagga tggtggctgg agccactggt 1680
ccccgtggtc atcttgttct gtgacatgtg gtgatggtgt gatcacaagg atccggctct 1740
gcaactctcc cagcccccag atgaacggga aaccctgtga aggcgaagcg cgggagacca 1800
aagcctgcaa gaaagacgcc tgccccatca atggaggctg gggtccttgg tcaccatggg 1860
acatctgttc tgtcacctgt ggaggagggg tacagaaacg tagtcgtctc tgcaacaacc 1920
ccacacccca gtttggaggc aaggactgcg ttggtgatgt aacagaaaac cagatctgca 1980
acaagcagga ctgtccaatt gatggatgcc tgtccaatcc ctgctttgcc ggcgtgaagt 2040
gtactagcta ccctgatggc agctggaaat gtggtgcttg tccccctggt tacagtggaa 2100
atggcatcca gtgcacagat gttgatgagt gcaaagaagt gcctgatgcc tgcttcaacc 2160
acaatggaga gcaccggtgt gagaacacgg accccggcta caactgcctg ccctgccccc 2220
cacgcttcac cggctcacag cccttcggcc agggtgtcga acatgccacg gccaacaaac 2280
aggtgtgcaa gccccgtaac ccctgcacgg atgggaccca cgactgcaac aagaacgcca 2340
agtgcaacta cctgggccac tatagcgacc ccatgtaccg ctgcgagtgc aagcctggct 2400
acgctggcaa tggcatcatc tgcgggggagg acacagacct ggatggctgg cccaatgaga 2460
acctggtgtg cgtggccaat gcgacttacc actgcaaaaa ggataattgc cccaaccttc 2520
ccaactcagg gcaggaagac tatgacaagg atggaattgg tgatgcctgt gatgatgacg 2580
atgacaatga taaaattcca gatgacagggg acaactgtcc attccattac aacccagctc 2640
agtatgacta tgacagagat gatgtggggag accgctgtga caactgtccc tacaaccaca 2700
accccagatca ggcagacaca gacaacaatg gggaaggaga cgcctgtgct gcagacattg 2760
atggagacgg tatcctcaat gaacgggaca actgccagta cgtctacaat gtggaccaga 2820
gagacactga tatggatggg gttggagatc agtgtgacaa ttgccccttg gaacacaatc 2880
cggatcagct ggactctgac tcagaccgca ttggagatac ctgtgacaac aatcaggata 2940
ttgatgaaga tggccaccag aacaatctgg acaactgtcc ctatgtgccc aatgccaacc 3000
aggctgacca tgacaaagat ggcaagggag atgcctgtga ccacgatgat gacaacgatg 3060
gcattcctga tgacaaggac aactgcagac tcgtgcccaa tcccgaccag aaggactctg 3120
acggcgatgg tcgaggtgat gcctgcaaag atgattttga ccatgacagt gtgccagaca 3180
tcgatgacat ctgtcctgag aatgttgaca tcagtgagac cgatttccgc cgattccaga 3240
tgattcctct ggaccccaaa gggacatccc aaaatgaccc taactgggtt gtacgccatc 3300
agggtaaaga actcgtccag actgtcaact gtgatcctgg actcgctgta ggttatgatg 3360
agtttaatgc tgtggacttc agtggcacct tcttcatcaa caccgaaagg gacgatgact 3420
atgctggatt tgtctttggc taccagtcca gcagccgctt ttatgttgtg atgtggaagc 3480
aagtcaccca gtcctactgg gacaccaacc ccacgagggc tcagggatac tcgggccttt 3540
ctgtgaaagt tgtaaactcc accacagggc ctggcgagca cctgcggaac gccctgtggc 3600
acacaggaaa cacccctggc caggtgcgca ccctgtggca tgaccctcgt cacataggct 3660
ggaaagattt caccgcctac agatggcgtc tcagccacag gccaaagacg ggtttcatta 3720
gagtggtgat gtatgaaggg aagaaaatca tggctgactc aggaccccatc tatgataaaa 3780
cctatgctgg tggtagacta gggttgtttg tcttctctca agaaatggtg ttcttctctg 3840
acctgaaata cgaatgtaga gatccctaat catcaaattg ttgattgaaa gactgatcat 3900
aaaccaatgc tggtattgca ccttctggaa ctatgggctt gagaaaaccc ccaggatcac 3960
ttctccttgg cttccttctt ttctgtgctt gcatcagtgt ggactcctag aacgtgcgac 4020
ctgcctcaag aaaatgcagt tttcaaaaac agactcagca ttcagcctcc aatgaataag 4080
acatcttcca agcatataaa caattgcttt ggtttccttt tgaaaaagca tctacttgct 4140
tcagttggga aggtgcccat tccactctgc ctttgtcaca gagcagggtg ctattgtgag 4200
gccatctctg agcagtggac tcaaaagcat tttcaggcat gtcagagaag ggaggactca 4260
ctagaattag caaacaaaac caccctgaca tcctccttca ggaacacggg gagcagaggc 4320
caaagcacta aggggagggc gcatacccga gacgattgta tgaagaaaat atggaggaac 4380
tgttacatgt tcggtactaa gtcattttca ggggattgaa agactattgc tggatttcat 4440
gatgctgact ggcgttagct gattaaccca tgtaaatagg cacttaaata gaagcaggaa 4500
agggagacaa agactggctt ctggacttcc tccctgatcc ccacccttac tcatcacctg 4560
cagtggccag aattagggaa tcagaatcaa accagtgtaa ggcagtgctg gctgccattg 4620
cctggtcaca ttgaaattgg tggcttcatt ctagatgtag cttgtgcaga tgtagcagga 4680
aaataggaaa acctaccatc tcagtgagca ccagctgcct cccaaaggag gggcagccgt 4740
gcttatattt ttatggttac aatggcacaa aattattatc aacctaacta aaacattcct 4800
tttctctttt ttcctgaatt atcatggagt tttctaattc tctcttttgg aatgtagatt 4860
ttttttaaat gctttacgat gtaaaatatt tattttttac ttatctgga agatctggct 4920
gaaggattat tcatggaaca ggaagaagcg taaagactat ccatgtcatc tttgttgaga 4980
gtcttcgtga ctgtaagatt gtaaatacag attatttatt aactctgttc tgcctggaaa 5040
tttaggcttc atacggaaag tgtttgagag caagtagttg acatttatca gcaaatctct 5100
tgcaagaaca gcacaaggaa aatcagtcta ataagctgct ctgccccttg tgctcagagt 5160
ggatgttatg ggattctttt tttctctgtt ttatcttttc aagtggaatt agttggttat 5220
ccatttgcaa atgttttaaa ttgcaaagaa agccatgagg tcttcaatac tgttttaccc 5280
catcccttgt gcatatttcc agggagaagg aaagcatata cacttttttc tttcattttt 5340
ccaaaagaga aaaaaatgac aaaaggtgaa acttacatac aaatattacc tcatttgttg 5400
tgtgactgag taaagaattt ttggatcaag cggaaagagt ttaagtgtct aacaaactta 5460
aagctactgt agtacctaaa aagtcagtgt tgtacatagc ataaaaactc tgcagagaag 5520
tattcccaat aaggaaatag cattgaaatg ttaaatacaa tttctgaaag ttatgttttt 5580
tttctatcat ctggtatacc attgctttat ttttataaat tattttctca ttgccattgg 5640
aatagatatc tcagattgtg tagatatgct atttaaataa tttatcagga aatactgcct 5700
gtagagttag tatttctatt tttatataat gtttgcacac tgaattgaag aattgttggt 5760
tttttcttttttttgttttg tttttttttttttttttttttgcttttgac ctcccatttt 5820
tactattttgc caataccttt ttctaggaat gtgctttttt ttgtacacat ttttatccat 5880
tttacattct aaagcagtgt aagttgtata ttactgtttc ttatgtacaa ggaacaacaa 5940
taaatcatat ggaaatttat atttatactt actgtatcca tgcttatttg ttctctactg 6000
gctttatgtc atgaagtata tgcgtaaata ccattcataa atcaatatag catatacaaa 6060
aataaattac agtaagtcat agcaacattc acagtttgta tgtgattgag aaagactgag 6120
ttgctcaggc ctaggcttag aatttgctgc gtttgtggaa taaaagaaca aaatgataca 6180
ttagcctgcc atatcaa 6197
<![CDATA[ <210> 389]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 389]]>
Met Gly Leu Ala Trp Gly Leu Gly Val Leu Phe Leu Met His Val Cys
1 5 10 15
Gly Thr Asn Arg Ile Pro
20
<![CDATA[ <210> 390]]>
<![CDATA[ <211> 186]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 390]]>
Ser Gly Gly Asp Asn Ser Val Phe Asp Ile Phe Glu Leu Thr Gly Ala
1 5 10 15
Ala Arg Lys Gly Ser Gly Arg Arg Leu Val Lys Gly Pro Asp Pro Ser
20 25 30
Ser Pro Ala Phe Arg Ile Glu Asp Ala Asn Leu Ile Pro Pro Val Pro
35 40 45
Asp Asp Lys Phe Gln Asp Leu Val Asp Ala Val Arg Ala Glu Lys Gly
50 55 60
Phe Leu Leu Leu Ala Ser Leu Arg Gln Met Lys Lys Thr Arg Gly Thr
65 70 75 80
Leu Leu Ala Leu Glu Arg Lys Asp His Ser Gly Gln Val Phe Ser Val
85 90 95
Val Ser Asn Gly Lys Ala Gly Thr Leu Asp Leu Ser Leu Thr Val Gln
100 105 110
Gly Lys Gln His Val Val Ser Val Glu Glu Ala Leu Leu Ala Thr Gly
115 120 125
Gln Trp Lys Ser Ile Thr Leu Phe Val Gln Glu Asp Arg Ala Gln Leu
130 135 140
Tyr Ile Asp Cys Glu Lys Met Glu Asn Ala Glu Leu Asp Val Pro Ile
145 150 155 160
Gln Ser Val Phe Thr Arg Asp Leu Ala Ser Ile Ala Arg Leu Arg Ile
165 170 175
Ala Lys Gly Gly Val Asn Asp Asn Phe Gln
180 185
<![CDATA[ <210> 391]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 391]]>
Gly Val Leu Gln Asn Val Arg Phe Val Phe Gly Thr Thr Pro Glu Asp
1 5 10 15
Ile Leu Arg Asn Lys Gly Cys Ser Ser
20 25
<![CDATA[ <210> 392]]>
<![CDATA[ <211> 66]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 392]]>
Thr Ser Val Leu Leu Thr Leu Asp Asn Asn Val Val Asn Gly Ser Ser
1 5 10 15
Pro Ala Ile Arg Thr Asn Tyr Ile Gly His Lys Thr Lys Asp Leu Gln
20 25 30
Ala Ile Cys Gly Ile Ser Cys Asp Glu Leu Ser Ser Met Val Leu Glu
35 40 45
Leu Arg Gly Leu Arg Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg
50 55 60
Lys Val
65
<![CDATA[ <210> 393]]>
<![CDATA[ <211> 41]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 393]]>
Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu
1 5 10 15
Cys Tyr His Asn Gly Val Gln Tyr Arg Asn Asn Glu Glu Trp Thr Val
20 25 30
Asp Ser Cys Thr Glu Cys His Cys Gln
35 40
<![CDATA[ <210> 394]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 394]]>
Asn Ser Val Thr Ile Cys Lys Lys Val Ser Cys Pro Ile Met Pro Cys
1 5 10 15
Ser Asn Ala Thr Val Pro Asp Gly Glu Cys Cys Pro Arg Cys Trp
20 25 30
<![CDATA[ <210> 395]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 395]]>
Ser Asp Ser Ala Asp Asp Gly Trp Ser Pro Trp Ser Glu Trp Thr Ser
1 5 10 15
Cys Ser Thr Ser Cys Gly Asn Gly Ile Gln Gln Arg Gly Arg Ser Cys
20 25 30
Asp Ser Leu Asn Asn Arg Cys Glu Gly Ser Ser Val Gln Thr Arg Thr
35 40 45
Cys His Ile Gln Glu Cys Asp Lys Arg
50 55
<![CDATA[ <210> 396]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 396]]>
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
1 5 10 15
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
20 25 30
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
35 40 45
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro
50 55
<![CDATA[ <210> 397]]>
<![CDATA[ <211> 738]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 397]]>
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
1 5 10 15
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
20 25 30
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
35 40 45
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro Ile Asn Gly Gly Trp Gly
50 55 60
Pro Trp Ser Pro Trp Asp Ile Cys Ser Val Thr Cys Gly Gly Gly Val
65 70 75 80
Gln Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr Pro Gln Phe Gly Gly
85 90 95
Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gln Ile Cys Asn Lys Gln
100 105 110
Asp Cys Pro Ile Asp Gly Cys Leu Ser Asn Pro Cys Phe Ala Gly Val
115 120 125
Lys Cys Thr Ser Tyr Pro Asp Gly Ser Trp Lys Cys Gly Ala Cys Pro
130 135 140
Pro Gly Tyr Ser Gly Asn Gly Ile Gln Cys Thr Asp Val Asp Glu Cys
145 150 155 160
Lys Glu Val Pro Asp Ala Cys Phe Asn His Asn Gly Glu His Arg Cys
165 170 175
Glu Asn Thr Asp Pro Gly Tyr Asn Cys Leu Pro Cys Pro Pro Arg Phe
180 185 190
Thr Gly Ser Gln Pro Phe Gly Gln Gly Val Glu His Ala Thr Ala Asn
195 200 205
Lys Gln Val Cys Lys Pro Arg Asn Pro Cys Thr Asp Gly Thr His Asp
210 215 220
Cys Asn Lys Asn Ala Lys Cys Asn Tyr Leu Gly His Tyr Ser Asp Pro
225 230 235 240
Met Tyr Arg Cys Glu Cys Lys Pro Gly Tyr Ala Gly Asn Gly Ile Ile
245 250 255
Cys Gly Glu Asp Thr Asp Leu Asp Gly Trp Pro Asn Glu Asn Leu Val
260 265 270
Cys Val Ala Asn Ala Thr Tyr His Cys Lys Lys Asp Asn Cys Pro Asn
275 280 285
Leu Pro Asn Ser Gly Gln Glu Asp Tyr Asp Lys Asp Gly Ile Gly Asp
290 295 300
Ala Cys Asp Asp Asp Asp Asp Asn Asp Lys Ile Pro Asp Asp Arg Asp
305 310 315 320
Asn Cys Pro Phe His Tyr Asn Pro Ala Gln Tyr Asp Tyr Asp Arg Asp
325 330 335
Asp Val Gly Asp Arg Cys Asp Asn Cys Pro Tyr Asn His Asn Pro Asp
340 345 350
Gln Ala Asp Thr Asp Asn Asn Gly Glu Gly Asp Ala Cys Ala Ala Asp
355 360 365
Ile Asp Gly Asp Gly Ile Leu Asn Glu Arg Asp Asn Cys Gln Tyr Val
370 375 380
Tyr Asn Val Asp Gln Arg Asp Thr Asp Met Asp Gly Val Gly Asp Gln
385 390 395 400
Cys Asp Asn Cys Pro Leu Glu His Asn Pro Asp Gln Leu Asp Ser Asp
405 410 415
Ser Asp Arg Ile Gly Asp Thr Cys Asp Asn Asn Gln Asp Ile Asp Glu
420 425 430
Asp Gly His Gln Asn Asn Leu Asp Asn Cys Pro Tyr Val Pro Asn Ala
435 440 445
Asn Gln Ala Asp His Asp Lys Asp Gly Lys Gly Asp Ala Cys Asp His
450 455 460
Asp Asp Asp Asn Asp Gly Ile Pro Asp Asp Lys Asp Asn Cys Arg Leu
465 470 475 480
Val Pro Asn Pro Asp Gln Lys Asp Ser Asp Gly Asp Gly Arg Gly Asp
485 490 495
Ala Cys Lys Asp Asp Phe Asp His Asp Ser Val Pro Asp Ile Asp Asp
500 505 510
Ile Cys Pro Glu Asn Val Asp Ile Ser Glu Thr Asp Phe Arg Arg Phe
515 520 525
Gln Met Ile Pro Leu Asp Pro Lys Gly Thr Ser Gln Asn Asp Pro Asn
530 535 540
Trp Val Val Arg His Gln Gly Lys Glu Leu Val Gln Thr Val Asn Cys
545 550 555 560
Asp Pro Gly Leu Ala Val Gly Tyr Asp Glu Phe Asn Ala Val Asp Phe
565 570 575
Ser Gly Thr Phe Phe Ile Asn Thr Glu Arg Asp Asp Asp Tyr Ala Gly
580 585 590
Phe Val Phe Gly Tyr Gln Ser Ser Ser Arg Phe Tyr Val Val Met Trp
595 600 605
Lys Gln Val Thr Gln Ser Tyr Trp Asp Thr Asn Pro Thr Arg Ala Gln
610 615 620
Gly Tyr Ser Gly Leu Ser Val Lys Val Val Asn Ser Thr Thr Gly Pro
625 630 635 640
Gly Glu His Leu Arg Asn Ala Leu Trp His Thr Gly Asn Thr Pro Gly
645 650 655
Gln Val Arg Thr Leu Trp His Asp Pro Arg His Ile Gly Trp Lys Asp
660 665 670
Phe Thr Ala Tyr Arg Trp Arg Leu Ser His Arg Pro Lys Thr Gly Phe
675 680 685
Ile Arg Val Val Met Tyr Glu Gly Lys Lys Ile Met Ala Asp Ser Gly
690 695 700
Pro Ile Tyr Asp Lys Thr Tyr Ala Gly Gly Arg Leu Gly Leu Phe Val
705 710 715 720
Phe Ser Gln Glu Met Val Phe Phe Ser Asp Leu Lys Tyr Glu Cys Arg
725 730 735
Asp Pro
<![CDATA[ <210> 398]]>
<![CDATA[ <211> 1170]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 398]]>
Met Gly Leu Ala Trp Gly Leu Gly Val Leu Phe Leu Met His Val Cys
1 5 10 15
Gly Thr Asn Arg Ile Pro Glu Ser Gly Gly Asp Asn Ser Val Phe Asp
20 25 30
Ile Phe Glu Leu Thr Gly Ala Ala Arg Lys Gly Ser Gly Arg Arg Leu
35 40 45
Val Lys Gly Pro Asp Pro Ser Ser Ser Pro Ala Phe Arg Ile Glu Asp Ala
50 55 60
Asn Leu Ile Pro Pro Val Pro Asp Asp Lys Phe Gln Asp Leu Val Asp
65 70 75 80
Ala Val Arg Ala Glu Lys Gly Phe Leu Leu Leu Ala Ser Leu Arg Gln
85 90 95
Met Lys Lys Thr Arg Gly Thr Leu Leu Ala Leu Glu Arg Lys Asp His
100 105 110
Ser Gly Gln Val Phe Ser Val Val Ser Asn Gly Lys Ala Gly Thr Leu
115 120 125
Asp Leu Ser Leu Thr Val Gln Gly Lys Gln His Val Val Ser Val Glu
130 135 140
Glu Ala Leu Leu Ala Thr Gly Gln Trp Lys Ser Ile Thr Leu Phe Val
145 150 155 160
Gln Glu Asp Arg Ala Gln Leu Tyr Ile Asp Cys Glu Lys Met Glu Asn
165 170 175
Ala Glu Leu Asp Val Pro Ile Gln Ser Val Phe Thr Arg Asp Leu Ala
180 185 190
Ser Ile Ala Arg Leu Arg Ile Ala Lys Gly Gly Val Asn Asp Asn Phe
195 200 205
Gln Gly Val Leu Gln Asn Val Arg Phe Val Phe Gly Thr Thr Pro Glu
210 215 220
Asp Ile Leu Arg Asn Lys Gly Cys Ser Ser Ser Thr Ser Val Leu Leu
225 230 235 240
Thr Leu Asp Asn Asn Val Val Asn Gly Ser Ser Pro Ala Ile Arg Thr
245 250 255
Asn Tyr Ile Gly His Lys Thr Lys Asp Leu Gln Ala Ile Cys Gly Ile
260 265 270
Ser Cys Asp Glu Leu Ser Ser Ser Met Val Leu Glu Leu Arg Gly Leu Arg
275 280 285
Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg Lys Val Thr Glu Glu
290 295 300
Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu Cys Tyr His
305 310 315 320
Asn Gly Val Gln Tyr Arg Asn Asn Glu Glu Trp Thr Val Asp Ser Cys
325 330 335
Thr Glu Cys His Cys Gln Asn Ser Val Thr Ile Cys Lys Lys Val Ser
340 345 350
Cys Pro Ile Met Pro Cys Ser Asn Ala Thr Val Pro Asp Gly Glu Cys
355 360 365
Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp Gly Trp Ser Pro
370 375 380
Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly Asn Gly Ile Gln
385 390 395 400
Gln Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Asn Arg Cys Glu Gly Ser
405 410 415
Ser Val Gln Thr Arg Thr Cys His Ile Gln Glu Cys Asp Lys Arg Phe
420 425 430
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
435 440 445
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
450 455 460
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
465 470 475 480
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro Ile Asn Gly Gly Trp Gly
485 490 495
Pro Trp Ser Pro Trp Asp Ile Cys Ser Val Thr Cys Gly Gly Gly Val
500 505 510
Gln Lys Arg Ser Arg Leu Cys Asn Asn Pro Thr Pro Gln Phe Gly Gly
515 520 525
Lys Asp Cys Val Gly Asp Val Thr Glu Asn Gln Ile Cys Asn Lys Gln
530 535 540
Asp Cys Pro Ile Asp Gly Cys Leu Ser Asn Pro Cys Phe Ala Gly Val
545 550 555 560
Lys Cys Thr Ser Tyr Pro Asp Gly Ser Trp Lys Cys Gly Ala Cys Pro
565 570 575
Pro Gly Tyr Ser Gly Asn Gly Ile Gln Cys Thr Asp Val Asp Glu Cys
580 585 590
Lys Glu Val Pro Asp Ala Cys Phe Asn His Asn Gly Glu His Arg Cys
595 600 605
Glu Asn Thr Asp Pro Gly Tyr Asn Cys Leu Pro Cys Pro Pro Arg Phe
610 615 620
Thr Gly Ser Gln Pro Phe Gly Gln Gly Val Glu His Ala Thr Ala Asn
625 630 635 640
Lys Gln Val Cys Lys Pro Arg Asn Pro Cys Thr Asp Gly Thr His Asp
645 650 655
Cys Asn Lys Asn Ala Lys Cys Asn Tyr Leu Gly His Tyr Ser Asp Pro
660 665 670
Met Tyr Arg Cys Glu Cys Lys Pro Gly Tyr Ala Gly Asn Gly Ile Ile
675 680 685
Cys Gly Glu Asp Thr Asp Leu Asp Gly Trp Pro Asn Glu Asn Leu Val
690 695 700
Cys Val Ala Asn Ala Thr Tyr His Cys Lys Lys Asp Asn Cys Pro Asn
705 710 715 720
Leu Pro Asn Ser Gly Gln Glu Asp Tyr Asp Lys Asp Gly Ile Gly Asp
725 730 735
Ala Cys Asp Asp Asp Asp Asp Asn Asp Lys Ile Pro Asp Asp Arg Asp
740 745 750
Asn Cys Pro Phe His Tyr Asn Pro Ala Gln Tyr Asp Tyr Asp Arg Asp
755 760 765
Asp Val Gly Asp Arg Cys Asp Asn Cys Pro Tyr Asn His Asn Pro Asp
770 775 780
Gln Ala Asp Thr Asp Asn Asn Gly Glu Gly Asp Ala Cys Ala Ala Asp
785 790 795 800
Ile Asp Gly Asp Gly Ile Leu Asn Glu Arg Asp Asn Cys Gln Tyr Val
805 810 815
Tyr Asn Val Asp Gln Arg Asp Thr Asp Met Asp Gly Val Gly Asp Gln
820 825 830
Cys Asp Asn Cys Pro Leu Glu His Asn Pro Asp Gln Leu Asp Ser Asp
835 840 845
Ser Asp Arg Ile Gly Asp Thr Cys Asp Asn Asn Gln Asp Ile Asp Glu
850 855 860
Asp Gly His Gln Asn Asn Leu Asp Asn Cys Pro Tyr Val Pro Asn Ala
865 870 875 880
Asn Gln Ala Asp His Asp Lys Asp Gly Lys Gly Asp Ala Cys Asp His
885 890 895
Asp Asp Asp Asn Asp Gly Ile Pro Asp Asp Lys Asp Asn Cys Arg Leu
900 905 910
Val Pro Asn Pro Asp Gln Lys Asp Ser Asp Gly Asp Gly Arg Gly Asp
915 920 925
Ala Cys Lys Asp Asp Phe Asp His Asp Ser Val Pro Asp Ile Asp Asp
930 935 940
Ile Cys Pro Glu Asn Val Asp Ile Ser Glu Thr Asp Phe Arg Arg Phe
945 950 955 960
Gln Met Ile Pro Leu Asp Pro Lys Gly Thr Ser Gln Asn Asp Pro Asn
965 970 975
Trp Val Val Arg His Gln Gly Lys Glu Leu Val Gln Thr Val Asn Cys
980 985 990
Asp Pro Gly Leu Ala Val Gly Tyr Asp Glu Phe Asn Ala Val Asp Phe
995 1000 1005
Ser Gly Thr Phe Phe Ile Asn Thr Glu Arg Asp Asp Asp Tyr Ala
1010 1015 1020
Gly Phe Val Phe Gly Tyr Gln Ser Ser Ser Arg Phe Tyr Val Val
1025 1030 1035
Met Trp Lys Gln Val Thr Gln Ser Tyr Trp Asp Thr Asn Pro Thr
1040 1045 1050
Arg Ala Gln Gly Tyr Ser Gly Leu Ser Val Lys Val Val Asn Ser
1055 1060 1065
Thr Thr Gly Pro Gly Glu His Leu Arg Asn Ala Leu Trp His Thr
1070 1075 1080
Gly Asn Thr Pro Gly Gln Val Arg Thr Leu Trp His Asp Pro Arg
1085 1090 1095
His Ile Gly Trp Lys Asp Phe Thr Ala Tyr Arg Trp Arg Leu Ser
1100 1105 1110
His Arg Pro Lys Thr Gly Phe Ile Arg Val Val Met Tyr Glu Gly
1115 1120 1125
Lys Lys Ile Met Ala Asp Ser Gly Pro Ile Tyr Asp Lys Thr Tyr
1130 1135 1140
Ala Gly Gly Arg Leu Gly Leu Phe Val Phe Ser Gln Glu Met Val
1145 1150 1155
Phe Phe Ser Asp Leu Lys Tyr Glu Cys Arg Asp Pro
1160 1165 1170
<![CDATA[ <210> 399]]>
<![CDATA[ <211> 1650]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 399]]>
agccgctgcg cccgagctgg cctgcgagtt cagggctcct gtcgctctcc aggagcaacc 60
tctactccgg acgcacaggc attccccgcg cccctccagc cctcgccgcc ctcgccaccg 120
ctcccggccg ccgcgctccg gtacacacag gatccctgct gggcaccaac agctccacca 180
tggggctggc ctggggacta ggcgtcctgt tcctgatgca tgtgtgtggc accaaccgca 240
ttccagagtc tggcggagac aacagcgtgt ttgacatctt tgaactcacc ggggccgccc 300
gcaaggggtc tgggcgccga ctggtgaagg gccccgaccc ttccagccca gctttccgca 360
tcgaggatgc caacctgatc ccccctgtgc ctgatgacaa gttccaagac ctggtggatg 420
ctgtgcgggc agaaaagggt ttcctccttc tggcatccct gaggcagatg aagaagaccc 480
ggggcacgct gctggccctg gagcggaaag accactctgg ccaggtcttc agcgtggtgt 540
ccaatggcaa ggcgggcacc ctggacctca gcctgaccgt ccaaggaaag cagcacgtgg 600
tgtctgtgga agaagctctc ctggcaaccg gccagtggaa gagcatcacc ctgtttgtgc 660
aggaagacag ggcccagctg tacatcgact gtgaaaagat ggagaatgct gagttggacg 720
tccccatcca aagcgtcttc accagagacc tggccagcat cgccagactc cgcatcgcaa 780
aggggggcgt caatgacaat ttccagggggg tgctgcagaa tgtgaggttt gtctttggaa 840
ccacaccaga agacatcctc aggaacaaag gctgctccag ctctaccagt gtcctcctca 900
cccttgacaa caacgtggtg aatggttcca gccctgccat ccgcactaac tacattggcc 960
acaagacaaa ggacttgcaa gccatctgcg gcatctcctg tgatgagctg tccagcatgg 1020
tcctggaact caggggcctg cgcaccattg tgaccacgct gcaggacagc atccgcaaag 1080
tgactgaaga gaacaaagag ttggccaatg agctgaggcg gcctccccta tgctatcaca 1140
acggagttca gtacagaaat aacgaggaat ggactgttga tagctgcact gagtgtcact 1200
gtcagaactc agttaccatc tgcaaaaagg tgtcctgccc catcatgccc tgctccaatg 1260
ccacagttcc tgatggagaa tgctgtcctc gctgttggcc cagcgactct gcggacgatg 1320
gctggtctcc atggtccgag tggacctcct gttctacgag ctgtggcaat ggaattcagc 1380
agcgcggccg ctcctgcgat agcctcaaca accgatgtga gggctcctcg gtccagacac 1440
ggacctgcca cattcaggag tgtgacaaga gatttaaaca ggatggtggc tggagccact 1500
ggtccccgtg gtcatcttgt tctgtgacat gtggtgatgg tgtgatcaca aggatccggc 1560
tctgcaactc tcccagcccc cagatgaacg ggaaaccctg tgaaggcgaa gcgcggggaga 1620
ccaaagcctg caagaaagac gcctgcccca 1650
<![CDATA[ <210> 400]]>
<![CDATA[ <211> 490]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 400]]>
Met Gly Leu Ala Trp Gly Leu Gly Val Leu Phe Leu Met His Val Cys
1 5 10 15
Gly Thr Asn Arg Ile Pro Glu Ser Gly Gly Asp Asn Ser Val Phe Asp
20 25 30
Ile Phe Glu Leu Thr Gly Ala Ala Arg Lys Gly Ser Gly Arg Arg Leu
35 40 45
Val Lys Gly Pro Asp Pro Ser Ser Ser Pro Ala Phe Arg Ile Glu Asp Ala
50 55 60
Asn Leu Ile Pro Pro Val Pro Asp Asp Lys Phe Gln Asp Leu Val Asp
65 70 75 80
Ala Val Arg Ala Glu Lys Gly Phe Leu Leu Leu Ala Ser Leu Arg Gln
85 90 95
Met Lys Lys Thr Arg Gly Thr Leu Leu Ala Leu Glu Arg Lys Asp His
100 105 110
Ser Gly Gln Val Phe Ser Val Val Ser Asn Gly Lys Ala Gly Thr Leu
115 120 125
Asp Leu Ser Leu Thr Val Gln Gly Lys Gln His Val Val Ser Val Glu
130 135 140
Glu Ala Leu Leu Ala Thr Gly Gln Trp Lys Ser Ile Thr Leu Phe Val
145 150 155 160
Gln Glu Asp Arg Ala Gln Leu Tyr Ile Asp Cys Glu Lys Met Glu Asn
165 170 175
Ala Glu Leu Asp Val Pro Ile Gln Ser Val Phe Thr Arg Asp Leu Ala
180 185 190
Ser Ile Ala Arg Leu Arg Ile Ala Lys Gly Gly Val Asn Asp Asn Phe
195 200 205
Gln Gly Val Leu Gln Asn Val Arg Phe Val Phe Gly Thr Thr Pro Glu
210 215 220
Asp Ile Leu Arg Asn Lys Gly Cys Ser Ser Ser Thr Ser Val Leu Leu
225 230 235 240
Thr Leu Asp Asn Asn Val Val Asn Gly Ser Ser Pro Ala Ile Arg Thr
245 250 255
Asn Tyr Ile Gly His Lys Thr Lys Asp Leu Gln Ala Ile Cys Gly Ile
260 265 270
Ser Cys Asp Glu Leu Ser Ser Ser Met Val Leu Glu Leu Arg Gly Leu Arg
275 280 285
Thr Ile Val Thr Thr Leu Gln Asp Ser Ile Arg Lys Val Thr Glu Glu
290 295 300
Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu Cys Tyr His
305 310 315 320
Asn Gly Val Gln Tyr Arg Asn Asn Glu Glu Trp Thr Val Asp Ser Cys
325 330 335
Thr Glu Cys His Cys Gln Asn Ser Val Thr Ile Cys Lys Lys Val Ser
340 345 350
Cys Pro Ile Met Pro Cys Ser Asn Ala Thr Val Pro Asp Gly Glu Cys
355 360 365
Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp Gly Trp Ser Pro
370 375 380
Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly Asn Gly Ile Gln
385 390 395 400
Gln Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Asn Arg Cys Glu Gly Ser
405 410 415
Ser Val Gln Thr Arg Thr Cys His Ile Gln Glu Cys Asp Lys Arg Phe
420 425 430
Lys Gln Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser Ser Cys Ser
435 440 445
Val Thr Cys Gly Asp Gly Val Ile Thr Arg Ile Arg Leu Cys Asn Ser
450 455 460
Pro Ser Pro Gln Met Asn Gly Lys Pro Cys Glu Gly Glu Ala Arg Glu
465 470 475 480
Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro
485 490
<![CDATA[ <210> 401]]>
<![CDATA[ <211> 106]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 401]]>
gtgtgcggct cagatggggt cacctacagc accgagtgtg agctgaagaa ggccagggtgt 60
gagtcacagc gagggctcta cgtagcggcc cagggagcct gccgag 106
<![CDATA[ <210> 402]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 402]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga c 101
<![CDATA[ <210> 403]]>
<![CDATA[ <211> 207]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 403]]>
gtgtgcggct cagatggggt cacctacagc accgagtgtg agctgaagaa ggccagggtgt 60
gagtcacagc gagggctcta cgtagcggcc cagggagcct gccgagctac atctacatcc 120
accactggga caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtgaat 180
ggaggggagt gcttcatggt gaaagac 207
<![CDATA[ <210> 404]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 404]]>
Val Cys Gly Ser Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys
1 5 10 15
Lys Ala Arg Cys Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly
20 25 30
Ala Cys Arg Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val
35 40 45
Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
50 55 60
Phe Met Val Lys Asp
65
<![CDATA[ <210> 405]]>
<![CDATA[ <211> 254]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 405]]>
agtcccgtcc ccggcgcggc ccgcgcgctc ctccgccgcc tctcgcctgc gccatggccg 60
gccggtccca cccgggcccg ctgcggccgc tgctgccgct ccttgtggtg gccgcgtgcg 120
tcctgcccgg agccggcggg acatgcccgg agcgcgcgct ggagcggcgc gaggaggagg 180
cgaacgtggt gctcaccggg acggtggagg agatcctcaa cgtggacccg gtgcagcaca 240
cgtactcctg caag 254
<![CDATA[ <210> 406]]>
<![CDATA[ <211> 262]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 406]]>
gttcgggtct ggcggtactt gaagggcaaa gacctggtgg cccggggagag cctgctggac 60
ggcggcaaca aggtggtgat cagcggcttt ggagacccccc tcatctgtga caaccaggtg 120
tccactgggg acaccaggat cttctttgtg aaccctgcac ccccatacct gtggccagcc 180
cacaagaacg agctgatgct caactccagc ctcatgcgga tcaccctgcg gaacctggag 240
gaggtggagt tctgtgtgga ag 262
<![CDATA[ <210> 407]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 407]]>
ataaacccgg gacccacttc actccagtgc ctccgacgcc tcctgatg 48
<![CDATA[ <210> 408]]>
<![CDATA[ <211> 216]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 408]]>
cgtgccgggg aatgctgtgc ggcttcggcg ccgtgtgcga gcccaacgcg gaggggccgg 60
gccgggcgtc ctgcgtctgc aagaagagcc cgtgccccag cgtggtggcg cctgtgtgtg 120
ggtcggacgc ctccacctac agcaacgaat gcgagctgca gcgggcgcag tgcagccagc 180
agcgccgcat ccgcctgctc agccgcgggc cgtgcg 216
<![CDATA[ <210> 409]]>
<![CDATA[ <211> 225]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 409]]>
gctcgcggga cccctgctcc aacgtgacct gcagcttcgg cagcacctgt gcgcgctcgg 60
ccgacgggct gacggcctcg tgcctgtgcc ccgcgacctg ccgtggcgcc cccgaggggga 120
ccgtctgcgg cagcgacggc gccgactacc ccggcgagtg ccagctcctg cgccgcgcct 180
gcgcccgcca ggagaatgtc ttcaagaagt tcgacggccc ttgtg 225
<![CDATA[ <210> 410]]>
<![CDATA[ <211> 225]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 410]]>
accccctgtca gggcgccctc cctgacccga gccgcagctg ccgtgtgaac ccgcgcacgc 60
ggcgccctga gatgctccta cggcccgaga gctgccctgc ccggcaggcg ccagtgtgtg 120
gggacgacgg agtcacctac gaaaacgact gtgtcatggg ccgatcgggg gccgcccggg 180
gtctcctcct gcagaaagtg cgctccggcc agtgccaggg tcgag 225
<![CDATA[ <210> 411]]>
<![CDATA[ <211> 207]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 411]]>
accagtgccc ggagccctgc cggttcaatg ccgtgtgcct gtcccgccgt ggccgtcccc 60
gctgctcctg cgaccgcgtc acctgtgacg gggcctacag gcccgtgtgt gcccaggacg 120
ggcgcacgta tgacagtgat tgctggcggc agcaggctga gtgccggcag cagcgtgcca 180
tccccagcaa gcaccagggc ccgtgtg 207
<![CDATA[ <210> 412]]>
<![CDATA[ <211> 219]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 412]]>
accaggcccc gtccccatgc ctcggggtgc agtgtgcatt tggggcgacg tgtgctgtga 60
agaacgggca ggcagcgtgt gaatgcctgc aggcgtgctc gagcctctac gatcctgtgt 120
gcggcagcga cggcgtcaca tacggcagcg cgtgcgagct ggaggccacg gcctgtaccc 180
tcgggcggga gatccaggtg gcgcgcaaag gaccctgtg 219
<![CDATA[ <210> 413]]>
<![CDATA[ <211> 195]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 413]]>
accgctgcgg gcagtgccgc tttggagccc tgtgcgaggc cgagaccggg cgctgcgtgt 60
gcccctctga atgcgtggct ttggcccagc ccgtgtgtgg ctccgacggg cacacgtacc 120
ccagcgagtg catgctgcac gtgcacgcct gcacacacca gatcagcctg cacgtggcct 180
cagctggacc ctgtg 195
<![CDATA[ <210> 414]]>
<![CDATA[ <211> 201]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 414]]>
agacctgtgg agatgccgtg tgtgcttttg gggctgtgtg ctccgcaggg cagtgtgtgt 60
gtccccggtg tgagcacccc ccgcccggcc ccgtgtgtgg cagcgacggt gtcacctacg 120
gcagtgcctg cgagctacgg gaagccgcct gcctccagca gacacagatc gaggaggccc 180
gggcagggcc gtgcgagcag g 201
<![CDATA[ <210> 415]]>
<![CDATA[ <211> 149]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 415]]>
ccgagtgcgg ttccggaggc tctggctctg gggaggacgg tgactgtgag caggagctgt 60
gccggcagcg cggtggcatc tgggacgagg actcggagga cgggccgtgt gtctgtgact 120
tcagctgcca gagtgtccca ggcagcccg 149
<![CDATA[ <210> 416]]>
<![CDATA[ <211> 106]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 416]]>
gtgtgcggct cagatggggt cacctacagc accgagtgtg agctgaagaa ggccagggtgt 60
gagtcacagc gagggctcta cgtagcggcc cagggagcct gccgag 106
<![CDATA[ <210> 417]]>
<![CDATA[ <211> 5088]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 417]]>
gccccacctt cgccccgctg ccgcctgtgg cccccttaca ctgtgcccag acgccctacg 60
gctgctgcca ggacaatatc accgcagccc ggggcgtggg cctggctggc tgccccagtg 120
cctgccagtg caacccccat ggctcttacg gcggcacctg tgacccagcc acaggccagt 180
gctcctgccg cccaggtgtg gggggcctca ggtgtgaccg ctgtgagcct ggcttctgga 240
actttcgagg catcgtcacc gatggccgga gtggctgtac accctgcagc tgtgatcccc 300
aaggcgccgt gcgggatgac tgtgagcaga tgacggggct gtgctcgtgt aagcccgggg 360
tggctggacc caagtgtggg cagtgtccag acggccgtgc cctgggcccc gcgggctgtg 420
aagctgacgc ttctgcgcct gcgacctgtg cggagatgcg ctgtgagttc ggtgcgcggt 480
gcgtggagga gtctggctca gcccactgtg tctgcccgat gctcacctgt ccagaggcca 540
acgctaccaa ggtctgtggg tcagatggag tcacatacgg caacgagtgt cagctgaaga 600
ccatcgcctg ccgccagggc ctgcaaatct ctatccagag cctgggcccg tgccaggagg 660
ctgttgctcc cagcactcac ccgacatctg cctccgtgac tgtgaccacc ccagggctcc 720
tcctgagcca ggcactgccg gccccccccg gcgccctccc cctggctccc agcagtaccg 780
cacacagcca gaccacccct ccgccctcat cacgacctcg gaccactgcc agcgtcccca 840
ggaccaccgt gtggcccgtg ctgacggtgc cccccacggc accctcccct gcacccagcc 900
tggtggcgtc cgcctttggt gaatctggca gcactgatgg aagcagcgat gaggaactga 960
gcggggacca ggaggccagt gggggtggct ctggggggct cgagcccttg gagggcagca 1020
gcgtggccac ccctgggcca cctgtcgaga gggcttcctg ctacaactcc gcgttgggct 1080
gctgctctga tgggaagacg ccctcgctgg acgcagagggg ctccaactgc cccgccacca 1140
aggtgttcca gggcgtcctg gagctggagg gcgtcgaggg ccaggagctg ttctacacgc 1200
ccgagatggc tgaccccaag tcagaactgt tcggggagac agccaggagc attgagagca 1260
ccctggacga cctcttccgg aattcagacg tcaagaagga ttttcggagt gtccgcttgc 1320
gggacctggg gcccggcaaa tccgtccgcg ccattgtgga tgtgcacttt gaccccacca 1380
cagccttcag ggcacccgac gtggcccggg ccctgctccg gcagatccag gtgtccaggc 1440
gccggtcctt gggggtgagg cggccgctgc aggagcacgt gcgattatg gactttgact 1500
ggtttcctgc gtttatcacg ggggccacgt caggagccat tgctgcggga gccacggcca 1560
gagccaccac tgcatcgcgc ctgccgtcct ctgctgtgac ccctcgggcc ccgcacccca 1620
gtcacacaag ccagccccgtt gccaagacca cggcagcccc caccaacacgt cggcccccca 1680
ccactgcccc cagccgtgtg cccggacgtc ggcccccggc cccccagcag cctccaaagc 1740
cctgtgactc acagccctgc ttccacgggg ggacctgcca ggactgggca ttgggcgggg 1800
gcttcacctg cagctgcccg gcaggcaggg gaggcgccgt ctgtgagaag gtgcttggcg 1860
cccctgtgcc ggccttcgag ggccgctcct tcctggcctt ccccactctc cgcgcctacc 1920
acacgctgcg cctggcactg gaattccggg cgctggagcc tcaggggctg ctgctgtaca 1980
atggcaacgc ccggggcaag gacttcctgg cattggcgct gctagatggc cgcgtgcagc 2040
tcaggtttga cacaggttcg gggccggcgg tgctgaccag tgccgtgccg gtagagccgg 2100
gccagtggca ccgcctggag ctgtcccggc actggcgccg gggcaccctc tcggtggatg 2160
gtgagacccc tgttctgggc gagagtccca gtggcaccga cggcctcaac ctggacacag 2220
acctctttgt gggcggcgta cccgaggacc aggctgccgt ggcgctggag cggaccttcg 2280
tgggcgccgg cctgaggggg tgcatccgtt tgctggacgt caacaaccag cgcctggagc 2340
ttggcattgg gccgggggct gccacccgag gctctggcgt gggcgagtgc ggggaccacc 2400
cctgcctgcc caacccctgc catggcgggg ccccatgcca gaacctggag gctggaaggt 2460
tccattgcca gtgcccgccc ggccgcgtcg gaccaacctg tgccgatgag aagagcccct 2520
gccagcccaa cccctgccat ggggcggcgc cctgccgtgt gctgcccgag ggtggtgctc 2580
agtgcgagtg ccccctgggg cgtgagggca ccttctgcca gacagcctcg gggcaggacg 2640
gctctgggcc cttcctggct gacttcaacg gcttctccca cctggagctg agaggcctgc 2700
acacctttgc acgggacctg ggggagaaga tggcgctgga ggtcgtgttc ctggcacgag 2760
gccccagcgg cctcctgctc tacaacgggc agaagacgga cggcaagggg gacttcgtgt 2820
cgctggcact gcgggaccgc cgcctggagt tccgctacga cctgggcaag ggggcagcgg 2880
tcatcaggag cagggagcca gtcaccctgg gagcctggac cagggtctca ctggagcgaa 2940
acggccgcaa gggtgccctg cgtgtgggcg acggcccccg tgtgttgggg gagtccccga 3000
aatcccgcaa ggttccgcac accgtcctca acctgaagga gccgctctac gtagggggcg 3060
ctcccgactt cagcaagctg gcccgtgctg ctgccgtgtc ctctggcttc gacggtgcca 3120
tccagctggt ctccctcgga ggccgccagc tgctgacccc ggagcacgtg ctgcggcagg 3180
tggacgtcac gtcctttgca ggtcacccct gcacccgggc ctcaggccac ccctgcctca 3240
atggggcctc ctgcgtcccg agggaggctg cctatgtgtg cctgtgtccc gggggattct 3300
caggaccgca ctgcgagaag gggctggtgg agaagtcagc gggggacgtg gataccttgg 3360
cctttgacgg gcggaccttt gtcgagtacc tcaacgctgt gaccgagagc gaactggcca 3420
atgagatccc cgtccccgaa actctggatt ccggggccct tcacagcgag aaggcactgc 3480
agagcaacca ctttgaactg agcctgcgca ctgaggccac gcaggggctg gtgctctgga 3540
gtggcaaggc cacggagcgg gcagactatg tggcactggc cattgtggac gggcacctgc 3600
aactgagcta caacctgggc tcccagcccg tggtgctgcg ttccaccgtg cccgtcaaca 3660
ccaaccgctg gttgcgggtc gtggcacata gggagcagag ggaaggttcc ctgcaggtgg 3720
gcaatgaggc ccctgtgacc ggctcctccc cgctgggcgc cacgcagctg gacactgatg 3780
gagccctgtg gcttgggggc ctgccggagc tgcccgtggg cccagcactg cccaaggcct 3840
acggcacagg ctttgtgggc tgcttgcggg acgtggtggt gggccggcac ccgctgcacc 3900
tgctggagga cgccgtcacc aagccagagc tgcggccctg ccccacccca tgagctggca 3960
ccagagcccc gcgcccgctg taattatttt ctatttttgt aaacttgttg ctttttgata 4020
tgattttctt gcctgagtgttggccggagg gactgctggc ccggcctccc ttccgtccag 4080
gcagccgtgc tgcagacaga cctagtgccg agggatggac aggcgaggtg gcagcgtgga 4140
gggctcggcg tggatggcag cctcaggaca cacacccctg cctcaaggtg ctgagccccc 4200
gccttgcact gcgcctgccc cacggtgtcc ccgccgggaa gcagccccgg ctcctgaatc 4260
accctcgctc cgtcaggcgg gactcgtgtc ccagagagga aggggctgct gaggtctgat 4320
ggggcccttc ctccgggtga ccccacagggg cctttccaag cccccatttg agctgctcct 4380
tcctgtgtgt gctctgggcc ctgcctcggc ctcctgcgcc aatactgtga cttccaaaca 4440
atgttactgc tgggcacagc tctgcgttgc tcccgtgctg cctgcgccag ccccaggctg 4500
ctgaggagca gaggccagac cagggccgat ctgggtgtcc tgaccctcag ctggccctgc 4560
ccagccacccc tggacgtgac cgtatccctc tgccacacccc caggccctgc gaggggctat 4620
cgagaggagc tcactgtggg atggggttga cctctgccgc ctgcctgggt atctgggcct 4680
ggccatggct gtgttcttca tgtgttgatt ttatttgacc cctggagtgg tgggtctcat 4740
ctttcccatc tcgcctgaga gcggctgagg gctgcctcac tgcaaatcct ccccacagcg 4800
tcagtgaaag tcgtccttgt ctcagaatga ccaggggcca gccagtgtct gaccaaggtc 4860
aaggggcagg tgcagaggtg gcagggatgg ctccgaagcc agaaatgcct taaactgcaa 4920
cgtcccgtcc cttccccacc cccatcccat ccccacccccc agccccagcc cagtcctcct 4980
aggagcagga cccgatgaag cgggcggcgg tggggctggg tgccgtgtta ctaactctag 5040
tatgtttctg tgtcaatcgc tgtgaaataa agtctgaaaa ctttaaaa 5088
<![CDATA[ <210> 418]]>
<![CDATA[ <211> 7395]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> The complete mRNA polynucleotide sequence of AGRN]]>
<![CDATA[ <400> 418]]>
agtcccgtcc ccggcgcggc ccgcgcgctc ctccgccgcc tctcgcctgc gccatggccg 60
gccggtccca cccgggcccg ctgcggccgc tgctgccgct ccttgtggtg gccgcgtgcg 120
tcctgcccgg agccggcggg acatgcccgg agcgcgcgct ggagcggcgc gaggaggagg 180
cgaacgtggt gctcaccggg acggtggagg agatcctcaa cgtggacccg gtgcagcaca 240
cgtactcctg caaggttcgg gtctggcggt acttgaaggg caaagacctg gtggcccggg 300
agagcctgct ggacggcggc aacaaggtgg tgatcagcgg ctttggagac cccctcatct 360
gtgacaacca ggtgtccact ggggaccacca ggatcttctt tgtgaaccct gcacccccat 420
acctgtggcc agcccacaag aacgagctga tgctcaactc cagcctcatg cggatcaccc 480
tgcggaacct gaggaggtg gagttctgtg tggaagataa acccgggacc cacttcactc 540
cagtgcctcc gacgcctcct gatgcgtgcc ggggaatgct gtgcggcttc ggcgccgtgt 600
gcgagcccaa cgcggagggg ccgggccggg cgtcctgcgt ctgcaagaag agcccgtgcc 660
ccagcgtggt ggcgcctgtg tgtgggtcgg acgcctccac ctacagcaac gaatgcgagc 720
tgcagcgggc gcagtgcagc cagcagcgcc gcatccgcct gctcagccgc gggccgtgcg 780
gctcgcggga cccctgctcc aacgtgacct gcagcttcgg cagcacctgt gcgcgctcgg 840
ccgacgggct gacggcctcg tgcctgtgcc ccgcgacctg ccgtggcgcc cccgagggga 900
ccgtctgcgg cagcgacggc gccgactacc ccggcgagtg ccagctcctg cgccgcgcct 960
gcgcccgcca ggagaatgtc ttcaagaagt tcgacggccc ttgtgacccc tgtcagggcg 1020
ccctccctga cccgagccgc agctgccgtg tgaacccgcg cacgcggcgc cctgagatgc 1080
tcctacggcc cgagagctgc cctgcccggc aggcgccagt gtgtggggac gacggagtca 1140
cctacgaaaa cgactgtgtc atgggccgat cgggggccgc ccggggtctc ctcctgcaga 1200
aagtgcgctc cggccagtgc cagggtcgag accagtgccc ggagccctgc cggttcaatg 1260
ccgtgtgcct gtcccgccgt ggccgtcccc gctgctcctg cgaccgcgtc acctgtgacg 1320
gggcctacag gcccgtgtgt gcccaggacg ggcgcacgta tgacagtgat tgctggcggc 1380
agcaggctga gtgccggcag cagcgtgcca tccccagcaa gcaccagggc ccgtgtgacc 1440
aggccccgtc cccatgcctc ggggtgcagt gtgcattgg ggcgacgtgt gctgtgaaga 1500
acgggcaggc agcgtgtgaa tgcctgcagg cgtgctcgag cctctacgat cctgtgtgcg 1560
gcagcgacgg cgtcacatac ggcagcgcgt gcgagctgga ggccacggcc tgtaccctcg 1620
ggcggggagat ccaggtggcg cgcaaaggac cctgtgaccg ctgcggggcag tgccgctttg 1680
gagccctgtg cgaggccgag accgggcgct gcgtgtgccc ctctgaatgc gtggctttgg 1740
cccagcccgt gtgtggctcc gacgggcaca cgtaccccag cgagtgcatg ctgcacgtgc 1800
acgcctgcac acaccagatc agcctgcacg tggcctcagc tggaccctgt gagacctgtg 1860
gagatgccgt gtgtgctttt ggggctgtgt gctccgcagg gcagtgtgtg tgtccccggt 1920
gtgagcaccc cccgcccggc cccgtgtgtg gcagcgacgg tgtcacctac ggcagtgcct 1980
gcgagctacg ggaagccgcc tgcctccagc agacacagat cgaggaggcc cgggcagggc 2040
cgtgcgagca ggccgagtgc ggttccggag gctctggctc tggggaggac ggtgactgtg 2100
agcaggagct gtgccggcag cgcggtggca tctgggacga ggactcggag gacgggccgt 2160
gtgtctgtga cttcagctgc cagagtgtcc caggcagccc ggtgtgcggc tcagatgggg 2220
tcacctacag caccgagtgt gagctgaaga aggccaggtg tgagtcacag cgagggctct 2280
acgtagcggc ccagggagcc tgccgaggcc ccaccttcgc cccgctgccg cctgtggccc 2340
ccttacactg tgcccagacg ccctacggct gctgccagga caatatcacc gcagcccggg 2400
gcgtgggcct ggctggctgc cccagtgcct gccagtgcaa cccccatggc tcttacggcg 2460
gcacctgtga cccagccaca ggccagtgct cctgccgccc aggtgtgggg ggcctcaggt 2520
gtgaccgctg tgagcctggc ttctggaact ttcgaggcat cgtcaccgat ggccggagtg 2580
gctgtacacc ctgcagctgt gatccccaag gcgccgtgcg ggatgactgt gagcagatga 2640
cggggctgtg ctcgtgtaag cccggggtgg ctggacccaa gtgtggggcag tgtccagacg 2700
gccgtgccct gggccccgcg ggctgtgaag ctgacgcttc tgcgcctgcg acctgtgcgg 2760
agatgcgctg tgagttcggt gcgcggtgcg tggaggagtc tggctcagcc cactgtgtct 2820
gcccgatgct cacctgtcca gaggccaacg ctaccaaggt ctgtgggtca gatggagtca 2880
catacggcaa cgagtgtcag ctgaagacca tcgcctgccg ccagggcctg caaatctcta 2940
tccagagcct gggcccgtgc caggaggctg ttgctcccag cactcacccg acatctgcct 3000
ccgtgactgt gaccacccca gggctcctcc tgagccaggc actgccggcc ccccccggcg 3060
ccctccccct ggctcccagc agtaccgcac acagccagac cacccctccg ccctcatcac 3120
gacctcggac cactgccagc gtccccagga ccaccgtgtg gcccgtgctg acggtgcccc 3180
ccacggcacc ctcccctgca cccagcctgg tggcgtccgc ctttggtgaa tctggcagca 3240
ctgatggaag cagcgatgag gaactgagcg gggaccagga ggccagtggg ggtggctctg 3300
gggggctcga gcccttggag ggcagcagcg tggccacccc tgggccacct gtcgagaggg 3360
cttcctgcta caactccgcg ttgggctgct gctctgatgg gaagacgccc tcgctggacg 3420
cagagggctc caactgcccc gccaccaagg tgttccaggg cgtcctggag ctggagggcg 3480
tcgagggcca ggagctgttc tacacgcccg agatggctga ccccaagtca gaactgttcg 3540
gggagacagc caggagcatt gagagcaccc tggacgacct cttccggaat tcagacgtca 3600
agaaggattt tcggagtgtc cgcttgcggg acctggggcc cggcaaatcc gtccgcgcca 3660
ttgtggatgt gcactttgac cccaccacag ccttcagggc acccgacgtg gcccgggccc 3720
tgctccggca gatccaggtg tccaggcgcc ggtccttggg ggtgaggcgg ccgctgcagg 3780
agcacgtgcg atttatggac tttgactggt ttcctgcgtt tatcacgggg gccacgtcag 3840
gagccattgc tgcggggagcc acggccagag ccaccactgc atcgcgcctg ccgtcctctg 3900
ctgtgacccc tcgggccccg caccccagtc acacaagcca gcccgttgcc aagaccacgg 3960
cagcccccac cacacgtcgg cccccacca ctgcccccag ccgtgtgccc ggacgtcggc 4020
ccccggcccc ccagcagcct ccaaagccct gtgactcaca gccctgcttc cacggggggga 4080
cctgccagga ctgggcattg ggcggggggct tcacctgcag ctgcccggca ggcaggggag 4140
gcgccgtctg tgagaaggtg cttggcgccc ctgtgccggc cttcgagggc cgctccttcc 4200
tggccttccc cactctccgc gcctaccaca cgctgcgcct ggcactggaa ttccgggcgc 4260
tggagcctca ggggctgctg ctgtacaatg gcaacgcccg gggcaaggac ttcctggcat 4320
tggcgctgct agatggccgc gtgcagctca ggtttgacac aggttcgggg ccggcggtgc 4380
tgaccagtgc cgtgccggta gagccgggcc agtggcaccg cctggagctg tcccggcact 4440
ggcgccgggg caccctctcg gtggatggtg agaccccctgt tctgggcgag agtccccagtg 4500
gcaccgacgg cctcaacctg gacacagacc tctttgtggg cggcgtaccc gaggaccagg 4560
ctgccgtggc gctggagcgg accttcgtgg gcgccggcct gagggggtgc atccgtttgc 4620
tggacgtcaa caaccagcgc ctggagcttg gcattgggcc gggggctgcc acccgaggct 4680
ctggcgtggg cgagtgcggg gaccacccct gcctgcccaa cccctgccat ggcggggccc 4740
catgccagaa cctggaggct ggaaggttcc attgccagtg cccgcccggc cgcgtcggac 4800
caacctgtgc cgatgagaag agcccctgcc agcccaaccc ctgccatggg gcggcgccct 4860
gccgtgtgct gcccgagggt ggtgctcagt gcgagtgccc cctggggcgt gagggcacct 4920
tctgccagac agcctcgggg caggacggct ctgggccctt cctggctgac ttcaacggct 4980
tctcccacct ggagctgaga ggcctgcaca cctttgcacg ggacctgggg gagaagatgg 5040
cgctggaggt cgtgttcctg gcacgaggcc ccagcggcct cctgctctac aacgggcaga 5100
agacggacgg caagggggac ttcgtgtcgc tggcactgcg ggaccgccgc ctggagttcc 5160
gctacgacct gggcaagggg gcagcggtca tcaggagcag ggagccagtc accctgggag 5220
cctggaccag ggtctcactg gagcgaaacg gccgcaaggg tgccctgcgt gtgggcgacg 5280
gcccccgtgtgttgggggag tccccgaaat cccgcaaggt tccgcacacc gtcctcaacc 5340
tgaaggagcc gctctacgta gggggcgctc ccgacttcag caagctggcc cgtgctgctg 5400
ccgtgtcctc tggcttcgac ggtgccatcc agctggtctc cctcggaggc cgccagctgc 5460
tgaccccgga gcacgtgctg cggcaggtgg acgtcacgtc ctttgcaggt cacccctgca 5520
cccgggcctc aggccacccc tgcctcaatg gggcctcctg cgtcccgagg gaggctgcct 5580
atgtgtgcct gtgtcccgggg ggattctcag gaccgcactg cgagaagggg ctggtggaga 5640
agtcagcggg ggacgtggat accttggcct ttgacgggcg gacctttgtc gagtacctca 5700
acgctgtgac cgagagcgaa ctggccaatg agatccccgt ccccgaaact ctggattccg 5760
gggcccttca cagcgagaag gcactgcaga gcaaccactt tgaactgagc ctgcgcactg 5820
aggccacgca ggggctggtg ctctggagtg gcaaggccac ggagcgggca gactatgtgg 5880
cactggccat tgtggacggg cacctgcaac tgagctacaa cctgggctcc cagcccgtgg 5940
tgctgcgttc caccgtgccc gtcaacacca accgctggtt gcgggtcgtg gcacataggg 6000
agcagaggga aggttccctg caggtgggca atgaggcccc tgtgaccggc tcctccccgc 6060
tgggcgccac gcagctggac actgatggag ccctgtggct tgggggcctg ccggagctgc 6120
ccgtgggccc agcactgccc aaggcctacg gcacaggctt tgtgggctgc ttgcgggacg 6180
tggtggtggg ccggcacccg ctgcacctgc tggaggacgc cgtcaccaag ccagagctgc 6240
ggccctgccc caccccatga gctggcacca gagccccgcg cccgctgtaa ttattttcta 6300
tttttgtaaa cttgttgctt tttgatatga ttttcttgcc tgagtgttgg ccggagggac 6360
tgctggcccg gcctcccttc cgtccaggca gccgtgctgc agacagacct agtgccgagg 6420
gatggacagg cgaggtggca gcgtggaggg ctcggcgtgg atggcagcct caggacacac 6480
acccctgcct caaggtgctg agcccccgcc ttgcactgcg cctgccccac ggtgtccccg 6540
ccgggaagca gccccggctc ctgaatcacc ctcgctccgt caggcgggac tcgtgtccca 6600
gagaggaagg ggctgctgag gtctgatggg gcccttcctc cgggtgaccc cacagggcct 6660
ttccaagccc ccatttgagc tgctccttcc tgtgtgtgct ctgggccctg cctcggcctc 6720
ctgcgccaat actgtgactt ccaaacaatg ttactgctgg gcacagctct gcgttgctcc 6780
cgtgctgcct gcgccagccc caggctgctg aggagcagag gccagaccag ggccgatctg 6840
ggtgtcctga ccctcagctg gccctgccca gccaccctgg acgtgaccgt atccctctgc 6900
cacaccccag gccctgcgag gggctatcga gaggagctca ctgtgggatg gggttgacct 6960
ctgccgcctg cctgggtatc tgggcctggc catggctgtg ttcttcatgt gttgatttta 7020
tttgacccct ggagtggtgg gtctcatctt tcccatctcg cctgagagcg gctgagggct 7080
gcctcactgc aaatcctccc cacagcgtca gtgaaagtcg tccttgtctc agaatgacca 7140
ggggccagcc agtgtctgac caaggtcaag gggcaggtgc agaggtggca gggatggctc 7200
cgaagccaga aatgccttaa actgcaacgt cccgtccctt ccccacccccc atcccatccc 7260
cacccccagc cccagcccag tcctcctagg agcaggaccc gatgaagcgg gcggcggtgg 7320
ggctgggtgc cgtgttacta actctagtat gtttctgtgt caatcgctgt gaaataaagt 7380
ctgaaaactt taaaa 7395
<![CDATA[ <210> 419]]>
<![CDATA[ <211> 67]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 419]]>
Met Ala Gly Arg Ser His Pro Gly Pro Leu Arg Pro Leu Leu Pro Leu
1 5 10 15
Leu Val Val Ala Ala Cys Val Leu Pro Gly Ala Gly Gly Thr Cys Pro
20 25 30
Glu Arg Ala Leu Glu Arg Arg Glu Glu Glu Glu Ala Asn Val Val Leu Thr
35 40 45
Gly Thr Val Glu Glu Ile Leu Asn Val Asp Pro Val Gln His Thr Tyr
50 55 60
Ser Cys Lys
65
<![CDATA[ <210> 420]]>
<![CDATA[ <211> 87]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 420]]>
Val Arg Val Trp Arg Tyr Leu Lys Gly Lys Asp Leu Val Ala Arg Glu
1 5 10 15
Ser Leu Leu Asp Gly Gly Asn Lys Val Val Ile Ser Gly Phe Gly Asp
20 25 30
Pro Leu Ile Cys Asp Asn Gln Val Ser Thr Gly Asp Thr Arg Ile Phe
35 40 45
Phe Val Asn Pro Ala Pro Pro Tyr Leu Trp Pro Ala His Lys Asn Glu
50 55 60
Leu Met Leu Asn Ser Ser Leu Met Arg Ile Thr Leu Arg Asn Leu Glu
65 70 75 80
Glu Val Glu Phe Cys Val Glu
85
<![CDATA[ <210> 421]]>
<![CDATA[ <211> 15]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 421]]>
Lys Pro Gly Thr His Phe Thr Pro Val Pro Pro Thr Pro Pro Asp
1 5 10 15
<![CDATA[ <210> 422]]>
<![CDATA[ <211> 71]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 422]]>
Cys Arg Gly Met Leu Cys Gly Phe Gly Ala Val Cys Glu Pro Asn Ala
1 5 10 15
Glu Gly Pro Gly Arg Ala Ser Cys Val Cys Lys Lys Ser Pro Cys Pro
20 25 30
Ser Val Val Ala Pro Val Cys Gly Ser Asp Ala Ser Thr Tyr Ser Asn
35 40 45
Glu Cys Glu Leu Gln Arg Ala Gln Cys Ser Gln Gln Arg Arg Ile Arg
50 55 60
Leu Leu Ser Arg Gly Pro Cys
65 70
<![CDATA[ <210> 423]]>
<![CDATA[ <211> 74]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 423]]>
Ser Arg Asp Pro Cys Ser Asn Val Thr Cys Ser Phe Gly Ser Thr Cys
1 5 10 15
Ala Arg Ser Ala Asp Gly Leu Thr Ala Ser Cys Leu Cys Pro Ala Thr
20 25 30
Cys Arg Gly Ala Pro Glu Gly Thr Val Cys Gly Ser Asp Gly Ala Asp
35 40 45
Tyr Pro Gly Glu Cys Gln Leu Leu Arg Arg Ala Cys Ala Arg Gln Glu
50 55 60
Asn Val Phe Lys Lys Phe Asp Gly Pro Cys
65 70
<![CDATA[ <210> 424]]>
<![CDATA[ <211> 74]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 424]]>
Pro Cys Gln Gly Ala Leu Pro Asp Pro Ser Arg Ser Cys Arg Val Asn
1 5 10 15
Pro Arg Thr Arg Arg Pro Glu Met Leu Leu Arg Pro Glu Ser Cys Pro
20 25 30
Ala Arg Gln Ala Pro Val Cys Gly Asp Asp Gly Val Thr Tyr Glu Asn
35 40 45
Asp Cys Val Met Gly Arg Ser Gly Ala Ala Arg Gly Leu Leu Leu Gln
50 55 60
Lys Val Arg Ser Gly Gln Cys Gln Gly Arg
65 70
<![CDATA[ <210> 425]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 425]]>
Gln Cys Pro Glu Pro Cys Arg Phe Asn Ala Val Cys Leu Ser Arg Arg
1 5 10 15
Gly Arg Pro Arg Cys Ser Cys Asp Arg Val Thr Cys Asp Gly Ala Tyr
20 25 30
Arg Pro Val Cys Ala Gln Asp Gly Arg Thr Tyr Asp Ser Asp Cys Trp
35 40 45
Arg Gln Gln Ala Glu Cys Arg Gln Gln Arg Ala Ile Pro Ser Lys His
50 55 60
Gln Gly Pro Cys
65
<![CDATA[ <210> 426]]>
<![CDATA[ <211> 72]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 426]]>
Gln Ala Pro Ser Pro Cys Leu Gly Val Gln Cys Ala Phe Gly Ala Thr
1 5 10 15
Cys Ala Val Lys Asn Gly Gln Ala Ala Cys Glu Cys Leu Gln Ala Cys
20 25 30
Ser Ser Leu Tyr Asp Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly
35 40 45
Ser Ala Cys Glu Leu Glu Ala Thr Ala Cys Thr Leu Gly Arg Glu Ile
50 55 60
Gln Val Ala Arg Lys Gly Pro Cys
65 70
<![CDATA[ <210> 427]]>
<![CDATA[ <211> 64]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 427]]>
Arg Cys Gly Gln Cys Arg Phe Gly Ala Leu Cys Glu Ala Glu Thr Gly
1 5 10 15
Arg Cys Val Cys Pro Ser Glu Cys Val Ala Leu Ala Gln Pro Val Cys
20 25 30
Gly Ser Asp Gly His Thr Tyr Pro Ser Glu Cys Met Leu His Val His
35 40 45
Ala Cys Thr His Gln Ile Ser Leu His Val Ala Ser Ala Gly Pro Cys
50 55 60
<![CDATA[ <210> 428]]>
<![CDATA[ <211> 66]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 428]]>
Thr Cys Gly Asp Ala Val Cys Ala Phe Gly Ala Val Cys Ser Ala Gly
1 5 10 15
Gln Cys Val Cys Pro Arg Cys Glu His Pro Pro Pro Gly Pro Val Cys
20 25 30
Gly Ser Asp Gly Val Thr Tyr Gly Ser Ala Cys Glu Leu Arg Glu Ala
35 40 45
Ala Cys Leu Gln Gln Thr Gln Ile Glu Glu Ala Arg Ala Gly Pro Cys
50 55 60
Glu Gln
65
<![CDATA[ <210> 429]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 429]]>
Glu Cys Gly Ser Gly Gly Ser Gly Ser Gly Glu Asp Gly Asp Cys Glu
1 5 10 15
Gln Glu Leu Cys Arg Gln Arg Gly Gly Ile Trp Asp Glu Asp Ser Glu
20 25 30
Asp Gly Pro Cys Val Cys Asp Phe Ser Cys Gln Ser Val Pro Gly Ser
35 40 45
Pro
<![CDATA[ <210> 430]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 430]]>
Val Cys Gly Ser Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys
1 5 10 15
Lys Ala Arg Cys Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly
20 25 30
Ala Cys Arg
35
<![CDATA[ <210> 431]]>
<![CDATA[ <211> 1352]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 431]]>
Val Cys Gly Ser Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys
1 5 10 15
Lys Ala Arg Cys Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly
20 25 30
Ala Cys Arg Gly Pro Thr Phe Ala Pro Leu Pro Pro Val Ala Pro Leu
35 40 45
His Cys Ala Gln Thr Pro Tyr Gly Cys Cys Gln Asp Asn Ile Thr Ala
50 55 60
Ala Arg Gly Val Gly Leu Ala Gly Cys Pro Ser Ala Cys Gln Cys Asn
65 70 75 80
Pro His Gly Ser Tyr Gly Gly Thr Cys Asp Pro Ala Thr Gly Gln Cys
85 90 95
Ser Cys Arg Pro Gly Val Gly Gly Leu Arg Cys Asp Arg Cys Glu Pro
100 105 110
Gly Phe Trp Asn Phe Arg Gly Ile Val Thr Asp Gly Arg Ser Gly Cys
115 120 125
Thr Pro Cys Ser Cys Asp Pro Gln Gly Ala Val Arg Asp Asp Cys Glu
130 135 140
Gln Met Thr Gly Leu Cys Ser Cys Lys Pro Gly Val Ala Gly Pro Lys
145 150 155 160
Cys Gly Gln Cys Pro Asp Gly Arg Ala Leu Gly Pro Ala Gly Cys Glu
165 170 175
Ala Asp Ala Ser Ala Pro Ala Thr Cys Ala Glu Met Arg Cys Glu Phe
180 185 190
Gly Ala Arg Cys Val Glu Glu Ser Gly Ser Ala His Cys Val Cys Pro
195 200 205
Met Leu Thr Cys Pro Glu Ala Asn Ala Thr Lys Val Cys Gly Ser Asp
210 215 220
Gly Val Thr Tyr Gly Asn Glu Cys Gln Leu Lys Thr Ile Ala Cys Arg
225 230 235 240
Gln Gly Leu Gln Ile Ser Ile Gln Ser Leu Gly Pro Cys Gln Glu Ala
245 250 255
Val Ala Pro Ser Thr His Pro Thr Ser Ala Ser Val Thr Val Thr Thr
260 265 270
Pro Gly Leu Leu Leu Ser Gln Ala Leu Pro Ala Pro Pro Gly Ala Leu
275 280 285
Pro Leu Ala Pro Ser Ser Thr Ala His Ser Gln Thr Thr Pro Pro Pro
290 295 300
Ser Ser Arg Pro Arg Thr Thr Ala Ser Val Pro Arg Thr Thr Val Trp
305 310 315 320
Pro Val Leu Thr Val Pro Pro Thr Ala Pro Ser Pro Ala Pro Ser Leu
325 330 335
Val Ala Ser Ala Phe Gly Glu Ser Gly Ser Thr Asp Gly Ser Ser Asp
340 345 350
Glu Glu Leu Ser Gly Asp Gln Glu Ala Ser Gly Gly Gly Ser Gly Gly
355 360 365
Leu Glu Pro Leu Glu Gly Ser Ser Val Ala Thr Pro Gly Pro Pro Val
370 375 380
Glu Arg Ala Ser Cys Tyr Asn Ser Ala Leu Gly Cys Cys Ser Asp Gly
385 390 395 400
Lys Thr Pro Ser Leu Asp Ala Glu Gly Ser Asn Cys Pro Ala Thr Lys
405 410 415
Val Phe Gln Gly Val Leu Glu Leu Glu Gly Val Glu Gly Gln Glu Leu
420 425 430
Phe Tyr Thr Pro Glu Met Ala Asp Pro Lys Ser Glu Leu Phe Gly Glu
435 440 445
Thr Ala Arg Ser Ile Glu Ser Thr Leu Asp Asp Leu Phe Arg Asn Ser
450 455 460
Asp Val Lys Lys Asp Phe Arg Ser Val Arg Leu Arg Asp Leu Gly Pro
465 470 475 480
Gly Lys Ser Val Arg Ala Ile Val Asp Val His Phe Asp Pro Thr Thr
485 490 495
Ala Phe Arg Ala Pro Asp Val Ala Arg Ala Leu Leu Arg Gln Ile Gln
500 505 510
Val Ser Arg Arg Arg Ser Leu Gly Val Arg Arg Pro Leu Gln Glu His
515 520 525
Val Arg Phe Met Asp Phe Asp Trp Phe Pro Ala Phe Ile Thr Gly Ala
530 535 540
Thr Ser Gly Ala Ile Ala Ala Gly Ala Thr Ala Arg Ala Thr Thr Ala
545 550 555 560
Ser Arg Leu Pro Ser Ser Ala Val Thr Pro Arg Ala Pro His Pro Ser
565 570 575
His Thr Ser Gln Pro Val Ala Lys Thr Thr Ala Ala Pro Thr Thr Arg
580 585 590
Arg Pro Pro Thr Thr Ala Pro Ser Arg Val Pro Gly Arg Arg Pro Pro
595 600 605
Ala Pro Gln Gln Pro Pro Lys Pro Cys Asp Ser Gln Pro Cys Phe His
610 615 620
Gly Gly Thr Cys Gln Asp Trp Ala Leu Gly Gly Gly Phe Thr Cys Ser
625 630 635 640
Cys Pro Ala Gly Arg Gly Gly Ala Val Cys Glu Lys Val Leu Gly Ala
645 650 655
Pro Val Pro Ala Phe Glu Gly Arg Ser Phe Leu Ala Phe Pro Thr Leu
660 665 670
Arg Ala Tyr His Thr Leu Arg Leu Ala Leu Glu Phe Arg Ala Leu Glu
675 680 685
Pro Gln Gly Leu Leu Leu Tyr Asn Gly Asn Ala Arg Gly Lys Asp Phe
690 695 700
Leu Ala Leu Ala Leu Leu Asp Gly Arg Val Gln Leu Arg Phe Asp Thr
705 710 715 720
Gly Ser Gly Pro Ala Val Leu Thr Ser Ala Val Pro Val Glu Pro Gly
725 730 735
Gln Trp His Arg Leu Glu Leu Ser Arg His Trp Arg Arg Gly Thr Leu
740 745 750
Ser Val Asp Gly Glu Thr Pro Val Leu Gly Glu Ser Pro Ser Gly Thr
755 760 765
Asp Gly Leu Asn Leu Asp Thr Asp Leu Phe Val Gly Gly Val Pro Glu
770 775 780
Asp Gln Ala Ala Val Ala Leu Glu Arg Thr Phe Val Gly Ala Gly Leu
785 790 795 800
Arg Gly Cys Ile Arg Leu Leu Asp Val Asn Asn Gln Arg Leu Glu Leu
805 810 815
Gly Ile Gly Pro Gly Ala Ala Thr Arg Gly Ser Gly Val Gly Glu Cys
820 825 830
Gly Asp His Pro Cys Leu Pro Asn Pro Cys His Gly Gly Ala Pro Cys
835 840 845
Gln Asn Leu Glu Ala Gly Arg Phe His Cys Gln Cys Pro Pro Gly Arg
850 855 860
Val Gly Pro Thr Cys Ala Asp Glu Lys Ser Pro Cys Gln Pro Asn Pro
865 870 875 880
Cys His Gly Ala Ala Pro Cys Arg Val Leu Pro Glu Gly Gly Ala Gln
885 890 895
Cys Glu Cys Pro Leu Gly Arg Glu Gly Thr Phe Cys Gln Thr Ala Ser
900 905 910
Gly Gln Asp Gly Ser Gly Pro Phe Leu Ala Asp Phe Asn Gly Phe Ser
915 920 925
His Leu Glu Leu Arg Gly Leu His Thr Phe Ala Arg Asp Leu Gly Glu
930 935 940
Lys Met Ala Leu Glu Val Val Phe Leu Ala Arg Gly Pro Ser Gly Leu
945 950 955 960
Leu Leu Tyr Asn Gly Gln Lys Thr Asp Gly Lys Gly Asp Phe Val Ser
965 970 975
Leu Ala Leu Arg Asp Arg Arg Leu Glu Phe Arg Tyr Asp Leu Gly Lys
980 985 990
Gly Ala Ala Val Ile Arg Ser Arg Glu Pro Val Thr Leu Gly Ala Trp
995 1000 1005
Thr Arg Val Ser Leu Glu Arg Asn Gly Arg Lys Gly Ala Leu Arg
1010 1015 1020
Val Gly Asp Gly Pro Arg Val Leu Gly Glu Ser Pro Lys Ser Arg
1025 1030 1035
Lys Val Pro His Thr Val Leu Asn Leu Lys Glu Pro Leu Tyr Val
1040 1045 1050
Gly Gly Ala Pro Asp Phe Ser Lys Leu Ala Arg Ala Ala Ala Val
1055 1060 1065
Ser Ser Gly Phe Asp Gly Ala Ile Gln Leu Val Ser Leu Gly Gly
1070 1075 1080
Arg Gln Leu Leu Thr Pro Glu His Val Leu Arg Gln Val Asp Val
1085 1090 1095
Thr Ser Phe Ala Gly His Pro Cys Thr Arg Ala Ser Gly His Pro
1100 1105 1110
Cys Leu Asn Gly Ala Ser Cys Val Pro Arg Glu Ala Ala Tyr Val
1115 1120 1125
Cys Leu Cys Pro Gly Gly Phe Ser Gly Pro His Cys Glu Lys Gly
1130 1135 1140
Leu Val Glu Lys Ser Ala Gly Asp Val Asp Thr Leu Ala Phe Asp
1145 1150 1155
Gly Arg Thr Phe Val Glu Tyr Leu Asn Ala Val Thr Glu Ser Glu
1160 1165 1170
Leu Ala Asn Glu Ile Pro Val Pro Glu Thr Leu Asp Ser Gly Ala
1175 1180 1185
Leu His Ser Glu Lys Ala Leu Gln Ser Asn His Phe Glu Leu Ser
1190 1195 1200
Leu Arg Thr Glu Ala Thr Gln Gly Leu Val Leu Trp Ser Gly Lys
1205 1210 1215
Ala Thr Glu Arg Ala Asp Tyr Val Ala Leu Ala Ile Val Asp Gly
1220 1225 1230
His Leu Gln Leu Ser Tyr Asn Leu Gly Ser Gln Pro Val Val Leu
1235 1240 1245
Arg Ser Thr Val Pro Val Asn Thr Asn Arg Trp Leu Arg Val Val
1250 1255 1260
Ala His Arg Glu Gln Arg Glu Gly Ser Leu Gln Val Gly Asn Glu
1265 1270 1275
Ala Pro Val Thr Gly Ser Ser Pro Leu Gly Ala Thr Gln Leu Asp
1280 1285 1290
Thr Asp Gly Ala Leu Trp Leu Gly Gly Leu Pro Glu Leu Pro Val
1295 1300 1305
Gly Pro Ala Leu Pro Lys Ala Tyr Gly Thr Gly Phe Val Gly Cys
1310 1315 1320
Leu Arg Asp Val Val Val Gly Arg His Pro Leu His Leu Leu Glu
1325 1330 1335
Asp Ala Val Thr Lys Pro Glu Leu Arg Pro Cys Pro Thr Pro
1340 1345 1350
<![CDATA[ <210> 432]]>
<![CDATA[ <211> 2068]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 432]]> Met Ala Gly Arg Ser His Pro Gly Pro Leu Arg Pro Leu Leu Pro Leu 1 5 10 15 Leu Val Val Ala Ala Cys Val Leu Pro Gly Ala Gly Gly Thr Cys Pro 20 25 30 Glu Arg Ala Leu Glu Arg Arg Glu Glu Glu Ala Asn Val Leu Thr 35 40 45 Gly Thr Val Glu Glu Ile Leu Asn Val Asp Pro Val Gln His Thr Tyr 50 55 60 Ser Cys Lys Val Arg Val Trp Arg Tyr Leu Lys Gly Lys Asp Leu Val 65 70 75 80 Ala Arg Glu Ser Leu Leu Asp Gly Gly Asn Lys Val Val Ile Ser Gly 85 90 95 Phe Gly Asp Pro Leu Ile Cys Asp Asn Gln Val Ser Thr Gly Asp Thr 100 105 110 Arg Ile Phe Phe Val Asn Pro Ala Pro Pro Tyr Leu Trp Pro Ala His 115 120 125 Lys Asn Glu Leu Met Leu Asn Ser Ser Leu Met Arg Ile Thr Leu Arg 130 135 140 Asn Leu Glu Glu Val Glu Phe Cys Val Glu Asp Lys Pro Gly Thr His 145 150 155 160 Phe Thr Pro Val Pro Pro Thr Pro Pro Asp Ala Cys Arg Gly Met Leu 165 170 175 Cys Gly Phe Gly Ala Val Cys Glu Pro Asn Ala Glu Gly Pro Gly Arg 180 185 190 Ala Ser Cys Val Cys Lys Lys Lys Ser Pro Cys Pro Ser Val Val Ala Pro 195 200 205 Val Cys Gly Ser Asp Ala Ser Thr Tyr Ser Asn Glu Cys Glu Leu Gln 210 215 220 Arg Ala Gln Cys Ser Gln Gln Arg Arg Ile Arg Leu Leu Ser Arg Gly 225 230 235 240 Pro Cys Gly Ser Arg Asp Pro Cys Ser Asn Val Thr Cys Ser Phe Gly 245 250 255 Ser Thr Cys Ala Arg Ser Ala Asp Gly Leu Thr Ala Ser Cys Leu Cys 260 265 270 Pro Ala Thr Cys Arg Gly Ala Pro Glu Gly Thr Val Cys Gly Ser Asp 275 280 285 Gly Ala Asp Tyr Pro Gly Glu Cys Gln Leu Leu Arg Arg Ala Cys Ala 290 295 300 Arg Gln Glu Asn Val Phe Lys Lys Phe Asp Gly Pro Cys Asp Pro Cys 305 310 315 320 Gln Gly Ala Leu Pro Asp Pro Ser Arg Ser Cys Arg Val Asn Pro Arg 325 330 335 Thr Arg Arg Pro Glu Met Leu Leu Arg Pro Glu Ser Cys Pro Ala Arg 340 345 350 Gln Ala Pro Val Cys Gly Asp Asp Gly Val Thr Tyr Glu Asn Asp Cys 355 360 365 Val Met Gly Arg Ser Gly Ala Ala Arg Gly Leu Leu Leu Gln Lys Val 370 375 380 Arg Ser Gly Gln Cys Gln Gly Arg Asp Gln Cys Pro Glu Pro Cys Arg 385 390 395 400 Phe Asn Ala Val Cys Leu Ser Arg Arg Gly Arg Pro Arg Cys Ser Cys 405 410 415 Asp Arg Val Thr Cys Asp Gly Ala Tyr Arg Pro Val Cys Ala Gln Asp 420 425 430 Gly Arg Thr Tyr Asp Ser Asp Cys Trp Arg Gln Gln Ala Glu Cys Arg 435 440 445 Gln Gln Arg A la ile Pro Ser Lys His Gln Gly Pro Cys Asp Gln Ala 450 455 460 Pro Ser Pro Cys Leu Gly Val Gln Cys Ala Phe Gly Ala Thr Cys Ala 465 470 475 480 Val Lys Asn Gly Gln Ala Ala Cys Glu Cys Leu Gln Ala Cys Ser Ser 485 490 495 Leu Tyr Asp Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly Ser Ala 500 505 510 Cys Glu Leu Glu Ala Thr Ala Cys Thr Leu Gly Arg Glu Ile Gln Val 515 520 525 Ala Arg Lys Gly Pro Cys Asp Arg Cys Gly Gln Cys Arg Phe Gly Ala 530 535 540 Leu Cys Glu Ala Glu Thr Gly Arg Cys Val Cys Pro Ser Glu Cys Val 545 550 555 560 Ala Leu Ala Gln Pro Val Cys Gly Ser Asp Gly His Thr Tyr Pro Ser 565 570 575 Glu Cys Met Leu His Val His Ala Cys Thr His Gln Ile Ser Leu His 580 585 590 Val Ala Ser Ala Gly Pro Cys Glu Thr Cys Gly Asp Ala Val Cys Ala 595 600 605 Phe Gly Ala Val Cys Ser Ala Gly Gln Cys Val Cys Pro Arg Cys Glu 610 615 620 His Pro Pro Pro Gly Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly 625 630 635 640 Ser Ala Cys Glu Leu Arg Glu Ala Ala Cys Leu Gln Gln Thr Gln Ile 645 650 655 Glu Glu Ala Arg Ala Gly Pro Cys Glu Gln Ala Glu Cys Gly Ser Gly 660 665 670 Gly Ser Gly Ser Gly Glu Asp Gly Asp Cys Glu Gln Glu Leu Cys Arg 675 680 685 Gln Arg Gly Gly Ile Trp Asp Glu Asp Ser Glu Asp Gly Pro Cys Val 690 695 700 Cys Asp Phe Ser Cys Gln Ser Val Pro Gly Ser Pro Val Cys Gly Ser 705 710 715 720 Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys Lys Ala Arg Cys 725 730 735 Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly Ala Cys Arg Gly 740 745 750 Pro Thr Phe Ala Pro Leu Pro Pro Val Ala Pro Leu His Cys Ala Gln 755 760 765 Thr Pro Tyr Gly Cys Cys Gln Asp Asn Ile Thr Ala Ala Arg Gly Val 770 775 780 Gly Leu Ala Gly Cys Pro Ser Ala Cys Gln Cys Asn Pro His Gly Ser 785 790 795 800 Tyr Gly Gly Thr Cys Asp Pro Ala Thr Gly Gln Cys Ser Cys Arg Pro 805 810 815 Gly Val Gly Gly Leu Arg Cys Asp Arg Cys Glu Pro Gly Phe Trp Asn 820 825 830 Phe Arg Gly Ile Val Thr Asp Gly Arg Ser Gly Cys Thr Pro Cys Ser 835 840 845 Cys Asp Pro Gln Gly Ala Val Arg Asp Asp Cys Glu Gln Met Thr Gly 850 855 860 Leu Cys Ser Cys Lys Pro Gly Val Ala Gly Pro Lys Cys Gly Gln Cys 865 870 875 880 Pro Asp Gly Arg Ala Leu Gly Pro Ala Gly Cys Glu Ala Asp Ala Ser 885 890 895 Ala Pro Ala Thr Cys Ala Glu Met Arg Cys Glu Phe Gly Ala Arg Cys 900 905 910 Val Glu Glu Ser Gly Ser Ala His Cys Val Cys Pro Met Leu Thr Cys 915 920 925 Pro Glu Ala Asn Ala Thr Lys Val Cys Gly Ser Asp Gly Val Thr Tyr 930 935 940 Gly Asn Glu Cys Gln Leu Lys Thr Ile Ala Cys Arg Gln Gly Leu Gln 945 950 955 960 Ile Ser Ile Gln Ser Leu Gly Pro Cys Gln Glu Ala Val Ala Pro Ser 965 970 975 Thr His Pro Thr Ser Ala Ser Val Thr Val Thr Pro Gly Leu Leu 980 985 990 Leu Ser Gln Ala Leu Pro Ala Pro Pro Gly Ala Leu Pro Leu Ala Pro 995 1000 1005 Ser Ser Thr Ala His Ser Gln Thr Thr Pro Pro Pro Ser Ser Arg 1010 1015 1020 Pro Arg Thr Thr Ala Ser Val Pro Arg Thr Thr Val Trp Pro Val 1025 1030 1035 Leu Thr Val Pro Pro Thr Ala Pro Ser Pro Ala Pro Ser Leu Val 1040 1045 1050 Ala Ser Ala Phe Gly Glu Ser Gly Ser Thr Asp Gly Ser Ser Asp 1055 1060 1065 Glu Glu Leu Ser Gly Asp Gln Glu Ala Ser G ly Gly Gly Ser Gly 1070 1075 1080 Gly Leu Glu Pro Leu Glu Gly Ser Ser Val Ala Thr Pro Gly Pro 1085 1090 1095 Pro Val Glu Arg Ala Ser Cys Tyr Asn Ser Ala Leu Gly Cys Cys 1100 1105 1110 Ser Asp Gly Lys Thr Pro Ser Leu Asp Ala Glu Gly Ser Asn Cys 1115 1120 1125 Pro Ala Thr Lys Val Phe Gln Gly Val Leu Glu Leu Glu Gly Val 1130 1135 1140 Glu Gly Gln Glu Leu Phe Tyr Thr Pro Glu Met Ala Asp Pro Lys 1145 1150 1155 Ser Glu Leu Phe Gly Glu Thr Ala Arg Ser Ile Glu Ser Leu 1160 1165 1170 Asp Asp Leu Phe Arg Asn Ser Asp Val Lys Lys Asp Phe Arg Ser 1175 1180 1185 Val Arg Leu Arg Asp Leu Gly Pro Gly Lys Ser Val Arg Ala Ile 1190 11 95 1200 Val Asp Val His Phe Asp Pro Thr Thr Ala Phe Arg Ala Pro Asp 1205 1210 1215 Val Ala Arg Ala Leu Leu Arg Gln Ile Gln Val Ser Arg Arg Arg 1220 1225 1230 Ser Leu Gly Val Arg Arg Pro Leu Gln Glu His Val Arg Phe Met 1235 1240 1245 Asp Phe Asp Trp Phe Pro Ala Phe Ile Thr Gly Ala Thr Ser Gly 1250 1255 1260 Ala Ile Ala Ala Gly Ala Thr Ala Arg Ala Thr Ala Ser Arg 1265 1270 1275 Leu Pro Ser Ser Ala Val Thr Pro Arg Ala Pro His Pro Ser His 1280 1285 1290 Thr Ser Gln Pro Val Ala Lys Thr Thr Ala Ala Pro Thr Thr Arg 1295 1300 1305 Arg Pro Pro Thr Thr Ala Pro Ser Arg Val Pro Gly Arg Arg Pro 1310 1315 1320 Pro Ala Pro Gln Gln Pro Pro Lys Pro Cys Asp Ser Gln Pro Cys 1325 1330 1335 Phe His Gly Gly Thr Cys Gln Asp Trp Ala Leu Gly Gly Gly Phe 1340 1345 1350 Thr Cys Ser Cys Pro Ala Gly Arg Gly Gly Ala Val Cys Glu Lys 1355 1360 136 5 Val Leu Gly Ala Pro Val Pro Ala Phe Glu Gly Arg Ser Phe Leu 1370 1375 1380 Ala Phe Pro Thr Leu Arg Ala Tyr His Thr Leu Arg Leu Ala Leu 1385 1390 1395 Glu Phe Arg Ala Leu Glu Pro Gln Gly Leu Leu Leu Tyr Asn Gly 1400 1405 1410 Asn Ala Arg Gly Lys Asp Phe Leu Ala Leu Ala Leu Leu Asp Gly 1415 1420 1425 Arg Val Gln Leu Arg Phe Asp Thr Gly Ser Gly Pro Ala Val Leu 1430 1435 1440 Thr Ser Ala Val Pro Val Glu Pro Gly Gln Trp His Arg Leu Glu 1445 1450 1455 Leu Ser Arg His Trp Arg Arg Gly Thr Leu Ser Val Asp Gly Glu 1460 1465 1470 Thr Pro Val Leu Gly Glu Ser Pro Ser Pro Ser Gly Thr Asp Gly Leu Asn 1475 1480 1485 Leu Asp Thr Asp Le u Phe Val Gly Gly Val Pro Glu Asp Gln Ala 1490 1495 1500 Ala Val Ala Leu Glu Arg Thr Phe Val Gly Ala Gly Leu Arg Gly 1505 1510 1515 Cys Ile Arg Leu Leu Asp Val Asn Asn Gln Arg Leu Glu Leu Gly 1520 1525 15 30 Ile Gly Pro Gly Ala Ala THR ALG GLY VAL VAL GLY GLU CYS 1535 1540 Gly Asp His Pro Cys Leu Pro ASN PRS HIS GLY GLY ALA PRO 155555 1560 CYS GLU GLU GLY ARG P HE HIS CYS GLN CYS Pro Pro 1565 1570 1575 Gly Arg Val Gly Pro Thr Cys Ala Asp Glu Lys Ser Pro Cys Gln 1580 1585 1590 Pro Asn Pro Cys His Gly Ala Ala Pro Cys Arg Val Leu Pro Glu 1595 1600 1605 Gly Gly Ala Gln Cys Glu Cys Pro Leu Gly Arg Glu Gly Thr Phe 1610 1615 1620 Cys Gln Thr Ala Ser Gly Gln Asp Gly Ser Gly Pro Phe Leu Ala 1625 1630 1635 Asp Phe Asn Gly Phe Ser His Leu Glu Leu Arg Gly Leu His Thr 1640 1645 1650 Phe Ala Arg Asp Leu G ly Glu Lys Met Ala Leu Glu Val Val Phe 1655 1660 1665 Leu Ala Arg Gly Pro Ser Gly Leu Leu Leu Tyr Asn Gly Gln Lys 1670 1675 1680 Thr Asp Gly Lys Gly Asp Phe Val Ser Leu Ala Leu Arg Asp Arg 1685 1690 1695 Arg Le u Glu Phe Arg Tyr Asp Leu Gly Lys Gly Ala Ala Val Ile 1700 1705 1710 Arg Ser Arg Glu Pro Val Thr Leu Gly Ala Trp Thr Arg Val Ser 1715 1720 1725 Leu Glu Arg Asn Gly Arg Lys Gly Ala Leu Arg Val Gly Asp Gly 1730 1735 1740Pro Arg Val Leu Gly Glu Ser Pro Lys Ser Arg Lys Val Pro His 1745 1750 1755 Thr Val Leu Asn Leu Lys Glu Pro Leu Tyr Val Gly Gly Ala Pro 1760 1765 1770 Asp Phe Ser Lys Leu Ala Arg Ala Ala Ala Val Ser Ser Gly Phe 1775 1780 1785 Asp Gly Ala Ile Gln Leu Val Ser Leu Gly Gly Arg Gln Leu Leu 1790 1795 1800 Thr Pro Glu His Val Leu Arg Gln Val Asp Val Thr Ser Phe Ala 1805 1810 1815 Gly His Pro Cys Thr Arg Ala Ser Gly His Pro Cys Leu Asn Gly 1820 1825 1830 Ala Ser Cys Val Pro Arg Glu Ala Ala Tyr Val Cys Leu Cys Pro 1835 1840 1845 Gly Gly Phe Ser Gly Pro His Cys Glu Lys Gly Leu Val Glu Lys 1850 1855 1860 Ser Ala Gly Asp Val Asp Thr Leu Ala Phe Asp Gly Arg Thr Phe 1865 1870 1875 Val Glu Tyr Leu Asn Ala Val Thr Glu Ser Glu Leu Ala Asn Glu 1880 1885 1890 Ile Pro Val Pro Glu Thr Leu Asp Ser Gly Ala Leu His Ser Glu 1895 1900 1905 Lys Ala Leu Gln Ser Asn His Phe Glu Leu Ser Leu Arg Thr Glu 1910 1915 1920 Ala Thr Gln Gly Leu Val Leu Trp Ser Gly Lys Ala Thr Glu Arg 1925 1930 1935 Ala Asp Tyr Val Ala Leu Ala Ile Val Asp Gly His Leu Gln Leu 1 940 1945 1950 Ser Tyr Asn Leu Gly Ser Gln Pro Val Val Leu Arg Ser Thr Val 1955 1960 1965 Pro Val Asn Thr Asn Arg Trp Leu Arg Val Val Ala His Arg Glu 1970 1975 1980 Gln Arg Glu Gly Ser Leu Gln Val Gly Asn Glu Ala Pro Val Thr 1985 1990 1995 Gly Ser Pro Leu Gly Ala Thr Gln Leu Asp Thr Asp Gly Ala 2000 2005 2010 Leu Trp Leu Gly Gly Leu Pro Glu Leu Pro Val Gly Pro Ala Leu 2015 2020 2025 Pro Lys Ala Tyr Gly Thr Gly P he Val Gly Cys Leu Arg Asp Val 2030 2035 2040 Val Val Gly Arg His Pro Leu His Leu Leu Glu Asp Ala Val Thr 2045 2050 2055 Lys Pro Glu Leu Arg Pro Cys Pro Thr Pro 2060 2065 <![CDATA[ <210> 433]]>
<![CDATA[ <211> 2307]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 433]]>
agtcccgtcc ccggcgcggc ccgcgcgctc ctccgccgcc tctcgcctgc gccatggccg 60
gccggtccca cccgggcccg ctgcggccgc tgctgccgct ccttgtggtg gccgcgtgcg 120
tcctgcccgg agccggcggg acatgcccgg agcgcgcgct ggagcggcgc gaggaggagg 180
cgaacgtggt gctcaccggg acggtggagg agatcctcaa cgtggacccg gtgcagcaca 240
cgtactcctg caaggttcgg gtctggcggt acttgaaggg caaagacctg gtggcccggg 300
agagcctgct ggacggcggc aacaaggtgg tgatcagcgg ctttggagac cccctcatct 360
gtgacaacca ggtgtccact ggggaccacca ggatcttctt tgtgaaccct gcacccccat 420
acctgtggcc agcccacaag aacgagctga tgctcaactc cagcctcatg cggatcaccc 480
tgcggaacct gaggaggtg gagttctgtg tggaagataa acccgggacc cacttcactc 540
cagtgcctcc gacgcctcct gatgcgtgcc ggggaatgct gtgcggcttc ggcgccgtgt 600
gcgagcccaa cgcggagggg ccgggccggg cgtcctgcgt ctgcaagaag agcccgtgcc 660
ccagcgtggt ggcgcctgtg tgtgggtcgg acgcctccac ctacagcaac gaatgcgagc 720
tgcagcgggc gcagtgcagc cagcagcgcc gcatccgcct gctcagccgc gggccgtgcg 780
gctcgcggga cccctgctcc aacgtgacct gcagcttcgg cagcacctgt gcgcgctcgg 840
ccgacgggct gacggcctcg tgcctgtgcc ccgcgacctg ccgtggcgcc cccgaggggga 900
ccgtctgcgg cagcgacggc gccgactacc ccggcgagtg ccagctcctg cgccgcgcct 960
gcgcccgcca ggagaatgtc ttcaagaagt tcgacggccc ttgtgacccc tgtcagggcg 1020
ccctccctga cccgagccgc agctgccgtg tgaacccgcg cacgcggcgc cctgagatgc 1080
tcctacggcc cgagagctgc cctgcccggc aggcgccagt gtgtggggac gacggagtca 1140
cctacgaaaa cgactgtgtc atgggccgat cgggggccgc ccggggtctc ctcctgcaga 1200
aagtgcgctc cggccagtgc cagggtcgag accagtgccc ggagccctgc cggttcaatg 1260
ccgtgtgcct gtcccgccgt ggccgtcccc gctgctcctg cgaccgcgtc acctgtgacg 1320
gggcctacag gcccgtgtgt gcccaggacg ggcgcacgta tgacagtgat tgctggcggc 1380
agcaggctga gtgccggcag cagcgtgcca tccccagcaa gcaccagggc ccgtgtgacc 1440
aggccccgtc cccatgcctc ggggtgcagt gtgcattgg ggcgacgtgt gctgtgaaga 1500
acgggcaggc agcgtgtgaa tgcctgcagg cgtgctcgag cctctacgat cctgtgtgcg 1560
gcagcgacgg cgtcacatac ggcagcgcgt gcgagctgga ggccacggcc tgtaccctcg 1620
ggcggggagat ccaggtggcg cgcaaaggac cctgtgaccg ctgcggggcag tgccgctttg 1680
gagccctgtg cgaggccgag accgggcgct gcgtgtgccc ctctgaatgc gtggctttgg 1740
cccagcccgt gtgtggctcc gacgggcaca cgtaccccag cgagtgcatg ctgcacgtgc 1800
acgcctgcac acaccagatc agcctgcacg tggcctcagc tggaccctgt gagacctgtg 1860
gagatgccgt gtgtgctttt ggggctgtgt gctccgcagg gcagtgtgtg tgtccccggt 1920
gtgagcaccc cccgcccggc cccgtgtgtg gcagcgacgg tgtcacctac ggcagtgcct 1980
gcgagctacg ggaagccgcc tgcctccagc agacacagat cgaggaggcc cgggcagggc 2040
cgtgcgagca ggccgagtgc ggttccggag gctctggctc tggggaggac ggtgactgtg 2100
agcaggagct gtgccggcag cgcggtggca tctgggacga ggactcggag gacgggccgt 2160
gtgtctgtga cttcagctgc cagagtgtcc caggcagccc ggtgtgcggc tcagatgggg 2220
tcacctacag caccgagtgt gagctgaaga aggccaggtg tgagtcacag cgagggctct 2280
acgtagcggc ccagggagcc tgccgag 2307
<![CDATA[ <210> 434]]>
<![CDATA[ <211> 751]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 434]]>
Met Ala Gly Arg Ser His Pro Gly Pro Leu Arg Pro Leu Leu Pro Leu
1 5 10 15
Leu Val Val Ala Ala Cys Val Leu Pro Gly Ala Gly Gly Thr Cys Pro
20 25 30
Glu Arg Ala Leu Glu Arg Arg Glu Glu Glu Glu Ala Asn Val Val Leu Thr
35 40 45
Gly Thr Val Glu Glu Ile Leu Asn Val Asp Pro Val Gln His Thr Tyr
50 55 60
Ser Cys Lys Val Arg Val Trp Arg Tyr Leu Lys Gly Lys Asp Leu Val
65 70 75 80
Ala Arg Glu Ser Leu Leu Asp Gly Gly Asn Lys Val Val Ile Ser Gly
85 90 95
Phe Gly Asp Pro Leu Ile Cys Asp Asn Gln Val Ser Thr Gly Asp Thr
100 105 110
Arg Ile Phe Phe Val Asn Pro Ala Pro Pro Tyr Leu Trp Pro Ala His
115 120 125
Lys Asn Glu Leu Met Leu Asn Ser Ser Leu Met Arg Ile Thr Leu Arg
130 135 140
Asn Leu Glu Glu Val Glu Phe Cys Val Glu Asp Lys Pro Gly Thr His
145 150 155 160
Phe Thr Pro Val Pro Pro Thr Pro Pro Asp Ala Cys Arg Gly Met Leu
165 170 175
Cys Gly Phe Gly Ala Val Cys Glu Pro Asn Ala Glu Gly Pro Gly Arg
180 185 190
Ala Ser Cys Val Cys Lys Lys Ser Pro Cys Pro Ser Val Val Ala Pro
195 200 205
Val Cys Gly Ser Asp Ala Ser Thr Tyr Ser Asn Glu Cys Glu Leu Gln
210 215 220
Arg Ala Gln Cys Ser Gln Gln Arg Arg Ile Arg Leu Leu Ser Arg Gly
225 230 235 240
Pro Cys Gly Ser Arg Asp Pro Cys Ser Asn Val Thr Cys Ser Phe Gly
245 250 255
Ser Thr Cys Ala Arg Ser Ala Asp Gly Leu Thr Ala Ser Cys Leu Cys
260 265 270
Pro Ala Thr Cys Arg Gly Ala Pro Glu Gly Thr Val Cys Gly Ser Asp
275 280 285
Gly Ala Asp Tyr Pro Gly Glu Cys Gln Leu Leu Arg Arg Ala Cys Ala
290 295 300
Arg Gln Glu Asn Val Phe Lys Lys Phe Asp Gly Pro Cys Asp Pro Cys
305 310 315 320
Gln Gly Ala Leu Pro Asp Pro Ser Arg Ser Cys Arg Val Asn Pro Arg
325 330 335
Thr Arg Arg Pro Glu Met Leu Leu Arg Pro Glu Ser Cys Pro Ala Arg
340 345 350
Gln Ala Pro Val Cys Gly Asp Asp Gly Val Thr Tyr Glu Asn Asp Cys
355 360 365
Val Met Gly Arg Ser Gly Ala Ala Arg Gly Leu Leu Leu Gln Lys Val
370 375 380
Arg Ser Gly Gln Cys Gln Gly Arg Asp Gln Cys Pro Glu Pro Cys Arg
385 390 395 400
Phe Asn Ala Val Cys Leu Ser Arg Arg Gly Arg Pro Arg Cys Ser Cys
405 410 415
Asp Arg Val Thr Cys Asp Gly Ala Tyr Arg Pro Val Cys Ala Gln Asp
420 425 430
Gly Arg Thr Tyr Asp Ser Asp Cys Trp Arg Gln Gln Ala Glu Cys Arg
435 440 445
Gln Gln Arg Ala Ile Pro Ser Lys His Gln Gly Pro Cys Asp Gln Ala
450 455 460
Pro Ser Pro Cys Leu Gly Val Gln Cys Ala Phe Gly Ala Thr Cys Ala
465 470 475 480
Val Lys Asn Gly Gln Ala Ala Cys Glu Cys Leu Gln Ala Cys Ser Ser
485 490 495
Leu Tyr Asp Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly Ser Ala
500 505 510
Cys Glu Leu Glu Ala Thr Ala Cys Thr Leu Gly Arg Glu Ile Gln Val
515 520 525
Ala Arg Lys Gly Pro Cys Asp Arg Cys Gly Gln Cys Arg Phe Gly Ala
530 535 540
Leu Cys Glu Ala Glu Thr Gly Arg Cys Val Cys Pro Ser Glu Cys Val
545 550 555 560
Ala Leu Ala Gln Pro Val Cys Gly Ser Asp Gly His Thr Tyr Pro Ser
565 570 575
Glu Cys Met Leu His Val His Ala Cys Thr His Gln Ile Ser Leu His
580 585 590
Val Ala Ser Ala Gly Pro Cys Glu Thr Cys Gly Asp Ala Val Cys Ala
595 600 605
Phe Gly Ala Val Cys Ser Ala Gly Gln Cys Val Cys Pro Arg Cys Glu
610 615 620
His Pro Pro Pro Gly Pro Val Cys Gly Ser Asp Gly Val Thr Tyr Gly
625 630 635 640
Ser Ala Cys Glu Leu Arg Glu Ala Ala Cys Leu Gln Gln Thr Gln Ile
645 650 655
Glu Glu Ala Arg Ala Gly Pro Cys Glu Gln Ala Glu Cys Gly Ser Gly
660 665 670
Gly Ser Gly Ser Gly Glu Asp Gly Asp Cys Glu Gln Glu Leu Cys Arg
675 680 685
Gln Arg Gly Gly Ile Trp Asp Glu Asp Ser Glu Asp Gly Pro Cys Val
690 695 700
Cys Asp Phe Ser Cys Gln Ser Val Pro Gly Ser Pro Val Cys Gly Ser
705 710 715 720
Asp Gly Val Thr Tyr Ser Thr Glu Cys Glu Leu Lys Lys Ala Arg Cys
725 730 735
Glu Ser Gln Arg Gly Leu Tyr Val Ala Ala Gln Gly Ala Cys Arg
740 745 750
<![CDATA[ <210> 435]]>
<![CDATA[ <211> 102]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 435]]>
taaaaggctt ctccccagat gccagagacc tgtctgctaa agaaaccaag atgctgatgg 60
ctgctggaga caaagatggg gacggcaaaa ttggggttga cg 102
<![CDATA[ <210> 436]]>
<![CDATA[ <211> 125]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 436]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgcttca tggtgaaaga cctttcaaac ccctcgagat 120
acttg 125
<![CDATA[ <210> 437]]>
<![CDATA[ <211> 227]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 437]]>
taaaaggctt ctccccagat gccagagacc tgtctgctaa agaaaccaag atgctgatgg 60
ctgctggaga caaagatggg gacggcaaaa ttggggttga cgctacatct acatccacca 120
ctgggacaag ccatcttgta aaatgtgcgg agaaggagaa aactttctgt gtgaatggag 180
gggagtgctt catggtgaaa gacctttcaa accccctcgag atacttg 227
<![CDATA[ <210> 438]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 438]]>
Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys Glu Thr Lys
1 5 10 15
Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys Ile Gly Val
20 25 30
Asp Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys
35 40 45
Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met
50 55 60
Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu
65 70 75
<![CDATA[ <210> 439]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 439]]>
acttcccgac aggacttccc accagcccag cctttcagtg caggctccag ccctccaccc 60
ccacccgag 69
<![CDATA[ <210> 440]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 440]]>
ttgcaggatg tcgatgacag acttgctgaa cgctgaggac atcaagaagg cggtgggagc 60
ctttagcg68
<![CDATA[ <210> 441]]>
<![CDATA[ <211> 133]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 441]]>
ctaccgactc cttcgaccac aaaaagttct tccaaatggt cggcctgaag aaaaagagtg 60
cggatgatgt gaagaaggtg tttcacatgc tggacaagga caaaagtggc ttcatcgagg 120
aggatgagct ggg 133
<![CDATA[ <210> 442]]>
<![CDATA[ <211> 110]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 442]]>
attcatccta aaaggcttct ccccagatgc cagagacctg tctgctaaag aaaccaagat 60
gctgatggct gctggagaca aagatgggga cggcaaaatt ggggttgacg 110
<![CDATA[ <210> 443]]>
<![CDATA[ <211> 216]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 443]]>
aattctccac tctggtggct gaaagctaag aagcactgac tgcccctggt cttccacctc 60
tctgccctga acacccaatc tcggcccctc tcgccaccct cctgcatttc tgttcagttc 120
gtttatgtta ttttttactc ccccatcccc tgtggccctc taatgacacc attcttctgg 180
aaaatgctgg agaagcaata aaggttgtac cagtca 216
<![CDATA[ <210> 444]]>
<![CDATA[ <211> 596]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> The complete mRNA polynucleotide sequence of PVALB]]>
<![CDATA[ <400> 444]]>
acttcccgac aggacttccc accagcccag cctttcagtg caggctccag ccctccaccc 60
ccaccccgagt tgcaggatgt cgatgacaga cttgctgaac gctgaggaca tcaagaaggc 120
ggtgggagcc tttagcgcta ccgactcctt cgaccacaaa aagttcttcc aaatggtcgg 180
cctgaagaaa aagagtgcgg atgatgtgaa gaaggtgttt cacatgctgg acaaggacaa 240
aagtggcttc atcgaggagg atgagctggg attcatccta aaaggcttct ccccagatgc 300
cagagacctg tctgctaaag aaaccaagat gctgatggct gctggagaca aagatgggga 360
cggcaaaatt ggggttgacg aattctccac tctggtggct gaaagctaag aagcactgac 420
tgcccctggt cttccacctc tctgccctga acacccaatc tcggcccctc tcgccaccct 480
cctgcatttc tgttcagttc gtttatgtta ttttttactc ccccatcccc tgtggccctc 540
taatgacacc attcttctgg aaaatgctgg agaagcaata aaggttgtac cagtca 596
<![CDATA[ <210> 445]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 445]]>
Met Ser Met Thr Asp Leu Leu Asn Ala Glu Asp Ile Lys Lys Ala Val
1 5 10 15
Gly Ala Phe Ser
20
<![CDATA[ <210> 446]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 446]]>
Thr Asp Ser Phe Asp His Lys Lys Phe Phe Gln Met Val Gly Leu Lys
1 5 10 15
Lys Lys Ser Ala Asp Asp Val Lys Lys Val Phe His Met Leu Asp Lys
20 25 30
Asp Lys Ser Gly Phe Ile Glu Glu Asp Glu Leu
35 40
<![CDATA[ <210> 447]]>
<![CDATA[ <211> 36]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 447]]>
Phe Ile Leu Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys
1 5 10 15
Glu Thr Lys Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys
20 25 30
Ile Gly Val Asp
35
<![CDATA[ <210> 448]]>
<![CDATA[ <211> 8]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 448]]>
Phe Ser Thr Leu Val Ala Glu Ser
1 5
<![CDATA[ <210> 449]]>
<![CDATA[ <211> 110]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 449]]>
Met Ser Met Thr Asp Leu Leu Asn Ala Glu Asp Ile Lys Lys Ala Val
1 5 10 15
Gly Ala Phe Ser Ala Thr Asp Ser Phe Asp His Lys Lys Phe Phe Gln
20 25 30
Met Val Gly Leu Lys Lys Lys Ser Ala Asp Asp Val Lys Lys Val Phe
35 40 45
His Met Leu Asp Lys Asp Lys Ser Gly Phe Ile Glu Glu Asp Glu Leu
50 55 60
Gly Phe Ile Leu Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala
65 70 75 80
Lys Glu Thr Lys Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly
85 90 95
Lys Ile Gly Val Asp Glu Phe Ser Thr Leu Val Ala Glu Ser
100 105 110
<![CDATA[ <210> 450]]>
<![CDATA[ <211> 380]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 450]]>
acttcccgac aggacttccc accagcccag cctttcagtg caggctccag ccctccaccc 60
ccaccccgagt tgcaggatgt cgatgacaga cttgctgaac gctgaggaca tcaagaaggc 120
ggtgggagcc tttagcgcta ccgactcctt cgaccacaaa aagttcttcc aaatggtcgg 180
cctgaagaaa aagagtgcgg atgatgtgaa gaaggtgttt cacatgctgg acaaggacaa 240
aagtggcttc atcgaggagg atgagctggg attcatccta aaaggcttct ccccagatgc 300
cagagacctg tctgctaaag aaaccaagat gctgatggct gctggagaca aagatgggga 360
cggcaaaatt ggggttgacg 380
<![CDATA[ <210> 451]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 451]]>
Met Ser Met Thr Asp Leu Leu Asn Ala Glu Asp Ile Lys Lys Ala Val
1 5 10 15
Gly Ala Phe Ser Ala Thr Asp Ser Phe Asp His Lys Lys Phe Phe Gln
20 25 30
Met Val Gly Leu Lys Lys Lys Ser Ala Asp Asp Val Lys Lys Val Phe
35 40 45
His Met Leu Asp Lys Asp Lys Ser Gly Phe Ile Glu Glu Asp Glu Leu
50 55 60
Gly Phe Ile Leu Lys Gly Phe Ser Pro Asp Ala Arg Asp Leu Ser Ala
65 70 75 80
Lys Glu Thr Lys Met Leu Met Ala Ala Gly Asp Lys Asp Gly Asp Gly
85 90 95
Lys Ile Gly Val Asp
100
<![CDATA[ <210> 452]]>
<![CDATA[ <211> 93]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 452]]>
agttggggtc tcactgtgtt gcccagactg gtctcgaact cttggcctca ggtgatcctc 60
ttccctcagc ttcccagaat gccgagatga tag 93
<![CDATA[ <210> 453]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 453]]>
ctacatctac atccaccact gggacaag 28
<![CDATA[ <210> 454]]>
<![CDATA[ <211> 121]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 454]]>
agttggggtc tcactgtgtt gcccagactg gtctcgaact cttggcctca ggtgatcctc 60
ttccctcagc ttcccagaat gccgagatga tagctacatc tacatccacc actgggacaa 120
g 121
<![CDATA[ <210> 455]]>
<![CDATA[ <211> 39]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 455]]>
Leu Gly Ser His Cys Val Ala Gln Thr Gly Leu Glu Leu Leu Ala Ser
1 5 10 15
Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn Ala Glu Met Ile Ala Thr
20 25 30
Ser Thr Ser Thr Thr Gly Thr
35
<![CDATA[ <210> 456]]>
<![CDATA[ <211> 273]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 456]]>
gcattgcggc ttggttttct cacccagtgc atgtggcagg agcggtgaga tcactgcctc 60
acggcgatcc tggactgacg gtcacgactg cctaccctct aaccctgttc tgagctgccc 120
cttgcccaca caccccaaac ctgtgtgcag gatccgcctc catggagcta cagcctcctg 180
aagcctcgat cgccgtcgtg tcgattccgc gccagttgcc tggctcacat tcggaggctg 240
gtgtccaggg tctcagcgcg ggggacgact cag 273
<![CDATA[ <210> 457]]>
<![CDATA[ <211> 93]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 457]]>
agttggggtc tcactgtgtt gcccagactg gtctcgaact cttggcctca ggtgatcctc 60
ttccctcagc ttcccagaat gccgagatga tag 93
<![CDATA[ <210> 458]]>
<![CDATA[ <211> 93]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 458]]>
agacggggtc tgactgtgtt acccaggctg gtcttcaact cttggcctca agtgatcctc 60
ctgccttagc ttccaagaat gctgaggtta cag 93
<![CDATA[ <210> 459]]>
<![CDATA[ <211> 429]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 459]]>
gcaccatgag ccaggacacc gaggtggata tgaaggaggt ggagctgaat gagttagagc 60
ccgagaagca gccgatgaac gcggcgtctg gggcggccat gtccctggcg ggagccgaga 120
agaatggtct ggtgaagatc aaggtggcgg aagacgaggc ggaggcggca gccgcggcta 180
agttcacggg cctgtccaag gaggagctgc tgaaggtggc aggcagcccc ggctgggtac 240
gcacccgctg ggcactgctg ctgctcttct ggctcggctg gctcggcatg cttgctggtg 300
ccgtggtcat aatcgtgcga gcgccgcgtt gtcgcgagct accggcgcag aagtggtggc 360
acacgggcgc cctctaccgc atcggcgacc ttcaggcctt ccagggccac ggcgcgggca 420
acctggcgg 429
<![CDATA[ <210> 460]]>
<![CDATA[ <211> 174]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 460]]>
gtctgaaggg gcgtctcgat tacctgagct ctctgaaggt gaagggcctt gtgctgggtc 60
caattcacaa gaaccagaag gatgatgtcg ctcagactga cttgctgcag atcgacccca 120
atttggctc caaggaagat tttgacagtc tcttgcaatc ggctaaaaaa aaga 174
<![CDATA[ <210> 461]]>
<![CDATA[ <211> 92]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 461]]>
gcatccgtgt cattctggac cttactccca actaccgggg tgagaactcg tggttctcca 60
ctcaggttga cactgtggcc accaaggtga ag 92
<![CDATA[ <210> 462]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 462]]>
gatgctctgg agttttggct gcaagctggc gtggatgggt tccaggttcg ggacatagag 60
aatctgaag 69
<![CDATA[ <210> 463]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 463]]>
gatgcatcct cattcttggc tgagtggcaa aatatcacca agggcttcag tgaagacag 59
<![CDATA[ <210> 464]]>
<![CDATA[ <211> 181]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 464]]>
gctcttgatt gcggggacta actcctccga ccttcagcag atcctgagcc tactcgaatc 60
caacaaagac ttgctgttga ctagctcata cctgtctgat tctggttcta ctggggagca 120
tacaaaatcc ctagtcacac agtatttgaa tgccactggc aatcgctggt gcagctggag 180
t 181
<![CDATA[ <210> 465]]>
<![CDATA[ <211> 144]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 465]]>
ttgtctcagg caaggctcct gacttccttc ttgccggctc aacttctccg actctaccag 60
ctgatgctct tcaccctgcc agggacccct gttttcagct acggggatga gattggcctg 120
gatgcagctg cccttcctgg acag 144
<![CDATA[ <210> 466]]>
<![CDATA[ <211> 84]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 466]]>
cctatggagg ctccagtcat gctgtggggat gagtccagct tccctgacat cccaggggct 60
gtaagtgcca acatgactgt gaag 84
<![CDATA[ <210> 467]]>
<![CDATA[ <211> 530]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 467]]>
ggccagagtg aagaccctgg ctccctcctt tccttgttcc ggcggctgag tgaccagcgg 60
agtaaggagc gctccctact gcatggggac ttccacgcgt tctccgctgg gcctggactc 120
ttctcctata tccgccactg ggaccagaat gagcgttttc tggtagtgct taactttggg 180
gatgtgggcc tctcggctgg actgcaggcc tccgacctgc ctgccagcgc cagcctgcca 240
gccaaggctg acctcctgct cagcacccag ccaggccgtg aggagggctc ccctcttgag 300
ctggaacgcc tgaaactgga gcctcacgaa gggctgctgc tccgcttccc ctacgcggcc 360
tgacttcagc ctgacatgga cccactaccc ttctcctttc cttcccaggc cctttggctt 420
ctgatttttc tcttttttaa aaacaaacaa acaaactgtt gcagattatg agtgaacccc 480
caaatagggt gttttctgcc ttcaaataaa agtcacccct gcatggtgaa 530
<![CDATA[ <210> 468]]>
<![CDATA[ <211> 2221]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Full mRNA polynucleotide sequence of SLC3A2]]>
<![CDATA[ <400> 468]]>
gcattgcggc ttggttttct cacccagtgc atgtggcagg agcggtgaga tcactgcctc 60
acggcgatcc tggactgacg gtcacgactg cctaccctct aaccctgttc tgagctgccc 120
cttgcccaca caccccaaac ctgtgtgcag gatccgcctc catggagcta cagcctcctg 180
aagcctcgat cgccgtcgtg tcgattccgc gccagttgcc tggctcacat tcggaggctg 240
gtgtccaggg tctcagcgcg ggggacgact cagagttggg gtctcactgt gttgcccaga 300
ctggtctcga actcttggcc tcaggtgatc ctcttccctc agcttcccag aatgccgaga 360
tgatagagac gggtctgac tgtgttaccc aggctggtct tcaactcttg gcctcaagtg 420
atcctcctgc cttagcttcc aagaatgctg aggttacagg caccatgagc caggacaccg 480
aggtggatat gaaggaggtg gagctgaatg agttagagcc cgagaagcag ccgatgaacg 540
cggcgtctgg ggcggccatg tccctggcgg gagccgagaa gaatggtctg gtgaagatca 600
aggtggcgga agacgaggcg gaggcggcag ccgcggctaa gttcacgggc ctgtccaagg 660
aggagctgct gaaggtggca ggcagccccg gctgggtacg cacccgctgg gcactgctgc 720
tgctcttctg gctcggctgg ctcggcatgc ttgctggtgc cgtggtcata atcgtgcgag 780
cgccgcgttg tcgcgagcta ccggcgcaga agtggtggca cacgggcgcc ctctaccgca 840
tcggcgacct tcaggccttc cagggccacg gcgcgggcaa cctggcgggt ctgaaggggc 900
gtctcgatta cctgagctct ctgaaggtga agggccttgt gctgggtcca attcacaaga 960
accagaagga tgatgtcgct cagactgact tgctgcagat cgaccccaat tttggctcca 1020
aggaagattt tgacagtctc ttgcaatcgg ctaaaaaaaa gagcatccgt gtcattctgg 1080
accttactcc caactaccgg ggtgagaact cgtggttctc cactcaggtt gacactgtgg 1140
ccaccaaggt gaaggatgct ctggagtttt ggctgcaagc tggcgtggat gggttccagg 1200
ttcgggacat agagaatctg aaggatgcat cctcattctt ggctgagtgg caaaatatca 1260
ccaagggctt cagtgaagac aggctcttga ttgcggggac taactcctcc gaccttcagc 1320
agatcctgag cctactcgaa tccaacaaag acttgctgtt gactagctca tacctgtctg 1380
attctggttc tactggggag catacaaaat ccctagtcac acagtatttg aatgccactg 1440
gcaatcgctg gtgcagctgg agtttgtctc aggcaaggct cctgacttcc ttcttgccgg 1500
ctcaacttct ccgactctac cagctgatgc tcttcaccct gccagggacc cctgttttca 1560
gctacgggga tgagattggc ctggatgcag ctgcccttcc tggacagcct atggaggctc 1620
cagtcatgct gtgggatgag tccagcttcc ctgacatccc aggggctgta agtgccaaca 1680
tgactgtgaa gggccagagt gaagaccctg gctccctcct ttccttgttc cggcggctga 1740
gtgaccagcg gagtaaggag cgctccctac tgcatgggga cttccacgcg ttctccgctg 1800
ggcctggact cttctcctat atccgccact gggaccagaa tgagcgtttt ctggtagtgc 1860
ttaactttgg ggatgtgggc ctctcggctg gactgcaggc ctccgacctg cctgccagcg 1920
ccagcctgcc agccaaggct gacctcctgc tcagcaccca gccaggccgt gaggagggct 1980
cccctcttga gctggaacgc ctgaaactgg agcctcacga agggctgctg ctccgcttcc 2040
cctacgcggc ctgacttcag cctgacatgg acccactacc cttctccttt ccttcccagg 2100
ccctttggct tctgattttt ctctttttta aaaacaaaca aacaaactgt tgcagattat 2160
gagtgaaccc ccaaataggg tgttttctgc cttcaaataa aagtcacccc tgcatggtga 2220
a 2221
<![CDATA[ <210> 469]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 469]]>
Met Glu Leu Gln Pro Pro Glu Ala Ser Ile Ala Val Val Ser Ile Pro
1 5 10 15
Arg Gln Leu Pro Gly Ser His Ser Glu Ala Gly Val Gln Gly Leu Ser
20 25 30
Ala Gly Asp Asp Ser
35
<![CDATA[ <210> 470]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> ]]> PRT
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 470]]>
Leu Gly Ser His Cys Val Ala Gln Thr Gly Leu Glu Leu Leu Ala Ser
1 5 10 15
Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn Ala Glu Met Ile
20 25 30
<![CDATA[ <210> 471]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 471]]>
Thr Gly Ser Asp Cys Val Thr Gln Ala Gly Leu Gln Leu Leu Ala Ser
1 5 10 15
Ser Asp Pro Pro Ala Leu Ala Ser Lys Asn Ala Glu Val Thr
20 25 30
<![CDATA[ <210> 472]]>
<![CDATA[ <211> 142]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 472]]>
Thr Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu Val Glu Leu Asn
1 5 10 15
Glu Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala Ser Gly Ala Ala
20 25 30
Met Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val Lys Ile Lys Val
35 40 45
Ala Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Ala Lys Phe Thr Gly Leu
50 55 60
Ser Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro Gly Trp Val Arg
65 70 75 80
Thr Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly Trp Leu Gly Met
85 90 95
Leu Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro Arg Cys Arg Glu
100 105 110
Leu Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu Tyr Arg Ile Gly
115 120 125
Asp Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn Leu Ala
130 135 140
<![CDATA[ <210> 473]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 473]]>
Leu Lys Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val Lys Gly Leu
1 5 10 15
Val Leu Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val Ala Gln Thr
20 25 30
Asp Leu Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu Asp Phe Asp
35 40 45
Ser Leu Leu Gln Ser Ala Lys Lys Lys Lys
50 55
<![CDATA[ <210> 474]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 474]]>
Ile Arg Val Ile Leu Asp Leu Thr Pro Asn Tyr Arg Gly Glu Asn Ser
1 5 10 15
Trp Phe Ser Thr Gln Val Asp Thr Val Ala Thr Lys Val Lys
20 25 30
<![CDATA[ <210> 475]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 475]]>
Asp Ala Leu Glu Phe Trp Leu Gln Ala Gly Val Asp Gly Phe Gln Val
1 5 10 15
Arg Asp Ile Glu Asn Leu Lys
20
<![CDATA[ <210> 476]]>
<![CDATA[ <211> 19]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 476]]>
Asp Ala Ser Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys Gly Phe
1 5 10 15
Ser Glu Asp
<![CDATA[ <210> 477]]>
<![CDATA[ <211> 60]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 477]]>
Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp Leu Gln Gln Ile Leu Ser
1 5 10 15
Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu Thr Ser Ser Tyr Leu Ser
20 25 30
Asp Ser Gly Ser Thr Gly Glu His Thr Lys Ser Leu Val Thr Gln Tyr
35 40 45
Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser Trp Ser
50 55 60
<![CDATA[ <210> 478]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 478]]>
Leu Ser Gln Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln Leu Leu
1 5 10 15
Arg Leu Tyr Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro Val Phe
20 25 30
Ser Tyr Gly Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro Gly Gln
35 40 45
<![CDATA[ <210> 479]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 479]]>
Pro Met Glu Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe Pro Asp
1 5 10 15
Ile Pro Gly Ala Val Ser Ala Asn Met Thr Val Lys
20 25
<![CDATA[ <210> 480]]>
<![CDATA[ <211> 120]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 480]]>
Gly Gln Ser Glu Asp Pro Gly Ser Leu Leu Ser Leu Phe Arg Arg Leu
1 5 10 15
Ser Asp Gln Arg Ser Lys Glu Arg Ser Leu Leu His Gly Asp Phe His
20 25 30
Ala Phe Ser Ala Gly Pro Gly Leu Phe Ser Tyr Ile Arg His Trp Asp
35 40 45
Gln Asn Glu Arg Phe Leu Val Val Leu Asn Phe Gly Asp Val Gly Leu
50 55 60
Ser Ala Gly Leu Gln Ala Ser Asp Leu Pro Ala Ser Ala Ser Leu Pro
65 70 75 80
Ala Lys Ala Asp Leu Leu Leu Ser Thr Gln Pro Gly Arg Glu Glu Gly
85 90 95
Ser Pro Leu Glu Leu Glu Arg Leu Lys Leu Glu Pro His Glu Gly Leu
100 105 110
Leu Leu Arg Phe Pro Tyr Ala Ala
115 120
<![CDATA[ <210> 481]]>
<![CDATA[ <211> 630]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 481]]>
Met Glu Leu Gln Pro Pro Glu Ala Ser Ile Ala Val Val Ser Ile Pro
1 5 10 15
Arg Gln Leu Pro Gly Ser His Ser Glu Ala Gly Val Gln Gly Leu Ser
20 25 30
Ala Gly Asp Asp Ser Glu Leu Gly Ser His Cys Val Ala Gln Thr Gly
35 40 45
Leu Glu Leu Leu Ala Ser Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn
50 55 60
Ala Glu Met Ile Glu Thr Gly Ser Asp Cys Val Thr Gln Ala Gly Leu
65 70 75 80
Gln Leu Leu Ala Ser Ser Asp Pro Pro Ala Leu Ala Ser Lys Asn Ala
85 90 95
Glu Val Thr Gly Thr Met Ser Gln Asp Thr Glu Val Asp Met Lys Glu
100 105 110
Val Glu Leu Asn Glu Leu Glu Pro Glu Lys Gln Pro Met Asn Ala Ala
115 120 125
Ser Gly Ala Ala Met Ser Leu Ala Gly Ala Glu Lys Asn Gly Leu Val
130 135 140
Lys Ile Lys Val Ala Glu Asp Glu Ala Glu Ala Ala Ala Ala Ala Ala Lys
145 150 155 160
Phe Thr Gly Leu Ser Lys Glu Glu Leu Leu Lys Val Ala Gly Ser Pro
165 170 175
Gly Trp Val Arg Thr Arg Trp Ala Leu Leu Leu Leu Phe Trp Leu Gly
180 185 190
Trp Leu Gly Met Leu Ala Gly Ala Val Val Ile Ile Val Arg Ala Pro
195 200 205
Arg Cys Arg Glu Leu Pro Ala Gln Lys Trp Trp His Thr Gly Ala Leu
210 215 220
Tyr Arg Ile Gly Asp Leu Gln Ala Phe Gln Gly His Gly Ala Gly Asn
225 230 235 240
Leu Ala Gly Leu Lys Gly Arg Leu Asp Tyr Leu Ser Ser Leu Lys Val
245 250 255
Lys Gly Leu Val Leu Gly Pro Ile His Lys Asn Gln Lys Asp Asp Val
260 265 270
Ala Gln Thr Asp Leu Leu Gln Ile Asp Pro Asn Phe Gly Ser Lys Glu
275 280 285
Asp Phe Asp Ser Leu Leu Gln Ser Ala Lys Lys Lys Lys Ser Ile Arg Val
290 295 300
Ile Leu Asp Leu Thr Pro Asn Tyr Arg Gly Glu Asn Ser Trp Phe Ser
305 310 315 320
Thr Gln Val Asp Thr Val Ala Thr Lys Val Lys Asp Ala Leu Glu Phe
325 330 335
Trp Leu Gln Ala Gly Val Asp Gly Phe Gln Val Arg Asp Ile Glu Asn
340 345 350
Leu Lys Asp Ala Ser Ser Phe Leu Ala Glu Trp Gln Asn Ile Thr Lys
355 360 365
Gly Phe Ser Glu Asp Arg Leu Leu Ile Ala Gly Thr Asn Ser Ser Asp
370 375 380
Leu Gln Gln Ile Leu Ser Leu Leu Glu Ser Asn Lys Asp Leu Leu Leu
385 390 395 400
Thr Ser Ser Tyr Leu Ser Asp Ser Gly Ser Thr Gly Glu His Thr Lys
405 410 415
Ser Leu Val Thr Gln Tyr Leu Asn Ala Thr Gly Asn Arg Trp Cys Ser
420 425 430
Trp Ser Leu Ser Gln Ala Arg Leu Leu Thr Ser Phe Leu Pro Ala Gln
435 440 445
Leu Leu Arg Leu Tyr Gln Leu Met Leu Phe Thr Leu Pro Gly Thr Pro
450 455 460
Val Phe Ser Tyr Gly Asp Glu Ile Gly Leu Asp Ala Ala Ala Leu Pro
465 470 475 480
Gly Gln Pro Met Glu Ala Pro Val Met Leu Trp Asp Glu Ser Ser Phe
485 490 495
Pro Asp Ile Pro Gly Ala Val Ser Ala Asn Met Thr Val Lys Gly Gln
500 505 510
Ser Glu Asp Pro Gly Ser Leu Leu Ser Leu Phe Arg Arg Leu Ser Asp
515 520 525
Gln Arg Ser Lys Glu Arg Ser Leu Leu His Gly Asp Phe His Ala Phe
530 535 540
Ser Ala Gly Pro Gly Leu Phe Ser Tyr Ile Arg His Trp Asp Gln Asn
545 550 555 560
Glu Arg Phe Leu Val Val Leu Asn Phe Gly Asp Val Gly Leu Ser Ala
565 570 575
Gly Leu Gln Ala Ser Asp Leu Pro Ala Ser Ala Ser Leu Pro Ala Lys
580 585 590
Ala Asp Leu Leu Leu Ser Thr Gln Pro Gly Arg Glu Glu Gly Ser Pro
595 600 605
Leu Glu Leu Glu Arg Leu Lys Leu Glu Pro His Glu Gly Leu Leu Leu
610 615 620
Arg Phe Pro Tyr Ala Ala
625 630
<![CDATA[ <210> 482]]>
<![CDATA[ <211> 366]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 482]]>
gcattgcggc ttggttttct cacccagtgc atgtggcagg agcggtgaga tcactgcctc 60
acggcgatcc tggactgacg gtcacgactg cctaccctct aaccctgttc tgagctgccc 120
cttgcccaca caccccaaac ctgtgtgcag gatccgcctc catggagcta cagcctcctg 180
aagcctcgat cgccgtcgtg tcgattccgc gccagttgcc tggctcacat tcggaggctg 240
gtgtccaggg tctcagcgcg ggggacgact cagagttggg gtctcactgt gttgcccaga 300
ctggtctcga actcttggcc tcaggtgatc ctcttccctc agcttcccag aatgccgaga 360
tgatag 366
<![CDATA[ <210> 483]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 483]]>
Met Glu Leu Gln Pro Pro Glu Ala Ser Ile Ala Val Val Ser Ile Pro
1 5 10 15
Arg Gln Leu Pro Gly Ser His Ser Glu Ala Gly Val Gln Gly Leu Ser
20 25 30
Ala Gly Asp Asp Ser Glu Leu Gly Ser His Cys Val Ala Gln Thr Gly
35 40 45
Leu Glu Leu Leu Ala Ser Gly Asp Pro Leu Pro Ser Ala Ser Gln Asn
50 55 60
Ala Glu Met Ile
65
<![CDATA[ <210> 484]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 484]]>
ttgagcctgt tgatgcccgc cctgctgccg accgaggact gaccactcga ccag 54
<![CDATA[ <210> 485]]>
<![CDATA[ <211> 87]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 485]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgct 87
<![CDATA[ <210> 486]]>
<![CDATA[ <211> 141]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 486]]>
ttgagcctgt tgatgcccgc cctgctgccg accgaggact gaccactcga ccagctacat 60
ctacatccac cactgggaca agccatcttg taaaatgtgc ggagaaggag aaaactttct 120
gtgtgaatgg aggggagtgc t 141
<![CDATA[ <210> 487]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 487]]>
Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg
1 5 10 15
Pro Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys
20 25 30
Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
35 40 45
<![CDATA[ <210> 488]]>
<![CDATA[ <211> 207]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 488]]>
gtcagtttcc tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg 60
gagacggcgg cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc 120
agcgcactcg gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg 180
gccgcctgga cggctcgggc gctggag 207
<![CDATA[ <210> 489]]>
<![CDATA[ <211> 130]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 489]]>
gtctaccctg aactgcagat caccaatgtg gtagaagcca accaaccagt gaccatccag 60
aactggtgca agcggggccg caagcagtgc aagacccatc cccactttgt gattccctac 120
cgctgcttag 130
<![CDATA[ <210> 490]]>
<![CDATA[ <211> 113]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 490]]>
ttggtgagtt tgtaagtgat gcccttctcg ttcctgacaa gtgcaaattc ttacaccagg 60
agaggatgga tgtttgcgaa actcatcttc actggcacac cgtcgccaaa gag 113
<![CDATA[ <210> 491]]>
<![CDATA[ <211> 194]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 491]]>
acatgcagtg agaagagtac caacttgcat gactacggca tgttgctgcc ctgcggaatt 60
gacaagttcc gaggggtaga gtttgtgtgt tgcccactgg ctgaagaaag tgacaatgtg 120
gattctgctg atgcggagga ggatgactcg gatgtctggt ggggcggagc agacacagac 180
tatgcagatgggag 194
<![CDATA[ <210> 492]]>
<![CDATA[ <211> 203]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 492]]>
tgaagacaaa gtagtagaag tagcagagga ggaagaagtg gctgaggtgg aagaagaaga 60
agccgatgat gacgaggacg atgaggatgg tgatgaggta gaggaagagg ctgaggaacc 120
ctacgaagaa gccacagaga gaaccaccag cattgccacc accaccacca ccaccacaga 180
gtctgtggaa gaggtggttc gag 203
<![CDATA[ <210> 493]]>
<![CDATA[ <211> 168]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 493]]>
aggtgtgctc tgaacaagcc gagacggggc cgtgccgagc aatgatctcc cgctggtact 60
ttgatgtgac tgaagggaag tgtgccccat tcttttacgg cggatgtggc ggcaaccgga 120
acaactttga cacagaagag tactgcatgg ccgtgtgtgg cagcgcca 168
<![CDATA[ <210> 494]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 494]]>
tgtcccaaag tttactcaag actacccagg aacctcttgc ccgagatcct gttaaac 57
<![CDATA[ <210> 495]]>
<![CDATA[ <211> 134]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 495]]>
ttcctacaac agcagccagt acccctgatg ccgttgacaa gtatctcgag acacctgggg 60
atgagaatga acatgcccat ttccagaaag ccaaagagag gcttgaggcc aagcaccgag 120
agagaatgtc ccag 134
<![CDATA[ <210> 496]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> ]]> 496
gtcatgagag aatgggaaga ggcagaacgt caagcaaaga acttgcctaa agctgataag 60
aaggcagtta tccag 75
<![CDATA[ <210> 497]]>
<![CDATA[ <211> 159]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 497]]>
catttccagg agaaagtgga atctttggaa caggaagcag ccaacgagag acagcagctg 60
gtggagacac acatggccag agtggaagcc atgctcaatg accgccgccg cctggccctg 120
gagaactaca tcaccgctct gcaggctgtt cctcctcgg 159
<![CDATA[ <210> 498]]>
<![CDATA[ <211> 129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 498]]>
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 60
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 120
cggtcccag 129
<![CDATA[ <210> 499]]>
<![CDATA[ <211> 100]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 499]]>
gttatgacac acctccgtgt gatttatgag cgcatgaatc agtctctctc cctgctctac 60
aacgtgcctg cagtggccga ggagattcag gatgaagttg 100
<![CDATA[ <210> 500]]>
<![CDATA[ <211> 222]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <21]]>3> Human]]>
<br/>
<br/> <![CDATA[ <400>500]]>
<br/> <![CDATA[atgagctgct tcagaaagag caaaactatt cagatgacgt cttggccaac atgattagtg 60
aaccaaggat cagttacgga aacgatgctc tcatgccatc tttgaccgaa acgaaaacca 120
ccgtggagct ccttcccgtg aatggagagt tcagcctgga cgatctccag ccgtggcatt 180
cttttggggc tgactctgtg ccagccaaca cagaaaacga ag 222
<![CDATA[ <210> 501]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 501]]>
ttgagcctgt tgatgcccgc cctgctgccg accgaggact gaccactcga ccag 54
<![CDATA[ <210> 502]]>
<![CDATA[ <211> 101]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 502]]>
gttctgggtt gacaaatatc aagacggagg agatctctga agtgaagatg gatgcagaat 60
tccgacatga ctcaggatat gaagttcatc atcaaaaatt g 101
<![CDATA[ <210> 503]]>
<![CDATA[ <211> 147]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 503]]>
gtgttctttg cagaagatgt gggttcaaac aaaggtgcaa tcattggact catggtgggc 60
ggtgttgtca tagcgacagt gatcgtcatc accttggtga tgctgaagaa gaaacagtac 120
acatccattc atcatggtgt ggtggag 147
<![CDATA[ <210> 504]]>
<![CDATA[ <211> 1222]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 504]]>
gttgacgccg ctgtcacccc agaggagcgc cacctgtcca agatgcagca gaacggctac 60
gaaaatccaa cctacaagtt ctttgagcag atgcagaact agaccccccgc cacagcagcc 120
tctgaagttg gacagcaaaa ccattgcttc actacccatc ggtgtccatt tatagaataa 180
tgtgggaaga aacaaacccg ttttatgatt tactcattat cgccttttga cagctgtgct 240
gtaacacaag tagatgcctg aacttgaatt aatccacaca tcagtaatgt attctatctc 300
tctttacatt ttggtctcta tactacatta ttaatgggtt ttgtgtactg taaagaattt 360
agctgtatca aactagtgca tgaatagatt ctctcctgat tattatcac atagcccctt 420
agccagttgt atattattct tgtggtttgt gacccaatta agtcctactt tacatatgct 480
ttaagaatcg atgggggatg cttcatgtga acgtgggagt tcagctgctt ctcttgccta 540
agtattcctt tcctgatcac tatgcatttt aaagttaaac atttttaagt atttcagatg 600
ctttagagag attttttttc catgactgca ttttactgta cagattgctg cttctgctat 660
atttgtgata taggaattaa gaggatacac acgtttgttt cttcgtgcct gttttatgtg 720
cacacattag gcattgagac ttcaagcttt tctttttttg tccacgtatc tttgggtctt 780
tgataaagaa aagaatccct gttcattgta agcactttta cggggcgggt ggggaggggt 840
gctctgctgg tcttcaatta ccaagaattc tccaaaacaa ttttctgcag gatgattgta 900
cagaatcatt gcttatgaca tgatcgcttt ctacactgta ttacataaat aaattaaata 960
aaataacccc gggcaagact tttctttgaa ggatgactac agacattaaa taatcgaagt 1020
aattttgggt ggggagaaga ggcagattca attttcttta accagtctga agtttcattt 1080
atgatacaaa agaagatgaa aatggaagtg gcaatataag gggatgagga aggcatgcct 1140
ggacaaaccc ttcttttaag atgtgtcttc aatttgtata aaatggtgtt ttcatgtaaa 1200
taaatacatt cttggaggag ca 1222
<![CDATA[ <210> 505]]>
<![CDATA[ <211> 3415]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> The complete mRNA polynucleotide sequence of APP]]>
<![CDATA[ <400> 505]]>
gtcagtttcc tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg 60
gagacggcgg cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc 120
agcgcactcg gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg 180
gccgcctgga cggctcgggc gctggaggtc taccctgaac tgcagatcac caatgtggta 240
gaagccaacc aaccagtgac catccagaac tggtgcaagc ggggccgcaa gcagtgcaag 300
acccatcccc actttgtgat tccctaccgc tgcttagttg gtgagtttgt aagtgatgcc 360
cttctcgttc ctgacaagtg caaattctta caccaggaga ggatggatgt ttgcgaaact 420
catcttcact ggcacaccgt cgccaaagag acatgcagtg agaagagtac caacttgcat 480
gactacggca tgttgctgcc ctgcggaatt gacaagttcc gaggggtaga gtttgtgtgt 540
tgcccactgg ctgaagaaag tgacaatgtg gattctgctg atgcggagga ggatgactcg 600
gatgtctggt ggggcggagc agacacagac tatgcagatg ggagtgaaga caaagtagta 660
gaagtagcag aggaggaaga agtggctgag gtggaagaag aagaagccga tgatgacgag 720
gacgatgagg atggtgatga ggtagaggaa gaggctgagg aaccctacga agaagccaca 780
gagagaacca ccagcattgc caccaccacc accacccacca cagagtctgt ggaagaggtg 840
gttcgagagg tgtgctctga acaagccgag acggggccgt gccgagcaat gatctcccgc 900
tggtactttg atgtgactga agggaagtgt gccccattct tttacggcgg atgtggcggc 960
aaccggaaca actttgacac agaagagtac tgcatggccg tgtgtggcag cgccatgtcc 1020
caaagtttac tcaagactac ccaggaacct cttgcccgag atcctgttaa acttcctaca 1080
acagcagcca gtacccctga tgccgttgac aagtatctcg agaacacctgg ggatgagaat 1140
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagaatg 1200
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcctaaagct 1260
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 1320
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 1380
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 1440
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 1500
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 1560
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 1620
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 1680
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 1740
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 1800
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 1860
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 1920
gaccgaggac tgaccactcg accaggttct gggttgacaa atatcaagac ggaggagatc 1980
tctgaagtga agatggatgc agaattccga catgactcag gatatgaagt tcatcatcaa 2040
aaattggtgt tctttgcaga agatgtgggt tcaaacaaag gtgcaatcat tggactcatg 2100
gtgggcggtg ttgtcatagc gacagtgatc gtcatcacct tggtgatgct gaagaagaaa 2160
cagtacacat ccattcatca tggtgtggtg gaggttgacg ccgctgtcac cccagaggag 2220
cgccacctgt ccaagatgca gcagaacggc tacgaaaatc caacctacaa gttctttgag 2280
cagatgcaga actagacccc cgccacagca gcctctgaag ttggacagca aaaccattgc 2340
ttcactaccc atcggtgtcc atttatagaa taatgtggga agaaacaaac ccgttttatg 2400
atttactcat tatcgccttt tgacagctgt gctgtaacac aagtagatgc ctgaacttga 2460
attaatccac acatcagtaa tgtattctat ctctctttac attttggtct ctatactaca 2520
ttattaatgg gttttgtgta ctgtaaagaa tttagctgta tcaaactagt gcatgaatag 2580
attctctcct gattatttt cacatagccc cttagccagt tgtatattat tcttgtggtt 2640
tgtgacccaa ttaagtccta ctttacatat gctttaagaa tcgatggggg atgcttcatg 2700
tgaacgtggg agttcagctg cttctcttgc ctaagtattc ctttcctgat cactatgcat 2760
tttaaagtta aacattttta agtatttcag atgctttaga gagattttttttccatgact 2820
gcattttact gtacagattg ctgcttctgc tatatttgtg atatagggaat taagaggata 2880
cacacgtttg tttcttcgtg cctgttttat gtgcacacat taggcattga gacttcaagc 2940
ttttcttttt ttgtccacgt atctttgggt ctttgataaa gaaaagaatc cctgttcatt 3000
gtaagcactt ttacggggcg ggtggggagg ggtgctctgc tggtcttcaa ttaccaagaa 3060
ttctccaaaa caattttctg caggatgatt gtacagaatc attgcttatg acatgatcgc 3120
tttctacact gtattacata aataaattaa ataaaataac cccgggcaag acttttcttt 3180
gaaggatgac tacagacatt aaataatcga agtaattttg ggtggggaga agaggcagat 3240
tcaattttct ttaaccagtc tgaagtttca tttatgatac aaaagaagat gaaaatggaa 3300
gtggcaatat aaggggatga ggaaggcatg cctggacaaa cccttctttt aagatgtgtc 3360
ttcaatttgt ataaaatggt gttttcatgt aaataaatac attcttggag gagca 3415
<![CDATA[ <210> 506]]>
<![CDATA[ <211> 19]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 506]]>
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg
1 5 10 15
Ala Leu Glu
<![CDATA[ <21]]>0> 507]]>
<br/> <![CDATA[ <211>43]]>
<br/> <![CDATA[ <212>PRT]]>
<br/> <![CDATA[ <213>Human]]>
<br/>
<br/> <![CDATA[ <400>507]]>
<br/>
<br/> <![CDATA[Val Tyr Pro Glu Leu Gln Ile Thr Asn Val Val Glu Ala Asn Gln Pro
1 5 10 15
Val Thr Ile Gln Asn Trp Cys Lys Arg Gly Arg Lys Gln Cys Lys Thr
20 25 30
His Pro His Phe Val Ile Pro Tyr Arg Cys Leu
35 40
<![CDATA[ <210> 508]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 508]]>
Gly Glu Phe Val Ser Asp Ala Leu Leu Val Pro Asp Lys Cys Lys Phe
1 5 10 15
Leu His Gln Glu Arg Met Asp Val Cys Glu Thr His Leu His Trp His
20 25 30
Thr Val Ala Lys Glu
35
<![CDATA[ <210> 509]]>
<![CDATA[ <211> 64]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 509]]>
Thr Cys Ser Glu Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu
1 5 10 15
Pro Cys Gly Ile Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro
20 25 30
Leu Ala Glu Glu Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp
35 40 45
Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly
50 55 60
<![CDATA[ <210> 510]]>
<![CDATA[ <211> 67]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 510]]>
Glu Asp Lys Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val
1 5 10 15
Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu
20 25 30
Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr
35 40 45
Thr Ser Ile Ala Thr Thr Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu
50 55 60
Val Val Arg
65
<![CDATA[ <210> 511]]>
<![CDATA[ <211> 55]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 511]]>
Val Cys Ser Glu Gln Ala Glu Thr Gly Pro Cys Arg Ala Met Ile Ser
1 5 10 15
Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe Tyr
20 25 30
Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr Cys
35 40 45
Met Ala Val Cys Gly Ser Ala
50 55
<![CDATA[ <210> 512]]>
<![CDATA[ <211> 18]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 512]]>
Ser Gln Ser Leu Leu Lys Thr Thr Gln Glu Pro Leu Ala Arg Asp Pro
1 5 10 15
Val Lys
<![CDATA[ <210> 513]]>
<![CDATA[ <211> 44]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 513]]>
Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu
1 5 10 15
Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gln Lys Ala Lys Glu
20 25 30
Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gln
35 40
<![CDATA[ <210> 514]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 514]]>
Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gln Ala Lys Asn Leu Pro
1 5 10 15
Lys Ala Asp Lys Lys Ala Val Ile Gln
20 25
<![CDATA[ <210> 515]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 515]]>
His Phe Gln Glu Lys Val Glu Ser Leu Glu Gln Glu Ala Ala Asn Glu
1 5 10 15
Arg Gln Gln Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu
20 25 30
Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr Ile Thr Ala Leu Gln
35 40 45
Ala Val Pro Pro Arg
50
<![CDATA[ <210> 516]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 516]]>
Pro Arg His Val Phe Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln
1 5 10 15
Lys Asp Arg Gln His Thr Leu Lys His Phe Glu His Val Arg Met Val
20 25 30
Asp Pro Lys Lys Ala Ala Gln Ile Arg Ser Gln
35 40
<![CDATA[ <210> 517]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 517]]>
Val Met Thr His Leu Arg Val Ile Tyr Glu Arg Met Asn Gln Ser Leu
1 5 10 15
Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu Ile Gln Asp Glu
20 25 30
Val
<![CDATA[ <210> 518]]>
<![CDATA[ <211> 73]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 518]]>
Glu Leu Leu Gln Lys Glu Gln Asn Tyr Ser Asp Asp Val Leu Ala Asn
1 5 10 15
Met Ile Ser Glu Pro Arg Ile Ser Tyr Gly Asn Asp Ala Leu Met Pro
20 25 30
Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly
35 40 45
Glu Phe Ser Leu Asp Asp Leu Gln Pro Trp His Ser Phe Gly Ala Asp
50 55 60
Ser Val Pro Ala Asn Thr Glu Asn Glu
65 70
<![CDATA[ <210> 519]]>
<![CDATA[ <211> 17]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 519]]>
Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg
1 5 10 15
Pro
<![CDATA[ <210> 520]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 520]]>
Ser Gly Leu Thr Asn Ile Lys Thr Glu Glu Ile Ser Glu Val Lys Met
1 5 10 15
Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gln Lys
20 25 30
Leu
<![CDATA[ <210> 521]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 521]]>
Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala Ile Ile Gly
1 5 10 15
Leu Met Val Gly Gly Val Val Ile Ala Thr Val Ile Val Ile Thr Leu
20 25 30
Val Met Leu Lys Lys Lys Lys Gln Tyr Thr Ser Ile His His Gly Val Val
35 40 45
Glu
<![CDATA[ <210> 522]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 522]]>
Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gln
1 5 10 15
Gln Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gln Met Gln
20 25 30
Asn
<![CDATA[ <210> 523]]>
<![CDATA[ <211> 714]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 523]]>
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg
1 5 10 15
Ala Leu Glu Val Tyr Pro Glu Leu Gln Ile Thr Asn Val Val Glu Ala
20 25 30
Asn Gln Pro Val Thr Ile Gln Asn Trp Cys Lys Arg Gly Arg Lys Gln
35 40 45
Cys Lys Thr His Pro His Phe Val Ile Pro Tyr Arg Cys Leu Val Gly
50 55 60
Glu Phe Val Ser Asp Ala Leu Leu Val Pro Asp Lys Cys Lys Phe Leu
65 70 75 80
His Gln Glu Arg Met Asp Val Cys Glu Thr His Leu His Trp His Thr
85 90 95
Val Ala Lys Glu Thr Cys Ser Glu Lys Ser Thr Asn Leu His Asp Tyr
100 105 110
Gly Met Leu Leu Pro Cys Gly Ile Asp Lys Phe Arg Gly Val Glu Phe
115 120 125
Val Cys Cys Pro Leu Ala Glu Glu Ser Asp Asn Val Asp Ser Ala Asp
130 135 140
Ala Glu Glu Asp Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp
145 150 155 160
Tyr Ala Asp Gly Ser Glu Asp Lys Val Val Glu Val Ala Glu Glu Glu Glu
165 170 175
Glu Val Ala Glu Val Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp
180 185 190
Glu Asp Gly Asp Glu Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu
195 200 205
Ala Thr Thr Glu Arg Thr Thr Ser Ile Ala Thr Thr Thr Thr Thr Thr Thr Thr Thr
210 215 220
Glu Ser Val Glu Glu Val Val Arg Glu Val Cys Ser Glu Gln Ala Glu
225 230 235 240
Thr Gly Pro Cys Arg Ala Met Ile Ser Arg Trp Tyr Phe Asp Val Thr
245 250 255
Glu Gly Lys Cys Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg
260 265 270
Asn Asn Phe Asp Thr Glu Glu Tyr Cys Met Ala Val Cys Gly Ser Ala
275 280 285
Met Ser Gln Ser Leu Leu Lys Thr Thr Gln Glu Pro Leu Ala Arg Asp
290 295 300
Pro Val Lys Leu Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp
305 310 315 320
Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gln
325 330 335
Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gln
340 345 350
Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gln Ala Lys Asn Leu Pro
355 360 365
Lys Ala Asp Lys Lys Ala Val Ile Gln His Phe Gln Glu Lys Val Glu
370 375 380
Ser Leu Glu Gln Glu Ala Ala Asn Glu Arg Gln Gln Leu Val Glu Thr
385 390 395 400
His Met Ala Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala
405 410 415
Leu Glu Asn Tyr Ile Thr Ala Leu Gln Ala Val Pro Pro Pro Arg Pro Arg
420 425 430
His Val Phe Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln Lys Asp
435 440 445
Arg Gln His Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro
450 455 460
Lys Lys Ala Ala Gln Ile Arg Ser Gln Val Met Thr His Leu Arg Val
465 470 475 480
Ile Tyr Glu Arg Met Asn Gln Ser Leu Ser Leu Leu Tyr Asn Val Pro
485 490 495
Ala Val Ala Glu Glu Ile Gln Asp Glu Val Asp Glu Leu Leu Gln Lys
500 505 510
Glu Gln Asn Tyr Ser Asp Asp Val Leu Ala Asn Met Ile Ser Glu Pro
515 520 525
Arg Ile Ser Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr
530 535 540
Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp
545 550 555 560
Asp Leu Gln Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn
565 570 575
Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg
580 585 590
Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr Asn Ile Lys Thr Glu
595 600 605
Glu Ile Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly
610 615 620
Tyr Glu Val His His Gln Lys Leu Val Phe Phe Ala Glu Asp Val Gly
625 630 635 640
Ser Asn Lys Gly Ala Ile Ile Gly Leu Met Val Gly Gly Val Val Ile
645 650 655
Ala Thr Val Ile Val Ile Thr Leu Val Met Leu Lys Lys Lys Gln Tyr
660 665 670
Thr Ser Ile His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro
675 680 685
Glu Glu Arg His Leu Ser Lys Met Gln Gln Asn Gly Tyr Glu Asn Pro
690 695 700
Thr Tyr Lys Phe Phe Glu Gln Met Gln Asn
705 710
<![CDATA[ <210> 524]]>
<![CDATA[ <211> 1945]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 524]]>
gtcagtttcc tcggcagcgg taggcgagag cacgcggagg agcgtgcgcg ggggccccgg 60
gagacggcgg cggtggcggc gcgggcagag caaggacgcg gcggatccca ctcgcacagc 120
agcgcactcg gtgccccgcg cagggtcgcg atgctgcccg gtttggcact gctcctgctg 180
gccgcctgga cggctcgggc gctggaggtc taccctgaac tgcagatcac caatgtggta 240
gaagccaacc aaccagtgac catccagaac tggtgcaagc ggggccgcaa gcagtgcaag 300
acccatcccc actttgtgat tccctaccgc tgcttagttg gtgagtttgt aagtgatgcc 360
cttctcgttc ctgacaagtg caaattctta caccaggaga ggatggatgt ttgcgaaact 420
catcttcact ggcacaccgt cgccaaagag acatgcagtg agaagagtac caacttgcat 480
gactacggca tgttgctgcc ctgcggaatt gacaagttcc gaggggtaga gtttgtgtgt 540
tgcccactgg ctgaagaaag tgacaatgtg gattctgctg atgcggagga ggatgactcg 600
gatgtctggt ggggcggagc agacacagac tatgcagatg ggagtgaaga caaagtagta 660
gaagtagcag aggaggaaga agtggctgag gtggaagaag aagaagccga tgatgacgag 720
gacgatgagg atggtgatga ggtagaggaa gaggctgagg aaccctacga agaagccaca 780
gagagaacca ccagcattgc caccaccacc accacccacca cagagtctgt ggaagaggtg 840
gttcgagagg tgtgctctga acaagccgag acggggccgt gccgagcaat gatctcccgc 900
tggtactttg atgtgactga agggaagtgt gccccattct tttacggcgg atgtggcggc 960
aaccggaaca actttgacac agaagagtac tgcatggccg tgtgtggcag cgccatgtcc 1020
caaagtttac tcaagactac ccaggaacct cttgcccgag atcctgttaa acttcctaca 1080
acagcagcca gtacccctga tgccgttgac aagtatctcg agaacacctgg ggatgagaat 1140
gaacatgccc atttccagaa agccaaagag aggcttgagg ccaagcaccg agagaatg 1200
tcccaggtca tgagagaatg ggaagaggca gaacgtcaag caaagaactt gcctaaagct 1260
gataagaagg cagttatcca gcatttccag gagaaagtgg aatctttgga acaggaagca 1320
gccaacgaga gacagcagct ggtggagaca cacatggcca gagtggaagc catgctcaat 1380
gaccgccgcc gcctggccct ggagaactac atcaccgctc tgcaggctgt tcctcctcgg 1440
cctcgtcacg tgttcaatat gctaaagaag tatgtccgcg cagaacagaa ggacagacag 1500
cacaccctaa agcatttcga gcatgtgcgc atggtggatc ccaagaaagc cgctcagatc 1560
cggtcccagg ttatgacaca cctccgtgtg atttatgagc gcatgaatca gtctctctcc 1620
ctgctctaca acgtgcctgc agtggccgag gagattcagg atgaagttga tgagctgctt 1680
cagaaagagc aaaactattc agatgacgtc ttggccaaca tgattagtga accaaggatc 1740
agttacggaa acgatgctct catgccatct ttgaccgaaa cgaaaaccac cgtggagctc 1800
cttcccgtga atggagagtt cagcctggac gatctccagc cgtggcattc ttttggggct 1860
gactctgtgc cagccaacac agaaaacgaa gttgagcctg ttgatgcccg ccctgctgcc 1920
gaccgaggac tgaccactcg accag 1945
<![CDATA[ <210> 525]]>
<![CDATA[ <211> 598]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 525]]>
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg
1 5 10 15
Ala Leu Glu Val Tyr Pro Glu Leu Gln Ile Thr Asn Val Val Glu Ala
20 25 30
Asn Gln Pro Val Thr Ile Gln Asn Trp Cys Lys Arg Gly Arg Lys Gln
35 40 45
Cys Lys Thr His Pro His Phe Val Ile Pro Tyr Arg Cys Leu Val Gly
50 55 60
Glu Phe Val Ser Asp Ala Leu Leu Val Pro Asp Lys Cys Lys Phe Leu
65 70 75 80
His Gln Glu Arg Met Asp Val Cys Glu Thr His Leu His Trp His Thr
85 90 95
Val Ala Lys Glu Thr Cys Ser Glu Lys Ser Thr Asn Leu His Asp Tyr
100 105 110
Gly Met Leu Leu Pro Cys Gly Ile Asp Lys Phe Arg Gly Val Glu Phe
115 120 125
Val Cys Cys Pro Leu Ala Glu Glu Ser Asp Asn Val Asp Ser Ala Asp
130 135 140
Ala Glu Glu Asp Asp Ser Asp Val Trp Trp Gly Gly Ala Asp Thr Asp
145 150 155 160
Tyr Ala Asp Gly Ser Glu Asp Lys Val Val Glu Val Ala Glu Glu Glu Glu
165 170 175
Glu Val Ala Glu Val Glu Glu Glu Glu Ala Asp Asp Asp Glu Asp Asp
180 185 190
Glu Asp Gly Asp Glu Val Glu Glu Glu Ala Glu Glu Pro Tyr Glu Glu
195 200 205
Ala Thr Thr Glu Arg Thr Thr Ser Ile Ala Thr Thr Thr Thr Thr Thr Thr Thr Thr
210 215 220
Glu Ser Val Glu Glu Val Val Arg Glu Val Cys Ser Glu Gln Ala Glu
225 230 235 240
Thr Gly Pro Cys Arg Ala Met Ile Ser Arg Trp Tyr Phe Asp Val Thr
245 250 255
Glu Gly Lys Cys Ala Pro Phe Phe Tyr Gly Gly Cys Gly Gly Asn Arg
260 265 270
Asn Asn Phe Asp Thr Glu Glu Tyr Cys Met Ala Val Cys Gly Ser Ala
275 280 285
Met Ser Gln Ser Leu Leu Lys Thr Thr Gln Glu Pro Leu Ala Arg Asp
290 295 300
Pro Val Lys Leu Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp
305 310 315 320
Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gln
325 330 335
Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gln
340 345 350
Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gln Ala Lys Asn Leu Pro
355 360 365
Lys Ala Asp Lys Lys Ala Val Ile Gln His Phe Gln Glu Lys Val Glu
370 375 380
Ser Leu Glu Gln Glu Ala Ala Asn Glu Arg Gln Gln Leu Val Glu Thr
385 390 395 400
His Met Ala Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala
405 410 415
Leu Glu Asn Tyr Ile Thr Ala Leu Gln Ala Val Pro Pro Pro Arg Pro Arg
420 425 430
His Val Phe Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln Lys Asp
435 440 445
Arg Gln His Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro
450 455 460
Lys Lys Ala Ala Gln Ile Arg Ser Gln Val Met Thr His Leu Arg Val
465 470 475 480
Ile Tyr Glu Arg Met Asn Gln Ser Leu Ser Leu Leu Tyr Asn Val Pro
485 490 495
Ala Val Ala Glu Glu Ile Gln Asp Glu Val Asp Glu Leu Leu Gln Lys
500 505 510
Glu Gln Asn Tyr Ser Asp Asp Val Leu Ala Asn Met Ile Ser Glu Pro
515 520 525
Arg Ile Ser Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr
530 535 540
Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp
545 550 555 560
Asp Leu Gln Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn
565 570 575
Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg
580 585 590
Gly Leu Thr Thr Arg Pro
595
<![CDATA[ <210> 526]]>
<![CDATA[ <211> 96]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 526]]>
aagctggctg cccccttgat ttggagcgag caggcctgac tccagaggtt cagaagatta 60
ttgctgatgt tatccgaaac cctcccgtca actcag 96
<![CDATA[ <210> 527]]>
<![CDATA[ <211> 87]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 527]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgt gaatggaggg gagtgct 87
<![CDATA[ <210> 528]]>
<![CDATA[ <211> 182]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 528]]>
aagctggctg cccccttgat ttggagcgag caggcctgac tccagaggtt cagaagatta 60
ttgctgatgt tatccgaaac cctcccgtca actcagctac atctacatcc accactggga 120
caagccatct tgtaaaatgt gcggagaagg agaaaacttt ctgtgtgaat ggaggggagt 180
gc 182
<![CDATA[ <210> 529]]>
<![CDATA[ <211> 60]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 529]]>
Ala Gly Cys Pro Leu Asp Leu Glu Arg Ala Gly Leu Thr Pro Glu Val
1 5 10 15
Gln Lys Ile Ile Ala Asp Val Ile Arg Asn Pro Pro Val Asn Ser Ala
20 25 30
Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu
35 40 45
Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys
50 55 60
<![CDATA[ <210> 530]]>
<![CDATA[ <211> 164]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 530]]>
gtgtactgtg tgcgccgggg aggcgccggc ttgtactcgg cagcgcggga ataaagtttg 60
ctgatttggt gtctagcctg gatgcctggg ttgcaggccc tgcttgtggt ggcgctccac 120
agtcatccgg ctgaagaaga cctgttggac tggatcttct cggg 164
<![CDATA[ <210> 531]]>
<![CDATA[ <211> 172]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 531]]>
ttttctttca gatattgttt tgtatttacc catgaagaca ttgttttttg gactctgcaa 60
ataggacatt tcaaagatga gtgaaaaaaa attggaaaca actgcacagc agcggaaatg 120
tcctgaatgg atgaatgtgc agaataaaag atgtgctgta gaagaaagaa ag 172
<![CDATA[ <210> 532]]>
<![CDATA[ <211> 113]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 532]]>
gcatgtgttc ggaagagtgt ttttgaagat gacctcccct tcttagaatt cactggatcc 60
attgtgtata gttacgatgc tagtgattgc tctttcctgt cagaagatat tag 113
<![CDATA[ <210> 533]]>
<![CDATA[ <211> 146]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 533]]>
catgagtcta tcagatgggg atgtggtgggg atttgacatg gagtggccac cattatacaa 60
tagagggaaa cttggcaaag ttgcactaat tcagttgtgtgtttctgaga gcaaatgtta 120
cttgttccac gtttcttcca tgtcag 146
<![CDATA[ <210> 534]]>
<![CDATA[ <211> 149]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 534]]>
tttttcccca gggattaaaa atgttgcttg aaaataaagc agttaaaaag gcaggtgtag 60
gaattgaagg agatcagtgg aaacttctac gtgactttga tatcaaattg aagaattttg 120
tggagttgac agatgttgcc aataaaaag 149
<![CDATA[ <210> 535]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 535]]>
ctgaaatgca cagagacctg gagccttaac agtctggtta aacacctctt aggtaaacag 60
ctcctgaaag acaagtctat ccgctgtagc aattggagta aatttcctct cactgaggac 120
cagaaactgt atgcagccac tgatgcttat 150
<![CDATA[ <210> 536]]>
<![CDATA[ <211> 70]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 536]]>
gctggtttta ttattaccg aaatttagag attttggatg atactgtgca aaggtttgct 60
ataaataaag 70
<![CDATA[ <210> 537]]>
<![CDATA[ <211> 115]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 537]]>
aggaagaaat cctacttagc gacatgaaca aacagttgac ttcaatctct gaggaagtga 60
tggatctggc taagcatctt cctcatgctt tcagtaaatt ggaaaaccca cggag 115
<![CDATA[ <210> 538]]>
<![CDATA[ <211> 430]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 538]]>
ggtttctatc ttactaaagg atatttcaga aaatctatat tcactgagga ggatgataat 60
tgggtctact aacattgaga ctgaactgag gcccagcaat aatttaaact tattatcctt 120
tgaagattca actactgggg gagtacaaca gaaacaaatt agagaacatg aagttttaat 180
tcacgttgaa gatgaaacat gggacccaac acttgatcat ttagctaaac atgatggaga 240
agatgtactt ggaaataaag tggaacgaaa agaagatgga tttgaagatg gagtagaaga 300
caacaaattg aaagagaata tggaaagagc ttgtttgatg tcgttagata ttacagaaca 360
tgaactccaa attttggaac agcagtctca ggaagaatat cttagtgata ttgcttataa 420
atctactgag 430
<![CDATA[ <210> 539]]>
<![CDATA[ <211> 81]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 539]]>
catttatctc ccaatgataa tgaaaacgat acgtcctatg taattgagag tgatgaagat 60
ttagaaatgg agatgcttaa g 81
<![CDATA[ <210> 540]]>
<![CDATA[ <211> 81]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 540]]>
catttatctc ccaatgataa tgaaaacgat acgtcctatg taattgagag tgatgaagat 60
ttagaaatgg agatgcttaa g 81
<![CDATA[ <210> 541]]>
<![CDATA[ <211> 145]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 541]]>
tctttagaaa acctcaatag tggcacggta gaaccaactc attctaaatg cttaaaaatg 60
gaaagaaatc tgggtcttcc tactaaagaa gaagaagaag atgatgaaaa tgaagctaat 120
gaaggggaag aagatgatga taagg 145
<![CDATA[ <210> 542]]>
<![CDATA[ <211> 76]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 542]]>
actttttgtg gccagcaccc aatgaagagc aagttacttg cctcaagatg tactttggcc 60
attccagttt taaacc 76
<![CDATA[ <210> 543]]>
<![CDATA[ <211> 68]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 543]]>
agttcagtgg aaagtgattc attcagtatt agaagaaaga agagataatg ttgctgtcat 60
ggcaactg 68
<![CDATA[ <210> 544]]>
<![CDATA[ <211> 109]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <21]]>3> Human]]>
<br/>
<br/> <![CDATA[ <400>544]]>
<br/> <![CDATA[gatatggaaa gagtttgtgc ttccagtatc cacctgttta tgtaggcaag attggccttg 60
ttatctctcc ccttatttct ctgatggaag accaagtgct acagcttaa 109
<![CDATA[ <210> 545]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 545]]>
aatgtccaac atcccagctt gcttccttgg atcagcacag tcagaaaatg ttctaacaga 60
Tattaaatt 69
<![CDATA[ <210> 546]]>
<![CDATA[ <211> 83]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 546]]>
aggtaaatac cggattgtat acgtaactcc agaatactgt tcaggtaaca tgggcctgct 60
ccagcaactt gaggctgata ttg 83
<![CDATA[ <210> 547]]>
<![CDATA[ <211> 107]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 547]]>
gtatcacgct cattgctgtg gatgaggctc actgtatttc tgagtggggg catgatttta 60
gggattcatt caggaagttg ggctccctaa aagacagcact gccaatg 107
<![CDATA[ <210> 548]]>
<![CDATA[ <211> 185]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 548]]>
gttccaatcg ttgcacttac tgctactgca agttcttcaa tccgggaaga cattgtacgt 60
tgcttaaatc tgagaaatcc tcagatcacc tgtactggtt ttgatcgacc aaacctgtat 120
ttagaagtta ggcgaaaaac agggaatatc cttcaggatc tgcagccatt tcttgtcaaa 180
acaag 185
<![CDATA[ <210> 549]]>
<![CDATA[ <211> 175]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 549]]>
ttcccactgg gaatttgaag gtccaacaat catctactgt ccttctagaa aaatgacaca 60
acaagttaca ggtgaactta ggaaactgaa tctatcctgt ggaacatacc atgcgggcat 120
gagttttagc acaaggaaag acattcatca taggtttgta agagatgaaa ttcag 175
<![CDATA[ <210> 550]]>
<![CDATA[ <211> 182]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 550]]>
tgtgtcatag ctaccatagc ttttggaatg ggcattaata aagctgacat tcgccaagtc 60
attcattacg gtgctcctaa ggacatggaa tcatattatc aggagattgg tagagctggt 120
cgtgatggac ttcaaagttc ttgtcacgtc ctctgggctc ctgcagacat taacttaaat 180
ag 182
<![CDATA[ <210> 551]]>
<![CDATA[ <211> 102]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 551]]>
gcaccttctt actgagatac gtaatgagaa gtttcgatta tacaaattaa agatgatggc 60
aaagatggaa aaatatcttc attctagcag atgtaggaga ca 102
<![CDATA[ <210> 552]]>
<![CDATA[ <211> 93]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 552]]>
aatcatcttg tctcattttg aggacaaaca agtacaaaaa gcctccttgg gaattatggg 60
aactgaaaaa tgctgtgata attgcaggtc cag 93
<![CDATA[ <210> 553]]>
<![CDATA[ <211> 142]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 553]]>
attggatcat tgctattcca tggatgactc agaggataca tcctgggact ttggtccaca 60
agcatttaag cttttgtctg ctgtggacat cttaggcgaa aaatttggaa ttgggcttcc 120
aattttattt ctccgaggat ct 142
<![CDATA[ <210> 554]]>
<![CDATA[ <211> 171]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 554]]>
aattctcagc gtcttgccga tcaatatcgc aggcacagtttatttggcac tggcaaggat 60
caaacagaga gttggtggaa ggctttttcc cgtcagctga tcactgaggg attcttggta 120
gaagtttctc ggtataacaa atttatgaag atttgcgccc ttacgaaaaa g 171
<![CDATA[ <210> 555]]>
<![CDATA[ <211> 95]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 555]]>
ggtagaaatt ggcttcataa agctaataca gaatctcaga gcctcatcct tcaagctaat 60
gaagaattgt gtccaaagaa gttgcttctg cctag 95
<![CDATA[ <210> 556]]>
<![CDATA[ <211> 76]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 556]]>
ttcgaaaact gtatcttcgg gcaccaaaga gcattgttat aatcaagtac cagttgaatt 60
aagtacagag aagaag 76
<![CDATA[ <210> 557]]>
<![CDATA[ <211> 74]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 557]]>
tctaacttgg agaagttata ttcttataaa ccatgtgata agatttcttc tgggagtaac 60
atttctaaaaaaag 74
<![CDATA[ <210> 558]]>
<![CDATA[ <211> 76]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 558]]>
tatcatggta cagtcaccag aaaaagctta cagttcctca cagcctgtta tttcggcaca 60
agagcaggag actcag 76
<![CDATA[ <210> 559]]>
<![CDATA[ <211> 113]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 559]]>
attgtgttat atggcaaatt ggtagaagct aggcagaaac atgccaataa aatggatgtt 60
cccccagcta ttctggcaac aaacaagata ctggtggata tggccaaaat gag 113
<![CDATA[ <210> 560]]>
<![CDATA[ <211> 115]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 560]]>
accaactacg gttgaaaacg taaaaaggat tgatggtgtt tctgaaggca aagctgccat 60
gttggcccct ctgttggaag tcatcaaaca tttctgccaa acaaatagtg ttcag 115
<![CDATA[ <210> 561]]>
<![CDATA[ <211> 132]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 561]]>
acagacctct tttcaagtac aaaacctcaa gaagaacaga agacgagtct ggtagcaaaa 60
aataaaatat gcacactttc acagtctatg gccatcacat actctttatt ccaagaaaag 120
aagatgcctt tg 132
<![CDATA[ <210> 562]]>
<![CDATA[ <211> 163]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 562]]>
aagagcatag ctgagagcag gattctgcct ctcatgacaa ttggcatgca cttatcccaa 60
gcggtgaaag ctggctgccc ccttgatttg gagcgagcag gcctgactcc agaggttcag 120
aagattattg ctgatgttat ccgaaaccct cccgtcaact cag 163
<![CDATA[ <210> 563]]>
<![CDATA[ <211> 209]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 563]]>
atatgagtaa aattagccta atcagaatgt tagttcctga aaacattgac acgtacctta 60
tccacatggc aattgagatc cttaaacatg gtcctgacag cggacttcaa ccttcatgtg 120
atgtcaacaa aaggagatgt tttcccggtt ctgaagagat ctgttcaagt tctaagagaa 180
gcaaggaaga agtaggcatc aatactgag 209
<![CDATA[ <210> 564]]>
<![CDATA[ <211> 3144]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 564]]>
acttcatctg cagagagaaa gagacgatta cctgtgtggt ttgccaaagg aagtgatacc 60
agcaagaaat taatggacaa aacgaaaagg ggaggtcttt ttagttaagc tggcaattac 120
cagaacaatt atgtttcttg ctgtattata agaggatagc tatattttattctgaagag 180
taaggagtag tattttggct taaaaatcat tctaattaca aagttcactg tttattgaag 240
aactggcatc ttaaatcagc cttccgcaat tcatgtagtt tctgggtctt ctgggagcct 300
acgtgagtac atcacctaac agaatattaa attagacttc ctgtaagatt gctttaagaa 360
actgttactg tcctgttttc taatctcttt attaaaacag tgtatttgga aaatgttatg 420
tgctctgatt tgatatagat aacagattag tagttacatg gtaattatgt gatataaaat 480
attcatatat tatcaaaatt ctgttttgta aatgtaagaa agcatagtta ttttacaaat 540
tgtttttact gtcttttgaa gaagttctta aatacgttgt taaatggtat tagttgacca 600
gggcagtgaa aatgaaaccg cattttgggt gccattaaat agggaaaaaa catgtaaaaa 660
atgtaaaatg gagaccaatt gcactaggca agtgtatatt ttgtatttta tatacaattt 720
ctattatttt tcaagtaata aaacaatgtt tttcatactg aatattatat atatattttt 780
tagctttcat ttacttaatt attttaagta cctttattt tccaggatgt cagaatttga 840
ttctaatctc tcttatgtag cacatgtgac ttaatttaaa acctatactg tgacacagag 900
ttgggtaaac gatgattatt taactttaag cagttcacca tccatttcaa agcctttgat 960
tggctttttt gtaaataaaa ataacttgtt aagaaacaaa tatatctgtc atagaagaac 1020
tagaaaatcc agggaagtga gaaaaatgaa aataaaaatc attcatagtt ttactagtag 1080
ctaatcacag tcaacctctt ttgtgtatcc caccagactt ttttatattc atttgttttt 1140
agttaaaata taaaagtctc gtatattccc atttttctgc attgcattac cagaaggtag 1200
tggcgcctat taaatatgtg atatgttgtt gtccagccat ggcttctgca tttgcatgct 1260
tttgtgtgtg catctgcaat accctgtgaa tatcctgtgt gatggagtgg caagtacgca 1320
cagacacgtc tgctgcatgc ctaggtacga ggctgtctcc aggagaagca cttgtttgat 1380
tatttgagtt gccaattgaa tttgctgctt tttttcatgg cttgccattt tcactgaaaa 1440
gaatgactaa tgaaaaacga tgattggtta ttagatttgg atgtttggca gacattttct 1500
caaaattgaa ctaagttggc ctcttcacgg aaaacaactg gtatttgttg tgccaatgat 1560
aaaattggag atttctagca aaatgtataa ttttggaaaa gttgtgttcc tccactggaa 1620
gcttgacagc tttccttaac ataaagactt ctctttctct tcgctttcac tactactact 1680
actaattctt cttctgattc ttcttcttct ccttcttcct tcttccttcc ttcctcctcc 1740
tcctccttct tcttcctctt cctcttcttc tttctctctt tccttccttc ccttcccttc 1800
cccttccttc cttccttcct tccttcctcc ctccctccctccctccctcc ctccctcctt 1860
tctttttctt tctctttctt tctttctttc tctctctctc tctctttctt tctttttctt 1920
tctctttttc tttctttcaa gcagtcctcc cgcctcagtc ccccaaaata gtgggattac 1980
aggtgtgagc caccatgcac agccttacat aaagcctttt ctaatgagat ggatagtaat 2040
taacaaatgt gagtttttga tattatataa agattttttc tgtgtttcga agatccgtat 2100
aactcagtga atcagtatgt tctggatgac taatatgtga tgttaagaaa tcatgactga 2160
ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggcgga 2220
tcacgagatc aggagatcga gaccaccctg gccaacatgg tgaaaccccg tctctactaa 2280
aaatacaaaa attagctggg tgtgttggtg cgtgcctata atcccagcta ctcgggaggc 2340
tgaggcagga gaatcgcttg aactcaggag gcggagattg cagtgagctg agactgcgcc 2400
actgcacccc agcctggcga cagagcaaga ctccgtctca aaaataaaaa aagaaatcat 2460
gactgggtaa aagatctgtt cagagtacaa gatggaccaa tggatttgat atatttgaat 2520
ataacagagt atgaaaaagt ttattgatat agtttcagat tacacactgc aactaatctt 2580
taagaaacta ttacttgtcc actttttggt aaaatttcag agaacaatgt ccaccattat 2640
ctgaacaggc tattaaaata ctcttctctt ttccaactac gtgcctgtgc aaagtcagat 2700
ttttttcata tacttcagcc aaaacagcat atcaaaatgg attgaatgca gaagtagatc 2760
tgagaataca gccacttttg ttaagccaga caatgagatt tgcaaaatgt aaacaatgct 2820
gctgttctca gtttttaaaa atatgttttt taaaagtatt tatgttaatg tgtacttggt 2880
ttactactgc tatttttaaa taaaacaaga aacattttta aatgtctgtt ttaatttcta 2940
aagtggtagt gatagatata acccatatta ataaaagctc tttggggtcc tcagtgattt 3000
ttttttaaga gtatggaagg gttctcagac ctaagagatt gagaaatgct gatgtaatgt 3060
tttattataa aggtgtacca tgaattatgt accttacttc atattgttgg acattaaagt 3120
tgctttcagtttttttgttt taaa 3144
<![CDATA[ <210> 565]]>
<![CDATA[ <211> 7575]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223>WRN full mRNA polynucleotide sequence]]>
<![CDATA[ <400> 565]]>
gtgtactgtg tgcgccgggg aggcgccggc ttgtactcgg cagcgcggga ataaagtttg 60
ctgatttggt gtctagcctg gatgcctggg ttgcaggccc tgcttgtggt ggcgctccac 120
agtcatccgg ctgaagaaga cctgttggac tggatcttct cgggttttct ttcagatatt 180
gttttgtatt tacccatgaa gacattgttt tttggactct gcaaataggga catttcaaag 240
atgagtgaaa aaaaattgga aacaactgca cagcagcgga aatgtcctga atggatgaat 300
gtgcagaata aaagatgtgc tgtagaagaa agaaaggcat gtgttcggaa gagtgttttt 360
gaagatgacc tccccttctt agaattcact ggatccattg tgtatagtta cgatgctagt 420
gattgctctt tcctgtcaga agatattagc atgagtctat cagatgggga tgtggtggga 480
tttgacatgg agtggccacc attatacaat agagggaaac ttggcaaagt tgcactaatt 540
cagttgtgtg tttctgagag caaatgttac ttgttccacg tttcttccat gtcagttttt 600
ccccagggat taaaaatgtt gcttgaaaat aaagcagtta aaaaggcagg tgtaggaatt 660
gaaggagatc agtggaaact tctacgtgac tttgatatca aattgaagaa ttttgtggag 720
ttgacagatg ttgccaataa aaagctgaaa tgcacagaga cctggagcct taacagtctg 780
gttaaacacc tcttaggtaa acagctcctg aaagacaagt ctatccgctg tagcaattgg 840
agtaaatttc ctctcactga ggaccagaaa ctgtatgcag ccactgatgc ttatgctggt 900
tttattattt accgaaattt agagattttg gatgatactg tgcaaaggtt tgctataaat 960
aaagaggaag aaatcctact tagcgacatg aacaaacagt tgacttcaat ctctgaggaa 1020
gtgatggatc tggctaagca tcttcctcat gctttcagta aattggaaaa cccacggagg 1080
gtttctatct tactaaagga tatttcagaa aatctatatt cactgaggag gatgataatt 1140
gggtctacta acattgagac tgaactgagg cccagcaata atttaaactt attatccttt 1200
gaagattcaa ctactggggg agtacaacag aaacaaatta gagaacatga agttttaatt 1260
cacgttgaag atgaaacatg ggacccaaca cttgatcatt tagctaaaca tgatggagaa 1320
gatgtacttg gaaataaagt ggaacgaaaa gaagatggat ttgaagatgg agtagaagac 1380
aacaaattga aagagaatat ggaaagagct tgtttgatgt cgttagatat tacagaacat 1440
gaactccaaa ttttggaaca gcagtctcag gaagaatatc ttagtgatat tgcttataaa 1500
tctactgagc atttatctcc caatgataat gaaaacgata cgtcctatgt aattgagagt 1560
gatgaagatt tagaaatgga gatgcttaag catttatctc ccaatgataa tgaaaacgat 1620
acgtcctatg taattgagag tgatgaagat ttagaaatgg agatgcttaa gtctttagaa 1680
aacctcaata gtggcacggt agaaccaact cattctaaat gcttaaaaat ggaaagaaat 1740
ctgggtcttc ctactaaaga agaagaagaa gatgatgaaa atgaagctaa tgaaggggaa 1800
gaagatgatg ataaggactt tttgtggcca gcacccaatg aagagcaagt tacttgcctc 1860
aagatgtact ttggccattc cagttttaaa ccagttcagt ggaaagtgat tcattcagta 1920
ttagaagaaa gaagagataa tgttgctgtc atggcaactg gatatggaaa gagtttgtgc 1980
ttccagtatc cacctgttta tgtaggcaag attggccttg ttatctctcc ccttatttct 2040
ctgatggaag accaagtgct acagcttaaa atgtccaaca tcccagcttg cttccttgga 2100
tcagcacagt cagaaaatgt tctaacagat attaaattag gtaaataccg gattgtatac 2160
gtaactccag aatactgttc aggtaacatg ggcctgctcc agcaacttga ggctgatatt 2220
ggtatcacgc tcattgctgt ggatgaggct cactgtattt ctgagtgggg gcatgatttt 2280
agggattcat tcaggaagtt gggctcccta aagacagcac tgccaatggt tccaatcgtt 2340
gcacttactg ctactgcaag ttcttcaatc cgggaagaca ttgtacgttg cttaaatctg 2400
agaaatcctc agaatcacctg tactggtttt gatcgaccaa acctgtattt agaagttagg 2460
cgaaaaacag ggaatatcct tcaggatctg cagccatttc ttgtcaaaac aagttcccac 2520
tgggaatttg aaggtccaac aatcatctac tgtccttcta gaaaaatgac acaacaagtt 2580
acaggtgaac ttaggaaact gaatctatcc tgtggaacat accatgcggg catgagtttt 2640
agcacaagga aagacattca tcataggtt gtaagagatg aaattcagtg tgtcatagct 2700
accatagctt ttggaatggg cattaataaa gctgacattc gccaagtcat tcattacggt 2760
gctcctaagg acatggaatc atattatcag gagattggta gagctggtcg tgatggactt 2820
caaagttctt gtcacgtcct ctgggctcct gcagacatta acttaaatag gcaccttctt 2880
actgagatac gtaatgagaa gtttcgatta tacaaattaa agatgatggc aaagatggaa 2940
aaatatcttc attctagcag atgtaggaga caaatcatct tgtctcattt tgaggacaaa 3000
caagtacaaa aagcctcctt gggaattatg ggaactgaaa aatgctgtga taattgcagg 3060
tccagattgg atcattgcta ttccatggat gactcagagg atacatcctg ggactttggt 3120
ccacaagcat ttaagctttt gtctgctgtg gacatcttag gcgaaaaatt tggaattggg 3180
cttccaattt tatttctccg aggatctaat tctcagcgtc ttgccgatca atatcgcagg 3240
cacagtttat ttggcactgg caaggatcaa acagagagtt ggtggaaggc tttttcccgt 3300
cagctgatca ctgagggatt cttggtagaa gtttctcggt ataacaaatt tatgaagatt 3360
tgcgccctta cgaaaaaggg tagaaattgg cttcataaag ctaatacaga atctcagagc 3420
ctcatccttc aagctaatga agaattgtgt ccaaagaagt tgcttctgcc tagttcgaaa 3480
actgtatctt cgggcaccaa agagcattgt tataatcaag taccagttga attaagtaca 3540
gagaagaagt ctaacttgga gaagttatat tcttataaac catgtgataa gatttcttct 3600
gggagtaaca tttctaaaaa aagtatcatg gtacagtcac cagaaaaagc ttacagttcc 3660
tcacagcctg ttatttcggc acaagagcag gagactcaga ttgtgttata tggcaaattg 3720
gtagaagcta ggcagaaaca tgccaataaa atggatgttc ccccagctat tctggcaaca 3780
aacaagatac tggtggatat ggccaaaatg agaccaacta cggttgaaaa cgtaaaaagg 3840
attgatggtg tttctgaagg caaagctgcc atgttggccc ctctgttgga agtcatcaaa 3900
catttctgcc aaacaaatag tgttcagaca gacctctttt caagtacaaa acctcaagaa 3960
gaacagaaga cgagtctggt agcaaaaaat aaaatatgca cactttcaca gtctatggcc 4020
atcacatact ctttattcca agaaaagaag atgcctttga agagcatagc tgagagcagg 4080
attctgcctc tcatgacaat tggcatgcac ttatcccaag cggtgaaagc tggctgcccc 4140
cttgatttgg agcgagcagg cctgactcca gaggttcaga agattattgc tgatgttatc 4200
cgaaaccctc ccgtcaactc agatatgagt aaaattagcc taatcagaat gttagttcct 4260
gaaaacattg acacgtacct tatccacatg gcaattgaga tccttaaaca tggtcctgac 4320
agcggacttc aaccttcatg tgatgtcaac aaaaggagat gttttcccgg ttctgaagag 4380
atctgttcaa gttctaagag aagcaaggaa gaagtaggca tcaatactga gacttcatct 4440
gcagagagaa agagacgatt acctgtgtgg tttgccaaag gaagtgatac cagcaagaaa 4500
ttaatggaca aaacgaaaag gggaggtctt tttagttaag ctggcaatta ccagaacaat 4560
tatgtttctt gctgtattat aagaggatag ctatatttta tttctgaaga gtaaggagta 4620
gtattttggc ttaaaaatca ttctaattac aaagttcact gtttattgaa gaactggcat 4680
cttaaatcag ccttccgcaa ttcatgtagt ttctgggtct tctgggagcc tacgtgagta 4740
catcacctaa cagaatatta aattagactt cctgtaagat tgctttaaga aactgttact 4800
gtcctgtttt ctaatctctt tattaaaaca gtgtatttgg aaaatgttat gtgctctgat 4860
ttgatataga taacagatta gtagttacat ggtaattatg tgatataaaa tattcatata 4920
ttatcaaaat tctgttttgt aaatgtaaga aagcatagtt attttacaaa ttgtttttac 4980
tgtcttttga agaagttctt aaatacgttg ttaaatggta ttagttgacc agggcagtga 5040
aaatgaaacc gcattttggg tgccattaaa tagggaaaaa acatgtaaaa aatgtaaaat 5100
ggagaccaat tgcactaggc aagtgtatat tttgtatttt atatacaatt tctattattt 5160
ttcaagtaat aaaacaatgt ttttcatact gaatattata tatatatttt ttagctttca 5220
tttacttaat tattttaagt acctttattt ttccaggatg tcagaatttg attctaatct 5280
ctcttatgta gcacatgtga cttaatttaa aacctatact gtgacacaga gttgggtaaa 5340
cgatgattat ttaactttaa gcagttcacc atccatttca aagcctttga ttggcttttt 5400
tgtaaataaa aataacttgt taagaaacaa atatatctgt catagaagaa ctagaaaatc 5460
cagggaagtg agaaaaatga aaataaaaat cattcatagt tttactagta gctaatcaca 5520
gtcaacctct tttgtgtatc ccaccagact tttttatatt catttgtttt tagttaaaat 5580
ataaaagtct cgtatattcc catttttctg cattgcatta ccagaaggta gtggcgccta 5640
ttaaatatgt gatatgttgt tgtccagcca tggcttctgc atttgcatgc ttttgtgtgtgt 5700
gcatctgcaa taccctgtga atatcctgtg tgatggagtg gcaagtacgc acagacacgt 5760
ctgctgcatg cctaggtacg aggctgtctc caggagaagc acttgtttga ttatttgagt 5820
tgccaattga atttgctgct ttttttcatg gcttgccatt ttcactgaaa agaatgacta 5880
atgaaaaacg atgattggtt attagatttg gatgtttggc agacattttc tcaaaattga 5940
actaagttgg cctcttcacg gaaaacaact ggtatttgtt gtgccaatga taaaattgga 6000
gatttctagc aaaatgtata attttggaaa agttgtgttc ctccactgga agcttgacag 6060
ctttccttaa cataaagact tctctttctc ttcgctttca ctactactac tactaattct 6120
tcttctgatt cttcttcttc tccttcttcc ttcttccttc cttcctcctc ctcctccttc 6180
ttcttcctct tcctcttctt ctttctctct ttccttccctt cccttccctt ccccttcctt 6240
ccttccttcc ttccttcctc cctccctccc tccctccctc cctccctcct ttctttttct 6300
ttctctttct ttctttcttt ctctctctct ctctctttct ttctttttct ttctcttttt 6360
ctttctttca agcagtcctc ccgcctcagt cccccaaaat agtgggatta caggtgtgag 6420
ccaccatgca cagccttaca taaagccttt tctaatgaga tggatagtaa ttaacaaatg 6480
tgagtttttg atattatata aagatttttt ctgtgtttcg aagatccgta taactcagtg 6540
aatcagtatg ttctggatga ctaatatgtg atgttaagaa atcatgactg aggccgggcg 6600
cggtggctca cgcctgtaat cccagcactt tgggaggccg aggcgggcgg atcacgagat 6660
caggagatcg agaccaccct ggccaacatg gtgaaaccccc gtctctacta aaaatacaaa 6720
aattagctgg gtgtgttggt gcgtgcctat aatcccagct actcgggagg ctgaggcagg 6780
agaatcgctt gaactcagga ggcggagatt gcagtgagct gagactgcgc cactgcaccc 6840
cagcctggcg acagagcaag actccgtctc aaaaataaaa aaagaaatca tgactgggta 6900
aaagatctgt tcagagtaca agatggacca atggatttga tatatttgaa tataacagag 6960
tatgaaaaag tttattgata tagtttcaga ttacacactg caactaatct ttaagaaact 7020
attacktgtc cactttttgg taaaatttca gagaacaatg tccaccatta tctgaacagg 7080
ctattaaaat actcttctct tttccaacta cgtgcctgtg caaagtcaga tttttttcat 7140
atacttcagc caaaacagca tatcaaaatg gattgaatgc agaagtagat ctgagaatac 7200
agccactttt gttaagccag acaatgagat ttgcaaaatg taaacaatgc tgctgttctc 7260
agtttttaaa aatatgtttt ttaaaagtat ttatgttaat gtgtacttgg tttactactg 7320
ctatttttaa ataaaacaag aaacattttt aaatgtctgt tttaatttct aaagtggtag 7380
tgatagatat aacccatatt aataaaagct ctttggggtc ctcagtgatt tttttttaag 7440
agtatggaag ggttctcaga cctaagagat tgagaaatgc tgatgtaatg ttttattata 7500
aaggtgtacc atgaattatg taccttactt catattgttg gacattaaag ttgctttcag 7560
tttttttgtt ttaaa 7575
<![CDATA[ <210> 566]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 566]]>
Met Ser Glu Lys Lys Leu Glu Thr Thr Ala Gln Gln Arg Lys Cys Pro
1 5 10 15
Glu Trp Met Asn Val Gln Asn Lys Arg Cys Ala Val Glu Glu Arg Lys
20 25 30
<![CDATA[ <210> 567]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 567]]>
Ala Cys Val Arg Lys Ser Val Phe Glu Asp Asp Leu Pro Phe Leu Glu
1 5 10 15
Phe Thr Gly Ser Ile Val Tyr Ser Tyr Asp Ala Ser Asp Cys Ser Phe
20 25 30
Leu Ser Glu Asp Ile
35
<![CDATA[ <210> 568]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 568]]>
Met Ser Leu Ser Asp Gly Asp Val Val Gly Phe Asp Met Glu Trp Pro
1 5 10 15
Pro Leu Tyr Asn Arg Gly Lys Leu Gly Lys Val Ala Leu Ile Gln Leu
20 25 30
Cys Val Ser Glu Ser Lys Cys Tyr Leu Phe His Val Ser Ser Met Ser
35 40 45
<![CDATA[ <210> 569]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 569]]>
Phe Pro Gln Gly Leu Lys Met Leu Leu Glu Asn Lys Ala Val Lys Lys
1 5 10 15
Ala Gly Val Gly Ile Glu Gly Asp Gln Trp Lys Leu Leu Arg Asp Phe
20 25 30
Asp Ile Lys Leu Lys Asn Phe Val Glu Leu Thr Asp Val Ala Asn Lys
35 40 45
Lys
<![CDATA[ <210> 570]]>
<![CDATA[ <211> 50]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 570]]>
Leu Lys Cys Thr Glu Thr Trp Ser Leu Asn Ser Leu Val Lys His Leu
1 5 10 15
Leu Gly Lys Gln Leu Leu Lys Asp Lys Ser Ile Arg Cys Ser Asn Trp
20 25 30
Ser Lys Phe Pro Leu Thr Glu Asp Gln Lys Leu Tyr Ala Ala Thr Asp
35 40 45
Ala Tyr
50
<![CDATA[ <210> 571]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 571]]>
Ala Gly Phe Ile Ile Tyr Arg Asn Leu Glu Ile Leu Asp Asp Thr Val
1 5 10 15
Gln Arg Phe Ala Ile Asn Lys
20
<![CDATA[ <210> 572]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 572]]>
Glu Glu Ile Leu Leu Ser Asp Met Asn Lys Gln Leu Thr Ser Ile Ser
1 5 10 15
Glu Glu Val Met Asp Leu Ala Lys His Leu Pro His Ala Phe Ser Lys
20 25 30
Leu Glu Asn Pro Arg
35
<![CDATA[ <210> 573]]>
<![CDATA[ <211> 143]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 573]]>
Val Ser Ile Leu Leu Lys Asp Ile Ser Glu Asn Leu Tyr Ser Leu Arg
1 5 10 15
Arg Met Ile Ile Gly Ser Thr Asn Ile Glu Thr Glu Leu Arg Pro Ser
20 25 30
Asn Asn Leu Asn Leu Leu Ser Phe Glu Asp Ser Thr Thr Gly Gly Val
35 40 45
Gln Gln Lys Gln Ile Arg Glu His Glu Val Leu Ile His Val Glu Asp
50 55 60
Glu Thr Trp Asp Pro Thr Leu Asp His Leu Ala Lys His Asp Gly Glu
65 70 75 80
Asp Val Leu Gly Asn Lys Val Glu Arg Lys Glu Asp Gly Phe Glu Asp
85 90 95
Gly Val Glu Asp Asn Lys Leu Lys Glu Asn Met Glu Arg Ala Cys Leu
100 105 110
Met Ser Leu Asp Ile Thr Glu His Glu Leu Gln Ile Leu Glu Gln Gln
115 120 125
Ser Gln Glu Glu Tyr Leu Ser Asp Ile Ala Tyr Lys Ser Thr Glu
130 135 140
<![CDATA[ <210> 574]]>
<![CDATA[ <211> 27]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 574]]>
His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val Ile Glu
1 5 10 15
Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys
20 25
<![CDATA[ <210> 575]]>
<![CDATA[ <211> 27]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 575]]>
His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val Ile Glu
1 5 10 15
Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys
20 25
<![CDATA[ <210> 576]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 576]]>
Ser Leu Glu Asn Leu Asn Ser Gly Thr Val Glu Pro Thr His Ser Lys
1 5 10 15
Cys Leu Lys Met Glu Arg Asn Leu Gly Leu Pro Thr Lys Glu Glu Glu Glu
20 25 30
Glu Asp Asp Glu Asn Glu Ala Asn Glu Gly Glu Glu Asp Asp Asp Lys
35 40 45
<![CDATA[ <210> 577]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 577]]>
Phe Leu Trp Pro Ala Pro Asn Glu Glu Gln Val Thr Cys Leu Lys Met
1 5 10 15
Tyr Phe Gly His Ser Ser Phe Lys
20
<![CDATA[ <210> 578]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 578]]>
Val Gln Trp Lys Val Ile His Ser Val Leu Glu Glu Arg Arg Asp Asn
1 5 10 15
Val Ala Val Met Ala Thr
20
<![CDATA[ <210> 579]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 579]]>
Tyr Gly Lys Ser Leu Cys Phe Gln Tyr Pro Pro Val Tyr Val Gly Lys
1 5 10 15
Ile Gly Leu Val Ile Ser Pro Leu Ile Ser Leu Met Glu Asp Gln Val
20 25 30
Leu Gln Leu
35
<![CDATA[ <210> 580]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 580]]>
Met Ser Asn Ile Pro Ala Cys Phe Leu Gly Ser Ala Gln Ser Glu Asn
1 5 10 15
Val Leu Thr Asp Ile Lys
20
<![CDATA[ <210> 581]]>
<![CDATA[ <211> 27]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 581]]>
Gly Lys Tyr Arg Ile Val Tyr Val Thr Pro Glu Tyr Cys Ser Gly Asn
1 5 10 15
Met Gly Leu Leu Gln Gln Leu Glu Ala Asp Ile
20 25
<![CDATA[ <210> 582]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 58]]>2
Ile Thr Leu Ile Ala Val Asp Glu Ala His Cys Ile Ser Glu Trp Gly
1 5 10 15
His Asp Phe Arg Asp Ser Phe Arg Lys Leu Gly Ser Leu Lys Thr Ala
20 25 30
Leu Pro Met
35
<![CDATA[ <210> 583]]>
<![CDATA[ <211> 61]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 583]]>
Val Pro Ile Val Ala Leu Thr Ala Thr Ala Ser Ser Ser Ile Arg Glu
1 5 10 15
Asp Ile Val Arg Cys Leu Asn Leu Arg Asn Pro Gln Ile Thr Cys Thr
20 25 30
Gly Phe Asp Arg Pro Asn Leu Tyr Leu Glu Val Arg Arg Lys Thr Gly
35 40 45
Asn Ile Leu Gln Asp Leu Gln Pro Phe Leu Val Lys Thr
50 55 60
<![CDATA[ <210> 584]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 584]]>
Ser His Trp Glu Phe Glu Gly Pro Thr Ile Ile Tyr Cys Pro Ser Arg
1 5 10 15
Lys Met Thr Gln Gln Val Thr Gly Glu Leu Arg Lys Leu Asn Leu Ser
20 25 30
Cys Gly Thr Tyr His Ala Gly Met Ser Phe Ser Thr Arg Lys Asp Ile
35 40 45
His His Arg Phe Val Arg Asp Glu Ile Gln
50 55
<![CDATA[ <210> 585]]>
<![CDATA[ <211> 60]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 585]]>
Cys Val Ile Ala Thr Ile Ala Phe Gly Met Gly Ile Asn Lys Ala Asp
1 5 10 15
Ile Arg Gln Val Ile His Tyr Gly Ala Pro Lys Asp Met Glu Ser Tyr
20 25 30
Tyr Gln Glu Ile Gly Arg Ala Gly Arg Asp Gly Leu Gln Ser Ser Cys
35 40 45
His Val Leu Trp Ala Pro Ala Asp Ile Asn Leu Asn
50 55 60
<![CDATA[ <210> 586]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 586]]>
His Leu Leu Thr Glu Ile Arg Asn Glu Lys Phe Arg Leu Tyr Lys Leu
1 5 10 15
Lys Met Met Ala Lys Met Glu Lys Tyr Leu His Ser Ser Arg Cys Arg
20 25 30
Arg
<![CDATA[ <210> 587]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 587]]>
Ile Ile Leu Ser His Phe Glu Asp Lys Gln Val Gln Lys Ala Ser Leu
1 5 10 15
Gly Ile Met Gly Thr Glu Lys Cys Cys Asp Asn Cys Arg Ser
20 25 30
<![CDATA[ <210> 588]]>
<![CDATA[ <211> 47]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 588]]>
Leu Asp His Cys Tyr Ser Met Asp Asp Ser Glu Asp Thr Ser Trp Asp
1 5 10 15
Phe Gly Pro Gln Ala Phe Lys Leu Leu Ser Ala Val Asp Ile Leu Gly
20 25 30
Glu Lys Phe Gly Ile Gly Leu Pro Ile Leu Phe Leu Arg Gly Ser
35 40 45
<![CDATA[ <210> 589]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 589]]>
Asn Ser Gln Arg Leu Ala Asp Gln Tyr Arg Arg His Ser Leu Phe Gly
1 5 10 15
Thr Gly Lys Asp Gln Thr Glu Ser Trp Trp Lys Ala Phe Ser Arg Gln
20 25 30
Leu Ile Thr Glu Gly Phe Leu Val Glu Val Ser Arg Tyr Asn Lys Phe
35 40 45
Met Lys Ile Cys Ala Leu Thr Lys Lys
50 55
<![CDATA[ <210> 590]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 590]]>
Gly Arg Asn Trp Leu His Lys Ala Asn Thr Glu Ser Gln Ser Leu Ile
1 5 10 15
Leu Gln Ala Asn Glu Glu Leu Cys Pro Lys Lys Leu Leu Leu Pro
20 25 30
<![CDATA[ <210> 591]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 591]]>
Ser Lys Thr Val Ser Ser Gly Thr Lys Glu His Cys Tyr Asn Gln Val
1 5 10 15
Pro Val Glu Leu Ser Thr Glu Lys Lys
20 25
<![CDATA[ <210> 592]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 592]]>
Ser Asn Leu Glu Lys Leu Tyr Ser Tyr Lys Pro Cys Asp Lys Ile Ser
1 5 10 15
Ser Gly Ser Asn Ile Ser Lys Lys
20
<![CDATA[ <210> 593]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 593]]>
Ile Met Val Gln Ser Pro Glu Lys Ala Tyr Ser Ser Ser Gln Pro Val
1 5 10 15
Ile Ser Ala Gln Glu Gln Glu Thr Gln
20 25
<![CDATA[ <210> 594]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 594]]>
Ile Val Leu Tyr Gly Lys Leu Val Glu Ala Arg Gln Lys His Ala Asn
1 5 10 15
Lys Met Asp Val Pro Pro Ala Ile Leu Ala Thr Asn Lys Ile Leu Val
20 25 30
Asp Met Ala Lys Met
35
<![CDATA[ <210> 595]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 595]]>
Pro Thr Thr Val Glu Asn Val Lys Arg Ile Asp Gly Val Ser Glu Gly
1 5 10 15
Lys Ala Ala Met Leu Ala Pro Leu Leu Glu Val Ile Lys His Phe Cys
20 25 30
Gln Thr Asn Ser Val Gln
35
<![CDATA[ <210> 596]]>
<![CDATA[ <211> 44]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 596]]>
Thr Asp Leu Phe Ser Ser Thr Lys Pro Gln Glu Glu Gln Lys Thr Ser
1 5 10 15
Leu Val Ala Lys Asn Lys Ile Cys Thr Leu Ser Gln Ser Met Ala Ile
20 25 30
Thr Tyr Ser Leu Phe Gln Glu Lys Lys Met Pro Leu
35 40
<![CDATA[ <210> 597]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 597]]>
Lys Ser Ile Ala Glu Ser Arg Ile Leu Pro Leu Met Thr Ile Gly Met
1 5 10 15
His Leu Ser Gln Ala Val Lys Ala Gly Cys Pro Leu Asp Leu Glu Arg
20 25 30
Ala Gly Leu Thr Pro Glu Val Gln Lys Ile Ile Ala Asp Val Ile Arg
35 40 45
Asn Pro Pro Val Asn Ser
50
<![CDATA[ <210> 598]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 598]]>
Met Ser Lys Ile Ser Leu Ile Arg Met Leu Val Pro Glu Asn Ile Asp
1 5 10 15
Thr Tyr Leu Ile His Met Ala Ile Glu Ile Leu Lys His Gly Pro Asp
20 25 30
Ser Gly Leu Gln Pro Ser Cys Asp Val Asn Lys Arg Arg Cys Phe Pro
35 40 45
Gly Ser Glu Glu Ile Cys Ser Ser Ser Lys Arg Ser Lys Glu Glu Val
50 55 60
Gly Ile Asn Thr Glu
65
<![CDATA[ <210> 599]]>
<![CDATA[ <211> 35]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 599]]>
Thr Ser Ser Ala Glu Arg Lys Arg Arg Leu Pro Val Trp Phe Ala Lys
1 5 10 15
Gly Ser Asp Thr Ser Lys Lys Leu Met Asp Lys Thr Lys Arg Gly Gly
20 25 30
Leu Phe Ser
35
<![CDATA[ <210> 600]]>
<![CDATA[ <211> 1432]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 600]]>
Met Ser Glu Lys Lys Leu Glu Thr Thr Ala Gln Gln Arg Lys Cys Pro
1 5 10 15
Glu Trp Met Asn Val Gln Asn Lys Arg Cys Ala Val Glu Glu Arg Lys
20 25 30
Ala Cys Val Arg Lys Ser Val Phe Glu Asp Asp Leu Pro Phe Leu Glu
35 40 45
Phe Thr Gly Ser Ile Val Tyr Ser Tyr Asp Ala Ser Asp Cys Ser Phe
50 55 60
Leu Ser Glu Asp Ile Ser Met Ser Leu Ser Asp Gly Asp Val Val Gly
65 70 75 80
Phe Asp Met Glu Trp Pro Pro Leu Tyr Asn Arg Gly Lys Leu Gly Lys
85 90 95
Val Ala Leu Ile Gln Leu Cys Val Ser Glu Ser Lys Cys Tyr Leu Phe
100 105 110
His Val Ser Ser Met Ser Val Phe Pro Gln Gly Leu Lys Met Leu Leu
115 120 125
Glu Asn Lys Ala Val Lys Lys Ala Gly Val Gly Ile Glu Gly Asp Gln
130 135 140
Trp Lys Leu Leu Arg Asp Phe Asp Ile Lys Leu Lys Asn Phe Val Glu
145 150 155 160
Leu Thr Asp Val Ala Asn Lys Lys Leu Lys Cys Thr Glu Thr Trp Ser
165 170 175
Leu Asn Ser Leu Val Lys His Leu Leu Gly Lys Gln Leu Leu Lys Asp
180 185 190
Lys Ser Ile Arg Cys Ser Asn Trp Ser Lys Phe Pro Leu Thr Glu Asp
195 200 205
Gln Lys Leu Tyr Ala Ala Thr Asp Ala Tyr Ala Gly Phe Ile Ile Tyr
210 215 220
Arg Asn Leu Glu Ile Leu Asp Asp Thr Val Gln Arg Phe Ala Ile Asn
225 230 235 240
Lys Glu Glu Glu Ile Leu Leu Ser Asp Met Asn Lys Gln Leu Thr Ser
245 250 255
Ile Ser Glu Glu Val Met Asp Leu Ala Lys His Leu Pro His Ala Phe
260 265 270
Ser Lys Leu Glu Asn Pro Arg Arg Val Ser Ile Leu Leu Lys Asp Ile
275 280 285
Ser Glu Asn Leu Tyr Ser Leu Arg Arg Met Ile Ile Gly Ser Thr Asn
290 295 300
Ile Glu Thr Glu Leu Arg Pro Ser Asn Asn Leu Asn Leu Leu Ser Phe
305 310 315 320
Glu Asp Ser Thr Thr Gly Gly Val Gln Gln Lys Gln Ile Arg Glu His
325 330 335
Glu Val Leu Ile His Val Glu Asp Glu Thr Trp Asp Pro Thr Leu Asp
340 345 350
His Leu Ala Lys His Asp Gly Glu Asp Val Leu Gly Asn Lys Val Glu
355 360 365
Arg Lys Glu Asp Gly Phe Glu Asp Gly Val Glu Asp Asn Lys Leu Lys
370 375 380
Glu Asn Met Glu Arg Ala Cys Leu Met Ser Leu Asp Ile Thr Glu His
385 390 395 400
Glu Leu Gln Ile Leu Glu Gln Gln Ser Gln Glu Glu Tyr Leu Ser Asp
405 410 415
Ile Ala Tyr Lys Ser Thr Glu His Leu Ser Pro Asn Asp Asn Glu Asn
420 425 430
Asp Thr Ser Tyr Val Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met
435 440 445
Leu Lys His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val
450 455 460
Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys Ser Leu Glu
465 470 475 480
Asn Leu Asn Ser Gly Thr Val Glu Pro Thr His Ser Lys Cys Leu Lys
485 490 495
Met Glu Arg Asn Leu Gly Leu Pro Thr Lys Glu Glu Glu Glu Asp Asp
500 505 510
Glu Asn Glu Ala Asn Glu Gly Glu Glu Asp Asp Asp Lys Asp Phe Leu
515 520 525
Trp Pro Ala Pro Asn Glu Glu Gln Val Thr Cys Leu Lys Met Tyr Phe
530 535 540
Gly His Ser Ser Phe Lys Pro Val Gln Trp Lys Val Ile His Ser Val
545 550 555 560
Leu Glu Glu Arg Arg Asp Asn Val Ala Val Met Ala Thr Gly Tyr Gly
565 570 575
Lys Ser Leu Cys Phe Gln Tyr Pro Pro Val Tyr Val Gly Lys Ile Gly
580 585 590
Leu Val Ile Ser Pro Leu Ile Ser Leu Met Glu Asp Gln Val Leu Gln
595 600 605
Leu Lys Met Ser Asn Ile Pro Ala Cys Phe Leu Gly Ser Ala Gln Ser
610 615 620
Glu Asn Val Leu Thr Asp Ile Lys Leu Gly Lys Tyr Arg Ile Val Tyr
625 630 635 640
Val Thr Pro Glu Tyr Cys Ser Gly Asn Met Gly Leu Leu Gln Gln Leu
645 650 655
Glu Ala Asp Ile Gly Ile Thr Leu Ile Ala Val Asp Glu Ala His Cys
660 665 670
Ile Ser Glu Trp Gly His Asp Phe Arg Asp Ser Phe Arg Lys Leu Gly
675 680 685
Ser Leu Lys Thr Ala Leu Pro Met Val Pro Ile Val Ala Leu Thr Ala
690 695 700
Thr Ala Ser Ser Ser Ile Arg Glu Asp Ile Val Arg Cys Leu Asn Leu
705 710 715 720
Arg Asn Pro Gln Ile Thr Cys Thr Gly Phe Asp Arg Pro Asn Leu Tyr
725 730 735
Leu Glu Val Arg Arg Lys Thr Gly Asn Ile Leu Gln Asp Leu Gln Pro
740 745 750
Phe Leu Val Lys Thr Ser Ser His Trp Glu Phe Glu Gly Pro Thr Ile
755 760 765
Ile Tyr Cys Pro Ser Arg Lys Met Thr Gln Gln Val Thr Gly Glu Leu
770 775 780
Arg Lys Leu Asn Leu Ser Cys Gly Thr Tyr His Ala Gly Met Ser Phe
785 790 795 800
Ser Thr Arg Lys Asp Ile His His Arg Phe Val Arg Asp Glu Ile Gln
805 810 815
Cys Val Ile Ala Thr Ile Ala Phe Gly Met Gly Ile Asn Lys Ala Asp
820 825 830
Ile Arg Gln Val Ile His Tyr Gly Ala Pro Lys Asp Met Glu Ser Tyr
835 840 845
Tyr Gln Glu Ile Gly Arg Ala Gly Arg Asp Gly Leu Gln Ser Ser Cys
850 855 860
His Val Leu Trp Ala Pro Ala Asp Ile Asn Leu Asn Arg His Leu Leu
865 870 875 880
Thr Glu Ile Arg Asn Glu Lys Phe Arg Leu Tyr Lys Leu Lys Met Met
885 890 895
Ala Lys Met Glu Lys Tyr Leu His Ser Ser Arg Cys Arg Arg Gln Ile
900 905 910
Ile Leu Ser His Phe Glu Asp Lys Gln Val Gln Lys Ala Ser Leu Gly
915 920 925
Ile Met Gly Thr Glu Lys Cys Cys Asp Asn Cys Arg Ser Arg Leu Asp
930 935 940
His Cys Tyr Ser Met Asp Asp Ser Glu Asp Thr Ser Trp Asp Phe Gly
945 950 955 960
Pro Gln Ala Phe Lys Leu Leu Ser Ala Val Asp Ile Leu Gly Glu Lys
965 970 975
Phe Gly Ile Gly Leu Pro Ile Leu Phe Leu Arg Gly Ser Asn Ser Gln
980 985 990
Arg Leu Ala Asp Gln Tyr Arg Arg His Ser Leu Phe Gly Thr Gly Lys
995 1000 1005
Asp Gln Thr Glu Ser Trp Trp Lys Ala Phe Ser Arg Gln Leu Ile
1010 1015 1020
Thr Glu Gly Phe Leu Val Glu Val Ser Arg Tyr Asn Lys Phe Met
1025 1030 1035
Lys Ile Cys Ala Leu Thr Lys Lys Gly Arg Asn Trp Leu His Lys
1040 1045 1050
Ala Asn Thr Glu Ser Gln Ser Leu Ile Leu Gln Ala Asn Glu Glu Glu
1055 1060 1065
Leu Cys Pro Lys Lys Leu Leu Leu Pro Ser Ser Lys Thr Val Ser
1070 1075 1080
Ser Gly Thr Lys Glu His Cys Tyr Asn Gln Val Pro Val Glu Leu
1085 1090 1095
Ser Thr Glu Lys Lys Ser Asn Leu Glu Lys Leu Tyr Ser Tyr Lys
1100 1105 1110
Pro Cys Asp Lys Ile Ser Ser Gly Ser Asn Ile Ser Lys Lys Ser
1115 1120 1125
Ile Met Val Gln Ser Pro Glu Lys Ala Tyr Ser Ser Ser Ser Gln Pro
1130 1135 1140
Val Ile Ser Ala Gln Glu Gln Glu Thr Gln Ile Val Leu Tyr Gly
1145 1150 1155
Lys Leu Val Glu Ala Arg Gln Lys His Ala Asn Lys Met Asp Val
1160 1165 1170
Pro Pro Ala Ile Leu Ala Thr Asn Lys Ile Leu Val Asp Met Ala
1175 1180 1185
Lys Met Arg Pro Thr Thr Val Glu Asn Val Lys Arg Ile Asp Gly
1190 1195 1200
Val Ser Glu Gly Lys Ala Ala Met Leu Ala Pro Leu Leu Glu Val
1205 1210 1215
Ile Lys His Phe Cys Gln Thr Asn Ser Val Gln Thr Asp Leu Phe
1220 1225 1230
Ser Ser Thr Lys Pro Gln Glu Glu Gln Lys Thr Ser Leu Val Ala
1235 1240 1245
Lys Asn Lys Ile Cys Thr Leu Ser Gln Ser Met Ala Ile Thr Tyr
1250 1255 1260
Ser Leu Phe Gln Glu Lys Lys Met Pro Leu Lys Ser Ile Ala Glu
1265 1270 1275
Ser Arg Ile Leu Pro Leu Met Thr Ile Gly Met His Leu Ser Gln
1280 1285 1290
Ala Val Lys Ala Gly Cys Pro Leu Asp Leu Glu Arg Ala Gly Leu
1295 1300 1305
Thr Pro Glu Val Gln Lys Ile Ile Ala Asp Val Ile Arg Asn Pro
1310 1315 1320
Pro Val Asn Ser Asp Met Ser Lys Ile Ser Leu Ile Arg Met Leu
1325 1330 1335
Val Pro Glu Asn Ile Asp Thr Tyr Leu Ile His Met Ala Ile Glu
1340 1345 1350
Ile Leu Lys His Gly Pro Asp Ser Gly Leu Gln Pro Ser Cys Asp
1355 1360 1365
Val Asn Lys Arg Arg Cys Phe Pro Gly Ser Glu Glu Ile Cys Ser
1370 1375 1380
Ser Ser Lys Arg Ser Lys Glu Glu Val Gly Ile Asn Thr Glu Thr
1385 1390 1395
Ser Ser Ala Glu Arg Lys Arg Arg Leu Pro Val Trp Phe Ala Lys
1400 1405 1410
Gly Ser Asp Thr Ser Lys Lys Leu Met Asp Lys Thr Lys Arg Gly
1415 1420 1425
Gly Leu Phe Ser
1430
<![CDATA[ <210> 601]]>
<![CDATA[ <211> 4222]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 601]]>
gtgtactgtg tgcgccgggg aggcgccggc ttgtactcgg cagcgcggga ataaagtttg 60
ctgatttggt gtctagcctg gatgcctggg ttgcaggccc tgcttgtggt ggcgctccac 120
agtcatccgg ctgaagaaga cctgttggac tggatcttct cgggttttct ttcagatatt 180
gttttgtatt tacccatgaa gacattgttt tttggactct gcaaataggga catttcaaag 240
atgagtgaaa aaaaattgga aacaactgca cagcagcgga aatgtcctga atggatgaat 300
gtgcagaata aaagatgtgc tgtagaagaa agaaaggcat gtgttcggaa gagtgttttt 360
gaagatgacc tccccttctt agaattcact ggatccattg tgtatagtta cgatgctagt 420
gattgctctt tcctgtcaga agatattagc atgagtctat cagatgggga tgtggtggga 480
tttgacatgg agtggccacc attatacaat agagggaaac ttggcaaagt tgcactaatt 540
cagttgtgtg tttctgagag caaatgttac ttgttccacg tttcttccat gtcagttttt 600
ccccagggat taaaaatgtt gcttgaaaat aaagcagtta aaaaggcagg tgtaggaatt 660
gaaggagatc agtggaaact tctacgtgac tttgatatca aattgaagaa ttttgtggag 720
ttgacagatg ttgccaataa aaagctgaaa tgcacagaga cctggagcct taacagtctg 780
gttaaacacc tcttaggtaa acagctcctg aaagacaagt ctatccgctg tagcaattgg 840
agtaaatttc ctctcactga ggaccagaaa ctgtatgcag ccactgatgc ttatgctggt 900
tttattattt accgaaattt agagattttg gatgatactg tgcaaaggtt tgctataaat 960
aaagaggaag aaatcctact tagcgacatg aacaaacagt tgacttcaat ctctgaggaa 1020
gtgatggatc tggctaagca tcttcctcat gctttcagta aattggaaaa cccacggagg 1080
gtttctatct tactaaagga tatttcagaa aatctatatt cactgaggag gatgataatt 1140
gggtctacta acattgagac tgaactgagg cccagcaata atttaaactt attatccttt 1200
gaagattcaa ctactggggg agtacaacag aaacaaatta gagaacatga agttttaatt 1260
cacgttgaag atgaaacatg ggacccaaca cttgatcatt tagctaaaca tgatggagaa 1320
gatgtacttg gaaataaagt ggaacgaaaa gaagatggat ttgaagatgg agtagaagac 1380
aacaaattga aagagaatat ggaaagagct tgtttgatgt cgttagatat tacagaacat 1440
gaactccaaa ttttggaaca gcagtctcag gaagaatatc ttagtgatat tgcttataaa 1500
tctactgagc atttatctcc caatgataat gaaaacgata cgtcctatgt aattgagagt 1560
gatgaagatt tagaaatgga gatgcttaag catttatctc ccaatgataa tgaaaacgat 1620
acgtcctatg taattgagag tgatgaagat ttagaaatgg agatgcttaa gtctttagaa 1680
aacctcaata gtggcacggt agaaccaact cattctaaat gcttaaaaat ggaaagaaat 1740
ctgggtcttc ctactaaaga agaagaagaa gatgatgaaa atgaagctaa tgaaggggaa 1800
gaagatgatg ataaggactt tttgtggcca gcacccaatg aagagcaagt tacttgcctc 1860
aagatgtact ttggccattc cagttttaaa ccagttcagt ggaaagtgat tcattcagta 1920
ttagaagaaa gaagagataa tgttgctgtc atggcaactg gatatggaaa gagtttgtgc 1980
ttccagtatc cacctgttta tgtaggcaag attggccttg ttatctctcc ccttatttct 2040
ctgatggaag accaagtgct acagcttaaa atgtccaaca tcccagcttg cttccttgga 2100
tcagcacagt cagaaaatgt tctaacagat attaaattag gtaaataccg gattgtatac 2160
gtaactccag aatactgttc aggtaacatg ggcctgctcc agcaacttga ggctgatatt 2220
ggtatcacgc tcattgctgt ggatgaggct cactgtattt ctgagtgggg gcatgatttt 2280
agggattcat tcaggaagtt gggctcccta aagacagcac tgccaatggt tccaatcgtt 2340
gcacttactg ctactgcaag ttcttcaatc cgggaagaca ttgtacgttg cttaaatctg 2400
agaaatcctc agaatcacctg tactggtttt gatcgaccaa acctgtattt agaagttagg 2460
cgaaaaacag ggaatatcct tcaggatctg cagccatttc ttgtcaaaac aagttcccac 2520
tgggaatttg aaggtccaac aatcatctac tgtccttcta gaaaaatgac acaacaagtt 2580
acaggtgaac ttaggaaact gaatctatcc tgtggaacat accatgcggg catgagtttt 2640
agcacaagga aagacattca tcataggtt gtaagagatg aaattcagtg tgtcatagct 2700
accatagctt ttggaatggg cattaataaa gctgacattc gccaagtcat tcattacggt 2760
gctcctaagg acatggaatc atattatcag gagattggta gagctggtcg tgatggactt 2820
caaagttctt gtcacgtcct ctgggctcct gcagacatta acttaaatag gcaccttctt 2880
actgagatac gtaatgagaa gtttcgatta tacaaattaa agatgatggc aaagatggaa 2940
aaatatcttc attctagcag atgtaggaga caaatcatct tgtctcattt tgaggacaaa 3000
caagtacaaa aagcctcctt gggaattatg ggaactgaaa aatgctgtga taattgcagg 3060
tccagattgg atcattgcta ttccatggat gactcagagg atacatcctg ggactttggt 3120
ccacaagcat ttaagctttt gtctgctgtg gacatcttag gcgaaaaatt tggaattggg 3180
cttccaattt tatttctccg aggatctaat tctcagcgtc ttgccgatca atatcgcagg 3240
cacagtttat ttggcactgg caaggatcaa acagagagtt ggtggaaggc tttttcccgt 3300
cagctgatca ctgagggatt cttggtagaa gtttctcggt ataacaaatt tatgaagatt 3360
tgcgccctta cgaaaaaggg tagaaattgg cttcataaag ctaatacaga atctcagagc 3420
ctcatccttc aagctaatga agaattgtgt ccaaagaagt tgcttctgcc tagttcgaaa 3480
actgtatctt cgggcaccaa agagcattgt tataatcaag taccagttga attaagtaca 3540
gagaagaagt ctaacttgga gaagttatat tcttataaac catgtgataa gatttcttct 3600
gggagtaaca tttctaaaaa aagtatcatg gtacagtcac cagaaaaagc ttacagttcc 3660
tcacagcctg ttatttcggc acaagagcag gagactcaga ttgtgttata tggcaaattg 3720
gtagaagcta ggcagaaaca tgccaataaa atggatgttc ccccagctat tctggcaaca 3780
aacaagatac tggtggatat ggccaaaatg agaccaacta cggttgaaaa cgtaaaaagg 3840
attgatggtg tttctgaagg caaagctgcc atgttggccc ctctgttgga agtcatcaaa 3900
catttctgcc aaacaaatag tgttcagaca gacctctttt caagtacaaa acctcaagaa 3960
gaacagaaga cgagtctggt agcaaaaaat aaaatatgca cactttcaca gtctatggcc 4020
atcacatact ctttattcca agaaaagaag atgcctttga agagcatagc tgagagcagg 4080
attctgcctc tcatgacaat tggcatgcac ttatcccaag cggtgaaagc tggctgcccc 4140
cttgatttgg agcgagcagg cctgactcca gaggttcaga agattattgc tgatgttatc 4200
cgaaaccctc ccgtcaactc ag 4222
<![CDATA[ <210> 602]]>
<![CDATA[ <211> 1327]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 602]]>
Met Ser Glu Lys Lys Leu Glu Thr Thr Ala Gln Gln Arg Lys Cys Pro
1 5 10 15
Glu Trp Met Asn Val Gln Asn Lys Arg Cys Ala Val Glu Glu Arg Lys
20 25 30
Ala Cys Val Arg Lys Ser Val Phe Glu Asp Asp Leu Pro Phe Leu Glu
35 40 45
Phe Thr Gly Ser Ile Val Tyr Ser Tyr Asp Ala Ser Asp Cys Ser Phe
50 55 60
Leu Ser Glu Asp Ile Ser Met Ser Leu Ser Asp Gly Asp Val Val Gly
65 70 75 80
Phe Asp Met Glu Trp Pro Pro Leu Tyr Asn Arg Gly Lys Leu Gly Lys
85 90 95
Val Ala Leu Ile Gln Leu Cys Val Ser Glu Ser Lys Cys Tyr Leu Phe
100 105 110
His Val Ser Ser Met Ser Val Phe Pro Gln Gly Leu Lys Met Leu Leu
115 120 125
Glu Asn Lys Ala Val Lys Lys Ala Gly Val Gly Ile Glu Gly Asp Gln
130 135 140
Trp Lys Leu Leu Arg Asp Phe Asp Ile Lys Leu Lys Asn Phe Val Glu
145 150 155 160
Leu Thr Asp Val Ala Asn Lys Lys Leu Lys Cys Thr Glu Thr Trp Ser
165 170 175
Leu Asn Ser Leu Val Lys His Leu Leu Gly Lys Gln Leu Leu Lys Asp
180 185 190
Lys Ser Ile Arg Cys Ser Asn Trp Ser Lys Phe Pro Leu Thr Glu Asp
195 200 205
Gln Lys Leu Tyr Ala Ala Thr Asp Ala Tyr Ala Gly Phe Ile Ile Tyr
210 215 220
Arg Asn Leu Glu Ile Leu Asp Asp Thr Val Gln Arg Phe Ala Ile Asn
225 230 235 240
Lys Glu Glu Glu Ile Leu Leu Ser Asp Met Asn Lys Gln Leu Thr Ser
245 250 255
Ile Ser Glu Glu Val Met Asp Leu Ala Lys His Leu Pro His Ala Phe
260 265 270
Ser Lys Leu Glu Asn Pro Arg Arg Val Ser Ile Leu Leu Lys Asp Ile
275 280 285
Ser Glu Asn Leu Tyr Ser Leu Arg Arg Met Ile Ile Gly Ser Thr Asn
290 295 300
Ile Glu Thr Glu Leu Arg Pro Ser Asn Asn Leu Asn Leu Leu Ser Phe
305 310 315 320
Glu Asp Ser Thr Thr Gly Gly Val Gln Gln Lys Gln Ile Arg Glu His
325 330 335
Glu Val Leu Ile His Val Glu Asp Glu Thr Trp Asp Pro Thr Leu Asp
340 345 350
His Leu Ala Lys His Asp Gly Glu Asp Val Leu Gly Asn Lys Val Glu
355 360 365
Arg Lys Glu Asp Gly Phe Glu Asp Gly Val Glu Asp Asn Lys Leu Lys
370 375 380
Glu Asn Met Glu Arg Ala Cys Leu Met Ser Leu Asp Ile Thr Glu His
385 390 395 400
Glu Leu Gln Ile Leu Glu Gln Gln Ser Gln Glu Glu Tyr Leu Ser Asp
405 410 415
Ile Ala Tyr Lys Ser Thr Glu His Leu Ser Pro Asn Asp Asn Glu Asn
420 425 430
Asp Thr Ser Tyr Val Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met
435 440 445
Leu Lys His Leu Ser Pro Asn Asp Asn Glu Asn Asp Thr Ser Tyr Val
450 455 460
Ile Glu Ser Asp Glu Asp Leu Glu Met Glu Met Leu Lys Ser Leu Glu
465 470 475 480
Asn Leu Asn Ser Gly Thr Val Glu Pro Thr His Ser Lys Cys Leu Lys
485 490 495
Met Glu Arg Asn Leu Gly Leu Pro Thr Lys Glu Glu Glu Glu Asp Asp
500 505 510
Glu Asn Glu Ala Asn Glu Gly Glu Glu Asp Asp Asp Lys Asp Phe Leu
515 520 525
Trp Pro Ala Pro Asn Glu Glu Gln Val Thr Cys Leu Lys Met Tyr Phe
530 535 540
Gly His Ser Ser Phe Lys Pro Val Gln Trp Lys Val Ile His Ser Val
545 550 555 560
Leu Glu Glu Arg Arg Asp Asn Val Ala Val Met Ala Thr Gly Tyr Gly
565 570 575
Lys Ser Leu Cys Phe Gln Tyr Pro Pro Val Tyr Val Gly Lys Ile Gly
580 585 590
Leu Val Ile Ser Pro Leu Ile Ser Leu Met Glu Asp Gln Val Leu Gln
595 600 605
Leu Lys Met Ser Asn Ile Pro Ala Cys Phe Leu Gly Ser Ala Gln Ser
610 615 620
Glu Asn Val Leu Thr Asp Ile Lys Leu Gly Lys Tyr Arg Ile Val Tyr
625 630 635 640
Val Thr Pro Glu Tyr Cys Ser Gly Asn Met Gly Leu Leu Gln Gln Leu
645 650 655
Glu Ala Asp Ile Gly Ile Thr Leu Ile Ala Val Asp Glu Ala His Cys
660 665 670
Ile Ser Glu Trp Gly His Asp Phe Arg Asp Ser Phe Arg Lys Leu Gly
675 680 685
Ser Leu Lys Thr Ala Leu Pro Met Val Pro Ile Val Ala Leu Thr Ala
690 695 700
Thr Ala Ser Ser Ser Ile Arg Glu Asp Ile Val Arg Cys Leu Asn Leu
705 710 715 720
Arg Asn Pro Gln Ile Thr Cys Thr Gly Phe Asp Arg Pro Asn Leu Tyr
725 730 735
Leu Glu Val Arg Arg Lys Thr Gly Asn Ile Leu Gln Asp Leu Gln Pro
740 745 750
Phe Leu Val Lys Thr Ser Ser His Trp Glu Phe Glu Gly Pro Thr Ile
755 760 765
Ile Tyr Cys Pro Ser Arg Lys Met Thr Gln Gln Val Thr Gly Glu Leu
770 775 780
Arg Lys Leu Asn Leu Ser Cys Gly Thr Tyr His Ala Gly Met Ser Phe
785 790 795 800
Ser Thr Arg Lys Asp Ile His His Arg Phe Val Arg Asp Glu Ile Gln
805 810 815
Cys Val Ile Ala Thr Ile Ala Phe Gly Met Gly Ile Asn Lys Ala Asp
820 825 830
Ile Arg Gln Val Ile His Tyr Gly Ala Pro Lys Asp Met Glu Ser Tyr
835 840 845
Tyr Gln Glu Ile Gly Arg Ala Gly Arg Asp Gly Leu Gln Ser Ser Cys
850 855 860
His Val Leu Trp Ala Pro Ala Asp Ile Asn Leu Asn Arg His Leu Leu
865 870 875 880
Thr Glu Ile Arg Asn Glu Lys Phe Arg Leu Tyr Lys Leu Lys Met Met
885 890 895
Ala Lys Met Glu Lys Tyr Leu His Ser Ser Arg Cys Arg Arg Gln Ile
900 905 910
Ile Leu Ser His Phe Glu Asp Lys Gln Val Gln Lys Ala Ser Leu Gly
915 920 925
Ile Met Gly Thr Glu Lys Cys Cys Asp Asn Cys Arg Ser Arg Leu Asp
930 935 940
His Cys Tyr Ser Met Asp Asp Ser Glu Asp Thr Ser Trp Asp Phe Gly
945 950 955 960
Pro Gln Ala Phe Lys Leu Leu Ser Ala Val Asp Ile Leu Gly Glu Lys
965 970 975
Phe Gly Ile Gly Leu Pro Ile Leu Phe Leu Arg Gly Ser Asn Ser Gln
980 985 990
Arg Leu Ala Asp Gln Tyr Arg Arg His Ser Leu Phe Gly Thr Gly Lys
995 1000 1005
Asp Gln Thr Glu Ser Trp Trp Lys Ala Phe Ser Arg Gln Leu Ile
1010 1015 1020
Thr Glu Gly Phe Leu Val Glu Val Ser Arg Tyr Asn Lys Phe Met
1025 1030 1035
Lys Ile Cys Ala Leu Thr Lys Lys Gly Arg Asn Trp Leu His Lys
1040 1045 1050
Ala Asn Thr Glu Ser Gln Ser Leu Ile Leu Gln Ala Asn Glu Glu Glu
1055 1060 1065
Leu Cys Pro Lys Lys Leu Leu Leu Pro Ser Ser Lys Thr Val Ser
1070 1075 1080
Ser Gly Thr Lys Glu His Cys Tyr Asn Gln Val Pro Val Glu Leu
1085 1090 1095
Ser Thr Glu Lys Lys Ser Asn Leu Glu Lys Leu Tyr Ser Tyr Lys
1100 1105 1110
Pro Cys Asp Lys Ile Ser Ser Gly Ser Asn Ile Ser Lys Lys Ser
1115 1120 1125
Ile Met Val Gln Ser Pro Glu Lys Ala Tyr Ser Ser Ser Ser Gln Pro
1130 1135 1140
Val Ile Ser Ala Gln Glu Gln Glu Thr Gln Ile Val Leu Tyr Gly
1145 1150 1155
Lys Leu Val Glu Ala Arg Gln Lys His Ala Asn Lys Met Asp Val
1160 1165 1170
Pro Pro Ala Ile Leu Ala Thr Asn Lys Ile Leu Val Asp Met Ala
1175 1180 1185
Lys Met Arg Pro Thr Thr Val Glu Asn Val Lys Arg Ile Asp Gly
1190 1195 1200
Val Ser Glu Gly Lys Ala Ala Met Leu Ala Pro Leu Leu Glu Val
1205 1210 1215
Ile Lys His Phe Cys Gln Thr Asn Ser Val Gln Thr Asp Leu Phe
1220 1225 1230
Ser Ser Thr Lys Pro Gln Glu Glu Gln Lys Thr Ser Leu Val Ala
1235 1240 1245
Lys Asn Lys Ile Cys Thr Leu Ser Gln Ser Met Ala Ile Thr Tyr
1250 1255 1260
Ser Leu Phe Gln Glu Lys Lys Met Pro Leu Lys Ser Ile Ala Glu
1265 1270 1275
Ser Arg Ile Leu Pro Leu Met Thr Ile Gly Met His Leu Ser Gln
1280 1285 1290
Ala Val Lys Ala Gly Cys Pro Leu Asp Leu Glu Arg Ala Gly Leu
1295 1300 1305
Thr Pro Glu Val Gln Lys Ile Ile Ala Asp Val Ile Arg Asn Pro
1310 1315 1320
Pro Val Asn Ser
1325
<![CDATA[ <210> 603]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 603]]>
gaaggaaact gtttaaccgg atcccattgt accccagagtg cagagccgcc tttccagcat 60
gcaggggctg ctcag 75
<![CDATA[ <210> 604]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 604]]>
gacagagagg gaggaggcgc gcggggacgg ggacgcccag gaggacccac tcgcgggtcc 60
cgctccgctc cggca 75
<![CDATA[ <210> 605]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 605]]>
gaaggaaact gtttaaccgg atcccattgt accccagagtg cagagccgcc tttccagcat 60
gcaggggctg ctcaggacag aggggagga ggcgcgcggg gacggggacg cccaggagga 120
cccactcgcg ggtcccgctc cgctccggca 150
<![CDATA[ <210> 606]]>
<![CDATA[ <211> 102]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 606]]>
gcgagttagt aacctacgag cggctgtgaa ggaaactgtt taaccggatc ccattgtacc 60
cagagtgcag agccgccttt ccagcatgca ggggctgctc ag 102
<![CDATA[ <210> 607]]>
<![CDATA[ <211> 220]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 607]]>
cgtttagtca catcaagaaa tagaacagaa ttcagccatg gccccaagaa agagaggtgg 60
acgaggtatt tcattcatct tttgctgttt ccgaaataat gatcacccag aaatcacgta 120
tcggctgcga aatgatagca actttgcgct tcagaccatg gaaccagcat tgcccatgcc 180
ccctgtggag gagctggatg tcatgttcag tgaactggtg 220
<![CDATA[ <210> 608]]>
<![CDATA[ <211> 90]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 608]]>
gatgaactgg acctcacaga caaacacaga gaagccatgt ttgcacttcc agctgagaaa 60
aaatggcaaa tatactgtag caagaaaaag 90
<![CDATA[ <210> 609]]>
<![CDATA[ <211> 72]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 609]]>
gaccaggaag aaaacaaggg agctacaagt tggcctgaat tctacattga tcagctcaat 60
tccatggctg ct 72
<![CDATA[ <210> 610]]>
<![CDATA[ <211> 9]]>5
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 610]]>
agaaaatctc tgctggcttt agagaaggaa gaagaagaag aaagaagtaa aactatagag 60
agtttaaaga cagcactgag gacaaaacca atgag 95
<![CDATA[ <210> 611]]>
<![CDATA[ <211> 334]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 611]]>
gtttgtaacc agattcatcg acttggatgg cctatcatgt atcctcaact ttctaaagac 60
catggactac gagacctcag agtctcgaat acatacttct ctcattggct gtataaaggc 120
gttaatgaac aactctcaag gccgggctca cgtcctggct cattctgaga gttattaatgt 180
aattgctcag agtctgagca cagagaacat taaaacgaag gtggccgtgc tggaaatctt 240
gggcgccgtg tgcctggttc ccgggggcca caagaaggtt ctgcaggcca tgctgcacta 300
ccagaagtat gccagcgaaa ggacccgctt tcag 334
<![CDATA[ <210> 612]]>
<![CDATA[ <211> 111]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 612]]>
acattaatta acgacttgga taaaagcact gggcggtatc gagatgaagt gagtctcaag 60
actgccatca tgtccttcat taatgcagtg ctcagccaag gtgcaggagt g 111
<![CDATA[ <210> 613]]>
<![CDATA[ <211> 104]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 613]]>
gagagtttgg actttagact tcatcttcgc tatgaatttc tgatgttagg aattcaacct 60
gtaatagata aattaaggga acacgaaaat tcaacattag atag 104
<![CDATA[ <210> 614]]>
<![CDATA[ <211> 67]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 614]]>
gcatttagac ttttttgaaa tgctccgaaa tgaagatgaa ctagaatttg ccaaaagatt 60
tgaactg 67
<![CDATA[ <210> 615]]>
<![CDATA[ <211> 118]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 615]]>
gttcacatag acacaaaaag tgcaactcag atgtttgagc tgaccaggaa gaggctgaca 60
catagtgaag cttacccgca tttcatgtcc atcctgcacc actgcctcca aatgcctt 118
<![CDATA[ <210> 616]]>
<![CDATA[ <211> 139]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 616]]>
acaagaggag tggcaacact gttcagtact ggctactact agatagaatt atacagcaga 60
tagttatcca gaatgacaaa ggacaggacc ctgactccac acctttggaa aactttaata 120
ttaagaatgt cgtacgaat 139
<![CDATA[ <210> 617]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 617]]>
gttggttaat gaaaatgaag ttaagcagtg gaaagaacaa gcggaaaaaa tgagaaaag 59
<![CDATA[ <210> 618]]>
<![CDATA[ <211> 188]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 618]]>
agcacaatga gctacaacag aaactggaaa agaaagaacg agaatgtgat gctaagactc 60
aagagaagga agagatgatg cagaccttaa ataaaatgaa agagaaactt gaaaaggaga 120
ctactgagca taagcaagtc aagcagcagg tggcggacct cacagcacag ctccatgagc 180
tcagcagg 188
<![CDATA[ <210> 619]]>
<![CDATA[ <211> 300]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 619]]>
agggccgtct gtgcttcaat cccaggtgga ccctcgcctg gagcaccagg agggcccttt 60
ccttcctctg tgcctggatc tctccttcct cccccaccac ccccacctct accagtggg 120
atgcttcccc ctccaccgcc tcccctccct ccaggtggcc ctcctcctcc cccagggcct 180
cctcccttag gggcaatcat gccacctcct ggtgctccaa tgggcctagc actgaagaag 240
aaaagcattc ctcagcccac aaatgccctg aaatccttca actggtctaa actgcccgag 300
<![CDATA[ <210> 620]]>
<![CDATA[ <211> 108]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 620]]>
aacaaactgg aaggaacagt atggaccgaa attgatgata caaaagtctt caaaattcta 60
gatcttgaag acctggaaag aaccttctct gcctatcaaa gacagcag 108
<![CDATA[ <210> 621]]>
<![CDATA[ <211> 107]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 621]]>
aaagaagcag atgccattga tgacactctg agttccaaac ttaaagttaa agagctttcg 60
gtgattgatg gtcggagagc tcagaattgc aacatccttc tatcgag 107
<![CDATA[ <210> 622]]>
<![CDATA[ <211> 85]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 622]]>
gttgaaatta tccaatgacg aaatcaaacg ggcaattcta acaatggacg aacaggaaga 60
tctgcccaag gacatgttgg aacag 85
<![CDATA[ <210> 623]]>
<![CDATA[ <211> 107]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 623]]>
ctcttgaaat ttgttcctga aaaaagtgac attgacctat tggaggaaca taaacacgaa 60
ctggatcgga tggccaaggc tgataggttc ctttttgaga tgagccg 107
<![CDATA[ <210> 6]]>24
<![CDATA[ <211> 89]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 624]]>
aattaatcac tatcagcaaa ggttgcaatc gctgtacttc aaaaagaagt ttgcagagcg 60
tgtggcagaa gtgaaaccta aagtggaag 89
<![CDATA[ <210> 625]]>
<![CDATA[ <211> 169]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 625]]>
caattcgttc tggctcagaa gaggtgttta ggagtggtgc cctcaagcag ttgctggagg 60
tggttttggc atttggaaat tatatgaata aaggtcaaag agggaatgca tatggattca 120
agatatctag cctaaacaaa attgctgaca caaaatccag catcgacaa 169
<![CDATA[ <210> 626]]>
<![CDATA[ <211> 108]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 626]]>
aaacattacc cttttgcact atctcatcac tattgtggaa aataagtacc ccagtgttct 60
caatctaaat gaagaattgc gagatattcc tcaagctgcg aaagtaaa 108
<![CDATA[ <210> 627]]>
<![CDATA[ <211> 61]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 627]]>
catgactgag ctggacaaag aaataagtac cttgagaagt ggcttgaaag cagtagagac 60
a 61
<![CDATA[ <210> 628]]>
<![CDATA[ <211> 132]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 628]]>
gagctggaat atcagaagtc tcagccccca cagcccggag ataagtttgt gtctgttgtc 60
agccagttca tcacagtagc cagcttcagc ttctctgatg ttgaagacct tctagcagaa 120
gctaaagacc tg 132
<![CDATA[ <210> 629]]>
<![CDATA[ <211> 171]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 629]]>
tttactaaag cagtgaagca ctttggggaa gaggctggca aaatacaacc agatgagttc 60
tttggcattt ttgatcaatt tcttcaagct gtgtcagaag ccaaacaaga aaacgaaaat 120
atgagaaaga aaaaggagga agaagaacgt cgagctcgca tggaagctca g 171
<![CDATA[ <210> 630]]>
<![CDATA[ <211> 2754]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 630]]>
ctcaaagaac aacgtgaaag ggaacgtaaa atgagaaaag ctaaagagaa tagtgaagaa 60
agcggagagt ttgatgacct tgtttcagct ttacgctcag gagaagtgtt tgacaaagac 120
ctttctaaat tgaaacggaa tcgcaaacgt attaccaacc agatgactga cagcagcaga 180
gagagaccaa tcacaaaact taatttctaa ttttccatga atactttttt ttagaaagct 240
cattagcagc cctctaaagt gactagaacg tttcattaca ctgccttgca atccaaacag 300
tggcaatttt ttccttcatc tgtgagtgaa tgtgtgaacg tgtgtatgta aatgtatgtg 360
tgtatatatt aaaaaatgta tatagatgtc tgagtgttgt ctggagacct atacgtatgg 420
ttaaaaagat ttatgttaat gtatgtgctc caaaaccttt cgtgtatgca ttcacattga 480
gtgtggctca ttttctttcc ccgaacgcca tgactgttca gaagcacaat actatctcct 540
gaaagagata agagacattc cctagattca aaggcaaaac agaagaaaca aacaaacaaa 600
caaaaaaagc ttgcaaaata ttttatggtt tccaagcttg atatccttta aaattatttt 660
cattgatgga actggagttg ttggaaaaac atagatttaa aatgattttt gatagctgac 720
attgtgatgt tgatgtatca catcagtaat aggaccagct ttgaatttct gacattggtg 780
tggggataca gtctgtaaat gtttattgag aacatcttgc acacaatttg aattatgtag 840
aatgtcaatc aagtttttgt atatttaaaa gttggacatc aattttttcc cctgatttca 900
tcaagttatc tctgccaagt gctcttgata atttcttcag atttttggaaaaaaacacta 960
tataaatgca atccatgctt tttttaaaga acaacattgc cagagtatgc ttgttctaac 1020
aatatagata tataaacctt aaaaataata aaatatctca cccaagactt aaaggaagaa 1080
ttctctgaag ggataaagat tactaaaaaaaaaaaaaaaaaaaaaaaatt aatggggtgc 1140
ctttttgtta tagtttctat tttctgtttt gtaggacaag ctgcattttc tgtaaatata 1200
ggtctggact aaaggataca taaagaatgc acaaaatgtc aacatcagca gagatgccca 1260
gatctattta tctctaagta tattgaagt gattgctgtt tatatgttgt cattttaaaa 1320
ttgtgtgtca gtaaagctac ctgtaaaatt tcagtccaaa aaaataaagc tctcagggag 1380
acatgaataa aatcaatgaa cattagaaaa taaaatatag atgcttacca ttaacctacc 1440
aactcttaat atccttaaat tatgtgatat ataaagagga ctgttacttttttacttttt 1500
tttttttttttttttttttttggctttgct ttatttattt gtagttgggg gctaacgttt 1560
tctcttttct ttctattgat cctgttgtgg ttgggtttcc tgtggagaga gtagtttgtc 1620
ctgttgcact agaacattat ttactcacta aattgagttt ttcagtcaat taacaaatat 1680
ttattaagtt cctactatgt accaggcata gtagggctac aatggtaagc aagacagagt 1740
ccctgccccc aaagagctta tattctaatg gggatattaa gggatgaata gaataccaat 1800
gtgtgcactg tacaggaatc atactattta aaaataattt gtataaacta taatgcttag 1860
cacagatggc gagttatctg tgctatgtga aagctgtgaa atagtgctct aagagttgtc 1920
aagagctgtg gttttacatc ttttcctcat tgcaaattta gtgactttct acacactata 1980
tggaaataaa tgactagcaa ataaaacagt cataaataca aagcagaggt tgcactcccc 2040
caatcccgag ttaacccagg tctgcaataa caccatgtta aaggtgcaga tagagacttg 2100
gctcaaaaag gcttggagat gaggaagatt ggaatataat gatggttttg tctgttcctt 2160
aactaaagtg cctctatgta tattcttttc tatttgtagc aggataaatt tggtctggct 2220
cagttttgga actgtatttt gaaaatggct ttgtcttaca gtttaaggaa tagacaggtg 2280
gagggaaagt cacataaagg agcaagtttg tgtagctgtc cctcttgccc cttttaatca 2340
tcctcctttg atatggccat cctggtgggc ctcctttgcc atttccattt ttggtttctt 2400
tccctgaaaa ctgtgtgcag gtaattccat gtgccattgt tgaaaagaaaaaaaaaaaac 2460
aaaaaaaaaa cctacttttt agattggtgc tggtgtaagt agccactttt ctctcttggg 2520
tgtgtatttt aaactttttt tgttttttta aattaatgcc aaaaagaaaa tgcataattt 2580
gtaaacttaa ttatatgtct tatatcttat tagcttagta gttggaacca cttagtcttt 2640
aggtgcaaga ctgttgttag atagtactga gaaaaaaaaa gtatgtgtta tgagactgta 2700
catgtttttt taaaaatagc aatatgcaat aaagagatga attcattggg tgta 2754
<![CDATA[ <210> 631]]>
<![CDATA[ <211> 5890]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> The complete mRNA polynucleotide sequence of DAAM1]]>
<![CDATA[ <400> 631]]>
gcgagttagt aacctacgag cggctgtgaa ggaaactgtt taaccggatc ccattgtacc 60
cagagtgcag agccgccttt ccagcatgca ggggctgctc agcgtttagt cacatcaaga 120
aatagaacag aattcagcca tggccccaag aaagagaggt ggacgaggta tttcattcat 180
cttttgctgt ttccgaaata atgatcaccc agaaatcacg tatcggctgc gaaatgatag 240
caactttgcg cttcagacca tggaaccagc attgcccatg ccccctgtgg aggagctgga 300
tgtcatgttc agtgaactgg tggatgaact ggacctcaca gacaaacaca gagaagccat 360
gtttgcactt ccagctgaga aaaaatggca aatatactgt agcaagaaaa aggaccagga 420
agaaaacaag ggagctacaa gttggcctga attctacatt gatcagctca attccatggc 480
tgctagaaaa tctctgctgg ctttagagaa ggaagaagaa gaagaaagaa gtaaaactat 540
agagagttta aagacagcac tgaggacaa accaatgagg tttgtaacca gattcatcga 600
cttggatggc ctatcatgta tcctcaactt tctaaagacc atggactacg agacctcaga 660
gtctcgaata catacttctc tcattggctg tataaaggcg ttaatgaaca actctcaagg 720
ccgggctcac gtcctggctc attctgagag tattaatgta attgctcaga gtctgagcac 780
agagaacatt aaaacgaagg tggccgtgct ggaaatcttg ggcgccgtgt gcctggttcc 840
cgggggccac aagaaggttc tgcaggccat gctgcactac cagaagtatg ccagcgaaag 900
gacccgcttt cagacattaa ttaacgactt ggataaaagc actgggcggt atcgagatga 960
agtgagtctc aagactgcca tcatgtccctt cattaatgca gtgctcagcc aaggtgcagg 1020
agtggagagt ttggacttta gacttcatct tcgctatgaa tttctgatgt taggaattca 1080
acctgtaata gataaattaa gggaacacga aaattcaaca ttagataggc atttagactt 1140
ttttgaaatg ctccgaaatg aagatgaact agaatttgcc aaaagatttg aactggttca 1200
catagacaca aaaagtgcaa ctcagatgtt tgagctgacc aggaagaggc tgacacatag 1260
tgaagcttac ccgcatttca tgtccatcct gcaccactgc ctccaaatgc cttacaagag 1320
gagtggcaac actgttcagt actggctact actagataga atttacagc agatagttat 1380
ccagaatgac aaaggacagg accctgactc cacacctttg gaaaacttta atattaagaa 1440
tgtcgtacga atgttggtta atgaaaatga agttaagcag tggaaagaac aagcggaaaa 1500
aatgagaaaa gagcacaatg agctacaaca gaaactggaa aagaaagaac gagaatgtga 1560
tgctaagact caagagaagg aagagatgat gcagacctta aataaaatga aagagaaact 1620
tgaaaaggag actactgagc ataagcaagt caagcagcag gtggcggacc tcacagcaca 1680
gctccatgag ctcagcagga gggccgtctg tgcttcaatc ccaggtggac cctcgcctgg 1740
agcaccagga gggccctttc cttcctctgt gcctggatct ctccttcctc ccccaccacc 1800
cccaccctcta ccaggtggga tgcttccccc tccaccgcct cccctccctc caggtggccc 1860
tcctcctccc ccagggcctc ctcccttagg ggcaatcatg ccacctcctg gtgctccaat 1920
gggcctagca ctgaagaaga aaagcattcc tcagcccaca aatgccctga aatccttcaa 1980
ctggtctaaa ctgcccgaga acaaactgga aggaacagta tggaccgaaa ttgatgatac 2040
aaaagtcttc aaaattctag atcttgaaga cctggaaaga accttctctg cctatcaaag 2100
acagcagaaa gaagcagatg ccattgatga cactctgagt tccaaactta aagttaaaga 2160
gctttcggtg attgatggtc ggagagctca gaattgcaac atccttctat cgaggttgaa 2220
attatccaat gacgaaatca aacgggcaat tctaacaatg gacgaacagg aagatctgcc 2280
caaggacatg ttggaacagc tcttgaaatt tgttcctgaa aaaagtgaca ttgacctatt 2340
ggaggaacat aaacacgaac tggatcggat ggccaaggct gtaggttcc tttttgagat 2400
gagccgaatt aatcactatc agcaaaggtt gcaatcgctg tacttcaaaa agaagtttgc 2460
agagcgtgtg gcagaagtga aacctaaagt ggaagcaatt cgttctggct cagaagaggt 2520
gtttaggagt ggtgccctca agcagttgct ggaggtggtt ttggcatttg gaaattatat 2580
gaataaaggt caaagaggga atgcatatgg attcaagata tctagcctaa acaaaattgc 2640
tgacacaaaa tccagcatcg acaaaaacat tacccttttg cactatctca tcactattgt 2700
ggaaaataag taccccagtg ttctcaatct aaatgaagaa ttgcgagata ttcctcaagc 2760
tgcgaaagta aacatgactg agctggacaa agaaataagt accttgagaa gtggcttgaa 2820
agcagtagag acagagctgg aatatcagaa gtctcagccc ccacagcccg gagataagtt 2880
tgtgtctgtt gtcagccagt tcatcacagt agccagcttc agcttctctg atgttgaaga 2940
ccttctagca gaagctaaag acctgtttac taaagcagtg aagcactttg gggaagaggc 3000
tggcaaaata caaccagatg agttctttgg catttttgat caatttcttc aagctgtgtc 3060
agaagccaaa caagaaaacg aaaatatgag aaagaaaaag gaggaagaag aacgtcgagc 3120
tcgcatggaa gctcagctca aagaacaacg tgaaagggaa cgtaaaatga gaaaagctaa 3180
agagaatagt gaagaaagcg gagagtttga tgaccttgtt tcagctttac gctcaggaga 3240
agtgtttgac aaagaccttt ctaaattgaa acggaatcgc aaacgtatta ccaaccagat 3300
gactgacagc agcagagaga gaccaatcac aaaacttaat ttctaatttt ccatgaatac 3360
ttttttttag aaagctcatt agcagccctc taaagtgact agaacgtttc attacactgc 3420
cttgcaatcc aaacagtggc aattttttcc ttcatctgtg agtgaatgtg tgaacgtgtg 3480
tatgtaaatg tatgtgtgtgta tatattaaaa aatgtatata gatgtctgag tgttgtctgg 3540
agacctatac gtatggttaa aaagatttat gttaatgtat gtgctccaaa acctttcgtg 3600
tatgcattca cattgagtgt ggctcatttt ctttccccga acgccatgac tgttcagaag 3660
cacaatacta tctcctgaaa gagataagag aattcccta gattcaaagg caaaacagaa 3720
gaaacaaaca aacaaacaaa aaaagcttgc aaaatatttt atggtttcca agcttgatat 3780
cctttaaaat tattttcatt gatggaactg gagttgttgg aaaaacatag atttaaaatg 3840
atttttgata gctgacattg tgatgttgat gtatcacatc agtaatagga ccagctttga 3900
atttctgaca ttggtgtggg gatacagtct gtaaatgttt attgagaaca tcttgcacac 3960
aatttgaatt atgtagaatg tcaatcaagt ttttgtatat ttaaaagttg gacatcaatt 4020
ttttcccctg atttcatcaa gttatctctg ccaagtgctc ttgataattt cttcagattt 4080
ttggaaaaaa acactatata aatgcaatcc atgctttttt taaagaacaa cattgccaga 4140
gtatgcttgt tctaacaata tagatatata aaccttaaaa ataataaaat atctcaccca 4200
agacttaaag gaagaattct ctgaagggat aaagattact aaaaaaaaaaaaaaaaaaaa 4260
aaaattaatg gggtgccttt ttgttatagt ttctattttc tgttttgtag gacaagctgc 4320
attttctgta aatataggtc tggactaaag gatacataaa gaatgcacaa aatgtcaaca 4380
tcagcagaga tgccccagatc tattattctc taagtatatt tgaagtgatt gctgtttata 4440
tgttgtcatt ttaaaattgt gtgtcagtaa agctacctgt aaaatttcag tccaaaaaaa 4500
taaagctctc agggagacat gaataaaatc aatgaacatt agaaaataaa atatagatgc 4560
ttaccattaa cctaccaact cttaatatcc ttaaattatg tgatatataa agaggactgt 4620
tactttttta cttttttttttttttttttttttttttggc tttgctttatttatttgtag 4680
ttgggggcta acgttttctc ttttctttct attgatcctg ttgtggttgg gtttcctgtg 4740
gagagagtag tttgtcctgt tgcactagaa catttattac tcactaaatt gagtttttca 4800
gtcaattaac aaatatttat taagttccta ctatgtacca ggcatagtag ggctacaatg 4860
gtaagcaaga cagagtccct gcccccaaag agcttatatt ctaatgggga tattaaggga 4920
tgaatagaat accaatgtgt gcactgtaca ggaatcatac tatttaaaaa taatttgtat 4980
aaactataat gcttagcaca gatggcgagt tatctgtgct atgtgaaagc tgtgaaatag 5040
tgctctaaga gttgtcaaga gctgtggttt tacatctttt cctcattgca aatttagtga 5100
ctttctacac actatatgga aataaatgac tagcaaataa aacagtcata aatacaaagc 5160
agaggttgca ctcccccaat cccgagttaa cccaggtctg caataacaccc atgttaaagg 5220
tgcagataga gacttggctc aaaaaggctt ggagatgagg aagattggaa tataatgatg 5280
gttttgtctg ttccttaact aaagtgcctc tatgtatatt cttttctatt tgtagcagga 5340
taaatttggt ctggctcagt tttggaactg tattttgaaa atggctttgt cttacagttt 5400
aaggaataga caggtggagg gaaagtcaca taaaggagca agtttgtgta gctgtccctc 5460
ttgccccttt taatcatcct cctttgatat ggccatcctg gtgggcctcc tttgccattt 5520
ccatttttgg tttctttccc tgaaaactgt gtgcaggtaa ttccatgtgc cattgttgaa 5580
aagaaaaaaaaaaaacaaaaaaaaaaccta ctttttagat tggtgctggt gtaagtagcc 5640
acttttctct cttgggtgtg tattttaaac tttttttgtt tttttaaatt aatgccaaaa 5700
agaaaatgca taatttgtaa acttaattat atgtcttata tcttattagc ttagtagttg 5760
gaaccactta gtctttaggt gcaagactgt tgttagatag tactgagaaa aaaaaagtat 5820
gtgttatgag actgtacatg tttttttaaa aatagcaata tgcaataaag agatgaattc 5880
attgggtgta 5890
<![CDATA[ <210> 632]]>
<![CDATA[ <211> 1068]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 632]]>
Met Ala Pro Arg Lys Arg Gly Gly Arg Gly Ile Ser Phe Ile Phe Cys
1 5 10 15
Cys Phe Arg Asn Asn Asp His Pro Glu Ile Thr Tyr Arg Leu Arg Asn
20 25 30
Asp Ser Asn Phe Ala Leu Gln Thr Met Glu Pro Ala Leu Pro Met Pro
35 40 45
Pro Val Glu Glu Leu Asp Val Met Phe Ser Glu Leu Val Asp Glu Leu
50 55 60
Asp Leu Thr Asp Lys His Arg Glu Ala Met Phe Ala Leu Pro Ala Glu
65 70 75 80
Lys Lys Trp Gln Ile Tyr Cys Ser Lys Lys Lys Lys Asp Gln Glu Glu Asn
85 90 95
Lys Gly Ala Thr Ser Trp Pro Glu Phe Tyr Ile Asp Gln Leu Asn Ser
100 105 110
Met Ala Ala Arg Lys Ser Leu Leu Ala Leu Glu Lys Glu Glu Glu Glu Glu
115 120 125
Glu Arg Ser Lys Thr Ile Glu Ser Leu Lys Thr Ala Leu Arg Thr Lys
130 135 140
Pro Met Arg Phe Val Thr Arg Phe Ile Asp Leu Asp Gly Leu Ser Cys
145 150 155 160
Ile Leu Asn Phe Leu Lys Thr Met Asp Tyr Glu Thr Ser Glu Ser Arg
165 170 175
Ile His Thr Ser Leu Ile Gly Cys Ile Lys Ala Leu Met Asn Asn Ser
180 185 190
Gln Gly Arg Ala His Val Leu Ala His Ser Glu Ser Ile Asn Val Ile
195 200 205
Ala Gln Ser Leu Ser Thr Glu Asn Ile Lys Thr Lys Val Ala Val Leu
210 215 220
Glu Ile Leu Gly Ala Val Cys Leu Val Pro Gly Gly His Lys Lys Val
225 230 235 240
Leu Gln Ala Met Leu His Tyr Gln Lys Tyr Ala Ser Glu Arg Thr Arg
245 250 255
Phe Gln Thr Leu Ile Asn Asp Leu Asp Lys Ser Thr Gly Arg Tyr Arg
260 265 270
Asp Glu Val Ser Leu Lys Thr Ala Ile Met Ser Phe Ile Asn Ala Val
275 280 285
Leu Ser Gln Gly Ala Gly Val Glu Ser Leu Asp Phe Arg Leu His Leu
290 295 300
Arg Tyr Glu Phe Leu Met Leu Gly Ile Gln Pro Val Ile Asp Lys Leu
305 310 315 320
Arg Glu His Glu Asn Ser Thr Leu Asp Arg His Leu Asp Phe Phe Glu
325 330 335
Met Leu Arg Asn Glu Asp Glu Leu Glu Phe Ala Lys Arg Phe Glu Leu
340 345 350
Val His Ile Asp Thr Lys Ser Ala Thr Gln Met Phe Glu Leu Thr Arg
355 360 365
Lys Arg Leu Thr His Ser Glu Ala Tyr Pro His Phe Met Ser Ile Leu
370 375 380
His His Cys Leu Gln Met Pro Tyr Lys Arg Ser Gly Asn Thr Val Gln
385 390 395 400
Tyr Trp Leu Leu Leu Asp Arg Ile Ile Gln Gln Ile Val Ile Gln Asn
405 410 415
Asp Lys Gly Gln Asp Pro Asp Ser Thr Pro Leu Glu Asn Phe Asn Ile
420 425 430
Lys Asn Val Val Arg Met Leu Val Asn Glu Asn Glu Val Lys Gln Trp
435 440 445
Lys Glu Gln Ala Glu Lys Met Arg Lys Glu His Asn Glu Leu Gln Gln
450 455 460
Lys Leu Glu Lys Lys Glu Arg Glu Cys Asp Ala Lys Thr Gln Glu Lys
465 470 475 480
Glu Glu Met Met Gln Thr Leu Asn Lys Met Lys Glu Lys Leu Glu Lys
485 490 495
Glu Thr Thr Glu His Lys Gln Val Lys Gln Gln Val Ala Asp Leu Thr
500 505 510
Ala Gln Leu His Glu Leu Ser Arg Arg Ala Val Cys Ala Ser Ile Pro
515 520 525
Gly Gly Pro Ser Pro Gly Ala Pro Gly Gly Pro Phe Pro Ser Ser Val
530 535 540
Pro Gly Ser Leu Leu Pro Pro Pro Pro Pro Pro Pro Pro Pro Leu Pro Gly Gly
545 550 555 560
Met Leu Pro Pro Pro Pro Pro Pro Leu Pro Pro Gly Gly Pro Pro Pro
565 570 575
Pro Pro Gly Pro Pro Pro Leu Gly Ala Ile Met Pro Pro Pro Gly Ala
580 585 590
Pro Met Gly Leu Ala Leu Lys Lys Lys Ser Ile Pro Gln Pro Thr Asn
595 600 605
Ala Leu Lys Ser Phe Asn Trp Ser Lys Leu Pro Glu Asn Lys Leu Glu
610 615 620
Gly Thr Val Trp Thr Glu Ile Asp Asp Thr Lys Val Phe Lys Ile Leu
625 630 635 640
Asp Leu Glu Asp Leu Glu Arg Thr Phe Ser Ala Tyr Gln Arg Gln Gln
645 650 655
Lys Glu Ala Asp Ala Ile Asp Asp Thr Leu Ser Ser Lys Leu Lys Val
660 665 670
Lys Glu Leu Ser Val Ile Asp Gly Arg Arg Ala Gln Asn Cys Asn Ile
675 680 685
Leu Leu Ser Arg Leu Lys Leu Ser Asn Asp Glu Ile Lys Arg Ala Ile
690 695 700
Leu Thr Met Asp Glu Gln Glu Asp Leu Pro Lys Asp Met Leu Glu Gln
705 710 715 720
Leu Leu Lys Phe Val Pro Glu Lys Ser Asp Ile Asp Leu Leu Glu Glu
725 730 735
His Lys His Glu Leu Asp Arg Met Ala Lys Ala Asp Arg Phe Leu Phe
740 745 750
Glu Met Ser Arg Ile Asn His Tyr Gln Gln Arg Leu Gln Ser Leu Tyr
755 760 765
Phe Lys Lys Lys Phe Ala Glu Arg Val Ala Glu Val Lys Pro Lys Val
770 775 780
Glu Ala Ile Arg Ser Gly Ser Glu Glu Val Phe Arg Ser Gly Ala Leu
785 790 795 800
Lys Gln Leu Leu Glu Val Val Leu Ala Phe Gly Asn Tyr Met Asn Lys
805 810 815
Gly Gln Arg Gly Asn Ala Tyr Gly Phe Lys Ile Ser Ser Leu Asn Lys
820 825 830
Ile Ala Asp Thr Lys Ser Ser Ile Asp Lys Asn Ile Thr Leu Leu His
835 840 845
Tyr Leu Ile Thr Ile Val Glu Asn Lys Tyr Pro Ser Val Leu Asn Leu
850 855 860
Asn Glu Glu Leu Arg Asp Ile Pro Gln Ala Ala Lys Val Asn Met Thr
865 870 875 880
Glu Leu Asp Lys Glu Ile Ser Thr Leu Arg Ser Gly Leu Lys Ala Val
885 890 895
Glu Thr Glu Leu Glu Tyr Gln Lys Ser Gln Pro Pro Gln Pro Gly Asp
900 905 910
Lys Phe Val Ser Val Val Ser Gln Phe Ile Thr Val Ala Ser Phe Ser
915 920 925
Phe Ser Asp Val Glu Asp Leu Leu Ala Glu Ala Lys Asp Leu Phe Thr
930 935 940
Lys Ala Val Lys His Phe Gly Glu Glu Ala Gly Lys Ile Gln Pro Asp
945 950 955 960
Glu Phe Phe Gly Ile Phe Asp Gln Phe Leu Gln Ala Val Ser Glu Ala
965 970 975
Lys Gln Glu Asn Glu Asn Met Arg Lys Lys Lys Glu Glu Glu Glu Arg
980 985 990
Arg Ala Arg Met Glu Ala Gln Leu Lys Glu Gln Arg Glu Arg Glu Arg
995 1000 1005
Lys Met Arg Lys Ala Lys Glu Asn Ser Glu Glu Ser Gly Glu Phe
1010 1015 1020
Asp Asp Leu Val Ser Ala Leu Arg Ser Gly Glu Val Phe Asp Lys
1025 1030 1035
Asp Leu Ser Lys Leu Lys Arg Asn Arg Lys Arg Ile Thr Asn Gln
1040 1045 1050
Met Thr Asp Ser Ser Arg Glu Arg Pro Ile Thr Lys Leu Asn Phe
1055 1060 1065
<![CDATA[ <210> 633]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 633]]>
aaggtctctt cctgcctgag gatgaaaacc tgagggaaaa aggggactgg agccagttca 60
cgctgtggca gcaag 75
<![CDATA[ <210> 634]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 634]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[ <210> 635]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 635]]>
aaggtctctt cctgcctgag gatgaaaacc tgagggaaaa aggggactgg agccagttca 60
cgctgtggca gcaagccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[ <210> 636]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> person]]> class
<![CDATA[ <400> 636]]>
Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg Glu Lys Gly Asp Trp
1 5 10 15
Ser Gln Phe Thr Leu Trp Gln Gln Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[ <210> 637]]>
<![CDATA[ <211> 139]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 637]]>
agtctcaagc tctggtaggc aagtgcatgc agtgtgccta aaacctgcca gcagtacttt 60
tgagtttttt tttttgtttt gttttacttt agcatttatt attcatggat tgaagaaatc 120
aaaatggctg aagataaag 139
<![CDATA[ <210> 638]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 638]]>
agacaaagca tggaggacac aagaatggga ggaaaggcgg actctcagga acttcattct 60
tcacgtggtt tatggtgatt gcattgctgg gcgtctggac atctgtagct gtcgtttggt 120
ttgatcttgt tgactatgag gaagttctag 150
<![CDATA[ <210> 639]]>
<![CDATA[ <211> 69]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 639]]>
gaaaactagg aatctatgat gctgatggtg atggagattt tgatgtggat gatgccaaag 60
tttttag 69
<![CDATA[ <210> 640]]>
<![CDATA[ <211> 93]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 640]]>
gacttaaaga gagatctact tcagagccag cagtcccgcc agaagaggct gagccacaca 60
ctgagcccga ggagcaggtt cctgtggagg cag 93
<![CDATA[ <210> 641]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 641]]>
aaccccagaa tatcgaagat gaagcaaaag aacaaattca gtcccttctc catgaaatgg 60
tacacgcaga acatg 75
<![CDATA[ <210> 642]]>
<![CDATA[ <211> 129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 642]]>
ttgaggggaga agacttgcaa caagaagatg gacccacagg agaaccacaa caagaggatg 60
atgagtttct tatggcgact gatgtagatg atagatttga gaccctggaa cctgaagtat 120
ctcatgaag 129
<![CDATA[ <210> 643]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 643]]>
aaaccgagca tagttaccac gtggaagaga cag 33
<![CDATA[ <210> 644]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 644]]>
tttcacaaga ctgtaatcag gatatggaag agatgatgtc tgagcaggaa aatccag 57
<![CDATA[ <210> 645]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 645]]>
attccagtga accagtagta gaagatgaaa gattgcacca tgatacag 48
<![CDATA[ <210> 646]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 646]]>
atgatgtaac ataccaagtc tatgaggaac aag 33
<![CDATA[ <210> 647]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 647]]>
cagtatatga acctctagaa aatgaaggga tagaaatcac ag 42
<![CDATA[ <210> 648]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 648]]>
aagtaactgc tccccctgag gataatcctg tagaagattc acaggtaatt gtagaag 57
<![CDATA[ <210> 649]]>
<![CDATA[ <211> 45]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 649]]>
aagtaagcat ttttcctgtg gaagaacagc aggaagtacc accag 45
<![CDATA[ <210> 650]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 650]]>
aaacaaatag aaaaacagat gatccagaac aaaaagcaaa ag 42
<![CDATA[ <210> 651]]>
<![CDATA[ <211> 86]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 651]]>
ttaagaaaaa gaagcctaaa cttttaaata aatttgataa gactattaaa gctgaacttg 60
atgctgcaga aaaactccgt aaaagg 86
<![CDATA[ <210> 652]]>
<![CDATA[ <211> 87]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 652]]>
ggaaaaattg aggaagcagt gaatgcattt aaagaactag tacgcaaata ccctcagagt 60
ccacgagcaa gatatgggaa ggcgcag 87
<![CDATA[ <210> 653]]>
<![CDATA[ <211> 151]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 653]]>
tgtgaggatg atttggctga gaagaggaga agtaatgagg tgctacgtgg agccatcgag 60
acctaccaag aggtggccag cctacctgat gtccctgcag acctgctgaa gctgagtttg 120
aagcgtcgct cagacaggca acaatttcta g 151
<![CDATA[ <210> 654]]>
<![CDATA[ <211> 137]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 654]]>
gtcatatgag aggttccctg cttaccctgc agagattagt tcaactattt cccaatgata 60
cttccttaaa aaatgacctt ggcgtggggat acctcttgat aggagataat gacaatgcaa 120
agaaagttta tgaagag 137
<![CDATA[ <210> 655]]>
<![CDATA[ <211> 99]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 655]]>
gtgctgagtg tgacacctaa tgatggcttt gctaaagtcc attatggctt catcctgaag 60
gcacagaaca aaattgctga gagcatccca tattaaag 99
<![CDATA[ <210> 656]]>
<![CDATA[ <211> 90]]>
<![CDATA[ <212> ]]> DNA
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 656]]>
gaaggaatag aatccggaga tcctggcact gatgatggga gattttattt ccacctgggg 60
gatgccatgc agagggttgg gaacaaagag 90
<![CDATA[ <210> 657]]>
<![CDATA[ <211> 138]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 657]]>
gcatataagt ggtatgagct tgggcacaag agaggacact ttgcatctgt ctggcaacgc 60
tcactctaca atgtgaatgg actgaaagca cagccttggt ggaccccaaa agaaacgggc 120
tacacagagt tagtaaag 138
<![CDATA[ <210> 658]]>
<![CDATA[ <211> 136]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 658]]>
tctttagaaa gaaactggaa gttaatccga gatgaaggcc ttgcagtgat ggataaagcc 60
aaaggtctct tcctgcctga ggatgaaaac ctgagggaaa aaggggactg gagccagttc 120
acgctgtggc agcaag 136
<![CDATA[ <210> 659]]>
<![CDATA[ <211> 92]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 659]]>
gaagaagaaa tgaaaatgcc tgcaaaggag ctcctaaaac ctgtacctta ctagaaaagt 60
tccccgagac aacaggatgc agaagaggac ag 92
<![CDATA[ <210> 660]]>
<![CDATA[ <211> 134]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 660]]>
atcaaatatt ccatcatgca ccccggggact cacgtgtggc cgcacacagg gcccacaaac 60
tgcaggctcc gaatgcacct gggcttggtg attcccaagg aaggctgcaa gattcgatgt 120
gccaacgaga ccaa 134
<![CDATA[ <210> 661]]>
<![CDATA[ <211> 2954]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 661]]>
gacctggggag gaaggcaagg tgctcatctt tgatgactcc tttgagcacg aggtatggca 60
ggatgcctca tctttccggc tgatattcat cgtggatgtg tggcatccgg aactgacacc 120
acagcagaga cgcagccttc cagcaattta gcatgaattc atgcaagctt gggaaactct 180
ggagagaggc tgcctttctg gttccatctc cttgggtgtg aggatagaat ttcgaacacc 240
aagagtcaat tcccttgact tgcagcccga gtaattcaaa gcctcctcct agggtcagaa 300
gacactaaag ggaatatttg cctcgctgca attcatttag gaaacaccct gctgtgtgtc 360
atctcatgac agcactggtc ttctgccagt atttaaggtg aacatttgat agcttctacc 420
ttaccagcca aagatatttt ttccacatag aataggtcta attcaatgta taatgagaac 480
atatgtagaa actgtgaatg gattgcttta gtttgtaatt tttctatgca gttatatttt 540
tctagtgtag ctagactatt ttgtcatcat gtaccactac atttttgttt attttaatga 600
caagctgtat aaatgcttta cttctagcta tttaatggta gcattactgg ggaactcaga 660
cttccctctt ttaattcttc ttagtaaaag atactcatga aaaaagcagt tttattttcc 720
taacaaaaaa gaaagagctc attatgtcag tgtctatgaa ctgtacccat cccaactctc 780
aaatcgtttg gtttttttta tcttgattga gatcctcttc tcactatgct agtggtggag 840
atattgacaa aatcctattt ctttcaaaga ggaacttttc acaccgaaaa aagagcatgg 900
aattatttta tattgttata aaaatcccag atgcaaattt ttttaatgcc aattattaga 960
gcttctgggg aaaaagtata gttcacggaa ataaaactat gttctttcag ggttgggtgg 1020
ataggtggct gctagggtgt ctggctcctg gcggctttgc catccatgag gcaagggctg 1080
ggaacacagt gtctttgcct atggtagatc catgtgaatg tcaggaagcc agctcttcag 1140
tcttggagat gatttctgct acaattctgt agaaagatta aggatggcag agtaaaaggt 1200
taccaagaat gccaggatgt ttttcttggg cgtagggaggt ccagattact ttcctttttg 1260
atgaaagagt ttggaagact gtcccatctc tctggcttga gaaatctctg ccattttaaa 1320
catcactgtg aaatagcaat tattatcatc tgtatttagt tttaacatta cccacaacat 1380
agaaataata ggtaaaaatc gtcttgccta ctcattccaa agatgatcaa gtcattaatc 1440
tagcaaagta ttcatgtatc agattttgta tattttgaat caaagctaac taggaatgtt 1500
agatataaga atgtaatgat attcatgcac tgaattctaa gccaatatga acaaaaatgc 1560
tgcatgaatg gcacatatag gtcaccaaag ttcattcaca ggtagaaaaa acttgtgctt 1620
tcttttccat ctaaaaacaa aaggagactt tctttatctc atttaaagaa cagctctttg 1680
aaattgaaat tgaccctttt tgcttgacct taaggagatt agcttccagt agatgagttt 1740
gcaaaatact tttcctgttc ttttgttttg ctggtattga aaacatccca ctaaatcaga 1800
tgaagaggca tgggaggaaa aatatccaaa ttaattacta aaatcgagaa gagaaggcaa 1860
actcttgaaa agtaaaaagg tgtttgtgac cttcagtatt tattgaacag aggaaataac 1920
tgacaagggc aatacaattc aatgttcatg tagtaacatt catgtcactt gttgaatttg 1980
gttctcatat gtatattgca tacacataaa ttcaaactat aagtcgtcat ttttgagcca 2040
tcatcttaca ttcatgtaat gaaattatgg aagagagtaa aaactagctc ttaacttagt 2100
aaatataata tggtatttaa aatcaggtca ctacagtaag gttctaagta ttgccaattg 2160
aaaagctaga aatggtatta ctgttgcaaa gtgttgtcaa taattgactc caatagcatt 2220
gtaaatactt gtatcccaca actattttaa acccaagcaa taaaatggat tttctaattc 2280
acttcaattc tttatttctc ttacctatct atgtcttggt acataagaaa agtgtttcca 2340
tcactgcatt ggtagaatta tttcagtgtt attatttttg tgatttcgta tgtctacaca 2400
aaaatgaatt agtttaattt atattgtgat acagttgttt gagaaatatt tttaattctg 2460
tttcaacttt atttctccaa gtgtataata taaaaattac ttctgttatt gttctctacc 2520
aatagggggaa aaaattaaaa cattagatcc attgagaaag agatgatgta ataaataatt 2580
aagattagta aataatattat caggggtgat tatgaccagt tgaataatct ctttcccttg 2640
aattatttag ctaacaaatt aactctccca aatatttaaa ataatgtaaa atcatatttt 2700
actgcccatt attaactaaa atatttttgt ttgactttga gcaccaactg gtaatactaa 2760
taaataccca tgtcatgcag atggctgggc gaataagaga tgtctaaaaa tatgcactgg 2820
tcttggaaaa catggcacaa gtaaggatat catatatgat gtctgtttat tttatgtctg 2880
atttcttttg aatgagtagt tggggactcc atttctaagg agactaggta aataaaatga 2940
cctttgacat ttca 2954
<![CDATA[ <210> 662]]>
<![CDATA[ <211> 5116]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Complete mRNA polynucleotide sequence of ASPH]]>
<![CDATA[ <400> 662]]>
agtctcaagc tctggtaggc aagtgcatgc agtgtgccta aaacctgcca gcagtacttt 60
tgagtttttt tttttgtttt gttttacttt agcatttatt attcatggat tgaagaaatc 120
aaaatggctg aagataaaga gacaaagcat ggaggacaca agaatggggag gaaaggcgga 180
ctctcaggaa cttcattctt cacgtggttt atggtgattg cattgctggg cgtctggaca 240
tctgtagctg tcgtttggtt tgatcttgtt gactatgagg aagttctagg aaaactagga 300
atctatgatg ctgatggtga tggagatttt gatgtggatg atgccaaagt tttattagga 360
cttaaagaga gatctacttc agagccagca gtcccgccag aagaggctga gccacacact 420
gagcccgagg agcaggttcc tgtggaggca gaacccccaga atatcgaaga tgaagcaaaa 480
gaacaaattc agtcccttct ccatgaaatg gtacacgcag aacatgttga gggagaagac 540
ttgcaacaag aagatggacc cacaggagaa ccacaacaag aggatgatga gtttcttatg 600
gcgactgatg tagatgatag atttgagacc ctggaacctg aagtatctca tgaagaaacc 660
gagcatagtt accacgtgga agagacagtt tcacaagact gtaatcagga tatggaagag 720
atgatgtctg agcaggaaaa tccagattcc agtgaaccag tagtagaaga tgaaagattg 780
caccatgata cagatgatgt aacataccaa gtctatgagg aacaagcagt atatgaacct 840
ctagaaaatg aagggataga aatcacagaa gtaactgctc cccctgagga taatcctgta 900
gaagattcac aggtaattgt agaagaagta agcatttttc ctgtggaaga acagcaggaa 960
gtaccaccag aaacaaatag aaaaacagat gatccagaac aaaaagcaaa agttaagaaa 1020
aagaagccta aacttttaaa taaatttgat aagactatta aagctgaact tgatgctgca 1080
gaaaaactcc gtaaaagggg aaaaattgag gaagcagtga atgcatttaa agaactagta 1140
cgcaaatacc ctcagagtcc acgagcaaga tatgggaagg cgcagtgtga ggatgatttg 1200
gctgagaaga ggagaagtaa tgaggtgcta cgtggagcca tcgagaccta ccaagaggtg 1260
gccagcctac ctgatgtccc tgcagacctg ctgaagctga gtttgaagcg tcgctcagac 1320
aggcaacaat ttctaggtca tatgagaggt tccctgctta ccctgcagag attagttcaa 1380
ctatttccca atgatacttc cttaaaaaat gaccttggcg tgggatacct cttgatagga 1440
gataatgaca atgcaaagaa agtttatgaa gaggtgctga gtgtgacacc taatgatggc 1500
tttgctaaag tccattatgg cttcatcctg aaggcacaga acaaaattgc tgagagcatc 1560
ccatatttaa aggaaggaat agaatccgga gatcctggca ctgatgatgg gagattttat 1620
ttccacctgg gggatgccat gcagagggtt gggaacaaag aggcatataa gtggtatgag 1680
cttgggcaca agagaggaca ctttgcatct gtctggcaac gctcactcta caatgtgaat 1740
ggactgaaag cacagccttg gtggaccccca aaagaaacgg gctacacaga gttagtaaag 1800
tctttagaaa gaaactggaa gttaatccga gatgaaggcc ttgcagtgat ggataaagcc 1860
aaaggtctct tcctgcctga ggatgaaaac ctgagggaaa aaggggactg gagccagttc 1920
acgctgtggc agcaaggaag aagaaatgaa aatgcctgca aaggagctcc taaaacctgt 1980
accttactag aaaagttccc cgagacaaca ggatgcagaa gaggacagat caaatattcc 2040
atcatgcacc ccgggactca cgtgtggccg cacacagggc ccacaaactg caggctccga 2100
atgcacctgg gcttggtgat tcccaaggaa ggctgcaaga ttcgatgtgc caacgagacc 2160
aagacctggg aggaaggcaa ggtgctcatc tttgatgact cctttgagca cgaggtatgg 2220
caggatgcct catctttccg gctgatattc atcgtggatg tgtggcatcc ggaactgaca 2280
ccacagcaga gacgcagcct tccagcaatt tagcatgaat tcatgcaagc ttgggaaact 2340
ctggagagag gctgcctttc tggttccatc tccttgggtg tgaggataga atttcgaaca 2400
ccaagagtca attcccttga cttgcagccc gagtaattca aagcctcctc ctagggtcag 2460
aagacactaa agggaatatt tgcctcgctg caattcatt aggaaacacc ctgctgtgtg 2520
tcatctcatg atagcactgg tcttctgcca gtatttaagg tgaacatttg atagcttcta 2580
ccttaccagc caaagatatt ttttccacat agaataggtc taattcaatg tataatgaga 2640
acatatgtag aaactgtgaa tggattgctt tagtttgtaa tttttctatg cagttatatt 2700
tttctagtgt agctagacta ttttgtcatc atgtaccact aatttttgt ttaattttaat 2760
gacaagctgt ataaatgctt tacttctagc tatttaatgg tagcattact ggggaactca 2820
gacttccctc ttttaattct tcttagtaaa agatactcat gaaaaaagca gttttatttt 2880
cctaacaaaa aagaaagagc tcatttatgtc agtgtctatg aactgtaccc atcccaactc 2940
tcaaatcgtt tggttttttt tatcttgatt gagatcctct tctcactatg ctagtggtgg 3000
agatattgac aaaatcctat ttctttcaaa gaggaacttt tcacaccgaa aaaagagcat 3060
ggaattattt tatattgtta taaaaatccc agatgcaaat ttttttaatg ccaattatta 3120
gagcttctgg ggaaaaagta tagttcacgg aaataaaact atgttctttc agggttgggt 3180
ggataggtgg ctgctagggt gtctggctcc tggcggcttt gccatccatg aggcaagggc 3240
tgggaacaca gtgtctttgc ctatggtaga tccatgtgaa tgtcaggaag ccagctcttc 3300
agtcttggag atgatttctg ctacaattct gtagaaagat taaggatggc agagtaaaag 3360
gttaccaaga atgccaggat gtttttcttg ggcgtaggag gtccagatta ctttcctttt 3420
tgatgaaaga gtttggaaga ctgtcccatc tctctggctt gagaaatctc tgccatttta 3480
aacatcactg tgaaatagca attattatca tctgtattta gttttaacat tacccacaac 3540
atagaaataa taggtaaaaa tcgtcttgcc tactcattcc aaagatgatc aagtcattaa 3600
tctagcaaag tatcatgta tcagattttg tatattttga atcaaagcta actaggaatg 3660
ttagatataa gaatgtaatg atattcatgc actgaattct aagccaatat gaacaaaaat 3720
gctgcatgaa tggcacatat aggtcaccaa agttcattca caggtagaaa aaacttgtgc 3780
tttcttttcc atctaaaaac aaaaggagac tttctttatc tcatttaaag aacagctctt 3840
tgaaattgaa attgaccctt tttgcttgac cttaaggaga ttagcttcca gtagatgagt 3900
ttgcaaaata cttttcctgt tcttttgttt tgctggtatt gaaaacatcc cactaaatca 3960
gatgaagagg catgggagga aaaatatcca aattaattac taaaatcgag aagagaaggc 4020
aaactcttga aaagtaaaaa ggtgtttgtg accttcagta tttatgaac agaggaaata 4080
actgacaagg gcaatacaat tcaatgttca tgtagtaaca ttcatgtcac ttgttgaatt 4140
tggttctcat atgtatattg catacacata aattcaaact ataagtcgtc atttttgagc 4200
catcatctta cattcatgta atgaaattat ggaagagagt aaaaactagc tcttaactta 4260
gtaaatataa tatggtattt aaaatcaggt cactacagta aggttctaag tattgccaat 4320
tgaaaagcta gaaatggtat tactgttgca aagtgttgtc aataattgac tccaatagca 4380
ttgtaaatac ttgtatccca caactatttt aaacccaagc aataaaatgg attttctaat 4440
tcacttcaat tctttatttc tcttacctat ctatgtcttg gtacataaga aaagtgtttc 4500
catcactgca ttggtagaat tatttcagtg ttaattatttt tgtgatttcg tatgtctaca 4560
caaaaatgaa ttagtttaat ttatattgtg atacagttgt ttgagaaata tttttaattc 4620
tgtttcaact ttatttctcc aagtgtataa tataaaaatt acttctgtta ttgttctcta 4680
ccaatagggg aaaaaattaa aacattagat ccattgagaa agagatgatg taataaataa 4740
ttaagattag taataatatt atcaggggtg attatgacca gttgaataat ctctttccct 4800
tgaattattt agctaacaaa ttaactctcc caaatatta aaataatgta aaatcatatt 4860
ttactgccca ttattaacta aaatattttt gtttgacttt gagcaccaac tggtaatact 4920
aataaatacc catgtcatgc agatggctgg gcgaataaga gatgtctaaa aatatgcact 4980
ggtcttggaa aacatggcac aagtaaggat atcatatatg atgtctgttt attttatgtc 5040
tgatttcttt tgaatgagta gttggggact ccatttctaa ggagactagg taaataaaat 5100
gacctttgac atttca 5116
<![CDATA[ <210> 663]]>
<![CDATA[ <211> 5]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 663]]>
Met Ala Glu Asp Lys
1 5
<![CDATA[ <210> 664]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 664]]>
Thr Lys His Gly Gly His Lys Asn Gly Arg Lys Gly Gly Leu Ser Gly
1 5 10 15
Thr Ser Phe Phe Thr Trp Phe Met Val Ile Ala Leu Leu Gly Val Trp
20 25 30
Thr Ser Val Ala Val Val Trp Phe Asp Leu Val Asp Tyr Glu Glu Val
35 40 45
Leu
<![CDATA[ <210> 665]]>
<![CDATA[ <211> 22]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 665]]>
Lys Leu Gly Ile Tyr Asp Ala Asp Gly Asp Gly Asp Phe Asp Val Asp
1 5 10 15
Asp Ala Lys Val Leu Leu
20
<![CDATA[ <210> 666]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 666]]>
Leu Lys Glu Arg Ser Thr Ser Glu Pro Ala Val Pro Pro Glu Glu Ala
1 5 10 15
Glu Pro His Thr Glu Pro Glu Glu Gln Val Pro Val Glu Ala
20 25 30
<![CDATA[ <210> 667]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 667]]>
Pro Gln Asn Ile Glu Asp Glu Ala Lys Glu Gln Ile Gln Ser Leu Leu
1 5 10 15
His Glu Met Val His Ala Glu His
20
<![CDATA[ <210> 668]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 668]]>
Glu Gly Glu Asp Leu Gln Gln Glu Asp Gly Pro Thr Gly Glu Pro Gln
1 5 10 15
Gln Glu Asp Asp Glu Phe Leu Met Ala Thr Asp Val Asp Asp Arg Phe
20 25 30
Glu Thr Leu Glu Pro Glu Val Ser His Glu
35 40
<![CDATA[ <210> 669]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 669]]>
Thr Glu His Ser Tyr His Val Glu Glu Thr
1 5 10
<![CDATA[ <210> 670]]>
<![CDATA[ <211> 18]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 670]]>
Ser Gln Asp Cys Asn Gln Asp Met Glu Glu Met Met Ser Glu Gln Glu
1 5 10 15
Asn Pro
<![CDATA[ <210> 671]]>
<![CDATA[ <211> 15]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 671]]>
Ser Ser Glu Pro Val Val Glu Asp Glu Arg Leu His His Asp Thr
1 5 10 15
<![CDATA[ <210> 672]]>
<![CDATA[ <211> 10]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 672]]>
Asp Val Thr Tyr Gln Val Tyr Glu Glu Gln
1 5 10
<![CDATA[ <210> 673]]>
<![CDATA[ <211> 13]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 673]]>
Val Tyr Glu Pro Leu Glu Asn Glu Gly Ile Glu Ile Thr
1 5 10
<![CDATA[ <210> 674]]>
<![CDATA[ <211> 18]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 674]]>
Val Thr Ala Pro Pro Glu Asp Asn Pro Val Glu Asp Ser Gln Val Ile
1 5 10 15
Val Glu
<![CDATA[ <210> 675]]>
<![CDATA[ <211> 14]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <21]]>3> Human]]>
<br/>
<br/> <![CDATA[ <400>675]]>
<br/>
<br/> <![CDATA[Val Ser Ile Phe Pro Val Glu Glu Gln Gln Glu Val Pro Pro
1 5 10
<![CDATA[ <210> 676]]>
<![CDATA[ <211> 13]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 676]]>
Thr Asn Arg Lys Thr Asp Asp Pro Glu Gln Lys Ala Lys
1 5 10
<![CDATA[ <210> 677]]>
<![CDATA[ <211> 28]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 677]]>
Lys Lys Lys Lys Pro Lys Leu Leu Asn Lys Phe Asp Lys Thr Ile Lys
1 5 10 15
Ala Glu Leu Asp Ala Ala Glu Lys Leu Arg Lys Arg
20 25
<![CDATA[ <210> 678]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 678]]>
Gly Lys Ile Glu Glu Ala Val Asn Ala Phe Lys Glu Leu Val Arg Lys
1 5 10 15
Tyr Pro Gln Ser Pro Arg Ala Arg Tyr Gly Lys Ala Gln
20 25
<![CDATA[ <210> 679]]>
<![CDATA[ <211> 50]]>
<![CDATA[ <212> PR]]>T
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 679]]>
Cys Glu Asp Asp Leu Ala Glu Lys Arg Arg Ser Asn Glu Val Leu Arg
1 5 10 15
Gly Ala Ile Glu Thr Tyr Gln Glu Val Ala Ser Leu Pro Asp Val Pro
20 25 30
Ala Asp Leu Leu Lys Leu Ser Leu Lys Arg Arg Ser Asp Arg Gln Gln
35 40 45
Phe Leu
50
<![CDATA[ <210> 680]]>
<![CDATA[ <211> 45]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 680]]>
His Met Arg Gly Ser Leu Leu Thr Leu Gln Arg Leu Val Gln Leu Phe
1 5 10 15
Pro Asn Asp Thr Ser Leu Lys Asn Asp Leu Gly Val Gly Tyr Leu Leu
20 25 30
Ile Gly Asp Asn Asp Asn Ala Lys Lys Val Tyr Glu Glu
35 40 45
<![CDATA[ <210> 681]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 681]]>
Val Leu Ser Val Thr Pro Asn Asp Gly Phe Ala Lys Val His Tyr Gly
1 5 10 15
Phe Ile Leu Lys Ala Gln Asn Lys Ile Ala Glu Ser Ile Pro Tyr Leu
20 25 30
Lys
<![CDATA[ <210> 682]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 682]]>
Glu Gly Ile Glu Ser Gly Asp Pro Gly Thr Asp Asp Gly Arg Phe Tyr
1 5 10 15
Phe His Leu Gly Asp Ala Met Gln Arg Val Gly Asn Lys Glu
20 25 30
<![CDATA[ <210> 683]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 683]]>
Ala Tyr Lys Trp Tyr Glu Leu Gly His Lys Arg Gly His Phe Ala Ser
1 5 10 15
Val Trp Gln Arg Ser Leu Tyr Asn Val Asn Gly Leu Lys Ala Gln Pro
20 25 30
Trp Trp Thr Pro Lys Glu Thr Gly Tyr Thr Glu Leu Val Lys
35 40 45
<![CDATA[ <210> 684]]>
<![CDATA[ <211> 45]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <21]]>3> Human]]>
<br/>
<br/> <![CDATA[ <400>684]]>
<br/>
<br/> <![CDATA[Ser Leu Glu Arg Asn Trp Lys Leu Ile Arg Asp Glu Gly Leu Ala Val
1 5 10 15
Met Asp Lys Ala Lys Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg
20 25 30
Glu Lys Gly Asp Trp Ser Gln Phe Thr Leu Trp Gln Gln
35 40 45
<![CDATA[ <210> 685]]>
<![CDATA[ <211> 30]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 685]]>
Arg Arg Asn Glu Asn Ala Cys Lys Gly Ala Pro Lys Thr Cys Thr Leu
1 5 10 15
Leu Glu Lys Phe Pro Glu Thr Thr Gly Cys Arg Arg Gly Gln
20 25 30
<![CDATA[ <210> 686]]>
<![CDATA[ <211> 44]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 686]]>
Ile Lys Tyr Ser Ile Met His Pro Gly Thr His Val Trp Pro His Thr
1 5 10 15
Gly Pro Thr Asn Cys Arg Leu Arg Met His Leu Gly Leu Val Ile Pro
20 25 30
Lys Glu Gly Cys Lys Ile Arg Cys Ala Asn Glu Thr
35 40
<![CDATA[ <210> 687]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 687]]>
Thr Trp Glu Glu Gly Lys Val Leu Ile Phe Asp Asp Ser Phe Glu His
1 5 10 15
Glu Val Trp Gln Asp Ala Ser Ser Phe Arg Leu Ile Phe Ile Val Asp
20 25 30
Val Trp His Pro Glu Leu Thr Pro Gln Gln Arg Arg Ser Leu Pro Ala
35 40 45
Ile
<![CDATA[ <210> 688]]>
<![CDATA[ <211> 729]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 688]]>
Met Ala Glu Asp Lys Glu Thr Lys His Gly Gly His Lys Asn Gly Arg
1 5 10 15
Lys Gly Gly Leu Ser Gly Thr Ser Phe Phe Thr Trp Phe Met Val Ile
20 25 30
Ala Leu Leu Gly Val Trp Thr Ser Val Ala Val Val Trp Phe Asp Leu
35 40 45
Val Asp Tyr Glu Glu Val Leu Gly Lys Leu Gly Ile Tyr Asp Ala Asp
50 55 60
Gly Asp Gly Asp Phe Asp Val Asp Asp Ala Lys Val Leu Leu Gly Leu
65 70 75 80
Lys Glu Arg Ser Thr Ser Glu Pro Ala Val Pro Pro Glu Glu Ala Glu
85 90 95
Pro His Thr Glu Pro Glu Glu Gln Val Pro Val Glu Ala Glu Pro Gln
100 105 110
Asn Ile Glu Asp Glu Ala Lys Glu Gln Ile Gln Ser Leu Leu His Glu
115 120 125
Met Val His Ala Glu His Val Glu Gly Glu Asp Leu Gln Gln Glu Asp
130 135 140
Gly Pro Thr Gly Glu Pro Gln Gln Glu Asp Asp Glu Phe Leu Met Ala
145 150 155 160
Thr Asp Val Asp Asp Arg Phe Glu Thr Leu Glu Pro Glu Val Ser His
165 170 175
Glu Glu Thr Glu His Ser Tyr His Val Glu Glu Thr Val Ser Gln Asp
180 185 190
Cys Asn Gln Asp Met Glu Glu Met Met Ser Glu Gln Glu Asn Pro Asp
195 200 205
Ser Ser Glu Pro Val Val Glu Asp Glu Arg Leu His His Asp Thr Asp
210 215 220
Asp Val Thr Tyr Gln Val Tyr Glu Glu Gln Ala Val Tyr Glu Pro Leu
225 230 235 240
Glu Asn Glu Gly Ile Glu Ile Thr Glu Val Thr Ala Pro Pro Glu Asp
245 250 255
Asn Pro Val Glu Asp Ser Gln Val Ile Val Glu Glu Val Ser Ile Phe
260 265 270
Pro Val Glu Glu Gln Gln Glu Val Pro Pro Glu Thr Asn Arg Lys Thr
275 280 285
Asp Asp Pro Glu Gln Lys Ala Lys Val Lys Lys Lys Lys Lys Pro Lys Leu
290 295 300
Leu Asn Lys Phe Asp Lys Thr Ile Lys Ala Glu Leu Asp Ala Ala Glu
305 310 315 320
Lys Leu Arg Lys Arg Gly Lys Ile Glu Glu Ala Val Asn Ala Phe Lys
325 330 335
Glu Leu Val Arg Lys Tyr Pro Gln Ser Pro Arg Ala Arg Tyr Gly Lys
340 345 350
Ala Gln Cys Glu Asp Asp Leu Ala Glu Lys Arg Arg Ser Asn Glu Val
355 360 365
Leu Arg Gly Ala Ile Glu Thr Tyr Gln Glu Val Ala Ser Leu Pro Asp
370 375 380
Val Pro Ala Asp Leu Leu Lys Leu Ser Leu Lys Arg Arg Ser Asp Arg
385 390 395 400
Gln Gln Phe Leu Gly His Met Arg Gly Ser Leu Leu Thr Leu Gln Arg
405 410 415
Leu Val Gln Leu Phe Pro Asn Asp Thr Ser Leu Lys Asn Asp Leu Gly
420 425 430
Val Gly Tyr Leu Leu Ile Gly Asp Asn Asp Asn Ala Lys Lys Val Tyr
435 440 445
Glu Glu Val Leu Ser Val Thr Pro Asn Asp Gly Phe Ala Lys Val His
450 455 460
Tyr Gly Phe Ile Leu Lys Ala Gln Asn Lys Ile Ala Glu Ser Ile Pro
465 470 475 480
Tyr Leu Lys Glu Gly Ile Glu Ser Gly Asp Pro Gly Thr Asp Asp Gly
485 490 495
Arg Phe Tyr Phe His Leu Gly Asp Ala Met Gln Arg Val Gly Asn Lys
500 505 510
Glu Ala Tyr Lys Trp Tyr Glu Leu Gly His Lys Arg Gly His Phe Ala
515 520 525
Ser Val Trp Gln Arg Ser Leu Tyr Asn Val Asn Gly Leu Lys Ala Gln
530 535 540
Pro Trp Trp Thr Pro Lys Glu Thr Gly Tyr Thr Glu Leu Val Lys Ser
545 550 555 560
Leu Glu Arg Asn Trp Lys Leu Ile Arg Asp Glu Gly Leu Ala Val Met
565 570 575
Asp Lys Ala Lys Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg Glu
580 585 590
Lys Gly Asp Trp Ser Gln Phe Thr Leu Trp Gln Gln Gly Arg Arg Asn
595 600 605
Glu Asn Ala Cys Lys Gly Ala Pro Lys Thr Cys Thr Leu Leu Glu Lys
610 615 620
Phe Pro Glu Thr Thr Gly Cys Arg Arg Gly Gln Ile Lys Tyr Ser Ile
625 630 635 640
Met His Pro Gly Thr His Val Trp Pro His Thr Gly Pro Thr Asn Cys
645 650 655
Arg Leu Arg Met His Leu Gly Leu Val Ile Pro Lys Glu Gly Cys Lys
660 665 670
Ile Arg Cys Ala Asn Glu Thr Lys Thr Trp Glu Glu Gly Lys Val Leu
675 680 685
Ile Phe Asp Asp Ser Phe Glu His Glu Val Trp Gln Asp Ala Ser Ser
690 695 700
Phe Arg Leu Ile Phe Ile Val Asp Val Trp His Pro Glu Leu Thr Pro
705 710 715 720
Gln Gln Arg Arg Ser Leu Pro Ala Ile
725
<![CDATA[ <210> 689]]>
<![CDATA[ <211> 1936]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 689]]>
agtctcaagc tctggtaggc aagtgcatgc agtgtgccta aaacctgcca gcagtacttt 60
tgagtttttt tttttgtttt gttttacttt agcatttatt attcatggat tgaagaaatc 120
aaaatggctg aagataaaga gacaaagcat ggaggacaca agaatggggag gaaaggcgga 180
ctctcaggaa cttcattctt cacgtggttt atggtgattg cattgctggg cgtctggaca 240
tctgtagctg tcgtttggtt tgatcttgtt gactatgagg aagttctagg aaaactagga 300
atctatgatg ctgatggtga tggagatttt gatgtggatg atgccaaagt tttattagga 360
cttaaagaga gatctacttc agagccagca gtcccgccag aagaggctga gccacacact 420
gagcccgagg agcaggttcc tgtggaggca gaacccccaga atatcgaaga tgaagcaaaa 480
gaacaaattc agtcccttct ccatgaaatg gtacacgcag aacatgttga gggagaagac 540
ttgcaacaag aagatggacc cacaggagaa ccacaacaag aggatgatga gtttcttatg 600
gcgactgatg tagatgatag atttgagacc ctggaacctg aagtatctca tgaagaaacc 660
gagcatagtt accacgtgga agagacagtt tcacaagact gtaatcagga tatggaagag 720
atgatgtctg agcaggaaaa tccagattcc agtgaaccag tagtagaaga tgaaagattg 780
caccatgata cagatgatgt aacataccaa gtctatgagg aacaagcagt atatgaacct 840
ctagaaaatg aagggataga aatcacagaa gtaactgctc cccctgagga taatcctgta 900
gaagattcac aggtaattgt agaagaagta agcatttttc ctgtggaaga acagcaggaa 960
gtaccaccag aaacaaatag aaaaacagat gatccagaac aaaaagcaaa agttaagaaa 1020
aagaagccta aacttttaaa taaatttgat aagactatta aagctgaact tgatgctgca 1080
gaaaaactcc gtaaaagggg aaaaattgag gaagcagtga atgcatttaa agaactagta 1140
cgcaaatacc ctcagagtcc acgagcaaga tatgggaagg cgcagtgtga ggatgatttg 1200
gctgagaaga ggagaagtaa tgaggtgcta cgtggagcca tcgagaccta ccaagaggtg 1260
gccagcctac ctgatgtccc tgcagacctg ctgaagctga gtttgaagcg tcgctcagac 1320
aggcaacaat ttctaggtca tatgagaggt tccctgctta ccctgcagag attagttcaa 1380
ctatttccca atgatacttc cttaaaaaat gaccttggcg tgggatacct cttgatagga 1440
gataatgaca atgcaaagaa agtttatgaa gaggtgctga gtgtgacacc taatgatggc 1500
tttgctaaag tccattatgg cttcatcctg aaggcacaga acaaaattgc tgagagcatc 1560
ccatatttaa aggaaggaat agaatccgga gatcctggca ctgatgatgg gagattttat 1620
ttccacctgg gggatgccat gcagagggtt gggaacaaag aggcatataa gtggtatgag 1680
cttgggcaca agagaggaca ctttgcatct gtctggcaac gctcactcta caatgtgaat 1740
ggactgaaag cacagccttg gtggaccccca aaagaaacgg gctacacaga gttagtaaag 1800
tctttagaaa gaaactggaa gttaatccga gatgaaggcc ttgcagtgat ggataaagcc 1860
aaaggtctct tcctgcctga ggatgaaaac ctgagggaaa aaggggactg gagccagttc 1920
acgctgtggc agcaag 1936
<![CDATA[ <210> 690]]>
<![CDATA[ <211> 604]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 690]]>
Met Ala Glu Asp Lys Glu Thr Lys His Gly Gly His Lys Asn Gly Arg
1 5 10 15
Lys Gly Gly Leu Ser Gly Thr Ser Phe Phe Thr Trp Phe Met Val Ile
20 25 30
Ala Leu Leu Gly Val Trp Thr Ser Val Ala Val Val Trp Phe Asp Leu
35 40 45
Val Asp Tyr Glu Glu Val Leu Gly Lys Leu Gly Ile Tyr Asp Ala Asp
50 55 60
Gly Asp Gly Asp Phe Asp Val Asp Asp Ala Lys Val Leu Leu Gly Leu
65 70 75 80
Lys Glu Arg Ser Thr Ser Glu Pro Ala Val Pro Pro Glu Glu Ala Glu
85 90 95
Pro His Thr Glu Pro Glu Glu Gln Val Pro Val Glu Ala Glu Pro Gln
100 105 110
Asn Ile Glu Asp Glu Ala Lys Glu Gln Ile Gln Ser Leu Leu His Glu
115 120 125
Met Val His Ala Glu His Val Glu Gly Glu Asp Leu Gln Gln Glu Asp
130 135 140
Gly Pro Thr Gly Glu Pro Gln Gln Glu Asp Asp Glu Phe Leu Met Ala
145 150 155 160
Thr Asp Val Asp Asp Arg Phe Glu Thr Leu Glu Pro Glu Val Ser His
165 170 175
Glu Glu Thr Glu His Ser Tyr His Val Glu Glu Thr Val Ser Gln Asp
180 185 190
Cys Asn Gln Asp Met Glu Glu Met Met Ser Glu Gln Glu Asn Pro Asp
195 200 205
Ser Ser Glu Pro Val Val Glu Asp Glu Arg Leu His His Asp Thr Asp
210 215 220
Asp Val Thr Tyr Gln Val Tyr Glu Glu Gln Ala Val Tyr Glu Pro Leu
225 230 235 240
Glu Asn Glu Gly Ile Glu Ile Thr Glu Val Thr Ala Pro Pro Glu Asp
245 250 255
Asn Pro Val Glu Asp Ser Gln Val Ile Val Glu Glu Val Ser Ile Phe
260 265 270
Pro Val Glu Glu Gln Gln Glu Val Pro Pro Glu Thr Asn Arg Lys Thr
275 280 285
Asp Asp Pro Glu Gln Lys Ala Lys Val Lys Lys Lys Lys Lys Pro Lys Leu
290 295 300
Leu Asn Lys Phe Asp Lys Thr Ile Lys Ala Glu Leu Asp Ala Ala Glu
305 310 315 320
Lys Leu Arg Lys Arg Gly Lys Ile Glu Glu Ala Val Asn Ala Phe Lys
325 330 335
Glu Leu Val Arg Lys Tyr Pro Gln Ser Pro Arg Ala Arg Tyr Gly Lys
340 345 350
Ala Gln Cys Glu Asp Asp Leu Ala Glu Lys Arg Arg Ser Asn Glu Val
355 360 365
Leu Arg Gly Ala Ile Glu Thr Tyr Gln Glu Val Ala Ser Leu Pro Asp
370 375 380
Val Pro Ala Asp Leu Leu Lys Leu Ser Leu Lys Arg Arg Ser Asp Arg
385 390 395 400
Gln Gln Phe Leu Gly His Met Arg Gly Ser Leu Leu Thr Leu Gln Arg
405 410 415
Leu Val Gln Leu Phe Pro Asn Asp Thr Ser Leu Lys Asn Asp Leu Gly
420 425 430
Val Gly Tyr Leu Leu Ile Gly Asp Asn Asp Asn Ala Lys Lys Val Tyr
435 440 445
Glu Glu Val Leu Ser Val Thr Pro Asn Asp Gly Phe Ala Lys Val His
450 455 460
Tyr Gly Phe Ile Leu Lys Ala Gln Asn Lys Ile Ala Glu Ser Ile Pro
465 470 475 480
Tyr Leu Lys Glu Gly Ile Glu Ser Gly Asp Pro Gly Thr Asp Asp Gly
485 490 495
Arg Phe Tyr Phe His Leu Gly Asp Ala Met Gln Arg Val Gly Asn Lys
500 505 510
Glu Ala Tyr Lys Trp Tyr Glu Leu Gly His Lys Arg Gly His Phe Ala
515 520 525
Ser Val Trp Gln Arg Ser Leu Tyr Asn Val Asn Gly Leu Lys Ala Gln
530 535 540
Pro Trp Trp Thr Pro Lys Glu Thr Gly Tyr Thr Glu Leu Val Lys Ser
545 550 555 560
Leu Glu Arg Asn Trp Lys Leu Ile Arg Asp Glu Gly Leu Ala Val Met
565 570 575
Asp Lys Ala Lys Gly Leu Phe Leu Pro Glu Asp Glu Asn Leu Arg Glu
580 585 590
Lys Gly Asp Trp Ser Gln Phe Thr Leu Trp Gln Gln
595 600
<![CDATA[ <210> 691]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 691]]>
cctcctgtac tccaggctcc acctgcatcg accgtgtggc ctccttctct tgcatgtgcc 60
cagaggggaa ggcag 75
<![CDATA[ <210> 692]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 692]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgtgaatg 75
<![CDATA[ <210> 693]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 693]]>
cctcctgtac tccaggctcc acctgcatcg accgtgtggc ctccttctct tgcatgtgcc 60
cagaggggaa ggcagctaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[ <210> 694]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 694]]>
Ser Cys Thr Pro Gly Ser Thr Cys Ile Asp Arg Val Ala Ser Phe Ser
1 5 10 15
Cys Met Cys Pro Glu Gly Lys Ala Ala Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[ <210> 695]]>
<![CDATA[ <211> 329]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 695]]>
aggctgcttc gttgcacacc cgagaaagtt tcagccaaac ttcgggcggc ggctgaggcg 60
gcggccgagg agcggcggac tcggggcgcg gggagtcgag gcatttgcgc ctgggcttcg 120
gagcgtagcg ccagggcctg agcctttgaa gcaggaggag gggaggagag agtggggctc 180
ctctatcggg accccctccc catgtggatc tgcccaggcg gcggcggcgg cggcggagga 240
ggaggcgacc gagaagatgc ccgccctgcg ccccgctctg ctgtgggcgc tgctggcgct 300
ctggctgtgc tgcgcggccc ccgcgcatg 329
<![CDATA[ <210> 696]]>
<![CDATA[ <211> 82]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 696]]>
cattgcagtg tcgagatggc tatgaaccct gtgtaaatga aggaatgtgt gttacctacc 60
acaatggcac aggatactgc aa 82
<![CDATA[ <210> 697]]>
<![CDATA[ <211> 260]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 697]]>
atgtccagaa ggcttcttgg gggaatattg tcaacatcga gacccctgtg agaagaaccg 60
ctgccagaat ggtgggactt gtgtggccca ggccatgctg gggaaagcca cgtgccgatg 120
tgcctcaggg tttacaggag aggactgcca gtactcaaca tctcatccat gctttgtgtc 180
tcgaccctgc ctgaatggcg gcacatgcca tatgctcagc cgggatacct atgagtgcac 240
ctgtcaagtc gggtttacag 260
<![CDATA[ <210> 698]]>
<![CDATA[ <211> 336]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 698]]>
gtaaggagtg ccaatggacg gatgcctgcc tgtctcatcc ctgtgcaaat ggaagtacct 60
gtaccactgt ggccaaccag ttctcctgca aatgcctcac aggcttcaca gggcagaaat 120
gtgagactga tgtcaatgag tgtgacattc caggacactg ccagcatggt ggcacctgcc 180
tcaacctgcc tggttcctac cagtgccagt gccctcaggg cttcacaggc cagtactgtg 240
acagcctgta tgtgccctgt gcaccctcac cttgtgtcaa tggaggcacc tgtcggcaga 300
ctggtgactt cacttttgag tgcaactgcc ttccag 336
<![CDATA[ <210> 699]]>
<![CDATA[ <211> 123]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 699]]>
gttttgaagg gagcacctgt gagaggaata ttgatgactg ccctaaccac aggtgtcaga 60
atggaggggt ttgtgtggat ggggtcaaca cttacaactg ccgctgtccc ccacaatgga 120
cag 123
<![CDATA[ <210> 700]]>
<![CDATA[ <211> 234]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 700]]>
gacagttctg cacagaggat gtggatgaat gcctgctgca gcccaatgcc tgtcaaaatg 60
ggggcacctg tgccaaccgc aatggaggct atggctgtgt atgtgtcaac ggctggagtg 120
gagatgactg cagtgagaac attgatgatt gtgccttcgc ctcctgtact ccaggctcca 180
cctgcatcga ccgtgtggcc tccttctctt gcatgtgccc agagggggaag gcag 234
<![CDATA[ <210> 701]]>
<![CDATA[ <211> 156]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 701]]>
gtctcctgtg tcatctggat gatgcatgca tcagcaatcc ttgccacaag ggggcactgt 60
gtgacaccaa ccccctaaat gggcaatata tttgcacctg cccacaaggc tacaaagggg 120
ctgactgcac agaagatgtg gatgaatgtg ccatgg 156
<![CDATA[ <210> 702]]>
<![CDATA[ <211> 9905]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 702]]>
ccaatagcaa tccttgtgag catgcaggaa aatgtgtgaa cacggatggc gccttccact 60
gtgagtgtct gaagggttat gcaggacctc gttgtgagat ggacatcaat gagtgccatt 120
cagacccctg ccagaatgat gctacctgtc tggataagat tggaggcttc acatgtctgt 180
gcatgccagg tttcaaaggt gtgcattgtg aattagaaat aaatgaatgt cagagcaacc 240
cttgtgtgaa caatgggcag tgtgtggata aagtcaatcg tttccagtgc ctgtgtcctc 300
ctggtttcac tgggccagtt tgccagattg atattgatga ctgttccagt actccgtgtc 360
tgaatggggc aaagtgtatc gatcacccga atggctatga atgccagtgt gccacaggtt 420
tcactggtgt gttgtgtgag gagaacattg acaactgtga ccccgatcct tgccaccatg 480
gtcagtgtca ggatggtatt gattcctaca cctgcatctg caatcccggg tacatgggcg 540
ccatctgcag tgaccagatt gatgaatgtt acagcagccc ttgcctgaac gatggtcgct 600
gcattgacct ggtcaatggc taccagtgca actgccagcc aggcacgtca ggggttaatt 660
gtgaaattaa ttttgatgac tgtgcaagta acccttgtat ccatggaatc tgtatggatg 720
gcattaatcg ctacagttgtgtctgctcac caggattcac agggcagaga tgtaacattg 780
acattgatga gtgtgcctcc aatccctgtc gcaagggtgc aacatgtatc aacggtgtga 840
atggtttccg ctgtatatgc cccgagggac cccatcaccc cagctgctac tcacaggtga 900
acgaatgcct gagcaatccc tgcatccatg gaaactgtac tggaggtctc agtggatata 960
agtgtctctg tgatgcaggc tgggttggca tcaactgtga agtggacaaa aatgaatgcc 1020
tttcgaatcc atgccagaat ggaggaactt gtgacaatct ggtgaatgga tacaggtgta 1080
cttgcaagaa gggctttaaa ggctataact gccaggtgaa tattgatgaa tgtgcctcaa 1140
atccatgcct gaaccaagga acctgctttg atgacataag tggctacact tgccactgtg 1200
tgctgccata cacaggcaag aattgtcaga cagtattggc tccctgttcc ccaaaccctt 1260
gtgagaatgc tgctgtttgc aaagagtcac caaattttga gagttatact tgcttgtgtg 1320
ctcctggctg gcaaggtcag cggtgtacca ttgacattga cgagtgtatc tccaagccct 1380
gcatgaacca tggtctctgc cataacaccc agggcagcta catgtgtgaa tgtccaccag 1440
gcttcagtgg tatggactgt gaggaggaca ttgatgactg ccttgccaat ccttgccaga 1500
atggaggttc ctgtatggat ggagtgaata ctttctcctg cctctgcctt ccgggtttca 1560
ctggggataa gtgccagaca gacatgaatg agtgtctgag tgaaccctgt aagaatggag 1620
ggacctgctc tgactacgtc aacagttaca cttgcaagtg ccaggcagga tttgatggag 1680
tccattgtga gaacaacatc aatgagtgca ctgagagctc ctgtttcaat ggtggcacat 1740
gtgttgatgg gattaactcc ttctcttgct tgtgccctgt gggtttcact ggatccttct 1800
gcctccatga gatcaatgaa tgcagctctc atccatgcct gaatgaggga acgtgtgttg 1860
atggcctggg tacctaccgc tgcagctgcc ccctgggcta cactgggaaa aactgtcaga 1920
ccctggtgaa tctctgcagt cggtctccat gtaaaaacaa aggtacttgc gttcagaaaa 1980
aagcagagtc ccagtgccta tgtccatctg gatgggctgg tgcctattgt gacgtgccca 2040
atgtctcttg tgacatagca gcctccagga gaggtgtgct tgttgaacac ttgtgccagc 2100
actcaggtgt ctgcatcaat gctggcaaca cgcattactg tcagtgcccc ctgggctata 2160
ctgggagcta ctgtgaggag caactcgatg agtgtgcgtc caacccctgc cagcacgggg 2220
caacatgcag tgacttcatt ggtggataca gatgcgagtg tgtcccaggc tatcagggtg 2280
tcaactgtga gtatgaagtg gatgagtgcc agaatcagcc ctgccagaat ggaggcacct 2340
gtattgacct tgtgaaccat ttcaagtgct cttgcccacc aggcactcgg ggcctactct 2400
gtgaagagaa cattgatgac tgtgcccggg gtccccattg ccttaatggt ggtcagtgca 2460
tggataggat tggaggctac agttgtcgct gcttgcctgg ctttgctggg gagcgttgtg 2520
agggagacat caacgagtgc ctctccaacc cctgcagctc tgagggcagc ctggactgta 2580
tacagctcac caatgactac ctgtgtgttt gccgtagtgc ctttactggc cggcactgtg 2640
aaaccttcgt cgatgtgtgt ccccagatgc cctgcctgaa tggagggact tgtgctgtgg 2700
ccagtaacat gcctgatggt ttcatttgcc gttgtccccc gggattttcc ggggcaaggt 2760
gccagagcag ctgtggacaa gtgaaatgta ggaaggggga gcagtgtgtg cacaccgcct 2820
ctggaccccg ctgcttctgc cccagtcccc gggactgcga gtcaggctgt gccagtagcc 2880
cctgccagca cgggggcagc tgccaccctc agcgccagcc tccttattac tcctgccagt 2940
gtgccccacc attctcgggt agccgctgtg aactctacac ggcacccccc agcaccccctc 3000
ctgccacctg tctgagccag tattgtgccg acaaagctcg ggatggcgtc tgtgatgagg 3060
cctgcaacag ccatgcctgc cagtgggatgggggtgactg ttctctcacc atggagaacc 3120
cctgggccaa ctgctcctcc ccacttccct gctgggatta tatcaacaac cagtgtgatg 3180
agctgtgcaa cacggtcgag tgcctgtttg acaactttga atgccagggg aacagcaaga 3240
catgcaagta tgacaaatac tgtgcagacc acttcaaaga caaccactgt gaccaggggt 3300
gcaacagtga ggagtgtggt tgggatgggc tggactgtgc tgctgaccaa cctgagaacc 3360
tggcagaagg taccctggtt attgtggtat tgatgccacc tgaacaactg ctccaggatg 3420
ctcgcagctt cttgcgggca ctgggtaccc tgctccacac caacctgcgc attaagcggg 3480
actcccaggg ggaactcatg gtgtacccct attatggtga gaagtcagct gctatgaaga 3540
aacagaggat gacacgcaga tcccttcctg gtgaacaaga acaggaggtg gctggctcta 3600
aagtctttct ggaaattgac aaccgccagt gtgttcaaga ctcagaccac tgcttcaaga 3660
acacggatgc agcagcagct ctcctggcct ctcacgccat acagggggacc ctgtcatacc 3720
ctcttgtgtc tgtcgtcagt gaatccctga ctccagaacg cactcagctc ctctatctcc 3780
ttgctgttgc tgttgtcatc attctgttta ttaattctgct gggggtaatc atggcaaaac 3840
gaaagcgtaa gcatggctct ctctggctgc ctgaaggttt cactcttcgc cgagatgcaa 3900
gcaatcacaa gcgtcgtgag ccagtgggac aggatgctgt ggggctgaaa aatctctcag 3960
tgcaagtctc agaagctaac ctaattggta ctggaacaag tgaacactgg gtcgatgatg 4020
aagggcccca gccaaagaaa gtaaaggctg aagatgaggc cttactctca gaagaagatg 4080
accccattga tcgacggcca tggacacagc agcaccttga agctgcagac atccgtagga 4140
caccatcgct ggctctcacc cctcctcagg cagagcagga ggtggatgtg ttagatgtga 4200
atgtccgtgg cccagatggc tgcaccccat tgatgttggc ttctctccga ggaggcagct 4260
cagatttgag tgatgaagat gaagatgcag aggactcttc tgctaacatc atcacagact 4320
tggtctacca gggtgccagc ctccaggccc agacagaccg gactggtgag atggccctgc 4380
accttgcagc ccgctactca cgggctgatg ctgccaagcg tctcctggat gcaggtgcag 4440
atgccaatgc ccaggacaac atgggccgct gtccactcca tgctgcagtg gcagctgatg 4500
cccaaggtgt cttccagatt ctgattcgca accgagtaac tgatctagat gccaggatga 4560
atgatggtac tacacccctg atcctggctg cccgcctggc tgtggaggga atggtggcag 4620
aactgatcaa ctgccaagcg gatgtgaatg cagtggatga ccatggaaaa tctgctcttc 4680
actgggcagc tgctgtcaat aatgtgggagg caactctttt gttgttgaaa aatggggcca 4740
accgagacat gcaggacaac aaggaagaga cacctctgtt tcttgctgcc cgggaggggga 4800
gctatgaagc agccaagatc ctgttagacc attttgccaa tcgagacatc acagaccata 4860
tggatcgtct tccccgggat gtggctcggg atcgcatgca ccatgacatt gtgcgccttc 4920
tggatgaata caatgtgacc ccaagccctc caggcaccgt gttgacttct gctctctcac 4980
ctgtcatctg tgggcccaac agatctttcc tcagcctgaa gcacacccca atgggcaaga 5040
agtctagacg gcccagtgcc aagagtacca tgcctactag cctccctaac cttgccaagg 5100
aggcaaagga tgccaagggt agtagggagga agaagtctct gagtgagaag gtccaactgt 5160
ctgagagttc agtaacttta tcccctgttg attccctaga atctccctcac acgtatgttt 5220
ccgacaccac atccctctcca atgattacat cccctgggat cttacaggcc tcacccaacc 5280
ctatgttggc cactgccgcc cctcctgccc cagtccatgc ccagcatgca ctatcttttt 5340
ctaaccttca tgaaatgcag cctttggcac atggggccag cactgtgctt ccctcagtga 5400
gccagttgct atcccaccac cacattgtgt ctccaggcag tggcagtgct ggaagcttga 5460
gtaggctcca tccagtccca gtcccagcag attggatgaa ccgcatggag gtgaatgaga 5520
cccagtacaa tgagatgttt ggtatggtcc tggctccagc tgagggcacc catcctggca 5580
tagctcccca gagcaggcca cctgaaggga agcacataac cacccctcgg gagcccttgc 5640
cccccattgt gactttccag ctcatcccta aaggcagtat tgcccaacca gcgggggctc 5700
cccagcctca gtccacctgc cctccagctg ttgcgggccc cctgcccacc atgtaccaga 5760
ttccagaaat ggcccgtttg cccagtgtgg ctttccccac tgccatgatg ccccagcagg 5820
acgggcaggt agctcagacc attctcccag cctatcatcc tttcccagcc tctgtgggca 5880
agtaccccac acccccttca cagcacagtt atgcttcctc aaatgctgct gagcgaacac 5940
ccagtcacag tggtcacctc cagggtgagc atccctacct gacaccatcc ccagagtctc 6000
ctgaccagtg gtcaagttca tcaccccact ctgcttctga ctggtcagat gtgaccacca 6060
gccctacccc tgggggtgct gggagggtc agcggggacc tgggacacac atgtctgagc 6120
caccacacaa caacatgcag gtttatgcgt gagagagtcc acctccagtg tagagacata 6180
actgactttt gtaaatgctg ctgaggaaca aatgaaggtc atccgggaga gaaatgaaga 6240
aatctctgga gccagcttct agaggtagga aagagaagat gttcttattc agataatgca 6300
agagaagcaa ttcgtcagtt tcactgggta tctgcaaggc ttattgatta ttctaatcta 6360
ataagacaag tttgtggaaa tgcaagatga atacaagcct tgggtccatg tttactctct 6420
tctatttgga gaataagatg gatgcttatt gaagccccaga cattcttgca gcttggactg 6480
cattttaagc cctgcaggct tctgccatat ccatgagaag attctacact agcgtcctgt 6540
tgggaattat gccctggaat tctgcctgaa ttgacctacg catctcctcc tccttggaca 6600
ttcttttgtc ttcatttggt gcttttggtt ttgcacctct ccgtgattgt agccctacca 6660
gcatgttata gggcaagacc tttgtgcttt tgatcattct ggcccatgaa agcaactttg 6720
gtctcctttc ccctcctgtc ttcccggtat cccttggagt ctcacaaggt ttactttggt 6780
atggttctca gcacaaacct ttcaagtatg ttgtttcttt ggaaaatgga catactgtat 6840
tgtgttctcc tgcatatatc attcctggag agagaagggg agaagaatac ttttcttcaa 6900
caaattttgg gggcaggaga tcccttcaag aggctgcacc ttaatttttc ttgtctgtgt 6960
gcaggtcttc atataaactt taccaggaag aagggtgtga gtttgttgtt tttctgtgta 7020
tgggcctggt cagtgtaaag ttttatcctt gtagtctag ttactatgac cctccccact 7080
tttttaaaac cagaaaaagg tttggaatgt tggaatgacc aagagacaag ttaactcgtg 7140
caagagccag ttacccaccc acaggtcccc ctacttcctg ccaagcattc cattgactgc 7200
ctgtatggaa cacatttgtc ccagatctga gcattctagg cctgtttcac tcactcaccc 7260
agcatatgaa actagtctta actgttgagc ctttcctttc atatccacag aagacactgt 7320
ctcaaatgtt gtacccttgc catttaggac tgaactttcc ttagcccaag ggacccagtg 7380
acagttgtct tccgtttgtc agatgatcag tctctactga ttatcttgct gcttaaaggc 7440
ctgctcacca atctttcttt cacaccgtgt ggtccgtgtt actggtatac ccagtatgtt 7500
ctcactgaag acatggactt tatatgttca agtgcaggaa ttggaaagtt ggacttgttt 7560
tctatgatcc aaaacagccc tataagaagg ttggaaaagg aggaactata tagcagcctt 7620
tgctattttc tgctaccatt tcttttcctc tgaagcggcc atgacattcc ctttggcaac 7680
taacgtagaa actcaacaga aattttcct ttcctagagt caccttttag atgataatgg 7740
acaactatag acttgctcat tgttcagact gattgcccct cacctgaatc cactctctgt 7800
attcatgctc ttggcaattt ctttgacttt cttttaaggg cagaagcatt ttagttaatt 7860
gtagataaag aatagttttc ttcctcttct ccttgggcca gttaataatt ggtccatggc 7920
tacactgcaa cttccgtcca gtgctgtgat gcccatgaca cctgcaaaat aagttctgcc 7980
tgggcatttt gtagatatta acaggtgaat tcccgactct tttggtttga atgacagttc 8040
tcattccttc tatggctgca agtatgcatc agtgcttccc acttacctga tttgtctgtc 8100
ggtggcccca tatggaaacc ctgcgtgtct gttggcataa tagtttacaa atggtttttt 8160
cagtcctatc caaatttatt gaaccaacaa aaataattac ttctgccctg agataagcag 8220
attaagtttg ttcattctct gctttatctct ctccatgtgg caacattctg tcagcctctt 8280
tcatagtgtg caaacatttt atcattctaa atggtgactc tctgcccttg gacccatta 8340
ttattcacag atggggagaa cctatctgca tggacctctg tggaccacag cgtacctgcc 8400
cctttctgcc ctcctgctcc agccccactt ctgaaagtat cagctactga tccagccact 8460
ggatatttta tatcctccct tttccttaag cacaatgtca gaccaaattg cttgtttctt 8520
tttcttggac tactttaatt tggatccttt gggtttggag aaagggaatg tgaaagctgt 8580
cattacagac aacaggtttc agtgatgagg aggacaacac tgcctttcaa actttttact 8640
gatctcttag attttaagaa ctcttgaatt gtgtggtatc taataaaagg gaaggtaaga 8700
tggataatca ctttctcatt tgggttctga attggagact cagtttttat gagacacatc 8760
ttttatgcca tgtatagatc ctcccctgct atttttggtt tattttatt gttataaatg 8820
ctttctttct ttgactcctc ttctgcctgc ctttggggat aggttttttt gtttgtttat 8880
ttgcttcctc tgttttgttt taagcatcat tttcttatgt gaggtgggga agggaaaggt 8940
atgagggaaa gagagtctga gaattaaaat attttagtat aagcaattgg ctgtgatgct 9000
caaatccatt gcatcctctt attgaatttg ccaatttgta atttttgcat aataaagaac 9060
caaaggtgta atgttttgtt gagaggtggt ttagggattt tggccctaac caatacattg 9120
aatgtatgat gactatttgg gaggacacat ttatgtaccc agaggccccc actaataagt 9180
ggtactatgg ttacttcctt gtgtacattt ctcttaaaag tgatattata tctgtttgta 9240
tgagaaaccc agtaaccaat aaaatgaccg catattcctg actaaacgta gtaaggaaaa 9300
tgcacacttt gtttttactt ttccgtttca ttctaaaggt agttaagatg aaatttatat 9360
gaaagcattt ttatcacaaa ataaaaaagg tttgccaagc tcagtggtgt tgtatttttt 9420
attttccaat actgcatcca tggcctggca gtgttacctc atgatgtcat aatttgctga 9480
gagagcaaat tttcttttct ttctgaatcc cacaaagcct agcaccaaac ttcttttttt 9540
cttcctttaa ttagatcata aataaatgat cctggggaaa aagcatctgt caaataggaa 9600
acatcacaaa actgagcact cttctgtgca ctagccatag ctggtgacaa acagatggtt 9660
gctcagggac aaggtgcctt ccaatggaaa tgcgaagtag ttgctatagc aagaattggg 9720
aactgggata taagtcataa tattaattat gctgttatgt aaatgattgg tttgtaacat 9780
tccttaagtg aaatttgtgt agaacttaat atacaggatt ataaaataat attttgtgta 9840
taaatttgtt ataagttcac attcatacat ttattataa agtcagtgag atatttgaac 9900
atgaa 9905
<![CDATA[ <210> 703]]>
<![CDATA[ <211> 11425]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> The complete mRNA polynucleotide sequence of NOTCH2]]>
<![CDATA[ <400> 703]]>
aggctgcttc gttgcacacc cgagaaagtt tcagccaaac ttcgggcggc ggctgaggcg 60
gcggccgagg agcggcggac tcggggcgcg gggagtcgag gcatttgcgc ctgggcttcg 120
gagcgtagcg ccagggcctg agcctttgaa gcaggaggag gggaggagag agtggggctc 180
ctctatcggg accccctccc catgtggatc tgcccaggcg gcggcggcgg cggcggagga 240
ggaggcgacc gagaagatgc ccgccctgcg ccccgctctg ctgtgggcgc tgctggcgct 300
ctggctgtgc tgcgcggccc ccgcgcatgc attgcagtgt cgagatggct atgaaccctg 360
tgtaaatgaa ggaatgtgtg ttacctacca caatggcaca ggatactgca aatgtccaga 420
aggcttcttg ggggaatatt gtcaacatcg agaccccctgt gagaagaacc gctgccagaa 480
tggtgggact tgtgtggccc aggccatgct ggggaaagcc acgtgccgat gtgcctcagg 540
gtttacagga gaggactgcc agtactcaac atctcatcca tgctttgtgt ctcgaccctg 600
cctgaatggc ggcacatgcc atatgctcag ccgggatacc tatgagtgca cctgtcaagt 660
cgggtttaca ggtaaggagt gccaatggac ggatgcctgc ctgtctcatc cctgtgcaaa 720
tggaagtacc tgtaccactg tggccaacca gttctcctgc aaatgcctca caggcttcac 780
agggcagaaa tgtgagactg atgtcaatga gtgtgacatt ccaggacact gccagcatgg 840
tggcacctgc ctcaacctgc ctggttccta ccagtgccag tgccctcagg gcttcacagg 900
ccagtactgt gacagcctgt atgtgccctg tgcaccctca ccttgtgtca atggaggcac 960
ctgtcggcag actggtgact tcacttttga gtgcaactgc cttccaggtt ttgaagggag 1020
cacctgtgag aggaatattg atgactgccc taaccacagg tgtcagaatg gaggggtttg 1080
tgtggatggg gtcaacactt acaactgccg ctgtccccca caatggacag gacagttctg 1140
cacagaggat gtggatgaat gcctgctgca gcccaatgcc tgtcaaaatg ggggcacctg 1200
tgccaaccgc aatggaggct atggctgtgt atgtgtcaac ggctggagtg gagatgactg 1260
cagtgagaac attgatgatt gtgccttcgc ctcctgtact ccaggctcca cctgcatcga 1320
ccgtgtggcc tccttctctt gcatgtgccc agagggggaag gcaggtctcc tgtgtcatct 1380
ggatgatgca tgcatcagca atccttgcca caagggggca ctgtgtgaca ccaaccccct 1440
aaatgggcaa tatatttgca cctgcccaca aggctacaaa ggggctgact gcacagaaga 1500
tgtggatgaa tgtgccatgg ccaatagcaa tccttgtgag catgcaggaa aatgtgtgaa 1560
cacggatggc gccttccact gtgagtgtct gaagggttat gcaggacctc gttgtgagat 1620
ggacatcaat gagtgccatt cagacccctg ccagaatgat gctacctgtc tggataagat 1680
tggaggcttc acatgtctgt gcatgccagg tttcaaaggt gtgcattgtg aattagaaat 1740
aaatgaatgt cagagcaacc cttgtgtgaa caatgggcag tgtgtggata aagtcaatcg 1800
tttccagtgc ctgtgtcctc ctggtttcac tgggccagtt tgccagattg atattgatga 1860
ctgttccagt actccgtgtc tgaatggggc aaagtgtatc gatcacccga atggctatga 1920
atgccagtgt gccacaggtt tcactggtgt gttgtgtgag gagaacattg acaactgtga 1980
ccccgatccttgccaccatg gtcagtgtca ggatggtatt gattcctaca cctgcatctg 2040
caatcccggg tacatgggcg ccatctgcag tgaccagatt gatgaatgtt acagcagccc 2100
ttgcctgaac gatggtcgct gcattgacct ggtcaatggc taccagtgca actgccagcc 2160
aggcacgtca ggggttaatt gtgaaattaa ttttgatgac tgtgcaagta acccttgtat 2220
ccatggaatc tgtatggatg gcattaatcg ctacagttgt gtctgctcac caggattcac 2280
agggcagaga tgtaacattg acattgatga gtgtgcctcc aatccctgtc gcaagggtgc 2340
aacatgtatc aacggtgtga atggtttccg ctgtatatgc cccgagggac cccatcaccc 2400
cagctgctac tcacaggtga acgaatgcct gagcaatccc tgcatccatg gaaactgtac 2460
tggaggtctc agtggatata agtgtctctg tgatgcaggc tgggttggca tcaactgtga 2520
agtggacaaa aatgaatgcc tttcgaatcc atgccagaat ggaggaactt gtgacaatct 2580
ggtgaatgga tacaggtgta cttgcaagaa gggctttaaa ggctataact gccaggtgaa 2640
tattgatgaa tgtgcctcaa atccatgcct gaaccaagga acctgctttg atgacataag 2700
tggctacact tgccactgtg tgctgccata cacaggcaag aattgtcaga cagtattggc 2760
tccctgttcc ccaaaccctt gtgagaatgc tgctgtttgc aaagagtcac caaattttga 2820
gagttatact tgcttgtgtg ctcctggctg gcaaggtcag cggtgtacca ttgacattga 2880
cgagtgtatc tccaagccct gcatgaacca tggtctctgc cataacaccc agggcagcta 2940
catgtgtgaa tgtccaccag gcttcagtgg tatggactgt gaggaggaca ttgatgactg 3000
ccttgccaat ccttgccaga atggaggttc ctgtatggat ggagtgaata ctttctcctg 3060
cctctgcctt ccgggtttca ctggggataa gtgccagaca gacatgaatg agtgtctgag 3120
tgaaccctgt aagaatggag ggacctgctc tgactacgtc aacagttaca cttgcaagtg 3180
ccaggcagga tttgatggag tccatgtga gaacaacatc aatgagtgca ctgagagctc 3240
ctgtttcaat ggtggcacat gtgttgatgg gattaactcc ttctcttgct tgtgccctgt 3300
gggtttcact ggatccttct gcctccatga gatcaatgaa tgcagctctc atccatgcct 3360
gaatgaggga acgtgtgttg atggcctggg tacctaccgc tgcagctgcc ccctgggcta 3420
cactgggaaa aactgtcaga ccctggtgaa tctctgcagt cggtctccat gtaaaaacaa 3480
aggtacttgc gttcagaaaa aagcagagtc ccagtgccta tgtccatctg gatgggctgg 3540
tgcctattgt gacgtgccca atgtctcttg tgacatagca gcctccagga gaggtgtgct 3600
tgttgaacac ttgtgccagc actcaggtgt ctgcatcaat gctggcaaca cgcattactg 3660
tcagtgcccc ctgggctata ctgggagcta ctgtgaggag caactcgatg agtgtgcgtc 3720
caacccctgc cagcacgggg caacatgcag tgacttcatt ggtggataca gatgcgagtg 3780
tgtcccaggc tatcagggtg tcaactgtga gtatgaagtg gatgagtgcc agaatcagcc 3840
ctgccagaat ggaggcacct gtattgacct tgtgaaccat ttcaagtgct cttgcccacc 3900
aggcactcgg ggccctactct gtgaagagaa cattgatgac tgtgcccggg gtccccattg 3960
ccttaatggt ggtcagtgca tggataggat tggaggctac agttgtcgct gcttgcctgg 4020
ctttgctggg gagcgttgtg agggagacat caacgagtgc ctctccaacc cctgcagctc 4080
tgagggcagc ctggactgta tacagctcac caatgactac ctgtgtgttt gccgtagtgc 4140
ctttactggc cggcactgtg aaaccttcgt cgatgtgtgt ccccagatgc cctgcctgaa 4200
tggagggact tgtgctgtgg ccagtaacat gcctgatggt ttcatttgcc gttgtccccc 4260
gggattttcc ggggcaaggt gccagagcag ctgtggacaa gtgaaatgta ggaaggggga 4320
gcagtgtgtg cacaccgcct ctggaccccg ctgcttctgc cccagtcccc gggactgcga 4380
gtcaggctgt gccagtagcc cctgccagca cgggggcagc tgccaccctc agcgccagcc 4440
tccttaattac tcctgccagt gtgccccacc attctcgggt agccgctgtg aactctacac 4500
ggcacccccc agcaccccctc ctgccacctg tctgagccag tattgtgccg acaaagctcg 4560
ggatggcgtc tgtgatgagg cctgcaacag ccatgcctgc cagtgggatgggggtgactg 4620
ttctctcacc atggagaacc cctgggccaa ctgctcctcc ccacttccct gctgggatta 4680
tatcaacaac cagtgtgatg agctgtgcaa cacggtcgag tgcctgtttg acaactttga 4740
atgccagggg aacagcaaga catgcaagta tgacaaatac tgtgcagacc acttcaaaga 4800
caaccactgt gaccaggggt gcaacagtga ggagtgtggt tgggatgggc tggactgtgc 4860
tgctgaccaa cctgagaacc tggcagaagg taccctggtt attgtggtat tgatgccacc 4920
tgaacaactg ctccaggatg ctcgcagctt cttgcgggca ctgggtaccc tgctccaacac 4980
caacctgcgc attaagcggg actcccaggg ggaactcatg gtgtacccct attatggtga 5040
gaagtcagct gctatgaaga aacagaggat gacacgcaga tcccttcctg gtgaacaaga 5100
acaggaggtg gctggctcta aagtctttct ggaaattgac aaccgccagt gtgttcaaga 5160
ctcagaccac tgcttcaaga acacggatgc agcagcagct ctcctggcct ctcacgccat 5220
acaggggacc ctgtcatacc ctcttgtgtc tgtcgtcagt gaatccctga ctccagaacg 5280
cactcagctc ctctatctcc ttgctgttgc tgttgtcatc attctgttta ttaattctgct 5340
gggggtaatc atggcaaaac gaaagcgtaa gcatggctct ctctggctgc ctgaaggttt 5400
cactcttcgc cgagatgcaa gcaatcacaa gcgtcgtgag ccagtgggac aggatgctgt 5460
ggggctgaaa aatctctcag tgcaagtctc agaagctaac ctaattggta ctggaacaag 5520
tgaacactgg gtcgatgatg aagggcccca gccaaagaaa gtaaaggctg aagatgaggc 5580
cttactctca gaagaagatg accccattga tcgacggcca tggacacagc agcaccttga 5640
agctgcagac atccgtagga caccatcgct ggctctcacc cctcctcagg cagagcagga 5700
ggtggatgtg ttagatgtga atgtccgtgg cccagatggc tgcaccccat tgatgttggc 5760
ttctctccga ggaggcagct cagatttgag tgatgaagat gaagatgcag aggactcttc 5820
tgctaacatc atcacagact tggtctacca gggtgccagc ctccaggccc agacagaccg 5880
gactggtgag atggccctgc accttgcagc ccgctactca cgggctgatg ctgccaagcg 5940
tctcctggat gcaggtgcag atgccaatgc ccaggacaac atgggccgct gtccactcca 6000
tgctgcagtg gcagctgatg cccaaggtgt cttccagatt ctgattcgca accgagtaac 6060
tgatctagat gccaggatga atgatggtac tacacccctg atcctggctg cccgcctggc 6120
tgtggaggga atggtggcag aactgatcaa ctgccaagcg gatgtgaatg cagtggatga 6180
ccatggaaaa tctgctcttc actgggcagc tgctgtcaat aatgtggagg caactctttt 6240
gttgttgaaa aatggggcca accgagacat gcaggacaac aaggaagaga cacctctgtt 6300
tcttgctgcc cgggaggggga gctatgaagc agccaagatc ctgttagacc attttgccaa 6360
tcgagacatc acagaccata tggatcgtct tccccgggat gtggctcggg atcgcatgca 6420
ccatgacatt gtgcgccttc tggatgaata caatgtgacc ccaagccctc caggcaccgt 6480
gttgacttct gctctctcac ctgtcatctg tgggcccaac agatctttcc tcagcctgaa 6540
gcacacccca atgggcaaga agtctagacg gcccagtgcc aagagtacca tgcctactag 6600
cctccctaac cttgccaagg aggcaaagga tgccaagggt agtagggagga agaagtctct 6660
gagtgagaag gtccaactgt ctgagagttc agtaacttta tcccctgttg attccctaga 6720
atctcctcac acgtatgttt ccgacaccac atcctctcca atgattacat cccctgggat 6780
ccttacaggcc tcacccaacc ctatgttggc cactgccgcc cctcctgccc cagtccatgc 6840
ccagcatgca ctatcttttt ctaaccttca tgaaatgcag cctttggcac atggggccag 6900
cactgtgctt ccctcagtga gccagttgct atcccaccac cacattgtgt ctccaggcag 6960
tggcagtgct ggaagcttga gtaggctcca tccagtccca gtcccagcag attggatgaa 7020
ccgcatggag gtgaatgaga cccagtacaa tgagatgttt ggtatggtcc tggctccagc 7080
tgagggcacc catcctggca tagctcccca gagcaggcca cctgaaggga agcacataac 7140
cacccctcgg gagcccttgc cccccattgt gactttccag ctcatcccta aaggcagtat 7200
tgcccaacca gcgggggctc cccagcctca gtccacctgc cctccagctg ttgcgggccc 7260
cctgcccacc atgtaccaga ttccagaaat ggcccgtttg cccagtgtgg ctttccccac 7320
tgccatgatg ccccagcagg acgggcaggt agctcagacc attctcccag cctatcatcc 7380
tttcccagcc tctgtgggca agtaccccac acccccttca cagcacagtt atgcttcctc 7440
aaatgctgct gagcgaacac ccagtcacag tggtcacctc cagggtgagc atccctacct 7500
gacaccatcc ccagagtctc ctgaccagtg gtcaagttca tcaccccact ctgcttctga 7560
ctggtcagat gtgaccacca gccctacccc tgggggtgct ggaggaggtc agcggggacc 7620
tgggacacac atgtctgagc caccacacaa caacatgcag gtttatgcgt gagagagtcc 7680
acctccagtg tagagacata actgactttt gtaaatgctg ctgaggaaca aatgaaggtc 7740
atccggggaga gaaatgaaga aatctctgga gccagcttct agaggtagga aagagaagat 7800
gttcttattc agataatgca agagaagcaa ttcgtcagtt tcactgggta tctgcaaggc 7860
ttattgatta ttctaatcta ataagacaag tttgtggaaa tgcaagatga atacaagcct 7920
tgggtccatg tttactctct tctatttgga gaataagatg gatgcttatt gaagcccaga 7980
cattcttgca gcttggactg cattttaagc cctgcaggct tctgccatat ccatgagaag 8040
attctacact agcgtcctgt tgggaattat gccctggaat tctgcctgaa ttgacctacg 8100
catctcctcc tccttggaca ttcttttgtc ttcatttggt gcttttggtt ttgcacctct 8160
ccgtgattgt agccctacca gcatgttata gggcaagacc tttgtgcttt tgatcattct 8220
ggcccatgaa agcaactttg gtctcctttc ccctcctgtc ttcccggtat cccttggagt 8280
ctcacaaggt ttactttggt atggttctca gcacaaacct ttcaagtatg ttgtttcttt 8340
ggaaaatgga catactgtat tgtgttctcc tgcatatac attcctggag agagaagggg 8400
agaagaatac ttttcttcaa caaattttgg gggcaggaga tcccttcaag aggctgcacc 8460
ttaatttttc ttgtctgtgt gcaggtcttc atataaactt taccaggaag aagggtgtga 8520
gtttgttgtt tttctgtgta tgggcctggt cagtgtaaag ttttatcctt gtagtctag 8580
ttactatgac cctccccact tttttaaaac cagaaaaagg tttggaatgt tggaatgacc 8640
aagagacaag ttaactcgtg caagagccag ttaccaccc acaggtcccc ctacttcctg 8700
ccaagcattc cattgactgc ctgtatggaa cacatttgtc ccagatctga gcattctagg 8760
cctgtttcac tcactcaccc agcatatgaa actagtctta actgttgagc ctttcctttc 8820
atatccacag aagacactgt ctcaaatgtt gtacccttgc catttaggac tgaactttcc 8880
ttagcccaag ggacccagtg acagttgtct tccgtttgtc agatgatcag tctctactga 8940
ttatcttgct gcttaaaggc ctgctcacca atctttcttt cacaccgtgt ggtccgtgtt 9000
actggtatac ccagtatgtt ctcactgaag acatggactt tatatgttca agtgcaggaa 9060
ttggaaagtt ggacttgttt tctatgatcc aaaacagccc tataagaagg ttggaaaagg 9120
aggaactata tagcagcctt tgctattttc tgctaccatt tcttttcctc tgaagcggcc 9180
atgacattcc ctttggcaac taacgtagaa actcaacaga attttcct ttcctagagt 9240
caccttttag atgataatgg acaactatag acttgctcat tgttcagact gattgcccct 9300
cacctgaatc cactctctgt attcatgctc ttggcaattt ctttgacttt cttttaaggg 9360
cagaagcatt ttagttaatt gtagataaag aatagttttc ttcctcttct ccttgggcca 9420
gttaataatt ggtccatggc tacactgcaa cttccgtcca gtgctgtgat gcccatgaca 9480
cctgcaaaat aagttctgcc tgggcatttt gtagatatta acaggtgaat tcccgactct 9540
tttggtttga atgacagttc tcattccttc tatggctgca agtatgcatc agtgcttccc 9600
acttacctga tttgtctgtc ggtggcccca tatggaaacc ctgcgtgtct gttggcataa 9660
tagtttacaa atggtttttt cagtcctatc caaatttatt gaaccaacaa aaataattac 9720
ttctgccctg agataagcag attaagtttg ttcattctct gctttattct ctccatgtgg 9780
caacattctg tcagcctctt tcatagtgtg caaacatttt atcattctaa atggtgactc 9840
tctgcccttg gacccattatta ttatcacag atggggagaa cctatctgca tggacctctg 9900
tggaccacag cgtacctgcc cctttctgcc ctcctgctcc agccccactt ctgaaagtat 9960
cagctactga tccagccact ggatatttta tatcctccct tttccttaag cacaatgtca 10020
gaccaaattg cttgtttctt tttcttggac tactttaatt tggatccttt gggtttggag 10080
aaagggaatg tgaaagctgt cattacagac aacaggtttc agtgatgagg aggacaacac 10140
tgcctttcaa actttttact gatctcttag attttaagaa ctcttgaatt gtgtggtatc 10200
taataaaagg gaaggtaaga tggataatca ctttctcatt tgggttctga attggagact 10260
cagtttttat gagacacatc ttttatgcca tgtatagatc ctcccctgct atttttggtt 10320
tatttttatt gttataaatg ctttctttct ttgactcctc ttctgcctgc ctttggggat 10380
aggttttttt gtttgtttat ttgcttcctc tgttttgttt taagcatcat tttcttatgt 10440
gaggtgggga agggaaaggt atgagggaaa gagagtctga gaattaaaat attttagtat 10500
aagcaattgg ctgtgatgct caaatccatt gcatcctctt attgaatttg ccaatttgta 10560
atttttgcat aataaagaac caaaggtgta atgttttgtt gagaggtggt ttaggggattt 10620
tggccctaac caatacattg aatgtatgat gactatttgg gaggacacat ttatgtaccc 10680
agaggccccc actaataagt ggtactatgg ttacttcctt gtgtacattt ctcttaaaag 10740
tgatattata tctgtttgta tgagaaaccc agtaaccaat aaaatgaccg catattcctg 10800
actaaacgta gtaaggaaaa tgcacacttt gtttttacttttccgtttca ttctaaaggt 10860
agttaagatg aaatttatat gaaagcattt ttatcacaaa ataaaaagg tttgccaagc 10920
tcagtggtgt tgtatttttt attttccaat actgcatcca tggcctggca gtgttacctc 10980
atgatgtcat aatttgctga gagagcaaat tttcttttct ttctgaatcc cacaaagcct 11040
agcaccaaac ttcttttttt cttcctttaa ttagatcata aataaatgat cctggggaaa 11100
aagcatctgt caaataggaa acatcacaaa actgagcact cttctgtgca ctagccatag 11160
ctggtgacaa acagatggtt gctcagggac aaggtgcctt ccaatggaaa tgcgaagtag 11220
ttgctatagc aagaattggg aactgggata taagtcataa tattaattat gctgttatgt 11280
aaatgattgg tttgtaacat tccttaagtg aaatttgtgt agaacttaat atacaggatt 11340
ataaaataat attttgtgta taaatttgtt ataagttcac attcatacat ttattattataa 11400
agtcagtgag atatttgaac atgaa 11425
<![CDATA[ <210> 704]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 704]]>
Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp
1 5 10 15
Leu Cys Cys Ala Ala Pro Ala His
20
<![CDATA[ <210> 705]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 705]]>
Leu Gln Cys Arg Asp Gly Tyr Glu Pro Cys Val Asn Glu Gly Met Cys
1 5 10 15
Val Thr Tyr His Asn Gly Thr Gly Tyr Cys
20 25
<![CDATA[ <210> 706]]>
<![CDATA[ <211> 86]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 706]]>
Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gln His Arg Asp Pro Cys
1 5 10 15
Glu Lys Asn Arg Cys Gln Asn Gly Gly Thr Cys Val Ala Gln Ala Met
20 25 30
Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe Thr Gly Glu Asp
35 40 45
Cys Gln Tyr Ser Thr Ser His Pro Cys Phe Val Ser Arg Pro Cys Leu
50 55 60
Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr Tyr Glu Cys Thr
65 70 75 80
Cys Gln Val Gly Phe Thr
85
<![CDATA[ <210> 707]]>
<![CDATA[ <211> 111]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 707]]>
Lys Glu Cys Gln Trp Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn
1 5 10 15
Gly Ser Thr Cys Thr Thr Val Ala Asn Gln Phe Ser Cys Lys Cys Leu
20 25 30
Thr Gly Phe Thr Gly Gln Lys Cys Glu Thr Asp Val Asn Glu Cys Asp
35 40 45
Ile Pro Gly His Cys Gln His Gly Gly Thr Cys Leu Asn Leu Pro Gly
50 55 60
Ser Tyr Gln Cys Gln Cys Pro Gln Gly Phe Thr Gly Gln Tyr Cys Asp
65 70 75 80
Ser Leu Tyr Val Pro Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr
85 90 95
Cys Arg Gln Thr Gly Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro
100 105 110
<![CDATA[ <210> 708]]>
<![CDATA[ <211> 40]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 708]]>
Phe Glu Gly Ser Thr Cys Glu Arg Asn Ile Asp Asp Cys Pro Asn His
1 5 10 15
Arg Cys Gln Asn Gly Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn
20 25 30
Cys Arg Cys Pro Pro Gln Trp Thr
35 40
<![CDATA[ <210> 709]]>
<![CDATA[ <211> 77]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 709]]>
Gln Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu Gln Pro Asn Ala
1 5 10 15
Cys Gln Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly Gly Tyr Gly Cys
20 25 30
Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser Glu Asn Ile Asp
35 40 45
Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr Cys Ile Asp Arg
50 55 60
Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys Ala
65 70 75
<![CDATA[ <210> 710]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 710]]>
Leu Leu Cys His Leu Asp Asp Ala Cys Ile Ser Asn Pro Cys His Lys
1 5 10 15
Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gln Tyr Ile Cys Thr
20 25 30
Cys Pro Gln Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val Asp Glu
35 40 45
Cys Ala Met
50
<![CDATA[ <210> 711]]>
<![CDATA[ <211> 2049]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 711]]> Asn Ser Asn Pro Cys Glu His Ala Gly Lys Cys Val Asn Thr Asp Gly 1 5 10 15 Ala Phe His Cys Glu Cys Leu Lys Gly Tyr Ala Gly Pro Arg Cys Glu 20 25 30 Met Asp Ile Asn Glu Cys His Ser Asp Pro Cys Gln Asn Asp Ala Thr 35 40 45 Cys Leu Asp Lys Ile Gly Gly Phe Thr Cys Leu Cys Met Pro Gly Phe 50 55 60 Lys Gly Val His Cys Glu Leu Glu Ile Asn Glu Cys Gln Ser Asn Pro 65 70 75 80 Cys Val Asn Asn Gly Gln Cys Val Asp Lys Val Asn Arg Phe Gln Cys 85 90 95 Leu Cys Pro Pro Gly Phe Thr Gly Pro Val Cys Gln Ile Asp Ile Asp 100 105 110 Asp Cys Ser Ser Ser Thr Pro Cys Leu Asn Gly Ala Lys Cys Ile Asp His 115 120 125 Pro Asn Gly Tyr Glu Cys Gln Cys Ala Thr Gly Phe Thr Gly Val Leu 130 135 140 Cys Glu Glu Asn Ile Asp Asn Cys Asp Pro Asp Pro Cys His His Gly 145 150 155 16 0 Gln Cys Gln Asp Gly Ile Asp Ser Tyr Thr Cys Ile Cys Asn Pro Gly 165 170 175 Tyr Met Gly Ala Ile Cys Ser Asp Gln Ile Asp Glu Cys Tyr Ser Ser 180 185 190 Pro Cys Leu Asn Asp Gly Arg Cys Ile Asp Leu Val Asn Gly Tyr Gln 195 200 205 Cys Asn Cys Gln Pro Gly Thr Ser Gly Val Asn Cys Glu Ile Asn Phe 210 215 220 Asp Asp Cys Ala Ser Asn Pro Cys Ile His Gly Ile Cys Met Asp Gly 225 230 235 240 Ile As n Arg Tyr Ser Cys Val Cys Ser Pro Gly Phe Thr Gly Gln Arg 245 250 255 Cys Asn Ile Asp Ile Asp Glu Cys Ala Ser Asn Pro Cys Arg Lys Gly 260 265 270 Ala Thr Cys Ile Asn Gly Val Asn Gly Phe Arg Cys Ile Cys Pro Glu 275 280 285 Gly Pro His His Pro Ser Cys Tyr Ser Gln Val Asn Glu Cys Leu Ser 290 295 300 Asn Pro Cys Ile His Gly Asn Cys Thr Gly Gly Leu Ser Gly Tyr Lys 305 310 315 320 Cys Leu Cys Asp Ala Gly Trp Val Gly Ile Asn Cys Glu Val Asp Lys 325 330 335 Asn Glu Cys Leu Ser Asn Pro Cys Gln Asn Gly Gly Thr Cys Asp Asn 340 345 350 Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys Lys Gly Phe Lys Gly Tyr 355 360 3 65 Asn Cys Gln Val Asn Ile Asp Glu Cys Ala Ser Asn Pro Cys Leu Asn 370 375 380 Gln Gly Thr Cys Phe Asp Asp Ile Ser Gly Tyr Thr Cys His Cys Val 385 390 395 400 Leu Pro Tyr Thr Gly Lys Asn Cys Gln Thr Val Leu Ala Pro Cys Ser 405 410 415 Pro Asn Pro Cys Glu Asn Ala Ala Val Cys Lys Glu Ser Pro Asn Phe 420 425 430 Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly Trp Gln Gly Gln Arg Cys 435 440 445 Thr Ile Asp Ile Asp Glu Cys Ile Ser Lys Pro Cys Met Asn His Gly 450 455 460 Leu Cys His Asn Thr Gln Gly Ser Tyr Met Cys Glu Cys Pro Gly 465 470 475 480 Phe Ser Gly Met Asp Cys Glu Glu Asp Ile Asp Asp Cys Leu Ala Asn 485 490 495 Pro Cys Gln Asn Gly Gly Ser Cys Met Asp Gly Val Asn Thr Phe Ser 500 505 510 Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp Lys Cys Gln Thr Asp Met 515 520 525 Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn Gly Gly Thr Cys Ser Asp 530 535 540 Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gln Ala Gly Phe Asp Gly Val 545 550 555 560 His Cys Glu Asn Asn Ile Asn Glu Cys Thr Glu Ser Ser Cys Phe Asn 565 57 0 575 Gly Gly Thr Cys Val Asp Gly Ile Asn Ser Phe Ser Cys Leu Cys Pro 580 585 590 Val Gly Phe Thr Gly Ser Phe Cys Leu His Glu Ile Asn Glu Cys Ser 595 600 605 Ser His Pro Cys Leu Asn Glu Gly Thr Cys Val Asp Gly Leu Gly Thr 610 615 620 Tyr Arg Cys Ser Cys Pro Leu Gly Tyr Thr Gly Lys Asn Cys Gln Thr 625 630 635 640 Leu Val Asn Leu Cys Ser Arg Ser Pro Cys Lys Asn Lys Gly Thr Cys 645 650 655 Val Gln Lys L ys Ala Glu Ser Gln Cys Leu Cys Pro Ser Gly Trp Ala 660 665 670 Gly Ala Tyr Cys Asp Val Pro Asn Val Ser Cys Asp Ile Ala Ala Ser 675 680 685 Arg Arg Gly Val Leu Val Glu His Leu Cys Gln His Ser Gly Val Cys 690 695 700 Ile Asn Ala Gly Asn Thr His Tyr Cys Gln Cys Pro Leu Gly Tyr Thr 705 710 715 720 Gly Ser Tyr Cys Glu Glu Gln Leu Asp Glu Cys Ala Ser Asn Pro Cys 725 730 735 Gln His Gly Ala Thr Cys Ser A sp Phe Ile Gly Gly Tyr Arg Cys Glu 740 745 750 Cys Val Pro Gly Tyr Gln Gly Val Asn Cys Glu Tyr Glu Val Asp Glu 755 760 765 Cys Gln Asn Gln Pro Cys Gln Asn Gly Gly Thr Cys Ile Asp Leu Val 770 775 780 Asn His Phe Lys Cys Ser Cys Pro Gly Thr Arg Gly Leu Leu Cys 785 790 795 800 Glu Glu Asn Ile Asp Asp Cys Ala Arg Gly Pro His Cys Leu Asn Gly 805 810 815 Gly Gln Cys Met Asp Arg Ile Gly Gly Tyr Ser Cys Arg Cys Leu Pro 820 825 830 Gly Phe Ala Gly Glu Arg Cys Glu Gly Asp Ile Asn Glu Cys Leu Ser 835 840 845 Asn Pro Cys Ser Ser Glu Gly Ser Leu Asp Cys Ile Gln Leu Thr Asn 850 855 860 Asp Tyr Leu Cys Val Cys Arg Ser Ala Phe Thr Gly Arg His Cys Glu 865 870 875 880 Thr Phe Val Asp Val Cys Pro Gln Met Pro Cys Leu Asn Gly Gly Thr 885 890 895 Cys Ala Val Ala Ser Asn Met Pro Asp Gly Phe Ile Cys Arg Cys Pro 900 905 910 Pro Gly Phe Ser Gly Ala Arg Cys Gln Ser Ser Cys Gly Gly Gln Val Lys 915 920 925 Cys Arg Lys Gly Glu Gln Cys Val His Thr Ala Ser Gly Pro Arg Cys 930 935 940 Phe Cys Pro Ser Pro Arg Asp Cys Glu Ser Gly Cys Ala Ser Ser Pro 945 950 955 960 Cys Gln His Gly Gly Ser Cys His Pro Gln Arg Gln Pro Pro Tyr Tyr 965 970 975 Ser Cys Gln Cys Ala Pro Pro Phe Ser Gly Ser Arg Cys Glu Leu Tyr 980 985 990 Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr Cys Leu Ser Gln Tyr Cys 995 1000 1005 Ala Asp Lys Ala Arg Asp Gly Val Cys Asp Glu Ala Cys Asn Ser 1010 1015 1020 His Ala Cys Gln Trp Asp Gly Gly Asp Cys Ser Leu Thr Met Glu 1025 1030 1035 Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys Trp Asp Tyr 1040 1045 1050 Ile Asn Asn Gln Cys Asp Glu Leu Cys Asn Thr Val Glu Cys Leu 1055 1060 1065 Phe Asp Asn Phe Glu Cys Gln Gly Asn Ser Lys Thr Cys Lys Tyr 1070 1075 1080 Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys Asp Gln 1085 1090 1095 Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp Cys Ala 1100 1105 1110 Ala A sp Gln Pro Glu Asn Leu Ala Glu Gly Thr Leu Val Ile Val 1115 1120 1125 Val Leu Met Pro Pro Glu Gln Leu Leu Gln Asp Ala Arg Ser Phe 1130 1135 1140 Leu Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg Ile Lys 1145 1150 1155 Arg Asp Ser Gln Gly Glu Leu Met Val Tyr Pro Tyr Tyr Gly Glu 1160 1165 1170 Lys Ser Ala Ala Met Lys Lys Gln Arg Met Thr Arg Arg Ser Leu 1175 1180 1185 Pro Gly Glu Gln Glu Gln Glu Val Ala Gly Ser Lys Val Phe Leu 11 90 1195 1200 Glu Ile Asp Asn Arg Gln Cys Val Gln Asp Ser Asp His Cys Phe 1205 1210 1215 Lys Asn Thr Asp Ala Ala Ala Ala Leu Leu Ala Ser His Ala Ile 1220 1225 1230 Gln Gly Thr Leu Ser Tyr Pro Leu Val Ser Val Val Ser Glu Ser 1235 1240 1245 Leu Thr Pro Glu Arg Thr Gln Leu Leu Tyr Leu Leu Ala Val Ala 1250 1255 1260 Val Val Ile Ile Leu Phe Ile Ile Leu Leu Gly Val Ile Met Ala 1265 1270 1275 Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro Glu Gly Phe 1280 1285 1290 Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu Pro Val 1295 1300 1305 Gly Gln Asp Ala Val Gly Leu Lys Asn Leu Ser Val Gln Val Ser 1310 1315 1320 Glu Ala Asn Leu Ile Gly Thr Gly Thr Ser Glu His Trp Val Asp 1325 1330 1335 Asp Glu Gly Pro Gln Pro Lys Lys Val Lys Ala Glu Asp Glu Ala 1340 1345 1350 Leu Leu Ser Glu Asp Asp Pro Ile Asp Arg Arg Pro Trp Thr 1355 1360 13 65 Gln Gln His Leu Glu Ala Ala Asp Ile Arg Arg Thr Pro Ser Leu 1370 1375 1380 Ala Leu Thr Pro Pro Gln Ala Glu Gln Glu Val Asp Val Leu Asp 1385 1390 1395 Val Asn Val Arg Gly Pro Asp Gly Cys Thr Pro Leu Met Leu Ala 1400 1405 1410 Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp Glu Asp Glu Asp 1415 1420 1425 Ala Glu Asp Ser Ser Ala Asn Ile Ile Thr Asp Leu Val Tyr Gln 1430 1435 1440 Gly Ala Ser Leu Gln Ala G ln Thr Asp Arg Thr Gly Glu Met Ala 1445 1450 1455 Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala Lys Arg 1460 1465 1470 Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gln Asp Asn Met Gly 1475 1480 1485 Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gln Gly Val 1490 1495 1500 Phe Gln Ile Leu Ile Arg Asn Arg Val Thr Asp Leu Asp Ala Arg 1505 1510 1515 Met Asn Asp Gly Thr Thr Pro Leu Ile Leu Ala Ala Arg Leu Ala 1520 1525 15 30 Val Glu Gly Met Val Ala Glu Leu Ile Asn Cys Gln Ala Asp Val 1535 1540 1545 Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp Ala Ala 1550 1555 1560 Ala Val Asn Asn Val Glu Ala Thr Leu Leu Leu Leu Lys Asn Gly 1565 1570 1575 Ala Asn Arg Asp Met Gln Asp Asn Lys Glu Glu Thr Pro Leu Phe 1580 1585 1590 Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys Ile Leu Leu 1595 1600 1605 Asp His Phe Ala Asn Arg Asp Ile Thr Asp His M et Asp Arg Leu 1610 1615 1620 Pro Arg Asp Val Ala Arg Asp Arg Met His His Asp Ile Val Arg 1625 1630 1635 Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly Thr Val 1640 1645 1650 Leu Thr Ser Ala Leu Ser Pro Val Ile Cys Gly Pro Asn Arg Ser 1655 1660 1665 Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys Ser Arg Arg 1670 1675 1680 Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn Leu Ala 1685 1690 1695 Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys Ser Leu 1700 1705 1710 Ser Glu Lys Val Gln Leu Ser Glu Ser Ser Val Thr Leu Ser Pro 1715 1720 1725 Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp Thr Thr 1730 1735 1 740 Ser Ser Pro Met Ile Thr Ser Pro Gly Ile Leu Gln Ala Ser Pro 1745 1750 1755 Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val His Ala 1760 1765 1770 Gln His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gln Pro Leu 1775 1780 1785 Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gln Leu Leu 1790 1795 1800 Ser His His His Ile Val Ser Pro Gly Ser Gly Ser Ala Gly Ser 1805 1810 1815 Leu Ser Arg Leu His Pro Val Pro Val Pro A la Asp Trp Met Asn 1820 1825 1830 Arg Met Glu Val Asn Glu Thr Gln Tyr Asn Glu Met Phe Gly Met 1835 1840 1845 Val Leu Ala Pro Ala Glu Gly Thr His Pro Gly Ile Ala Pro Gln 1850 1855 1860 Ser Arg Pro Pro Glu Gly Lys His Ile Thr Thr Pro Arg Glu Pro 1865 1870 1875 Leu Pro Pro Ile Val Thr Phe Gln Leu Ile Pro Lys Gly Ser Ile 1880 1885 1890 Ala Gln Pro Ala Gly Ala Pro Gln Pro Gln Ser Thr Cys Pro Pro 1895 1900 1905 Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gln Ile Pro Glu Met 1910 1915 1920 Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met Pro Gln 1925 1930 1935 Gln Asp Gly Gln Val Ala Gln Thr Ile Leu Pro Ala Tyr His Pro 1940 194 5 1950 Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser Gln His 1955 1960 1965 Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser His Ser 1970 1975 1980 Gly His Leu Gln Gly Glu His Pro Tyr Leu Thr Pro Ser Pro Glu 1985 1990 1995 Ser Pro Asp Gln Trp Ser Ser Ser Ser Ser Pro His Ser Ala Ser Asp 2000 2005 2010 Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala Gly Gly 2015 2020 2025 Gly Gln Arg Gly Pro Gly Thr H is met Ser Glu Pro Pro His Asn 2030 2035 2040 Asn Met Gln Val Tyr Ala 2045 <![CDATA[ <210> 712]]>
<![CDATA[ <211> 2471]]>
<![CDATA[ <212>]]> PRT
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 712]]> Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp 1 5 10 15 Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gln Cys Arg Asp Gly Tyr 20 25 30 Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr 35 40 45 Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gln His 50 55 60 Arg Asp Pro Cys Glu Lys Asn Arg Cys Gln Asn Gly Gly Thr Cys Val 65 70 75 80 Ala Gln Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe 85 90 95 Thr Gly Glu Asp Cys Gln Tyr Ser Thr Ser His Pro Cys Phe Val Ser 100 105 110 Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr 115 120 125 Tyr Glu Cys Thr Cys Gln Val Gly Phe Thr Gly Lys Glu Cys Gln Trp 130 135 140 Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr 145 150 155 1 60 Thr Val Ala Asn Gln Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly 165 170 175 Gln Lys Cys Glu Thr Asp Val Asn Glu Cys Asp Ile Pro Gly His Cys 180 185 190 Gln His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gln Cys Gln 195 200 205 Cys Pro Gln Gly Phe Thr Gly Gln Tyr Cys Asp Ser Leu Tyr Val Pro 210 215 220 Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gln Thr Gly 225 230 235 240 Asp Phe Th r Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr 245 250 255 Cys Glu Arg Asn Ile Asp Asp Cys Pro Asn His Arg Cys Gln Asn Gly 260 265 270 Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro 275 280 285 Gln Trp Thr Gly Gln Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu 290 295 300 Gln Pro Asn Ala Cys Gln Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly 305 310 315 320 Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser 325 330 335 Glu Asn Ile Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr 340 345 350 Cys Ile Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys 355 360 365 Ala Gly Leu Leu Cys His Leu Asp Asp Ala Cys Ile Ser Asn Pro Cys 370 375 380 His Lys Gly Ala Leu Cys Asp Thr Asn Pro Leu Asn Gly Gln Tyr Ile 385 390 395 400 Cys Thr Cys Pro Gln Gly Tyr Lys Gly Ala Asp Cys Thr Glu Asp Val 405 410 415 Asp Glu Cys Ala Met Ala Asn Ser Asn Pro Cys Glu His Ala Gly Lys 420 425 430 Cys Val Asn Thr Asp Gly Ala Phe His Cys Glu Cys Leu Lys Gly Tyr 435 440 445 Ala Gly Pro Arg Cys Glu Met Asp Ile Asn Glu Cys His Ser Asp Pro 450 455 460 Cys Gln Asp Ala Thr Cys Leu Asp Lys Ile Gly Gly Phe Thr Cys 465 470 475 480 Leu Cys Met Pro Gly Phe Lys Gly Val His Cys Glu Leu Glu Ile Asn 485 490 495 GLU CLN SER Asn Pro Cys Val Val ASN GLY GLN CYS Val Val ASP LYS 500 510 Val asn ARG PHE GLN CYS PRO Pro Gly PRY Pro Val 515 520525 CY s GLN ILE ASP ILE ASP ASP CYS Ser Ser Thr Pro Cys Leu Asn Gly 530 535 540 Ala Lys Cys Ile Asp His Pro Asn Gly Tyr Glu Cys Gln Cys Ala Thr 545 550 555 560 Gly Phe Thr Gly Val Leu Cys Glu Glu Asn Ile Asp Asn Cys Asp Pro 565 570 575 A sp Pro Cys His His Gly Gln Cys Gln Asp Gly Ile Asp Ser Tyr Thr 580 585 590 Cys Ile Cys Asn Pro Gly Tyr Met Gly Ala Ile Cys Ser Asp Gln Ile 595 600 605 Asp Glu Cys Tyr Ser Ser Pro Cys Leu Asn Asp Gly Arg Cys Ile Asp 610 615 620 Leu Val Asn Gly Tyr Gln Cys Asn Cys Gln Pro Gly Thr Ser Gly Val 625 630 635 640 Asn Cys Glu Ile Asn Phe Asp Asp Asp Cys Ala Ser Asn Pro Cys Ile His 645 650 655 Gly Ile Cys Met Asp Gly Ile Asn Arg Tyr Ser Cys Val Cys Ser Pro 660 665 670 Gly Phe Thr Gly Gln Arg Cys Asn Ile Asp Ile Asp Glu Cys Ala Ser 675 680 685 Asn Pro Cys Arg Lys Gly Ala Thr Cys Ile Asn Gly Val Asn Gly Phe 690 695 700 Arg Cys Ile Cys Pro Glu Gly Pro His His Pro Ser Cys Tyr Ser Gln 705 710 715 720 Val Asn Glu Cys Leu Ser Asn Pro Cys Ile His Gly Asn Cys Thr Gly 725 730 735 Gly Leu Ser Gly Tyr Lys Cys Leu Cys Asp Ala Gly Trp Val Gly Ile 740 745 750 Asn Cys Glu Val Asp Lys Asn Glu Cys Leu Ser Asn Pro Cys Gln Asn 755 760 765 Gly Gly Thr Cys Asp Asn Leu Val Asn Gly Tyr Arg Cys Thr Cys Lys 770 775 780 Lys Gly PHE LYS GLY TYR Asn Cys Gln Val Asn Ile Asp Glu Cys Ala 785 790 795 800 SER ASN GLN GLN GLN GLN GLS PHE ASP ILE Serle Serle Serle Serle Serite 805 815 TYR Cys His His Cys Cys Cys lu project Cys Gln Thr 820 825 830 Val Leu Ala Pro Cys Ser Pro Asn Pro Cys Glu Asn Ala Ala Val Cys 835 840 845 Lys Glu Ser Pro Asn Phe Glu Ser Tyr Thr Cys Leu Cys Ala Pro Gly 850 855 860 Trp Gln Gly Gln Arg Cys Thr Ile Asp Ile Asp Glu Cys Ile Ser Lys 865 870 875 880 Pro Cys Met Asn His Gly Leu Cys His Asn Thr Gln Gly Ser Tyr Met 885 890 895 Cys Glu Cys Pro Gly Phe Ser Gly Met Asp Cys Glu Glu Asp Ile 900 905 910 Asp Asp Cys Leu Ala Asn Pro Cys Gln Asn Gly Gly Ser Cys Met Asp 915 920 925 Gly Val Asn Thr Phe Ser Cys Leu Cys Leu Pro Gly Phe Thr Gly Asp 930 935 940 Lys Cys Gln Thr Asp Met Asn Glu Cys Leu Ser Glu Pro Cys Lys Asn 945 950 955 960 Gly Gly Thr Cys Ser Asp Tyr Val Asn Ser Tyr Thr Cys Lys Cys Gln 965 970 975 Ala Gly Phe Asp Gly Val His Cys Glu Asn Asn Ile Asn Glu Cys Thr 980 985 99 0 Glu Ser Ser Cys Phe Asn Gly Gly Thr Cys Val Asp Gly Ile Asn Ser 995 1000 1005 Phe Ser Cys Leu Cys Pro Val Gly Phe Thr Gly Ser Phe Cys Leu 1010 1015 1020 His Glu Ile Asn Glu Cys Ser Ser His Pro Cys Leu Asn Glu Gly 1025 1030 1035 Thr Cys Val Asp Gly Leu Gly Thr Tyr Arg Cys Ser Cys Pro Leu 1040 1045 1050 Gly Tyr Thr Gly Lys Asn Cys Gln Thr Leu Val Asn Leu Cys Ser 1055 1060 1065 Arg Ser Pro Cys Lys Asn L ys Gly Thr Cys Val Gln Lys Lys Ala 1070 1075 1080 Glu Ser Gln Cys Leu Cys Pro Ser Gly Trp Ala Gly Ala Tyr Cys 1085 1090 1095 Asp Val Pro Asn Val Ser Cys Asp Ile Ala Ala Ser Arg Arg Gly 1100 1105 1110 Val Leu Val Glu His Leu Cys Gln His Ser Gly Val Cys Ile Asn 1115 1120 1125 Ala Gly Asn Thr His Tyr Cys Gln Cys Pro Leu Gly Tyr Thr Gly 1130 1135 1140 Ser Tyr Cys Glu Glu Gln Leu Asp Glu Cys Ala Ser Asn Pro Cys 1145 115 0 1155 Gln His Gly Ala Thr Cys Ser Asp Phe Ile Gly Gly Tyr Arg Cys 1160 1165 1170 Glu Cys Val Pro Gly Tyr Gln Gly Val Asn Cys Glu Tyr Glu Val 1175 1180 1185 Asp Glu Cys Gln Asn Gln Pro Cys Gln Asn Gly Gly Thr Cys Ile 1190 1195 1200 ASP Leu Val As His PHE LYS CYS Ser Cys Pro Pro Gly THR ARG 1205 Gly Leu Leu Leu GLU GLU Asn Ile ASP CYS ALA ARG GLY PRE GLY GLN CYS MET ASP ARG Ile Gly Gly Tyr 1235 1240 1245 Ser Cys Arg Cys Leu Pro Gly Phe Ala Gly Glu Arg Cys Glu Gly 1250 1255 1260 Asp Ile Asn Glu Cys Leu Ser Asn Pro Cys Ser Ser Ser Glu Gly Ser 1265 1270 1275 Leu Asp Cys Ile Gln Le u Thr Asn Asp Tyr Leu Cys Val Cys Arg 1280 1285 1290 Ser Ala Phe Thr Gly Arg His Cys Glu Thr Phe Val Asp Val Cys 1295 1300 1305 Pro Gln Met Pro Cys Leu Asn Gly Gly Thr Cys Ala Val Ala Ser 1310 1315 1320 Asn M et Pro Asp Gly Phe Ile Cys Arg Cys Pro Gly Phe Ser 1325 1330 1335 Gly Ala Arg Cys Gln Ser Ser Cys Gly Gln Val Lys Cys Arg Lys 1340 1345 1350 Gly Glu Gln Cys Val His Thr Ala Ser Gly Pro Arg Cys Phe Cys 1355 1360 1 365 Pro Ser Pro Arg Asp Cys Glu Ser Gly Cys Ala Ser Ser Pro Cys 1370 1375 1380 Gln His Gly Gly Ser Cys His Pro Gln Arg Gln Pro Pro Tyr Tyr 1385 1390 1395 Ser Cys Gln Cys Ala Pro Pro Phe Ser Gly Ser Arg Cys Glu Leu 1400 1405 1410 Tyr Thr Ala Pro Pro Ser Thr Pro Pro Ala Thr Cys Leu Ser Gln 1415 1420 1425 Tyr Cys Ala Asp Lys Ala Arg Asp Gly Val Cys Asp Glu Ala Cys 1430 1435 1440 Asn Ser His Ala Cys Gln Trp A sp Gly Gly Asp Cys Ser Leu Thr 1445 1450 1455 Met Glu Asn Pro Trp Ala Asn Cys Ser Ser Pro Leu Pro Cys Trp 1460 1465 1470 Asp Tyr Ile Asn Asn Gln Cys Asp Glu Leu Cys Asn Thr Val Glu 1475 1480 1485 Cys Le u Phe Asp Asn Phe Glu Cys Gln Gly Asn Ser Lys Thr Cys 1490 1495 1500 Lys Tyr Asp Lys Tyr Cys Ala Asp His Phe Lys Asp Asn His Cys 1505 1510 1515 Asp Gln Gly Cys Asn Ser Glu Glu Cys Gly Trp Asp Gly Leu Asp 1520 15 25 1530 Cys Ala Ala Asp Gln Pro Glu Asn Leu Ala Glu Gly Thr Leu Val 1535 1540 1545 Ile Val Val Leu Met Pro Pro Glu Gln Leu Leu Gln Asp Ala Arg 1550 1555 1560 Ser Phe Leu Arg Ala Leu Gly Thr Leu Leu His Thr Asn Leu Arg 1565 1570 1575 Ile Lys Arg Asp Ser Gln Gly Glu Leu Met Val Tyr Pro Tyr Tyr 1580 1585 1590 Gly Glu Lys Ser Ala Ala Met Lys Lys Gln Arg Met Thr Arg Arg 1595 1600 1605 Ser Leu Pro Gly Glu Gln Glu Gln Glu Val Ala Gly Ser Lys Val 1610 1615 1620 Phe Leu Glu Ile Asp Asn Arg Gln Cys Val Gln Asp Ser Asp His 1625 1630 1635 Cys Phe Lys Asn Thr Asp Ala Ala Ala Ala Leu Leu Ala Ser His 1640 1645 1650 Ala Ile Gln Gly Thr Leu Ser Tyr Pro Leu Val Ser Val Val Ser 1655 1660 1665 Glu Ser Leu Thr Pro Glu Arg Thr Gln Leu Leu Tyr Leu Leu Ala 1670 1675 1680 Val Ala Val Val Ile Ile Leu Phe Ile Ile Leu Leu Gly Val Ile 1685 1690 1695 Met Ala Lys Arg Lys Arg Lys His Gly Ser Leu Trp Leu Pro Glu 1700 1705 1710 Gly Phe Thr Leu Arg Arg Asp Ala Ser Asn His Lys Arg Arg Glu 1715 1720 1725 Pro Val Gly Gln Asp Ala Val Gly Leu Lys Asn Leu Ser Val Gln 1730 173 5 1740 Val Ser Glu Ala Asn Leu Ile Gly Thr Gly Thr Ser Glu His Trp 1745 1750 1755 Val Asp Asp Glu Gly Pro Gln Pro Lys Lys Val Lys Ala Glu Asp 1760 1765 1770 Glu Ala Leu Leu Ser Glu Glu Asp Asp Pro Ile Asp Arg Arg Pro 1775 1780 1785 Trp Thr Gln Gln His Leu Glu Ala Ala Asp Ile Arg Arg Thr Pro 1790 1795 1800 Ser Leu Ala Leu Thr Pro Pro Gln Ala Glu Gln Glu Val Asp Val 1805 1810 1815 Leu Asp Val Asn Val Arg Gly Pro Asp Gly Cys Thr Pro Leu Met 1820 1825 1830 Leu Ala Ser Leu Arg Gly Gly Ser Ser Asp Leu Ser Asp Glu Asp 1835 1840 1845 Glu Asp Ala Glu Asp Ser Ser Ala Asn Ile Ile Thr Asp Leu Val 1850 1855 1860 Tyr Gln Gly A la Ser Leu Gln Ala Gln Thr Asp Arg Thr Gly Glu 1865 1870 1875 Met Ala Leu His Leu Ala Ala Arg Tyr Ser Arg Ala Asp Ala Ala 1880 1885 1890 Lys Arg Leu Leu Asp Ala Gly Ala Asp Ala Asn Ala Gln Asp Asn 1895 1900 1905 Met Gly Arg Cys Pro Leu His Ala Ala Val Ala Ala Asp Ala Gln 1910 1915 1920 Gly Val Phe Gln Ile Leu Ile Arg Asn Arg Val Thr Asp Leu Asp 1925 1930 1935 Ala Arg Met Asn Asp Gly Thr Thr Pro Leu Ile Leu Ala Ala Arg 1940 1945 1950 Leu Ala Val Glu Gly Met Val Ala Glu Leu Ile Asn Cys Gln Ala 1955 1960 1965 Asp Val Asn Ala Val Asp Asp His Gly Lys Ser Ala Leu His Trp 1970 1975 1980 Ala Ala Ala Val Asn Asn Val Glu Ala Thr Le u Leu Leu Leu Lys 1985 1990 1995 Asn Gly Ala Asn Arg Asp Met Gln Asp Asn Lys Glu Glu Thr Pro 2000 2005 2010 Leu Phe Leu Ala Ala Arg Glu Gly Ser Tyr Glu Ala Ala Lys Ile 2015 2020 2025 Leu Leu Asp His P he Ala Asn Arg Asp Ile Thr Asp His Met Asp 2030 2035 2040 Arg Leu Pro Arg Asp Val Ala Arg Asp Arg Met His His Asp Ile 2045 2050 2055 Val Arg Leu Leu Asp Glu Tyr Asn Val Thr Pro Ser Pro Pro Gly 2060 2065 2070 Thr Val Leu Th r Ser Ala Leu Ser Pro Val Ile Cys Gly Pro Asn 2075 2080 2085 Arg Ser Phe Leu Ser Leu Lys His Thr Pro Met Gly Lys Lys Ser 2090 2095 2100 Arg Arg Pro Ser Ala Lys Ser Thr Met Pro Thr Ser Leu Pro Asn 2105 2110 2115 Le u Ala Lys Glu Ala Lys Asp Ala Lys Gly Ser Arg Arg Lys Lys 2120 2125 2130 Ser Leu Ser Glu Lys Val Gln Leu Ser Glu Ser Ser Ser Val Thr Leu 2135 2140 2145 Ser Pro Val Asp Ser Leu Glu Ser Pro His Thr Tyr Val Ser Asp 2150 2155 2160 Thr Thr Ser Ser Pro Met Ile Thr Ser Pro Gly Ile Leu Gln Ala 2165 2170 2175 Ser Pro Asn Pro Met Leu Ala Thr Ala Ala Pro Pro Ala Pro Val 2180 2185 2190 His Ala Gln His Ala Leu Ser Phe Ser Asn Leu His Glu Met Gln 2195 2200 2205 Pro Leu Ala His Gly Ala Ser Thr Val Leu Pro Ser Val Ser Gln 2210 2215 2220 Leu Leu Ser His His His His Ile Val Ser Pro Gly Ser Gly Ser Ala 2225 2230 2235 Gly Ser Leu Ser Arg Leu His Pro Val Pro Val Pro Ala Asp Trp 2240 2245 2250 Met Asn Arg Met Glu Val Asn Glu Thr Gln Tyr Asn Glu Met Phe 2255 2260 2265 Gly Met Val Leu Ala Pro Ala Glu Gly Thr His Pro Gly Ile Ala 2270 2275 2280 Pro Gln Ser Arg Pro Pro Glu Gly Lys His Ile Thr Thr Pro Arg 2285 2290 2295 Glu Pro Leu Pro Pro Ile Val Thr Phe Gln Leu Ile Pro Lys Gly 2300 2305 2310 Ser Ile Ala Gln Pro Ala Gly Ala Pro Gln Pro Gln Ser Thr Cys 2315 2320 2325 Pro Pro Ala Val Ala Gly Pro Leu Pro Thr Met Tyr Gln Ile Pro 2330 2335 2340 Glu Met Ala Arg Leu Pro Ser Val Ala Phe Pro Thr Ala Met Met 2345 2350 2355 Pro Gln Gln Asp Gly Gln Val Ala Gln Thr Ile Leu Pro Ala Tyr 2360 2365 2370 His Pro Phe Pro Ala Ser Val Gly Lys Tyr Pro Thr Pro Pro Ser 2375 2380 2385 Gln His Ser Tyr Ala Ser Ser Asn Ala Ala Glu Arg Thr Pro Ser 2390 2395 2400 His Ser Gly His Leu Gln G Ly Glu His Pro Tyr Leu Thr Pro Ser 2405 2410 2415 Pro Glu Ser Pro Asp Gln Trp Ser Ser Ser Ser Ser Pro His Ser Ala 2420 2425 2430 Ser Asp Trp Ser Asp Val Thr Thr Ser Pro Thr Pro Gly Gly Ala 2435 2440 2445 Gly Gly Gly Gl n Arg Gly Pro Gly Thr His Met Ser Glu Pro Pro 2450 2455 2460 His Asn Asn Met Gln Val Tyr Ala 2465 2470 <![CDATA[ <210> 713]]>
<![CDATA[ <211> 1364]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> person]]> class
<![CDATA[ <400> 713]]>
aggctgcttc gttgcacacc cgagaaagtt tcagccaaac ttcgggcggc ggctgaggcg 60
gcggccgagg agcggcggac tcggggcgcg gggagtcgag gcatttgcgc ctgggcttcg 120
gagcgtagcg ccagggcctg agcctttgaa gcaggaggag gggaggagag agtggggctc 180
ctctatcggg accccctccc catgtggatc tgcccaggcg gcggcggcgg cggcggagga 240
ggaggcgacc gagaagatgc ccgccctgcg ccccgctctg ctgtgggcgc tgctggcgct 300
ctggctgtgc tgcgcggccc ccgcgcatgc attgcagtgt cgagatggct atgaaccctg 360
tgtaaatgaa ggaatgtgtg ttacctacca caatggcaca ggatactgca aatgtccaga 420
aggcttcttg ggggaatatt gtcaacatcg agaccccctgt gagaagaacc gctgccagaa 480
tggtgggact tgtgtggccc aggccatgct ggggaaagcc acgtgccgat gtgcctcagg 540
gtttacagga gaggactgcc agtactcaac atctcatcca tgctttgtgt ctcgaccctg 600
cctgaatggc ggcacatgcc atatgctcag ccgggatacc tatgagtgca cctgtcaagt 660
cgggtttaca ggtaaggagt gccaatggac ggatgcctgc ctgtctcatc cctgtgcaaa 720
tggaagtacc tgtaccactg tggccaacca gttctcctgc aaatgcctca caggcttcac 780
agggcagaaa tgtgagactg atgtcaatga gtgtgacatt ccaggacact gccagcatgg 840
tggcacctgc ctcaacctgc ctggttccta ccagtgccag tgccctcagg gcttcacagg 900
ccagtactgt gacagcctgt atgtgccctg tgcaccctca ccttgtgtca atggaggcac 960
ctgtcggcag actggtgact tcacttttga gtgcaactgc cttccaggtt ttgaagggag 1020
cacctgtgag aggaatattg atgactgccc taaccacagg tgtcagaatg gaggggtttg 1080
tgtggatggg gtcaacactt acaactgccg ctgtccccca caatggacag gacagttctg 1140
cacagaggat gtggatgaat gcctgctgca gcccaatgcc tgtcaaaatg ggggcacctg 1200
tgccaaccgc aatggaggct atggctgtgt atgtgtcaac ggctggagtg gagatgactg 1260
cagtgagaac attgatgatt gtgccttcgc ctcctgtact ccaggctcca cctgcatcga 1320
ccgtgtggcc tccttctctt gcatgtgccc agagggggaag gcag 1364
<![CDATA[ <210> 714]]>
<![CDATA[ <211> 369]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 714]]>
Met Pro Ala Leu Arg Pro Ala Leu Leu Trp Ala Leu Leu Ala Leu Trp
1 5 10 15
Leu Cys Cys Ala Ala Pro Ala His Ala Leu Gln Cys Arg Asp Gly Tyr
20 25 30
Glu Pro Cys Val Asn Glu Gly Met Cys Val Thr Tyr His Asn Gly Thr
35 40 45
Gly Tyr Cys Lys Cys Pro Glu Gly Phe Leu Gly Glu Tyr Cys Gln His
50 55 60
Arg Asp Pro Cys Glu Lys Asn Arg Cys Gln Asn Gly Gly Thr Cys Val
65 70 75 80
Ala Gln Ala Met Leu Gly Lys Ala Thr Cys Arg Cys Ala Ser Gly Phe
85 90 95
Thr Gly Glu Asp Cys Gln Tyr Ser Thr Ser His Pro Cys Phe Val Ser
100 105 110
Arg Pro Cys Leu Asn Gly Gly Thr Cys His Met Leu Ser Arg Asp Thr
115 120 125
Tyr Glu Cys Thr Cys Gln Val Gly Phe Thr Gly Lys Glu Cys Gln Trp
130 135 140
Thr Asp Ala Cys Leu Ser His Pro Cys Ala Asn Gly Ser Thr Cys Thr
145 150 155 160
Thr Val Ala Asn Gln Phe Ser Cys Lys Cys Leu Thr Gly Phe Thr Gly
165 170 175
Gln Lys Cys Glu Thr Asp Val Asn Glu Cys Asp Ile Pro Gly His Cys
180 185 190
Gln His Gly Gly Thr Cys Leu Asn Leu Pro Gly Ser Tyr Gln Cys Gln
195 200 205
Cys Pro Gln Gly Phe Thr Gly Gln Tyr Cys Asp Ser Leu Tyr Val Pro
210 215 220
Cys Ala Pro Ser Pro Cys Val Asn Gly Gly Thr Cys Arg Gln Thr Gly
225 230 235 240
Asp Phe Thr Phe Glu Cys Asn Cys Leu Pro Gly Phe Glu Gly Ser Thr
245 250 255
Cys Glu Arg Asn Ile Asp Asp Cys Pro Asn His Arg Cys Gln Asn Gly
260 265 270
Gly Val Cys Val Asp Gly Val Asn Thr Tyr Asn Cys Arg Cys Pro Pro
275 280 285
Gln Trp Thr Gly Gln Phe Cys Thr Glu Asp Val Asp Glu Cys Leu Leu
290 295 300
Gln Pro Asn Ala Cys Gln Asn Gly Gly Thr Cys Ala Asn Arg Asn Gly
305 310 315 320
Gly Tyr Gly Cys Val Cys Val Asn Gly Trp Ser Gly Asp Asp Cys Ser
325 330 335
Glu Asn Ile Asp Asp Cys Ala Phe Ala Ser Cys Thr Pro Gly Ser Thr
340 345 350
Cys Ile Asp Arg Val Ala Ser Phe Ser Cys Met Cys Pro Glu Gly Lys
355 360 365
Ala
<![CDATA[ <210> 715]]>
<![CDATA[ <211> 7]]>5
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 715]]>
agggccggct ggacaaactg acagtcacct cccagaacct gcagctggag aacctgcgca 60
tgaagcttcc caagc 75
<![CDATA[ <210> 716]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 716]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[ <210> 717]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 717]]>
agggccggct ggacaaactg acagtcacct cccagaacct gcagctggag aacctgcgca 60
tgaagcttcc caagcccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[ <210> 718]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 718]]>
Gly Arg Leu Asp Lys Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu
1 5 10 15
Asn Leu Arg Met Lys Leu Pro Lys Pro Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[ <210> 719]]>
<![CDATA[ <211> 286]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 719]]>
atcctgcccc gcaaaaggca gcttcaccaa agtggggtat ttccagcctt tgtagctttc 60
acttccacat ctaccaagtg ggcggagtgg ccttctgtgg acgaatcaga ttcctctcca 120
gcaccgactt taagaggcga gccggggggt cagggtccca gatgcacagg aggagaagca 180
ggagctgtcg ggaagatcag aagccagtca tggatgacca gcgcgacctt atctccaaca 240
atgagcaact gcccatgctg ggccggcgcc ctggggcccc ggagag 286
<![CDATA[ <210> 720]]>
<![CDATA[ <211> 173]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 720]]>
caagtgcagc cgcggagccc tgtacacagg cttttccatc ctggtgactc tgctcctcgc 60
tggccaggcc accaccgcct acttcctgta ccagcagcag ggccggctgg acaaactgac 120
agtcacctcc cagaacctgc agctggagaa cctgcgcatg aagcttccca agc 173
<![CDATA[ <210> 721]]>
<![CDATA[ <211> 80]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 721]]>
ctcccaagcc tgtgagcaag atgcgcatgg ccaccccgct gctgatgcag gcgctgccca 60
tgggagccct gccccagggg 80
<![CDATA[ <210> 722]]>
<![CDATA[ <211> 63]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 722]]>
cccatgcaga atgccaccaa gtatggcaac atgacagagg accatgtgat gcacctgctc 60
cag 63
<![CDATA[ <210> 723]]>
<![CDATA[ <211> 96]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 723]]>
aatgctgacc ccctgaaggt gtacccgcca ctgaagggga gcttcccgga gaacctgaga 60
caccttaaga acaccatgga gaccatagac tggaag 96
<![CDATA[ <210> 724]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 724]]>
gtctttgaga gctggatgca ccattggctc ctgtttgaaa tgagcaggca ctccttggag 60
caaaagccca ctgacgctcc accgaaag 88
<![CDATA[ <210> 725]]>
<![CDATA[ <211> 192]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 725]]>
tactgaccaa gtgccaggaa gaggtcagcc acatccctgc tgtccacccg ggttcattca 60
ggcccaagtg cgacgagaac ggcaactatc tgccactcca gtgctatggg agcatcggct 120
actgctggtg tgtcttcccc aacggcacgg aggtccccaa caccagaagc cgcgggcacc 180
ataactgcag tg 192
<![CDATA[ <210> 726]]>
<![CDATA[ <211> 63]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 726]]>
agtcactgga actggaggac ccgtcttctg ggctgggtgt gaccaagcag gatctgggcc 60
cag 63
<![CDATA[ <210> 727]]>
<![CDATA[ <211> 612]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 727]]>
tccccatgtg agagcagcag aggcggtctt caacatcctg ccagccccac acagctacag 60
ctttcttgct cccttcagcc cccagcccct cccccatctc ccaccctgta cctcatccca 120
tgagaccctg gtgcctggct ctttcgtcac ccttggacaa gacaaaccaa gtcggaacag 180
cagataacaa tgcagcaagg ccctgctgcc caatctccat ctgtcaacag gggcgtgagg 240
tcccaggaag tggccaaaag ctagacagat ccccgttcct gacatcacag cagcctccaa 300
cacaaggctc caagacctag gctcatggac gagatgggaa ggcacaggga gaagggataa 360
ccctacaccc agaccccagg ctggacatgc tgactgtcct ctcccctcca gcctttggcc 420
ttggcttttc tagcctattt acctgcaggc tgagccactc tcttcccttt ccccagcatc 480
actccccaag gaagagccaa tgttttccac ccataatcct ttctgccgac ccctagttcc 540
ctctgctcag ccaagcttgt tatcagcttt cagggccatg gttcacatta gaataaaagg 600
tagtaattag aa 612
<![CDATA[ <210> 728]]>
<![CDATA[ <211> 1653]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> CD74 full mRNA polynucleotide sequence]]>
<![CDATA[ <400> 728]]>
atcctgcccc gcaaaaggca gcttcaccaa agtggggtat ttccagcctt tgtagctttc 60
acttccacat ctaccaagtg ggcggagtgg ccttctgtgg acgaatcaga ttcctctcca 120
gcaccgactt taagaggcga gccggggggt cagggtccca gatgcacagg aggagaagca 180
ggagctgtcg ggaagatcag aagccagtca tggatgacca gcgcgacctt atctccaaca 240
atgagcaact gcccatgctg ggccggcgcc ctggggcccc ggagagcaag tgcagccgcg 300
gagccctgta cacaggcttt tccatcctgg tgactctgct cctcgctggc caggccacca 360
ccgcctactt cctgtaccag cagcagggcc ggctggaca actgacagtc acctcccaga 420
acctgcagct ggagaacctg cgcatgaagc ttcccaagcc tcccaagcct gtgagcaaga 480
tgcgcatggc caccccgctg ctgatgcagg cgctgcccat gggagccctg ccccaggggc 540
ccatgcagaa tgccaccaag tatggcaaca tgacagagga ccatgtgatg cacctgctcc 600
agaatgctga ccccctgaag gtgtacccgc cactgaaggg gagcttcccg gagaacctga 660
gacaccttaa gaacaccatg gagaccatag actggaaggt ctttgagagc tggatgcacc 720
attggctcct gtttgaaatg agcaggcact ccttggagca aaagcccact gacgctccac 780
cgaaagtact gaccaagtgc caggaagagg tcagccacat ccctgctgtc cacccgggtt 840
cattcaggcc caagtgcgac gagaacggca actatctgcc actccagtgc tatgggagca 900
tcggctactg ctggtgtgtc ttccccaacg gcacggaggt ccccaacacc agaagccgcg 960
ggcaccataa ctgcagtgag tcactggaac tggaggaccc gtcttctggg ctgggtgtga 1020
ccaagcagga tctgggccca gtccccatgt gagagcagca gaggcggtct tcaacatcct 1080
gccagcccca cacagctaca gctttcttgc tcccttcagc ccccagcccc tcccccatct 1140
cccaccctgt acctcatccc atgagaccct ggtgcctggc tctttcgtca cccttggaca 1200
agacaaacca agtcggaaca gcagataaca atgcagcaag gccctgctgc ccaatctcca 1260
tctgtcaaca ggggcgtgag gtcccaggaa gtggccaaaa gctagacaga tccccgttcc 1320
tgacatcaca gcagcctcca acacaaggct ccaagaccta ggctcatgga cgagatggga 1380
aggcacagggg agaagggata accctacacc cagaccccag gctggacatg ctgactgtcc 1440
tctcccctcc agcctttggc cttggctttt ctagcctatt tacctgcagg ctgagccact 1500
ctcttccctt tccccagcat cactccccaa ggaagagcca atgttttcca cccataatcc 1560
tttctgccga cccctagttc cctctgctca gccaagcttg ttatcagctt tcagggccat 1620
ggttcacatt agaataaaag gtagtaatta gaa 1653
<![CDATA[ <210> 729]]>
<![CDATA[ <211> 41]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 729]]>
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu
35 40
<![CDATA[ <210> 730]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 730]]>
Lys Cys Ser Arg Gly Ala Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr
1 5 10 15
Leu Leu Leu Ala Gly Gln Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln
20 25 30
Gln Gly Arg Leu Asp Lys Leu Thr Val Thr Ser Gln Asn Leu Gln Leu
35 40 45
Glu Asn Leu Arg Met Lys Leu Pro Lys
50 55
<![CDATA[ <210> 731]]>
<![CDATA[ <211> 26]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 731]]>
Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr Pro Leu Leu Met Gln
1 5 10 15
Ala Leu Pro Met Gly Ala Leu Pro Gln Gly
20 25
<![CDATA[ <210> 732]]>
<![CDATA[ <211> 21]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 732]]>
Pro Met Gln Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val
1 5 10 15
Met His Leu Leu Gln
20
<![CDATA[ <210> 733]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 733]]>
Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly Ser Phe Pro
1 5 10 15
Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr Ile Asp Trp Lys
20 25 30
<![CDATA[ <210> 734]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 734]]>
Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu Met Ser Arg
1 5 10 15
His Ser Leu Glu Gln Lys Pro Thr Asp Ala Pro Pro Lys
20 25
<![CDATA[ <210> 735]]>
<![CDATA[ <211> 63]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 735]]>
Leu Thr Lys Cys Gln Glu Glu Val Ser His Ile Pro Ala Val His Pro
1 5 10 15
Gly Ser Phe Arg Pro Lys Cys Asp Glu Asn Gly Asn Tyr Leu Pro Leu
20 25 30
Gln Cys Tyr Gly Ser Ile Gly Tyr Cys Trp Cys Val Phe Pro Asn Gly
35 40 45
Thr Glu Val Pro Asn Thr Arg Ser Arg Gly His His Asn Cys Ser
50 55 60
<![CDATA[ <210> 736]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 736]]>
Ser Leu Glu Leu Glu Asp Pro Ser Ser Ser Gly Leu Gly Val Thr Lys Gln
1 5 10 15
Asp Leu Gly Pro
20
<![CDATA[ <210> 737]]>
<![CDATA[ <211> 2]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 737]]>
Pro Met
1
<![CDATA[ <210> 738]]>
<![CDATA[ <211> 296]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 738]]>
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly Ala
35 40 45
Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr Leu Leu Leu Ala Gly Gln
50 55 60
Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln Gln Gly Arg Leu Asp Lys
65 70 75 80
Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu Asn Leu Arg Met Lys
85 90 95
Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr Pro
100 105 110
Leu Leu Met Gln Ala Leu Pro Met Gly Ala Leu Pro Gln Gly Pro Met
115 120 125
Gln Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val Met His
130 135 140
Leu Leu Gln Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly
145 150 155 160
Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr Ile
165 170 175
Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu
180 185 190
Met Ser Arg His Ser Leu Glu Gln Lys Pro Thr Asp Ala Pro Pro Lys
195 200 205
Val Leu Thr Lys Cys Gln Glu Glu Val Ser His Ile Pro Ala Val His
210 215 220
Pro Gly Ser Phe Arg Pro Lys Cys Asp Glu Asn Gly Asn Tyr Leu Pro
225 230 235 240
Leu Gln Cys Tyr Gly Ser Ile Gly Tyr Cys Trp Cys Val Phe Pro Asn
245 250 255
Gly Thr Glu Val Pro Asn Thr Arg Ser Arg Gly His His Asn Cys Ser
260 265 270
Glu Ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val Thr Lys
275 280 285
Gln Asp Leu Gly Pro Val Pro Met
290 295
<![CDATA[ <210> 739]]>
<![CDATA[ <211> 459]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 739]]>
atcctgcccc gcaaaaggca gcttcaccaa agtggggtat ttccagcctt tgtagctttc 60
acttccacat ctaccaagtg ggcggagtgg ccttctgtgg acgaatcaga ttcctctcca 120
gcaccgactt taagaggcga gccggggggt cagggtccca gatgcacagg aggagaagca 180
ggagctgtcg ggaagatcag aagccagtca tggatgacca gcgcgacctt atctccaaca 240
atgagcaact gcccatgctg ggccggcgcc ctggggcccc ggagagcaag tgcagccgcg 300
gagccctgta cacaggcttt tccatcctgg tgactctgct cctcgctggc caggccacca 360
ccgcctactt cctgtaccag cagcagggcc ggctggaca actgacagtc acctcccaga 420
acctgcagct ggagaacctg cgcatgaagc ttcccaagc 459
<![CDATA[ <210> 740]]>
<![CDATA[ <211> 99]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 740]]>
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gln Lys Pro Val
1 5 10 15
Met Asp Asp Gln Arg Asp Leu Ile Ser Asn Asn Glu Gln Leu Pro Met
20 25 30
Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly Ala
35 40 45
Leu Tyr Thr Gly Phe Ser Ile Leu Val Thr Leu Leu Leu Ala Gly Gln
50 55 60
Ala Thr Thr Ala Tyr Phe Leu Tyr Gln Gln Gln Gly Arg Leu Asp Lys
65 70 75 80
Leu Thr Val Thr Ser Gln Asn Leu Gln Leu Glu Asn Leu Arg Met Lys
85 90 95
Leu Pro Lys
<![CDATA[ <210> 741]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 741]]>
taccagacga tgaggatgta gtggggcccg ggcaggaatc tgatgacttt gagctgtctg 60
gctctggaga tctgg 75
<![CDATA[ <210> 742]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 742]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[ <210> 743]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 743]]>
taccagacga tgaggatgta gtggggcccg ggcaggaatc tgatgacttt gagctgtctg 60
gctctggaga tctggccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[ <210> 744]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 744]]>
Pro Asp Asp Glu Asp Val Val Gly Pro Gly Gln Glu Ser Asp Asp Phe
1 5 10 15
Glu Leu Ser Gly Ser Gly Asp Leu Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[ <210> 745]]>
<![CDATA[ <211> 100]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 745]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg 100
<![CDATA[ <210> 746]]>
<![CDATA[ <211> 139]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 746]]>
atccgagaga ctgaggtcat cgacccccag gacctcctag aaggccgata cttctccgga 60
gccctaccag acgatgagga tgtagtgggg cccgggcagg aatctgatga ctttgagctg 120
tctggctctg gagatctgg 139
<![CDATA[ <210> 747]]>
<![CDATA[ <211> 47]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 747]]>
atgacttgga agactccatg atcggccctg aagttgtcca tcccttg 47
<![CDATA[ <210> 748]]>
<![CDATA[ <211> 199]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 748]]>
gtgcctctag ataaccatat ccctgagagg gcagggtctg ggagccaagt ccccaccgaa 60
cccaagaaac tagaggaa tgaggttatc cccaagagaa tctcacccgt tgaagagagt 120
gaggatgtgt ccaacaaggt gtcaatgtcc agcactgtgc agggcagcaa catctttgag 180
agaacggagg tcctggcag 199
<![CDATA[ <210> 749]]>
<![CDATA[ <211> 2128]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 749]]>
ctctgattgt gggtggcatc gtgggcatcc tctttgccgt cttcctgatc ctactgctca 60
tgtaccgtat gaagaagaag gatgaaggca gctatgacct gggcaagaaa cccatctaca 120
agaaagcccc caccaatgag ttctacgcgt gaagcttgct tgtgggcact ggcttggact 180
ttagcgggga gggaagccag gggattttga agggtggaca ttagggtagg gtgaggtcaa 240
cctaatactg acttgtcagt atctccagct ctgattacct ttgaagtgtt cagaagagac 300
attgtcttct actgttctgc caggttcttc ttgagctttg ggcctcagtt gccctggcag 360
aaaaatggat tcaacttggc ctttctgaag gcaagactgg gattggatca cttcttaaac 420
ttccagttaa gaatctaggt ccgccctcaa gcccatactg accatgcctc atccagagct 480
cctctgaagc cagggggcta acggatgttg tgtggagtcc tggctggagg tcctccccca 540
gtggccttcc tcccttcctt tcacagccgg tctctctgcc aggaaatggg ggaaggaact 600
agaaccacct gcaccttgag atgtttctgt aaatgggtac ttgtgatcac actacgggaa 660
tctctgtggt atatacctgg ggccattcta ggctctttca agtgactttt ggaaatcaac 720
cttttttatt tggggggggag gatggggaaa agagctgaga gtttatgctg aaatggattt 780
atagaatatt tgtaaatcta tttttagtgtttgttcgttt ttttaactgt tcattccttt 840
gtgcagagtg tatatctctg cctgggcaag agtgtgggagg tgccgaggtg tcttcattct 900
ctcgcacatt tccacagcac ctgctaagtt tgtatttaat ggtttttgtt tttgtttttg 960
tttgtttctt gaaaatgaga gaagagccgg agagatgatt tttattaatt tttttttttt 1020
tttttttttt ttactattta tagctttaga tagggcctcc cttcccctct tctttctttg 1080
ttctctttca ttaaacccct tccccagttt tttttttata ctttaaaccc cgctcctcat 1140
ggccttggcc ctttctgaag ctgcttcctc ttataaaata gcttttgccg aaacatagtt 1200
tttttttagc agatcccaaa atataatgaa ggggatggtg ggatatttgt gtctgtgttc 1260
ttataatata ttaattattct tccttggttc tagaaaaata gataaatata tttttttcag 1320
gaaatagtgt ggtgtttcca gtttgatgtt gctgggtggt tgagtgagtg aattttcatg 1380
tggctgggtg ggtttttgcc tttttctctt gccctgttcc tggtgccttc tgatggggct 1440
ggaatagttg aggtggatgg ttctaccctt tctgccttct gtttgggacc cagctggtgt 1500
tctttggttt gctttcttca ggctctaggg ctgtgctatc caatacagta accacatgcg 1560
gctgtttaaa gttaagccaa ttaaaatcac ataagattaa aaattccttc ctcagttgca 1620
ctaaccacgt ttctagaggc gtcactgtat gtagttcatg gctactgtac tgacagcgag 1680
agcatgtcca tctgttggac agcactattc tagagaacta aactggctta acgagtcaca 1740
gcctcagctg tgctgggacg acccttgtct ccctgggtag ggggggggga atgggggagg 1800
gctgatgagg ccccagctgg ggcctgttgt ctgggaccct ccctctcctg agagggggagg 1860
cctggtggct tagcctgggc aggtcgtgtc tcctcctgac cccagtggct gcggtgaggg 1920
gaaccaccct cccttgctgc accagtggcc attagctccc gtcaccactg caacccaggg 1980
tcccagctgg ctgggtcctc ttctgccccc agtgcccttc cccttgggct gtgttggagt 2040
gagcacctcc tctgtaggca cctctcacac tgttgtctgt tactgatttt ttttgataaa 2100
aagataataa aacctggtac tttctaaa 2128
<![CDATA[ <210> 750]]>
<![CDATA[ <211> 2613]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Full mRNA polynucleotide sequence of SDC4]]>
<![CDATA[ <400> 750]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg atccgagaga ctgaggtcat 120
cgacccccag gacctcctag aaggccgata cttctccgga gccctaccag acgatgagga 180
tgtagtgggg cccgggcagg aatctgatga ctttgagctg tctggctctg gagatctgga 240
tgacttggaa gactccatga tcggccctga agttgtccat cccttggtgc ctctagataa 300
ccatatccct gagagggcag ggtctggggag ccaagtcccc accgaaccca agaaactaga 360
ggagaatgag gttatcccca agagaatctc acccgttgaa gagagtgagg atgtgtccaa 420
caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa cggaggtcct 480
ggcagctctg attgtgggtg gcatcgtggg catcctcttt gccgtcttcc tgatcctact 540
gctcatgtac cgtatgaaga agaaggatga aggcagctat gacctgggca agaaacccat 600
ctacaagaaa gccccccacca atgagttcta cgcgtgaagc ttgcttgtgg gcactggctt 660
ggactttagc ggggagggaa gccaggggat tttgaagggt ggacattagg gtagggtgag 720
gtcaacctaa tactgacttg tcagtatctc cagctctgat tacctttgaa gtgttcagaa 780
gagacattgt cttctactgt tctgccaggt tcttcttgag ctttgggcct cagttgccct 840
ggcagaaaaa tggattcaac ttggcctttc tgaaggcaag actgggattg gatcacttct 900
taaacttcca gttaagaatc taggtccgcc ctcaagccca tactgaccat gcctcatcca 960
gagctcctct gaagccagggg ggctaacgga tgttgtgtgg agtcctggct ggaggtcctc 1020
ccccagtggc cttcctccct tcctttcaca gccggtctct ctgccaggaa atgggggaag 1080
gaactagaac cacctgcacc ttgagatgtt tctgtaaatg ggtacttgtg atcacactac 1140
gggaatctct gtggtatata cctggggcca ttctaggctc tttcaagtga cttttggaaa 1200
tcaacctttt ttatttgggg gggaggatgg ggaaaagagc tgagagttta tgctgaaatg 1260
gatttataga atatttgtaa atctattttt agtgtttgtt cgttttttta actgttcatt 1320
cctttgtgca gagtgtatat ctctgcctgg gcaagagtgt ggaggtgccg aggtgtcttc 1380
attctctcgc aatttccac agcacctgct aagtttgtat ttaatggttt ttgtttttgt 1440
ttttgtttgt ttcttgaaaa tgagagaaga gccggagaga tgatttttat taattttttt 1500
tttttttttttttttttact atttatagct ttagataggg cctcccttcc cctcttcttt 1560
ctttgttctc tttcattaaa ccccttcccc agtttttttt ttatacttta aaccccgctc 1620
ctcatggcct tggccctttc tgaagctgct tcctcttata aaatagcttt tgccgaaaca 1680
tagttttttt ttagcagatc ccaaaatata atgaagggga tggtgggata tttgtgtctg 1740
tgttcttata atatattatt attcttcctt ggttctagaa aaatagataa atatattttt 1800
ttcaggaaat agtgtggtgt ttccagtttg atgttgctgg gtggttgagt gagtgaattt 1860
tcatgtggct gggtgggttt ttgccttttt ctcttgccct gttcctggtg ccttctgatg 1920
gggctggaat agttgaggtg gatggttcta ccctttctgc cttctgtttg ggacccagct 1980
ggtgttcttt ggtttgcttt cttcaggctc tagggctgtg ctatccaata cagtaaccac 2040
atgcggctgt ttaaagttaa gccaattaaa atcacataag attaaaaatt ccttcctcag 2100
ttgcactaac cacgtttcta gaggcgtcac tgtatgtagt tcatggctac tgtactgaca 2160
gcgagagcat gtccatctgt tggacagcac tattctagag aactaaactg gcttaacgag 2220
tcacagcctc agctgtgctg ggacgaccct tgtctccctg ggtagggggg ggggaatggg 2280
ggagggctga tgaggcccca gctggggcct gttgtctggg accctccctc tcctgagagg 2340
ggaggcctgg tggcttagcc tgggcaggtc gtgtctcctc ctgaccccag tggctgcggt 2400
gaggggaacc accctccctt gctgcaccag tggccattag ctcccgtcac cactgcaacc 2460
cagggtccca gctggctggg tcctcttctg cccccagtgc ccttcccctt gggctgtgtt 2520
ggagtgagca cctcctctgt aggcacctct cacactgttg tctgttactg attttttttg 2580
ataaaaagat aataaaacct ggtactttct aaa 2613
<![CDATA[ <210> 751]]>
<![CDATA[ <211> 20]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 751]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser
20
<![CDATA[ <210> 752]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 752]]>
Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu Leu Glu Gly Arg
1 5 10 15
Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val Val Gly Pro Gly
20 25 30
Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly Asp Leu
35 40 45
<![CDATA[ <210> 753]]>
<![CDATA[ <211> 15]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 753]]>
Asp Leu Glu Asp Ser Met Ile Gly Pro Glu Val Val His Pro Leu
1 5 10 15
<![CDATA[ <210> 754]]>
<![CDATA[ <211> 66]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 754]]>
Val Pro Leu Asp Asn His Ile Pro Glu Arg Ala Gly Ser Gly Ser Gln
1 5 10 15
Val Pro Thr Glu Pro Lys Lys Leu Glu Glu Asn Glu Val Ile Pro Lys
20 25 30
Arg Ile Ser Pro Val Glu Glu Ser Glu Asp Val Ser Asn Lys Val Ser
35 40 45
Met Ser Ser Thr Val Gln Gly Ser Asn Ile Phe Glu Arg Thr Glu Val
50 55 60
Leu Ala
65
<![CDATA[ <210> 755]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 755]]>
Leu Ile Val Gly Gly Ile Val Gly Ile Leu Phe Ala Val Phe Leu Ile
1 5 10 15
Leu Leu Leu Met Tyr Arg Met Lys Lys Lys Asp Glu Gly Ser Tyr Asp
20 25 30
Leu Gly Lys Lys Pro Ile Tyr Lys Lys Ala Pro Thr Asn Glu Phe Tyr
35 40 45
Ala
<![CDATA[ <210> 756]]>
<![CDATA[ <211> 198]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 756]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu
20 25 30
Leu Glu Gly Arg Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val
35 40 45
Val Gly Pro Gly Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly
50 55 60
Asp Leu Asp Asp Leu Glu Asp Ser Met Ile Gly Pro Glu Val Val His
65 70 75 80
Pro Leu Val Pro Leu Asp Asn His Ile Pro Glu Arg Ala Gly Ser Gly
85 90 95
Ser Gln Val Pro Thr Glu Pro Lys Lys Leu Glu Glu Asn Glu Val Ile
100 105 110
Pro Lys Arg Ile Ser Pro Val Glu Glu Ser Glu Asp Val Ser Asn Lys
115 120 125
Val Ser Met Ser Ser Ser Thr Val Gln Gly Ser Asn Ile Phe Glu Arg Thr
130 135 140
Glu Val Leu Ala Ala Leu Ile Val Gly Gly Ile Val Gly Ile Leu Phe
145 150 155 160
Ala Val Phe Leu Ile Leu Leu Leu Met Tyr Arg Met Lys Lys Lys Lys Asp
165 170 175
Glu Gly Ser Tyr Asp Leu Gly Lys Lys Pro Ile Tyr Lys Lys Ala Pro
180 185 190
Thr Asn Glu Phe Tyr Ala
195
<![CDATA[ <210> 757]]>
<![CDATA[ <211> 239]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 757]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg atccgagaga ctgaggtcat 120
cgacccccag gacctcctag aaggccgata cttctccgga gccctaccag acgatgagga 180
tgtagtgggg cccgggcagg aatctgatga ctttgagctg tctggctctg gagatctgg 239
<![CDATA[ <210> 758]]>
<![CDATA[ <211> 66]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 758]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu
20 25 30
Leu Glu Gly Arg Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val
35 40 45
Val Gly Pro Gly Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly
50 55 60
Asp Leu
65
<![CDATA[ <210> 759]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 759]]>
tttctactgt aacaccccatc ccagacgaag acagtccctg gatcaccgac agcacagaca 60
gaatccctgc tacca 75
<![CDATA[ <210> 760]]>
<![CDATA[ <21]]>1> 75]]>
<br/> <![CDATA[ <212>DNA]]>
<br/> <![CDATA[ <213>Human]]>
<br/>
<br/> <![CDATA[ <400>760]]>
<br/> <![CDATA[ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgtgaatg 75
<![CDATA[ <210> 761]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 761]]>
tttctactgt aacaccccatc ccagacgaag acagtccctg gatcaccgac agcacagaca 60
gaatccctgc taccactaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[ <210> 762]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 762]]>
Ser Thr Val His Pro Ile Pro Asp Glu Asp Ser Pro Trp Ile Thr Asp
1 5 10 15
Ser Thr Asp Arg Ile Pro Ala Thr Thr Thr Thr Ser Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[ <210> 763]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212> ]]>DNA
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 763]]>
actaccccat caactccaac ttcaaagtgg gctacaacac tctcttttcc tgtacctgtg 60
tgccacctga cccag 75
<![CDATA[ <210> 764]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 764]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgtgaatg 75
<![CDATA[ <210> 765]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 765]]>
actaccccat caactccaac ttcaaagtgg gctacaacac tctcttttcc tgtacctgtg 60
tgccacctga cccagctaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[ <210> 766]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 766]]>
Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu Phe Ser
1 5 10 15
Cys Thr Cys Val Pro Pro Asp Pro Ala Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[ <210> 767]]>
<![CDATA[ <211> 143]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 767]]>
ggcggcgcgc cgggcagcgc ttcggtggcg gcggcggccg cggtggcagc gaaggcggcg 60
gcggcggcgg cagtggcagt ggccgctgca gccccaacact ccgccgccaa actggaggag 120
cgacggaagc cagaccccag gag 143
<![CDATA[ <210> 768]]>
<![CDATA[ <211> 74]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 768]]>
gatggaggat gaagctgtcc tggacagagg ggcttccttc ctcaagcatg tgtgtgatga 60
agaagaagta gaag 74
<![CDATA[ <210> 769]]>
<![CDATA[ <211> 180]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 769]]>
gccaccacatac catttacatc ggagtccatg tgccgaagag ttacaggaga aggagacgtc 60
acaagagaaa gacagggcac aaagaaaaga aggaaaagga gagaatctct gagaactact 120
ctgacaaatc agatattgaa aatgctgatg aatccagcag cagcatccta aaacctctca 180
<![CDATA[ <210> 770]]>
<![CDATA[ <211> 136]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 770]]>
tctctcctgc tgcagaacgc atccgattca tcttgggaga ggaggatgac agcccagctc 60
cccctcagct cttcacggaa ctggatgagc tgctggccgt ggatgggcag gagatggagt 120
ggaaggaaac agccag 136
<![CDATA[ <210> 771]]>
<![CDATA[ <211> 161]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 771]]>
gtggatcaag tttgaagaaa aagtggaaca gggtggggaa agatggagca agccccatgt 60
ggccacattg tcccttcata gtttatttga gctgaggaca tgtatggaga aaggatccat 120
catgcttgat cgggaggctt cttctctccc acagttggtg g 161
<![CDATA[ <210> 772]]>
<![CDATA[ <211> 180]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 772]]>
agatgattgt tgaccatcag attgagacag gcctattgaa acctgaactt aaggataagg 60
tgacctatac tttgctccgg aagcaccggc atcaaaccaa gaaatccaac cttcggtccc 120
tggctgacat tgggaagaca gtctccagtg caagtaggat gtttaccaac cctgataatg 180
<![CDATA[ <210> 773]]>
<![CDATA[ <211> 77]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 773]]>
gtagcccagc catgacccat aggaatctga cttcctccag tctgaatgac atttctgata 60
aaccggagaa ggaccag 77
<![CDATA[ <210> 774]]>
<![CDATA[ <211> 158]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 774]]>
ctgaagaata agttcatgaa aaaattgcca cgtgatgcag aagcttccaa cgtgcttgtt 60
gggggaggttg actttttgga tactcctttc attgcctttg ttaggctaca gcaggctgtc 120
atgctgggtg ccctgactga agttcctgtg cccacaag 158
<![CDATA[ <210> 775]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 775]]>
gttcttgttc attctcttag gtcctaaggg gaaagccaag tcctaccacg agattggcag 60
agccattgcc accctgatgt ctgatgag 88
<![CDATA[ <210> 776]]>
<![CDATA[ <211> 155]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 776]]>
gtgttccatg acattgctta taaagcaaaa gacaggcacg acctgattgc tggtattgat 60
gagttcctag atgaagtcat cgtccttcca cctggggaat gggatccagc aattaggata 120
gagcctccta agagtcttcc atcctctgac aaaag 155
<![CDATA[ <210> 777]]>
<![CDATA[ <211> 114]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 777]]>
aaagaatatg tactcaggtg gagagaatgt tcagatgaat ggggatacgc cccatgatgg 60
aggtcacgga ggaggaggac atggggattg tgaagaattg cagcgaactg gacg 114
<![CDATA[ <210> 778]]>
<![CDATA[ <211> 175]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 778]]>
gttctgtggt ggactaatta aagacataaa gaggaaagcg ccattttttg ccagtgattt 60
ttatgatgct ttaaatattc aagctctttc ggcaattctc ttcatttatc tggcaactgt 120
aactaatgct atcacttttg gaggactgct tggggatgcc actgacaaca tgcag 175
<![CDATA[ <210> 779]]>
<![CDATA[ <211> 134]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 779]]>
ggcgtgttgg agagtttcct gggcactgct gtctctggag ccatcttttg cctttttgct 60
ggtcaaccac tcactattct gagcagcacc ggacctgtcc tagtttttga gaggcttcta 120
tttaatttca gcaa 134
<![CDATA[ <210> 780]]>
<![CDATA[ <211> 272]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 780]]>
ggacaataat tttgactatt tggagtttcg cctttggatt ggcctgtggt ccgccttcct 60
atgtctcatt ttggtagcca ctgatgccag cttcttggtt caatacttca cacgtttcac 120
ggaggagggc ttttcctctc tgattagctt catctttatc tatgatgctt tcaagaagat 180
gatcaagctt gcagattact acccccatcaa ctccaacttc aaagtgggct acaacactct 240
cttttcctgt acctgtgtgc cacctgaccc ag 272
<![CDATA[ <210> 781]]>
<![CDATA[ <211> 71]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 781]]>
ctaatatctc aatatctaat gacaccacac tggccccaga gtatttgcca actatgtctt 60
ctactgacat g 71
<![CDATA[ <210> 782]]>
<![CDATA[ <211> 192]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 782]]>
taccataata ctacctttga ctgggcattt ttgtcgaaga aggagtgttc aaaatacgga 60
ggaaacctcg tcgggaacaa ctgtaatttt gttcctgata tcacactcat gtcttttatc 120
ctcttcttgg gaacctacac ctcttccatg gctctgaaaa aattcaaaac tagtccttat 180
tttccaacca ca 192
<![CDATA[ <210> 783]]>
<![CDATA[ <211> 114]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 783]]>
gcaagaaaac tgatcagtga ttttgccatt atcttgtcca ttctcatctt ttgtgtaata 60
gatgccctag taggcgtgga caccccaaaa ctaattgtgc caagtgagtt caag 114
<![CDATA[ <210> 784]]>
<![CDATA[ <211> 162]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 784]]>
ccaacaagtc caaaccgagg ttggttcgtt ccaccgtttg gagaaaaccc ctggtgggtg 60
tgccttgctg ctgctatccc ggctttgttg gtcactatac tgattttcat ggaccaacaa 120
attacagctg tgattgtaaa caggaaagaa cataaactca ag 162
<![CDATA[ <210> 785]]>
<![CDATA[ <211> 179]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 785]]>
aaaggagcag ggtatcactt ggatctcttt tgggtggcca tcctcatggt tatatgctcc 60
ctcatggctc ttccgtggta tgtagctgct acggtcatct ccattgctca catcgacagt 120
ttgaagatgg agacagagac ttctgcacct ggagaacaac caaagtttct aggagtgag 179
<![CDATA[ <210> 786]]>
<![CDATA[ <211]]>> 73]]>
<br/> <![CDATA[ <212>DNA]]>
<br/> <![CDATA[ <213>Human]]>
<br/>
<br/> <![CDATA[ <400>786]]>
<br/> <![CDATA[ggaacaaaga gtcactggaa cccttgtgtt tattctgact ggtctgtcag tctttatggc 60
tcccatcttg aag 73
<![CDATA[ <210> 787]]>
<![CDATA[ <211> 70]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 787]]>
gtttataccc atgcctgtac tctatggtgt gttcctgtat atgggagtag catcccttaa 60
tggtgtgcag 70
<![CDATA[ <210> 788]]>
<![CDATA[ <211> 174]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 788]]>
ttcatggatc gtctgaagct gcttctgatg cctctgaagc atcagcctga cttcatctac 60
ctgcgtcatg ttcctctgcg cagagtccac ctgttcactt tcctgcaggt gttgtgtctg 120
gccctgcttt ggatcctcaa gtcaacggtg gctgctatca tttttccagt aatg 174
<![CDATA[ <210> 789]]>
<![CDATA[ <211> 162]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 789]]>
atcttggcac ttgtagctgt cagaaaaggc atggactacc tcttctccca gcacgacctc 60
agcttcctgg atgatgtcat tccagaaaag gacaagaaaa agaaggagga tgagaagaaa 120
aagaaaaaga agaagggaag tctggacagt gacaatgatg at 162
<![CDATA[ <210> 790]]>
<![CDATA[ <211> 97]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 790]]>
tctgactgcc catactcaga aaaagttcca agtattaaaa ttccaatgga catcatggaa 60
cagcaacctt tcctaagcga tagcaaacct tctgaca 97
<![CDATA[ <210> 791]]>
<![CDATA[ <211> 80]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 791]]>
gagaaagatc accaacattc cttgaacgcc acacatcatg ctgataaaat tcctttcctt 60
cagtcactcg gtatgccaag 80
<![CDATA[ <210> 792]]>
<![CDATA[ <211> 4296]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 792]]>
tcctcctaga actccagtaa aagttgtgcc tcaaattaga atagaacttg aacctgaaga 60
caatgattat ttctggagga gcaagggaac agaaactaca ttgtaacctg tttgtctttc 120
ttaaaactga catttgttgt taatgtcatt tgtttttgtt tggctgtttg tttatttttt 180
aacttttatt tcgtctcagt ttttggtcac aggccaaata atacagcgct ctctctgctt 240
ctctcttgca tagacacaat caagacaata gtgcaccgtt ccttaaaaac agcatctgag 300
gaatccccct tttgttctta aactttcaga tgtgtccttt gataaccaaa ttctgtcact 360
caagacacag acacgcacag accctgtcct ttgcctctat taagcagagg atggaagtat 420
taaggatttt gtaacacctt ttatgaaaat gttgaaggaa cttaaaactt tagctttgga 480
gctgtgctta ctggcttgtc tttgtctggt agaacaaacc ttgacctcca gacagagtcc 540
cttctcactt atagagctct ccaggactgg aaaaagtgct gctattttaa cttgctcttg 600
cttgtaaatc ctaatcttag agttatcaaa agaagaaaaa actgaaggta ctttactccc 660
tatagagaaa ccattgccat cattgtagca agtgctggaa tgtccctttt ttcctatgca 720
acttttttta accctttaat gaacttatct gttgagtaca ttgaagaata tttttcttcc 780
tagattttgt tgtttaaatt atggggccta acctgccact tattttttgt caatttttaa 840
aacttttttt taattactgt aaagaaaatg aattttttcc tgcagcagga aacatagttt 900
tgagtagttc tacctcttat ttgtagctgc caggctttct gtaaaaattg tattgtatat 960
aatgtgattt ttacacatac atacacacac aaatacacaa tctctagggt aagccagaag 1020
gcaagatcag attaaaaaca ccatgtttct aagcatccat ttttcccttt ctttaaaaga 1080
aacttaactg ttctatgaag gagattgagg gagaagagac aaactcctat gtcatgagaa 1140
taaccgatgt tctgataata gtagcatcta ggtacagatg ctggttgtat taccacgtca 1200
atgtcctatg cagtattgtt agacattttc tcattttgaa atatttgtgtgtttgtgtat 1260
gtgctctgtg ccatggctgg tgtatatatg tgcaatgtta gaaggcaaaa gagtgatggt 1320
aggcagagggg caaagtcatt gaatctctta tgccagtttt cataaaaccc aaaccacata 1380
tgaaaaaatc cattaagggt ccaagaagtc tgtccatatg aaaatgaggg taaatatagt 1440
ttatttccca ggtatcagtc attataattg atataatagc tctaacatgc aatataaaat 1500
tcataggagt attaatagcc catttacaca tctataaaat gtaatgggat tgcagagctg 1560
cagagtacag tgtaacagta ctctcatgca atttttttca ggatgcaaag gcaattattc 1620
tttgtaagcg ggacatttag aatatatttg tgtacatatt atatgtatgt atatttcaaa 1680
gtaccacact gaaaattaga catttattaa ccaaatttaa cgtggtattt aaaggtaata 1740
tttttaatat gatacattac atattgtgaa tgtatactaa aaaaacattt taaatgttaa 1800
aattataatt tcagattcat ataaccacaa ctgtgatata tcctaactat aaccagttgt 1860
tgaggggtat actagaagca gaatgaaacc aattttttg gtttgataat atgcacttat 1920
tgactcccac tcattgttat gttaattaag ttaattattct gtctccttgt aattttgatt 1980
acaaaaattt tattatcctg agttagctgt tacttttaca gtacctgata ctcctaaaac 2040
ttttaactta tacaaattag tcaataatga ccccaatttt ttcattaaaa taatagtggt 2100
gaattatatg ttaattgtgtt aaaacctcac ttgccaaatt ctggcttcac atttgtattt 2160
agggctatcc ttaaaatgat gagtctatat tatctagctt tctattaccc taatataaac 2220
tggtataaga agactttcct tttttcttta tgcatggaag catcaataaa ttgtttaaaa 2280
accatgtata gtaaattcag cttaacccgt gatcttctta agttaaaggt acttttgttt 2340
tataaaagct ctagataaaa ctttcttttc tgatcatgaa tcaagtatct gtggtttcat 2400
gcccctctct atacctttca aagaactcct gaagcaactt aactcatcat ttcagcctct 2460
gagtagaggt aaaacctatg tgtacttctg tttatgatcc atattgatat ttatgacatg 2520
aacacagaat agtaccttac atttgctaaa cagacagtta atatcaaatc ctttcaatat 2580
tctgggaacc cagggaagtt tttaaaaatg tcattacttt caaaggaaca gaagtagtta 2640
accaaactaa caagcaaaac ctgaggttta cctagtgaca ccaaattatc ggtattttaa 2700
ctgaatttac ccattgacta agaatgaacc agattggtg gtggttttgt ttctatgcaa 2760
actggacaca aattacaaca gtaaattttt ttataagtgc ttctcccttc tccatgatgt 2820
gacttccgga gataaaggat tcaaaagata aagacaaagt acgctcagag ttgttaacca 2880
gaaagtcctg gctgtggttg cagaaacact gttggaagaa aagagatgac taagtcaagt 2940
gtctgcctta tcaaaagagc aaaaatgcct ctggttttgt gtttgggaga aaagtatctt 3000
ggacgcactg ttttcccttga taaaagtcat cttctctact gtgtgaaatg aatacttgga 3060
attctaattg ttttgtgtgc caggggcagt aatgtccctg cctcttctcc caatcaaggt 3120
tgaggagtgg ggctggggag aggacttaac tgacttaaga agtaggaaaa caaaaacctc 3180
tctcctcagc cttccacctc caagagagga ggaaaaacag ttgtctgctg tctgtaattc 3240
agtttgcgtg tattttatgc tcatgcacca acccatacag agtaaatctt ttatcaactg 3300
tatactggtg tttaatagag aatgattgtc ttccgagttt tttggttcct tttttaactg 3360
tgttaaagta cttgaaatgt attgactgct gactatattt taaaaacaaa atgaaataat 3420
ttgagttgta ttacagaggt tgacattgtt cagggatggg acaaagcctt cttcaatcct 3480
tttcatacta cttaatgatt ttggtgcagg aacctgagat tttctgattt atatttcatg 3540
atatttcaca tttgctcttc acagcatgag catgaagccc agtggcacca aatggctggg 3600
tacaatcaag tgatattttg tagcacctca ctatctgaaa ggccatgagt tttcagatga 3660
tttcattgag cttcattgca gcctgaaatt ttaaaaaagt tgtgtaatac gccaaccagt 3720
caagttgtgttttggccaga gatttagata tgtccaattt cctggctcat ttcattgtgc 3780
tctatgggta cgtataaaaa gcaagaattc tgtttcctag gcaaacattg caactcaggg 3840
ctaaagtcat ccagtgaaac ttttagagcc agaagtaact ttgtcccagt cctacaatgt 3900
gaaaagagtg aatagttgcc tctttttagc cattttcatg gctggtacat attcgtacgc 3960
attacktttc agaatcaata cgcactttca gatattctta tttttattct cttaagtctt 4020
tattaacttt ggagagagaa atgatgcatc tttttatttt aaatgaagta gatcaacatg 4080
gtggaacaaa atgataaaga acagaaaaca tttcaatata ttactaataa ctttttccaa 4140
tataaatcct aaaattccta taacatagta ttttacagtt ttatgaagct ttctattgtg 4200
acttttatgg aattaagaga tgaagaagat gagatatttt agcatttata tttttcaaaa 4260
ttatatgtat acttaaaaat aaagtaactt tatgca 4296
<![CDATA[ <210> 793]]>
<![CDATA[ <211> 7716]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Full mRNA polynucleotide sequence of SLC4A4]]>
<![CDATA[ <400> 793]]>
ggcggcgcgc cgggcagcgc ttcggtggcg gcggcggccg cggtggcagc gaaggcggcg 60
gcggcggcgg cagtggcagt ggccgctgca gccccaacact ccgccgccaa actggaggag 120
cgacggaagc cagaccccag gaggatggag gatgaagctg tcctggacag aggggcttcc 180
ttcctcaagc atgtgtgtga tgaagaagaa gtagaaggcc accataccat ttacatcgga 240
gtccatgtgc cgaagagtta caggagaagg agacgtcaca agagaaagac agggcacaaa 300
gaaaagaagg aaaaggagag aatctctgag aactactctg acaaatcaga tattgaaaat 360
gctgatgaat ccagcagcag catcctaaaa cctctcatct ctcctgctgc agaacgcatc 420
cgattcatct tgggagagga ggatgacagc ccagctcccc ctcagctctt cacggaactg 480
gatgagctgc tggccgtgga tgggcaggag atggagtgga aggaaacagc caggtggatc 540
aagtttgaag aaaaagtgga acagggtggg gaaagatgga gcaagcccca tgtggccaca 600
ttgtcccttc atagtttatt tgagctgagg acatgtatgg agaaaggatc catcatgctt 660
gatcgggagg cttcttctct cccacagttg gtggagatga ttgttgacca tcagattgag 720
acaggcctat tgaaacctga acttaaggat aaggtgacct atactttgct ccggaagcac 780
cggcatcaaa ccaagaaatc caaccttcgg tccctggctg aattgggaa gacagtctcc 840
agtgcaagta ggatgtttac caaccctgat aatggtagcc cagccatgac ccataggaat 900
ctgacttcct ccagtctgaa tgacatttct gataaaccgg agaaggacca gctgaagaat 960
aagttcatga aaaaattgcc acgtgatgca gaagcttcca acgtgcttgt tggggaggtt 1020
gactttttgg atactccttt cattgccttt gttaggctac agcaggctgt catgctgggt 1080
gccctgactg aagttcctgt gcccacaagg ttcttgttca ttctcttagg tcctaagggg 1140
aaagccaagt cctaccacga gattggcaga gccattgcca ccctgatgtc tgatgaggtg 1200
ttccatgaca ttgcttataa agcaaaagac aggcacgacc tgattgctgg tattgatgag 1260
ttcctagatg aagtcatcgt ccttccacct ggggaatggg atccagcaat taggatagag 1320
cctcctaaga gtcttccatc ctctgacaaa agaaagaata tgtactcagg tggagagaat 1380
gttcagatga atggggatac gccccatgat ggaggtcacg gaggaggagg acatggggat 1440
tgtgaagaat tgcagcgaac tggacggttc tgtggtggac taattaaaga cataaagagg 1500
aaagcgccat tttttgccag tgatttttat gatgctttaa atattcaagc tctttcggca 1560
attctcttca tttatctggc aactgtaact aatgctatca cttttggagg actgcttggg 1620
gatgccactg acaacatgca gggcgtgttg gagagtttcc tgggcactgc tgtctctgga 1680
gccatctttt gcctttttgc tggtcaacca ctcactattc tgagcagcac cggacctgtc 1740
ctagtttttg agaggcttct atttaatttc agcaaggaca ataattttga ctatttggag 1800
tttcgccttt ggattggcct gtggtccgcc ttcctatgtc tcattttggt agccactgat 1860
gccagcttct tggttcaata cttcacacgt ttcacggagg agggcttttc ctctctgatt 1920
agcttcatct ttatctatga tgctttcaag aagatgatca agcttgcaga ttactaccccc 1980
atcaactcca acttcaaagt gggctacaac actctctttt cctgtacctg tgtgccacct 2040
gacccagcta atatctcaat atctaatgac accacactgg ccccagagta tttgccaact 2100
atgtcttcta ctgacatgta ccataatact acctttgact gggcattttt gtcgaagaag 2160
gagtgttcaa aatacggagg aaacctcgtc gggaacaact gtaattttgt tcctgattc 2220
acactcatgt cttttatcct cttcttggga acctacacct cttccatggc tctgaaaaaa 2280
ttcaaaacta gtccttattt tccaaccaca gcaagaaaac tgatcagtga ttttgccatt 2340
atcttgtcca ttctcatctt ttgtgtaata gatgccctag taggcgtgga caccccaaaa 2400
ctaattgtgc caagtgagtt caagccaaca agtccaaacc gaggttggtt cgttccaccg 2460
tttggagaaa acccctggtgggtgtgcctt gctgctgcta tcccggcttt gttggtcact 2520
atactgattt tcatggacca acaaattaca gctgtgattg taaacaggaa agaacataaa 2580
ctcaagaaag gagcagggta tcacttggat ctcttttggg tggccatcct catggttata 2640
tgctccctca tggctcttcc gtggtatgta gctgctacgg tcatctccat tgctcacatc 2700
gacagtttga agatggagac agagacttct gcacctggag aacaaccaaa gtttctagga 2760
gtgagggaac aaagagtcac tggaaccctt gtgtttattc tgactggtct gtcagtcttt 2820
atggctccca tcttgaagtt tatacccatg cctgtactct atggtgtgtt cctgtatatg 2880
ggagtagcat cccttaatgg tgtgcagttc atggatcgtc tgaagctgct tctgatgcct 2940
ctgaagcatc agcctgactt catctacctg cgtcatgttc ctctgcgcag agtccacctg 3000
ttcactttcc tgcaggtgtt gtgtctggcc ctgctttgga tcctcaagtc aacggtggct 3060
gctatcattt ttccagtaat gatcttggca cttgtagctg tcagaaaagg catggactac 3120
ctcttctccc agcacgacct cagcttcctg gatgatgtca ttccagaaaa ggacaagaaa 3180
aagaaggagg atgagaagaa aaagaaaaag aagaagggaa gtctggacag tgacaatgat 3240
gattctgact gcccatactc agaaaaagtt ccaagtatta aaattccaat ggacatcatg 3300
gaacagcaac ctttcctaag cgatagcaaa ccttctgaca gagaaagatc accaacattc 3360
cttgaacgcc acacatcatg ctgataaaat tcctttcctt cagtcactcg gtatgccaag 3420
tcctcctaga actccagtaa aagttgtgcc tcaaattaga atagaacttg aacctgaaga 3480
caatgattat ttctggagga gcaagggaac agaaactaca ttgtaacctg tttgtctttc 3540
ttaaaactga catttgttgt taatgtcatt tgtttttgtt tggctgtttg tttatttttt 3600
aacttttatt tcgtctcagt ttttggtcac aggccaaata atacagcgct ctctctgctt 3660
ctctcttgca tagacacaat caagacaata gtgcaccgtt ccttaaaaac agcatctgag 3720
gaatccccct tttgttctta aactttcaga tgtgtccttt gataaccaaa ttctgtcact 3780
caagacacag acacgcacag accctgtcct ttgcctctat taagcagagg atggaagtat 3840
taaggatttt gtaacacctt ttatgaaaat gttgaaggaa cttaaaactt tagctttgga 3900
gctgtgctta ctggcttgtc tttgtctggt agaacaaacc ttgacctcca gacagagtcc 3960
cttctcactt atagagctct ccaggactgg aaaaagtgct gctattttaa cttgctcttg 4020
cttgtaaatc ctaatcttag agttatcaaa agaagaaaaa actgaaggta ctttactccc 4080
tatagagaaa ccattgccat cattgtagca agtgctggaa tgtccctttt ttcctatgca 4140
acttttttta accctttaat gaacttatct gttgagtaca ttgaagaata tttttcttcc 4200
tagattttgt tgtttaaatt atggggccta acctgccact tatttttgt caatttttaa 4260
aacttttttt taattactgt aaagaaaatg aattttttcc tgcagcagga aacatagttt 4320
tgagtagttc tacctcttat ttgtagctgc caggctttct gtaaaaattg tattgtatat 4380
aatgtgattt ttacacatac atacacacac aaatacacaa tctctagggt aagccagaag 4440
gcaagatcag attaaaaaca ccatgtttct aagcatccat ttttcccttt ctttaaaaga 4500
aacttaactg ttctatgaag gagattgagg gagaagagac aaactcctat gtcatgagaa 4560
taaccgatgt tctgataata gtagcatcta ggtacagatg ctggttgtat taccacgtca 4620
atgtcctatg cagtattgtt agacattttc tcattttgaa atatttgtgt gtttgtgtat 4680
gtgctctgtg ccatggctgg tgtatatatg tgcaatgtta gaaggcaaaa gagtgatggt 4740
aggcagagggg caaagtcatt gaatctctta tgccagtttt cataaaaccc aaaccacata 4800
tgaaaaaatc cattaagggt ccaagaagtc tgtccatatg aaaatgaggg taaatatagt 4860
ttatttccca ggtatcagtc attataattg atataatagc tctaacatgc aatataaaat 4920
tcataggagt attaatagcc catttacaca tctataaaat gtaatgggat tgcagagctg 4980
cagagtacag tgtaacagta ctctcatgca atttttttca ggatgcaaag gcaattattc 5040
tttgtaagcg ggacatttag aatatatttg tgtacatatt atatgtatgt atatttcaaa 5100
gtaccacact gaaaattaga catttattaa ccaaatttaa cgtggtattt aaaggtaata 5160
tttttaatat gatacattac atattgtgaa tgtatactaa aaaaacattt taaatgttaa 5220
aattataatt tcagattcat ataaccacaa ctgtgatata tcctaactat aaccagttgt 5280
tgaggggtat actagaagca gaatgaaacc aattttttg gtttgataat atgcacttat 5340
tgactcccac tcattgttat gttaattaag ttaattattct gtctccttgt aattttgatt 5400
acaaaaattt tattatcctg agttagctgt tacttttaca gtacctgata ctcctaaaac 5460
ttttaactta tacaaattag tcaataatga ccccaatttt ttcattaaaa taatagtggt 5520
gaattatatg ttaattgtgtt aaaacctcac ttgccaaatt ctggcttcac atttgtattt 5580
agggctatcc ttaaaatgat gagtctatat tatctagctt tctattaccc taatataaac 5640
tggtataaga agactttcct tttttcttta tgcatggaag catcaataaa ttgtttaaaa 5700
accatgtata gtaaattcag cttaacccgt gatcttctta agttaaaggt acttttgttt 5760
tataaaagct ctagataaaa ctttcttttc tgatcatgaa tcaagtatct gtggtttcat 5820
gcccctctct atacctttca aagaactcct gaagcaactt aactcatcat ttcagcctct 5880
gagtagaggt aaaacctatg tgtacttctg tttatgatcc atattgatat ttatgacatg 5940
aacacagaat agtaccttac atttgctaaa cagacagtta atatcaaatc ctttcaatat 6000
tctgggaacc cagggaagtt tttaaaaatg tcattacttt caaaggaaca gaagtagtta 6060
accaaactaa caagcaaaac ctgaggttta cctagtgaca ccaaattatc ggtattttaa 6120
ctgaatttac ccattgacta agaatgaacc agattggtg gtggttttgt ttctatgcaa 6180
actggacaca aattacaaca gtaaattttt ttataagtgc ttctcccttc tccatgatgt 6240
gacttccgga gataaaggat tcaaaagata aagacaaagt acgctcagag ttgttaacca 6300
gaaagtcctg gctgtggttg cagaaacact gttggaagaa aagagatgac taagtcaagt 6360
gtctgcctta tcaaaagagc aaaaatgcct ctggttttgt gtttgggaga aaagtatctt 6420
ggacgcactg ttttcccttga taaaagtcat cttctctact gtgtgaaatg aatacttgga 6480
attctaattg ttttgtgtgc caggggcagt aatgtccctg cctcttctcc caatcaaggt 6540
tgaggagtgg ggctggggag aggacttaac tgacttaaga agtaggaaaa caaaaacctc 6600
tctcctcagc cttccacctc caagagagga ggaaaaacag ttgtctgctg tctgtaattc 6660
agtttgcgtg tattttatgc tcatgcacca acccatacag agtaaatctt ttatcaactg 6720
tatactggtg tttaatagag aatgattgtc ttccgagttt tttggttcct tttttaactg 6780
tgttaaagta cttgaaatgt attgactgct gactatattt taaaaacaaa atgaaataat 6840
ttgagttgta ttacagaggt tgacattgtt cagggatggg acaaagcctt cttcaatcct 6900
tttcatacta cttaatgatt ttggtgcagg aacctgagat tttctgattt atatttcatg 6960
atatttcaca tttgctcttc acagcatgag catgaagccc agtggcacca aatggctggg 7020
tacaatcaag tgatattttg tagcacctca ctatctgaaa ggccatgagt tttcagatga 7080
tttcattgag cttcattgca gcctgaaatt ttaaaaaagt tgtgtaatac gccaaccagt 7140
caagttgtgttttggccaga gatttagata tgtccaattt cctggctcat ttcattgtgc 7200
tctatgggta cgtataaaaa gcaagaattc tgtttcctag gcaaacattg caactcaggg 7260
ctaaagtcat ccagtgaaac ttttagagcc agaagtaact ttgtcccagt cctacaatgt 7320
gaaaagagtg aatagttgcc tctttttagc cattttcatg gctggtacat attcgtacgc 7380
attacktttc agaatcaata cgcactttca gatattctta tttttattct cttaagtctt 7440
tattaacttt ggagagagaa atgatgcatc tttttatttt aaatgaagta gatcaacatg 7500
gtggaacaaa atgataaaga acagaaaaca tttcaatata ttactaataa ctttttccaa 7560
tataaatcct aaaattccta taacatagta ttttacagtt ttatgaagct ttctattgtg 7620
acttttatgg aattaagaga tgaagaagat gagatatttt agcatttata tttttcaaaa 7680
ttatatgtat acttaaaaat aaagtaactt tatgca 7716
<![CDATA[ <210> 794]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 794]]>
Met Glu Asp Glu Ala Val Leu Asp Arg Gly Ala Ser Phe Leu Lys His
1 5 10 15
Val Cys Asp Glu Glu Glu Val Glu
20
<![CDATA[ <210> 795]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 795]]>
His His Thr Ile Tyr Ile Gly Val His Val Pro Lys Ser Tyr Arg Arg
1 5 10 15
Arg Arg Arg His Lys Arg Lys Thr Gly His Lys Glu Lys Lys Glu Lys
20 25 30
Glu Arg Ile Ser Glu Asn Tyr Ser Asp Lys Ser Asp Ile Glu Asn Ala
35 40 45
Asp Glu Ser Ser Ser Ser Ile Leu Lys Pro Leu
50 55
<![CDATA[ <210> 796]]>
<![CDATA[ <211> 44]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 796]]>
Ser Pro Ala Ala Glu Arg Ile Arg Phe Ile Leu Gly Glu Glu Asp Asp
1 5 10 15
Ser Pro Ala Pro Pro Gln Leu Phe Thr Glu Leu Asp Glu Leu Leu Ala
20 25 30
Val Asp Gly Gln Glu Met Glu Trp Lys Glu Thr Ala
35 40
<![CDATA[ <210> 797]]>
<![CDATA[ <211> 53]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 797]]>
Trp Ile Lys Phe Glu Glu Lys Val Glu Gln Gly Gly Glu Arg Trp Ser
1 5 10 15
Lys Pro His Val Ala Thr Leu Ser Leu His Ser Leu Phe Glu Leu Arg
20 25 30
Thr Cys Met Glu Lys Gly Ser Ile Met Leu Asp Arg Glu Ala Ser Ser
35 40 45
Leu Pro Gln Leu Val
50
<![CDATA[ <210> 798]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 798]]>
Met Ile Val Asp His Gln Ile Glu Thr Gly Leu Leu Lys Pro Glu Leu
1 5 10 15
Lys Asp Lys Val Thr Tyr Thr Leu Leu Arg Lys His His Arg His Gln Thr
20 25 30
Lys Lys Ser Asn Leu Arg Ser Leu Ala Asp Ile Gly Lys Thr Val Ser
35 40 45
Ser Ala Ser Arg Met Phe Thr Asn Pro Asp Asn
50 55
<![CDATA[ <210> 799]]>
<![CDATA[ <211> 25]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 799]]>
Ser Pro Ala Met Thr His Arg Asn Leu Thr Ser Ser Ser Ser Leu Asn Asp
1 5 10 15
Ile Ser Asp Lys Pro Glu Lys Asp Gln
20 25
<![CDATA[ <210> 800]]>
<![CDATA[ <211> 52]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 800]]>
Leu Lys Asn Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala Ser
1 5 10 15
Asn Val Leu Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe Ile Ala
20 25 30
Phe Val Arg Leu Gln Gln Ala Val Met Leu Gly Ala Leu Thr Glu Val
35 40 45
Pro Val Pro Thr
50
<![CDATA[ <210> 801]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 801]]>
Phe Leu Phe Ile Leu Leu Gly Pro Lys Gly Lys Ala Lys Ser Tyr His
1 5 10 15
Glu Ile Gly Arg Ala Ile Ala Thr Leu Met Ser Asp Glu
20 25
<![CDATA[ <210> 802]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 802]]>
Val Phe His Asp Ile Ala Tyr Lys Ala Lys Asp Arg His Asp Leu Ile
1 5 10 15
Ala Gly Ile Asp Glu Phe Leu Asp Glu Val Ile Val Leu Pro Pro Gly
20 25 30
Glu Trp Asp Pro Ala Ile Arg Ile Glu Pro Pro Lys Ser Leu Pro Ser
35 40 45
Ser Asp Lys
50
<![CDATA[ <210> 803]]>
<![CDATA[ <211> 37]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 803]]>
Lys Asn Met Tyr Ser Gly Gly Glu Asn Val Gln Met Asn Gly Asp Thr
1 5 10 15
Pro His Asp Gly Gly His Gly Gly Gly Gly Gly His Gly Asp Cys Glu Glu
20 25 30
Leu Gln Arg Thr Gly
35
<![CDATA[ <210> 804]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 804]]>
Phe Cys Gly Gly Leu Ile Lys Asp Ile Lys Arg Lys Ala Pro Phe Phe
1 5 10 15
Ala Ser Asp Phe Tyr Asp Ala Leu Asn Ile Gln Ala Leu Ser Ala Ile
20 25 30
Leu Phe Ile Tyr Leu Ala Thr Val Thr Asn Ala Ile Thr Phe Gly Gly
35 40 45
Leu Leu Gly Asp Ala Thr Asp Asn Met Gln
50 55
<![CDATA[ <210> 805]]>
<![CDATA[ <211> 44]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 805]]>
Gly Val Leu Glu Ser Phe Leu Gly Thr Ala Val Ser Gly Ala Ile Phe
1 5 10 15
Cys Leu Phe Ala Gly Gln Pro Leu Thr Ile Leu Ser Ser Thr Gly Pro
20 25 30
Val Leu Val Phe Glu Arg Leu Leu Phe Asn Phe Ser
35 40
<![CDATA[ <210> 806]]>
<![CDATA[ <211> 90]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 806]]>
Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp Ile Gly Leu Trp
1 5 10 15
Ser Ala Phe Leu Cys Leu Ile Leu Val Ala Thr Asp Ala Ser Phe Leu
20 25 30
Val Gln Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Ser Ser Leu Ile
35 40 45
Ser Phe Ile Phe Ile Tyr Asp Ala Phe Lys Lys Met Ile Lys Leu Ala
50 55 60
Asp Tyr Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu
65 70 75 80
Phe Ser Cys Thr Cys Val Pro Pro Asp Pro
85 90
<![CDATA[ <210> 807]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 807]]>
Asn Ile Ser Ile Ser Asn Asp Thr Thr Leu Ala Pro Glu Tyr Leu Pro
1 5 10 15
Thr Met Ser Ser Thr Asp Met
20
<![CDATA[ <210> 808]]>
<![CDATA[ <211> 64]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 808]]>
Tyr His Asn Thr Thr Phe Asp Trp Ala Phe Leu Ser Lys Lys Glu Cys
1 5 10 15
Ser Lys Tyr Gly Gly Asn Leu Val Gly Asn Asn Cys Asn Phe Val Pro
20 25 30
Asp Ile Thr Leu Met Ser Phe Ile Leu Phe Leu Gly Thr Tyr Thr Ser
35 40 45
Ser Met Ala Leu Lys Lys Lys Phe Lys Thr Ser Pro Tyr Phe Pro Thr Thr
50 55 60
<![CDATA[ <210> 809]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 809]]>
Ala Arg Lys Leu Ile Ser Asp Phe Ala Ile Ile Leu Ser Ile Leu Ile
1 5 10 15
Phe Cys Val Ile Asp Ala Leu Val Gly Val Asp Thr Pro Lys Leu Ile
20 25 30
Val Pro Ser Glu Phe Lys
35
<![CDATA[ <210> 810]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 810]]>
Pro Thr Ser Pro Asn Arg Gly Trp Phe Val Pro Pro Phe Gly Glu Asn
1 5 10 15
Pro Trp Trp Val Cys Leu Ala Ala Ala Ile Pro Ala Leu Leu Val Thr
20 25 30
Ile Leu Ile Phe Met Asp Gln Gln Ile Thr Ala Val Ile Val Asn Arg
35 40 45
Lys Glu His Lys Leu Lys
50
<![CDATA[ <210> 811]]>
<![CDATA[ <211> 59]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 811]]>
Lys Gly Ala Gly Tyr His Leu Asp Leu Phe Trp Val Ala Ile Leu Met
1 5 10 15
Val Ile Cys Ser Leu Met Ala Leu Pro Trp Tyr Val Ala Ala Thr Val
20 25 30
Ile Ser Ile Ala His Ile Asp Ser Leu Lys Met Glu Thr Glu Thr Ser
35 40 45
Ala Pro Gly Glu Gln Pro Lys Phe Leu Gly Val
50 55
<![CDATA[ <210> 812]]>
<![CDATA[ <211> 24]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 812]]>
Glu Gln Arg Val Thr Gly Thr Leu Val Phe Ile Leu Thr Gly Leu Ser
1 5 10 15
Val Phe Met Ala Pro Ile Leu Lys
20
<![CDATA[ <210> 813]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 813]]>
Phe Ile Pro Met Pro Val Leu Tyr Gly Val Phe Leu Tyr Met Gly Val
1 5 10 15
Ala Ser Leu Asn Gly Val Gln
20
<![CDATA[ <210> 814]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 814]]>
Phe Met Asp Arg Leu Lys Leu Leu Leu Met Pro Leu Lys His Gln Pro
1 5 10 15
Asp Phe Ile Tyr Leu Arg His Val Pro Leu Arg Arg Val His Leu Phe
20 25 30
Thr Phe Leu Gln Val Leu Cys Leu Ala Leu Leu Trp Ile Leu Lys Ser
35 40 45
Thr Val Ala Ala Ile Ile Phe Pro Val Met
50 55
<![CDATA[ <210> 815]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 815]]>
Ile Leu Ala Leu Val Ala Val Arg Lys Gly Met Asp Tyr Leu Phe Ser
1 5 10 15
Gln His Asp Leu Ser Phe Leu Asp Asp Val Ile Pro Glu Lys Asp Lys
20 25 30
Lys Lys Lys Glu Asp Glu Lys Lys Lys Lys Lys Lys Lys Lys Lys Gly Ser Leu
35 40 45
Asp Ser Asp Asn Asp Asp
50
<![CDATA[ <210> 816]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 816]]>
Ser Asp Cys Pro Tyr Ser Glu Lys Val Pro Ser Ile Lys Ile Pro Met
1 5 10 15
Asp Ile Met Glu Gln Gln Pro Phe Leu Ser Asp Ser Lys Pro Ser Asp
20 25 30
<![CDATA[ <210> 817]]>
<![CDATA[ <211> 13]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 817]]>
Glu Arg Ser Pro Thr Phe Leu Glu Arg His Thr Ser Cys
1 5 10
<![CDATA[ <210> 818]]>
<![CDATA[ <211> 0]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Skip sequence]]>
<![CDATA[ <400> 818]]>
000
<![CDATA[ <210> 819]]>
<![CDATA[ <211> 1079]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 819]]>
Met Glu Asp Glu Ala Val Leu Asp Arg Gly Ala Ser Phe Leu Lys His
1 5 10 15
Val Cys Asp Glu Glu Glu Val Glu Gly His His Thr Ile Tyr Ile Gly
20 25 30
Val His Val Pro Lys Ser Tyr Arg Arg Arg Arg Arg Arg His Lys Arg Lys
35 40 45
Thr Gly His Lys Glu Lys Lys Glu Lys Glu Arg Ile Ser Glu Asn Tyr
50 55 60
Ser Asp Lys Ser Asp Ile Glu Asn Ala Asp Glu Ser Ser Ser Ser Ser Ile
65 70 75 80
Leu Lys Pro Leu Ile Ser Pro Ala Ala Glu Arg Ile Arg Phe Ile Leu
85 90 95
Gly Glu Glu Asp Asp Ser Pro Ala Pro Pro Gln Leu Phe Thr Glu Leu
100 105 110
Asp Glu Leu Leu Ala Val Asp Gly Gln Glu Met Glu Trp Lys Glu Thr
115 120 125
Ala Arg Trp Ile Lys Phe Glu Glu Lys Val Glu Gln Gly Gly Glu Arg
130 135 140
Trp Ser Lys Pro His Val Ala Thr Leu Ser Leu His Ser Leu Phe Glu
145 150 155 160
Leu Arg Thr Cys Met Glu Lys Gly Ser Ile Met Leu Asp Arg Glu Ala
165 170 175
Ser Ser Leu Pro Gln Leu Val Glu Met Ile Val Asp His Gln Ile Glu
180 185 190
Thr Gly Leu Leu Lys Pro Glu Leu Lys Asp Lys Val Thr Tyr Thr Leu
195 200 205
Leu Arg Lys His Arg His Gln Thr Lys Lys Ser Asn Leu Arg Ser Leu
210 215 220
Ala Asp Ile Gly Lys Thr Val Ser Ser Ala Ser Arg Met Phe Thr Asn
225 230 235 240
Pro Asp Asn Gly Ser Pro Ala Met Thr His Arg Asn Leu Thr Ser Ser
245 250 255
Ser Leu Asn Asp Ile Ser Asp Lys Pro Glu Lys Asp Gln Leu Lys Asn
260 265 270
Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala Ser Asn Val Leu
275 280 285
Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe Ile Ala Phe Val Arg
290 295 300
Leu Gln Gln Ala Val Met Leu Gly Ala Leu Thr Glu Val Pro Val Pro
305 310 315 320
Thr Arg Phe Leu Phe Ile Leu Leu Gly Pro Lys Gly Lys Ala Lys Ser
325 330 335
Tyr His Glu Ile Gly Arg Ala Ile Ala Thr Leu Met Ser Asp Glu Val
340 345 350
Phe His Asp Ile Ala Tyr Lys Ala Lys Asp Arg His Asp Leu Ile Ala
355 360 365
Gly Ile Asp Glu Phe Leu Asp Glu Val Ile Val Leu Pro Pro Gly Glu
370 375 380
Trp Asp Pro Ala Ile Arg Ile Glu Pro Pro Lys Ser Leu Pro Ser Ser
385 390 395 400
Asp Lys Arg Lys Asn Met Tyr Ser Gly Gly Glu Asn Val Gln Met Asn
405 410 415
Gly Asp Thr Pro His Asp Gly Gly His Gly Gly Gly Gly His Gly Asp
420 425 430
Cys Glu Glu Leu Gln Arg Thr Gly Arg Phe Cys Gly Gly Leu Ile Lys
435 440 445
Asp Ile Lys Arg Lys Ala Pro Phe Phe Ala Ser Asp Phe Tyr Asp Ala
450 455 460
Leu Asn Ile Gln Ala Leu Ser Ala Ile Leu Phe Ile Tyr Leu Ala Thr
465 470 475 480
Val Thr Asn Ala Ile Thr Phe Gly Gly Leu Leu Gly Asp Ala Thr Asp
485 490 495
Asn Met Gln Gly Val Leu Glu Ser Phe Leu Gly Thr Ala Val Ser Gly
500 505 510
Ala Ile Phe Cys Leu Phe Ala Gly Gln Pro Leu Thr Ile Leu Ser Ser
515 520 525
Thr Gly Pro Val Leu Val Phe Glu Arg Leu Leu Phe Asn Phe Ser Lys
530 535 540
Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp Ile Gly Leu Trp
545 550 555 560
Ser Ala Phe Leu Cys Leu Ile Leu Val Ala Thr Asp Ala Ser Phe Leu
565 570 575
Val Gln Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Ser Ser Leu Ile
580 585 590
Ser Phe Ile Phe Ile Tyr Asp Ala Phe Lys Lys Met Ile Lys Leu Ala
595 600 605
Asp Tyr Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu
610 615 620
Phe Ser Cys Thr Cys Val Pro Pro Asp Pro Ala Asn Ile Ser Ile Ser
625 630 635 640
Asn Asp Thr Thr Leu Ala Pro Glu Tyr Leu Pro Thr Met Ser Ser Ser Thr
645 650 655
Asp Met Tyr His Asn Thr Thr Phe Asp Trp Ala Phe Leu Ser Lys Lys
660 665 670
Glu Cys Ser Lys Tyr Gly Gly Asn Leu Val Gly Asn Asn Cys Asn Phe
675 680 685
Val Pro Asp Ile Thr Leu Met Ser Phe Ile Leu Phe Leu Gly Thr Tyr
690 695 700
Thr Ser Ser Met Ala Leu Lys Lys Phe Lys Thr Ser Pro Tyr Phe Pro
705 710 715 720
Thr Thr Ala Arg Lys Leu Ile Ser Asp Phe Ala Ile Ile Leu Ser Ile
725 730 735
Leu Ile Phe Cys Val Ile Asp Ala Leu Val Gly Val Asp Thr Pro Lys
740 745 750
Leu Ile Val Pro Ser Glu Phe Lys Pro Thr Ser Pro Asn Arg Gly Trp
755 760 765
Phe Val Pro Pro Phe Gly Glu Asn Pro Trp Trp Val Cys Leu Ala Ala
770 775 780
Ala Ile Pro Ala Leu Leu Val Thr Ile Leu Ile Phe Met Asp Gln Gln
785 790 795 800
Ile Thr Ala Val Ile Val Asn Arg Lys Glu His Lys Leu Lys Lys Gly
805 810 815
Ala Gly Tyr His Leu Asp Leu Phe Trp Val Ala Ile Leu Met Val Ile
820 825 830
Cys Ser Leu Met Ala Leu Pro Trp Tyr Val Ala Ala Thr Val Ile Ser
835 840 845
Ile Ala His Ile Asp Ser Leu Lys Met Glu Thr Glu Thr Ser Ala Pro
850 855 860
Gly Glu Gln Pro Lys Phe Leu Gly Val Arg Glu Gln Arg Val Thr Gly
865 870 875 880
Thr Leu Val Phe Ile Leu Thr Gly Leu Ser Val Phe Met Ala Pro Ile
885 890 895
Leu Lys Phe Ile Pro Met Pro Val Leu Tyr Gly Val Phe Leu Tyr Met
900 905 910
Gly Val Ala Ser Leu Asn Gly Val Gln Phe Met Asp Arg Leu Lys Leu
915 920 925
Leu Leu Met Pro Leu Lys His Gln Pro Asp Phe Ile Tyr Leu Arg His
930 935 940
Val Pro Leu Arg Arg Val His Leu Phe Thr Phe Leu Gln Val Leu Cys
945 950 955 960
Leu Ala Leu Leu Trp Ile Leu Lys Ser Thr Val Ala Ala Ile Ile Phe
965 970 975
Pro Val Met Ile Leu Ala Leu Val Ala Val Arg Lys Gly Met Asp Tyr
980 985 990
Leu Phe Ser Gln His Asp Leu Ser Phe Leu Asp Asp Val Ile Pro Glu
995 1000 1005
Lys Asp Lys Lys Lys Lys Lys Glu Asp Glu Lys Lys Lys Lys Lys Lys Lys
1010 1015 1020
Lys Gly Ser Leu Asp Ser Asp Asn Asp Asp Ser Asp Cys Pro Tyr
1025 1030 1035
Ser Glu Lys Val Pro Ser Ile Lys Ile Pro Met Asp Ile Met Glu
1040 1045 1050
Gln Gln Pro Phe Leu Ser Asp Ser Lys Pro Ser Asp Arg Glu Arg
1055 1060 1065
Ser Pro Thr Phe Leu Glu Arg His Thr Ser Cys
1070 1075
<![CDATA[ <210> 820]]>
<![CDATA[ <211> 2047]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 820]]>
ggcggcgcgc cgggcagcgc ttcggtggcg gcggcggccg cggtggcagc gaaggcggcg 60
gcggcggcgg cagtggcagt ggccgctgca gccccaacact ccgccgccaa actggaggag 120
cgacggaagc cagaccccag gaggatggag gatgaagctg tcctggacag aggggcttcc 180
ttcctcaagc atgtgtgtga tgaagaagaa gtagaaggcc accataccat ttacatcgga 240
gtccatgtgc cgaagagtta caggagaagg agacgtcaca agagaaagac agggcacaaa 300
gaaaagaagg aaaaggagag aatctctgag aactactctg acaaatcaga tattgaaaat 360
gctgatgaat ccagcagcag catcctaaaa cctctcatct ctcctgctgc agaacgcatc 420
cgattcatct tgggagagga ggatgacagc ccagctcccc ctcagctctt cacggaactg 480
gatgagctgc tggccgtgga tgggcaggag atggagtgga aggaaacagc caggtggatc 540
aagtttgaag aaaaagtgga acagggtggg gaaagatgga gcaagcccca tgtggccaca 600
ttgtcccttc atagtttatt tgagctgagg acatgtatgg agaaaggatc catcatgctt 660
gatcgggagg cttcttctct cccacagttg gtggagatga ttgttgacca tcagattgag 720
acaggcctat tgaaacctga acttaaggat aaggtgacct atactttgct ccggaagcac 780
cggcatcaaa ccaagaaatc caaccttcgg tccctggctg aattgggaa gacagtctcc 840
agtgcaagta ggatgtttac caaccctgat aatggtagcc cagccatgac ccataggaat 900
ctgacttcct ccagtctgaa tgacatttct gataaaccgg agaaggacca gctgaagaat 960
aagttcatga aaaaattgcc acgtgatgca gaagcttcca acgtgcttgt tggggaggtt 1020
gactttttgg atactccttt cattgccttt gttaggctac agcaggctgt catgctgggt 1080
gccctgactg aagttcctgt gcccacaagg ttcttgttca ttctcttagg tcctaagggg 1140
aaagccaagt cctaccacga gattggcaga gccattgcca ccctgatgtc tgatgaggtg 1200
ttccatgaca ttgcttataa agcaaaagac aggcacgacc tgattgctgg tattgatgag 1260
ttcctagatg aagtcatcgt ccttccacct ggggaatggg atccagcaat taggatagag 1320
cctcctaaga gtcttccatc ctctgacaaa agaaagaata tgtactcagg tggagagaat 1380
gttcagatga atggggatac gccccatgat ggaggtcacg gaggaggagg acatggggat 1440
tgtgaagaat tgcagcgaac tggacggttc tgtggtggac taattaaaga cataaagagg 1500
aaagcgccat tttttgccag tgatttttat gatgctttaa atattcaagc tctttcggca 1560
attctcttca tttatctggc aactgtaact aatgctatca cttttggagg actgcttggg 1620
gatgccactg acaacatgca gggcgtgttg gagagtttcc tgggcactgc tgtctctgga 1680
gccatctttt gcctttttgc tggtcaacca ctcactattc tgagcagcac cggacctgtc 1740
ctagtttttg agaggcttct atttaatttc agcaaggaca ataattttga ctatttggag 1800
tttcgccttt ggattggcct gtggtccgcc ttcctatgtc tcattttggt agccactgat 1860
gccagcttct tggttcaata cttcacacgt ttcacggagg agggcttttc ctctctgatt 1920
agcttcatct ttatctatga tgctttcaag aagatgatca agcttgcaga ttactaccccc 1980
atcaactcca acttcaaagt gggctacaac actctctttt cctgtacctg tgtgccacct 2040
gacccag 2047
<![CDATA[ <210> 821]]>
<![CDATA[ <211> 634]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 821]]>
Met Glu Asp Glu Ala Val Leu Asp Arg Gly Ala Ser Phe Leu Lys His
1 5 10 15
Val Cys Asp Glu Glu Glu Val Glu Gly His His Thr Ile Tyr Ile Gly
20 25 30
Val His Val Pro Lys Ser Tyr Arg Arg Arg Arg Arg Arg His Lys Arg Lys
35 40 45
Thr Gly His Lys Glu Lys Lys Glu Lys Glu Arg Ile Ser Glu Asn Tyr
50 55 60
Ser Asp Lys Ser Asp Ile Glu Asn Ala Asp Glu Ser Ser Ser Ser Ser Ile
65 70 75 80
Leu Lys Pro Leu Ile Ser Pro Ala Ala Glu Arg Ile Arg Phe Ile Leu
85 90 95
Gly Glu Glu Asp Asp Ser Pro Ala Pro Pro Gln Leu Phe Thr Glu Leu
100 105 110
Asp Glu Leu Leu Ala Val Asp Gly Gln Glu Met Glu Trp Lys Glu Thr
115 120 125
Ala Arg Trp Ile Lys Phe Glu Glu Lys Val Glu Gln Gly Gly Glu Arg
130 135 140
Trp Ser Lys Pro His Val Ala Thr Leu Ser Leu His Ser Leu Phe Glu
145 150 155 160
Leu Arg Thr Cys Met Glu Lys Gly Ser Ile Met Leu Asp Arg Glu Ala
165 170 175
Ser Ser Leu Pro Gln Leu Val Glu Met Ile Val Asp His Gln Ile Glu
180 185 190
Thr Gly Leu Leu Lys Pro Glu Leu Lys Asp Lys Val Thr Tyr Thr Leu
195 200 205
Leu Arg Lys His Arg His Gln Thr Lys Lys Ser Asn Leu Arg Ser Leu
210 215 220
Ala Asp Ile Gly Lys Thr Val Ser Ser Ala Ser Arg Met Phe Thr Asn
225 230 235 240
Pro Asp Asn Gly Ser Pro Ala Met Thr His Arg Asn Leu Thr Ser Ser
245 250 255
Ser Leu Asn Asp Ile Ser Asp Lys Pro Glu Lys Asp Gln Leu Lys Asn
260 265 270
Lys Phe Met Lys Lys Leu Pro Arg Asp Ala Glu Ala Ser Asn Val Leu
275 280 285
Val Gly Glu Val Asp Phe Leu Asp Thr Pro Phe Ile Ala Phe Val Arg
290 295 300
Leu Gln Gln Ala Val Met Leu Gly Ala Leu Thr Glu Val Pro Val Pro
305 310 315 320
Thr Arg Phe Leu Phe Ile Leu Leu Gly Pro Lys Gly Lys Ala Lys Ser
325 330 335
Tyr His Glu Ile Gly Arg Ala Ile Ala Thr Leu Met Ser Asp Glu Val
340 345 350
Phe His Asp Ile Ala Tyr Lys Ala Lys Asp Arg His Asp Leu Ile Ala
355 360 365
Gly Ile Asp Glu Phe Leu Asp Glu Val Ile Val Leu Pro Pro Gly Glu
370 375 380
Trp Asp Pro Ala Ile Arg Ile Glu Pro Pro Lys Ser Leu Pro Ser Ser
385 390 395 400
Asp Lys Arg Lys Asn Met Tyr Ser Gly Gly Glu Asn Val Gln Met Asn
405 410 415
Gly Asp Thr Pro His Asp Gly Gly His Gly Gly Gly Gly His Gly Asp
420 425 430
Cys Glu Glu Leu Gln Arg Thr Gly Arg Phe Cys Gly Gly Leu Ile Lys
435 440 445
Asp Ile Lys Arg Lys Ala Pro Phe Phe Ala Ser Asp Phe Tyr Asp Ala
450 455 460
Leu Asn Ile Gln Ala Leu Ser Ala Ile Leu Phe Ile Tyr Leu Ala Thr
465 470 475 480
Val Thr Asn Ala Ile Thr Phe Gly Gly Leu Leu Gly Asp Ala Thr Asp
485 490 495
Asn Met Gln Gly Val Leu Glu Ser Phe Leu Gly Thr Ala Val Ser Gly
500 505 510
Ala Ile Phe Cys Leu Phe Ala Gly Gln Pro Leu Thr Ile Leu Ser Ser
515 520 525
Thr Gly Pro Val Leu Val Phe Glu Arg Leu Leu Phe Asn Phe Ser Lys
530 535 540
Asp Asn Asn Phe Asp Tyr Leu Glu Phe Arg Leu Trp Ile Gly Leu Trp
545 550 555 560
Ser Ala Phe Leu Cys Leu Ile Leu Val Ala Thr Asp Ala Ser Phe Leu
565 570 575
Val Gln Tyr Phe Thr Arg Phe Thr Glu Glu Gly Phe Ser Ser Leu Ile
580 585 590
Ser Phe Ile Phe Ile Tyr Asp Ala Phe Lys Lys Met Ile Lys Leu Ala
595 600 605
Asp Tyr Tyr Pro Ile Asn Ser Asn Phe Lys Val Gly Tyr Asn Thr Leu
610 615 620
Phe Ser Cys Thr Cys Val Pro Pro Asp Pro
625 630
<![CDATA[ <210> 822]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 822]]>
atgtgtccaa caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa 60
cggaggtcct ggcag 75
<![CDATA[ <210> 823]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 823]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[ <210> 824]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 824]]>
atgtgtccaa caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa 60
cggaggtcct ggcagccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[ <210> 825]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 825]]>
Val Ser Asn Lys Val Ser Met Ser Ser Thr Val Gln Gly Ser Asn Ile
1 5 10 15
Phe Glu Arg Thr Glu Val Leu Ala Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[ <210> 826]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 826]]>
acaggaagca ccctaatgag gagtatgcca acgtgggcac cggggagctg gcagcggagg 60
tgctcatcca gcaag 75
<![CDATA[ <210> 827]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 827]]>
ctacatctac atccaccact gggacaagcc atcttgtaaa atgtgcggag aaggagaaaa 60
ctttctgtgtgaatg 75
<![CDATA[ <210> 828]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 828]]>
acaggaagca ccctaatgag gagtatgcca acgtgggcac cggggagctg gcagcggagg 60
tgctcatcca gcaagctaca tctacatcca ccactgggac aagccatctt gtaaaatgtg 120
cggagaagga gaaaactttc tgtgtgaatg 150
<![CDATA[ <210> 829]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 829]]>
Arg Lys His Pro Asn Glu Glu Tyr Ala Asn Val Gly Thr Gly Glu Leu
1 5 10 15
Ala Ala Glu Val Leu Ile Gln Gln Ala Thr Ser Thr Ser Thr Thr Gly
20 25 30
Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val
35 40 45
Asn
<![CDATA[ <210> 830]]>
<![CDATA[ <211> 187]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 830]]>
atccgccatg ttggatgccg cagattcgcc ataacctcgc cggctctttt cttaaaaaaa 60
taaaaataaa aagcgaagcg tcagcagggc gccccgcccc ctcggtcggc acgggagggg 120
gcccggaaga gcccgaggct tttttttcct ccgcggtggg gcgttgccat ggagacgcgg 180
gcggcag 187
<![CDATA[ <210> 831]]>
<![CDATA[ <211> 177]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 831]]>
aaaacacggc catctttatg tgtaaatgtt gtaacctctt ctcaccaaat cagtcggaac 60
tcctctccca cgtttcagag aagcacatgg aagaaggggt taatgttgat gagattatta 120
ttccccttag gccctctgagt acacctgaac cccccaactc aagcaaaacc ggagatg 177
<![CDATA[ <210> 832]]>
<![CDATA[ <211> 252]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 832]]>
agtttttggt catgaagagg aagagaggca ggcctaaggg gtccacgaag aagtccagca 60
cagaagagga gctggcagaa aacatcgtga gtccgactga ggacagcccg ctggctccgg 120
aggaagggaa cagcctgcct ccaagcagct tggagtgtag caagtgctgt cggaagttct 180
ccaacacgcg ccagctgcgg aagcacatct gcatttatcgt gctgaatttg ggtgaggagg 240
aaggagaagc ag 252
<![CDATA[ <210> 833]]>
<![CDATA[ <211> 186]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 833]]>
gtaacgagtc tgaccttgaa ctagaaaaga agtgtaagga agatgatcgg gaaaaagcct 60
cgaaaagacc acggtcacag aaaacagaga aagtccagaa gatctcagga aaggaggcca 120
gacagctttc tggggcgaag aaacccatca taagtgtggt tttaactgca cacgaagcaa 180
ttccag 186
<![CDATA[ <210> 834]]>
<![CDATA[ <211> 151]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 834]]>
gtgctaccaa gattgtgcca gtggaggctg ggccccctga aacaggagct acaaattctg 60
agaccacttc agcagacctg gtgcctcgga gaggctacca ggaatacgcc attcagcaga 120
caccttatga gcaaccaatg aagtcaagca g 151
<![CDATA[ <210> 835]]>
<![CDATA[ <211> 1457]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 835]]>
gctaggtccc actcagctca aaatcttcac ttgtgaatac tgcaacaagg tcttcaagtt 60
caagcactcg ctgcaggccc acctgaggat ccacaccaat gaaaagccat acaagtgccc 120
ccagtgcagc tatgccagtg ccatcaaggc caacctcaat gtgcacctgc gcaagcacac 180
tggagagaag ttcgcctgcg actattgctc gttcacctgc ctgagcaagg gccacctcaa 240
ggtgcacatc gagcgagtgc acaagaagat caagcagcac tgccgcttct gcaagaagaa 300
gtactctgac gtcaagaacc tcatcaagca catccgagac gcgcatgacc cacaggaca 360
gaaggtcaaa gaggccttgg acgagctctg cctgatgacg aggggagggca agcggcagct 420
gctctatgac tgccacatct gtgagcgcaa gttcaagaac gagctggacc gtgaccgcca 480
tatgctggtc cacggagaca agtggccttt tgcctgtgag ctctgtggcc atggggccac 540
caagtaccag gcgctggaac tgcatgtcag gaagcacccc ttcgtgtacg tctgtgccgt 600
ctgccgcaag aagttcgtca gctccatcag gctgcgcacc cacatcaaag aggtgcacgg 660
ggctgcccag gaggccttgg tcttcaccag ttccatcaac cagagcttct gcctcctgga 720
acctggtggg gacatccagc aagaagctct gggggaccag ctacagctgg tggaagagga 780
gtttgccctc cagggcgtga atgcactcaa ggaagaggcc tgtcctgggg acactcagct 840
ggagggagggc cggaaggagc cggaggcccc tggggaaatg cctgccccag ctgtgcacct 900
ggcctccccg caggccgaaa gcacagccct gccaccctgt gagctggaaa ccaccgtggt 960
ctcctcctca gacctgcatt ctcaagaggt ggtttcagat gattttttgt tgaaaaatga 1020
tacctcctcc gcagaggctc atgctgctcc tgagaagccc ccagacatgc agcacagaag 1080
ctcagtccag acgcaaggtg aagtgatcac actactgctg tccaaggccc agagtgctgg 1140
gtcagatcag gaaagccatg gcgccccagag ccccctaggg gaagggcaga acatggctgt 1200
gctttcagct ggtgacccag atcccagcag gtgtctcagg tcaaacccag ctgaggcctc 1260
agacctcctc cctccagtag ctggtggtgg ggacaccatc acacatcagc ctgactcttg 1320
caaagctgcc cctgagcacc ggtcaggcat caccgctttc atgaaggtcc tgaacagttt 1380
acagaagaag caaatgaaca ccagcttgtg tgagcggatc cggaaggttt atggagacct 1440
ggagtgtgaa tactgtg 1457
<![CDATA[ <210> 836]]>
<![CDATA[ <211> 233]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 836]]>
gcaaactttt ttggtaccaa gtgcattttg atatgcatgt ccgcacccac acccgggaac 60
atctgtatta ttgctctcag tgtcattatt cttccatcac caaaaactgc cttaaacgcc 120
acgtaattca gaaacacagt aacatcttgc tgaagtgtcc caccgatggc tgtgactact 180
caactccaga taaatataag ctacaggcac atcttaaagt tcacacagca ctg 233
<![CDATA[ <210> 837]]>
<![CDATA[ <211> 88]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 837]]>
gacaaaagga gttattcttg tcctgtttgt gaaaagtctt tttcagagga tcgattgata 60
aagtcacata tcaagaccaa ccatcctg 88
<![CDATA[ <210> 838]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 838]]>
aggtctccat gagcaccatt tctgaggttc tcgggaggag ggttcagctg aaagggctaa 60
ttggaaagag agccatgaaa tgcccatatt gtgactttta tttcatgaag aatggctcag 120
accttcagcg tcatatttgg gctcatgaag 150
<![CDATA[ <210> 839]]>
<![CDATA[ <211> 174]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 839]]>
gtgtgaagcc cttcaagtgt tctttgtgtg agtatgcaac tcgtagcaag agtaacctca 60
aggctcatat gaatcgtcac agcactgaga aaacccacct atgtgacatg tgtggcaaga 120
aattcaaatc aaaagggaca ctgaaaagtc acaaactcct tcacactgca gatg 174
<![CDATA[ <210> 840]]>
<![CDATA[ <211> 89]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 840]]>
ggaagcagtt taagtgcacg gtgtgtgact acacagcggc ccagaagcca cagctgctgc 60
ggcacatgga acagcatgtc tccttcaag 89
<![CDATA[ <210> 841]]>
<![CDATA[ <211> 139]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 841]]>
cctttccgct gtgcccattg ccattactcc tgcaacatat ctggctctct gaagcggcac 60
tacaacagga agcaccttaa tgaggagtat gccaacgtgg gcaccgggga gctggcagcg 120
gaggtgctca tccagcaag 139
<![CDATA[ <210> 842]]>
<![CDATA[ <211> 119]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 842]]>
gtggtttgaa gtgtcctgtt tgcagctttg tatatggcac caaatggggag ttcaataggc 60
acttgaagaa caaacatggc ttgaaggtgg tggaaattga tggagacccc aagtggggag 119
<![CDATA[ <210> 843]]>
<![CDATA[ <211> 127]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 843]]>
acagcaacag aagctcctga ggagccctcc accccagtatc tccacatcac agaggccgaa 60
gaagacgttc aagggacaca ggcagcggtg gccgcgctcc aggacctgag atacacctct 120
gagagtg 127
<![CDATA[ <210> 844]]>
<![CDATA[ <211> 131]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 844]]>
gcgaccgact ggaccccacg gccgtgaaca tcctgcagca gatcattgag ctgggcgccg 60
agacccatga cgccactgcc cttgcctcgg tggttgccat ggcaccaggg acggtgactg 120
tggttaagca g 131
<![CDATA[ <210> 845]]>
<![CDATA[ <211> 934]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 845]]>
gtcaccgagg aggagcccag ctccaaccac acggtcatga tccaggagac ggtccagcaa 60
gcgtccgtgg agcttgccga gcagcaccac ctggtggtgt cctccgacga cgtggagggc 120
attgagacgg tgactgtcta cacgcagggc ggggaggcct cggagttcat cgtctacgtg 180
caggaggcca tgcagcctgt ggaggagcag gctgtggagc agccggccca ggaactctag 240
aggacatgtg gcatcggatg gccacagggc ggggctgcca ggctctgcag gcacccaggg 300
tggggaggcc acccttcctg ccctacccgc agaatggtgc tctcctttgc cctccctgcc 360
cagcagcctg ataggactct cctagtccaa cttggggtgg gcaaggcagt cagcatcacc 420
agcaatacca caggacccctc accccagcat agacacacac cccctgaccc ttaccatctg 480
cttcctgaaa gacttcagtg tcagctcccc tacacacacc ccaacaccttc accccttgct 540
tcaagattca aacagagact cccagtcccc ctcagcatct tccctgaatc acagccccag 600
ctccttgacc cccatctagg tgccaaatgt tcatctgcaa ccgctatgca gtctggtgag 660
aggggagacag ccatcacata gaaagtgacc gtacgggttt ttaatcactg ctgggtgggg 720
tgggggtagg gggattgtcc tggctttgtc gacaaagtcc cacttccccg agtattaagg 780
gcccttggta tcaagtgagg taaattcacc catcacagggg tctcgcccta ccatcctgga 840
attatttcac ttttaagata aatgcactat ttcactgttc gcctcccatt ctaaggaggt 900
gaggtggttg gaataaaaac agttcctgtc tgaa 934
<![CDATA[ <210> 846]]>
<![CDATA[ <211> 4594]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> The complete mRNA polynucleotide sequence of ZFAT]]>
<![CDATA[ <400> 846]]>
atccgccatg ttggatgccg cagattcgcc ataacctcgc cggctctttt cttaaaaaaa 60
taaaaataaa aagcgaagcg tcagcagggc gccccgcccc ctcggtcggc acgggagggg 120
gcccggaaga gcccgaggct tttttttcct ccgcggtggg gcgttgccat ggagacgcgg 180
gcggcagaaa acacggccat ctttatgtgt aaatgttgta acctcttctc accaaatcag 240
tcggaactcc tctcccacgt ttcagagaag cacatggaag aaggggttaa tgttgatgag 300
attattattc cccttaggcc tctgagtaca cctgaaccccc ccaactcaag caaaaccgga 360
gatgagtttt tggtcatgaa gaggaagaga ggcaggccta aggggtccac gaagaagtcc 420
agcacagaag aggagctggc agaaaacatc gtgagtccga ctgaggacag cccgctggct 480
ccggaggaag ggaacagcct gcctccaagc agcttggagt gtagcaagtg ctgtcggaag 540
ttctccaaca cgcgccagct gcggaagcac atctgcatta tcgtgctgaa tttgggtgag 600
gaggaaggag aagcaggtaa cgagtctgac cttgaactag aaaagaagtg taaggaagat 660
gatcgggaaa aagcctcgaa aagaccacgg tcacagaaaa cagagaaagt ccagaagatc 720
tcaggaaagg aggccagaca gctttctggg gcgaagaaac ccatcataag tgtggtttta 780
actgcacacg aagcaattcc aggtgctacc aagattgtgc cagtggaggc tgggccccct 840
gaaacaggag ctacaaattc tgagaccact tcagcagacc tggtgcctcg gagaggctac 900
caggaatacg ccattcagca gacaccttat gagcaaccaa tgaagtcaag caggctaggt 960
cccactcagc tcaaaatctt cacttgtgaa tactgcaaca aggtcttcaa gttcaagcac 1020
tcgctgcagg cccacctgag gatccacacc aatgaaaagc catacaagtg cccccagtgc 1080
agctatgcca gtgccatcaa ggccaacctc aatgtgcacc tgcgcaagca cactggagag 1140
aagttcgcct gcgactattg ctcgttcacc tgcctgagca agggccacct caaggtgcac 1200
atcgagcgag tgcacaagaa gatcaagcag cactgccgct tctgcaagaa gaagtactct 1260
gacgtcaaga acctcatcaa gcacatccga gacgcgcatg accccacagga caagaaggtc 1320
aaagaggcct tggacgagct ctgcctgatg acgagggagg gcaagcggca gctgctctat 1380
gactgccaca tctgtgagcg caagttcaag aacgagctgg accgtgaccg ccatatgctg 1440
gtccacggag acaagtggcc ttttgcctgt gagctctgtg gccatggggc caccaagtac 1500
caggcgctgg aactgcatgt caggaagcac cccttcgtgt acgtctgtgc cgtctgccgc 1560
aagaagttcg tcagctccat caggctgcgc acccacatca aagaggtgca cggggctgcc 1620
caggaggcct tggtcttcac cagttccatc aaccagagct tctgcctcct ggaacctggt 1680
ggggacatcc agcaagaagc tctgggggac cagctacagc tggtggaaga ggagtttgcc 1740
ctccagggcg tgaatgcact caaggaagag gcctgtcctg gggacactca gctggaggag 1800
ggccggaagg agccggaggc ccctggggaa atgcctgccc cagctgtgca cctggcctcc 1860
ccgcaggccg aaagcacagc cctgccaccc tgtgagctgg aaaccaccgt ggtctcctcc 1920
tcagacctgc attctcaaga ggtggtttca gatgattttttgttgaaaaa tgatacctcc 1980
tccgcagagg ctcatgctgc tcctgagaag cccccagaca tgcagcacag aagctcagtc 2040
cagacgcaag gtgaagtgat cacactactg ctgtccaagg cccagagtgc tgggtcagat 2100
caggaaagcc atggcgccca gagcccccta ggggaagggc agaacatggc tgtgctttca 2160
gctggtgacc cagatccccag caggtgtctc aggtcaaacc cagctgaggc ctcagacctc 2220
ctccctccag tagctggtgg tggggacacc atcacacatc agcctgactc ttgcaaagct 2280
gcccctgagc accggtcagg catcaccgct ttcatgaagg tcctgaacag tttacagaag 2340
aagcaaatga acaccagctt gtgtgagcgg atccggaagg tttatggaga cctggagtgt 2400
gaatactgtg gcaaactttt ttggtaccaa gtgcattttg atatgcatgt ccgcacccac 2460
acccgggaac atctgtatta ttgctctcag tgtcattatt cttccatcac caaaaactgc 2520
cttaaacgcc acgtaattca gaaacacagt aacatcttgc tgaagtgtcc caccgatggc 2580
tgtgactact caactccaga taaatataag ctacaggcac atcttaaagt tcacacagca 2640
ctggacaaaa ggagttattc ttgtcctgtt tgtgaaaagt ctttttcaga ggatcgattg 2700
ataaagtcac atatcaagac caaccatcct gaggtctcca tgagcaccat ttctgaggtt 2760
ctcgggagga gggttcagct gaaagggcta attggaaaga gagccatgaa atgcccatat 2820
tgtgactttt atttcatgaa gaatggctca gaccttcagc gtcatatttg ggctcatgaa 2880
ggtgtgaagc ccttcaagtg ttctttgtgtgagtatgcaa ctcgtagcaa gagtaacctc 2940
aaggctcata tgaatcgtca cagcactgag aaaacccacc tatgtgacat gtgtggcaag 3000
aaattcaaat caaaagggac actgaaaagt cacaaactcc ttcacactgc agatgggaag 3060
cagtttaagt gcacggtgtg tgactacaca gcggcccaga agccacagct gctgcggcac 3120
atggaacagc atgtctcctt caagcctttc cgctgtgccc attgccatta ctcctgcaac 3180
atatctggct ctctgaagcg gcactacaac aggaagcacc ctaatgagga gtatgccaac 3240
gtgggcaccg gggagctggc agcggaggtg ctcatccagc aaggtggttt gaagtgtcct 3300
gtttgcagct ttgtatatgg caccaaatgg gagttcaata ggcacttgaa gaacaaacat 3360
ggcttgaagg tggtggaaat tgatggagac cccaagtggg agacagcaac agaagctcct 3420
gaggagccct ccaccccagta tctccacatc acagaggccg aagaagacgt tcaagggaca 3480
caggcagcgg tggccgcgct ccaggacctg agatacacct ctgagagtgg cgaccgactg 3540
gaccccacgg ccgtgaacat cctgcagcag atcattgagc tgggcgccga gacccatgac 3600
gccactgccc ttgcctcggt ggttgccatg gcaccaggga cggtgactgt ggttaagcag 3660
gtcaccgagg aggagcccag ctccaaccac acggtcatga tccaggagac ggtccagcaa 3720
gcgtccgtgg agcttgccga gcagcaccac ctggtggtgt cctccgacga cgtggagggc 3780
attgagacgg tgactgtcta cacgcagggc ggggaggcct cggagttcat cgtctacgtg 3840
caggaggcca tgcagcctgt ggaggagcag gctgtggagc agccggccca ggaactctag 3900
aggacatgtg gcatcggatg gccacagggc ggggctgcca ggctctgcag gcacccaggg 3960
tggggaggcc acccttcctg ccctacccgc agaatggtgc tctcctttgc cctccctgcc 4020
cagcagcctg ataggactct cctagtccaa cttggggtgg gcaaggcagt cagcatcacc 4080
agcaatacca caggacccctc accccagcat agacacacac cccctgaccc ttaccatctg 4140
cttcctgaaa gacttcagtg tcagctcccc tacacacacc ccaacaccttc accccttgct 4200
tcaagattca aacagagact cccagtcccc ctcagcatct tccctgaatc acagccccag 4260
ctccttgacc cccatctagg tgccaaatgt tcatctgcaa ccgctatgca gtctggtgag 4320
aggggagacag ccatcacata gaaagtgacc gtacgggttt ttaatcactg ctgggtgggg 4380
tgggggtagg gggattgtcc tggctttgtc gacaaagtcc cacttccccg agtattaagg 4440
gcccttggta tcaagtgagg taaattcacc catcacaggg tctcgcccta ccatcctgga 4500
attatttcac ttttaagata aatgcactat ttcactgttc gcctcccatt ctaaggaggt 4560
gaggtggttg gaataaaaac agttcctgtc tgaa 4594
<![CDATA[ <210> 847]]>
<![CDATA[ <211> 5]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 847]]>
Met Glu Thr Arg Ala
1 5
<![CDATA[ <210> 848]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 848]]>
Asn Thr Ala Ile Phe Met Cys Lys Cys Cys Asn Leu Phe Ser Pro Asn
1 5 10 15
Gln Ser Glu Leu Leu Ser His Val Ser Glu Lys His Met Glu Glu Gly
20 25 30
Val Asn Val Asp Glu Ile Ile Ile Pro Leu Arg Pro Leu Ser Thr Pro
35 40 45
Glu Pro Pro Asn Ser Ser Lys Thr Gly Asp
50 55
<![CDATA[ <210> 849]]>
<![CDATA[ <211> 83]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 849]]>
Phe Leu Val Met Lys Arg Lys Arg Gly Arg Pro Lys Gly Ser Thr Lys
1 5 10 15
Lys Ser Ser Thr Glu Glu Glu Leu Ala Glu Asn Ile Val Ser Pro Thr
20 25 30
Glu Asp Ser Pro Leu Ala Pro Glu Glu Gly Asn Ser Leu Pro Pro Ser
35 40 45
Ser Leu Glu Cys Ser Lys Cys Cys Arg Lys Phe Ser Asn Thr Arg Gln
50 55 60
Leu Arg Lys His Ile Cys Ile Ile Val Leu Asn Leu Gly Glu Glu Glu Glu
65 70 75 80
Gly Glu Ala
<![CDATA[ <210> 850]]>
<![CDATA[ <211> 61]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 850]]>
Asn Glu Ser Asp Leu Glu Leu Glu Lys Lys Cys Lys Glu Asp Asp Arg
1 5 10 15
Glu Lys Ala Ser Lys Arg Pro Arg Ser Gln Lys Thr Glu Lys Val Gln
20 25 30
Lys Ile Ser Gly Lys Glu Ala Arg Gln Leu Ser Gly Ala Lys Lys Pro
35 40 45
Ile Ile Ser Val Val Leu Thr Ala His Glu Ala Ile Pro
50 55 60
<![CDATA[ <210> 851]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 851]]>
Ala Thr Lys Ile Val Pro Val Glu Ala Gly Pro Pro Glu Thr Gly Ala
1 5 10 15
Thr Asn Ser Glu Thr Thr Ser Ala Asp Leu Val Pro Arg Arg Gly Tyr
20 25 30
Gln Glu Tyr Ala Ile Gln Gln Thr Pro Tyr Glu Gln Pro Met Lys Ser
35 40 45
Ser
<![CDATA[ <210> 852]]>
<![CDATA[ <211> 485]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> ]]> human
<![CDATA[ <400> 852]]>
Leu Gly Pro Thr Gln Leu Lys Ile Phe Thr Cys Glu Tyr Cys Asn Lys
1 5 10 15
Val Phe Lys Phe Lys His Ser Leu Gln Ala His Leu Arg Ile His Thr
20 25 30
Asn Glu Lys Pro Tyr Lys Cys Pro Gln Cys Ser Tyr Ala Ser Ala Ile
35 40 45
Lys Ala Asn Leu Asn Val His Leu Arg Lys His Thr Gly Glu Lys Phe
50 55 60
Ala Cys Asp Tyr Cys Ser Phe Thr Cys Leu Ser Lys Gly His Leu Lys
65 70 75 80
Val His Ile Glu Arg Val His Lys Lys Ile Lys Gln His Cys Arg Phe
85 90 95
Cys Lys Lys Lys Tyr Ser Asp Val Lys Asn Leu Ile Lys His Ile Arg
100 105 110
Asp Ala His Asp Pro Gln Asp Lys Lys Val Lys Glu Ala Leu Asp Glu
115 120 125
Leu Cys Leu Met Thr Arg Glu Gly Lys Arg Gln Leu Leu Tyr Asp Cys
130 135 140
His Ile Cys Glu Arg Lys Phe Lys Asn Glu Leu Asp Arg Asp Arg His
145 150 155 160
Met Leu Val His Gly Asp Lys Trp Pro Phe Ala Cys Glu Leu Cys Gly
165 170 175
His Gly Ala Thr Lys Tyr Gln Ala Leu Glu Leu His Val Arg Lys His
180 185 190
Pro Phe Val Tyr Val Cys Ala Val Cys Arg Lys Lys Phe Val Ser Ser
195 200 205
Ile Arg Leu Arg Thr His Ile Lys Glu Val His Gly Ala Ala Gln Glu
210 215 220
Ala Leu Val Phe Thr Ser Ser Ile Asn Gln Ser Phe Cys Leu Leu Glu
225 230 235 240
Pro Gly Gly Asp Ile Gln Gln Glu Ala Leu Gly Asp Gln Leu Gln Leu
245 250 255
Val Glu Glu Glu Phe Ala Leu Gln Gly Val Asn Ala Leu Lys Glu Glu
260 265 270
Ala Cys Pro Gly Asp Thr Gln Leu Glu Glu Gly Arg Lys Glu Pro Glu
275 280 285
Ala Pro Gly Glu Met Pro Ala Pro Ala Val His Leu Ala Ser Pro Gln
290 295 300
Ala Glu Ser Thr Ala Leu Pro Pro Cys Glu Leu Glu Thr Thr Val Val
305 310 315 320
Ser Ser Ser Asp Leu His Ser Gln Glu Val Val Ser Asp Asp Phe Leu
325 330 335
Leu Lys Asn Asp Thr Ser Ser Ala Glu Ala His Ala Ala Pro Glu Lys
340 345 350
Pro Pro Asp Met Gln His Arg Ser Ser Val Gln Thr Gln Gly Glu Val
355 360 365
Ile Thr Leu Leu Leu Ser Lys Ala Gln Ser Ala Gly Ser Asp Gln Glu
370 375 380
Ser His Gly Ala Gln Ser Pro Leu Gly Glu Gly Gln Asn Met Ala Val
385 390 395 400
Leu Ser Ala Gly Asp Pro Asp Pro Ser Arg Cys Leu Arg Ser Asn Pro
405 410 415
Ala Glu Ala Ser Asp Leu Leu Pro Pro Val Ala Gly Gly Gly Asp Thr
420 425 430
Ile Thr His Gln Pro Asp Ser Cys Lys Ala Ala Pro Glu His Arg Ser
435 440 445
Gly Ile Thr Ala Phe Met Lys Val Leu Asn Ser Leu Gln Lys Lys Gln
450 455 460
Met Asn Thr Ser Leu Cys Glu Arg Ile Arg Lys Val Tyr Gly Asp Leu
465 470 475 480
Glu Cys Glu Tyr Cys
485
<![CDATA[ <210> 853]]>
<![CDATA[ <211> 77]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 853]]>
Lys Leu Phe Trp Tyr Gln Val His Phe Asp Met His Val Arg Thr His
1 5 10 15
Thr Arg Glu His Leu Tyr Tyr Cys Ser Gln Cys His Tyr Ser Ser Ile
20 25 30
Thr Lys Asn Cys Leu Lys Arg His Val Ile Gln Lys His Ser Asn Ile
35 40 45
Leu Leu Lys Cys Pro Thr Asp Gly Cys Asp Tyr Ser Thr Pro Asp Lys
50 55 60
Tyr Lys Leu Gln Ala His Leu Lys Val His Thr Ala Leu
65 70 75
<![CDATA[ <210> 854]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 854]]>
Asp Lys Arg Ser Tyr Ser Cys Pro Val Cys Glu Lys Ser Phe Ser Glu
1 5 10 15
Asp Arg Leu Ile Lys Ser His Ile Lys Thr Asn His Pro
20 25
<![CDATA[ <210> 855]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 855]]>
Val Ser Met Ser Thr Ile Ser Glu Val Leu Gly Arg Arg Val Gln Leu
1 5 10 15
Lys Gly Leu Ile Gly Lys Arg Ala Met Lys Cys Pro Tyr Cys Asp Phe
20 25 30
Tyr Phe Met Lys Asn Gly Ser Asp Leu Gln Arg His Ile Trp Ala His
35 40 45
Glu
<![CDATA[ <210> 856]]>
<![CDATA[ <211> 57]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 856]]>
Val Lys Pro Phe Lys Cys Ser Leu Cys Glu Tyr Ala Thr Arg Ser Lys
1 5 10 15
Ser Asn Leu Lys Ala His Met Asn Arg His Ser Thr Glu Lys Thr His
20 25 30
Leu Cys Asp Met Cys Gly Lys Lys Phe Lys Ser Lys Gly Thr Leu Lys
35 40 45
Ser His Lys Leu Leu His Thr Ala Asp
50 55
<![CDATA[ <210> 857]]>
<![CDATA[ <211> 29]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 857]]>
Lys Gln Phe Lys Cys Thr Val Cys Asp Tyr Thr Ala Ala Gln Lys Pro
1 5 10 15
Gln Leu Leu Arg His Met Glu Gln His Val Ser Phe Lys
20 25
<![CDATA[ <210> 858]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 858]]>
Pro Phe Arg Cys Ala His Cys His Tyr Ser Cys Asn Ile Ser Gly Ser
1 5 10 15
Leu Lys Arg His Tyr Asn Arg Lys His Pro Asn Glu Glu Tyr Ala Asn
20 25 30
Val Gly Thr Gly Glu Leu Ala Ala Glu Val Leu Ile Gln Gln
35 40 45
<![CDATA[ <210> 859]]>
<![CDATA[ <211> 39]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 859]]>
Gly Leu Lys Cys Pro Val Cys Ser Phe Val Tyr Gly Thr Lys Trp Glu
1 5 10 15
Phe Asn Arg His Leu Lys Asn Lys His Gly Leu Lys Val Val Glu Ile
20 25 30
Asp Gly Asp Pro Lys Trp Glu
35
<![CDATA[ <210> 860]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 860]]>
Thr Ala Thr Glu Ala Pro Glu Glu Pro Ser Thr Gln Tyr Leu His Ile
1 5 10 15
Thr Glu Ala Glu Glu Asp Val Gln Gly Thr Gln Ala Ala Val Ala Ala
20 25 30
Leu Gln Asp Leu Arg Tyr Thr Ser Glu Ser
35 40
<![CDATA[ <210> 861]]>
<![CDATA[ <211> 43]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 861]]>
Asp Arg Leu Asp Pro Thr Ala Val Asn Ile Leu Gln Gln Ile Ile Glu
1 5 10 15
Leu Gly Ala Glu Thr His Asp Ala Thr Ala Leu Ala Ser Val Val Ala
20 25 30
Met Ala Pro Gly Thr Val Thr Val Val Lys Gln
35 40
<![CDATA[ <210> 862]]>
<![CDATA[ <211> 79]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 862]]>
Val Thr Glu Glu Glu Pro Ser Ser Asn His Thr Val Met Ile Gln Glu
1 5 10 15
Thr Val Gln Gln Ala Ser Val Glu Leu Ala Glu Gln His His Leu Val
20 25 30
Val Ser Ser Asp Asp Val Glu Gly Ile Glu Thr Val Thr Val Tyr Thr
35 40 45
Gln Gly Gly Glu Ala Ser Glu Phe Ile Val Tyr Val Gln Glu Ala Met
50 55 60
Gln Pro Val Glu Glu Gln Ala Val Glu Gln Pro Ala Gln Glu Leu
65 70 75
<![CDATA[ <210> 863]]>
<![CDATA[ <211> 1243]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 863]]>
Met Glu Thr Arg Ala Ala Glu Asn Thr Ala Ile Phe Met Cys Lys Cys
1 5 10 15
Cys Asn Leu Phe Ser Pro Asn Gln Ser Glu Leu Leu Ser His Val Ser
20 25 30
Glu Lys His Met Glu Glu Gly Val Asn Val Asp Glu Ile Ile Ile Pro
35 40 45
Leu Arg Pro Leu Ser Thr Pro Glu Pro Pro Asn Ser Ser Lys Thr Gly
50 55 60
Asp Glu Phe Leu Val Met Lys Arg Lys Arg Gly Arg Pro Lys Gly Ser
65 70 75 80
Thr Lys Lys Ser Ser Thr Glu Glu Glu Leu Ala Glu Asn Ile Val Ser
85 90 95
Pro Thr Glu Asp Ser Pro Leu Ala Pro Glu Glu Gly Asn Ser Leu Pro
100 105 110
Pro Ser Ser Leu Glu Cys Ser Lys Cys Cys Arg Lys Phe Ser Asn Thr
115 120 125
Arg Gln Leu Arg Lys His Ile Cys Ile Ile Val Leu Asn Leu Gly Glu
130 135 140
Glu Glu Gly Glu Ala Gly Asn Glu Ser Asp Leu Glu Leu Glu Lys Lys
145 150 155 160
Cys Lys Glu Asp Asp Arg Glu Lys Ala Ser Lys Arg Pro Arg Ser Gln
165 170 175
Lys Thr Glu Lys Val Gln Lys Ile Ser Gly Lys Glu Ala Arg Gln Leu
180 185 190
Ser Gly Ala Lys Lys Pro Ile Ile Ser Val Val Leu Thr Ala His Glu
195 200 205
Ala Ile Pro Gly Ala Thr Lys Ile Val Pro Val Glu Ala Gly Pro Pro
210 215 220
Glu Thr Gly Ala Thr Asn Ser Glu Thr Thr Ser Ala Asp Leu Val Pro
225 230 235 240
Arg Arg Gly Tyr Gln Glu Tyr Ala Ile Gln Gln Thr Pro Tyr Glu Gln
245 250 255
Pro Met Lys Ser Ser Arg Leu Gly Pro Thr Gln Leu Lys Ile Phe Thr
260 265 270
Cys Glu Tyr Cys Asn Lys Val Phe Lys Phe Lys His Ser Leu Gln Ala
275 280 285
His Leu Arg Ile His Thr Asn Glu Lys Pro Tyr Lys Cys Pro Gln Cys
290 295 300
Ser Tyr Ala Ser Ala Ile Lys Ala Asn Leu Asn Val His Leu Arg Lys
305 310 315 320
His Thr Gly Glu Lys Phe Ala Cys Asp Tyr Cys Ser Phe Thr Cys Leu
325 330 335
Ser Lys Gly His Leu Lys Val His Ile Glu Arg Val His Lys Lys Ile
340 345 350
Lys Gln His Cys Arg Phe Cys Lys Lys Lys Tyr Ser Asp Val Lys Asn
355 360 365
Leu Ile Lys His Ile Arg Asp Ala His Asp Pro Gln Asp Lys Lys Val
370 375 380
Lys Glu Ala Leu Asp Glu Leu Cys Leu Met Thr Arg Glu Gly Lys Arg
385 390 395 400
Gln Leu Leu Tyr Asp Cys His Ile Cys Glu Arg Lys Phe Lys Asn Glu
405 410 415
Leu Asp Arg Asp Arg His Met Leu Val His Gly Asp Lys Trp Pro Phe
420 425 430
Ala Cys Glu Leu Cys Gly His Gly Ala Thr Lys Tyr Gln Ala Leu Glu
435 440 445
Leu His Val Arg Lys His Pro Phe Val Tyr Val Cys Ala Val Cys Arg
450 455 460
Lys Lys Phe Val Ser Ser Ile Arg Leu Arg Thr His Ile Lys Glu Val
465 470 475 480
His Gly Ala Ala Gln Glu Ala Leu Val Phe Thr Ser Ser Ile Asn Gln
485 490 495
Ser Phe Cys Leu Leu Glu Pro Gly Gly Asp Ile Gln Gln Glu Ala Leu
500 505 510
Gly Asp Gln Leu Gln Leu Val Glu Glu Glu Phe Ala Leu Gln Gly Val
515 520 525
Asn Ala Leu Lys Glu Glu Ala Cys Pro Gly Asp Thr Gln Leu Glu Glu
530 535 540
Gly Arg Lys Glu Pro Glu Ala Pro Gly Glu Met Pro Ala Pro Ala Val
545 550 555 560
His Leu Ala Ser Pro Gln Ala Glu Ser Thr Ala Leu Pro Pro Cys Glu
565 570 575
Leu Glu Thr Thr Val Val Ser Ser Ser Asp Leu His Ser Gln Glu Val
580 585 590
Val Ser Asp Asp Phe Leu Leu Lys Asn Asp Thr Ser Ser Ala Glu Ala
595 600 605
His Ala Ala Pro Glu Lys Pro Pro Asp Met Gln His Arg Ser Ser Val
610 615 620
Gln Thr Gln Gly Glu Val Ile Thr Leu Leu Leu Ser Lys Ala Gln Ser
625 630 635 640
Ala Gly Ser Asp Gln Glu Ser His Gly Ala Gln Ser Pro Leu Gly Glu
645 650 655
Gly Gln Asn Met Ala Val Leu Ser Ala Gly Asp Pro Asp Pro Ser Arg
660 665 670
Cys Leu Arg Ser Asn Pro Ala Glu Ala Ser Asp Leu Leu Pro Pro Val
675 680 685
Ala Gly Gly Gly Asp Thr Ile Thr His Gln Pro Asp Ser Cys Lys Ala
690 695 700
Ala Pro Glu His Arg Ser Gly Ile Thr Ala Phe Met Lys Val Leu Asn
705 710 715 720
Ser Leu Gln Lys Lys Gln Met Asn Thr Ser Leu Cys Glu Arg Ile Arg
725 730 735
Lys Val Tyr Gly Asp Leu Glu Cys Glu Tyr Cys Gly Lys Leu Phe Trp
740 745 750
Tyr Gln Val His Phe Asp Met His Val Arg Thr His Thr Arg Glu His
755 760 765
Leu Tyr Tyr Cys Ser Gln Cys His Tyr Ser Ser Ile Thr Lys Asn Cys
770 775 780
Leu Lys Arg His Val Ile Gln Lys His Ser Asn Ile Leu Leu Lys Cys
785 790 795 800
Pro Thr Asp Gly Cys Asp Tyr Ser Thr Pro Asp Lys Tyr Lys Leu Gln
805 810 815
Ala His Leu Lys Val His Thr Ala Leu Asp Lys Arg Ser Tyr Ser Cys
820 825 830
Pro Val Cys Glu Lys Ser Phe Ser Glu Asp Arg Leu Ile Lys Ser His
835 840 845
Ile Lys Thr Asn His Pro Glu Val Ser Met Ser Thr Ile Ser Glu Val
850 855 860
Leu Gly Arg Arg Val Gln Leu Lys Gly Leu Ile Gly Lys Arg Ala Met
865 870 875 880
Lys Cys Pro Tyr Cys Asp Phe Tyr Phe Met Lys Asn Gly Ser Asp Leu
885 890 895
Gln Arg His Ile Trp Ala His Glu Gly Val Lys Pro Phe Lys Cys Ser
900 905 910
Leu Cys Glu Tyr Ala Thr Arg Ser Lys Ser Asn Leu Lys Ala His Met
915 920 925
Asn Arg His Ser Thr Glu Lys Thr His Leu Cys Asp Met Cys Gly Lys
930 935 940
Lys Phe Lys Ser Lys Gly Thr Leu Lys Ser His Lys Leu Leu His Thr
945 950 955 960
Ala Asp Gly Lys Gln Phe Lys Cys Thr Val Cys Asp Tyr Thr Ala Ala
965 970 975
Gln Lys Pro Gln Leu Leu Arg His Met Glu Gln His Val Ser Phe Lys
980 985 990
Pro Phe Arg Cys Ala His Cys His Tyr Ser Cys Asn Ile Ser Gly Ser
995 1000 1005
Leu Lys Arg His Tyr Asn Arg Lys His Pro Asn Glu Glu Tyr Ala
1010 1015 1020
Asn Val Gly Thr Gly Glu Leu Ala Ala Glu Val Leu Ile Gln Gln
1025 1030 1035
Gly Gly Leu Lys Cys Pro Val Cys Ser Phe Val Tyr Gly Thr Lys
1040 1045 1050
Trp Glu Phe Asn Arg His Leu Lys Asn Lys His Gly Leu Lys Val
1055 1060 1065
Val Glu Ile Asp Gly Asp Pro Lys Trp Glu Thr Ala Thr Glu Ala
1070 1075 1080
Pro Glu Glu Pro Ser Thr Gln Tyr Leu His Ile Thr Glu Ala Glu
1085 1090 1095
Glu Asp Val Gln Gly Thr Gln Ala Ala Val Ala Ala Leu Gln Asp
1100 1105 1110
Leu Arg Tyr Thr Ser Glu Ser Gly Asp Arg Leu Asp Pro Thr Ala
1115 1120 1125
Val Asn Ile Leu Gln Gln Ile Ile Glu Leu Gly Ala Glu Thr His
1130 1135 1140
Asp Ala Thr Ala Leu Ala Ser Val Val Ala Met Ala Pro Gly Thr
1145 1150 1155
Val Thr Val Val Lys Gln Val Thr Glu Glu Glu Pro Ser Ser Asn
1160 1165 1170
His Thr Val Met Ile Gln Glu Thr Val Gln Gln Ala Ser Val Glu
1175 1180 1185
Leu Ala Glu Gln His His Leu Val Val Ser Ser Asp Asp Val Glu
1190 1195 1200
Gly Ile Glu Thr Val Thr Val Tyr Thr Gln Gly Gly Glu Ala Ser
1205 1210 1215
Glu Phe Ile Val Tyr Val Gln Glu Ala Met Gln Pro Val Glu Glu
1220 1225 1230
Gln Ala Val Glu Gln Pro Ala Gln Glu Leu
1235 1240
<![CDATA[ <210> 864]]>
<![CDATA[ <211> 3283]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 864]]>
atccgccatg ttggatgccg cagattcgcc ataacctcgc cggctctttt cttaaaaaaa 60
taaaaataaa aagcgaagcg tcagcagggc gccccgcccc ctcggtcggc acgggagggg 120
gcccggaaga gcccgaggct tttttttcct ccgcggtggg gcgttgccat ggagacgcgg 180
gcggcagaaa acacggccat ctttatgtgt aaatgttgta acctcttctc accaaatcag 240
tcggaactcc tctcccacgt ttcagagaag cacatggaag aaggggttaa tgttgatgag 300
attattattc cccttaggcc tctgagtaca cctgaaccccc ccaactcaag caaaaccgga 360
gatgagtttt tggtcatgaa gaggaagaga ggcaggccta aggggtccac gaagaagtcc 420
agcacagaag aggagctggc agaaaacatc gtgagtccga ctgaggacag cccgctggct 480
ccggaggaag ggaacagcct gcctccaagc agcttggagt gtagcaagtg ctgtcggaag 540
ttctccaaca cgcgccagct gcggaagcac atctgcatta tcgtgctgaa tttgggtgag 600
gaggaaggag aagcaggtaa cgagtctgac cttgaactag aaaagaagtg taaggaagat 660
gatcgggaaa aagcctcgaa aagaccacgg tcacagaaaa cagagaaagt ccagaagatc 720
tcaggaaagg aggccagaca gctttctggg gcgaagaaac ccatcataag tgtggtttta 780
actgcacacg aagcaattcc aggtgctacc aagattgtgc cagtggaggc tgggccccct 840
gaaacaggag ctacaaattc tgagaccact tcagcagacc tggtgcctcg gagaggctac 900
caggaatacg ccattcagca gacaccttat gagcaaccaa tgaagtcaag caggctaggt 960
cccactcagc tcaaaatctt cacttgtgaa tactgcaaca aggtcttcaa gttcaagcac 1020
tcgctgcagg cccacctgag gatccacacc aatgaaaagc catacaagtg cccccagtgc 1080
agctatgcca gtgccatcaa ggccaacctc aatgtgcacc tgcgcaagca cactggagag 1140
aagttcgcct gcgactattg ctcgttcacc tgcctgagca agggccacct caaggtgcac 1200
atcgagcgag tgcacaagaa gatcaagcag cactgccgct tctgcaagaa gaagtactct 1260
gacgtcaaga acctcatcaa gcacatccga gacgcgcatg accccacagga caagaaggtc 1320
aaagaggcct tggacgagct ctgcctgatg acgagggagg gcaagcggca gctgctctat 1380
gactgccaca tctgtgagcg caagttcaag aacgagctgg accgtgaccg ccatatgctg 1440
gtccacggag acaagtggcc ttttgcctgt gagctctgtg gccatggggc caccaagtac 1500
caggcgctgg aactgcatgt caggaagcac cccttcgtgt acgtctgtgc cgtctgccgc 1560
aagaagttcg tcagctccat caggctgcgc acccacatca aagaggtgca cggggctgcc 1620
caggaggcct tggtcttcac cagttccatc aaccagagct tctgcctcct ggaacctggt 1680
ggggacatcc agcaagaagc tctgggggac cagctacagc tggtggaaga ggagtttgcc 1740
ctccagggcg tgaatgcact caaggaagag gcctgtcctg gggacactca gctggaggag 1800
ggccggaagg agccggaggc ccctggggaa atgcctgccc cagctgtgca cctggcctcc 1860
ccgcaggccg aaagcacagc cctgccaccc tgtgagctgg aaaccaccgt ggtctcctcc 1920
tcagacctgc attctcaaga ggtggtttca gatgattttttgttgaaaaa tgatacctcc 1980
tccgcagagg ctcatgctgc tcctgagaag cccccagaca tgcagcacag aagctcagtc 2040
cagacgcaag gtgaagtgat cacactactg ctgtccaagg cccagagtgc tgggtcagat 2100
caggaaagcc atggcgccca gagcccccta ggggaagggc agaacatggc tgtgctttca 2160
gctggtgacc cagatccccag caggtgtctc aggtcaaacc cagctgaggc ctcagacctc 2220
ctccctccag tagctggtgg tggggacacc atcacacatc agcctgactc ttgcaaagct 2280
gcccctgagc accggtcagg catcaccgct ttcatgaagg tcctgaacag tttacagaag 2340
aagcaaatga acaccagctt gtgtgagcgg atccggaagg tttatggaga cctggagtgt 2400
gaatactgtg gcaaactttt ttggtaccaa gtgcattttg atatgcatgt ccgcacccac 2460
acccgggaac atctgtatta ttgctctcag tgtcattatt cttccatcac caaaaactgc 2520
cttaaacgcc acgtaattca gaaacacagt aacatcttgc tgaagtgtcc caccgatggc 2580
tgtgactact caactccaga taaatataag ctacaggcac atcttaaagt tcacacagca 2640
ctggacaaaa ggagttattc ttgtcctgtt tgtgaaaagt ctttttcaga ggatcgattg 2700
ataaagtcac atatcaagac caaccatcct gaggtctcca tgagcaccat ttctgaggtt 2760
ctcgggagga gggttcagct gaaagggcta attggaaaga gagccatgaa atgcccatat 2820
tgtgactttt atttcatgaa gaatggctca gaccttcagc gtcatatttg ggctcatgaa 2880
ggtgtgaagc ccttcaagtg ttctttgtgtgagtatgcaa ctcgtagcaa gagtaacctc 2940
aaggctcata tgaatcgtca cagcactgag aaaacccacc tatgtgacat gtgtggcaag 3000
aaattcaaat caaaagggac actgaaaagt cacaaactcc ttcacactgc agatgggaag 3060
cagtttaagt gcacggtgtg tgactacaca gcggcccaga agccacagct gctgcggcac 3120
atggaacagc atgtctcctt caagcctttc cgctgtgccc attgccatta ctcctgcaac 3180
atatctggct ctctgaagcg gcactacaac aggaagcacc ctaatgagga gtatgccaac 3240
gtgggcaccg gggagctggc agcggaggtg ctcatccagc aag 3283
<![CDATA[ <210> 865]]>
<![CDATA[ <211> 1038]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 865]]>
Met Glu Thr Arg Ala Ala Glu Asn Thr Ala Ile Phe Met Cys Lys Cys
1 5 10 15
Cys Asn Leu Phe Ser Pro Asn Gln Ser Glu Leu Leu Ser His Val Ser
20 25 30
Glu Lys His Met Glu Glu Gly Val Asn Val Asp Glu Ile Ile Ile Pro
35 40 45
Leu Arg Pro Leu Ser Thr Pro Glu Pro Pro Asn Ser Ser Lys Thr Gly
50 55 60
Asp Glu Phe Leu Val Met Lys Arg Lys Arg Gly Arg Pro Lys Gly Ser
65 70 75 80
Thr Lys Lys Ser Ser Thr Glu Glu Glu Leu Ala Glu Asn Ile Val Ser
85 90 95
Pro Thr Glu Asp Ser Pro Leu Ala Pro Glu Glu Gly Asn Ser Leu Pro
100 105 110
Pro Ser Ser Leu Glu Cys Ser Lys Cys Cys Arg Lys Phe Ser Asn Thr
115 120 125
Arg Gln Leu Arg Lys His Ile Cys Ile Ile Val Leu Asn Leu Gly Glu
130 135 140
Glu Glu Gly Glu Ala Gly Asn Glu Ser Asp Leu Glu Leu Glu Lys Lys
145 150 155 160
Cys Lys Glu Asp Asp Arg Glu Lys Ala Ser Lys Arg Pro Arg Ser Gln
165 170 175
Lys Thr Glu Lys Val Gln Lys Ile Ser Gly Lys Glu Ala Arg Gln Leu
180 185 190
Ser Gly Ala Lys Lys Pro Ile Ile Ser Val Val Leu Thr Ala His Glu
195 200 205
Ala Ile Pro Gly Ala Thr Lys Ile Val Pro Val Glu Ala Gly Pro Pro
210 215 220
Glu Thr Gly Ala Thr Asn Ser Glu Thr Thr Ser Ala Asp Leu Val Pro
225 230 235 240
Arg Arg Gly Tyr Gln Glu Tyr Ala Ile Gln Gln Thr Pro Tyr Glu Gln
245 250 255
Pro Met Lys Ser Ser Arg Leu Gly Pro Thr Gln Leu Lys Ile Phe Thr
260 265 270
Cys Glu Tyr Cys Asn Lys Val Phe Lys Phe Lys His Ser Leu Gln Ala
275 280 285
His Leu Arg Ile His Thr Asn Glu Lys Pro Tyr Lys Cys Pro Gln Cys
290 295 300
Ser Tyr Ala Ser Ala Ile Lys Ala Asn Leu Asn Val His Leu Arg Lys
305 310 315 320
His Thr Gly Glu Lys Phe Ala Cys Asp Tyr Cys Ser Phe Thr Cys Leu
325 330 335
Ser Lys Gly His Leu Lys Val His Ile Glu Arg Val His Lys Lys Ile
340 345 350
Lys Gln His Cys Arg Phe Cys Lys Lys Lys Tyr Ser Asp Val Lys Asn
355 360 365
Leu Ile Lys His Ile Arg Asp Ala His Asp Pro Gln Asp Lys Lys Val
370 375 380
Lys Glu Ala Leu Asp Glu Leu Cys Leu Met Thr Arg Glu Gly Lys Arg
385 390 395 400
Gln Leu Leu Tyr Asp Cys His Ile Cys Glu Arg Lys Phe Lys Asn Glu
405 410 415
Leu Asp Arg Asp Arg His Met Leu Val His Gly Asp Lys Trp Pro Phe
420 425 430
Ala Cys Glu Leu Cys Gly His Gly Ala Thr Lys Tyr Gln Ala Leu Glu
435 440 445
Leu His Val Arg Lys His Pro Phe Val Tyr Val Cys Ala Val Cys Arg
450 455 460
Lys Lys Phe Val Ser Ser Ile Arg Leu Arg Thr His Ile Lys Glu Val
465 470 475 480
His Gly Ala Ala Gln Glu Ala Leu Val Phe Thr Ser Ser Ile Asn Gln
485 490 495
Ser Phe Cys Leu Leu Glu Pro Gly Gly Asp Ile Gln Gln Glu Ala Leu
500 505 510
Gly Asp Gln Leu Gln Leu Val Glu Glu Glu Phe Ala Leu Gln Gly Val
515 520 525
Asn Ala Leu Lys Glu Glu Ala Cys Pro Gly Asp Thr Gln Leu Glu Glu
530 535 540
Gly Arg Lys Glu Pro Glu Ala Pro Gly Glu Met Pro Ala Pro Ala Val
545 550 555 560
His Leu Ala Ser Pro Gln Ala Glu Ser Thr Ala Leu Pro Pro Cys Glu
565 570 575
Leu Glu Thr Thr Val Val Ser Ser Ser Asp Leu His Ser Gln Glu Val
580 585 590
Val Ser Asp Asp Phe Leu Leu Lys Asn Asp Thr Ser Ser Ala Glu Ala
595 600 605
His Ala Ala Pro Glu Lys Pro Pro Asp Met Gln His Arg Ser Ser Val
610 615 620
Gln Thr Gln Gly Glu Val Ile Thr Leu Leu Leu Ser Lys Ala Gln Ser
625 630 635 640
Ala Gly Ser Asp Gln Glu Ser His Gly Ala Gln Ser Pro Leu Gly Glu
645 650 655
Gly Gln Asn Met Ala Val Leu Ser Ala Gly Asp Pro Asp Pro Ser Arg
660 665 670
Cys Leu Arg Ser Asn Pro Ala Glu Ala Ser Asp Leu Leu Pro Pro Val
675 680 685
Ala Gly Gly Gly Asp Thr Ile Thr His Gln Pro Asp Ser Cys Lys Ala
690 695 700
Ala Pro Glu His Arg Ser Gly Ile Thr Ala Phe Met Lys Val Leu Asn
705 710 715 720
Ser Leu Gln Lys Lys Gln Met Asn Thr Ser Leu Cys Glu Arg Ile Arg
725 730 735
Lys Val Tyr Gly Asp Leu Glu Cys Glu Tyr Cys Gly Lys Leu Phe Trp
740 745 750
Tyr Gln Val His Phe Asp Met His Val Arg Thr His Thr Arg Glu His
755 760 765
Leu Tyr Tyr Cys Ser Gln Cys His Tyr Ser Ser Ile Thr Lys Asn Cys
770 775 780
Leu Lys Arg His Val Ile Gln Lys His Ser Asn Ile Leu Leu Lys Cys
785 790 795 800
Pro Thr Asp Gly Cys Asp Tyr Ser Thr Pro Asp Lys Tyr Lys Leu Gln
805 810 815
Ala His Leu Lys Val His Thr Ala Leu Asp Lys Arg Ser Tyr Ser Cys
820 825 830
Pro Val Cys Glu Lys Ser Phe Ser Glu Asp Arg Leu Ile Lys Ser His
835 840 845
Ile Lys Thr Asn His Pro Glu Val Ser Met Ser Thr Ile Ser Glu Val
850 855 860
Leu Gly Arg Arg Val Gln Leu Lys Gly Leu Ile Gly Lys Arg Ala Met
865 870 875 880
Lys Cys Pro Tyr Cys Asp Phe Tyr Phe Met Lys Asn Gly Ser Asp Leu
885 890 895
Gln Arg His Ile Trp Ala His Glu Gly Val Lys Pro Phe Lys Cys Ser
900 905 910
Leu Cys Glu Tyr Ala Thr Arg Ser Lys Ser Asn Leu Lys Ala His Met
915 920 925
Asn Arg His Ser Thr Glu Lys Thr His Leu Cys Asp Met Cys Gly Lys
930 935 940
Lys Phe Lys Ser Lys Gly Thr Leu Lys Ser His Lys Leu Leu His Thr
945 950 955 960
Ala Asp Gly Lys Gln Phe Lys Cys Thr Val Cys Asp Tyr Thr Ala Ala
965 970 975
Gln Lys Pro Gln Leu Leu Arg His Met Glu Gln His Val Ser Phe Lys
980 985 990
Pro Phe Arg Cys Ala His Cys His Tyr Ser Cys Asn Ile Ser Gly Ser
995 1000 1005
Leu Lys Arg His Tyr Asn Arg Lys His Pro Asn Glu Glu Tyr Ala
1010 1015 1020
Asn Val Gly Thr Gly Glu Leu Ala Ala Glu Val Leu Ile Gln Gln
1025 1030 1035
<![CDATA[ <210> 866]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 866]]>
gcctcatccc ctcttcagtg caggaatatg ttagcgttgt atcttgggag aaagacacag 60
tctccatcat cccag 75
<![CDATA[ <210> 867]]>
<![CDATA[ <211> 75]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 867]]>
ccttgcctcc ccgattgaaa gagatgaaaa gccaggaatc ggctgcaggt tccaaactag 60
tccttcggtg tgaaa 75
<![CDATA[ <210> 868]]>
<![CDATA[ <211> 150]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 868]]>
gcctcatccc ctcttcagtg caggaatatg ttagcgttgt atcttgggag aaagacacag 60
tctccatcat cccagccttg cctccccgat tgaaagagat gaaaagccag gaatcggctg 120
caggttccaa actagtcctt cggtgtgaaa 150
<![CDATA[ <210> 869]]>
<![CDATA[ <211> 49]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 869]]>
Leu Ile Pro Ser Ser Val Gln Glu Tyr Val Ser Val Val Ser Trp Glu
1 5 10 15
Lys Asp Thr Val Ser Ile Ile Pro Ala Leu Pro Pro Arg Leu Lys Glu
20 25 30
Met Lys Ser Gln Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys
35 40 45
Glu
<![CDATA[ <210> 870]]>
<![CDATA[ <211> 183]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 870]]>
agccgagcgc tgggctgagg agcagagaga gcggggcgcc gagtgcgggc ggctgggagc 60
gcgctgagcg ggggagaggc gctgccgcac ggccggccac aggacccacct ccccggagaa 120
tagggcctct ttatggcatg tggctggtaa ctttcctcct gctcctggac tctttacaca 180
aag 183
<![CDATA[ <210> 871]]>
<![CDATA[ <211> 318]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 871]]>
cccgccctga agatgttggc accagcctct actttgtaaa tgactccttg cagcaggtga 60
ccttttccag ctccgtgggg gtggtggtgc cctgcccggc cgcgggctcc cccagcgcgg 120
cccttcgatg gtacctggcc acaggggacg acatctacga cgtgccgcac atccggcacg 180
tccacgccaa cgggacgctg cagctctacc ccttctcccc ctccgccttc aatagcttta 240
tccacgacaa tgactacttc tgcaccgcgg agaacgctgc cggcaagatc cggagcccca 300
acatccgcgt caaagcag 318
<![CDATA[ <210> 872]]>
<![CDATA[ <211> 147]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 872]]>
ttttcaggga accttacacc gtccgggtgg aggatcaaag gtcaatgcgt ggcaacgtgg 60
ccgtcttcaa gtgcctcatc ccctcttcag tgcaggaata tgttagcgtt gtatcttggg 120
agaaagacac agtctccatc atcccag 147
<![CDATA[ <210> 873]]>
<![CDATA[ <211> 147]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 873]]>
aacacaggtt ttttattacc taccacggcg ggctgtacat ctctgacgta cagaaggagg 60
acgccctctc cacctatcgc tgcatcacca agcacaagta tagcggggag acccggcaga 120
gcaatggggc acgcctctct gtgacag 147
<![CDATA[ <210> 874]]>
<![CDATA[ <211> 279]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 874]]>
accctgctga gtcgatcccc accatcctgg atggcttcca ctcccaggaa gtgtgggccg 60
gccacaccgt ggagctgccc tgcaccgcct cgggctaccc tatccccgcc atccgctggc 120
tcaaggatgg ccggcccctc ccggctgaca gccgctggac caagcgcatc acagggctga 180
ccatcagcga cttgcggacc gaggacagcg gcacctacat ttgtgaggtc accaacacct 240
tcggttcggc agaggccaca ggcatcctca tggtcattg 279
<![CDATA[ <210> 875]]>
<![CDATA[ <211> 276]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 875]]>
atccccttca tgtgaccctg acaccaaaga agctgaagac cggcattggc agcacggtca 60
tcctctcctg tgccctgacg ggctccccag agttcaccat ccgctggtat cgcaacacgg 120
agctggtgct gcctgacgag gccatctcca tccgcgggct cagcaacgag acgctgctca 180
tcacctcggc ccagaagagc cattccgggg cctaccagtg cttcgctacc cgcaaggccc 240
agaccgccca ggactttgcc atcattgcac ttgagg 276
<![CDATA[ <210> 876]]>
<![CDATA[ <211> 297]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 876]]>
atggcacgcc ccgcatcgtc tcgtccttca gcgagaaggt ggtcaaccccc ggggagcagt 60
tctcactgat gtgtgcggcc aagggcgccc cgccccccac ggtcacctgg gccctcgacg 120
atgagcccat cgtgcgggat ggcagccacc gcaccaacca gtacaccatg tcggacggca 180
ccaccatcag ccacatgaac gtcacaggcc cccagatccg cgacgggggc gtgtaccggt 240
gcacagcgcg gaacttggtg ggcagtgctg aatatcaggc gcgaataaac gtaagag 297
<![CDATA[ <210> 877]]>
<![CDATA[ <211> 273]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 877]]>
gcccacccag catccgggct atgcggaaca tcacagcagt cgccgggcgg gacaccctta 60
tcaactgcag ggtcatcggc tatccctact actccatcaa gtggtacaag gatgccctgc 120
tgctgccaga caaccaccgc caggtggtgt ttgagaatgg gaccctcaag ctgactgacg 180
tgcagaaggg catggatgag ggggagtacc tgtgcagtgt cctcatccag ccccagctct 240
ccatcagcca gagcgttcac gtagccgtca aag 273
<![CDATA[ <210> 878]]>
<![CDATA[ <211> 279]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 878]]>
tgccccctct gatccagccc ttcgaattcc cacccgcctc catcggccag ctgctctaca 60
ttccctgtgt ggtgtcctcg ggggacatgc ccatccgtat cacctggagg aaggacggac 120
aggtgatcat ctcaggctcg ggcgtgacca tcgagagcaa ggaattcatg agctccctgc 180
agatctctag cgtctccctc aagcacaacg gcaactatac atgcatcgcc agcaacgcag 240
ccgccaccgt gagccggggag cgccagctca tcgtgcgtg 279
<![CDATA[ <210> 879]]>
<![CDATA[ <211> 120]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 879]]>
tgccccctcg atttgtggtg caacccaaca accaggatgg catctacggc aaagctggtg 60
tgctcaactg ctcggtggac ggctaccccc cacccaaggt catgtggaag catgccaagg 120
<![CDATA[ <210> 880]]>
<![CDATA[ <211> 177]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 880]]>
ggagcgggaa cccccagcag taccaccctg tgcccctcac tggccgcatc cagatcctgc 60
ccaacagctc gctgctgatc cgccacgtcc tagaagagga catcggctac tacctctgcc 120
aggccagcaa cggcgtaggc accgacatca gcaagtccat gttcctcaca gtcaaga 177
<![CDATA[ <210> 881]]>
<![CDATA[ <211> 200]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 881]]>
tcccggccat gatcacttcc caccccaaca ccaccatcgc catcaagggc catgcgaagg 60
agctaaactg cacggcacgg ggtgagcggc ccatcatcat ccgctggggag aagggggaca 120
cagtcatcga ccctgaccgc gtcatgcggt atgccatcgc caccaaggac aacggcgacg 180
aggtcgtctc cacactgaag 200
<![CDATA[ <210> 882]]>
<![CDATA[ <211> 97]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 882]]>
ctcaagcccg ctgaccgtgg ggactctgtg ttcttcagct gccatgccat caactcgtat 60
ggggaggacc ggggcttgat ccaactcact gtgcaag 97
<![CDATA[ <210> 883]]>
<![CDATA[ <211> 129]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 883]]>
agccccccga ccccccagag ctggagatcc gggaggtgaa ggcccggagc atgaacctgc 60
gctggaccca gcgattcgac gggaacagca tcatcacggg cttcgacatt gaatacaaga 120
acaaatcag 129
<![CDATA[ <210> 884]]>
<![CDATA[ <211> 168]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 884]]>
attcctggga cttcaagcag tccaacacgca acatctcccc caccatcaac caggccaaca 60
ttgtggactt gcacccggca tctgtgtaca gcatccgcat gtactctttc aacaagattg 120
gccgcagtga accaagcaag gagctcacca tcagcactga ggaggccg 168
<![CDATA[ <210> 885]]>
<![CDATA[ <211> 71]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 885]]>
ctcccgatgg gccccccatg gatgttacct tgcagccagt gacctcacag agcatccagg 60
tgacctggaa g 71
<![CDATA[ <210> 886]]>
<![CDATA[ <211> 241]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 886]]>
gcacccaaga aggagctgca gaacggtgtc atccggggct accagattgg ctacagagag 60
aacagccccg gcagcaacgg gcagtacagc atcgtggaga tgaaggccac gggggacagc 120
gaggtctaca ccctggacaa cctcaagaag ttcgcccagt atggggtggt ggtccaagcc 180
ttcaatcggg ctggcacggg gccctcttcc agcgagatca atgccaccac tctggaggat 240
g 241
<![CDATA[ <210> 887]]>
<![CDATA[ <211> 147]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 887]]>
tgcccagcca gccccctgag aacgtccggg ccctgtccat cacttctgac gtggccgtca 60
tctcctggtc agagcccccg cgcagcaccc tcaatggcgt cctcaaaggc tatcgggtca 120
tcttctggtc cctctatgtt gatgggg 147
<![CDATA[ <210> 888]]>
<![CDATA[ <211> 156]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 888]]>
agtggggcga gatgcagaac atcaccacca cgcgggagcg ggtggagctg cggggcatgg 60
agaagttcac caactacagc gtccaggtgc tggcctacac ccaggctggg gacggcgtac 120
gcagcagtgt gctctacatc cagaccaagg aggacg 156
<![CDATA[ <210> 889]]>
<![CDATA[ <211> 140]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 889]]>
ttccaggtcc ccctgctggc atcaaagctg tcccttcatc agctagcagt gtggttgtgt 60
cttggctccc ccctaccaag cccaacgggg tgatccgcaa gtacaccatc ttctgttcca 120
gccccgggtc tggccagccg 140
<![CDATA[ <210> 890]]>
<![CDATA[ <211> 154]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 890]]>
gctcccagcg agtacgagac gagtccagag cagctcttct accggatcgc ccacctaaac 60
cgcggtcagc agtatctgct gtgggtggcc gccgtcacct ctgccggccg gggcaacagc 120
agcgagaagg tgaccatcga gcctgctggc aagg 154
<![CDATA[ <210> 891]]>
<![CDATA[ <211> 118]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 891]]>
ccccagcaaa gatcatctcc tttgggggca ccgtgacaac accttggatg aaagatgttc 60
ggctgccttg caattcagtg ggagatccag cccctgctgt gaagtggacc aaggacag 118
<![CDATA[ <210> 892]]>
<![CDATA[ <211> 164]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 892]]>
tgaagactcg gccattccag tgtccatgga tgggcaccgg ctcatccaca ccaatggcac 60
actgctgctg cgtgcagtga aggctgagga ctctggctac tacacgtgca cggccaccaa 120
cactggtggc tttgacacca tcatcgtcaa ccttctggtg caag 164
<![CDATA[ <210> 893]]>
<![CDATA[ <211> 99]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 893]]>
ttcccccgga ccagccccgc ctcactgtct ccaaaacctc agcttcgtcc atcaccctga 60
cctggattcc aggtgacaat gggggcagct ccatccgag 99
<![CDATA[ <210> 894]]>
<![CDATA[ <211> 189]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 894]]>
gcttcgtgct acagtactcg gtggacaaca gcgaggagtg gaaggatgtg ttcatcagct 60
ccagcgagcg ctccttcaag ctggacagcc tcaagtgtgg cacgtggtac aaggtgaagc 120
tggcagccaa gaacagcgtg ggctctgggc gcatcagcga gatcatcgag gccaagaccc 180
acggggcggg 189
<![CDATA[ <210> 895]]>
<![CDATA[ <211> 288]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 895]]>
agccctcctt cagcaaagac caacacctct tcacccacat caactccacg catgctcggc 60
ttaacctgca gggctggaac aatgggggct gccctatcac agccatcgtt ctggagtacc 120
ggcccaaggg gacctgggcc tggcagggcc tccgggccaa cagctccggg gaggtgtttc 180
tgacggaact gcgagaggcc acgtggtacg agctgcgcat gagggcttgc aacagtgcgg 240
gctgcggcaa tgaaacagcc cagttcgcca ccctggacta cgatggca 288
<![CDATA[ <210> 896]]>
<![CDATA[ <211> 156]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 896]]>
gcaccattcc acccatcaag tctgctcaag gtgaagggga tgatgtgaag aagctgttca 60
ccatcggctg ccctgtcatc ctggccaacac tgggggtggc actgctcttc atcgtacgca 120
agaagaggaa ggagaaacgg ctgaagcgac tccgag 156
<![CDATA[ <210> 897]]>
<![CDATA[ <211> 31]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 897]]>
atgcaaagag tttggcagaa atgttgataa g 31
<![CDATA[ <210> 898]]>
<![CDATA[ <211> 119]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 898]]>
caagaacaat agaagctttg aacccctgt gaaagggcca ccccagggcc cacggctaca 60
cattgacatc cccagggtcc agctgctcat cgaggacaaa gaaggcatca agcaactgg 119
<![CDATA[ <210> 899]]>
<![CDATA[ <211> 153]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 899]]>
gagatgacaa ggccaccatc cctgtgacag atgctgagtt cagccaagct gtcaacccac 60
agagcttctg tactggcgtc tccttgcacc acccaaccct catccagagc acaggaccccc 120
tcatcgacat gtctgacatc cggccaggaa cca 153
<![CDATA[ <210> 900]]>
<![CDATA[ <211> 195]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 900]]>
atccagtgtc caggaagaat gtgaagtcag cccacagcac ccggaaccgg tactcaagcc 60
agtggaccct gaccaagtgc caggcctcca cacctgcccg caccctcacc tccgactggc 120
gcaccgtggg ctcccagcat ggtgtcacgg tcactgagag tgacagctac agtgccagcc 180
tgtcccagga cacag 195
<![CDATA[ <210> 901]]>
<![CDATA[ <211> 312]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 901]]>
acaaaggaag gaacagcatg gtgtccactg agagtgcctc ttccacctac gaggagctgg 60
cccgggccta tgagcatgcc aagctggagg agcagctgca gcacgccaag tttgagatca 120
ccgagtgctt catctctgac agttcctctg accagatgac cacaggcacc aacgagaacg 180
ccgacagcat gacatccatg agcacaccct cagagcctgg catctgccgc tttaccgcct 240
caccacccaa gccccaggat gcggaccggg gcaaaaacgt ggctgtgccc atccctcacc 300
gggccaacaa ga 312
<![CDATA[ <210> 902]]>
<![CDATA[ <211> 1032]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 902]]>
gtgactactg caacctgccc ctgtatgcca agtcagaggc cttctttcga aaggcagatg 60
gacgtgagcc ctgccccgtg gtcccacccc gtgaggcctc catccggaac ctggctcgaa 120
cctaccacac ccaggctcgc cacctgaccc tggaccctgc cagcaagtcc ttgggccttc 180
cccacccagg ggcccccgct gccgcctcca cagccacctt acctcagagg actctggcca 240
tgccagcccc cccagccggc acagcccccc cagcccccgg ccccacccct gctgagccac 300
ccaccgcccc cagcgctgcc cctccggccc ccagcaccga gcctccacga gccgggggcc 360
cacacaccaa aatggggggc tccagggact cgcttctcga gatgagcaca tcggggggtag 420
ggaggtctca gaagcagggg gccggggcct actccaaatc ctacaccctg gtgtagggcc 480
cgcaggaaga gcagccacgc ctggaccgcg ccgcgccgca gccccacacg ccagctcggc 540
tgtttttctg catttattat attcaactga cagacaaaaa ccaaccaacg acaaaacaaa 600
aacccccaat catgaacgcc tgtacataga actcttttgt acaaatgaaa ctattttctt 660
cttctccatg aagccagggc acaaagaatt tgacagtaca agtcaaatcc cccaccccac 720
aaaatatgtg tggagatata tatacatata tagacagaca ggaacgcgtc cacgagctat 780
atatctatat atttctctca ccctattttg agacagaggc acaaagactc agcaattttt 840
ttccctcctc ctcaccttcc ccccagtcta ggtggttttg acaaagacca aaatcccaac 900
tcagagacac tgcatgcgat tttactgttc caagaaaacc aggagttgct tcaatttgca 960
gatgcttatg tgttaatacc tttttctatg aaaaaagacc cagcgccgtg tgcaataaag 1020
gttatgtttc ta 1032
<![CDATA[ <210> 903]]>
<![CDATA[ <211> 6855]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> Full mRNA polynucleotide sequence of DSCAML1]]>
<![CDATA[ <400> 903]]>
agccgagcgc tgggctgagg agcagagaga gcggggcgcc gagtgcgggc ggctgggagc 60
gcgctgagcg ggggagaggc gctgccgcac ggccggccac aggacccacct ccccggagaa 120
tagggcctct ttatggcatg tggctggtaa ctttcctcct gctcctggac tctttacaca 180
aagcccgccc tgaagatgtt ggcaccagcc tctactttgt aaatgactcc ttgcagcagg 240
tgaccttttc cagctccgtgggggtggtgg tgccctgccc ggccgcgggc tcccccagcg 300
cggcccttcg atggtacctg gccacagggg acgacatcta cgacgtgccg cacatccggc 360
acgtccacgc caacgggacg ctgcagctct accccttctc cccctccgcc ttcaatagct 420
ttatccacga caatgactac ttctgcaccg cggagaacgc tgccggcaag atccggagcc 480
ccaacatccg cgtcaaagca gttttcaggg aaccctacac cgtccgggtg gaggatcaaa 540
ggtcaatgcg tggcaacgtg gccgtcttca agtgcctcat cccctcttca gtgcaggaat 600
atgttagcgt tgtatcttgg gagaaagaca cagtctccat catcccagaa cacaggtttt 660
ttattaccta ccacggcggg ctgtacatct ctgacgtaca gaaggaggac gccctctcca 720
cctatcgctg catcaccaag cacaagtata gcggggagac ccggcagagc aatggggcac 780
gcctctctgt gacagaccct gctgagtcga tccccaccat cctggatggc ttccactccc 840
aggaagtgtg ggccggccac accgtggagc tgccctgcac cgcctcgggc taccctatcc 900
ccgccatccg ctggctcaag gatggccggc ccctcccggc tgacagccgc tggaccaagc 960
gcatcacagg gctgaccatc agcgacttgc ggaccgagga cagcggcacc tacatttgtg 1020
aggtcaccaa caccttcggt tcggcagagg ccacaggcat cctcatggtc attgatcccc 1080
ttcatgtgac cctgacacca aagaagctga agaccggcat tggcagcacg gtcatcctct 1140
cctgtgccct gacgggctcc ccagagttca ccatccgctg gtatcgcaac acggagctgg 1200
tgctgcctga cgaggccatc tccatccgcg ggctcagcaa cgagacgctg ctcatcacct 1260
cggcccagaa gagccattcc ggggcctacc agtgcttcgc tacccgcaag gcccagaccg 1320
cccaggactt tgccatcatt gcacttgagg atggcacgcc ccgcatcgtc tcgtccttca 1380
gcgagaaggt ggtcaaccccc ggggagcagt tctcactgat gtgtgcggcc aagggcgccc 1440
cgccccccac ggtcacctgg gccctcgacg atgagcccat cgtgcgggat ggcagccacc 1500
gcaccaacca gtacaccatg tcggacggca ccaccatcag ccacatgaac gtcacaggcc 1560
cccagatccg cgacgggggc gtgtaccggt gcacagcgcg gaacttggtg ggcagtgctg 1620
aatatcaggc gcgaataaac gtaagaggcc cacccagcat ccgggctatg cggaacatca 1680
cagcagtcgc cgggcgggac acccttatca actgcagggt catcggctat ccctactact 1740
ccatcaagtg gtacaaggat gccctgctgc tgccagacaa ccaccgccag gtggtgtttg 1800
agaatgggac cctcaagctg actgacgtgc agaagggcat ggatgagggg gagtacctgt 1860
gcagtgtcct catccagccc cagctctcca tcagccagag cgttcacgta gccgtcaaag 1920
tgccccctct gatccagccc ttcgaattcc cacccgcctc catcggccag ctgctctaca 1980
ttccctgtgt ggtgtcctcg ggggacatgc ccatccgtat cacctggagg aaggacggac 2040
aggtgatcat ctcaggctcg ggcgtgacca tcgagagcaa ggaattcatg agctccctgc 2100
agatctctag cgtctccctc aagcacaacg gcaactatac atgcatcgcc agcaacgcag 2160
ccgccaccgt gagccggggag cgccagctca tcgtgcgtgt gccccctcga tttgtggtgc 2220
aacccaacaa ccaggatggc atctacggca aagctggtgt gctcaactgc tcggtggacg 2280
gctaccccccc acccaaggtc atgtggaagc atgccaaggg gagcgggaac ccccaagcagt 2340
accacccctgt gcccctcact ggccgcatcc agatcctgcc caacagctcg ctgctgatcc 2400
gccacgtcct agaagaggac atcggctact acctctgcca ggccagcaac ggcgtaggca 2460
ccgacatcag caagtccatg ttcctcacag tcaagatccc ggccatgatc acttcccacc 2520
ccaacacacac catcgccatc aagggccatg cgaaggagct aaactgcacg gcacggggtg 2580
agcggcccat catcatccgc tgggagaagg gggacacagt catcgaccct gaccgcgtca 2640
tgcggtatgc catcgccacc aaggacaacg gcgacgaggt cgtctccaca ctgaagctca 2700
agcccgctga ccgtggggac tctgtgttct tcagctgcca tgccatcaac tcgtatgggg 2760
aggaccgggg cttgatccaa ctcactgtgc aagagccccc cgacccccca gagctggaga 2820
tccgggaggt gaaggcccgg agcatgaacc tgcgctggac ccagcgattc gacgggaaca 2880
gcatcatcac gggcttcgac attgaataca agaacaaatc agattcctgg gacttcaagc 2940
agtccacacg caacatctcc cccaccatca accaggccaa cattgtggac ttgcacccgg 3000
catctgtgta cagcatccgc atgtactctt tcaacaagat tggccgcagt gaaccaagca 3060
aggagctcac catcagcact gaggaggccg ctcccgatgg gccccccatg gatgttacct 3120
tgcagccagt gacctcacag agcatccagg tgacctggaa ggcacccaag aaggagctgc 3180
agaacggtgt catccggggc taccagattg gctacagaga gaacagcccc ggcagcaacg 3240
ggcagtacag catcgtggag atgaaggcca cgggggacag cgaggtctac accctggaca 3300
acctcaagaa gttcgcccag tatggggtgg tggtccaagc cttcaatcgg gctggcacgg 3360
ggccctcttc cagcgagatc aatgccacca ctctggagga tgtgcccagc cagccccctg 3420
agaacgtccg ggccctgtcc atcacttctg acgtggccgt catctcctgg tcagagcccc 3480
cgcgcagcac cctcaatggc gtcctcaaag gctatcgggt catcttctgg tccctctatg 3540
ttgatgggga gtggggcgag atgcagaaca tcaccaccac gcgggagcgg gtggagctgc 3600
ggggcatgga gaagttcacc aactacagcg tccaggtgct ggcctacacc caggctgggg 3660
acggcgtacg cagcagtgtg ctctacatcc agaccaagga ggacgttcca ggtccccctg 3720
ctggcatcaa agctgtccct tcatcagcta gcagtgtggt tgtgtcttgg ctccccccta 3780
ccaagcccaa cggggtgatc cgcaagtaca ccatcttctg ttccagcccc gggtctggcc 3840
agccggctcc cagcgagtac gagacgagtc cagagcagct cttctaccgg atcgcccacc 3900
taaaccgcgg tcagcagtat ctgctgtggg tggccgccgt cacctctgcc ggccggggca 3960
acagcagcga gaaggtgacc atcgagcctg ctggcaaggc cccagcaaag atcatctcct 4020
ttgggggcac cgtgacaaca ccttggatga aagatgttcg gctgccttgc aattcagtgg 4080
gagatccagc ccctgctgtg aagtggacca aggacagtga agactcggcc attccagtgt 4140
ccatggatgg gcaccggctc atccacacca atggcacact gctgctgcgt gcagtgaagg 4200
ctgaggactc tggctactac acgtgcacgg ccaccaacac tggtggcttt gacaccatca 4260
tcgtcaacct tctggtgcaa gttcccccgg accagccccg cctcactgtc tccaaaacct 4320
cagcttcgtc catcaccctg acctggattc caggtgacaa tgggggcagc tccatccgag 4380
gcttcgtgct acagtactcg gtggacaaca gcgaggagtg gaaggatgtg ttcatcagct 4440
ccagcgagcg ctccttcaag ctggacagcc tcaagtgtgg cacgtggtac aaggtgaagc 4500
tggcagccaa gaacagcgtg ggctctgggc gcatcagcga gatcatcgag gccaagaccc 4560
acgggcggga gccctccttc agcaaagacc aacacctctt cacccacatc aactccacgc 4620
atgctcggct taacctgcag ggctggaaca atgggggctg ccctatcaca gccatcgttc 4680
tggagtaccg gcccaagggg acctgggcct ggcagggcct ccgggccaac agctccgggg 4740
aggtgtttct gacggaactg cgagaggcca cgtggtacga gctgcgcatg agggcttgca 4800
acagtgcggg ctgcggcaat gaaacagccc agttcgccac cctggactac gatggcagca 4860
ccattccacc catcaagtct gctcaaggtg aaggggatga tgtgaagaag ctgttcacca 4920
tcggctgccc tgtcatcctg gccaacactgg gggtggcact gctcttcatc gtacgcaaga 4980
agaggaagga gaaacggctg aagcgactcc gagatgcaaa gagtttggca gaaatgttga 5040
taagcaagaa caatagaagc tttgacaccc ctgtgaaagg gccaccccag ggcccacggc 5100
tacacattga catccccagg gtccagctgc tcatcgagga caaagaaggc atcaagcaac 5160
tgggagatga caaggccacc atccctgtga cagatgctga gttcagccaa gctgtcaacc 5220
cacagagctt ctgtactggc gtctccttgc accacccaac cctcatccag agcacaggac 5280
ccctcatcga catgtctgac atccggccag gaaccaatcc agtgtccagg aagaatgtga 5340
agtcagccca cagcacccgg aaccggtact caagccagtg gaccctgacc aagtgccagg 5400
cctccacacc tgcccgcacc ctcacctccg actggcgcac cgtgggctcc cagcatggtg 5460
tcacggtcac tgagagtgac agctacagtg ccagcctgtc ccaggacaca gacaaaggaa 5520
ggaacagcat ggtgtccact gagagtgcct cttccaccta cgaggagctg gcccgggcct 5580
atgagcatgc caagctggag gagcagctgc agcacgccaa gtttgagatc accgagtgct 5640
tcatctctga cagttcctct gaccagatga ccacaggcac caacgagaac gccgacagca 5700
tgacatccat gagcacacccc tcagagcctg gcatctgccg ctttaccgcc tcaccacccca 5760
agccccagga tgcggaccgg ggcaaaaacg tggctgtgcc catccctcac cgggccaaca 5820
agagtgacta ctgcaacctg cccctgtatg ccaagtcaga ggccttcttt cgaaaggcag 5880
atggacgtga gccctgcccc gtggtcccac cccgtgaggc ctccatccgg aacctggctc 5940
gaacctacca cacccaggct cgccacctga ccctggaccc tgccagcaag tccttgggcc 6000
ttccccaccc aggggccccc gctgccgcct ccacagccac cttacctcag aggactctgg 6060
ccatgccagc ccccccagcc ggcacagccc ccccagcccc cggccccacc cctgctgagc 6120
cacccaccgc ccccagcgct gcccctccgg cccccagcac cgagcctcca cgagccgggg 6180
gcccacacac caaaatgggg ggctccaggg actcgcttct cgagatgagc acatcggggg 6240
tagggaggtc tcagaagcag ggggccgggg cctactccaa atcctacacc ctggtgtagg 6300
gcccgcagga agagcagcca cgcctggacc gcgccgcgcc gcagccccac acgccagctc 6360
ggctgttttt ctgcattatt tatattcaac tgacagacaa aaaccaacca acgacaaaac 6420
aaaaaccccc aatcatgaac gcctgtacat agaactcttt tgtacaaatg aaactatttt 6480
cttcttctcc atgaagccag ggcacaaaga atttgacagt acaagtcaaa tcccccaccc 6540
cacaaaatat gtgtggagat atatatacat atatagacag acaggaacgc gtccacgagc 6600
tatatatcta tatatttctc tcaccctatt ttgagacaga ggcacaaaga ctcagcaatt 6660
tttttccctc ctcctcacct tccccccagt ctaggtggtt ttgacaaaga ccaaaatccc 6720
aactcagaga cactgcatgc gattttactg ttccaagaaa accaggagtt gcttcaattt 6780
gcagatgctt atgtgttaat acctttttct atgaaaaaag accccagcgcc gtgtgcaata 6840
aaggttatgtttcta 6855
<![CDATA[ <210> 904]]>
<![CDATA[ <211> 15]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 904]]>
Met Trp Leu Val Thr Phe Leu Leu Leu Leu Asp Ser Leu His Lys
1 5 10 15
<![CDATA[ <210> 905]]>
<![CDATA[ <211> 105]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 905]]>
Arg Pro Glu Asp Val Gly Thr Ser Leu Tyr Phe Val Asn Asp Ser Leu
1 5 10 15
Gln Gln Val Thr Phe Ser Ser Ser Ser Val Gly Val Val Val Pro Cys Pro
20 25 30
Ala Ala Gly Ser Pro Ser Ala Ala Leu Arg Trp Tyr Leu Ala Thr Gly
35 40 45
Asp Asp Ile Tyr Asp Val Pro His Ile Arg His Val His Ala Asn Gly
50 55 60
Thr Leu Gln Leu Tyr Pro Phe Ser Pro Ser Ala Phe Asn Ser Phe Ile
65 70 75 80
His Asp Asn Asp Tyr Phe Cys Thr Ala Glu Asn Ala Ala Gly Lys Ile
85 90 95
Arg Ser Pro Asn Ile Arg Val Lys Ala
100 105
<![CDATA[ <210> 906]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 906]]>
Phe Arg Glu Pro Tyr Thr Val Arg Val Glu Asp Gln Arg Ser Met Arg
1 5 10 15
Gly Asn Val Ala Val Phe Lys Cys Leu Ile Pro Ser Ser Val Gln Glu
20 25 30
Tyr Val Ser Val Val Ser Trp Glu Lys Asp Thr Val Ser Ile Ile Pro
35 40 45
<![CDATA[ <210> 907]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213]]>> Human]]>
<br/>
<br/> <![CDATA[ <400>907]]>
<br/>
<br/> <![CDATA[His Arg Phe Phe Ile Thr Tyr His Gly Gly Leu Tyr Ile Ser Asp Val
1 5 10 15
Gln Lys Glu Asp Ala Leu Ser Thr Tyr Arg Cys Ile Thr Lys His Lys
20 25 30
Tyr Ser Gly Glu Thr Arg Gln Ser Asn Gly Ala Arg Leu Ser Val Thr
35 40 45
<![CDATA[ <210> 908]]>
<![CDATA[ <211> 92]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 908]]>
Pro Ala Glu Ser Ile Pro Thr Ile Leu Asp Gly Phe His Ser Gln Glu
1 5 10 15
Val Trp Ala Gly His Thr Val Glu Leu Pro Cys Thr Ala Ser Gly Tyr
20 25 30
Pro Ile Pro Ala Ile Arg Trp Leu Lys Asp Gly Arg Pro Leu Pro Ala
35 40 45
Asp Ser Arg Trp Thr Lys Arg Ile Thr Gly Leu Thr Ile Ser Asp Leu
50 55 60
Arg Thr Glu Asp Ser Gly Thr Tyr Ile Cys Glu Val Thr Asn Thr Phe
65 70 75 80
Gly Ser Ala Glu Ala Thr Gly Ile Leu Met Val Ile
85 90
<![CDATA[ <210> 909]]>
<![CDATA[ <211> 91]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 909]]>
Pro Leu His Val Thr Leu Thr Pro Lys Lys Leu Lys Thr Gly Ile Gly
1 5 10 15
Ser Thr Val Ile Leu Ser Cys Ala Leu Thr Gly Ser Pro Glu Phe Thr
20 25 30
Ile Arg Trp Tyr Arg Asn Thr Glu Leu Val Leu Pro Asp Glu Ala Ile
35 40 45
Ser Ile Arg Gly Leu Ser Asn Glu Thr Leu Leu Ile Thr Ser Ala Gln
50 55 60
Lys Ser His Ser Gly Ala Tyr Gln Cys Phe Ala Thr Arg Lys Ala Gln
65 70 75 80
Thr Ala Gln Asp Phe Ala Ile Ile Ala Leu Glu
85 90
<![CDATA[ <210> 910]]>
<![CDATA[ <211> 98]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 910]]>
Gly Thr Pro Arg Ile Val Ser Ser Phe Ser Glu Lys Val Val Asn Pro
1 5 10 15
Gly Glu Gln Phe Ser Leu Met Cys Ala Ala Lys Gly Ala Pro Pro Pro
20 25 30
Thr Val Thr Trp Ala Leu Asp Asp Glu Pro Ile Val Arg Asp Gly Ser
35 40 45
His Arg Thr Asn Gln Tyr Thr Met Ser Asp Gly Thr Thr Ile Ser His
50 55 60
Met Asn Val Thr Gly Pro Gln Ile Arg Asp Gly Gly Val Tyr Arg Cys
65 70 75 80
Thr Ala Arg Asn Leu Val Gly Ser Ala Glu Tyr Gln Ala Arg Ile Asn
85 90 95
Val Arg
<![CDATA[ <210> 911]]>
<![CDATA[ <211> 90]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> ]]> 911
Pro Pro Ser Ile Arg Ala Met Arg Asn Ile Thr Ala Val Ala Gly Arg
1 5 10 15
Asp Thr Leu Ile Asn Cys Arg Val Ile Gly Tyr Pro Tyr Tyr Ser Ile
20 25 30
Lys Trp Tyr Lys Asp Ala Leu Leu Leu Pro Asp Asn His Arg Gln Val
35 40 45
Val Phe Glu Asn Gly Thr Leu Lys Leu Thr Asp Val Gln Lys Gly Met
50 55 60
Asp Glu Gly Glu Tyr Leu Cys Ser Val Leu Ile Gln Pro Gln Leu Ser
65 70 75 80
Ile Ser Gln Ser Val His Val Ala Val Lys
85 90
<![CDATA[ <210> 912]]>
<![CDATA[ <211> 92]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 912]]>
Pro Pro Leu Ile Gln Pro Phe Glu Phe Pro Pro Ala Ser Ile Gly Gln
1 5 10 15
Leu Leu Tyr Ile Pro Cys Val Val Ser Ser Gly Asp Met Pro Ile Arg
20 25 30
Ile Thr Trp Arg Lys Asp Gly Gln Val Ile Ile Ser Gly Ser Gly Val
35 40 45
Thr Ile Glu Ser Lys Glu Phe Met Ser Ser Leu Gln Ile Ser Ser Ser Val
50 55 60
Ser Leu Lys His Asn Gly Asn Tyr Thr Cys Ile Ala Ser Asn Ala Ala
65 70 75 80
Ala Thr Val Ser Arg Glu Arg Gln Leu Ile Val Arg
85 90
<![CDATA[ <210> 913]]>
<![CDATA[ <211> 39]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 913]]>
Pro Pro Arg Phe Val Val Gln Pro Asn Asn Gln Asp Gly Ile Tyr Gly
1 5 10 15
Lys Ala Gly Val Leu Asn Cys Ser Val Asp Gly Tyr Pro Pro Pro Lys
20 25 30
Val Met Trp Lys His Ala Lys
35
<![CDATA[ <210> 914]]>
<![CDATA[ <211> 58]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 914]]>
Ser Gly Asn Pro Gln Gln Tyr His Pro Val Pro Leu Thr Gly Arg Ile
1 5 10 15
Gln Ile Leu Pro Asn Ser Ser Leu Leu Ile Arg His Val Leu Glu Glu
20 25 30
Asp Ile Gly Tyr Tyr Tyr Leu Cys Gln Ala Ser Asn Gly Val Gly Thr Asp
35 40 45
Ile Ser Lys Ser Met Phe Leu Thr Val Lys
50 55
<![CDATA[ <210> 915]]>
<![CDATA[ <211> 66]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 915]]>
Pro Ala Met Ile Thr Ser His Pro Asn Thr Thr Ile Ala Ile Lys Gly
1 5 10 15
His Ala Lys Glu Leu Asn Cys Thr Ala Arg Gly Glu Arg Pro Ile Ile
20 25 30
Ile Arg Trp Glu Lys Gly Asp Thr Val Ile Asp Pro Asp Arg Val Met
35 40 45
Arg Tyr Ala Ile Ala Thr Lys Asp Asn Gly Asp Glu Val Val Ser Thr
50 55 60
Leu Lys
65
<![CDATA[ <210> 916]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 916]]>
Leu Lys Pro Ala Asp Arg Gly Asp Ser Val Phe Phe Ser Cys His Ala
1 5 10 15
Ile Asn Ser Tyr Gly Glu Asp Arg Gly Leu Ile Gln Leu Thr Val Gln
20 25 30
<![CDATA[ <210> 917]]>
<![CDATA[ <211> 42]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 917]]>
Pro Pro Asp Pro Pro Glu Leu Glu Ile Arg Glu Val Lys Ala Arg Ser
1 5 10 15
Met Asn Leu Arg Trp Thr Gln Arg Phe Asp Gly Asn Ser Ile Ile Thr
20 25 30
Gly Phe Asp Ile Glu Tyr Lys Asn Lys Ser
35 40
<![CDATA[ <210> 918]]>
<![CDATA[ <211> 55]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 918]]>
Ser Trp Asp Phe Lys Gln Ser Thr Arg Asn Ile Ser Pro Thr Ile Asn
1 5 10 15
Gln Ala Asn Ile Val Asp Leu His Pro Ala Ser Val Tyr Ser Ile Arg
20 25 30
Met Tyr Ser Phe Asn Lys Ile Gly Arg Ser Glu Pro Ser Lys Glu Leu
35 40 45
Thr Ile Ser Thr Glu Glu Ala
50 55
<![CDATA[ <210> 919]]>
<![CDATA[ <211> 23]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 919]]>
Pro Asp Gly Pro Pro Met Asp Val Thr Leu Gln Pro Val Thr Ser Gln
1 5 10 15
Ser Ile Gln Val Thr Trp Lys
20
<![CDATA[ <210> 920]]>
<![CDATA[ <211> 80]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 920]]>
Ala Pro Lys Lys Glu Leu Gln Asn Gly Val Ile Arg Gly Tyr Gln Ile
1 5 10 15
Gly Tyr Arg Glu Asn Ser Pro Gly Ser Asn Gly Gly Gln Tyr Ser Ile Val
20 25 30
Glu Met Lys Ala Thr Gly Asp Ser Glu Val Tyr Thr Leu Asp Asn Leu
35 40 45
Lys Lys Phe Ala Gln Tyr Gly Val Val Val Gln Ala Phe Asn Arg Ala
50 55 60
Gly Thr Gly Pro Ser Ser Ser Glu Ile Asn Ala Thr Thr Leu Glu Asp
65 70 75 80
<![CDATA[ <210> 921]]>
<![CDATA[ <211> 48]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 921]]>
Pro Ser Gln Pro Pro Glu Asn Val Arg Ala Leu Ser Ile Thr Ser Asp
1 5 10 15
Val Ala Val Ile Ser Trp Ser Glu Pro Pro Arg Ser Thr Leu Asn Gly
20 25 30
Val Leu Lys Gly Tyr Arg Val Ile Phe Trp Ser Leu Tyr Val Asp Gly
35 40 45
<![CDATA[ <210> 922]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 922]]>
Trp Gly Glu Met Gln Asn Ile Thr Thr Thr Arg Glu Arg Val Glu Leu
1 5 10 15
Arg Gly Met Glu Lys Phe Thr Asn Tyr Ser Val Gln Val Leu Ala Tyr
20 25 30
Thr Gln Ala Gly Asp Gly Val Arg Ser Ser Val Leu Tyr Ile Gln Thr
35 40 45
Lys Glu Asp
50
<![CDATA[ <210> 923]]>
<![CDATA[ <211> 46]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 923]]>
Pro Gly Pro Pro Ala Gly Ile Lys Ala Val Pro Ser Ser Ala Ser Ser
1 5 10 15
Val Val Val Ser Trp Leu Pro Pro Thr Lys Pro Asn Gly Val Ile Arg
20 25 30
Lys Tyr Thr Ile Phe Cys Ser Ser Pro Gly Ser Gly Gln Pro
35 40 45
<![CDATA[ <210> 924]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 924]]>
Ala Pro Ser Glu Tyr Glu Thr Ser Pro Glu Gln Leu Phe Tyr Arg Ile
1 5 10 15
Ala His Leu Asn Arg Gly Gln Gln Tyr Leu Leu Trp Val Ala Ala Val
20 25 30
Thr Ser Ala Gly Arg Gly Asn Ser Ser Glu Lys Val Thr Ile Glu Pro
35 40 45
Ala Gly Lys
50
<![CDATA[ <210> 925]]>
<![CDATA[ <211> 38]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 925]]>
Pro Ala Lys Ile Ile Ser Phe Gly Gly Thr Val Thr Thr Pro Trp Met
1 5 10 15
Lys Asp Val Arg Leu Pro Cys Asn Ser Val Gly Asp Pro Ala Pro Ala
20 25 30
Val Lys Trp Thr Lys Asp
35
<![CDATA[ <210> 926]]>
<![CDATA[ <211> 54]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 926]]>
Glu Asp Ser Ala Ile Pro Val Ser Met Asp Gly His Arg Leu Ile His
1 5 10 15
Thr Asn Gly Thr Leu Leu Leu Arg Ala Val Lys Ala Glu Asp Ser Gly
20 25 30
Tyr Tyr Thr Cys Thr Ala Thr Asn Thr Gly Gly Phe Asp Thr Ile Ile
35 40 45
Val Asn Leu Leu Val Gln
50
<![CDATA[ <210> 927]]>
<![CDATA[ <211> 32]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 927]]>
Pro Pro Asp Gln Pro Arg Leu Thr Val Ser Lys Thr Ser Ala Ser Ser
1 5 10 15
Ile Thr Leu Thr Trp Ile Pro Gly Asp Asn Gly Gly Ser Ser Ile Arg
20 25 30
<![CDATA[ <210> 928]]>
<![CDATA[ <211> 62]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 928]]>
Phe Val Leu Gln Tyr Ser Val Asp Asn Ser Glu Glu Trp Lys Asp Val
1 5 10 15
Phe Ile Ser Ser Ser Glu Arg Ser Phe Lys Leu Asp Ser Leu Lys Cys
20 25 30
Gly Thr Trp Tyr Lys Val Lys Leu Ala Ala Lys Asn Ser Val Gly Ser
35 40 45
Gly Arg Ile Ser Glu Ile Ile Glu Ala Lys Thr His Gly Arg
50 55 60
<![CDATA[ <210> 929]]>
<![CDATA[ <211> 95]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 929]]>
Pro Ser Phe Ser Lys Asp Gln His Leu Phe Thr His Ile Asn Ser Thr
1 5 10 15
His Ala Arg Leu Asn Leu Gln Gly Trp Asn Asn Gly Gly Cys Pro Ile
20 25 30
Thr Ala Ile Val Leu Glu Tyr Arg Pro Lys Gly Thr Trp Ala Trp Gln
35 40 45
Gly Leu Arg Ala Asn Ser Ser Gly Glu Val Phe Leu Thr Glu Leu Arg
50 55 60
Glu Ala Thr Trp Tyr Glu Leu Arg Met Arg Ala Cys Asn Ser Ala Gly
65 70 75 80
Cys Gly Asn Glu Thr Ala Gln Phe Ala Thr Leu Asp Tyr Asp Gly
85 90 95
<![CDATA[ <210> 930]]>
<![CDATA[ <211> 51]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 930]]>
Thr Ile Pro Pro Ile Lys Ser Ala Gln Gly Glu Gly Asp Asp Val Lys
1 5 10 15
Lys Leu Phe Thr Ile Gly Cys Pro Val Ile Leu Ala Thr Leu Gly Val
20 25 30
Ala Leu Leu Phe Ile Val Arg Lys Lys Arg Lys Glu Lys Arg Leu Lys
35 40 45
Arg Leu Arg
50
<![CDATA[ <210> 931]]>
<![CDATA[ <211> 9]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 931]]>
Ala Lys Ser Leu Ala Glu Met Leu Ile
1 5
<![CDATA[ <210> 932]]>
<![CDATA[ <211> 39]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 932]]>
Lys Asn Asn Arg Ser Phe Asp Thr Pro Val Lys Gly Pro Pro Gln Gly
1 5 10 15
Pro Arg Leu His Ile Asp Ile Pro Arg Val Gln Leu Leu Ile Glu Asp
20 25 30
Lys Glu Gly Ile Lys Gln Leu
35
<![CDATA[ <210> 933]]>
<![CDATA[ <211> 50]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 933]]>
Asp Asp Lys Ala Thr Ile Pro Val Thr Asp Ala Glu Phe Ser Gln Ala
1 5 10 15
Val Asn Pro Gln Ser Phe Cys Thr Gly Val Ser Leu His His Pro Thr
20 25 30
Leu Ile Gln Ser Thr Gly Pro Leu Ile Asp Met Ser Asp Ile Arg Pro
35 40 45
Gly Thr
50
<![CDATA[ <210> 934]]>
<![CDATA[ <211> 64]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 934]]>
Pro Val Ser Arg Lys Asn Val Lys Ser Ala His Ser Thr Arg Asn Arg
1 5 10 15
Tyr Ser Ser Gln Trp Thr Leu Thr Lys Cys Gln Ala Ser Thr Pro Ala
20 25 30
Arg Thr Leu Thr Ser Asp Trp Arg Thr Val Gly Ser Gln His Gly Val
35 40 45
Thr Val Thr Glu Ser Asp Ser Tyr Ser Ala Ser Leu Ser Gln Asp Thr
50 55 60
<![CDATA[ <210> 935]]>
<![CDATA[ <211> 103]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 935]]>
Lys Gly Arg Asn Ser Met Val Ser Thr Glu Ser Ala Ser Ser Thr Tyr
1 5 10 15
Glu Glu Leu Ala Arg Ala Tyr Glu His Ala Lys Leu Glu Glu Gln Leu
20 25 30
Gln His Ala Lys Phe Glu Ile Thr Glu Cys Phe Ile Ser Asp Ser Ser
35 40 45
Ser Asp Gln Met Thr Thr Gly Thr Asn Glu Asn Ala Asp Ser Met Thr
50 55 60
Ser Met Ser Thr Pro Ser Glu Pro Gly Ile Cys Arg Phe Thr Ala Ser
65 70 75 80
Pro Pro Lys Pro Gln Asp Ala Asp Arg Gly Lys Asn Val Ala Val Pro
85 90 95
Ile Pro His Arg Ala Asn Lys
100
<![CDATA[ <210> 936]]>
<![CDATA[ <211> 157]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 936]]>
Asp Tyr Cys Asn Leu Pro Leu Tyr Ala Lys Ser Glu Ala Phe Phe Arg
1 5 10 15
Lys Ala Asp Gly Arg Glu Pro Cys Pro Val Val Pro Pro Arg Glu Ala
20 25 30
Ser Ile Arg Asn Leu Ala Arg Thr Tyr His Thr Gln Ala Arg His Leu
35 40 45
Thr Leu Asp Pro Ala Ser Lys Ser Leu Gly Leu Pro His Pro Gly Ala
50 55 60
Pro Ala Ala Ala Ser Thr Ala Thr Leu Pro Gln Arg Thr Leu Ala Met
65 70 75 80
Pro Ala Pro Pro Ala Gly Thr Ala Pro Pro Ala Pro Gly Pro Thr Pro
85 90 95
Ala Glu Pro Pro Thr Ala Pro Ser Ala Ala Pro Pro Ala Pro Ser Thr
100 105 110
Glu Pro Pro Arg Ala Gly Gly Pro His Thr Lys Met Gly Gly Ser Arg
115 120 125
Asp Ser Leu Leu Glu Met Ser Thr Ser Gly Val Gly Arg Ser Gln Lys
130 135 140
Gln Gly Ala Gly Ala Tyr Ser Lys Ser Tyr Thr Leu Val
145 150 155
<![CDATA[ <210> 937]]>
<![CDATA[ <211> 2053]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 937]]> Met Trp Leu Val Thr Phe Leu Leu Leu Leu Asp Ser Leu His Lys Ala 1 5 10 15 Arg Pro Glu Asp Val Gly Thr Ser Leu Tyr Phe Val Asn Asp Ser Leu 20 25 30 Gln Gln Val Thr Phe Ser Ser Ser Val Gly Val Val Pro Cys Pro 35 40 45 Ala Ala Gly Ser Pro Ser Ala Ala Leu Arg Trp Tyr Leu Ala Thr Gly 50 55 60 Asp Asp Ile Tyr Asp Val Pro His Ile Arg His Val His Ala Asn Gly 65 70 75 80 Thr Leu Gln Leu Tyr Pro Phe Ser Pro Ser Ala Phe Asn Ser Phe Ile 85 90 95 His Asp Asn Asp Tyr Phe Cys Thr Ala Glu Asn Ala Ala Gly Lys Ile 100 105 110 Arg Ser Pro Asn Ile Arg Val Lys Ala Val Phe Arg Glu Pro Tyr Thr 115 120 125 Val Arg Val Glu Asp Gln Arg Ser Met Arg Gly Asn Val Ala Val Phe 130 135 140 Lys Cys Leu Ile Pro Ser Ser Val Gln Glu Tyr Val Ser Val Val Ser 145 150 155 16 0 Trp Glu Lys Asp Thr Val Ser Ile Ile Pro Glu His Arg Phe Phe Ile 165 170 175 Thr Tyr His Gly Gly Leu Tyr Ile Ser Asp Val Gln Lys Glu Asp Ala 180 185 190 Leu Ser Thr Tyr Arg Cys Ile Thr Lys His Lys Tyr Serg Glu Thr 195 ARG Gln Sern GLY Ala Ala ARG Leu Ser Val THR ASP Pro Ala Glu Ser 210 215 220 Ile Pro ThR ILEU ASP GLY PHE HIS GLN Glu Val Tra Gly 225 230 23 23 5 240 His Thr Val Glu Leu Pro Cys Thr Ala Ser Gly Tyr Pro Ile Pro Ala 245 250 255 Ile Arg Trp Leu Lys Asp Gly Arg Pro Leu Pro Ala Asp Ser Arg Trp 260 265 270 Thr Lys Arg Ile Thr Gly Leu Thr Ile Ser Asp Leu Arg Thr Glu Asp 275 280 285 Ser Gly Thr Tyr Ile Cys Glu Val Thr Asn Thr Phe Gly Ser Ala Glu 290 295 300 Ala Thr Gly Ile Leu Met Val Ile Asp Pro Leu His Val Thr Leu Thr 305 310 315 320 Pro Lys Lys Leu Lys Thr Gly Ile Gly Ser Thr Val Ile Leu Ser Cys 325 330 335 Ala Leu Thr Gly Ser Pro Glu Phe Thr Ile Arg Trp Tyr Arg Asn Thr 340 345 350 Glu Leu Val Leu Pro Asp Glu Ala Ile Ser Ile Arg Gly Leu Ser Asn 355 360 365 Glu Thr Leu Leu Ile Thr Ser Ala Gln Lys Ser His Ser Gly Ala Tyr 370 375 380 Gln Cys Phe Ala Thr Arg Lys Ala Gln Thr Ala Gln Asp Phe Ala Ile 385 390 395 400 Ile Ala Leu Glu Asp Gly Thr Pro Arg Ile Val Ser Ser Phe Ser Glu 405 410 415 Lys Val Val Asn Pro Gly Glu Gln Phe Ser Leu Met Cys Ala Ala Lys 420 425 430 Gly Ala Pro Pro Pro Thr Val Thr Trp Ala Leu Asp Asp Glu Pro Ile 435 440 445 Val Arg Asp Gly Ser His Arg Thr Asn Gln Tyr Thr Met Ser Asp Gly 450 455 460 Thr Thr Ile Ser His Met Asn Val Thr Gly Pro Gln Ile Arg Asp Gly 465 470 475 480 Gly Val Tyr Arg Cys Thr Ala Arg Asn Leu Val Gly Ser Ala Glu Tyr 485 490 495 Gln ARG Ile Asn Val ARG GLY Pro Pro Serg Ala Met ARG 50055 510 ASN Ile Thr Val Ala Gly ARG ARG ARL Leu Ile Asn CYS ARG Val 515 525 ILE GLY T. YR Pro Tyr Tyr Serle Lys Trp Tyr Lys Asp Ala Leu Leu 530 535 540 Leu Pro Asp Asn His Arg Gln Val Val Phe Glu Asn Gly Thr Leu Lys 545 550 555 560 Leu Thr Asp Val Gln Lys Gly Met Asp Glu Gly Glu Tyr Leu Cys Ser 565 570 5 75 Val Leu Ile Gln Pro Gln Leu Ser Ile Ser Gln Ser Val His Val Ala 580 585 590 Val Lys Val Pro Pro Leu Ile Gln Pro Phe Glu Phe Pro Pro Ala Ser 595 600 605 Ile Gly Gln Leu Leu Tyr Ile Pro Cys Val Val Ser Ser GLY ASP MET 610 620 Pro ILE ARG Ile Trp ARG LYS ASP GLN Val ILE ILE ILE ILE ILE Serite 625 635 640 Serge Val THR Ile Glu Phe Met Serou Gln Ile 645 650 65 65 65 65 65 65 65 65 65 65 65 65 65 65 65 65 65 65 65 5 Ser Ser Val Ser Leu Lys His Asn Gly Asn Tyr Thr Cys Ile Ala Ser 660 665 670 Asn Ala Ala Ala Thr Val Ser Arg Glu Arg Gln Leu Ile Val Arg Val 675 680 685 Pro Pro Arg Phe Val Val Gln Pro Asn Asn Asn Gln Asp Gly Ile Tyr Gly 690 695 700 Lys Ala Gly Val Leu Asn Cys Ser Val Asp Gly Tyr Pro Pro Pro Lys 705 710 715 720 Val Met Trp Lys His Ala Lys Gly Ser Gly Asn Pro Gln Gln Tyr His 725 730 735 Pro Val Pro Leu Thr Gly Arg Ile Gln Ile Leu Pro Asn Serou 740 745 750 Leu Ile ARG His Val Leu Glu GLU ASP ILE GLY Tyr Leu Cys Gln 755 760 760 Ala Sern Gly Val Gly Thr Lyser Met PHE Leu ThR 770 775 780 Val LYS Ile Pro Ala Met Ile Thr Ser His Pro Asn Thr Thr Ile Ala 785 790 795 800 Ile Lys Gly His Ala Lys Glu Leu Asn Cys Thr Ala Arg Gly Glu Arg 805 810 815 Pro Ile Ile Ile Arg Trp Glu Lys Gly Asp Thr Val Ile Asp Pro Asp 820 825 830 Arg Val Met Arg Tyr Ala Ile Ala Thr Lys Asp Asn Gly Asp Glu Val 835 840 845 Val Ser Thr Leu Lys Leu Lys Pro Ala Asp Arg Gly Asp Ser Val Phe 850 855 860 Phe Ser Cys His Ala Ile Asn Ser Tyr Gly Glu Asp Arg Gly Leu Ile 865 870 875 880 Gln Leu Thr Val Gln Glu Pro Pro Asp Pro Pro Glu Leu Glu Ile Arg 885 890 895 Glu Val Lys Ala Arg Ser Met Asn Leu Arg Trp Thr Gln Arg Phe Asp 900 905 910 Gly Asn Ser Ile Ile Thr Gly Phe Asp Ile Glu Tyr Lys Asn Lys Ser 915 920 925 Asp Ser Trp Asp Phe Lys Gln Ser Thr Arg Asn Ile Ser Pro Thr Ile 930 935 940 Asn Gln Ala Asn Ile Val Asp Leu His Pro Ala Ser Val Tyr Ser Ile 945 950 955 960 Arg Met Tyr Ser Phe Asn Lys Ile Gly Arg Ser Glu Pro Ser Lys Glu 965 970 975 Leu Thr Ile Ser Thr Glu Glu Ala Ala Pro Asp Gly Pro Pro Met Asp 980 985 990 Val Thr Leu Gln Pro Val Thr Ser Gln Ser Ile Gln Val Thr Trp Lys 995 1000 1005 Ala Pro Lys Lys Glu Leu Gln Asn Gly Val Ile Arg Gly Tyr Gln 1010 1015 1020 Ile Gly Tyr Arg Glu Asn Ser Pro Gly Ser Asn Gly Gln Tyr Ser 1025 1030 1035 Ile Val Glu Met Lys Ala Thr Gly Asp Ser Glu Val Tyr Thr Leu 1040 1045 1050 Asp Asn Leu Lys Lys Phe Ala Gln Tyr Gly Val Val Val Gln Ala 1055 1060 1065 Phe Asn Arg Ala Gly Th r Gly Pro Ser Ser Ser Ser Glu Ile Asn Ala 1070 1075 1080 Thr Thr Leu Glu Asp Val Pro Ser Gln Pro Pro Glu Asn Val Arg 1085 1090 1095 Ala Leu Ser Ile Thr Ser Asp Val Ala Val Ile Ser Trp Ser Glu 1100 1105 1110 Pro Pro Arg Ser Thr Leu Asn Gly Val Leu Lys Gly Tyr Arg Val 1115 1120 1125 Ile Phe Trp Ser Leu Tyr Val Asp Gly Glu Trp Gly Glu Met Gln 1130 1135 1140 Asn Ile Thr Thr Thr Arg Glu Arg Val Glu Leu Arg Gly Met Glu 1145 1150 1155 Lys Phe Thr Asn Tyr Ser Val Gln Val Leu Ala Tyr Thr Gln Ala 1160 1165 1170 Gly Asp Gly Val Arg Ser Ser Val Leu Tyr Ile Gln Thr Lys Glu 1175 1180 1185 Asp Val Pro Gly Pro Pro Ala Gly Ile Lys Ala Val Pro Ser Ser 1190 1195 1200 Ala Ser Ser Val Val Val Ser Trp Leu Pro Pro Thr Lys Pro Asn 1205 1210 1215 Gly Val Ile Arg Lys Tyr Thr Ile Phe Cys Ser Ser Pro Gly Ser 1220 1225 1230 Gly Gln Pro Ala Pro Ser Glu Tyr Glu Thr Ser Pro Glu Gln Leu 1235 1240 1245 Phe Tyr Arg Ile Ala His Leu Asn Arg Gly Gln Gln Tyr Leu Leu 1250 1255 1260 Trp Val Ala Ala Val Thr Ser Ala Gly Arg Gly Asn Ser Ser Glu 1265 1270 1275 Lys Val Th r Ile Glu Pro Ala Gly Lys Ala Pro Ala Lys Ile Ile 1280 1285 1290 Ser Phe Gly Gly Thr Val Thr Thr Pro Trp Met Lys Asp Val Arg 1295 1300 1305 Leu Pro Cys Asn Ser Val Gly Asp Pro Ala Pro Ala Val Lys Trp 1310 1315 1320 Thr Lys Asp S er Glu Asp Ser Ala Ile Pro Val Ser Met Asp Gly 1325 1330 1335 His Arg Leu Ile His Thr Asn Gly Thr Leu Leu Leu Arg Ala Val 1340 1345 1350 Lys Ala Glu Asp Ser Gly Tyr Tyr Thr Cys Thr Ala Thr Asn Thr 1355 136 0 1365 Gly Gly Phe Asp Thr Ile Ile Val Asn Leu Leu Val Gln Val Pro 1370 1375 1380 Pro Asp Gln Pro Arg Leu Thr Val Ser Lys Thr Ser Ala Ser Ser 1385 1390 1395 Ile Thr Leu Thr Trp Ile Pro Gly Asp Asn Gly Gly Ser Ser Ile 1400 1405 1410 Arg Gly Phe Val Leu Gln Tyr Ser Val Asp Asn Ser Glu Glu Trp 1415 1420 1425 Lys Asp Val Phe Ile Ser Ser Ser Ser Glu Arg Ser Phe Lys Leu Asp 1430 1435 1440 Ser Leu Lys Cys Gly Thr Trp Tyr Lys Val Lys Leu Ala Ala Lys 1445 1450 1455 Asn Ser Val Gly Ser Gly Arg Ile Ser Glu Ile Ile Glu Ala Lys 1460 1465 1470 Thr His Gly Arg Glu Pro Ser Phe Ser Lys Asp Gln His Leu Phe 1475 1480 1485 Thr His Ile As n Ser Thr His Ala Arg Leu Asn Leu Gln Gly Trp 1490 1495 1500 Asn Asn Gly Gly Cys Pro Ile Thr Ala Ile Val Leu Glu Tyr Arg 1505 1510 1515 Pro Lys Gly Thr Trp Ala Trp Gln Gly Leu Arg Ala Asn Ser Ser 1520 15 25 1530 Gly Glu Val Phe Leu Thr Glu Leu Arg Glu Ala Thr Trp Tyr Glu 1535 1540 1545 Leu Arg Met Arg Ala Cys Asn Ser Ala Gly Cys Gly Asn Glu Thr 1550 1555 1560 Ala Gln Phe Ala Thr Leu Asp Tyr Asp Gly Ser Thr Ile Pro 156 5 1570 1575 Ile Lys Ser Ala Gln Gly Glu Gly Asp Val Lys Lys Leu Phe 1580 1585 1590 Thr Ile Gly Cys Pro Val Ile Leu Ala Thr Leu Gly Val Ala Leu 1595 1600 1605 Leu Phe Val Arg Lys Lys Arg Lys Glu Lys Arg Leu Lys Arg 1610 1615 1620 Leu Arg Asp Ala Lys Ser Leu Ala Glu Met Leu Ile Ser Lys Asn 1625 1630 1635 Asn Arg Ser Phe Asp Thr Pro Val Lys Gly Pro Pro Gln Gly Pro 1640 1645 1650 Arg Leu His Ile Asp Ile Pro Ar g Val Gln Leu Leu Ile Glu Asp 1655 1660 1665 Lys Glu Gly Ile Lys Gln Leu Gly Asp Asp Lys Ala Thr Ile Pro 1670 1675 1680 Val Thr Asp Ala Glu Phe Ser Gln Ala Val Asn Pro Gln Ser Phe 1685 1690 1695 Cys Thr Gly Val Ser Leu His His Pro Thr Leu Ile Gln Ser Thr 1700 1705 1710 Gly Pro Leu Ile Asp Met Ser Asp Ile Arg Pro Gly Thr Asn Pro 1715 1720 1725 Val Ser Arg Lys Asn Val Lys Ser Ala His Ser Thr Arg Asn Arg 1730 1735 1 740 Tyr Ser Ser Gln Trp Thr Leu Thr Lys Cys Gln Ala Ser Thr Pro 1745 1750 1755 Ala Arg Thr Leu Thr Ser Asp Trp Arg Thr Val Gly Ser Gln His 1760 1765 1770 Gly Val Thr Val Thr Glu Ser Asp Ser Tyr Ser Ala Ser Leu S er 1775 1780 1785 Gln Asp Thr Asp Lys Gly Arg Asn Ser Met Val Ser Thr Glu Ser 1790 1795 1800 Ala Ser Ser Thr Tyr Glu Glu Leu Ala Arg Ala Tyr Glu His Ala 1805 1810 1815 Lys Leu Glu Glu Gln Leu Gln His Ala Lys Phe Glu Ile Thr Glu 1820 1825 1830 Cys Phe Ile Ser Asp Ser Ser Ser Asp Gln Met Thr Thr Gly Thr 1835 1840 1845 Asn Glu Asn Ala Asp Ser Met Thr Ser Ser Met Thr Pro Ser Glu 1850 1855 1860 Pro Gly Ile Cys Arg Phe Thr Ala Ser Pro Pro Lys Pro Gln Asp 1865 1870 1875 Ala Asp Arg Gly Lys Asn Val Ala Val Pro Ile Pro His Arg Ala 1880 1885 1890 Asn Lys Ser Asp Tyr Cys Asn Leu Pro Leu Tyr Ala Lys Ser Glu 1895 1900 1905 A la Phe Phe ARG LYS ALA ASP GLY ARG GLU Pro Cys Pro Val Val 1910 1920 Pro Pro ARG GLU Ala Serg Asn Leu Ala Ala ARG THR TYR HIS P Pro Ala Ser Lys Seru 1940 1945 1950 Gly Leu Pro His Pro Gly Ala Pro Ala Ala Ala Ser Thr Ala Thr 1955 1960 1965 Leu Pro Gln Arg Thr Leu Ala Met Pro Ala Pro Pro Ala Gly Thr 1970 1975 1980 Ala Pro Pro Ala Pro Gly Pro Thr Pro Ala Glu Pro Pro Thr Ala 1985 1990 1995 Pro Ser Ala Ala Pro Pro Ala Pro Ser Thr Glu Pro Pro Arg Ala 2000 2005 2010 Gly Gly Pro His Thr Lys Met Gly Gly Ser Arg Asp Ser Leu Leu 2015 2020 2025 Glu Met Ser Thr Ser Gly Val Gly Arg Ser Gln Lys Gln Gly Ala 2030 2035 2040 Gly Ala Tyr Ser Lys Ser Tyr Thr Leu Val 2045 2050 <![CDATA[ <210> 938]]>
<![CDATA[ <211> 648]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 938]]>
agccgagcgc tgggctgagg agcagagaga gcggggcgcc gagtgcgggc ggctgggagc 60
gcgctgagcg ggggagaggc gctgccgcac ggccggccac aggacccacct ccccggagaa 120
tagggcctct ttatggcatg tggctggtaa ctttcctcct gctcctggac tctttacaca 180
aagcccgccc tgaagatgtt ggcaccagcc tctactttgt aaatgactcc ttgcagcagg 240
tgaccttttc cagctccgtgggggtggtgg tgccctgccc ggccgcgggc tcccccagcg 300
cggcccttcg atggtacctg gccacagggg acgacatcta cgacgtgccg cacatccggc 360
acgtccacgc caacgggacg ctgcagctct accccttctc cccctccgcc ttcaatagct 420
ttatccacga caatgactac ttctgcaccg cggagaacgc tgccggcaag atccggagcc 480
ccaacatccg cgtcaaagca gttttcaggg aaccctacac cgtccgggtg gaggatcaaa 540
ggtcaatgcg tggcaacgtg gccgtcttca agtgcctcat cccctcttca gtgcaggaat 600
atgttagcgt tgtatcttgg gagaaagaca cagtctccat catcccag 648
<![CDATA[ <210> 939]]>
<![CDATA[ <211> 170]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 939]]>
Met Trp Leu Val Thr Phe Leu Leu Leu Leu Asp Ser Leu His Lys Ala
1 5 10 15
Arg Pro Glu Asp Val Gly Thr Ser Leu Tyr Phe Val Asn Asp Ser Leu
20 25 30
Gln Gln Val Thr Phe Ser Ser Ser Ser Val Gly Val Val Val Pro Cys Pro
35 40 45
Ala Ala Gly Ser Pro Ser Ala Ala Leu Arg Trp Tyr Leu Ala Thr Gly
50 55 60
Asp Asp Ile Tyr Asp Val Pro His Ile Arg His Val His Ala Asn Gly
65 70 75 80
Thr Leu Gln Leu Tyr Pro Phe Ser Pro Ser Ala Phe Asn Ser Phe Ile
85 90 95
His Asp Asn Asp Tyr Phe Cys Thr Ala Glu Asn Ala Ala Gly Lys Ile
100 105 110
Arg Ser Pro Asn Ile Arg Val Lys Ala Val Phe Arg Glu Pro Tyr Thr
115 120 125
Val Arg Val Glu Asp Gln Arg Ser Met Arg Gly Asn Val Ala Val Phe
130 135 140
Lys Cys Leu Ile Pro Ser Ser Val Gln Glu Tyr Val Ser Val Val Ser
145 150 155 160
Trp Glu Lys Asp Thr Val Ser Ile Ile Pro
165 170
<![CDATA[ <210> 940]]>
<![CDATA[ <211> 485]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 940]]>
actcgccgca gcctgcgcgc cttctccagt ccgcggtgcc atggcccccg cccgtctgtt 60
cgcgctgctg ctgttcttcg taggcggagt cgccgagtcg atccgagaga ctgaggtcat 120
cgacccccag gacctcctag aaggccgata cttctccgga gccctaccag acgatgagga 180
tgtagtgggg cccgggcagg aatctgatga ctttgagctg tctggctctg gagatctgga 240
tgacttggaa gactccatga tcggccctga agttgtccat cccttggtgc ctctagataa 300
ccatatccct gagagggcag ggtctggggag ccaagtcccc accgaaccca agaaactaga 360
ggagaatgag gttatcccca agagaatctc acccgttgaa gagagtgagg atgtgtccaa 420
caaggtgtca atgtccagca ctgtgcaggg cagcaacatc tttgagagaa cggaggtcct 480
ggcag 485
<![CDATA[ <210> 941]]>
<![CDATA[ <211> 148]]>
<![CDATA[ <212> PRT]]>
<![CDATA[ <213> Human]]>
<![CDATA[ <400> 941]]>
Met Ala Pro Ala Arg Leu Phe Ala Leu Leu Leu Phe Phe Val Gly Gly
1 5 10 15
Val Ala Glu Ser Ile Arg Glu Thr Glu Val Ile Asp Pro Gln Asp Leu
20 25 30
Leu Glu Gly Arg Tyr Phe Ser Gly Ala Leu Pro Asp Asp Glu Asp Val
35 40 45
Val Gly Pro Gly Gln Glu Ser Asp Asp Phe Glu Leu Ser Gly Ser Gly
50 55 60
Asp Leu Asp Asp Leu Glu Asp Ser Met Ile Gly Pro Glu Val Val His
65 70 75 80
Pro Leu Val Pro Leu Asp Asn His Ile Pro Glu Arg Ala Gly Ser Gly
85 90 95
Ser Gln Val Pro Thr Glu Pro Lys Lys Leu Glu Glu Asn Glu Val Ile
100 105 110
Pro Lys Arg Ile Ser Pro Val Glu Glu Ser Glu Asp Val Ser Asn Lys
115 120 125
Val Ser Met Ser Ser Ser Thr Val Gln Gly Ser Asn Ile Phe Glu Arg Thr
130 135 140
Glu Val Leu Ala
145
<![CDATA[ <210> 942]]>
<![CDATA[ <211> 33]]>
<![CDATA[ <212>DNA]]>
<![CDATA[ <213> Artificial Sequence]]>
<![CDATA[ <220>]]>
<![CDATA[ <223> T7 promoter]]>
<![CDATA[ <400> 942]]>
cagagatgca taatacgact cactataggg aga 33
(無)(none)
Claims (61)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
NL2028384 | 2021-06-03 | ||
NL2028384 | 2021-06-03 | ||
NL2030006 | 2021-12-03 | ||
NL2030006 | 2021-12-03 |
Publications (1)
Publication Number | Publication Date |
---|---|
TW202313972A true TW202313972A (en) | 2023-04-01 |
Family
ID=82019205
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW111120470A TW202313972A (en) | 2021-06-03 | 2022-06-01 | New nrg1 fusions, fusion junctions and methods for detecting them |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4347893A2 (en) |
TW (1) | TW202313972A (en) |
WO (1) | WO2022255871A2 (en) |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5654442A (en) | 1989-11-14 | 1997-08-05 | The Perkin-Elmer Corporation | 4,7-dichlorofluorescein dyes as molecular probes |
US5945526A (en) | 1996-05-03 | 1999-08-31 | Perkin-Elmer Corporation | Energy transfer dyes with enhanced fluorescence |
US6372907B1 (en) | 1999-11-03 | 2002-04-16 | Apptera Corporation | Water-soluble rhodamine dye peptide conjugates |
DK2059533T3 (en) | 2006-08-30 | 2013-02-25 | Genentech Inc | MULTI-SPECIFIC ANTIBODIES |
AU2010226453B2 (en) | 2009-03-20 | 2013-11-21 | Genentech, Inc. | Bispecific anti-HER antibodies |
US11426409B2 (en) * | 2017-09-08 | 2022-08-30 | The Regents Of The University Of Colorado | Compounds, compositions and methods for treating or preventing HER-driven drug-resistant cancers |
GB201913079D0 (en) * | 2019-09-11 | 2019-10-23 | Hummingbird Bioscience Holdings Pte Ltd | Treatment and prevention of cancer using her3 antigen-binding molecules |
-
2022
- 2022-06-01 WO PCT/NL2022/050302 patent/WO2022255871A2/en active Application Filing
- 2022-06-01 TW TW111120470A patent/TW202313972A/en unknown
- 2022-06-01 EP EP22729828.8A patent/EP4347893A2/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP4347893A2 (en) | 2024-04-10 |
WO2022255871A2 (en) | 2022-12-08 |
WO2022255871A3 (en) | 2023-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2017267184B2 (en) | Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy | |
KR102023584B1 (en) | PREDICTING GASTROENTEROPANCREATIC NEUROENDOCRINE NEOPLASMS (GEP-NENs) | |
AU2013358958B2 (en) | Polynucleotides encoding rodent antibodies with human idiotypes and animals comprising same | |
US20190300967A1 (en) | Compositions and methods for predicting response and resistance to ctla4 blockade in melanoma using a gene expression signature | |
KR101421326B1 (en) | Composition for predicting prognosis of breast cancer and kit comprising the same | |
CN107223159A (en) | The detection of DNA from particular cell types and correlation technique | |
KR102110469B1 (en) | Phosphodiesterase 4d7 as marker for malignant, hormone-sensitive prostate cancer | |
KR20180093902A (en) | Detection of fetal chromosomal anomalies using differentially methylated diene regions between fetuses and pregnant women | |
US20090305284A1 (en) | Methods for Identifying Risk of Breast Cancer and Treatments Thereof | |
CN101258249A (en) | Methods and reagents for the detection of melanoma | |
KR20060045950A (en) | Prognostic for hematological malignancy | |
KR20230034198A (en) | Methods for activating and expanding tumor-infiltrating lymphocytes | |
KR20040065524A (en) | Method for assessing and treating leukemia | |
KR102046839B1 (en) | Method for in vitro diagnosis or prognosis of colon cancer | |
AU2018210420A1 (en) | Human antibodies from transgenic rodents with multiple heavy chain immunoglobulin loci | |
US20020137077A1 (en) | Genes regulated in activated T cells | |
CN110541031A (en) | Method for in vitro diagnosis or prognosis of ovarian cancer | |
KR20170116009A (en) | Novel rna-biomarker signature for diagnosis of prostate cancer | |
JP2002017376A (en) | Secretory protein or membrane protein | |
KR20190126812A (en) | Biomarkers for Disease Diagnosis | |
WO2006022636A1 (en) | Methods for identifying risk of type ii diabetes and treatments thereof | |
US20220265798A1 (en) | Cancer vaccine compositions and methods for using same to prevent and/or treat cancer | |
CN113249470A (en) | Method for in vitro diagnosis or prognosis of lung cancer | |
CN100516876C (en) | Methods for diagnosing RCC and other solid tumors | |
TW202313972A (en) | New nrg1 fusions, fusion junctions and methods for detecting them |