CN113727993A - 通过遗传密码子扩展以靶蛋白选择性方式制备工程化靶蛋白的手段和方法 - Google Patents
通过遗传密码子扩展以靶蛋白选择性方式制备工程化靶蛋白的手段和方法 Download PDFInfo
- Publication number
- CN113727993A CN113727993A CN202080028507.1A CN202080028507A CN113727993A CN 113727993 A CN113727993 A CN 113727993A CN 202080028507 A CN202080028507 A CN 202080028507A CN 113727993 A CN113727993 A CN 113727993A
- Authority
- CN
- China
- Prior art keywords
- rna
- poi
- ncaa
- amino acid
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 77
- 108020004705 Codon Proteins 0.000 title claims description 152
- 108090000623 proteins and genes Proteins 0.000 title abstract description 406
- 102000004169 proteins and genes Human genes 0.000 title abstract description 394
- 230000002068 genetic effect Effects 0.000 title description 7
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 186
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 179
- 229920001184 polypeptide Polymers 0.000 claims abstract description 177
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 139
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 130
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 96
- 239000002773 nucleotide Substances 0.000 claims abstract description 82
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 82
- 230000014509 gene expression Effects 0.000 claims abstract description 77
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 74
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 74
- 230000008685 targeting Effects 0.000 claims abstract description 66
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 claims abstract description 35
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 claims abstract description 35
- 239000013604 expression vector Substances 0.000 claims abstract description 33
- 210000004027 cell Anatomy 0.000 claims description 192
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 172
- 150000001413 amino acids Chemical class 0.000 claims description 100
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 53
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 47
- 210000000805 cytoplasm Anatomy 0.000 claims description 25
- 230000003834 intracellular effect Effects 0.000 claims description 22
- 230000000295 complement effect Effects 0.000 claims description 21
- 150000003839 salts Chemical class 0.000 claims description 17
- 238000004519 manufacturing process Methods 0.000 claims description 13
- 239000012636 effector Substances 0.000 claims description 7
- 101710123134 Ice-binding protein Proteins 0.000 claims 7
- 238000013519 translation Methods 0.000 abstract description 65
- 125000000539 amino acid group Chemical group 0.000 abstract description 24
- 108091026890 Coding region Proteins 0.000 abstract description 13
- 238000002360 preparation method Methods 0.000 abstract description 11
- 230000009471 action Effects 0.000 abstract description 3
- 235000018102 proteins Nutrition 0.000 description 385
- 108020004414 DNA Proteins 0.000 description 357
- 101710125418 Major capsid protein Proteins 0.000 description 135
- 239000005090 green fluorescent protein Substances 0.000 description 124
- 235000001014 amino acid Nutrition 0.000 description 107
- 229940024606 amino acid Drugs 0.000 description 98
- 239000012634 fragment Substances 0.000 description 95
- 108090000740 RNA-binding protein EWS Proteins 0.000 description 84
- 102000004229 RNA-binding protein EWS Human genes 0.000 description 84
- 108020004999 messenger RNA Proteins 0.000 description 77
- 108020005038 Terminator Codon Proteins 0.000 description 66
- QTBSBXVTEAMEQO-UHFFFAOYSA-N acetic acid Substances CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 61
- 101000798951 Homo sapiens Mitochondrial import receptor subunit TOM20 homolog Proteins 0.000 description 59
- 102100034007 Mitochondrial import receptor subunit TOM20 homolog Human genes 0.000 description 58
- 239000012528 membrane Substances 0.000 description 45
- 101001047681 Homo sapiens Tyrosine-protein kinase Lck Proteins 0.000 description 40
- 102100024036 Tyrosine-protein kinase Lck Human genes 0.000 description 40
- 108020004566 Transfer RNA Proteins 0.000 description 37
- 241000205274 Methanosarcina mazei Species 0.000 description 36
- 238000002474 experimental method Methods 0.000 description 36
- 230000004927 fusion Effects 0.000 description 36
- 102100034894 Kinesin-like protein KIF16B Human genes 0.000 description 31
- -1 PylRS Proteins 0.000 description 30
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 29
- 108010053665 kinesin family member 16B Proteins 0.000 description 29
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 28
- 210000003463 organelle Anatomy 0.000 description 28
- 102000003960 Ligases Human genes 0.000 description 27
- 108090000364 Ligases Proteins 0.000 description 27
- 108700008625 Reporter Genes Proteins 0.000 description 26
- 230000001086 cytosolic effect Effects 0.000 description 26
- 102100039560 Microtubule-associated protein RP/EB family member 1 Human genes 0.000 description 25
- 101710099411 Microtubule-associated protein RP/EB family member 1 Proteins 0.000 description 25
- 101710107943 Trans-activator protein BZLF1 Proteins 0.000 description 24
- 101000598403 Homo sapiens Nucleoporin NUP42 Proteins 0.000 description 22
- 102100037821 Nucleoporin NUP42 Human genes 0.000 description 22
- 230000006870 function Effects 0.000 description 22
- 108020005098 Anticodon Proteins 0.000 description 21
- 101001090172 Homo sapiens Kinectin Proteins 0.000 description 21
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 20
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 20
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 20
- 239000013598 vector Substances 0.000 description 19
- 230000001629 suppression Effects 0.000 description 17
- 101100150006 Caenorhabditis elegans spd-5 gene Proteins 0.000 description 16
- 102000029749 Microtubule Human genes 0.000 description 16
- 108091022875 Microtubule Proteins 0.000 description 16
- 230000003993 interaction Effects 0.000 description 16
- 210000004688 microtubule Anatomy 0.000 description 16
- 101001062222 Homo sapiens Receptor-binding cancer antigen expressed on SiSo cells Proteins 0.000 description 15
- 102100029165 Receptor-binding cancer antigen expressed on SiSo cells Human genes 0.000 description 15
- 210000000170 cell membrane Anatomy 0.000 description 14
- 239000013612 plasmid Substances 0.000 description 14
- 210000003527 eukaryotic cell Anatomy 0.000 description 13
- 241000282414 Homo sapiens Species 0.000 description 12
- 241000205276 Methanosarcina Species 0.000 description 12
- 238000010367 cloning Methods 0.000 description 12
- 230000000694 effects Effects 0.000 description 12
- 239000000523 sample Substances 0.000 description 12
- 238000012360 testing method Methods 0.000 description 12
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 11
- 102000010638 Kinesin Human genes 0.000 description 11
- 108010063296 Kinesin Proteins 0.000 description 11
- 238000005191 phase separation Methods 0.000 description 11
- 101001091266 Homo sapiens Kinesin-like protein KIF13A Proteins 0.000 description 10
- 230000027455 binding Effects 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 10
- 238000009396 hybridization Methods 0.000 description 10
- 239000012071 phase Substances 0.000 description 10
- 239000004475 Arginine Substances 0.000 description 9
- 108090000565 Capsid Proteins Proteins 0.000 description 9
- 102100034865 Kinesin-like protein KIF13A Human genes 0.000 description 9
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 9
- 101150077352 NUP153 gene Proteins 0.000 description 9
- 239000002253 acid Substances 0.000 description 9
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 9
- 235000009697 arginine Nutrition 0.000 description 9
- 238000000684 flow cytometry Methods 0.000 description 9
- 238000010348 incorporation Methods 0.000 description 9
- 210000000633 nuclear envelope Anatomy 0.000 description 9
- 108091033319 polynucleotide Proteins 0.000 description 9
- 102000040430 polynucleotide Human genes 0.000 description 9
- 239000002157 polynucleotide Substances 0.000 description 9
- 210000003705 ribosome Anatomy 0.000 description 9
- 238000001890 transfection Methods 0.000 description 9
- 101710132601 Capsid protein Proteins 0.000 description 8
- 101710094648 Coat protein Proteins 0.000 description 8
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 8
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 8
- 101710141454 Nucleoprotein Proteins 0.000 description 8
- 101710083689 Probable capsid protein Proteins 0.000 description 8
- 230000004570 RNA-binding Effects 0.000 description 8
- 230000009977 dual effect Effects 0.000 description 8
- 238000010166 immunofluorescence Methods 0.000 description 8
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 8
- 229960000310 isoleucine Drugs 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 7
- 101710146427 Probable tyrosine-tRNA ligase, cytoplasmic Proteins 0.000 description 7
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 7
- 102000018378 Tyrosine-tRNA ligase Human genes 0.000 description 7
- 101710107268 Tyrosine-tRNA ligase, mitochondrial Proteins 0.000 description 7
- 230000006229 amino acid addition Effects 0.000 description 7
- 230000033228 biological regulation Effects 0.000 description 7
- 238000003384 imaging method Methods 0.000 description 7
- JEIPFZHSYJVQDO-UHFFFAOYSA-N iron(III) oxide Inorganic materials O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 7
- YOBAEOGBNPPUQV-UHFFFAOYSA-N iron;trihydrate Chemical compound O.O.O.[Fe].[Fe] YOBAEOGBNPPUQV-UHFFFAOYSA-N 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 238000010186 staining Methods 0.000 description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 6
- BABTYIKKTLTNRX-QMMMGPOBSA-N (2s)-2-amino-3-(3-iodophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC(I)=C1 BABTYIKKTLTNRX-QMMMGPOBSA-N 0.000 description 6
- 241000701959 Escherichia virus Lambda Species 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 101000658112 Homo sapiens Synaptotagmin-like protein 3 Proteins 0.000 description 6
- 206010039491 Sarcoma Diseases 0.000 description 6
- 102100035001 Synaptotagmin-like protein 3 Human genes 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 239000000427 antigen Substances 0.000 description 6
- 108091007433 antigens Proteins 0.000 description 6
- 102000036639 antigens Human genes 0.000 description 6
- 230000000903 blocking effect Effects 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 238000004587 chromatography analysis Methods 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 108010021843 fluorescent protein 583 Proteins 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 238000013508 migration Methods 0.000 description 6
- 230000005012 migration Effects 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 5
- 239000012114 Alexa Fluor 647 Substances 0.000 description 5
- OBMZMSLWNNWEJA-XNCRXQDQSA-N C1=CC=2C(C[C@@H]3NC(=O)[C@@H](NC(=O)[C@H](NC(=O)N(CC#CCN(CCCC[C@H](NC(=O)[C@@H](CC4=CC=CC=C4)NC3=O)C(=O)N)CC=C)NC(=O)[C@@H](N)C)CC3=CNC4=C3C=CC=C4)C)=CNC=2C=C1 Chemical compound C1=CC=2C(C[C@@H]3NC(=O)[C@@H](NC(=O)[C@H](NC(=O)N(CC#CCN(CCCC[C@H](NC(=O)[C@@H](CC4=CC=CC=C4)NC3=O)C(=O)N)CC=C)NC(=O)[C@@H](N)C)CC3=CNC4=C3C=CC=C4)C)=CNC=2C=C1 OBMZMSLWNNWEJA-XNCRXQDQSA-N 0.000 description 5
- 241000588914 Enterobacter Species 0.000 description 5
- 241000709744 Enterobacterio phage MS2 Species 0.000 description 5
- 108010052285 Membrane Proteins Proteins 0.000 description 5
- 206010028980 Neoplasm Diseases 0.000 description 5
- 241000283973 Oryctolagus cuniculus Species 0.000 description 5
- 101710176384 Peptide 1 Proteins 0.000 description 5
- 229920002873 Polyethylenimine Polymers 0.000 description 5
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 5
- 108091036066 Three prime untranslated region Proteins 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 239000007819 coupling partner Substances 0.000 description 5
- 108091006047 fluorescent proteins Proteins 0.000 description 5
- 102000034287 fluorescent proteins Human genes 0.000 description 5
- 210000001700 mitochondrial membrane Anatomy 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 239000011022 opal Substances 0.000 description 5
- 230000009268 pathologic speech processing Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 208000032207 progressive 1 supranuclear palsy Diseases 0.000 description 5
- 238000001742 protein purification Methods 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 102000035160 transmembrane proteins Human genes 0.000 description 5
- 108091005703 transmembrane proteins Proteins 0.000 description 5
- PRDFBSVERLRRMY-UHFFFAOYSA-N 2'-(4-ethoxyphenyl)-5-(4-methylpiperazin-1-yl)-2,5'-bibenzimidazole Chemical compound C1=CC(OCC)=CC=C1C1=NC2=CC=C(C=3NC4=CC(=CC=C4N=3)N3CCN(C)CC3)C=C2N1 PRDFBSVERLRRMY-UHFFFAOYSA-N 0.000 description 4
- 108020005345 3' Untranslated Regions Proteins 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000206602 Eukaryota Species 0.000 description 4
- 101710121996 Hexon protein p72 Proteins 0.000 description 4
- 101001091229 Homo sapiens Kinesin-like protein KIF16B Proteins 0.000 description 4
- 101000914514 Homo sapiens T-cell-specific surface glycoprotein CD28 Proteins 0.000 description 4
- 102100036721 Insulin receptor Human genes 0.000 description 4
- 108010071170 Leucine-tRNA ligase Proteins 0.000 description 4
- 102100023342 Leucine-tRNA ligase, mitochondrial Human genes 0.000 description 4
- 102100024573 Macrophage-capping protein Human genes 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- 241000709749 Pseudomonas phage PP7 Species 0.000 description 4
- 102100027213 T-cell-specific surface glycoprotein CD28 Human genes 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 238000000429 assembly Methods 0.000 description 4
- 230000000712 assembly Effects 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 238000012650 click reaction Methods 0.000 description 4
- 230000007547 defect Effects 0.000 description 4
- 229960005156 digoxin Drugs 0.000 description 4
- 102000013035 dynein heavy chain Human genes 0.000 description 4
- 108060002430 dynein heavy chain Proteins 0.000 description 4
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 230000002438 mitochondrial effect Effects 0.000 description 4
- 238000003032 molecular docking Methods 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 229920001223 polyethylene glycol Polymers 0.000 description 4
- 238000010869 super-resolution microscopy Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 239000012110 Alexa Fluor 594 Substances 0.000 description 3
- 239000000592 Artificial Cell Substances 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 102100033787 CMP-sialic acid transporter Human genes 0.000 description 3
- 108091006146 Channels Proteins 0.000 description 3
- 241000205145 Desulfobacterium Species 0.000 description 3
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 3
- 101001040734 Homo sapiens Golgi phosphoprotein 3 Proteins 0.000 description 3
- 101000970403 Homo sapiens Nuclear pore complex protein Nup153 Proteins 0.000 description 3
- 241000203353 Methanococcus Species 0.000 description 3
- 241000205290 Methanosarcina thermophila Species 0.000 description 3
- 108091060545 Nonsense suppressor Proteins 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 102100035071 Vimentin Human genes 0.000 description 3
- 108010065472 Vimentin Proteins 0.000 description 3
- 108020000999 Viral RNA Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 229960002685 biotin Drugs 0.000 description 3
- 235000020958 biotin Nutrition 0.000 description 3
- 239000011616 biotin Substances 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000012411 cloning technique Methods 0.000 description 3
- 230000008045 co-localization Effects 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 239000012091 fetal bovine serum Substances 0.000 description 3
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000007654 immersion Methods 0.000 description 3
- 238000007901 in situ hybridization Methods 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 150000002668 lysine derivatives Chemical class 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 210000004492 nuclear pore Anatomy 0.000 description 3
- 229910052698 phosphorus Inorganic materials 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 210000005048 vimentin Anatomy 0.000 description 3
- 101710145083 ATP-dependent RNA helicase laf-1 Proteins 0.000 description 2
- BPYKTIZUTYGOLE-IFADSCNNSA-N Bilirubin Chemical compound N1C(=O)C(C)=C(C=C)\C1=C\C1=C(C)C(CCC(O)=O)=C(CC2=C(C(C)=C(\C=C/3C(=C(C=C)C(=O)N\3)C)N2)CCC(O)=O)N1 BPYKTIZUTYGOLE-IFADSCNNSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 101710150575 CMP-sialic acid transporter Proteins 0.000 description 2
- 101100386910 Caenorhabditis elegans laf-1 gene Proteins 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- LTMHDMANZUZIPE-AMTYYWEZSA-N Digoxin Natural products O([C@H]1[C@H](C)O[C@H](O[C@@H]2C[C@@H]3[C@@](C)([C@@H]4[C@H]([C@]5(O)[C@](C)([C@H](O)C4)[C@H](C4=CC(=O)OC4)CC5)CC3)CC2)C[C@@H]1O)[C@H]1O[C@H](C)[C@@H](O[C@H]2O[C@@H](C)[C@H](O)[C@@H](O)C2)[C@@H](O)C1 LTMHDMANZUZIPE-AMTYYWEZSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- ZRALSGWEFCBTJO-UHFFFAOYSA-N Guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 101710103773 Histone H2B Proteins 0.000 description 2
- 102100021639 Histone H2B type 1-K Human genes 0.000 description 2
- 101001030211 Homo sapiens Myc proto-oncogene protein Proteins 0.000 description 2
- 101000623857 Homo sapiens Serine/threonine-protein kinase mTOR Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- 108090001090 Lectins Proteins 0.000 description 2
- 102000004856 Lectins Human genes 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 241000203407 Methanocaldococcus jannaschii Species 0.000 description 2
- 241000205284 Methanosarcina acetivorans Species 0.000 description 2
- 241000205275 Methanosarcina barkeri Species 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 102100021706 Nuclear pore complex protein Nup153 Human genes 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 230000014632 RNA localization Effects 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 241000711975 Vesicular stomatitis virus Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- DZBUGLKDJFMEHC-UHFFFAOYSA-N acridine Chemical compound C1=CC=CC2=CC3=CC=CC=C3N=C21 DZBUGLKDJFMEHC-UHFFFAOYSA-N 0.000 description 2
- 101150063416 add gene Proteins 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 229940061720 alpha hydroxy acid Drugs 0.000 description 2
- 150000001280 alpha hydroxy acids Chemical class 0.000 description 2
- 235000008206 alpha-amino acids Nutrition 0.000 description 2
- 125000000266 alpha-aminoacyl group Chemical group 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000001588 bifunctional effect Effects 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 239000006143 cell culture medium Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 210000003850 cellular structure Anatomy 0.000 description 2
- 230000003196 chaotropic effect Effects 0.000 description 2
- 238000012733 comparative method Methods 0.000 description 2
- 238000010226 confocal imaging Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000006352 cycloaddition reaction Methods 0.000 description 2
- CKKWLCWHIOOUMQ-ZSCHJXSPSA-N cyclooctyne;(2s)-2,6-diaminohexanoic acid Chemical compound C1CCCC#CCC1.NCCCC[C@H](N)C(O)=O CKKWLCWHIOOUMQ-ZSCHJXSPSA-N 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 230000003436 cytoskeletal effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- LTMHDMANZUZIPE-PUGKRICDSA-N digoxin Chemical compound C1[C@H](O)[C@H](O)[C@@H](C)O[C@H]1O[C@@H]1[C@@H](C)O[C@@H](O[C@@H]2[C@H](O[C@@H](O[C@@H]3C[C@@H]4[C@]([C@@H]5[C@H]([C@]6(CC[C@@H]([C@@]6(C)[C@H](O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)C[C@@H]2O)C)C[C@@H]1O LTMHDMANZUZIPE-PUGKRICDSA-N 0.000 description 2
- LTMHDMANZUZIPE-UHFFFAOYSA-N digoxine Natural products C1C(O)C(O)C(C)OC1OC1C(C)OC(OC2C(OC(OC3CC4C(C5C(C6(CCC(C6(C)C(O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)CC2O)C)CC1O LTMHDMANZUZIPE-UHFFFAOYSA-N 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 229910052731 fluorine Inorganic materials 0.000 description 2
- 239000012737 fresh medium Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 102000053563 human MYC Human genes 0.000 description 2
- 150000004677 hydrates Chemical class 0.000 description 2
- 238000003125 immunofluorescent labeling Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 206010022000 influenza Diseases 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000002523 lectin Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 230000008880 microtubule cytoskeleton organization Effects 0.000 description 2
- 150000007522 mineralic acids Chemical class 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 231100000252 nontoxic Toxicity 0.000 description 2
- 230000003000 nontoxic effect Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 150000007524 organic acids Chemical class 0.000 description 2
- 150000007530 organic bases Chemical class 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 238000005204 segregation Methods 0.000 description 2
- 238000001338 self-assembly Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 241000712461 unidentified influenza virus Species 0.000 description 2
- VZQHRKZCAZCACO-PYJNHQTQSA-N (2s)-2-[[(2s)-2-[2-[[(2s)-2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]propanoyl]amino]prop-2-enoylamino]-3-methylbutanoyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)C(=C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VZQHRKZCAZCACO-PYJNHQTQSA-N 0.000 description 1
- QGKMIGUHVLGJBR-UHFFFAOYSA-M (4z)-1-(3-methylbutyl)-4-[[1-(3-methylbutyl)quinolin-1-ium-4-yl]methylidene]quinoline;iodide Chemical compound [I-].C12=CC=CC=C2N(CCC(C)C)C=CC1=CC1=CC=[N+](CCC(C)C)C2=CC=CC=C12 QGKMIGUHVLGJBR-UHFFFAOYSA-M 0.000 description 1
- XWNJMSJGJFSGRY-UHFFFAOYSA-N 2-(benzylamino)-3,7-dihydropurin-6-one Chemical compound N1C=2N=CNC=2C(=O)N=C1NCC1=CC=CC=C1 XWNJMSJGJFSGRY-UHFFFAOYSA-N 0.000 description 1
- UXUMHNNOICTMDR-UHFFFAOYSA-N 2-amino-6-(2-cyclooct-2-yn-1-yloxyethoxycarbonylamino)hexanoic acid Chemical compound OC(=O)C(N)CCCCNC(=O)OCCOC1CCCCCC#C1 UXUMHNNOICTMDR-UHFFFAOYSA-N 0.000 description 1
- SCZIJGBDOGSCKY-UHFFFAOYSA-N 2-amino-6-(cyclooct-2-yn-1-yloxycarbonylamino)hexanoic acid Chemical compound OC(=O)C(N)CCCCNC(=O)OC1CCCCCC#C1 SCZIJGBDOGSCKY-UHFFFAOYSA-N 0.000 description 1
- KRFMMSZGIQEBIJ-UHFFFAOYSA-N 2-amino-6-(prop-2-ynoxycarbonylamino)hexanoic acid Chemical compound OC(=O)C(N)CCCCNC(=O)OCC#C KRFMMSZGIQEBIJ-UHFFFAOYSA-N 0.000 description 1
- RIPRFLAPFPCYBY-XBXARRHUSA-N 2-amino-6-[[(2E)-cyclooct-2-en-1-yl]oxycarbonylamino]hexanoic acid Chemical compound NC(CCCCNC(=O)OC/1CCCCC\C=C\1)C(O)=O RIPRFLAPFPCYBY-XBXARRHUSA-N 0.000 description 1
- FOGYUVXBYCZPFK-OWOJBTEDSA-N 2-amino-6-[[(4e)-cyclooct-4-en-1-yl]oxycarbonylamino]hexanoic acid Chemical compound OC(=O)C(N)CCCCNC(=O)OC1CCC\C=C\CC1 FOGYUVXBYCZPFK-OWOJBTEDSA-N 0.000 description 1
- GOLORTLGFDVFDW-UHFFFAOYSA-N 3-(1h-benzimidazol-2-yl)-7-(diethylamino)chromen-2-one Chemical compound C1=CC=C2NC(C3=CC4=CC=C(C=C4OC3=O)N(CC)CC)=NC2=C1 GOLORTLGFDVFDW-UHFFFAOYSA-N 0.000 description 1
- NNMALANKTSRILL-LXENMSTPSA-N 3-[(2z,5e)-2-[[3-(2-carboxyethyl)-5-[(z)-[(3e,4r)-3-ethylidene-4-methyl-5-oxopyrrolidin-2-ylidene]methyl]-4-methyl-1h-pyrrol-2-yl]methylidene]-5-[(4-ethyl-3-methyl-5-oxopyrrol-2-yl)methylidene]-4-methylpyrrol-3-yl]propanoic acid Chemical compound O=C1C(CC)=C(C)C(\C=C\2C(=C(CCC(O)=O)C(=C/C3=C(C(C)=C(\C=C/4\C(\[C@@H](C)C(=O)N\4)=C\C)N3)CCC(O)=O)/N/2)C)=N1 NNMALANKTSRILL-LXENMSTPSA-N 0.000 description 1
- WHSOKGZCVSCOJM-UHFFFAOYSA-N 4-amino-1-benzylpyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1CC1=CC=CC=C1 WHSOKGZCVSCOJM-UHFFFAOYSA-N 0.000 description 1
- 102100028439 60S ribosomal protein L26-like 1 Human genes 0.000 description 1
- 102000001671 Acid Sensing Ion Channels Human genes 0.000 description 1
- 108010068806 Acid Sensing Ion Channels Proteins 0.000 description 1
- 108010011170 Ala-Trp-Arg-His-Pro-Gln-Phe-Gly-Gly Proteins 0.000 description 1
- 102000015404 Amino Acid Receptors Human genes 0.000 description 1
- 108010025177 Amino Acid Receptors Proteins 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 101100272670 Aromatoleum evansii boxB gene Proteins 0.000 description 1
- 244000186140 Asperula odorata Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- WWZKQHOCKIZLMA-UHFFFAOYSA-N Caprylic acid Natural products CCCCCCCC(O)=O WWZKQHOCKIZLMA-UHFFFAOYSA-N 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 241000228124 Desulfitobacterium hafniense Species 0.000 description 1
- 238000006117 Diels-Alder cycloaddition reaction Methods 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 102100021238 Dynamin-2 Human genes 0.000 description 1
- 102100029108 Elongation factor 1-alpha 2 Human genes 0.000 description 1
- PXGOKWXKJXAPGV-UHFFFAOYSA-N Fluorine Chemical compound FF PXGOKWXKJXAPGV-UHFFFAOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 235000008526 Galium odoratum Nutrition 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 108010053070 Glutathione Disulfide Proteins 0.000 description 1
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101001080152 Homo sapiens 60S ribosomal protein L26-like 1 Proteins 0.000 description 1
- 101000817607 Homo sapiens Dynamin-2 Proteins 0.000 description 1
- 101000841231 Homo sapiens Elongation factor 1-alpha 2 Proteins 0.000 description 1
- 101000852815 Homo sapiens Insulin receptor Proteins 0.000 description 1
- 101001050472 Homo sapiens Integral membrane protein 2A Proteins 0.000 description 1
- 101001057249 Homo sapiens Mastermind-like domain-containing protein 1 Proteins 0.000 description 1
- 101000938536 Homo sapiens RNA-binding protein EWS Proteins 0.000 description 1
- 101001061518 Homo sapiens RNA-binding protein FUS Proteins 0.000 description 1
- 101000803403 Homo sapiens Vimentin Proteins 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 102000003746 Insulin Receptor Human genes 0.000 description 1
- 108010001127 Insulin Receptor Proteins 0.000 description 1
- 102100023351 Integral membrane protein 2A Human genes 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 1
- 235000014852 L-arginine Nutrition 0.000 description 1
- 229930064664 L-arginine Natural products 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- 101150028321 Lck gene Proteins 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 241001148031 Methanococcoides burtonii Species 0.000 description 1
- 108010006519 Molecular Chaperones Proteins 0.000 description 1
- 102000005431 Molecular Chaperones Human genes 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101100166601 Mus musculus Cd28 gene Proteins 0.000 description 1
- 101100383042 Mus musculus Cd4 gene Proteins 0.000 description 1
- CHJJGSNFBQVOTG-UHFFFAOYSA-N N-methyl-guanidine Natural products CNC(N)=N CHJJGSNFBQVOTG-UHFFFAOYSA-N 0.000 description 1
- 108020004485 Nonsense Codon Proteins 0.000 description 1
- 108090000163 Nuclear pore complex proteins Proteins 0.000 description 1
- 102000003789 Nuclear pore complex proteins Human genes 0.000 description 1
- 102220558745 Nuclear protein 1_E33Q_mutation Human genes 0.000 description 1
- 102220558662 Nuclear protein 1_E43A_mutation Human genes 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- GEYBMYRBIABFTA-VIFPVBQESA-N O-methyl-L-tyrosine Chemical compound COC1=CC=C(C[C@H](N)C(O)=O)C=C1 GEYBMYRBIABFTA-VIFPVBQESA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 102220558796 Platelet-activating factor acetylhydrolase_E16A_mutation Human genes 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 102000029797 Prion Human genes 0.000 description 1
- 108091000054 Prion Proteins 0.000 description 1
- 102100038870 Protein NPAT Human genes 0.000 description 1
- 102220599465 Protein NPAT_E18A_mutation Human genes 0.000 description 1
- 239000012614 Q-Sepharose Substances 0.000 description 1
- 108010000605 Ribosomal Proteins Proteins 0.000 description 1
- 102000002278 Ribosomal Proteins Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 101100221606 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) COS7 gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 102220468573 Serine/Arginine-related protein 53_E40A_mutation Human genes 0.000 description 1
- 102220468502 Serine/Arginine-related protein 53_E46A_mutation Human genes 0.000 description 1
- 102100023085 Serine/threonine-protein kinase mTOR Human genes 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 102100021685 Stomatin Human genes 0.000 description 1
- 108700037714 Stomatin Proteins 0.000 description 1
- 102100024234 Stomatin-like protein 3 Human genes 0.000 description 1
- 108050003907 Stomatin-like protein 3 Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 108010065917 TOR Serine-Threonine Kinases Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241000605118 Thiobacillus Species 0.000 description 1
- 102220590311 Transcription factor 7-like 2_E17A_mutation Human genes 0.000 description 1
- 102220590312 Transcription factor 7-like 2_E24A_mutation Human genes 0.000 description 1
- 102220590295 Transcription factor 7-like 2_E28A_mutation Human genes 0.000 description 1
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Triethanolamine Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 1
- YZCKVEUIGOORGS-NJFSPNSNSA-N Tritium Chemical compound [3H] YZCKVEUIGOORGS-NJFSPNSNSA-N 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000000304 alkynyl group Chemical group 0.000 description 1
- 150000001370 alpha-amino acid derivatives Chemical class 0.000 description 1
- 150000001371 alpha-amino acids Chemical class 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 230000000689 aminoacylating effect Effects 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- GONOPSZTUGRENK-UHFFFAOYSA-N benzyl(trichloro)silane Chemical compound Cl[Si](Cl)(Cl)CC1=CC=CC=C1 GONOPSZTUGRENK-UHFFFAOYSA-N 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000005277 cation exchange chromatography Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 239000012707 chemical precursor Substances 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 230000000973 chemotherapeutic effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 229920001940 conductive polymer Polymers 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- ZPWOOKQUDFIEIX-UHFFFAOYSA-N cyclooctyne Chemical group C1CCCC#CCC1 ZPWOOKQUDFIEIX-UHFFFAOYSA-N 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- SWSQBOPZIKWTGO-UHFFFAOYSA-N dimethylaminoamidine Natural products CN(C)C(N)=N SWSQBOPZIKWTGO-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000011737 fluorine Substances 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 230000000799 fusogenic effect Effects 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- YPZRWBKMTBYPTK-BJDJZHNGSA-N glutathione disulfide Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@H](C(=O)NCC(O)=O)CSSC[C@@H](C(=O)NCC(O)=O)NC(=O)CC[C@H](N)C(O)=O YPZRWBKMTBYPTK-BJDJZHNGSA-N 0.000 description 1
- 125000003827 glycol group Chemical group 0.000 description 1
- 229960004198 guanidine Drugs 0.000 description 1
- 229960000789 guanidine hydrochloride Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 102000047882 human INSR Human genes 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 150000002431 hydrogen Chemical class 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- 238000012872 hydroxylapatite chromatography Methods 0.000 description 1
- 239000000367 immunologic factor Substances 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000009616 inductively coupled plasma Methods 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Substances N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- PNDPGZBMCMUPRI-UHFFFAOYSA-N iodine Chemical compound II PNDPGZBMCMUPRI-UHFFFAOYSA-N 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 239000002808 molecular sieve Substances 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- FUZZWVXGSFPDMH-UHFFFAOYSA-N n-hexanoic acid Natural products CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 1
- SQDFHQJTAWCFIB-UHFFFAOYSA-N n-methylidenehydroxylamine Chemical compound ON=C SQDFHQJTAWCFIB-UHFFFAOYSA-N 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 150000002825 nitriles Chemical class 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000030648 nucleus localization Effects 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 229920002113 octoxynol Polymers 0.000 description 1
- 230000010494 opalescence Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- YPZRWBKMTBYPTK-UHFFFAOYSA-N oxidized gamma-L-glutamyl-L-cysteinylglycine Natural products OC(=O)C(N)CCC(=O)NC(C(=O)NCC(O)=O)CSSCC(C(=O)NCC(O)=O)NC(=O)CCC(N)C(O)=O YPZRWBKMTBYPTK-UHFFFAOYSA-N 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 239000003973 paint Substances 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 150000002993 phenylalanine derivatives Chemical class 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 229940080469 phosphocellulose Drugs 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000002165 photosensitisation Effects 0.000 description 1
- 239000003504 photosensitizing agent Substances 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000030788 protein refolding Effects 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000018883 protein targeting Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 108010054624 red fluorescent protein Proteins 0.000 description 1
- 239000006176 redox buffer Substances 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 210000004708 ribosome subunit Anatomy 0.000 description 1
- 125000006413 ring segment Chemical group 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 101150017727 spd-5 gene Proteins 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 125000005247 tetrazinyl group Chemical group N1=NN=NC(=C1)* 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 229910052722 tritium Inorganic materials 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 239000012224 working solution Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 150000003751 zinc Chemical class 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/05—Fusion polypeptide containing a localisation/targetting motif containing a GOLGI retention signal
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/735—Fusion polypeptide containing domain for protein-protein interaction containing a domain for self-assembly, e.g. a viral coat protein (includes phage display)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明涉及正交翻译系统,其允许以POI‑mRNA选择性方式位点特异性地将非典型氨基酸残基引入靶蛋白(POI)中。具体地,本发明涉及组装器融合蛋白,其使靶向RNA的多肽(RNA‑TP)区段和正交氨酰tRNA合成酶(O‑RS)区段在空间上互相接近,通过RNA‑TP/O‑RS融合蛋白中的直接连接,或通过“组装器”的作用,所述组装器与组装器融合蛋白(AFP)中的这些区段中的每一个融合。本发明还涉及AFP组合和核酸分子,所述核酸分子包含POI编码序列以及能够与RNA‑TP相互作用的靶向核苷酸序列。本发明进一步涉及核酸分子、表达盒和表达载体,其编码所述RNA‑TP/O‑RS融合蛋白或AFP,包含其的细胞,以及用于翻译制备POI的方法和试剂盒。
Description
技术领域
本发明涉及正交翻译系统,其允许以POI-mRNA选择性方式将非典型氨基酸(ncAA)残基位点特异性地引入感兴趣的多肽(POI)中。具体地,本发明涉及融合蛋白,其使靶向RNA的多肽(RNA-TP)区段和正交氨酰tRNA合成酶(O-RS)区段在空间上互相接近。这通过在一个相同的融合蛋白(RNA-TP/O-RS融合蛋白)中组合RNA-TP区段和O-RS区段来实现,或通过一个或多个多肽区段的作用来实现,所述多肽区段充当“组装器”(AP)并促进包含一个或多个AP以及RNA-TP区段或O-RS区段的组装器融合蛋白(AFP)的局部富集,从而使所述RNA-TP和O-RS区段相互接近。本发明还涉及AFP组合和核酸分子,所述核酸分子包含POI编码序列和靶向核苷酸序列(TN),其能够与RNA-TP相互作用。本发明进一步涉及编码所述RNA-TP/O-RS融合蛋白或AFP的核酸分子、表达盒和表达载体,包含其的细胞,以及用于翻译制备POI的方法和试剂盒。
背景技术
将正交(即非交叉反应)翻译系统位点特异性地设计入活细胞中的能力使得能在蛋白中引入新功能。然而,这是一项艰巨的任务,因为翻译是一个复杂的多步骤过程,其中至少有20种不同的氨酰tRNA、其同源氨酰tRNA合成酶(RS)、核糖体以及各种其他因素通过协同作用从RNA转录物合成多肽链。理想的正交系统不会与宿主系统的因素发生交叉反应,从而最大限度地减少其对细胞常规翻译活性和正常生理的影响。
遗传密码子扩展(GCE)是一种实现这一目标的方法,其能够重新编程特定密码子。利用GCE,正交(抑制子)RS(O-RS)可以用非典型氨基酸(ncAA)氨酰化其同源抑制子tRNA。这些ncAA通常是定制设计的,并具有化学功能,例如,可以对蛋白功能进行光控制,编码翻译后修饰或允许使用点击化学引入荧光标记进行显微镜学研究。为了将ncAA位点特异性地引入感兴趣的多肽(POI)中,选择tRNA的反密码子环进行解码,从而抑制终止密码子中的一个(参见,例如Liu et al.,Annu Rev Biochem 2010,79:413-444;Lemke,ChemBioChem 2014,15:1691-1694;Chin,Nature 2017,550;53-60)。为了最大限度的减少对宿主细胞系统的影响,通常利用琥珀终止密码子(对应tRNACUA)来终止内源性蛋白(<10%),因为琥珀终止密码子在大肠杆菌(E.coli)中的丰度特别低。然而,原则上基因组中的任何琥珀密码子都可以被抑制,这可能导致对非靶宿主蛋白的意外背景抑制。如果ncAA修饰的蛋白是为了体外应用而重组生产的,这种背景掺入可能是可以接受的,只要纯化的全长蛋白的产量令人满意。然而,如果宿主不仅仅是可以为其蛋白牺牲的生物反应器容器,那么挑战就不同了。为了原位研究宿主细胞POI的功能,宿主细胞的生理条件是一个重要因素。在这种情况下,特别需要最大限度地减少ncAA的背景掺入,以确保良好控制的实验。
已开发至少三种巧妙的方法来实现大肠杆菌中的正交翻译,即,仅解码POI的RNA的特定密码子而不是整个基因组。i)已开发识别独特Shine-Dalgarno序列的正交核糖体来解码四联体密码子,然后使用其代替终止密码子以位点特异性地将ncAA编码入POI。(参见,例如Heumann et al.,Nature 2010,464:441:444;Orelle et al.,Nature 2015,524:119-124;Fried et al.,Angew Chem 2015,54:12791-12794.)ii)最近,基因组工程已经发展到如下阶段,大肠杆菌菌株中选定的天然密码子可以被去除,为仅在POI中的特定密码子进行选择性解码提供干净的(例如,无琥珀密码子)宿主遗传背景。(参见,例如Isaacs et al.,Science 2011,333:348-353;Lajoie et al.,Science 2013,342:357-360;Ostrov etal.,Science 2016,353:819-822;Wang et al.,Nature 2016,539:59-64.)iii)使用人工碱基对设计独特的非典型密码子,其仅在POI编码序列中进行编码。这降低在基因组的其他部分中非特异性解码的风险(参见Zhang et al.,Nature 2017,551:644-647)。然而,由于基因组的复杂性,将这些正交翻译方法移用到真核生物中并不是简单易行的(参见,例如Thompson et al.,ACS Chem Biol 2018,13:313-325),此外,真核生物中琥珀密码子是高度丰富的(在哺乳动物细胞中为20%)。
因此,亟需通用的POI选择性正交翻译策略,它不仅仅适用于相对容易处理和操作的良好表征的原核生物(如大肠杆菌),而且也适用于真核细胞。因此,本发明目的是解决这种挑战。
发明内容
本发明人发现能够选择性翻译POI的mRNA的正交翻译系统(OT系统)可以通过使POI的mRNA和O-RS在空间上接近来产生,其允许将ncAA残基翻译引入不断增长的POI的多肽链中。本发明人证明包括膜蛋白在内的多种POI,它们的OT系统允许位点特异性地将ncAA残基引入哺乳动物细胞中的POI中,与细胞质中含有相同终止密码子(用作编码POI的ncAA残基的选择密码子)的其他mRNA相比,对POI的mRNA具有选择性。
在本发明的正交翻译系统中,空间接近是通过在POI的mRNA中包含靶向序列(TN)来实现的,其可以选择性地与靶向RNA的多肽(RNA-TP)相互作用,并将O-RS与这样的RNA-TP进行连接。所述连接可以在包含O-RS和RNA-TP的融合蛋白中(RNA-TP/O-RS融合蛋白)。
在另一种方法中,这可以通过一个或多个多肽区段的作用来实现,所述多肽区段充当“组装器(assembler)”(AP)以促进至少两种组装器融合蛋白(AFP)的局部富集,其中至少一种包含一个或多个AP和RNA-TP区段,并且至少另一种AFP包含一个或多个AP和O-RS区段,从而使所述RNA-TP和O-RS区段(RNA-TP和O-RS也称为“效应物”或“EP”)相互靠近。AFP的局部富集允许形成组装器集合体(assemblies)(OT组装器集合体,在本文中也称“OT细胞器”),其可充当人工正交翻译细胞器。
本发明人证明可使用不同类型的AP。第一种类型包括在(先前已有的)细胞内结构(例如,微管或膜如细胞膜或核膜、ER、线粒体或高尔基体细胞器的细胞质侧)处驱动局部富集的AP,称为细胞内靶向多肽(IC-TP)区段。第二种类型的AP通过细胞质中的自缔合(特别是通过相分离)形成局部高浓度的AFP,本文中称为相分离多肽(PSP)区段。所述AP类型也可与具有形成多聚体结构能力的其他多肽元件组合,特别是,由合成SYNZIP多肽对所形成的卷曲螺旋异二聚体。类似地,所述EP类型也可以与具有形成多聚体结构能力的其他多肽元件组合,特别是,由合成SYNZIP多肽对所形成的卷曲螺旋异二聚体。这种多聚体的形成进一步提高AFP的局部富集。
本发明人进一步发现组合不同AP类型的AFP特别有用。
在另一种方法中,提供包含单个多肽的AFP,即融合在一起的两种类型EP区段,即RNA-TP和O-RS区段,一种或两种类型的AP区段,即IC-TP和/或PSP区段,任选地补充有具有形成多聚体结构能力的所述多肽元件(SYNZIP多肽)。这提供的优点是产生本发明OT系统所需的所有元件都包含在单个AFP中。
因此,在第一方面,本发明涉及一种组装器融合蛋白(AFP),其包含:
(a)充当组装器(AP)的至少一个第一多肽区段,其选自:
(a1)源自细胞内靶向多肽的多肽区段(IC-TP区段),其中所述细胞内靶向多肽靶向细胞内结构元件,并因此在所述细胞内结构元件处局部富集,所述细胞内结构元件在细胞质内或与细胞质直接相邻;和
(a2)源自相分离多肽的多肽区段(PSP区段),其中所述相分离多肽具有在细胞的细胞质中进行自缔合的能力以在细胞质中产生高局部浓度的位点,以及
(b)充当效应物(EP)的至少一个第二多肽区段,其选自:
b1)靶向RNA的多肽(RNA-TP)区段,和
b2)正交氨酰tRNA合成酶(O-RS)区段;
其中所述多肽区段在所述AFP中功能性连接。
在第二方面,本发明涉及一种组装器融合蛋白(AFP)组合,其包含至少两种本发明的AFP。优选地,AFP组合包含至少一种包含RNA-TP区段的AFP和至少一种包含O-RS区段的AFP。在所述组合的至少一种AFP中包括第一SYNZIP元件,并在所述组合的至少另一种AFP中包括第二SYNZIP元件,其中所述第一和所述第二SYNZIP通过形成异二聚体结构共同作用,代表了所述第二方面的另一种优点。
在第三方面,本发明涉及一种融合蛋白(RNA-TP/O-RS融合蛋白),其包含:
(i)至少一个靶向RNA的多肽(RNA-TP)区段;和
(ii)至少一个正交氨酰tRNA合成酶(O-RS)区段,
其中所述多肽区段在所述RNA-TP/O-RS融合蛋白中功能性连接。
在进一步的方面,本发明提供一种核酸分子,或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种本发明的RNA-TP/O-RS融合蛋白,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
在进一步的方面,本发明提供一种核酸分子,或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种本发明的AFP,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
在进一步的方面,本发明提供一种核酸分子,或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种本发明的AFP组合,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
在进一步的方面,本发明提供一种表达盒,其包含本发明的核酸分子或者核酸分子的组合的核苷酸序列。
在具体实施方案中,本发明提供一种表达盒,其包含:
(i)核苷酸序列,其编码至少一种本发明的RNA-TP/O-RS融合蛋白,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
在进一步的具体实施方案中,本发明提供一种表达盒,其包含:
(i)核苷酸序列,其编码至少一种本发明的AFP,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
在进一步的具体实施方案中,本发明提供一种表达盒,其包含:
(i)核苷酸序列,其编码至少一种本发明的AFP组合,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
在进一步的方面,本发明提供一种表达载体,其包含至少一种本发明的表达盒。
在进一步的方面,本发明提供一种细胞,其包含至少一种本发明的核酸分子或核酸分子的组合。在具体实施方案中,所述细胞包含至少一种本发明的表达盒或至少一种本发明的表达载体。
在进一步的方面,本发明涉及一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基。所述方法包括在所述一种或多种ncAA的存在下,在本发明的细胞中表达所述POI,其中所述细胞包含:
(i)本文所述的至少一种包含RNA-TP区段的AFP和至少一种包含O-RS区段的AFP;
(ii)编码POI的核苷酸序列(CSPOI),其中所述POI的一种或多种ncAA残基由选择密码子编码,
(iii)靶向核苷酸序列(TN),其功能性连接至所述CSPOI,并且能够与所述细胞中AFP中的至少一种的RNA-TP区段相互作用;
(iv)一种或多种正交tRNAncAA(O-tRNAncAA)分子,其携带与所述CSPOI的选择密码子互补的反密码子,并且其中所述O-tRNAncAA分子与所述细胞中AFP的一个或多个O-RS区段一起形成一个或多个正交O-RS/O-tRNAncAA对,其允许将所述一种或多种ncAA残基引入POI的氨基酸序列中;
并且其中所述方法任选地进一步包括回收表达的POI。
(i)中列举的所述至少一种包含RNA-TP区段的AFP和所述至少一种包含O-RS区段的AFP可以是一种且相同类型的AFP,即包含RNA-TP区段和O-RS区段的AFP。或者,(i)中列举的所述至少一种包含RNA-TP区段的AFP和所述至少一种包含O-RS区段的AFP可以是不同的AFP。
在进一步的方面,本发明涉及一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基。所述方法包括在所述一种或多种ncAA的存在下,在本发明的细胞中表达所述POI,其中所述细胞包含:
(i)本发明的RNA-TP/O-RS融合蛋白;
(ii)编码POI的核苷酸序列(CSPOI),其中所述POI的一种或多种ncAA残基由选择密码子编码,
(iii)靶向核苷酸序列(TN),其功能性连接至所述CSPOI,并且能够与所述细胞中RNA-TP/O-RS融合蛋白中的至少一种的RNA-TP区段相互作用;
(iv)一种或多种正交tRNAncAA(O-tRNAncAA)分子,其携带与所述CSPOI的选择密码子互补的反密码子,并且其中所述O-tRNAncAA分子与所述细胞中RNA-TP/O-RS融合蛋白的一个或多个O-RS区段一起形成一个或多个正交O-RS/O-tRNAncAA对,其允许将所述一种或多种ncAA残基引入POI的氨基酸序列中;
并且其中所述方法任选地进一步包括回收表达的POI。
在进一步的方面,本发明涉及一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基。所述方法包括以下步骤:
(a)在细胞中表达本文所述的一种或多种包含至少一个RNA-TP区段的AFP和一种或多种包含至少一个O-RS区段的AFP;
(b)在所述细胞中表达一种或多种正交tRNAncAA(O-tRNAncAA)分子,其中
-所述正交tRNAncAA分子与细胞中的AFP的一个或多个O-RS区段形成一个或多个正交氨酰tRNA合成酶/tRNAncAA(O-RS/O-tRNAncAA)对,
-所述O-RS/O-tRNAncAA对允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中,
其中步骤(a)和(b)可以同时或以任何顺序依次进行;
(c)随后,在所述一种或多种ncAA的存在下,在所述细胞中表达所述POI,其中
-编码POI的核苷酸序列(CSPOI)包含编码所述一种或多种ncAA残基的一种或多种选择密码子,
-所述选择密码子与所述一种或多种O-tRNAncAA分子的反密码子匹配;
-所述CSPOI与靶向核苷酸序列(TN)功能性连接,从而形成CSPOI/TN融合序列,
-所述CSPOI/TN融合序列能够通过其TN与所述细胞中AFP中的至少一种的RNA-TP区段相互作用;
以及
(d)任选地回收表达的POI。
在进一步的方面,本发明涉及一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基。所述方法包括以下步骤:
(a)在细胞中表达本发明的RNA-TP/O-RS融合蛋白;
(b)在所述细胞中表达一种或多种正交tRNAncAA(O-tRNAncAA)分子,其中
-所述正交tRNAncAA分子与细胞中的RNA-TP/O-RS融合蛋白的一个或多个O-RS区段形成一个或多个正交氨酰tRNA合成酶/tRNAncAA(O-RS/O-tRNAncAA)对,
-所述O-RS/O-tRNAncAA对允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中,
其中步骤(a)和(b)可以同时或以任何顺序依次进行;
(c)随后,在所述一种或多种ncAA的存在下,在所述细胞中表达所述POI,
其中
-编码POI的核苷酸序列(CSPOI)包含编码所述一种或多种ncAA残基的一种或多种选择密码子,
-所述选择密码子与所述一种或多种O-tRNAncAA分子的反密码子匹配;
-所述CSPOI与靶向核苷酸序列(TN)功能性连接,从而形成CSPOI/TN融合序列,
-所述CSPOI/TN融合序列能够通过其TN与所述细胞中RNA-TP/O-RS融合蛋白中的至少一种的RNA-TP区段相互作用;
以及
(d)任选地回收表达的POI。
在进一步的方面,本发明涉及一种核酸分子,其包含:
(i)编码感兴趣的多肽(POI)的核苷酸序列(CSPOI),所述POI包含一个或多个、相同或不同的非典型氨基酸(ncAA)残基,所述ncAA残基在CSPOI中由选择密码子编码,和
(ii)靶向核苷酸序列(TN),其中包含所述TN的RNA分子能够通过所述TN与靶向RNA的多肽(RNA-TP)相互作用。
在进一步的方面,本发明涉及一种试剂盒,其用于制备具有至少一个非典型氨基酸(ncAA)残基的感兴趣的多肽(POI),所述试剂盒包含:
-至少一种ncAA或其盐,其对应于所述POI的至少一个ncAA残基;以及
-至少一种本发明的表达载体。
所述表达载体包含至少一个表达盒,所述表达盒包含:
(i)核苷酸序列,其编码至少一种本发明的RNA-TP/O-RS融合蛋白、至少一种本发明的AFP或至少一种本发明的AFP组合,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
附图说明
图1显示组件空间分离的示意图,其允许正交翻译以解码独特标记的mRNA中的特定终止密码子。(A)合成酶PylRS的常规表达导致其同源终止密码子抑制子tRNAPyl由定制设计的ncAA氨酰化。这使得每当各自的终止密码子出现在POI的mRNA中时,就会导致位点特异性的ncAA掺入。鉴于许多内源性mRNA以相同的终止密码子终止,在细胞质中使用这种方法可能会导致ncAA错误掺入不需要的蛋白中(左框)。(B)为了避免这种情况发生,本发明允许通过使用靶向RNA的多肽区段(例如,MCP)和组装器(AP),使编码POI的mRNA和正交氨酰-tRNA合成酶(例如,PylRS)相互紧密靠近。这允许所有组件在空间上富集以产生OT组装器集合体(“OT细胞器”),包括编码POI的mRNA、正交氨酰-tRNA合成酶、tRNA和核糖体(右框)。这里,特别是氨酰tRNAPyl可与OT细胞器直接接近,因此可以特别地发生(POI mRNA的)终止密码子抑制。这导致选择性抑制POI mRNA的终止密码子(并因此表达POI mRNA),而不是非靶向至OT组装器集合体的mRNA中的相应终止密码子。虽然在(A)中,GCE发生终止密码子特异性,但在(B)中它应该发生终止密码子特异性和mRNA特异性。
图2A显示不同组装器类型的示意图。B=双分子MCP::PylRS融合,P1=融合到FUS和EWSR1,P2=SPD5,K1=驱动蛋白KIF13A的截短(KIF13A1-411,ΔP390),K2=驱动蛋白KIF16B的截短(KIF16B1-400)及其组合(K1::P1、K1::P2、K2::P1、K2::P2)。
图2B显示双色报告基因的示意图。在容许位点含有终止密码子的编码荧光蛋白GFP和mCherry的mRNA由一个质粒表达,每个都有自己的CMV启动子,确保每次实验中mRNA的比例恒定。mCherry报告基因的mRNA用两个MS2 RNA茎环(“ms2”,本文也称为MS2标签)标记,mRNA(mCherry)::ms2。在ncAA和tRNAPyl的存在下,在细胞质PylRS的情况下,GFP39STOP和mCherry185STOP都会生成,导致荧光流式细胞术(FFC)分析(左框)的对角线产生。然而,在相同条件下,OT细胞器中的正交翻译使得能够选择性地抑制mRNA(mCherry)::ms2的终止密码子,从而产生mCherry阳性和GFP阴性群体(在右框中示意性地绘制为垂直群体)。在这两种方案中,未转染的HEK293T细胞用底部的灰色圆圈表示。
图2C显示各种示例性OT系统的选择性和相对效率。在所有实验中,标示的构建体与tRNAPyl(对应于标示的密码子的反密码子)和双报告基因(GFP39STOP、mCherry185STOP::ms2)共表达。GCE在标示的ncAA的存在下进行,并通过FFC分析细胞。深灰色条(标准化为细胞质PylRS)代表所有测试系统的mCherry与GFP(源自FFC,参见图2D、E)的平均荧光强度比率r的倍数变化。浅灰色条表示相对效率,其定义为在每种条件下,mCherry的平均荧光强度除以细胞质PylRS对照(源自FFC,参见图2D、E)。显示的是至少三个独立实验的平均值;误差线代表SEM。方框突出显示表现得最佳的OT细胞器(OTK2::P1)。
图2D显示双色报告基因的FFC分析结果图,所述双色报告基因是在ncAA SCO(具有环辛炔侧链的赖氨酸衍生物)的存在下,在转染的HEK293T细胞和tRNAPyl中用四个标示的系统表达。在OT组装器集合体中观察到高选择性和高效的正交翻译(黑色箭头表示明亮的、高mCherry阳性的群体)。点图显示的是至少三个独立实验的总和。坐标轴表示荧光强度,单位为任意单位。
图2E显示OT组装器集合体的FFC图,仅分别选择性地翻译招募的mRNA(mCherry185TGA)::ms2和mRNA(mCherry185TAA)::ms2的乳白和赭石密码子。
图3显示构成以下系统的构建体示意图:PylRS、MCP::PylRS、FUS::MCP::PylRS和LcK::FUS::PylRS·LcK::EWS::MCP。
图4为采用图3中所述的4种不同系统进行双报告基因表达的流式细胞术分析。HEK293T细胞用构建体转染,所述构建体编码双报告基因、tRNA、LcK::FUS::PylRS和LcK::EWS::MCP或PylRS、MCP::PylRS、FUS::MCP::PylRS和pcDNA3.1。显示的是至少三个独立实验的总和。坐标轴表示荧光强度,单位为任意单位。
图5显示所有测试系统的mCherry与GFP荧光的平均荧光强度比率的条形图。条形图表示至少3次生物学重复的平均值,误差线表示平均值的标准误差。
图6提供用本发明的不同方法产生靶向不同细胞内结构表面的OT细胞器的概况。显示不同构建体的表达和各自荧光流式细胞术(FFC)分析的结果。在图的顶部描绘的是双色报告基因构建体GFP39TAG·mCherry185TAG::ms2(另请参见图2B)用于A到G中每个实验的示意图,并显示靶向的不同细胞区室的示意图。也显示A至G中每个的对照实验,对照实验在没有效应多肽MCP(-MCP)的情况下进行:
A:靶向微管的OT细胞器,其通过表达系统KIF16B1-400::FUS::PylRS·KIF16B1-400::EWSR1::MCP或构建体KIF16B1-400::FUS::PylRS(对照)获得;
B:靶向微管正末端的OT细胞器,其通过表达构建体EB1::FUS::MCP::PylRS或EB1::FUS::PylRS(对照)获得。
C:靶向质膜的OT细胞器,其通过表达系统LcK::FUS::PylRS·LcK::EWSR1::MCP或构建体LcK::FUS::PylRS(对照)获得。
D:靶向线粒体膜的OT细胞器,其通过表达系统TOM201-70::FUS::PylRS·TOM201-70::EWSR1::MCP或构建体TOM201-70::FUS::PylRS(对照)获得。
E:靶向核膜的OT细胞器,其通过表达系统CG1::FUS::PylRS·CG1::EWSR1::MCP或构建体CG1::FUS::PylRS(对照)获得。
F(左侧):靶向高尔基体膜的OT细胞器,其通过表达系统EBAG91-29::FUS::PylRS·EBAG91-29::EWSR1::MCP或构建体EBAG91-29::FUS::PylRS(对照)获得。
F(右侧):靶向高尔基体膜的OT细胞器,其通过表达系统CMP Sia Tr::FUS::PylRS·CMP Sia Tr::MCP或构建体CMP Sia Tr::FUS::PylRS(对照)获得。
G:靶向ER膜的OT细胞器,其通过表达系统P450 2C11-27::FUS::PylRS·P4502C11-27::EWSR1::MCP或构建体P450 2C11-27::FUS::PylRS(对照)获得。
图7提供本发明招募RNA的不同方法的概述,所述方法利用不同RNA环与各自的靶向RNA的蛋白的相互作用。显示各自荧光流式细胞术(FFC)分析结果,并与仅用非靶向PylRS的相应分析结果进行比较:
A:系统ms-2-MCP将ms2环掺入mRNA分子的UTR中,并用MCP蛋白将所述mRNA招募到人工细胞器中。
B:系统boxB-λN22将boxB环掺入mRNA分子的UTR中,并用λN22蛋白将所述mRNA招募到人工细胞器中
C:系统pp7-PCP将pp7环掺入mRNA分子的UTR中,并用PCP蛋白将所述mRNA招募到人工细胞器中。
图8显示本发明产生OT细胞器的另一种方法,所述OT细胞器在不同细胞结构表面进行作用。本图示例靶向质膜。该具体方法的特征是将所谓的合成异二聚体卷曲螺旋肽SYNZIP1和SYNZIP2成对融合掺入到系统LcK::FUS::SYNZIP1::PylRS·EWSR1::SYNZIP2::MCP中;SYNZIP1和2对表达后,将MCP招募到基于质膜的OT细胞器上,其又使得能对随后招募的包含ms2靶向核苷酸环的mRNA进行选择性正交翻译。选择性翻译用各自的FFC分析结果(A)进行图示说明。在使用系统LcK::FUS::PylRS·EWSR1::SYNZIP2::MCP的比较方法中,其中缺少SYNZIP1,没有观察到翻译的选择性(B)。
具体实施方式
除非本文另有定义,否则本发明上下文中使用的科学和技术术语应具有本领域普通技术人员通常理解的含义。术语的含义和范围应明确。但是,在任何隐含歧义的情况下,本文提供的定义优先于任何字典或外部定义。此外,除非上下文另有要求,单数术语应包括复数,以及复数术语应包括单数。
如果没有另外说明,本文所述的核苷酸序列以5'到3'的方向描述。如果没有另外说明,本文所述的氨基酸序列以从N-端到C-端的方向描述。
如果没有另外说明,本发明的OT系统翻译表达的感兴趣的多肽(POI)包含一种或多种ncAA残基,其通过选择密码子在编码POI的核苷酸序列(CSPOI)中编码。
1.融合蛋白
1.1.总则
本发明的融合蛋白可以用不同方式解释说明。
第一种类型包括融合蛋白,其中至少两种类型的效应多肽(EP)包含于一个且相同的融合蛋白(也称为RNA-TP/O-RS融合蛋白)中,所述至少两种类型的效应多肽(EP)包含至少一个RNA-TP和至少一个O-RS。
第二种类型包括融合蛋白,其包含至少一种组装器多肽(AP)和至少一种类型的EP(也称为AFP),所述EP选自RNA-TP区段和O-RS区段。具体地,除了至少一种类型的AP之外,AFP可以包含RNA-TP和O-RS区段,如任何序列顺序的一个或多个RNA-TP区段以及一个或多个O-RS区段。因此,AFP具体地选自以下融合蛋白类型(多肽链内以任何顺序功能性连接的区段;多肽链内以任何顺序功能性连接的一个或多个相同类型的区段):
(RNA-TP/AP)
(O-RS/AP)
(RNA-TP/O-RS/AP)
AP选自IC-TP和PSP,并且可以包含任何序列顺序的一个或多个IC-TP和/或一个或多个PSP。因此,更为具体地,AFP选自以下融合蛋白类型(多肽链内以任何顺序功能性连接的区段;多肽链内以任何顺序功能性连接的一个或多个相同类型的区段):
(RNA-TP/IC-TP)
(O-RS/IC-TP)
(RNA-TP/O-RS/IC-TP)
(RNA-TP/PSP)
(O-RS/PSP)
(RNA-TP/O-RS/PSP)
(RNA-TP/PSP/IC-TP)
(O-RS/PSP/IC-TP)
(RNA-TP/O-RS/PSP/IC-TP)
AP和/或EP也可以包含(作为融合蛋白的一部分)形成异源寡聚体、特别是形成异二聚体的多肽区段,例如特别是合成卷曲螺旋SYNZIP肽。AFP组合包含这类相互作用的SYNZIP对,其分布在所述AFP组合的成员之间,使得每个AFP仅包含这类相互作用的SYNZIP对的一个成员,如具体实施方案所示。
本文所使用的术语“区段”在融合蛋白的上下文中表示所指定的元件(如,RNA-TP、O-RS、IC-TP、PSP、SYNZIP)是融合蛋白的一部分,即连接到融合蛋白的剩余部分。本发明的融合蛋白区段是功能性连接的,即连接使得它们仍然分别作为RNA-TP、O-RS、IC-TP和PSP或SYNZIP发挥作用。所述连接优选是共价的,特别是肽连接。
例如,本发明的融合蛋白包含的RNA-TP区段是融合蛋白的区段,其源自RNA-TP并在融合蛋白的上下文中作为RNA-TP发挥作用,因此允许融合蛋白与靶向的RNA相互作用(结合),其中所述相互作用有利地是特异性相互作用。因此,RNA-TP区段可以包含本文所述的靶向RNA的多肽的(整个)氨基酸序列或功能片段。
类似地,本发明的融合蛋白所包含的O-RS区段是融合蛋白的区段,其源自O-RS并在融合蛋白的上下文中作为O-RS发挥作用,因此赋予融合蛋白O-RS酶活性,所述酶活性是指用ncAA催化O-tRNA氨酰化的能力。因此,O-RS区段可以包含本文所述的O-RS的(整个)氨基酸序列或功能片段。
本文所述的组装器融合蛋白(AFP)包含至少一个作为组装器(AP)的多肽区段。本文所使用的术语AP是指允许在活细胞内的空间不同位点富集包含所述区段的AFP的任何多肽区段。有利地,所述空间不同位点位于细胞的细胞质内或直接与细胞质相邻,并且易于被细胞的翻译系统(包括典型的氨酰tRNA、翻译因子、核糖体亚基等)以及允许将ncAA残基引入POI的O-tRNA进入使用。
有不同类型的多肽区段可用作本发明中的AP。一种类型的AP是多肽区段,其源自细胞内靶向多肽(IC-TP)并在融合蛋白的上下文中作为细胞内靶向多肽(IC-TP)发挥作用。这些IC-TP区段可以包含IC-TP的(整个)氨基酸序列或功能片段。IC-TP靶向细胞内结构元件,并因此在所述细胞内结构元件处局部富集,所述细胞内结构元件在细胞质内或与细胞质直接相邻。这类结构元件的实例包括微管、膜的细胞质侧,如细胞膜、核膜、线粒体膜、高尔基体膜、内质网膜等。
因此,在具体实施方案中,本发明的融合蛋白包含至少一个IC-TP区段,其靶向并促进融合蛋白在微管,特别是微管的正端或负端的局部富集。例如,动力蛋白和驱动蛋白(动力蛋白或驱动蛋白家族的蛋白)及其功能片段和突变体可用作实现这种功能的IC-TP。
在进一步的具体实施方案中,本发明的融合蛋白包含至少一个IC-TP区段,其源自膜锚(membrane anchor)并作为膜锚发挥作用。例如,本发明的融合蛋白包含至少一个IC-TP区段,其靶向并促进融合蛋白在细胞(内)膜(特别是细胞膜的细胞质侧)处的局部富集。在另一个实例中,本发明的融合蛋白包含至少一个IC-TP区段,其靶向并促进融合蛋白在(外)核膜(特别是核膜的细胞质侧)处的局部富集。在进一步的具体实施方案中,本发明的融合蛋白包含至少一个IC-TP区段,其靶向并促进融合蛋白在线粒体外膜(特别是线粒体膜的细胞质侧)处的局部富集。在进一步的具体实施方案中,本发明的融合蛋白包含至少一个IC-TP区段,其靶向并促进融合蛋白在ER外膜(特别是ER膜的细胞质侧)处的局部富集。在进一步的具体实施方案中,本发明的融合蛋白包含至少一个IC-TP区段,其靶向并促进融合蛋白在高尔基体外膜(特别是高尔基体膜的细胞质侧)处的局部富集。例如,膜蛋白的跨膜结构域及其功能片段和突变体可用作实现这种功能的IC-TP。
靶向并因此在细胞内结构元件处局部富集的多肽是本领域已知的,并可用作本发明中IC-TP。
合适的IC-TP具体实例包括但不限于:
-任选截短的驱动蛋白多肽,其组成性地移向并在活细胞中的微管正末端处局部富集,例如任选截短的驱动蛋白家族成员16B(KIF16B),如任选截短的智人KIF16B(Uniprot:Q96L93),特别是覆盖KIF16B氨基酸残基1-400(KIF16B1-400)的片段,其包含SEQID NO:20的氨基酸序列;或任选截短的驱动蛋白家族成员13A(KIF13A),如任选截短的智人KIF13A(Uniprot:Q9H1H9),特别是覆盖氨基酸残基1-411的KIF13A片段,其中P390缺失(KIF13A1-411,Δ390),其包含SEQ ID NO:22的氨基酸序列;多肽EB1,微管尖端(microtubuletip)结合蛋白,其与生长的微管正末端结合(Nehlig A,Molina A,Rodrigues-Ferreira S,HonoréS,Nahmias C.Regulation of end-binding protein EB1 in the control ofmicrotubule dynamics.Cell Mol Life Sci.2017;74(13):2381–2393.doi:10.1007/s00018-017-2476-2)(Uniprot:Q15691),因此将细胞器靶向至微管正末端并包含SEQ IDNO:302的氨基酸序列;
-源自跨膜蛋白的靶向线粒体外膜的多肽,例如,任选截短的线粒体外膜易位酶20(TOMM20),如任选截短的智人TOMM20(Uniprot:Q15388),特别是覆盖TOMM20的氨基酸残基1-70的片段(TOMM201-70),其包含SEQ ID NO:24的氨基酸序列;
-源自跨膜蛋白的细胞膜靶向多肽,例如,淋巴细胞特异性蛋白酪氨酸激酶(LcK;如:小家鼠(Mus musculus)LcK,Uniprot:P06240)、CD4(如:小家鼠CD4,Uniprot:P06332)、FRB(类似智人mTOR;Uniprot:P42345)、CD28(如:小家鼠CD28,Uniprot:P31041)及其组合,特别是包含SEQ ID NO:26、SEQ ID NO:28或SEQ ID NO:30的氨基酸序列的多肽;
-多肽CG1,与核孔复合物的细胞质侧结合的核孔蛋白(Fernandez-Martinez J,Kim SJ,Shi Y,et al.Structure and Function of the Nuclear Pore ComplexCytoplasmic mRNA Export Platform.Cell.2016;167(5):1215–1228.e25.doi:10.1016/j.cell.2016.10.028)(也称为Nup42)(Uniprot:O15504),其靶向核膜的细胞质侧,包含SEQID NO:304的氨基酸序列;
-多肽EBAG9,具有一个跨膜螺旋的高尔基体膜蛋白(Engelsberg A,HermosillaR,Karsten U,Schülein R,B,Rehm A.The Golgi protein RCAS1 controls cellsurface expression of tumor-associated O-linked glycan antigens.J BiolChem.2003;278(25):22998–23007.doi:10.1074/jbc.M301361200)(Uniprot:O00559),其靶向高尔基体膜的细胞质侧,包含SEQ ID NO:292的氨基酸序列(全长)或包含SEQ ID NO:294的前29个N端氨基酸残基;或多肽CMP Sia Tr,CMP唾液酸转运蛋白,具有10个跨膜螺旋的高尔基体蛋白(Eckhardt M,Gotza B,Gerardy-Schahn R.Membrane topology of themammalian CMP-sialic acid transporter.J Biol Chem.1999;274(13):8779–8787.doi:10.1074/jbc.274.13.8779)(Uniprot:P78382),其靶向高尔基体膜的细胞质侧,包含SEQID NO:296的氨基酸序列;
-P450 2C1的多肽片段,内质网驻留蛋白(Fazal FM,Han S,Parker KR,etal.Atlas of Subcellular RNA Localization Revealed by APEX-Seq.Cell.2019;178(2):473–490.e26.doi:10.1016/j.cell.2019.05.027)(Uniprot:P78382),其靶向ER膜的细胞质侧,特别是包含N端前27个(SEQ ID NO:298)或前29个(SEQ ID NO:300)氨基酸残基的片段;
-跨膜蛋白红细胞膜整合蛋白(stomatin)样蛋白3(SLP-3)(包含SEQ ID NO:310的氨基酸序列的膜;aa 1-59(智人,Uniprot:Q8TAV4),定位于质膜和囊泡膜(Lapatsina L,Jira JA,Smith ES,et al.Regulation of ASIC channels by a stomatin/STOML3complex located in a mobile vesicle pool in sensory neurons.Open Biol.2012;2(6):120096.doi:10.1098/rsob.120096);
以及这些多肽的功能片段和突变体。所述功能片段和突变体可以与其来源的多肽的氨基酸具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
另一种类型的AP是多肽区段,其源自相分离多肽(PSP)并在融合蛋白的上下文中作为相分离多肽发挥作用。PSP是多肽,其具有在细胞的细胞质中自组装的能力,从而在细胞质中产生局部高浓度位点。具体地,PSP能够驱动相分离(特别是液-液相分离),从而导致在细胞质中形成无膜区室。所述区室可以采取液滴、聚集体、冷凝物或致密相的形式。特别地,PSP包括天然无序蛋白(IDP),其是一类重要的驱动相分离的蛋白(参见,例如,Albertiet al.,Bioessays 2016,38:959-968和其中引用的参考文献,如Patel et al.,Cell2015,162:1066-1077;Han et al.,Cell 2012,149:768-779;Kato et al.,Cell 2012,149:753-767)。有三种不同类别的ICPs,每种类别的蛋白、或者其功能片段或突变体,可用作本发明中的PSP。IDP的一种重要类别包含所谓的朊病毒样结构域,其不带电荷并包含极性氨基酸残基(Q、N、S、G)和散布的芳香族残基(F、Y)。参见,例如Malinovska et al.,Biochim Biophys Acta 2013,1834:918-931;Alberti et al.,2009,Cell 137:146-158;Malinovska et al.,Prion 2015,9:339-346。另一类IDP也具有低序列复杂性的特征,但通常包含酸性和碱性氨基酸侧链,如包含RGG重复的IDP,如Ddx4。参见Nott et al.,Cell2015,57:936-947。合适的IC-TP的具体实例包括但不限于:
-纺锤体缺陷蛋白5(SPD5)(如,秀丽隐杆线虫(Caenorhabditis elegans)SPD5;Uniprot:P91349),特别是包含SEQ ID NO:32的氨基酸序列的多肽;
-融合肉瘤(FUS)(如,智人FUS;Uniprot:P35637),特别是包含SEQ ID NO:34的氨基酸序列的多肽;
-尤文肉瘤断点区域1(Ewing sarcoma breakpoint region 1)(EWSR1)(如,智人EWSR1;Uniprot:Q01844),特别是包含SEQ ID NO:36的氨基酸序列的多肽;
-ATP依赖性RNA解旋酶laf-1(RGG结构域,1-168,包含SEQ ID NO:308的氨基酸序列的LAF-1膜;)(秀丽隐杆线虫,Uniprot:D0PV95),(Schuster BS,Reed EH,ParthasarathyR,et al.Controllable protein phase separation and modular recruitment to formresponsive membraneless organelles.Nat Commun.2018;9(1):2985.2018年7月30日发表.doi:10.1038/s41467-018-05403-1);
以及这些多肽的功能片段和突变体。所述功能片段和突变体可以包含与其来源的多肽的氨基酸至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的氨基酸序列。
本发明的融合蛋白所包含的AP数量没有特别限制,即融合蛋白可以包含1、2、3、4、5、6、7、8、9、10个或多个相同或不同的AP。特别优选包含至少一个选自IC-TP区段的AP和至少一个选自PSP区段的AP的本发明的融合蛋白。同样地,RNA-TP区段的数量没有特别限制并且可以独立地选自1、2、3、4、5或更多个,例如6、7、8、9或10个,不同或相同的RNA-TP区段。同样地,O-RS区段的数量没有特别限制并且可以独立地选自1、2、3、4、5或更多个,例如6、7、8、9或10个,不同或相同的O-RS区段。这适用于AFP以及RNA-TP/O-RS融合蛋白。本发明的融合蛋白中的区段数量显然影响融合蛋白的大小,其没有特别限制,但通常小于3500个氨基酸残基,例如小于3000个氨基酸残基。
本发明的融合蛋白内的区段顺序也没有特别限制。RNA-TP、O-RS和/或AP区段因此可以以任意顺序进行功能性连接。RNA-TP/O-RS融合蛋白结构(包含两种类型的EP区段)的实例包括但不限于,
[RNA-TP]x-[O-RS]y
[O-RS]y-[RNA-TP]x
其中x和y相互独立,是选自1、2、3、4和5的整数;
“-”表示肽键。
当x≥2,[RNA-TP]x可以包括相同或不同的RNA-TP区段。当Y≥2,[O-RS]y可以包括相同或不同的O-RS区段。
RNA-TP/O-RS融合蛋白结构的实例包括但不限于:
[IC-TP]m-[EP]o
[EP]o-[IC-TP]m
[PSP]n-[EP]o
[EP]o-[PSP]n
[IC-TP]m-[EP]o-[PSP]n
[PSP]n-[EP]o-[IC-TP]m
[IC-TP]m-[PSP]n-[EP]o
[EP]o-[PSP]n-[IC-TP]m
[PSP]n-[IC-TP]m-[EP]o
[EP]o-[IC-TP]m-[PSP]n
其中m、n和o相互独立,是选自1、2、3、4或5的整数,或选自1、2、3、4、5、6,“-”表示肽键。
在一个优选的实施方案中,“m”是整数1。
在另一个优选的实施方案中,“n”是选自1和2的整数。
在另一个优选的实施方案中,如果EP选自RNA-TP,则“o”是选自1、2、3、4、5或6的整数。
在另一个优选的实施方案中,如果EP选自O-RS,则“o”是选自1或2的整数。
在RNA-TP/O-RS融合蛋白结构的另一个优选的实施方案中,优选其中至少一个ICT-TP在多肽链内占据C-或N-末端位置。
在RNA-TP/O-RS融合蛋白结构的另一个优选的实施方案中,优选其中至少一个EP在多肽链内占据C-或N-末端位置。
在RNA-TP/O-RS融合蛋白结构的另一个优选的实施方案中,优选其中至少一个ICT-TP在多肽链内占据C-或N-末端位置,同时至少一个EP在多肽链内分别占据N-或C-末端位置。任何PSP,如果存在于这种结构中,都位于多肽链内。
当m≥2,[IC-TP]m可以包括相同或不同的IC-TP区段。优选地应用相同功能的IC-TP(靶向相同类型的细胞结构(例如,相同的膜类型或类型或细胞器)。当n≥2,[PSP]n可以包括相同或不同的PSP区段。当o≥2,[EP]o可以包括相同或不同的EP。当[EP]o包括不同的EP,例如,至少一个EP可以是RNA-TP区段,至少一个可以是O-RS区段。
本发明的融合蛋白提供一种正交翻译(OT)系统,其中将一种或多种ncAA残基引入POI中所需的一个或多个O-RS(区段)与至少一个靶向RNA的多肽(RNA-TP)区段在空间上接近。POI的mRNA包含至少一种靶向核苷酸序列(TN),其能够与OT系统的融合蛋白中的至少一种的RNA-TP区段相互作用。所述的相互作用有利地是特异性相互作用。本发明的融合蛋白的RNA-TP区段优选是靶向mRNA的多肽区段。有利地选择融合蛋白的RNA-TP区段和POI mRNA的TN以便其特异性地相互作用(结合)。适用于此目的的RNA-TP区段和TN对可以选自RNA病毒的外壳蛋白和所述外壳蛋白结合的核酸基序。这类病毒外壳蛋白和蛋白结合的RNA基序是本领域已知的。
合适的RNA-TP的具体实例包括但不限于:
-MCP(肠杆菌噬菌体MS2的外壳蛋白),特别是包含SEQ ID NO:14的氨基酸序列的多肽;
-λN22(λ噬菌体抗终止子蛋白N的22个氨基酸的RNA结合结构域),特别是包含SEQID NO:16的氨基酸序列的多肽;
-PCP(细菌噬菌体PP7的外壳蛋白,Wu B,Chao JA,Singer RH.Fluorescencefluctuation spectroscopy enables quantitative imaging of single mRNAs inliving cells.Biophys J.2012;102(12):2936–2944.doi:10.1016/j.bpj.2012.05.017),特别是包含SEQ ID NO:306的氨基酸序列的多肽;
以及这些多肽的功能片段和突变体。所述功能片段和突变体可以包含与其来源的多肽的氨基酸至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的氨基酸序列。
合适的TN的具体实例包括但不限于:
-肠杆菌噬菌体MS2 RNA茎环,特别是具有对应于SEQ ID NO:17的核苷酸(DNA)序列(由SEQ ID NO:17的核苷酸(DNA)序列编码)的RNA序列的多核苷酸;
-BoxB(λ噬菌体RNA茎环,λN22的特异性结合位点),特别是具有对应于SEQ ID NO:18的核苷酸(DNA)序列(由SEQ ID NO:18的核苷酸(DNA)序列编码)的RNA序列的多核苷酸;
-细菌噬菌体pp7 RNA茎环(Wu B,Chao JA,Singer RH.Fluorescencefluctuation spectroscopy enables quantitative imaging of single mRNAs inliving cells.Biophys J.2012;102(12):2936–2944.doi:10.1016/j.bpj.2012.05.017),特别是具有对应于SEQ ID NO:289或SEQ ID NO:290的核苷酸(DNA)序列(由SEQ ID NO:289或SEQ ID NO:290的核苷酸(DNA)序列编码)的RNA序列的多核苷酸;
以及它们的功能片段和突变体。所述功能片段和突变体可以包含与其来源的多核苷酸序列至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的核苷酸序列。
这类TN可用作单拷贝区段或者用作包含多于一个,例如两个、三个、四个、五个、六个或多个TN的重复单元的多拷贝区段。
MCP与MS2 RNA茎环特异性地相互作用。因此,当融合蛋白的RNA-TP区段包含(由其组成)源自MCP的区段,并作为MCP发挥作用时,POI的mRNA适宜地包含一个或多个MS2 RNA茎环,如两个、三个、四个、五个或六个MS2 RNA茎环。λN22与BoxB特异性地相互作用。因此,当融合蛋白的RNA-TP区段包含(由其组成)源自λN22的区段,并作为λN22发挥作用时,POI的mRNA适宜地包含一个或多个BoxB基序,如一个、两个、三个、四个、五个或六个或更多个BoxB基序。PCP与pp7 RNA茎环特异性地相互作用。因此,当融合蛋白的RNA-TP区段包含(由其组成)源自PCP的区段,并作为PCP发挥作用时,POI的mRNA适宜地包含一个或多个pp7 RNA茎环,如两个、三个、四个、五个或六个或更多个pp7 RNA茎环。
已有几种RS用于遗传密码子扩展,包括詹氏甲烷球菌(Methanococcusjannaschii)酪氨酰-tRNA合成酶,大肠杆菌酪氨酰-tRNA合成酶,大肠杆菌亮氨酰-tRNA合成酶,来自某些甲烷八叠球菌(如马氏甲烷八叠球菌(M.mazei)、巴氏甲烷八叠球菌(M.barkeri)、乙酸甲烷八叠球菌(M.acetivorans)、嗜热甲烷八叠球菌(M.thermophila)、甲烷球菌(布氏拟甲烷球菌(M.burtonii))或脱硫杆菌(D.hafniense)的吡咯赖氨酰-tRNA合成酶。相应的正交RS/tRNA对已用于对多肽的各种功能进行遗传编码(Chin,Annu RevBiochem 2014,83:379-408;Chin et al.,J Am Chem Soc 2001,124:9026;Chin et al.,Science2003,301:964;Nguyen et al.,J Am Chem Soc 2009,131:8720;Yanagisawa etal.,Chem Biol 2008,15:1187)。取决于翻译POI所用的细胞,这些RS可用作本发明的O-RS。
可用于本发明的方法和融合蛋白的吡咯赖氨酰-tRNA合成酶(PylRS)可以是野生型或基因工程化的PylRS。野生型PylRS的实例包括但不限于来自古细菌和真细菌的PylRS,例如马氏甲烷八叠球菌(Methanosarcina maize)、巴氏甲烷八叠球菌(Methanosarcinabarkeri)、布氏拟甲烷球菌(Methanococcoides burtonii)、乙酸甲烷八叠球菌(Methanosarcina acetivorans)、嗜热甲烷八叠球菌(Methanosarcina thermophila)和Desulfitobacterium hafniense。例如,Neumann等人(Nat Chem Biol2008,4:232),Yanagisawa等人(Chem Biol 2008,15:1187)和EP2192185A1已经描述基因工程化的PylRS。通过修饰PylRS的氨基酸序列使其不导向细胞核,可以提高使用PylRS进行遗传密码子扩展的效率。为此,核定位信号(NLS)可以从PylRS中删除,也可以通过引入合适的核输出信号(NES)进行掩盖。本发明的融合蛋白和方法中使用的PylRS可以是缺少NLS和/或包含NES的PylRS,如WO 2018/069481中所述。
因此,可用于本发明的融合蛋白的O-RS区段的实例包括但不限于:
-詹氏甲烷球菌酪氨酰-tRNA合成酶;
-大肠杆菌酪氨酰-tRNA合成酶;
-大肠杆菌亮氨酰-tRNA合成酶;
-马氏甲烷八叠球菌(Methanosarcina mazei)吡咯赖氨酰-tRNA合成酶;
-巴氏甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-乙酸甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-嗜热甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-布氏拟甲烷球菌吡咯赖氨酰-tRNA合成酶;
-Desulfitobacterium hafniense吡咯赖氨酰-tRNA合成酶;
以及这些多肽的功能(即酶活性)片段和突变体。所述功能片段和突变体可以包含与其来源的氨酰tRNA合成酶至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的氨基酸序列。
源自马氏甲烷八叠球菌吡咯赖氨酰-tRNA合成酶的用于本发明的O-RS区段的具体实例包括但不限于:
-源自PylRSAF(马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶双突变体:Y306A、Y384F;Uniprot:Q8PWY1)的O-RS区段,例如包含SEQ ID NO:8的氨基酸序列的O-RS区段;
-源自PylRSAA(马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶双突变体:N346A、C348A;Uniprot:Q8PWY1)的O-RS区段,例如包含SEQ ID NO:10的氨基酸序列的O-RS区段;
-源自PylRSAAAF(马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶四重突变体:Y306A、N346A、C348A、Y384F;Uniprot:Q8PWY1)的O-RS区段,例如包含SEQ ID NO:12的氨基酸序列的O-RS区段;
-源自IFRS1(马氏甲烷八叠球菌吡咯赖氨酰tRNA突变体(L305M、Y306L、L309S、N346S、C348M))的O-RS区段,例如包含SEQ ID NO:224的氨基酸序列的O-RS区段
-源自CbzRS(马氏甲烷八叠球菌吡咯赖氨酰tRNA突变体(Y306M、L309G、C348T))的O-RS区段,例如包含SEQ ID NO:226的氨基酸序列的O-RS区段;
-源自CpkRS(马氏甲烷八叠球菌吡咯赖氨酰tRNA突变体(A302S))的O-RS区段,例如包含SEQ ID NO:228的氨基酸序列的O-RS区段;
-源自OMeRS(马氏甲烷八叠球菌吡咯赖氨酰tRNA突变体(A302T、Y384F、N346V、C348W、V401L))的O-RS区段,例如包含SEQ ID NO:236的氨基酸序列的O-RS区段;
以及这些多肽区段的功能(即酶活性)片段和突变体。所述功能片段和突变体可以包含与其来源的氨酰tRNA合成酶至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的氨基酸序列。
根据具体实施方案,本文所述的野生型和突变体马氏甲烷八叠球菌(M.mazei)PylRS用于将tRNA用ncAA氨酰化,如WO2012/104422或WO2015/107064所述。用于这个目的的示例性ncAA包括但不限于2-氨基-6-(环辛-2-炔-1-基氧基羰基氨基)己酸(SCO)、2-氨基-6-(环辛-2-炔-1-基氧基乙氧基羰基氨基)己酸、2-氨基-6[(4E-环辛-4-烯-1-基)氧基羰基氨基]己酸(TCO)、2-氨基-6[(2E-环辛-2-烯-1-基)氧基羰基氨基]己酸(TCO*)、2-氨基-6-(丙-2-炔氧基羰基氨基)己酸(PrK)和2-氨基-6-(9-生物环[6.1.0]非-4-炔基甲氧基羰基氨基)己酸(BCN)。
在本发明的另一实施方案中,上述AP(IC-TP和PSP)区段和/或上述EP(RNA-TP和O-RS)区段相互独立,可以进一步与天然的或者,尤其是,合成的蛋白区段结合,其诱导和控制大分子的相互作用。特别地,这类进一步的蛋白区段可操作地融合到本发明的AFP的多肽链中。一个或多个,如2、3、4、5、6、7、8、9或10个,然而优选一个这样的蛋白区段可操作地融合到本发明的单个AFP中。融合到AFP多肽链中应当使得其他多肽区段(AP和EP)的活性基本不受影响,特别是不被抑制(即AP和EP保持可操作),同时保留其他多肽区段诱导和控制大分子相互作用的能力。文献中描述的是所谓的SYNZIP肽,其形成多聚体结构。本发明的上下文中特别感兴趣的是具有形成特定异二聚体卷曲螺旋蛋白结构能力的SYNZIP。这类SYNZIP是成对的人工合成肽,能够相互作用,用于诱导和控制大分子相互作用。非限制性示例是成对的SYNZIP 1:2;SYNZIP 3:4和SYNZIP 5:6。根据本发明特别优选的是如Reinke,A.W.,Grant,R.A.,Keating,A.E.(2010)J Am Chem Soc 132 6025-6031所述的异源特异性卷曲螺旋对SYNZIP2:SYNZIP1(SYNZIP 1:SEQ ID NO:312;SYNZIP 2:SEQ ID NO:314;SYNZIP 3:SEQ ID NO:316;SYNZIP 4:SEQ ID NO:318,以及这些SYNZIP多肽的功能片段和突变体。所述功能片段和突变体可以包含与其来源的多肽的氨基酸至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的氨基酸序列)。由于需要成对使用以诱导大分子相互作用,在本文所述的AFP组合中,这些SYNZIP优选成对使用。通过整合在不同AFP融合蛋白中的这类SYNZIP对的相互作用,可进一步支持根据本发明的OT细胞器的形成。
在本发明的另一实施方案中,本发明的融合蛋白可通过引入(融合)至少一个所谓的“表位标签”,即充当抗体结合位点的短寡肽序列来进一步修饰,可用于检测/定量表达的本发明的融合产物。这类标签的非限制性实例如下:
VSV-G:水泡性口炎病毒糖蛋白表位标签(SEQ ID NO:680)
HA:人流感病毒血凝素表位标签(SEQ ID NO:682)
Myc:人c-Myc原癌基因表位标签(SEQ ID NO:684)
1.2本发明的AFP构建体的具体实例
每个单独示例的构建体可以以N->C或C->N方向理解。所绘制的方案是在N->C方向上给出的。
在区段块[IC-TP]m、[PSP]n、[O-RS]y和[RNA-TP]x的情况下,其中m、n、y或x是>1的整数,这类块中的重复区段可以相同或不同,优选相同。
其中所应用的区段[IC-TP]、[PSP]、[O-RS]、[RNA-TP]x和[SYNZIP]可以从上文第1.1节中描述的区段的各个实例制备。
1.2.1.靶向细胞内结构的单功能AFP
1.2.1.1靶向细胞内结构的单功能AFP(即包含一种类型的EP)
其中个别优选的实例有:
[IC-TP]m-[O-RS]y,其中m=1或2,优选1;y=1或2,优选1;
[IC-TP]m-[RNA-TP]x,其中m=1或2,优选1;x=1、2、3、4、5或6,优选2、3或4;
[IC-TP]m-[PSP]n-[O-RS]y,其中m=1或2,优选1;n=1、2或3,优选1或2;y=1或2,优选1;
[IC-TP]m-[PSP]n-[RNA-TP]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;
[IC-TP]m-[O-RS1]y-[PSP]n-[O-RS2]y,其中m=1或2,优选1;n=1、2或3,优选1或2;y相互独立地=1或2,优选1;并且O-RS1和O-RS2相同或不同,优选相同;
[IC-TP]m-[PSP1]n-[O-RS]y-[PSP2]n,其中m=1或2,优选1;n相互独立地为1、2或3,优选1或2;y相互独立地=1或2,优选1;并且PSP1和PSP2相同或不同;
[IC-TP]m-[RNA-TP1]x-[PSP]n-[RNA-TP2]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x相互独立地=1、2、3、4、5或6,优选2、3或4;并且RNA-TP1和RNA-TP2相同或不同,优选相同;
[IC-TP]m-[PSP1]n-[O-RS1]y-[PSP2]n-[O-RS2]y,其中m=1或2,优选1;n相互独立地为1、2或3,优选1或2;y相互独立地=1或2,优选1;O-RS1和O-RS2相同或不同,优选相同;并且PSP1和PSP2相同或不同;
[IC-TP]m-[PSP1]n-[RNA-TP1]x-[PSP2]n-[RNA-TP2]x,其中m=1或2,优选1;n相互独立地=1、2或3,优选1或2;x相互独立地=1、2、3、4、5或6,优选2、3或4;RNA-TP1和RNA-TP2相同或不同;并且PSP1和PSP2相同或不同。
1.2.1.2靶向细胞内结构的双功能AFP(包含两种类型的EP)
其中个别优选的实例有:
[IC-TP]m-[O-RS]y-[RNA-TP]x,其中m=1或2,优选1;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[RNA-TP]x-[O-RS]y,其中m=1或2,优选1;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[PSP]n-[O-RS]y-[RNA-TP]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[PSP]n-[RNA-TP]x-[O-RS]y,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[O-RS]y-[PSP]n-[RNA-TP]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[RNA-TP]x-[PSP]n-[O-RS]y,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[PSP1]n-[O-RS]y-[PSP2]n-[RNA-TP]x,其中m=1或2,优选1;n每个独立,n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;并且PSP1和PSP2相同或不同;
[IC-TP]m-[PSP1]n-[RNA-TP]x-[O-RS1]y-[PSP2]n-[O-RS2]y,其中m=1或2,优选1;每个独立的n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y相互独立地=1或2,优选1;并且PSP1和PSP2相同或不同;O-RS1和O-RS2相同或不同,优选相同;
[IC-TP]m-[PSP1]n-[O-RS1]y-[PSP2]n-[O-RS2]y-[RNA-TP]x,其中m=1或2,优选1;每个独立的n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y相互独立地=1或2,优选1;并且PSP1和PSP2相同或不同;O-RS1和O-RS2相同或不同,优选相同。
1.2.2.不靶向细胞内结构的单功能AFP
这些是与上文第1.2.1节中所列相同的AFP,唯一例外的是区段[IC-TP]缺失,而区段[PSP]保留。
1.2.3.SYNZIP变体
这些是与上文第1.2.1和1.2.2节中所列相同的AFP,唯一例外的是区段[IC-TP]、[PSP]、[O-RS2]或[RNA-TP]中的至少一个在N-或C-末端补充有SYNZIP元件。AFP可以包含1、2、3、4或5个,优选1或2个,相同或不同,优选相同的SYNZIP。这类分子的非限制性实例有:
1.2.3.1单功能SYNZIP AFP
其中个别优选的实例有:
[PSP]n-[SYNZIP]-[O-RS]y,y=1或2,优选1;n=1、2或3,优选1或2;
[PSP]n-[SYNZIP]-[RNA-TP]x,x=1、2、3、4、5或6,优选2、3或4;n=1、2或3,优选1或2;
[IC-TP]m-[SYNZIP]-[O-RS]y,其中m=1或2,优选1;y=1或2,优选1;
[IC-TP]m-[SYNZIP]-[RNA-TP]x,其中m=1或2,优选1;x=1、2、3、4、5或6,优选2、3或4;
[IC-TP]m-[PSP]n-[SYNZIP]-[O-RS]y,其中m=1或2,优选1;n=1、2或3,优选1或2;y=1或2,优选1;
[IC-TP]m-[PSP]n-[SYNZIP]-[RNA-TP]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4。
1.2.3.2双功能SYNZIP AFP
其中个别优选的实例有:
[IC-TP]m-[O-RS]y-[SYNZIP]-[RNA-TP]x,其中m=1或2,优选1;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[RNA-TP]x-[SYNZIP]-[O-RS]y,其中m=1或2,优选1;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[PSP]n-[SYNZIP]-[O-RS]y-[RNA-TP]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[PSP]n-[SYNZIP]-[RNA-TP]x-[O-RS]y,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;
[IC-TP]m-[PSP]n-[SYNZIPa]-[O-RS]y-[SYNZIPb]-[RNA-TP]x,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;并且SYNZIPa和SYNZIPb相同或不同,优选相同;
[IC-TP]m-[PSP]n-[SYNZIPa]-[RNA-TP]x-[SYNZIPb]-[O-RS]y,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;并且SYNZIPa和SYNZIPb相同或不同,优选相同;
[IC-TP]m-[PSP1]n-[SYNZIP]-[RNA-TP]x-[O-RS1]y-[PSP2]n-[O-RS2]y,其中m=1或2,优选1;n=1、2或3,优选1或2;x=1、2、3、4、5或6,优选2、3或4;y=1或2,优选1;并且PSP1和PSP2相同或不同;O-RS1和O-RS2相同或不同,优选相同。
1.2.4.单功能融合蛋白
其个别优选的实例有:
[SYNZIP]-[O-RS]y,其中y=1或2,优选1;
[SYNZIP]-[RNA-TP]x,其中x=1、2、3、4、5或6,优选2、3或4;
由于此处IC-TP和PSP缺失,这些可优选地与包含至少一个C-TP和/或PSP区段的AFP分子组合使用。
1.3单个融合蛋白的实例
本发明的融合蛋白的非常具体的实例及其特定组合列于下表1、2和3中。这个表1、2和3的内容也构成说明书一般公开的一部分,其内容在一般方面不做明确和字面重复。表1和表2中标明为“包含O-RS和RNA-TP区段的融合蛋白”的相应列披露应被视为独立于表1和表2中涉及具体报告和宿主细胞系的其他列内容披露。
2.功能片段和突变体
本文所述的是特定RNA-TP、O-RS、IC-TP、PSP、TN以及SYNZIP的片段和突变体,其具有功能性(即分别具有亲本RNA-TP的RNA结合活性、亲本IC-TP的靶向细胞内结构的活性、亲本PSP的自组装活性、亲本TN对RNA-TP的结合活性、亲本O-RS的酶活性或亲本SYNZIP的异二聚体卷曲螺旋形成能力)。这类片段和突变体可用本文所述的最小程度的序列相同性进行表征。所述氨基酸或核苷酸序列相同性分别指所表征的氨基酸或核苷酸序列在整个长度上的相同性。百分比相同性值可根据本领域已知的BLAST比对、blastp算法(蛋白-蛋白BLAST),或使用Clustal方法(Higgins et al.,Comput Appl.Biosci.1989,5(2):151-1)进行确定。
本发明可用的特定RNA-TP、O-RS、IC-TP、SYNZIP或PSP的片段和突变体保留亲本多肽的相关功能(分别为结合、自组装或酶活性),并且可以例如通过本领域已知的保守氨基酸取代获得,即用具有相似生化特性(例如电荷、疏水性和大小)的不同氨基酸残基置换氨基酸残基。典型的实例是用Ile取代Leu或反之,用Glu取代Asp或反之,用Gln替换Asn或反之,等等。
3.正交翻译、tRNA和POI编码序列
术语“翻译系统”通常是指将天然存在的氨基酸加入正在生长的多肽链(蛋白)中所必需的一组组件。翻译系统的组件可以包括,例如,核糖体、tRNA、氨酰tRNA合成酶、mRNA等等。氨酰tRNA合成酶(RS)是能够用氨基酸或氨基酸类似物氨酰化tRNA的酶。本发明的过程中使用的RS能够用相应的ncAA氨酰化tRNA,即氨酰化tRNAncAA。如本文所用的术语“正交”是指翻译系统的元件(例如,正交tRNA(O-tRNA)和/或正交氨酰tRNA合成酶(O-RS)),其由感兴趣的翻译系统(例如,细胞)以降低的效率使用。“正交”是指O-tRNA或O-RS不能或者以降低的效率,如,20%以下的效率、10%以下的效率、5%以下的效率,或者例如,1%以下的效率分别与感兴趣的翻译系统的内源性RS或内源性tRNA一起发挥作用。例如,与内源性tRNA通过内源性RS氨酰化相比,感兴趣的翻译系统中的O-tRNA以降低的甚至为零的效率通过翻译系统的任何内源性RA氨酰化。在另一实例中,与内源性RS氨酰化内源性tRNA相比,O-RS以降低的甚至为零的效率氨酰化感兴趣的翻译系统中的任何内源性tRNA。具体地,术语“正交翻译系统”或“OT系统”在本文中用来指使用O-RS/O-tRNAncAA对的翻译系统,其允许将ncAA残基引入生长的多肽链中。
本发明中使用的O-RS/O-tRNAncAA对优选具有以下特性:O-tRNAncAA优先通过O-RS用ncAA进行氨酰化。此外,正交对在感兴趣的翻译系统(例如细胞)中发挥作用,因此O-tRNAncAA用于将ncAA残基掺入生长的POI多肽链中。掺入以位点特异性方式进行。具体地,O-tRNAncAA识别编码POI的mRNA中的选择密码子(例如,琥珀、赭石或乳白终止密码子)。
术语“优选氨酰化”是指O-RS用非天然氨基酸氨酰化O-tRNA,与感兴趣的翻译系统(例如细胞)的内源性tRNA或氨基酸相比,其效率为,例如约50%有效、约70%有效、约75%有效、约85%有效、约90%有效、约95%有效,或约99%或更高的效率。然后将非天然氨基酸以高保真度掺入生长的多肽链中,例如,对于给定选择密码子其效率大于约75%,对于给定选择密码子其效率大于约80%,对于给定选择密码子其效率大于约90%,对于给定选择密码子其效率大于约95%,或对于给定选择密码子其效率大于约99%或更高。
可用于通过本发明的融合蛋白氨酰化的tRNA包含至少一个源自马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶的O-RS区段,所述tRNA包括但不限于马氏甲烷八叠球菌的吡咯赖氨酰tRNA及其功能突变体,其中反密码子是选择密码子的反密码子,例如琥珀终止密码子TAG的CUA反密码子、乳白终止密码子TGA的反密码子UCA和赭石终止密码子TAA的反密码子UUA。这类吡咯赖氨酰tRNA的实例包括但不限于由核苷酸序列SEQ ID NO:4(tRNAPyl,CUA)、SEQ ID NO:5(tRNAPyl,UCA)或SEQ ID NO:6(tRNAPyl,UUA)编码的那些。其他合适的tRNA的非限制性实例有以下源自马氏甲烷八叠球菌的吡咯赖氨酰tRNA的tRNA:
tRNApyl,CGA吡咯赖氨酰tRNA(用于丝氨酸密码子),SEQ ID NO:229
tRNApyl,CGG吡咯赖氨酰tRNA(用于脯氨酸密码子),SEQ ID NO:230
tRNApyl,UAA吡咯赖氨酰tRNA(用于亮氨酸密码子),SEQ ID NO:231
tRNApyl,UAG吡咯赖氨酰tRNA(用于亮氨酸密码子),SEQ ID NO:232
tRNApyl,CCG吡咯赖氨酰tRNA(用于精氨酸密码子),SEQ ID NO:233
tRNApyl,AUA吡咯赖氨酰tRNA(用于异亮氨酸密码子),SEQ ID NO:234
如本文所用的术语“选择密码子”是指在翻译过程中被O-tRNAncAA识别(即结合)的密码子。该术语还用于不是信使RNA(mRNA)的多核苷酸(例如DNA质粒)的多肽编码序列中的相应密码子。本文描述的新OT系统允许以与细胞的细胞质中存在的其他mRNA相比对所述POI的mRNA具有选择性的方式进行POI的正交翻译。然而,优选选择密码子是所选择用于表达的细胞中的低丰度密码子,例如天然存在的真核细胞中的低丰度密码子。新OT系统使POI的mRNA、O-RS和tRNAncAA相互靠近,从而支持在POI的选择密码子编码的氨基酸位置引入ncAA(而不是引入可能与选择密码子结合的不同tRNA的氨基酸)。因此,所述选择密码子可以是有义密码子。然而,在优选的实施例中,选择密码子是不被用于制备POI的细胞的内源性tRNA识别的密码子。
O-tRNAncAA的反密码子与mRNA(POI的mRNA)内的选择密码子结合,从而将ncAA位点特异性地掺入由所述mRNA编码的生长多肽链(POI)中。可用于本文所述新OT系统的选择密码子的实例包括但不限于:
-无义密码子,如终止密码子,例如,琥珀(UAG)、赭石(UAA)和乳白(UGA)密码子;
-由三个以上碱基组成的密码子(例如,四碱基密码子);
-源自天然或非天然碱基对的密码子;和
-有义密码子。
当使用的选择密码子是有义密码子(即,天然的三碱基密码子)时,优选根据本发明的方法用于POI表达的细胞内源翻译系统不(或几乎不)使用所述天然三碱基密码子,例如,其中识别天然三碱基密码子的tRNA缺乏或丰度减少的细胞,或者其中天然三碱基密码子是稀有密码子的细胞。特别优选使用一种或多种终止密码子,例如琥珀、赭石和乳白中的一种或多种作为本发明的选择密码子。
可将多个选择密码子引入编码所需多肽(靶多肽,POI)的多核苷酸,例如,一个或多个、两个或更多个、超过三个等选择密码子。一个POI可携带两个或更多个ncAA残基。所述ncAA残基可以相同并且由相同类型的选择密码子编码,或者可以不同并且由不同的选择密码子编码。
反密码子具有相应密码子的反向互补序列。
抑制子tRNA是一种改变给定翻译系统(例如,细胞)中信使RNA(mRNA)阅读的tRNA(如O-tRNAncAA)。抑制子tRNA可以通读,例如,终止密码子、四碱基密码子或稀有密码子。
如本文所述,O-tRNA优先通过O-RS(而不是内源性合成酶)氨酰化并且能够解码选择密码子。O-RS识别O-tRNA,例如,具有扩展的反密码子环,并且优先用ncAA氨酰化O-tRNA。
本发明的方法和/或融合蛋白中使用的O-tRNA和O-RS可以是天然存在的,或者可以从各种生物体通过天然存在的tRNA和/或RS的突变衍生。在各种实施方案中,tRNA和RS源自至少一种生物体。在另一实施方案中,tRNA从第一种生物体由天然存在的tRNA或突变的天然存在的tRNA衍生,而RS从第二种生物体由天然存在的RS或突变的天然存在的RS衍生。
合适的(正交)tRNA/RS对可选自突变tRNA和RS文库,例如,基于文库筛选的结果。或者,合适的tRNA/RS对可以是异源tRNA/合成酶对,其从来源物种导入翻译系统。优选地,用作翻译系统的细胞不同于所述来源物种。用于进化tRNA/RS对的方法在例如WO 02/085923和WO 02/06075中有所描述。
常规的定点诱变可用于将选择密码子引入POI的编码序列中。
4.核酸分子
本发明还涉及核酸分子(单链或双链DNA和RNA序列,例如cDNA、mRNA),或这类核酸分子的组合,其包含编码至少一种本发明的融合蛋白的核苷酸序列,和/或与其互补的核苷酸序列。
此外,本发明涉及核酸分子(单链或双链DNA和RNA序列,例如cDNA、mRNA),或这类核酸分子的组合,其包含(i)核苷酸序列(CSPOI),其编码至少一种POI,所述POI包含一种或多种ncAA残基,所述ncAA残基在CSPOI中由选择密码子编码;和(ii)如本文所述的靶向核苷酸序列(TN),其中包含所述TN(的RNA形式)的RNA分子能够通过所述TN与靶向RNA的多肽(RNA-TP)相互作用。
本发明的核酸分子还可以包含编码基因区的3'-和/或5'-末端的非翻译序列。TN优选位于编码POI的核酸分子的3'末端。例如,本发明的编码POI的核酸分子可以通过使用本领域已知的常用克隆技术在3'非翻译区(特别是3')处引入至少一个TN来制备。
本发明的核酸分子还可以包含编码基因区的3'-和/或5'-末端的非翻译序列。
本发明进一步涉及,特别是重组体、表达构建体或表达盒,其包含在调控核酸序列的遗传控制下的如本文所述的本发明的核酸分子或核酸分子组合的核酸序列。因此本发明的表达盒包含编码至少一种POI(加TN)或至少一种本发明的融合蛋白的核酸序列,和/或与其互补的核酸序列。本发明还涉及,特别是重组体、载体,其包含这些表达构建体(表达载体)中的至少一种。
表达盒通常包含位于编码待表达POI或融合蛋白的核酸序列5'(上游)并与其功能性连接的启动子序列、所述编码序列的终止子序列3'(下游)和任选存在的其他调控元件。这类其他调控元件的实例包括但不限于靶向序列、增强子、聚腺苷酸化信号、选择标记、扩增信号、复制起点等。合适的调控序列描述于例如Goeddel,Gene Expression Technology:Methods in Enzymology 185,Academic Press,San Diego,CA(1990)。
除了这些调控序列外,这些序列的天然调控仍可在实际结构基因之前存在,并任选地可在遗传上改变,从而天然调控被关闭且基因表达增加。然而,核酸构建体也可以是更为简单的构建体,即在编码序列之前没有插入额外的调控信号,并且天然启动子及其调控没有被去除。相反,天然调控序列发生突变,从而不再发生调控,基因表达增加。
核酸分子的元件的“功能性”连接,如启动子、多肽编码序列、终止子、调节子,表示排列这些元件,使得可以转录编码序列并且任选的调节元件可对所述转录进行调节。这可通过同一个核酸分子中的元件直接连接来实现。然而,这种直接连接不是必须的。基因控制序列,例如增强子序列,甚至可以从更远的位置或甚至其他DNA分子对靶序列发挥作用。待转录的核酸序列位于启动子序列下游(即在其3'-末端)的排列是优选的,从而使两个序列共价连接在一起。启动子序列与待表达的核酸序列之间的距离可以小于200个碱基对,或小于100个碱基对或小于50个碱基对。
为了在细胞中表达,将表达盒有利地插入表达载体中。根据用于表达的细胞来选择表达载体,这使得编码核苷酸序列在细胞中的最佳表达成为可能。载体是本领域技术人员熟知的并且在例如“Cloning vectors”(Pouwels P.H.et al.,Ed.,Elsevier,Amsterdam-New York-Oxford,1985)中给出。表达载体的实例包括但不限于质粒、病毒载体(噬菌体),例如SV40、CMV、杆状病毒和腺病毒、转座子、IS元件、质粒、粘粒以及线性或环状DNA。参见,例如,“Cloning vectors”一书(Eds.Pouwels P.H.et al.Elsevier,Amsterdam-New York-Oxford,1985,ISBN 0 444 904018)。这些载体可以在(宿主)细胞中自主复制,或者可以在染色体上复制。包含至少一个本发明的表达盒的表达载体代表本发明的另一方面。
为了在根据本发明的细胞中表达POI,例如,可以将编码POI的核酸分子(例如本发明的表达载体)引入细胞中。或者,可以修饰细胞的现有基因,以便在POI意图携带上ncAA残基的那些氨基酸位置包含选择密码子。用于将编码(重组)多肽的核酸分子引入细胞或修饰细胞的现有基因的方法是本领域已知的。
在本发明上下文中,术语“表达”描述细胞中由相应核酸序列编码的多肽的产生。术语“表达”也用于细胞中由核酸序列编码的tRNA分子的产生。
本发明的核酸分子,包括本发明的表达盒和表达载体,可以使用本领域已知的常用克隆技术来制备。使用常用重组和克隆技术,例如T.Maniatis,E.F.Fritsch andJ.Sambrook,Molecular Cloning:A Laboratory Manual,Cold Spring HarborLaboratory,Cold Spring Harbor,NY(1989)和T.J.Sihavy,M.L.Berman andL.W.Enquist,Experiments with Gene Fusions,Cold Spring Harbor Laboratory,ColdSpring Harbor,NY(1989)和Ausubel,F.M.et al.,Current Protocols in MolecularBiology,Greene Publishing Assoc.and Wiley Interscience(1987)所述。
本发明的核酸分子或核酸分子的组合,包括本发明的表达盒和表达载体,可以通过例如本领域已知的方法分离。
“分离的”核酸分子是从核酸的天然来源中存在的其他核酸分子分离的,而且当它通过重组技术产生时,可基本上不含其他细胞材料或培养基,或者当它是化学合成时,可不含化学前体或其他化学品。
根据本发明的核酸分子可通过分子生物学的标准技术和本发明所提供的序列信息来分离。例如,cDNA可从合适的cDNA库中分离,使用具体公开的完整序列之一或其区段作为杂交探针以及标准杂交技术(例如Sambrook,J.,Fritsch,EF and Maniatis,T.Molecular Cloning:A Laboratory Manual.2nd edition,Cold Spring HarborLaboratory,Cold Spring Harbor,NY,1989所述)。此外,包含公开序列之一或其区段的核酸分子,可使用基于这个序列构建的寡核苷酸引物通过聚合酶链式反应分离。可以将由此扩增的核酸克隆到合适的载体中并可通过DNA序列分析来表征。此外,根据本发明的寡核苷酸可通过标准的合成方法来生产,例如用自动DNA合成仪。
5.ncAA和翻译后POI修饰
缩写“ncAA”通常是指任何非典型或非天然氨基酸,或者氨基酸残基,其不属于22种天然存在的蛋白原氨基酸。许多ncAA是本领域公知的(参见,例如,Liu et al.,Annu RevBiochem 2010,79:413-444;Lemke,ChemBioChem 2014,15:1691-1694)。术语“ncAA”也指氨基酸衍生物,例如α-羟基酸(而不是α-氨基酸)。这类衍生物也已被证明是可翻译掺入的。参见,例如,Ohta et al.,2008,ChemBioChem 9:2773-2778。因此,本文使用的术语如“氨酰化(aminoacylate)”或“氨酰化(aminoacylation)”的含义不限于tRNA和α-氨基酸的RS催化连接,还包括tRNA和ncAA衍生物如α-羟基酸的RS催化连接。
用于本发明的特别优选的ncAA是可在翻译后进一步修饰的那些,例如使用点击化学反应。这类点击反应包括应变促进的逆电子需求Diels-Alder环加成(SPIEDAC;参见,例如,Devaraj et al.,Angew Chem Int Ed Engl 2009,48:7013)以及应变环炔基之间的环加成,或者具有一个或多个未被氨基取代的三键结合的环原子的应变环炔基类似物基团,具有叠氮化物、氧化腈、硝酮和重氮羰基试剂(参见,例如,Sanders et al.,J Am Chem Soc2010,133:949;Agard et al.,J Am Chem Soc 2004,126:15046),例如应变促进的炔-叠氮环加成反应(SPAAC)。这类点击反应允许靶多肽的ncAA标记基团与偶联伙伴分子的合适基团进行超快的双正交共价位点特异性偶联。可通过上述点击反应进行反应的对接和标记基团对是本领域已知的。用于本发明的包含的对接基团的合适ncAA的实例包括但不限于例如WO 2012/104422和WO 2015/107064中描述的ncAA(“非天然氨基酸”,“UAA”)。任选取代的应变炔基包括但不限于任选取代的反式环辛烯基,如上述文献中描述的那些。任选取代的应变烯基包括但不限于任选取代的环辛炔基,如WO 2012/104422和WO 2015/107064中描述的那些。任选取代的四嗪基包括但不限于WO 2012/104422和WO 2015/107064中描述的那些。
本发明的上下文中使用的ncAA可以其盐的形式使用。如本文所述的ncAA的盐是指酸或碱加成盐,特别是生理上可耐受的酸或碱的加成盐。生理上耐受的酸加成盐可以通过用适当的有机或无机酸处理ncAA的碱形式来形成。通过用适当的有机和无机碱处理可以将含有酸质子的ncAA转化为它们的无毒金属或胺加成盐形式。ncAA的羧基盐可以以本领域已知的方式生产,并且包含无机盐,例如钠盐、钙盐、铵盐、铁盐和锌盐,以及与有机碱形成的盐,例如胺,如三乙醇胺、精氨酸、赖氨酸、哌啶等。ncAA也可以以酸加成盐的形式使用,例如与无机酸形成的盐,如盐酸或硫酸,以及与有机酸形成的盐,如乙酸和草酸。可用于本发明的ncAA及其盐还包括其水合物和溶剂加成形式,如水合物、醇化物等。
生理上耐受的酸或碱特别是用于制备具有ncAA残基的POI的翻译系统所耐受的酸或碱,例如对活的真核细胞基本上无毒。
在本发明上下文中可用的ncAA及其盐可以通过类似于本领域公知并且例如本文引用的各种出版物描述的方法来制备。
偶联伙伴分子的性质取决于预期用途。例如,靶多肽可以与适合成像方法的分子偶联或可以通过与生物活性分子偶联而被功能化。例如,除了对接基团之外,偶联伙伴分子可以包含基团,所述基团选自但不限于染料(例如荧光、发光或磷光染料,如丹磺酰基、香豆素、荧光素、吖啶、罗丹明、硅-罗丹明、BODIPY或花青染料)、与试剂接触时能发出荧光的分子、发色团(例如,光敏色素、藻胆素、胆红素等)、放射性标记(例如氢、氟、碳、磷、硫或碘的放射性形式,如氚、18F、11C、14C、32P、33P、33S、35S、11In、125I、123I、131I、212B、90Y或186Rh)、MRI敏感自旋标签、亲和标签(例如生物素、His-标签、Flag-标签、strep-标签、糖、脂质、甾醇、PEG-接头分子、苄基鸟嘌呤、苄基胞嘧啶或辅因子)、聚乙二醇基团(例如,支链PEG、线性PEG、不同分子量的PEG等)、光交联剂(如对叠氮基碘乙酰苯胺)、NMR探针、X射线探针、pH探针、IR探针、树脂、固体支持物和生物活性化合物(例如合成药物)。合适的生物活性化合物包括但不限于细胞毒性化合物(例如,癌症化疗化合物)、抗病毒化合物、生物反应调节剂(例如,激素、趋化因子、细胞因子、白介素等)、影响微管的物质、激素调节剂和甾体化合物。可用的偶联伙伴分子的具体实例包括但不限于受体/配体对的成员;抗体/抗原对的成员;凝集素/碳水化合物对的成员;酶/底物对的成员;生物素/亲和素;生物素/链霉亲和素和地高辛/抗地高辛。
某些ncAA残基(的标记基团)与偶联伙伴分子(的对接基团)原位共价偶联的能力,特别是通过本文所述的点击反应,可用于在表达靶多肽的真核细胞或组织中检测具有这类ncAA残基的靶多肽,以及用于研究靶多肽的分布和命运。具体地,本发明通过在(例如真核)细胞中表达制备POI的方法可以与超分辨率显微镜术(SRM)结合以检测细胞内或这类细胞的组织内的POI。数种SRM方法是本领域已知的,并且可以将其修改以利用点击化学来检测由本发明的真核细胞表达的靶多肽。这类SRM方法的具体实例包括DNA-PAINT(用于纳米级成像的DNA点积累;例如Jungmann et al.,Nat Methods 11:313-318,2014描述)、dSTORM(直接随机光学重建显微术)和STED(受激发射损耗)显微术。
6.细胞内POI的翻译制备
本发明提供的OT系统允许在细胞中翻译制备POI。
根据本发明用于制备POI的细胞可以是原核细胞。或者,根据本发明用于制备POI的细胞可以是真核细胞。根据本发明用于制备POI的细胞可以是单个细胞,例如单细胞微生物或源自多细胞生物体细胞的细胞系。或者,根据本发明用于制备POI的细胞可以存在于组织、器官、身体部位(及其部分)或整个多细胞生物体中。因此,本发明用于制备POI的方法可以用单个细胞或细胞培养物,或者用组织或组织培养物、器官、身体部分或者(整个多细胞)生物体进行。
与原核生物(例如大肠杆菌)相比,真核细胞通常更难处理和操作,因此无法或仅很难使用已知的POI选择性正交翻译方法,例如上文“发明背景”一节中描述的方法。因此,当用于真核细胞(包括,例如,单细胞和多细胞真核生物体以及真核细胞系)中的POI表达时,本发明的OT系统和方法是特别有利的。
原则上,根据本发明的方法,所有原核或真核细胞均可用于制备POI。可以使用微生物,例如细菌、真菌或酵母,以及真核细胞,例如哺乳动物细胞、昆虫细胞、酵母细胞和植物细胞。特别优选真核细胞,特别是哺乳动物细胞。
根据本发明用于制备POI的细胞携带编码POI的核苷酸序列(CSPOI),其中POI的ncAA残基由选择密码子编码。所述CSPOI与一种或多种靶向序列(TN)功能性连接。翻译产生包含CSPOI和TN的mRNA。所述细胞进一步包含一种或多种本发明的融合蛋白,其中所述融合蛋白包含至少一个O-RS区段和至少一个RNA-TP区段。所述O-RS和RNA-TP可以在本发明的分别的融合蛋白(例如AFP)上。或者,所述O-RS和RNA-TP可以在本发明的一种且相同的融合蛋白上(例如在RNA-TP/O-RS融合蛋白或AFP上)。通过(至少一种)其TN,所述mRNA可以在细胞中与本发明的融合蛋白的至少一种RNA-TP区段相互作用(结合)。所述细胞进一步包含一种或多种正交tRNAncAA分子(O-tRNAncAA),其携带CSPOI的选择密码子的反密码子。所述O-tRNAncAA分子与细胞中融合蛋白的一个或多个O-RS区段形成一个或多个正交O-RS/O-tRNAncAA对,其允许将ncAA残基引入(翻译制备的)POI的氨基酸序列。
包含CSPOI和TN的mRNA与RNA-TP区段的相互作用,通过O-RS区段用ncAA对O-tRNAncAA的氨酰化,以及包括引入ncAA残基的POI的翻译制备,据认为发生在细胞质中,更特别是在ncAA存在下的细胞的OT组装器集合体(OT细胞器)中。
包含CSPOI和TN的mRNA(mRNAPOI)可以由引入细胞的重组构建体(例如表达载体)产生。或者,可以修饰细胞的一个或多个内源基因以包含一种或多种选择密码子以及一种或多种TN。将重组构建体引入细胞的技术以及修饰细胞内源基因的方法是本领域公知的。
本发明的tRNAncAA分子和融合蛋白可以由引入细胞的重组构建体(例如表达载体)产生。
使用本发明的表达载体,可以产生重组细胞,其可用于使用本发明的方法制备POI。有利地,将上述根据本发明的重组载体引入合适的细胞中并表达。
如本文所述用于制备POI的细胞可以通过将编码融合蛋白、tRNAncAA分子和POI的核苷酸序列引入细胞来制备。所述核苷酸序列可以以任何组合位于不同的核酸分子(载体)或同一核酸分子(例如,载体)上,并且可以组合或顺序方式引入细胞中。
优选地,使用本领域技术人员已知的常用克隆和转染技术,例如共沉淀、原生质体融合、电穿孔、病毒介导的基因递送、脂质转染、显微注射或其他,将所述核酸分子引入相应的细胞中。合适的技术描述于例如Current Protocols in Molecular Biology,F.Ausubelet al.,Ed.,Wiley Interscience,New York 1997,或者Sambrook et al.MolecularCloning:A Laboratory Manual.2nd edition,Cold Spring Harbor Laboratory,ColdSpring Harbor Laboratory Press,Cold Spring Harbor,NY,1989。
对于本发明的方法,用于POI表达的细胞以本领域技术人员已知的方式进行生长或培养。取决于细胞的类型,可以使用液体培养基进行培养。培养可以是分批、半分批或连续的。营养物质可以在培养开始时存在,或者可以在后续培养中半连续或连续提供。
表达的POI可以通过已知技术纯化,例如分子筛层析(凝胶过滤),如Q-琼脂糖层析、离子交换层析和疏水层析,以及其他常见的蛋白纯化技术如超滤、结晶、盐析、透析和天然凝胶电泳。合适的方法描述于例如Cooper,T.G.,Biochemische Arbeitsmethoden[Biochemistry processes],Verlag Walter de Gruyter,Berlin,New York或Scopes,R.,Protein Purification,Springer Verlag,New York,Heidelberg,Berlin。
为了分离POI,将POI与可以用于更容易纯化的标签相连接可能是有利的。这可以通过将相应的标签编码序列引入CSPOI来实现。用于蛋白纯化的合适标签是本领域公知的,并且包括例如组氨酸标签(例如,His6标签)和可被识别为抗体抗原的表位(描述于例如Harlow,E.and Lane,D.,1988,Antibodies:A Laboratory Manual.Cold Spring Harbor(NY)Press)。这些标签可以用于将蛋白连接至固体载体,例如聚合物基质,其可以例如用作色谱柱中的填料,或者可以用于微量滴定板或一些其他载体上。
连接到POI的标签也可以用于检测POI。用于蛋白检测的标签是本领域公知的,并且包括例如荧光染料,酶标记物,其在与底物反应后形成可检测的反应产物,等等。
为了根据本发明的方法制备POI,可以通过在对应于POI的ncAA残基的一种或多种ncAA存在下培养细胞(其中所述ncAA可以方便地包含在培养基中)一段适合翻译POI的时间来实现表达。取决于编码POI的核酸(以及任选存在的本发明的融合蛋白和/或tRNAncAA分子),可能需要通过添加诱导转录的化合物来诱导表达,例如允许转录的阿拉伯糖、异丙基β-D-硫代半乳糖苷(IPTG)或四环素。
翻译后,可以任选地从翻译系统中回收POI。为此目的,根据本领域技术人员已知和使用的流程,POI可以部分或基本上被回收和纯化至均质。除非靶多肽被分泌到培养基中,否则回收通常需要细胞破碎。细胞破碎的方法是本领域公知的,包括物理破碎,例如,通过(超声)声波作用,液体剪切破碎(例如,通过弗氏压碎器),机械方法(如使用搅拌器或研磨机)或冻融循环,也包括化学裂解,所述化学裂解使用破坏脂质-脂质、蛋白-蛋白和/或蛋白-脂质相互作用的试剂(如去污剂),以及物理破碎技术和化学裂解的组合。从细胞裂解液或培养基中纯化多肽的标准流程也是本领域公知的,并且包括例如硫酸铵或乙醇沉淀、酸或碱提取、柱层析、亲和柱层析、阴离子或阳离子交换层析、磷酸纤维素层析、疏水相互作用色谱、羟基磷灰石色谱、凝集素色谱、凝胶电泳等。根据需要,可以使用蛋白重折叠步骤来制备正确折叠的成熟蛋白。在需要高纯度的最终纯化步骤中,可以采用高效液相色谱(HPLC)、亲和色谱或其他合适的方法。针对本发明的多肽制备的抗体可用作纯化试剂,即用于多肽的基于亲和的纯化。多种纯化/蛋白折叠方法是本领域公知的,包括例如Scopes,ProteinPurification,Springer,Berlin(1993);和Deutscher,Methods in Enzymology Vol.182:Guide to Protein Purification,Academic Press(1990);以及其中引用的参考文献中示出的那些方法。
如上所述,本领域技术人员会认识到,在合成、表达和/或纯化之后,多肽可以具有与相关多肽的期望构象不同的构象。例如,由原核系统产生的多肽通常通过暴露于离液剂中以实现正确折叠来优化。在从例如细胞裂解液纯化期间,表达的多肽任选地变性然后复性。这是通过例如将蛋白溶解在离液剂如盐酸胍中来实现的。通常,有时需要使表达的多肽变性和还原,然后使多肽重新折叠成优选的构象。例如,胍、尿素、DTT、DTE和/或伴侣蛋白可以添加到感兴趣的翻译产物中。还原、变性和复性蛋白的方法是本领域技术人员公知的。多肽可以在含有例如氧化谷胱甘肽和L-精氨酸的氧化还原缓冲液中重新折叠。
本文还描述通过本发明的方法产生的多肽。这类多肽可通过本发明的方法制备,所述方法利用本文所述的OT系统。
7.试剂盒
本发明还提供用于制备POI的试剂盒,所述POI具有至少一个非典型氨基酸(ncAA)残基。本发明的试剂盒可以包含至少一种用于本发明的至少一种融合蛋白的表达载体。试剂盒中的表达载体编码的融合蛋白可以包含至少一个O-RS区段和至少一个RNA-TP区段。所述试剂盒可以进一步包含至少一种ncAA或其盐,对应于所述POI的至少一个ncAA残基。有利地,所述O-RS区段能够用至少一种ncAA氨酰化tRNA。所述试剂盒可以进一步包含至少一种用于正交tRNAncAA(O-tRNAncAA)分子的表达载体。试剂盒的其他成分可以包括至少一种表达载体,其包含多克隆位点和靶向核苷酸序列(TN),其中包含所述TN的RNA分子能够通过所述TN与靶向RNA的多肽(RNA-TP)相互作用。有利地,所述TN是这样的序列,当存在于RNA分子中时,其能够与试剂盒包含的表达载体所编码的融合蛋白中的至少一种的RNA-TP区段相互作用。所述试剂盒可以进一步包含至少一种报告构建体,其编码易于检测(例如荧光)的报告多肽,所述报告多肽具有至少一个非典型氨基酸(ncAA)残基,使得从所述构建体翻译的mRNA包含如本文所述的TN。
本发明的试剂盒可在本发明的方法中用于制备本文所述的含有ncAA残基的POI。
具体实施方案
本发明还提供以下非限制性实施方案E1至E50。
E1:一种组装器融合蛋白(AFP),其包含:
(a)充当组装器(AP)的至少一个第一多肽区段,其选自:
(a1)源自细胞内靶向多肽的多肽区段(IC-TP区段),其中所述细胞内靶向多肽靶向细胞内结构元件,并因此在所述细胞内结构元件处局部富集,所述细胞内结构元件在细胞质内或与细胞质直接相邻;和
(a2)源自相分离多肽的多肽区段(PSP区段),其中所述相分离多肽具有在细胞的细胞质中进行自缔合的能力以在细胞质中产生高局部浓度的位点,以及
(b)充当效应物(EP)的至少一个第二多肽区段,其选自:
b1)靶向RNA的多肽(RNA-TP)区段,和
b2)正交氨酰tRNA合成酶(O-RS)区段;
其中所述多肽区段在所述AFP中功能性连接。
E2:E1的AFP,其包含至少两种AP,优选至少一个IC-TP区段和至少一个PSP区段。
E3:E1或E2的AFP,其具有以下结构之一(从N端到C端):
[IC-TP]m-[EP]o
[EP]o-[IC-TP]m
[PSP]n-[EP]o
[EP]o-[PSP]n
[IC-TP]m-[EP]o-[PSP]n
[PSP]n-[EP]o-[IC-TP]m
[IC-TP]m-[PSP]n-[EP]o
[EP]o-[PSP]n-[IC-TP]m
[PSP]n-[IC-TP]m-[EP]o
[EP]o-[IC-TP]m-[PSP]n
其中m、n和o相互独立地是选自1、2、3、4或5的整数,并且“-”表示肽键。
E4:E1-E3中任一项的AFP,其中至少一个EP选自RNA-TP区段。
E5:E1-E3中任一项的AFP,其中至少一个EP选自O-RS区段。
E6:E1-E3中任一项的AFP,其包含至少一个选自RNA-TP区段的EP和至少一个选自O-RS区段的EP。
E7:E1-E6中任一项的AFP,其包含至少一个IC-TP区段,所述IC-TP区段选自动力蛋白和驱动蛋白以及动力蛋白和驱动蛋白的片段和突变体,其保留靶向微管的正末端或负末端并在微管的正末端或负末端富集的能力。
E8:E1-E6中任一项的AFP,其包含至少一个IC-TP区段,所述IC-TP区段选自膜蛋白的跨膜结构域以及跨膜结构域的功能片段和突变体,其保留靶向膜的细胞质侧并在膜的细胞质侧富集的能力,特别是选自细胞膜、核膜和线粒体膜的膜。
E9:E1-E8中任一项的AFP,其包含至少一个IC-TP区段,所述IC-TP区段选自:
-KIF16B1-400,其包含SEQ ID NO:20的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:20的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-KIF13A1-411,Δ390,其包含SEQ ID NO:22的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:22的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-TOMM201-70,其包含SEQ ID NO:24的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:24的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-LcK,其包含SEQ ID NO:26的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:26的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-FRB-CD28,其包含SEQ ID NO:28的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:28的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-FUS-CD28,其包含SEQ ID NO:30的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:30的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-EB1,其包含SEQ ID NO:302的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:303的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-CG1,其包含SEQ ID NO:304的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:304的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-EBAG9,其包含SEQ ID NO:292的氨基酸序列(全长),或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:292具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;或者包含SEQ IDNO:294的前29个N端氨基酸残基;或者其功能片段或突变体,所述功能片段或突变体与SEQID NO:294具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-CMP Sia Tr,其包含SEQ ID NO:296的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:296的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;以及
-P450 2C1,其靶向ER膜的细胞质侧,或者其功能片段或突变体,所述功能片段或突变体与其具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性,特别是包含N端前27个(SEQ ID NO:298)氨基酸残基的片段;或包含前29个(SEQ ID NO:300)氨基酸残基的片段;或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:298或300具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E10:E1-E9中任一项的AFP,其包含至少一个PSP区段,所述PSP区段选自天然无序蛋白(IDP),特别是朊病毒样结构域,以及IDP或朊病毒样结构域的功能片段和突变体,其保留在细胞的细胞质中自缔合的能力,从而在细胞质中产生局部高浓度位点。
E11:E1-E10中任一项的AFP,其包含至少一个PSP区段,所述PSP区段选自:
-SPD5,其包含SEQ ID NO:32的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:32的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-FUS,其包含SEQ ID NO:34的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:34的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;以及
-EWSR1,其包含SEQ ID NO:36的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:36的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E12:E1-E11中任一项的AFP,其包含至少一个RNA-TP区段,所述RNA-TP区段选自病毒外壳蛋白的RNA结合区段以及病毒外壳蛋白的RNA结合区段的功能片段和突变体,其保留与病毒的RNA基序特异性相互作用的能力。
E13:E1-E12中任一项的AFP,其包含至少一个RNA-TP区段,所述RNA-TP区段选自:
-MCP,其包含SEQ ID NO:14的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:14的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-λN22,其包含SEQ ID NO:16的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:16的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;以及
-PCP,其包含SEQ ID NO:306的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:306的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E14:E1-E13中任一项的AFP,其包含至少一个O-RS区段,所述O-RS区段选自:
-詹氏甲烷球菌酪氨酰-tRNA合成酶;
-大肠杆菌酪氨酰-tRNA合成酶;
-大肠杆菌亮氨酰-tRNA合成酶;
-马氏甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-巴氏甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-乙酸甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-嗜热甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-布氏拟甲烷球菌吡咯赖氨酰-tRNA合成酶;
-Desulfitobacterium hafniense吡咯赖氨酰-tRNA合成酶;和
及其保留氨酰-tRNA合成酶酶活性的功能片段和突变体。
E15:E1-E14中任一项的AFP,其包含至少一个O-RS区段,所述O-RS区段选自:
-PylRSAF,其包含SEQ ID NO:8的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:8的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-PylRSAA,其包含SEQ ID NO:10的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:10的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-PylRSAAAF,其包含SEQ ID NO:12的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:12的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-IFRS1,其包含SEQ ID NO:224的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:224的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-CbzRS,其包含SEQ ID NO:226的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:226的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-CpkRS,其包含SEQ ID NO:228的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:228的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;以及
-OMeRS,其包含SEQ ID NO:236的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:236的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E16:一种组装器融合蛋白(AFP)组合,其包含至少两种E1-E15中任一项的AFP。
E17:E16的AFP组合,其包含至少一个第一AFP,以及至少一个第二AFP,所述第一AFP包含至少一个RNA-TP区段,所述第二AFP包含至少一个O-RS区段。
E18:一种融合蛋白(RNA-TP/O-RS融合蛋白),其包含:
(ⅰ)至少一个靶向RNA的多肽(RNA-TP)区段;和
(ⅱ)至少一个正交氨酰tRNA合成酶(O-RS)区段,
其中所述多肽区段在所述RNA-TP/O-RS融合蛋白中功能性连接。
E19:E18的RNA-TP/O-RS融合蛋白,其具有以下结构之一(从N端到C端):
[RNA-TP]x-[O-RS]y
[O-RS]y-[RNA-TP]x
其中x和y相互独立地是选自1、2、3、4和5的整数;并且“-”表示肽键。
E20:E18或E19的RNA-TP/O-RS融合蛋白,其包含至少一个RNA-TP区段,所述RNA-TP区段选自病毒外壳蛋白的RNA结合区段以及病毒外壳蛋白的RNA结合区段的功能片段和突变体,其保留与病毒的RNA基序特异性相互作用的能力。
E21:E18-E20中任一项的RNA-TP/O-RS融合蛋白,其包含至少一个RNA-TP区段,所述RNA-TP区段选自:
-MCP,其包含SEQ ID NO:14的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:14的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-λN22,其包含SEQ ID NO:16的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:16的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;以及
-PCP,其包含SEQ ID NO:306的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:306的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E22:E18-E21中任一项的RNA-TP/O-RS融合蛋白,其包含至少一个O-RS区段,所述O-RS区段选自:
-詹氏甲烷球菌酪氨酰-tRNA合成酶;
-大肠杆菌酪氨酰-tRNA合成酶;
-大肠杆菌亮氨酰-tRNA合成酶;
-马氏甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-巴氏甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-乙酸甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-嗜热甲烷八叠球菌吡咯赖氨酰-tRNA合成酶;
-布氏拟甲烷球菌吡咯赖氨酰-tRNA合成酶;
-Desulfitobacterium hafniense吡咯赖氨酰-tRNA合成酶;和
及其保留氨酰-tRNA合成酶酶活性的功能片段和突变体。
E23:E18-E22中任一项的RNA-TP/O-RS融合蛋白,其包含至少一个O-RS区段,所述O-RS区段选自:
-PylRSAF,其包含SEQ ID NO:8的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:8的氨基酸序列具有至少60%序列相同性;
-PylRSAA,其包含SEQ ID NO:10的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:10的氨基酸序列具有至少60%序列相同性;
-PylRSAAAF,其包含SEQ ID NO:12的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:12的氨基酸序列具有至少60%序列相同性;
-IFRS1,其包含SEQ ID NO:224的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:224的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-CbzRS,其包含SEQ ID NO:226的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:226的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-CpkRS,其包含SEQ ID NO:228的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:228的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;以及
-OMeRS,其包含SEQ ID NO:236的氨基酸序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:236的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E24:一种核酸分子,或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种E1-E15中任一项的AFP,或者至少一种E16或E17的AFP组合,或
(ii)与(i)的核苷酸序列互补的核酸序列,
(ⅲ)(i)和(ii)。
E25:一种核酸分子,或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种E18-E23中任一项的RNA-TP/O-RS融合蛋白,或
(ii)与(i)互补的核酸序列,或
(ⅲ)(i)和(ii)。
E26:一种表达盒,其包含E24或E25的核酸分子或者核酸分子的组合的核苷酸序列。
E27:一种表达载体,其包含至少一种E26的表达盒。
E28:一种细胞,其包含至少一种E24或E25的核酸分子或核酸分子的组合、至少一种E26的表达盒或者至少一种E27的表达载体。
E29:E28的细胞,其为真核细胞。
E30:E28的细胞,其为哺乳动物细胞。
E31:E28-E30中任一项的细胞,其包含至少一种E24的核酸分子或核酸分子的组合,或者至少一种表达盒,所述表达盒包含所述核酸分子或核酸分子的组合的核苷酸序列,或者至少一种包含所述表达盒的表达载体。
E32:E31的细胞,其包含核苷酸序列,所述核苷酸序列编码至少一种E1-E3和E7-E15中任一项的AFP或者与编码至少一种E1-E3和E7-E15中任一项的AFP的核苷酸序列互补,所述AFP包含至少一个选自RNA-TP区段的EP和至少一个选自O-RS区段的EP。
E33:E31的细胞,其包含核苷酸序列,所述核苷酸序列编码以下AFP或与编码以下AFP的核苷酸序列互补:
至少一种E1-E3和E7-E15中任一项的AFP,所述AFP包含至少一个选自RNA-TP区段的EP;以及
至少一种E1-E3和E7-E15中任一项的AFP,所述AFP包含至少一个选自O-RS区段的EP。
E34:E28-E30中任一项的细胞,其包含至少一种E25的核酸分子或核酸分子的组合,或者至少一种表达盒,所述表达盒包含所述核酸分子或核酸分子的组合的核苷酸序列,或者至少一种包含所述表达盒的表达载体。
E35:E28-E34中任一项的细胞,其中所述细胞分别表达所述至少一种AFP、所述至少一种AFP组合或者所述至少一种RNA-TP/O-RS融合蛋白,其由所述核酸分子或核酸分子的组合的核苷酸序列编码。
E36:一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基,其中所述方法包括在所述一种或多种ncAA的存在下,在E31-E33中任一项的细胞中表达所述POI,其中所述细胞包含:
(ⅰ)编码POI的核苷酸序列(CSPOI),其中所述POI的一种或多种ncAA残基由选择密码子编码,
(ⅱ)靶向核苷酸序列(TN),其功能性连接至所述CSPOI,并且能够与所述细胞中AFP中的至少一种的RNA-TP区段相互作用;
(ⅲ)一种或多种正交tRNAncAA(O-tRNAncAA)分子,其携带与所述CSPOI的选择密码子互补的反密码子,并且其中所述O-tRNAncAA分子与所述细胞中AFP中的至少一种的一个或多个O-RS区段一起形成一个或多个正交O-RS/O-tRNAncAA对,其允许将所述一种或多种ncAA残基引入POI的氨基酸序列中;
并且其中所述方法任选地进一步包括回收表达的POI。
E37:一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基,其中所述方法包括在所述一种或多种ncAA的存在下,在E35的细胞中表达所述POI,其中所述细胞包含:
(ⅰ)编码POI的核苷酸序列(CSPOI),其中所述POI的一种或多种ncAA残基由选择密码子编码,
(ⅱ)靶向核苷酸序列(TN),其功能性连接至所述CSPOI,并且能够与所述细胞中RNA-TP/O-RS融合蛋白中的至少一种的RNA-TP区段相互作用;
(ⅲ)一种或多种正交tRNAncAA(O-tRNAncAA)分子,其携带与所述CSPOI的选择密码子互补的反密码子,并且其中所述O-tRNAncAA分子与所述细胞中RNA-TP/O-RS融合蛋白的一个或多个O-RS区段一起形成一个或多个正交O-RS/O-tRNAncAA对,其允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中;
并且其中所述方法任选地进一步包括回收表达的POI。
E38:一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基,其中所述方法包括以下步骤:
(a)在细胞中表达一种或多种E1-E3和E7-E15中任一项的AFP,所述AFP包含至少一个RNA-TP区段,以及一种或多种E1-E3和E7-E15中任一项的AFP,所述AFP包含至少一个O-RS区段;
(b)在所述细胞中表达一种或多种正交tRNAncAA(O-tRNAncAA)分子,其中
-所述正交tRNAncAA分子与细胞中AFP的一个或多个O-RS区段形成一个或多个正交氨酰tRNA合成酶tRNAncAA(O-RS/O-tRNAncAA)对,
-所述O-RS/O-tRNAncAA对允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中,其中步骤(a)和(b)可以同时或以任意顺序依次进行;
(c)随后,在所述一种或多种ncAA的存在下,在所述细胞中表达所述POI,其中
-编码POI的核苷酸序列(CSPOI)包含编码所述一种或多种ncAA残基的一种或多种选择密码子,
-所述选择密码子与所述一种或多种O-tRNAncAA分子的反密码子匹配;
-所述CSPOI与靶向核苷酸序列(TN)功能性连接,从而形成CSPOI/TN融合序列,
-所述CSPOI/TN融合序列能通过其TN与所述细胞中AFP中的至少一种的RNA-TP区段相互作用;以及
(d)任选地回收表达的POI。
E39:一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基,所述方法包括以下步骤:
(a)在细胞中表达E18-E23中任一项的RNA-TP/O-RS融合蛋白;
(b)在所述细胞中表达一种或多种正交tRNAncAA(O-tRNAncAA)分子,其中
-所述正交tRNAncAA分子与细胞中RNA-TP/O-RS融合蛋白的一个或多个O-RS区段形成一个或多个正交氨酰tRNA合成酶/tRNAncAA(O-RS/O-tRNAncAA)对,
-所述O-RS/O-tRNAncAA对允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中,
其中步骤(a)和(b)可以同时或以任意顺序依次进行;
(c)在所述一种或多种ncAA的存在下,在所述细胞中表达所述POI,其中
-编码POI的核苷酸序列(CSPOI)包含编码所述一种或多种ncAA残基的一种或多种选择密码子,
-所述选择密码子与所述一种或多种O-tRNAncAA分子的反密码子匹配;
-所述CSPOI与靶向核苷酸序列(TN)功能性连接,从而形成CSPOI/TN融合序列,
-所述CSPOI/TN融合序列能够通过其TN与所述细胞中RNA-TP/O-RS融合蛋白中的至少一种的RNA-TP区段相互作用;
以及
(d)任选地回收表达的POI。
E40:E36-E39中任一项的方法,其中所述TN选自病毒外壳蛋白结合的病毒RNA基序,及其保留与病毒外壳蛋白结合能力的功能片段和突变体。
E41:E36-E40中任一项的方法,其中所述TN选自:
-MS2 RNA茎环,其包含由核苷酸序列SEQ ID NO:17编码的RNA序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:17的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-BoxB,其包含由核苷酸序列SEQ ID NO:18编码的RNA序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:18的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;和
-pp7 RNA茎环,其以至少两种不同的形式存在并且包含由SEQ ID NO:289或SEQID NO:290的核苷酸序列编码的RNA序列,特别是具有对应于SEQ ID NO:289或SEQ ID NO:290的核苷酸(DNA)序列(由SEQ ID NO:289或SEQ ID NO:290的核苷酸(DNA)序列编码)的RNA序列的多核苷酸,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:289或290的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E42:E36-E41中任一项的方法,其中编码POI的ncAA残基的选择密码子选自琥珀、赭石和乳白终止密码子。
E43:一种核酸分子,其包含:
(i)编码感兴趣的多肽(POI)的核苷酸序列(CSPOI),所述POI包含一种或多种非典型氨基酸(ncAA)残基,所述ncAA残基在CSPOI中由选择密码子编码,和
(ii)靶向核苷酸序列(TN),其中包含所述TN的RNA分子能够通过所述TN与靶向RNA的多肽(RNA-TP)相互作用。
E44:E43的核酸分子,其中所述TN选自病毒外壳蛋白结合的病毒RNA基序,及其保留与病毒外壳蛋白结合能力的功能片段和突变体。
E45:E43或E44的核酸分子,其中所述TN选自:
-MS2 RNA茎环,其包含由核苷酸序列SEQ ID NO:17编码的RNA序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:17的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;
-BoxB,其包含由核苷酸序列SEQ ID NO:18编码的RNA序列,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:18的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性;和
-pp7 RNA茎环,其以至少两种不同的形式存在并且包含由SEQ ID NO:289或SEQID NO:290的核苷酸序列编码的RNA序列,特别是具有对应于SEQ ID NO:289或SEQ ID NO:290的核苷酸(DNA)序列(由SEQ ID NO:289或SEQ ID NO:290的核苷酸(DNA)序列编码)的RNA序列的多核苷酸,或者其功能片段或突变体,所述功能片段或突变体与SEQ ID NO:289或290的氨基酸序列具有至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%氨基酸序列相同性。
E46:E43-E45中任一项的核酸分子,其中编码POI的ncAA残基的选择密码子选自琥珀、赭石和乳白终止密码子。
E47:一种试剂盒,其用于制备具有至少一个非典型氨基酸(ncAA)残基的感兴趣的多肽(POI),所述试剂盒包含:
-至少一种ncAA或其盐,其对应于所述POI的至少一个ncAA残基;以及
-至少一种E27的表达载体。
E48:E47的试剂盒,其中所述表达载体编码包含至少一个O-RS区段和至少一个RNA-TP区段的融合蛋白。
E49:E47或E48的试剂盒,其进一步包含至少一种正交tRNAncAA(O-tRNAncAA)分子的表达载体。
E50:E47-E49中任一项的试剂盒,其进一步包含至少一种表达载体,所述表达载体包含多克隆位点和靶向核苷酸序列(TN),其中包含所述TN的RNA分子能够通过所述TN与靶向RNA的多肽(RNA-TP)相互作用。
上述实施方案中的任一个还涵盖以下修改:上述AP(即IC-TP和PSP)区段和/或EP(RNA-TP或O-RS)区段可以进一步与合成蛋白区段结合,其诱导并控制大分子相互作用。一个或多个,如2、3、4、5、6、7、8、9或10个,优选地,一个这样的蛋白区段可以可操作地融合到本发明的单个AFP中。本发明的上下文中特别感兴趣的是具有形成异二聚体卷曲螺旋蛋白结构能力的SYNZIP。这类SYNZIP是成对的合成肽,其能够相互作用,并用于诱导和控制大分子相互作用。非限制性实例是成对的SYNZIP 1:2;SYNZIP 3:4和SYNZIP 5:6。根据本发明特别优选的是如Reinke,A.W.,Grant,R.A.,Keating,A.E.(2010)J Am Chem Soc 132 6025-6031所述的异种特异性卷曲螺旋对SYNZIP2:SYNZIP1(SYNZIP 1:SEQ ID NO:312;SYNZIP2:SEQ ID NO:314;SYNZIP 3:SEQ ID NO:316;SYNZIP 4:SEQ ID NO:318,以及这些SYNZIP多肽的功能片段和突变体。所述功能片段和突变体可以包含与其来源的多肽的氨基酸至少60%、至少70%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%相同的氨基酸序列。
本发明通过以下非限制性实施例进一步说明。
实施例
方法
(A)细胞培养、转染和用ncAA补料
将HEK293T细胞(ATCC CRL-3216)和COS-7细胞(ATCC,CRL-1651)维持在Dulbecco改良的Eagle培养基(Life Technologies,41965-039)中,并补充有1%青霉素-链霉素(Sigma,10,000U/ml青霉素、10mg/ml链霉素、0.9%NaCl)、2mM L-谷氨酰胺(Sigma)、1mM丙酮酸钠(Life Technologies)和10%FBS(Sigma)。将细胞在37℃和5%CO2气氛中培养,每2-3天传代一次,最多传代15-20次。
在所有情况下,转染前15-20小时,将细胞以转染时导致70-80%汇合的密度接种。使用具有塑料底的24孔板(Nunclon Delta Surface ThermoScientific)进行流式细胞术。免疫荧光标记和FISH在具有玻璃底的24孔板(Greiner Bio-One)或四孔Lab-Tek#1.0硼硅盖玻片(ThermoFisher)上进行。
HEK293T细胞的转染用聚乙烯亚胺(PEI,Sigma-Aldrich)进行,每1μg DNA使用3μgPEI。根据制造商的建议,使用JetPrime试剂(PeqLab)以1:2的比例转染COS-7细胞。
对于琥珀型抑制系统测试,用POITAG载体、tRNAPyl、合成酶和MCP或模拟构建体以1:1:1:1的比例转染细胞。转染4-6小时后,将培养基换成含有ncAA的新鲜培养基。
所有使用的ncAA的原液和工作液都如Nikic et al.(Nat Protoc 10(5):780-791,2015)所述制备。SCO(环辛炔赖氨酸,SiChem SC-8000)使用的最终浓度为250μM。3-碘苯丙氨酸(Chem-Impex International Inc.)使用的最终浓度为1mM。SCO被PylRSAF(Y306A,Y384F)有效识别(参见Plass et al,Angew Chem 2011,50:3878-3881)。3-碘苯丙氨酸被PylRSAA(C346A,N348A)识别(参见Wang et al,ACS Chem Biol 2013,8:405-415)。
(B)流式细胞术
转染一天后收获HEK293T细胞,重新悬浮在1xPBS中并通过100μm尼龙网。流式细胞术的共转染以1:1:1:1的比例进行,总DNA为1.2μg,以及:
-编码POI的报告质粒(终止密码子编码待被ncAA占据的氨基酸位置),
-编码具有反密码子的tRNAPyl的质粒,所述反密码子与POI编码序列中的终止密码子匹配(即,反向互补)(以下简称为tRNAPyl),
-分别编码PylRS或其功能突变体的质粒,和
-编码MCP融合多肽的质粒或模拟质粒。
在转染后4-6小时将细胞培养基更换为含有待掺入POI的ncAA的新鲜培养基,并放置至收集时间。
数据采集和分析使用LSRFortessa SORP细胞分析仪(Becton,Dickinson andCompany)和FlowJo软件(FlowJo)进行。首先使用前向散射区域(FSC-A)和侧向散射区域(SSC-A)参数按细胞类型对细胞进行门控。随后,根据SSC-A和侧向散射宽度(SSC-W)鉴定单细胞。每个显示的FFC图是三个独立的生物学重复的总和,并从中计算平均值和SEM。每个条件至少分析130,000个单细胞。GFP荧光在488-530/30通道中获得,mCherry荧光在561-610/20通道中获得。
(C)PylRS免疫染色和成像,荧光原位杂交(FISH)
对于免疫标记实验,将细胞用1xPBS润洗,在1xPBS中的2%多聚甲醛中于室温下固定10分钟,再次用1xPBS润洗,然后在1xPBS中的0.5%Triton X中于室温下透化15分钟。用1xPBS润洗透化的细胞样品两次后,将所述样品在封闭液中孵育90分钟(在1xPBS中的3%BSA中于室温下孵育90分钟),然后加入1μg/ml一抗(多克隆大鼠抗PylRS,如Nikic等人(Angew Chem Int Ed Engl2016,55(52):16172-16176)所述制备,和/或多克隆兔抗MCP(Merck,ABE76)和/或单克隆兔抗RPL26L1抗体(EPR8478,Abcam,ab137046))在封闭液中4℃过夜。第二天,将细胞样品用1xPBS润洗,并与2μg/ml二抗(鸡抗大鼠IgG(H+L)交叉吸附的Alexa Fluor 594缀合的抗体(Thermo Fisher Scientific,A-21471)和/或山羊抗兔IgG(H+L)交叉吸附的Alexa Fluor 647缀合的F(ab')2(Thermo Fisher Scientific,A-21246))在封闭液中于室温下孵育60分钟。DNA用Hoechst 33342(1xPBS中1μg/ml)在室温下染色10分钟。如果仅染色DNA,则按上述方法进行固定和透化细胞,然后用Hoechst 33342(1xPBS中1μg/ml)在室温下染色10分钟。最后,用1xPBS润洗细胞两次。
FISH实验在转染后一天进行,类似于Nikic等人(Angew Chem Int Ed Engl 2016,55(52):16172-16176)中描述的FISH实验。杂交方案适用于24孔板,来自Pierce等人(Methods Cell Biol122:415-436,2014)。
对于仅tRNAPyl的成像,使用0.25μM的杂交探针5'-CTAACCCGGCTGAACGGATTTAGAGTCCATTCGATC-3'(在5'末端用Cy5标记;SEQ ID NO:1)。用SSC洗涤四次和用TN缓冲液(0.1MTrisHCl,150mM NaCl)洗涤一次后,在上述标准免疫荧光标记前,将细胞用TN缓冲液中的3%BSA在室温下孵育1小时。
对于tRNAPyl和MS2 RNA茎环序列的成像,在5'末端用地高辛标记的tRNAPyl的杂交探针(5'-CTAACCCGGCTGAACGGATTTAGAGTCCATTCGATC-3';SEQ ID NO:2)以0.16μM使用,在5'末端用Alexa Fluor 647标记的MS2 RNA茎环序列的杂交探针(5'-CTGCAGACATGGGTGATCCTCATGTTTTCTA-3';SEQ ID NO:3)以0.75μM使用。用SSC洗涤四次后,将细胞在封闭缓冲液(0.1M TrisHCl、150mM NaCl、1x封闭试剂(Sigma 11096176001))中于室温下孵育1小时。然后,将细胞与荧光素缀合的绵羊抗地高辛Fab(Sigma 11207741910)在封闭缓冲液中以1:200稀释度在4℃下孵育过夜。第二天,在吐温缓冲液(0.1M TrisHCl、150mM NaCl、0.5%Tween20)中洗涤3次,每次5分钟。DNA用Hoechst 33342(1xPBS中1μg/ml)在室温下染色10分钟。
在配备63x/1.40油浸物镜的Leica SP8 STED 3X显微镜上获得共聚焦图像,使用以下激光线进行激发:Hoechst 33342为405nm,荧光素和GFP为488nm,mOrange为548nm,Alexa Fluor 594为594nm,Alexa Fluor 647和Cy5为647nm。用HyD检测器分别收集420-500nm和605-680nm的发射光。
在配备60x/1.40油浸物镜的Olympus Fluoroview FV3000显微镜上拍摄核糖体免疫荧光图像,使用以下激光线进行激发:GFP为488nm,Alexa Fluor 594为594nm,AlexaFluor 647为640nm。
使用FIJI软件处理图像。
(D)构建体、克隆和诱变
将两种不同的荧光蛋白报告基因(双色报告基因)克隆到pBI-CMV1载体(Clontech631630)中,一种蛋白位于一个多克隆位点,另一种报告基因位于另一个多克隆位点。其中一个报告基因的CDS编码携带两个MS2 RNA茎环的mRNA,其融合到3'非翻译区(“MS2-标签”),而另一个报告基因编码的mRNA没有MS2标签。
为了检查琥珀型抑制,报告基因GFP39TAG和mCherry185TAG以与NLS的N端融合物使用。为了检查赭石型和乳白型抑制,(分别用GFP39TAA和mCherry185TAA、GFP39TGA和mCherry185TGA)制备类似的构建体。
NLS::GFP39TAG::MS2-标签报告基因:将NLS::GFP39TAG与两个拷贝的MS2 RNA茎环一起克隆到pBI-CMV1载体中,作为成像实验中琥珀型抑制成功的报告基因。
为了检查多个琥珀密码子抑制,制备GFP39,149TAG和GFP39,149,182TAG的pBI-CMV构建体,其在第二多克隆位点中不包括第二(例如mCherry)报告基因。
适用于本发明的上下文的GFP的其他非限制性实例有:
GFP66TAG具有琥珀型位点的GFP(SEQ ID NO:238)
GFP66TCG具有丝氨酸位点的GFP(SEQ ID NO:240)
GFP66CCG具有脯氨酸位点的GFP(SEQ ID NO:242)
GFP66CTA具有亮氨酸位点的GFP(SEQ ID NO:244)
GFP66TTA具有亮氨酸位点的GFP(SEQ ID NO:246)
GFP66ATA具有异亮氨酸位点的GFP(SEQ ID NO:248)
GFP66CGG具有精氨酸位点的GFP(SEQ ID NO:250)
GFP39TCG具有丝氨酸位点的GFP(SEQ ID NO:252)
GFP39CCG具有脯氨酸位点的GFP(SEQ ID NO:254)
GFP39CTA具有亮氨酸位点的GFP(SEQ ID NO:256)
GFP39CGG具有精氨酸位点的GFP(SEQ ID NO:258)
GFP39TCG具有丝氨酸位点的LCK-GFP(SEQ ID NO:278)
GFP39CCG具有脯氨酸位点的LCK-GFP(SEQ ID NO:280)
GFP39CTA具有亮氨酸位点的LCK-GFP(SEQ ID NO:282)
扩展的GFP39TCG与GFP66CCG基因融合的在第39位具有丝氨酸位点的GFP(SEQ ID NO:284)
扩展的GFP39CCG与GFP66TCG基因融合的在第39位具有脯氨酸位点的GFP(SEQ ID NO:286)
扩展的GFP39CTA与GFP66TCG基因融合的在第39位具有亮氨酸位点的GFP(SEQ ID NO:288)
适用于本发明的上下文的mCherry的其他非限制性实例有:
mCherry72TAG具有琥珀型位点的mCherry(SEQ ID NO:260)
mCherry72TCG具有丝氨酸位点的mCherry(SEQ ID NO:262)
mCherry72CCG具有脯氨酸位点的mCherry(SEQ ID NO:264)
mCherry72CTA具有亮氨酸位点的mCherry(SEQ ID NO:266)
mCherry72TTA具有亮氨酸位点的mCherry(SEQ ID NO:268)
mCherry72ATA具有异亮氨酸位点的mCherry(SEQ ID NO:270)
mCherry185TCG具有丝氨酸位点的mCherry(SEQ ID NO:272)
mCherry185CCG具有脯氨酸位点的mCherry(SEQ ID NO:274)
mCherry185CTA具有亮氨酸位点的mCherry(SEQ ID NO:276)
适用于本发明的上下文的包含不同TN环的mCherry构建体的其他非限制性实例有:
mCherry190TAG-2xPP7具有琥珀型位点和2x pp7环的mCherry(SEQ ID NO:216)
mCherry190TAG-4xPP7具有琥珀型位点和4x pp7环的mCherry(SEQ ID NO:218)
mCherry190TAG-6xPP7具有琥珀型位点和6x pp7环的mCherry(SEQ ID NO:220)
H2B-mCherry190TAG-2xMS2人组蛋白H2B 1-J型(Uniprot:P06899)与具有琥珀型位点和2x ms2-环的mCherry融合(SEQ ID NO:222)
可以将它们融合到本文所述的任何AFP分子的多肽链中,特别是融合分子内不抑制AFP分子的任何其他多肽区段(AP和EP)的功能的位置。下面给出这类含有AFP分子的表位-标签的实例。
OT组装器集合体的构建体制备如下:tRNAPyl在人U6启动子的控制下进行克隆,所有其他构建体在pcDNA3.1(Invitrogen V86020)载体中克隆的CMV启动子下。从addgene质粒#31230克隆MCP蛋白,而FUS来自Addgene质粒#26374。在所有FUS融合物中,使用氨基酸1-478(S108N),用Flag标签替换C末端NLS区域。在所有RS融合物中,使用先前报道的高效NES::PylRSAF(Y306A,Y384F)序列(参见,例如,Nikic et al.,Angew Chem Int Ed Engl2016,55(52):16172-16176)。从野生型PylRS开始,通过定点诱变克隆PylRS突变体PylRSAA(N346A,C348A)。SPD5基因从Genewiz订购并通过限制性克隆与MCP和PylRSAF融合。KIF13A1-411和KIF16B1-400从人cDNA克隆,并通过限制性克隆插入pcDNA3.1。通过定点诱变去除KIF13A1-411的P390。KIF13A1-411,ΔP390和KIF16B1-400与MCP、PylRSAF、EWSR1::MCP、FUS::PylRSAF、FUS::PylRSAA、SPD5::MCP和SPD5::PylRSAF的融合物通过Gibson组装进行组装(参见Gibson et al.,Nat Methods 2009,6:343-345)。
用于差分成像实验的构建体:为了选择性地表达Nup153-EGFP149TAG和Vim116TAG-mOrange,首先将一个基因与MS2标签一起插入pBI-CMV1(比较Nikic et al.,Angew ChemInt Ed Engl 2016,55(52):16172-16176)。随后,在没有MS2标签的情况下插入另一个基因。通过替换包含Nup153::EGFP149TAG和Vim116TAG::mOrange::MS2-标签的pBI载体中的Vim116TAG-mOrange将INSR676TAG::mOrange融合到MS2标签上,以产生一种双顺反子载体,其中一个盒具有INSR676TAG::mOrange,而另一个盒具有Nup153::EGFP149TAG。
用于COS-7细胞实验的多顺反子琥珀型抑制载体:由于COS-7细胞转染效率较低;我们生成包含OT组装器集合体组件的多顺反子载体。为了组装多顺反子琥珀型抑制载体,首先通过Gibson组装将人U6启动子控制下的一个tRNAPyl拷贝插入pBI-CMV1载体中。随后,通过Gibson组装首先插入AFP CDS KIF16B::FUS::PylRSAF,最后插入AFP CDS KIF16B::EWSR1::MCP。或者,使用先前公布的在CMV启动子下表达NES::PylRSAF和在人U6启动子下表达tRNAPyl的基于pcDNA3.1的构建体(参见Nikic et al.,Angew Chem Int Ed Engl 2016,55(52):16172-16176)。或者将具有U6-tRNAPyl、KIF16B::FUS::PylRSAF和KIF16B::EWSR1::MCP或NES::PylRSAF的构建体插入pDonor载体(GeneCopoeia)。
以下实验中使用的AFP的相应序列信息可以从下面给出的序列表中获取。
实施例1–RNA-TP/O-RS融合物和包含单个AP的AFP
OT组装器集合体(“OT细胞器”,图1)设计为具有以下组件:
i)mRNA靶向系统,其中将两个MS2 RNA茎环(MS2标签)与编码POI的所选mRNA融合,形成mRNA::ms2融合物。MS2标签与MS2细菌噬菌体外壳蛋白(MCP)特异性地结合(参见Bertrand et al.,Mol Cell 1998,2:437-445),从而在细胞中形成稳定且特异性的mRNA::ms2–MCP复合物。MS2标签始终与mRNA的3'非翻译区(3'UTR)融合,这确保翻译以产生无痕的最终POI。
ii)tRNA/RS抑制子对。选择来自马氏甲烷八叠球菌吡咯赖氨酰系统(tRNAPyl/PylRS)的正交tRNA/RS对,因为其能够在多种细胞类型和物种(包括大肠杆菌,哺乳动物细胞甚至活小鼠)中,利用GCE将超过200种具有不同功能的ncAA编码为蛋白(参见,例如,Liuet al.,Annu Rev Biochem2010,79:413-444;Lemke,ChemBioChem 2014,15:1691-1694;Chin,Nature 2017,550;53-60)。
iii)组装器(AP)是形成OT组装器集合体所需的关键组件。组装器的目的是以致密相、聚集体、液滴或冷凝物的形式产生无膜结构,其中mRNA::ms2–MCP复合物与tRNAPyl/PylRS对紧密相邻。
测试的最简单策略是MCP::PylRS的双分子融合物(称为B,图2)。此外,还测试预期产生更大组装器集合体的策略。所有这些组装器集合体系统都包含与PylRS融合的组装器和与MCP融合的组装器的共同表达。组装器::PylRS·组装器::MCP预期形成大的聚集体(本文中的共表达用“·”表示)。一种测试的组装器集合体策略是基于蛋白的相分离,另一种是基于驱动蛋白的组装器集合体,在本文中分别缩写为P和K(图2A)。此外,对于每种P和K方法,测试两种不同的分子设计(分别为P1、P2和K1、K2),总结如下:
P1.先前的研究已经确定蛋白融合肉瘤(FUS)和尤文肉瘤断点区域1(EWSR1)通过相分离形成混合液滴状结构的能力。它们都包含类似朊病毒的无序结构域,可促进相分离成液体、凝胶和固体状态(参见,例如,Altmeyer et al.,Nat Commun 2015,6:8088;Patelet al.,Cell 2015,162:1066-1077)。在相分离状态下,与细胞质中剩余的可溶性部分相比,这些蛋白在局部高度浓缩(几个数量级)。FUS与PylRS融合,并且EWSR1与MCP融合。预期这会导致形成其中高度富集MCP和PylRS的液滴。P1表示为FUS::PylRS·EWSR1::MCP。
P2.已显示秀丽隐杆线虫蛋白纺锤体缺陷蛋白5(SPD5)相分离成特别大(几个微米大小)的液滴(参见Woodruff et al.,Cell 2017,169:1066-1077,e1010)。在相分离状态下,与细胞质中剩余的可溶性部分相比,SPD5局部高度浓缩(几个数量级)。预期与SPD5融合的蛋白将凝结成液滴。与FUS-EWSR1液滴类似,预期与SPD5融合的PylRS和与SPD5融合的MCP会高度富集。P2表示为SPD5::PylRS·SPD5::MCP。
K1.某些截短驱动蛋白组成性地向活细胞中的微管正末端移动(Soppina et al.,Proc Natl Acad Sci U.S.A.2014,111:5562-5567)。一种这样的截短驱动蛋白为KIF13A1-411,ΔP390,预期分别与这种截短驱动蛋白融合并共表达的PylRS和MCP会局部富集,由于空间靶向微管正末端。K1表示为KIF13A1-411,ΔP390::PylRS·KIF13A1-411,ΔP390::MCP。
K2.通过与K1类比,还测试了截短驱动蛋白KIF16B1-400。K2表示为KIF16B1-400::PylRS·KIF16B1-400::MCP。
为了评估这些组装器促进MS2标记的mRNA的功能性正交翻译的情况,设计一种双报告基因构建体,其中GFP和mCherry突变体从一个质粒的两个不同表达盒中同时表达,确保它们之间的mRNA比率在所有实验中都是恒定的。在容许位点将终止密码子引入GFP的第39位(GFP39STOP)和mCherry的第185位(mCherry185STOP;图2B)。仅当终止密码子抑制成功时,才会产生相应的绿色或红色荧光蛋白。通过荧光流式细胞术(FFC)分析转染的细胞(tRNAPyl和ncAA始终存在,除非另有特别说明);如果使用无法区分mRNA的常规细胞质PylRS系统从该质粒表达GFP和mCherry,则调整设置以便在FFC图中产生近似对角线。只有当MS2标签与mCherry mRNA的3'UTR融合时,选择性和功能性OT细胞器才应选择性地表达mCherry,从而导致在细胞计数图中出现一条垂直线(图2B)。除非另有报道,所有实验均在tRNAPyl和ncAASCO存在下进行,这是一种广泛使用且特征明确的赖氨酸衍生物,其侧链带有环辛炔基,可用于各种点击化学反应以在蛋白上安装不同的化学基团。如以前报道的,这种ncAA由PylRS的Y306A、Y384F双突变体有效编码(为简单起见,除非另有说明,否则本文将这种突变体称为PylRS)(参见Nikic et al.,Angew Chem 2014,53:2245-2249;Plass,Angew Chem2012,51:4166-4170;Plass et al.,Angew Chem 2011,50:3878-3881)。省略ncAA作为标准阴性对照,导致没有GFP或mCherry的表达。
根据其选择性和相对效率评估每个OT系统的性能。选择性定义为平均mCherryFFC信号除以平均GFP信号的比率r。最终值表示为相对于细胞质PylRS的选择性倍数。相对效率定义为每个系统的平均mCherry信号除以作为参考的细胞质PylRS系统的平均mCherry信号(此处定义为100%)。关于选择性(深灰色正值条)和效率(浅灰色负值条)的所有结果总结在图2C的条形图中。选定的FFC数据也显示在图2D中。
最简单的策略B(MCP与PylRS融合)显示大约1.5倍的选择性增益(图2C)。OT系统P1(基于FUS/EWSR1的相分离)具有较低的选择性增益(图2C、D)。P2系统(基于SPD5)显示大约两倍的选择性增益(图2C)。对于K1,观察到选择性增加两倍(图2C)。K2系统的表现类似(图2C、D)。总体而言,选择性增益相对较小,但被可靠地检测到并与简单的效率下降区分开来。所观察到的选择性效应(数据未显示)在琥珀型抑制效率的滴定中是可靠的(具体地,分别使用0.48ng、2.4ng、12ng、60ng或300ng tRNAPyl构建体),表明使ncAA氨酰化活性(即在ncAA存在下的tRNAPyl/PylRS)直接接近靶mRNA是一种更具选择性的密码子抑制途径。
实施例2–包含两种AP的组合的AFP
以类似方式测试包含实施例1中描述的AP的组合的AFP,它们是:
K1::P1=KIF13A1–411,ΔP390::FUS::PylRS·KIF13A1–411,ΔP390::EWSR1::MCP,
K2::P1=KIF16B1–400::FUS::PylRS·KIF16B1–400::EWSR1::MCP,
K1::P2=KIF13A1–411,ΔP390::SPD5::PylRS·KIF13A1–411,ΔP390::SPD5::MCP,
K2::P2=KIF16B1–400::SPD5::PylRS·KIF16B1–400::SPD5::MCP。
对于所有组合,观察到至少五倍的选择性增益,表明发生正交翻译。这些系统中表现最好的是基于FUS/EWSR1与KIF16B1-400的融合,K2::P1,并表现出八倍的选择性(图2C中的框)。这在FFC数据中也很明显,其中清晰地保留明亮的mCherry阳性细胞群,而GFP表达极少(图2D中的箭头)。
实施例3–包含AP的组合的AFP,所述AP包括膜靶向AP
AFP包含源自相分离多肽(PSP)、FUS和EWSR1(本文也称为EWS)的AP的组合,其任选地融合到SYNZIP区段,并且以类似于实施例2的方式测试作为膜靶向信号的不同AP,LcK、EB1、CG1、EBAG9全长、EBAG91-29、CMP Sia Tr P450 2C11-27和P450 2C11-29。
LcK是一种细胞膜靶向信号(Resh,Bba-Mol Cell Res 1999,1451:1-16),其在翻译后向POI添加两亲性螺旋。对于这些实验,AFP LcK::FUS::PylRS和LcK::EWSR1::MCP在HE293T细胞中共表达(见图3和6C)。与PylRS对照相比,用相同的双报告基因测试这个系统导致信号的显著迁移以及仅对MS2标记的mCherry表达具有强选择性。参见图4和图5,显示与对照相比26倍的选择性增益。MCP、PylRS和tRNA的IF和FISH显示清晰的膜信号,偶尔出现液滴状结构,并且所有元件都完美地共定位。
不希望受理论束缚,假设将OT系统靶向到膜导致将元件限制在2D表面(即膜),提供比细胞质液滴更高的空间分离。根据两种组合的组装器策略(用于膜靶向的LcK和用于液滴生成的FUS/EWSR1)的这种累积效应,表明为获得选择性琥珀型抑制,LcK-融合(以及由此形成的膜锚定系统)中对FUS/EWSR1“组装器”的存在不做要求(数据未显示)。尽管如此,LcK靶向与FUS/EWSR1的组合导致系统的更高选择性。此外,据发现交换荧光报告基因上的MS2标签,在FFC数据中产生交换的选择性,强调MS2标记的mRNA的选择性(正交)翻译。
对于进一步基于LcK的实验,AFP构建体LcK::FUS::SYNZIP1::PylRS和EWSR1::SYNZIP2::MCP在HE293T细胞中共表达(见图8A)。用相同的双报告基因测试这个系统导致信号的显著迁移以及仅对MS2标记的mCherry表达具有强选择性。表达后SYNZIP1和2配对并将MCP招募到基于质膜的OT细胞器。在共表达AFP构建体LcK::FUS::PylRS和EWSR1::SYNZIP2::MCP的比较方法中,其中缺少SYNZIP1,没有观察到翻译的选择性(见图8B)。
EB1是一种微管正末端靶向信号((Nehlig A,Molina A,Rodrigues-Ferreira S,HonoréS,Nahmias C.Regulation of end-binding protein EB1 in the control ofmicrotubule dynamics.Cell Mol Life Sci.2017;74(13):2381–2393.doi:10.1007/s00018-017-2476-2)。对于这些实验,AFP构建体EB1::PylRS与EB1::MCP、EB1:FUS::PylRS与EB1::EWSR1::MCP或EB1::FUS::MCP::PylRS在HE293T细胞中表达。与对照PylRS相比,用相同的双报告基因测试这个系统导致信号的迁移以及仅对MS2标记的mCherry表达具有强选择性。见图6B。
CG1是一种核膜靶向信号(Kim SJ,Fernandez-Martinez J,Nudelman I,etal.Integrative structure and functional anatomy of a nuclear porecomplex.Nature.2018;555(7697):475–482.doi:10.1038/nature26003)。对于这些实验,AFP构建体CG1::FUS::PylRS和CG1::EWSR1::MCP在HE293T细胞中共表达。与对照PylRS相比,用相同的双报告基因测试这个系统导致信号的迁移以及仅对MS2标记的mCherry表达具有强选择性。见图6E。
EBAG9全长和EBAG91-29是高尔基体膜靶向信号(Engelsberg A,Hermosilla R,Karsten U,Schülein R,B,Rehm A.The Golgi protein RCAS1 controls cellsurface expression of tumor-associated O-linked glycan antigens.J BiolChem.2003;278(25):22998–23007.doi:10.1074/jbc.M301361200)。对于这些实验,AFP构建体EBAG91-29::FUS::PylRS和EBAG91-29::EWSR1::MCP在HE293T细胞中共表达。与对照PylRS相比,用相同的双报告基因测试这个系统导致信号的迁移以及仅对MS2标记的mCherry表达具有强选择性。见图6F(左侧)。
CMP Sia Tr是一种高尔基体膜靶向信号(Eckhardt M,Gotza B,Gerardy-SchahnR.Membrane topology of the mammalian CMP-sialic acid transporter.J BiolChem.1999;274(13):8779–8787.doi:10.1074/jbc.274.13.8779)。对于这些实验,AFP构建体CMP Sia Tr::FUS::PylRS和CMP Sia Tr::MCP在HE293T细胞中共表达。与对照PylRS相比,用相同的双报告基因测试这个系统导致信号的迁移以及仅对MS2标记的mCherry表达具有强选择性。见图6F(右侧)。
P450 2C11-27是一种ER膜靶向信号(Fazal FM,Han S,Parker KR,et al.Atlas ofSubcellular RNA Localization Revealed by APEX-Seq.Cell.2019;178(2):473–490.e26.doi:10.1016/j.cell.2019.05.027)。对于这些实验,AFP构建体P450 2C11-27::FUS::PylRS和P450 2C11-27::EWSR1::MCP或P4502C11-29::FUS::MCP::PylRS在HE293T细胞中共表达。与对照PylRS相比,用相同的双报告基因测试这个系统导致信号的迁移以及仅对MS2标记的mCherry表达具有强选择性。见图6G。
实施例4–mRNAMS2标签和MCP相互作用所特有的选择性增益的验证
为了验证观察到的选择性增益是MCP区段与mRNA的MS2标签相互作用所特有的,通过在没有MCP的情况下表达每个OT系统的RS组装器融合物来表征所有OT系统。正如预期的那样,在这些情况下没有观察到MS2标记的mRNA的选择性正交翻译(见图6A到G)。此外,通过将MS2标签从mCherry移动到双色报告基因中的GFP盒中来进行报告基因反转,正如预期的,这反转系统对显性GFP表达的选择性(数据未显示)。这确定OT系统选择性地作用于MS2标记的RNA。
实施例5–将多个ncAA引入同一POI
GCE还可用于将多个ncAA引入同一POI(参见,例如,Liu et al.,Annu RevBiochem 2010,79:413-444;Lemke,ChemBioChem 2014,15:1691-1694;Chin,Nature 2017,550;53-60)。然而,只有极少数出版物报道真核生物中同一蛋白超过一个,即,两个或三个密码子抑制,因为与单密码子抑制相比,产量通常会受到影响(参见Xiao et al.,AngewChem 2013,52:14080-14083;Schmied et al.,J Am Chem Soc 2014,136:15577-15583;Zhang et al.,Biochem Biophys Res Co 2017,489:490-496)。值得注意的是,即使是双重和三重琥珀型蛋白仍被OT细胞器抑制(数据未显示)。
实施例6–具有3-碘苯丙氨酸的OT
为了确保其他ncAA也可以通过OT组装器集合体进行翻译,测试另一种结构不同的ncAA(3-碘苯丙氨酸),它是苯丙氨酸衍生物而不是赖氨酸衍生物(例如SCO),并且由不同的PylRS突变体(N346A,C348A)编码(参见Wang et al.,ACS Chem Biol 2013,8:405-415)。这个系统也观察到一致的结果(图2C)。
实施例7–具有不同选择密码子的OT
由于乳白和赭石密码子在真核生物基因组中非常丰富(人基因组中乳白占52%,赭石占28%),因此琥珀密码子是迄今为止真核生物中最常用于GCE的密码子。此外,通过去除整个真核基因组中的那些密码子来进行正交翻译的基因组方法比琥珀密码子更具挑战性,并且目前超出现有技术水平。然而,在本发明的OT系统中,tRNAPyl的反密码子环以及MS2标记的POI编码mRNA中相应密码子中的简单突变允许这些密码子的正交翻译。FFC分析表明,本发明的OT系统提供关于终止(选择)密码子的选择自由(图2C、E)。事实上,乳白型抑制是性能最好的系统,选择性提高11倍。赭石型抑制仍然显示选择性提高5倍,效率提高20%。
实施例8–各种细胞区室蛋白的正交翻译
为了使OTK2::P1系统(在选择性和效率方面表现最佳的琥珀型抑制OT系统)超越“简单”报告基因的能力可视化,旨在显示人核孔蛋白153(Nup153)与细胞骨架波形蛋白的差异表达。Nup153位于核孔复合体中,长度超过1500个氨基酸。因此,它的mRNA大约比以上使用的荧光蛋白报告基因大六倍。对于这个实验,使用先前描述的C端GFP融合物,其具有琥珀型突变(Nup153::EGFP149TAG),仅当琥珀型抑制成功时,才会在共聚焦成像中产生特征性核包膜染色(参见Nikic et al.,Angew Chem2016,55:16172-16276)。所述Nup153::EGFP149TAG现在在mRNA水平上用MS2标签(nup153::egfp149TAG::ms2)进行标记,并与波形蛋白(一种细胞骨架蛋白)从相同的质粒共表达,所述波形蛋白在第116位包含琥珀密码子,与mOrange融合(Vim116TAG::mOrange)。在细胞质PylRS的存在下,HEK293T细胞中的表达导致两种蛋白的产生,分别显示出特征性的核包膜和细胞骨架染色。使用OTK2::P1组装器集合体,仅Nup153::GFP可见(共转染HEK293T细胞的共聚焦成像中的选择性核边缘染色)。在COS7细胞中也观察到一致的结果。将MS标签交换为波形蛋白会反转效果,因此只有Vim116TAG::mOrange可见(在COS-7和HEK293T细胞实验中均观察到)。这表明OTK2::P1对截然不同的mRNA发挥作用。
实施例9-跨膜蛋白的正交翻译
还表明跨膜蛋白可以使用OTK2::P1组装器集合体进行选择性表达。膜蛋白表达代表翻译复杂性的另一层次,因为核糖体在翻译过程中需要结合内质网,在内质网中蛋白被共翻译插入膜中。在这个实验中,使用在第676位具有琥珀密码子的胰岛素受体1与mOrange的融合物(INSR676TAG::mOrange),其位于质膜并在HEK293T细胞中产生特征性质膜染色(参见Nikic et al.,Angew Chem 2014,53:2245-2249)。将这个构建体在3'UTR中用MS2标签标记,并与Nup153::EGFP149TAG克隆到一个双盒质粒中。然后,在细胞质PylRS系统或OTK2::P1组装器集合体的存在下,在HEK293T细胞中表达构建体。在OTK2::P1组装器集合体的存在下,观察到MS2标记蛋白的选择性表达和预期的INSR676TAG::mOrange质膜定位(数据未显示),表明本发明的OT系统参与更为复杂的膜相关翻译过程的潜力。
实施例10–细胞中OT系统元件的空间分布
采用免疫荧光(IF)评估AFP特别是PylRS在细胞中的空间分布。此外,采用荧光原位杂交(FISH)检测tRNAPyl。与上述FFC实验中使用的双色报告基因相比,在所有IF/FISH实验中,采用融合到MS2标签(nls-gfp39TAG::ms2)的单色NLS-GFP39TAG报告基因鉴定琥珀型抑制中的细胞活动(如果琥珀型抑制成功,则会产生绿色核,并有助于优化可区分的颜色通道)。IF和FISH染色显示,与细胞质PylRS相比,P1系统形成小的细胞内组装器::PylRS液滴(数据未显示)。这表明发生相分离。tRNAPyl与高度分散的组装器::PylRS液滴很好地共定位,表明tRNAPyl可以很好地分隔到组装器::PylRS相中。额外的染色显示其与组装器::MCP的进一步共定位(数据未显示)。与P1相比,P2系统显示更大但仍然是多分散的液滴状结构(数据未显示)。用两种组装策略(K1::P1、K2::P1、K1::P2、K2::P2)的组合,在细胞质中观察到大微米级细胞器样结构的形成,在大多数情况下,这些结构局限于每个细胞的几个甚至单个位置。对于组合的组装器,mRNA::ms2、tRNAPyl、组装器::PylRS和组装器::MCP都共定位于细胞器样结构。如通过FISH和IF确定的,两种组装器策略的组合,即,与截短驱动蛋白的空间靶向配对的相分离,产生最佳位置限制和最高选择性增加。这与以下假设一致:tRNAPyl、PylRS和mRNA的较高空间隔离和由此产生的较高局部浓度与较高的选择性相关。
对核糖体进行染色,以便观察其是否共定位到OTK2::P1组装器集合体。核糖体蛋白RPL26L1的IF染色显示与OTK2::P1细胞器的强共定位(数据未显示),证明核糖体的招募暂时是由于在翻译过程中与mRNA::ms2结合。高核糖体迁移也可以解释为什么可以成功表达膜蛋白INSR(结构:INSR676TAG::mOrange::ms2)。
不希望受理论束缚,实验结果强烈表明选择性正交翻译是通过一组招募的核糖体发生在靠近OT组装器集合体的位置,甚至可能在OT组装器集合体内部,所述核糖体靠近浓缩的tRNAPyl池或完全浸入浓缩的tRNAPyl池中。由于tRNAPyl对组装器::PylRS的亲和性,tRNAPyl本身被招募到OTK2::P1组装器集合体中,并可以很容易地共分隔到液滴中与其同源的ncAA进行氨酰化,同时组装器::MCP招募MS2标记的mRNA。这反过来吸引核糖体共分隔到由双组装器系统(K2::P1=KIF16B::FUS::PylRS和KIF16B::EWSR1::MCP)形成的致密相中,从而保持翻译的其他翻译因子进入并发挥作用。未暴露于tRNAPyl的细胞质中其他位置的核糖体在遇到终止密码子时执行其终止翻译的常规功能。
实施例11–其他OT系统
除了前述实施例中描述的OT系统之外,还测试各种其他OT系统,并发现其允许报告基因(即POI)的选择性正交翻译。这些实验的总结如下表1中所示。除非另有说明,否则如Nikic等人(Angew Chem Int Ed Engl 2016,55(52):16172-16176)先前描述的但使用相应的AF、AA或AAAF突变的细胞质NES-PylRS系统用作非特异性参考(阴性对照)。所有实验都在密码子特异性tRNAPyl和PylRS突变体对应的ncAA的存在下进行。
表1:测试的OT系统
实施例12–其他OT系统
除了前述实施例中描述的OT系统之外,还测试各种类似的OT系统,它们在mRNA靶向元件方面不同,并发现其允许报告基因(即POI)的选择性正交翻译。这些实验的总结如下表2中所示。结果如图7A、B和C所示。如Nikic等人(Angew Chem Int Ed Engl 2016,55(52):16172-16176)先前描述的细胞质NES-PylRS系统用作非特异性参考(阴性对照)。所有实验都在密码子特异性tRNAPyl和PylRS突变体对应的ncAA的存在下进行。
表2:测试的OT系统
结果示于图7A、B和C。
实施例13–测试的其他OT融合构建体
除了前述实施例中描述的OT系统之外,还制备并测试各种其他OT融合构建体,并发现其允许报告基因(即POI)的选择性正交翻译。测试构建体的总结如下表3中所示。除非另有说明,否则如Nikic等人(Angew Chem Int Ed Engl 2016,55(52):16172-16176)先前描述的但用相应的AF、AA或AAAF突变,或者Pyl RS突变体CpkRS、CbzRS、IFRS1和OMeRS其中之一的细胞质NES-PylRS系统用作非特异性参考(阴性对照)。
所有实验都在密码子特异性tRNAPyl和PylRS突变体对应的非典型氨基酸的存在下进行[例如CpkRS与环丙烯-L-赖氨酸,CbzRS与N(ε)-苄氧羰基-L-赖氨酸,IFRS-1与3-碘-L-苯丙氨酸,OMeRS与4-甲氧基-L-苯丙氨酸)]。
用各自的报告基因测试所有构建体,MCP采用ms2-环,λN22采用boxB-环,PCP采用pp7-环。
在所有融合构建体中,合成酶应可以自由互换。
对于SYNZIP构建体,重要的是注意SYNZIP1与SYNZIP2成对,SYNZIP3与SYNZIP4成对。原则上,描述的所有其他SYNZIP应当类似地工作(https://pubs.acs.org/doi/pdf/10.1021/ja907617a)。
表3:测试的OT融合构建体
AA:氨基酸序列
缩写
“-”或“::” 代表肽链的符号
“·” 代表多肽的组合的符号
AP 充当组装器的多肽区段
AFP 组装器融合蛋白
BSA 牛血清白蛋白
BoxB λ噬菌体RNA茎环,λN22的特异性结合位点
CbzRS 马氏甲烷八叠球菌PylRS(Y306M、L309G、C348T)
CDS 编码序列
CG1 靶向核膜的CG1(Nup42)核孔蛋白
CMPSiaTr 靶向高尔基体膜的CMP唾液酸转运蛋白
CpkRS 马氏甲烷八叠球菌PylRS(A302S
EB1 靶向微管正末端的EB1蛋白
EBAG9 SiSo细胞上表达的受体结合癌抗原
EBAG9FL 靶向高尔基体膜的EBAG9全长蛋白
EBAG91-29 靶向高尔基体膜的EBAG9氨基酸残基1-29(N端)
EGFP149TAG 增强型绿色荧光蛋白,氨基酸149位由琥珀密码子(TAG)编码
EP 充当效应器的多肽区段
ER 内质网
EWSR1 尤文肉瘤断点区域1(本文也称为EWS)
FBS 胎牛血清
FFC 荧光流式细胞术
FISH 荧光原位杂交
FRB-CD28 源自跨膜蛋白CD4、FRB(类似于mTOR)和CD28的合成膜靶向结构域
FSC-A 前向散射区域
FUS 融合肉瘤
FUS-CD28 (源自CD4、FUS和CD28的合成膜靶向融合多肽
GCE 遗传密码子扩展
GFP 绿色荧光蛋白
GFP39TAA 绿色荧光蛋白,氨基酸39位由赭石密码子(TAA)编码
GFP39TAG 绿色荧光蛋白,氨基酸39位由琥珀密码子(TAG)编码
GFP39TGA 绿色荧光蛋白,氨基酸39位由乳白密码子(TGA)编码
GFP39,149TAG 绿色荧光蛋白,氨基酸39和149位均由琥珀密码子(TAG)编码
GFP39,149,182TAG 绿色荧光蛋白,氨基酸39、149和182位均由琥珀密码子(TAG)编码
IC-TP 细胞内靶向多肽
IDP 天然无序蛋白
IFRS1 马氏甲烷八叠球菌PylRS(L305M、Y306L、L309S、N346S、C348M)
INSR 胰岛素受体
INSR676TAG 胰岛素受体,氨基酸676位由琥珀密码子(TAG)编码
iRFP 近红外荧光蛋白
KIF13A 驱动蛋白家族成员13A-除非本文另有说明,否则“KIF13A”特指覆盖KIF13A的氨基酸残基1-411的片段,其中P390缺失(KIF13A1-411,ΔP390)
KIF16B 驱动蛋白家族成员16B-除非本文另有说明,否则“KIF16B”特指覆盖KIF16B的氨基酸残基1-400的片段(KIF16B1-400)
λN22 λ噬菌体抗终止子蛋白N的22个氨基酸的RNA结合结构域
LcK 用于淋巴细胞特异性蛋白酪氨酸激酶的质膜靶向的翻译后修饰位点
mCherry185TAG mCherry,氨基酸185位由琥珀密码子(TAG)编码
MCP MS2细菌噬菌体外壳蛋白
MLC 无膜区室
MS2 肠杆菌噬菌体MS2
MS2-tag 与mRNA的3'非翻译区(或其编码序列)融合的两个MS2 RNA茎环
ms2 MS2标签
ncAA 非典型氨基酸
NLS 核定位序列
Nup153 核孔蛋白153
O-RS 正交氨酰tRNA合成酶
OMeRS 马氏甲烷八叠球菌PyrRS(A302T、Y384F、N346V、C348W、V401L)
OT组装器集合体 在无膜组装器集合体中空间富集的GCE系统组件,其能够充当人工正交翻译(OT)细胞器
P450 2C11-27 靶向ER膜的P450 2C1残基1-27(N端)
PBS 磷酸盐缓冲溶液
PCP 靶向pp7环标签的细菌噬菌体外壳蛋白
PEI 聚乙烯亚胺
POI 感兴趣的多肽(=靶多肽)
POITAG 包含琥珀-(TAG-)编码的氨基酸残基的POI(或其编码序列)
pp7 来自RNA细菌噬菌体pp7的pp7环标签
PSP 相分离多肽
PylRS 吡咯赖氨酰tRNA合成酶
PylRSAA 突变的马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶,包含氨基酸取代N346A和C348A
PylRSAF 突变的马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶,包含氨基酸取代Y306A和Y384F
PylRSAAAF 突变的马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶,包含氨基酸取代Y306A、N346A、C348A和Y384F
RNA-TP 靶向RNA的多肽
RS 氨酰tRNA合成酶
RT 室温
SCO 环辛炔赖氨酸
SEM 平均值的标准误差
SSC 盐水-柠檬酸钠(缓冲液)
SSC-A 侧向散射区域
SSC-W 侧向散射宽度
SPD5 纺锤体缺陷蛋白5
SYNZIP1 合成卷曲螺旋肽1
SYNZIP2 合成卷曲螺旋肽2
SYNZIP3 合成卷曲螺旋肽3
SYNZIP4 合成卷曲螺旋肽4
TOMM20 线粒体外膜转位酶20
TOMM201-70 覆盖TOMM20的氨基酸残基1-70的片段
tRNAPyl tRNA,其通过野生型或修饰的PylRS与吡咯赖氨酰或另一非典型氨基酸残基偶联并具有反密码子,所述反密码子用于将(非典型)氨基酸残基位点特异性地掺入POI中,优选为选择密码子的反向互补序列。-实施例中使用的tRNAPyl携带针对终止密码子琥珀(tRNAPyl,CUA)、赭石(tRNAPyl,UUA)或乳白(tRNAPyl,UCA)的反密码子,取决于这些中的哪一个用作POI编码序列中的选择密码子。
3'UTR 3'非翻译区
Vim116TAG 波形蛋白,氨基酸116位由琥珀密码子(TAG)编码
序列
下节示出本文所述的多肽和多核苷酸的序列。
核酸序列以5’至3’方向表示,蛋白序列以N端至C端表示。
序列-集合1
1.杂交探针
在5’端用Cy5标记的tRNAPyl的杂交探针
CTAACCCGGCTGAACGGATTTAGAGTCCATTCGATC(SEQ ID NO:1)
在5’端用地高辛标记的tRNAPyl的杂交探针
CTAACCCGGCTGAACGGATTTAGAGTCCATTCGATC(SEQ ID NO:2)
在5’端用Alexa Fluor 647标记的MS2 RNA茎环序列的杂交探针
CTGCAGACATGGGTGATCCTCATGTTTTCTA(SEQ ID NO:3)
2.tRNA
tRNAPyl,CUA的DNA序列(琥珀密码子的马氏甲烷八叠球菌的吡咯赖氨酰tRNA;下划线示出反密码子)
GGAAACCTGATCATGTAGATCGAATGGACTCTAAATCCGTTCAGCCGGGTTAGATTCCCGGGGTTTCCG(SEQ ID NO:4)
tRNAPyl,UCA的DNA序列(乳白密码子的马氏甲烷八叠球菌的吡咯赖氨酰tRNA;下划线示出反密码子)
GGAAACCTGATCATGTAGATCGAATGGACTTCAAATCCGTTCAGCCGGGTTAGATTCCCGGGGTTTCCG(SEQ ID NO:5)
tRNAPyl,UUA的DNA序列(赭石密码子的马氏甲烷八叠球菌的吡咯赖氨酰tRNA;下划线示出反密码子)GGAAACCTGATCATGTAGATCGAATGGACTTTAAATCCGTTCAGCCGGGTTAGATTCCCGGGGTTTCCG(SEQ ID NO:6)
3.O-RS
PylRSAF(马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶双突变体:Y306A、Y384F;Uniprot:Q8PWY1)
DNA:
ATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:7)
蛋白:MACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:8)
PylRSAA(马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶双突变体:N346A、C348A;Uniprot:Q8PWY1)
DNA:
ATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:9)
蛋白:
MACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:10)
PylRSAAAF(马氏甲烷八叠球菌吡咯赖氨酰tRNA合成酶四突变体:Y306A、N346A、C348A、Y384F;Uniprot:Q8PWY1)
DNA:
GCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:11)
蛋白:
ACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:12)
4.RNA-TP
MCP(肠杆菌噬菌体MS2的外壳蛋白)
DNA:
GCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTAC(SEQ ID NO:13)
蛋白:
ASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNM ELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:14)
λN22(λ噬菌体抗终止子蛋白N的22个氨基酸的RNA结合结构域)
DNA:
ATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAAC(SEQ ID NO:15)
蛋白:
MDAQTRRRERRAEKQAQWKAAN(SEQ ID NO:16)
5.TN
肠杆菌噬菌体MS2 RNA茎环的DNA序列
ACATGAGGATCACCCATGT(SEQ ID NO:17)
BoxB的DNA序列(λ噬菌体RNA茎环,λN22的特异性结合位点)
GCCCTGAAAAAGGGC(SEQ ID NO:18)
6.IC-TPKIF16B1-400(智人驱动蛋白家族成员16B片段,覆盖氨基酸残基1-400;Uniprot:Q96L93)DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACA(SEQ IDNO:19)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPT(SEQ ID NO:20)
KIF13A1-411,ΔP390(智人驱动蛋白家族成员13A片段,覆盖氨基酸残基1-411,其中P390缺失;Uniprot:Q9H1H9)
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAA(SEQ ID NO:21)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWE(SEQ ID NO:22)
TOMM201-70(智人线粒体外膜转位酶20片段,覆盖氨基酸残基1-70;Uniprot:Q15388)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTC(SEQ ID NO:23)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFF(SEQ ID NO:24)
LcK(用于小家鼠淋巴细胞特异性蛋白酪氨酸激酶的质膜靶向的翻译后修饰位点;Uniprot:P06240)DNA:
GGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTC(SEQ ID NO:25)
蛋白:(下划线示出相同部分P06240)
GCVCSSNPEGTEL(SEQ ID NO:26)
FRB-CD28(源自小家鼠CD4(Uniprot:P06332)、FRB(与智人mTOR相似;Uniprot:P42345)和小家鼠CD28(Uniprot:P31041)的合成膜靶向融合多肽)
DNA:ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTT(SEQID NO:27)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCV(SEQ ID NO:28)
FUS-CD28(源自小家鼠CD4(Uniprot:P06332)、智人融合肉瘤(Uniprot:P35637)和小家鼠CD28(Uniprot:P31041)的合成膜靶向融合多肽)
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTT(SEQ ID NO:29)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCV(SEQ ID NO:30)
7.PSP
SPD5(秀丽隐杆线虫纺锤体缺陷蛋白5;Uniprot:P91349)
DNA:
ATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAG(SEQ ID NO:31)
蛋白:
MEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKK(SEQ ID NO:32)
FUS(智人融合肉瘤;Uniprot:P35637)
DNA:
ATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGC(SEQ ID NO:33)
蛋白:
MASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGG(SEQ ID NO:34)
EWSR1(智人尤文肉瘤断点区域1;Uniprot:Q01844)
DNA:
ATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCCACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAG(SEQ IDNO:35)
蛋白:
MASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQS SYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQ(SEQ I D NO:36)
8.AFP
EWSR1-MCP
DNA:
ATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCCACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ I D NO:37)
蛋白:
MASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:38)
FUS-MCP
DNA:
ATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ IDNO:39)
蛋白:
MASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:40)
FUS-PylRSAF
DNA:
ATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:41)
蛋白:
MASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:42)
MCP-PylRSAF
DNA:
ATGGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:43)
蛋白:
MASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:44)
SPD5-MCP
DNA:
ATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:45)
蛋白:
MEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPI IQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ IDNO:46)
SPD5-PylRSAF
DNA:
ATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:47)
蛋白:
MEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:48)
KIF16B-FUS-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:49)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGI IKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:50)KIF16B-VSV-G-FUS-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:51)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:52)
KIF16B-FUS-PylRSAA
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:53)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:54)
KIF16B-FUS-PylRSAAAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:55)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ I D NO:56)
KIF16B-EWSR1-MCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:57)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:58)
KIF16B-FUS-4xλN22-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:59)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:60)
KIF16B-FUS-MCP-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:61)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:62)
KIF16B-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:63)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESI ITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQID NO:64)
KIF16B-MCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:65)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:66)
KIF16B-MCP-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:67)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:68)
KIF16B-SPD5-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTcGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:69)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ I D NO:70)
KIF16B-SPD5-MCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:71)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:72)
KIF16B-SPD5-MCP-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:73)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:74)
KIF16B-SPD5-4xλN22-PylRSAF
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGTCCGGATATCCCTATGATGTGCCGGATTATGCTTCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:75)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKSGYPYDVPDYASTMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:76)
KIF13A-FUS-PylRSAF
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:77)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:78)
KIF13A-FUS-PylRSAA
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:79)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:80)
KIF13A-FUS-PylRSAAAF
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:81)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:82)
KIF13A-EWSR1-MCP
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:83)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:84)
KIF13A-SPD5-PylRSAF
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAAATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:85)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEMEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:86)
KIF13A-SPD5-MCP
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAAATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:87)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEMEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPIIQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:88)
KIF13A-FUS-MCP-PylRSAF
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:89)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:90)
KIF13A-PylRSAF
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:91)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEYTDIEMNRLGKGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:92)
KIF13A-MCP
DNA:
ATGTCGGATACCAAGGTAAAAGTTGCCGTCCGGGTCCGGCCCATGAACCGACGAGAACTGGAACTGAACACCAAGTGCGTGGTGGAGATGGAAGGGAATCAAACGGTCCTGCACCCTCCTCCTTCTAACACCAAACAGGGAGAAAGGAAACCTCCCAAGGTATTTGCCTTTGATTATTGCTTTTGGTCCATGGATGAATCTAACACTACAAAATACGCTGGTCAAGAAGTGGTTTTCAAGTGCCTTGGGGAAGGAATTCTTGAAAAAGCCTTTCAGGGGTATAATGCGTGTATTTTTGCATATGGACAGACAGGTTCGGGAAAATCCTTTTCCATGATGGGCCATGCTGAGCAGCTGGGCCTTATTCCAAGGCTCTGCTGTGCTTTATTTAAAAGGATCTCTTTGGAGCAAAATGAGTCACAGACCTTTAAAGTTGAAGTGTCCTATATGGAAATTTATAATGAGAAAGTTCGGGATCTTTTAGACCCCAAAGGGAGTAGACAGTCTCTTAAAGTTCGAGAACATAAAGTTTTGGGACCATATGTAGATGGTTTATCTCAACTAGCTGTCACTAGTTTTGAGGATATTGAGTCATTGATGTCTGAGGGAAATAAGTCTCGAACGGTAGCTGCTACCAACATGAACGAAGAAAGCAGCCGCTCCCATGCTGTGTTCAACATCATAATCACACAGACACTTTATGACCTGCAGTCTGGGAATTCCGGGGAGAAAGTCAGTAAGGTCAGCTTGGTAGACCTGGCGGGTAGCGAAAGAGTATCTAAAACAGGAGCTGCAGGAGAGCGACTGAAAGAAGGCAGCAACATTAACAAATCGCTTACAACCTTGGGGTTGGTTATATCATCACTGGCTGACCAGGCAGCTGGCAAGGGTAAAAGCAAATTTGTGCCTTATCGAGATTCAGTCCTCACTTGGCTGCTTAAGGACAACTTGGGGGGCAACAGCCAAACCTCTATGATAGCCACAATCAGCCCAGCCGCAGACAACTATGAAGAGACCCTCTCCACATTAAGATATGCAGACCGAGCCAAAAGGATTGTGAACCATGCTGTTGTGAATGAGGACCCCAACGCAAAAGTGATCCGAGAACTGCGGGAGGAAGTCGAGAAACTGAGAGAGCAGCTCTCTCAGGCAGAGGCCATGAAGGCCGAACTGAAGGAGAAGCTCGAAGAGTCTGAAAAGCTGATAAAAGAACTAACAGTGACTTGGGAATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ IDNO:93)
蛋白:
MSDTKVKVAVRVRPMNRRELELNTKCVVEMEGNQTVLHPPPSNTKQGERKPPKVFAFDYCFWSMDESNTTKYAGQEVVFKCLGEGILEKAFQGYNACIFAYGQTGSGKSFSMMGHAEQLGLIPRLCCALFKRISLEQNESQTFKVEVSYMEIYNEKVRDLLDPKGSRQSLKVREHKVLGPYVDGLSQLAVTSFEDIESLMSEGNKSRTVAATNMNEESSRSHAVFNIIITQTLYDLQSGNSGEKVSKVSLVDLAGSERVSKTGAAGERLKEGSNINKSLTTLGLVISSLADQAAGKGKSKFVPYRDSVLTWLLKDNLGGNSQTSMIATISPAADNYEETLSTLRYADRAKRIVNHAVVNEDPNAKVIRELREEVEKLREQLSQAEAMKAELKEKLEESEKLIKELTVTWEYTDIEMNRLGKGAPGSAGSAAGSGMASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:94)
TOMM20-EWSR1-MCP
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:95)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:96)
TOMM20-EWSR1-HA-MCP
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:97)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:98)
TOMM20-FUS-PylRSAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:99)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:100)
TOMM20-FUS-V5-PylRSAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:101)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:102)
TOMM20-FUS-PylRSAA
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ IDNO:103)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:104)
TOMM20-FUS-V5-PylRSAA
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:105)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESI ITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:106)
TOMM20-FUS-PylRSAAAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:107)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESI ITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:108)
TOMM20-FUS-V5-PylRSAAAF
DNA:ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:109)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:110)
TOMM20-EWSR1-λN22
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:111)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV(SEQ ID NO:112)
TOMM20-EWSR1-Myc-λN22
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:113)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV(SEQ ID NO:114)
TOMM20-3xMCP-PylRSAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCTATACAGATATTGAAATGAACAGATTGGGAAAGGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTGCCCCAGGCTCCGCAGGAAGCGCAGCGGGGTCCGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCACCTGGTAGTGCTGGTTCTGCTGCTGGATCAGGTGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:115)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFYTDIEMNRLGKEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:116)
TOMM20-FUS-3xMCP-PylRSAF
DNA:ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTGCCCCAGGCTCCGCAGGAAGCGCAGCGGGGTCCGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCACCTGGTAGTGCTGGTTCTGCTGCTGGATCAGGTGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:117)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:118)
TOMM20-FUS-3xMCP-PylRSAAAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTGCCCCAGGCTCCGCAGGAAGCGCAGCGGGGTCCGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCACCTGGTAGTGCTGGTTCTGCTGCTGGATCAGGTGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:119)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:120)
TOMM20-FUS-4xλN22-PylRSAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:121)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:122)
TOMM20-FUS-4xλN22-PylRSAAAF
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:123)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:124)
LcK-EWSR1-MCP
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCCACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:125)
蛋白:
MGCVCSSNPEGTELASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:126)
LcK-EWSR1-4xλN22
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTAAACCCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGA(SEQ ID NO:127)
蛋白:
MGCVCSSNPEGTELASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVKPADQPRLCLLVASHLLFAPPPCLP(SEQ ID NO:128)
LcK-FUS-PylRSAF
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:129)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:130)
LcK-FUS-MCP-PylRSAF
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:131)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:132)
LCK-FUS-3xMCP-PylRSAF
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTGCCCCAGGCTCCGCAGGAAGCGCAGCGGGGTCCGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCACCTGGTAGTGCTGGTTCTGCTGCTGGATCAGGTGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:133)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:134)
LcK-FUS-PylRSAAAF
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAACACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:135)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSNTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESI ITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:136)
LcK-PylRSAF
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:137)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:138)
LcK-PylRSAAAF
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:139)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:140)
LcK-MCP
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:141)
蛋白:
MGCVCSSNPEGTELEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:142)
FRB-CD28-FUS-PylRSAF
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:143)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:144)
FRB-CD28-FUS-PylRSAA
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTTCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:145)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:146)
FRB-CD28-EWSR1-MCP
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:147)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:148)
FRB-CD28-EWSR1-4xλN22
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:149)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV(SEQ ID NO:150)
FUS-CD28-FUS-PylRSAF
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:151)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:152)
FUS-CD28-FUS-PylRSAA
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:153)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:154)
FUS-CD28-FUS-MCP-PylRSAF
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:155)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:156)
FUS-CD28-FUS-MCP-PylRSAA
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:157)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ IDNO:158)
FUS-CD28-EWSR1-MCP
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCCACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:159)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY(SEQ ID NO:160)
FUS-CD28-EWSR1-4xλN22
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATTACAAGGATGACGACGATAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:161)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDYKDDDDKGTEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV(SEQ IDNO:162)
FRB-CD28-FUS-MCP-PylRSAA
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:163)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFNQAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:164)
FRB-CD28-FUS-MCP-PylRSAF
DNA:
ATGTGCCGAGCCATCTCTCTTAGGCGCTTGCTGCTGCTGCTGCTGCAGCTGTCACAACTCCTAGCTGTCACTCAAGGGATGCTCGAGATGTGGCATGAAGGCCTGGAAGAGGCATCTCGTTTGTACTTTGGGGAAAGGAACGTGAAAGGCATGTTTGAGGTGCTGGAGCCCTTGCATGCTATGATGGAACGGGGCCCCCAGACTCTGAAGGAAACATCCTTTAATCAGGCCTATGGTCGAGATTTAATGGAGGCCCAAGAGTGGTGCAGGAAGTACATGAAATCAGGGAATGTCAAGGACCTCCTCCAAGCCTGGGACCTCTATTATCATGTGTTCCGACGAATCTCAAAGACTAGAACCGGTAAGCTTTTTTGGGCACTGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCTTGCTAGTGACAGTGGCTCTTTGTGTTATCTGGGTAAGATCTGGTATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:165)
蛋白:
MCRAISLRRLLLLLLQLSQLLAVTQGMLEMWHEGLEEASRLYFGERNVKGMFEVLEPLHAMMERGPQTLKETSFN
QAYGRDLMEAQEWCRKYMKSGNVKDLLQAWDLYYHVFRRISKTRTGKLFWALVVVAGVLFCYGLLVTVALCVIWVRSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:166)
FUS-MCP-PylRSAF
DNA:
ATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:167)
蛋白:
MASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:168)
SPD5-MCP-PylRSAF
DNA:
ATGGAGGACAACAGCGTGCTGAACGAGGACAGCAACCTGGAGCACGTGGAGGGCCAGCCCAGAAGAAGCATGAGCCAGCCCGTGCTGAACGTGGAGGGCGACAAGAGAACCAGCAGCACCAGCGCCACCCAGCAGCAGGTGCTGAGCGGCGCCTTCAGCAGCGCCGACGTGAGAAGCATCCCCATCATCCAGACCTGGGAGGAGAACAAGGCCCTGAAGACCAAGATCACCATCCTGAGAGGCGAGCTGCAGATGTACCAGAGAAGATACAGCGAGGCCAAGGAGGCCAGCCAGAAGAGAGTGAAGGAGGTGATGGACGACTACGTGGACCTGAAGCTGGGCCAGGAGAACGTGCAGGAGAAGATGGAGCAGTACAAGCTGATGGAGGAGGACCTGCTGGCCATGCAGAGCAGAATCGAGACCAGCGAGGACAACTTCGCCAGACAGATGAAGGAGTTCGAGGCCCAGAAGCACGCCATGGAGGAGAGAATCAAGGAGCTGGAGCTGAGCGCCACCGACGCCAACAACACCACCGTGGGCAGCTTCAGAGGCACCCTGGACGACATCCTGAAGAAGAACGACCCCGACTTCACCCTGACCAGCGGCTACGAGGAGAGAAAGATCAACGACCTGGAGGCCAAGCTGCTGAGCGAGATCGACAAGGTGGCCGAGCTGGAGGACCACATCCAGCAGCTGAGACAGGAGCTGGACGACCAGAGCGCCAGACTGGCCGACAGCGAGAACGTGAGAGCCCAGCTGGAGGCCGCCACCGGCCAGGGCATCCTGGGCGCCGCCGGCAACGCCATGGTGCCCAACAGCACCTTCATGATCGGCAACGGCAGAGAGAGCCAGACCAGAGACCAGCTGAACTACATCGACGACCTGGAGACCAAGCTGGCCGACGCCAAGAAGGAGAACGACAAGGCCAGACAGGCCCTGGTGGAGTACATGAACAAGTGCAGCAAGCTGGAGCACGAGATCAGAACCATGGTGAAGAACAGCACCTTCGACAGCAGCAGCATGCTGCTGGGCGGCCAGACCAGCGACGAGCTGAAGATCCAGATCGGCAAGGTGAACGGCGAGCTGAACGTGCTGAGAGCCGAGAACAGAGAGCTGAGAATCAGATGCGACCAGCTGACCGGCGGCGACGGCAACCTGAGCATCAGCCTGGGCCAGAGCAGACTGATGGCCGGCATCGCCACCAACGACGTGGACAGCATCGGCCAGGGCAACGAGACCGGCGGCACCAGCATGAGAATCCTGCCCAGAGAGAGCCAGCTGGACGACCTGGAGGAGAGCAAGCTGCCCCTGATGGACACCAGCAGCGCCGTGAGAAACCAGCAGCAGTTCGCCAGCATGTGGGAGGACTTCGAGAGCGTGAAGGACAGCCTGCAGAACAACCACAACGACACCCTGGAGGGCAGCTTCAACAGCAGCATGCCCCCCCCCGGCAGAGACGCCACCCAGAGCTTCCTGAGCCAGAAGAGCTTCAAGAACAGCCCCATCGTGATGCAGAAGCCCAAGAGCCTGCACCTGCACCTGAAGAGCCACCAGAGCGAGGGCGCCGGCGAGCAGATCCAGAACAACAGCTTCAGCACCAAGACCGCCAGCCCCCACGTGAGCCAGAGCCACATCCCCATCCTGCACGACATGCAGCAGATCCTGGACAGCAGCGCCATGTTCCTGGAGGGCCAGCACGACGTGGCCGTGAACGTGGAGCAGATGCAGGAGAAGATGAGCCAGATCAGAGAGGCCCTGGCCAGACTGTTCGAGAGACTGAAGAGCAGCGCCGCCCTGTTCGAGGAGATCCTGGAGAGAATGGGCAGCAGCGACCCCAACGCCGACAAGATCAAGAAGATGAAGCTGGCCTTCGAGACCAGCATCAACGACAAGCTGAACGTGAGCGCCATCCTGGAGGCCGCCGAGAAGGACCTGCACAACATGAGCCTGAACTTCAGCATCCTGGAGAAGAGCATCGTGAGCCAGGCCGCCGAGGCCAGCAGAAGATTCACCATCGCCCCCGACGCCGAGGACGTGGCCAGCAGCAGCCTGCTGAACGCCAGCTACAGCCCCCTGTTCAAGTTCACCAGCAACAGCGACATCGTGGAGAAGCTGCAGAACGAGGTGAGCGAGCTGAAGAACGAGCTGGAGATGGCCAGAACCAGAGACATGAGAAGCCCCCTGAACGGCAGCAGCGGCAGACTGAGCGACGTGCAGATCAACACCAACAGAATGTTCGAGGACCTGGAGGTGAGCGAGGCCACCCTGCAGAAGGCCAAGGAGGAGAACAGCACCCTGAAGAGCCAGTTCGCCGAGCTGGAGGCCAACCTGCACCAGGTGAACAGCAAGCTGGGCGAGGTGAGATGCGAGCTGAACGAGGCCCTGGCCAGAGTGGACGGCGAGCAGGAGACCAGAGTGAAGGCCGAGAACGCCCTGGAGGAGGCCAGACAGCTGATCAGCAGCCTGAAGCACGAGGAGAACGAGCTGAAGAAGACCATCACCGACATGGGCATGAGACTGAACGAGGCCAAGAAGAGCGACGAGTTCCTGAAGAGCGAGCTGAGCACCGCCCTGGAGGAGGAGAAGAAGAGCCAGAACCTGGCCGACGAGCTGAGCGAGGAGCTGAACGGCTGGAGAATGAGAACCAAGGAGGCCGAGAACAAGGTGGAGCACGCCAGCAGCGAGAAGAGCGAGATGCTGGAGAGAATCGTGCACCTGGAGACCGAGATGGAGAAGCTGAGCACCAGCGAGATCGCCGCCGACTACTGCAGCACCAAGATGACCGAGAGAAAGAAGGAGATCGAGCTGGCCAAGTACAGAGAGGACTTCGAGAACGCCGCCATCGTGGGCCTGGAGAGAATCAGCAAGGAGATCAGCGAGCTGACCAAGAAGACCCTGAAGGCCAAGATCATCCCCAGCAACATCAGCAGCATCCAGCTGGTGTGCGACGAGCTGTGCAGAAGACTGAGCAGAGAGAGAGAGCAGCAGCACGAGTACGCCAAGGTGATGAGAGACGTGAACGAGAAGATCGAGAAGCTGCAGCTGGAGAAGGACGCCCTGGAGCACGAGCTGAAGATGATGAGCAGCAACAACGAGAACGTGCCCCCCGTGGGCACCAGCGTGAGCGGCATGCCCACCAAGACCAGCAACCAGAAGTGCGCCCAGCCCCACTACACCAGCCCCACCAGACAGCTGCTGCACGAGAGCACCATGGCCGTGGACGCCATCGTGCAGAAGCTGAAGAAGACCCACAACATGAGCGGCATGGGCCCCGAGCTGAAGGAGACCATCGGCAACGTGATCAACGAGAGCAGAGTGCTGAGAGACTTCCTGCACCAGAAGCTGATCCTGTTCAAGGGCATCGACATGAGCAACTGGAAGAACGAGACCGTGGACCAGCTGATCACCGACCTGGGCCAGCTGCACCAGGACAACCTGATGCTGGAGGAGCAGATCAAGAAGTACAAGAAGGAGCTGAAGCTGACCAAGAGCGCCATCCCCACCCTGGGCGTGGAGTTCCAGGACAGAATCAAGACCGAGATCGGCAAGATCGCCACCGACATGGGCGGCGCCGTGAAGGAGATCAGAAAGAAGGGTACCGAGCAGAAGCTGATCTCAGAGGAGGACCTGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:169)
蛋白:
MEDNSVLNEDSNLEHVEGQPRRSMSQPVLNVEGDKRTSSTSATQQQVLSGAFSSADVRSIPI IQTWEENKALKTKITILRGELQMYQRRYSEAKEASQKRVKEVMDDYVDLKLGQENVQEKMEQYKLMEEDLLAMQSRIETSEDNFARQMKEFEAQKHAMEERIKELELSATDANNTTVGSFRGTLDDILKKNDPDFTLTSGYEERKINDLEAKLLSEIDKVAELEDHIQQLRQELDDQSARLADSENVRAQLEAATGQGILGAAGNAMVPNSTFMIGNGRESQTRDQLNYIDDLETKLADAKKENDKARQALVEYMNKCSKLEHEIRTMVKNSTFDSSSMLLGGQTSDELKIQIGKVNGELNVLRAENRELRIRCDQLTGGDGNLSISLGQSRLMAGIATNDVDSIGQGNETGGTSMRILPRESQLDDLEESKLPLMDTSSAVRNQQQFASMWEDFESVKDSLQNNHNDTLEGSFNSSMPPPGRDATQSFLSQKSFKNSPIVMQKPKSLHLHLKSHQSEGAGEQIQNNSFSTKTASPHVSQSHIPILHDMQQILDSSAMFLEGQHDVAVNVEQMQEKMSQIREALARLFERLKSSAALFEEILERMGSSDPNADKIKKMKLAFETSINDKLNVSAILEAAEKDLHNMSLNFSILEKSIVSQAAEASRRFTIAPDAEDVASSSLLNASYSPLFKFTSNSDIVEKLQNEVSELKNELEMARTRDMRSPLNGSSGRLSDVQINTNRMFEDLEVSEATLQKAKEENSTLKSQFAELEANLHQVNSKLGEVRCELNEALARVDGEQETRVKAENALEEARQLISSLKHEENELKKTITDMGMRLNEAKKSDEFLKSELSTALEEEKKSQNLADELSEELNGWRMRTKEAENKVEHASSEKSEMLERIVHLETEMEKLSTSEIAADYCSTKMTERKKEIELAKYREDFENAAIVGLERISKEISELTKKTLKAKIIPSNISSIQLVCDELCRRLSREREQQHEYAKVMRDVNEKIEKLQLEKDALEHELKMMSSNNENVPPVGTSVSGMPTKTSNQKCAQPHYTSPTRQLLHESTMAVDAIVQKLKKTHNMSGMGPELKETIGNVINESRVLRDFLHQKLILFKGIDMSNWKNETVDQLITDLGQLHQDNLMLEEQIKKYKKELKLTKSAIPTLGVEFQDRIKTEIGKIATDMGGAVKEIRKKGTEQKLISEEDLGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:170)
9.POI(报告基因)和对照
GFP39TAG(具有琥珀密码子编码的氨基酸39位的GFP)
DNA:(下划线示出琥珀密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGATTACAAAGATGATGATGATAAAGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAGGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGCCAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGCATCACCATCACCATCACTAA(SEQ IDNO:171)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASDYKDDDDKVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATXGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKANFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH(SEQ ID NO:172)
GFP39TAG-2xMS2(具有2个MS2茎环的GFP39TAG)
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGATTACAAAGATGATGATGATAAAGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAGGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGCCAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGCATCACCATCACCATCACTAAGGATCCTAAGGTACCTAATTGCCTAGAAAACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCAT GT(SEQ ID NO:173)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASDYKDDDDKVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATXGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKANFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH(SEQ ID NO:174)
mOrange
DNA:
ATGGTGAGCAAGGGCGAGGAGAATAATATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCGCATGGAGGGCACCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCTTTCAGACCGCTAAGCTGAAGGTGACCAAGGGCGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCTCTTCACCTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTCAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTACGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCACTGCAGGACGGCGAGTTCATCTACAAGGTGAAGATGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTGATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAGGATGAGGCTGAAGCTGAAGGACGGCGGCCACTACACCTCCGAGGTCAAGACCACCTACAAGGCCAAGAAGTCCGTGCAGCTGCCCGGCGCCTACATCGTCGGCATCAAGCTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGTAA(SEQ ID NO:175)
蛋白:
MVSKGEENNMAIIKEFMRFKVRMEGTVNGHEFEIEGEGEGRPYEGFQTAKLKVTKGGPLPFAWDILSPLFTYGSKAYVKHPADIPDYFKLSFPEGFKWERVMNYEDGGVVTVTQDSSLQDGEFIYKVKMRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIRMRLKLKDGGHYTSEVKTTYKAKKSVQLPGAYIVGIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK(SEQ ID NO:176)
iRFP(近红外荧光蛋白)
DNA:
GAAGGATCCGTCGCCAGGCAGCCTGACCTCTTGACCTGCGACGATGAGCCGATCCATATCCCCGGTGCCATCCAACCGCATGGACTGCTGCTCGCCCTCGCCGCCGACATGACGATCGTTGCCGGCAGCGACAACCTTCCCGAACTCACCGGACTGGCGATCGGCGCCCTGATCGGCCGCTCTGCGGCCGATGTCTTCGACTCGGAGACGCACAACCGTCTGACGATCGCCTTGGCCGAGCCCGGGGCGGCCGTCGGAGCACCGATCACTGTCGGCTTCACGATGCGAAAGGACGCAGGCTTCATCGGCTCCTGGCATCGCCATGATCAGCTCATCTTCCTCGAGCTCGAGCCTCCCCAGCGGGACGTCGCCGAGCCGCAGGCGTTCTTCCGCCGCACCAACAGCGCCATCCGCCGCCTGCAGGCCGCCGAAACCTTGGAAAGCGCCTGCGCCGCCGCGGCGCAAGAGGTGCGGAAGATTACCGGCTTCGATCGGGTGATGATCTATCGCTTCGCCTCCGACTTCAGCGGCGAAGTGATCGCAGAGGATCGGTGCGCCGAGGTCGAGTCAAAACTAGGCCTGCACTATCCTGCCTCAACCGTGCCGGCGCAGGCCCGTCGGCTCTATACCATCAACCCGGTACGGATCATTCCCGATATCAATTATCGGCCGGTGCCGGTCACCCCAGACCTCAATCCGGTCACCGGGCGGCCGATTGATCTTAGCTTCGCCATCCTGCGCAGCGTCTCGCCCGTCCATCTGGAATTCATGCGCAACATAGGCATGCACGGCACGATGTCGATCTCGATTTTGCGCGGCGAGCGACTGTGGGGATTGATCGTTTGCCATCACCGAACGCCGTACTACGTCGATCTCGATGGCCGCCAAGCCTGCGAGCTAGTCGCCCAGGTTCTGGCCTGGCAGATCGGCGTGATGGAAGAG(SEQ ID NO:177)
蛋白:
EGSVARQPDLLTCDDEPIHIPGAIQPHGLLLALAADMTIVAGSDNLPELTGLAIGALIGRSAADVFDSETHNRLTIALAEPGAAVGAPITVGFTMRKDAGFIGSWHRHDQLIFLELEPPQRDVAEPQAFFRRTNSAIRRLQAAETLESACAAAAQEVRKITGFDRVMIYRFASDFSGEVIAEDRCAEVESKLGLHYPASTVPAQARRLYTINPVRIIPDINYRPVPVTPDLNPVTGRPIDLSFAILRSVSPVHLEFMRNIGMHGTMSISILRGERLWGLIVCHHRTPYYVDLDGRQACELVAQVLAWQIGVMEE(SEQ ID NO:178)
mCherry185TAG(具有琥珀密码子编码的氨基酸185位的mCherry)
DNA:(下划线示出琥珀密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACGATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGTAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGCATCATCATCATCATCATTAA(SEQ ID NO:179)
蛋白:(X是非典型氨基酸)
MGRLESTPPKKKRKVEDSASVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKXPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKHHHHHH(SEQ ID NO:180)
mCherry185TAG-2xMS2(具有2个MS2 RNA茎环的mCherry185TAG)
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACGATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGTAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGCATCATCATCATCATCATTAAAGATCCTAAGGTACCTAATTGCCTAGAAAACATGAGGATCAC CCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCATGT(SEQ ID NO:181)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKXPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKHHHHHH(SEQ ID NO:182)
mCherry185TAG-4xBoxB(具有4个BoxB环的mCherry185TAG)
DNA:(下划线示出BoxB茎环和琥珀密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACGATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGTAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGCATCATCATCATCATCATTAAAGATCCTAAGGTACCGCCCTGAAAAAGGGCTCGAGCCCTGAA AAAGGGCAATTGCCCTGAAAAAGGGCGTCGACGCCCTGAAAAAGGGC(SEQ ID NO:183)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKXPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKHHHHHH(SEQ ID NO:184)
GFP39TAA-2xMS2(具有赭石密码子编码的氨基酸39位和2个MS2 RNA茎环的GFP)
DNA:(下划线示出MS2茎环和赭石密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGATTACAAAGATGATGATGATAAAGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAAGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGCCAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGCATCACCATCACCATCACTGAGGATCCTAAGGTACCTAATTGCCTAGAAAACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCAT GT(SEQ ID NO:185)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASDYKDDDDKVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATXGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKANFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH(SEQ ID NO:186)
mCherry185TAA-2xMS2(具有赭石密码子编码的氨基酸185位和2个MS2 RNA茎环的mCherry)
DNA:(下划线示出MS2茎环和赭石密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACGATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGTAACCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGCATCATCATCATCATCATTGAAGATCCTAAGGTACCTAATTGCCTAGAAAACATGAGGATCAC CCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCATGT(SEQ ID NO:187)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKXPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKHHHHHH(SEQ ID NO:188)
GFP39TGA-2xMS2(具有乳白密码子编码的氨基酸39位和2个MS2 RNA茎环的GFP)
DNA:(下划线示出MS2茎环和乳白密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGATTACAAAGATGATGATGATAAAGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTGAGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGCCAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGCATCACCATCACCATCACTAAGGATCCTAAGGTACCTAATTGCCTAGAAAACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCAT GT(SEQ ID NO:189)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASDYKDDDDKVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATXGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKANFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH(SEQ ID NO:190)
mCherry185TGA-2xMS2(具有乳白密码子编码的氨基酸185位和2个MS2 RNA茎环的mCherry)
DNA:(下划线示出MS2茎环和乳白密码子)
ATGGGCCGCCTGGAAAGCACCCCGCCGAAAAAAAAACGCAAAGTGGAAGATAGCGCGAGCGTGAGCAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCCTGCAGGACGGCGAGTTCATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACGATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGGTCAAGACCACCTACAAGGCCAAGTGACCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGCATCATCATCATCATCATTAAAGATCCTAAGGTACCTAATTGCCTAGAAAACATGAGGATCAC CCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCATGT(SEQ ID NO:191)
蛋白:(X表示非典型氨基酸)
MGRLESTPPKKKRKVEDSASVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKXPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKHHHHHH(SEQ ID NO:192)
Nup153(智人核孔蛋白153;Uniprot:P49790)
DNA:
ATGGCGTCTGGTGCTGGCGGTGTTGGTGGAGGAGGTGGGGGTAAAATTCGTACTCGTCGCTGTCATCAAGGTCCGATTAAACCGTATCAGCAGGGACGTCAGCAACATCAGGGTATTCTGAGCCGTGTGACCGAAAGCGTGAAAAACATTGTGCCGGGTTGGCTGCAACGTTATTTCAACAAAAATGAGGATGTGTGTTCGTGTTCTACCGATACCAGTGAAGTTCCTCGTTGGCCGGAAAACAAAGAAGATCACCTGGTGTATGCCGATGAAGAATCGAGCAATATCACCGATGGCCGTATTACTCCTGAACCGGCGGTTAGTAACACTGAAGAACCGTCAACCACAAGCACAGCATCGAACTATCCAGATGTCCTGACTCGCCCTTCTCTGCACCGTTCTCACCTGAACTTTAGCATGCTGGAATCACCAGCTCTGCATTGTCAGCCGTCTACCAGTAGTGCCTTCCCGATTGGCTCTAGTGGCTTTTCGCTGGTCAAAGAGATCAAAGACTCGACCTCTCAACATGACGATGATAACATTAGCACGACCTCGGGTTTTAGTAGCCGTGCCTCCGATAAAGACATTACCGTGAGCAAAAACACCTCTCTGCCGCCTCTGTGGAGTCCTGAAGCCGAACGCTCTCATAGTCTGTCTCAGCACACAGCCACCAGTTCCAAAAAACCAGCCTTCAACCTGAGCGCCTTTGGTACACTGTCACCGAGCCTGGGAAATTCCTCTATCCTGAAAACATCACAGCTGGGCGATAGTCCGTTTTATCCGGGCAAAACGACGTATGGTGGTGCCGCTGCTGCTGTTCGCCAGTCTAAACTGCGTAACACTCCGTATCAAGCTCCAGTCCGTCGCCAAATGAAAGCAAAACAACTGTCGGCCCAGTCTTATGGTGTGACAAGCTCTACAGCTCGTCGTATCCTGCAAAGTCTGGAGAAAATGTCATCTCCGCTGGCAGATGCCAAACGTATTCCGTCCATTGTGAGCAGTCCGCTGAATAGCCCGCTGGACCGTAGTGGGATCGATATCACCGACTTCCAAGCCAAACGTGAGAAAGTGGATAGCCAGTATCCGCCTGTACAACGTCTGATGACCCCGAAACCGGTTTCAATTGCCACGAATCGTAGCGTGTATTTCAAACCGTCACTGACCCCTAGTGGTGAGTTTCGTAAAACAAATCAGCGTATCGACAACAAATGCTCTACCGGGTATGAAAAAAACATGACGCCGGGACAGAATCGTGAACAACGTGAATCTGGCTTCTCTTATCCGAACTTTAGTCTGCCGGCAGCAAATGGTCTGAGTAGCGGTGTAGGAGGTGGTGGGGGCAAAATGCGCCGTGAACGTCACGCCTTTGTGGCCTCTAAACCTCTGGAAGAAGAAGAGATGGAGGTTCCTGTACTGCCGAAAATCAGTCTGCCTATCACCTCTTCAAGTCTGCCGACCTTCAACTTTTCTAGTCCGGAAATCACAACCTCTAGCCCGTCACCGATTAATAGCAGTCAAGCACTGACGAATAAAGTCCAAATGACCTCACCGAGTTCTACGGGTTCTCCGATGTTCAAATTCTCTAGTCCTATCGTGAAATCAACCGAAGCGAACGTCCTGCCTCCTTCTAGTATTGGGTTCACCTTTAGCGTCCCAGTGGCCAAAACAGCTGAACTGAGCGGTAGCAGTAGTACTCTGGAACCGATTATCAGCTCAAGCGCCCATCATGTCACTACCGTGAATAGCACAAACTGTAAAAAAACGCCGCCTGAGGACTGTGAAGGACCGTTTCGTCCTGCCGAAATCCTGAAAGAAGGTTCCGTCCTGGACATTCTGAAATCTCCGGGATTTGCCTCTCCTAAAATCGACTCTGTTGCCGCTCAACCAACTGCCACATCACCGGTGGTTTATACTCGTCCGGCGATTAGCAGTTTTAGCAGTAGTGGCATCGGTTTTGGTGAATCCCTGAAAGCTGGCTCATCTTGGCAGTGTGACACCTGCCTGCTGCAAAACAAAGTGACCGATAACAAATGTATTGCCTGTCAGGCCGCCAAACTGTCTCCTCGTGATACAGCCAAACAGACCGGCATCGAAACCCCTAATAAAAGCGGGAAAACGACCCTGTCAGCAAGTGGTACGGGATTTGGGGACAAATTCAAACCTGTGATCGGCACATGGGACTGTGACACTTGTCTGGTACAGAACAAACCAGAAGCGATCAAATGTGTGGCCTGTGAAACGCCTAAACCTGGAACATGTGTGAAACGTGCCCTGACTCTGACTGTTGTGTCAGAAAGCGCCGAAACCATGACGGCAAGCAGCTCATCCTGTACTGTGACTACCGGGACTCTGGGATTTGGTGACAAATTCAAACGCCCGATTGGTTCCTGGGAATGCTCCGTGTGTTGTGTGAGCAATAATGCCGAGGACAACAAATGTGTGTCCTGTATGAGCGAGAAACCTGGCAGCTCTGTTCCTGCTAGCAGCTCTAGCACAGTTCCTGTTAGTCTGCCTAGTGGTGGTTCTCTGGGTCTGGAAAAATTCAAAAAACCTGAAGGAAGCTGGGATTGTGAGCTGTGCCTGGTACAGAATAAAGCGGATAGCACGAAATGTCTGGCCTGTGAGTCAGCCAAACCAGGGACTAAAAGCGGCTTTAAAGGCTTCGACACGTCGAGCAGTTCTAGTAACAGCGCCGCCTCATCATCTTTCAAATTTGGGGTGAGCAGCTCCTCTAGTGGTCCTAGTCAAACACTGACCTCTACCGGAAACTTCAAATTCGGCGATCAGGGTGGCTTCAAAATTGGTGTCTCCTCTGATTCGGGTAGCATTAACCCGATGAGTGAGGGGTTCAAATTCAGCAAACCAATTGGCGATTTCAAATTCGGTGTGTCGTCTGAATCCAAACCTGAAGAAGTCAAAAAAGACAGCAAAAACGACAATTTCAAATTCGGCCTGTCTAGTGGTCTGTCTAATCCGGTTAGCCTGACCCCGTTTCAGTTCGGGGTGTCTAATCTGGGTCAGGAAGAGAAAAAAGAGGAGCTGCCTAAAAGTTCATCTGCCGGGTTCAGTTTTGGTACAGGCGTGATCAATAGCACTCCAGCACCAGCCAATACAATCGTGACGAGCGAGAACAAATCGAGCTTCAACCTGGGGACAATCGAAACGAAAAGCGCCAGTGTAGCGCCATTCACGTGTAAAACCTCCGAGGCAAAAAAAGAAGAGATGCCGGCCACAAAAGGTGGATTCTCATTCGGCAACGTGGAACCGGCTAGCCTGCCATCAGCAAGCGTGTTTGTACTGGGCCGTACCGAGGAGAAACAGCAGGAACCTGTTACTAGCACCAGTCTGGTCTTTGGTAAAAAAGCCGACAACGAAGAACCGAAATGTCAGCCAGTGTTCAGCTTCGGCAATAGCGAACAGACGAAAGACGAAAACAGCAGCAAATCGACGTTCAGCTTCAGTATGACGAAACCGAGCGAAAAAGAAAGTGAGCAGCCAGCAAAAGCAACGTTCGCCTTTGGAGCACAGACATCAACCACAGCCGATCAAGGAGCAGCGAAACCAGTTTTCAGTTTTCTGAATAACAGCTCAAGCAGCAGTTCTACACCAGCAACCTCAGCAGGTGGTGGGATCTTTGGATCAAGCACCTCATCCAGCAATCCGCCAGTGGCAACATTCGTGTTTGGCCAGAGCAGTAATCCGGTGTCATCTTCAGCATTTGGGAATACCGCCGAGAGTAGCACATCACAGTCTCTGCTGTTCTCACAGGACTCTAAACTGGCAACCACCTCTTCTACTGGTACAGCGGTTACCCCGTTTGTGTTCGGTCCGGGAGCATCATCCAATAATACCACGACGTCGGGCTTTGGGTTTGGTGCCACGACAACAAGCAGTAGCGCTGGTAGCAGCTTTGTCTTTGGCACAGGTCCTTCAGCACCTTCTGCTTCACCAGCTTTCGGAGCCAATCAGACTCCGACATTCGGACAGTCACAGGGTGCCTCTCAACCAAATCCTCCGGGTTTTGGCAGTATTAGCAGTAGTACCGCCCTGTTCCCGACCGGTAGTCAACCGGCACCGCCAACATTTGGAACGGTTAGCAGTAGTAGTCAGCCTCCGGTGTTTGGACAACAACCGAGCCAGAGCGCCTTCGGATCAGGAACGACCCCTAATAGTAGCAGTGCCTTCCAGTTCGGTAGCAGTACCACCAACTTCAACTTCACGAACAATAGCCCGTCAGGTGTGTTCACGTTTGGCGCCAATTCTTCTACCCCAGCGGCAAGTGCTCAACCTTCAGGCTCAGGTGGATTTCCTTTCAACCAGTCACCAGCAGCGTTTACTGTTGGTTCTAACGGGAAAAACGTTTTCAGTAGCAGCGGCACCTCGTTTTCTGGTCGTAAAATCAAAACGGCCGTTCGTCGCCGTAAA(SEQ ID NO:193)
蛋白:
MASGAGGVGGGGGGKIRTRRCHQGPIKPYQQGRQQHQGILSRVTESVKNIVPGWLQRYFNKNEDVCSCSTDTSEVPRWPENKEDHLVYADEESSNITDGRITPEPAVSNTEEPSTTSTASNYPDVLTRPSLHRSHLNFSMLESPALHCQPSTSSAFPIGSSGFSLVKEIKDSTSQHDDDNISTTSGFSSRASDKDITVSKNTSLPPLWSPEAERSHSLSQHTATSSKKPAFNLSAFGTLSPSLGNSSILKTSQLGDSPFYPGKTTYGGAAAAVRQSKLRNTPYQAPVRRQMKAKQLSAQSYGVTSSTARRILQSLEKMSSPLADAKRIPSIVSSPLNSPLDRSGIDITDFQAKREKVDSQYPPVQRLMTPKPVSIATNRSVYFKPSLTPSGEFRKTNQRIDNKCSTGYEKNMTPGQNREQRESGFSYPNFSLPAANGLSSGVGGGGGKMRRERHAFVASKPLEEEEMEVPVLPKISLPITSSSLPTFNFSSPEITTSSPSPINSSQALTNKVQMTSPSSTGSPMFKFSSPIVKSTEANVLPPSSIGFTFSVPVAKTAELSGSSSTLEPIISSSAHHVTTVNSTNCKKTPPEDCEGPFRPAEILKEGSVLDILKSPGFASPKIDSVAAQPTATSPVVYTRPAISSFSSSGIGFGESLKAGSSWQCDTCLLQNKVTDNKCIACQAAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGTWDCDTCLVQNKPEAIKCVACETPKPGTCVKRALTLTVVSESAETMTASSSSCTVTTGTLGFGDKFKRPIGSWECSVCCVSNNAEDNKCVSCMSEKPGSSVPASSSSTVPVSLPSGGSLGLEKFKKPEGSWDCELCLVQNKADSTKCLACESAKPGTKSGFKGFDTSSSSSNSAASSSFKFGVSSSSSGPSQTLTSTGNFKFGDQGGFKIGVSSDSGSINPMSEGFKFSKPIGDFKFGVSSESKPEEVKKDSKNDNFKFGLSSGLSNPVSLTPFQFGVSNLGQEEKKEELPKSSSAGFSFGTGVINSTPAPANTIVTSENKSSFNLGTIETKSASVAPFTCKTSEAKKEEMPATKGGFSFGNVEPASLPSASVFVLGRTEEKQQEPVTSTSLVFGKKADNEEPKCQPVFSFGNSEQTKDENSSKSTFSFSMTKPSEKESEQPAKATFAFGAQTSTTADQGAAKPVFSFLNNSSSSSSTPATSAGGGIFGSSTSSSNPPVATFVFGQSSNPVSSSAFGNTAESSTSQSLLFSQDSKLATTSSTGTAVTPFVFGPGASSNNTTTSGFGFGATTTSSSAGSSFVFGTGPSAPSASPAFGANQTPTFGQSQGASQPNPPGFGSISSSTALFPTGSQPAPPTFGTVSSSSQPPVFGQQPSQSAFGSGTTPNSSSAFQFGSSTTNFNFTNNSPSGVFTFGANSSTPAASAQPSGSGGFPFNQSPAAFTVGSNGKNVFSSSGTSFSGRKIKTAVRRRK(SEQ ID NO:194)
Vim116TAG(智人波形蛋白,具有琥珀密码子编码的氨基酸116位;Uniprot:P08670)
DNA:(下划线示出琥珀密码子)
ATGTCCACCAGGTCCGTGTCCTCGTCCTCCTACCGCAGGATGTTCGGCGGCCCGGGCACCGCGAGCCGGCCGAGCTCCAGCCGGAGCTACGTGACTACGTCCACCCGCACCTACAGCCTGGGCAGCGCGCTGCGCCCCAGCACCAGCCGCAGCCTCTACGCCTCGTCCCCGGGCGGCGTGTATGCCACGCGCTCCTCTGCCGTGCGCCTGCGGAGCAGCGTGCCCGGGGTGCGGCTCCTGCAGGACTCGGTGGACTTCTCGCTGGCCGACGCCATCAACACCGAGTTCAAGAACACCCGCACCAACGAGAAGGTGGAGCTGCAGGAGCTGAATGACCGCTTCGCCTAGTACATCGACAAGGTGCGCTTCCTGGAGCAGCAGAATAAGATCCTGCTGGCCGAGCTCGAGCAGCTCAAGGGCCAAGGCAAGTCGCGCCTGGGGGACCTCTACGAGGAGGAGATGCGGGAGCTGCGCCGGCAGGTGGACCAGCTAACCAACGACAAAGCCCGCGTCGAGGTGGAGCGCGACAACCTGGCCGAGGACATCATGCGCCTCCGGGAGAAATTGCAGGAGGAGATGCTTCAGAGAGAGGAAGCCGAAAACACCCTGCAATCTTTCAGACAGGATGTTGACAATGCGTCTCTGGCACGTCTTGACCTTGAACGCAAAGTGGAATCTTTGCAAGAAGAGATTGCCTTTTTGAAGAAACTCCACGAAGAGGAAATCCAGGAGCTGCAGGCTCAGATTCAGGAACAGCATGTCCAAATCGATGTGGATGTTTCCAAGCCTGACCTCACGGCTGCCCTGCGTGACGTACGTCAGCAATATGAAAGTGTGGCTGCCAAGAACCTGCAGGAGGCAGAAGAATGGTACAAATCCAAGTTTGCTGACCTCTCTGAGGCTGCCAACCGGAACAATGACGCCCTGCGCCAGGCAAAGCAGGAGTCCACTGAGTACCGGAGACAGGTGCAGTCCCTCACCTGTGAAGTGGATGCCCTTAAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCCGTTGAAGCTGCTAACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATGAAGGAGGAAATGGCTCGTCACCTTCGTGAATACCAAGACCTGCTCAATGTTAAGATGGCCCTTGACATTGAGATTGCCACCTACAGGAAGCTGCTGGAAGGCGAGGAGAGCAGGATTTCTCTGCCTCTTCCAAACTTTTCCTCCCTGAACCTGAGGGAAACTAATCTGGATTCACTCCCTCTGGTTGATACCCACTCAAAAAGGACACTTCTGATTAAGACGGTTGAAACTAGAGATGGACAGGTTATCAACGAAACTTCTCAGCATCACGATGACCTTGAA(SEQ ID NO:195)
蛋白:(X表示非典型氨基酸)
MSTRSVSSSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGSALRPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADAINTEFKNTRTNEKVELQELNDRFAXYIDKVRFLEQQNKILLAELEQLKGQGKSRLGDLYEEEMRELRRQVDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAENTLQSFRQDVDNASLARLDLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDVSKPDLTAALRDVRQQYESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYRRQVQSLTCEVDALKGTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHLREYQDLLNVKMALDIEIATYRKLLEGEESRISLPLPNFSSLNLRETNLDSLPLVDTHSKRTLLIKTVETRDGQVINETSQHHDDLE(SEQ ID NO:196)
INSR676TAG(智人胰岛素受体;Uniprot:P06213)
DNA:(下划线示出琥珀密码子)
ATGGGCACCGGGGGCCGGCGGGGGGCGGCGGCCGCGCCGCTGCTGGTGGCGGTGGCCGCGCTGCTACTGGGCGCCGCGGGCCACCTGTACCCCGGAGAGGTGTGTCCCGGCATGGATATCCGGAACAACCTCACTAGGTTGCATGAGCTGGAGAATTGCTCTGTCATCGAAGGACACTTGCAGATACTCTTGATGTTCAAAACGAGGCCCGAAGATTTCCGAGACCTCAGTTTCCCCAAACTCATCATGATCACTGATTACTTGCTGCTCTTCCGGGTCTATGGGCTCGAGAGCCTGAAGGACCTGTTCCCCAACCTCACGGTCATCCGGGGATCACGACTGTTCTTTAACTACGCGCTGGTCATCTTCGAGATGGTTCACCTCAAGGAACTCGGCCTCTACAACCTGATGAACATCACCCGGGGTTCTGTCCGCATCGAGAAGAACAATGAGCTCTGTTACTTGGCCACTATCGACTGGTCCCGTATCCTGGATTCCGTGGAGGATAATTACATCGTGTTGAACAAAGATGACAACGAGGAGTGTGGAGACATCTGTCCGGGTACCGCGAAGGGCAAGACCAACTGCCCCGCCACCGTCATCAACGGGCAGTTTGTCGAACGATGTTGGACTCATAGTCACTGCCAGAAAGTTTGCCCGACCATCTGTAAGTCACACGGCTGCACCGCCGAAGGCCTCTGTTGCCACAGCGAGTGCCTGGGCAACTGTTCTCAGCCCGACGACCCCACCAAGTGCGTGGCCTGCCGCAACTTCTACCTGGATGGCAGGTGTGTGGAGACCTGCCCGCCCCCGTACTACCACTTCCAGGACTGGCGCTGTGTGAACTTCAGCTTCTGCCAGGACCTGCACCACAAATGCAAGAACTCGCGGAGGCAGGGCTGCCACCAGTACGTCATTCACAACAACAAGTGCATCCCTGAGTGTCCCTCCGGGTACACGATGAATTCCAGCAACTTGCTGTGCACCCCATGCCTGGGTCCCTGTCCCAAGGTGTGCCACCTCCTAGAAGGCGAGAAGACCATCGACTCGGTGACGTCTGCCCAGGAGCTCCGAGGATGCACCGTCATCAACGGGAGTCTGATCATCAACATTCGAGGAGGCAACAATCTGGCAGCTGAGCTAGAAGCCAACCTCGGCCTCATTGAAGAAATTTCAGGGTATCTAAAAATCCGCCGATCCTACGCTCTGGTGTCACTTTCCTTCTTCCGGAAGTTACGTCTGATTCGAGGAGAGACCTTGGAAATTGGGAACTACTCCTTCTATGCCTTGGACAACCAGAACCTAAGGCAGCTCTGGGACTGGAGCAAACACAACCTCACCATCACTCAGGGGAAACTCTTCTTCCACTATAACCCCAAACTCTGCTTGTCAGAAATCCACAAGATGGAAGAAGTTTCAGGAACCAAGGGGCGCCAGGAGAGAAACGACATTGCCCTGAAGACCAATGGGGACCAGGCATCCTGTGAAAATGAGTTACTTAAATTTTCTTACATTCGGACATCTTTTGACAAGATCTTGCTGAGATGGGAGCCGTACTGGCCCCCCGACTTCCGAGACCTCTTGGGGTTCATGCTGTTCTACAAAGAGGCCCCTTATCAGAATGTGACGGAGTTCGACGGGCAGGATGCATGTGGTTCCAACAGTTGGACGGTGGTAGACATTGACCCACCCCTGAGGTCCAACGACCCCAAATCACAGAACCACCCAGGGTGGCTGATGCGGGGTCTCAAGCCCTGGACCCAGTATGCCATCTTTGTGAAGACCCTGGTCACCTTTTCGGATGAACGCCGGACCTATGGGGCCAAGAGTGACATCATTTATGTCCAGACAGATGCCACCAACCCCTCTGTGCCCCTGGATCCAATCTCAGTGTCTAACTCATCATCCCAGATTATTCTGAAGTGGAAACCACCCTCCGACCCCAATGGCAACATCACCCACTACCTGGTTTTCTGGGAGAGGCAGGCGGAAGACAGTGAGCTGTTCGAGCTGGATTATTGCCTCTAGGGGCTGAAGCTGCCCTCGAGGACCTGGTCTCCACCATTCGAGTCTGAAGATTCTCAGAAGCACAACCAGAGTGAGTATGAGGATTCGGCCGGCGAATGCTGCTCCTGTCCAAAGACAGACTCTCAGATCCTGAAGGAGCTGGAGGAGTCCTCGTTTAGGAAGACGTTTGAGGATTACCTGCACAACGTGGTTTTCGTCCCCAGGCCATCTCGGAAACGCAGGTCCCTTGGCGATGTTGGGAATGTGACGGTGGCCGTGCCCACGGTGGCAGCTTTCCCCAACACTTCCTCGACCAGCGTGCCCACGAGTCCGGAGGAGCACAGGCCTTTTGAGAAGGTGGTGAACAAGGAGTCGCTGGTCATCTCCGGCTTGCGACACTTCACGGGCTATCGCATCGAGCTGCAGGCTTGCAACCAGGACACCCCTGAGGAACGGTGCAGTGTGGCAGCCTACGTCAGTGCGAGGACCATGCCTGAAGCCAAGGCTGATGACATTGTTGGCCCTGTGACGCATGAAATCTTTGAGAACAACGTCGTCCACTTGATGTGGCAGGAGCCGAAGGAGCCCAATGGTCTGATCGTGCTGTATGAAGTGAGTTATCGGCGATATGGTGATGAGGAGCTGCATCTCTGCGTCTCCCGCAAGCACTTCGCTCTGGAACGGGGCTGCAGGCTGCGTGGGCTGTCACCGGGGAACTACAGCGTGCGAATCCGGGCCACCTCCCTTGCGGGCAACGGCTCTTGGACGGAACCCACCTATTTCTACGTGACAGACTATTTAGACGTCCCGTCAAATATTGCAAAAATTATCATCGGCCCCCTCATCTTTGTCTTTCTCTTCAGTGTTGTGATTGGAAGTATTTATCTATTCCTGAGAAAGAGGCAGCCAGATGGGCCGCTGGGACCGCTTTACGCTTCTTCAAACCCTGAGTATCTCAGTGCCAGTGATGTGTTTCCATGCTCTGTGTACGTGCCGGACGAGTGGGAGGTGTCTCGAGAGAAGATCACCCTCCTTCGAGAGCTGGGGCAGGGCTCCTTCGGCATGGTGTATGAGGGCAATGCCAGGGACATCATCAAGGGTGAGGCAGAGACCCGCGTGGCGGTGAAGACGGTCAACGAGTCAGCCAGTCTCCGAGAGCGGATTGAGTTCCTCAATGAGGCCTCGGTCATGAAGGGCTTCACCTGCCATCACGTGGTGCGCCTCCTGGGAGTGGTGTCCAAGGGCCAGCCCACGCTGGTGGTGATGGAGCTGATGGCTCACGGAGACCTGAAGAGCTACCTCCGTTCTCTGCGGCCAGAGGCTGAGAATAATCCTGGCCGCCCTCCCCCTACCCTTCAAGAGATGATTCAGATGGCGGCAGAGATTGCTGACGGGATGGCCTACCTGAACGCCAAGAAGTTTGTGCATCGGGACCTGGCAGCGAGAAACTGCATGGTCGCCCATGATTTTACTGTCAAAATTGGAGACTTTGGAATGACCAGAGACATCTATGAAACGGATTACTACCGGAAAGGGGGCAAGGGTCTGCTCCCTGTACGGTGGATGGCACCGGAGTCCCTGAAGGATGGGGTCTTCACCACTTCTTCTGACATGTGGTCCTTTGGCGTGGTCCTTTGGGAAATCACCAGCTTGGCAGAACAGCCTTACCAAGGCCTGTCTAATGAACAGGTGTTGAAATTTGTCATGGATGGAGGGTATCTGGATCAACCCGACAACTGTCCAGAGAGAGTCACTGACCTCATGCGCATGTGCTGGCAATTCAACCCCAACATGAGGCCAACCTTCCTGGAGATTGTCAACCTGCTCAAGGACGACCTGCACCCCAGCTTTCCAGAGGTGTCGTTCTTCCACAGCGAGGAGAACAAGGCTCCCGAGAGTGAGGAGCTGGAGATGGAGTTTGAGGACATGGAGAATGTGCCCCTGGACCGTTCCTCGCACTGTCAGAGGGAGGAGGCGGGGGGCCGGGATGGAGGGTCCTCGCTGGGTTTCAAGCGGAGCTACGAGGAACACATCCCTTACACACACATGAACGGAGGCAAGAAAAACGGGCGGATTCTGACCTTGCCTCGGTCCAATCCTTCCT(SEQ ID NO:197)
蛋白:(X表示非典型氨基酸)
MGTGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLXGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPNMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPS(SEQ ID NO:198)
Nup153-EGFP149TAG
DNA:(下划线示出琥珀密码子)
ATGGCGTCTGGTGCTGGCGGTGTTGGTGGAGGAGGTGGGGGTAAAATTCGTACTCGTCGCTGTCATCAAGGTCCGATTAAACCGTATCAGCAGGGACGTCAGCAACATCAGGGTATTCTGAGCCGTGTGACCGAAAGCGTGAAAAACATTGTGCCGGGTTGGCTGCAACGTTATTTCAACAAAAATGAGGATGTGTGTTCGTGTTCTACCGATACCAGTGAAGTTCCTCGTTGGCCGGAAAACAAAGAAGATCACCTGGTGTATGCCGATGAAGAATCGAGCAATATCACCGATGGCCGTATTACTCCTGAACCGGCGGTTAGTAACACTGAAGAACCGTCAACCACAAGCACAGCATCGAACTATCCAGATGTCCTGACTCGCCCTTCTCTGCACCGTTCTCACCTGAACTTTAGCATGCTGGAATCACCAGCTCTGCATTGTCAGCCGTCTACCAGTAGTGCCTTCCCGATTGGCTCTAGTGGCTTTTCGCTGGTCAAAGAGATCAAAGACTCGACCTCTCAACATGACGATGATAACATTAGCACGACCTCGGGTTTTAGTAGCCGTGCCTCCGATAAAGACATTACCGTGAGCAAAAACACCTCTCTGCCGCCTCTGTGGAGTCCTGAAGCCGAACGCTCTCATAGTCTGTCTCAGCACACAGCCACCAGTTCCAAAAAACCAGCCTTCAACCTGAGCGCCTTTGGTACACTGTCACCGAGCCTGGGAAATTCCTCTATCCTGAAAACATCACAGCTGGGCGATAGTCCGTTTTATCCGGGCAAAACGACGTATGGTGGTGCCGCTGCTGCTGTTCGCCAGTCTAAACTGCGTAACACTCCGTATCAAGCTCCAGTCCGTCGCCAAATGAAAGCAAAACAACTGTCGGCCCAGTCTTATGGTGTGACAAGCTCTACAGCTCGTCGTATCCTGCAAAGTCTGGAGAAAATGTCATCTCCGCTGGCAGATGCCAAACGTATTCCGTCCATTGTGAGCAGTCCGCTGAATAGCCCGCTGGACCGTAGTGGGATCGATATCACCGACTTCCAAGCCAAACGTGAGAAAGTGGATAGCCAGTATCCGCCTGTACAACGTCTGATGACCCCGAAACCGGTTTCAATTGCCACGAATCGTAGCGTGTATTTCAAACCGTCACTGACCCCTAGTGGTGAGTTTCGTAAAACAAATCAGCGTATCGACAACAAATGCTCTACCGGGTATGAAAAAAACATGACGCCGGGACAGAATCGTGAACAACGTGAATCTGGCTTCTCTTATCCGAACTTTAGTCTGCCGGCAGCAAATGGTCTGAGTAGCGGTGTAGGAGGTGGTGGGGGCAAAATGCGCCGTGAACGTCACGCCTTTGTGGCCTCTAAACCTCTGGAAGAAGAAGAGATGGAGGTTCCTGTACTGCCGAAAATCAGTCTGCCTATCACCTCTTCAAGTCTGCCGACCTTCAACTTTTCTAGTCCGGAAATCACAACCTCTAGCCCGTCACCGATTAATAGCAGTCAAGCACTGACGAATAAAGTCCAAATGACCTCACCGAGTTCTACGGGTTCTCCGATGTTCAAATTCTCTAGTCCTATCGTGAAATCAACCGAAGCGAACGTCCTGCCTCCTTCTAGTATTGGGTTCACCTTTAGCGTCCCAGTGGCCAAAACAGCTGAACTGAGCGGTAGCAGTAGTACTCTGGAACCGATTATCAGCTCAAGCGCCCATCATGTCACTACCGTGAATAGCACAAACTGTAAAAAAACGCCGCCTGAGGACTGTGAAGGACCGTTTCGTCCTGCCGAAATCCTGAAAGAAGGTTCCGTCCTGGACATTCTGAAATCTCCGGGATTTGCCTCTCCTAAAATCGACTCTGTTGCCGCTCAACCAACTGCCACATCACCGGTGGTTTATACTCGTCCGGCGATTAGCAGTTTTAGCAGTAGTGGCATCGGTTTTGGTGAATCCCTGAAAGCTGGCTCATCTTGGCAGTGTGACACCTGCCTGCTGCAAAACAAAGTGACCGATAACAAATGTATTGCCTGTCAGGCCGCCAAACTGTCTCCTCGTGATACAGCCAAACAGACCGGCATCGAAACCCCTAATAAAAGCGGGAAAACGACCCTGTCAGCAAGTGGTACGGGATTTGGGGACAAATTCAAACCTGTGATCGGCACATGGGACTGTGACACTTGTCTGGTACAGAACAAACCAGAAGCGATCAAATGTGTGGCCTGTGAAACGCCTAAACCTGGAACATGTGTGAAACGTGCCCTGACTCTGACTGTTGTGTCAGAAAGCGCCGAAACCATGACGGCAAGCAGCTCATCCTGTACTGTGACTACCGGGACTCTGGGATTTGGTGACAAATTCAAACGCCCGATTGGTTCCTGGGAATGCTCCGTGTGTTGTGTGAGCAATAATGCCGAGGACAACAAATGTGTGTCCTGTATGAGCGAGAAACCTGGCAGCTCTGTTCCTGCTAGCAGCTCTAGCACAGTTCCTGTTAGTCTGCCTAGTGGTGGTTCTCTGGGTCTGGAAAAATTCAAAAAACCTGAAGGAAGCTGGGATTGTGAGCTGTGCCTGGTACAGAATAAAGCGGATAGCACGAAATGTCTGGCCTGTGAGTCAGCCAAACCAGGGACTAAAAGCGGCTTTAAAGGCTTCGACACGTCGAGCAGTTCTAGTAACAGCGCCGCCTCATCATCTTTCAAATTTGGGGTGAGCAGCTCCTCTAGTGGTCCTAGTCAAACACTGACCTCTACCGGAAACTTCAAATTCGGCGATCAGGGTGGCTTCAAAATTGGTGTCTCCTCTGATTCGGGTAGCATTAACCCGATGAGTGAGGGGTTCAAATTCAGCAAACCAATTGGCGATTTCAAATTCGGTGTGTCGTCTGAATCCAAACCTGAAGAAGTCAAAAAAGACAGCAAAAACGACAATTTCAAATTCGGCCTGTCTAGTGGTCTGTCTAATCCGGTTAGCCTGACCCCGTTTCAGTTCGGGGTGTCTAATCTGGGTCAGGAAGAGAAAAAAGAGGAGCTGCCTAAAAGTTCATCTGCCGGGTTCAGTTTTGGTACAGGCGTGATCAATAGCACTCCAGCACCAGCCAATACAATCGTGACGAGCGAGAACAAATCGAGCTTCAACCTGGGGACAATCGAAACGAAAAGCGCCAGTGTAGCGCCATTCACGTGTAAAACCTCCGAGGCAAAAAAAGAAGAGATGCCGGCCACAAAAGGTGGATTCTCATTCGGCAACGTGGAACCGGCTAGCCTGCCATCAGCAAGCGTGTTTGTACTGGGCCGTACCGAGGAGAAACAGCAGGAACCTGTTACTAGCACCAGTCTGGTCTTTGGTAAAAAAGCCGACAACGAAGAACCGAAATGTCAGCCAGTGTTCAGCTTCGGCAATAGCGAACAGACGAAAGACGAAAACAGCAGCAAATCGACGTTCAGCTTCAGTATGACGAAACCGAGCGAAAAAGAAAGTGAGCAGCCAGCAAAAGCAACGTTCGCCTTTGGAGCACAGACATCAACCACAGCCGATCAAGGAGCAGCGAAACCAGTTTTCAGTTTTCTGAATAACAGCTCAAGCAGCAGTTCTACACCAGCAACCTCAGCAGGTGGTGGGATCTTTGGATCAAGCACCTCATCCAGCAATCCGCCAGTGGCAACATTCGTGTTTGGCCAGAGCAGTAATCCGGTGTCATCTTCAGCATTTGGGAATACCGCCGAGAGTAGCACATCACAGTCTCTGCTGTTCTCACAGGACTCTAAACTGGCAACCACCTCTTCTACTGGTACAGCGGTTACCCCGTTTGTGTTCGGTCCGGGAGCATCATCCAATAATACCACGACGTCGGGCTTTGGGTTTGGTGCCACGACAACAAGCAGTAGCGCTGGTAGCAGCTTTGTCTTTGGCACAGGTCCTTCAGCACCTTCTGCTTCACCAGCTTTCGGAGCCAATCAGACTCCGACATTCGGACAGTCACAGGGTGCCTCTCAACCAAATCCTCCGGGTTTTGGCAGTATTAGCAGTAGTACCGCCCTGTTCCCGACCGGTAGTCAACCGGCACCGCCAACATTTGGAACGGTTAGCAGTAGTAGTCAGCCTCCGGTGTTTGGACAACAACCGAGCCAGAGCGCCTTCGGATCAGGAACGACCCCTAATAGTAGCAGTGCCTTCCAGTTCGGTAGCAGTACCACCAACTTCAACTTCACGAACAATAGCCCGTCAGGTGTGTTCACGTTTGGCGCCAATTCTTCTACCCCAGCGGCAAGTGCTCAACCTTCAGGCTCAGGTGGATTTCCTTTCAACCAGTCACCAGCAGCGTTTACTGTTGGTTCTAACGGGAAAAACGTTTTCAGTAGCAGCGGCACCTCGTTTTCTGGTCGTAAAATCAAAACGGCCGTTCGTCGCCGTAAAGCGGATCCACCGGTCGCCACGAGAGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACTAGGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA(SEQ ID NO:199)
蛋白:(X表示非典型氨基酸)
MASGAGGVGGGGGGKIRTRRCHQGPIKPYQQGRQQHQGILSRVTESVKNIVPGWLQRYFNKNEDVCSCSTDTSEVPRWPENKEDHLVYADEESSNITDGRITPEPAVSNTEEPSTTSTASNYPDVLTRPSLHRSHLNFSMLESPALHCQPSTSSAFPIGSSGFSLVKEIKDSTSQHDDDNISTTSGFSSRASDKDITVSKNTSLPPLWSPEAERSHSLSQHTATSSKKPAFNLSAFGTLSPSLGNSSILKTSQLGDSPFYPGKTTYGGAAAAVRQSKLRNTPYQAPVRRQMKAKQLSAQSYGVTSSTARRILQSLEKMSSPLADAKRIPSIVSSPLNSPLDRSGIDITDFQAKREKVDSQYPPVQRLMTPKPVSIATNRSVYFKPSLTPSGEFRKTNQRIDNKCSTGYEKNMTPGQNREQRESGFSYPNFSLPAANGLSSGVGGGGGKMRRERHAFVASKPLEEEEMEVPVLPKISLPITSSSLPTFNFSSPEITTSSPSPINSSQALTNKVQMTSPSSTGSPMFKFSSPIVKSTEANVLPPSSIGFTFSVPVAKTAELSGSSSTLEPIISSSAHHVTTVNSTNCKKTPPEDCEGPFRPAEILKEGSVLDILKSPGFASPKIDSVAAQPTATSPVVYTRPAISSFSSSGIGFGESLKAGSSWQCDTCLLQNKVTDNKCIACQAAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGTWDCDTCLVQNKPEAIKCVACETPKPGTCVKRALTLTVVSESAETMTASSSSCTVTTGTLGFGDKFKRPIGSWECSVCCVSNNAEDNKCVSCMSEKPGSSVPASSSSTVPVSLPSGGSLGLEKFKKPEGSWDCELCLVQNKADSTKCLACESAKPGTKSGFKGFDTSSSSSNSAASSSFKFGVSSSSSGPSQTLTSTGNFKFGDQGGFKIGVSSDSGSINPMSEGFKFSKPIGDFKFGVSSESKPEEVKKDSKNDNFKFGLSSGLSNPVSLTPFQFGVSNLGQEEKKEELPKSSSAGFSFGTGVINSTPAPANTIVTSENKSSFNLGTIETKSASVAPFTCKTSEAKKEEMPATKGGFSFGNVEPASLPSASVFVLGRTEEKQQEPVTSTSLVFGKKADNEEPKCQPVFSFGNSEQTKDENSSKSTFSFSMTKPSEKESEQPAKATFAFGAQTSTTADQGAAKPVFSFLNNSSSSSSTPATSAGGGIFGSSTSSSNPPVATFVFGQSSNPVSSSAFGNTAESSTSQSLLFSQDSKLATTSSTGTAVTPFVFGPGASSNNTTTSGFGFGATTTSSSAGSSFVFGTGPSAPSASPAFGANQTPTFGQSQGASQPNPPGFGSISSSTALFPTGSQPAPPTFGTVSSSSQPPVFGQQPSQSAFGSGTTPNSSSAFQFGSSTTNFNFTNNSPSGVFTFGANSSTPAASAQPSGSGGFPFNQSPAAFTVGSNGKNVFSSSGTSFSGRKIKTAVRRRKADPPVATRVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHXVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK(SEQ ID NO:200)
Nup153-EGFP149TAG-MS2
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGGCGTCTGGTGCTGGCGGTGTTGGTGGAGGAGGTGGGGGTAAAATTCGTACTCGTCGCTGTCATCAAGGTCCGATTAAACCGTATCAGCAGGGACGTCAGCAACATCAGGGTATTCTGAGCCGTGTGACCGAAAGCGTGAAAAACATTGTGCCGGGTTGGCTGCAACGTTATTTCAACAAAAATGAGGATGTGTGTTCGTGTTCTACCGATACCAGTGAAGTTCCTCGTTGGCCGGAAAACAAAGAAGATCACCTGGTGTATGCCGATGAAGAATCGAGCAATATCACCGATGGCCGTATTACTCCTGAACCGGCGGTTAGTAACACTGAAGAACCGTCAACCACAAGCACAGCATCGAACTATCCAGATGTCCTGACTCGCCCTTCTCTGCACCGTTCTCACCTGAACTTTAGCATGCTGGAATCACCAGCTCTGCATTGTCAGCCGTCTACCAGTAGTGCCTTCCCGATTGGCTCTAGTGGCTTTTCGCTGGTCAAAGAGATCAAAGACTCGACCTCTCAACATGACGATGATAACATTAGCACGACCTCGGGTTTTAGTAGCCGTGCCTCCGATAAAGACATTACCGTGAGCAAAAACACCTCTCTGCCGCCTCTGTGGAGTCCTGAAGCCGAACGCTCTCATAGTCTGTCTCAGCACACAGCCACCAGTTCCAAAAAACCAGCCTTCAACCTGAGCGCCTTTGGTACACTGTCACCGAGCCTGGGAAATTCCTCTATCCTGAAAACATCACAGCTGGGCGATAGTCCGTTTTATCCGGGCAAAACGACGTATGGTGGTGCCGCTGCTGCTGTTCGCCAGTCTAAACTGCGTAACACTCCGTATCAAGCTCCAGTCCGTCGCCAAATGAAAGCAAAACAACTGTCGGCCCAGTCTTATGGTGTGACAAGCTCTACAGCTCGTCGTATCCTGCAAAGTCTGGAGAAAATGTCATCTCCGCTGGCAGATGCCAAACGTATTCCGTCCATTGTGAGCAGTCCGCTGAATAGCCCGCTGGACCGTAGTGGGATCGATATCACCGACTTCCAAGCCAAACGTGAGAAAGTGGATAGCCAGTATCCGCCTGTACAACGTCTGATGACCCCGAAACCGGTTTCAATTGCCACGAATCGTAGCGTGTATTTCAAACCGTCACTGACCCCTAGTGGTGAGTTTCGTAAAACAAATCAGCGTATCGACAACAAATGCTCTACCGGGTATGAAAAAAACATGACGCCGGGACAGAATCGTGAACAACGTGAATCTGGCTTCTCTTATCCGAACTTTAGTCTGCCGGCAGCAAATGGTCTGAGTAGCGGTGTAGGAGGTGGTGGGGGCAAAATGCGCCGTGAACGTCACGCCTTTGTGGCCTCTAAACCTCTGGAAGAAGAAGAGATGGAGGTTCCTGTACTGCCGAAAATCAGTCTGCCTATCACCTCTTCAAGTCTGCCGACCTTCAACTTTTCTAGTCCGGAAATCACAACCTCTAGCCCGTCACCGATTAATAGCAGTCAAGCACTGACGAATAAAGTCCAAATGACCTCACCGAGTTCTACGGGTTCTCCGATGTTCAAATTCTCTAGTCCTATCGTGAAATCAACCGAAGCGAACGTCCTGCCTCCTTCTAGTATTGGGTTCACCTTTAGCGTCCCAGTGGCCAAAACAGCTGAACTGAGCGGTAGCAGTAGTACTCTGGAACCGATTATCAGCTCAAGCGCCCATCATGTCACTACCGTGAATAGCACAAACTGTAAAAAAACGCCGCCTGAGGACTGTGAAGGACCGTTTCGTCCTGCCGAAATCCTGAAAGAAGGTTCCGTCCTGGACATTCTGAAATCTCCGGGATTTGCCTCTCCTAAAATCGACTCTGTTGCCGCTCAACCAACTGCCACATCACCGGTGGTTTATACTCGTCCGGCGATTAGCAGTTTTAGCAGTAGTGGCATCGGTTTTGGTGAATCCCTGAAAGCTGGCTCATCTTGGCAGTGTGACACCTGCCTGCTGCAAAACAAAGTGACCGATAACAAATGTATTGCCTGTCAGGCCGCCAAACTGTCTCCTCGTGATACAGCCAAACAGACCGGCATCGAAACCCCTAATAAAAGCGGGAAAACGACCCTGTCAGCAAGTGGTACGGGATTTGGGGACAAATTCAAACCTGTGATCGGCACATGGGACTGTGACACTTGTCTGGTACAGAACAAACCAGAAGCGATCAAATGTGTGGCCTGTGAAACGCCTAAACCTGGAACATGTGTGAAACGTGCCCTGACTCTGACTGTTGTGTCAGAAAGCGCCGAAACCATGACGGCAAGCAGCTCATCCTGTACTGTGACTACCGGGACTCTGGGATTTGGTGACAAATTCAAACGCCCGATTGGTTCCTGGGAATGCTCCGTGTGTTGTGTGAGCAATAATGCCGAGGACAACAAATGTGTGTCCTGTATGAGCGAGAAACCTGGCAGCTCTGTTCCTGCTAGCAGCTCTAGCACAGTTCCTGTTAGTCTGCCTAGTGGTGGTTCTCTGGGTCTGGAAAAATTCAAAAAACCTGAAGGAAGCTGGGATTGTGAGCTGTGCCTGGTACAGAATAAAGCGGATAGCACGAAATGTCTGGCCTGTGAGTCAGCCAAACCAGGGACTAAAAGCGGCTTTAAAGGCTTCGACACGTCGAGCAGTTCTAGTAACAGCGCCGCCTCATCATCTTTCAAATTTGGGGTGAGCAGCTCCTCTAGTGGTCCTAGTCAAACACTGACCTCTACCGGAAACTTCAAATTCGGCGATCAGGGTGGCTTCAAAATTGGTGTCTCCTCTGATTCGGGTAGCATTAACCCGATGAGTGAGGGGTTCAAATTCAGCAAACCAATTGGCGATTTCAAATTCGGTGTGTCGTCTGAATCCAAACCTGAAGAAGTCAAAAAAGACAGCAAAAACGACAATTTCAAATTCGGCCTGTCTAGTGGTCTGTCTAATCCGGTTAGCCTGACCCCGTTTCAGTTCGGGGTGTCTAATCTGGGTCAGGAAGAGAAAAAAGAGGAGCTGCCTAAAAGTTCATCTGCCGGGTTCAGTTTTGGTACAGGCGTGATCAATAGCACTCCAGCACCAGCCAATACAATCGTGACGAGCGAGAACAAATCGAGCTTCAACCTGGGGACAATCGAAACGAAAAGCGCCAGTGTAGCGCCATTCACGTGTAAAACCTCCGAGGCAAAAAAAGAAGAGATGCCGGCCACAAAAGGTGGATTCTCATTCGGCAACGTGGAACCGGCTAGCCTGCCATCAGCAAGCGTGTTTGTACTGGGCCGTACCGAGGAGAAACAGCAGGAACCTGTTACTAGCACCAGTCTGGTCTTTGGTAAAAAAGCCGACAACGAAGAACCGAAATGTCAGCCAGTGTTCAGCTTCGGCAATAGCGAACAGACGAAAGACGAAAACAGCAGCAAATCGACGTTCAGCTTCAGTATGACGAAACCGAGCGAAAAAGAAAGTGAGCAGCCAGCAAAAGCAACGTTCGCCTTTGGAGCACAGACATCAACCACAGCCGATCAAGGAGCAGCGAAACCAGTTTTCAGTTTTCTGAATAACAGCTCAAGCAGCAGTTCTACACCAGCAACCTCAGCAGGTGGTGGGATCTTTGGATCAAGCACCTCATCCAGCAATCCGCCAGTGGCAACATTCGTGTTTGGCCAGAGCAGTAATCCGGTGTCATCTTCAGCATTTGGGAATACCGCCGAGAGTAGCACATCACAGTCTCTGCTGTTCTCACAGGACTCTAAACTGGCAACCACCTCTTCTACTGGTACAGCGGTTACCCCGTTTGTGTTCGGTCCGGGAGCATCATCCAATAATACCACGACGTCGGGCTTTGGGTTTGGTGCCACGACAACAAGCAGTAGCGCTGGTAGCAGCTTTGTCTTTGGCACAGGTCCTTCAGCACCTTCTGCTTCACCAGCTTTCGGAGCCAATCAGACTCCGACATTCGGACAGTCACAGGGTGCCTCTCAACCAAATCCTCCGGGTTTTGGCAGTATTAGCAGTAGTACCGCCCTGTTCCCGACCGGTAGTCAACCGGCACCGCCAACATTTGGAACGGTTAGCAGTAGTAGTCAGCCTCCGGTGTTTGGACAACAACCGAGCCAGAGCGCCTTCGGATCAGGAACGACCCCTAATAGTAGCAGTGCCTTCCAGTTCGGTAGCAGTACCACCAACTTCAACTTCACGAACAATAGCCCGTCAGGTGTGTTCACGTTTGGCGCCAATTCTTCTACCCCAGCGGCAAGTGCTCAACCTTCAGGCTCAGGTGGATTTCCTTTCAACCAGTCACCAGCAGCGTTTACTGTTGGTTCTAACGGGAAAAACGTTTTCAGTAGCAGCGGCACCTCGTTTTCTGGTCGTAAAATCAAAACGGCCGTTCGTCGCCGTAAAGCGGATCCACCGGTCGCCACGAGAGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACTAGGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGACTCTAGATCATAATCAGCACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAAC ATGAGGATCACCCATGT(SEQ ID NO:201)
蛋白:(X表示非典型氨基酸)
MASGAGGVGGGGGGKIRTRRCHQGPIKPYQQGRQQHQGILSRVTESVKNIVPGWLQRYFNKNEDVCSCSTDTSEVPRWPENKEDHLVYADEESSNITDGRITPEPAVSNTEEPSTTSTASNYPDVLTRPSLHRSHLNFSMLESPALHCQPSTSSAFPIGSSGFSLVKEIKDSTSQHDDDNISTTSGFSSRASDKDITVSKNTSLPPLWSPEAERSHSLSQHTATSSKKPAFNLSAFGTLSPSLGNSSILKTSQLGDSPFYPGKTTYGGAAAAVRQSKLRNTPYQAPVRRQMKAKQLSAQSYGVTSSTARRILQSLEKMSSPLADAKRIPSIVSSPLNSPLDRSGIDITDFQAKREKVDSQYPPVQRLMTPKPVSIATNRSVYFKPSLTPSGEFRKTNQRIDNKCSTGYEKNMTPGQNREQRESGFSYPNFSLPAANGLSSGVGGGGGKMRRERHAFVASKPLEEEEMEVPVLPKISLPITSSSLPTFNFSSPEITTSSPSPINSSQALTNKVQMTSPSSTGSPMFKFSSPIVKSTEANVLPPSSIGFTFSVPVAKTAELSGSSSTLEPIISSSAHHVTTVNSTNCKKTPPEDCEGPFRPAEILKEGSVLDILKSPGFASPKIDSVAAQPTATSPVVYTRPAISSFSSSGIGFGESLKAGSSWQCDTCLLQNKVTDNKCIACQAAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGTWDCDTCLVQNKPEAIKCVACETPKPGTCVKRALTLTVVSESAETMTASSSSCTVTTGTLGFGDKFKRPIGSWECSVCCVSNNAEDNKCVSCMSEKPGSSVPASSSSTVPVSLPSGGSLGLEKFKKPEGSWDCELCLVQNKADSTKCLACESAKPGTKSGFKGFDTSSSSSNSAASSSFKFGVSSSSSGPSQTLTSTGNFKFGDQGGFKIGVSSDSGSINPMSEGFKFSKPIGDFKFGVSSESKPEEVKKDSKNDNFKFGLSSGLSNPVSLTPFQFGVSNLGQEEKKEELPKSSSAGFSFGTGVINSTPAPANTIVTSENKSSFNLGTIETKSASVAPFTCKTSEAKKEEMPATKGGFSFGNVEPASLPSASVFVLGRTEEKQQEPVTSTSLVFGKKADNEEPKCQPVFSFGNSEQTKDENSSKSTFSFSMTKPSEKESEQPAKATFAFGAQTSTTADQGAAKPVFSFLNNSSSSSSTPATSAGGGIFGSSTSSSNPPVATFVFGQSSNPVSSSAFGNTAESSTSQSLLFSQDSKLATTSSTGTAVTPFVFGPGASSNNTTTSGFGFGATTTSSSAGSSFVFGTGPSAPSASPAFGANQTPTFGQSQGASQPNPPGFGSISSSTALFPTGSQPAPPTFGTVSSSSQPPVFGQQPSQSAFGSGTTPNSSSAFQFGSSTTNFNFTNNSPSGVFTFGANSSTPAASAQPSGSGGFPFNQSPAAFTVGSNGKNVFSSSGTSFSGRKIKTAVRRRKADPPVATRVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHXVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK(SEQ ID NO:202)
Vim116TAG-mOrange
DNA:(下划线示出琥珀密码子)
ATGTCCACCAGGTCCGTGTCCTCGTCCTCCTACCGCAGGATGTTCGGCGGCCCGGGCACCGCGAGCCGGCCGAGCTCCAGCCGGAGCTACGTGACTACGTCCACCCGCACCTACAGCCTGGGCAGCGCGCTGCGCCCCAGCACCAGCCGCAGCCTCTACGCCTCGTCCCCGGGCGGCGTGTATGCCACGCGCTCCTCTGCCGTGCGCCTGCGGAGCAGCGTGCCCGGGGTGCGGCTCCTGCAGGACTCGGTGGACTTCTCGCTGGCCGACGCCATCAACACCGAGTTCAAGAACACCCGCACCAACGAGAAGGTGGAGCTGCAGGAGCTGAATGACCGCTTCGCCTAGTACATCGACAAGGTGCGCTTCCTGGAGCAGCAGAATAAGATCCTGCTGGCCGAGCTCGAGCAGCTCAAGGGCCAAGGCAAGTCGCGCCTGGGGGACCTCTACGAGGAGGAGATGCGGGAGCTGCGCCGGCAGGTGGACCAGCTAACCAACGACAAAGCCCGCGTCGAGGTGGAGCGCGACAACCTGGCCGAGGACATCATGCGCCTCCGGGAGAAATTGCAGGAGGAGATGCTTCAGAGAGAGGAAGCCGAAAACACCCTGCAATCTTTCAGACAGGATGTTGACAATGCGTCTCTGGCACGTCTTGACCTTGAACGCAAAGTGGAATCTTTGCAAGAAGAGATTGCCTTTTTGAAGAAACTCCACGAAGAGGAAATCCAGGAGCTGCAGGCTCAGATTCAGGAACAGCATGTCCAAATCGATGTGGATGTTTCCAAGCCTGACCTCACGGCTGCCCTGCGTGACGTACGTCAGCAATATGAAAGTGTGGCTGCCAAGAACCTGCAGGAGGCAGAAGAATGGTACAAATCCAAGTTTGCTGACCTCTCTGAGGCTGCCAACCGGAACAATGACGCCCTGCGCCAGGCAAAGCAGGAGTCCACTGAGTACCGGAGACAGGTGCAGTCCCTCACCTGTGAAGTGGATGCCCTTAAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCCGTTGAAGCTGCTAACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATGAAGGAGGAAATGGCTCGTCACCTTCGTGAATACCAAGACCTGCTCAATGTTAAGATGGCCCTTGACATTGAGATTGCCACCTACAGGAAGCTGCTGGAAGGCGAGGAGAGCAGGATTTCTCTGCCTCTTCCAAACTTTTCCTCCCTGAACCTGAGGGAAACTAATCTGGATTCACTCCCTCTGGTTGATACCCACTCAAAAAGGACACTTCTGATTAAGACGGTTGAAACTAGAGATGGACAGGTTATCAACGAAACTTCTCAGCATCACGATGACCTTGAAGGGGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGAATAATATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCGCATGGAGGGCACCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCTTTCAGACCGCTAAGCTGAAGGTGACCAAGGGCGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCTCTTCACCTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTCAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTACGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCACTGCAGGACGGCGAGTTCATCTACAAGGTGAAGATGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTGATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAGGATGAGGCTGAAGCTGAAGGACGGCGGCCACTACACCTCCGAGGTCAAGACCACCTACAAGGCCAAGAAGTCCGTGCAGCTGCCCGGCGCCTACATCGTCGGCATCAAGCTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGTAA(SEQ ID NO:203)
蛋白:(X表示非典型氨基酸)
MSTRSVSSSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGSALRPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADAINTEFKNTRTNEKVELQELNDRFAXYIDKVRFLEQQNKILLAELEQLKGQGKSRLGDLYEEEMRELRRQVDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAENTLQSFRQDVDNASLARLDLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDVSKPDLTAALRDVRQQYESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYRRQVQSLTCEVDALKGTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHLREYQDLLNVKMALDIEIATYRKLLEGEESRISLPLPNFSSLNLRETNLDSLPLVDTHSKRTLLIKTVETRDGQVINETSQHHDDLEGDPPVATMVSKGEENNMAIIKEFMRFKVRMEGTVNGHEFEIEGEGEGRPYEGFQTAKLKVTKGGPLPFAWDILSPLFTYGSKAYVKHPADIPDYFKLSFPEGFKWERVMNYEDGGVVTVTQDSSLQDGEFIYKVKMRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIRMRLKLKDGGHYTSEVKTTYKAKKSVQLPGAYIVGIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK(SEQ ID NO:204)
Vim116TAG-mOrange-MS2
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGTCCACCAGGTCCGTGTCCTCGTCCTCCTACCGCAGGATGTTCGGCGGCCCGGGCACCGCGAGCCGGCCGAGCTCCAGCCGGAGCTACGTGACTACGTCCACCCGCACCTACAGCCTGGGCAGCGCGCTGCGCCCCAGCACCAGCCGCAGCCTCTACGCCTCGTCCCCGGGCGGCGTGTATGCCACGCGCTCCTCTGCCGTGCGCCTGCGGAGCAGCGTGCCCGGGGTGCGGCTCCTGCAGGACTCGGTGGACTTCTCGCTGGCCGACGCCATCAACACCGAGTTCAAGAACACCCGCACCAACGAGAAGGTGGAGCTGCAGGAGCTGAATGACCGCTTCGCCTAGTACATCGACAAGGTGCGCTTCCTGGAGCAGCAGAATAAGATCCTGCTGGCCGAGCTCGAGCAGCTCAAGGGCCAAGGCAAGTCGCGCCTGGGGGACCTCTACGAGGAGGAGATGCGGGAGCTGCGCCGGCAGGTGGACCAGCTAACCAACGACAAAGCCCGCGTCGAGGTGGAGCGCGACAACCTGGCCGAGGACATCATGCGCCTCCGGGAGAAATTGCAGGAGGAGATGCTTCAGAGAGAGGAAGCCGAAAACACCCTGCAATCTTTCAGACAGGATGTTGACAATGCGTCTCTGGCACGTCTTGACCTTGAACGCAAAGTGGAATCTTTGCAAGAAGAGATTGCCTTTTTGAAGAAACTCCACGAAGAGGAAATCCAGGAGCTGCAGGCTCAGATTCAGGAACAGCATGTCCAAATCGATGTGGATGTTTCCAAGCCTGACCTCACGGCTGCCCTGCGTGACGTACGTCAGCAATATGAAAGTGTGGCTGCCAAGAACCTGCAGGAGGCAGAAGAATGGTACAAATCCAAGTTTGCTGACCTCTCTGAGGCTGCCAACCGGAACAATGACGCCCTGCGCCAGGCAAAGCAGGAGTCCACTGAGTACCGGAGACAGGTGCAGTCCCTCACCTGTGAAGTGGATGCCCTTAAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCCGTTGAAGCTGCTAACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATGAAGGAGGAAATGGCTCGTCACCTTCGTGAATACCAAGACCTGCTCAATGTTAAGATGGCCCTTGACATTGAGATTGCCACCTACAGGAAGCTGCTGGAAGGCGAGGAGAGCAGGATTTCTCTGCCTCTTCCAAACTTTTCCTCCCTGAACCTGAGGGAAACTAATCTGGATTCACTCCCTCTGGTTGATACCCACTCAAAAAGGACACTTCTGATTAAGACGGTTGAAACTAGAGATGGACAGGTTATCAACGAAACTTCTCAGCATCACGATGACCTTGAAGGGGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGAATAATATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCGCATGGAGGGCACCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCTTTCAGACCGCTAAGCTGAAGGTGACCAAGGGCGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCTCTTCACCTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTCAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTACGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCACTGCAGGACGGCGAGTTCATCTACAAGGTGAAGATGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTGATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAGGATGAGGCTGAAGCTGAAGGACGGCGGCCACTACACCTCCGAGGTCAAGACCACCTACAAGGCCAAGAAGTCCGTGCAGCTGCCCGGCGCCTACATCGTCGGCATCAAGCTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGACTCTAGATCATAATCAGCACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCATGT(SEQ ID NO:205)
蛋白:(X表示非典型氨基酸)
MSTRSVSSSSYRRMFGGPGTASRPSSSRSYVTTSTRTYSLGSALRPSTSRSLYASSPGGVYATRSSAVRLRSSVPGVRLLQDSVDFSLADAINTEFKNTRTNEKVELQELNDRFAXYIDKVRFLEQQNKILLAELEQLKGQGKSRLGDLYEEEMRELRRQVDQLTNDKARVEVERDNLAEDIMRLREKLQEEMLQREEAENTLQSFRQDVDNASLARLDLERKVESLQEEIAFLKKLHEEEIQELQAQIQEQHVQIDVDVSKPDLTAALRDVRQQYESVAAKNLQEAEEWYKSKFADLSEAANRNNDALRQAKQESTEYRRQVQSLTCEVDALKGTNESLERQMREMEENFAVEAANYQDTIGRLQDEIQNMKEEMARHLREYQDLLNVKMALDIEIATYRKLLEGEESRISLPLPNFSSLNLRETNLDSLPLVDTHSKRTLLIKTVETRDGQVINETSQHHDDLEGDPPVATMVSKGEENNMAIIKEFMRFKVRMEGTVNGHEFEIEGEGEGRPYEGFQTAKLKVTKGGPLPFAWDILSPLFTYGSKAYVKHPADIPDYFKLSFPEGFKWERVMNYEDGGVVTVTQDSSLQDGEFIYKVKMRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIRMRLKLKDGGHYTSEVKTTYKAKKSVQLPGAYIVGIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK(SEQ ID NO:206)
INSR676TAG-EGFP
DNA:(下划线示出琥珀密码子)
ATGGGCACCGGGGGCCGGCGGGGGGCGGCGGCCGCGCCGCTGCTGGTGGCGGTGGCCGCGCTGCTACTGGGCGCCGCGGGCCACCTGTACCCCGGAGAGGTGTGTCCCGGCATGGATATCCGGAACAACCTCACTAGGTTGCATGAGCTGGAGAATTGCTCTGTCATCGAAGGACACTTGCAGATACTCTTGATGTTCAAAACGAGGCCCGAAGATTTCCGAGACCTCAGTTTCCCCAAACTCATCATGATCACTGATTACTTGCTGCTCTTCCGGGTCTATGGGCTCGAGAGCCTGAAGGACCTGTTCCCCAACCTCACGGTCATCCGGGGATCACGACTGTTCTTTAACTACGCGCTGGTCATCTTCGAGATGGTTCACCTCAAGGAACTCGGCCTCTACAACCTGATGAACATCACCCGGGGTTCTGTCCGCATCGAGAAGAACAATGAGCTCTGTTACTTGGCCACTATCGACTGGTCCCGTATCCTGGATTCCGTGGAGGATAATTACATCGTGTTGAACAAAGATGACAACGAGGAGTGTGGAGACATCTGTCCGGGTACCGCGAAGGGCAAGACCAACTGCCCCGCCACCGTCATCAACGGGCAGTTTGTCGAACGATGTTGGACTCATAGTCACTGCCAGAAAGTTTGCCCGACCATCTGTAAGTCACACGGCTGCACCGCCGAAGGCCTCTGTTGCCACAGCGAGTGCCTGGGCAACTGTTCTCAGCCCGACGACCCCACCAAGTGCGTGGCCTGCCGCAACTTCTACCTGGATGGCAGGTGTGTGGAGACCTGCCCGCCCCCGTACTACCACTTCCAGGACTGGCGCTGTGTGAACTTCAGCTTCTGCCAGGACCTGCACCACAAATGCAAGAACTCGCGGAGGCAGGGCTGCCACCAGTACGTCATTCACAACAACAAGTGCATCCCTGAGTGTCCCTCCGGGTACACGATGAATTCCAGCAACTTGCTGTGCACCCCATGCCTGGGTCCCTGTCCCAAGGTGTGCCACCTCCTAGAAGGCGAGAAGACCATCGACTCGGTGACGTCTGCCCAGGAGCTCCGAGGATGCACCGTCATCAACGGGAGTCTGATCATCAACATTCGAGGAGGCAACAATCTGGCAGCTGAGCTAGAAGCCAACCTCGGCCTCATTGAAGAAATTTCAGGGTATCTAAAAATCCGCCGATCCTACGCTCTGGTGTCACTTTCCTTCTTCCGGAAGTTACGTCTGATTCGAGGAGAGACCTTGGAAATTGGGAACTACTCCTTCTATGCCTTGGACAACCAGAACCTAAGGCAGCTCTGGGACTGGAGCAAACACAACCTCACCATCACTCAGGGGAAACTCTTCTTCCACTATAACCCCAAACTCTGCTTGTCAGAAATCCACAAGATGGAAGAAGTTTCAGGAACCAAGGGGCGCCAGGAGAGAAACGACATTGCCCTGAAGACCAATGGGGACCAGGCATCCTGTGAAAATGAGTTACTTAAATTTTCTTACATTCGGACATCTTTTGACAAGATCTTGCTGAGATGGGAGCCGTACTGGCCCCCCGACTTCCGAGACCTCTTGGGGTTCATGCTGTTCTACAAAGAGGCCCCTTATCAGAATGTGACGGAGTTCGACGGGCAGGATGCATGTGGTTCCAACAGTTGGACGGTGGTAGACATTGACCCACCCCTGAGGTCCAACGACCCCAAATCACAGAACCACCCAGGGTGGCTGATGCGGGGTCTCAAGCCCTGGACCCAGTATGCCATCTTTGTGAAGACCCTGGTCACCTTTTCGGATGAACGCCGGACCTATGGGGCCAAGAGTGACATCATTTATGTCCAGACAGATGCCACCAACCCCTCTGTGCCCCTGGATCCAATCTCAGTGTCTAACTCATCATCCCAGATTATTCTGAAGTGGAAACCACCCTCCGACCCCAATGGCAACATCACCCACTACCTGGTTTTCTGGGAGAGGCAGGCGGAAGACAGTGAGCTGTTCGAGCTGGATTATTGCCTCTAGGGGCTGAAGCTGCCCTCGAGGACCTGGTCTCCACCATTCGAGTCTGAAGATTCTCAGAAGCACAACCAGAGTGAGTATGAGGATTCGGCCGGCGAATGCTGCTCCTGTCCAAAGACAGACTCTCAGATCCTGAAGGAGCTGGAGGAGTCCTCGTTTAGGAAGACGTTTGAGGATTACCTGCACAACGTGGTTTTCGTCCCCAGAAAAACCTCTTCAGGCACTGGTGCCGAGGACCCTAGGCCATCTCGGAAACGCAGGTCCCTTGGCGATGTTGGGAATGTGACGGTGGCCGTGCCCACGGTGGCAGCTTTCCCCAACACTTCCTCGACCAGCGTGCCCACGAGTCCGGAGGAGCACAGGCCTTTTGAGAAGGTGGTGAACAAGGAGTCGCTGGTCATCTCCGGCTTGCGACACTTCACGGGCTATCGCATCGAGCTGCAGGCTTGCAACCAGGACACCCCTGAGGAACGGTGCAGTGTGGCAGCCTACGTCAGTGCGAGGACCATGCCTGAAGCCAAGGCTGATGACATTGTTGGCCCTGTGACGCATGAAATCTTTGAGAACAACGTCGTCCACTTGATGTGGCAGGAGCCGAAGGAGCCCAATGGTCTGATCGTGCTGTATGAAGTGAGTTATCGGCGATATGGTGATGAGGAGCTGCATCTCTGCGTCTCCCGCAAGCACTTCGCTCTGGAACGGGGCTGCAGGCTGCGTGGGCTGTCACCGGGGAACTACAGCGTGCGAATCCGGGCCACCTCCCTTGCGGGCAACGGCTCTTGGACGGAACCCACCTATTTCTACGTGACAGACTATTTAGACGTCCCGTCAAATATTGCAAAAATTATCATCGGCCCCCTCATCTTTGTCTTTCTCTTCAGTGTTGTGATTGGAAGTATTTATCTATTCCTGAGAAAGAGGCAGCCAGATGGGCCGCTGGGACCGCTTTACGCTTCTTCAAACCCTGAGTATCTCAGTGCCAGTGATGTGTTTCCATGCTCTGTGTACGTGCCGGACGAGTGGGAGGTGTCTCGAGAGAAGATCACCCTCCTTCGAGAGCTGGGGCAGGGCTCCTTCGGCATGGTGTATGAGGGCAATGCCAGGGACATCATCAAGGGTGAGGCAGAGACCCGCGTGGCGGTGAAGACGGTCAACGAGTCAGCCAGTCTCCGAGAGCGGATTGAGTTCCTCAATGAGGCCTCGGTCATGAAGGGCTTCACCTGCCATCACGTGGTGCGCCTCCTGGGAGTGGTGTCCAAGGGCCAGCCCACGCTGGTGGTGATGGAGCTGATGGCTCACGGAGACCTGAAGAGCTACCTCCGTTCTCTGCGGCCAGAGGCTGAGAATAATCCTGGCCGCCCTCCCCCTACCCTTCAAGAGATGATTCAGATGGCGGCAGAGATTGCTGACGGGATGGCCTACCTGAACGCCAAGAAGTTTGTGCATCGGGACCTGGCAGCGAGAAACTGCATGGTCGCCCATGATTTTACTGTCAAAATTGGAGACTTTGGAATGACCAGAGACATCTATGAAACGGATTACTACCGGAAAGGGGGCAAGGGTCTGCTCCCTGTACGGTGGATGGCACCGGAGTCCCTGAAGGATGGGGTCTTCACCACTTCTTCTGACATGTGGTCCTTTGGCGTGGTCCTTTGGGAAATCACCAGCTTGGCAGAACAGCCTTACCAAGGCCTGTCTAATGAACAGGTGTTGAAATTTGTCATGGATGGAGGGTATCTGGATCAACCCGACAACTGTCCAGAGAGAGTCACTGACCTCATGCGCATGTGCTGGCAATTCAACCCCAACATGAGGCCAACCTTCCTGGAGATTGTCAACCTGCTCAAGGACGACCTGCACCCCAGCTTTCCAGAGGTGTCGTTCTTCCACAGCGAGGAGAACAAGGCTCCCGAGAGTGAGGAGCTGGAGATGGAGTTTGAGGACATGGAGAATGTGCCCCTGGACCGTTCCTCGCACTGTCAGAGGGAGGAGGCGGGGGGCCGGGATGGAGGGTCCTCGCTGGGTTTCAAGCGGAGCTACGAGGAACACATCCCTTACACACACATGAACGGAGGCAAGAAAAACGGGCGGATTCTGACCTTGCCTCGGTCCAATCCTTCCTGGGCCCGGGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATG
GACGAGCTGTACAAGTAA(SEQ ID NO:207)
蛋白:(X表示非典型氨基酸)
MGTGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLXGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRKTSSGTGAEDPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPNMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPSWARDPPVATMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK(SEQ ID NO:208)
INSR676TAG-EGFP-MS2
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGGGCACCGGGGGCCGGCGGGGGGCGGCGGCCGCGCCGCTGCTGGTGGCGGTGGCCGCGCTGCTACTGGGCGCCGCGGGCCACCTGTACCCCGGAGAGGTGTGTCCCGGCATGGATATCCGGAACAACCTCACTAGGTTGCATGAGCTGGAGAATTGCTCTGTCATCGAAGGACACTTGCAGATACTCTTGATGTTCAAAACGAGGCCCGAAGATTTCCGAGACCTCAGTTTCCCCAAACTCATCATGATCACTGATTACTTGCTGCTCTTCCGGGTCTATGGGCTCGAGAGCCTGAAGGACCTGTTCCCCAACCTCACGGTCATCCGGGGATCACGACTGTTCTTTAACTACGCGCTGGTCATCTTCGAGATGGTTCACCTCAAGGAACTCGGCCTCTACAACCTGATGAACATCACCCGGGGTTCTGTCCGCATCGAGAAGAACAATGAGCTCTGTTACTTGGCCACTATCGACTGGTCCCGTATCCTGGATTCCGTGGAGGATAATTACATCGTGTTGAACAAAGATGACAACGAGGAGTGTGGAGACATCTGTCCGGGTACCGCGAAGGGCAAGACCAACTGCCCCGCCACCGTCATCAACGGGCAGTTTGTCGAACGATGTTGGACTCATAGTCACTGCCAGAAAGTTTGCCCGACCATCTGTAAGTCACACGGCTGCACCGCCGAAGGCCTCTGTTGCCACAGCGAGTGCCTGGGCAACTGTTCTCAGCCCGACGACCCCACCAAGTGCGTGGCCTGCCGCAACTTCTACCTGGATGGCAGGTGTGTGGAGACCTGCCCGCCCCCGTACTACCACTTCCAGGACTGGCGCTGTGTGAACTTCAGCTTCTGCCAGGACCTGCACCACAAATGCAAGAACTCGCGGAGGCAGGGCTGCCACCAGTACGTCATTCACAACAACAAGTGCATCCCTGAGTGTCCCTCCGGGTACACGATGAATTCCAGCAACTTGCTGTGCACCCCATGCCTGGGTCCCTGTCCCAAGGTGTGCCACCTCCTAGAAGGCGAGAAGACCATCGACTCGGTGACGTCTGCCCAGGAGCTCCGAGGATGCACCGTCATCAACGGGAGTCTGATCATCAACATTCGAGGAGGCAACAATCTGGCAGCTGAGCTAGAAGCCAACCTCGGCCTCATTGAAGAAATTTCAGGGTATCTAAAAATCCGCCGATCCTACGCTCTGGTGTCACTTTCCTTCTTCCGGAAGTTACGTCTGATTCGAGGAGAGACCTTGGAAATTGGGAACTACTCCTTCTATGCCTTGGACAACCAGAACCTAAGGCAGCTCTGGGACTGGAGCAAACACAACCTCACCATCACTCAGGGGAAACTCTTCTTCCACTATAACCCCAAACTCTGCTTGTCAGAAATCCACAAGATGGAAGAAGTTTCAGGAACCAAGGGGCGCCAGGAGAGAAACGACATTGCCCTGAAGACCAATGGGGACCAGGCATCCTGTGAAAATGAGTTACTTAAATTTTCTTACATTCGGACATCTTTTGACAAGATCTTGCTGAGATGGGAGCCGTACTGGCCCCCCGACTTCCGAGACCTCTTGGGGTTCATGCTGTTCTACAAAGAGGCCCCTTATCAGAATGTGACGGAGTTCGACGGGCAGGATGCATGTGGTTCCAACAGTTGGACGGTGGTAGACATTGACCCACCCCTGAGGTCCAACGACCCCAAATCACAGAACCACCCAGGGTGGCTGATGCGGGGTCTCAAGCCCTGGACCCAGTATGCCATCTTTGTGAAGACCCTGGTCACCTTTTCGGATGAACGCCGGACCTATGGGGCCAAGAGTGACATCATTTATGTCCAGACAGATGCCACCAACCCCTCTGTGCCCCTGGATCCAATCTCAGTGTCTAACTCATCATCCCAGATTATTCTGAAGTGGAAACCACCCTCCGACCCCAATGGCAACATCACCCACTACCTGGTTTTCTGGGAGAGGCAGGCGGAAGACAGTGAGCTGTTCGAGCTGGATTATTGCCTCTAGGGGCTGAAGCTGCCCTCGAGGACCTGGTCTCCACCATTCGAGTCTGAAGATTCTCAGAAGCACAACCAGAGTGAGTATGAGGATTCGGCCGGCGAATGCTGCTCCTGTCCAAAGACAGACTCTCAGATCCTGAAGGAGCTGGAGGAGTCCTCGTTTAGGAAGACGTTTGAGGATTACCTGCACAACGTGGTTTTCGTCCCCAGAAAAACCTCTTCAGGCACTGGTGCCGAGGACCCTAGGCCATCTCGGAAACGCAGGTCCCTTGGCGATGTTGGGAATGTGACGGTGGCCGTGCCCACGGTGGCAGCTTTCCCCAACACTTCCTCGACCAGCGTGCCCACGAGTCCGGAGGAGCACAGGCCTTTTGAGAAGGTGGTGAACAAGGAGTCGCTGGTCATCTCCGGCTTGCGACACTTCACGGGCTATCGCATCGAGCTGCAGGCTTGCAACCAGGACACCCCTGAGGAACGGTGCAGTGTGGCAGCCTACGTCAGTGCGAGGACCATGCCTGAAGCCAAGGCTGATGACATTGTTGGCCCTGTGACGCATGAAATCTTTGAGAACAACGTCGTCCACTTGATGTGGCAGGAGCCGAAGGAGCCCAATGGTCTGATCGTGCTGTATGAAGTGAGTTATCGGCGATATGGTGATGAGGAGCTGCATCTCTGCGTCTCCCGCAAGCACTTCGCTCTGGAACGGGGCTGCAGGCTGCGTGGGCTGTCACCGGGGAACTACAGCGTGCGAATCCGGGCCACCTCCCTTGCGGGCAACGGCTCTTGGACGGAACCCACCTATTTCTACGTGACAGACTATTTAGACGTCCCGTCAAATATTGCAAAAATTATCATCGGCCCCCTCATCTTTGTCTTTCTCTTCAGTGTTGTGATTGGAAGTATTTATCTATTCCTGAGAAAGAGGCAGCCAGATGGGCCGCTGGGACCGCTTTACGCTTCTTCAAACCCTGAGTATCTCAGTGCCAGTGATGTGTTTCCATGCTCTGTGTACGTGCCGGACGAGTGGGAGGTGTCTCGAGAGAAGATCACCCTCCTTCGAGAGCTGGGGCAGGGCTCCTTCGGCATGGTGTATGAGGGCAATGCCAGGGACATCATCAAGGGTGAGGCAGAGACCCGCGTGGCGGTGAAGACGGTCAACGAGTCAGCCAGTCTCCGAGAGCGGATTGAGTTCCTCAATGAGGCCTCGGTCATGAAGGGCTTCACCTGCCATCACGTGGTGCGCCTCCTGGGAGTGGTGTCCAAGGGCCAGCCCACGCTGGTGGTGATGGAGCTGATGGCTCACGGAGACCTGAAGAGCTACCTCCGTTCTCTGCGGCCAGAGGCTGAGAATAATCCTGGCCGCCCTCCCCCTACCCTTCAAGAGATGATTCAGATGGCGGCAGAGATTGCTGACGGGATGGCCTACCTGAACGCCAAGAAGTTTGTGCATCGGGACCTGGCAGCGAGAAACTGCATGGTCGCCCATGATTTTACTGTCAAAATTGGAGACTTTGGAATGACCAGAGACATCTATGAAACGGATTACTACCGGAAAGGGGGCAAGGGTCTGCTCCCTGTACGGTGGATGGCACCGGAGTCCCTGAAGGATGGGGTCTTCACCACTTCTTCTGACATGTGGTCCTTTGGCGTGGTCCTTTGGGAAATCACCAGCTTGGCAGAACAGCCTTACCAAGGCCTGTCTAATGAACAGGTGTTGAAATTTGTCATGGATGGAGGGTATCTGGATCAACCCGACAACTGTCCAGAGAGAGTCACTGACCTCATGCGCATGTGCTGGCAATTCAACCCCAACATGAGGCCAACCTTCCTGGAGATTGTCAACCTGCTCAAGGACGACCTGCACCCCAGCTTTCCAGAGGTGTCGTTCTTCCACAGCGAGGAGAACAAGGCTCCCGAGAGTGAGGAGCTGGAGATGGAGTTTGAGGACATGGAGAATGTGCCCCTGGACCGTTCCTCGCACTGTCAGAGGGAGGAGGCGGGGGGCCGGGATGGAGGGTCCTCGCTGGGTTTCAAGCGGAGCTACGAGGAACACATCCCTTACACACACATGAACGGAGGCAAGAAAAACGGGCGGATTCTGACCTTGCCTCGGTCCAATCCTTCCTGGGCCCGGGATCCACCGGTCGCCACCATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGCGGCCGCGACTCTAGATCATAATCAGCAC ATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCATGT(SEQ ID NO:209)
蛋白:(X表示非典型氨基酸)
MGTGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLXGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRKTSSGTGAEDPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPNMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPSWARDPPVATMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK(SEQ ID NO:210)
INSR676TAG-mOrange-MS2
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGGGCACCGGGGGCCGGCGGGGGGCGGCGGCCGCGCCGCTGCTGGTGGCGGTGGCCGCGCTGCTACTGGGCGCCGCGGGCCACCTGTACCCCGGAGAGGTGTGTCCCGGCATGGATATCCGGAACAACCTCACTAGGTTGCATGAGCTGGAGAATTGCTCTGTCATCGAAGGACACTTGCAGATACTCTTGATGTTCAAAACGAGGCCCGAAGATTTCCGAGACCTCAGTTTCCCCAAACTCATCATGATCACTGATTACTTGCTGCTCTTCCGGGTCTATGGGCTCGAGAGCCTGAAGGACCTGTTCCCCAACCTCACGGTCATCCGGGGATCACGACTGTTCTTTAACTACGCGCTGGTCATCTTCGAGATGGTTCACCTCAAGGAACTCGGCCTCTACAACCTGATGAACATCACCCGGGGTTCTGTCCGCATCGAGAAGAACAATGAGCTCTGTTACTTGGCCACTATCGACTGGTCCCGTATCCTGGATTCCGTGGAGGATAATTACATCGTGTTGAACAAAGATGACAACGAGGAGTGTGGAGACATCTGTCCGGGTACCGCGAAGGGCAAGACCAACTGCCCCGCCACCGTCATCAACGGGCAGTTTGTCGAACGATGTTGGACTCATAGTCACTGCCAGAAAGTTTGCCCGACCATCTGTAAGTCACACGGCTGCACCGCCGAAGGCCTCTGTTGCCACAGCGAGTGCCTGGGCAACTGTTCTCAGCCCGACGACCCCACCAAGTGCGTGGCCTGCCGCAACTTCTACCTGGATGGCAGGTGTGTGGAGACCTGCCCGCCCCCGTACTACCACTTCCAGGACTGGCGCTGTGTGAACTTCAGCTTCTGCCAGGACCTGCACCACAAATGCAAGAACTCGCGGAGGCAGGGCTGCCACCAGTACGTCATTCACAACAACAAGTGCATCCCTGAGTGTCCCTCCGGGTACACGATGAATTCCAGCAACTTGCTGTGCACCCCATGCCTGGGTCCCTGTCCCAAGGTGTGCCACCTCCTAGAAGGCGAGAAGACCATCGACTCGGTGACGTCTGCCCAGGAGCTCCGAGGATGCACCGTCATCAACGGGAGTCTGATCATCAACATTCGAGGAGGCAACAATCTGGCAGCTGAGCTAGAAGCCAACCTCGGCCTCATTGAAGAAATTTCAGGGTATCTAAAAATCCGCCGATCCTACGCTCTGGTGTCACTTTCCTTCTTCCGGAAGTTACGTCTGATTCGAGGAGAGACCTTGGAAATTGGGAACTACTCCTTCTATGCCTTGGACAACCAGAACCTAAGGCAGCTCTGGGACTGGAGCAAACACAACCTCACCATCACTCAGGGGAAACTCTTCTTCCACTATAACCCCAAACTCTGCTTGTCAGAAATCCACAAGATGGAAGAAGTTTCAGGAACCAAGGGGCGCCAGGAGAGAAACGACATTGCCCTGAAGACCAATGGGGACCAGGCATCCTGTGAAAATGAGTTACTTAAATTTTCTTACATTCGGACATCTTTTGACAAGATCTTGCTGAGATGGGAGCCGTACTGGCCCCCCGACTTCCGAGACCTCTTGGGGTTCATGCTGTTCTACAAAGAGGCCCCTTATCAGAATGTGACGGAGTTCGACGGGCAGGATGCATGTGGTTCCAACAGTTGGACGGTGGTAGACATTGACCCACCCCTGAGGTCCAACGACCCCAAATCACAGAACCACCCAGGGTGGCTGATGCGGGGTCTCAAGCCCTGGACCCAGTATGCCATCTTTGTGAAGACCCTGGTCACCTTTTCGGATGAACGCCGGACCTATGGGGCCAAGAGTGACATCATTTATGTCCAGACAGATGCCACCAACCCCTCTGTGCCCCTGGATCCAATCTCAGTGTCTAACTCATCATCCCAGATTATTCTGAAGTGGAAACCACCCTCCGACCCCAATGGCAACATCACCCACTACCTGGTTTTCTGGGAGAGGCAGGCGGAAGACAGTGAGCTGTTCGAGCTGGATTATTGCCTCTAGGGGCTGAAGCTGCCCTCGAGGACCTGGTCTCCACCATTCGAGTCTGAAGATTCTCAGAAGCACAACCAGAGTGAGTATGAGGATTCGGCCGGCGAATGCTGCTCCTGTCCAAAGACAGACTCTCAGATCCTGAAGGAGCTGGAGGAGTCCTCGTTTAGGAAGACGTTTGAGGATTACCTGCACAACGTGGTTTTCGTCCCCAGGCCATCTCGGAAACGCAGGTCCCTTGGCGATGTTGGGAATGTGACGGTGGCCGTGCCCACGGTGGCAGCTTTCCCCAACACTTCCTCGACCAGCGTGCCCACGAGTCCGGAGGAGCACAGGCCTTTTGAGAAGGTGGTGAACAAGGAGTCGCTGGTCATCTCCGGCTTGCGACACTTCACGGGCTATCGCATCGAGCTGCAGGCTTGCAACCAGGACACCCCTGAGGAACGGTGCAGTGTGGCAGCCTACGTCAGTGCGAGGACCATGCCTGAAGCCAAGGCTGATGACATTGTTGGCCCTGTGACGCATGAAATCTTTGAGAACAACGTCGTCCACTTGATGTGGCAGGAGCCGAAGGAGCCCAATGGTCTGATCGTGCTGTATGAAGTGAGTTATCGGCGATATGGTGATGAGGAGCTGCATCTCTGCGTCTCCCGCAAGCACTTCGCTCTGGAACGGGGCTGCAGGCTGCGTGGGCTGTCACCGGGGAACTACAGCGTGCGAATCCGGGCCACCTCCCTTGCGGGCAACGGCTCTTGGACGGAACCCACCTATTTCTACGTGACAGACTATTTAGACGTCCCGTCAAATATTGCAAAAATTATCATCGGCCCCCTCATCTTTGTCTTTCTCTTCAGTGTTGTGATTGGAAGTATTTATCTATTCCTGAGAAAGAGGCAGCCAGATGGGCCGCTGGGACCGCTTTACGCTTCTTCAAACCCTGAGTATCTCAGTGCCAGTGATGTGTTTCCATGCTCTGTGTACGTGCCGGACGAGTGGGAGGTGTCTCGAGAGAAGATCACCCTCCTTCGAGAGCTGGGGCAGGGCTCCTTCGGCATGGTGTATGAGGGCAATGCCAGGGACATCATCAAGGGTGAGGCAGAGACCCGCGTGGCGGTGAAGACGGTCAACGAGTCAGCCAGTCTCCGAGAGCGGATTGAGTTCCTCAATGAGGCCTCGGTCATGAAGGGCTTCACCTGCCATCACGTGGTGCGCCTCCTGGGAGTGGTGTCCAAGGGCCAGCCCACGCTGGTGGTGATGGAGCTGATGGCTCACGGAGACCTGAAGAGCTACCTCCGTTCTCTGCGGCCAGAGGCTGAGAATAATCCTGGCCGCCCTCCCCCTACCCTTCAAGAGATGATTCAGATGGCGGCAGAGATTGCTGACGGGATGGCCTACCTGAACGCCAAGAAGTTTGTGCATCGGGACCTGGCAGCGAGAAACTGCATGGTCGCCCATGATTTTACTGTCAAAATTGGAGACTTTGGAATGACCAGAGACATCTATGAAACGGATTACTACCGGAAAGGGGGCAAGGGTCTGCTCCCTGTACGGTGGATGGCACCGGAGTCCCTGAAGGATGGGGTCTTCACCACTTCTTCTGACATGTGGTCCTTTGGCGTGGTCCTTTGGGAAATCACCAGCTTGGCAGAACAGCCTTACCAAGGCCTGTCTAATGAACAGGTGTTGAAATTTGTCATGGATGGAGGGTATCTGGATCAACCCGACAACTGTCCAGAGAGAGTCACTGACCTCATGCGCATGTGCTGGCAATTCAACCCCAACATGAGGCCAACCTTCCTGGAGATTGTCAACCTGCTCAAGGACGACCTGCACCCCAGCTTTCCAGAGGTGTCGTTCTTCCACAGCGAGGAGAACAAGGCTCCCGAGAGTGAGGAGCTGGAGATGGAGTTTGAGGACATGGAGAATGTGCCCCTGGACCGTTCCTCGCACTGTCAGAGGGAGGAGGCGGGGGGCCGGGATGGAGGGTCCTCGCTGGGTTTCAAGCGGAGCTACGAGGAACACATCCCTTACACACACATGAACGGAGGCAAGAAAAACGGGCGGATTCTGACCTTGCCTCGGTCCAATCCTTCCTGGGCCCGGGATCCACCGGTCGCCACCGTGAGCAAGGGCGAGGAGAATAATATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCGCATGGAGGGCACCGTGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCTTTCAGACCGCTAAGCTGAAGGTGACCAAGGGCGGCCCCCTGCCCTTCGCCTGGGACATCCTGTCCCCTCTCTTCACCTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTCAAGCTGTCCTTCCCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTACGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCACTGCAGGACGGCGAGTTCATCTACAAGGTGAAGATGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTGATGCAGAAGAAGACCATGGGCTGGGAGGCCTCCTCCGAGCGGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAGGATGAGGCTGAAGCTGAAGGACGGCGGCCACTACACCTCCGAGGTCAAGACCACCTACAAGGCCAAGAAGTCCGTGCAGCTGCCCGGCGCCTACATCGTCGGCATCAAGCTGGACATCACCTCCCACAACGAGGACTACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGTAAAGCGGCCGCGACTCTAGATCATAATCAGCACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGGATCACCCATGT(SEQ ID NO:211)
蛋白:(X表示非典型氨基酸)
MGTGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLXGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPNMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPSWARDPPVATVSKGEENNMAIIKEFMRFKVRMEGTVNGHEFEIEGEGEGRPYEGFQTAKLKVTKGGPLPFAWDILSPLFTYGSKAYVKHPADIPDYFKLSFPEGFKWERVMNYEDGGVVTVTQDSSLQDGEFIYKVKMRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIRMRLKLKDGGHYTSEVKTTYKAKKSVQLPGAYIVGIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK(SEQ ID NO:212)
INSR676TAG-iRFP-MS2
DNA:(下划线示出MS2茎环和琥珀密码子)
ATGGGCACCGGGGGCCGGCGGGGGGCGGCGGCCGCGCCGCTGCTGGTGGCGGTGGCCGCGCTGCTACTGGGCGCCGCGGGCCACCTGTACCCCGGAGAGGTGTGTCCCGGCATGGATATCCGGAACAACCTCACTAGGTTGCATGAGCTGGAGAATTGCTCTGTCATCGAAGGACACTTGCAGATACTCTTGATGTTCAAAACGAGGCCCGAAGATTTCCGAGACCTCAGTTTCCCCAAACTCATCATGATCACTGATTACTTGCTGCTCTTCCGGGTCTATGGGCTCGAGAGCCTGAAGGACCTGTTCCCCAACCTCACGGTCATCCGGGGATCACGACTGTTCTTTAACTACGCGCTGGTCATCTTCGAGATGGTTCACCTCAAGGAACTCGGCCTCTACAACCTGATGAACATCACCCGGGGTTCTGTCCGCATCGAGAAGAACAATGAGCTCTGTTACTTGGCCACTATCGACTGGTCCCGTATCCTGGATTCCGTGGAGGATAATTACATCGTGTTGAACAAAGATGACAACGAGGAGTGTGGAGACATCTGTCCGGGTACCGCGAAGGGCAAGACCAACTGCCCCGCCACCGTCATCAACGGGCAGTTTGTCGAACGATGTTGGACTCATAGTCACTGCCAGAAAGTTTGCCCGACCATCTGTAAGTCACACGGCTGCACCGCCGAAGGCCTCTGTTGCCACAGCGAGTGCCTGGGCAACTGTTCTCAGCCCGACGACCCCACCAAGTGCGTGGCCTGCCGCAACTTCTACCTGGATGGCAGGTGTGTGGAGACCTGCCCGCCCCCGTACTACCACTTCCAGGACTGGCGCTGTGTGAACTTCAGCTTCTGCCAGGACCTGCACCACAAATGCAAGAACTCGCGGAGGCAGGGCTGCCACCAGTACGTCATTCACAACAACAAGTGCATCCCTGAGTGTCCCTCCGGGTACACGATGAATTCCAGCAACTTGCTGTGCACCCCATGCCTGGGTCCCTGTCCCAAGGTGTGCCACCTCCTAGAAGGCGAGAAGACCATCGACTCGGTGACGTCTGCCCAGGAGCTCCGAGGATGCACCGTCATCAACGGGAGTCTGATCATCAACATTCGAGGAGGCAACAATCTGGCAGCTGAGCTAGAAGCCAACCTCGGCCTCATTGAAGAAATTTCAGGGTATCTAAAAATCCGCCGATCCTACGCTCTGGTGTCACTTTCCTTCTTCCGGAAGTTACGTCTGATTCGAGGAGAGACCTTGGAAATTGGGAACTACTCCTTCTATGCCTTGGACAACCAGAACCTAAGGCAGCTCTGGGACTGGAGCAAACACAACCTCACCATCACTCAGGGGAAACTCTTCTTCCACTATAACCCCAAACTCTGCTTGTCAGAAATCCACAAGATGGAAGAAGTTTCAGGAACCAAGGGGCGCCAGGAGAGAAACGACATTGCCCTGAAGACCAATGGGGACCAGGCATCCTGTGAAAATGAGTTACTTAAATTTTCTTACATTCGGACATCTTTTGACAAGATCTTGCTGAGATGGGAGCCGTACTGGCCCCCCGACTTCCGAGACCTCTTGGGGTTCATGCTGTTCTACAAAGAGGCCCCTTATCAGAATGTGACGGAGTTCGACGGGCAGGATGCATGTGGTTCCAACAGTTGGACGGTGGTAGACATTGACCCACCCCTGAGGTCCAACGACCCCAAATCACAGAACCACCCAGGGTGGCTGATGCGGGGTCTCAAGCCCTGGACCCAGTATGCCATCTTTGTGAAGACCCTGGTCACCTTTTCGGATGAACGCCGGACCTATGGGGCCAAGAGTGACATCATTTATGTCCAGACAGATGCCACCAACCCCTCTGTGCCCCTGGATCCAATCTCAGTGTCTAACTCATCATCCCAGATTATTCTGAAGTGGAAACCACCCTCCGACCCCAATGGCAACATCACCCACTACCTGGTTTTCTGGGAGAGGCAGGCGGAAGACAGTGAGCTGTTCGAGCTGGATTATTGCCTCTAGGGGCTGAAGCTGCCCTCGAGGACCTGGTCTCCACCATTCGAGTCTGAAGATTCTCAGAAGCACAACCAGAGTGAGTATGAGGATTCGGCCGGCGAATGCTGCTCCTGTCCAAAGACAGACTCTCAGATCCTGAAGGAGCTGGAGGAGTCCTCGTTTAGGAAGACGTTTGAGGATTACCTGCACAACGTGGTTTTCGTCCCCAGGCCATCTCGGAAACGCAGGTCCCTTGGCGATGTTGGGAATGTGACGGTGGCCGTGCCCACGGTGGCAGCTTTCCCCAACACTTCCTCGACCAGCGTGCCCACGAGTCCGGAGGAGCACAGGCCTTTTGAGAAGGTGGTGAACAAGGAGTCGCTGGTCATCTCCGGCTTGCGACACTTCACGGGCTATCGCATCGAGCTGCAGGCTTGCAACCAGGACACCCCTGAGGAACGGTGCAGTGTGGCAGCCTACGTCAGTGCGAGGACCATGCCTGAAGCCAAGGCTGATGACATTGTTGGCCCTGTGACGCATGAAATCTTTGAGAACAACGTCGTCCACTTGATGTGGCAGGAGCCGAAGGAGCCCAATGGTCTGATCGTGCTGTATGAAGTGAGTTATCGGCGATATGGTGATGAGGAGCTGCATCTCTGCGTCTCCCGCAAGCACTTCGCTCTGGAACGGGGCTGCAGGCTGCGTGGGCTGTCACCGGGGAACTACAGCGTGCGAATCCGGGCCACCTCCCTTGCGGGCAACGGCTCTTGGACGGAACCCACCTATTTCTACGTGACAGACTATTTAGACGTCCCGTCAAATATTGCAAAAATTATCATCGGCCCCCTCATCTTTGTCTTTCTCTTCAGTGTTGTGATTGGAAGTATTTATCTATTCCTGAGAAAGAGGCAGCCAGATGGGCCGCTGGGACCGCTTTACGCTTCTTCAAACCCTGAGTATCTCAGTGCCAGTGATGTGTTTCCATGCTCTGTGTACGTGCCGGACGAGTGGGAGGTGTCTCGAGAGAAGATCACCCTCCTTCGAGAGCTGGGGCAGGGCTCCTTCGGCATGGTGTATGAGGGCAATGCCAGGGACATCATCAAGGGTGAGGCAGAGACCCGCGTGGCGGTGAAGACGGTCAACGAGTCAGCCAGTCTCCGAGAGCGGATTGAGTTCCTCAATGAGGCCTCGGTCATGAAGGGCTTCACCTGCCATCACGTGGTGCGCCTCCTGGGAGTGGTGTCCAAGGGCCAGCCCACGCTGGTGGTGATGGAGCTGATGGCTCACGGAGACCTGAAGAGCTACCTCCGTTCTCTGCGGCCAGAGGCTGAGAATAATCCTGGCCGCCCTCCCCCTACCCTTCAAGAGATGATTCAGATGGCGGCAGAGATTGCTGACGGGATGGCCTACCTGAACGCCAAGAAGTTTGTGCATCGGGACCTGGCAGCGAGAAACTGCATGGTCGCCCATGATTTTACTGTCAAAATTGGAGACTTTGGAATGACCAGAGACATCTATGAAACGGATTACTACCGGAAAGGGGGCAAGGGTCTGCTCCCTGTACGGTGGATGGCACCGGAGTCCCTGAAGGATGGGGTCTTCACCACTTCTTCTGACATGTGGTCCTTTGGCGTGGTCCTTTGGGAAATCACCAGCTTGGCAGAACAGCCTTACCAAGGCCTGTCTAATGAACAGGTGTTGAAATTTGTCATGGATGGAGGGTATCTGGATCAACCCGACAACTGTCCAGAGAGAGTCACTGACCTCATGCGCATGTGCTGGCAATTCAACCCCAACATGAGGCCAACCTTCCTGGAGATTGTCAACCTGCTCAAGGACGACCTGCACCCCAGCTTTCCAGAGGTGTCGTTCTTCCACAGCGAGGAGAACAAGGCTCCCGAGAGTGAGGAGCTGGAGATGGAGTTTGAGGACATGGAGAATGTGCCCCTGGACCGTTCCTCGCACTGTCAGAGGGAGGAGGCGGGGGGCCGGGATGGAGGGTCCTCGCTGGGTTTCAAGCGGAGCTACGAGGAACACATCCCTTACACACACATGAACGGAGGCAAGAAAAACGGGCGGATTCTGACCTTGCCTCGGTCCAATCCTTCCTGGGCCCGGGATCCACCGGTCGCCACCGCGGAAGGATCCGTCGCCAGGCAGCCTGACCTCTTGACCTGCGACGATGAGCCGATCCATATCCCCGGTGCCATCCAACCGCATGGACTGCTGCTCGCCCTCGCCGCCGACATGACGATCGTTGCCGGCAGCGACAACCTTCCCGAACTCACCGGACTGGCGATCGGCGCCCTGATCGGCCGCTCTGCGGCCGATGTCTTCGACTCGGAGACGCACAACCGTCTGACGATCGCCTTGGCCGAGCCCGGGGCGGCCGTCGGAGCACCGATCACTGTCGGCTTCACGATGCGAAAGGACGCAGGCTTCATCGGCTCCTGGCATCGCCATGATCAGCTCATCTTCCTCGAGCTCGAGCCTCCCCAGCGGGACGTCGCCGAGCCGCAGGCGTTCTTCCGCCGCACCAACAGCGCCATCCGCCGCCTGCAGGCCGCCGAAACCTTGGAAAGCGCCTGCGCCGCCGCGGCGCAAGAGGTGCGGAAGATTACCGGCTTCGATCGGGTGATGATCTATCGCTTCGCCTCCGACTTCAGCGGCGAAGTGATCGCAGAGGATCGGTGCGCCGAGGTCGAGTCAAAACTAGGCCTGCACTATCCTGCCTCAACCGTGCCGGCGCAGGCCCGTCGGCTCTATACCATCAACCCGGTACGGATCATTCCCGATATCAATTATCGGCCGGTGCCGGTCACCCCAGACCTCAATCCGGTCACCGGGCGGCCGATTGATCTTAGCTTCGCCATCCTGCGCAGCGTCTCGCCCGTCCATCTGGAATTCATGCGCAACATAGGCATGCACGGCACGATGTCGATCTCGATTTTGCGCGGCGAGCGACTGTGGGGATTGATCGTTTGCCATCACCGAACGCCGTACTACGTCGATCTCGATGGCCGCCAAGCCTGCGAGCTAGTCGCCCAGGTTCTGGCCTGGCAGATCGGCGTGATGGAAGAGTAAGCGGCCGCGACTCTAGATCATAATCAGCACATGAGGATCACCCATGTCTGCAGGTCGACTCTAGAAAACATGAGG ATCACCCATGT(SEQ ID NO:213)
蛋白:(X表示非典型氨基酸)
MGTGGRRGAAAAPLLVAVAALLLGAAGHLYPGEVCPGMDIRNNLTRLHELENCSVIEGHLQILLMFKTRPEDFRDLSFPKLIMITDYLLLFRVYGLESLKDLFPNLTVIRGSRLFFNYALVIFEMVHLKELGLYNLMNITRGSVRIEKNNELCYLATIDWSRILDSVEDNYIVLNKDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCCHSECLGNCSQPDDPTKCVACRNFYLDGRCVETCPPPYYHFQDWRCVNFSFCQDLHHKCKNSRRQGCHQYVIHNNKCIPECPSGYTMNSSNLLCTPCLGPCPKVCHLLEGEKTIDSVTSAQELRGCTVINGSLIINIRGGNNLAAELEANLGLIEEISGYLKIRRSYALVSLSFFRKLRLIRGETLEIGNYSFYALDNQNLRQLWDWSKHNLTITQGKLFFHYNPKLCLSEIHKMEEVSGTKGRQERNDIALKTNGDQASCENELLKFSYIRTSFDKILLRWEPYWPPDFRDLLGFMLFYKEAPYQNVTEFDGQDACGSNSWTVVDIDPPLRSNDPKSQNHPGWLMRGLKPWTQYAIFVKTLVTFSDERRTYGAKSDIIYVQTDATNPSVPLDPISVSNSSSQIILKWKPPSDPNGNITHYLVFWERQAEDSELFELDYCLXGLKLPSRTWSPPFESEDSQKHNQSEYEDSAGECCSCPKTDSQILKELEESSFRKTFEDYLHNVVFVPRPSRKRRSLGDVGNVTVAVPTVAAFPNTSSTSVPTSPEEHRPFEKVVNKESLVISGLRHFTGYRIELQACNQDTPEERCSVAAYVSARTMPEAKADDIVGPVTHEIFENNVVHLMWQEPKEPNGLIVLYEVSYRRYGDEELHLCVSRKHFALERGCRLRGLSPGNYSVRIRATSLAGNGSWTEPTYFYVTDYLDVPSNIAKIIIGPLIFVFLFSVVIGSIYLFLRKRQPDGPLGPLYASSNPEYLSASDVFPCSVYVPDEWEVSREKITLLRELGQGSFGMVYEGNARDIIKGEAETRVAVKTVNESASLRERIEFLNEASVMKGFTCHHVVRLLGVVSKGQPTLVVMELMAHGDLKSYLRSLRPEAENNPGRPPPTLQEMIQMAAEIADGMAYLNAKKFVHRDLAARNCMVAHDFTVKIGDFGMTRDIYETDYYRKGGKGLLPVRWMAPESLKDGVFTTSSDMWSFGVVLWEITSLAEQPYQGLSNEQVLKFVMDGGYLDQPDNCPERVTDLMRMCWQFNPNMRPTFLEIVNLLKDDLHPSFPEVSFFHSEENKAPESEELEMEFEDMENVPLDRSSHCQREEAGGRDGGSSLGFKRSYEEHIPYTHMNGGKKNGRILTLPRSNPSWARDPPVATAEGSVARQPDLLTCDDEPIHIPGAIQPHGLLLALAADMTIVAGSDNLPELTGLAIGALIGRSAADVFDSETHNRLTIALAEPGAAVGAPITVGFTMRKDAGFIGSWHRHDQLIFLELEPPQRDVAEPQAFFRRTNSAIRRLQAAETLESACAAAAQEVRKITGFDRVMIYRFASDFSGEVIAEDRCAEVESKLGLHYPASTVPAQARRLYTINPVRIIPDINYRPVPVTPDLNPVTGRPIDLSFAILRSVSPVHLEFMRNIGMHGTMSISILRGERLWGLIVCHHRTPYYVDLDGRQACELVAQVLAWQIGVMEE(SEQ ID NO:214)
序列–集合2
1.其他组分:
mCherry190TAG-2xPP7具有琥珀型位点和2个PP7环的mCherry(TAG密码子,PP7环)
DNA:
蛋白:
mCherry190TAG-4xPP7具有琥珀型位点和4个PP7环的mCherry(TAG密码子,PP7环)
DNA:
蛋白:
mCherry190TAG-6xPP7具有琥珀型位点和6个PP7环的mCherry(TAG密码子,PP7环)
DNA:
蛋白:
H2B-mCherry190TAG-2xMS2人组蛋白H2B 1-J型(Uniprot:P06899)与具有琥珀型位点和2xms2环的mCherry融合(TAG密码子,ms2环)
蛋白:
IFRS1马氏甲烷八叠球菌PylRS(L305M、Y306L、L309S、N346S、C348M)
DNA:
GACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATATGCTGAACTATAGCCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGTCTTTTATGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:223)
蛋白:
DKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNMLNYSRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLSFMQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:224)
CbzRS马氏甲烷八叠球菌PylRS(Y306M、L309G、C348T)
DNA:
GACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:225)
蛋白:
DKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:226)
CpkRS马氏甲烷八叠球菌PylRS(A302S)
DNA:
GATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:227)
蛋白:
DKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:228)
tRNApyl,CGA吡咯赖氨酰tRNA(用于丝氨酸密码子,反密码子以粗体显示)(马氏甲烷八叠球菌)
tRNApyl,CGG吡咯赖氨酰tRNA(用于脯氨酸密码子,反密码子以粗体显示)(马氏甲烷八叠球菌)
tRNApyl,UAA吡咯赖氨酰tRNA(用于亮氨酸密码子,反密码子以粗体显示)(马氏甲烷八叠球菌)
tRNApyl,UAG吡咯赖氨酰tRNA(用于亮氨酸密码子,反密码子以粗体显示)(马氏甲烷八叠球菌)
tRNApyl,CCG吡咯赖氨酰tRNA(用于精氨酸密码子,反密码子以粗体显示)(马氏甲烷八叠球菌)
tRNApyl,AUA吡咯赖氨酰tRNA(用于异亮氨酸密码子,反密码子以紫色显示)(马氏甲烷八叠球菌)
OMeRS吡咯赖氨酰tRNA合成酶突变体:A302T、Y384F、N346V、C348W、V401L(马氏甲烷八叠球菌)
DNA:
ATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:235)
蛋白:
MACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL(SEQ ID NO:236)
GFP66TAG具有琥珀型位点的GFP
DNA:(MS2茎环,琥珀密码子)
GFP66TCG具有丝氨酸位点的GFP
DNA:(MS2茎环,丝氨酸密码子)
GFP66CCG具有脯氨酸位点的GFP
DNA:(MS2茎环,脯氨酸密码子)
GFP66CTA具有亮氨酸位点的GFP
DNA:(MS2茎环,亮氨酸密码子)
GFP66TTA具有亮氨酸位点的GFP
DNA:(MS2茎环,亮氨酸密码子)
GFP66ATA具有异亮氨酸位点的GFP
DNA:(MS2茎环,异亮氨酸密码子)
GFP66CGG具有精氨酸位点的GFP
DNA:(MS2茎环,精氨酸密码子)
GFP39TCG具有丝氨酸位点的GFP
DNA:(MS2茎环,丝氨酸密码子)
GFP39CCG具有脯氨酸位点的GFP
DNA:(MS2茎环,脯氨酸密码子
GFP39CTA具有亮氨酸位点的GFP
DNA:(MS2茎环,亮氨酸密码子)
GFP39CGG具有精氨酸位点的GFP
DNA:(MS2茎环,精氨酸密码子)
mCherry72TAG具有琥珀型位点的mCherry
DNA:(MS2茎环,琥珀密码子)
mCherry72TCG具有丝氨酸位点的mCherry
DNA:(MS2茎环,丝氨酸密码子)
mCherry72CCG具有脯氨酸位点的mCherry
DNA:(MS2茎环,脯氨酸密码子)
mCherry72CTA具有亮氨酸位点的mCherryDNA:(MS2茎环,亮氨酸密码子)
mCherry72TTA具有亮氨酸位点的mCherry
DNA:(MS2茎环,亮氨酸密码子)
mCherry72ATA具有异亮氨酸位点的mCherry
DNA:(MS2茎环,异亮氨酸密码子)
mCherry185TCG具有丝氨酸位点的mcherry
DNA:(丝氨酸密码子)
mCherry185CCG具有脯氨酸位点的mCherry
DNA:(脯氨酸密码子)
mCherry185CTA具有亮氨酸位点的mCherry
DNA:(亮氨酸密码子)
GFP39TCG具有丝氨酸位点的LCK-GFP
GFP39CCG具有脯氨酸位点的LCK-GFP
GFP39CTA具有亮氨酸位点的LCK-GFP
扩展的GFP39TCG与GFP66CCG基因融合的在第39位具有丝氨酸位点的GFP
扩展的GFP39CCG与GFP66TCG基因融合的在第39位具有脯氨酸位点的GFP
扩展的GFP39CTA与GFP66TCG融合的在第39位具有亮氨酸位点的GFP
pp7细菌噬菌体PP7 RNA茎环:(形式1)
DNA:
GGAGCAGACGATATGGCGTCGCTCC(SEQ ID NO:289)
pp7细菌噬菌体PP7 RNA茎环的DNA序列:(形式2)
DNA:
CCAGCAGAGCATATGGGCTCGCTGG(SEQ ID NO:290)
EBAG9:在SiSo细胞上表达的受体结合癌抗原(智人,Uniprot:O00559)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTA
ATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCA
GTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGA
GGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCA
ACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACA
GGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGAT
ACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAG
CAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCA(SEQ ID NO:291)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEG
GNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLS(SEQ IDNO:292)
EBAG91-29:在SiSo细胞上表达的受体结合癌抗原(智人,Uniprot:O00559)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCT(SEQ ID NO:293)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRS(SEQ ID NO:294)
CMP-SaTr/SLC35A1:CMP唾液酸转运蛋白(智人,Uniprot:P78382)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACGACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAATCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAACATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTCACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTG(SEQ ID NO:295)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGV(SEQ ID NO:296)
P450 2C11-27:细胞色素P450 2C1(穴兔(Oryctolagus cuniculus),Uniprot:P00180)DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGC(SEQ ID NO:297)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGG(SEQ ID NO:298)
P450 2C11-29:细胞色素P450 2C1(穴兔,Uniprot:P00180)
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCAAGCTG(SEQ ID NO:299)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGKL(SEQ ID NO:300)
EB1:微管相关蛋白RP/EB家族成员1(智人,Uniprot:Q8WQ86)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTAT(SEQ ID NO:301)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEY(SEQ ID NO:302)
CG1:核孔蛋白NUP42(智人,Uniprot:O15504)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTT(SEQ ID NO:303)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNV(SEQ ID NO:304)
PCP:细胞色素P450 2C1(穴兔,Uniprot:P00180)
DNA:
TCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGT(SEQ IDNO:305)
蛋白:
SKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGR(SEQ ID NO:306)
LAF-1:ATP依赖性RNA解旋酶laf-1(RGG结构域,1-168)(秀丽隐杆线虫,Uniprot:D0PV95)DNA:
ATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGC(SEQ ID NO:307)
蛋白:
MESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDG(SEQ ID NO:308)
SLP3:红细胞膜整合蛋白样蛋白3,aa 1-59(智人,Uniprot:Q8TAV4)
DNA:
ATGGATTCTAGGGTGTCTTCACCTGAGAAGCAAGATAAAGAGAATTTCGTGGGTGTCAACAATAAACGGCTTGGTGTATGTGGCTGGATCCTGTTTTCCCTCTCTTTCCTGTTGGTGATCATTACCTTCCCCATCTCCATATGGATGTGCTTGAAGATCATTAAGGAGTATGAACGT(SEQ ID NO:309)
蛋白:
MDSRVSSPEKQDKENFVGVNNKRLGVCGWILFSLSFLLVIITFPISIWMCLKIIKEYER(SEQ IDNO:310)SYNZIP1:合成卷曲螺旋肽1(合成的)
DNA:
AATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAA(SEQID NO:311)
蛋白:
NLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEE(SEQ ID NO:312)
SYNZIP2:合成卷曲螺旋肽1(合成的)
DNA:
GCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAG(SEQ ID NO:313)
蛋白:
ARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQ(SEQ ID NO:314)
SYNZIP3:合成卷曲螺旋肽1(合成的)
DNA:
AATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAG(SEQ ID NO:315)
蛋白:
NEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKK(SEQ ID NO:316)
SYNZIP4:合成卷曲螺旋肽1(合成的)
DNA:
CAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAG(SEQ ID NO:317)
蛋白:
QKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAE(SEQ ID NO:318)
2.其他融合蛋白:
EBAG91-29::FUS::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:319)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:320)
EBAG9::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:321)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:322)
EBAG9::FUS::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:323)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:324)
EBAG9::MCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:325)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:326)
EBAG9::EWSR1::MCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ IDNO:327)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:328)
EBAG91-29::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:329)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:330)
EBAG91-29::FUS::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:331)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPY
GQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:332)
EBAG91-29::MCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:333)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:334)
EBAG91-29::EWSR1::MCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:335)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:336)
EBAG9::EWSR1::4xλN22
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:337)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:338)
EBAG91-29::EWSR1::4xλN22
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:339)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:340)
EBAG9::PylRS(AA)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:341)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:342)
EBAG9::PylRS(AAAF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:343)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:344)
EBAG9::FUS::PylRS(AA)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:345)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:346)
EBAG9::FUS::PylRS(AAAF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGCAGAGGACGGAAATTAAGTGGAGACCAAATAACTTTGCCAACTACAGTTGATTATTCATCAGTTCCTAAGCAGACAGATGTTGAAGAGTGGACTTCCTGGGATGAAGATGCACCCACCAGTGTAAAGATCGAAGGAGGGAATGGGAATGTGGCAACACAACAAAATTCTTTGGAACAACTGGAACCTGACTATTTTAAGGACATGACACCAACTATTAGGAAAACTCAGAAAATTGTTATTAAGAAGAGAGAACCATTGAATTTTGGCATCCCAGATGGGAGCACAGGTTTCTCTAGTAGATTAGCAGCTACACAAGATCTGCCTTTTATTCATCAGTCTTCTGAATTAGGTGACTTAGATACCTGGCAGGAAAATACCAATGCATGGGAAGAAGAAGAAGATGCAGCCTGGCAAGCAGAAGAAGTTCTGAGACAGCAGAAACTAGCAGACAGAGAAAAGAGAGCAGCCGAACAACAAAGGAAGAAAATGGAAAAGGAAGCACAACGGCTAATGAAGAAGGAACAAAACAAAATTGGTGTGAAACTTTCAGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:347)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGRGRKLSGDQITLPTTVDYSSVPKQTDVEEWTSWDEDAPTSVKIEGGNGNVATQQNSLEQLEPDYFKDMTPTIRKTQKIVIKKREPLNFGIPDGSTGFSSRLAATQDLPFIHQSSELGDLDTWQENTNAWEEEEDAAWQAEEVLRQQKLADREKRAAEQQRKKMEKEAQRLMKKEQNKIGVKLSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:348)
EBAG91-29::FUS::PylRS(AA)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:349)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:350)
EBAG91-29::FUS::PylRS(AAAF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:351)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:352)
EBAG91-29::FUS::MCP::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:353)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:354)
CG1::PylRS(AF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:355)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:356)
CG1::PylRS(AA)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:357)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:358)
CG1::PylRS(AAAF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:359)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:360)
CG1::FUS::PylRS(AA)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:361)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:362)
CG1::FUS::PylRS(AAAF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:363)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:364)
CG1::MCP
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:365)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:366)
CG1::EWSR1::MCP
DNA:ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:367)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:368)
CG1::FUS::PylRS(AF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:369)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:370)
CG1::FUS::MCP::PylRS(AF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:371)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:372)
CMP-SaTr::PylRS(AF)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:373)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:374)
CMP-SaTr::PylRS(AA)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ IDNO:375)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:376)
CMP-SaTr::PylRS(AAAF)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:377)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:378)
CMP-SaTr::FUS::PylRS(AA)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:379)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:380)
CMP-SaTr::FUS::PylRS(AAAF)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:381)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:382)
CMP-SaTr::MCP
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:383)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:384)
CMP-SaTr::EWSR1::MCP
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:385)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:386)
CMP-SaTr::PylRS(AF)EWSR1::4xλN22
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:387)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:388)
CMP-SaTr::FUS::PylRS(AF)
DNA:
ATGGCTGCCCCGAGAGACAATGTCACTTTATTATTCAAGTTATACTGCTTGGCAGTGATGACCCTGATGGCTGCAGTCTATACCATAGCTTTAAGATACACAAGGACATCAGACAAAGAACTCTACTTTTCAACCACAGCCGTGTGTATCACAGAAGTTATAAAGTTATTGCTAAGTGTGGGAATTTTAGCTAAAGAAACTGGTAGTCTGGGTAGATTCAAAGCATCTTTAAGAGAAAATGTCTTGGGGAGCCCCAAGGAACTGTTGAAGTTAAGTGTGCCATCGTTAGTGTATGCTGTTCAGAACAACATGGCTTTCCTAGCTCTTAGCAATCTGGATGCAGCAGTGTACCAGGTGACCTACCAGTTGAAGATTCCGTGTACTGCTTTATGCACTGTTTTAATGTTAAACCGGACACTCAGCAAATTACAGTGGGTTTCAGTTTTTATGCTGTGTGCTGGAGTTACGCTTGTACAGTGGAAACCAGCCCAAGCTACAAAAGTGGTGGTGGAACAAAATCCATTATTAGGGTTTGGCGCTATAGCTATTGCTGTATTGTGCTCAGGATTTGCAGGAGTATATTTTGAAAAAGTTTTAAAGAGTTCAGATACTTCTCTTTGGGTGAGAAACATTCAAATGTATCTATCAGGGATTATTGTGACATTAGCTGGCGTCTACTTGTCAGATGGAGCTGAAATTAAAGAAAAAGGATTTTTCTATGGTTACACATATTATGTCTGGTTTGTCATCTTTCTTGCAAGTGTTGGTGGCCTCTACACTTCTGTTGTGGTTAAGTACACAGACAATATCATGAAAGGCTTTTCTGCAGCAGCGGCCATTGTCCTTTCCACCATTGCTTCAGTAATGCTGTTTGGATTACAGATAACACTTACCTTTGCCCTGGGTACTCTTCTTGTATGTGTTTCCATATATCTCTATGGATTACCCAGACAAGACACTACATCCATCCAACAAGGAGAAACAGCTTCAAAGGAGAGAGTTATTGGTGTGGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:389)
蛋白:
MAAPRDNVTLLFKLYCLAVMTLMAAVYTIALRYTRTSDKELYFSTTAVCITEVIKLLLSVGILAKETGSLGRFKASLRENVLGSPKELLKLSVPSLVYAVQNNMAFLALSNLDAAVYQVTYQLKIPCTALCTVLMLNRTLSKLQWVSVFMLCAGVTLVQWKPAQATKVVVEQNPLLGFGAIAIAVLCSGFAGVYFEKVLKSSDTSLWVRNIQMYLSGIIVTLAGVYLSDGAEIKEKGFFYGYTYYVWFVIFLASVGGLYTSVVVKYTDNIMKGFSAAAAIVLSTIASVMLFGLQITLTFALGTLLVCVSIYLYGLPRQDTTSIQQGETASKERVIGVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:390)
P450 2C11-27::PylRS(AF)
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:391)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:392)
P450 2C11-27::MCP
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:393)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:394)
P450 2C11-27::FUS::PylRS(AF)
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:395)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:396)
P450 2C11-27::EWSR1::MCP
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:397)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:398)
P450 2C11-27::FUS::MCP::PylRS(AF)
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:399)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:400)
P450 2C11-29::FUS::MCP::PylRS(AF)
DNA:
ATGGACCCCGTGGTCGTGCTGGGCCTGTGCCTGTCATGCCTGCTGCTGCTGAGCCTGTGGAAGCAGAGCTACGGCGGAGGCAAGCTGATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:401)
蛋白:
MDPVVVLGLCLSCLLLLSLWKQSYGGGKLMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:402)
EB1::PylRS(AF)
DNA:ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:403)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:404)
EB1::PylRS(AA)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:405)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:406)
EB1::PylRS(AAAF)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:407)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:408)
EB1::FUS::PylRS(AA)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:409)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:410)
EB1::FUS::PylRS(AAAF)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:411)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:412)
EB1::MCP
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:413)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:414)
EB1::EWSR1::MCP
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:415)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:416)
EB1::EWSR1::4xλN22
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:417)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:418)
EB1::FUS::PylRS(AF)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:419)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:420)
EB1::FUS::MCP::PylRS(AF)
DNA:
ATGGCAGTGAACGTATACTCAACGTCAGTGACCAGTGATAACCTAAGTCGACATGACATGCTGGCCTGGATCAATGAGTCTCTGCAGTTGAATCTGACAAAGATCGAACAGTTGTGCTCAGGGGCTGCGTATTGTCAGTTTATGGACATGCTGTTCCCTGGCTCCATTGCCTTGAAGAAAGTGAAATTCCAAGCTAAGCTAGAACACGAGTACATCCAGAACTTCAAAATACTACAAGCAGGTTTTAAGAGAATGGGTGTTGACAAAATAATTCCTGTGGACAAATTAGTAAAAGGAAAGTTTCAGGACAATTTTGAATTCGTTCAGTGGTTCAAGAAGTTTTTCGATGCAAACTATGATGGAAAAGACTATGACCCTGTGGCTGCCAGACAAGGTCAAGAAACTGCAGTGGCTCCTTCCCTTGTTGCTCCAGCTCTGAATAAACCGAAGAAACCTCTCACTTCTAGCAGTGCAGCTCCCCAGAGGCCCATCTCAACACAGAGAACCGCTGCGGCTCCTAAGGCTGGCCCTGGTGTGGTGCGAAAGAACCCTGGTGTGGGCAACGGAGATGACGAGGCAGCTGAGTTGATGCAGCAGGTCAACGTATTGAAACTTACTGTTGAAGACTTGGAGAAAGAGAGGGATTTCTACTTCGGAAAGCTACGGAACATTGAATTGATTTGCCAGGAGAACGAGGGGGAAAACGACCCTGTATTGCAGAGGATTGTAGACATTCTGTATGCCACAGATGAAGGCTTTGTGATACCTGATGAAGGGGGCCCACAGGAGGAGCAAGAAGAGTATGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:421)
蛋白:
MAVNVYSTSVTSDNLSRHDMLAWINESLQLNLTKIEQLCSGAAYCQFMDMLFPGSIALKKVKFQAKLEHEYIQNFKILQAGFKRMGVDKIIPVDKLVKGKFQDNFEFVQWFKKFFDANYDGKDYDPVAARQGQETAVAPSLVAPALNKPKKPLTSSSAAPQRPISTQRTAAAPKAGPGVVRKNPGVGNGDDEAAELMQQVNVLKLTVEDLEKERDFYFGKLRNIELICQENEGENDPVLQRIVDILYATDEGFVIPDEGGPQEEQEEYGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:422)
TOM20::FUS::PCP::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:423)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:424)
TOM20::FUS::2xPCP::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:425)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVATGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:426)
TOM20::FUS::4xλN22::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:427)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:428)
LCK::FUS::2xPCP::CbzRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:429)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVATGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:430)
LCK::FUS::PCP::CbzRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:431)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:432)
TOM20::FUS::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:433)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:434)
TOM20::FUS::2xPCP::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:435)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVATGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:436)
TOM20::FUS::4xλN22::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:437)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:438)
EBAG91-29::FUS::PCP::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:439)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:440)
EBAG91-29::FUS::4xλN22::IFRS1
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATATGCTGAACTATAGCCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGTCTTTTATGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:441)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNMLNYSRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLSFMQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:442)
KIF16B::EWSR1::Myc::2xPCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:443)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:444)
KIF16B::EWSR1::HA::2xPCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATACCCCTACGACGTGCCCGACTACGCCATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:445)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:446)
EBAG91-29::EWSR1::Myc::2xPCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:447)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:448)
EBAG91-29::EWSR1::HA::2xPCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATACCCCTACGACGTGCCCGACTACGCCATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:449)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:450)
LCK::CbzRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:451)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:452)
LCK::FUS::CbzRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:453)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQID NO:454)
TOM20::FUS::SYNZIP1::CpkRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:455)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:456)
KIF16B::FUS::CbzRS
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:457)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:458)
EBAG91-29::FUS::CpkRS
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:459)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:460)
TOM20::FUS::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:461)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:462)
EBAG91-29::FUS::CbzRS
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:463)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:464)
TOM20::FUS::SYNZIP1::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:465)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:466)
KIF16B::FUS::CpkRS
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:467)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:468)
LCK::FUS::CpkRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:469)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQID NO:470)
LCK::CpkRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:471)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:472)
TOM20::FUS::SYNZIP3::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:473)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:474)
TOM20::FUS::SYNZIP3::CpkRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTTCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:475)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLSPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:476)
TOM20::EWSR1::PylRS(AA)::FUS::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGGCTAAGCCCCGACCGCGTTAGAGCCGTATCCCACTGGTCTTCCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:477)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGLSPDRVRAVSHWSSACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:478)
LCK::PylRS(AF)::FUS::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:479)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:480)
LCK::FUS::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:481)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:482)
LCK::FUS::PylRS(AF)::FUS::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:483)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:484)
TOM20::FUS::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:485)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:486)
TOM20::FUS::PylRS(AF)::FUS::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:487)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:488)
TOM20::EWSR1::4xλN22::PylRS(AA)::FUS::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGGGCTAAGCCCCGACCGCGTTAGAGCCGTATCCCACTGGTCTTCCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:489)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLGLSPDRVRAVSHWSSACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:490)
LCK::EWSR1::MCP::PylRS(AA)::FUS::PylRS(AA)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGGCTAAGCCCCGACCGCGTTAGAGCCGTATCCCACTGGTCTTCCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ IDNO:491)
蛋白:
MGCVCSSNPEGTELMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGLSPDRVRAVSHWSSACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:492)
LCK::PylRS(AA)::FUS::PylRS(AA)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQID NO:493)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:494)
LCK::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:495)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:496)
TOM20::FUS::MCP::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:497)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:498)
TOM20::FUS::4xλN22::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:499)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:500)
TOM20::FUS::SYNZIP1::MCP::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:501)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:502)
TOM20::FUS::SYNZIP2::MCP::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:503)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:504)
LCK::FUS::SYNZIP1::MCP::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:505)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:506)
LCK::FUS::SYNZIP2::MCP::PylRS(AF)::EWSR1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:507)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:508)
LCK::PylRS(AA)::EWSR1::PylRS(AA)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ IDNO:509)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:510)
LCK::FUS::PylRS(AA)::EWSR1::PylRS(AA)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:511)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:512)
TOM20::FUS::PylRS(AA)::EWSR1::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:513)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:514)
TOM20::FUS::MCP::PylRS(AA)::EWSR1::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:515)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:516)
TOM20::FUS::4xλN22::PylRS(AA)::EWSR1::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:517)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQID NO:518)
LCK::EWSR1::MCP::PylRS(AF)::FUS::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGGCTAAGCTATACAGATATTGAAATGAACAGATTGGGAAAGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:519)
蛋白:
MGCVCSSNPEGTELMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGLSYTDIEMNRLGKACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:520)
TOM20::EWSR1::4xλN22::PylRS(AF)::FUS::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGGGCTAAGCTATACAGATATTGAAATGAACAGATTGGGAAAGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:521)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLGLSYTDIEMNRLGKACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:522)
TOM20::EWSR1::MCP::PylRS(AF)::FUS::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGGCTAAGCTATACAGATATTGAAATGAACAGATTGGGAAAGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:523)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGLSYTDIEMNRLGKACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:524)
LCK::PylRS(AA)::FUS::PylRS(AA)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQID NO:525)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNLASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:526)
EBAG91-29::EWSR1::SYNZIP4::4xλN22
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:527)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:528)
KIF16B::FUS::SYNZIP1::PylRS(AF)
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:529)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:530)
KIF16B::FUS::SYNZIP1::PylRS(AA)
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:531)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:532)
EBAG91-29::EWSR1::SYNZIP2::MCP
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:533)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:534)
TOM20::EWSR1::SYNZIP2::MCP
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:535)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:536)
TOM20::FUS::SYNZIP4::4xλN22::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGTTAATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:537)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAELIAGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:538)
KIF16B::EWSR1::SYNZIP4::4xλN22
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:539)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:540)
TOM20::EWSR1::SYNZP4::4xλN22
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:541)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:542)
TOM20::FUS::SYNZIP1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:543)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:544)
TOM20::FUS::SYNZIP::3::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:545)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:546)
EBAG91-29::FUS::SYNZIP1::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:547)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:548)
EBAG91-29::FUS::SYNZIP3::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:549)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:550)
TOM20::FUS::SYNZIP1::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:551)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:552)
TOM20::FUS::SYNZIP3::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:553)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:554)
TOM20::FUS::SYNZIP3::PylRS(AAAF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:555)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:556)
LCK::FUS::SYNZIP3::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:557)
蛋白:
MGCVCSSNPEGTELMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:558)
LCK::SYNZIP1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:559)
蛋白:
MGCVCSSNPEGTELAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:560)
LCK::SYNZIP3::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:561)
蛋白:
MGCVCSSNPEGTELAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:562)
SYNZIP2::MCP
DNA:
ATGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:563)
蛋白:
MAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:564)
LCK::EWSR1::SYNZIP2::MCP
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:565)
蛋白:
MGCVCSSNPEGTELMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:566)
LCK::EWSR1::SYNZIP4::4xλN22
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:567)
蛋白:
MGCVCSSNPEGTELMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:568)
LCK::SYNZIP2::MCP
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:569)
蛋白:
MGCVCSSNPEGTELAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:570)
EWSR1::SYNZIP2::MCP
DNA:
ATGATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:571)
蛋白:
MMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ IDNO:572)
LCK::FUS::SYNZIP1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:573)
蛋白:
MGCVCSSNPEGTELMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:574)
LCK::FUS::SYNZIP3::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:575)
蛋白:
MGCVCSSNPEGTELMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:576)
TOM20::EWSR1::SYNZIP4::2xPCP
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGTTAATCGCAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:577)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAELIAEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:578)
TOM20::EWSR1::SYNZIP2::2xPCP
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTTAATCGCATACCCCTACGACGTGCCCGACTACGCCATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:579)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQLIAYPYDVPDYAIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:580)
KIF16B::EWSR1::SYNZIP2::MCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:581)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:582)
LCK::SYNZIP1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:583)
蛋白:
MGCVCSSNPEGTELAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:584)
LCK::FUS::SYNZIP1::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:585)
蛋白:
MGCVCSSNPEGTELMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:586)
SYNZIP4::4xλN22
DNA:
ATGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:587)
蛋白:
MAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:588)
TOM20::FUS::SYNZIP1::MCP::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:589)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:590)
TOM20::FUS::SYNZIP2::MCP::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ IDNO:591)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:592)
TOM20::FUS::SYNZIP1::MCP::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:593)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:594)
TOM20::FUS::SYNZIP2::MCP::PylRS(AA)
DNA:ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQID NO:595)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:596)
TOM20::FUS::SYNZIP::MCP::IFRS1
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATATGCTGAACTATAGCCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGTCTTTTATGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:597)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNMLNYSRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLSFMQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:598)
TOM20::FUS::SYNZIP2::MCP::IFRS1
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATATGCTGAACTATAGCCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGTCTTTTATGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ IDNO:599)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNMLNYSRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLSFMQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:600)
TOM20::FUS::SYNZIP3::4xλN22::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:601)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:602)
EBAG91-29::FUS::SYNZIP3::PCP::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTTTAA(SEQ ID NO:603)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:604)
EBAG91-29::FUS::SYNZIP4::PylRS(AF)
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGTTAATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:605)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAELIAGAPGSAGSAAGSGASIEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLAGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:606)
EBAG91-29::FUS::SYNZIP3::4xλN22::IFRS1
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATATGCTGAACTATAGCCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGTCTTTTATGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:607)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNMLNYSRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLSFMQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:608)
LCK::FUS::SYNZIP1::MCP::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:609)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:610)
LCK::FUS::SYNZIP2::MCP::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:611)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:612)
CG1::FUS::SYNZIP1::MCP::PylRS(AF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:613)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:614)
CG1::FUS::SYNZIP2::MCP::PylRS(AF)
DNA:
ATGGCCATTTGTCAATTCTTCCTTCAAGGCCGGTGCCGCTTTGGAGATCGGTGCTGGAACGAACATCCCGGTGCTAGGGGTGCAGGAGGAGGACGGCAGCAACCGCAGCAGCAGCCTTCAGGTAATAATAGACGTGGATGGAATACAACTAGCCAGAGATATTCCAATGTCATCCAGCCATCCAGTTTCTCCAAATCCACACCATGGGGGGGCAGCAGAGATCAAGAAAAGCCATATTTCAGTTCTTTTGATTCTGGAGCTTCAACTAACAGGAAGGAAGGCTTTGGATTGTCTGAGAACCCATTTGCTTCACTTAGTCCTGATGAGCAGAAAGATGAAAAGAAACTTCTGGAAGGAATTGTAAAAGATATGGAGGTTTGGGAATCATCAGGGCAGTGGATGTTTTCTGTTTATTCACCAGTGAAAAAGAAACCTAATATTTCAGGTTTTACAGACATTTCACCAGAGGAATTGAGGCTTGAATACCATAACTTCTTAACCAGCAATAACTTACAGAGTTATCTAAATTCTGTCCAACGTTTAATAAATCAATGGAGGAACAGGGTAAATGAACTGAAAAGTCTAAATATATCAACTAAAGTAGCTTTGCTCTCTGATGTAAAGGATGGAGTAAATCAAGCAGCACCTGCATTTGGATTTGGCAGCAGTCAAGCAGCAACATTTATGTCGCCAGGCTTTCCAGTCAATAACAGCAGCAGTGATAATGCTCAGAACTTTAGTTTTAAAACAAACTCTGGATTTGCTGCTGCCTCTTCTGGAAGCCCTGCTGGTTTTGGGAGTTCCCCAGCATTTGGAGCTGCAGCCTCTACCAGTTCAGGTATCTCTACTTCTGCTCCAGCTTTTGGATTTGGGAAGCCTGAAGTCACATCGGCTGCATCATTTTCATTCAAAAGCCCTGCAGCTTCCAGTTTTGGATCACCTGGATTTTCAGGACTTCCAGCTTCCTTGGCAACAGGTCCTGTCAGAGCTCCAGTGGCCCCAGCCTTTGGAGGTGGCAGTTCTGTGGCTGGTTTTGGTAGTCCGGGCTCACATTCTCACACTGCTTTTTCTAAGCCATCCAGTGACACTTTTGGAAATAGCAGCATATCCACTTCTCTGTCAGCCTCAAGCAGCATCATTGCAACAGATAATGTGTTATTCACACCCAGAGATAAACTAACAGTAGAAGAACTGGAACAATTTCAATCCAAGAAATTTACTCTGGGAAAAATTCCATTAAAGCCTCCACCTCTGGAACTTCTAAATGTTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:615)
蛋白:
MAICQFFLQGRCRFGDRCWNEHPGARGAGGGRQQPQQQPSGNNRRGWNTTSQRYSNVIQPSSFSKSTPWGGSRDQEKPYFSSFDSGASTNRKEGFGLSENPFASLSPDEQKDEKKLLEGIVKDMEVWESSGQWMFSVYSPVKKKPNISGFTDISPEELRLEYHNFLTSNNLQSYLNSVQRLINQWRNRVNELKSLNISTKVALLSDVKDGVNQAAPAFGFGSSQAATFMSPGFPVNNSSSDNAQNFSFKTNSGFAAASSGSPAGFGSSPAFGAAASTSSGISTSAPAFGFGKPEVTSAASFSFKSPAASSFGSPGFSGLPASLATGPVRAPVAPAFGGGSSVAGFGSPGSHSHTAFSKPSSDTFGNSSISTSLSASSSIIATDNVLFTPRDKLTVEELEQFQSKKFTLGKIPLKPPPLELLNVGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:616)
TOM20::FUS::SYNZIP4::λN22::CbzRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGTTAATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATCTGATGAACTATGGACGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTACACAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:617)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAELIAGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLMNYGRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFTQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:618)
EBAG91-29::FUS::SYNZIP4::4xλN22::IFRS1
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGTTAATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTTGCACCAAATATGCTGAACTATAGCCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGTCTTTTATGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:619)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAELIAGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNMLNYSRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLSFMQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:620)
TOM20::FUS::SYNZIP3::4xλN22::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTGGTGCTCCTGGTTCAGCAGGAAGCGCAGCAGGATCAGGTGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:621)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:622)
LCK::FUS::SYNZIP1::MCP::PylRS(AA)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:623)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:624)
TOM20::EWSR1::SYNZIP4::4xλN22::SYNZIP4::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTCTGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTCAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:625)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVSGLSGAPGSAGSAAGSGQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:626)
TOM20::FUS::SYNZIP1::MCP::SYNZIP1::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGCATCGATATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:627)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEASIYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGLSGAPGSAGSAAGSGNLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:628)
TOM20::EWSR1::SYNZIP4::4xλN22::SYNZIP4::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTCTGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTCAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:629)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVSGLSGAPGSAGSAAGSGQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:630)
TOM20::FUS::SYNZIP3::4xλN22::SYNZIP3::PylRS(AA)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTGGCACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGCCTTTGCCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTATGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTTGGACCAATTCCGCTGGACCGTGAGTGGGGTATCGACAAACCGTGGATCGGAGCAGGATTCGGTCTGGAACGCCTGCTGAAAGTGAAACACGACTTCAAAAACATCAAACGTGCCGCCCGTTCTGAATCGTATTATAACGGGATCTCTACGAACCTGTAA(SEQ ID NO:631)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGLSGAPGSAGSAAGSGNEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLAFAQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVYGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:632)
LCK::EWSR1::SYNZIP4::4xλN22::SYNZIP4::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCACAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTCTGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTCAAAAGGTGGCTGAACTGAAAAATAGAGTGGCCGTGAAGCTGAACCGGAACGAGCAGCTGAAGAACAAGGTGGAAGAGCTGAAGAACAGAAACGCCTACCTGAAGAATGAGCTGGCCACCCTGGAAAACGAGGTGGCCAGACTGGAAAACGACGTGGCCGAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:633)
蛋白:
MGCVCSSNPEGTELMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPVSGLSGAPGSAGSAAGSGQKVAELKNRVAVKLNRNEQLKNKVEELKNRNAYLKNELATLENEVARLENDVAEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:634)
LCK::FUS::SYNZIP3::4xλN22::SYNZIP3::PylRS(AF)
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:635)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGLSGAPGSAGSAAGSGNEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQID NO:636)
TOM20::FUS::SYNZIP3::4xλN22::SYNZIP3::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCATCGATAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:637)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGAPGSAGSAAGSGASIEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGLSGAPGSAGSAAGSGNEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:638)
TOM20::EWSR1::SYNZIP2::MCP::SYNZIP2::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAGTTCTTCATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGTATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGGCTAAGCGGTGCTCCGGGGTCAGCCGGAAGTGCAGCAGGATCAGGTGCTAGAAACGCCTACCTGAGAAAGAAAATCGCCAGACTGAAGAAGGACAACCTGCAGCTGGAAAGAGACGAGCAGAACCTGGAAAAGATCATCGCCAACCTCAGAGATGAGATCGCCAGACTGGAAAACGAGGTGGCCAGCCACGAGCAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGACAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:639)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGLSGAPGSAGSAAGSGARNAYLRKKIARLKKDNLQLERDEQNLEKIIANLRDEIARLENEVASHEQGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:640)
LCK::OMeRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:641)
蛋白:
MGCVCSSNPEGTELACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:642)
TOM20::FUS::OMeRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:643)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESI ITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQID NO:644)
KIF16B::FUS::OMeRS
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACATATACAGATATTGAAATGAACAGATTGGGAAAGGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGATTACAAGGATGACGACGATAAGGGTACCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:645)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTYTDIEMNRLGKGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGDYKDDDDKGTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:646)
LCK::FUS::OMeRS
DNA:
ATGGGCTGCGTGTGCAGCAGCAACCCCGAGGGTACCGAGCTCGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGGCGCCCCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:647)
蛋白:
MGCVCSSNPEGTELASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGGAPGSAGSAAGSGMACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQID NO:648)
EBAG91-29:FUS::OMeRS
DNA:
ATGGCCATCACCCAGTTTCGGTTATTTAAATTTTGTACCTGCCTAGCAACAGTATTCTCATTCCTAAAGAGATTAATATGCAGATCTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAGCAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:649)
蛋白:
MAITQFRLFKFCTCLATVFSFLKRLICRSGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:650)
TOM20::FUS::SYNZIP1::OMeRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATCTGGTGGCCCAGCTGGAAAACGAGGTGGCCAGCCTGGAAAACGAGAACGAAACCCTGAAGAAAAAGAACCTGCACAAGAAGGACCTGATCGCCTACCTGGAAAAGGAAATCGCCAACCTGAGAAAGAAGATCGAGGAAGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:651)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANLVAQLENEVASLENENETLKKKNLHKKDLIAYLEKEIANLRKKIEEGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:652)
TOM20::FUS::SYNZIP3::OMeRS
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCAAATGAGGTGACCACCCTGGAAAACGACGCCGCCTTCATCGAGAACGAGAACGCCTACCTGGAAAAAGAGATCGCCAGACTGAGAAAGGAAAAGGCCGCTCTGCGGAACAGACTGGCCCACAAGAAGGGCAAGCCTATTCCCAACCCCCTGCTGGGCCTGGATAGCACCGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAACACCAAATCTGTATAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGGTCTTTTGGCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCCTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:653)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIANEVTTLENDAAFIENENAYLEKEIARLRKEKAALRNRLAHKKGKPIPNPLLGLDSTGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLTPNLYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLVFWQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSALVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:654)
SLP3::FUS::PylRS(AF)
DNA:
ATGGATTCTAGGGTGTCTTCACCTGAGAAGCAAGATAAAGAGAATTTCGTGGGTGTCAACAATAAACGGCTTGGTGTATGTGGCTGGATCCTGTTTTCCCTCTCTTTCCTGTTGGTGATCATTACCTTCCCCATCTCCATATGGATGTGCTTGAAGATCATTAAGGAGTATGAACGTGGAGCACCCGGCTCCGCCGGCTCCGCCGCCGGCTCCGGCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGTGGTGCGATCGCAGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:655)
蛋白:
MDSRVSSPEKQDKENFVGVNNKRLGVCGWILFSLSFLLVIITFPISIWMCLKIIKEYERGAPGSAGSAAGSGMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:656)
SLP3::MCP
DNA:
ATGGATTCTAGGGTGTCTTCACCTGAGAAGCAAGATAAAGAGAATTTCGTGGGTGTCAACAATAAACGGCTTGGTGTATGTGGCTGGATCCTGTTTTCCCTCTCTTTCCTGTTGGTGATCATTACCTTCCCCATCTCCATATGGATGTGCTTGAAGATCATTAAGGAGTATGAACGTGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:657)
蛋白:
MDSRVSSPEKQDKENFVGVNNKRLGVCGWILFSLSFLLVIITFPISIWMCLKIIKEYERAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:658)
SLP3::EWSR1::MCP
DNA:
ATGGATTCTAGGGTGTCTTCACCTGAGAAGCAAGATAAAGAGAATTTCGTGGGTGTCAACAATAAACGGCTTGGTGTATGTGGCTGGATCCTGTTTTCCCTCTCTTTCCTGTTGGTGATCATTACCTTCCCCATCTCCATATGGATGTGCTTGAAGATCATTAAGGAGTATGAACGTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:659)
蛋白:
MDSRVSSPEKQDKENFVGVNNKRLGVCGWILFSLSFLLVIITFPISIWMCLKIIKEYERMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:660)
SLP3::EWSR1::4xλN22
DNA:
ATGGATTCTAGGGTGTCTTCACCTGAGAAGCAAGATAAAGAGAATTTCGTGGGTGTCAACAATAAACGGCTTGGTGTATGTGGCTGGATCCTGTTTTCCCTCTCTTTCCTGTTGGTGATCATTACCTTCCCCATCTCCATATGGATGTGCTTGAAGATCATTAAGGAGTATGAACGTATGGCGTCCACGGATTACAGTACCTATAGCCAAGCTGCAGCGCAGCAGGGCTACAGTGCTTACACCGCCCAGCCCACTCAAGGATATGCACAGACCACCCAGGCATATGGGCAACAAAGCTATGGAACCTATGGACAGCCCACTGATGTCAGCTATACCCAGGCTCAGACCACTGCAACCTATGGGCAGACCGCCTATGCAACTTCTTATGGACAGCCTCCCACTGGTTATACTACTCCAACTGCCCCCCAGGCATACAGCCAGCCTGTCCAGGGGTATGGCACTGGTGCTTATGATACCACCACTGCTACAGTCACCACCACCCAGGCCTCCTATGCAGCTCAGTCTGCATATGGCACTCAGCCTGCTTATCCAGCCTATGGGCAGCAGCCAGCAGCCACTGCACCTACAAGACCGCAGGATGGAAACAAGCCCACTGAGACTAGTCAACCTCAATCTAGCACAGGGGGTTACAACCAGCCCAGCCTAGGATATGGACAGAGTAACTACAGTTATCCCCAGGTACCTGGGAGCTACCCCATGCAGCCAGTCACTGCACCTCCATCCTACCCTCCTACCAGCTATTCCTCTACACAGCCGACTAGTTATGATCAGAGCAGTTACTCTCAGCAGAACACCTATGGGCAACCGAGCAGCTATGGACAGCAGAGTAGCTATGGTCAACAAAGCAGCTATGGGCAGCAGCCTCCCACTAGTTACCCACCCCAAACTGGATCCTACAGCCAAGCTCCAAGTCAATATAGCCAACAGAGCAGCAGCTACGGGCAGCAGAGTTCATTCCGACAGGACCACCCCAGTAGCATGGGTGTTTATGGGCAGGAGTCTGGAGGATTTTCCGGACCAGGAGAGAACCGGAGCATGAGTGGCCCTGATAACCGGGGCAGGGGAAGAGGGGGATTTGATCGTGGAGGCATGAGCAGAGGTGGGCGGGGAGGAGGACGCGGTGGAATGGGCAGCGCTGGAGAGCGAGGTGGCTTCAATAAGCCTGGTGGACCCATGGATGAAGGACCAGATCTTGATCTAGGCCCACCTGTAGATCCAGATGAAGACTCTGACAACAGTGCAATTTATGTACAAGGATTAAATGACAGTGTGACTCTAGATGATCTGGCAGACTTCTTTAAGCAGTGTGGGGTTGTTAAGATGAACAAGAGAACTGGGCAACCCATGATCCACATCTACCTGGACAAGGAAACAGGAAAGCCCAAAGGCGATGCCACAGTGTCCTATGAAGACCCACCTACTGCCAAGGCTGCCGTGGAATGGTTTGATGGGAAAGATTTTCAAGGGAGCAAACTTAAAGTCTCCCTTGCTCGGAAGAAGCCTCCAATGAACAGTATGCGGGGTGGTCTGCCACCCCGTGAGGGCAGAGGCATGCCACCACCACTCCGTGGAGGTCCAGGAGGCCCAGGAGGTCCTGGGGGACCCATGGGTCGCATGGGAGGCCGTGGAGGAGATAGAGGAGGCTTCCCTCCAAGAGGACCCCGGGGTTCCCGAGGGAACCCCTCTGGAGGAGGAAACGTCCAGCACCGAGCTGGAGACTGGCAGTGTCCCAATCCGGGTTGTGGAAACCAGAACTTCGCCTGGAGAACAGAGTGCAACCAGTGTAAGGCCCCAAAGCCTGAAGGCTTCCTCCCGCCACCCTTTCCGCCCCCGGGTGGTGATCGTGGCAGAGGTGGCCCTGGTGGCATGCGGGGAGGAAGAGGTGGCCTCATGGATCGTGGTGGTCCCGGTGGAATGTTCAGAGGTGGCCGTGGTGGAGACAGAGGTGGCTTCCGTGGTGGCCGGGGCATGGACCGAGGTGGCTTTGGTGGAGGAAGACGAGGTGGCCCTGGGGGGCCCCCTGGACCTTTGATGGAACAGGCGATCGCAGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGAGCAGAAGCTGATCTCAGAGGAGGACCTGCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGACGGAGCCGGAGCTGGCGCTGGAGCTGGAGCCGGAGCTGGCGGTCTAGCCACCATGGACGCACAAACACGACGACGTGAGCGTCGCGCTGAGAAACAAGCTCAATGGAAAGCTGCAAACCCACCGCTCGAGTCTAGAGGGCCCGTTTAA(SEQ ID NO:661)
蛋白:
MDSRVSSPEKQDKENFVGVNNKRLGVCGWILFSLSFLLVIITFPISIWMCLKIIKEYERMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQSSFRQDHPSSMGVYGQESGGFSGPGENRSMSGPDNRGRGRGGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIYVQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAKAAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHRAGDWQCPNPGCGNQNFAWRTECNQCKAPKPEGFLPPPFPPPGGDRGRGGPGGMRGGRGGLMDRGGPGGMFRGGRGGDRGGFRGGRGMDRGGFGGGRRGGPGGPPGPLMEQAIAGAPGSAGSAAGSGEQKLISEEDLLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLDGAGAGAGAGAGAGGLATMDAQTRRRERRAEKQAQWKAANPPLESRGPV*(SEQ ID NO:662)
SLP3::PylRS(AF)
DNA:
ATGGATTCTAGGGTGTCTTCACCTGAGAAGCAAGATAAAGAGAATTTCGTGGGTGTCAACAATAAACGGCTTGGTGTATGTGGCTGGATCCTGTTTTCCCTCTCTTTCCTGTTGGTGATCATTACCTTCCCCATCTCCATATGGATGTGCTTGAAGATCATTAAGGAGTATGAACGTGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQID NO:663)
蛋白:
MDSRVSSPEKQDKENFVGVNNKRLGVCGWILFSLSFLLVIITFPISIWMCLKIIKEYERGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:664)
TOM20::FUS::MCP::PylRS(AF)
DNA:
ATGGTGGGTCGGAACAGCGCCATCGCCGCCGGTGTATGCGGGGCCCTTTTCATTGGGTACTGCATCTACTTCGACCGCAAAAGACGAAGTGACCCCAACTTCAAGAACAGGCTTCGAGAACGAAGAAAGAAACAGAAGCTTGCCAAGGAGAGAGCTGGGCTTTCCAAGTTACCTGACCTTAAAGATGCTGAAGCTGTTCAGAAATTCTTCATGGCCTCAAACGATTATACCCAACAAGCAACCCAAAGCTATGGGGCCTACCCCACCCAGCCCGGGCAGGGCTATTCCCAGCAGAGCAGTCAGCCCTACGGACAGCAGAGTTACAGTGGTTATAGCCAGTCCACGGACACTTCAGGATATGGCCAGAGCAGCTATTCTTCTTATGGCCAGAGCCAGAACACAGGCTATGGAACTCAGTCAACTCCCCAGGGATATGGCTCGACTGGCGGCTATGGCAGTAGCCAGAGCTCCCAATCGTCTTACGGGCAGCAGTCCTCCTACCCTGGCTATGGCCAGCAGCCAGCTCCCAGCAGCACCTCGGGAAGTTACGGTAGCAGTTCTCAGAGCAGCAGCTATGGGCAGCCCCAGAGTGGGAGCTACAGCCAGCAGCCTAGCTATGGTGGACAGCAGCAAAGCTATGGACAGCAGCAAAGCTATAATCCCCCTCAGGGCTATGGACAGCAGAACCAGTACAACAGCAGCAGTGGTGGTGGAGGTGGAGGTGGAGGTGGAGGTAACTATGGCCAAGATCAATCCTCCATGAGTAGTGGTGGTGGCAGTGGTGGCGGTTATGGCAATCAAGACCAGAGTGGTGGAGGTGGCAGCGGTGGCTATGGACAGCAGGACCGTGGAGGCCGCGGCAGGGGTGGCAGTGGTGGCGGCGGCGGCGGCGGCGGTGGTGGTTACAACCGCAGCAGTGGTGGCTATGAACCCAGAGGTCGTGGAGGTGGCCGTGGAGGCAGAGGTGGCATGGGCGGAAGTGACCGTGGTGGCTTCAATAAATTTGGTGGCCCTCGGGACCAAGGATCACGTCATGACTCCGAACAGGATAATTCAGACAACAACACCATCTTTGTGCAAGGCCTGGGTGAGAATGTTACAATTGAGTCTGTGGCTGATTACTTCAAGCAGATTGGTATTATTAAGACAAACAAGAAAACGGGACAGCCCATGATTAATTTGTACACAGACAGGGAAACTGGCAAGCTGAAGGGAGAGGCAACGGTCTCTTTTGATGACCCACCTTCAGCTAAAGCAGCTATTGACTGGTTTGATGGTAAAGAATTCTCCGGAAATCCTATCAAGGTCTCATTTGCTACTCGCCGGGCAGACTTTAATCGGGGTGGTGGCAATGGTCGTGGAGGCCGAGGGCGAGGAGGACCCATGGGCCGTGGAGGCTATGGAGGTGGTGGCAGTGGTGGTGGTGGCCGAGGAGGATTTCCCAGTGGAGGTGGTGGCGGTGGAGGACAGCAGCGAGCTGGTGACTGGAAGTGTCCTAATCCCACCTGTGAGAATATGAACTTCTCTTGGAGGAATGAATGCAACCAGTGTAAGGCCCCTAAACCAGATGGCCCAGGAGGGGGACCAGGTGGCTCTCACATGGGGGGTAACTACGGGGATGATCGTCGTGGTGGCAGAGGAGGCGCGATCGCATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACGGCGCCGATTACAAGGACGATGATGACAAGGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:665)
蛋白:
MVGRNSAIAAGVCGALFIGYCIYFDRKRRSDPNFKNRLRERRKKQKLAKERAGLSKLPDLKDAEAVQKFFMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDNNTIFVQGLGENVTIESVADYFKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGNPIKVSFATRRADFNRGGGNGRGGRGRGGPMGRGGYGGGGSGGGGRGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPDGPGGGPGGSHMGGNYGDDRRGGRGGAIAYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIYGADYKDDDDKGAPGSAGSAAGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:666)
KIF16B::1xLAF-1::PylRS(AF)
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAGACTACAAGGACGACGATGATAAGATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:667)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTDYKDDDDKMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ IDNO:668)
KIF16B::1xLAF-1::MCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAGACTACAAGGACGACGATGATAAGATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:669)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTDYKDDDDKMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:670)
KIF16B::1xLAF-1::2xPCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAGACTACAAGGACGACGATGATAAGATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGCGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:671)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTDYKDDDDKMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:672)
KIF16B::2xLAF-1::2xPCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAGACTACAAGGACGACGATGATAAGATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGCGGAATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGCGAGCAGAAGCTGATCTCAGAGGAGGACCTGATCGAAGGCCGCCATATGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTGCGGATCCGCTAGCCTCCAAAACCATCGTTCTTTCGGTCGGCGAGGCTACTCGCACTCTGACTGAGATCCAGTCCACCGCAGACCGTCAGATCTTCGAAGAGAAGGTCGGGCCTCTGGTGGGTCGGCTGCGCCTCACGGCTTCGCTCCGTCAAAACGGAGCCAAGACCGCGTATCGCGTCAACCTAAAACTGGATCAGGCGGACGTCGTTGATTCCGGACTTCCGAAAGTGCGCTACACTCAGGTATGGTCGCACGACGTGACAATCGTTGCGAATAGCACCGAGGCCTCGCGCAAATCGTTGTACGATTTGACCAAGTCCCTCGTCGCGACCTCGCAGGTCGAAGATCTTGTCGTCAACCTTGTGCCGCTGGGCCGTCCACCGGTCGCCACCTAA(SEQ ID NO:673)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTDYKDDDDKMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGGMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGEQKLISEEDLIEGRHMLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRADPLASKTIVLSVGEATRTLTEIQSTADRQIFEEKVGPLVGRLRLTASLRQNGAKTAYRVNLKLDQADVVDSGLPKVRYTQVWSHDVTIVANSTEASRKSLYDLTKSLVATSQVEDLVVNLVPLGRPPVAT*(SEQ ID NO:674)
KIF16B::2xLAF-1::PylRS(AF)
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAGACTACAAGGACGACGATGATAAGATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGCGGAATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGAGCGTGCCCGGTGCCGCTGCAGCTGCCGCCGCTGGAACGCCTGACCCTGGATGATAAAAAACCGCTGAATACCCTGATCTCTGCTACTGGTCTGTGGATGAGTCGTACCGGAACCATTCATAAAATCAAACACCACGAGGTTAGCCGTTCGAAAATCTATATTGAGATGGCGTGTGGCGATCATCTGGTTGTGAACAATAGCCGCTCTTCTCGTACAGCACGTGCACTGCGTCACCACAAATATCGTAAAACCTGTAAACGTTGCCGTGTGTCCGATGAGGATCTGAACAAATTCCTGACAAAAGCCAATGAGGACCAAACAAGCGTGAAAGTGAAAGTCGTTAGCGCTCCTACCCGTACTAAAAAAGCAATGCCGAAATCCGTTGCTCGTGCCCCTAAACCACTGGAAAACACTGAAGCAGCACAGGCACAGCCGTCTGGAAGCAAATTCTCTCCGGCCATTCCTGTTTCTACCCAGGAGTCCGTTTCTGTTCCAGCAAGTGTGAGCACCAGCATTAGCAGTATTAGCACCGGTGCCACCGCTAGCGCCCTGGTTAAAGGCAATACCAATCCGATTACAAGCATGTCTGCCCCGGTTCAAGCATCAGCTCCAGCACTGACAAAATCCCAAACCGATCGTCTGGAGGTTCTGCTGAATCCGAAAGACGAAATCAGCCTGAATTCCGGCAAACCGTTTCGTGAACTGGAGAGCGAACTGCTGTCACGTCGTAAAAAAGACCTGCAACAAATCTATGCCGAAGAACGTGAGAACTATCTGGGGAAACTGGAACGTGAAATCACCCGCTTTTTCGTGGATCGTGGCTTTCTGGAGATCAAATCCCCGATTCTGATTCCTCTGGAGTATATCGAGCGTATGGGCATCGACAATGATACCGAACTGAGCAAACAAATTTTCCGTGTGGATAAAAACTTCTGTCTGCGCCCTATGCTAGCACCAAATCTGGCTAACTATCTGCGCAAACTGGACCGTGCCCTGCCTGATCCTATCAAAATCTTCGAGATCGGCCCGTGTTATCGTAAAGAGTCCGACGGTAAAGAACATCTGGAGGAGTTTACCATGCTGAACTTTTGCCAAATGGGTTCAGGTTGTACTCGTGAGAACCTGGAAAGCATCATCACCGATTTTCTGAACCACCTGGGCATTGACTTCAAAATTGTGGGCGACAGCTGTATGGTGTTTGGCGACACCCTGGATGTCATGCACGGCGACCTGGAACTGTCTAGTGCCGTTGTGGGCCCAATCCCGCTGGATCGTGAGTGGGGTATCGACAAACCTTGGATCGGTGCGGGTTTTGGTCTGGAGCGTCTGCTGAAAGTAAAACACGACTTCAAGAACATCAAACGTGCTGCACGTTCCGAGTCCTATTACAATGGTATTTCTACTAACCTGTAA(SEQ ID NO:675)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTDYKDDDDKMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGGMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGACPVPLQLPPLERLTLDDKKPLNTLISATGLWMSRTGTIHKIKHHEVSRSKIYIEMACGDHLVVNNSRSSRTARALRHHKYRKTCKRCRVSDEDLNKFLTKANEDQTSVKVKVVSAPTRTKKAMPKSVARAPKPLENTEAAQAQPSGSKFSPAIPVSTQESVSVPASVSTSISSISTGATASALVKGNTNPITSMSAPVQASAPALTKSQTDRLEVLLNPKDEISLNSGKPFRELESELLSRRKKDLQQIYAEERENYLGKLEREITRFFVDRGFLEIKSPILIPLEYIERMGIDNDTELSKQIFRVDKNFCLRPMLAPNLANYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMLNFCQMGSGCTRENLESIITDFLNHLGIDFKIVGDSCMVFGDTLDVMHGDLELSSAVVGPIPLDREWGIDKPWIGAGFGLERLLKVKHDFKNIKRAARSESYYNGISTNL*(SEQ ID NO:676)
KIF16B::2xLAF-1::MCP
DNA:
ATGGCATCGGTCAAGGTGGCCGTGAGGGTCCGGCCCATGAATCGCAGGGAAAAGGACTTGGAGGCCAAGTTCATTATTCAGATGGAGAAAAGCAAAACGACAATCACAAACTTAAAGATACCAGAAGGAGGCACTGGGGACTCAGGAAGAGAACGGACCAAGACCTTCACCTATGACTTTTCTTTTTATTCTGCTGATACAAAAAGCCCAGATTACGTTTCACAAGAAATGGTTTTCAAAACCCTCGGCACAGATGTCGTGAAGTCTGCATTTGAAGGTTATAATGCTTGTGTCTTTGCATATGGGCAAACTGGATCTGGAAAGTCATACACTATGATGGGAAATTCTGGAGATTCTGGCTTAATACCTCGGATCTGTGAAGGACTCTTCAGTCGGATAAATGAAACCACCAGATGGGATGAAGCTTCTTTTCGAACTGAAGTCAGCTACTTAGAAATTTATAACGAACGTGTGAGAGATCTACTTCGGCGGAAGTCATCTAAAACCTTCAATTTGAGAGTCCGTGAGCATCCCAAAGAAGGCCCTTATGTTGAGGATTTATCCAAACATTTAGTACAGAATTATGGTGACGTAGAAGAACTTATGGATGCGGGCAATATCAACCGGACCACCGCAGCGACTGGGATGAACGACGTCAGTAGCAGGTCTCATGCCATCTTCACCATCAAGTTCACTCAGGCTAAATTTGATTCTGAAATGCCATGTGAAACCGTCAGTAAGATCCACTTGGTTGATCTTGCCGGAAGTGAGCGTGCAGATGCCACCGGAGCCACCGGGGTTAGGCTAAAGGAAGGGGGAAATATTAACAAGTCCCTCGTGACTCTGGGGAACGTCATTTCTGCCTTAGCTGATTTATCTCAGGATGCTGCAAATACTCTTGCAAAGAAGAAGCAAGTTTTCGTGCCTTACAGGGATTCTGTGTTGACTTGGTTGTTAAAAGATAGCCTTGGAGGAAACTCTAAAACTATCATGATTGCCACCATTTCACCTGCTGATGTCAATTATGGAGAAACCCTAAGTACTCTTCGCTATGCAAATAGAGCCAAAAACATCATCAACAAGCCTACCATTAATGAGGATGCCAACGTCAAACTTATCCGTGAGCTGCGAGCTGAAATAGCCAGACTGAAAACGCTGCTTGCTCAAGGGAATCAGATTGCCCTCTTAGACTCCCCCACAGACTACAAGGACGACGATGATAAGATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGCGGAATGGAAAGCAACCAGAGCAACAACGGCGGCTCTGGCAACGCCGCTCTGAACAGAGGCGGCAGATACGTGCCCCCCCACCTGAGAGGAGGCGACGGCGGCGCCGCCGCCGCTGCATCTGCCGGCGGAGATGACAGAAGAGGCGGAGCCGGAGGCGGCGGCTATAGACGGGGAGGCGGAAACAGCGGCGGCGGAGGCGGAGGCGGCTACGACAGAGGCTACAACGACAACCGGGACGACCGGGACAACAGAGGCGGCAGCGGCGGATACGGCAGAGATCGAAACTACGAGGACAGAGGCTACAATGGCGGAGGCGGAGGCGGCGGCAACCGGGGCTACAACAACAACAGAGGAGGCGGCGGCGGCGGCTACAACCGCCAGGACAGAGGCGATGGCGGATCTAGCAATTTCAGCAGAGGCGGCTACAACAACCGGGACGAGGGCAGCGACAACAGAGGCAGCGGAAGAAGCTACAACAATGACCGGAGAGATAATGGCGGAGATGGCTCCGGATATCCCTATGATGTGCCGGATTATGCTGGAGCACCAGGAAGTGCTGGTTCTGCTGCTGGTAGTGGAGCTTCTAACTTTACTCAGTTCGTTCTCGTCGACAATGGCGGAACTGGCGACGTGACTGTCGCCCCAAGCAACTTCGCTAACGGGATCGCTGAATGGATCAGCTCTAACTCGCGTTCACAGGCTTACAAAGTAACCTGTAGCGTTCGTCAGAGCTCTGCGCAGAATCGCAAATACACCATCAAAGTCGAGGTGCCTAAAGGCGCCTGGCGTTCGTACTTAAATATGGAACTAACCATTCCAATTTTCGCCACGAATTCCGACTGCGAGCTTATTGTTAAGGCAATGCAAGGTCTCCTAAAAGATGGAAACCCGATTCCCTCAGCAATCGCAGCAAACTCCGGCATCTACTAA(SEQ ID NO:677)
蛋白:
MASVKVAVRVRPMNRREKDLEAKFIIQMEKSKTTITNLKIPEGGTGDSGRERTKTFTYDFSFYSADTKSPDYVSQEMVFKTLGTDVVKSAFEGYNACVFAYGQTGSGKSYTMMGNSGDSGLIPRICEGLFSRINETTRWDEASFRTEVSYLEIYNERVRDLLRRKSSKTFNLRVREHPKEGPYVEDLSKHLVQNYGDVEELMDAGNINRTTAATGMNDVSSRSHAIFTIKFTQAKFDSEMPCETVSKIHLVDLAGSERADATGATGVRLKEGGNINKSLVTLGNVISALADLSQDAANTLAKKKQVFVPYRDSVLTWLLKDSLGGNSKTIMIATISPADVNYGETLSTLRYANRAKNIINKPTINEDANVKLIRELRAEIARLKTLLAQGNQIALLDSPTDYKDDDDKMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGGMESNQSNNGGSGNAALNRGGRYVPPHLRGGDGGAAAAASAGGDDRRGGAGGGGYRRGGGNSGGGGGGGYDRGYNDNRDDRDNRGGSGGYGRDRNYEDRGYNGGGGGGGNRGYNNNRGGGGGGYNRQDRGDGGSSNFSRGGYNNRDEGSDNRGSGRSYNNDRRDNGGDGSGYPYDVPDYAGAPGSAGSAAGSGASNFTQFVLVDNGGTGDVTVAPSNFANGIAEWISSNSRSQAYKVTCSVRQSSAQNRKYTIKVEVPKGAWRSYLNMELTIPIFATNSDCELIVKAMQGLLKDGNPIPSAIAANSGIY*(SEQ ID NO:678)
3.表位标签:
VSV-G:水泡性口炎病毒糖蛋白表位标签
DNA:
TATACAGATATTGAAATGAACAGATTGGGAAAG(SEQ ID NO:679)
蛋白:
YTDIEMNRLGK(SEQ ID NO:680)
HA:人流感病毒血凝素表位标签
DNA:
TACCCCTACGACGTGCCCGACTACGCC(SEQ ID NO:681)
蛋白:
YPYDVPDYA(SEQ ID NO:682)
Myc:人c-Myc原癌基因表位标签
DNA:
GAGCAGAAGCTGATCTCAGAGGAGGACCTG(SEQ ID NO:683)
蛋白:
EQKLISEEDL(SEQ ID NO:684)
Claims (15)
1.一种组装器融合蛋白(AFP),其包含:
(a)充当组装器(AP)的至少一个第一多肽区段,其选自:
(a1)源自细胞内靶向多肽的多肽区段(IC-TP区段),其中所述细胞内靶向多肽靶向细胞内结构元件,并因此在所述细胞内结构元件处局部富集,所述细胞内结构元件在细胞质内或与细胞质直接相邻;和
(a2)源自相分离多肽的多肽区段(PSP区段),其中所述相分离多肽具有在细胞的细胞质中进行自缔合的能力以在细胞质中产生高局部浓度的位点,以及
(b)充当效应物(EP)的至少一个第二多肽区段,其选自:
b1)靶向RNA的多肽(RNA-TP)区段,和
b2)正交氨酰tRNA合成酶(O-RS)区段;
其中所述多肽区段在所述AFP中功能性连接。
2.一种组装器融合蛋白(AFP)组合,其包含至少两种权利要求1的AFP。
3.一种融合蛋白(RNA-TP/O-RS融合蛋白),其包含:
(i)至少一个靶向RNA的多肽(RNA-TP)区段;和
(ii)至少一个正交氨酰tRNA合成酶(O-RS)区段,
其中所述多肽区段在所述RNA-TP/O-RS融合蛋白中功能性连接。
4.一种核酸分子或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种权利要求1的AFP或者至少一种权利要求2的AFP组合,或
(ii)与(i)的核苷酸序列互补的核酸序列,
(iii)(i)和(ii)。
5.一种核酸分子或者两种或更多种核酸分子的组合,其包含:
(i)核苷酸序列,其编码至少一种权利要求3的RNA-TP/O-RS融合蛋白,或
(ii)与(i)互补的核酸序列,或
(iii)(i)和(ii)。
6.一种表达盒,其包含权利要求4或权利要求5的核酸分子或者核酸分子的组合的核苷酸序列。
7.一种表达载体,其包含至少一种权利要求6的表达盒。
8.一种细胞,其包含至少一种权利要求4或权利要求5的核酸分子或者核酸分子的组合、至少一种权利要求6的表达盒或者至少一种权利要求7的表达载体。
9.权利要求8的细胞,其包含核苷酸序列,所述核苷酸序列编码至少一种权利要求1的AFP或者与编码至少一种权利要求1的AFP的核苷酸序列互补,
所述AFP包含(i)选自RNA-TP区段的至少一个EP,和(ii)选自O-RS区段的至少一个EP。
10.权利要求8的细胞,其包含核苷酸序列,所述核苷酸序列编码至少两种权利要求1的AFP的组合或者与编码至少两种权利要求1的AFP的组合的核苷酸序列互补,
其中所述至少两种AFP中的一种包含至少一个RNA-TP区段,而所述至少两种AFP中的另外一种包含至少一个O-RS区段。
11.权利要求8的细胞,其包含核苷酸序列,所述核苷酸序列编码至少一种权利要求3的RNA-TP/O-RS融合蛋白或者与编码至少一种权利要求3的RNA-TP/O-RS融合蛋白的核苷酸序列互补。
12.一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基,其中所述方法包括在所述一种或多种ncAA的存在下,在权利要求9或权利要求10的细胞中表达所述POI,其中所述细胞包含:
(i)编码POI的核苷酸序列(CSPOI),其中所述POI的一种或多种ncAA残基由选择密码子编码,
(ii)靶向核苷酸序列(TN),其功能性连接至所述CSPOI,并且能够与所述细胞中AFP中的至少一种的RNA-TP区段相互作用;
(iii)一种或多种正交tRNAncAA(O-tRNAncAA)分子,其携带与所述CSPOI的选择密码子互补的反密码子,并且其中所述O-tRNAncAA分子与所述细胞中的AFP中的至少一种的一个或多个O-RS区段一起形成一个或多个正交O-RS/O-tRNAncAA对,其允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中;
并且其中所述方法任选地进一步包括回收表达的POI。
13.一种制备感兴趣的多肽(POI)的方法,所述POI在其氨基酸序列中包含一种或多种非典型氨基酸(ncAA)残基,其中所述方法包括在所述一种或多种ncAA的存在下,在权利要求11的细胞中表达所述POI,其中所述细胞包含:
(iv)编码POI的核苷酸序列(CSPOI),其中所述POI的一种或多种ncAA残基由选择密码子编码,
(v)靶向核苷酸序列(TN),其功能性连接至所述CSPOI,并且能够与所述细胞中RNA-TP/O-RS融合蛋白中的至少一种的RNA-TP区段相互作用;
(vi)一种或多种正交tRNAncAA(O-tRNAncAA)分子,其携带与所述CSPOI的选择密码子互补的反密码子,并且其中所述O-tRNAncAA分子与细胞中的RNA-TP/O-RS融合蛋白的一个或多个O-RS区段一起形成一个或多个正交O-RS/OtRNAncAA对,其允许将所述一种或多种ncAA残基引入所述POI的氨基酸序列中;
并且其中所述方法任选地进一步包括回收表达的POI。
14.一种核酸分子,其包含:
(i)编码感兴趣的多肽(POI)的核苷酸序列(CSPOI),所述POI包含一种或多种非典型氨基酸(ncAA)残基,所述ncAA残基在CSPOI中由选择密码子编码,和
(ii)靶向核苷酸序列(TN),其中包含所述TN的RNA分子能够通过所述TN与靶向RNA的多肽(RNA-TP)相互作用。
15.一种试剂盒,其用于制备具有至少一个非典型氨基酸(ncAA)残基的感兴趣的多肽(POI),所述试剂盒包含:
-至少一种ncAA或其盐,其对应于所述POI的至少一个ncAA残基;以及
-至少一种权利要求7的表达载体。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP19157257.7 | 2019-02-14 | ||
EP19157257.7A EP3696189A1 (en) | 2019-02-14 | 2019-02-14 | Means and methods for preparing engineered target proteins by genetic code expansion in a target protein selective manner |
PCT/EP2020/053883 WO2020165408A1 (en) | 2019-02-14 | 2020-02-14 | Means and methods for preparing engineered target proteins by genetic code expansion in a target protein-selective manner |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113727993A true CN113727993A (zh) | 2021-11-30 |
Family
ID=65685108
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080028507.1A Pending CN113727993A (zh) | 2019-02-14 | 2020-02-14 | 通过遗传密码子扩展以靶蛋白选择性方式制备工程化靶蛋白的手段和方法 |
Country Status (8)
Country | Link |
---|---|
US (1) | US20230098002A1 (zh) |
EP (2) | EP3696189A1 (zh) |
JP (1) | JP2022521049A (zh) |
CN (1) | CN113727993A (zh) |
CA (1) | CA3129336A1 (zh) |
IL (1) | IL285405A (zh) |
MA (1) | MA54934A (zh) |
WO (1) | WO2020165408A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116804187A (zh) * | 2023-07-24 | 2023-09-26 | 中国农业科学院兰州兽医研究所 | 一种复制缺陷型口蹄疫病毒、构建方法及应用 |
CN118271417A (zh) * | 2024-04-08 | 2024-07-02 | 北京大学第六医院 | Zkscan4 1-133肽段在抗抑郁中的应用 |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111304234A (zh) * | 2020-02-27 | 2020-06-19 | 江南大学 | 一种适用于枯草芽孢杆菌的非天然氨基酸利用工具 |
EP4186529A1 (en) | 2021-11-25 | 2023-05-31 | Veraxa Biotech GmbH | Improved antibody-payload conjugates (apcs) prepared by site-specific conjugation utilizing genetic code expansion |
WO2023094525A1 (en) | 2021-11-25 | 2023-06-01 | Veraxa Biotech Gmbh | Improved antibody-payload conjugates (apcs) prepared by site-specific conjugation utilizing genetic code expansion |
CN115896144B (zh) * | 2022-10-17 | 2024-01-02 | 湖南诺合新生物科技有限公司 | Fus蛋白在作为融合标签中的应用,重组蛋白及其表达方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2008133028A (ru) * | 2006-03-09 | 2010-04-20 | Зе Скрипс Ресеч Инститьют (Us) | Система экспрессии компонентов ортогональной трансляции в эубактериальной клетке-хозяине |
WO2016066995A1 (en) * | 2014-10-27 | 2016-05-06 | Medical Research Council | Incorporation of unnatural amino acids into proteins |
CN108368499A (zh) * | 2015-11-30 | 2018-08-03 | 欧洲分子生物学实验室 | 通过在昆虫细胞中扩增遗传密码来制备工程化蛋白质的手段和方法 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6412611B1 (en) | 2000-07-17 | 2002-07-02 | Magnetar Technologies, Ltd | Eddy current brake system with dual use conductor fin |
IL158418A0 (en) | 2001-04-19 | 2004-05-12 | Scripps Research Inst | In vivo incorporation of unnatural amino acids |
JP5590649B2 (ja) | 2007-09-20 | 2014-09-17 | 独立行政法人理化学研究所 | 変異体ピロリジル−tRNA合成酵素及びこれを用いる非天然アミノ酸組み込みタンパク質の製造方法 |
EP2619305A1 (en) * | 2010-09-24 | 2013-07-31 | Medical Research Council | Methods for incorporating unnatural amino acids in eukaryotic cells |
WO2012103496A2 (en) * | 2011-01-28 | 2012-08-02 | Medimmune, Llc | Expression of soluble viral fusion glycoproteins in mammalian cells |
EP3279211B1 (en) | 2011-02-03 | 2020-04-29 | Embl | Unnatural amino acids comprising a norbornenyl group and uses thereof |
WO2015006294A2 (en) * | 2013-07-10 | 2015-01-15 | President And Fellows Of Harvard College | Orthogonal cas9 proteins for rna-guided gene regulation and editing |
JP6479022B2 (ja) | 2014-01-14 | 2019-03-06 | ヨーロピアン モレキュラー バイオロジー ラボラトリーEuropean Molecular Biology Laboratory | 分子標識のための複数の環化付加反応 |
WO2017160118A2 (ko) * | 2016-03-18 | 2017-09-21 | 연세대학교 산학협력단 | 목적 단백질의 발현 효율을 증진시키기 위한 신규한 펩타이드 및 이를 포함하는 융합 단백질 |
US11371059B2 (en) * | 2016-05-13 | 2022-06-28 | Flash Therapeutics | Viral particle for the transfer of RNAs, especially into cells involved in immune response |
EP3309260A1 (en) | 2016-10-14 | 2018-04-18 | European Molecular Biology Laboratory | Archaeal pyrrolysyl trna synthetases for orthogonal use |
-
2019
- 2019-02-14 EP EP19157257.7A patent/EP3696189A1/en active Pending
-
2020
- 2020-02-14 EP EP20703782.1A patent/EP3924365A1/en active Pending
- 2020-02-14 JP JP2021545719A patent/JP2022521049A/ja active Pending
- 2020-02-14 WO PCT/EP2020/053883 patent/WO2020165408A1/en unknown
- 2020-02-14 MA MA054934A patent/MA54934A/fr unknown
- 2020-02-14 CN CN202080028507.1A patent/CN113727993A/zh active Pending
- 2020-02-14 US US17/426,338 patent/US20230098002A1/en active Pending
- 2020-02-14 CA CA3129336A patent/CA3129336A1/en active Pending
-
2021
- 2021-08-05 IL IL285405A patent/IL285405A/en unknown
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2008133028A (ru) * | 2006-03-09 | 2010-04-20 | Зе Скрипс Ресеч Инститьют (Us) | Система экспрессии компонентов ортогональной трансляции в эубактериальной клетке-хозяине |
WO2016066995A1 (en) * | 2014-10-27 | 2016-05-06 | Medical Research Council | Incorporation of unnatural amino acids into proteins |
CN108368499A (zh) * | 2015-11-30 | 2018-08-03 | 欧洲分子生物学实验室 | 通过在昆虫细胞中扩增遗传密码来制备工程化蛋白质的手段和方法 |
Non-Patent Citations (3)
Title |
---|
CHRISTOPHER D. REINKEMEIER: "Designer membraneless organelles enable codon reassignment of selected mRNAs in eukaryotes", 《SCIENCE. AUTHOR MANUSCRIPT》, vol. 363, 29 September 2021 (2021-09-29), pages 1 - 24 * |
JASONW. CHIN: "Expanding and Reprogramming the Genetic Code of Cells and Animals", 《ANNU REV BIOCHEM 》, vol. 83, 10 February 2014 (2014-02-10), pages 379 - 408, XP002753743, DOI: 10.1146/annurev-biochem-060713-035737 * |
王志鹏: "非天然氨基酸引入法制备含有 新型赖氨酸翻译后修饰蛋白质", 《化 学 教 育》, vol. 39, no. 18, 31 December 2018 (2018-12-31), pages 9 - 13 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116804187A (zh) * | 2023-07-24 | 2023-09-26 | 中国农业科学院兰州兽医研究所 | 一种复制缺陷型口蹄疫病毒、构建方法及应用 |
CN118271417A (zh) * | 2024-04-08 | 2024-07-02 | 北京大学第六医院 | Zkscan4 1-133肽段在抗抑郁中的应用 |
Also Published As
Publication number | Publication date |
---|---|
JP2022521049A (ja) | 2022-04-05 |
WO2020165408A1 (en) | 2020-08-20 |
US20230098002A1 (en) | 2023-03-30 |
IL285405A (en) | 2021-09-30 |
MA54934A (fr) | 2021-12-22 |
EP3696189A1 (en) | 2020-08-19 |
CA3129336A1 (en) | 2020-08-20 |
EP3924365A1 (en) | 2021-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113727993A (zh) | 通过遗传密码子扩展以靶蛋白选择性方式制备工程化靶蛋白的手段和方法 | |
JP4188909B2 (ja) | 細胞質残留性細胞膜透過ペプチド及びこれの用途{CytoplasmicTransductionPeptidesandUsesthereof} | |
JP7277361B2 (ja) | 直交(orthogonal)用途の古細菌ピロリジルtRNA合成酵素 | |
Howarth et al. | Imaging proteins in live mammalian cells with biotin ligase and monovalent streptavidin | |
JP2023514384A (ja) | 直交法で使用する古細菌ピロリジルtRNA合成酵素 | |
WO2009066964A1 (en) | Method for screening an inhibitory agent of hbv proliferation by using the interaction between hbv capsid and surface proteins based on cellular imaging | |
AU2003244303A1 (en) | Methods for transducing fusion molecules | |
JP2011211983A (ja) | タンパク質分子対、タンパク質分子対をコードする遺伝子および遺伝子導入ベクターとタンパク質分子対を産生する細胞 | |
EP3334755B1 (en) | Improved cell-permeable reprogramming factor (icp-rf) recombinant protein and use thereof | |
JPWO2007132555A1 (ja) | 細胞膜透過性ペプチドと細胞内におけるその使用 | |
KR102351041B1 (ko) | 인간 lrrc24 단백질 유래 세포막 투과 도메인 | |
JP6304683B2 (ja) | 標的タンパク質の細胞内導入剤及び標的タンパク質の細胞内導入方法 | |
Toby et al. | Vectors to target protein domains to different cellular compartments | |
TWI515203B (zh) | 衍生自雞貧血病毒(cav)vp2蛋白質的細胞核定位信號胜肽以及它們的應用 | |
KR20200076603A (ko) | 인간 clk2 단백질 유래 세포막 투과 도메인 | |
KR101505697B1 (ko) | 시스토바이러스 파이12의 외피막 단백질 피9을 융합파트너로 포함하는 막 단백질 발현벡터 및 이를 이용한 막 단백질 제조 방법 | |
Rothe et al. | Expression and purification of ZEBRA fusion proteins and applications for the delivery of macromolecules into mammalian cells | |
WO2020096020A1 (ja) | リポソーム結合ペプチド、リポソーム結合ペプチドの作製用コンストラクト及びリポソーム | |
KR20200076604A (ko) | 인간 gpatch4 단백질 유래 세포막 투과 도메인 | |
Moncivais | Novel tools for the study of protein-protein interactions in pluripotent cells |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |