KR20230134488A - Rna 구조체 - Google Patents
Rna 구조체 Download PDFInfo
- Publication number
- KR20230134488A KR20230134488A KR1020237024303A KR20237024303A KR20230134488A KR 20230134488 A KR20230134488 A KR 20230134488A KR 1020237024303 A KR1020237024303 A KR 1020237024303A KR 20237024303 A KR20237024303 A KR 20237024303A KR 20230134488 A KR20230134488 A KR 20230134488A
- Authority
- KR
- South Korea
- Prior art keywords
- seq
- rna
- fragment
- variant
- accordingly
- Prior art date
Links
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 692
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 121
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 47
- 108020004999 messenger RNA Proteins 0.000 claims abstract description 40
- 238000000034 method Methods 0.000 claims abstract description 33
- 239000003814 drug Substances 0.000 claims abstract description 13
- 229960005486 vaccine Drugs 0.000 claims abstract description 12
- 108091029810 SaRNA Proteins 0.000 claims abstract 3
- 229940078677 sarna Drugs 0.000 claims abstract 3
- 108090000623 proteins and genes Proteins 0.000 claims description 212
- 230000014509 gene expression Effects 0.000 claims description 202
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 194
- 102000004169 proteins and genes Human genes 0.000 claims description 174
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 106
- 239000013598 vector Substances 0.000 claims description 87
- 230000001225 therapeutic effect Effects 0.000 claims description 65
- 102000034356 gene-regulatory proteins Human genes 0.000 claims description 54
- 108091006104 gene-regulatory proteins Proteins 0.000 claims description 54
- 239000000427 antigen Substances 0.000 claims description 49
- 108091007433 antigens Proteins 0.000 claims description 49
- 102000036639 antigens Human genes 0.000 claims description 49
- 102100036981 Interferon regulatory factor 1 Human genes 0.000 claims description 46
- 101000598002 Homo sapiens Interferon regulatory factor 1 Proteins 0.000 claims description 44
- 102100038251 Interferon regulatory factor 9 Human genes 0.000 claims description 43
- 108010050904 Interferons Proteins 0.000 claims description 43
- 102000014150 Interferons Human genes 0.000 claims description 43
- 101001032341 Homo sapiens Interferon regulatory factor 9 Proteins 0.000 claims description 42
- 230000003612 virological effect Effects 0.000 claims description 42
- 229940079322 interferon Drugs 0.000 claims description 40
- 101710196510 BRCA1-associated protein Proteins 0.000 claims description 39
- 102100027106 BRCA1-associated protein Human genes 0.000 claims description 39
- 101710144867 Inositol monophosphatase Proteins 0.000 claims description 39
- 235000013902 inosinic acid Nutrition 0.000 claims description 39
- 102000043138 IRF family Human genes 0.000 claims description 34
- 108091054729 IRF family Proteins 0.000 claims description 34
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 31
- 102100029843 Interferon regulatory factor 3 Human genes 0.000 claims description 29
- 101001011382 Homo sapiens Interferon regulatory factor 3 Proteins 0.000 claims description 24
- 206010028980 Neoplasm Diseases 0.000 claims description 24
- 101001011441 Homo sapiens Interferon regulatory factor 4 Proteins 0.000 claims description 22
- 102100033703 Mitofusin-2 Human genes 0.000 claims description 22
- 101001011442 Homo sapiens Interferon regulatory factor 5 Proteins 0.000 claims description 21
- 102100038070 Interferon regulatory factor 7 Human genes 0.000 claims description 21
- 241000710959 Venezuelan equine encephalitis virus Species 0.000 claims description 20
- 241000700605 Viruses Species 0.000 claims description 20
- 101001032342 Homo sapiens Interferon regulatory factor 7 Proteins 0.000 claims description 19
- 102100030131 Interferon regulatory factor 5 Human genes 0.000 claims description 19
- 102100030126 Interferon regulatory factor 4 Human genes 0.000 claims description 18
- 102000004265 STAT2 Transcription Factor Human genes 0.000 claims description 17
- 108010081691 STAT2 Transcription Factor Proteins 0.000 claims description 17
- 230000001580 bacterial effect Effects 0.000 claims description 17
- 230000000694 effects Effects 0.000 claims description 17
- 230000011664 signaling Effects 0.000 claims description 16
- 101001018717 Homo sapiens Mitofusin-2 Proteins 0.000 claims description 15
- 102100029408 Interferon-inducible double-stranded RNA-dependent protein kinase activator A Human genes 0.000 claims description 15
- 238000004519 manufacturing process Methods 0.000 claims description 15
- 241000710929 Alphavirus Species 0.000 claims description 14
- -1 DDX-56 Proteins 0.000 claims description 14
- 101001125123 Homo sapiens Interferon-inducible double-stranded RNA-dependent protein kinase activator A Proteins 0.000 claims description 14
- 102100026965 RISC-loading complex subunit TARBP2 Human genes 0.000 claims description 13
- 201000011510 cancer Diseases 0.000 claims description 13
- 230000002538 fungal effect Effects 0.000 claims description 13
- 230000037361 pathway Effects 0.000 claims description 13
- 102100038069 Interferon regulatory factor 8 Human genes 0.000 claims description 12
- 238000011282 treatment Methods 0.000 claims description 12
- 101001032345 Homo sapiens Interferon regulatory factor 8 Proteins 0.000 claims description 11
- 230000028993 immune response Effects 0.000 claims description 11
- 102100035389 2'-5'-oligoadenylate synthase 3 Human genes 0.000 claims description 10
- 102100034051 Heat shock protein HSP 90-alpha Human genes 0.000 claims description 10
- 101000597332 Homo sapiens 2'-5'-oligoadenylate synthase 3 Proteins 0.000 claims description 10
- 101001011393 Homo sapiens Interferon regulatory factor 2 Proteins 0.000 claims description 10
- 102100029838 Interferon regulatory factor 2 Human genes 0.000 claims description 10
- 208000036142 Viral infection Diseases 0.000 claims description 10
- 102100022912 ADP-ribosylation factor-like protein 16 Human genes 0.000 claims description 9
- 102100022870 ADP-ribosylation factor-like protein 5B Human genes 0.000 claims description 9
- 208000035143 Bacterial infection Diseases 0.000 claims description 9
- 206010017533 Fungal infection Diseases 0.000 claims description 9
- 101710113864 Heat shock protein 90 Proteins 0.000 claims description 9
- 101000974510 Homo sapiens ADP-ribosylation factor-like protein 16 Proteins 0.000 claims description 9
- 101000974439 Homo sapiens ADP-ribosylation factor-like protein 5B Proteins 0.000 claims description 9
- 101000644847 Homo sapiens Ubl carboxyl-terminal hydrolase 18 Proteins 0.000 claims description 9
- 208000031888 Mycoses Diseases 0.000 claims description 9
- 102100020726 Ubl carboxyl-terminal hydrolase 18 Human genes 0.000 claims description 9
- 208000022362 bacterial infectious disease Diseases 0.000 claims description 9
- 230000009385 viral infection Effects 0.000 claims description 9
- 108010086291 Deubiquitinating Enzyme CYLD Proteins 0.000 claims description 8
- 101000634987 Homo sapiens E3 ubiquitin-protein ligase TRIM35 Proteins 0.000 claims description 8
- 101000777670 Homo sapiens Hsp90 co-chaperone Cdc37 Proteins 0.000 claims description 8
- 101001011446 Homo sapiens Interferon regulatory factor 6 Proteins 0.000 claims description 8
- 101000864662 Homo sapiens Probable ATP-dependent RNA helicase DHX58 Proteins 0.000 claims description 8
- 101000607872 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 21 Proteins 0.000 claims description 8
- 101000807540 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 25 Proteins 0.000 claims description 8
- 101000939135 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 27 Proteins 0.000 claims description 8
- 102100031568 Hsp90 co-chaperone Cdc37 Human genes 0.000 claims description 8
- 102100023727 Mitochondrial antiviral-signaling protein Human genes 0.000 claims description 8
- 101710142315 Mitochondrial antiviral-signaling protein Proteins 0.000 claims description 8
- 102100030090 Probable ATP-dependent RNA helicase DHX58 Human genes 0.000 claims description 8
- 102100029736 Ubiquitin carboxyl-terminal hydrolase 27 Human genes 0.000 claims description 8
- 230000016396 cytokine production Effects 0.000 claims description 8
- 241000894006 Bacteria Species 0.000 claims description 7
- 241000710831 Flavivirus Species 0.000 claims description 7
- 101000914654 Homo sapiens FAS-associated factor 1 Proteins 0.000 claims description 7
- 241000709664 Picornaviridae Species 0.000 claims description 7
- 102100029904 Signal transducer and activator of transcription 1-alpha/beta Human genes 0.000 claims description 7
- 108700027336 Suppressor of Cytokine Signaling 1 Proteins 0.000 claims description 7
- 108700027337 Suppressor of Cytokine Signaling 3 Proteins 0.000 claims description 7
- 102100024779 Suppressor of cytokine signaling 1 Human genes 0.000 claims description 7
- 102100024250 Ubiquitin carboxyl-terminal hydrolase CYLD Human genes 0.000 claims description 7
- 238000002560 therapeutic procedure Methods 0.000 claims description 7
- 241000711573 Coronaviridae Species 0.000 claims description 6
- 102100029501 E3 ubiquitin-protein ligase TRIM35 Human genes 0.000 claims description 6
- 102100030130 Interferon regulatory factor 6 Human genes 0.000 claims description 6
- 241000710778 Pestivirus Species 0.000 claims description 6
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 claims description 6
- 102220356672 c.201_202delCGinsTC Human genes 0.000 claims description 6
- 230000002265 prevention Effects 0.000 claims description 6
- 239000011701 zinc Substances 0.000 claims description 6
- 229910052725 zinc Inorganic materials 0.000 claims description 6
- 102100027279 FAS-associated factor 1 Human genes 0.000 claims description 5
- 101000763328 Homo sapiens RISC-loading complex subunit TARBP2 Proteins 0.000 claims description 5
- 108010044012 STAT1 Transcription Factor Proteins 0.000 claims description 5
- 241001493065 dsRNA viruses Species 0.000 claims description 5
- 230000004936 stimulating effect Effects 0.000 claims description 5
- 230000000638 stimulation Effects 0.000 claims description 5
- 241000233866 Fungi Species 0.000 claims description 4
- 101000984746 Homo sapiens BRCA1-associated protein Proteins 0.000 claims description 4
- 101150045565 Socs1 gene Proteins 0.000 claims description 4
- 102000050209 human BRAP Human genes 0.000 claims description 4
- 241000711557 Hepacivirus Species 0.000 claims description 3
- 230000006872 improvement Effects 0.000 claims description 3
- 101150043341 Socs3 gene Proteins 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims description 2
- 239000010979 ruby Substances 0.000 claims description 2
- 229910001750 ruby Inorganic materials 0.000 claims description 2
- 101000650589 Mus musculus Roundabout homolog 3 Proteins 0.000 claims 1
- 206010037075 Protozoal infections Diseases 0.000 claims 1
- 102000058015 Suppressor of Cytokine Signaling 3 Human genes 0.000 claims 1
- 102100037179 Ubiquitin carboxyl-terminal hydrolase 25 Human genes 0.000 claims 1
- 102000039446 nucleic acids Human genes 0.000 abstract description 72
- 108020004707 nucleic acids Proteins 0.000 abstract description 72
- 201000010099 disease Diseases 0.000 abstract description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 5
- 239000013604 expression vector Substances 0.000 abstract description 2
- 239000012634 fragment Substances 0.000 description 281
- 239000002773 nucleotide Substances 0.000 description 272
- 125000003729 nucleotide group Chemical group 0.000 description 272
- 108020004705 Codon Proteins 0.000 description 227
- 108020004414 DNA Proteins 0.000 description 198
- 235000018102 proteins Nutrition 0.000 description 150
- 210000004027 cell Anatomy 0.000 description 135
- 241000282414 Homo sapiens Species 0.000 description 93
- 102000004196 processed proteins & peptides Human genes 0.000 description 77
- 229920001184 polypeptide Polymers 0.000 description 71
- 125000003275 alpha amino acid group Chemical group 0.000 description 61
- 210000002230 centromere Anatomy 0.000 description 48
- 238000005457 optimization Methods 0.000 description 47
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 41
- 108091027544 Subgenomic mRNA Proteins 0.000 description 34
- 239000003112 inhibitor Substances 0.000 description 32
- 230000014616 translation Effects 0.000 description 23
- 238000013519 translation Methods 0.000 description 23
- 101710172711 Structural protein Proteins 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 22
- 108060001084 Luciferase Proteins 0.000 description 18
- 239000005089 Luciferase Substances 0.000 description 18
- 235000001014 amino acid Nutrition 0.000 description 18
- 229940024606 amino acid Drugs 0.000 description 18
- 150000001413 amino acids Chemical class 0.000 description 18
- 239000000203 mixture Substances 0.000 description 18
- 239000003981 vehicle Substances 0.000 description 18
- 230000027455 binding Effects 0.000 description 16
- 102100037435 Antiviral innate immune response receptor RIG-I Human genes 0.000 description 15
- 102100034170 Interferon-induced, double-stranded RNA-activated protein kinase Human genes 0.000 description 15
- 101710089751 Interferon-induced, double-stranded RNA-activated protein kinase Proteins 0.000 description 15
- 238000013518 transcription Methods 0.000 description 15
- 230000035897 transcription Effects 0.000 description 15
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 14
- 101710127675 Antiviral innate immune response receptor RIG-I Proteins 0.000 description 13
- 238000000338 in vitro Methods 0.000 description 13
- 230000019491 signal transduction Effects 0.000 description 13
- 238000001890 transfection Methods 0.000 description 13
- 102000004961 Furin Human genes 0.000 description 12
- 108090001126 Furin Proteins 0.000 description 12
- 108010073929 Vascular Endothelial Growth Factor A Proteins 0.000 description 12
- 239000007788 liquid Substances 0.000 description 12
- 230000000840 anti-viral effect Effects 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 230000015788 innate immune response Effects 0.000 description 11
- KZNQNBZMBZJQJO-UHFFFAOYSA-N 1-(2-azaniumylacetyl)pyrrolidine-2-carboxylate Chemical group NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 10
- 230000004568 DNA-binding Effects 0.000 description 10
- 102000009524 Vascular Endothelial Growth Factor A Human genes 0.000 description 10
- 101710138767 Non-structural glycoprotein 4 Proteins 0.000 description 9
- 102100031056 Serine protease 57 Human genes 0.000 description 9
- 101710197596 Serine protease 57 Proteins 0.000 description 9
- 201000001441 melanoma Diseases 0.000 description 9
- 108020005345 3' Untranslated Regions Proteins 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- 230000004913 activation Effects 0.000 description 8
- 239000013543 active substance Substances 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 238000003776 cleavage reaction Methods 0.000 description 8
- 230000009977 dual effect Effects 0.000 description 8
- 229940088598 enzyme Drugs 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 108020003589 5' Untranslated Regions Proteins 0.000 description 7
- 208000026310 Breast neoplasm Diseases 0.000 description 7
- 206010009944 Colon cancer Diseases 0.000 description 7
- 241000282412 Homo Species 0.000 description 7
- 101001077300 Homo sapiens E3 ubiquitin-protein ligase RBBP6 Proteins 0.000 description 7
- 101000974349 Homo sapiens Nuclear receptor coactivator 6 Proteins 0.000 description 7
- 102100027353 Interferon-induced helicase C domain-containing protein 1 Human genes 0.000 description 7
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 7
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 7
- 101710144127 Non-structural protein 1 Proteins 0.000 description 7
- 101710144128 Non-structural protein 2 Proteins 0.000 description 7
- 102100022648 Reticulon-2 Human genes 0.000 description 7
- 102100031776 SH2 domain-containing protein 3A Human genes 0.000 description 7
- 102100039918 Ubiquitin carboxyl-terminal hydrolase 21 Human genes 0.000 description 7
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 7
- 230000006698 induction Effects 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 238000002255 vaccination Methods 0.000 description 7
- 206010006187 Breast cancer Diseases 0.000 description 6
- 241000710188 Encephalomyocarditis virus Species 0.000 description 6
- 241000709661 Enterovirus Species 0.000 description 6
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- 101000623543 Homo sapiens 40S ribosomal protein S15 Proteins 0.000 description 6
- 101000951342 Homo sapiens Dickkopf-related protein 3 Proteins 0.000 description 6
- 101000951240 Homo sapiens GTP-binding protein Di-Ras1 Proteins 0.000 description 6
- 108060003951 Immunoglobulin Proteins 0.000 description 6
- 108010032038 Interferon Regulatory Factor-3 Proteins 0.000 description 6
- 102100024283 Suppressor of cytokine signaling 3 Human genes 0.000 description 6
- 102100028882 Zinc finger CCCH-type antiviral protein 1 Human genes 0.000 description 6
- 101710087130 Zinc finger CCCH-type antiviral protein 1 Proteins 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- 230000005860 defense response to virus Effects 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 102000018358 immunoglobulin Human genes 0.000 description 6
- 230000002401 inhibitory effect Effects 0.000 description 6
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 210000001082 somatic cell Anatomy 0.000 description 6
- 239000000725 suspension Substances 0.000 description 6
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 5
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 5
- 102000004127 Cytokines Human genes 0.000 description 5
- 108090000695 Cytokines Proteins 0.000 description 5
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- 101710144111 Non-structural protein 3 Proteins 0.000 description 5
- 102100021798 SH2 domain-containing protein 3C Human genes 0.000 description 5
- 101710196623 Stimulator of interferon genes protein Proteins 0.000 description 5
- 241000223238 Trichophyton Species 0.000 description 5
- GFFGJBXGBJISGV-UHFFFAOYSA-N adenyl group Chemical group N1=CN=C2N=CNC2=C1N GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 5
- 239000002671 adjuvant Substances 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 230000000903 blocking effect Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 239000000839 emulsion Substances 0.000 description 5
- 230000002519 immonomodulatory effect Effects 0.000 description 5
- 201000005202 lung cancer Diseases 0.000 description 5
- 208000020816 lung neoplasm Diseases 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 239000000843 powder Substances 0.000 description 5
- 208000028172 protozoa infectious disease Diseases 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 230000008672 reprogramming Effects 0.000 description 5
- 239000007787 solid Substances 0.000 description 5
- 210000000130 stem cell Anatomy 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- 241000228212 Aspergillus Species 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 4
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 4
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 4
- 101001082073 Homo sapiens Interferon-induced helicase C domain-containing protein 1 Proteins 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- 102100031413 L-dopachrome tautomerase Human genes 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- 241000351643 Metapneumovirus Species 0.000 description 4
- 241000712045 Morbillivirus Species 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 102000035195 Peptidases Human genes 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 238000009472 formulation Methods 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 229940088597 hormone Drugs 0.000 description 4
- 239000005556 hormone Substances 0.000 description 4
- 210000000987 immune system Anatomy 0.000 description 4
- 230000002163 immunogen Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 4
- 150000002632 lipids Chemical class 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 210000004940 nucleus Anatomy 0.000 description 4
- 238000007911 parenteral administration Methods 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 125000006850 spacer group Chemical group 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 102100027769 2'-5'-oligoadenylate synthase 1 Human genes 0.000 description 3
- 102100027621 2'-5'-oligoadenylate synthase 2 Human genes 0.000 description 3
- 102100035473 2'-5'-oligoadenylate synthase-like protein Human genes 0.000 description 3
- LKKMLIBUAXYLOY-UHFFFAOYSA-N 3-Amino-1-methyl-5H-pyrido[4,3-b]indole Chemical compound N1C2=CC=CC=C2C2=C1C=C(N)N=C2C LKKMLIBUAXYLOY-UHFFFAOYSA-N 0.000 description 3
- WEVYNIUIFUYDGI-UHFFFAOYSA-N 3-[6-[4-(trifluoromethoxy)anilino]-4-pyrimidinyl]benzamide Chemical compound NC(=O)C1=CC=CC(C=2N=CN=C(NC=3C=CC(OC(F)(F)F)=CC=3)C=2)=C1 WEVYNIUIFUYDGI-UHFFFAOYSA-N 0.000 description 3
- 229930024421 Adenine Natural products 0.000 description 3
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 3
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 108090000331 Firefly luciferases Proteins 0.000 description 3
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 3
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 3
- 241000711549 Hepacivirus C Species 0.000 description 3
- 241000709721 Hepatovirus A Species 0.000 description 3
- 101001008907 Homo sapiens 2'-5'-oligoadenylate synthase 1 Proteins 0.000 description 3
- 101001008910 Homo sapiens 2'-5'-oligoadenylate synthase 2 Proteins 0.000 description 3
- 101000597360 Homo sapiens 2'-5'-oligoadenylate synthase-like protein Proteins 0.000 description 3
- 101000721661 Homo sapiens Cellular tumor antigen p53 Proteins 0.000 description 3
- 108010044240 IFIH1 Interferon-Induced Helicase Proteins 0.000 description 3
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 3
- 108010002586 Interleukin-7 Proteins 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 3
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 3
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 102100028389 Melanoma antigen recognized by T-cells 1 Human genes 0.000 description 3
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108010057466 NF-kappa B Proteins 0.000 description 3
- 102000003945 NF-kappa B Human genes 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- 206010033128 Ovarian cancer Diseases 0.000 description 3
- 206010061535 Ovarian neoplasm Diseases 0.000 description 3
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 3
- 241000711902 Pneumovirus Species 0.000 description 3
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 3
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 3
- 241000725643 Respiratory syncytial virus Species 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- 208000001203 Smallpox Diseases 0.000 description 3
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 3
- 208000004006 Tick-borne encephalitis Diseases 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 3
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 3
- 229960000643 adenine Drugs 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 210000000481 breast Anatomy 0.000 description 3
- 238000002659 cell therapy Methods 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 235000010980 cellulose Nutrition 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 108010054812 diprotin A Proteins 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 201000010536 head and neck cancer Diseases 0.000 description 3
- 208000014829 head and neck neoplasm Diseases 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 230000010468 interferon response Effects 0.000 description 3
- 229940047124 interferons Drugs 0.000 description 3
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 3
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000003921 oil Substances 0.000 description 3
- 235000019198 oils Nutrition 0.000 description 3
- 201000002528 pancreatic cancer Diseases 0.000 description 3
- 208000008443 pancreatic carcinoma Diseases 0.000 description 3
- 230000007030 peptide scission Effects 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 239000003755 preservative agent Substances 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000008174 sterile solution Substances 0.000 description 3
- 239000000375 suspending agent Substances 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 230000029812 viral genome replication Effects 0.000 description 3
- 239000011534 wash buffer Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- RQFCJASXJCIDSX-UHFFFAOYSA-N 14C-Guanosin-5'-monophosphat Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(O)=O)C(O)C1O RQFCJASXJCIDSX-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 102100030310 5,6-dihydroxyindole-2-carboxylic acid oxidase Human genes 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- 241000710189 Aphthovirus Species 0.000 description 2
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 2
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 2
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 2
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 2
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241001519465 Avian metapneumovirus Species 0.000 description 2
- 102100035526 B melanoma antigen 1 Human genes 0.000 description 2
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 2
- 241000193738 Bacillus anthracis Species 0.000 description 2
- 108060000903 Beta-catenin Proteins 0.000 description 2
- 102000015735 Beta-catenin Human genes 0.000 description 2
- 206010005003 Bladder cancer Diseases 0.000 description 2
- 102000004506 Blood Proteins Human genes 0.000 description 2
- 108010017384 Blood Proteins Proteins 0.000 description 2
- 241000588832 Bordetella pertussis Species 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 241000714198 Caliciviridae Species 0.000 description 2
- 241000222122 Candida albicans Species 0.000 description 2
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 2
- 241000710190 Cardiovirus Species 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 102000019034 Chemokines Human genes 0.000 description 2
- 108010012236 Chemokines Proteins 0.000 description 2
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 2
- 208000001726 Classical Swine Fever Diseases 0.000 description 2
- 102000007644 Colony-Stimulating Factors Human genes 0.000 description 2
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 2
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 2
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 2
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 2
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 2
- 238000007702 DNA assembly Methods 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241001466953 Echovirus Species 0.000 description 2
- 241000991587 Enterovirus C Species 0.000 description 2
- 102000003951 Erythropoietin Human genes 0.000 description 2
- 108090000394 Erythropoietin Proteins 0.000 description 2
- 108010054218 Factor VIII Proteins 0.000 description 2
- 102000001690 Factor VIII Human genes 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 2
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 2
- JTWZNMUVQWWGOX-SOUVJXGZSA-N Gln-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JTWZNMUVQWWGOX-SOUVJXGZSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 2
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 102100041003 Glutamate carboxypeptidase 2 Human genes 0.000 description 2
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 102000004457 Granulocyte-Macrophage Colony-Stimulating Factor Human genes 0.000 description 2
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 108010051696 Growth Hormone Proteins 0.000 description 2
- 102100032510 Heat shock protein HSP 90-beta Human genes 0.000 description 2
- 102100021519 Hemoglobin subunit beta Human genes 0.000 description 2
- 108091005904 Hemoglobin subunit beta Proteins 0.000 description 2
- 241000724675 Hepatitis E virus Species 0.000 description 2
- 241000724709 Hepatitis delta virus Species 0.000 description 2
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- 101000952099 Homo sapiens Antiviral innate immune response receptor RIG-I Proteins 0.000 description 2
- 101000874316 Homo sapiens B melanoma antigen 1 Proteins 0.000 description 2
- 101001016856 Homo sapiens Heat shock protein HSP 90-beta Proteins 0.000 description 2
- 101000578784 Homo sapiens Melanoma antigen recognized by T-cells 1 Proteins 0.000 description 2
- 101000880770 Homo sapiens Protein SSX2 Proteins 0.000 description 2
- 101000984042 Homo sapiens Protein lin-28 homolog A Proteins 0.000 description 2
- 101000585484 Homo sapiens Signal transducer and activator of transcription 1-alpha/beta Proteins 0.000 description 2
- 101000687905 Homo sapiens Transcription factor SOX-2 Proteins 0.000 description 2
- 241000701085 Human alphaherpesvirus 3 Species 0.000 description 2
- 241000701041 Human betaherpesvirus 7 Species 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- 241001502974 Human gammaherpesvirus 8 Species 0.000 description 2
- 241000701027 Human herpesvirus 6 Species 0.000 description 2
- 241000342334 Human metapneumovirus Species 0.000 description 2
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 2
- GRSZFWQUAKGDAV-UHFFFAOYSA-N Inosinic acid Natural products OC1C(O)C(COP(O)(O)=O)OC1N1C(NC=NC2=O)=C2N=C1 GRSZFWQUAKGDAV-UHFFFAOYSA-N 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- 108010032036 Interferon Regulatory Factor-7 Proteins 0.000 description 2
- 102100026720 Interferon beta Human genes 0.000 description 2
- 108090000890 Interferon regulatory factor 1 Proteins 0.000 description 2
- 108090000467 Interferon-beta Proteins 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 208000008839 Kidney Neoplasms Diseases 0.000 description 2
- 241000710912 Kunjin virus Species 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- 101710093778 L-dopachrome tautomerase Proteins 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 2
- 101000988090 Leishmania donovani Heat shock protein 83 Proteins 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- 239000006137 Luria-Bertani broth Substances 0.000 description 2
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 2
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 2
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 201000005505 Measles Diseases 0.000 description 2
- 102100034216 Melanocyte-stimulating hormone receptor Human genes 0.000 description 2
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 2
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 2
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 2
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 2
- 241001480037 Microsporum Species 0.000 description 2
- 241001260008 Microsporum equinum Species 0.000 description 2
- 208000025370 Middle East respiratory syndrome Diseases 0.000 description 2
- 241000711466 Murine hepatitis virus Species 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 2
- 102000003505 Myosin Human genes 0.000 description 2
- 108060008487 Myosin Proteins 0.000 description 2
- 239000012124 Opti-MEM Substances 0.000 description 2
- 241000713112 Orthobunyavirus Species 0.000 description 2
- 241000150218 Orthonairovirus Species 0.000 description 2
- 108060006580 PRAME Proteins 0.000 description 2
- 102000036673 PRAME Human genes 0.000 description 2
- 102100034640 PWWP domain-containing DNA repair factor 3A Human genes 0.000 description 2
- 108050007154 PWWP domain-containing DNA repair factor 3A Proteins 0.000 description 2
- 241000711504 Paramyxoviridae Species 0.000 description 2
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 2
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 241000713137 Phlebovirus Species 0.000 description 2
- 241000700625 Poxviridae Species 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- FYXCBXDAMPEHIQ-FHWLQOOXSA-N Pro-Trp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O FYXCBXDAMPEHIQ-FHWLQOOXSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- 206010060862 Prostate cancer Diseases 0.000 description 2
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 102100037686 Protein SSX2 Human genes 0.000 description 2
- 102100025460 Protein lin-28 homolog A Human genes 0.000 description 2
- 241000125945 Protoparvovirus Species 0.000 description 2
- 208000010362 Protozoan Infections Diseases 0.000 description 2
- 206010038389 Renal cancer Diseases 0.000 description 2
- 241000702263 Reovirus sp. Species 0.000 description 2
- 241000235527 Rhizopus Species 0.000 description 2
- 241000710801 Rubivirus Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- 101710173694 Short transient receptor potential channel 2 Proteins 0.000 description 2
- 241000700584 Simplexvirus Species 0.000 description 2
- 102100038803 Somatotropin Human genes 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 description 2
- 108091008874 T cell receptors Proteins 0.000 description 2
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 2
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 2
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- 102000005876 Tissue Inhibitor of Metalloproteinases Human genes 0.000 description 2
- 108010005246 Tissue Inhibitor of Metalloproteinases Proteins 0.000 description 2
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 description 2
- 102000003978 Tissue Plasminogen Activator Human genes 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102100024270 Transcription factor SOX-2 Human genes 0.000 description 2
- 241000711484 Transmissible gastroenteritis virus Species 0.000 description 2
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 2
- 102100033598 Triosephosphate isomerase Human genes 0.000 description 2
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 2
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 2
- DXHHCIYKHRKBOC-BHYGNILZSA-N Trp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O DXHHCIYKHRKBOC-BHYGNILZSA-N 0.000 description 2
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 2
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 2
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 2
- 108010047933 Tumor Necrosis Factor alpha-Induced Protein 3 Proteins 0.000 description 2
- 102100024596 Tumor necrosis factor alpha-induced protein 3 Human genes 0.000 description 2
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 2
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 241000700618 Vaccinia virus Species 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 241000711970 Vesiculovirus Species 0.000 description 2
- 241000607479 Yersinia pestis Species 0.000 description 2
- SIIZPVYVXNXXQG-KGXOGWRBSA-N [(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-4-[[(3s,4r)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-3-hydroxyoxolan-2-yl]methyl [(2r,4r,5r)-2-(6-aminopurin-9-yl)-4-hydroxy-5-(phosphonooxymethyl)oxolan-3-yl] hydrogen phosphate Polymers C1=NC2=C(N)N=CN=C2N1[C@@H]1O[C@H](COP(O)(=O)OC2[C@@H](O[C@H](COP(O)(O)=O)[C@H]2O)N2C3=NC=NC(N)=C3N=C2)[C@@H](O)[C@H]1OP(O)(=O)OCC([C@@H](O)[C@H]1O)OC1N1C(N=CN=C2N)=C2N=C1 SIIZPVYVXNXXQG-KGXOGWRBSA-N 0.000 description 2
- 108010081404 acein-2 Proteins 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 2
- 229910052782 aluminium Inorganic materials 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 108091008324 binding proteins Proteins 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 2
- 229960003669 carbenicillin Drugs 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 108091036078 conserved sequence Proteins 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000006071 cream Substances 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 108010051081 dopachrome isomerase Proteins 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 239000002158 endotoxin Substances 0.000 description 2
- 239000002532 enzyme inhibitor Substances 0.000 description 2
- 229940105423 erythropoietin Drugs 0.000 description 2
- 229960000301 factor viii Drugs 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 239000000796 flavoring agent Substances 0.000 description 2
- 235000013355 food flavoring agent Nutrition 0.000 description 2
- 235000003599 food sweetener Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 206010017758 gastric cancer Diseases 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 239000000122 growth hormone Substances 0.000 description 2
- 108010072285 growth inhibitory proteins Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000003481 heat shock protein 90 inhibitor Substances 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 230000006882 induction of apoptosis Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 210000005007 innate immune system Anatomy 0.000 description 2
- 229940125396 insulin Drugs 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 239000003456 ion exchange resin Substances 0.000 description 2
- 229920003303 ion-exchange polymer Polymers 0.000 description 2
- 201000010982 kidney cancer Diseases 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 201000007270 liver cancer Diseases 0.000 description 2
- 208000014018 liver neoplasm Diseases 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 2
- PUPNJSIFIXXJCH-UHFFFAOYSA-N n-(4-hydroxyphenyl)-2-(1,1,3-trioxo-1,2-benzothiazol-2-yl)acetamide Chemical compound C1=CC(O)=CC=C1NC(=O)CN1S(=O)(=O)C2=CC=CC=C2C1=O PUPNJSIFIXXJCH-UHFFFAOYSA-N 0.000 description 2
- 239000002105 nanoparticle Substances 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 235000019833 protease Nutrition 0.000 description 2
- 235000019419 proteases Nutrition 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 238000009097 single-agent therapy Methods 0.000 description 2
- 239000008247 solid mixture Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000009168 stem cell therapy Methods 0.000 description 2
- 238000009580 stem-cell therapy Methods 0.000 description 2
- 201000011549 stomach cancer Diseases 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000003765 sweetening agent Substances 0.000 description 2
- 239000006188 syrup Substances 0.000 description 2
- 235000020357 syrup Nutrition 0.000 description 2
- 239000003826 tablet Substances 0.000 description 2
- 229960000187 tissue plasminogen activator Drugs 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 230000004614 tumor growth Effects 0.000 description 2
- 102000003390 tumor necrosis factor Human genes 0.000 description 2
- 241001529453 unidentified herpesvirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 201000005112 urinary bladder cancer Diseases 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- JNYAEWCLZODPBN-JGWLITMVSA-N (2r,3r,4s)-2-[(1r)-1,2-dihydroxyethyl]oxolane-3,4-diol Chemical compound OC[C@@H](O)[C@H]1OC[C@H](O)[C@H]1O JNYAEWCLZODPBN-JGWLITMVSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 1
- LIWOHUSRWUWRSX-ZJZGAYNASA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-methylbutanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LIWOHUSRWUWRSX-ZJZGAYNASA-N 0.000 description 1
- SSOORFWOBGFTHL-OTEJMHTDSA-N (4S)-5-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[2-[(2S)-2-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S,3S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-6-amino-1-[[(2S)-6-amino-1-[[(2S)-1-[[(2S)-1-[[(2S)-5-amino-1-[[(2S)-5-carbamimidamido-1-[[(2S)-5-carbamimidamido-1-[[(1S)-4-carbamimidamido-1-carboxybutyl]amino]-1-oxopentan-2-yl]amino]-1-oxopentan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-1-oxohexan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1,5-dioxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-hydroxy-1-oxopropan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-methyl-1-oxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]carbamoyl]pyrrolidin-1-yl]-2-oxoethyl]amino]-3-(1H-indol-3-yl)-1-oxopropan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxohexan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-5-carbamimidamido-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]amino]-3-(1H-imidazol-4-yl)-1-oxopropan-2-yl]amino]-3-methyl-1-oxobutan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-4-[[(2S)-2-[[(2S)-2-[[(2S)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-5-oxopentanoic acid Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SSOORFWOBGFTHL-OTEJMHTDSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- SNKAWJBJQDLSFF-NVKMUCNASA-N 1,2-dioleoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC SNKAWJBJQDLSFF-NVKMUCNASA-N 0.000 description 1
- WEYNBWVKOYCCQT-UHFFFAOYSA-N 1-(3-chloro-4-methylphenyl)-3-{2-[({5-[(dimethylamino)methyl]-2-furyl}methyl)thio]ethyl}urea Chemical compound O1C(CN(C)C)=CC=C1CSCCNC(=O)NC1=CC=C(C)C(Cl)=C1 WEYNBWVKOYCCQT-UHFFFAOYSA-N 0.000 description 1
- ZIIUUSVHCHPIQD-UHFFFAOYSA-N 2,4,6-trimethyl-N-[3-(trifluoromethyl)phenyl]benzenesulfonamide Chemical compound CC1=CC(C)=CC(C)=C1S(=O)(=O)NC1=CC=CC(C(F)(F)F)=C1 ZIIUUSVHCHPIQD-UHFFFAOYSA-N 0.000 description 1
- 108010000834 2-5A-dependent ribonuclease Proteins 0.000 description 1
- KRQUFUKTQHISJB-YYADALCUSA-N 2-[(E)-N-[2-(4-chlorophenoxy)propoxy]-C-propylcarbonimidoyl]-3-hydroxy-5-(thian-3-yl)cyclohex-2-en-1-one Chemical compound CCC\C(=N/OCC(C)OC1=CC=C(Cl)C=C1)C1=C(O)CC(CC1=O)C1CCCSC1 KRQUFUKTQHISJB-YYADALCUSA-N 0.000 description 1
- GTVAUHXUMYENSK-RWSKJCERSA-N 2-[3-[(1r)-3-(3,4-dimethoxyphenyl)-1-[(2s)-1-[(2s)-2-(3,4,5-trimethoxyphenyl)pent-4-enoyl]piperidine-2-carbonyl]oxypropyl]phenoxy]acetic acid Chemical compound C1=C(OC)C(OC)=CC=C1CC[C@H](C=1C=C(OCC(O)=O)C=CC=1)OC(=O)[C@H]1N(C(=O)[C@@H](CC=C)C=2C=C(OC)C(OC)=C(OC)C=2)CCCC1 GTVAUHXUMYENSK-RWSKJCERSA-N 0.000 description 1
- QCDWFXQBSFUVSP-UHFFFAOYSA-N 2-phenoxyethanol Chemical compound OCCOC1=CC=CC=C1 QCDWFXQBSFUVSP-UHFFFAOYSA-N 0.000 description 1
- UAIUNKRWKOVEES-UHFFFAOYSA-N 3,3',5,5'-tetramethylbenzidine Chemical compound CC1=C(N)C(C)=CC(C=2C=C(C)C(N)=C(C)C=2)=C1 UAIUNKRWKOVEES-UHFFFAOYSA-N 0.000 description 1
- HIQIXEFWDLTDED-UHFFFAOYSA-N 4-hydroxy-1-piperidin-4-ylpyrrolidin-2-one Chemical compound O=C1CC(O)CN1C1CCNCC1 HIQIXEFWDLTDED-UHFFFAOYSA-N 0.000 description 1
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 1
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 1
- 239000013607 AAV vector Substances 0.000 description 1
- 102100022423 ATP-dependent RNA helicase DHX33 Human genes 0.000 description 1
- 241000235389 Absidia Species 0.000 description 1
- 101710159080 Aconitate hydratase A Proteins 0.000 description 1
- 101710159078 Aconitate hydratase B Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 101710137115 Adenylyl cyclase-associated protein 1 Proteins 0.000 description 1
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 1
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- 241000223600 Alternaria Species 0.000 description 1
- 102000052587 Anaphase-Promoting Complex-Cyclosome Apc3 Subunit Human genes 0.000 description 1
- 108700004606 Anaphase-Promoting Complex-Cyclosome Apc3 Subunit Proteins 0.000 description 1
- 102100023003 Ankyrin repeat domain-containing protein 30A Human genes 0.000 description 1
- 102000004411 Antithrombin III Human genes 0.000 description 1
- 108090000935 Antithrombin III Proteins 0.000 description 1
- 101100288313 Arabidopsis thaliana KTI4 gene Proteins 0.000 description 1
- 101000719121 Arabidopsis thaliana Protein MEI2-like 1 Proteins 0.000 description 1
- 235000003911 Arachis Nutrition 0.000 description 1
- 244000105624 Arachis hypogaea Species 0.000 description 1
- 241000712891 Arenavirus Species 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- DGFGDPVSDQPANQ-XGEHTFHBSA-N Arg-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)O DGFGDPVSDQPANQ-XGEHTFHBSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- 102000014654 Aromatase Human genes 0.000 description 1
- 108010078554 Aromatase Proteins 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- 241001225321 Aspergillus fumigatus Species 0.000 description 1
- 241000132177 Aspergillus glaucus Species 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 241000228245 Aspergillus niger Species 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000711404 Avian avulavirus 1 Species 0.000 description 1
- 208000003950 B-cell lymphoma Diseases 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 108091032955 Bacterial small RNA Proteins 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- 102100021663 Baculoviral IAP repeat-containing protein 5 Human genes 0.000 description 1
- 241000235579 Basidiobolus Species 0.000 description 1
- 241000608319 Bebaru virus Species 0.000 description 1
- 102100023995 Beta-nerve growth factor Human genes 0.000 description 1
- 241000335423 Blastomyces Species 0.000 description 1
- 241000589968 Borrelia Species 0.000 description 1
- 101001129912 Bos taurus Leptin Proteins 0.000 description 1
- 241000711895 Bovine orthopneumovirus Species 0.000 description 1
- 241000712005 Bovine respirovirus 3 Species 0.000 description 1
- 241000710780 Bovine viral diarrhea virus 1 Species 0.000 description 1
- 241000614861 Brachiola Species 0.000 description 1
- 241000589567 Brucella abortus Species 0.000 description 1
- 241001148106 Brucella melitensis Species 0.000 description 1
- 241001148112 Brucella neotomae Species 0.000 description 1
- 241000589568 Brucella ovis Species 0.000 description 1
- 241000508772 Brucella sp. Species 0.000 description 1
- 241001148111 Brucella suis Species 0.000 description 1
- 241001453380 Burkholderia Species 0.000 description 1
- 241000589513 Burkholderia cepacia Species 0.000 description 1
- 241001136175 Burkholderia pseudomallei Species 0.000 description 1
- 241001508395 Burkholderia sp. Species 0.000 description 1
- 101150108242 CDC27 gene Proteins 0.000 description 1
- 241001678559 COVID-19 virus Species 0.000 description 1
- 102100027674 CTD small phosphatase-like protein Human genes 0.000 description 1
- 101710156847 CTD small phosphatase-like protein Proteins 0.000 description 1
- 241000232908 Cabassous Species 0.000 description 1
- 101100438971 Caenorhabditis elegans mat-1 gene Proteins 0.000 description 1
- 101100314454 Caenorhabditis elegans tra-1 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241001493160 California encephalitis virus Species 0.000 description 1
- 241000282836 Camelus dromedarius Species 0.000 description 1
- 102100025570 Cancer/testis antigen 1 Human genes 0.000 description 1
- 102100039510 Cancer/testis antigen 2 Human genes 0.000 description 1
- 241000222173 Candida parapsilosis Species 0.000 description 1
- 241000222178 Candida tropicalis Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 102000003846 Carbonic anhydrases Human genes 0.000 description 1
- 108090000209 Carbonic anhydrases Proteins 0.000 description 1
- 102100027668 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 Human genes 0.000 description 1
- 101710134395 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 Proteins 0.000 description 1
- 102100027667 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Human genes 0.000 description 1
- 101710134389 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Proteins 0.000 description 1
- 108010022366 Carcinoembryonic Antigen Proteins 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 102100026548 Caspase-8 Human genes 0.000 description 1
- 108090000538 Caspase-8 Proteins 0.000 description 1
- 102000011727 Caspases Human genes 0.000 description 1
- 108010076667 Caspases Proteins 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 102000016951 Chemokine CXCL2 Human genes 0.000 description 1
- 108010014414 Chemokine CXCL2 Proteins 0.000 description 1
- 241001502567 Chikungunya virus Species 0.000 description 1
- 241001647372 Chlamydia pneumoniae Species 0.000 description 1
- 241000606153 Chlamydia trachomatis Species 0.000 description 1
- 108090000746 Chymosin Proteins 0.000 description 1
- 241001668502 Cladophialophora carrionii Species 0.000 description 1
- 241000222290 Cladosporium Species 0.000 description 1
- 102100039518 Claudin-12 Human genes 0.000 description 1
- 101710197000 Claudin-12 Proteins 0.000 description 1
- 241001508813 Clavispora lusitaniae Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 241000193449 Clostridium tetani Species 0.000 description 1
- 102100022641 Coagulation factor IX Human genes 0.000 description 1
- 102100023804 Coagulation factor VII Human genes 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 241000223205 Coccidioides immitis Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 241000702669 Coltivirus Species 0.000 description 1
- 241001480517 Conidiobolus Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 208000001528 Coronaviridae Infections Diseases 0.000 description 1
- 241000606678 Coxiella burnetii Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 241000150230 Crimean-Congo hemorrhagic fever orthonairovirus Species 0.000 description 1
- 241001289493 Cripavirus Species 0.000 description 1
- 201000007336 Cryptococcosis Diseases 0.000 description 1
- 241000221204 Cryptococcus neoformans Species 0.000 description 1
- 241000223936 Cryptosporidium parvum Species 0.000 description 1
- 241000235555 Cunninghamella Species 0.000 description 1
- 241000223208 Curvularia Species 0.000 description 1
- 241000371644 Curvularia ravenelii Species 0.000 description 1
- 108010025464 Cyclin-Dependent Kinase 4 Proteins 0.000 description 1
- 102100036252 Cyclin-dependent kinase 4 Human genes 0.000 description 1
- 229920000858 Cyclodextrin Polymers 0.000 description 1
- 241000179197 Cyclospora Species 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- MBPKYKSYUAPLMY-DCAQKATOSA-N Cys-Arg-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MBPKYKSYUAPLMY-DCAQKATOSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 1
- XCDDSPYIMNXECQ-NAKRPEOUSA-N Cys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS XCDDSPYIMNXECQ-NAKRPEOUSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- XSELZJJGSKZZDO-UBHSHLNASA-N Cys-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XSELZJJGSKZZDO-UBHSHLNASA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 208000001490 Dengue Diseases 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 101100216227 Dictyostelium discoideum anapc3 gene Proteins 0.000 description 1
- 208000002699 Digestive System Neoplasms Diseases 0.000 description 1
- 102100037070 Doublecortin domain-containing protein 2 Human genes 0.000 description 1
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000008157 ELISA kit Methods 0.000 description 1
- 241000710945 Eastern equine encephalitis virus Species 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 108010014258 Elastin Proteins 0.000 description 1
- 102000016942 Elastin Human genes 0.000 description 1
- 241000243234 Encephalitozoon Species 0.000 description 1
- 241000596569 Encephalitozoon intestinalis Species 0.000 description 1
- 102100030013 Endoribonuclease Human genes 0.000 description 1
- 108010093099 Endoribonucleases Proteins 0.000 description 1
- 241000224432 Entamoeba histolytica Species 0.000 description 1
- 241000194032 Enterococcus faecalis Species 0.000 description 1
- 241000194031 Enterococcus faecium Species 0.000 description 1
- 241001442406 Enterocytozoon bieneusi Species 0.000 description 1
- 241001529459 Enterovirus A71 Species 0.000 description 1
- 241000146324 Enterovirus D68 Species 0.000 description 1
- 206010066919 Epidemic polyarthritis Diseases 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- 241001480035 Epidermophyton Species 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000214054 Equine rhinitis A virus Species 0.000 description 1
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 1
- 241000223682 Exophiala Species 0.000 description 1
- 101710187301 FAS-associated factor 1 Proteins 0.000 description 1
- 108010076282 Factor IX Proteins 0.000 description 1
- 108010023321 Factor VII Proteins 0.000 description 1
- 108010049003 Fibrinogen Proteins 0.000 description 1
- 102000008946 Fibrinogen Human genes 0.000 description 1
- 102100028043 Fibroblast growth factor 3 Human genes 0.000 description 1
- 108010022355 Fibroins Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 241000122862 Fonsecaea Species 0.000 description 1
- 241000589601 Francisella Species 0.000 description 1
- 241000589602 Francisella tularensis Species 0.000 description 1
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 102000000805 Galectin 4 Human genes 0.000 description 1
- 108010001515 Galectin 4 Proteins 0.000 description 1
- 102100031351 Galectin-9 Human genes 0.000 description 1
- 101710121810 Galectin-9 Proteins 0.000 description 1
- 102100031416 Gastric triacylglycerol lipase Human genes 0.000 description 1
- 102400000921 Gastrin Human genes 0.000 description 1
- 108010052343 Gastrins Proteins 0.000 description 1
- 206010017993 Gastrointestinal neoplasms Diseases 0.000 description 1
- 241000178292 Geotrichum clavatum Species 0.000 description 1
- 241000608297 Getah virus Species 0.000 description 1
- 241000224466 Giardia Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- 102000004547 Glucosylceramidase Human genes 0.000 description 1
- 108010017544 Glucosylceramidase Proteins 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 108010086677 Gonadotropins Proteins 0.000 description 1
- 102000006771 Gonadotropins Human genes 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- 108010078321 Guanylate Cyclase Proteins 0.000 description 1
- 102000014469 Guanylate cyclase Human genes 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241001466963 Hawaii calicivirus Species 0.000 description 1
- 241000590002 Helicobacter pylori Species 0.000 description 1
- 102100027685 Hemoglobin subunit alpha Human genes 0.000 description 1
- 108091005902 Hemoglobin subunit alpha Proteins 0.000 description 1
- 241000035314 Henipavirus Species 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 208000037262 Hepatitis delta Diseases 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- 241000228404 Histoplasma capsulatum Species 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 102100030234 Homeobox protein cut-like 1 Human genes 0.000 description 1
- 101000901934 Homo sapiens ATP-dependent RNA helicase DHX33 Proteins 0.000 description 1
- 101000757191 Homo sapiens Ankyrin repeat domain-containing protein 30A Proteins 0.000 description 1
- 101000856237 Homo sapiens Cancer/testis antigen 1 Proteins 0.000 description 1
- 101000889345 Homo sapiens Cancer/testis antigen 2 Proteins 0.000 description 1
- 101000859758 Homo sapiens Cartilage-associated protein Proteins 0.000 description 1
- 101000916686 Homo sapiens Cytohesin-interacting protein Proteins 0.000 description 1
- 101000954709 Homo sapiens Doublecortin domain-containing protein 2 Proteins 0.000 description 1
- 101000892862 Homo sapiens Glutamate carboxypeptidase 2 Proteins 0.000 description 1
- 101001016865 Homo sapiens Heat shock protein HSP 90-alpha Proteins 0.000 description 1
- 101000985516 Homo sapiens Hermansky-Pudlak syndrome 5 protein Proteins 0.000 description 1
- 101000726740 Homo sapiens Homeobox protein cut-like 1 Proteins 0.000 description 1
- 101000959820 Homo sapiens Interferon alpha-1/13 Proteins 0.000 description 1
- 101000852865 Homo sapiens Interferon alpha/beta receptor 2 Proteins 0.000 description 1
- 101001082065 Homo sapiens Interferon-induced protein with tetratricopeptide repeats 1 Proteins 0.000 description 1
- 101000614481 Homo sapiens Kidney-associated antigen 1 Proteins 0.000 description 1
- 101001051093 Homo sapiens Low-density lipoprotein receptor Proteins 0.000 description 1
- 101001134060 Homo sapiens Melanocyte-stimulating hormone receptor Proteins 0.000 description 1
- 101001133056 Homo sapiens Mucin-1 Proteins 0.000 description 1
- 101000598921 Homo sapiens Orexin Proteins 0.000 description 1
- 101001126234 Homo sapiens Phospholipid phosphatase 3 Proteins 0.000 description 1
- 101000874141 Homo sapiens Probable ATP-dependent RNA helicase DDX43 Proteins 0.000 description 1
- 101000761460 Homo sapiens Protein CASP Proteins 0.000 description 1
- 101000679365 Homo sapiens Putative tyrosine-protein phosphatase TPTE Proteins 0.000 description 1
- 101001109419 Homo sapiens RNA-binding protein NOB1 Proteins 0.000 description 1
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 101000857677 Homo sapiens Runt-related transcription factor 1 Proteins 0.000 description 1
- 101000821981 Homo sapiens Sarcoma antigen 1 Proteins 0.000 description 1
- 101000665137 Homo sapiens Scm-like with four MBT domains protein 1 Proteins 0.000 description 1
- 101000739178 Homo sapiens Secretoglobin family 3A member 2 Proteins 0.000 description 1
- 101000831496 Homo sapiens Toll-like receptor 3 Proteins 0.000 description 1
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 description 1
- 101000808011 Homo sapiens Vascular endothelial growth factor A Proteins 0.000 description 1
- 108010000521 Human Growth Hormone Proteins 0.000 description 1
- 102000002265 Human Growth Hormone Human genes 0.000 description 1
- 239000000854 Human Growth Hormone Substances 0.000 description 1
- 108091006905 Human Serum Albumin Proteins 0.000 description 1
- 102000008100 Human Serum Albumin Human genes 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 241000702617 Human parvovirus B19 Species 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 1
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 1
- 108010043496 Immunoglobulin Idiotypes Proteins 0.000 description 1
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 1
- 108050002021 Integrator complex subunit 2 Proteins 0.000 description 1
- 102000007578 Interferon Regulatory Factor-3 Human genes 0.000 description 1
- 108010014726 Interferon Type I Proteins 0.000 description 1
- 102000002227 Interferon Type I Human genes 0.000 description 1
- 102100040019 Interferon alpha-1/13 Human genes 0.000 description 1
- 102100036718 Interferon alpha/beta receptor 2 Human genes 0.000 description 1
- 102100027355 Interferon-induced protein with tetratricopeptide repeats 1 Human genes 0.000 description 1
- 101710154084 Interferon-inducible double-stranded RNA-dependent protein kinase activator A Proteins 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 108090000172 Interleukin-15 Proteins 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 241000710842 Japanese encephalitis virus Species 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- 241000588754 Klebsiella sp. Species 0.000 description 1
- 244000285963 Kluyveromyces fragilis Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- LFXSPAIBSZSTEM-PMVMPFDFSA-N Leu-Trp-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LFXSPAIBSZSTEM-PMVMPFDFSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 108010028921 Lipopeptides Proteins 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 1
- 208000016604 Lyme disease Diseases 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- CTJUSALVKAWFFU-CIUDSAMLSA-N Lys-Ser-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N CTJUSALVKAWFFU-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 241000711828 Lyssavirus Species 0.000 description 1
- 108010010995 MART-1 Antigen Proteins 0.000 description 1
- 108700005089 MHC Class I Genes Proteins 0.000 description 1
- 108010000410 MSH receptor Proteins 0.000 description 1
- 241000701076 Macacine alphaherpesvirus 1 Species 0.000 description 1
- 241001539803 Magnusiomyces capitatus Species 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241000555676 Malassezia Species 0.000 description 1
- 241000555688 Malassezia furfur Species 0.000 description 1
- 241000608292 Mayaro virus Species 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- 101000761459 Mesocricetus auratus Calcium-dependent serine proteinase Proteins 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HGKJFNCLOHKEHS-FXQIFTODSA-N Met-Cys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(O)=O HGKJFNCLOHKEHS-FXQIFTODSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- 241000235048 Meyerozyma guilliermondii Species 0.000 description 1
- 108700011259 MicroRNAs Proteins 0.000 description 1
- 241000243190 Microsporidia Species 0.000 description 1
- 241001295810 Microsporidium Species 0.000 description 1
- 241000893980 Microsporum canis Species 0.000 description 1
- 241001460074 Microsporum distortum Species 0.000 description 1
- 241000588655 Moraxella catarrhalis Species 0.000 description 1
- 241000235575 Mortierella Species 0.000 description 1
- 241000868135 Mucambo virus Species 0.000 description 1
- 102100034256 Mucin-1 Human genes 0.000 description 1
- 241000235395 Mucor Species 0.000 description 1
- 208000005647 Mumps Diseases 0.000 description 1
- 241000711408 Murine respirovirus Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 241000893976 Nannizzia gypsea Species 0.000 description 1
- 241000264375 Nannizzia nana Species 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 108010025020 Nerve Growth Factor Proteins 0.000 description 1
- 102000007530 Neurofibromin 1 Human genes 0.000 description 1
- 108010085793 Neurofibromin 1 Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 241000526636 Nipah henipavirus Species 0.000 description 1
- 101800000515 Non-structural protein 3 Proteins 0.000 description 1
- 241001263478 Norovirus Species 0.000 description 1
- 241000714209 Norwalk virus Species 0.000 description 1
- 241001126829 Nosema Species 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 241000702259 Orbivirus Species 0.000 description 1
- 241000702244 Orthoreovirus Species 0.000 description 1
- 102400000050 Oxytocin Human genes 0.000 description 1
- XNOPRXBHLZRZKH-UHFFFAOYSA-N Oxytocin Natural products N1C(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CC(C)C)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C(C(C)CC)NC(=O)C1CC1=CC=C(O)C=C1 XNOPRXBHLZRZKH-UHFFFAOYSA-N 0.000 description 1
- 101800000989 Oxytocin Proteins 0.000 description 1
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 241000526686 Paracoccidioides brasiliensis Species 0.000 description 1
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 1
- 241000991583 Parechovirus Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 241000228143 Penicillium Species 0.000 description 1
- 206010057249 Phagocytosis Diseases 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- 102000015439 Phospholipases Human genes 0.000 description 1
- 108010064785 Phospholipases Proteins 0.000 description 1
- 102100030450 Phospholipid phosphatase 3 Human genes 0.000 description 1
- 102000007982 Phosphoproteins Human genes 0.000 description 1
- 108010089430 Phosphoproteins Proteins 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 1
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000235645 Pichia kudriavzevii Species 0.000 description 1
- 241000305299 Pithomyces Species 0.000 description 1
- 241000868134 Pixuna virus Species 0.000 description 1
- 206010035148 Plague Diseases 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 108010001014 Plasminogen Activators Proteins 0.000 description 1
- 102000001938 Plasminogen Activators Human genes 0.000 description 1
- 241001492488 Pleistophora Species 0.000 description 1
- 241000233872 Pneumocystis carinii Species 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- 241001672814 Porcine teschovirus 1 Species 0.000 description 1
- 108010013381 Porins Proteins 0.000 description 1
- 241001300940 Porphyromonas sp. Species 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 201000009754 Powassan encephalitis Diseases 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 102100035724 Probable ATP-dependent RNA helicase DDX43 Human genes 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 102000003946 Prolactin Human genes 0.000 description 1
- 108010057464 Prolactin Proteins 0.000 description 1
- 102000006437 Proprotein Convertases Human genes 0.000 description 1
- 108010044159 Proprotein Convertases Proteins 0.000 description 1
- 101800000980 Protease nsP2 Proteins 0.000 description 1
- 101800004937 Protein C Proteins 0.000 description 1
- 102000017975 Protein C Human genes 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 102100020932 Putative protein RIG Human genes 0.000 description 1
- 102100022578 Putative tyrosine-protein phosphatase TPTE Human genes 0.000 description 1
- 241000197220 Pythium insidiosum Species 0.000 description 1
- 101710162430 RISC-loading complex subunit TARBP2 Proteins 0.000 description 1
- 108020005161 RNA Caps Proteins 0.000 description 1
- 102000004409 RNA Helicases Human genes 0.000 description 1
- 108090000944 RNA Helicases Proteins 0.000 description 1
- 229940022005 RNA vaccine Drugs 0.000 description 1
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 1
- 101710105008 RNA-binding protein Proteins 0.000 description 1
- 102100022491 RNA-binding protein NOB1 Human genes 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- 101150066717 Rara gene Proteins 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 206010051497 Rhinotracheitis Diseases 0.000 description 1
- 241001361634 Rhizoctonia Species 0.000 description 1
- 241000936948 Rhopalosiphum padi virus Species 0.000 description 1
- 240000000528 Ricinus communis Species 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000713124 Rift Valley fever virus Species 0.000 description 1
- 241000710942 Ross River virus Species 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 102100025373 Runt-related transcription factor 1 Human genes 0.000 description 1
- 241000907329 Russian Spring-Summer encephalitis virus Species 0.000 description 1
- 241000315672 SARS coronavirus Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000293026 Saksenaea Species 0.000 description 1
- 241000293871 Salmonella enterica subsp. enterica serovar Typhi Species 0.000 description 1
- 101800001700 Saposin-D Proteins 0.000 description 1
- 102100021466 Sarcoma antigen 1 Human genes 0.000 description 1
- 241000132889 Scedosporium Species 0.000 description 1
- 102100038689 Scm-like with four MBT domains protein 1 Human genes 0.000 description 1
- 102100037269 Secretoglobin family 3A member 2 Human genes 0.000 description 1
- 102000003800 Selectins Human genes 0.000 description 1
- 108090000184 Selectins Proteins 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 101710173693 Short transient receptor potential channel 1 Proteins 0.000 description 1
- 241000710960 Sindbis virus Species 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 241000509413 Snow Mountain virus Species 0.000 description 1
- 102000013275 Somatomedins Human genes 0.000 description 1
- 241001149962 Sporothrix Species 0.000 description 1
- 241001149963 Sporothrix schenckii Species 0.000 description 1
- 241000713675 Spumavirus Species 0.000 description 1
- 102100035748 Squamous cell carcinoma antigen recognized by T-cells 3 Human genes 0.000 description 1
- 101710185775 Squamous cell carcinoma antigen recognized by T-cells 3 Proteins 0.000 description 1
- 241000710888 St. Louis encephalitis virus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241001147691 Staphylococcus saprophyticus Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 241000193985 Streptococcus agalactiae Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 108010023197 Streptokinase Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 101800001271 Surface protein Proteins 0.000 description 1
- 108010002687 Survivin Proteins 0.000 description 1
- 101710143177 Synaptonemal complex protein 1 Proteins 0.000 description 1
- 102100036234 Synaptonemal complex protein 1 Human genes 0.000 description 1
- 208000027585 T-cell non-Hodgkin lymphoma Diseases 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 108700019889 TEL-AML1 fusion Proteins 0.000 description 1
- 102100033082 TNF receptor-associated factor 3 Human genes 0.000 description 1
- 101150033985 TPI gene Proteins 0.000 description 1
- 101150032817 TPI1 gene Proteins 0.000 description 1
- 241001523006 Talaromyces marneffei Species 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- 241001648840 Thosea asigna virus Species 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- AUYYCJSJGJYCDS-LBPRGKRZSA-N Thyrolar Chemical class IC1=CC(C[C@H](N)C(O)=O)=CC(I)=C1OC1=CC=C(O)C(I)=C1 AUYYCJSJGJYCDS-LBPRGKRZSA-N 0.000 description 1
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 description 1
- 102000002689 Toll-like receptor Human genes 0.000 description 1
- 108020000411 Toll-like receptor Proteins 0.000 description 1
- 102100024324 Toll-like receptor 3 Human genes 0.000 description 1
- 102100039390 Toll-like receptor 7 Human genes 0.000 description 1
- 102100033117 Toll-like receptor 9 Human genes 0.000 description 1
- 241000223996 Toxoplasma Species 0.000 description 1
- 241000223997 Toxoplasma gondii Species 0.000 description 1
- 241001249162 Trachipleistophora Species 0.000 description 1
- 241000589884 Treponema pallidum Species 0.000 description 1
- 241000893963 Trichophyton concentricum Species 0.000 description 1
- 241000893962 Trichophyton equinum Species 0.000 description 1
- 241001045770 Trichophyton mentagrophytes Species 0.000 description 1
- 241001609979 Trichophyton quinckeanum Species 0.000 description 1
- 241000223229 Trichophyton rubrum Species 0.000 description 1
- 241001480048 Trichophyton tonsurans Species 0.000 description 1
- 241000893966 Trichophyton verrucosum Species 0.000 description 1
- 241001480050 Trichophyton violaceum Species 0.000 description 1
- 241000223231 Trichosporon beigelii Species 0.000 description 1
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 1
- OKAMOYTUQMIFJO-JBACZVJFSA-N Trp-Glu-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 OKAMOYTUQMIFJO-JBACZVJFSA-N 0.000 description 1
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- OWSRIUBVJOQHNY-IHPCNDPISA-N Trp-Lys-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N OWSRIUBVJOQHNY-IHPCNDPISA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 1
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 1
- 208000037386 Typhoid Diseases 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- 102100027244 U4/U6.U5 tri-snRNP-associated protein 1 Human genes 0.000 description 1
- 101710155955 U4/U6.U5 tri-snRNP-associated protein 1 Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000870995 Variola Species 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 208000002687 Venezuelan Equine Encephalomyelitis Diseases 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000607626 Vibrio cholerae Species 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 206010051511 Viral diarrhoea Diseases 0.000 description 1
- 241000144556 Vittaforma Species 0.000 description 1
- 201000006449 West Nile encephalitis Diseases 0.000 description 1
- 208000005466 Western Equine Encephalomyelitis Diseases 0.000 description 1
- 201000005806 Western equine encephalitis Diseases 0.000 description 1
- 241000710772 Yellow fever virus Species 0.000 description 1
- 241000607447 Yersinia enterocolitica Species 0.000 description 1
- 229920000392 Zymosan Polymers 0.000 description 1
- GUIRUWRHBDQCQJ-UHFFFAOYSA-N [(6-oxo-1,7-dihydropurin-2-yl)amino]phosphonic acid Chemical compound P(=O)(O)(O)NC=1NC(C=2NC=NC=2N=1)=O GUIRUWRHBDQCQJ-UHFFFAOYSA-N 0.000 description 1
- 241000222126 [Candida] glabrata Species 0.000 description 1
- 241000606834 [Haemophilus] ducreyi Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000033289 adaptive immune response Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 1
- 108060000200 adenylate cyclase Proteins 0.000 description 1
- 102000030621 adenylate cyclase Human genes 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 102000013529 alpha-Fetoproteins Human genes 0.000 description 1
- 108010026331 alpha-Fetoproteins Proteins 0.000 description 1
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical class [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000033115 angiogenesis Effects 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 229960004977 anhydrous lactose Drugs 0.000 description 1
- 230000001952 anti-alphavirus Effects 0.000 description 1
- 230000001028 anti-proliverative effect Effects 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 229960005348 antithrombin iii Drugs 0.000 description 1
- 230000007416 antiviral immune response Effects 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 229940091771 aspergillus fumigatus Drugs 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- GLMQHZPGHAPYIO-UHFFFAOYSA-L azanium;2-hydroxypropane-1,2,3-tricarboxylate;iron(2+) Chemical compound [NH4+].[Fe+2].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O GLMQHZPGHAPYIO-UHFFFAOYSA-L 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- VEZXCJBBBCKRPI-UHFFFAOYSA-N beta-propiolactone Chemical compound O=C1CCO1 VEZXCJBBBCKRPI-UHFFFAOYSA-N 0.000 description 1
- 239000003833 bile salt Substances 0.000 description 1
- 229940093761 bile salts Drugs 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229940126587 biotherapeutics Drugs 0.000 description 1
- 229910021538 borax Inorganic materials 0.000 description 1
- 208000008921 border disease Diseases 0.000 description 1
- 108010006025 bovine growth hormone Proteins 0.000 description 1
- 206010006451 bronchitis Diseases 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- PPBOKXIGFIBOGK-BDTUAEFFSA-N bvdv Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)C(C)C)[C@@H](C)CC)C1=CN=CN1 PPBOKXIGFIBOGK-BDTUAEFFSA-N 0.000 description 1
- RFCBNSCSPXMEBK-INFSMZHSSA-N c-GMP-AMP Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]3[C@@H](O)[C@H](N4C5=NC=NC(N)=C5N=C4)O[C@@H]3COP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=C(NC2=O)N)=C2N=C1 RFCBNSCSPXMEBK-INFSMZHSSA-N 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 208000032343 candida glabrata infection Diseases 0.000 description 1
- 229940055022 candida parapsilosis Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 150000003943 catecholamines Chemical class 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- AOXOCDRNSPFDPE-UKEONUMOSA-N chembl413654 Chemical compound C([C@H](C(=O)NCC(=O)N[C@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@H](CCSC)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC=1C=CC=CC=1)C(N)=O)NC(=O)[C@@H](C)NC(=O)[C@@H](CCC(O)=O)NC(=O)[C@@H](CCC(O)=O)NC(=O)[C@@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@@H](N)CCC(O)=O)C1=CC=C(O)C=C1 AOXOCDRNSPFDPE-UKEONUMOSA-N 0.000 description 1
- 229940038705 chlamydia trachomatis Drugs 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 229940080701 chymosin Drugs 0.000 description 1
- 235000015165 citric acid Nutrition 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 235000019864 coconut oil Nutrition 0.000 description 1
- 239000003240 coconut oil Substances 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 101150047356 dec-1 gene Proteins 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000003413 degradative effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- WZISDKTXHMETKG-UHFFFAOYSA-H dimagnesium;dipotassium;trisulfate Chemical compound [Mg+2].[Mg+2].[K+].[K+].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O WZISDKTXHMETKG-UHFFFAOYSA-H 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- WDRWZVWLVBXVOI-QTNFYWBSSA-L dipotassium;(2s)-2-aminopentanedioate Chemical compound [K+].[K+].[O-]C(=O)[C@@H](N)CCC([O-])=O WDRWZVWLVBXVOI-QTNFYWBSSA-L 0.000 description 1
- 230000006806 disease prevention Effects 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 230000007783 downstream signaling Effects 0.000 description 1
- 230000005014 ectopic expression Effects 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 229920002549 elastin Polymers 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000001159 endocytotic effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 229940007078 entamoeba histolytica Drugs 0.000 description 1
- 229940032049 enterococcus faecalis Drugs 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 229960004222 factor ix Drugs 0.000 description 1
- 229940012413 factor vii Drugs 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 229940012952 fibrinogen Drugs 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010067667 gamma Subunit Interferon-Stimulated Gene Factor 3 Proteins 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 108010091264 gastric triacylglycerol lipase Proteins 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 239000002622 gonadotropin Substances 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 102000028718 growth factor binding proteins Human genes 0.000 description 1
- 108091009353 growth factor binding proteins Proteins 0.000 description 1
- 239000010440 gypsum Substances 0.000 description 1
- 229910052602 gypsum Inorganic materials 0.000 description 1
- 150000008282 halocarbons Chemical class 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- 108010080417 hemozoin Proteins 0.000 description 1
- 208000010710 hepatitis C virus infection Diseases 0.000 description 1
- 208000029570 hepatitis D virus infection Diseases 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 230000013632 homeostatic process Effects 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 102000058223 human VEGFA Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 239000000017 hydrogel Substances 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 208000037797 influenza A Diseases 0.000 description 1
- 208000037798 influenza B Diseases 0.000 description 1
- 208000037799 influenza C Diseases 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 108010051621 interferon regulatory factor-8 Proteins 0.000 description 1
- 230000011542 interferon-beta production Effects 0.000 description 1
- 108010074108 interleukin-21 Proteins 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 239000004313 iron ammonium citrate Substances 0.000 description 1
- 235000000011 iron ammonium citrate Nutrition 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 229960001375 lactose Drugs 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- 229940039781 leptin Drugs 0.000 description 1
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- GZQKNULLWNGMCW-PWQABINMSA-N lipid A (E. coli) Chemical compound O1[C@H](CO)[C@@H](OP(O)(O)=O)[C@H](OC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCCCC)[C@@H](NC(=O)C[C@@H](CCCCCCCCCCC)OC(=O)CCCCCCCCCCC)[C@@H]1OC[C@@H]1[C@@H](O)[C@H](OC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](NC(=O)C[C@H](O)CCCCCCCCCCC)[C@@H](OP(O)(O)=O)O1 GZQKNULLWNGMCW-PWQABINMSA-N 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 238000011866 long-term treatment Methods 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 230000001926 lymphatic effect Effects 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108700021021 mRNA Vaccine Proteins 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 235000013919 monopotassium glutamate Nutrition 0.000 description 1
- LPUQAYUQRXPFSQ-DFWYDOINSA-M monosodium L-glutamate Chemical compound [Na+].[O-]C(=O)[C@@H](N)CCC(O)=O LPUQAYUQRXPFSQ-DFWYDOINSA-M 0.000 description 1
- 235000013923 monosodium glutamate Nutrition 0.000 description 1
- 239000004223 monosodium glutamate Substances 0.000 description 1
- 208000010805 mumps infectious disease Diseases 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- GNOLWGAJQVLBSM-UHFFFAOYSA-N n,n,5,7-tetramethyl-1,2,3,4-tetrahydronaphthalen-1-amine Chemical group C1=C(C)C=C2C(N(C)C)CCCC2=C1C GNOLWGAJQVLBSM-UHFFFAOYSA-N 0.000 description 1
- 239000007922 nasal spray Substances 0.000 description 1
- 229940097496 nasal spray Drugs 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 229940053128 nerve growth factor Drugs 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 239000007764 o/w emulsion Substances 0.000 description 1
- 239000002674 ointment Substances 0.000 description 1
- 229940049964 oleate Drugs 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 229960001723 oxytocin Drugs 0.000 description 1
- XNOPRXBHLZRZKH-DSZYJQQASA-N oxytocin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@H](N)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(N)=O)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 XNOPRXBHLZRZKH-DSZYJQQASA-N 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 102000007863 pattern recognition receptors Human genes 0.000 description 1
- 108010089193 pattern recognition receptors Proteins 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 108010011903 peptide receptors Proteins 0.000 description 1
- 102000014187 peptide receptors Human genes 0.000 description 1
- 230000008782 phagocytosis Effects 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 229960005323 phenoxyethanol Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 229940127126 plasminogen activator Drugs 0.000 description 1
- 229920000747 poly(lactic acid) Polymers 0.000 description 1
- 229920001606 poly(lactic acid-co-glycolic acid) Polymers 0.000 description 1
- 229920001481 poly(stearyl methacrylate) Polymers 0.000 description 1
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 229940068968 polysorbate 80 Drugs 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 102000007739 porin activity proteins Human genes 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 235000007686 potassium Nutrition 0.000 description 1
- GRLPQNLYRHEGIJ-UHFFFAOYSA-J potassium aluminium sulfate Chemical compound [Al+3].[K+].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O GRLPQNLYRHEGIJ-UHFFFAOYSA-J 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000000770 proinflammatory effect Effects 0.000 description 1
- 229940097325 prolactin Drugs 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010007513 prolyl-glycyl-prolyl-leucine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- AAEVYOVXGOFMJO-UHFFFAOYSA-N prometryn Chemical compound CSC1=NC(NC(C)C)=NC(NC(C)C)=N1 AAEVYOVXGOFMJO-UHFFFAOYSA-N 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 239000003380 propellant Substances 0.000 description 1
- 229960000380 propiolactone Drugs 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 229960000856 protein c Drugs 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000007076 release of cytoplasmic sequestered NF-kappaB Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 229960002181 saccharomyces boulardii Drugs 0.000 description 1
- 235000002020 sage Nutrition 0.000 description 1
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108091006024 signal transducing proteins Proteins 0.000 description 1
- 102000034285 signal transducing proteins Human genes 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- HRZFUMHJMZEROT-UHFFFAOYSA-L sodium disulfite Chemical compound [Na+].[Na+].[O-]S(=O)S([O-])(=O)=O HRZFUMHJMZEROT-UHFFFAOYSA-L 0.000 description 1
- 229940001584 sodium metabisulfite Drugs 0.000 description 1
- 235000010262 sodium metabisulphite Nutrition 0.000 description 1
- 235000010339 sodium tetraborate Nutrition 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 108020003113 steroid hormone receptors Proteins 0.000 description 1
- 102000005969 steroid hormone receptors Human genes 0.000 description 1
- 230000000365 steroidogenetic effect Effects 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 229960005202 streptokinase Drugs 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 210000003523 substantia nigra Anatomy 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 208000006379 syphilis Diseases 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 239000005495 thyroid hormone Substances 0.000 description 1
- 229940036555 thyroid hormone Drugs 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 229940055035 trichophyton verrucosum Drugs 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- BSVBQGMMJUBVOD-UHFFFAOYSA-N trisodium borate Chemical compound [Na+].[Na+].[Na+].[O-]B([O-])[O-] BSVBQGMMJUBVOD-UHFFFAOYSA-N 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 201000008297 typhoid fever Diseases 0.000 description 1
- 108010014402 tyrosinase-related protein-1 Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 230000006663 ubiquitin-proteasome pathway Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 201000006266 variola major Diseases 0.000 description 1
- 201000000627 variola minor Diseases 0.000 description 1
- 208000014016 variola minor infection Diseases 0.000 description 1
- 229940118696 vibrio cholerae Drugs 0.000 description 1
- 239000004034 viscosity adjusting agent Substances 0.000 description 1
- 108010047303 von Willebrand Factor Proteins 0.000 description 1
- 102100036537 von Willebrand factor Human genes 0.000 description 1
- 229960001134 von willebrand factor Drugs 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 229940051021 yellow-fever virus Drugs 0.000 description 1
- 229940098232 yersinia enterocolitica Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/39—Medicinal preparations containing antigens or antibodies characterised by the immunostimulating additives, e.g. chemical adjuvants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/0008—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition
- A61K48/0025—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition wherein the non-active part clearly interacts with the delivered nucleic acid
- A61K48/0033—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition wherein the non-active part clearly interacts with the delivered nucleic acid the non-active part being non-polymeric
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/60—Medicinal preparations containing antigens or antibodies characteristics by the carrier linked to the antigen
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/16011—Herpesviridae
- C12N2710/16022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/24011—Poxviridae
- C12N2710/24111—Orthopoxvirus, e.g. vaccinia virus, variola
- C12N2710/24122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/36011—Togaviridae
- C12N2770/36111—Alphavirus, e.g. Sindbis virus, VEE, EEE, WEE, Semliki
- C12N2770/36121—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/36011—Togaviridae
- C12N2770/36111—Alphavirus, e.g. Sindbis virus, VEE, EEE, WEE, Semliki
- C12N2770/36122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/36011—Togaviridae
- C12N2770/36111—Alphavirus, e.g. Sindbis virus, VEE, EEE, WEE, Semliki
- C12N2770/36134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- Genetics & Genomics (AREA)
- Epidemiology (AREA)
- Microbiology (AREA)
- Mycology (AREA)
- Engineering & Computer Science (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- General Engineering & Computer Science (AREA)
- Virology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
본 발명은 RNA 구조체에 관한 것이며, 특히, 배타적인 것은 아니지만, mRNA 구조체 및 saRNA 레플리콘, 및 이러한 RNA 구조체를 암호화하는 핵산 및 발현 벡터에 관한 것이다. 본 발명은 이러한 RNA 구조체를 치료법에, 예를 들면 질환의 치료 및/또는 백신 운반에 사용하는 용도로 확장된다. 본 발명은 이러한 RNA 구조체를 포함하는 약학적 조성물, 및 그의 방법 및 용도로 확장된다.
Description
본 발명은 RNA 구조체(construct)에 관한 것이며, 특히, 배타적인 것은 아니지만, mRNA 구조체 및 saRNA 레플리콘(replicon), 및 이러한 RNA 구조체를 암호화하는 핵산 및 발현 벡터(expression vector)에 관한 것이다. 본 발명은 이러한 RNA 구조체를 치료법에, 예를 들면 질환의 치료 및/또는 백신 운반에 사용하는 용도로 확장된다. 본 발명은 이러한 RNA 구조체를 포함하는 약학적 조성물, 및 그의 방법 및 용도로 확장된다.
메신저 RNA(mRNA)는 생물치료제(biotherapeutics)를 위한 유망한 도구이다. 그러나, mRNA 치료제는 소동물에서 매우 효과적인 것으로 나타났지만, 그 결과는 상기 제형이 인간에서 용량-증량 연구로 번역될 때 선형으로 규모가 커지지는 않는다. 아울러, 인터페론 반응의 유도와 연관된 부작용은 증가된 용량의 RNA가 인간에서 효과적일 가능성과 관련한 속도-제한요인(rate-limiting)인 것으로 나타나 있다. 이러한 불일치의 이유는 명확하지 않지만, 본 발명자들은 인간 선천적 감지(innate sensing)에서의 내재하는 차이가 실험실로부터 임상으로 RNA 치료제를 번역하기 위한 장애물을 제공한다는 가설을 세운다. 아울러, RNA의 선천적 감지는 단백질 발현의 억제와 연관된다. 지금까지, 외인성(exogenous) RNA의 선천적 인식을 극복하기 위한 주된 접근법은 선천적 감지 메커니즘에 의해 검출이 덜되는 변형된 리보뉴클레오티드를 사용하는 것이다. 그러나, 변형된 mRNA는 완전히 검출불가능하지는 않으며, 인간에게 사용할 때 여전히 인터페론의 일부 유도, 단백질 사일런싱(silencing) 및 감소된 내약성(tolerability)을 야기한다(도 2 참조).
다른 접근법은 전형적으로 그 비-구조 단백질 내에 폴리머라아제(polymerase) 활성을 암호화함으로써 그 자신의 RNA를 자가-증폭하는 능력을 갖고 있는 알파바이러스 백본(alphavirus backbone)에 기반하는 자가-증폭형(self-amplifying) 또는 saRNA 벡터를 이용하는 것이다. 종래 기술의 방법은 상기 벡터의 구조 단백질을, 예를 들면 관심있는 항원을 암호화하는 관심있는 유전자(GOI)로 교체하여 이를 백신 구조체로 하거나 치료 단백질을 암호화하는 것을 수반한다. 다른 버전의 saRNA는 피코르나바이러스(picornavirus), 플라비바이러스(flavivirus), 및 코로나바이러스(coronavirus)에 기반한다. saRNA가 표적 세포의 세포질 내로 들어갈 때, 이것은 상기 암호화된 폴리머라아제 기구(machinery)에 의한 RNA의 증폭 및 GOI의 매우 높은 발현 레벨(level)을 유도한다. 결과적으로, saRNA는 mRNA보다 낮은(10 내지 100배 낮은) 용량의 saRNA로 면역 반응을 유도하는 것으로 나타났고, 마우스에서 최대 60일까지 연장된 단백질 발현을 야기한다.
그러나, 도 3에 나타낸 것과 같이, saRNA를 이용하는 결점은 이것은 또한 선천적 감지 패턴 인식 수용체에 의해 감지되어 상기 종래 기술의 saRNA의 단백질 발현 및 자가-증폭을 제한하는 항-바이러스(인터페론) 반응을 촉발하게 된다는 것이다. saRNA의 선천적 감지는 그 큰 크기(전형적으로 >5,000 염기) 및 이중 가닥 영역(dsRNA)을 포함하는 고도한 2차 구조로 인해 mRNA의 경우와 상이하다. 길고 이중 가닥인 RNA는 다른 센서들 중에서도 DA5(흑색종 분화-연관 단백질 5) 경로를 통해 선천적 반응을 촉발한다. 이것은 길고 dsRNA인 RNA에 대한 PACT(PKR 활성화 단백질)의 결합에 의해 MDA5의 올리고머화(oligomerization)를 촉진하고, 이어서 saRNA의 복제 및 발현을 억제하는 하류의 신호전달 캐스케이드(cascade)를 촉발함으로써 용이하게 된다.
따라서, 본 기술분야에는 RNA 치료제가 mRNA- 또는 saRNA 기반이고, 환자에게 운반 및 발현되어 선천적 면역 시스템 감지를 극복할 수 있는 새로운 수단을 생성하기 위한 필요성이 있다.
본 발명자들은 면역 시스템 기구의 활성을 차단 또는 감소시키는 비-바이러스(예컨대, 포유동물) 면역 조절성 단백질을 발현함으로써 RNA를 감지하는 선천적 면역 시스템을 유리하게 극복하고, 숙주 세포에서 개선된 번역(mRNA의 경우) 및 증가된 자가-증폭 및 이후의 번역(saRNA의 경우)과, 이에 따른 관심있는 유전자, 예컨대 항원의 더 큰 단백질 발현 레벨을 야기하는 신규한 RNA 구조체(saRNA 및 mRNA)를 개발하였다.
따라서, 본 발명의 제1 측면에서, (ⅰ) 적어도 하나의 치료 생체분자(biomolecule); 및 (ⅱ) 적어도 하나의 비-바이러스 선천적 조절 단백질(innate modulatory protein, IMP);을 암호화하는 RNA 구조체가 제공된다.
RNA 구조체, 예컨대 mRNA 및 saRNA 레플리콘은 백신 및 치료제에 대한 관심있는 유전자의 운반 및 발현을 위한 잠재적인 도구로 상정되어 왔다. 그러나, 단일 가닥 mRNA(ssRNA) 및 이중 가닥 RNA(dsRNA)는 단백질 번역을 억제하는 반응을 촉발하는 선천적 감지 메커니즘에 의해 세포내에서 검출된다. 결과적으로, RNA 구조체에 의해 암호화되는 관심있는 유전자의 발현은 현저하게 손상되고, 따라서 saRNA 및 mRNA를 포함하는 RNA 구조체의 면역원성 또는 치료 잠재력은 제한된다. 유리하게는, 본 발명의 RNA 구조체는 숙주 세포 내에서 전이유전자(transgene) 발현의 하류 선천적 억제를 감소시키거나 제거하는 하나 이상의 비-바이러스 선천적 조절 단백질(IMP)을 암호화하기 때문에 이러한 문제점을 극복한다.
인터페론의 유도는 선천적 인식의 하류 결과 중 하나이지만, 아래에 논의된 것과 같이, 다른 분자 및 경로가 유도되거나 유도될 수 있고, 이들 중 임의의 것은 RNA 구조체가 갖고 있는 하나 이상의 비-바이러스 면역 조절성 단백질에 의해 억제될 것임이 인식될 것이다. 따라서, 바람직하게는 적어도 하나의 선천적 조절 단백질(IMP)은 본 발명의 RNA 구조체로 처리된 대상체에서 RNA에 대한 선천적 면역 반응을 조절할 수 있다. 따라서, 상기 IMP는 선천적 면역의 조절제로서 기술될 수 있다. 이것은 또한 일부 구현예에서 인터페론 억제 분자로 기술될 수 있다.
saRNA를 이용해 인터페론 반응을 제거하는 이전에 공개된 한 접근법은 백시니아(vaccinia) 바이러스 유래의 인터페론 억제 단백질인 E3, K3 및 B18을 이용하였다. 그러나, 상기 연구에서, 인터페론 억제 단백질은 saRNA와 조합된 별도의 mRNA 분자로서 운반 및 제형화되었다. 이것은 saRNA 및 mRNA 모두의 제조를 필요로 하고, 단백질 발현에서 임의의 관찰가능한 향상을 제공하기 위하여 본 발명에 따른 saRNA 레플리콘 구조체보다 적어도 3-6배 더 많은 백시니아 mRNA를 사용하는 것을 필요하게 하였다.
유리하게는, 제1 측면의 RNA 구조체에서, 하나 이상의 비-바이러스 선천적 조절 단백질의 존재는 관심있는 펩티드 또는 단백질, 즉 생물치료제 분자를 이용한 이중 단백질 발현을 가능하게 한다. 종래 기술에서 기술된 것과 같이 하나는 관심있는 펩티드/단백질을 암호화하고 하나는 선천적 조절 단백질을 암호화하는 2개의 상이한 가닥의 RNA를 운반하는 것과 대조적으로, 본 발명의 RNA 구조체를 이용할 때 단지 하나의 단일 가닥만이 표적 세포로 운반되고, 이로 인해 RNA 분자 및 비-바이러스 면역 조절성 단백질의 동시국소화(colocalization)를 보장한다. 상기 비-바이러스 면역 조절성 단백질은 숙주 세포에서 RNA의 선천적 감지를 억제하고, 이로 인해 더 높은 단백질 발현 및 번역을 가능하게 하며, 상기 비-바이러스 면역 조절성 단백질의 발현 자체는 치료 생체분자와 동일한 RNA 분자로부터 공동-발현 및 번역된다.
실시예에 기술된 것과 같이, (GOI로서) ("스틸티콘(Stealthicon)"으로도 알려져 있는) 예시적인 루시퍼라아제(luciferase)를 암호화하는 본 발명의 RNA 구조체는 놀랍게도 온전한 선천적 감지 시스템을 갖는 인간 세포주에서 시험관내 루시퍼라아제 단백질 발현 레벨을 상기 구조체에 IMP가 없는 대조군과 비교하여 적어도 2 자리수(order) 규모 이상으로 증가시키고, 또한 BL/6 마우스에서 종래의 VEEV RNA 레플리콘과 비교하여 생체내에서 루시퍼라아제의 단백질 발현의 규모 및 기간 모두를 증가시키는 것으로 나타났다. 또한, VEGF-A(도 10 참조)는 GOI로서 루시퍼라아제에 대한 대안적 예를 나타낸다.
상기 RNA 구조체는 본 발명의 RNA 분자가 갖고 있는 유전자를 생체내에서 발현할 수 있는 것으로 판명되었기 때문에, 숙련된 기술자는 루시퍼라아제 리포터(reporter)가 치료 생체분자를 정확히 대표함을 용이하게 인식할 것이다. 이와 같이, 상기 루시퍼라아제는 본 발명의 RNA 구조체가 임의의 치료적으로 활성인 생체분자, 예컨대 면역 반응을 촉발하기 위한 항원을 발현하기 위해 사용될 수 있다는 개념 증명(proof of concept)의 강력한 증거를 제공한다.
제1 측면의 RNA 구조체는 단일 가닥 RNA 또는 이중 가닥 RNA일 수 있다.
상기 RNA 구조체는 mRNA 또는 saRNA를 포함할 수 있다.
한 구현예에서, 상기 RNA 구조체는 mRNA를 포함한다. 도 1(오른쪽)은 mRNA 분자로서 RNA 구조체의 다양한 구현예를 실증한다.
그러나, 바람직한 구현예에서, 상기 RNA 구조체는 자가-증폭형 RNA(saRNA)를 포함한다. 도 1(왼쪽)은 saRNA 분자로서 RNA 구조체의 다양한 구현예를 실증한다. 숙련된 기술자는 이러한 RNA 구조체가 또한 자가-복제(self-replicating) RNA 바이러스 벡터, 또는 RNA 레플리콘으로 나타낼 수 있음을 이해할 것이다.
바람직하게는, 상기 saRNA 구조체는 알파바이러스; 피코르나바이러스; 플라비바이러스; 루비바이러스(rubivirus); 페스티바이러스(pestivirus); 헤파시바이러스(hepacivirus); 칼리시바이러스(calicivirus); 및 코로나바이러스;로 이루어진 속(genus)의 군으로부터 선택되는 양성 가닥(positive stranded) RNA 바이러스를 포함하거나 이로부터 유래된다.
바람직하게는, 상기 RNA 구조체는 알파바이러스를 포함하거나 이로부터 유래된다. 적합한 야생형 알파바이러스 서열은 잘 알려져 있다. 적합한 알파바이러스의 대표적인 예는 아우라(Aura), 베바루(Bebaru) 바이러스, 카바소우(Cabassou), 치쿤군야(Chikungunya) 바이러스, 동부 말 뇌척수염 바이러스, 포트 모건(Fort Morgan), 게타(Getah) 바이러스, 키질라가치(Kyzylagach), 마야로(Mayaro), 마야로 바이러스, 미들버그(Middleburg), 무캄보(Mucambo) 바이러스, 은두무(Ndumu), 픽수나(Pixuna) 바이러스, 로스 리버(Ross River) 바이러스, 셈리키 포레스트(Semliki Forest), 신드비스(Sindbis) 바이러스, 토나테(Tonate), 트리니티(Triniti), 우나(Una), 베네주엘라 말 뇌척수염, 서부 말 뇌척수염, 와타로아(Whataroa Whataroa), 및 Y-62-33을 포함한다. 따라서, 바람직하게는 상기 RNA 구조체는 임의의 상기 알파바이러스를 포함하거나 이로부터 유래된다.
바람직하게는, 상기 RNA 구조체는 베네주엘라 말 뇌염 바이러스(VEEV); 엔테로바이러스(enterovirus) 71; 뇌심근염 바이러스; 쿤진(Kunjin) 바이러스; 및 중동 호흡기 증후군 바이러스;로 이루어진 종의 군으로부터 선택되는 바이러스를 포함하거나 이로부터 유래된다. 한 바람직한 구현예에서, 상기 RNA 구조체는 쿤진 바이러스를 포함하거나 이로부터 유래된다. 바람직하게는, 상기 RNA 구조체는 VEEV로부터 유래된다.
바람직하게는, 상기 RNA 구조체는 RNA에 대한 선천적 면역 반응을 감소, 제거, 또는 차단할 수 있는 적어도 하나의 선천적 조절 단백질(IMP)을 암호화하는 뉴클레오티드 서열을 포함한다. 따라서, 상기 IMP는 선천적 면역의 조절제이다. 이것은 또한 인터페론 신호전달의 인터페론 억제제일 수 있다.
상기 IMP는 바람직하게는 포유동물 IMP이다. 보다 바람직하게는, 상기 IMP는 영장류 IMP이다. 가장 바람직하게는, 상기 IMP는 인간 IMP이다.
해당 RNA 분자(즉, 비-내인성으로 생산된 RNA)로 형질전환된 숙주 세포에서 RNA에 대한 선천적 면역 반응의 감소, 제거, 또는 차단은 선천적 신호전달 경로를 억제하고, 및/또는 RNA 인식을 억제하는 IMP 조절성 인터페론 생산에 의해 달성될 수 있다. 인터페론 생산의 조절은 선천적 신호전달의 억제로서 기술될 수 있음이 인식될 것이다. 따라서, 선천적 감지 및 선천적 신호전달 시스템은 (a) RNA 인식 시스템, (b) 인터페론 생산을 유도하고 인터페론-자극 유전자의 자극을 야기하는 경로, 및 (c) 인터페론 신호전달 시스템을 포함한다.
따라서, 상기 IMP는 다음의 4가지 넓은 카테고리 중 하나에 속할 수 있다:
(ⅰ) 카테고리 1: 인터페론 조절 인자 활성의 억제제;
(ⅱ) 카테고리 2: 인터페론 생산을 유도하고 인터페론-자극 유전자의 자극을 야기하는 경로의 억제제;
(ⅲ) 카테고리 3: 인터페론 신호전달의 억제제; 및/또는
(ⅳ) 카테고리 4: RNA 인식 시스템의 억제제.
일부 IMP는 다수의 작용을 가질 수 있음이 인식될 것이다. 예를 들어, 카테고리 4의 IMP는 또한 카테고리 2의 IMP(예컨대, IRF3, IRF7) 및 카테고리 3의 IMP(예컨대, IRF9)로 분류될 수 있다.
카테고리 1: 인터페론 조절 인자 활성의 억제제
한 구현예에서, 상기 IMP는 인터페론 조절 인자 활성을 억제하도록 구성될 수 있다.
RNA에 대한 선천적 면역 반응의 감소, 제거, 또는 차단은 바람직하게는 다양한 항-바이러스 유전자(예컨대, RNA 발현을 저해하는 것으로 알려진 IFIT1-3, Mx1, Mx2), 그 생성물이 선천적 면역 반응을 조직하는 전염증성(proinflammatory) 유전자, 및 임의의 인터페론 의존성 캐스케이드 상류의 정규적(canonically) IFN-자극 유전자(ISG)의 직접적인 활성화를 직접 촉발하는 인터페론 조절 인자(예컨대, IRF3 및 IRF7), NF-κB 전사 인자, 및 다른 신호전달 단백질의 활성화를 감소 또는 방지함으로써 IMP에 의해 달성된다. 상기 경로는 많은 항-바이러스 반응을 추가로 증폭하는 양성 피드백 루프(feedback loop)를 제공하는 타입 I & III 인터페론을 유도함으로써 향상될 수 있다.
따라서, 바람직하게는 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질은 돌연변이된 또는 비-기능적 인터페론 조절 인자(IRF), 또는 그의 우성 음성(dominant negative) 작용 형태를 포함한다. 상기 IRF, 또는 그의 우성 음성 형태는 바람직하게는 숙주 세포에서 자연형 IRF에 대한 RNA와 경쟁하거나 이에 대한 결합을 방지하도록 돌연변이된다.
상기 돌연변이된 또는 비-기능적 인터페론 조절 인자, 또는 그의 우성 음성 작용 형태는 IRF1, IRF2, IRF3, IRF4, IRF5, IRF6, IRF7, IRF8, 또는 IRF9 중 어느 하나일 수 있다.
한 구현예에서, 본 명세서에 기술된 임의의 IRF는 그 DNA 결합 도메인(DBD), 및/또는 그 핵 위치화 신호(NLS)에 결실 또는 돌연변이가 있어서 상기 DBD 및/또는 NLS가 비-기능적이거나 없는 것을 제외하고는 전체 단백질을 포함할 수 있다. 따라서, 바람직하게는 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질은 그DNA 결합 도메인(DBD) 및/또는 핵 위치화 신호(NLS)가 비-기능적이거나 결실되어서, 이것이 세포질에서 우성 음성 형태가 되게 하는 인터페론 조절 인자(IRF)를 포함한다.
또 다른 구현예에서, 바람직하게는 상기 IRF의 DBD 및/또는 NLS를 (개별적으로 또는 함께 융합되어) 포함하는 RNA 구조체에 의해 암호화되는 선천적 조절 단백질은 하나 이상의 인터페론 자극 유전자(ISG)의 프로모터에 대한 대응하는 자연형 IRF의 결합을 경쟁적으로 차단한다.
따라서, 상기 돌연변이된 또는 비-기능적 인터페론 조절 인자, 또는 그의 우성 음성 작용 형태는 인터페론 조절 인자(IRF)의 DNA 결합 도메인(DBD) 및/또는 핵 위치화 신호(NLS)를 포함하거나 이로 이루어질 수 있다.
상기 IMP 각각에 대한 단백질, DNA 및 RNA 서열은 아래에 제공된다. 성공적인 발현을 위하여, 상기 RNA 구조체는 바람직하게는 RNA가 대응하는 단백질로 번역되도록 개시 코돈(start codon)을 포함함이 인식될 것이다. 아래에 제공된 일부 IMP는 자연적으로는 개시 코돈을 갖지 않을 수 있으며, 따라서 이 경우, 상기 RNA 구조체는 번역을 보장하기 위하여 그 5' 말단에 이것이 하나 부가될 필요가 있을 것이다. 유사하게, RNA의 단백질로의 성공적인 번역을 보장하기 위하여, 종결 코돈(stop codon)이 필요로 하며, 다시, 아래에 제공된 일부 IMP의 경우, 상기 RNA 구조체는 그 3' 말단에 종결 코돈이 필요할 것이다.
구현예에서, IRF는 그 DBD 및/또는 NLS 섹션(section)이 결실되어서, 상기 IRF의 우성 음성 형태가 핵으로 진입할 수 없을 수 있다. 상기 적어도 하나의 IMP는 IRF1 우성 음성; IRF3 우성 음성; IRF7 우성 음성; 및 IRF9 우성 음성;으로 이루어진 군으로부터 선택될 수 있는 IRF의 우성 음성 형태일 수 있다.
한 구현예에서, 상기 적어도 하나의 IMP는 IRF1 우성 음성 작용성 폴리펩티드(IRF1(141-325)), 즉 DBD 및 NLS이 결실된 IRF1(접근 번호 - NCBI 참조 서열: NM_002198.3; UniProtKB - P10914(IRF1_HUMAN)), 또는 그의 동원체일 수 있다. 상기 IRF1 우성 음성 형태의 폴리펩티드 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 1로 나타낸다:
GDSSPDTFSDGLSSSTLPDDHSSYTVPGYMQDLEVEQALTPALSPCAVSSTLPDWHIPVEVVPDSTSDLYNFQVSPMPSTSEATTDEDEEGKLPEDIMKLLEQSEWQPTNVDGKGYLLNEPGVQPTSVYGDFSCKEEPEIDSPGGDIGLSLQRVFTDLKNMDATWLDSLLTPVRLPSIQAIPCAP [서열번호 1]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 1에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다. 서열번호 1에 나타낸 것과 같이, 위치 299 및 275에서 2개의 (굵게) 강조된 리신(K) 잔기는 아래에 논의된 것과 같이 아르기닌(R)으로 돌연변이되어서 돌연변이체 IRF1 우성 음성 작용성 폴리펩티드를 형성할 수 있다.
한 구현예에서, 상기 IRF1 우성 음성 작용성 폴리펩티드는 다음과 같은 서열번호 2의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GGGGATTCCAGCCCTGATACCTTCTCTGATGGACTCAGCAGCTCCACTCTGCCTGATGACCACAGCAGCTACACAGTTCCAGGCTACATGCAGGACTTGGAGGTGGAGCAGGCCCTGACTCCAGCACTGTCGCCATGTGCTGTCAGCAGCACTCTCCCCGACTGGCACATCCCAGTGGAAGTTGTGCCGGACAGCACCAGTGATCTGTACAACTTCCAGGTGTCACCCATGCCCTCCACCTCTGAAGCTACAACAGATGAGGATGAGGAAGGGAAATTACCTGAGGACATCATGAAGCTCTTGGAGCAGTCGGAGTGGCAGCCAACAAACGTGGATGGGAAGGGGTACCTACTCAATGAACCTGGAGTCCAGCCCACCTCTGTCTATGGAGACTTTAGCTGTAAGGAGGAGCCAGAAATTGACAGCCCAGGGGGGGATATTGGGCTGAGTCTACAGCGTGTCTTCACAGATCTGAAGAACATGGATGCCACCTGGCTGGACAGCCTGCTGACCCCAGTCCGGTTGCCCTCCATCCAGGCCATTCCCTGTGCACCGTAG [서열번호 2]
따라서, 바람직하게는 상기 IRF1 우성 음성 작용성 폴리펩티드는 실질적으로 서열번호 2에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 3의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GGGGAUUCCAGCCCUGAUACCUUCUCUGAUGGACUCAGCAGCUCCACUCUGCCUGAUGACCACAGCAGCUACACAGUUCCAGGCUACAUGCAGGACUUGGAGGUGGAGCAGGCCCUGACUCCAGCACUGUCGCCAUGUGCUGUCAGCAGCACUCUCCCCGACUGGCACAUCCCAGUGGAAGUUGUGCCGGACAGCACCAGUGAUCUGUACAACUUCCAGGUGUCACCCAUGCCCUCCACCUCUGAAGCUACAACAGAUGAGGAUGAGGAAGGGAAAUUACCUGAGGACAUCAUGAAGCUCUUGGAGCAGUCGGAGUGGCAGCCAACAAACGUGGAUGGGAAGGGGUACCUACUCAAUGAACCUGGAGUCCAGCCCACCUCUGUCUAUGGAGACUUUAGCUGUAAGGAGGAGCCAGAAAUUGACAGCCCAGGGGGGGAUAUUGGGCUGAGUCUACAGCGUGUCUUCACAGAUCUGAAGAACAUGGAUGCCACCUGGCUGGACAGCCUGCUGACCCCAGUCCGGUUGCCCUCCAUCCAGGCCAUUCCCUGUGCACCGUAG [서열번호 3]
아울러, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 3에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 1의 변형된 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 4로 제공된다:
ATGGGCGATAGCAGCCCCGATACCTTTTCCGATGGCCTGAGCAGCAGCACCCTGCCTGATGATCACAGCAGCTACACCGTGCCTGGCTACATGCAGGACCTGGAAGTGGAACAGGCCCTGACACCAGCTCTGAGCCCTTGTGCTGTGTCCAGCACACTGCCCGATTGGCACATCCCTGTGGAAGTGGTGCCTGACAGCACCAGCGACCTGTACAACTTCCAAGTGTCCCCTATGCCTAGCACCTCCGAGGCCACCACCGATGAGGATGAAGAGGGAAAGCTGCCCGAGGACATCATGAAGCTGCTGGAACAGAGCGAGTGGCAGCCCACCAATGTGGATGGCAAGGGCTACCTGCTGAACGAGCCTGGCGTTCAGCCTACAAGCGTGTACGGCGACTTCAGCTGCAAAGAGGAACCCGAGATCGATAGCCCTGGCGGCGATATCGGACTGAGCCTGCAGAGAGTGTTCACCGACCTGAAGAACATGGACGCCACCTGGCTGGACAGCCTGCTGACACCTGTTAGACTGCCCTCTATCCAGGCTATCCCCTGCGCTCCTTGA [서열번호 4]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 4에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 4의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 5로 제공된다:
AUGGGCGAUAGCAGCCCCGAUACCUUUUCCGAUGGCCUGAGCAGCAGCACCCUGCCUGAUGAUCACAGCAGCUACACCGUGCCUGGCUACAUGCAGGACCUGGAAGUGGAACAGGCCCUGACACCAGCUCUGAGCCCUUGUGCUGUGUCCAGCACACUGCCCGAUUGGCACAUCCCUGUGGAAGUGGUGCCUGACAGCACCAGCGACCUGUACAACUUCCAAGUGUCCCCUAUGCCUAGCACCUCCGAGGCCACCACCGAUGAGGAUGAAGAGGGAAAGCUGCCCGAGGACAUCAUGAAGCUGCUGGAACAGAGCGAGUGGCAGCCCACCAAUGUGGAUGGCAAGGGCUACCUGCUGAACGAGCCUGGCGUUCAGCCUACAAGCGUGUACGGCGACUUCAGCUGCAAAGAGGAACCCGAGAUCGAUAGCCCUGGCGGCGAUAUCGGACUGAGCCUGCAGAGAGUGUUCACCGACCUGAAGAACAUGGACGCCACCUGGCUGGACAGCCUGCUGACACCUGUUAGACUGCCCUCUAUCCAGGCUAUCCCCUGCGCUCCUUGA [서열번호 5]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 5에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 서열번호 1의 IRF1 우성 음성 작용성 폴리펩티드(접근 번호 - NCBI 참조 서열: NM_002198.3; UniProtKB - P10914(IRF1_HUMAN), 또는 그의 동원체는 299 및/또는 275에서 K에서 R 돌연변이로 돌연변이될 수 있다(상기 강조됨). [Panda D, Gjinaj E, Bachu M, Squire E, Novatt H, Ozato K, Rabin RL. IRF1 Maintains Optimal Constitutive Expression of Antiviral Genes and Regulates the Early Antiviral Response. Front Immunol. 2019 May 15;10:1019. doi: 10.3389/fimmu.2019.01019). 상기 돌연변이된 IRF1 우성 음성 작용성 폴리펩티드의 한 구현예는 본 명세서에서 다음과 같은 서열번호 6으로 나타낸다:
GDSSPDTFSDGLSSSTLPDDHSSYTVPGYMQDLEVEQALTPALSPCAVSSTLPDWHIPVEVVPDSTSDLYNFQVSPMPSTSEATTDEDEEGKLPEDIMKLLEQSEWQPTNVDGKGYLLNEPGVQPTSVYGDFSCREEPEIDSPGGDIGLSLQRVFTDLRNMDATWLDSLLTPVRLPSIQAIPCAP [서열번호 6]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 6에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 돌연변이된 IRF1 우성 음성 작용성 폴리펩티드는 다음과 같은 서열번호 7의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GGGGATTCCAGCCCTGATACCTTCTCTGATGGACTCAGCAGCTCCACTCTGCCTGATGACCACAGCAGCTACACAGTTCCAGGCTACATGCAGGACTTGGAGGTGGAGCAGGCCCTGACTCCAGCACTGTCGCCATGTGCTGTCAGCAGCACTCTCCCCGACTGGCACATCCCAGTGGAAGTTGTGCCGGACAGCACCAGTGATCTGTACAACTTCCAGGTGTCACCCATGCCCTCCACCTCTGAAGCTACAACAGATGAGGATGAGGAAGGGAAATTACCTGAGGACATCATGAAGCTCTTGGAGCAGTCGGAGTGGCAGCCAACAAACGTGGATGGGAAGGGGTACCTACTCAATGAACCTGGAGTCCAGCCCACCTCTGTCTATGGAGACTTTAGCTGTCGGGAGGAGCCAGAAATTGACAGCCCAGGGGGGGATATTGGGCTGAGTCTACAGCGTGTCTTCACAGATCTGCGGAACATGGATGCCACCTGGCTGGACAGCCTGCTGACCCCAGTCCGGTTGCCCTCCATCCAGGCCATTCCCTGTGCACCG [서열번호 7]
따라서, 바람직하게는 상기 돌연변이된 IRF1 우성 음성 작용성 폴리펩티드는 실질적으로 서열번호 7에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다. 상기 아미노산 변화를 유도하는 코돈은 위에 굵게 강조되어 있음이 인식될 것이다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 8의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GGGGAUUCCAGCCCUGAUACCUUCUCUGAUGGACUCAGCAGCUCCACUCUGCCUGAUGACCACAGCAGCUACACAGUUCCAGGCUACAUGCAGGACUUGGAGGUGGAGCAGGCCCUGACUCCAGCACUGUCGCCAUGUGCUGUCAGCAGCACUCUCCCCGACUGGCACAUCCCAGUGGAAGUUGUGCCGGACAGCACCAGUGAUCUGUACAACUUCCAGGUGUCACCCAUGCCCUCCACCUCUGAAGCUACAACAGAUGAGGAUGAGGAAGGGAAAUUACCUGAGGACAUCAUGAAGCUCUUGGAGCAGUCGGAGUGGCAGCCAACAAACGUGGAUGGGAAGGGGUACCUACUCAAUGAACCUGGAGUCCAGCCCACCUCUGUCUAUGGAGACUUUAGCUGUCGGGAGGAGCCAGAAAUUGACAGCCCAGGGGGGGAUAUUGGGCUGAGUCUACAGCGUGUCUUCACAGAUCUGCGGAACAUGGAUGCCACCUGGCUGGACAGCCUGCUGACCCCAGUCCGGUUGCCCUCCAUCCAGGCCAUUCCCUGUGCACCG [서열번호 8]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로에 나타낸 것과 같은 서열번호 8의 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 6의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 9로 제공된다:
ATGGGCGATAGCAGCCCCGATACCTTTTCCGATGGCCTGAGCAGCAGCACCCTGCCTGATGATCACAGCAGCTACACCGTGCCTGGCTACATGCAGGACCTGGAAGTGGAACAGGCCCTGACACCAGCTCTGAGCCCTTGTGCTGTGTCCAGCACACTGCCCGATTGGCACATCCCTGTGGAAGTGGTGCCTGACAGCACCAGCGACCTGTACAACTTCCAAGTGTCCCCTATGCCTAGCACCTCCGAGGCCACCACCGATGAGGATGAAGAGGGAAAGCTGCCCGAGGACATCATGAAGCTGCTGGAACAGAGCGAGTGGCAGCCCACCAATGTGGATGGCAAGGGCTACCTGCTGAACGAGCCTGGCGTTCAGCCTACAAGCGTGTACGGCGACTTCAGCTGCAGAGAGGAACCCGAGATCGATAGCCCTGGCGGCGATATCGGACTGAGTCTGCAGAGGGTGTTCACCGACCTGAGAAACATGGACGCCACCTGGCTGGACAGCCTGCTGACACCTGTTAGACTGCCCTCTATCCAGGCTATCCCCTGCGCTCCTTGA [서열번호 9]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 9에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 9의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 10으로 제공된다:
AUGGGCGAUAGCAGCCCCGAUACCUUUUCCGAUGGCCUGAGCAGCAGCACCCUGCCUGAUGAUCACAGCAGCUACACCGUGCCUGGCUACAUGCAGGACCUGGAAGUGGAACAGGCCCUGACACCAGCUCUGAGCCCUUGUGCUGUGUCCAGCACACUGCCCGAUUGGCACAUCCCUGUGGAAGUGGUGCCUGACAGCACCAGCGACCUGUACAACUUCCAAGUGUCCCCUAUGCCUAGCACCUCCGAGGCCACCACCGAUGAGGAUGAAGAGGGAAAGCUGCCCGAGGACAUCAUGAAGCUGCUGGAACAGAGCGAGUGGCAGCCCACCAAUGUGGAUGGCAAGGGCUACCUGCUGAACGAGCCUGGCGUUCAGCCUACAAGCGUGUACGGCGACUUCAGCUGCAGAGAGGAACCCGAGAUCGAUAGCCCUGGCGGCGAUAUCGGACUGAGUCUGCAGAGGGUGUUCACCGACCUGAGAAACAUGGACGCCACCUGGCUGGACAGCCUGCUGACACCUGUUAGACUGCCCUCUAUCCAGGCUAUCCCCUGCGCUCCUUGA [서열번호 10]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 10에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 IFN 유도 캐스케이드에 핵심인 IRF3 우성 음성 작용 형태, 즉 DBD 결실된 IRF3의 우성 음성 작용 버전(version), IRF3(191-427)(NCBI 참조 서열: NM_001571.6; UniProtKB - Q14653(IRF3_HUMAN)), 또는 그의 동원체일 수 있다. [Ysebrant de Lendonck L, Martinet V, Goriely S. Interferon regulatory factor 3 in adaptive immune responses. Cell Mol Life Sci. 2014 Oct;71(20):3873-83. doi: 10.1007/s00018-014-1653-9]. 상기 IRF3 우성 음성 작용 형태의 한 구현예는 본 명세서에서 다음과 같은 서열번호 11로 나타낸다:
PLKRLLVPGEEWEFEVTAFYRGRQVFQQTISCPEGLRLVGSEVGDRTLPGWPVTLPDPGMSLTDRGVMSYVRHVLSCLGGGLALWRAGQWLWAQRLGHCHTYWAVSEELLPNSGHGPDGEVPKDKEGGVFDLGPFIVDLITFTEGSGRSPRYALWFCVGESWPQDQPWTKRLVMVKVVPTCLRALVEMARVGGASSLENTVDLHISNSHPLSLTSDQYKAYLQDLVEGMDFQGPGES [서열번호 11]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 11에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 IRF3 우성 음성 작용 형태 폴리펩티드는 다음과 같은 서열번호 12의 DNA 뉴클레오티드 서열에 의해 암호화된다:
CCACTGAAGCGGCTGTTGGTGCCGGGGGAAGAGTGGGAGTTCGAGGTGACAGCCTTCTACCGGGGCCGCCAAGTCTTCCAGCAGACCATCTCCTGCCCGGAGGGCCTGCGGCTGGTGGGGTCCGAAGTGGGAGACAGGACGCTGCCTGGATGGCCAGTCACACTGCCAGACCCTGGCATGTCCCTGACAGACAGGGGAGTGATGAGCTACGTGAGGCATGTGCTGAGCTGCCTGGGTGGGGGACTGGCTCTCTGGCGGGCCGGGCAGTGGCTCTGGGCCCAGCGGCTGGGGCACTGCCACACATACTGGGCAGTGAGCGAGGAGCTGCTCCCCAACAGCGGGCATGGGCCTGATGGCGAGGTCCCCAAGGACAAGGAAGGAGGCGTGTTTGACCTGGGGCCCTTCATTGTAGATCTGATTACCTTCACGGAAGGAAGCGGACGCTCACCACGCTATGCCCTCTGGTTCTGTGTGGGGGAGTCATGGCCCCAGGACCAGCCGTGGACCAAGAGGCTCGTGATGGTCAAGGTTGTGCCCACGTGCCTCAGGGCCTTGGTAGAAATGGCCCGGGTAGGGGGTGCCTCCTCCCTGGAGAATACTGTGGACCTGCACATTTCCAACAGCCACCCACTCTCCCTCACCTCCGACCAGTACAAGGCCTACCTGCAGGACTTGGTGGAGGGCATGGATTTCCAGGGCCCTGGGGAGAGC [서열번호 12]
따라서, 바람직하게는 상기 IRF3 우성 음성 작용 형태 폴리펩티드는 실질적으로 서열번호 12에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 13의 RNA 뉴클레오티드 서열을 포함할 수 있다:
CCACUGAAGCGGCUGUUGGUGCCGGGGGAAGAGUGGGAGUUCGAGGUGACAGCCUUCUACCGGGGCCGCCAAGUCUUCCAGCAGACCAUCUCCUGCCCGGAGGGCCUGCGGCUGGUGGGGUCCGAAGUGGGAGACAGGACGCUGCCUGGAUGGCCAGUCACACUGCCAGACCCUGGCAUGUCCCUGACAGACAGGGGAGUGAUGAGCUACGUGAGGCAUGUGCUGAGCUGCCUGGGUGGGGGACUGGCUCUCUGGCGGGCCGGGCAGUGGCUCUGGGCCCAGCGGCUGGGGCACUGCCACACAUACUGGGCAGUGAGCGAGGAGCUGCUCCCCAACAGCGGGCAUGGGCCUGAUGGCGAGGUCCCCAAGGACAAGGAAGGAGGCGUGUUUGACCUGGGGCCCUUCAUUGUAGAUCUGAUUACCUUCACGGAAGGAAGCGGACGCUCACCACGCUAUGCCCUCUGGUUCUGUGUGGGGGAGUCAUGGCCCCAGGACCAGCCGUGGACCAAGAGGCUCGUGAUGGUCAAGGUUGUGCCCACGUGCCUCAGGGCCUUGGUAGAAAUGGCCCGGGUAGGGGGUGCCUCCUCCCUGGAGAAUACUGUGGACCUGCACAUUUCCAACAGCCACCCACUCUCCCUCACCUCCGACCAGUACAAGGCCUACCUGCAGGACUUGGUGGAGGGCAUGGAUUUCCAGGGCCCUGGGGAGAGC [서열번호 13]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 13에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 11의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 번호 14로 제공된다:
ATGCCCCTGAAGAGACTGCTGGTGCCTGGCGAGGAATGGGAGTTTGAAGTGACCGCCTTCTACCGGGGCAGACAGGTGTTCCAGCAGACCATCTCTTGCCCCGAGGGACTGAGACTCGTGGGCTCTGAAGTGGGCGATAGAACACTGCCTGGCTGGCCTGTGACACTGCCAGATCCTGGAATGAGCCTGACCGACAGAGGCGTGATGAGCTATGTGCGGCACGTGCTGTCTTGTCTCGGCGGAGGACTTGCCCTTTGGAGAGCTGGACAATGGCTGTGGGCTCAGAGACTGGGCCACTGTCACACATACTGGGCCGTGTCTGAGGAACTGCTGCCCAATTCTGGCCACGGACCTGATGGCGAGGTGCCCAAAGACAAAGAAGGCGGCGTTTTCGATCTGGGCCCCTTCATCGTGGACCTGATCACCTTTACCGAAGGCAGCGGCAGAAGCCCCAGATACGCCCTGTGGTTTTGTGTGGGCGAGAGCTGGCCTCAGGATCAGCCTTGGACCAAGAGACTGGTCATGGTCAAGGTGGTGCCTACCTGCCTGAGAGCCCTGGTGGAAATGGCTAGAGTTGGCGGAGCCAGCAGCCTGGAAAACACCGTGGATCTGCACATCAGCAACTCTCACCCTCTGTCTCTGACCAGCGACCAGTACAAGGCCTATCTGCAGGACCTGGTCGAAGGCATGGACTTTCAAGGCCCTGGCGAGTCCTGA [서열번호 14]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 14에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 14의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 15로 제공된다:
AUGCCCCUGAAGAGACUGCUGGUGCCUGGCGAGGAAUGGGAGUUUGAAGUGACCGCCUUCUACCGGGGCAGACAGGUGUUCCAGCAGACCAUCUCUUGCCCCGAGGGACUGAGACUCGUGGGCUCUGAAGUGGGCGAUAGAACACUGCCUGGCUGGCCUGUGACACUGCCAGAUCCUGGAAUGAGCCUGACCGACAGAGGCGUGAUGAGCUAUGUGCGGCACGUGCUGUCUUGUCUCGGCGGAGGACUUGCCCUUUGGAGAGCUGGACAAUGGCUGUGGGCUCAGAGACUGGGCCACUGUCACACAUACUGGGCCGUGUCUGAGGAACUGCUGCCCAAUUCUGGCCACGGACCUGAUGGCGAGGUGCCCAAAGACAAAGAAGGCGGCGUUUUCGAUCUGGGCCCCUUCAUCGUGGACCUGAUCACCUUUACCGAAGGCAGCGGCAGAAGCCCCAGAUACGCCCUGUGGUUUUGUGUGGGCGAGAGCUGGCCUCAGGAUCAGCCUUGGACCAAGAGACUGGUCAUGGUCAAGGUGGUGCCUACCUGCCUGAGAGCCCUGGUGGAAAUGGCUAGAGUUGGCGGAGCCAGCAGCCUGGAAAACACCGUGGAUCUGCACAUCAGCAACUCUCACCCUCUGUCUCUGACCAGCGACCAGUACAAGGCCUAUCUGCAGGACCUGGUCGAAGGCAUGGACUUUCAAGGCCCUGGCGAGUCCUGA [서열번호 15]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 15에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 IFN 유도 캐스케이드에 핵심이고 IFN 알파 및 베타 유도에 영향을 미치는 IRF7 우성 음성 작용 형태(NCBI 참조 서열: NM_001572.5; UniProtKB - Q92985(IRF7_HUMAN)), 또는 그의 동원체일 수 있다. [Au WC, Yeow WS, Pitha PM. Analysis of functional domains of interferon regulatory factor 7 and its association with IRF-3. Virology. 2001;280(2):273-282. doi:10.1006/viro.2000.0782). 상기 IRF7 우성 음성 작용 형태의 한 구현예는 IRF-7(238-503)로 나타내고, 본 명세서에서 다음과 같은 서열번호 16으로 나타낸다:
WAVETTPSPGPQPAALTTGEAAAPESPHQAEPYLSPSPSACTAVQEPSPGALDVTIMYKGRTVLQKVVGHPSCTFLYGPPDPAVRATDPQQVAFPSPAELPDQKQLRYTEELLRHVAPGLHLELRGPQLWARRMGKCKVYWEVGGPPGSASPSTPACLLPRNCDTPIFDFRVFFQELVEFRARQRRGSPRYTIYLGFGQDLSAGRPKEKSLVLVKLEPWLCRVHLEGTQREGVSSLDSSSLSLCLSSANSLYDDIECFLMELEQPA [서열번호 16]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 16에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 IRF7 우성 음성 작용 형태 폴리펩티드는 다음과 같은 서열번호 17의 DNA 뉴클레오티드 서열에 의해 암호화된다:
TGGGCAGTAGAGACGACCCCCAGCCCCGGGCCCCAGCCCGCGGCACTAACGACAGGCGAGGCCGCGGCCCCAGAGTCCCCGCACCAGGCAGAGCCGTACCTGTCACCCTCCCCAAGCGCCTGCACCGCGGTGCAAGAGCCCAGCCCAGGGGCGCTGGACGTGACCATCATGTACAAGGGCCGCACGGTGCTGCAGAAGGTGGTGGGACACCCGAGCTGCACGTTCCTATACGGCCCCCCAGACCCAGCTGTCCGGGCCACAGACCCCCAGCAGGTAGCATTCCCCAGCCCTGCCGAGCTCCCGGACCAGAAGCAGCTGCGCTACACGGAGGAACTGCTGCGGCACGTGGCCCCTGGGTTGCACCTGGAGCTTCGGGGGCCACAGCTGTGGGCCCGGCGCATGGGCAAGTGCAAGGTGTACTGGGAGGTGGGCGGACCCCCAGGCTCCGCCAGCCCCTCCACCCCAGCCTGCCTGCTGCCTCGGAACTGTGACACCCCCATCTTCGACTTCAGAGTCTTCTTCCAAGAGCTGGTGGAATTCCGGGCACGGCAGCGCCGTGGCTCCCCACGCTATACCATCTACCTGGGCTTCGGGCAGGACCTGTCAGCTGGGAGGCCCAAGGAGAAGAGCCTGGTCCTGGTGAAGCTGGAACCCTGGCTGTGCCGAGTGCACCTAGAGGGCACGCAGCGTGAGGGTGTGTCTTCCCTGGATAGCAGCAGCCTCAGCCTCTGCCTGTCCAGCGCCAACAGCCTCTATGACGACATCGAGTGCTTCCTTATGGAGCTGGAGCAGCCCGCC [서열번호 17]
따라서, 바람직하게는 상기 IRF7 우성 음성 작용 형태 폴리펩티드는 실질적으로 서열번호 17에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
아울러, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 18에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
UGGGCAGUAGAGACGACCCCCAGCCCCGGGCCCCAGCCCGCGGCACUAACGACAGGCGAGGCCGCGGCCCCAGAGUCCCCGCACCAGGCAGAGCCGUACCUGUCACCCUCCCCAAGCGCCUGCACCGCGGUGCAAGAGCCCAGCCCAGGGGCGCUGGACGUGACCAUCAUGUACAAGGGCCGCACGGUGCUGCAGAAGGUGGUGGGACACCCGAGCUGCACGUUCCUAUACGGCCCCCCAGACCCAGCUGUCCGGGCCACAGACCCCCAGCAGGUAGCAUUCCCCAGCCCUGCCGAGCUCCCGGACCAGAAGCAGCUGCGCUACACGGAGGAACUGCUGCGGCACGUGGCCCCUGGGUUGCACCUGGAGCUUCGGGGGCCACAGCUGUGGGCCCGGCGCAUGGGCAAGUGCAAGGUGUACUGGGAGGUGGGCGGACCCCCAGGCUCCGCCAGCCCCUCCACCCCAGCCUGCCUGCUGCCUCGGAACUGUGACACCCCCAUCUUCGACUUCAGAGUCUUCUUCCAAGAGCUGGUGGAAUUCCGGGCACGGCAGCGCCGUGGCUCCCCACGCUAUACCAUCUACCUGGGCUUCGGGCAGGACCUGUCAGCUGGGAGGCCCAAGGAGAAGAGCCUGGUCCUGGUGAAGCUGGAACCCUGGCUGUGCCGAGUGCACCUAGAGGGCACGCAGCGUGAGGGUGUGUCUUCCCUGGAUAGCAGCAGCCUCAGCCUCUGCCUGUCCAGCGCCAACAGCCUCUAUGACGACAUCGAGUGCUUCCUUAUGGAGCUGGAGCAGCCCGCC [서열번호 18]
따라서, 본 발명자들은 서열번호 16의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 19로 제공된다:
ATGTGGGCCGTCGAGACAACACCTTCTCCAGGACCTCAACCTGCCGCTCTGACAACAGGCGAAGCTGCTGCTCCTGAGTCTCCACATCAGGCCGAGCCTTACCTGTCTCCATCTCCTAGCGCCTGTACCGCCGTGCAAGAACCTTCTCCTGGTGCTCTGGACGTGACCATCATGTACAAGGGCAGAACCGTGCTGCAGAAAGTCGTGGGACACCCCAGCTGCACCTTTCTGTATGGCCCTCCAGATCCTGCCGTGCGGGCTACAGATCCTCAGCAGGTTGCATTCCCATCTCCAGCCGAGCTGCCCGATCAGAAGCAGCTGAGATACACCGAGGAACTGCTGAGACACGTGGCCCCTGGACTGCACCTGGAACTGAGAGGACCACAACTGTGGGCCAGACGGATGGGCAAGTGCAAGGTGTACTGGGAAGTTGGCGGCCCTCCTGGATCTGCCTCTCCATCTACACCAGCCTGCCTGCTGCCTCGGAATTGCGACACCCCTATCTTCGACTTCCGGGTGTTCTTCCAAGAGCTGGTGGAATTCCGGGCCAGACAGAGAAGAGGCAGCCCCAGATACACCATCTACCTCGGCTTTGGCCAGGACCTGTCTGCCGGACGGCCTAAAGAAAAGTCCCTGGTGCTGGTCAAGCTGGAACCCTGGCTGTGTAGAGTGCATCTGGAAGGCACCCAGAGAGAGGGCGTCAGCAGCCTGGATAGCAGCTCTCTGAGCCTGTGTCTGAGCAGCGCCAACAGCCTGTACGACGATATCGAGTGCTTCCTGATGGAACTGGAACAGCCCGCCTGA [서열번호 19]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 19에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 19의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 20으로 제공된다:
AUGUGGGCCGUCGAGACAACACCUUCUCCAGGACCUCAACCUGCCGCUCUGACAACAGGCGAAGCUGCUGCUCCUGAGUCUCCACAUCAGGCCGAGCCUUACCUGUCUCCAUCUCCUAGCGCCUGUACCGCCGUGCAAGAACCUUCUCCUGGUGCUCUGGACGUGACCAUCAUGUACAAGGGCAGAACCGUGCUGCAGAAAGUCGUGGGACACCCCAGCUGCACCUUUCUGUAUGGCCCUCCAGAUCCUGCCGUGCGGGCUACAGAUCCUCAGCAGGUUGCAUUCCCAUCUCCAGCCGAGCUGCCCGAUCAGAAGCAGCUGAGAUACACCGAGGAACUGCUGAGACACGUGGCCCCUGGACUGCACCUGGAACUGAGAGGACCACAACUGUGGGCCAGACGGAUGGGCAAGUGCAAGGUGUACUGGGAAGUUGGCGGCCCUCCUGGAUCUGCCUCUCCAUCUACACCAGCCUGCCUGCUGCCUCGGAAUUGCGACACCCCUAUCUUCGACUUCCGGGUGUUCUUCCAAGAGCUGGUGGAAUUCCGGGCCAGACAGAGAAGAGGCAGCCCCAGAUACACCAUCUACCUCGGCUUUGGCCAGGACCUGUCUGCCGGACGGCCUAAAGAAAAGUCCCUGGUGCUGGUCAAGCUGGAACCCUGGCUGUGUAGAGUGCAUCUGGAAGGCACCCAGAGAGAGGGCGUCAGCAGCCUGGAUAGCAGCUCUCUGAGCCUGUGUCUGAGCAGCGCCAACAGCCUGUACGACGAUAUCGAGUGCUUCCUGAUGGAACUGGAACAGCCCGCCUGA [서열번호 20]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 20에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 IRF9 우성 음성 작용 형태인 IRF9(142-393)(NCBI 참조 서열: NM_006084.5; UniProtKB - Q00978(IRF9_HUMAN)), 또는 그의 동원체일 수 있다. [Paul A, Tang TH, Ng SK. Interferon Regulatory Factor 9 Structure and Regulation. Front Immunol. 2018 Aug 10;9:1831. doi: 10.3389/fimmu.2018.01831. PMID: 30147694; PMCID: PMC6095977). 상기 IRF9 우성 음성 작용 형태의 한 구현예는 본 명세서에서 다음과 같은 서열번호 21로 나타낸다:
RKEEEDAMQNCTLSPSVLQDSLNNEEEGASGGAVHSDIGSSSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQQAAILSLV [서열번호 21]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 21에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 22에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
AAGGAGGAAGAGGATGCCATGCAGAACTGCACACTCAGTCCCTCTGTGCTCCAGGACTCCCTCAATAATGAGGAGGAGGGGGCCAGTGGGGGAGCAGTCCATTCAGACATTGGGAGCAGCAGCAGCAGCAGCAGCCCTGAGCCACAGGAAGTTACAGACACAACTGAGGCCCCCTTTCAAGGGGATCAGAGGTCCCTGGAGTTTCTGCTTCCTCCAGAGCCAGACTACTCACTGCTGCTCACCTTCATCTACAACGGGCGCGTGGTGGGCGAGGCCCAGGTGCAAAGCCTGGATTGCCGCCTTGTGGCTGAGCCCTCAGGCTCTGAGAGCAGCATGGAGCAGGTGCTGTTCCCCAAGCCTGGCCCACTGGAGCCCACGCAGCGCCTGCTGAGCCAGCTTGAGAGGGGCATCCTAGTGGCCAGCAACCCCCGAGGCCTCTTCGTGCAGCGCCTTTGCCCCATCCCCATCTCCTGGAATGCACCCCAGGCTCCACCTGGGCCAGGCCCGCATCTGCTGCCCAGCAACGAGTGCGTGGAGCTCTTCAGAACCGCCTACTTCTGCAGAGACTTGGTCAGGTACTTTCAGGGCCTGGGCCCCCCACCGAAGTTCCAGGTAACACTGAATTTCTGGGAAGAGAGCCATGGCTCCAGCCATACTCCACAGAATCTTATCACAGTGAAGATGGAGCAGGCCTTTGCCCGATACTTGCTGGAGCAGACTCCAGAGCAGCAGGCAGCCATTCTGTCCCTGGTG [서열번호 22]
따라서, 바람직하게는 상기 IRF9 우성 음성 작용 형태 폴리펩티드는 실질적으로 서열번호 22에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 23의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AAGGAGGAAGAGGAUGCCAUGCAGAACUGCACACUCAGUCCCUCUGUGCUCCAGGACUCCCUCAAUAAUGAGGAGGAGGGGGCCAGUGGGGGAGCAGUCCAUUCAGACAUUGGGAGCAGCAGCAGCAGCAGCAGCCCUGAGCCACAGGAAGUUACAGACACAACUGAGGCCCCCUUUCAAGGGGAUCAGAGGUCCCUGGAGUUUCUGCUUCCUCCAGAGCCAGACUACUCACUGCUGCUCACCUUCAUCUACAACGGGCGCGUGGUGGGCGAGGCCCAGGUGCAAAGCCUGGAUUGCCGCCUUGUGGCUGAGCCCUCAGGCUCUGAGAGCAGCAUGGAGCAGGUGCUGUUCCCCAAGCCUGGCCCACUGGAGCCCACGCAGCGCCUGCUGAGCCAGCUUGAGAGGGGCAUCCUAGUGGCCAGCAACCCCCGAGGCCUCUUCGUGCAGCGCCUUUGCCCCAUCCCCAUCUCCUGGAAUGCACCCCAGGCUCCACCUGGGCCAGGCCCGCAUCUGCUGCCCAGCAACGAGUGCGUGGAGCUCUUCAGAACCGCCUACUUCUGCAGAGACUUGGUCAGGUACUUUCAGGGCCUGGGCCCCCCACCGAAGUUCCAGGUAACACUGAAUUUCUGGGAAGAGAGCCAUGGCUCCAGCCAUACUCCACAGAAUCUUAUCACAGUGAAGAUGGAGCAGGCCUUUGCCCGAUACUUGCUGGAGCAGACUCCAGAGCAGCAGGCAGCCAUUCUGUCCCUGGUG [서열번호 23]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 23에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 21의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 24로 제공된다:
ATGAAGGAAGAAGAGGACGCCATGCAGAACTGCACACTGAGCCCAAGCGTGCTGCAGGACAGCCTGAACAATGAGGAAGAAGGCGCCTCTGGCGGAGCCGTGCACTCTGATATTGGCAGCAGCAGCTCTAGCAGCAGCCCCGAGCCTCAAGAAGTGACCGATACAACAGAGGCCCCATTCCAGGGCGACCAGCGGAGTCTGGAATTTCTGCTGCCTCCTGAGCCTGACTACAGCCTGCTGCTGACCTTCATCTACAACGGCAGAGTCGTGGGCGAAGCCCAGGTGCAGTCTCTGGATTGCAGACTGGTGGCCGAGCCTAGCGGAAGCGAGTCTAGTATGGAACAGGTGCTGTTCCCCAAGCCTGGACCTCTGGAACCCACACAGAGGCTGCTGTCTCAACTGGAAAGGGGCATCCTGGTGGCCAGCAATCCTAGAGGCCTGTTCGTGCAGAGACTGTGCCCTATTCCTATCAGCTGGAACGCCCCTCAGGCTCCTCCTGGACCTGGACCACATCTGCTGCCCAGCAATGAGTGCGTGGAACTGTTCCGGACCGCCTACTTCTGCAGAGATCTCGTGCGGTACTTCCAAGGCCTGGGACCTCCTCCAAAGTTCCAAGTGACCCTGAACTTCTGGGAAGAGAGCCACGGCAGCAGCCACACACCTCAGAATCTGATCACCGTGAAGATGGAACAAGCCTTCGCCAGATACCTGCTGGAACAGACCCCTGAACAGCAGGCCGCCATCCTGTCTCTGGTGTGA [서열번호 24]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 24에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 24의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 25로 제공된다:
AUGAAGGAAGAAGAGGACGCCAUGCAGAACUGCACACUGAGCCCAAGCGUGCUGCAGGACAGCCUGAACAAUGAGGAAGAAGGCGCCUCUGGCGGAGCCGUGCACUCUGAUAUUGGCAGCAGCAGCUCUAGCAGCAGCCCCGAGCCUCAAGAAGUGACCGAUACAACAGAGGCCCCAUUCCAGGGCGACCAGCGGAGUCUGGAAUUUCUGCUGCCUCCUGAGCCUGACUACAGCCUGCUGCUGACCUUCAUCUACAACGGCAGAGUCGUGGGCGAAGCCCAGGUGCAGUCUCUGGAUUGCAGACUGGUGGCCGAGCCUAGCGGAAGCGAGUCUAGUAUGGAACAGGUGCUGUUCCCCAAGCCUGGACCUCUGGAACCCACACAGAGGCUGCUGUCUCAACUGGAAAGGGGCAUCCUGGUGGCCAGCAAUCCUAGAGGCCUGUUCGUGCAGAGACUGUGCCCUAUUCCUAUCAGCUGGAACGCCCCUCAGGCUCCUCCUGGACCUGGACCACAUCUGCUGCCCAGCAAUGAGUGCGUGGAACUGUUCCGGACCGCCUACUUCUGCAGAGAUCUCGUGCGGUACUUCCAAGGCCUGGGACCUCCUCCAAAGUUCCAAGUGACCCUGAACUUCUGGGAAGAGAGCCACGGCAGCAGCCACACACCUCAGAAUCUGAUCACCGUGAAGAUGGAACAAGCCUUCGCCAGAUACCUGCUGGAACAGACCCCUGAACAGCAGGCCGCCAUCCUGUCUCUGGUGUGA [서열번호 25]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 25에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 서열번호 21의 IRF9 우성 음성 작용 형태는 이것을 서열번호 26의 아미노산 잔기 182-385 또는 그의 단편 또는 변이체(NCBI 참조 서열: NM_006084.5; UniProtKB - Q00978(IRF9_HUMAN)), 또는 그의 동원체로 줄임으로써 돌연변이될 수 있다.
SSSSSSPEPQEVTDTTEAPFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPGPGPHLLPSNECVELFRTAYFCRDLVRYFQGLGPPPKFQVTLNFWEESHGSSHTPQNLITVKMEQAFARYLLEQTPEQ [서열번호 26]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 26에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 DNA 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 돌연변이된 IRF9 우성 음성 작용 형태 폴리펩티드(IRF9(182-235))는 다음과 같은 서열번호 27의 DNA 뉴클레오티드 서열에 의해 암호화된다:
AGCAGCAGCAGCAGCAGCCCTGAGCCACAGGAAGTTACAGACACAACTGAGGCCCCCTTTCAAGGGGATCAGAGGTCCCTGGAGTTTCTGCTTCCTCCAGAGCCAGACTACTCACTGCTGCTCACCTTCATCTACAACGGGCGCGTGGTGGGCGAGGCCCAGGTGCAAAGCCTGGATTGCCGCCTTGTGGCTGAGCCCTCAGGCTCTGAGAGCAGCATGGAGCAGGTGCTGTTCCCCAAGCCTGGCCCACTGGAGCCCACGCAGCGCCTGCTGAGCCAGCTTGAGAGGGGCATCCTAGTGGCCAGCAACCCCCGAGGCCTCTTCGTGCAGCGCCTTTGCCCCATCCCCATCTCCTGGAATGCACCCCAGGCTCCACCTGGGCCAGGCCCGCATCTGCTGCCCAGCAACGAGTGCGTGGAGCTCTTCAGAACCGCCTACTTCTGCAGAGACTTGGTCAGGTACTTTCAGGGCCTGGGCCCCCCACCGAAGTTCCAGGTAACACTGAATTTCTGGGAAGAGAGCCATGGCTCCAGCCATACTCCACAGAATCTTATCACAGTGAAGATGGAGCAGGCCTTTGCCCGATACTTGCTGGAGCAGACTCCAGAGCAG [서열번호 27]
따라서, 바람직하게는 상기 돌연변이된 IRF9 우성 음성 작용 형태는 실질적으로 서열번호 27에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 28의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AGCAGCAGCAGCAGCAGCCCUGAGCCACAGGAAGUUACAGACACAACUGAGGCCCCCUUUCAAGGGGAUCAGAGGUCCCUGGAGUUUCUGCUUCCUCCAGAGCCAGACUACUCACUGCUGCUCACCUUCAUCUACAACGGGCGCGUGGUGGGCGAGGCCCAGGUGCAAAGCCUGGAUUGCCGCCUUGUGGCUGAGCCCUCAGGCUCUGAGAGCAGCAUGGAGCAGGUGCUGUUCCCCAAGCCUGGCCCACUGGAGCCCACGCAGCGCCUGCUGAGCCAGCUUGAGAGGGGCAUCCUAGUGGCCAGCAACCCCCGAGGCCUCUUCGUGCAGCGCCUUUGCCCCAUCCCCAUCUCCUGGAAUGCACCCCAGGCUCCACCUGGGCCAGGCCCGCAUCUGCUGCCCAGCAACGAGUGCGUGGAGCUCUUCAGAACCGCCUACUUCUGCAGAGACUUGGUCAGGUACUUUCAGGGCCUGGGCCCCCCACCGAAGUUCCAGGUAACACUGAAUUUCUGGGAAGAGAGCCAUGGCUCCAGCCAUACUCCACAGAAUCUUAUCACAGUGAAGAUGGAGCAGGCCUUUGCCCGAUACUUGCUGGAGCAGACUCCAGAGCAG [서열번호 28]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 28에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 26의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 29로 제공된다
ATGAGCAGCTCTAGCAGCAGCCCCGAGCCTCAAGAAGTGACCGATACAACAGAGGCCCCATTCCAGGGCGACCAGCGGAGTCTGGAATTTCTGCTGCCTCCTGAGCCTGACTACAGCCTGCTGCTGACCTTCATCTACAACGGCAGAGTCGTGGGCGAAGCCCAGGTGCAGTCTCTGGATTGCAGACTGGTGGCCGAGCCTAGCGGAAGCGAGTCTAGTATGGAACAGGTGCTGTTCCCCAAGCCTGGACCTCTGGAACCCACACAGAGGCTGCTGTCTCAACTGGAAAGGGGCATCCTGGTGGCCAGCAATCCTAGAGGCCTGTTCGTGCAGAGACTGTGCCCTATTCCTATCAGCTGGAACGCCCCTCAGGCTCCTCCTGGACCTGGACCACATCTGCTGCCCAGCAATGAGTGCGTGGAACTGTTCCGGACCGCCTACTTCTGCAGAGATCTCGTGCGGTACTTCCAAGGCCTGGGACCTCCTCCAAAGTTCCAAGTGACCCTGAACTTCTGGGAAGAGAGCCACGGCAGCAGCCACACACCTCAGAATCTGATCACCGTGAAGATGGAACAAGCCTTCGCCAGATACCTGCTGGAACAGACCCCTGAACAGTGA [서열번호 29]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 29에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 29의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 30으로 제공된다:
AUGAGCAGCUCUAGCAGCAGCCCCGAGCCUCAAGAAGUGACCGAUACAACAGAGGCCCCAUUCCAGGGCGACCAGCGGAGUCUGGAAUUUCUGCUGCCUCCUGAGCCUGACUACAGCCUGCUGCUGACCUUCAUCUACAACGGCAGAGUCGUGGGCGAAGCCCAGGUGCAGUCUCUGGAUUGCAGACUGGUGGCCGAGCCUAGCGGAAGCGAGUCUAGUAUGGAACAGGUGCUGUUCCCCAAGCCUGGACCUCUGGAACCCACACAGAGGCUGCUGUCUCAACUGGAAAGGGGCAUCCUGGUGGCCAGCAAUCCUAGAGGCCUGUUCGUGCAGAGACUGUGCCCUAUUCCUAUCAGCUGGAACGCCCCUCAGGCUCCUCCUGGACCUGGACCACAUCUGCUGCCCAGCAAUGAGUGCGUGGAACUGUUCCGGACCGCCUACUUCUGCAGAGAUCUCGUGCGGUACUUCCAAGGCCUGGGACCUCCUCCAAAGUUCCAAGUGACCCUGAACUUCUGGGAAGAGAGCCACGGCAGCAGCCACACACCUCAGAAUCUGAUCACCGUGAAGAUGGAACAAGCCUUCGCCAGAUACCUGCUGGAACAGACCCCUGAACAGUGA [서열번호 30]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 30에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
또 다른 구현예에서, 서열번호 21의 IRF9 우성 음성 작용 형태는 이것을 서열번호 31의 아미노산 잔기 200-308 또는 그의 단편 또는 변이체(NCBI 참조 서열: NM_006084.5; UniProtKB - Q00978(IRF9_HUMAN)), 또는 그의 동원체로 줄임으로써 돌연변이될 수 있다.
PFQGDQRSLEFLLPPEPDYSLLLTFIYNGRVVGEAQVQSLDCRLVAEPSGSESSMEQVLFPKPGPLEPTQRLLSQLERGILVASNPRGLFVQRLCPIPISWNAPQAPPG [서열번호 31]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 31에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 DNA 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 돌연변이된 IRF9 우성 음성 작용 형태(IRF9(200-308))는 다음과 같은 서열번호 32의 DNA 뉴클레오티드 서열에 의해 암호화된다:
CCCTTTCAAGGGGATCAGAGGTCCCTGGAGTTTCTGCTTCCTCCAGAGCCAGACTACTCACTGCTGCTCACCTTCATCTACAACGGGCGCGTGGTGGGCGAGGCCCAGGTGCAAAGCCTGGATTGCCGCCTTGTGGCTGAGCCCTCAGGCTCTGAGAGCAGCATGGAGCAGGTGCTGTTCCCCAAGCCTGGCCCACTGGAGCCCACGCAGCGCCTGCTGAGCCAGCTTGAGAGGGGCATCCTAGTGGCCAGCAACCCCCGAGGCCTCTTCGTGCAGCGCCTTTGCCCCATCCCCATCTCCTGGAATGCACCCCAGGCTCCACCTGGG [서열번호 32]
따라서, 바람직하게는 상기 돌연변이된 IRF9 우성 음성 작용 형태 폴리펩티드는 실질적으로 서열번호 32에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 33의 RNA 뉴클레오티드 서열을 포함할 수 있다:
CCCUUUCAAGGGGAUCAGAGGUCCCUGGAGUUUCUGCUUCCUCCAGAGCCAGACUACUCACUGCUGCUCACCUUCAUCUACAACGGGCGCGUGGUGGGCGAGGCCCAGGUGCAAAGCCUGGAUUGCCGCCUUGUGGCUGAGCCCUCAGGCUCUGAGAGCAGCAUGGAGCAGGUGCUGUUCCCCAAGCCUGGCCCACUGGAGCCCACGCAGCGCCUGCUGAGCCAGCUUGAGAGGGGCAUCCUAGUGGCCAGCAACCCCCGAGGCCUCUUCGUGCAGCGCCUUUGCCCCAUCCCCAUCUCCUGGAAUGCACCCCAGGCUCCACCUGGG [서열번호 33]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 33에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 31의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 34로 제공된다:
ATGCCATTCCAGGGCGACCAGCGGAGTCTGGAATTTCTGCTGCCTCCTGAGCCTGACTACAGCCTGCTGCTGACCTTCATCTACAACGGCAGAGTCGTGGGCGAAGCCCAGGTGCAGTCTCTGGATTGCAGACTGGTGGCCGAGCCTAGCGGAAGCGAGTCTAGTATGGAACAGGTGCTGTTCCCCAAGCCTGGACCTCTGGAACCCACACAGAGGCTGCTGTCTCAACTGGAAAGGGGCATCCTGGTGGCCAGCAATCCTAGAGGCCTGTTCGTGCAGAGACTGTGCCCTATTCCTATCAGCTGGAACGCCCCTCAGGCTCCTCCTGGATGA [서열번호 34]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 34에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 34의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 35로 제공된다:
AUGCCAUUCCAGGGCGACCAGCGGAGUCUGGAAUUUCUGCUGCCUCCUGAGCCUGACUACAGCCUGCUGCUGACCUUCAUCUACAACGGCAGAGUCGUGGGCGAAGCCCAGGUGCAGUCUCUGGAUUGCAGACUGGUGGCCGAGCCUAGCGGAAGCGAGUCUAGUAUGGAACAGGUGCUGUUCCCCAAGCCUGGACCUCUGGAACCCACACAGAGGCUGCUGUCUCAACUGGAAAGGGGCAUCCUGGUGGCCAGCAAUCCUAGAGGCCUGUUCGUGCAGAGACUGUGCCCUAUUCCUAUCAGCUGGAACGCCCCUCAGGCUCCUCCUGGAUGA [서열번호 35]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 35에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
따라서, 상기 적어도 하나의 IMP는 IRF1; IRF4; IRF5; IRF8; 및 IRF9;로 이루어진 군으로부터 선택되는 IRF의 DBD일 수 있다. 다음은 전체 IRF의 결합을 방지하고 신호전달을 방지하며, 이로 인해 선천적 감지 시스템을 조정하게 되는 DNA 결합 도메인(DBD)의 예이다. 또한, 상기 적어도 하나의 IMP는 IRF의 스플라이스 변이체(splice variant)일 수 있다.
한 구현예에서, 상기 적어도 하나의 IMP는 IRF1의 DBD, 즉 DBD - DNA 결합 도메인(DBD)에 기반한 IRF1의 우성 음성 형태, IRF1(1-164)(NCBI 참조 서열: NM_002198.3; UniProtKB - P10914(IRF1_HUMAN)), 또는 그의 동원체일 수 있다. [Bouker KB, et al. Interferon regulatory factor-1 (IRF-1) exhibits tumor suppressor activities in breast cancer associated with caspase activation and induction of apoptosis. Carcinogenesis. 2005 Sep;26(9):1527-35. doi: 10.1093/carcin/bgi113]; 및 [Panda D, Gjinaj E, Bachu M, Squire E, Novatt H, Ozato K, Rabin RL. IRF1 Maintains Optimal Constitutive Expression of Antiviral Genes and Regulates the Early Antiviral Response. Front Immunol. 2019 May 15;10:1019. doi: 10.3389/fimmu.2019.01019]. IRF1의 DBD 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 36으로 나타낸다:
MPITRMRMRPWLEMQINSNQIPGLIWINKEEMIFQIPWKHAAKHGWDINKDACLFRSWAIHTGRYKAGEKEPDPKTWKANFRCAMNSLPDIEEVKDQSRNKGSSAVRVYRMLPPLTKNQRKERKSKSSRDAKSKAKRKSCGDSSPDTFSDGLSSSTLPDDHSSY [서열번호 36]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 36에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 DBD - IRF1 폴리펩티드의 우성 음성 작용 형태는 다음과 같은 서열번호 37의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGCCCATCACTCGGATGCGCATGAGACCCTGGCTAGAGATGCAGATTAATTCCAACCAAATCCCGGGGCTCATCTGGATTAATAAAGAGGAGATGATCTTCCAGATCCCATGGAAGCATGCTGCCAAGCATGGCTGGGACATCAACAAGGATGCCTGTTTGTTCCGGAGCTGGGCCATTCACACAGGCCGATACAAAGCAGGGGAAAAGGAGCCAGATCCCAAGACGTGGAAGGCCAACTTTCGCTGTGCCATGAACTCCCTGCCAGATATCGAGGAGGTGAAAGACCAGAGCAGGAACAAGGGCAGCTCAGCTGTGCGAGTGTACCGGATGCTTCCACCTCTCACCAAGAACCAGAGAAAAGAAAGAAAGTCGAAGTCCAGCCGAGATGCTAAGAGCAAGGCCAAGAGGAAGTCATGTGGGGATTCCAGCCCTGATACCTTCTCTGATGGACTCAGCAGCTCCACTCTGCCTGATGACCACAGCAGCTAC [서열번호 37]
따라서, 바람직하게는 상기 DBD - IRF1 폴리펩티드의 우성 음성 작용 형태는 실질적으로 서열번호 37에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 38의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGCCCAUCACUCGGAUGCGCAUGAGACCCUGGCUAGAGAUGCAGAUUAAUUCCAACCAAAUCCCGGGGCUCAUCUGGAUUAAUAAAGAGGAGAUGAUCUUCCAGAUCCCAUGGAAGCAUGCUGCCAAGCAUGGCUGGGACAUCAACAAGGAUGCCUGUUUGUUCCGGAGCUGGGCCAUUCACACAGGCCGAUACAAAGCAGGGGAAAAGGAGCCAGAUCCCAAGACGUGGAAGGCCAACUUUCGCUGUGCCAUGAACUCCCUGCCAGAUAUCGAGGAGGUGAAAGACCAGAGCAGGAACAAGGGCAGCUCAGCUGUGCGAGUGUACCGGAUGCUUCCACCUCUCACCAAGAACCAGAGAAAAGAAAGAAAGUCGAAGUCCAGCCGAGAUGCUAAGAGCAAGGCCAAGAGGAAGUCAUGUGGGGAUUCCAGCCCUGAUACCUUCUCUGAUGGACUCAGCAGCUCCACUCUGCCUGAUGACCACAGCAGCUAC [서열번호 38]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 38에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 36의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 39로 제공된다:
ATGCCCATCACCAGAATGAGAATGCGGCCCTGGCTGGAAATGCAGATCAACAGCAATCAGATCCCCGGCCTGATCTGGATCAACAAAGAAGAGATGATCTTTCAGATCCCGTGGAAGCACGCCGCCAAGCACGGATGGGACATCAACAAGGACGCCTGCCTGTTCAGAAGCTGGGCCATCCACACCGGCAGATACAAGGCCGGCGAGAAAGAGCCCGATCCTAAGACCTGGAAGGCCAACTTCAGATGCGCCATGAACAGCCTGCCTGACATCGAGGAAGTGAAGGACCAGAGCCGGAACAAGGGATCTTCTGCCGTGCGGGTGTACCGGATGTTGCCTCCTCTGACCAAGAACCAGCGCAAAGAGCGGAAGTCCAAGAGCAGCAGAGATGCCAAGAGCAAGGCCAAGAGAAAGTCCTGCGGCGACAGCAGCCCTGACACCTTTTCTGATGGCCTGAGCAGCAGCACCCTGCCAGATGATCACAGCAGCTACTGA [서열번호 39]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 39에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 39의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 40으로 제공된다:
AUGCCCAUCACCAGAAUGAGAAUGCGGCCCUGGCUGGAAAUGCAGAUCAACAGCAAUCAGAUCCCCGGCCUGAUCUGGAUCAACAAAGAAGAGAUGAUCUUUCAGAUCCCGUGGAAGCACGCCGCCAAGCACGGAUGGGACAUCAACAAGGACGCCUGCCUGUUCAGAAGCUGGGCCAUCCACACCGGCAGAUACAAGGCCGGCGAGAAAGAGCCCGAUCCUAAGACCUGGAAGGCCAACUUCAGAUGCGCCAUGAACAGCCUGCCUGACAUCGAGGAAGUGAAGGACCAGAGCCGGAACAAGGGAUCUUCUGCCGUGCGGGUGUACCGGAUGUUGCCUCCUCUGACCAAGAACCAGCGCAAAGAGCGGAAGUCCAAGAGCAGCAGAGAUGCCAAGAGCAAGGCCAAGAGAAAGUCCUGCGGCGACAGCAGCCCUGACACCUUUUCUGAUGGCCUGAGCAGCAGCACCCUGCCAGAUGAUCACAGCAGCUACUGA [서열번호 40]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 40에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 IRF2의 DBD, 즉 DBD - DNA 결합 도메인(DBD)에 기반한 IRF2의 우성 음성 작용 형태, IRF2(1-113)(NCBI 참조 서열: NM_002199.3; UniProtKB - P14316(IRF2_HUMAN), 또는 그의 동원체일 수 있다.
IRF2는 타입 I IFN 및 IFN-유도성 MHC 클래스 I 유전자(인터페론 공통 서열(ICS))의 상류 조절 영역에 특이적으로 결합하고, 상기 유전자를 억제한다. 이것은 또한 H4 및 IL7을 포함하는 몇 가지 유전자에 대한 활성화제로 작용하고, ISRE 프로모터에 구성적으로 결합하여 IL7을 활성화한다(Oshima S., et al., Mol. Cell. Biol. 24:6298-6310(2004)). IRF2의 DBD 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 232로 나타낸다:
MPVERMRMRPWLEEQINSNTIPGLKWLNKEKKIFQIPWMHAARHGWDVEKDAPLFRNWAIHTGKHQPGVDKPDPKTWKANFRCAMNSLPDIEEVKDKSIKKGNNAFRVYRMLP [서열번호 232]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 232에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 DBD - IRF2 폴리펩티드의 우성 음성 작용 형태는 다음과 같은 서열번호 233의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGCCGGTGGAAAGGATGCGCATGCGCCCGTGGCTGGAGGAGCAGATAAACTCCAACACGATCCCGGGGCTCAAGTGGCTTAACAAGGAAAAGAAGATTTTTCAGATCCCCTGGATGCATGCGGCTAGACATGGGTGGGATGTGGAAAAAGATGCACCACTCTTTAGAAACTGGGCAATCCATACAGGAAAGCATCAACCAGGAGTAGATAAACCTGATCCCAAAACATGGAAGGCGAATTTCAGATGCGCCATGAATTCCTTGCCTGATATTGAAGAAGTCAAGGATAAAAGCATAAAGAAAGGAAATAATGCCTTCAGGGTCTACCGAATGCTGCCC [서열번호 233]
따라서, 바람직하게는 상기 DBD - IRF2 폴리펩티드의 우성 음성 작용 형태는 실질적으로 서열번호 233에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 234의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGCCGGUGGAAAGGAUGCGCAUGCGCCCGUGGCUGGAGGAGCAGAUAAACUCCAACACGAUCCCGGGGCUCAAGUGGCUUAACAAGGAAAAGAAGAUUUUUCAGAUCCCCUGGAUGCAUGCGGCUAGACAUGGGUGGGAUGUGGAAAAAGAUGCACCACUCUUUAGAAACUGGGCAAUCCAUACAGGAAAGCAUCAACCAGGAGUAGAUAAACCUGAUCCCAAAACAUGGAAGGCGAAUUUCAGAUGCGCCAUGAAUUCCUUGCCUGAUAUUGAAGAAGUCAAGGAUAAAAGCAUAAAGAAAGGAAAUAAUGCCUUCAGGGUCUACCGAAUGCUGCCC [서열번호 234]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 234에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 232의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 235로 제공된다:
ATGCCCGTGGAACGGATGAGAATGAGGCCCTGGCTGGAAGAACAGATCAACAGCAACACAATCCCCGGCCTGAAGTGGCTGAACAAAGAGAAGAAGATCTTTCAGATCCCCTGGATGCACGCCGCCAGACACGGATGGGATGTCGAGAAAGATGCCCCTCTGTTCAGAAACTGGGCCATCCACACCGGCAAACACCAGCCTGGCGTGGACAAGCCTGATCCTAAGACCTGGAAGGCCAACTTCAGATGCGCCATGAACAGCCTGCCTGACATCGAGGAAGTGAAGGACAAGAGCATCAAGAAGGGCAACAACGCCTTCCGGGTGTACAGAATGCTGCCCTGA [서열번호 235]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 235에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 235의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 236으로 제공된다:
AUGCCCGUGGAACGGAUGAGAAUGAGGCCCUGGCUGGAAGAACAGAUCAACAGCAACACAAUCCCCGGCCUGAAGUGGCUGAACAAAGAGAAGAAGAUCUUUCAGAUCCCCUGGAUGCACGCCGCCAGACACGGAUGGGAUGUCGAGAAAGAUGCCCCUCUGUUCAGAAACUGGGCCAUCCACACCGGCAAACACCAGCCUGGCGUGGACAAGCCUGAUCCUAAGACCUGGAAGGCCAACUUCAGAUGCGCCAUGAACAGCCUGCCUGACAUCGAGGAAGUGAAGGACAAGAGCAUCAAGAAGGGCAACAACGCCUUCCGGGUGUACAGAAUGCUGCCCUGA [서열번호 236]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 236에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 상기 적어도 하나의 IMP는 IRF4의 DBD, 즉 IRF1을 차단하는 DBD(NCBI 참조 서열: NM_002460.4; UniProtKB - Q15306(IRF4_HUMAN)), 또는 그의 동원체일 수 있다. IRF1은 인터페론 유도 캐스케이드의 핵심 조절제로 알려질 것이다(Yoshida K et al, International Immunology, Vol. 17, No. 11, pp. 1463-1471, IRF4 binding domain, blocks IRF1). IRF4의 DBD 단백질 서열의 한 구현예(IRF4(21-129))는 본 명세서에서 다음과 같은 서열번호 41로 나타낸다:
NGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQDYNREEDAALFKAWALFKGKFREGIDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISDPYKVYRIVP [서열번호 41]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 41에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, IRF4 폴리펩티드의 DBD는 다음과 같은 서열번호 42의 DNA 뉴클레오티드 서열에 의해 암호화된다:
AACGGGAAGCTCCGCCAGTGGCTGATCGACCAGATCGACAGCGGCAAGTACCCCGGGCTGGTGTGGGAGAACGAGGAGAAGAGCATCTTCCGCATCCCCTGGAAGCACGCGGGCAAGCAGGACTACAACCGCGAGGAGGACGCCGCGCTCTTCAAGGCTTGGGCACTGTTTAAAGGAAAGTTCCGAGAAGGCATCGACAAGCCGGACCCTCCCACCTGGAAGACGCGCCTGCGGTGCGCTTTGAACAAGAGCAATGACTTTGAGGAACTGGTTGAGCGGAGCCAGCTGGACATCTCAGACCCGTACAAAGTGTACAGGATTGTTCCT [서열번호 42]
따라서, 바람직하게는 상기 IRF4 폴리펩티드의 DBD는 실질적으로 서열번호 42에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 43의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AACGGGAAGCUCCGCCAGUGGCUGAUCGACCAGAUCGACAGCGGCAAGUACCCCGGGCUGGUGUGGGAGAACGAGGAGAAGAGCAUCUUCCGCAUCCCCUGGAAGCACGCGGGCAAGCAGGACUACAACCGCGAGGAGGACGCCGCGCUCUUCAAGGCUUGGGCACUGUUUAAAGGAAAGUUCCGAGAAGGCAUCGACAAGCCGGACCCUCCCACCUGGAAGACGCGCCUGCGGUGCGCUUUGAACAAGAGCAAUGACUUUGAGGAACUGGUUGAGCGGAGCCAGCUGGACAUCUCAGACCCGUACAAAGUGUACAGGAUUGUUCCU [서열번호 43]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 43에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 41의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 44로 제공된다:
ATGAACGGCAAGCTGCGGCAGTGGCTGATCGACCAGATCGACAGCGGCAAGTATCCTGGCCTCGTGTGGGAGAACGAGGAAAAGTCTATCTTCAGAATCCCCTGGAAGCACGCCGGCAAGCAGGACTACAACAGAGAAGAGGACGCCGCTCTGTTCAAGGCCTGGGCTCTGTTTAAGGGCAAGTTCAGAGAGGGCATCGACAAGCCCGATCCTCCAACCTGGAAAACCAGACTGAGATGCGCCCTGAACAAGAGCAACGACTTCGAGGAACTGGTGGAAAGAAGCCAGCTGGACATCAGCGACCCCTACAAGGTGTACCGGATCGTGCCTTGA [서열번호 44]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 44에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 44의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 45로 제공된다:
AUGAACGGCAAGCUGCGGCAGUGGCUGAUCGACCAGAUCGACAGCGGCAAGUAUCCUGGCCUCGUGUGGGAGAACGAGGAAAAGUCUAUCUUCAGAAUCCCCUGGAAGCACGCCGGCAAGCAGGACUACAACAGAGAAGAGGACGCCGCUCUGUUCAAGGCCUGGGCUCUGUUUAAGGGCAAGUUCAGAGAGGGCAUCGACAAGCCCGAUCCUCCAACCUGGAAAACCAGACUGAGAUGCGCCCUGAACAAGAGCAACGACUUCGAGGAACUGGUGGAAAGAAGCCAGCUGGACAUCAGCGACCCCUACAAGGUGUACCGGAUCGUGCCUUGA [서열번호 45]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 45에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 상기 적어도 하나의 IMP는 본 명세서에서 다음과 같은 서열번호 257로 나타내는 IRF4(1-129)일 수 있다:
MNLEGGGRGGEFGMSAVSCGNGKLRQWLIDQIDSGKYPGLVWENEEKSIFRIPWKHAGKQDYNREEDAALFKAWALFKGKFREGIDKPDPPTWKTRLRCALNKSNDFEELVERSQLDISDPYKVYRIVP [서열번호 257]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 257에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, IRF4 폴리펩티드의 DBD는 다음과 같은 서열번호 258의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGAACCTGGAGGGCGGCGGCCGAGGCGGAGAGTTCGGCATGAGCGCGGTGAGCTGCGGCAACGGGAAGCTCCGCCAGTGGCTGATCGACCAGATCGACAGCGGCAAGTACCCCGGGCTGGTGTGGGAGAACGAGGAGAAGAGCATCTTCCGCATCCCCTGGAAGCACGCGGGCAAGCAGGACTACAACCGCGAGGAGGACGCCGCGCTCTTCAAGGCTTGGGCACTGTTTAAAGGAAAGTTCCGAGAAGGCATCGACAAGCCGGACCCTCCCACCTGGAAGACGCGCCTGCGGTGCGCTTTGAACAAGAGCAATGACTTTGAGGAACTGGTTGAGCGGAGCCAGCTGGACATCTCAGACCCGTACAAAGTGTACAGGATTGTTCCT [서열번호 258]
따라서, 바람직하게는 IRF4 폴리펩티드의 DBD는 실질적으로 서열번호 258에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 259의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGAACCUGGAGGGCGGCGGCCGAGGCGGAGAGUUCGGCAUGAGCGCGGUGAGCUGCGGCAACGGGAAGCUCCGCCAGUGGCUGAUCGACCAGAUCGACAGCGGCAAGUACCCCGGGCUGGUGUGGGAGAACGAGGAGAAGAGCAUCUUCCGCAUCCCCUGGAAGCACGCGGGCAAGCAGGACUACAACCGCGAGGAGGACGCCGCGCUCUUCAAGGCUUGGGCACUGUUUAAAGGAAAGUUCCGAGAAGGCAUCGACAAGCCGGACCCUCCCACCUGGAAGACGCGCCUGCGGUGCGCUUUGAACAAGAGCAAUGACUUUGAGGAACUGGUUGAGCGGAGCCAGCUGGACAUCUCAGACCCGUACAAAGUGUACAGGAUUGUUCCU [서열번호 259]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 259에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 257의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 260으로 제공된다:
ATGAATCTGGAAGGCGGCGGAAGAGGCGGCGAGTTTGGAATGTCTGCCGTGTCCTGTGGCAACGGCAAGCTGAGACAGTGGCTGATCGACCAGATCGACAGCGGCAAGTATCCTGGCCTCGTGTGGGAGAACGAGGAAAAGTCTATCTTCAGAATCCCCTGGAAGCACGCCGGCAAGCAGGACTACAACAGAGAAGAGGACGCCGCTCTGTTCAAGGCCTGGGCTCTGTTTAAGGGCAAGTTCAGAGAGGGCATCGACAAGCCCGATCCTCCAACCTGGAAAACCAGACTGAGATGCGCCCTGAACAAGAGCAACGACTTCGAGGAACTGGTGGAAAGAAGCCAGCTGGACATCAGCGACCCCTACAAGGTGTACCGGATCGTGCCCTGA [서열번호 260]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 260에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 260의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 261로 제공된다:
AUGAAUCUGGAAGGCGGCGGAAGAGGCGGCGAGUUUGGAAUGUCUGCCGUGUCCUGUGGCAACGGCAAGCUGAGACAGUGGCUGAUCGACCAGAUCGACAGCGGCAAGUAUCCUGGCCUCGUGUGGGAGAACGAGGAAAAGUCUAUCUUCAGAAUCCCCUGGAAGCACGCCGGCAAGCAGGACUACAACAGAGAAGAGGACGCCGCUCUGUUCAAGGCCUGGGCUCUGUUUAAGGGCAAGUUCAGAGAGGGCAUCGACAAGCCCGAUCCUCCAACCUGGAAAACCAGACUGAGAUGCGCCCUGAACAAGAGCAACGACUUCGAGGAACUGGUGGAAAGAAGCCAGCUGGACAUCAGCGACCCCUACAAGGUGUACCGGAUCGUGCCCUGA [서열번호 261]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 261에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 상기 적어도 하나의 IMP는 IRF5의 DBD(Yang L, Zhao T, Shi X, Nakhaei P, Wang Y, Sun Q, Hiscott J, Lin R. Functional analysis of a dominant negative acting mutation of interferon regulatory factor 5. PLoS One. 2009;4(5):e5500)(NCBI 참조 서열: NM_032643.5; UniProtKB - Q13568(IRF5_HUMAN)), 또는 그의 동원체일 수 있다. IRF5 및 7은 모두 TLR7/8의 하류를 촉발한다. IRF5의 DBD 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 46으로 나타낸다:
MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGDNTIFKAWAKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEVCSNGPAPTDSQPPEDYSFGA [서열번호 46]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 46에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다. 서열번호 46에서 굵게 강조된 68번째 아미노산은 상기 야생형 서열에서는 알라닌이며, 프롤린으로 돌연변이되어 상기 단백질의 우성 음성 작용 형태를 형성할 수 있다(서열번호 51 참조).
한 구현예에서, IRF5 폴리펩티드의 DBD(IRF5(1-140))는 다음과 같은 서열번호 47의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGAACCAGTCCATCCCAGTGGCTCCCACCCCACCCCGCCGCGTGCGGCTGAAGCCCTGGCTGGTGGCCCAGGTGAACAGCTGCCAGTACCCAGGGCTTCAATGGGTCAACGGGGAAAAGAAATTATTCTGCATCCCCTGGAGGCATGCCACAAGGCATGGTCCCAGCCAGGACGGAGATAACACCATCTTCAAGGCCTGGGCCAAGGAGACAGGGAAATACACCGAAGGCGTGGATGAAGCCGATCCGGCCAAGTGGAAGGCCAACCTGCGCTGTGCCCTTAACAAGAGCCGGGACTTCCGCCTCATCTACGACGGGCCCCGGGACATGCCACCTCAGCCCTACAAGATCTACGAGGTCTGCTCCAATGGCCCTGCTCCCACAGACTCCCAGCCCCCTGAGGATTACTCTTTTGGTGCA [서열번호 47]
따라서, 바람직하게는 IRF5 폴리펩티드의 DBD는 실질적으로 서열번호 47에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 48의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGAACCAGUCCAUCCCAGUGGCUCCCACCCCACCCCGCCGCGUGCGGCUGAAGCCCUGGCUGGUGGCCCAGGUGAACAGCUGCCAGUACCCAGGGCUUCAAUGGGUCAACGGGGAAAAGAAAUUAUUCUGCAUCCCCUGGAGGCAUGCCACAAGGCAUGGUCCCAGCCAGGACGGAGAUAACACCAUCUUCAAGGCCUGGGCCAAGGAGACAGGGAAAUACACCGAAGGCGUGGAUGAAGCCGAUCCGGCCAAGUGGAAGGCCAACCUGCGCUGUGCCCUUAACAAGAGCCGGGACUUCCGCCUCAUCUACGACGGGCCCCGGGACAUGCCACCUCAGCCCUACAAGAUCUACGAGGUCUGCUCCAAUGGCCCUGCUCCCACAGACUCCCAGCCCCCUGAGGAUUACUCUUUUGGUGCA [서열번호 48]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 48에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 46의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 49로 제공된다:
ATGAACCAGAGCATCCCCGTGGCTCCCACACCTCCTAGAAGAGTGCGACTGAAGCCTTGGCTGGTGGCCCAAGTGAACAGCTGTCAGTATCCTGGCCTGCAGTGGGTCAACGGCGAGAAGAAGCTGTTCTGCATCCCTTGGAGACACGCCACCAGACACGGCCCTTCTCAGGACGGCGACAACACCATCTTTAAGGCCTGGGCCAAAGAGACAGGCAAGTACACCGAAGGCGTGGACGAAGCCGATCCTGCCAAGTGGAAGGCCAATCTGAGATGCGCCCTGAACAAGAGCCGGGACTTCCGGCTGATCTACGACGGCCCTAGAGACATGCCTCCTCAGCCTTACAAGATCTACGAAGTGTGCAGCAACGGCCCTGCTCCTACCGATTCTCAGCCTCCTGAGGACTACAGCTTCGGCGCTTGA [서열번호 49]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 49에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 49의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 50으로 제공된다:
AUGAACCAGAGCAUCCCCGUGGCUCCCACACCUCCUAGAAGAGUGCGACUGAAGCCUUGGCUGGUGGCCCAAGUGAACAGCUGUCAGUAUCCUGGCCUGCAGUGGGUCAACGGCGAGAAGAAGCUGUUCUGCAUCCCUUGGAGACACGCCACCAGACACGGCCCUUCUCAGGACGGCGACAACACCAUCUUUAAGGCCUGGGCCAAAGAGACAGGCAAGUACACCGAAGGCGUGGACGAAGCCGAUCCUGCCAAGUGGAAGGCCAAUCUGAGAUGCGCCCUGAACAAGAGCCGGGACUUCCGGCUGAUCUACGACGGCCCUAGAGACAUGCCUCCUCAGCCUUACAAGAUCUACGAAGUGUGCAGCAACGGCCCUGCUCCUACCGAUUCUCAGCCUCCUGAGGACUACAGCUUCGGCGCUUGA [서열번호 50]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 50에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
추가 구현예에서, 전체 단백질은 서열번호 51(NCBI 참조 서열: NM_032643.5; UniProtKB - Q13568(IRF5_HUMAN))에 강조된 것과 같이 돌연변이된 전사체가 68번째 아미노산인 알라닌이 프롤린에 의해 치환된 버전(IRF5 A68P)을 암호화할 때 우성 음성 작용 형태, 또는 그의 동원체로 작용한다. 따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 68번째 아미노산인 알라닌이 프롤린에 의해 치환된 서열번호 51에 나타낸 것과 같은 아미노산 서열(IRF5 A68P), 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
MNQSIPVAPTPPRRVRLKPWLVAQVNSCQYPGLQWVNGEKKLFCIPWRHATRHGPSQDGDNTIFKAWPKETGKYTEGVDEADPAKWKANLRCALNKSRDFRLIYDGPRDMPPQPYKIYEVCSNGPAPTDSQPPEDYSFGA [서열번호 51]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 51에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 돌연변이된 폴리펩티드는 다음과 같은 서열번호 52의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGAACCAGTCCATCCCAGTGGCTCCCACCCCACCCCGCCGCGTGCGGCTGAAGCCCTGGCTGGTGGCCCAGGTGAACAGCTGCCAGTACCCAGGGCTTCAATGGGTCAACGGGGAAAAGAAATTATTCTGCATCCCCTGGAGGCATGCCACAAGGCATGGTCCCAGCCAGGACGGAGATAACACCATCTTCAAGGCCTGGCCCAAGGAGACAGGGAAATACACCGAAGGCGTGGATGAAGCCGATCCGGCCAAGTGGAAGGCCAACCTGCGCTGTGCCCTTAACAAGAGCCGGGACTTCCGCCTCATCTACGACGGGCCCCGGGACATGCCACCTCAGCCCTACAAGATCTACGAGGTCTGCTCCAATGGCCCTGCTCCCACAGACTCCCAGCCCCCTGAGGATTACTCTTTTGGTGCA [서열번호 52]
따라서, 바람직하게는 상기 돌연변이된 폴리펩티드는 실질적으로 서열번호 52에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 53의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGAACCAGUCCAUCCCAGUGGCUCCCACCCCACCCCGCCGCGUGCGGCUGAAGCCCUGGCUGGUGGCCCAGGUGAACAGCUGCCAGUACCCAGGGCUUCAAUGGGUCAACGGGGAAAAGAAAUUAUUCUGCAUCCCCUGGAGGCAUGCCACAAGGCAUGGUCCCAGCCAGGACGGAGAUAACACCAUCUUCAAGGCCUGGCCCAAGGAGACAGGGAAAUACACCGAAGGCGUGGAUGAAGCCGAUCCGGCCAAGUGGAAGGCCAACCUGCGCUGUGCCCUUAACAAGAGCCGGGACUUCCGCCUCAUCUACGACGGGCCCCGGGACAUGCCACCUCAGCCCUACAAGAUCUACGAGGUCUGCUCCAAUGGCCCUGCUCCCACAGACUCCCAGCCCCCUGAGGAUUACUCUUUUGGUGCA [서열번호 53]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 53에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 51의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 54로 제공된다:
ATGAACCAGAGCATCCCCGTGGCTCCCACACCTCCTAGAAGAGTGCGACTGAAGCCTTGGCTGGTGGCCCAAGTGAACAGCTGTCAGTATCCTGGCCTGCAGTGGGTCAACGGCGAGAAGAAGCTGTTCTGCATCCCTTGGAGACACGCCACCAGACACGGCCCTTCTCAGGACGGCGACAACACCATCTTTAAGGCCTGGcCCAAAGAGACAGGCAAGTACACCGAAGGCGTGGACGAAGCCGATCCTGCCAAGTGGAAGGCCAATCTGAGATGCGCCCTGAACAAGAGCCGGGACTTCCGGCTGATCTACGACGGCCCTAGAGACATGCCTCCTCAGCCTTACAAGATCTACGAAGTGTGCAGCAACGGCCCTGCTCCTACCGATTCTCAGCCTCCTGAGGACTACAGCTTCGGCGCTTGA [서열번호 54]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 54에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 54의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 55로 제공된다:
AUGAACCAGAGCAUCCCCGUGGCUCCCACACCUCCUAGAAGAGUGCGACUGAAGCCUUGGCUGGUGGCCCAAGUGAACAGCUGUCAGUAUCCUGGCCUGCAGUGGGUCAACGGCGAGAAGAAGCUGUUCUGCAUCCCUUGGAGACACGCCACCAGACACGGCCCUUCUCAGGACGGCGACAACACCAUCUUUAAGGCCUGGcCCAAAGAGACAGGCAAGUACACCGAAGGCGUGGACGAAGCCGAUCCUGCCAAGUGGAAGGCCAAUCUGAGAUGCGCCCUGAACAAGAGCCGGGACUUCCGGCUGAUCUACGACGGCCCUAGAGACAUGCCUCCUCAGCCUUACAAGAUCUACGAAGUGUGCAGCAACGGCCCUGCUCCUACCGAUUCUCAGCCUCCUGAGGACUACAGCUUCGGCGCUUGA [서열번호 55]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 55에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 IRF6의 DBD, 즉 DBD - DNA 결합 도메인(DBD)에 기반한 IRF6의 우성 음성 작용 형태(1-115)(NCBI 참조 서열: NM_006147.3; UniProtKB - O14896(IRF6_HUMAN) 또는 그의 동원체일 수 있다. IRF6의 DBD 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 237로 나타낸다:
MALHPRRVRLKPWLVAQVDSGLYPGLIWLHRDSKRFQIPWKHATRHSPQQEEENTIFKAWAVETGKYQEGVDDPDPAKWKAQLRCALNKSREFNLMYDGTKEVPMNPVKIYQVCD [서열번호 237]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 237에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편에 의해 암호화되는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 DBD - IRF6 폴리펩티드의 우성 음성 작용 형태는 다음과 같은 서열번호 238의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGCCCTCCACCCCCGCAGAGTCCGGCTAAAGCCCTGGCTGGTGGCCCAGGTGGATAGTGGCCTCTACCCTGGGCTCATCTGGCTACACAGGGACTCTAAACGCTTCCAGATTCCCTGGAAACATGCCACCCGGCATAGCCCTCAACAAGAAGAGGAAAATACCATTTTTAAGGCCTGGGCTGTAGAGACAGGGAAGTACCAGGAAGGGGTGGATGACCCTGACCCAGCTAAATGGAAGGCCCAGCTGCGCTGTGCTCTCAATAAGAGCAGAGAATTCAACCTGATGTATGATGGCACCAAGGAGGTGCCCATGAACCCAGTGAAGATATATCAAGTGTGTGAC [서열번호 238]
따라서, 바람직하게는 상기 DBD - IRF6 폴리펩티드의 우성 음성 작용 형태는 실질적으로 서열번호 238에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 239의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGCCCUCCACCCCCGCAGAGUCCGGCUAAAGCCCUGGCUGGUGGCCCAGGUGGAUAGUGGCCUCUACCCUGGGCUCAUCUGGCUACACAGGGACUCUAAACGCUUCCAGAUUCCCUGGAAACAUGCCACCCGGCAUAGCCCUCAACAAGAAGAGGAAAAUACCAUUUUUAAGGCCUGGGCUGUAGAGACAGGGAAGUACCAGGAAGGGGUGGAUGACCCUGACCCAGCUAAAUGGAAGGCCCAGCUGCGCUGUGCUCUCAAUAAGAGCAGAGAAUUCAACCUGAUGUAUGAUGGCACCAAGGAGGUGCCCAUGAACCCAGUGAAGAUAUAUCAAGUGUGUGAC [서열번호 239]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 239에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 237의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 240으로 제공된다:
ATGGCCCTGCATCCTAGAAGAGTGCGGCTGAAGCCTTGGCTGGTGGCTCAAGTGGATAGCGGCCTGTATCCTGGCCTGATCTGGCTGCACAGAGACAGCAAGCGGTTTCAGATCCCCTGGAAGCACGCCACCAGACACAGCCCTCAGCAAGAGGAAGAGAACACCATCTTCAAGGCCTGGGCCGTCGAGACAGGCAAGTACCAAGAAGGCGTGGACGACCCCGATCCTGCCAAATGGAAAGCCCAGCTGAGATGCGCCCTGAACAAGAGCCGCGAGTTCAACCTGATGTACGACGGCACCAAAGAGGTGCCCATGAATCCCGTGAAGATCTACCAAGTGTGCGACTGA 서열번호 240]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 240에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 240의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 241로 제공된다:
AUGGCCCUGCAUCCUAGAAGAGUGCGGCUGAAGCCUUGGCUGGUGGCUCAAGUGGAUAGCGGCCUGUAUCCUGGCCUGAUCUGGCUGCACAGAGACAGCAAGCGGUUUCAGAUCCCCUGGAAGCACGCCACCAGACACAGCCCUCAGCAAGAGGAAGAGAACACCAUCUUCAAGGCCUGGGCCGUCGAGACAGGCAAGUACCAAGAAGGCGUGGACGACCCCGAUCCUGCCAAAUGGAAAGCCCAGCUGAGAUGCGCCCUGAACAAGAGCCGCGAGUUCAACCUGAUGUACGACGGCACCAAAGAGGUGCCCAUGAAUCCCGUGAAGAUCUACCAAGUGUGCGACUGA [서열번호 241]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 241에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 상기 적어도 하나의 IMP는 IRF8의 DBD, 즉 IRF-8 DBD(1-140)(DNA 결합 모티프, IRG 프로모터에 대한 다른 IRF의 결합을 방지함. [Thornton AM, et al. A dominant negative mutant of an IFN regulatory factor family protein inhibits both type I and type II IFN-stimulated gene expression and antiproliferative activity of IFNs. J Immunol. 1996 Dec 1;157(11):5145-54])(NCBI 참조 서열: NM_002163; UniProtKB - Q02556(IRF8_HUMAN)), 또는 그의 동원체일 수 있다. IRF8의 DBD 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 56으로 나타낸다:
MCDRNGGRRLRQWLIEQIDSSMYPGLIWENEEKSMFRIPWKHAGKQDYNQEVDASIFKAWAVFKGKFKEGDKAEPATWKTRLRCALNKSPDFEEVTDRSQLDISEPYKVYRIVPEEEQKCKLGVATAGCVNEVTEMECGR [서열번호 56]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 56에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 IRF8 DBD 폴리펩티드는 다음과 같은 서열번호 57의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGTGTGACCGGAATGGTGGTCGGCGGCTTCGACAGTGGCTGATCGAGCAGATTGACAGTAGCATGTATCCAGGACTGATTTGGGAGAATGAGGAGAAGAGCATGTTCCGGATCCCTTGGAAACACGCTGGCAAGCAAGATTATAATCAGGAAGTGGATGCCTCCATTTTTAAGGCCTGGGCAGTTTTTAAAGGGAAGTTTAAAGAAGGGGACAAAGCTGAACCAGCCACTTGGAAGACGAGGTTACGCTGTGCTTTGAATAAGAGCCCAGATTTTGAGGAAGTGACGGACCGGTCCCAACTGGACATTTCCGAGCCATACAAAGTTTACCGAATTGTTCCTGAGGAAGAGCAAAAATGCAAACTAGGCGTGGCAACTGCTGGCTGCGTGAATGAAGTTACAGAGATGGAGTGCGGTCGC [서열번호 57]
따라서, 바람직하게는 상기 IRF8 DBD 폴리펩티드는 실질적으로 서열번호 57에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 58의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGUGUGACCGGAAUGGUGGUCGGCGGCUUCGACAGUGGCUGAUCGAGCAGAUUGACAGUAGCAUGUAUCCAGGACUGAUUUGGGAGAAUGAGGAGAAGAGCAUGUUCCGGAUCCCUUGGAAACACGCUGGCAAGCAAGAUUAUAAUCAGGAAGUGGAUGCCUCCAUUUUUAAGGCCUGGGCAGUUUUUAAAGGGAAGUUUAAAGAAGGGGACAAAGCUGAACCAGCCACUUGGAAGACGAGGUUACGCUGUGCUUUGAAUAAGAGCCCAGAUUUUGAGGAAGUGACGGACCGGUCCCAACUGGACAUUUCCGAGCCAUACAAAGUUUACCGAAUUGUUCCUGAGGAAGAGCAAAAAUGCAAACUAGGCGUGGCAACUGCUGGCUGCGUGAAUGAAGUUACAGAGAUGGAGUGCGGUCGC [서열번호 58]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 58에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 56의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 59로 제공된다:
ATGTGCGACAGAAATGGCGGCAGACGGCTGAGACAGTGGCTGATCGAGCAGATCGACAGCAGCATGTACCCCGGCCTGATCTGGGAGAACGAAGAGAAGTCTATGTTCAGGATCCCCTGGAAGCACGCCGGCAAGCAGGACTACAATCAAGAGGTGGACGCCAGCATCTTCAAGGCCTGGGCCGTGTTCAAGGGCAAGTTCAAAGAGGGCGACAAGGCCGAGCCTGCCACCTGGAAAACCAGACTGAGATGCGCCCTGAACAAGAGCCCCGACTTCGAGGAAGTGACCGACAGAAGCCAGCTGGACATCAGCGAGCCCTACAAGGTGTACCGGATCGTGCCCGAAGAGGAACAGAAATGCAAGCTGGGCGTTGCCACCGCCGGCTGTGTGAATGAAGTGACAGAGATGGAATGCGGCCGGTGA [서열번호 59]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 59에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 59의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 60으로 제공된다:
AUGUGCGACAGAAAUGGCGGCAGACGGCUGAGACAGUGGCUGAUCGAGCAGAUCGACAGCAGCAUGUACCCCGGCCUGAUCUGGGAGAACGAAGAGAAGUCUAUGUUCAGGAUCCCCUGGAAGCACGCCGGCAAGCAGGACUACAAUCAAGAGGUGGACGCCAGCAUCUUCAAGGCCUGGGCCGUGUUCAAGGGCAAGUUCAAAGAGGGCGACAAGGCCGAGCCUGCCACCUGGAAAACCAGACUGAGAUGCGCCCUGAACAAGAGCCCCGACUUCGAGGAAGUGACCGACAGAAGCCAGCUGGACAUCAGCGAGCCCUACAAGGUGUACCGGAUCGUGCCCGAAGAGGAACAGAAAUGCAAGCUGGGCGUUGCCACCGCCGGCUGUGUGAAUGAAGUGACAGAGAUGGAAUGCGGCCGGUGA [서열번호 60]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 60에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 IRF9의 DBD, 즉 IRF9 DBD(1-120)일 수 있다. IRF9의 DBD 단백질 서열의 한 구현예는 NCBI 참조 서열: NM_006084.5; UniProtKB - Q00978(IRF9_HUMAN), 또는 그의 동원체로 나타내며, 본 명세서에서 다음과 같은 서열번호 61로 나타낸다:
MASGRARCTRKLRNWVVEQVESGQFPGVCWDDTAKTMFRIPWKHAGKQDFREDQDAAFFKAWAIFKGKYKEGDTGGPAVWKTRLRCALNKSSEFKEVPERGRMDVAEPYKVYQLLPPGIV [서열번호 61]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 61에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 IRF9 DBD 폴리펩티드는 다음과 같은 서열번호 62의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGCATCAGGCAGGGCACGCTGCACCCGAAAACTCCGGAACTGGGTGGTGGAGCAAGTGGAGAGTGGGCAGTTTCCCGGAGTGTGCTGGGATGATACAGCTAAGACCATGTTCCGGATTCCCTGGAAACATGCAGGCAAGCAGGACTTCCGGGAGGACCAGGATGCTGCCTTCTTCAAGGCCTGGGCAATATTTAAGGGAAAGTATAAGGAGGGGGACACAGGAGGTCCAGCTGTCTGGAAGACTCGCCTGCGCTGTGCACTCAACAAGAGTTCTGAATTTAAGGAGGTTCCTGAGAGGGGCCGCATGGATGTTGCTGAGCCCTACAAGGTGTATCAGTTGCTGCCACCAGGAATCGTC [서열번호 62]
따라서, 바람직하게는 상기 IRF9 DBD 폴리펩티드는 실질적으로 서열번호 62에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 63의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGCAUCAGGCAGGGCACGCUGCACCCGAAAACUCCGGAACUGGGUGGUGGAGCAAGUGGAGAGUGGGCAGUUUCCCGGAGUGUGCUGGGAUGAUACAGCUAAGACCAUGUUCCGGAUUCCCUGGAAACAUGCAGGCAAGCAGGACUUCCGGGAGGACCAGGAUGCUGCCUUCUUCAAGGCCUGGGCAAUAUUUAAGGGAAAGUAUAAGGAGGGGGACACAGGAGGUCCAGCUGUCUGGAAGACUCGCCUGCGCUGUGCACUCAACAAGAGUUCUGAAUUUAAGGAGGUUCCUGAGAGGGGCCGCAUGGAUGUUGCUGAGCCCUACAAGGUGUAUCAGUUGCUGCCACCAGGAAUCGUC [서열번호 63]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 63에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 61의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 64로 제공된다:
ATGGCTTCTGGCAGAGCCAGATGCACCCGGAAGCTGAGAAACTGGGTCGTCGAACAGGTGGAAAGCGGACAGTTCCCTGGCGTGTGCTGGGATGATACCGCCAAGACAATGTTCAGAATCCCCTGGAAGCACGCCGGCAAGCAGGACTTCAGAGAAGATCAGGACGCCGCCTTCTTCAAGGCCTGGGCCATCTTCAAGGGCAAGTACAAAGAGGGCGACACAGGCGGACCTGCCGTGTGGAAAACCAGACTGAGATGCGCCCTGAACAAGAGCAGCGAGTTCAAAGAGGTGCCCGAGCGGGGCAGAATGGATGTGGCCGAACCTTACAAGGTGTACCAGCTGCTGCCTCCTGGCATCGTGTGA [서열번호 64]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 64에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 64의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 65로 제공된다:
AUGGCUUCUGGCAGAGCCAGAUGCACCCGGAAGCUGAGAAACUGGGUCGUCGAACAGGUGGAAAGCGGACAGUUCCCUGGCGUGUGCUGGGAUGAUACCGCCAAGACAAUGUUCAGAAUCCCCUGGAAGCACGCCGGCAAGCAGGACUUCAGAGAAGAUCAGGACGCCGCCUUCUUCAAGGCCUGGGCCAUCUUCAAGGGCAAGUACAAAGAGGGCGACACAGGCGGACCUGCCGUGUGGAAAACCAGACUGAGAUGCGCCCUGAACAAGAGCAGCGAGUUCAAAGAGGUGCCCGAGCGGGGCAGAAUGGAUGUGGCCGAACCUUACAAGGUGUACCAGCUGCUGCCUCCUGGCAUCGUGUGA [서열번호 65]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 65에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
카테고리 2: 인터페론 생산을 유도하고 인터페론-자극 유전자의 자극을 야기하는 경로의 억제제
한 구현예에서, 상기 IMP는 인터페론 생산을 유도하고 인터페론-자극 유전자의 자극을 야기하는 경로를 억제하도록 구성될 수 있다.
그러므로, 선천적 신호전달 경로의 억제제 또는 우성 음성 억제제는 HSP90의 C-말단이 절단된 돌연변이체일 수 있다. 상기 HSP90 돌연변이체는 HSP90(CDC37)(1-232)(NCBI 참조 서열: NM_007065.4; UniProtKB - Q16543(CDC37_HUMAN)), 또는 그의 동원체, IRF3 활성화, 즉 IRF3-TBK1 신호전달의 우성 음성 억제제일 수 있다. [Yang et al. Hsp90 Regulates Activation of Interferon Regulatory Factor 3 and TBK-1 Stabilization in Sendai Virus-infected Cells, Molecular Biology of the Cell Vol. 17, 1461-1471, March 2006]. 상기 HSP90 우성 음성 형태의 한 구현예는 본 명세서에서 다음과 같은 서열번호 81로 나타낸다:
MVDYSVWDHIEVSDDEDETHPNIDTASLFRWRHQARVERMEQFQKEKEELDRGCRECKRKVAECQRKLKELEVAEGGKAELERLQAEAQQLRKEERSWEQKLEEMRKKEKSMPWNVDTLSKDGFSKSMVNTKPEKTEEDSEEVREQKHKTFVEKYEKQIKHFGMLRRWDDSQKYLSDNVHLVCEETANYLVIWCIDLEVEEKCALMEQVAHQTIVMQFILELAKSLKVDPRA [서열번호 81]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 81에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 HSP90 억제제 또는 우성 음성 작용 형태 폴리펩티드는 다음과 같은 서열번호 82의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGTGGACTACAGCGTGTGGGACCACATTGAGGTGTCTGATGATGAAGACGAGACGCACCCCAACATCGACACGGCCAGTCTCTTCCGCTGGCGGCATCAGGCCCGGGTGGAACGCATGGAGCAGTTCCAGAAGGAGAAGGAGGAACTGGACAGGGGCTGCCGCGAGTGCAAGCGCAAGGTGGCCGAGTGCCAGAGGAAACTGAAGGAGCTGGAGGTGGCCGAGGGCGGCAAGGCAGAGCTGGAGCGCCTGCAGGCCGAGGCACAGCAGCTGCGCAAGGAGGAGCGGAGCTGGGAGCAGAAGCTGGAGGAGATGCGCAAGAAGGAGAAGAGCATGCCCTGGAACGTGGACACGCTCAGCAAAGACGGCTTCAGCAAGAGCATGGTAAATACCAAGCCCGAGAAGACGGAGGAGGACTCAGAGGAGGTGAGGGAGCAGAAACACAAGACCTTCGTGGAAAAATACGAGAAACAGATCAAGCACTTTGGCATGCTTCGCCGCTGGGATGACAGCCAAAAGTACCTGTCAGACAACGTCCACCTGGTGTGCGAGGAGACAGCCAATTACCTGGTCATTTGGTGCATTGACCTAGAGGTGGAGGAGAAATGTGCACTCATGGAGCAGGTGGCCCACCAGACAATCGTCATGCAATTTATCCTGGAGCTGGCCAAGAGCCTAAAGGTGGACCCCCGGGCC [서열번호 82]
따라서, 바람직하게는 상기 HSP90 억제제 또는 우성 음성 작용 형태 폴리펩티드는 실질적으로 서열번호 82에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 83의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGUGGACUACAGCGUGUGGGACCACAUUGAGGUGUCUGAUGAUGAAGACGAGACGCACCCCAACAUCGACACGGCCAGUCUCUUCCGCUGGCGGCAUCAGGCCCGGGUGGAACGCAUGGAGCAGUUCCAGAAGGAGAAGGAGGAACUGGACAGGGGCUGCCGCGAGUGCAAGCGCAAGGUGGCCGAGUGCCAGAGGAAACUGAAGGAGCUGGAGGUGGCCGAGGGCGGCAAGGCAGAGCUGGAGCGCCUGCAGGCCGAGGCACAGCAGCUGCGCAAGGAGGAGCGGAGCUGGGAGCAGAAGCUGGAGGAGAUGCGCAAGAAGGAGAAGAGCAUGCCCUGGAACGUGGACACGCUCAGCAAAGACGGCUUCAGCAAGAGCAUGGUAAAUACCAAGCCCGAGAAGACGGAGGAGGACUCAGAGGAGGUGAGGGAGCAGAAACACAAGACCUUCGUGGAAAAAUACGAGAAACAGAUCAAGCACUUUGGCAUGCUUCGCCGCUGGGAUGACAGCCAAAAGUACCUGUCAGACAACGUCCACCUGGUGUGCGAGGAGACAGCCAAUUACCUGGUCAUUUGGUGCAUUGACCUAGAGGUGGAGGAGAAAUGUGCACUCAUGGAGCAGGUGGCCCACCAGACAAUCGUCAUGCAAUUUAUCCUGGAGCUGGCCAAGAGCCUAAAGGUGGACCCCCGGGCC [서열번호 83]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 83에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 81의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 84로 제공된다:
ATGGTGGACTACAGCGTGTGGGACCACATCGAGGTGTCCGACGACGAGGATGAGACACACCCCAACATCGATACCGCCAGCCTGTTCAGATGGCGGCACCAGGCTAGAGTGGAACGGATGGAACAGTTCCAGAAAGAGAAAGAGGAACTGGACCGGGGCTGCCGCGAGTGCAAAAGAAAAGTGGCCGAGTGCCAGCGGAAGCTGAAAGAACTGGAAGTGGCTGAAGGCGGCAAGGCCGAGCTGGAAAGACTGCAGGCTGAAGCCCAGCAGCTGCGCAAAGAGGAAAGAAGCTGGGAGCAGAAACTGGAAGAGATGCGCAAGAAAGAAAAATCCATGCCGTGGAACGTGGACACCCTGAGCAAGGACGGCTTCAGCAAGAGCATGGTCAACACCAAGCCTGAGAAAACCGAAGAGGACAGCGAGGAAGTGCGGGAACAGAAACACAAGACCTTCGTCGAGAAGTACGAGAAGCAGATCAAGCACTTCGGCATGCTGCGGAGATGGGACGACAGCCAGAAGTACCTGAGCGACAACGTGCACCTCGTGTGCGAGGAAACCGCCAACTACCTGGTCATCTGGTGCATCGATCTCGAGGTGGAAGAGAAGTGCGCCCTCATGGAACAGGTGGCCCACCAGACAATCGTGATGCAGTTCATCCTGGAACTGGCCAAGAGCCTGAAGGTGGACCCTAGAGCTTGA [서열번호 84]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 84에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 84의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 85로 제공된다:
AUGGUGGACUACAGCGUGUGGGACCACAUCGAGGUGUCCGACGACGAGGAUGAGACACACCCCAACAUCGAUACCGCCAGCCUGUUCAGAUGGCGGCACCAGGCUAGAGUGGAACGGAUGGAACAGUUCCAGAAAGAGAAAGAGGAACUGGACCGGGGCUGCCGCGAGUGCAAAAGAAAAGUGGCCGAGUGCCAGCGGAAGCUGAAAGAACUGGAAGUGGCUGAAGGCGGCAAGGCCGAGCUGGAAAGACUGCAGGCUGAAGCCCAGCAGCUGCGCAAAGAGGAAAGAAGCUGGGAGCAGAAACUGGAAGAGAUGCGCAAGAAAGAAAAAUCCAUGCCGUGGAACGUGGACACCCUGAGCAAGGACGGCUUCAGCAAGAGCAUGGUCAACACCAAGCCUGAGAAAACCGAAGAGGACAGCGAGGAAGUGCGGGAACAGAAACACAAGACCUUCGUCGAGAAGUACGAGAAGCAGAUCAAGCACUUCGGCAUGCUGCGGAGAUGGGACGACAGCCAGAAGUACCUGAGCGACAACGUGCACCUCGUGUGCGAGGAAACCGCCAACUACCUGGUCAUCUGGUGCAUCGAUCUCGAGGUGGAAGAGAAGUGCGCCCUCAUGGAACAGGUGGCCCACCAGACAAUCGUGAUGCAGUUCAUCCUGGAACUGGCCAAGAGCCUGAAGGUGGACCCUAGAGCUUGA [서열번호 85]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 85에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 선천적 신호전달 경로의 억제제는 STING의 활성을 차단하고 또한 선천적 감지 캐스케이드의 핵심인 STING-베타(진뱅크: MF360993.1; UniProtKB - A0A3G1PSE3(A0A3G1PSE3_HUMAN)), 또는 그의 동원체이다. [Wang PH, et al. A novel transcript isoform of STING that sequesters cGAMP and dominantly inhibits innate nucleic acid sensing. Nucleic Acids Res. 2018 May 4;46(8):4054-4071. doi: 10.1093/nar/gky186]. STING은 IRF3 활성화를 유도하는 dsRNA 인식의 하류의 경로에 관여된다. 상기 STING-베타의 한 구현예는 본 명세서에서 다음과 같은 서열번호 86으로 나타낸다:
MTWVSLLNQVGDRVSRNNFLGFPASELQARIRTYNQHYNNLLRGAVSQRLYILLPLDCGVPDNLSMADPNIRFLDKLPQQTGDHAGIKDRVYSNSIYELLENGQRAGTCVLEYATPLQTLFAMSQYSQAGFSREDRLEQAKLFCRTLEDILADAPESQNNCRLIAYQEPADDSSFSLSQEVLRHLRQEEKEEVTVGSLKTSAVPSTSTMSQEPELLISGMEKPLPLRTDFS [서열번호 86]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 86에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 STING-베타 폴리펩티드는 다음과 같은 서열번호 87의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGACCTGGGTCTCACTCCTGAATCAGGTGGGAGATAGGGTTAGCAGGAATAACTTCTTGGGCTTCCCTGCCTCAGAGCTCCAGGCCCGGATTCGAACTTACAATCAGCATTACAACAACCTGCTACGGGGTGCAGTGAGCCAGCGGCTGTATATTCTCCTCCCATTGGACTGTGGGGTGCCTGATAACCTGAGTATGGCTGACCCCAACATTCGCTTCCTGGATAAACTGCCCCAGCAGACCGGTGACCATGCTGGCATCAAGGATCGGGTTTACAGCAACAGCATCTATGAGCTTCTGGAGAACGGGCAGCGGGCGGGCACCTGTGTCCTGGAGTACGCCACCCCCTTGCAGACTTTGTTTGCCATGTCACAATACAGTCAAGCTGGCTTTAGCCGGGAGGATAGGCTTGAGCAGGCCAAACTCTTCTGCCGGACACTTGAGGACATCCTGGCAGATGCCCCTGAGTCTCAGAACAACTGCCGCCTCATTGCCTACCAGGAACCTGCAGATGACAGCAGCTTCTCGCTGTCCCAGGAGGTTCTCCGGCACCTGCGGCAGGAGGAAAAGGAAGAGGTTACTGTGGGCAGCTTGAAGACCTCAGCGGTGCCCAGTACCTCCACGATGTCCCAAGAGCCTGAGCTCCTCATCAGTGGAATGGAAAAGCCCCTCCCTCTCCGCACGGATTTCTCT [서열번호 87]
따라서, 바람직하게는 상기 STING-베타 폴리펩티드는 실질적으로 서열번호 87에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 88의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGACCUGGGUCUCACUCCUGAAUCAGGUGGGAGAUAGGGUUAGCAGGAAUAACUUCUUGGGCUUCCCUGCCUCAGAGCUCCAGGCCCGGAUUCGAACUUACAAUCAGCAUUACAACAACCUGCUACGGGGUGCAGUGAGCCAGCGGCUGUAUAUUCUCCUCCCAUUGGACUGUGGGGUGCCUGAUAACCUGAGUAUGGCUGACCCCAACAUUCGCUUCCUGGAUAAACUGCCCCAGCAGACCGGUGACCAUGCUGGCAUCAAGGAUCGGGUUUACAGCAACAGCAUCUAUGAGCUUCUGGAGAACGGGCAGCGGGCGGGCACCUGUGUCCUGGAGUACGCCACCCCCUUGCAGACUUUGUUUGCCAUGUCACAAUACAGUCAAGCUGGCUUUAGCCGGGAGGAUAGGCUUGAGCAGGCCAAACUCUUCUGCCGGACACUUGAGGACAUCCUGGCAGAUGCCCCUGAGUCUCAGAACAACUGCCGCCUCAUUGCCUACCAGGAACCUGCAGAUGACAGCAGCUUCUCGCUGUCCCAGGAGGUUCUCCGGCACCUGCGGCAGGAGGAAAAGGAAGAGGUUACUGUGGGCAGCUUGAAGACCUCAGCGGUGCCCAGUACCUCCACGAUGUCCCAAGAGCCUGAGCUCCUCAUCAGUGGAAUGGAAAAGCCCCUCCCUCUCCGCACGGAUUUCUCU [서열번호 88]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 88에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 88의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 89로 제공된다:
ATGACATGGGTGTCCCTGCTGAATCAAGTGGGCGACAGAGTGTCCCGGAACAACTTCCTGGGATTCCCTGCCAGCGAACTGCAGGCCAGAATCCGGACCTACAACCAGCACTACAACAACCTGCTGAGAGGCGCCGTGTCTCAGCGGCTGTATATTCTGCTGCCTCTGGATTGCGGCGTGCCCGACAATCTGTCTATGGCCGATCCTAATATCCGGTTCCTGGACAAGCTGCCCCAGCAGACAGGCGATCACGCCGGCATTAAGGACCGGGTGTACAGCAACAGCATCTACGAGCTGCTGGAAAACGGCCAGCGAGCCGGAACATGCGTGCTGGAATATGCCACACCTCTGCAGACCCTGTTCGCCATGAGCCAGTATAGCCAGGCCGGCTTCAGCAGAGAGGACAGACTGGAACAGGCCAAGCTGTTCTGCCGGACACTGGAAGATATCCTGGCCGACGCTCCTGAGAGCCAGAACAACTGTAGACTGATCGCCTACCAAGAGCCTGCCGACGACAGCAGCTTTAGCCTGTCTCAAGAGGTGCTGCGGCACCTGAGACAAGAGGAAAAAGAGGAAGTCACCGTCGGCAGCCTGAAAACCTCTGCCGTGCCTAGCACCAGCACCATGAGTCAAGAACCTGAGCTGCTGATCTCCGGCATGGAAAAGCCCCTGCCTCTGAGAACCGACTTCAGCTGA [서열번호 89]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 89에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 89의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 90으로 제공된다:
AUGACAUGGGUGUCCCUGCUGAAUCAAGUGGGCGACAGAGUGUCCCGGAACAACUUCCUGGGAUUCCCUGCCAGCGAACUGCAGGCCAGAAUCCGGACCUACAACCAGCACUACAACAACCUGCUGAGAGGCGCCGUGUCUCAGCGGCUGUAUAUUCUGCUGCCUCUGGAUUGCGGCGUGCCCGACAAUCUGUCUAUGGCCGAUCCUAAUAUCCGGUUCCUGGACAAGCUGCCCCAGCAGACAGGCGAUCACGCCGGCAUUAAGGACCGGGUGUACAGCAACAGCAUCUACGAGCUGCUGGAAAACGGCCAGCGAGCCGGAACAUGCGUGCUGGAAUAUGCCACACCUCUGCAGACCCUGUUCGCCAUGAGCCAGUAUAGCCAGGCCGGCUUCAGCAGAGAGGACAGACUGGAACAGGCCAAGCUGUUCUGCCGGACACUGGAAGAUAUCCUGGCCGACGCUCCUGAGAGCCAGAACAACUGUAGACUGAUCGCCUACCAAGAGCCUGCCGACGACAGCAGCUUUAGCCUGUCUCAAGAGGUGCUGCGGCACCUGAGACAAGAGGAAAAAGAGGAAGUCACCGUCGGCAGCCUGAAAACCUCUGCCGUGCCUAGCACCAGCACCAUGAGUCAAGAACCUGAGCUGCUGAUCUCCGGCAUGGAAAAGCCCCUGCCUCUGAGAACCGACUUCAGCUGA [서열번호 90]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 90에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 선천적 신호전달 경로의 억제제는 IFN-베타 전사의 TLR3 유도를 억제하는 절단된 또는 우성 음성 작용 형태인 A20 또는 TNFAIP3_HUMAN(NCBI 참조 서열: NM_006290.4; UniProtKB - P21580(TNAP3_HUMAN)), 또는 그의 동원체이다. [Saitoh T, et al. A20 is a negative regulator of IFN regulatory factor 3 signaling. J Immunol. 2005 Feb 1;174(3):1507-12. doi:10.4049/jimmunol.174.3.1507]. 상기 A20 또는 TNFAIP3_HUMAN의 한 구현예는 본 명세서에서 다음과 같은 서열번호 91로 나타낸다:
AQNPMEPSVPQLSLMDVKCETPNCPFFMSVNTQPLCHECSERRQKNQNKLPKLNSKPGPEGLPGMALGASRGEAYEPLAWNPEESTGGPHSAPPTAPSPFLFSETTAMKCRSPGCPFTLNVQHNGFCERCHNARQLHASHAPDHTRHLDPGKCQACLQDVTRTFNGICSTCFKRTTAEASSSLSTSLPPSCHQRSKSDPSRLVRSPSPHSCHRAGNDAPAGCLSQAARTPGDRTGTSKCRKAGCVYFGTPENKGFCTLCFIEYRENKHFAAASGKVSPTASRFQNTIPCLGRECGTLGSTMFEGYCQKCFIEAQNQRFHEAKRTEEQLRSSQRRDVPRTTQSTSRPKCARASCKNILACRSEELCMECQHPNQRMGPGAHRGEPAPEDPPKQRCRAPACDHFGNAKC [서열번호 91]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 91에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 절단된 또는 우성 음성 작용 형태 폴리펩티드인 A20(369-775) 또는 TNFAIP3_HUMAN은 다음과 같은 서열번호 92의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GCCCAGAATCCCATGGAACCTTCCGTGCCCCAGCTTTCTCTCATGGATGTAAAATGTGAAACGCCCAACTGCCCCTTCTTCATGTCTGTGAACACCCAGCCTTTATGCCATGAGTGCTCAGAGAGGCGGCAAAAGAATCAAAACAAACTCCCAAAGCTGAACTCCAAGCCGGGCCCTGAGGGGCTCCCTGGCATGGCGCTCGGGGCCTCTCGGGGAGAAGCCTATGAGCCCTTGGCGTGGAACCCTGAGGAGTCCACTGGGGGGCCTCATTCGGCCCCACCGACAGCACCCAGCCCTTTTCTGTTCAGTGAGACCACTGCCATGAAGTGCAGGAGCCCCGGCTGCCCCTTCACACTGAATGTGCAGCACAACGGATTTTGTGAACGTTGCCACAACGCCCGGCAACTTCACGCCAGCCACGCCCCAGACCACACAAGGCACTTGGATCCCGGGAAGTGCCAAGCCTGCCTCCAGGATGTTACCAGGACATTTAATGGGATCTGCAGTACTTGCTTCAAAAGGACTACAGCAGAGGCCTCCTCCAGCCTCAGCACCAGCCTCCCTCCTTCCTGTCACCAGCGTTCCAAGTCAGATCCCTCGCGGCTCGTCCGGAGCCCCTCCCCGCATTCTTGCCACAGAGCTGGAAACGACGCCCCTGCTGGCTGCCTGTCTCAAGCTGCACGGACTCCTGGGGACAGGACGGGGACGAGCAAGTGCAGAAAAGCCGGCTGCGTGTATTTTGGGACTCCAGAAAACAAGGGCTTTTGCACACTGTGTTTCATCGAGTACAGAGAAAACAAACATTTTGCTGCTGCCTCAGGGAAAGTCAGTCCCACAGCGTCCAGGTTCCAGAACACCATTCCGTGCCTGGGGAGGGAATGCGGCACCCTTGGAAGCACCATGTTTGAAGGATACTGCCAGAAGTGTTTCATTGAAGCTCAGAATCAGAGATTTCATGAGGCCAAAAGGACAGAAGAGCAACTGAGATCGAGCCAGCGCAGAGATGTGCCTCGAACCACACAAAGCACCTCAAGGCCCAAGTGCGCCCGGGCCTCCTGCAAGAACATCCTGGCCTGCCGCAGCGAGGAGCTCTGCATGGAGTGTCAGCATCCCAACCAGAGGATGGGCCCTGGGGCCCACCGGGGTGAGCCTGCCCCCGAAGACCCCCCCAAGCAGCGTTGCCGGGCCCCCGCCTGTGATCATTTTGGCAATGCCAAGTGC [서열번호 92]
따라서, 바람직하게는 절단된 또는 우성 음성 작용 형태 폴리펩티드인 A20 또는 TNFAIP3_HUMAN은 실질적으로 서열번호 92에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 93의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GCCCAGAAUCCCAUGGAACCUUCCGUGCCCCAGCUUUCUCUCAUGGAUGUAAAAUGUGAAACGCCCAACUGCCCCUUCUUCAUGUCUGUGAACACCCAGCCUUUAUGCCAUGAGUGCUCAGAGAGGCGGCAAAAGAAUCAAAACAAACUCCCAAAGCUGAACUCCAAGCCGGGCCCUGAGGGGCUCCCUGGCAUGGCGCUCGGGGCCUCUCGGGGAGAAGCCUAUGAGCCCUUGGCGUGGAACCCUGAGGAGUCCACUGGGGGGCCUCAUUCGGCCCCACCGACAGCACCCAGCCCUUUUCUGUUCAGUGAGACCACUGCCAUGAAGUGCAGGAGCCCCGGCUGCCCCUUCACACUGAAUGUGCAGCACAACGGAUUUUGUGAACGUUGCCACAACGCCCGGCAACUUCACGCCAGCCACGCCCCAGACCACACAAGGCACUUGGAUCCCGGGAAGUGCCAAGCCUGCCUCCAGGAUGUUACCAGGACAUUUAAUGGGAUCUGCAGUACUUGCUUCAAAAGGACUACAGCAGAGGCCUCCUCCAGCCUCAGCACCAGCCUCCCUCCUUCCUGUCACCAGCGUUCCAAGUCAGAUCCCUCGCGGCUCGUCCGGAGCCCCUCCCCGCAUUCUUGCCACAGAGCUGGAAACGACGCCCCUGCUGGCUGCCUGUCUCAAGCUGCACGGACUCCUGGGGACAGGACGGGGACGAGCAAGUGCAGAAAAGCCGGCUGCGUGUAUUUUGGGACUCCAGAAAACAAGGGCUUUUGCACACUGUGUUUCAUCGAGUACAGAGAAAACAAACAUUUUGCUGCUGCCUCAGGGAAAGUCAGUCCCACAGCGUCCAGGUUCCAGAACACCAUUCCGUGCCUGGGGAGGGAAUGCGGCACCCUUGGAAGCACCAUGUUUGAAGGAUACUGCCAGAAGUGUUUCAUUGAAGCUCAGAAUCAGAGAUUUCAUGAGGCCAAAAGGACAGAAGAGCAACUGAGAUCGAGCCAGCGCAGAGAUGUGCCUCGAACCACACAAAGCACCUCAAGGCCCAAGUGCGCCCGGGCCUCCUGCAAGAACAUCCUGGCCUGCCGCAGCGAGGAGCUCUGCAUGGAGUGUCAGCAUCCCAACCAGAGGAUGGGCCCUGGGGCCCACCGGGGUGAGCCUGCCCCCGAAGACCCCCCCAAGCAGCGUUGCCGGGCCCCCGCCUGUGAUCAUUUUGGCAAUGCCAAGUGC [서열번호 93]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 93에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 91의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 94로 제공된다:
ATGGCCCAGAATCCTATGGAACCTAGCGTGCCCCAGCTGAGCCTGATGGACGTGAAGTGCGAAACCCCTAACTGCCCCTTCTTCATGTCCGTGAACACCCAGCCTCTGTGCCACGAGTGTAGCGAGCGGAGACAGAAGAACCAGAACAAGCTGCCCAAGCTGAACAGCAAGCCCGGACCTGAAGGACTGCCTGGAATGGCTCTGGGAGCTTCTAGAGGCGAGGCCTATGAACCCCTGGCCTGGAATCCTGAGGAAAGCACAGGCGGACCTCACAGCGCTCCTCCAACAGCACCTTCTCCATTTCTGTTCAGCGAGACAACCGCCATGAAGTGCAGAAGCCCTGGCTGCCCTTTCACACTGAACGTGCAGCACAACGGCTTTTGCGAGAGATGCCACAACGCCAGACAGCTGCACGCTTCTCACGCCCCTGATCACACCAGACACCTGGATCCTGGAAAGTGCCAGGCCTGCCTGCAGGATGTGACCAGAACCTTCAACGGCATCTGCAGCACCTGTTTCAAGCGGACAACAGCCGAGGCCAGCAGCAGCCTGTCTACATCTCTGCCTCCAAGCTGCCACCAGCGGAGCAAGAGCGATCCTTCTAGACTTGTGCGGAGCCCCTCTCCTCACTCCTGTCACAGAGCCGGAAATGATGCCCCTGCCGGATGTCTGTCTCAGGCCGCTAGAACACCTGGCGATAGAACCGGCACCAGCAAGTGTAGAAAGGCCGGCTGCGTGTACTTCGGCACCCCTGAGAACAAGGGATTCTGCACCCTGTGCTTCATCGAGTACAGAGAGAACAAGCACTTCGCCGCTGCCTCCGGAAAGGTGTCACCTACCGCTAGCCGGTTCCAGAACACAATCCCTTGCCTGGGCAGAGAGTGTGGCACACTGGGCAGCACAATGTTCGAGGGCTACTGCCAGAAGTGCTTTATCGAGGCCCAGAACCAGCGGTTCCACGAGGCCAAGAGAACCGAGGAACAGCTGAGAAGCAGCCAGAGAAGGGACGTGCCCAGAACAACCCAGAGCACCAGCAGACCTAAGTGCGCCAGAGCCAGCTGCAAGAACATCCTGGCCTGTCGGAGCGAGGAACTGTGCATGGAATGCCAGCATCCTAACCAGAGAATGGGCCCTGGCGCTCACAGAGGCGAACCTGCTCCAGAAGATCCTCCTAAGCAGCGGTGTAGAGCCCCTGCCTGTGACCACTTTGGCAACGCCAAGTGCTGA [서열번호 94]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 94에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 94의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 95로 제공된다:
AUGGCCCAGAAUCCUAUGGAACCUAGCGUGCCCCAGCUGAGCCUGAUGGACGUGAAGUGCGAAACCCCUAACUGCCCCUUCUUCAUGUCCGUGAACACCCAGCCUCUGUGCCACGAGUGUAGCGAGCGGAGACAGAAGAACCAGAACAAGCUGCCCAAGCUGAACAGCAAGCCCGGACCUGAAGGACUGCCUGGAAUGGCUCUGGGAGCUUCUAGAGGCGAGGCCUAUGAACCCCUGGCCUGGAAUCCUGAGGAAAGCACAGGCGGACCUCACAGCGCUCCUCCAACAGCACCUUCUCCAUUUCUGUUCAGCGAGACAACCGCCAUGAAGUGCAGAAGCCCUGGCUGCCCUUUCACACUGAACGUGCAGCACAACGGCUUUUGCGAGAGAUGCCACAACGCCAGACAGCUGCACGCUUCUCACGCCCCUGAUCACACCAGACACCUGGAUCCUGGAAAGUGCCAGGCCUGCCUGCAGGAUGUGACCAGAACCUUCAACGGCAUCUGCAGCACCUGUUUCAAGCGGACAACAGCCGAGGCCAGCAGCAGCCUGUCUACAUCUCUGCCUCCAAGCUGCCACCAGCGGAGCAAGAGCGAUCCUUCUAGACUUGUGCGGAGCCCCUCUCCUCACUCCUGUCACAGAGCCGGAAAUGAUGCCCCUGCCGGAUGUCUGUCUCAGGCCGCUAGAACACCUGGCGAUAGAACCGGCACCAGCAAGUGUAGAAAGGCCGGCUGCGUGUACUUCGGCACCCCUGAGAACAAGGGAUUCUGCACCCUGUGCUUCAUCGAGUACAGAGAGAACAAGCACUUCGCCGCUGCCUCCGGAAAGGUGUCACCUACCGCUAGCCGGUUCCAGAACACAAUCCCUUGCCUGGGCAGAGAGUGUGGCACACUGGGCAGCACAAUGUUCGAGGGCUACUGCCAGAAGUGCUUUAUCGAGGCCCAGAACCAGCGGUUCCACGAGGCCAAGAGAACCGAGGAACAGCUGAGAAGCAGCCAGAGAAGGGACGUGCCCAGAACAACCCAGAGCACCAGCAGACCUAAGUGCGCCAGAGCCAGCUGCAAGAACAUCCUGGCCUGUCGGAGCGAGGAACUGUGCAUGGAAUGCCAGCAUCCUAACCAGAGAAUGGGCCCUGGCGCUCACAGAGGCGAACCUGCUCCAGAAGAUCCUCCUAAGCAGCGGUGUAGAGCCCCUGCCUGUGACCACUUUGGCAACGCCAAGUGCUGA [서열번호 95]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 95에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 선천적 신호전달 경로의 억제제, 그의 절단된 또는 우성 음성 작용 형태는 NF-kB 활성화를 방지하는 A20의 더 작은 단편(606-790), NCBI 참조 서열: NM_006290.4; UniProtKB - P21580(TNAP3_HUMAN), 또는 그의 동원체이다. 상기 A20의 더 작은 단편의 한 구현예는 본 명세서에서 다음과 같은 서열번호 96으로 나타낸다:
KCRKAGCVYFGTPENKGFCTLCFIEYRENKHFAAASGKVSPTASRFQNTIPCLGRECGTLGSTMFEGYCQKCFIEAQNQRFHEAKRTEEQLRSSQRRDVPRTTQSTSRPKCARASCKNILACRSEELCMECQHPNQRMGPGAHRGEPAPEDPPKQRCRAPACDHFGNAKCNGYCNECFQFKQMYG [서열번호 96]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 96에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 A20의 더 작은 단편 폴리펩티드는 다음과 같은 서열번호 97의 DNA 뉴클레오티드 서열에 의해 암호화된다:
AAGTGCAGAAAAGCCGGCTGCGTGTATTTTGGGACTCCAGAAAACAAGGGCTTTTGCACACTGTGTTTCATCGAGTACAGAGAAAACAAACATTTTGCTGCTGCCTCAGGGAAAGTCAGTCCCACAGCGTCCAGGTTCCAGAACACCATTCCGTGCCTGGGGAGGGAATGCGGCACCCTTGGAAGCACCATGTTTGAAGGATACTGCCAGAAGTGTTTCATTGAAGCTCAGAATCAGAGATTTCATGAGGCCAAAAGGACAGAAGAGCAACTGAGATCGAGCCAGCGCAGAGATGTGCCTCGAACCACACAAAGCACCTCAAGGCCCAAGTGCGCCCGGGCCTCCTGCAAGAACATCCTGGCCTGCCGCAGCGAGGAGCTCTGCATGGAGTGTCAGCATCCCAACCAGAGGATGGGCCCTGGGGCCCACCGGGGTGAGCCTGCCCCCGAAGACCCCCCCAAGCAGCGTTGCCGGGCCCCCGCCTGTGATCATTTTGGCAATGCCAAGTGCAACGGCTACTGCAACGAATGCTTTCAGTTCAAGCAGATGTATGGC [서열번호 97]
따라서, 바람직하게는 상기 A20의 더 작은 단편 폴리펩티드는 실질적으로 서열번호 97에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 98의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AAGUGCAGAAAAGCCGGCUGCGUGUAUUUUGGGACUCCAGAAAACAAGGGCUUUUGCACACUGUGUUUCAUCGAGUACAGAGAAAACAAACAUUUUGCUGCUGCCUCAGGGAAAGUCAGUCCCACAGCGUCCAGGUUCCAGAACACCAUUCCGUGCCUGGGGAGGGAAUGCGGCACCCUUGGAAGCACCAUGUUUGAAGGAUACUGCCAGAAGUGUUUCAUUGAAGCUCAGAAUCAGAGAUUUCAUGAGGCCAAAAGGACAGAAGAGCAACUGAGAUCGAGCCAGCGCAGAGAUGUGCCUCGAACCACACAAAGCACCUCAAGGCCCAAGUGCGCCCGGGCCUCCUGCAAGAACAUCCUGGCCUGCCGCAGCGAGGAGCUCUGCAUGGAGUGUCAGCAUCCCAACCAGAGGAUGGGCCCUGGGGCCCACCGGGGUGAGCCUGCCCCCGAAGACCCCCCCAAGCAGCGUUGCCGGGCCCCCGCCUGUGAUCAUUUUGGCAAUGCCAAGUGCAACGGCUACUGCAACGAAUGCUUUCAGUUCAAGCAGAUGUAUGGC [서열번호 98]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 98에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 96의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 99로 제공된다:
ATGAAGTGCAGAAAGGCCGGCTGCGTGTACTTCGGCACCCCTGAGAACAAGGGCTTCTGCACCCTGTGCTTCATCGAGTACAGAGAGAACAAGCACTTCGCTGCCGCCAGCGGAAAGGTGTCACCTACCGCCAGCAGATTCCAGAACACAATCCCCTGCCTGGGCAGAGAGTGTGGCACACTGGGCAGCACAATGTTCGAGGGCTACTGCCAGAAGTGCTTTATCGAGGCCCAGAACCAGCGGTTCCACGAGGCCAAGAGAACCGAGGAACAGCTGAGAAGCAGCCAGAGAAGGGACGTGCCCAGAACAACCCAGAGCACCAGCAGACCTAAGTGCGCCAGAGCCAGCTGCAAGAACATCCTGGCCTGCAGATCCGAGGAACTGTGCATGGAATGCCAGCATCCTAACCAGAGAATGGGCCCTGGCGCTCACAGAGGCGAACCTGCTCCAGAAGATCCTCCTAAGCAGCGGTGTAGAGCCCCAGCCTGTGACCACTTTGGCAACGCCAAGTGCAACGGCTACTGCAACGAGTGCTTCCAGTTCAAGCAGATGTACGGCTGA [서열번호 99]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 99에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 99의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 100으로 제공된다:
AUGAAGUGCAGAAAGGCCGGCUGCGUGUACUUCGGCACCCCUGAGAACAAGGGCUUCUGCACCCUGUGCUUCAUCGAGUACAGAGAGAACAAGCACUUCGCUGCCGCCAGCGGAAAGGUGUCACCUACCGCCAGCAGAUUCCAGAACACAAUCCCCUGCCUGGGCAGAGAGUGUGGCACACUGGGCAGCACAAUGUUCGAGGGCUACUGCCAGAAGUGCUUUAUCGAGGCCCAGAACCAGCGGUUCCACGAGGCCAAGAGAACCGAGGAACAGCUGAGAAGCAGCCAGAGAAGGGACGUGCCCAGAACAACCCAGAGCACCAGCAGACCUAAGUGCGCCAGAGCCAGCUGCAAGAACAUCCUGGCCUGCAGAUCCGAGGAACUGUGCAUGGAAUGCCAGCAUCCUAACCAGAGAAUGGGCCCUGGCGCUCACAGAGGCGAACCUGCUCCAGAAGAUCCUCCUAAGCAGCGGUGUAGAGCCCCAGCCUGUGACCACUUUGGCAACGCCAAGUGCAACGGCUACUGCAACGAGUGCUUCCAGUUCAAGCAGAUGUACGGCUGA [서열번호 100]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 100에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 선천적 신호전달 경로의 억제제/우성 음성 이펙터(effector)는 MFN2 완전 폴리펩티드(MFN2(1-757)), 또는 그의 절단된 버전(NCBI 참조 서열: NM_001127660.2; UniProtKB - O95140(MFN2_HUMAN)), 또는 그의 동원체이다. [Yasukawa K, Oshiumi H, Takeda M, Ishihara N, Yanagi Y, Seya T, Kawabata S, Koshiba T. Mitofusin 2 inhibits mitochondrial antiviral signaling. Sci Signal. 2009 Aug 18;2(84):ra47. doi: 10.1126/scisignal.2000287. PMID: 19690333).
상기MFN2 폴리펩티드(MFN2(369-598))의 한 구현예는 본 명세서에서 다음과 같은 서열번호 242로 나타낸다:
EAVRLIMDSLHMAAREQQVYCEEMREERQDRLKFIDKQLELLAQDYKLRIKQITEEVERQVSTAMAEEIRRLSVLVDDYQMDFHPSPVVLKVYKNELHRHIEEGLGRNMSDRCSTAITNSLQTMQQDMIDGLKPLLPVSVRSQIDMLVPRQCFSLNYDLNCDKLCADFQEDIEFHFSLGWTMLVNRFLGPKNSRRALMGYNDQVQRPIPLTPANPSMPPLPQGSLTQEE [서열번호 242]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 242에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 MFN2 폴리펩티드는 다음과 같은 서열번호 243의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GAGGCGGTTCGACTCATCATGGACTCCCTGCACATGGCGGCTCGGGAGCAGCAGGTTTACTGCGAGGAAATGCGTGAAGAGCGGCAAGACCGACTGAAATTTATTGACAAACAGCTGGAGCTCTTGGCTCAAGACTATAAGCTGCGAATTAAGCAGATTACGGAGGAAGTGGAGAGGCAGGTGTCGACTGCAATGGCCGAGGAGATCAGGCGCCTCTCTGTACTGGTGGACGATTACCAGATGGACTTCCACCCTTCTCCAGTAGTCCTCAAGGTTTATAAGAATGAGCTGCACCGCCACATAGAGGAAGGACTGGGTCGAAACATGTCTGACCGCTGCTCCACGGCCATCACCAACTCCCTGCAGACCATGCAGCAGGACATGATAGATGGCTTGAAACCCCTCCTTCCTGTGTCTGTGCGGAGTCAGATAGACATGCTGGTCCCACGCCAGTGCTTCTCCCTCAACTATGACCTAAACTGTGACAAGCTGTGTGCTGACTTCCAGGAAGACATTGAGTTCCATTTCTCTCTCGGATGGACCATGCTGGTGAATAGGTTCCTGGGCCCCAAGAACAGCCGTCGGGCCTTGATGGGCTACAATGACCAGGTCCAGCGTCCCATCCCTCTGACGCCAGCCAACCCCAGCATGCCCCCACTGCCACAGGGCTCGCTCACCCAGGAGGAG [서열번호 243]
따라서, 바람직하게는 상기 MFN2 폴리펩티드는 실질적으로 서열번호 243에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 244의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GAGGCGGUUCGACUCAUCAUGGACUCCCUGCACAUGGCGGCUCGGGAGCAGCAGGUUUACUGCGAGGAAAUGCGUGAAGAGCGGCAAGACCGACUGAAAUUUAUUGACAAACAGCUGGAGCUCUUGGCUCAAGACUAUAAGCUGCGAAUUAAGCAGAUUACGGAGGAAGUGGAGAGGCAGGUGUCGACUGCAAUGGCCGAGGAGAUCAGGCGCCUCUCUGUACUGGUGGACGAUUACCAGAUGGACUUCCACCCUUCUCCAGUAGUCCUCAAGGUUUAUAAGAAUGAGCUGCACCGCCACAUAGAGGAAGGACUGGGUCGAAACAUGUCUGACCGCUGCUCCACGGCCAUCACCAACUCCCUGCAGACCAUGCAGCAGGACAUGAUAGAUGGCUUGAAACCCCUCCUUCCUGUGUCUGUGCGGAGUCAGAUAGACAUGCUGGUCCCACGCCAGUGCUUCUCCCUCAACUAUGACCUAAACUGUGACAAGCUGUGUGCUGACUUCCAGGAAGACAUUGAGUUCCAUUUCUCUCUCGGAUGGACCAUGCUGGUGAAUAGGUUCCUGGGCCCCAAGAACAGCCGUCGGGCCUUGAUGGGCUACAAUGACCAGGUCCAGCGUCCCAUCCCUCUGACGCCAGCCAACCCCAGCAUGCCCCCACUGCCACAGGGCUCGCUCACCCAGGAGGAG [서열번호 244]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 244에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 242의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 245로 제공된다:
ATGGAGGCCGTCAGACTGATCATGGACAGCCTGCATATGGCCGCCAGAGAGCAGCAGGTCTACTGCGAGGAAATGCGGGAAGAGAGACAGGACCGGCTGAAGTTCATCGACAAGCAGCTGGAACTGCTGGCCCAGGACTACAAGCTGCGGATCAAGCAGATCACCGAAGAGGTGGAAAGACAGGTGTCCACCGCCATGGCCGAGGAAATCAGACGACTGAGCGTGCTGGTGGACGACTACCAGATGGACTTTCACCCCTCTCCAGTGGTGCTGAAGGTGTACAAGAACGAGCTGCACCGGCACATCGAGGAAGGCCTGGGCAGAAACATGAGCGACAGATGCAGCACCGCCATCACCAATAGCCTGCAGACCATGCAGCAGGACATGATCGACGGCCTGAAACCTCTGCTGCCTGTGTCCGTCAGATCCCAGATCGACATGCTGGTGCCCAGACAGTGCTTCAGCCTGAACTACGACCTGAACTGCGACAAGCTGTGCGCCGACTTCCAAGAGGACATCGAGTTCCACTTCAGCCTCGGCTGGACAATGCTGGTCAACAGATTTCTGGGCCCCAAGAACAGCAGACGGGCCCTGATGGGCTACAACGATCAGGTGCAGAGGCCCATTCCTCTGACACCCGCCAATCCTAGCATGCCTCCACTGCCTCAGGGCAGCCTGACACAAGAAGAATGA [서열번호 245]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 245에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 245의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 246으로 제공된다:
AUGGAGGCCGUCAGACUGAUCAUGGACAGCCUGCAUAUGGCCGCCAGAGAGCAGCAGGUCUACUGCGAGGAAAUGCGGGAAGAGAGACAGGACCGGCUGAAGUUCAUCGACAAGCAGCUGGAACUGCUGGCCCAGGACUACAAGCUGCGGAUCAAGCAGAUCACCGAAGAGGUGGAAAGACAGGUGUCCACCGCCAUGGCCGAGGAAAUCAGACGACUGAGCGUGCUGGUGGACGACUACCAGAUGGACUUUCACCCCUCUCCAGUGGUGCUGAAGGUGUACAAGAACGAGCUGCACCGGCACAUCGAGGAAGGCCUGGGCAGAAACAUGAGCGACAGAUGCAGCACCGCCAUCACCAAUAGCCUGCAGACCAUGCAGCAGGACAUGAUCGACGGCCUGAAACCUCUGCUGCCUGUGUCCGUCAGAUCCCAGAUCGACAUGCUGGUGCCCAGACAGUGCUUCAGCCUGAACUACGACCUGAACUGCGACAAGCUGUGCGCCGACUUCCAAGAGGACAUCGAGUUCCACUUCAGCCUCGGCUGGACAAUGCUGGUCAACAGAUUUCUGGGCCCCAAGAACAGCAGACGGGCCCUGAUGGGCUACAACGAUCAGGUGCAGAGGCCCAUUCCUCUGACACCCGCCAAUCCUAGCAUGCCUCCACUGCCUCAGGGCAGCCUGACACAAGAAGAAUGA [서열번호 246]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 246에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
상기 절단된 MFN2(MFN2(369-490))의 한 구현예는 본 명세서에서 다음과 같은 서열번호 101로 나타낸다:
EAVRLIMDSLHMAAREQQVYCEEMREERQDRLKFIDKQLELLAQDYKLRIKQITEEVERQVSTAMAEEIRRLSVLVDDYQMDFHPSPVVLKVYKNELHRHIEEGLGRNMSDRCSTAITNSLQTMQQDMIDG [서열번호 101]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 101에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 절단된 MFN2 폴리펩티드는 다음과 같은 서열번호 102의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GAGGCGGTTCGACTCATCATGGACTCCCTGCACATGGCGGCTCGGGAGCAGCAGGTTTACTGCGAGGAAATGCGTGAAGAGCGGCAAGACCGACTGAAATTTATTGACAAACAGCTGGAGCTCTTGGCTCAAGACTATAAGCTGCGAATTAAGCAGATTACGGAGGAAGTGGAGAGGCAGGTGTCGACTGCAATGGCCGAGGAGATCAGGCGCCTCTCTGTACTGGTGGACGATTACCAGATGGACTTCCACCCTTCTCCAGTAGTCCTCAAGGTTTATAAGAATGAGCTGCACCGCCACATAGAGGAAGGACTGGGTCGAAACATGTCTGACCGCTGCTCCACGGCCATCACCAACTCCCTGCAGACCATGCAGCAGGACATGATAGATGGC [서열번호 102]
따라서, 바람직하게는 상기 절단된 MFN2 폴리펩티드는 실질적으로 서열번호 102에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 103의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GAGGCGGUUCGACUCAUCAUGGACUCCCUGCACAUGGCGGCUCGGGAGCAGCAGGUUUACUGCGAGGAAAUGCGUGAAGAGCGGCAAGACCGACUGAAAUUUAUUGACAAACAGCUGGAGCUCUUGGCUCAAGACUAUAAGCUGCGAAUUAAGCAGAUUACGGAGGAAGUGGAGAGGCAGGUGUCGACUGCAAUGGCCGAGGAGAUCAGGCGCCUCUCUGUACUGGUGGACGAUUACCAGAUGGACUUCCACCCUUCUCCAGUAGUCCUCAAGGUUUAUAAGAAUGAGCUGCACCGCCACAUAGAGGAAGGACUGGGUCGAAACAUGUCUGACCGCUGCUCCACGGCCAUCACCAACUCCCUGCAGACCAUGCAGCAGGACAUGAUAGAUGGC [서열번호 103]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 103에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 101의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 104로 제공된다:
ATGGAAGCCGTGCGGCTGATCATGGACAGCCTGCATATGGCCGCCAGAGAGCAGCAGGTCTACTGCGAGGAAATGCGGGAAGAGAGACAGGACCGGCTGAAGTTCATCGACAAGCAGCTGGAACTGCTGGCCCAGGACTACAAGCTGCGGATCAAGCAGATCACCGAAGAGGTGGAAAGACAGGTGTCCACCGCCATGGCCGAGGAAATCAGACGACTGAGCGTGCTGGTGGACGACTACCAGATGGACTTTCACCCCTCTCCAGTGGTGCTGAAGGTGTACAAGAACGAGCTGCACCGGCACATCGAGGAAGGCCTGGGCAGAAACATGAGCGACAGATGCAGCACCGCCATCACCAATAGCCTGCAGACCATGCAGCAGGACATGATCGACGGCTGA [서열번호 104]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 104에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 104의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 105로 제공된다:
AUGGAAGCCGUGCGGCUGAUCAUGGACAGCCUGCAUAUGGCCGCCAGAGAGCAGCAGGUCUACUGCGAGGAAAUGCGGGAAGAGAGACAGGACCGGCUGAAGUUCAUCGACAAGCAGCUGGAACUGCUGGCCCAGGACUACAAGCUGCGGAUCAAGCAGAUCACCGAAGAGGUGGAAAGACAGGUGUCCACCGCCAUGGCCGAGGAAAUCAGACGACUGAGCGUGCUGGUGGACGACUACCAGAUGGACUUUCACCCCUCUCCAGUGGUGCUGAAGGUGUACAAGAACGAGCUGCACCGGCACAUCGAGGAAGGCCUGGGCAGAAACAUGAGCGACAGAUGCAGCACCGCCAUCACCAAUAGCCUGCAGACCAUGCAGCAGGACAUGAUCGACGGCUGA [서열번호 105]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 105에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 서열번호 101의 MFN2 우성 음성 작용 형태(NCBI 참조 서열: NM_001127660.2; UniProtKB - O95140(MFN2_HUMAN)), 또는 그의 동원체는 이것을 서열번호 106의 아미노산 잔기 400-480 또는 그의 단편 또는 변이체로 줄임으로써 돌연변이될 수 있다.
RLKFIDKQLELLAQDYKLRIKQITEEVERQVSTAMAEEIRRLSVLVDDYQMDFHPSPVVLKVYKNELHRHIEEGLGRNMSD [서열번호 106]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 106에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 절단된 MFN2 폴리펩티드(MFN2(400-480))는 다음과 같은 서열번호 107의 DNA 뉴클레오티드 서열에 의해 암호화된다:
CGACTGAAATTTATTGACAAACAGCTGGAGCTCTTGGCTCAAGACTATAAGCTGCGAATTAAGCAGATTACGGAGGAAGTGGAGAGGCAGGTGTCGACTGCAATGGCCGAGGAGATCAGGCGCCTCTCTGTACTGGTGGACGATTACCAGATGGACTTCCACCCTTCTCCAGTAGTCCTCAAGGTTTATAAGAATGAGCTGCACCGCCACATAGAGGAAGGACTGGGTCGAAACATGTCTGAC [서열번호 107]
따라서, 바람직하게는 상기 절단된 MFN2 폴리펩티드는 실질적으로 서열번호 107에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 108의 RNA 뉴클레오티드 서열을 포함할 수 있다:
CGACUGAAAUUUAUUGACAAACAGCUGGAGCUCUUGGCUCAAGACUAUAAGCUGCGAAUUAAGCAGAUUACGGAGGAAGUGGAGAGGCAGGUGUCGACUGCAAUGGCCGAGGAGAUCAGGCGCCUCUCUGUACUGGUGGACGAUUACCAGAUGGACUUCCACCCUUCUCCAGUAGUCCUCAAGGUUUAUAAGAAUGAGCUGCACCGCCACAUAGAGGAAGGACUGGGUCGAAACAUGUCUGAC [서열번호 108]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 108에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 106의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 109로 제공된다:
ATGCGGCTGAAGTTCATCGACAAGCAGCTGGAACTGCTGGCCCAGGACTACAAGCTGCGGATCAAGCAGATCACCGAAGAGGTGGAAAGACAGGTGTCCACCGCCATGGCCGAGGAAATCAGACGACTGAGCGTGCTGGTGGACGACTACCAGATGGACTTTCACCCCTCTCCAGTGGTGCTGAAGGTGTACAAGAACGAGCTGCACCGGCACATCGAGGAAGGCCTGGGCAGAAACATGAGCGACTGA [서열번호 109]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 109에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 109의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 110으로 제공된다:
AUGCGGCUGAAGUUCAUCGACAAGCAGCUGGAACUGCUGGCCCAGGACUACAAGCUGCGGAUCAAGCAGAUCACCGAAGAGGUGGAAAGACAGGUGUCCACCGCCAUGGCCGAGGAAAUCAGACGACUGAGCGUGCUGGUGGACGACUACCAGAUGGACUUUCACCCCUCUCCAGUGGUGCUGAAGGUGUACAAGAACGAGCUGCACCGGCACAUCGAGGAAGGCCUGGGCAGAAACAUGAGCGACUGA [서열번호 110]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 110에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 may be FAF1 폴리펩티드(접근 번호 - NCBI 참조 서열: NM_007051.3; UniProtKB - Q9UNN5(FAF1_HUMAN)), 또는 그의 절단된 버전 또는 동원체일 수 있다. FAF1은 핵의로의 인터페론 조절 인자 3의 전위(translocation)를 억제하고, IFNβ 생산을 감소시킨다. [Song S, Lee J-J, Kim H-J, Lee J-Y et al. Fas-Associated Factor 1 Negatively Regulates the Antiviral Immune Response by Inhibiting Translocation of Interferon Regulatory Factor 3 to the Nucleus. 2016 Jan 25;36(7):1136-51. doi: 10.1128/ MCB.00744-15). FAF1의 한 구현예는 본 명세서에서 다음과 같은 서열번호 146으로 나타낸다:
MASNMDREMILADFQACTGIENIDEAITLLEQNNWDLVAAINGVIPQENGILQSEYGGETIPGPAFNPASHPASAPTSSSSSAFRPVMPSRQIVERQPRMLDFRVEYRDRNVDVVLEDTCTVGEIKQILENELQIPVSKMLLKGWKTGDVEDSTVLKSLHLPKNNSLYVLTPDLPPPSSSSHAGALQESLNQNFMLIITHREVQREYNLNFSGSSTIQEVKRNVYDLTSIPVRHQLWEGWPTSATDDSMCLAESGLSYPCHRLTVGRRSSPAQTREQSEEQITDVHMVSDSDGDDFEDATEFGVDDGEVFGMASSALRKSPMMPENAENEGDALLQFTAEFSSRYGDCHPVFFIGSLEAAFQEAFYVKARDRKLLAIYLHHDESVLTNVFCSQMLCAESIVSYLSQNFITWAWDLTKDSNRARFLTMCNRHFGSVVAQTIRTQKTDQFPLFLIIMGKRSSNEVLNVIQGNTTVDELMMRLMAAMEIFTAQQQEDIKDEDEREARENVKREQDEAYRLSLEADRAKREAHEREMAEQFRLEQIRKEQEEEREAIRLSLEQALPPEPKEENAEPVSKLRIRTPSGEFLERRFLASNKLQIVFDFVASKGFPWDEYKLLSTFPRRDVTQLDPNKSLLEVKLFPQETLFLEAKE [서열번호 146]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 146에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 FAF1 폴리펩티드는 다음과 같은 서열번호 147의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGCGTCCAACATGGACCGGGAGATGATCCTGGCGGATTTTCAGGCATGTACTGGCATTGAAAACATTGACGAAGCTATTACATTGCTTGAACAAAATAATTGGGACTTAGTGGCAGCTATCAATGGTGTAATACCACAGGAAAATGGCATTCTACAAAGTGAATATGGAGGTGAGACCATACCAGGACCTGCATTTAATCCAGCAAGTCATCCAGCTTCAGCTCCTACTTCCTCTTCTTCTTCAGCGTTTCGACCTGTAATGCCATCCAGGCAGATTGTAGAAAGGCAACCTCGGATGCTGGACTTCAGGGTTGAATACAGAGACAGAAATGTTGATGTGGTACTTGAAGACACCTGTACTGTTGGAGAGATTAAACAGATTCTAGAAAATGAACTTCAGATACCTGTGTCCAAAATGCTGTTAAAAGGCTGGAAGACGGGAGATGTGGAAGACAGTACGGTCCTAAAATCTCTACACTTGCCAAAAAACAACAGTCTTTATGTCCTTACACCAGATTTGCCACCACCTTCATCATCTAGTCATGCTGGTGCCCTGCAGGAGTCATTAAATCAAAACTTCATGCTGATCATCACCCACCGAGAAGTCCAGCGGGAGTACAACCTGAACTTCTCAGGAAGCAGTACTATTCAAGAGGTAAAGAGAAATGTGTATGACCTTACAAGTATCCCCGTTCGCCACCAATTATGGGAGGGCTGGCCAACTTCTGCTACAGACGACTCAATGTGTCTTGCTGAATCAGGGCTCTCTTATCCCTGCCATCGACTTACAGTGGGAAGAAGATCTTCACCTGCACAGACCCGGGAACAGTCGGAAGAACAAATCACCGATGTTCATATGGTTAGTGATAGCGATGGAGATGACTTTGAAGATGCTACAGAATTTGGGGTGGATGATGGAGAAGTATTTGGCATGGCGTCATCTGCCTTGAGAAAATCTCCAATGATGCCAGAAAACGCAGAAAATGAAGGAGATGCCTTATTACAATTTACAGCAGAGTTTTCTTCAAGATATGGTGATTGCCATCCTGTATTTTTTATTGGCTCATTAGAAGCTGCTTTTCAAGAGGCCTTCTATGTGAAAGCCCGAGATAGAAAGCTTCTTGCTATCTACCTCCACCATGATGAAAGTGTGTTAACCAACGTGTTCTGCTCACAAATGCTTTGTGCTGAATCCATTGTTTCTTATCTGAGTCAAAATTTTATAACCTGGGCTTGGGATCTGACAAAGGACTCCAACAGAGCAAGATTTCTCACTATGTGCAATAGACACTTTGGCAGTGTTGTGGCACAAACCATTCGGACTCAAAAAACGGATCAGTTTCCGCTTTTCCTGATTATTATGGGAAAGCGATCATCTAATGAAGTGTTGAATGTGATACAAGGGAACACAACAGTAGATGAGTTAATGATGAGACTCATGGCTGCAATGGAGATCTTCACAGCCCAACAACAGGAAGATATAAAGGACGAGGATGAACGTGAAGCCAGAGAAAATGTGAAGAGAGAGCAAGATGAGGCCTATCGCCTTTCACTTGAGGCTGACAGAGCAAAGAGGGAAGCTCACGAGAGAGAGATGGCAGAACAGTTTCGTTTGGAGCAGATTCGCAAAGAACAAGAAGAGGAACGTGAGGCCATCCGGCTGTCCTTAGAGCAAGCCCTGCCTCCTGAGCCAAAGGAAGAAAATGCTGAGCCTGTGAGCAAACTGCGGATCCGGACCCCCAGTGGCGAGTTCTTGGAGCGGCGTTTCCTGGCCAGCAACAAGCTCCAGATTGTCTTTGATTTTGTAGCTTCCAAAGGATTTCCATGGGATGAGTACAAGTTACTGAGCACCTTTCCTAGGAGAGACGTAACTCAACTGGACCCAAATAAATCATTATTGGAGGTAAAGTTGTTCCCTCAAGAAACCCTTTTCCTTGAAGCAAAAGAG [서열번호 147]
따라서, 바람직하게는 상기 FAF1 폴리펩티드는 실질적으로 서열번호 147에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 148의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGCGUCCAACAUGGACCGGGAGAUGAUCCUGGCGGAUUUUCAGGCAUGUACUGGCAUUGAAAACAUUGACGAAGCUAUUACAUUGCUUGAACAAAAUAAUUGGGACUUAGUGGCAGCUAUCAAUGGUGUAAUACCACAGGAAAAUGGCAUUCUACAAAGUGAAUAUGGAGGUGAGACCAUACCAGGACCUGCAUUUAAUCCAGCAAGUCAUCCAGCUUCAGCUCCUACUUCCUCUUCUUCUUCAGCGUUUCGACCUGUAAUGCCAUCCAGGCAGAUUGUAGAAAGGCAACCUCGGAUGCUGGACUUCAGGGUUGAAUACAGAGACAGAAAUGUUGAUGUGGUACUUGAAGACACCUGUACUGUUGGAGAGAUUAAACAGAUUCUAGAAAAUGAACUUCAGAUACCUGUGUCCAAAAUGCUGUUAAAAGGCUGGAAGACGGGAGAUGUGGAAGACAGUACGGUCCUAAAAUCUCUACACUUGCCAAAAAACAACAGUCUUUAUGUCCUUACACCAGAUUUGCCACCACCUUCAUCAUCUAGUCAUGCUGGUGCCCUGCAGGAGUCAUUAAAUCAAAACUUCAUGCUGAUCAUCACCCACCGAGAAGUCCAGCGGGAGUACAACCUGAACUUCUCAGGAAGCAGUACUAUUCAAGAGGUAAAGAGAAAUGUGUAUGACCUUACAAGUAUCCCCGUUCGCCACCAAUUAUGGGAGGGCUGGCCAACUUCUGCUACAGACGACUCAAUGUGUCUUGCUGAAUCAGGGCUCUCUUAUCCCUGCCAUCGACUUACAGUGGGAAGAAGAUCUUCACCUGCACAGACCCGGGAACAGUCGGAAGAACAAAUCACCGAUGUUCAUAUGGUUAGUGAUAGCGAUGGAGAUGACUUUGAAGAUGCUACAGAAUUUGGGGUGGAUGAUGGAGAAGUAUUUGGCAUGGCGUCAUCUGCCUUGAGAAAAUCUCCAAUGAUGCCAGAAAACGCAGAAAAUGAAGGAGAUGCCUUAUUACAAUUUACAGCAGAGUUUUCUUCAAGAUAUGGUGAUUGCCAUCCUGUAUUUUUUAUUGGCUCAUUAGAAGCUGCUUUUCAAGAGGCCUUCUAUGUGAAAGCCCGAGAUAGAAAGCUUCUUGCUAUCUACCUCCACCAUGAUGAAAGUGUGUUAACCAACGUGUUCUGCUCACAAAUGCUUUGUGCUGAAUCCAUUGUUUCUUAUCUGAGUCAAAAUUUUAUAACCUGGGCUUGGGAUCUGACAAAGGACUCCAACAGAGCAAGAUUUCUCACUAUGUGCAAUAGACACUUUGGCAGUGUUGUGGCACAAACCAUUCGGACUCAAAAAACGGAUCAGUUUCCGCUUUUCCUGAUUAUUAUGGGAAAGCGAUCAUCUAAUGAAGUGUUGAAUGUGAUACAAGGGAACACAACAGUAGAUGAGUUAAUGAUGAGACUCAUGGCUGCAAUGGAGAUCUUCACAGCCCAACAACAGGAAGAUAUAAAGGACGAGGAUGAACGUGAAGCCAGAGAAAAUGUGAAGAGAGAGCAAGAUGAGGCCUAUCGCCUUUCACUUGAGGCUGACAGAGCAAAGAGGGAAGCUCACGAGAGAGAGAUGGCAGAACAGUUUCGUUUGGAGCAGAUUCGCAAAGAACAAGAAGAGGAACGUGAGGCCAUCCGGCUGUCCUUAGAGCAAGCCCUGCCUCCUGAGCCAAAGGAAGAAAAUGCUGAGCCUGUGAGCAAACUGCGGAUCCGGACCCCCAGUGGCGAGUUCUUGGAGCGGCGUUUCCUGGCCAGCAACAAGCUCCAGAUUGUCUUUGAUUUUGUAGCUUCCAAAGGAUUUCCAUGGGAUGAGUACAAGUUACUGAGCACCUUUCCUAGGAGAGACGUAACUCAACUGGACCCAAAUAAAUCAUUAUUGGAGGUAAAGUUGUUCCCUCAAGAAACCCUUUUCCUUGAAGCAAAAGAG [서열번호 148]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 148에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 146의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 149로 제공된다:
ATGGCCAGCAACATGGACAGAGAGATGATCCTGGCCGACTTCCAGGCCTGTACCGGCATCGAGAACATCGACGAGGCCATCACACTGCTGGAACAGAACAACTGGGATCTCGTGGCCGCCATCAACGGCGTGATCCCTCAAGAGAATGGCATCCTGCAGAGCGAGTACGGCGGCGAGACAATTCCTGGACCTGCCTTCAATCCCGCCAGCCATCCTGCATCTGCCCCTACATCTAGCAGCAGCAGCGCCTTCAGACCCGTGATGCCTAGCAGACAGATCGTGGAACGGCAGCCCAGAATGCTGGACTTCAGAGTCGAGTACCGGGACAGAAACGTGGACGTGGTGCTGGAAGATACCTGCACCGTGGGCGAGATCAAGCAGATCCTGGAAAACGAGCTGCAGATCCCCGTGTCCAAGATGCTGCTGAAAGGCTGGAAAACCGGCGACGTGGAAGATAGCACCGTGCTGAAGTCCCTGCATCTCCCTAAGAACAACAGCCTGTACGTGCTGACCCCTGACCTGCCTCCTCCAAGCTCTAGTTCTCATGCTGGCGCCCTGCAAGAGAGCCTGAACCAGAACTTCATGCTGATCATCACCCACCGCGAGGTGCAGAGAGAGTATAACCTGAACTTCAGCGGCAGCAGCACCATCCAAGAAGTGAAGCGGAACGTCTACGACCTGACCAGCATTCCTGTGCGGCACCAGCTTTGGGAAGGCTGGCCTACAAGCGCCACCGACGATTCTATGTGTCTGGCCGAGAGCGGCCTGAGCTACCCTTGTCACAGACTGACCGTGGGCAGAAGAAGCAGCCCTGCTCAGACAAGAGAGCAGTCCGAGGAACAGATCACCGACGTGCACATGGTGTCCGATAGCGACGGCGACGATTTCGAGGATGCCACCGAGTTTGGAGTGGACGACGGCGAGGTTTTCGGCATGGCTAGCAGCGCCCTGAGAAAGTCCCCTATGATGCCCGAGAACGCCGAGAATGAAGGCGACGCCCTGCTGCAGTTTACCGCCGAGTTTAGCAGCAGATACGGCGACTGTCACCCCGTGTTCTTCATCGGATCTCTGGAAGCCGCCTTCCAAGAGGCCTTTTACGTGAAGGCCAGAGACAGAAAGCTGCTGGCTATCTATCTGCACCACGACGAGAGCGTGCTGACAAACGTGTTCTGCAGCCAGATGCTGTGCGCCGAGAGCATCGTGTCTTACCTGTCTCAGAATTTCATCACCTGGGCCTGGGATCTGACCAAGGACAGCAACAGAGCCCGGTTCCTGACCATGTGTAACCGGCACTTTGGCAGCGTGGTGGCCCAGACCATCAGAACCCAGAAAACCGATCAGTTCCCTCTGTTCCTGATCATTATGGGCAAGCGCAGCAGCAACGAGGTGCTGAATGTGATCCAGGGCAACACCACCGTGGACGAGCTGATGATGAGACTGATGGCCGCTATGGAAATCTTCACAGCCCAGCAGCAAGAAGATATCAAGGACGAGGACGAGCGCGAGGCCCGCGAGAATGTGAAAAGAGAACAGGACGAAGCCTACCGGCTGAGCCTGGAAGCTGACAGAGCCAAGAGAGAGGCCCACGAGAGAGAGATGGCCGAGCAGTTCAGACTCGAGCAGATCCGCAAAGAGCAAGAGGAAGAGAGAGAAGCCATCCGGCTGTCCCTGGAACAAGCCTTGCCTCCTGAGCCTAAAGAAGAGAACGCTGAGCCAGTGTCCAAGCTGCGGATCAGAACTCCTAGCGGCGAGTTCCTGGAAAGACGGTTCCTGGCCTCCAACAAACTGCAGATCGTGTTCGACTTCGTGGCCTCTAAGGGCTTCCCCTGGGACGAGTACAAGCTGCTGAGCACATTCCCCAGACGGGACGTGACACAGCTGGACCCTAACAAGAGCCTGCTGGAAGTGAAACTGTTTCCCCAAGAGACACTGTTTCTCGAGGCCAAAGAGTGA [서열번호 149]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 149에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 149의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 150으로 제공된다:
AUGGCCAGCAACAUGGACAGAGAGAUGAUCCUGGCCGACUUCCAGGCCUGUACCGGCAUCGAGAACAUCGACGAGGCCAUCACACUGCUGGAACAGAACAACUGGGAUCUCGUGGCCGCCAUCAACGGCGUGAUCCCUCAAGAGAAUGGCAUCCUGCAGAGCGAGUACGGCGGCGAGACAAUUCCUGGACCUGCCUUCAAUCCCGCCAGCCAUCCUGCAUCUGCCCCUACAUCUAGCAGCAGCAGCGCCUUCAGACCCGUGAUGCCUAGCAGACAGAUCGUGGAACGGCAGCCCAGAAUGCUGGACUUCAGAGUCGAGUACCGGGACAGAAACGUGGACGUGGUGCUGGAAGAUACCUGCACCGUGGGCGAGAUCAAGCAGAUCCUGGAAAACGAGCUGCAGAUCCCCGUGUCCAAGAUGCUGCUGAAAGGCUGGAAAACCGGCGACGUGGAAGAUAGCACCGUGCUGAAGUCCCUGCAUCUCCCUAAGAACAACAGCCUGUACGUGCUGACCCCUGACCUGCCUCCUCCAAGCUCUAGUUCUCAUGCUGGCGCCCUGCAAGAGAGCCUGAACCAGAACUUCAUGCUGAUCAUCACCCACCGCGAGGUGCAGAGAGAGUAUAACCUGAACUUCAGCGGCAGCAGCACCAUCCAAGAAGUGAAGCGGAACGUCUACGACCUGACCAGCAUUCCUGUGCGGCACCAGCUUUGGGAAGGCUGGCCUACAAGCGCCACCGACGAUUCUAUGUGUCUGGCCGAGAGCGGCCUGAGCUACCCUUGUCACAGACUGACCGUGGGCAGAAGAAGCAGCCCUGCUCAGACAAGAGAGCAGUCCGAGGAACAGAUCACCGACGUGCACAUGGUGUCCGAUAGCGACGGCGACGAUUUCGAGGAUGCCACCGAGUUUGGAGUGGACGACGGCGAGGUUUUCGGCAUGGCUAGCAGCGCCCUGAGAAAGUCCCCUAUGAUGCCCGAGAACGCCGAGAAUGAAGGCGACGCCCUGCUGCAGUUUACCGCCGAGUUUAGCAGCAGAUACGGCGACUGUCACCCCGUGUUCUUCAUCGGAUCUCUGGAAGCCGCCUUCCAAGAGGCCUUUUACGUGAAGGCCAGAGACAGAAAGCUGCUGGCUAUCUAUCUGCACCACGACGAGAGCGUGCUGACAAACGUGUUCUGCAGCCAGAUGCUGUGCGCCGAGAGCAUCGUGUCUUACCUGUCUCAGAAUUUCAUCACCUGGGCCUGGGAUCUGACCAAGGACAGCAACAGAGCCCGGUUCCUGACCAUGUGUAACCGGCACUUUGGCAGCGUGGUGGCCCAGACCAUCAGAACCCAGAAAACCGAUCAGUUCCCUCUGUUCCUGAUCAUUAUGGGCAAGCGCAGCAGCAACGAGGUGCUGAAUGUGAUCCAGGGCAACACCACCGUGGACGAGCUGAUGAUGAGACUGAUGGCCGCUAUGGAAAUCUUCACAGCCCAGCAGCAAGAAGAUAUCAAGGACGAGGACGAGCGCGAGGCCCGCGAGAAUGUGAAAAGAGAACAGGACGAAGCCUACCGGCUGAGCCUGGAAGCUGACAGAGCCAAGAGAGAGGCCCACGAGAGAGAGAUGGCCGAGCAGUUCAGACUCGAGCAGAUCCGCAAAGAGCAAGAGGAAGAGAGAGAAGCCAUCCGGCUGUCCCUGGAACAAGCCUUGCCUCCUGAGCCUAAAGAAGAGAACGCUGAGCCAGUGUCCAAGCUGCGGAUCAGAACUCCUAGCGGCGAGUUCCUGGAAAGACGGUUCCUGGCCUCCAACAAACUGCAGAUCGUGUUCGACUUCGUGGCCUCUAAGGGCUUCCCCUGGGACGAGUACAAGCUGCUGAGCACAUUCCCCAGACGGGACGUGACACAGCUGGACCCUAACAAGAGCCUGCUGGAAGUGAAACUGUUUCCCCAAGAGACACUGUUUCUCGAGGCCAAAGAGUGA [서열번호 150]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 150에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 USP21(NCBI 참조 서열: NM_012475.5; UniProtKB - Q9UK80(UBP21_HUMAN), 또는 그의 동원체일 수 있다. [Fan Y, Mao R, Yu Y, Liu S, Shi Z, Cheng J, Zhang H, An L, Zhao Y, Xu X, Chen Z, Kogiso M, Zhang D, Zhang H, Xhang P, Jung JU, LI, X, Xu G, Yang J. USP21 negatively regulates antiviral response by acting as a RIG-1 deubiquitinase. J Exp Med.; 211(2): 313-328]. 상기 USP21은 우성 음성이 아니다; 이것은 RIG-I에 결합하여 탈유비퀴틴화하는 그 능력을 통해 항-바이러스 반응에서 음성 조절자로 작용하는 온전한 단백질이다. USP21의 과발현은 RNA 바이러스-유도성 RIG-I 폴리유비퀴틴화 및 RIG-I-매개 인터페론(IFN) 신호전달을 억제한다. 상기 USP21의 한 구현예는 다음과 같은 서열번호 166으로 제공된다:
MPQASEHRLGRTREPPVNIQPRVGSKLPFAPRARSKERRNPASGPNPMLRPLPPRPGLPDERLKKLELGRGRTSGPRPRGPLRADHGVPLPGSPPPTVALPLPSRTNLARSKSVSSGDLRPMGIALGGHRGTGELGAALSRLALRPEPPTLRRSTSLRRLGGFPGPPTLFSIRTEPPASHGSFHMISARSSEPFYSDDKMAHHTLLLGSGHVGLRNLGNTCFLNAVLQCLSSTRPLRDFCLRRDFRQEVPGGGRAQELTEAFADVIGALWHPDSCEAVNPTRFRAVFQKYVPSFSGYSQQDAQEFLKLLMERLHLEINRRGRRAPPILANGPVPSPPRRGGALLEEPELSDDDRANLMWKRYLEREDSKIVDLFVGQLKSCLKCQACGYRSTTFEVFCDLSLPIPKKGFAGGKVSLRDCFNLFTKEEELESENAPVCDRCRQKTRSTKKLTVQRFPRILVLHLNRFSASRGSIKKSSVGVDFPLQRLSLGDFASDKAGSPVYQLYALCNHSGSVHYGHYTALCRCQTGWHVYNDSRVSPVSENQVASSEGYVLFYQLMQEPPRCL [서열번호 166]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 166에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 USP21 폴리펩티드는 다음과 같은 서열번호 167의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGCCCCAGGCCTCTGAGCACCGCCTGGGCCGTACCCGAGAGCCACCTGTTAATATCCAGCCCCGAGTGGGATCCAAGCTACCATTTGCCCCCAGGGCCCGCAGCAAGGAGCGCAGAAACCCAGCCTCTGGGCCAAACCCCATGTTACGACCTCTGCCTCCCCGGCCAGGTCTGCCTGATGAACGGCTCAAGAAACTGGAGCTGGGACGGGGACGGACCTCAGGCCCTCGTCCCAGAGGCCCCCTTCGAGCAGATCATGGGGTTCCCCTGCCTGGCTCACCACCCCCAACAGTGGCTTTGCCTCTCCCATCTCGGACCAACTTAGCCCGTTCCAAGTCTGTGAGCAGTGGGGACTTGCGTCCAATGGGGATTGCCTTGGGAGGGCACCGTGGCACCGGAGAGCTTGGGGCTGCACTGAGCCGCTTGGCCCTCCGGCCTGAGCCACCCACTTTGAGACGTAGCACTTCTCTCCGCCGCCTAGGGGGCTTTCCTGGACCCCCTACCCTGTTCAGCATACGGACAGAGCCCCCTGCTTCCCATGGCTCCTTCCACATGATATCCGCCCGGTCCTCTGAGCCTTTCTACTCTGATGACAAGATGGCTCATCACACACTCCTTCTGGGCTCTGGTCATGTTGGCCTTCGAAACCTGGGAAACACGTGCTTCCTGAATGCTGTGCTGCAGTGTCTGAGCAGCACTCGACCTCTTCGGGACTTCTGTCTGAGAAGGGACTTCCGGCAAGAGGTGCCTGGAGGAGGCCGAGCCCAAGAGCTCACTGAAGCCTTTGCAGATGTGATTGGTGCCCTCTGGCACCCTGACTCCTGCGAAGCTGTGAATCCTACTCGATTCCGAGCTGTCTTCCAGAAATATGTTCCCTCCTTCTCTGGATACAGCCAGCAGGATGCCCAAGAGTTCCTGAAGCTCCTCATGGAGCGGCTACACCTTGAAATCAACCGCCGAGGCCGCCGGGCTCCACCGATACTTGCCAATGGTCCAGTTCCCTCTCCACCCCGCCGAGGAGGGGCTCTGCTAGAAGAACCTGAGTTAAGTGATGATGACCGAGCCAACCTAATGTGGAAACGTTACCTGGAGCGAGAGGACAGCAAGATTGTGGACCTGTTTGTGGGCCAGTTGAAAAGTTGTCTCAAGTGCCAGGCCTGTGGGTATCGCTCCACGACCTTCGAGGTTTTTTGTGACCTGTCCCTGCCCATCCCCAAGAAAGGATTTGCTGGGGGCAAGGTGTCTCTGCGGGATTGTTTCAACCTTTTCACTAAGGAAGAAGAGCTAGAGTCGGAGAATGCCCCAGTGTGTGACCGATGTCGGCAGAAAACTCGAAGTACCAAAAAGTTGACAGTACAAAGATTCCCTCGAATCCTCGTGCTCCATCTGAATCGATTTTCTGCCTCCCGAGGCTCCATCAAAAAAAGTTCAGTAGGTGTAGACTTTCCACTGCAGCGACTGAGCCTAGGGGACTTTGCCAGTGACAAAGCCGGAAGTCCTGTATACCAGCTGTATGCCCTTTGCAACCACTCAGGCAGCGTCCACTATGGCCACTACACAGCCCTGTGCCGGTGCCAGACTGGTTGGCATGTCTACAATGACTCTCGTGTCTCCCCTGTCAGTGAAAACCAGGTGGCATCCAGCGAGGGCTACGTGCTGTTCTACCAACTGATGCAGGAGCCACCCCGGTGCCTG [서열번호 167]
따라서, 바람직하게는 상기 USP21 폴리펩티드는 실질적으로 서열번호 167에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 168의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGCCCCAGGCCUCUGAGCACCGCCUGGGCCGUACCCGAGAGCCACCUGUUAAUAUCCAGCCCCGAGUGGGAUCCAAGCUACCAUUUGCCCCCAGGGCCCGCAGCAAGGAGCGCAGAAACCCAGCCUCUGGGCCAAACCCCAUGUUACGACCUCUGCCUCCCCGGCCAGGUCUGCCUGAUGAACGGCUCAAGAAACUGGAGCUGGGACGGGGACGGACCUCAGGCCCUCGUCCCAGAGGCCCCCUUCGAGCAGAUCAUGGGGUUCCCCUGCCUGGCUCACCACCCCCAACAGUGGCUUUGCCUCUCCCAUCUCGGACCAACUUAGCCCGUUCCAAGUCUGUGAGCAGUGGGGACUUGCGUCCAAUGGGGAUUGCCUUGGGAGGGCACCGUGGCACCGGAGAGCUUGGGGCUGCACUGAGCCGCUUGGCCCUCCGGCCUGAGCCACCCACUUUGAGACGUAGCACUUCUCUCCGCCGCCUAGGGGGCUUUCCUGGACCCCCUACCCUGUUCAGCAUACGGACAGAGCCCCCUGCUUCCCAUGGCUCCUUCCACAUGAUAUCCGCCCGGUCCUCUGAGCCUUUCUACUCUGAUGACAAGAUGGCUCAUCACACACUCCUUCUGGGCUCUGGUCAUGUUGGCCUUCGAAACCUGGGAAACACGUGCUUCCUGAAUGCUGUGCUGCAGUGUCUGAGCAGCACUCGACCUCUUCGGGACUUCUGUCUGAGAAGGGACUUCCGGCAAGAGGUGCCUGGAGGAGGCCGAGCCCAAGAGCUCACUGAAGCCUUUGCAGAUGUGAUUGGUGCCCUCUGGCACCCUGACUCCUGCGAAGCUGUGAAUCCUACUCGAUUCCGAGCUGUCUUCCAGAAAUAUGUUCCCUCCUUCUCUGGAUACAGCCAGCAGGAUGCCCAAGAGUUCCUGAAGCUCCUCAUGGAGCGGCUACACCUUGAAAUCAACCGCCGAGGCCGCCGGGCUCCACCGAUACUUGCCAAUGGUCCAGUUCCCUCUCCACCCCGCCGAGGAGGGGCUCUGCUAGAAGAACCUGAGUUAAGUGAUGAUGACCGAGCCAACCUAAUGUGGAAACGUUACCUGGAGCGAGAGGACAGCAAGAUUGUGGACCUGUUUGUGGGCCAGUUGAAAAGUUGUCUCAAGUGCCAGGCCUGUGGGUAUCGCUCCACGACCUUCGAGGUUUUUUGUGACCUGUCCCUGCCCAUCCCCAAGAAAGGAUUUGCUGGGGGCAAGGUGUCUCUGCGGGAUUGUUUCAACCUUUUCACUAAGGAAGAAGAGCUAGAGUCGGAGAAUGCCCCAGUGUGUGACCGAUGUCGGCAGAAAACUCGAAGUACCAAAAAGUUGACAGUACAAAGAUUCCCUCGAAUCCUCGUGCUCCAUCUGAAUCGAUUUUCUGCCUCCCGAGGCUCCAUCAAAAAAAGUUCAGUAGGUGUAGACUUUCCACUGCAGCGACUGAGCCUAGGGGACUUUGCCAGUGACAAAGCCGGAAGUCCUGUAUACCAGCUGUAUGCCCUUUGCAACCACUCAGGCAGCGUCCACUAUGGCCACUACACAGCCCUGUGCCGGUGCCAGACUGGUUGGCAUGUCUACAAUGACUCUCGUGUCUCCCCUGUCAGUGAAAACCAGGUGGCAUCCAGCGAGGGCUACGUGCUGUUCUACCAACUGAUGCAGGAGCCACCCCGGUGCCUG [서열번호 168]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 168에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 166의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 169로 제공된다:
ATGCCTCAGGCCTCTGAGCACAGACTGGGCAGAACCAGAGAACCTCCTGTGAACATCCAGCCTAGAGTGGGCAGCAAGCTGCCCTTCGCTCCTAGAGCCAGAAGCAAAGAGCGGAGAAACCCTGCCAGCGGACCCAATCCTATGCTGAGGCCTTTGCCTCCTAGACCTGGCCTGCCTGACGAGAGACTGAAGAAGCTGGAACTCGGCAGAGGCAGAACAAGCGGCCCTAGACCTAGAGGACCTCTGAGAGCTGATCACGGCGTTCCACTGCCTGGAAGCCCTCCACCTACAGTTGCTCTGCCACTGCCTAGCAGGACCAACCTGGCCAGATCTAAGAGCGTGTCCAGCGGCGATCTGCGGCCTATGGGAATTGCCCTCGGAGGCCATAGAGGAACAGGCGAACTTGGAGCCGCTCTGAGCAGACTGGCCCTCAGACCTGAACCTCCTACACTGAGAAGAAGCACCAGCCTGAGAAGGCTCGGCGGCTTTCCTGGACCACCAACACTGTTCAGCATCCGGACAGAGCCTCCAGCCAGCCACGGCAGCTTTCACATGATCAGCGCCAGATCCAGCGAGCCCTTCTACAGCGACGACAAGATGGCCCACCACACACTGCTGCTCGGCTCTGGACATGTGGGCCTGAGAAACCTGGGCAATACCTGCTTCCTGAATGCCGTGCTGCAGTGCCTGAGCAGCACAAGACCCCTGAGAGACTTCTGCCTGCGGCGGGACTTTAGACAAGAAGTGCCTGGCGGAGGCAGAGCCCAAGAACTGACAGAGGCTTTCGCCGATGTGATCGGAGCCCTGTGGCACCCTGATTCTTGCGAGGCCGTGAATCCCACCAGATTCCGGGCCGTGTTCCAGAAATACGTGCCCAGCTTTAGCGGCTACAGCCAGCAGGATGCCCAAGAGTTCCTGAAGCTGCTGATGGAACGGCTGCACCTGGAAATCAACAGAAGAGGCAGACGGGCCCCTCCTATCCTGGCTAATGGACCTGTTCCTAGTCCTCCTAGAAGAGGCGGCGCTCTGCTGGAAGAACCTGAGCTGAGCGACGACGACAGAGCCAACCTGATGTGGAAGAGATACCTGGAACGCGAGGACAGCAAGATCGTGGATCTGTTCGTGGGCCAGCTGAAGTCCTGCCTGAAGTGTCAGGCCTGTGGCTACAGGTCCACCACCTTCGAGGTGTTCTGCGACCTGTCTCTGCCCATTCCTAAGAAGGGCTTTGCCGGCGGAAAGGTGTCCCTGAGGGACTGCTTCAACCTGTTCACCAAAGAGGAAGAACTCGAGAGCGAGAACGCCCCTGTGTGCGACAGATGCCGGCAGAAAACCCGGTCCACCAAGAAACTGACCGTGCAGCGGTTCCCCAGAATCCTGGTGCTGCATCTGAACAGATTCTCCGCCAGCCGGGGCAGCATCAAGAAAAGCTCTGTGGGCGTCGACTTCCCACTGCAGCGACTGAGCCTGGGCGATTTCGCCTCTGATAAGGCCGGCTCTCCTGTGTACCAGCTGTACGCCCTGTGTAACCACAGCGGCTCTGTGCACTACGGCCACTACACCGCTCTGTGTAGATGCCAGACAGGCTGGCACGTGTACAACGACAGCAGAGTGTCCCCTGTGTCCGAGAATCAGGTGGCCAGCTCTGAGGGCTACGTGCTGTTCTACCAGCTGATGCAAGAGCCTCCTCGGTGCCTGTGA [서열번호 169]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 169에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 169의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 170으로 제공된다:
AUGCCUCAGGCCUCUGAGCACAGACUGGGCAGAACCAGAGAACCUCCUGUGAACAUCCAGCCUAGAGUGGGCAGCAAGCUGCCCUUCGCUCCUAGAGCCAGAAGCAAAGAGCGGAGAAACCCUGCCAGCGGACCCAAUCCUAUGCUGAGGCCUUUGCCUCCUAGACCUGGCCUGCCUGACGAGAGACUGAAGAAGCUGGAACUCGGCAGAGGCAGAACAAGCGGCCCUAGACCUAGAGGACCUCUGAGAGCUGAUCACGGCGUUCCACUGCCUGGAAGCCCUCCACCUACAGUUGCUCUGCCACUGCCUAGCAGGACCAACCUGGCCAGAUCUAAGAGCGUGUCCAGCGGCGAUCUGCGGCCUAUGGGAAUUGCCCUCGGAGGCCAUAGAGGAACAGGCGAACUUGGAGCCGCUCUGAGCAGACUGGCCCUCAGACCUGAACCUCCUACACUGAGAAGAAGCACCAGCCUGAGAAGGCUCGGCGGCUUUCCUGGACCACCAACACUGUUCAGCAUCCGGACAGAGCCUCCAGCCAGCCACGGCAGCUUUCACAUGAUCAGCGCCAGAUCCAGCGAGCCCUUCUACAGCGACGACAAGAUGGCCCACCACACACUGCUGCUCGGCUCUGGACAUGUGGGCCUGAGAAACCUGGGCAAUACCUGCUUCCUGAAUGCCGUGCUGCAGUGCCUGAGCAGCACAAGACCCCUGAGAGACUUCUGCCUGCGGCGGGACUUUAGACAAGAAGUGCCUGGCGGAGGCAGAGCCCAAGAACUGACAGAGGCUUUCGCCGAUGUGAUCGGAGCCCUGUGGCACCCUGAUUCUUGCGAGGCCGUGAAUCCCACCAGAUUCCGGGCCGUGUUCCAGAAAUACGUGCCCAGCUUUAGCGGCUACAGCCAGCAGGAUGCCCAAGAGUUCCUGAAGCUGCUGAUGGAACGGCUGCACCUGGAAAUCAACAGAAGAGGCAGACGGGCCCCUCCUAUCCUGGCUAAUGGACCUGUUCCUAGUCCUCCUAGAAGAGGCGGCGCUCUGCUGGAAGAACCUGAGCUGAGCGACGACGACAGAGCCAACCUGAUGUGGAAGAGAUACCUGGAACGCGAGGACAGCAAGAUCGUGGAUCUGUUCGUGGGCCAGCUGAAGUCCUGCCUGAAGUGUCAGGCCUGUGGCUACAGGUCCACCACCUUCGAGGUGUUCUGCGACCUGUCUCUGCCCAUUCCUAAGAAGGGCUUUGCCGGCGGAAAGGUGUCCCUGAGGGACUGCUUCAACCUGUUCACCAAAGAGGAAGAACUCGAGAGCGAGAACGCCCCUGUGUGCGACAGAUGCCGGCAGAAAACCCGGUCCACCAAGAAACUGACCGUGCAGCGGUUCCCCAGAAUCCUGGUGCUGCAUCUGAACAGAUUCUCCGCCAGCCGGGGCAGCAUCAAGAAAAGCUCUGUGGGCGUCGACUUCCCACUGCAGCGACUGAGCCUGGGCGAUUUCGCCUCUGAUAAGGCCGGCUCUCCUGUGUACCAGCUGUACGCCCUGUGUAACCACAGCGGCUCUGUGCACUACGGCCACUACACCGCUCUGUGUAGAUGCCAGACAGGCUGGCACGUGUACAACGACAGCAGAGUGUCCCCUGUGUCCGAGAAUCAGGUGGCCAGCUCUGAGGGCUACGUGCUGUUCUACCAGCUGAUGCAAGAGCCUCCUCGGUGCCUGUGA [서열번호 170]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 170에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 USP27(1-438)(NCBI 참조 서열: NM_001145073.3; UniProtKB - A6NNY8(UBP27_HUMAN), 또는 그의 동원체일 수 있다. 상기 USP27은 우성 음성이 아니다; 이것은 RIG-I에 결합하여 탈유비퀴틴화하는 그 능력을 통해 항-바이러스 반응에서 음성 조절자로 작용하는 온전한 단백질이다. USP27의 과발현은 RNA 바이러스-유도성 RIG-I 폴리유비퀴틴화 및 IFN 생산을 유도하는 RIG-I-매개 경로를 억제한다. 상기 USP27 형태의 한 구현예는 본 명세서에서 다음과 같은 서열번호 171로 나타낸다:
MCKDYVYDKDIEQIAKEEQGEALKLQASTSTEVSHQQCSVPGLGEKFPTWETTKPELELLGHNPRRRRITSSFTIGLRGLINLGNTCFMNCIVQALTHTPILRDFFLSDRHRCEMPSPELCLVCEMSSLFRELYSGNPSPHVPYKLLHLVWIHARHLAGYRQQDAHEFLIAALDVLHRHCKGDDVGKAANNPNHCNCIIDQIFTGGLQSDVTCQACHGVSTTIDPCWDISLDLPGSCTSFWPMSPGRESSVNGESHIPGITTLTDCLRRFTRPEHLGSSAKIKCGSCQSYQESTKQLTMNKLPVVACFHFKRFEHSAKQRRKITTYISFPLELDMTPFMASSKESRMNGQLQLPTNSGNNENKYSLFAVVNHQGTLESGHYTSFIRHHKDQWFKCDDAVITKASIKDVLDSEGYLLFYHKQVLEHESEKVKEMNTQAY [서열번호 171]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 171에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 USP27 폴리펩티드는 다음과 같은 서열번호 172의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGTGTAAGGACTATGTATATGACAAAGACATTGAGCAAATTGCCAAAGAAGAGCAAGGAGAAGCTTTGAAATTACAAGCCTCCACCTCAACAGAGGTTTCTCACCAGCAGTGTTCAGTGCCAGGCCTTGGTGAGAAATTCCCAACCTGGGAAACAACCAAACCAGAATTAGAACTGCTGGGGCACAACCCGAGGAGAAGAAGAATCACCTCCAGCTTTACGATCGGTTTAAGAGGACTCATCAATCTTGGCAACACGTGCTTTATGAACTGCATTGTCCAGGCCCTCACCCACACGCCGATACTGAGAGATTTCTTTCTCTCTGACAGGCACCGATGTGAGATGCCGAGTCCCGAGTTGTGTCTGGTCTGTGAGATGTCGTCGCTGTTTCGGGAGTTGTATTCTGGAAACCCGTCTCCTCATGTGCCCTATAAGTTACTGCACCTGGTGTGGATACATGCCCGCCATTTAGCAGGGTACAGGCAACAGGATGCCCACGAGTTCCTCATTGCAGCGTTAGATGTCCTGCACAGGCACTGCAAAGGTGATGATGTCGGGAAGGCGGCCAACAATCCCAACCACTGTAACTGCATCATAGACCAAATCTTCACAGGTGGCCTGCAGTCTGATGTCACCTGTCAAGCCTGCCATGGCGTCTCCACCACGATAGACCCATGCTGGGACATTAGTTTGGACTTGCCTGGCTCTTGCACCTCCTTCTGGCCCATGAGCCCAGGGAGGGAGAGCAGTGTGAACGGGGAAAGCCACATACCAGGAATCACCACCCTCACGGACTGCTTGCGGAGGTTTACGAGGCCAGAGCACTTAGGAAGCAGTGCCAAAATCAAATGTGGTAGTTGCCAAAGCTACCAGGAATCTACCAAACAGCTCACAATGAATAAATTACCTGTCGTTGCCTGTTTTCATTTCAAACGGTTTGAACATTCAGCGAAACAGAGGCGCAAGATCACTACATACATTTCCTTTCCTCTGGAGCTGGATATGACGCCGTTTATGGCCTCAAGTAAAGAGAGCAGAATGAATGGACAATTGCAGCTGCCAACCAATAGTGGAAACAACGAAAATAAGTATTCCTTGTTTGCTGTGGTTAATCACCAAGGAACCTTGGAGAGTGGCCACTATACCAGCTTCATCCGGCACCACAAGGACCAGTGGTTCAAGTGTGATGATGCCGTCATCACTAAGGCCAGTATTAAGGACGTACTGGACAGTGAAGGGTATTTACTGTTCTATCACAAACAGGTGCTAGAACATGAGTCAGAAAAAGTGAAAGAAATGAACACACAAGCCTAC [서열번호 172]
따라서, 바람직하게는 상기 USP27 폴리펩티드는 실질적으로 서열번호 172에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 173의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGUGUAAGGACUAUGUAUAUGACAAAGACAUUGAGCAAAUUGCCAAAGAAGAGCAAGGAGAAGCUUUGAAAUUACAAGCCUCCACCUCAACAGAGGUUUCUCACCAGCAGUGUUCAGUGCCAGGCCUUGGUGAGAAAUUCCCAACCUGGGAAACAACCAAACCAGAAUUAGAACUGCUGGGGCACAACCCGAGGAGAAGAAGAAUCACCUCCAGCUUUACGAUCGGUUUAAGAGGACUCAUCAAUCUUGGCAACACGUGCUUUAUGAACUGCAUUGUCCAGGCCCUCACCCACACGCCGAUACUGAGAGAUUUCUUUCUCUCUGACAGGCACCGAUGUGAGAUGCCGAGUCCCGAGUUGUGUCUGGUCUGUGAGAUGUCGUCGCUGUUUCGGGAGUUGUAUUCUGGAAACCCGUCUCCUCAUGUGCCCUAUAAGUUACUGCACCUGGUGUGGAUACAUGCCCGCCAUUUAGCAGGGUACAGGCAACAGGAUGCCCACGAGUUCCUCAUUGCAGCGUUAGAUGUCCUGCACAGGCACUGCAAAGGUGAUGAUGUCGGGAAGGCGGCCAACAAUCCCAACCACUGUAACUGCAUCAUAGACCAAAUCUUCACAGGUGGCCUGCAGUCUGAUGUCACCUGUCAAGCCUGCCAUGGCGUCUCCACCACGAUAGACCCAUGCUGGGACAUUAGUUUGGACUUGCCUGGCUCUUGCACCUCCUUCUGGCCCAUGAGCCCAGGGAGGGAGAGCAGUGUGAACGGGGAAAGCCACAUACCAGGAAUCACCACCCUCACGGACUGCUUGCGGAGGUUUACGAGGCCAGAGCACUUAGGAAGCAGUGCCAAAAUCAAAUGUGGUAGUUGCCAAAGCUACCAGGAAUCUACCAAACAGCUCACAAUGAAUAAAUUACCUGUCGUUGCCUGUUUUCAUUUCAAACGGUUUGAACAUUCAGCGAAACAGAGGCGCAAGAUCACUACAUACAUUUCCUUUCCUCUGGAGCUGGAUAUGACGCCGUUUAUGGCCUCAAGUAAAGAGAGCAGAAUGAAUGGACAAUUGCAGCUGCCAACCAAUAGUGGAAACAACGAAAAUAAGUAUUCCUUGUUUGCUGUGGUUAAUCACCAAGGAACCUUGGAGAGUGGCCACUAUACCAGCUUCAUCCGGCACCACAAGGACCAGUGGUUCAAGUGUGAUGAUGCCGUCAUCACUAAGGCCAGUAUUAAGGACGUACUGGACAGUGAAGGGUAUUUACUGUUCUAUCACAAACAGGUGCUAGAACAUGAGUCAGAAAAAGUGAAAGAAAUGAACACACAAGCCUAC [서열번호 173]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 173에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 171의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 174로 제공된다:
ATGTGCAAGGACTACGTGTACGACAAGGACATCGAGCAGATCGCCAAAGAGGAACAGGGCGAAGCCCTGAAGCTGCAGGCCAGCACATCTACAGAGGTGTCCCACCAGCAGTGTAGCGTGCCAGGACTGGGCGAGAAGTTCCCTACCTGGGAAACCACCAAGCCTGAGCTGGAACTGCTGGGCCACAATCCTCGGCGGAGAAGAATCACCAGCAGCTTCACCATCGGCCTGCGGGGCCTGATCAATCTGGGCAATACCTGCTTCATGAACTGCATCGTGCAGGCCCTGACACACACCCCTATCCTGAGAGACTTCTTCCTGTCCGACCGGCACAGATGCGAGATGCCTTCTCCAGAGCTGTGCCTCGTGTGCGAGATGAGCAGCCTGTTCCGGGAACTGTACAGCGGCAACCCTTCTCCTCACGTGCCCTACAAACTGCTGCACCTCGTGTGGATTCACGCCAGACACCTGGCCGGCTACAGACAGCAGGATGCCCACGAGTTTCTGATCGCCGCTCTGGACGTGCTGCACAGACACTGCAAAGGCGACGATGTGGGCAAAGCCGCCAACAATCCCAACCACTGCAACTGCATCATCGACCAGATCTTCACAGGCGGCCTGCAGAGCGACGTTACCTGTCAAGCTTGTCACGGCGTGTCCACCACCATCGATCCCTGCTGGGATATCAGCCTGGATCTGCCTGGCAGCTGCACCAGCTTTTGGCCTATGAGCCCTGGCAGAGAAAGCAGCGTGAACGGCGAGTCTCACATCCCCGGCATCACCACACTGACCGACTGCCTGCGGAGATTCACCAGACCTGAGCACCTGGGAAGCAGCGCCAAGATCAAGTGTGGCTCCTGCCAGAGCTACCAAGAGAGCACCAAGCAGCTGACCATGAACAAGCTGCCTGTGGTGGCCTGCTTCCACTTCAAGAGATTCGAGCACTCCGCCAAGCAGCGGCGGAAGATCACAACCTACATCAGCTTCCCTCTGGAACTGGACATGACCCCTTTCATGGCCAGCAGCAAAGAAAGCCGGATGAACGGCCAGCTCCAGCTGCCTACAAATAGCGGCAACAACGAGAACAAGTACTCCCTGTTCGCCGTGGTCAACCACCAGGGCACACTGGAAAGCGGCCACTACACCAGCTTCATCAGACACCACAAGGACCAGTGGTTCAAGTGCGACGACGCCGTGATCACCAAGGCCAGCATCAAGGATGTCCTGGACAGCGAGGGCTACCTGCTGTTCTACCACAAACAGGTGCTGGAACACGAGAGCGAGAAAGTGAAAGAGATGAACACCCAGGCCTACTGA [서열번호 174]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 174에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 174의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 175로 제공된다:
AUGUGCAAGGACUACGUGUACGACAAGGACAUCGAGCAGAUCGCCAAAGAGGAACAGGGCGAAGCCCUGAAGCUGCAGGCCAGCACAUCUACAGAGGUGUCCCACCAGCAGUGUAGCGUGCCAGGACUGGGCGAGAAGUUCCCUACCUGGGAAACCACCAAGCCUGAGCUGGAACUGCUGGGCCACAAUCCUCGGCGGAGAAGAAUCACCAGCAGCUUCACCAUCGGCCUGCGGGGCCUGAUCAAUCUGGGCAAUACCUGCUUCAUGAACUGCAUCGUGCAGGCCCUGACACACACCCCUAUCCUGAGAGACUUCUUCCUGUCCGACCGGCACAGAUGCGAGAUGCCUUCUCCAGAGCUGUGCCUCGUGUGCGAGAUGAGCAGCCUGUUCCGGGAACUGUACAGCGGCAACCCUUCUCCUCACGUGCCCUACAAACUGCUGCACCUCGUGUGGAUUCACGCCAGACACCUGGCCGGCUACAGACAGCAGGAUGCCCACGAGUUUCUGAUCGCCGCUCUGGACGUGCUGCACAGACACUGCAAAGGCGACGAUGUGGGCAAAGCCGCCAACAAUCCCAACCACUGCAACUGCAUCAUCGACCAGAUCUUCACAGGCGGCCUGCAGAGCGACGUUACCUGUCAAGCUUGUCACGGCGUGUCCACCACCAUCGAUCCCUGCUGGGAUAUCAGCCUGGAUCUGCCUGGCAGCUGCACCAGCUUUUGGCCUAUGAGCCCUGGCAGAGAAAGCAGCGUGAACGGCGAGUCUCACAUCCCCGGCAUCACCACACUGACCGACUGCCUGCGGAGAUUCACCAGACCUGAGCACCUGGGAAGCAGCGCCAAGAUCAAGUGUGGCUCCUGCCAGAGCUACCAAGAGAGCACCAAGCAGCUGACCAUGAACAAGCUGCCUGUGGUGGCCUGCUUCCACUUCAAGAGAUUCGAGCACUCCGCCAAGCAGCGGCGGAAGAUCACAACCUACAUCAGCUUCCCUCUGGAACUGGACAUGACCCCUUUCAUGGCCAGCAGCAAAGAAAGCCGGAUGAACGGCCAGCUCCAGCUGCCUACAAAUAGCGGCAACAACGAGAACAAGUACUCCCUGUUCGCCGUGGUCAACCACCAGGGCACACUGGAAAGCGGCCACUACACCAGCUUCAUCAGACACCACAAGGACCAGUGGUUCAAGUGCGACGACGCCGUGAUCACCAAGGCCAGCAUCAAGGAUGUCCUGGACAGCGAGGGCUACCUGCUGUUCUACCACAAACAGGUGCUGGAACACGAGAGCGAGAAAGUGAAAGAGAUGAACACCCAGGCCUACUGA [서열번호 175]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 175에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 CYLD(NCBI 참조 서열: NM_015247.3; UniProtKB - Q9NQC7(CYLD_HUMAN), 또는 그의 동원체일 수 있다. [Friedman CS, O'Donell MA, Legarda-Addision D, Ng A, Cardenas WB, Young JS, Moran TM, Basler CF, Komuro A, Horvath CM, Xavier R, Ting AT. The tumour suppressor CYLD is a negative regulator of RIG-I-mediated antiviral response. EMBO Rep. 2008; 9(9): 930-93]. CYLD의 이소성(ectopic) 발현은 IRF3 신호전달 경로를 억제하고, RIG-I에 의해 IFN 생산이 촉발된다. 상기 CYLD의 한 구현예는 본 명세서에서 다음과 같은 서열번호 176으로 나타낸다:
MSSGLWSQEKVTSPYWEERIFYLLLQECSVTDKQTQKLLKVPKGSIGQYIQDRSVGHSRIPSAKGKKNQIGLKILEQPHAVLFVDEKDVVEINEKFTELLLAITNCEERFSLFKNRNRLSKGLQIDVGCPVKVQLRSGEEKFPGVVRFRGPLLAERTVSGIFFGVELLEEGRGQGFTDGVYQGKQLFQCDEDCGVFVALDKLELIEDDDTALESDYAGPGDTMQVELPPLEINSRVSLKVGETIESGTVIFCDVLPGKESLGYFVGVDMDNPIGNWDGRFDGVQLCSFACVESTILLHINDIIPALSESVTQERRPPKLAFMSRGVGDKGSSSHNKPKATGSTSDPGNRNRSELFYTLNGSSVDSQPQSKSKNTWYIDEVAEDPAKSLTEISTDFDRSSPPLQPPPVNSLTTENRFHSLPFSLTKMPNTNGSIGHSPLSLSAQSVMEELNTAPVQESPPLAMPPGNSHGLEVGSLAEVKENPPFYGVIRWIGQPPGLNEVLAGLELEDECAGCTDGTFRGTRYFTCALKKALFVKLKSCRPDSRFASLQPVSNQIERCNSLAFGGYLSEVVEENTPPKMEKEGLEIMIGKKKGIQGHYNSCYLDSTLFCLFAFSSVLDTVLLRPKEKNDVEYYSETQELLRTEIVNPLRIYGYVCATKIMKLRKILEKVEAASGFTSEEKDPEEFLNILFHHILRVEPLLKIRSAGQKVQDCYFYQIFMEKNEKVGVPTIQQLLEWSFINSNLKFAEAPSCLIIQMPRFGKDFKLFKKIFPSLELNITDLLEDTPRQCRICGGLAMYECRECYDDPDISAGKIKQFCKTCNTQVHLHPKRLNHKYNPVSLPKDLPDWDWRHGCIPCQNMELFAVLCIETSHYVAFVKYGKDDSAWLFFDSMADRDGGQNGFNIPQVTPCPEVGEYLKMSLEDLHSLDSRRIQGCARRLLCDAYMCMYQSPTMSLYK [서열번호 176]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 176에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 CYLD 폴리펩티드는 다음과 같은 서열번호 177의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGAGTTCAGGCTTATGGAGCCAAGAAAAAGTCACTTCACCCTACTGGGAAGAGCGGATTTTTTACTTGCTTCTTCAAGAATGCAGCGTTACAGACAAACAAACACAAAAGCTCCTTAAAGTACCGAAGGGAAGTATAGGACAGTATATTCAAGATCGTTCTGTGGGGCATTCAAGGATTCCTTCTGCAAAAGGCAAGAAAAATCAGATTGGATTAAAAATTCTAGAGCAACCTCATGCAGTTCTCTTTGTTGATGAAAAGGATGTTGTAGAGATAAATGAAAAGTTCACAGAGTTACTTTTGGCAATTACCAATTGTGAGGAGAGGTTCAGCCTGTTTAAAAACAGAAACAGACTAAGTAAAGGCCTCCAAATAGACGTGGGCTGTCCTGTGAAAGTACAGCTGAGATCTGGGGAAGAAAAATTTCCTGGAGTTGTACGCTTCAGAGGACCCCTGTTAGCAGAGAGGACAGTCTCCGGAATATTCTTTGGAGTTGAATTGCTGGAAGAAGGTCGTGGTCAAGGTTTCACTGACGGGGTGTACCAAGGGAAACAGCTTTTTCAGTGTGATGAAGATTGTGGCGTGTTTGTTGCATTGGACAAGCTAGAACTCATAGAAGATGATGACACTGCATTGGAAAGTGATTACGCAGGTCCTGGGGACACAATGCAGGTCGAACTTCCTCCTTTGGAAATAAACTCCAGAGTTTCTTTGAAGGTTGGAGAAACAATAGAATCTGGAACAGTTATATTCTGTGATGTTTTGCCAGGAAAAGAAAGCTTAGGATATTTTGTTGGTGTGGACATGGATAACCCTATTGGCAACTGGGATGGAAGATTTGATGGAGTGCAGCTTTGTAGTTTTGCGTGTGTTGAAAGTACAATTCTATTGCACATCAATGATATCATCCCAGCTTTATCAGAGAGTGTGACGCAGGAAAGGAGGCCTCCCAAACTTGCCTTTATGTCAAGAGGTGTTGGGGACAAAGGTTCATCCAGTCATAATAAACCAAAGGCTACAGGATCTACCTCAGACCCTGGAAATAGAAACAGATCTGAATTATTTTATACCTTAAATGGGTCTTCTGTTGACTCACAACCACAATCCAAATCAAAAAATACATGGTACATTGATGAAGTTGCAGAAGACCCTGCAAAATCTCTTACAGAGATATCTACAGACTTTGACCGTTCTTCACCACCACTCCAGCCTCCTCCTGTGAACTCACTGACCACCGAGAACAGATTCCACTCTTTACCATTCAGTCTCACCAAGATGCCCAATACCAATGGAAGTATTGGCCACAGTCCACTTTCTCTGTCAGCCCAGTCTGTAATGGAAGAGCTAAACACTGCACCCGTCCAAGAGAGTCCACCCTTGGCCATGCCTCCTGGGAACTCACATGGTCTAGAAGTGGGCTCATTGGCTGAAGTTAAGGAGAACCCTCCTTTCTATGGGGTAATCCGTTGGATCGGTCAGCCACCAGGACTGAATGAAGTGCTCGCTGGACTGGAACTGGAAGATGAGTGTGCAGGCTGTACGGATGGAACCTTCAGAGGCACTCGGTATTTCACCTGTGCCCTGAAGAAGGCGCTGTTTGTGAAACTGAAGAGCTGCAGGCCTGACTCTAGGTTTGCATCATTGCAGCCGGTTTCCAATCAGATTGAGCGCTGTAACTCTTTAGCATTTGGAGGCTACTTAAGTGAAGTAGTAGAAGAAAATACTCCACCAAAAATGGAAAAAGAAGGCTTGGAGATAATGATTGGGAAGAAGAAAGGCATCCAGGGTCATTACAATTCTTGTTACTTAGACTCAACCTTATTCTGCTTATTTGCTTTTAGTTCTGTTCTGGACACTGTGTTACTTAGACCCAAAGAAAAGAACGATGTAGAATATTATAGTGAAACCCAAGAGCTACTGAGGACAGAAATTGTTAATCCTCTGAGAATATATGGATATGTGTGTGCCACAAAAATTATGAAACTGAGGAAAATACTTGAAAAGGTGGAGGCTGCATCAGGATTTACCTCTGAAGAAAAAGATCCTGAGGAATTCTTGAATATTCTGTTTCATCATATTTTAAGGGTAGAACCTTTGCTAAAAATAAGATCAGCAGGTCAAAAGGTACAAGATTGTTACTTCTATCAAATTTTTATGGAAAAAAATGAGAAAGTTGGCGTTCCCACAATTCAGCAGTTGTTAGAATGGTCTTTTATCAACAGTAACCTGAAATTTGCAGAGGCACCATCATGTCTGATTATTCAGATGCCTCGATTTGGAAAAGACTTTAAACTATTTAAAAAAATTTTTCCTTCTCTGGAATTAAATATAACAGATTTACTTGAAGACACTCCCAGACAGTGCCGGATATGTGGAGGGCTTGCAATGTATGAGTGTAGAGAATGCTACGACGATCCGGACATCTCAGCTGGAAAAATCAAGCAGTTTTGTAAAACCTGCAACACTCAAGTCCACCTTCATCCGAAGAGGCTGAATCATAAATATAACCCAGTGTCACTTCCCAAAGACTTACCCGACTGGGACTGGAGACACGGCTGCATCCCTTGCCAGAATATGGAGTTATTTGCTGTTCTCTGCATAGAAACAAGCCACTATGTTGCTTTTGTGAAGTATGGGAAGGACGATTCTGCCTGGCTCTTCTTTGACAGCATGGCCGATCGGGATGGTGGTCAGAATGGCTTCAACATTCCTCAAGTCACCCCATGCCCAGAAGTAGGAGAGTACTTGAAGATGTCTCTGGAAGACCTGCATTCCTTGGACTCCAGGAGAATCCAAGGCTGTGCACGAAGACTGCTTTGTGATGCATATATGTGCATGTACCAGAGTCCAACAATGAGTTTGTACAAA [서열번호 177]
따라서, 바람직하게는 상기 CYLD 폴리펩티드는 실질적으로 서열번호 177에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 178의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGAGUUCAGGCUUAUGGAGCCAAGAAAAAGUCACUUCACCCUACUGGGAAGAGCGGAUUUUUUACUUGCUUCUUCAAGAAUGCAGCGUUACAGACAAACAAACACAAAAGCUCCUUAAAGUACCGAAGGGAAGUAUAGGACAGUAUAUUCAAGAUCGUUCUGUGGGGCAUUCAAGGAUUCCUUCUGCAAAAGGCAAGAAAAAUCAGAUUGGAUUAAAAAUUCUAGAGCAACCUCAUGCAGUUCUCUUUGUUGAUGAAAAGGAUGUUGUAGAGAUAAAUGAAAAGUUCACAGAGUUACUUUUGGCAAUUACCAAUUGUGAGGAGAGGUUCAGCCUGUUUAAAAACAGAAACAGACUAAGUAAAGGCCUCCAAAUAGACGUGGGCUGUCCUGUGAAAGUACAGCUGAGAUCUGGGGAAGAAAAAUUUCCUGGAGUUGUACGCUUCAGAGGACCCCUGUUAGCAGAGAGGACAGUCUCCGGAAUAUUCUUUGGAGUUGAAUUGCUGGAAGAAGGUCGUGGUCAAGGUUUCACUGACGGGGUGUACCAAGGGAAACAGCUUUUUCAGUGUGAUGAAGAUUGUGGCGUGUUUGUUGCAUUGGACAAGCUAGAACUCAUAGAAGAUGAUGACACUGCAUUGGAAAGUGAUUACGCAGGUCCUGGGGACACAAUGCAGGUCGAACUUCCUCCUUUGGAAAUAAACUCCAGAGUUUCUUUGAAGGUUGGAGAAACAAUAGAAUCUGGAACAGUUAUAUUCUGUGAUGUUUUGCCAGGAAAAGAAAGCUUAGGAUAUUUUGUUGGUGUGGACAUGGAUAACCCUAUUGGCAACUGGGAUGGAAGAUUUGAUGGAGUGCAGCUUUGUAGUUUUGCGUGUGUUGAAAGUACAAUUCUAUUGCACAUCAAUGAUAUCAUCCCAGCUUUAUCAGAGAGUGUGACGCAGGAAAGGAGGCCUCCCAAACUUGCCUUUAUGUCAAGAGGUGUUGGGGACAAAGGUUCAUCCAGUCAUAAUAAACCAAAGGCUACAGGAUCUACCUCAGACCCUGGAAAUAGAAACAGAUCUGAAUUAUUUUAUACCUUAAAUGGGUCUUCUGUUGACUCACAACCACAAUCCAAAUCAAAAAAUACAUGGUACAUUGAUGAAGUUGCAGAAGACCCUGCAAAAUCUCUUACAGAGAUAUCUACAGACUUUGACCGUUCUUCACCACCACUCCAGCCUCCUCCUGUGAACUCACUGACCACCGAGAACAGAUUCCACUCUUUACCAUUCAGUCUCACCAAGAUGCCCAAUACCAAUGGAAGUAUUGGCCACAGUCCACUUUCUCUGUCAGCCCAGUCUGUAAUGGAAGAGCUAAACACUGCACCCGUCCAAGAGAGUCCACCCUUGGCCAUGCCUCCUGGGAACUCACAUGGUCUAGAAGUGGGCUCAUUGGCUGAAGUUAAGGAGAACCCUCCUUUCUAUGGGGUAAUCCGUUGGAUCGGUCAGCCACCAGGACUGAAUGAAGUGCUCGCUGGACUGGAACUGGAAGAUGAGUGUGCAGGCUGUACGGAUGGAACCUUCAGAGGCACUCGGUAUUUCACCUGUGCCCUGAAGAAGGCGCUGUUUGUGAAACUGAAGAGCUGCAGGCCUGACUCUAGGUUUGCAUCAUUGCAGCCGGUUUCCAAUCAGAUUGAGCGCUGUAACUCUUUAGCAUUUGGAGGCUACUUAAGUGAAGUAGUAGAAGAAAAUACUCCACCAAAAAUGGAAAAAGAAGGCUUGGAGAUAAUGAUUGGGAAGAAGAAAGGCAUCCAGGGUCAUUACAAUUCUUGUUACUUAGACUCAACCUUAUUCUGCUUAUUUGCUUUUAGUUCUGUUCUGGACACUGUGUUACUUAGACCCAAAGAAAAGAACGAUGUAGAAUAUUAUAGUGAAACCCAAGAGCUACUGAGGACAGAAAUUGUUAAUCCUCUGAGAAUAUAUGGAUAUGUGUGUGCCACAAAAAUUAUGAAACUGAGGAAAAUACUUGAAAAGGUGGAGGCUGCAUCAGGAUUUACCUCUGAAGAAAAAGAUCCUGAGGAAUUCUUGAAUAUUCUGUUUCAUCAUAUUUUAAGGGUAGAACCUUUGCUAAAAAUAAGAUCAGCAGGUCAAAAGGUACAAGAUUGUUACUUCUAUCAAAUUUUUAUGGAAAAAAAUGAGAAAGUUGGCGUUCCCACAAUUCAGCAGUUGUUAGAAUGGUCUUUUAUCAACAGUAACCUGAAAUUUGCAGAGGCACCAUCAUGUCUGAUUAUUCAGAUGCCUCGAUUUGGAAAAGACUUUAAACUAUUUAAAAAAAUUUUUCCUUCUCUGGAAUUAAAUAUAACAGAUUUACUUGAAGACACUCCCAGACAGUGCCGGAUAUGUGGAGGGCUUGCAAUGUAUGAGUGUAGAGAAUGCUACGACGAUCCGGACAUCUCAGCUGGAAAAAUCAAGCAGUUUUGUAAAACCUGCAACACUCAAGUCCACCUUCAUCCGAAGAGGCUGAAUCAUAAAUAUAACCCAGUGUCACUUCCCAAAGACUUACCCGACUGGGACUGGAGACACGGCUGCAUCCCUUGCCAGAAUAUGGAGUUAUUUGCUGUUCUCUGCAUAGAAACAAGCCACUAUGUUGCUUUUGUGAAGUAUGGGAAGGACGAUUCUGCCUGGCUCUUCUUUGACAGCAUGGCCGAUCGGGAUGGUGGUCAGAAUGGCUUCAACAUUCCUCAAGUCACCCCAUGCCCAGAAGUAGGAGAGUACUUGAAGAUGUCUCUGGAAGACCUGCAUUCCUUGGACUCCAGGAGAAUCCAAGGCUGUGCACGAAGACUGCUUUGUGAUGCAUAUAUGUGCAUGUACCAGAGUCCAACAAUGAGUUUGUACAAA [서열번호 178]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 178에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 176의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 179로 제공된다:
ATGTCTAGCGGCCTGTGGTCCCAAGAGAAAGTGACAAGCCCCTACTGGGAAGAGAGGATCTTCTACCTGCTGCTGCAAGAGTGCAGCGTGACCGACAAGCAGACCCAGAAACTGCTGAAGGTGCCCAAGGGCAGCATCGGCCAGTACATCCAGGATAGAAGCGTGGGCCACAGCAGAATCCCTAGCGCCAAGGGCAAGAAGAACCAGATCGGCCTGAAGATCCTGGAACAGCCTCACGCCGTGCTGTTCGTGGACGAGAAGGACGTGGTGGAAATCAACGAGAAGTTCACCGAGCTGCTGCTGGCCATCACCAACTGCGAGGAACGGTTCAGCCTGTTCAAGAACCGGAACCGGCTGAGCAAGGGCCTGCAGATCGATGTGGGATGCCCTGTGAAGGTGCAGCTGAGAAGCGGCGAAGAGAAGTTCCCTGGCGTCGTGCGGTTTAGAGGACCTCTGCTGGCCGAGAGAACCGTGTCCGGCATCTTCTTTGGCGTGGAACTGCTGGAAGAAGGCAGAGGCCAGGGCTTTACCGATGGCGTGTACCAGGGCAAGCAGCTGTTTCAGTGCGACGAGGATTGCGGCGTGTTCGTGGCCCTGGATAAGCTGGAACTGATCGAGGACGACGACACAGCCCTGGAAAGCGATTATGCCGGACCTGGCGATACCATGCAGGTCGAACTGCCTCCACTCGAGATCAACAGCCGGGTGTCCCTGAAAGTGGGCGAGACAATCGAGAGCGGCACCGTGATCTTTTGCGACGTGCTGCCTGGCAAAGAGTCCCTGGGCTATTTTGTGGGCGTCGACATGGACAACCCCATCGGCAATTGGGACGGCAGATTTGACGGCGTGCAGCTGTGCAGCTTCGCCTGTGTGGAAAGCACCATCCTGCTGCACATCAACGACATCATCCCCGCTCTGAGCGAGAGCGTGACCCAAGAAAGACGGCCTCCTAAGCTGGCCTTCATGTCTAGAGGCGTGGGCGATAAGGGCAGCTCCAGCCACAACAAGCCTAAGGCCACAGGCTCCACAAGCGACCCCGGCAACAGAAACAGAAGCGAGCTGTTCTACACCCTGAACGGCAGCAGCGTGGACAGCCAGCCTCAGAGCAAGAGCAAGAACACCTGGTACATCGACGAGGTGGCCGAGGATCCTGCCAAGAGCCTGACAGAGATCAGCACCGACTTCGACAGAAGCAGCCCTCCACTGCAGCCTCCACCTGTGAATAGCCTGACCACCGAGAACAGATTCCACAGCCTGCCTTTCAGCCTGACTAAGATGCCCAACACCAACGGCTCCATCGGGCACTCTCCACTGTCTCTGTCTGCCCAGAGCGTGATGGAAGAACTGAACACAGCCCCTGTGCAAGAGTCCCCTCCTCTGGCTATGCCTCCTGGCAATTCTCACGGCCTGGAAGTGGGATCTCTGGCCGAAGTGAAAGAGAACCCTCCTTTCTACGGCGTGATCCGGTGGATCGGACAACCTCCTGGACTGAATGAAGTGCTGGCCGGACTGGAACTGGAAGATGAGTGTGCCGGCTGCACCGACGGCACCTTTAGAGGCACCAGATACTTCACATGCGCCCTGAAGAAAGCCCTGTTCGTGAAGCTGAAGTCCTGCAGACCCGACAGCAGATTCGCTAGCCTGCAGCCTGTGTCCAATCAGATCGAGCGGTGCAACTCCCTGGCCTTTGGCGGCTATCTGTCCGAGGTGGTGGAAGAGAACACCCCTCCTAAGATGGAAAAAGAGGGCCTCGAGATTATGATCGGGAAGAAGAAGGGCATCCAGGGGCACTACAATAGCTGCTACCTGGACAGCACCCTGTTCTGCCTGTTCGCCTTTAGCAGCGTGCTGGACACTGTGCTGCTGCGGCCCAAAGAGAAGAACGACGTCGAGTACTACAGCGAGACACAAGAGCTGCTGAGAACCGAGATCGTGAACCCTCTGCGGATCTACGGCTACGTGTGCGCCACCAAGATCATGAAGCTGCGGAAGATTCTGGAAAAGGTGGAAGCCGCCTCCGGCTTCACCAGCGAGGAAAAGGATCCCGAAGAGTTCCTGAACATCCTGTTTCACCACATCCTGAGAGTGGAACCCCTGCTGAAGATCAGATCCGCCGGACAGAAAGTGCAGGACTGCTACTTCTACCAGATCTTCATGGAAAAGAACGAGAAAGTCGGCGTGCCCACCATCCAGCAACTGCTCGAGTGGTCCTTCATCAACAGCAACCTGAAGTTCGCCGAGGCTCCCAGCTGCCTGATCATCCAGATGCCTAGATTCGGCAAGGACTTCAAGCTGTTCAAAAAGATCTTCCCCAGCCTCGAGCTGAACATCACCGACCTGCTCGAGGACACCCCTCGGCAGTGTAGAATTTGTGGCGGCCTGGCTATGTACGAGTGCAGAGAGTGCTACGACGACCCCGATATCAGCGCCGGCAAGATCAAGCAGTTCTGCAAGACCTGCAACACCCAAGTGCATCTGCACCCCAAGCGGCTGAACCACAAGTACAACCCCGTGTCTCTGCCCAAGGACCTGCCTGACTGGGATTGGAGACACGGCTGTATCCCTTGCCAGAACATGGAACTGTTCGCTGTGCTGTGCATCGAGACAAGCCACTACGTGGCCTTCGTGAAGTACGGCAAGGATGACAGCGCCTGGCTGTTCTTCGACAGCATGGCCGATAGAGATGGCGGCCAGAACGGCTTCAACATCCCTCAAGTGACCCCTTGTCCTGAAGTGGGAGAGTACCTGAAGATGAGCCTGGAAGATCTGCACAGCCTGGACTCCAGACGGATCCAGGGATGTGCTAGAAGGCTGCTGTGCGACGCCTACATGTGCATGTATCAGAGCCCCACCATGAGCCTGTACAAGTGA [서열번호 179]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 179에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 179의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 180으로 제공된다:
AUGUCUAGCGGCCUGUGGUCCCAAGAGAAAGUGACAAGCCCCUACUGGGAAGAGAGGAUCUUCUACCUGCUGCUGCAAGAGUGCAGCGUGACCGACAAGCAGACCCAGAAACUGCUGAAGGUGCCCAAGGGCAGCAUCGGCCAGUACAUCCAGGAUAGAAGCGUGGGCCACAGCAGAAUCCCUAGCGCCAAGGGCAAGAAGAACCAGAUCGGCCUGAAGAUCCUGGAACAGCCUCACGCCGUGCUGUUCGUGGACGAGAAGGACGUGGUGGAAAUCAACGAGAAGUUCACCGAGCUGCUGCUGGCCAUCACCAACUGCGAGGAACGGUUCAGCCUGUUCAAGAACCGGAACCGGCUGAGCAAGGGCCUGCAGAUCGAUGUGGGAUGCCCUGUGAAGGUGCAGCUGAGAAGCGGCGAAGAGAAGUUCCCUGGCGUCGUGCGGUUUAGAGGACCUCUGCUGGCCGAGAGAACCGUGUCCGGCAUCUUCUUUGGCGUGGAACUGCUGGAAGAAGGCAGAGGCCAGGGCUUUACCGAUGGCGUGUACCAGGGCAAGCAGCUGUUUCAGUGCGACGAGGAUUGCGGCGUGUUCGUGGCCCUGGAUAAGCUGGAACUGAUCGAGGACGACGACACAGCCCUGGAAAGCGAUUAUGCCGGACCUGGCGAUACCAUGCAGGUCGAACUGCCUCCACUCGAGAUCAACAGCCGGGUGUCCCUGAAAGUGGGCGAGACAAUCGAGAGCGGCACCGUGAUCUUUUGCGACGUGCUGCCUGGCAAAGAGUCCCUGGGCUAUUUUGUGGGCGUCGACAUGGACAACCCCAUCGGCAAUUGGGACGGCAGAUUUGACGGCGUGCAGCUGUGCAGCUUCGCCUGUGUGGAAAGCACCAUCCUGCUGCACAUCAACGACAUCAUCCCCGCUCUGAGCGAGAGCGUGACCCAAGAAAGACGGCCUCCUAAGCUGGCCUUCAUGUCUAGAGGCGUGGGCGAUAAGGGCAGCUCCAGCCACAACAAGCCUAAGGCCACAGGCUCCACAAGCGACCCCGGCAACAGAAACAGAAGCGAGCUGUUCUACACCCUGAACGGCAGCAGCGUGGACAGCCAGCCUCAGAGCAAGAGCAAGAACACCUGGUACAUCGACGAGGUGGCCGAGGAUCCUGCCAAGAGCCUGACAGAGAUCAGCACCGACUUCGACAGAAGCAGCCCUCCACUGCAGCCUCCACCUGUGAAUAGCCUGACCACCGAGAACAGAUUCCACAGCCUGCCUUUCAGCCUGACUAAGAUGCCCAACACCAACGGCUCCAUCGGGCACUCUCCACUGUCUCUGUCUGCCCAGAGCGUGAUGGAAGAACUGAACACAGCCCCUGUGCAAGAGUCCCCUCCUCUGGCUAUGCCUCCUGGCAAUUCUCACGGCCUGGAAGUGGGAUCUCUGGCCGAAGUGAAAGAGAACCCUCCUUUCUACGGCGUGAUCCGGUGGAUCGGACAACCUCCUGGACUGAAUGAAGUGCUGGCCGGACUGGAACUGGAAGAUGAGUGUGCCGGCUGCACCGACGGCACCUUUAGAGGCACCAGAUACUUCACAUGCGCCCUGAAGAAAGCCCUGUUCGUGAAGCUGAAGUCCUGCAGACCCGACAGCAGAUUCGCUAGCCUGCAGCCUGUGUCCAAUCAGAUCGAGCGGUGCAACUCCCUGGCCUUUGGCGGCUAUCUGUCCGAGGUGGUGGAAGAGAACACCCCUCCUAAGAUGGAAAAAGAGGGCCUCGAGAUUAUGAUCGGGAAGAAGAAGGGCAUCCAGGGGCACUACAAUAGCUGCUACCUGGACAGCACCCUGUUCUGCCUGUUCGCCUUUAGCAGCGUGCUGGACACUGUGCUGCUGCGGCCCAAAGAGAAGAACGACGUCGAGUACUACAGCGAGACACAAGAGCUGCUGAGAACCGAGAUCGUGAACCCUCUGCGGAUCUACGGCUACGUGUGCGCCACCAAGAUCAUGAAGCUGCGGAAGAUUCUGGAAAAGGUGGAAGCCGCCUCCGGCUUCACCAGCGAGGAAAAGGAUCCCGAAGAGUUCCUGAACAUCCUGUUUCACCACAUCCUGAGAGUGGAACCCCUGCUGAAGAUCAGAUCCGCCGGACAGAAAGUGCAGGACUGCUACUUCUACCAGAUCUUCAUGGAAAAGAACGAGAAAGUCGGCGUGCCCACCAUCCAGCAACUGCUCGAGUGGUCCUUCAUCAACAGCAACCUGAAGUUCGCCGAGGCUCCCAGCUGCCUGAUCAUCCAGAUGCCUAGAUUCGGCAAGGACUUCAAGCUGUUCAAAAAGAUCUUCCCCAGCCUCGAGCUGAACAUCACCGACCUGCUCGAGGACACCCCUCGGCAGUGUAGAAUUUGUGGCGGCCUGGCUAUGUACGAGUGCAGAGAGUGCUACGACGACCCCGAUAUCAGCGCCGGCAAGAUCAAGCAGUUCUGCAAGACCUGCAACACCCAAGUGCAUCUGCACCCCAAGCGGCUGAACCACAAGUACAACCCCGUGUCUCUGCCCAAGGACCUGCCUGACUGGGAUUGGAGACACGGCUGUAUCCCUUGCCAGAACAUGGAACUGUUCGCUGUGCUGUGCAUCGAGACAAGCCACUACGUGGCCUUCGUGAAGUACGGCAAGGAUGACAGCGCCUGGCUGUUCUUCGACAGCAUGGCCGAUAGAGAUGGCGGCCAGAACGGCUUCAACAUCCCUCAAGUGACCCCUUGUCCUGAAGUGGGAGAGUACCUGAAGAUGAGCCUGGAAGAUCUGCACAGCCUGGACUCCAGACGGAUCCAGGGAUGUGCUAGAAGGCUGCUGUGCGACGCCUACAUGUGCAUGUAUCAGAGCCCCACCAUGAGCCUGUACAAGUGA [서열번호 180]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 180에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 LGP2(NCBI 참조 서열: NM_024119.3; UniProtKB - Q96C10(DHX58_HUMAN), 또는 그의 동원체일 수 있다. [Rothenfusser, S., N. Goutagny, G. DiPerna, M. Gong, B. G. Monks, A. Schoenemeyer, M. Yamamoto, S. Akira, K. A. Fitzgerald. 2005. The RNA helicase LGP2 inhibits TLR-independent sensing of viral replication by retinoic acid-inducible gene-I. J. Immunol. 175: 5260-5268]; [Komuro, A., C. M. Horvath. 2006. RNA and virus-independent inhibition of antiviral signaling by RNA helicase LGP2. J. Virol. 80: 12332-12342].
상기 LGP2의 한 구현예는 본 명세서에서 다음과 같은 서열번호 181로 나타낸다:
MELRSYQWEVIMPALEGKNIIIWLPTGAGKTRAAAYVAKRHLETVDGAKVVVLVNRVHLVTQHGEEFRRMLDGRWTVTTLSGDMGPRAGFGHLARCHDLLICTAELLQMALTSPEEEEHVELTVFSLIVVDECHHTHKDTVYNVIMSQYLELKLQRAQPLPQVLGLTASPGTGGASKLDGAINHVLQLCANLDTWCIMSPQNCCPQLQEHSQQPCKQYNLCHRRSQDPFGDLLKKLMDQIHDHLEMPELSRKFGTQMYEQQVVKLSEAAALAGLQEQRVYALHLRRYNDALLIHDTVRAVDALAALQDFYHREHVTKTQILCAERRLLALFDDRKNELAHLATHGPENPKLEMLEKILQRQFSSSNSPRGIIFTRTRQSAHSLLLWLQQQQGLQTVDIRAQLLIGAGNSSQSTHMTQRDQQEVIQKFQDGTLNLLVATSVAEEGLDIPHCNVVVRYGLLTNEISMVQARGRARADQSVYAFVATEGSRELKRELINEALETLMEQAVAAVQKMDQAEYQAKIRDLQQAALTKRAAQAAQRENQRQQFPVEHVQLLCINCMVAVGHGSDLRKVEGTHHVNVNPNFSNYYNVSRDPVVINKVFKDWKPGGVISCRNCGEVWGLQMIYKSVKLPVLKVRSMLLETPQGRIQAKKWSRVPFSVPDFDFLQHCAENLSDLSLD [서열번호 181]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 181에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 LGP2 폴리펩티드는 다음과 같은 서열번호 182의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGAGCTTCGGTCCTACCAATGGGAGGTGATCATGCCTGCCCTGGAGGGCAAGAATATCATCATCTGGCTGCCCACGGGTGCCGGGAAGACCCGGGCGGCTGCTTATGTGGCCAAGCGGCACCTAGAGACTGTGGATGGAGCCAAGGTGGTTGTATTGGTCAACAGGGTGCACCTGGTGACCCAGCATGGTGAAGAGTTCAGGCGCATGCTGGATGGACGCTGGACCGTGACAACCCTGAGTGGGGACATGGGACCACGTGCTGGCTTTGGCCACCTGGCCCGGTGCCATGACCTGCTCATCTGCACAGCAGAGCTTCTGCAGATGGCACTGACCAGCCCCGAGGAGGAGGAGCACGTGGAGCTCACTGTCTTCTCCCTGATCGTGGTGGATGAGTGCCACCACACGCACAAGGACACCGTCTACAACGTCATCATGAGCCAGTACCTAGAACTTAAACTCCAGAGGGCACAGCCGCTACCCCAGGTGCTGGGTCTCACAGCCTCCCCAGGCACTGGCGGGGCCTCCAAACTCGATGGGGCCATCAACCACGTCCTGCAGCTCTGTGCCAACTTGGACACGTGGTGCATCATGTCACCCCAGAACTGCTGCCCCCAGCTGCAGGAGCACAGCCAACAGCCTTGCAAACAGTACAACCTCTGCCACAGGCGCAGCCAGGATCCGTTTGGGGACTTGCTGAAGAAGCTCATGGACCAAATCCATGACCACCTGGAGATGCCTGAGTTGAGCCGGAAATTTGGGACGCAAATGTATGAGCAGCAGGTGGTGAAGCTGAGTGAGGCTGCGGCTTTGGCTGGGCTTCAGGAGCAACGGGTGTATGCGCTTCACCTGAGGCGCTACAATGACGCGCTGCTCATCCATGACACCGTCCGCGCCGTGGATGCCTTGGCTGCGCTGCAGGATTTCTATCACAGGGAGCACGTCACTAAAACCCAGATCCTGTGTGCCGAGCGCCGGCTGCTGGCCCTGTTCGATGACCGCAAGAATGAGCTGGCCCACTTGGCAACTCATGGCCCAGAGAATCCAAAACTGGAGATGCTGGAAAAGATCCTGCAAAGGCAGTTCAGTAGCTCTAACAGCCCTCGGGGTATCATCTTCACCCGCACCCGCCAAAGCGCACACTCCCTCCTGCTCTGGCTCCAGCAGCAGCAGGGCCTGCAGACTGTGGACATCCGGGCCCAGCTACTGATTGGGGCTGGGAACAGCAGCCAGAGCACCCACATGACCCAGAGGGACCAGCAAGAAGTGATCCAGAAGTTCCAAGATGGAACCCTGAACCTTCTGGTGGCCACGAGTGTGGCGGAGGAGGGGCTGGACATCCCACATTGCAATGTGGTGGTGCGTTATGGGCTCTTGACCAATGAAATCTCCATGGTCCAGGCCAGGGGCCGTGCCCGGGCCGATCAGAGTGTATACGCGTTTGTAGCAACTGAAGGTAGCCGGGAGCTGAAGCGGGAGCTGATCAACGAGGCGCTGGAGACGCTGATGGAGCAGGCAGTGGCTGCTGTGCAGAAAATGGACCAGGCCGAGTACCAGGCCAAGATCCGGGATCTGCAGCAGGCAGCCTTGACCAAGCGGGCGGCCCAGGCAGCCCAGCGGGAGAACCAGCGGCAGCAGTTCCCAGTGGAGCACGTGCAGCTACTCTGCATCAACTGCATGGTGGCTGTGGGCCATGGCAGCGACCTGCGGAAGGTGGAGGGCACCCACCATGTCAATGTGAACCCCAACTTCTCGAACTACTATAATGTCTCCAGGGATCCTGTGGTCATCAACAAAGTCTTCAAGGACTGGAAGCCTGGGGGTGTCATCAGCTGCAGGAACTGTGGGGAGGTCTGGGGTCTGCAGATGATCTACAAGTCAGTGAAGCTGCCAGTGCTCAAAGTCCGCAGCATGCTGCTGGAGACCCCTCAGGGGCGGATCCAGGCCAAAAAGTGGTCCCGCGTGCCCTTCTCCGTGCCTGACTTTGACTTCCTGCAGCATTGTGCCGAGAACTTGTCGGACCTCTCCCTGGAC [서열번호 182]
따라서, 바람직하게는 상기 LGP2 폴리펩티드는 실질적으로 서열번호 182에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 183의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGAGCUUCGGUCCUACCAAUGGGAGGUGAUCAUGCCUGCCCUGGAGGGCAAGAAUAUCAUCAUCUGGCUGCCCACGGGUGCCGGGAAGACCCGGGCGGCUGCUUAUGUGGCCAAGCGGCACCUAGAGACUGUGGAUGGAGCCAAGGUGGUUGUAUUGGUCAACAGGGUGCACCUGGUGACCCAGCAUGGUGAAGAGUUCAGGCGCAUGCUGGAUGGACGCUGGACCGUGACAACCCUGAGUGGGGACAUGGGACCACGUGCUGGCUUUGGCCACCUGGCCCGGUGCCAUGACCUGCUCAUCUGCACAGCAGAGCUUCUGCAGAUGGCACUGACCAGCCCCGAGGAGGAGGAGCACGUGGAGCUCACUGUCUUCUCCCUGAUCGUGGUGGAUGAGUGCCACCACACGCACAAGGACACCGUCUACAACGUCAUCAUGAGCCAGUACCUAGAACUUAAACUCCAGAGGGCACAGCCGCUACCCCAGGUGCUGGGUCUCACAGCCUCCCCAGGCACUGGCGGGGCCUCCAAACUCGAUGGGGCCAUCAACCACGUCCUGCAGCUCUGUGCCAACUUGGACACGUGGUGCAUCAUGUCACCCCAGAACUGCUGCCCCCAGCUGCAGGAGCACAGCCAACAGCCUUGCAAACAGUACAACCUCUGCCACAGGCGCAGCCAGGAUCCGUUUGGGGACUUGCUGAAGAAGCUCAUGGACCAAAUCCAUGACCACCUGGAGAUGCCUGAGUUGAGCCGGAAAUUUGGGACGCAAAUGUAUGAGCAGCAGGUGGUGAAGCUGAGUGAGGCUGCGGCUUUGGCUGGGCUUCAGGAGCAACGGGUGUAUGCGCUUCACCUGAGGCGCUACAAUGACGCGCUGCUCAUCCAUGACACCGUCCGCGCCGUGGAUGCCUUGGCUGCGCUGCAGGAUUUCUAUCACAGGGAGCACGUCACUAAAACCCAGAUCCUGUGUGCCGAGCGCCGGCUGCUGGCCCUGUUCGAUGACCGCAAGAAUGAGCUGGCCCACUUGGCAACUCAUGGCCCAGAGAAUCCAAAACUGGAGAUGCUGGAAAAGAUCCUGCAAAGGCAGUUCAGUAGCUCUAACAGCCCUCGGGGUAUCAUCUUCACCCGCACCCGCCAAAGCGCACACUCCCUCCUGCUCUGGCUCCAGCAGCAGCAGGGCCUGCAGACUGUGGACAUCCGGGCCCAGCUACUGAUUGGGGCUGGGAACAGCAGCCAGAGCACCCACAUGACCCAGAGGGACCAGCAAGAAGUGAUCCAGAAGUUCCAAGAUGGAACCCUGAACCUUCUGGUGGCCACGAGUGUGGCGGAGGAGGGGCUGGACAUCCCACAUUGCAAUGUGGUGGUGCGUUAUGGGCUCUUGACCAAUGAAAUCUCCAUGGUCCAGGCCAGGGGCCGUGCCCGGGCCGAUCAGAGUGUAUACGCGUUUGUAGCAACUGAAGGUAGCCGGGAGCUGAAGCGGGAGCUGAUCAACGAGGCGCUGGAGACGCUGAUGGAGCAGGCAGUGGCUGCUGUGCAGAAAAUGGACCAGGCCGAGUACCAGGCCAAGAUCCGGGAUCUGCAGCAGGCAGCCUUGACCAAGCGGGCGGCCCAGGCAGCCCAGCGGGAGAACCAGCGGCAGCAGUUCCCAGUGGAGCACGUGCAGCUACUCUGCAUCAACUGCAUGGUGGCUGUGGGCCAUGGCAGCGACCUGCGGAAGGUGGAGGGCACCCACCAUGUCAAUGUGAACCCCAACUUCUCGAACUACUAUAAUGUCUCCAGGGAUCCUGUGGUCAUCAACAAAGUCUUCAAGGACUGGAAGCCUGGGGGUGUCAUCAGCUGCAGGAACUGUGGGGAGGUCUGGGGUCUGCAGAUGAUCUACAAGUCAGUGAAGCUGCCAGUGCUCAAAGUCCGCAGCAUGCUGCUGGAGACCCCUCAGGGGCGGAUCCAGGCCAAAAAGUGGUCCCGCGUGCCCUUCUCCGUGCCUGACUUUGACUUCCUGCAGCAUUGUGCCGAGAACUUGUCGGACCUCUCCCUGGAC [서열번호 183]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 183에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 181의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 184로 제공된다:
ATGGAACTGCGGAGCTACCAGTGGGAAGTGATCATGCCTGCTCTGGAAGGCAAGAACATCATCATCTGGCTGCCCACCGGCGCTGGCAAAACAAGAGCTGCTGCCTACGTGGCCAAGCGGCACCTGGAAACAGTGGATGGCGCTAAGGTGGTGGTGCTGGTCAACAGAGTGCACCTGGTTACCCAGCACGGCGAGGAATTCAGAAGAATGCTGGACGGCCGGTGGACCGTGACAACACTGTCTGGCGATATGGGCCCTAGAGCCGGCTTTGGACACCTGGCCAGATGCCACGATCTGCTGATCTGTACAGCCGAACTGCTGCAGATGGCCCTGACAAGCCCTGAGGAAGAGGAACACGTCGAGCTGACCGTGTTCAGCCTGATCGTGGTGGACGAGTGCCACCACACACACAAGGACACCGTGTACAACGTGATCATGAGCCAGTACCTGGAACTGAAGCTGCAGAGAGCCCAGCCTCTGCCTCAAGTGCTGGGACTGACAGCCTCTCCTGGAACAGGCGGAGCCTCTAAACTGGACGGCGCCATCAATCACGTGCTGCAGCTGTGCGCCAACCTGGATACCTGGTGCATCATGTCCCCACAGAACTGCTGTCCCCAGCTGCAAGAGCACTCTCAGCAGCCCTGCAAGCAGTACAACCTGTGCCACAGAAGATCTCAGGACCCCTTCGGCGACCTGCTGAAGAAACTGATGGACCAGATCCACGACCACCTCGAGATGCCCGAGCTGAGCAGAAAGTTCGGCACCCAGATGTACGAGCAGCAGGTTGTGAAGCTGAGCGAAGCCGCTGCTCTGGCCGGACTGCAAGAACAGAGAGTGTACGCCCTGCACCTGAGGCGGTACAATGATGCCCTGCTGATCCACGATACCGTGCGCGCTGTTGATGCTCTGGCTGCTCTGCAGGATTTCTACCACCGCGAGCACGTGACCAAGACACAGATCCTGTGTGCCGAGAGAAGGCTGCTGGCCCTGTTCGACGACAGAAAGAATGAGCTGGCCCACCTGGCTACACACGGCCCCGAAAATCCCAAGCTGGAAATGCTGGAAAAGATCCTGCAGCGGCAGTTCAGCAGCAGCAACAGCCCTAGAGGCATCATCTTCACCCGGACCAGACAGAGCGCCCACTCTCTGCTGCTGTGGCTGCAGCAACAACAGGGACTGCAGACCGTGGACATTAGGGCCCAGCTGCTGATCGGAGCCGGCAATAGCTCTCAGAGCACCCACATGACCCAGCGGGACCAGCAAGAAGTGATCCAGAAGTTCCAGGACGGCACCCTGAATCTGCTGGTGGCCACATCTGTGGCTGAGGAAGGCCTGGATATCCCTCACTGCAACGTGGTCGTCAGATACGGCCTGCTGACCAACGAGATCAGCATGGTGCAGGCCAGAGGCAGAGCCAGAGCCGATCAGTCTGTGTACGCCTTCGTGGCTACAGAGGGCTCCAGAGAGCTGAAGCGCGAGCTGATCAATGAGGCCCTGGAAACCCTGATGGAACAAGCCGTGGCCGCCGTGCAGAAAATGGATCAGGCCGAGTACCAGGCCAAGATCAGGGATCTGCAACAGGCCGCTCTGACCAAGAGAGCTGCTCAGGCTGCCCAGAGAGAGAACCAGAGACAGCAATTCCCCGTGGAACACGTGCAGCTGCTGTGTATCAACTGCATGGTGGCCGTCGGACACGGCAGCGATCTGAGAAAAGTGGAAGGCACCCACCACGTGAACGTGAACCCCAACTTCAGCAACTACTACAACGTGTCCAGAGATCCCGTGGTCATCAACAAGGTGTTCAAGGACTGGAAGCCTGGCGGCGTGATCAGCTGCAGAAATTGCGGAGAAGTGTGGGGCCTGCAGATGATCTACAAGAGCGTGAAGCTGCCCGTGCTGAAAGTGCGGAGCATGCTGCTGGAAACACCCCAGGGAAGAATCCAGGCCAAAAAGTGGTCCAGAGTGCCCTTCAGCGTGCCCGACTTCGATTTCCTGCAGCACTGCGCCGAGAACCTGAGCGATCTGTCCCTGGATTGA [서열번호 184]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 184에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 184의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 185로 제공된다:
AUGGAACUGCGGAGCUACCAGUGGGAAGUGAUCAUGCCUGCUCUGGAAGGCAAGAACAUCAUCAUCUGGCUGCCCACCGGCGCUGGCAAAACAAGAGCUGCUGCCUACGUGGCCAAGCGGCACCUGGAAACAGUGGAUGGCGCUAAGGUGGUGGUGCUGGUCAACAGAGUGCACCUGGUUACCCAGCACGGCGAGGAAUUCAGAAGAAUGCUGGACGGCCGGUGGACCGUGACAACACUGUCUGGCGAUAUGGGCCCUAGAGCCGGCUUUGGACACCUGGCCAGAUGCCACGAUCUGCUGAUCUGUACAGCCGAACUGCUGCAGAUGGCCCUGACAAGCCCUGAGGAAGAGGAACACGUCGAGCUGACCGUGUUCAGCCUGAUCGUGGUGGACGAGUGCCACCACACACACAAGGACACCGUGUACAACGUGAUCAUGAGCCAGUACCUGGAACUGAAGCUGCAGAGAGCCCAGCCUCUGCCUCAAGUGCUGGGACUGACAGCCUCUCCUGGAACAGGCGGAGCCUCUAAACUGGACGGCGCCAUCAAUCACGUGCUGCAGCUGUGCGCCAACCUGGAUACCUGGUGCAUCAUGUCCCCACAGAACUGCUGUCCCCAGCUGCAAGAGCACUCUCAGCAGCCCUGCAAGCAGUACAACCUGUGCCACAGAAGAUCUCAGGACCCCUUCGGCGACCUGCUGAAGAAACUGAUGGACCAGAUCCACGACCACCUCGAGAUGCCCGAGCUGAGCAGAAAGUUCGGCACCCAGAUGUACGAGCAGCAGGUUGUGAAGCUGAGCGAAGCCGCUGCUCUGGCCGGACUGCAAGAACAGAGAGUGUACGCCCUGCACCUGAGGCGGUACAAUGAUGCCCUGCUGAUCCACGAUACCGUGCGCGCUGUUGAUGCUCUGGCUGCUCUGCAGGAUUUCUACCACCGCGAGCACGUGACCAAGACACAGAUCCUGUGUGCCGAGAGAAGGCUGCUGGCCCUGUUCGACGACAGAAAGAAUGAGCUGGCCCACCUGGCUACACACGGCCCCGAAAAUCCCAAGCUGGAAAUGCUGGAAAAGAUCCUGCAGCGGCAGUUCAGCAGCAGCAACAGCCCUAGAGGCAUCAUCUUCACCCGGACCAGACAGAGCGCCCACUCUCUGCUGCUGUGGCUGCAGCAACAACAGGGACUGCAGACCGUGGACAUUAGGGCCCAGCUGCUGAUCGGAGCCGGCAAUAGCUCUCAGAGCACCCACAUGACCCAGCGGGACCAGCAAGAAGUGAUCCAGAAGUUCCAGGACGGCACCCUGAAUCUGCUGGUGGCCACAUCUGUGGCUGAGGAAGGCCUGGAUAUCCCUCACUGCAACGUGGUCGUCAGAUACGGCCUGCUGACCAACGAGAUCAGCAUGGUGCAGGCCAGAGGCAGAGCCAGAGCCGAUCAGUCUGUGUACGCCUUCGUGGCUACAGAGGGCUCCAGAGAGCUGAAGCGCGAGCUGAUCAAUGAGGCCCUGGAAACCCUGAUGGAACAAGCCGUGGCCGCCGUGCAGAAAAUGGAUCAGGCCGAGUACCAGGCCAAGAUCAGGGAUCUGCAACAGGCCGCUCUGACCAAGAGAGCUGCUCAGGCUGCCCAGAGAGAGAACCAGAGACAGCAAUUCCCCGUGGAACACGUGCAGCUGCUGUGUAUCAACUGCAUGGUGGCCGUCGGACACGGCAGCGAUCUGAGAAAAGUGGAAGGCACCCACCACGUGAACGUGAACCCCAACUUCAGCAACUACUACAACGUGUCCAGAGAUCCCGUGGUCAUCAACAAGGUGUUCAAGGACUGGAAGCCUGGCGGCGUGAUCAGCUGCAGAAAUUGCGGAGAAGUGUGGGGCCUGCAGAUGAUCUACAAGAGCGUGAAGCUGCCCGUGCUGAAAGUGCGGAGCAUGCUGCUGGAAACACCCCAGGGAAGAAUCCAGGCCAAAAAGUGGUCCAGAGUGCCCUUCAGCGUGCCCGACUUCGAUUUCCUGCAGCACUGCGCCGAGAACCUGAGCGAUCUGUCCCUGGAUUGA [서열번호 185]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 185에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 DDX-56(NCBI 참조 서열: NM_019082.4; UniProtKB - Q9NY93(DDX56_HUMAN), 또는 그의 동원체일 수 있다. [Li D, Fu S, Wu Z, Yang W, Ru Y, Shu H, Liu X, Zheng H. DDX56 inhibits type I interferon by disrupting assembly of IRF3-IPO5 to inhibit IRF3 nucleus import. J Cell Sci. 2020; 133(5): jcs230409). 상기 DDX-56의 한 구현예는 본 명세서에서 다음과 같은 서열번호 191로 나타낸다:
MEDSEALGFEHMGLDPRLLQAVTDLGWSRPTLIQEKAIPLALEGKDLLARARTGSGKTAAYAIPMLQLLLHRKATGPVVEQAVRGLVLVPTKELARQAQSMIQQLATYCARDVRVANVSAAEDSVSQRAVLMEKPDVVVGTPSRILSHLQQDSLKLRDSLELLVVDEADLLFSFGFEEELKSLLCHLPRIYQAFLMSATFNEDVQALKELILHNPVTLKLQESQLPGPDQLQQFQVVCETEEDKFLLLYALLKLSLIRGKSLLFVNTLERSYRLRLFLEQFSIPTCVLNGELPLRSRCHIISQFNQGFYDCVIATDAEVLGAPVKGKRRGRGPKGDKASDPEAGVARGIDFHHVSAVLNFDLPPTPEAYIHRAGRTARANNPGIVLTFVLPTEQFHLGKIEELLSGENRGPILLPYQFRMEEIEGFRYRCRDAMRSVTKQAIREARLKEIKEELLHSEKLKTYFEDNPRDLQLLRHDLPLHPAVVKPHLGHVPDYLVPPALRGLVRPHKKRKKLSSSCRKAKRAKSQNPLRSFKHKGKKFRPTAKPS [서열번호 191]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 191에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 DDX-56 폴리펩티드는 다음과 같은 서열번호 192의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGAGGACTCTGAAGCACTGGGCTTCGAACACATGGGCCTCGATCCCCGGCTCCTTCAGGCTGTCACCGATCTGGGCTGGTCGCGACCTACGCTGATCCAGGAGAAGGCCATCCCACTGGCCCTAGAAGGGAAGGACCTCCTGGCTCGGGCCCGCACGGGCTCCGGGAAGACGGCCGCTTATGCTATTCCGATGCTGCAGCTGTTGCTCCATAGGAAGGCGACAGGTCCGGTGGTAGAACAGGCAGTGAGAGGCCTTGTTCTTGTTCCTACCAAGGAGCTGGCACGGCAAGCACAGTCCATGATTCAGCAGCTGGCTACCTACTGTGCTCGGGATGTCCGAGTGGCCAATGTCTCAGCTGCTGAAGACTCAGTCTCTCAGAGAGCTGTGCTGATGGAGAAGCCAGATGTGGTAGTAGGGACCCCATCTCGCATATTAAGCCACTTGCAGCAAGACAGCCTGAAACTTCGTGACTCCCTGGAGCTTTTGGTGGTGGACGAAGCTGACCTTCTTTTTTCCTTTGGCTTTGAAGAAGAGCTCAAGAGTCTCCTCTGTCACTTGCCCCGGATTTACCAGGCTTTTCTCATGTCAGCTACTTTTAACGAGGACGTACAAGCACTCAAGGAGCTGATATTACATAACCCGGTTACCCTTAAGTTACAGGAGTCCCAGCTGCCTGGGCCAGACCAGTTACAGCAGTTTCAGGTGGTCTGTGAGACTGAGGAAGACAAATTCCTCCTGCTGTATGCCCTGCTCAAGCTGTCATTGATTCGGGGCAAGTCTCTGCTCTTTGTCAACACTCTAGAACGGAGTTACCGGCTACGCCTGTTCTTGGAACAGTTCAGCATCCCCACCTGTGTGCTCAATGGAGAGCTTCCACTGCGCTCCAGGTGCCACATCATCTCACAGTTCAACCAAGGCTTCTACGACTGTGTCATAGCAACTGATGCTGAAGTCCTGGGGGCCCCAGTCAAGGGCAAGCGTCGGGGCCGAGGGCCCAAAGGGGACAAGGCCTCTGATCCGGAAGCAGGTGTGGCCCGGGGCATAGACTTCCACCATGTGTCTGCTGTGCTCAACTTTGATCTTCCCCCAACCCCTGAGGCCTACATCCATCGAGCTGGCAGGACAGCACGCGCTAACAACCCAGGCATAGTCTTAACCTTTGTGCTTCCCACGGAGCAGTTCCACTTAGGCAAGATTGAGGAGCTTCTCAGTGGAGAGAACAGGGGCCCCATTCTGCTCCCCTACCAGTTCCGGATGGAGGAGATCGAGGGCTTCCGCTATCGCTGCAGGGATGCCATGCGCTCAGTGACTAAGCAGGCCATTCGGGAGGCAAGATTGAAGGAGATCAAGGAAGAGCTTCTGCATTCTGAGAAGCTTAAGACATACTTTGAAGACAACCCTAGGGACCTCCAGCTGCTGCGGCATGACCTACCTTTGCACCCCGCAGTGGTGAAGCCCCACCTGGGCCATGTTCCTGACTACCTGGTTCCTCCTGCTCTCCGTGGCCTGGTGCGCCCTCACAAGAAGCGGAAGAAGCTGTCTTCCTCTTGTAGGAAGGCCAAGAGAGCAAAGTCCCAGAACCCACTGCGCAGCTTCAAGCACAAAGGAAAGAAATTCAGACCCACAGCCAAGCCCTCC [서열번호 192]
따라서, 바람직하게는 상기 DDX-56 폴리펩티드는 실질적으로 서열번호 192에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 193의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGAGGACUCUGAAGCACUGGGCUUCGAACACAUGGGCCUCGAUCCCCGGCUCCUUCAGGCUGUCACCGAUCUGGGCUGGUCGCGACCUACGCUGAUCCAGGAGAAGGCCAUCCCACUGGCCCUAGAAGGGAAGGACCUCCUGGCUCGGGCCCGCACGGGCUCCGGGAAGACGGCCGCUUAUGCUAUUCCGAUGCUGCAGCUGUUGCUCCAUAGGAAGGCGACAGGUCCGGUGGUAGAACAGGCAGUGAGAGGCCUUGUUCUUGUUCCUACCAAGGAGCUGGCACGGCAAGCACAGUCCAUGAUUCAGCAGCUGGCUACCUACUGUGCUCGGGAUGUCCGAGUGGCCAAUGUCUCAGCUGCUGAAGACUCAGUCUCUCAGAGAGCUGUGCUGAUGGAGAAGCCAGAUGUGGUAGUAGGGACCCCAUCUCGCAUAUUAAGCCACUUGCAGCAAGACAGCCUGAAACUUCGUGACUCCCUGGAGCUUUUGGUGGUGGACGAAGCUGACCUUCUUUUUUCCUUUGGCUUUGAAGAAGAGCUCAAGAGUCUCCUCUGUCACUUGCCCCGGAUUUACCAGGCUUUUCUCAUGUCAGCUACUUUUAACGAGGACGUACAAGCACUCAAGGAGCUGAUAUUACAUAACCCGGUUACCCUUAAGUUACAGGAGUCCCAGCUGCCUGGGCCAGACCAGUUACAGCAGUUUCAGGUGGUCUGUGAGACUGAGGAAGACAAAUUCCUCCUGCUGUAUGCCCUGCUCAAGCUGUCAUUGAUUCGGGGCAAGUCUCUGCUCUUUGUCAACACUCUAGAACGGAGUUACCGGCUACGCCUGUUCUUGGAACAGUUCAGCAUCCCCACCUGUGUGCUCAAUGGAGAGCUUCCACUGCGCUCCAGGUGCCACAUCAUCUCACAGUUCAACCAAGGCUUCUACGACUGUGUCAUAGCAACUGAUGCUGAAGUCCUGGGGGCCCCAGUCAAGGGCAAGCGUCGGGGCCGAGGGCCCAAAGGGGACAAGGCCUCUGAUCCGGAAGCAGGUGUGGCCCGGGGCAUAGACUUCCACCAUGUGUCUGCUGUGCUCAACUUUGAUCUUCCCCCAACCCCUGAGGCCUACAUCCAUCGAGCUGGCAGGACAGCACGCGCUAACAACCCAGGCAUAGUCUUAACCUUUGUGCUUCCCACGGAGCAGUUCCACUUAGGCAAGAUUGAGGAGCUUCUCAGUGGAGAGAACAGGGGCCCCAUUCUGCUCCCCUACCAGUUCCGGAUGGAGGAGAUCGAGGGCUUCCGCUAUCGCUGCAGGGAUGCCAUGCGCUCAGUGACUAAGCAGGCCAUUCGGGAGGCAAGAUUGAAGGAGAUCAAGGAAGAGCUUCUGCAUUCUGAGAAGCUUAAGACAUACUUUGAAGACAACCCUAGGGACCUCCAGCUGCUGCGGCAUGACCUACCUUUGCACCCCGCAGUGGUGAAGCCCCACCUGGGCCAUGUUCCUGACUACCUGGUUCCUCCUGCUCUCCGUGGCCUGGUGCGCCCUCACAAGAAGCGGAAGAAGCUGUCUUCCUCUUGUAGGAAGGCCAAGAGAGCAAAGUCCCAGAACCCACUGCGCAGCUUCAAGCACAAAGGAAAGAAAUUCAGACCCACAGCCAAGCCCUCC [서열번호 193]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 193에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 191의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 194로 제공된다:
ATGGAAGATTCTGAGGCCCTGGGCTTCGAGCACATGGGCCTTGATCCTAGACTGCTGCAGGCCGTGACAGATCTCGGATGGTCCAGACCTACACTGATCCAAGAGAAGGCCATTCCTCTGGCTCTGGAAGGCAAGGACCTGCTGGCCAGAGCTAGAACAGGCTCTGGCAAGACAGCCGCCTACGCTATCCCTATGCTGCAGCTGCTGCTGCACAGAAAGGCCACAGGACCAGTGGTGGAACAGGCCGTTAGAGGACTGGTGCTGGTGCCCACAAAAGAGCTGGCTAGACAGGCCCAGAGCATGATCCAGCAGCTGGCCACATACTGCGCCAGAGATGTGCGAGTGGCCAATGTGTCTGCCGCCGAGGATTCTGTGTCTCAGAGGGCCGTGCTGATGGAAAAGCCCGATGTGGTCGTGGGCACCCCTAGCAGAATCCTGTCTCATCTGCAGCAGGACAGCCTGAAGCTGAGAGACAGCCTGGAACTGCTGGTGGTGGATGAGGCCGATCTGCTGTTCAGCTTCGGCTTCGAGGAAGAACTGAAGTCCCTGCTGTGCCATCTGCCTCGGATCTACCAGGCCTTCCTGATGAGCGCCACCTTCAACGAAGATGTGCAGGCCCTGAAAGAGCTGATCCTGCACAACCCCGTGACACTGAAGCTGCAAGAGAGCCAGCTGCCAGGACCTGATCAGCTCCAGCAGTTTCAAGTCGTGTGCGAGACAGAAGAGGACAAGTTCCTGCTGCTGTACGCCCTGCTGAAGCTGTCCCTGATCAGAGGCAAGAGCCTGCTGTTCGTGAACACCCTGGAAAGAAGCTACCGGCTGCGGCTGTTTCTGGAACAGTTCAGCATCCCTACCTGCGTGCTGAACGGCGAGCTGCCTCTGAGAAGCAGATGCCACATCATCAGCCAGTTCAACCAGGGCTTCTACGACTGCGTGATCGCCACAGATGCCGAAGTGCTGGGAGCACCCGTGAAGGGCAAAAGAAGAGGCAGAGGCCCCAAGGGCGATAAGGCCAGTGATCCTGAAGCAGGCGTGGCCAGAGGCATCGATTTTCACCATGTGTCCGCTGTGCTGAACTTCGACCTGCCACCTACACCTGAGGCCTACATCCACAGAGCCGGCAGAACAGCCAGAGCCAACAATCCTGGCATCGTGCTGACCTTCGTGCTGCCTACCGAACAGTTCCACCTGGGCAAGATCGAAGAACTGCTGTCCGGCGAGAACAGGGGCCCTATCCTGCTGCCTTACCAGTTCCGGATGGAAGAGATCGAGGGCTTCAGATACAGATGCAGGGACGCCATGCGGAGCGTGACAAAGCAGGCCATTAGAGAGGCCCGGCTGAAAGAGATCAAAGAGGAACTGCTCCACAGCGAGAAGCTCAAGACCTACTTCGAGGACAACCCCAGGGACCTGCAGCTCCTGAGACATGATCTGCCTCTGCACCCTGCCGTGGTCAAACCTCATCTGGGACACGTGCCCGACTACCTGGTTCCTCCTGCTCTGAGAGGCCTTGTGCGCCCTCACAAGAAGCGGAAGAAGCTGAGCAGCTCTTGTCGGAAGGCCAAGCGGGCCAAGAGCCAGAATCCACTGAGAAGCTTCAAGCACAAGGGCAAGAAGTTCAGACCCACCGCCAAGCCTAGCTGA [서열번호 194]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 194에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 194의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 195로 제공된다:
AUGGAAGAUUCUGAGGCCCUGGGCUUCGAGCACAUGGGCCUUGAUCCUAGACUGCUGCAGGCCGUGACAGAUCUCGGAUGGUCCAGACCUACACUGAUCCAAGAGAAGGCCAUUCCUCUGGCUCUGGAAGGCAAGGACCUGCUGGCCAGAGCUAGAACAGGCUCUGGCAAGACAGCCGCCUACGCUAUCCCUAUGCUGCAGCUGCUGCUGCACAGAAAGGCCACAGGACCAGUGGUGGAACAGGCCGUUAGAGGACUGGUGCUGGUGCCCACAAAAGAGCUGGCUAGACAGGCCCAGAGCAUGAUCCAGCAGCUGGCCACAUACUGCGCCAGAGAUGUGCGAGUGGCCAAUGUGUCUGCCGCCGAGGAUUCUGUGUCUCAGAGGGCCGUGCUGAUGGAAAAGCCCGAUGUGGUCGUGGGCACCCCUAGCAGAAUCCUGUCUCAUCUGCAGCAGGACAGCCUGAAGCUGAGAGACAGCCUGGAACUGCUGGUGGUGGAUGAGGCCGAUCUGCUGUUCAGCUUCGGCUUCGAGGAAGAACUGAAGUCCCUGCUGUGCCAUCUGCCUCGGAUCUACCAGGCCUUCCUGAUGAGCGCCACCUUCAACGAAGAUGUGCAGGCCCUGAAAGAGCUGAUCCUGCACAACCCCGUGACACUGAAGCUGCAAGAGAGCCAGCUGCCAGGACCUGAUCAGCUCCAGCAGUUUCAAGUCGUGUGCGAGACAGAAGAGGACAAGUUCCUGCUGCUGUACGCCCUGCUGAAGCUGUCCCUGAUCAGAGGCAAGAGCCUGCUGUUCGUGAACACCCUGGAAAGAAGCUACCGGCUGCGGCUGUUUCUGGAACAGUUCAGCAUCCCUACCUGCGUGCUGAACGGCGAGCUGCCUCUGAGAAGCAGAUGCCACAUCAUCAGCCAGUUCAACCAGGGCUUCUACGACUGCGUGAUCGCCACAGAUGCCGAAGUGCUGGGAGCACCCGUGAAGGGCAAAAGAAGAGGCAGAGGCCCCAAGGGCGAUAAGGCCAGUGAUCCUGAAGCAGGCGUGGCCAGAGGCAUCGAUUUUCACCAUGUGUCCGCUGUGCUGAACUUCGACCUGCCACCUACACCUGAGGCCUACAUCCACAGAGCCGGCAGAACAGCCAGAGCCAACAAUCCUGGCAUCGUGCUGACCUUCGUGCUGCCUACCGAACAGUUCCACCUGGGCAAGAUCGAAGAACUGCUGUCCGGCGAGAACAGGGGCCCUAUCCUGCUGCCUUACCAGUUCCGGAUGGAAGAGAUCGAGGGCUUCAGAUACAGAUGCAGGGACGCCAUGCGGAGCGUGACAAAGCAGGCCAUUAGAGAGGCCCGGCUGAAAGAGAUCAAAGAGGAACUGCUCCACAGCGAGAAGCUCAAGACCUACUUCGAGGACAACCCCAGGGACCUGCAGCUCCUGAGACAUGAUCUGCCUCUGCACCCUGCCGUGGUCAAACCUCAUCUGGGACACGUGCCCGACUACCUGGUUCCUCCUGCUCUGAGAGGCCUUGUGCGCCCUCACAAGAAGCGGAAGAAGCUGAGCAGCUCUUGUCGGAAGGCCAAGCGGGCCAAGAGCCAGAAUCCACUGAGAAGCUUCAAGCACAAGGGCAAGAAGUUCAGACCCACCGCCAAGCCUAGCUGA [서열번호 195]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 195에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 ARL16(NCBI 참조 서열: NM_001040025.3; UniProtKB - Q0P5N6(ARL16_HUMAN), 또는 그의 동원체일 수 있다. [Yang Y-K, Qu H, Gao D, Di W, Chen H-W, Guo X, He Z_H, Chen D-Y. ARF-like protein 16 (ARL16) inhibits RIG-I by binding with its C-terminal domain in a GTP-dependent manner. J Biol Chem 2011;286(12):10568-10580]. 상기 ARL16의 한 구현예는 본 명세서에서 다음과 같은 서열번호 196으로 나타낸다:
MCLLLGATGVGKTLLVKRLQEVSSRDGKGDLGEPPPTRPTVGTNLTDIVAQRKITIRELGGCMGPIWSSYYGNCRSLLFVMDASDPTQLSASCVQLLGLLSAEQLAEASVLILFNKIDLPCYMSTEEMKSLIRLPDIIACAKQNITTAEISAREGTGLAGVLAWLQATHRAND [서열번호 196]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 196에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하느 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 ARL16 폴리펩티드는 다음과 같은 서열번호 197의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGTGTCTCCTGCTGGGGGCCACGGGCGTCGGGAAGACGCTGCTGGTGAAACGGCTGCAGGAGGTGAGCTCCCGGGATGGGAAAGGCGACCTGGGGGAGCCGCCCCCGACACGGCCCACGGTGGGCACCAATCTTACTGACATCGTGGCACAGAGAAAGATCACCATCCGGGAGCTTGGGGGGTGCATGGGCCCCATCTGGTCCAGTTACTATGGAAACTGCCGTTCTCTCCTGTTTGTGATGGACGCCTCTGACCCCACCCAGCTCTCTGCATCCTGTGTGCAGCTCTTAGGTCTCCTTTCTGCAGAACAACTTGCAGAAGCATCGGTGCTGATACTCTTCAATAAAATCGACCTACCCTGTTACATGTCCACGGAGGAGATGAAGTCATTAATCAGGCTTCCAGACATCATTGCTTGTGCCAAGCAGAACATCACCACGGCAGAAATCAGCGCCCGTGAAGGCACTGGCTTAGCAGGGGTGCTGGCCTGGCTCCAGGCCACCCACAGAGCCAACGAT [서열번호 197]
따라서, 바람직하게는 상기 ARL16 폴리펩티드는 실질적으로 서열번호 197에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 198의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGUGUCUCCUGCUGGGGGCCACGGGCGUCGGGAAGACGCUGCUGGUGAAACGGCUGCAGGAGGUGAGCUCCCGGGAUGGGAAAGGCGACCUGGGGGAGCCGCCCCCGACACGGCCCACGGUGGGCACCAAUCUUACUGACAUCGUGGCACAGAGAAAGAUCACCAUCCGGGAGCUUGGGGGGUGCAUGGGCCCCAUCUGGUCCAGUUACUAUGGAAACUGCCGUUCUCUCCUGUUUGUGAUGGACGCCUCUGACCCCACCCAGCUCUCUGCAUCCUGUGUGCAGCUCUUAGGUCUCCUUUCUGCAGAACAACUUGCAGAAGCAUCGGUGCUGAUACUCUUCAAUAAAAUCGACCUACCCUGUUACAUGUCCACGGAGGAGAUGAAGUCAUUAAUCAGGCUUCCAGACAUCAUUGCUUGUGCCAAGCAGAACAUCACCACGGCAGAAAUCAGCGCCCGUGAAGGCACUGGCUUAGCAGGGGUGCUGGCCUGGCUCCAGGCCACCCACAGAGCCAACGAU [서열번호 198]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 198에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 196의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 262로 제공된다:
ATGTGTCTGCTGCTGGGAGCTACAGGCGTGGGCAAGACACTGCTGGTCAAGCGGCTGCAAGAGGTGTCCAGCAGAGATGGCAAAGGCGATCTGGGAGAGCCTCCTCCAACCAGACCTACCGTGGGCACCAACCTGACAGATATCGTGGCCCAGCGGAAGATCACCATCAGAGAACTCGGCGGCTGCATGGGCCCTATCTGGTCTAGCTACTACGGCAACTGCCGCAGCCTGCTGTTCGTGATGGATGCCAGCGATCCCACACAGCTGAGCGCCTCTTGTGTGCAACTGCTGGGACTGCTGTCTGCCGAACAACTGGCCGAAGCCTCTGTGCTGATCCTGTTCAACAAGATCGACCTGCCTTGCTACATGAGCACCGAGGAAATGAAGTCCCTGATCAGACTGCCCGACATCATTGCCTGCGCCAAGCAGAATATCACCACAGCCGAGATCAGCGCCAGAGAAGGCACAGGACTTGCTGGCGTTCTGGCATGGCTGCAGGCCACACACAGAGCCAACGATTGA [서열번호 262]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 262에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 262의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 263으로 제공된다:
AUGUGUCUGCUGCUGGGAGCUACAGGCGUGGGCAAGACACUGCUGGUCAAGCGGCUGCAAGAGGUGUCCAGCAGAGAUGGCAAAGGCGAUCUGGGAGAGCCUCCUCCAACCAGACCUACCGUGGGCACCAACCUGACAGAUAUCGUGGCCCAGCGGAAGAUCACCAUCAGAGAACUCGGCGGCUGCAUGGGCCCUAUCUGGUCUAGCUACUACGGCAACUGCCGCAGCCUGCUGUUCGUGAUGGAUGCCAGCGAUCCCACACAGCUGAGCGCCUCUUGUGUGCAACUGCUGGGACUGCUGUCUGCCGAACAACUGGCCGAAGCCUCUGUGCUGAUCCUGUUCAACAAGAUCGACCUGCCUUGCUACAUGAGCACCGAGGAAAUGAAGUCCCUGAUCAGACUGCCCGACAUCAUUGCCUGCGCCAAGCAGAAUAUCACCACAGCCGAGAUCAGCGCCAGAGAAGGCACAGGACUUGCUGGCGUUCUGGCAUGGCUGCAGGCCACACACAGAGCCAACGAUUGA [서열번호 263]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 263에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 ARL5B(NCBI 참조 서열: NM_178815.5; UniProtKB - Q96KC2(ARL5B_HUMAN), 또는 그의 동원체일 수 있다. [Kitai Y, Takeuchi O, Kawasaki T, Ori D, Suevoshi T, Murase M, Akira S, Kawai T. Negative Regulation of Melanoma Differentiation-associated Gene 5 (MDA5)-dependent Antiviral Innate Immune Responses by Arf-like Protein 5B. J Bio Chem 2015; 290(2): 1269-1280]. 상기 ARL5B의 한 구현예는 본 명세서에서 다음과 같은 서열번호 199로 나타낸다:
MGLIFAKLWSLFCNQEHKVIIVGLDNAGKTTIYQFLMNEVVHTSPTIGSNVEEIVVKNTHFLMWDIGGQESLRSSWNTYYSNTEFIILVVDSIDRERLAITKEELYRMLAHEDLRKAAVLIFANKQDMKGCMTAAEISKYLTLSSIKDHPWHIQSCCALTGEGLCQGLEWMTSRIGVR [서열번호 199]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 199에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 ARL5B 폴리펩티드는 다음과 같은 서열번호 200의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGGGCTGATCTTCGCCAAACTGTGGAGCCTCTTCTGTAACCAAGAACACAAAGTAATTATAGTGGGACTGGATAATGCAGGGAAAACCACCATTCTTTACCAATTCTTAATGAATGAAGTGGTTCATACTTCTCCAACCATAGGAAGCAATGTTGAAGAAATAGTTGTGAAGAACACTCATTTTCTTATGTGGGATATTGGTGGTCAGGAGTCTCTGCGATCATCCTGGAACACATATTACTCAAATACAGAGTTCATCATTCTTGTTGTTGATAGCATTGACAGGGAACGACTAGCTATTACAAAAGAAGAATTATACAGAATGTTGGCTCATGAGGATTTACGGAAGGCTGCAGTCCTTATCTTTGCAAATAAACAGGATATGAAAGGGTGTATGACAGCAGCTGAAATCTCGAAATACCTCACCCTTAGTTCAATTAAGGATCATCCATGGCACATTCAATCCTGCTGTGCTCTCACAGGAGAAGGGTTATGCCAAGGTCTAGAGTGGATGACCTCCCGGATTGGTGTGAGA [서열번호 200]
따라서, 바람직하게는 상기 ARL5B 폴리펩티드는 실질적으로 서열번호 200에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 201의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGGGCUGAUCUUCGCCAAACUGUGGAGCCUCUUCUGUAACCAAGAACACAAAGUAAUUAUAGUGGGACUGGAUAAUGCAGGGAAAACCACCAUUCUUUACCAAUUCUUAAUGAAUGAAGUGGUUCAUACUUCUCCAACCAUAGGAAGCAAUGUUGAAGAAAUAGUUGUGAAGAACACUCAUUUUCUUAUGUGGGAUAUUGGUGGUCAGGAGUCUCUGCGAUCAUCCUGGAACACAUAUUACUCAAAUACAGAGUUCAUCAUUCUUGUUGUUGAUAGCAUUGACAGGGAACGACUAGCUAUUACAAAAGAAGAAUUAUACAGAAUGUUGGCUCAUGAGGAUUUACGGAAGGCUGCAGUCCUUAUCUUUGCAAAUAAACAGGAUAUGAAAGGGUGUAUGACAGCAGCUGAAAUCUCGAAAUACCUCACCCUUAGUUCAAUUAAGGAUCAUCCAUGGCACAUUCAAUCCUGCUGUGCUCUCACAGGAGAAGGGUUAUGCCAAGGUCUAGAGUGGAUGACCUCCCGGAUUGGUGUGAGA [서열번호 201]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 201에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 199의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 202로 제공된다:
ATGGGCCTGATCTTCGCCAAACTGTGGTCCCTGTTCTGCAATCAAGAGCACAAAGTGATCATCGTCGGCCTGGACAACGCCGGCAAGACAACAATCCTGTACCAGTTCCTGATGAACGAGGTGGTGCACACAAGCCCCACCATCGGCAGCAACGTGGAAGAGATCGTGGTCAAGAATACCCACTTCCTGATGTGGGACATCGGCGGCCAAGAGAGCCTGAGAAGCAGCTGGAACACCTACTACAGCAACACCGAGTTCATCATCCTGGTGGTGGACAGCATCGACAGAGAGAGACTGGCCATCACCAAAGAGGAACTGTACCGGATGCTGGCCCACGAGGATCTGAGAAAAGCCGCCGTGCTGATTTTTGCCAACAAGCAGGACATGAAGGGCTGCATGACAGCCGCCGAGATCAGCAAGTACCTGACACTGAGCAGCATCAAGGATCACCCCTGGCACATCCAGAGCTGCTGTGCATTGACAGGCGAGGGCCTGTGTCAGGGACTCGAGTGGATGACAAGCAGAATCGGCGTGCGGTGA [서열번호 202]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 202에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 202의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 203으로 제공된다:
AUGGGCCUGAUCUUCGCCAAACUGUGGUCCCUGUUCUGCAAUCAAGAGCACAAAGUGAUCAUCGUCGGCCUGGACAACGCCGGCAAGACAACAAUCCUGUACCAGUUCCUGAUGAACGAGGUGGUGCACACAAGCCCCACCAUCGGCAGCAACGUGGAAGAGAUCGUGGUCAAGAAUACCCACUUCCUGAUGUGGGACAUCGGCGGCCAAGAGAGCCUGAGAAGCAGCUGGAACACCUACUACAGCAACACCGAGUUCAUCAUCCUGGUGGUGGACAGCAUCGACAGAGAGAGACUGGCCAUCACCAAAGAGGAACUGUACCGGAUGCUGGCCCACGAGGAUCUGAGAAAAGCCGCCGUGCUGAUUUUUGCCAACAAGCAGGACAUGAAGGGCUGCAUGACAGCCGCCGAGAUCAGCAAGUACCUGACACUGAGCAGCAUCAAGGAUCACCCCUGGCACAUCCAGAGCUGCUGUGCAUUGACAGGCGAGGGCCUGUGUCAGGGACUCGAGUGGAUGACAAGCAGAAUCGGCGUGCGGUGA [서열번호 203]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 203에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
또 다른 구현예에서, 상기 IMP는 MAVS의 우성 음성 작용 형태(ΔCARD 도메인)(NCBI 참조 서열: NM_020746.4; UniProtKB - Q7Z434(MAVS_HUMAN)) 또는 그의 동원체일 수 있다. MAVS는 바이러스 복제 동안에 생산된 세포내 dsRNA를 검출하는 DHX33, DDX58/RIG-I 및 IFIH1/MDA5의 하류에 작용하여서, NF-kappa-B, IRF3 및 IRF7의 활성화를 유도하고, 후속해서 IFN을 유도하는 경로를 조직화한다. [Seth RB, Sun L, Zhijian C-K, Chen K. Identification and Characterization of MAVS, a mitochondrial antiviral Signaling Protein that Activates NF-κB and IRF3. Cell, 122, 5, 9, 669-682]. MAVS의 우성 음성 작용 형태의 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 247로 나타낸다:
GCELVDLADEVASVYQSYQPRTSDRPPDPLEPPSLPAERPGPPTPAAAHSIPYNSCREKEPSYPMPVQETQAPESPGENSEQALQTLSPRAIPRNPDGGPLESSSDLAALSPLTSSGHQEQDTELGSTHTAGATSSLTPSRGPVSPSVSFQPLARSTPRASRLPGPTGSVVSTGTSFSSSSPGLASAGAAEGKQGAESDQAEPIICSSGAEAPANSLPSKVPTTLMPVNTVALKVPANPASVSTVPSKLPTSSKPPGAVPSNALTNPAPSKLPINSTRAGMVPSKVPTSMVLTKVSASTVPTDGSSRNEETPAAPTPAGATGGSSAWLDSSSENRGLGSELSKPGVLASQVDSPFSGCFEDLAISASTSLGMGPCHGPEENEYKSEGTFGIHVAENPSIQLLEGNPGPPADPDGGPRPQADRKFQEREVPCHRPSPGALWLQVAVTGVLVVTLLVVLYRRRLH [서열번호 247]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 247에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, MAVS 폴리펩티드의 우성 음성 작용 형태는 다음과 같은 서열번호 248의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GGCTGTGAGCTAGTTGATCTCGCGGACGAAGTGGCCTCTGTCTACCAGAGCTACCAGCCTCGGACCTCGGACCGTCCCCCAGACCCACTGGAGCCACCGTCACTTCCTGCTGAGAGGCCAGGGCCCCCCACACCTGCTGCGGCCCACAGCATCCCCTACAACAGCTGCAGAGAGAAGGAGCCAAGTTACCCCATGCCTGTCCAGGAGACCCAGGCGCCAGAGTCCCCAGGAGAGAATTCAGAGCAAGCCCTGCAGACGCTCAGCCCCAGAGCCATCCCAAGGAATCCAGATGGTGGCCCCCTGGAGTCCTCCTCTGACCTGGCAGCCCTCAGCCCTCTGACCTCCAGCGGGCATCAGGAGCAGGACACAGAACTGGGCAGTACCCACACAGCAGGTGCGACCTCCAGCCTCACACCATCCCGTGGGCCTGTGTCTCCATCTGTCTCCTTCCAGCCCCTGGCCCGTTCCACCCCCAGGGCAAGCCGCTTGCCTGGACCCACAGGGTCAGTTGTATCTACTGGCACCTCCTTCTCCTCCTCATCCCCTGGCTTGGCCTCTGCAGGGGCTGCAGAGGGTAAACAGGGTGCAGAGAGTGACCAGGCCGAGCCTATCATCTGCTCCAGTGGGGCAGAGGCACCTGCCAACTCTCTGCCCTCCAAAGTGCCTACCACCTTGATGCCTGTGAACACAGTGGCCCTGAAAGTGCCTGCCAACCCAGCATCTGTCAGCACAGTGCCCTCCAAGTTGCCAACTAGCTCAAAGCCCCCTGGTGCAGTGCCTTCTAATGCGCTCACCAATCCAGCACCATCCAAATTGCCCATCAACTCAACCCGTGCTGGCATGGTGCCATCCAAAGTGCCTACTAGCATGGTGCTCACCAAGGTGTCTGCCAGCACAGTCCCCACTGACGGGAGCAGCAGAAATGAGGAGACCCCAGCAGCTCCAACACCCGCCGGCGCCACTGGAGGCAGCTCAGCCTGGCTAGACAGCAGCTCTGAGAATAGGGGCCTTGGGTCGGAGCTGAGTAAGCCTGGCGTGCTGGCATCCCAGGTAGACAGCCCGTTCTCGGGCTGCTTCGAGGATCTTGCCATCAGTGCCAGCACCTCCTTGGGCATGGGGCCCTGCCATGGCCCAGAGGAGAATGAGTATAAGTCCGAGGGCACCTTTGGGATCCACGTGGCTGAGAACCCCAGCATCCAGCTCCTGGAGGGCAACCCTGGGCCACCTGCGGACCCGGATGGCGGCCCCAGGCCACAAGCCGACCGGAAGTTCCAGGAGAGGGAGGTGCCATGCCACAGGCCCTCACCTGGGGCTCTGTGGCTCCAGGTGGCTGTGACAGGGGTGCTGGTAGTCACACTCCTGGTGGTGCTGTACCGGCGGCGTCTGCAC [서열번호 248]
따라서, 바람직하게는 MAVS 폴리펩티드의 우성 음성 작용 형태는 실질적으로 서열번호 248에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 249의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GGCUGUGAGCUAGUUGAUCUCGCGGACGAAGUGGCCUCUGUCUACCAGAGCUACCAGCCUCGGACCUCGGACCGUCCCCCAGACCCACUGGAGCCACCGUCACUUCCUGCUGAGAGGCCAGGGCCCCCCACACCUGCUGCGGCCCACAGCAUCCCCUACAACAGCUGCAGAGAGAAGGAGCCAAGUUACCCCAUGCCUGUCCAGGAGACCCAGGCGCCAGAGUCCCCAGGAGAGAAUUCAGAGCAAGCCCUGCAGACGCUCAGCCCCAGAGCCAUCCCAAGGAAUCCAGAUGGUGGCCCCCUGGAGUCCUCCUCUGACCUGGCAGCCCUCAGCCCUCUGACCUCCAGCGGGCAUCAGGAGCAGGACACAGAACUGGGCAGUACCCACACAGCAGGUGCGACCUCCAGCCUCACACCAUCCCGUGGGCCUGUGUCUCCAUCUGUCUCCUUCCAGCCCCUGGCCCGUUCCACCCCCAGGGCAAGCCGCUUGCCUGGACCCACAGGGUCAGUUGUAUCUACUGGCACCUCCUUCUCCUCCUCAUCCCCUGGCUUGGCCUCUGCAGGGGCUGCAGAGGGUAAACAGGGUGCAGAGAGUGACCAGGCCGAGCCUAUCAUCUGCUCCAGUGGGGCAGAGGCACCUGCCAACUCUCUGCCCUCCAAAGUGCCUACCACCUUGAUGCCUGUGAACACAGUGGCCCUGAAAGUGCCUGCCAACCCAGCAUCUGUCAGCACAGUGCCCUCCAAGUUGCCAACUAGCUCAAAGCCCCCUGGUGCAGUGCCUUCUAAUGCGCUCACCAAUCCAGCACCAUCCAAAUUGCCCAUCAACUCAACCCGUGCUGGCAUGGUGCCAUCCAAAGUGCCUACUAGCAUGGUGCUCACCAAGGUGUCUGCCAGCACAGUCCCCACUGACGGGAGCAGCAGAAAUGAGGAGACCCCAGCAGCUCCAACACCCGCCGGCGCCACUGGAGGCAGCUCAGCCUGGCUAGACAGCAGCUCUGAGAAUAGGGGCCUUGGGUCGGAGCUGAGUAAGCCUGGCGUGCUGGCAUCCCAGGUAGACAGCCCGUUCUCGGGCUGCUUCGAGGAUCUUGCCAUCAGUGCCAGCACCUCCUUGGGCAUGGGGCCCUGCCAUGGCCCAGAGGAGAAUGAGUAUAAGUCCGAGGGCACCUUUGGGAUCCACGUGGCUGAGAACCCCAGCAUCCAGCUCCUGGAGGGCAACCCUGGGCCACCUGCGGACCCGGAUGGCGGCCCCAGGCCACAAGCCGACCGGAAGUUCCAGGAGAGGGAGGUGCCAUGCCACAGGCCCUCACCUGGGGCUCUGUGGCUCCAGGUGGCUGUGACAGGGGUGCUGGUAGUCACACUCCUGGUGGUGCUGUACCGGCGGCGUCUGCAC [서열번호 249]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 249에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 247의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 250으로 제공된다:
ATGGGCTGTGAACTGGTGGATCTGGCCGATGAAGTGGCCAGCGTGTACCAGAGCTACCAGCCTAGAACCAGCGACCGGCCTCCTGATCCTCTGGAACCTCCATCTCTGCCCGCCGAAAGACCTGGACCTCCTACACCAGCTGCCGCTCACAGCATCCCTTACAACAGCTGCAGAGAGAAAGAACCTAGCTACCCCATGCCTGTGCAAGAGACACAGGCCCCAGAAAGCCCTGGCGAGAATTCTGAACAGGCCCTGCAGACACTGAGCCCCAGAGCCATTCCTAGAAACCCTGATGGCGGCCCTCTGGAAAGCAGCAGTGATCTGGCTGCTCTGAGCCCTCTGACAAGCTCTGGACACCAAGAGCAGGATACCGAGCTGGGCAGCACACATACAGCCGGCGCTACAAGCAGCCTGACACCTTCTAGAGGCCCCGTGTCTCCCAGCGTGTCATTTCAGCCTCTGGCCAGGTCTACCCCTAGAGCCTCTAGACTGCCTGGACCTACAGGCAGCGTGGTGTCTACCGGCACAAGCTTCAGCTCTAGCTCTCCTGGACTGGCCTCTGCTGGTGCCGCTGAGGGAAAACAAGGCGCCGAATCTGATCAGGCCGAGCCTATCATCTGTAGCAGCGGAGCAGAAGCCCCTGCCAATAGCCTGCCTAGCAAGGTGCCAACCACACTGATGCCCGTGAACACAGTGGCCCTGAAGGTGCCAGCTAATCCTGCCTCCGTGTCCACCGTGCCTTCTAAGCTGCCAACCAGCTCTAAGCCACCTGGCGCCGTGCCATCTAACGCCCTGACAAATCCTGCTCCAAGCAAGCTGCCCATCAACAGCACAAGAGCCGGCATGGTGCCCTCTAAGGTGCCCACATCTATGGTGCTGACCAAGGTGTCCGCCAGCACCGTGCCAACAGATGGCAGCAGCAGAAACGAGGAAACCCCTGCCGCTCCTACTCCTGCTGGCGCTACAGGCGGATCTTCTGCCTGGCTGGATAGCAGCTCCGAGAATAGAGGCCTGGGCAGCGAGCTGTCTAAACCTGGCGTTCTGGCAAGCCAGGTGGACAGCCCTTTCAGCGGCTGCTTTGAGGACCTGGCTATCAGCGCCTCTACAAGCCTCGGCATGGGACCTTGTCACGGCCCCGAGGAAAACGAGTACAAGAGCGAGGGCACCTTCGGCATCCACGTGGCCGAGAATCCTAGCATCCAACTGCTGGAAGGCAACCCCGGACCTCCTGCTGATCCAGATGGTGGACCTAGACCTCAGGCCGACCGGAAGTTCCAAGAAAGAGAGGTGCCCTGCCACCGGCCATCTCCAGGTGCACTTTGGCTGCAAGTGGCTGTGACAGGCGTGCTGGTGGTTACACTGCTGGTCGTGCTGTACAGAAGGCGGCTGCATTGA [서열번호 250]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 250에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 250의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 251로 제공된다:
AUGGGCUGUGAACUGGUGGAUCUGGCCGAUGAAGUGGCCAGCGUGUACCAGAGCUACCAGCCUAGAACCAGCGACCGGCCUCCUGAUCCUCUGGAACCUCCAUCUCUGCCCGCCGAAAGACCUGGACCUCCUACACCAGCUGCCGCUCACAGCAUCCCUUACAACAGCUGCAGAGAGAAAGAACCUAGCUACCCCAUGCCUGUGCAAGAGACACAGGCCCCAGAAAGCCCUGGCGAGAAUUCUGAACAGGCCCUGCAGACACUGAGCCCCAGAGCCAUUCCUAGAAACCCUGAUGGCGGCCCUCUGGAAAGCAGCAGUGAUCUGGCUGCUCUGAGCCCUCUGACAAGCUCUGGACACCAAGAGCAGGAUACCGAGCUGGGCAGCACACAUACAGCCGGCGCUACAAGCAGCCUGACACCUUCUAGAGGCCCCGUGUCUCCCAGCGUGUCAUUUCAGCCUCUGGCCAGGUCUACCCCUAGAGCCUCUAGACUGCCUGGACCUACAGGCAGCGUGGUGUCUACCGGCACAAGCUUCAGCUCUAGCUCUCCUGGACUGGCCUCUGCUGGUGCCGCUGAGGGAAAACAAGGCGCCGAAUCUGAUCAGGCCGAGCCUAUCAUCUGUAGCAGCGGAGCAGAAGCCCCUGCCAAUAGCCUGCCUAGCAAGGUGCCAACCACACUGAUGCCCGUGAACACAGUGGCCCUGAAGGUGCCAGCUAAUCCUGCCUCCGUGUCCACCGUGCCUUCUAAGCUGCCAACCAGCUCUAAGCCACCUGGCGCCGUGCCAUCUAACGCCCUGACAAAUCCUGCUCCAAGCAAGCUGCCCAUCAACAGCACAAGAGCCGGCAUGGUGCCCUCUAAGGUGCCCACAUCUAUGGUGCUGACCAAGGUGUCCGCCAGCACCGUGCCAACAGAUGGCAGCAGCAGAAACGAGGAAACCCCUGCCGCUCCUACUCCUGCUGGCGCUACAGGCGGAUCUUCUGCCUGGCUGGAUAGCAGCUCCGAGAAUAGAGGCCUGGGCAGCGAGCUGUCUAAACCUGGCGUUCUGGCAAGCCAGGUGGACAGCCCUUUCAGCGGCUGCUUUGAGGACCUGGCUAUCAGCGCCUCUACAAGCCUCGGCAUGGGACCUUGUCACGGCCCCGAGGAAAACGAGUACAAGAGCGAGGGCACCUUCGGCAUCCACGUGGCCGAGAAUCCUAGCAUCCAACUGCUGGAAGGCAACCCCGGACCUCCUGCUGAUCCAGAUGGUGGACCUAGACCUCAGGCCGACCGGAAGUUCCAAGAAAGAGAGGUGCCCUGCCACCGGCCAUCUCCAGGUGCACUUUGGCUGCAAGUGGCUGUGACAGGCGUGCUGGUGGUUACACUGCUGGUCGUGCUGUACAGAAGGCGGCUGCAUUGA [서열번호 251]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 251에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, 상기 IMP는 TRIM35 또는 그의 동원체이다(NCBI 참조 서열: NM_171982.4; UniProtKB - Q9UPQ4(TRI35_HUMAN)).
TRIM35는 IRF7과 상호작용하여서, K48-연관 유비퀴틴-프로테아좀 경로를 통해 그 분해를 유도하는 것으로 나타났다. [Wang Y, Yan S, Yang B, Wang Y, Zhou H, Lian Q, Sun B (2015). TRIM35 negatively regulates TLR7- and TLR9-mediated type 1 interferon production by targeting IRF7. FEBS Lett, 589, 12, 1322-1330]. TRIM35의 단백질 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 252로 나타낸다:
MERSPDVSPGPSRSFKEELLCAVCYDPFRDAVTLRCGHNFCRGCVSRCWEVQVSPTCPVCKDRASPADLRTNHTLNNLVEKLLREEAEGARWTSYRFSRVCRLHRGQLSLFCLEDKELLCCSCQADPRHQGHRVQPVKDTAHDFRAKCRNMEHALREKAKAFWAMRRSYEAIAKHNQVEAAWLEGRIRQEFDKLREFLRVEEQAILDAMAEETRQKQLLADEKMKQLTEETEVLAHEIERLQMEMKEDDVSFLMKHKSRKRRLFCTMEPEPVQPGMLIDVCKYLGSLQYRVWKKMLASVESVPFSFDPNTAAGWLSVSDDLTSVTNHGYRVQVENPERFSSAPCLLGSRVFSQGSHAWEVALGGLQSWRVGVVRVRQDSGAEGHSHSCYHDTRSGFWYVCRTQGVEGDHCVTSDPATSPLVLAIPRRLRVELECEEGELSFYDAERHCHLYTFHARFGEVRPYFYLGGARGAGPPEPLRICPLHISVKEELDG [서열번호 252]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 252에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 TRIM35 폴리펩티드는 다음과 같은 서열번호 253의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGAGCGGAGTCCCGACGTGTCCCCCGGGCCTTCCCGCTCCTTCAAGGAGGAGTTGCTCTGCGCCGTCTGCTACGACCCCTTCCGCGACGCAGTCACTCTGCGCTGCGGCCACAACTTCTGCCGCGGGTGCGTGAGCCGCTGCTGGGAGGTGCAGGTGTCGCCCACCTGCCCAGTGTGCAAAGACCGCGCGTCACCCGCCGACCTGCGCACCAACCACACCCTCAACAACCTGGTGGAGAAGCTGCTGCGCGAGGAGGCCGAGGGCGCGCGCTGGACCAGCTACCGCTTCTCGCGTGTCTGCCGCCTGCACCGCGGACAGCTCAGCCTCTTCTGCCTCGAGGACAAGGAGCTGCTGTGCTGCTCCTGCCAGGCCGACCCCCGACACCAGGGGCACCGCGTGCAGCCGGTGAAGGACACTGCCCACGACTTTCGGGCCAAGTGCAGGAACATGGAGCATGCACTGCGGGAGAAGGCCAAGGCCTTCTGGGCCATGCGGCGCTCCTATGAGGCCATCGCCAAGCACAATCAGGTGGAGGCTGCATGGCTGGAAGGCCGGATCCGGCAGGAGTTTGATAAGCTTCGCGAGTTCTTGAGAGTGGAGGAGCAGGCCATTCTGGATGCCATGGCCGAGGAGACAAGGCAGAAGCAACTTCTGGCCGACGAGAAGATGAAGCAGCTCACAGAGGAGACGGAGGTGCTGGCACATGAGATCGAGCGGCTGCAGATGGAGATGAAGGAGGACGACGTTTCTTTTCTCATGAAACACAAGAGCCGAAAACGCCGACTCTTCTGCACCATGGAGCCAGAGCCAGTCCAGCCCGGCATGCTTATCGATGTCTGCAAGTACCTGGGCTCCCTGCAGTACCGCGTCTGGAAGAAGATGCTTGCATCTGTGGAATCTGTACCCTTCAGCTTTGACCCCAACACCGCAGCTGGCTGGCTCTCCGTGTCTGACGACCTCACCAGCGTCACCAACCATGGCTACCGCGTGCAGGTGGAGAACCCGGAACGCTTCTCCTCGGCGCCCTGCCTGCTGGGCTCCCGTGTCTTCTCACAGGGCTCGCACGCCTGGGAGGTGGCCCTTGGGGGGCTGCAGAGCTGGAGGGTGGGCGTGGTACGTGTGCGCCAGGACTCGGGCGCTGAGGGCCACTCACACAGCTGCTACCACGACACACGCTCGGGCTTCTGGTATGTCTGCCGCACGCAGGGCGTGGAGGGGGACCACTGCGTGACCTCGGACCCAGCCACGTCGCCCCTGGTCCTGGCCATCCCACGCCGCCTGCGTGTGGAGCTGGAGTGTGAGGAGGGCGAGCTGTCTTTCTATGACGCGGAGCGCCACTGCCACCTGTACACCTTCCACGCCCGCTTTGGGGAGGTTCGCCCCTACTTCTACCTGGGGGGTGCACGGGGCGCCGGGCCTCCAGAGCCTTTGCGCATCTGCCCCTTGCACATCAGTGTCAAGGAAGAACTGGATGGC [서열번호 253]
따라서, 바람직하게는 상기 TRIM35 폴리펩티드는 실질적으로 서열번호 253에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 254의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGAGCGGAGUCCCGACGUGUCCCCCGGGCCUUCCCGCUCCUUCAAGGAGGAGUUGCUCUGCGCCGUCUGCUACGACCCCUUCCGCGACGCAGUCACUCUGCGCUGCGGCCACAACUUCUGCCGCGGGUGCGUGAGCCGCUGCUGGGAGGUGCAGGUGUCGCCCACCUGCCCAGUGUGCAAAGACCGCGCGUCACCCGCCGACCUGCGCACCAACCACACCCUCAACAACCUGGUGGAGAAGCUGCUGCGCGAGGAGGCCGAGGGCGCGCGCUGGACCAGCUACCGCUUCUCGCGUGUCUGCCGCCUGCACCGCGGACAGCUCAGCCUCUUCUGCCUCGAGGACAAGGAGCUGCUGUGCUGCUCCUGCCAGGCCGACCCCCGACACCAGGGGCACCGCGUGCAGCCGGUGAAGGACACUGCCCACGACUUUCGGGCCAAGUGCAGGAACAUGGAGCAUGCACUGCGGGAGAAGGCCAAGGCCUUCUGGGCCAUGCGGCGCUCCUAUGAGGCCAUCGCCAAGCACAAUCAGGUGGAGGCUGCAUGGCUGGAAGGCCGGAUCCGGCAGGAGUUUGAUAAGCUUCGCGAGUUCUUGAGAGUGGAGGAGCAGGCCAUUCUGGAUGCCAUGGCCGAGGAGACAAGGCAGAAGCAACUUCUGGCCGACGAGAAGAUGAAGCAGCUCACAGAGGAGACGGAGGUGCUGGCACAUGAGAUCGAGCGGCUGCAGAUGGAGAUGAAGGAGGACGACGUUUCUUUUCUCAUGAAACACAAGAGCCGAAAACGCCGACUCUUCUGCACCAUGGAGCCAGAGCCAGUCCAGCCCGGCAUGCUUAUCGAUGUCUGCAAGUACCUGGGCUCCCUGCAGUACCGCGUCUGGAAGAAGAUGCUUGCAUCUGUGGAAUCUGUACCCUUCAGCUUUGACCCCAACACCGCAGCUGGCUGGCUCUCCGUGUCUGACGACCUCACCAGCGUCACCAACCAUGGCUACCGCGUGCAGGUGGAGAACCCGGAACGCUUCUCCUCGGCGCCCUGCCUGCUGGGCUCCCGUGUCUUCUCACAGGGCUCGCACGCCUGGGAGGUGGCCCUUGGGGGGCUGCAGAGCUGGAGGGUGGGCGUGGUACGUGUGCGCCAGGACUCGGGCGCUGAGGGCCACUCACACAGCUGCUACCACGACACACGCUCGGGCUUCUGGUAUGUCUGCCGCACGCAGGGCGUGGAGGGGGACCACUGCGUGACCUCGGACCCAGCCACGUCGCCCCUGGUCCUGGCCAUCCCACGCCGCCUGCGUGUGGAGCUGGAGUGUGAGGAGGGCGAGCUGUCUUUCUAUGACGCGGAGCGCCACUGCCACCUGUACACCUUCCACGCCCGCUUUGGGGAGGUUCGCCCCUACUUCUACCUGGGGGGUGCACGGGGCGCCGGGCCUCCAGAGCCUUUGCGCAUCUGCCCCUUGCACAUCAGUGUCAAGGAAGAACUGGAUGGC [서열번호 254]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 254에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 254의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 255로 제공된다:
ATGGAAAGATCCCCTGACGTGTCCCCTGGACCTAGCAGAAGCTTCAAAGAGGAACTGCTCTGCGCCGTGTGCTACGACCCCTTCAGAGATGCCGTGACACTGAGATGCGGCCACAACTTCTGCAGAGGCTGCGTGTCCAGATGCTGGGAAGTGCAGGTTTCCCCTACATGCCCCGTGTGCAAGGACAGAGCCTCTCCTGCCGATCTGCGGACCAATCACACCCTGAACAACCTGGTGGAAAAGCTGCTGAGAGAAGAGGCCGAAGGCGCCAGATGGACCAGCTACAGATTCAGCAGAGTGTGCCGGCTGCACAGAGGCCAGCTGAGCCTGTTCTGTCTCGAGGACAAAGAACTGCTGTGCTGCAGCTGCCAGGCCGATCCTAGACACCAGGGACATAGAGTGCAGCCCGTGAAGGACACAGCCCACGACTTCAGAGCCAAGTGCCGGAACATGGAACACGCCCTGAGAGAGAAGGCCAAAGCCTTCTGGGCCATGCGGAGAAGCTATGAGGCCATTGCCAAGCACAATCAGGTGGAAGCCGCCTGGCTGGAAGGCCGGATCAGACAAGAGTTCGACAAGCTGCGCGAGTTCCTGAGAGTGGAAGAACAGGCCATCCTGGACGCCATGGCCGAGGAAACAAGACAGAAACAGCTGCTGGCCGACGAGAAGATGAAGCAGCTGACCGAAGAGACAGAGGTGCTGGCCCACGAAATCGAGCGGCTGCAGATGGAAATGAAGGAAGATGATGTGTCCTTTCTGATGAAGCACAAGAGCCGGAAGCGGCGGCTGTTCTGCACAATGGAACCTGAGCCAGTGCAGCCTGGCATGCTGATCGATGTGTGCAAGTACCTGGGCAGCCTGCAGTACAGAGTGTGGAAGAAAATGCTGGCCTCCGTGGAAAGCGTGCCCTTCAGCTTCGACCCTAATACTGCCGCTGGCTGGCTGAGCGTGTCCGATGATCTGACCAGCGTGACCAACCACGGCTACAGAGTGCAGGTCGAGAACCCCGAGAGATTCAGCTCTGCCCCTTGTCTGCTGGGCTCCAGAGTGTTTTCTCAGGGCTCTCACGCCTGGGAAGTTGCCCTTGGAGGACTCCAGTCTTGGAGAGTGGGCGTTGTCAGAGTGCGGCAGGATTCTGGCGCCGAAGGACACTCTCACAGCTGCTACCACGATACCCGCAGCGGCTTTTGGTACGTGTGTAGAACACAGGGCGTCGAGGGCGACCACTGTGTGACATCTGACCCTGCCACATCTCCTCTGGTGCTGGCTATCCCTCGGAGACTGAGAGTCGAGCTGGAATGCGAGGAAGGCGAGCTGAGCTTCTACGACGCCGAGAGACACTGCCACCTGTACACCTTCCACGCCAGATTTGGCGAAGTGCGGCCCTACTTTTATCTCGGCGGAGCTAGAGGTGCCGGACCTCCTGAACCTCTGAGAATCTGCCCTCTGCACATCAGCGTGAAAGAGGAATTGGACGGCTGA [서열번호 255]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 255에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 255의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 256으로 제공된다:
AUGGAAAGAUCCCCUGACGUGUCCCCUGGACCUAGCAGAAGCUUCAAAGAGGAACUGCUCUGCGCCGUGUGCUACGACCCCUUCAGAGAUGCCGUGACACUGAGAUGCGGCCACAACUUCUGCAGAGGCUGCGUGUCCAGAUGCUGGGAAGUGCAGGUUUCCCCUACAUGCCCCGUGUGCAAGGACAGAGCCUCUCCUGCCGAUCUGCGGACCAAUCACACCCUGAACAACCUGGUGGAAAAGCUGCUGAGAGAAGAGGCCGAAGGCGCCAGAUGGACCAGCUACAGAUUCAGCAGAGUGUGCCGGCUGCACAGAGGCCAGCUGAGCCUGUUCUGUCUCGAGGACAAAGAACUGCUGUGCUGCAGCUGCCAGGCCGAUCCUAGACACCAGGGACAUAGAGUGCAGCCCGUGAAGGACACAGCCCACGACUUCAGAGCCAAGUGCCGGAACAUGGAACACGCCCUGAGAGAGAAGGCCAAAGCCUUCUGGGCCAUGCGGAGAAGCUAUGAGGCCAUUGCCAAGCACAAUCAGGUGGAAGCCGCCUGGCUGGAAGGCCGGAUCAGACAAGAGUUCGACAAGCUGCGCGAGUUCCUGAGAGUGGAAGAACAGGCCAUCCUGGACGCCAUGGCCGAGGAAACAAGACAGAAACAGCUGCUGGCCGACGAGAAGAUGAAGCAGCUGACCGAAGAGACAGAGGUGCUGGCCCACGAAAUCGAGCGGCUGCAGAUGGAAAUGAAGGAAGAUGAUGUGUCCUUUCUGAUGAAGCACAAGAGCCGGAAGCGGCGGCUGUUCUGCACAAUGGAACCUGAGCCAGUGCAGCCUGGCAUGCUGAUCGAUGUGUGCAAGUACCUGGGCAGCCUGCAGUACAGAGUGUGGAAGAAAAUGCUGGCCUCCGUGGAAAGCGUGCCCUUCAGCUUCGACCCUAAUACUGCCGCUGGCUGGCUGAGCGUGUCCGAUGAUCUGACCAGCGUGACCAACCACGGCUACAGAGUGCAGGUCGAGAACCCCGAGAGAUUCAGCUCUGCCCCUUGUCUGCUGGGCUCCAGAGUGUUUUCUCAGGGCUCUCACGCCUGGGAAGUUGCCCUUGGAGGACUCCAGUCUUGGAGAGUGGGCGUUGUCAGAGUGCGGCAGGAUUCUGGCGCCGAAGGACACUCUCACAGCUGCUACCACGAUACCCGCAGCGGCUUUUGGUACGUGUGUAGAACACAGGGCGUCGAGGGCGACCACUGUGUGACAUCUGACCCUGCCACAUCUCCUCUGGUGCUGGCUAUCCCUCGGAGACUGAGAGUCGAGCUGGAAUGCGAGGAAGGCGAGCUGAGCUUCUACGACGCCGAGAGACACUGCCACCUGUACACCUUCCACGCCAGAUUUGGCGAAGUGCGGCCCUACUUUUAUCUCGGCGGAGCUAGAGGUGCCGGACCUCCUGAACCUCUGAGAAUCUGCCCUCUGCACAUCAGCGUGAAAGAGGAAUUGGACGGCUGA [서열번호 256]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 256에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
카테고리 3: 인터페론 신호전달의 억제제
다른 구현예에서, 상기 IMP는 인터페론 신호전달을 억제하도록 구성될 수 있다.
따라서, RNA에 대한 선천적 면역 반응의 감소, 제거 또는 차단은 RNA의 활성에 영향을 미치는 인터페론 자극 유전자(예컨대, IFIT1)의 생산을 유도하는 인터페론의 신호전달을 억제함으로써 상기 IMP에 의해 달성된다. 따라서, 바람직하게는 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질은 인터페론 신호전달 경로의 단백질/억제제 또는 돌연변이된 또는 비-기능적 단백질, 또는 그의 우성 음성 작용 형태를 포함한다.
한 구현예에서, 선천적 신호전달 경로의 억제제, 또는 그의 우성 음성 작용 형태는 STAT1 우성 음성 형태이다. STAT1(NCBI 참조 서열: NM_007315.4; UniProtKB - P42224(STAT1_HUMAN)), 또는 그의 동원체는 우성 음성 방식으로 작용해 ISGF-3 복합체 형성을 차단할 수 있는 Y701F 돌연변이에 의해 우성 음성으로 될 수 있으며, 본 명세서에서 다음과 같은 서열번호 66으로 나타낸다:
MSQWYELQQLDSKFLEQVHQLYDDSFPMEIRQYLAQWLEKQDWEHAANDVSFATIRFHDLLSQLDDQYSRFSLENNFLLQHNIRKSKRNLQDNFQEDPIQMSMIIYSCLKEERKILENAQRFNQAQSGNIQSTVMLDKQKELDSKVRNVKDKVMCIEHEIKSLEDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQLLLKKMYLMLDNKRKEVVHKIIELLNVTELTQNALINDELVEWKRRQQSACIGGPPNACLDQLQNWFTIVAESLQQVRQQLKKLEELEQKYTYEHDPITKNKQVLWDRTFSLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQFTVKLRLLVKLQELNYNLKVKVLFDKDVNERNTVKGFRKFNILGTHTKVMNMEESTNGSLAAEFRHLQLKEQKNAGTRTNEGPLIVTEELHSLSFETQLCQPGLVIDLETTSLPVVVISNVSQLPSGWASILWYNMLVAEPRNLSFFLTPPCARWAQLSEVLSWQFSSVTKRGLNVDQLNMLGEKLLGPNASPDGLIPWTRFCKENINDKNFPFWLWIESILELIKKHLLPLWNDGCIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYTKKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELDGPKGTGFIKTELISVSEVHPSRLQTTDNLLPMSPEEFDEVSRIVGSVEFDSMMNTV [서열번호 66]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 66에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 STAT1 우성 음성 형태 폴리펩티드는 다음과 같은 서열번호 67의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGTCTCAGTGGTACGAACTTCAGCAGCTTGACTCAAAATTCCTGGAGCAGGTTCACCAGCTTTATGATGACAGTTTTCCCATGGAAATCAGACAGTACCTGGCACAGTGGTTAGAAAAGCAAGACTGGGAGCACGCTGCCAATGATGTTTCATTTGCCACCATCCGTTTTCATGACCTCCTGTCACAGCTGGATGATCAATATAGTCGCTTTTCTTTGGAGAATAACTTCTTGCTACAGCATAACATAAGGAAAAGCAAGCGTAATCTTCAGGATAATTTTCAGGAAGACCCAATCCAGATGTCTATGATCATTTACAGCTGTCTGAAGGAAGAAAGGAAAATTCTGGAAAACGCCCAGAGATTTAATCAGGCTCAGTCGGGGAATATTCAGAGCACAGTGATGTTAGACAAACAGAAAGAGCTTGACAGTAAAGTCAGAAATGTGAAGGACAAGGTTATGTGTATAGAGCATGAAATCAAGAGCCTGGAAGATTTACAAGATGAATATGACTTCAAATGCAAAACCTTGCAGAACAGAGAACACGAGACCAATGGTGTGGCAAAGAGTGATCAGAAACAAGAACAGCTGTTACTCAAGAAGATGTATTTAATGCTTGACAATAAGAGAAAGGAAGTAGTTCACAAAATAATAGAGTTGCTGAATGTCACTGAACTTACCCAGAATGCCCTGATTAATGATGAACTAGTGGAGTGGAAGCGGAGACAGCAGAGCGCCTGTATTGGGGGGCCGCCCAATGCTTGCTTGGATCAGCTGCAGAACTGGTTCACTATAGTTGCGGAGAGTCTGCAGCAAGTTCGGCAGCAGCTTAAAAAGTTGGAGGAATTGGAACAGAAATACACCTACGAACATGACCCTATCACAAAAAACAAACAAGTGTTATGGGACCGCACCTTCAGTCTTTTCCAGCAGCTCATTCAGAGCTCGTTTGTGGTGGAAAGACAGCCCTGCATGCCAACGCACCCTCAGAGGCCGCTGGTCTTGAAGACAGGGGTCCAGTTCACTGTGAAGTTGAGACTGTTGGTGAAATTGCAAGAGCTGAATTATAATTTGAAAGTCAAAGTCTTATTTGATAAAGATGTGAATGAGAGAAATACAGTAAAAGGATTTAGGAAGTTCAACATTTTGGGCACGCACACAAAAGTGATGAACATGGAGGAGTCCACCAATGGCAGTCTGGCGGCTGAATTTCGGCACCTGCAATTGAAAGAACAGAAAAATGCTGGCACCAGAACGAATGAGGGTCCTCTCATCGTTACTGAAGAGCTTCACTCCCTTAGTTTTGAAACCCAATTGTGCCAGCCTGGTTTGGTAATTGACCTCGAGACGACCTCTCTGCCCGTTGTGGTGATCTCCAACGTCAGCCAGCTCCCGAGCGGTTGGGCCTCCATCCTTTGGTACAACATGCTGGTGGCGGAACCCAGGAATCTGTCCTTCTTCCTGACTCCACCATGTGCACGATGGGCTCAGCTTTCAGAAGTGCTGAGTTGGCAGTTTTCTTCTGTCACCAAAAGAGGTCTCAATGTGGACCAGCTGAACATGTTGGGAGAGAAGCTTCTTGGTCCTAACGCCAGCCCCGATGGTCTCATTCCGTGGACGAGGTTTTGTAAGGAAAATATAAATGATAAAAATTTTCCCTTCTGGCTTTGGATTGAAAGCATCCTAGAACTCATTAAAAAACACCTGCTCCCTCTCTGGAATGATGGGTGCATCATGGGCTTCATCAGCAAGGAGCGAGAGCGTGCCCTGTTGAAGGACCAGCAGCCGGGGACCTTCCTGCTGCGGTTCAGTGAGAGCTCCCGGGAAGGGGCCATCACATTCACATGGGTGGAGCGGTCCCAGAACGGAGGCGAACCTGACTTCCATGCGGTTGAACCCTACACGAAGAAAGAACTTTCTGCTGTTACTTTCCCTGACATCATTCGCAATTACAAAGTCATGGCTGCTGAGAATATTCCTGAGAATCCCCTGAAGTATCTGTATCCAAATATTGACAAAGACCATGCCTTTGGAAAGTATTACTCCAGGCCAAAGGAAGCACCAGAGCCAATGGAACTTGATGGCCCTAAAGGAACTGGATTTATCAAGACTGAGTTGATTTCTGTGTCTGAAGTTCACCCTTCTAGACTTCAGACCACAGACAACCTGCTCCCCATGTCTCCTGAGGAGTTTGACGAGGTGTCTCGGATAGTGGGCTCTGTAGAATTCGACAGTATGATGAACACAGTA [서열번호 67]
따라서, 바람직하게는 상기 STAT1 우성 음성 형태 폴리펩티드는 실질적으로 서열번호 67에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 68의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGUCUCAGUGGUACGAACUUCAGCAGCUUGACUCAAAAUUCCUGGAGCAGGUUCACCAGCUUUAUGAUGACAGUUUUCCCAUGGAAAUCAGACAGUACCUGGCACAGUGGUUAGAAAAGCAAGACUGGGAGCACGCUGCCAAUGAUGUUUCAUUUGCCACCAUCCGUUUUCAUGACCUCCUGUCACAGCUGGAUGAUCAAUAUAGUCGCUUUUCUUUGGAGAAUAACUUCUUGCUACAGCAUAACAUAAGGAAAAGCAAGCGUAAUCUUCAGGAUAAUUUUCAGGAAGACCCAAUCCAGAUGUCUAUGAUCAUUUACAGCUGUCUGAAGGAAGAAAGGAAAAUUCUGGAAAACGCCCAGAGAUUUAAUCAGGCUCAGUCGGGGAAUAUUCAGAGCACAGUGAUGUUAGACAAACAGAAAGAGCUUGACAGUAAAGUCAGAAAUGUGAAGGACAAGGUUAUGUGUAUAGAGCAUGAAAUCAAGAGCCUGGAAGAUUUACAAGAUGAAUAUGACUUCAAAUGCAAAACCUUGCAGAACAGAGAACACGAGACCAAUGGUGUGGCAAAGAGUGAUCAGAAACAAGAACAGCUGUUACUCAAGAAGAUGUAUUUAAUGCUUGACAAUAAGAGAAAGGAAGUAGUUCACAAAAUAAUAGAGUUGCUGAAUGUCACUGAACUUACCCAGAAUGCCCUGAUUAAUGAUGAACUAGUGGAGUGGAAGCGGAGACAGCAGAGCGCCUGUAUUGGGGGGCCGCCCAAUGCUUGCUUGGAUCAGCUGCAGAACUGGUUCACUAUAGUUGCGGAGAGUCUGCAGCAAGUUCGGCAGCAGCUUAAAAAGUUGGAGGAAUUGGAACAGAAAUACACCUACGAACAUGACCCUAUCACAAAAAACAAACAAGUGUUAUGGGACCGCACCUUCAGUCUUUUCCAGCAGCUCAUUCAGAGCUCGUUUGUGGUGGAAAGACAGCCCUGCAUGCCAACGCACCCUCAGAGGCCGCUGGUCUUGAAGACAGGGGUCCAGUUCACUGUGAAGUUGAGACUGUUGGUGAAAUUGCAAGAGCUGAAUUAUAAUUUGAAAGUCAAAGUCUUAUUUGAUAAAGAUGUGAAUGAGAGAAAUACAGUAAAAGGAUUUAGGAAGUUCAACAUUUUGGGCACGCACACAAAAGUGAUGAACAUGGAGGAGUCCACCAAUGGCAGUCUGGCGGCUGAAUUUCGGCACCUGCAAUUGAAAGAACAGAAAAAUGCUGGCACCAGAACGAAUGAGGGUCCUCUCAUCGUUACUGAAGAGCUUCACUCCCUUAGUUUUGAAACCCAAUUGUGCCAGCCUGGUUUGGUAAUUGACCUCGAGACGACCUCUCUGCCCGUUGUGGUGAUCUCCAACGUCAGCCAGCUCCCGAGCGGUUGGGCCUCCAUCCUUUGGUACAACAUGCUGGUGGCGGAACCCAGGAAUCUGUCCUUCUUCCUGACUCCACCAUGUGCACGAUGGGCUCAGCUUUCAGAAGUGCUGAGUUGGCAGUUUUCUUCUGUCACCAAAAGAGGUCUCAAUGUGGACCAGCUGAACAUGUUGGGAGAGAAGCUUCUUGGUCCUAACGCCAGCCCCGAUGGUCUCAUUCCGUGGACGAGGUUUUGUAAGGAAAAUAUAAAUGAUAAAAAUUUUCCCUUCUGGCUUUGGAUUGAAAGCAUCCUAGAACUCAUUAAAAAACACCUGCUCCCUCUCUGGAAUGAUGGGUGCAUCAUGGGCUUCAUCAGCAAGGAGCGAGAGCGUGCCCUGUUGAAGGACCAGCAGCCGGGGACCUUCCUGCUGCGGUUCAGUGAGAGCUCCCGGGAAGGGGCCAUCACAUUCACAUGGGUGGAGCGGUCCCAGAACGGAGGCGAACCUGACUUCCAUGCGGUUGAACCCUACACGAAGAAAGAACUUUCUGCUGUUACUUUCCCUGACAUCAUUCGCAAUUACAAAGUCAUGGCUGCUGAGAAUAUUCCUGAGAAUCCCCUGAAGUAUCUGUAUCCAAAUAUUGACAAAGACCAUGCCUUUGGAAAGUAUUACUCCAGGCCAAAGGAAGCACCAGAGCCAAUGGAACUUGAUGGCCCUAAAGGAACUGGAUUUAUCAAGACUGAGUUGAUUUCUGUGUCUGAAGUUCACCCUUCUAGACUUCAGACCACAGACAACCUGCUCCCCAUGUCUCCUGAGGAGUUUGACGAGGUGUCUCGGAUAGUGGGCUCUGUAGAAUUCGACAGUAUGAUGAACACAGUA [서열번호 68]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 68에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 66의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 69로 제공된다:
ATGAGCCAGTGGTACGAGCTGCAGCAGCTGGACAGCAAGTTCCTGGAACAGGTGCACCAGCTGTACGACGACAGCTTCCCCATGGAAATCCGGCAGTATCTGGCCCAGTGGCTGGAAAAGCAGGATTGGGAACACGCCGCCAACGACGTGTCCTTCGCCACCATCAGATTCCACGACCTGCTGAGCCAGCTGGACGACCAGTACAGCAGATTCAGCCTGGAAAACAACTTCCTGCTCCAGCACAACATCCGGAAGTCCAAGCGGAACCTGCAGGACAACTTCCAAGAGGACCCCATCCAGATGTCCATGATCATCTACAGCTGCCTGAAAGAGGAACGGAAGATCCTGGAAAATGCCCAGCGGTTCAATCAGGCCCAGAGCGGCAATATCCAGAGCACCGTGATGCTGGACAAGCAGAAAGAACTGGACTCCAAAGTGCGGAACGTCAAGGACAAAGTGATGTGCATCGAGCACGAGATCAAGAGCCTGGAAGATCTGCAGGACGAGTACGACTTCAAGTGCAAGACCCTGCAGAACCGGGAACACGAGACAAACGGCGTGGCCAAGAGCGACCAGAAGCAAGAACAGCTGCTCCTGAAGAAAATGTACCTGATGCTCGACAACAAACGGAAAGAGGTGGTCCACAAGATCATCGAGCTGCTGAACGTGACCGAGCTGACCCAGAACGCCCTGATCAACGACGAGCTGGTGGAATGGAAGCGGAGACAGCAGTCTGCCTGTATCGGCGGACCTCCTAATGCCTGCCTGGACCAGCTGCAGAACTGGTTCACAATCGTGGCCGAGAGCCTGCAGCAAGTGCGCCAGCAGCTGAAGAAGCTGGAAGAACTCGAGCAGAAGTACACCTACGAGCACGACCCCATCACCAAGAACAAACAGGTGCTGTGGGACAGAACCTTCAGCCTGTTCCAACAGCTGATCCAGTCCAGCTTCGTGGTGGAAAGACAGCCCTGCATGCCTACACACCCTCAGAGGCCACTGGTGCTGAAAACCGGCGTGCAGTTCACCGTGAAGCTGCGGCTGCTGGTCAAGCTGCAAGAGCTGAACTACAACCTGAAAGTGAAGGTGCTGTTCGACAAGGACGTGAACGAGCGGAACACCGTGAAAGGCTTCCGCAAGTTCAACATCCTGGGCACCCACACAAAAGTGATGAACATGGAAGAGAGCACCAACGGCAGCCTGGCCGCCGAGTTTAGACACCTCCAGCTGAAAGAGCAGAAGAACGCCGGCACCAGGACCAATGAGGGACCTCTGATCGTGACAGAGGAACTGCACAGCCTGAGCTTCGAAACCCAGCTGTGTCAGCCAGGCCTCGTGATCGATCTGGAAACCACAAGCCTGCCTGTGGTGGTCATCAGCAATGTGTCCCAGCTGCCTTCTGGCTGGGCCAGCATCCTGTGGTACAACATGCTGGTGGCCGAGCCTCGGAACCTGTCCTTCTTTCTGACCCCTCCATGTGCCAGATGGGCCCAGCTGTCTGAAGTGCTGAGCTGGCAGTTTAGCAGCGTGACCAAGAGGGGCCTGAATGTCGACCAGCTGAATATGCTGGGCGAGAAGCTGCTGGGCCCCAACGCTTCTCCTGATGGACTGATCCCTTGGACCAGATTCTGCAAAGAGAATATCAACGACAAGAACTTCCCGTTCTGGCTGTGGATCGAGAGCATCCTGGAACTGATCAAGAAACATCTGCTGCCCCTGTGGAACGACGGCTGCATCATGGGCTTCATCTCCAAAGAGAGAGAGCGGGCCCTGCTGAAGGATCAGCAGCCAGGCACATTCCTGCTGCGGTTTAGCGAGTCTAGCAGAGAGGGCGCCATCACCTTTACCTGGGTCGAGAGATCTCAGAACGGCGGCGAGCCTGATTTTCACGCCGTGGAACCCTACACCAAAAAAGAACTGAGCGCCGTGACATTCCCCGACATCATCCGGAACTACAAAGTCATGGCCGCTGAGAATATCCCCGAGAATCCCCTGAAGTATCTGTACCCCAACATCGATAAGGACCACGCCTTCGGCAAGTACTACAGCAGACCCAAAGAGGCCCCTGAGCCTATGGAACTGGATGGCCCTAAAGGCACCGGCTTCATCAAGACAGAGCTGATCTCCGTGTCCGAGGTGCACCCTAGCAGACTGCAGACCACCGATAACCTGCTGCCTATGAGCCCCGAGGAATTCGACGAGGTGTCCAGAATCGTGGGCAGCGTGGAATTCGATAGCATGATGAATACCGTGTGA [서열번호 69]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 69에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 69의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 70으로 제공된다:
AUGAGCCAGUGGUACGAGCUGCAGCAGCUGGACAGCAAGUUCCUGGAACAGGUGCACCAGCUGUACGACGACAGCUUCCCCAUGGAAAUCCGGCAGUAUCUGGCCCAGUGGCUGGAAAAGCAGGAUUGGGAACACGCCGCCAACGACGUGUCCUUCGCCACCAUCAGAUUCCACGACCUGCUGAGCCAGCUGGACGACCAGUACAGCAGAUUCAGCCUGGAAAACAACUUCCUGCUCCAGCACAACAUCCGGAAGUCCAAGCGGAACCUGCAGGACAACUUCCAAGAGGACCCCAUCCAGAUGUCCAUGAUCAUCUACAGCUGCCUGAAAGAGGAACGGAAGAUCCUGGAAAAUGCCCAGCGGUUCAAUCAGGCCCAGAGCGGCAAUAUCCAGAGCACCGUGAUGCUGGACAAGCAGAAAGAACUGGACUCCAAAGUGCGGAACGUCAAGGACAAAGUGAUGUGCAUCGAGCACGAGAUCAAGAGCCUGGAAGAUCUGCAGGACGAGUACGACUUCAAGUGCAAGACCCUGCAGAACCGGGAACACGAGACAAACGGCGUGGCCAAGAGCGACCAGAAGCAAGAACAGCUGCUCCUGAAGAAAAUGUACCUGAUGCUCGACAACAAACGGAAAGAGGUGGUCCACAAGAUCAUCGAGCUGCUGAACGUGACCGAGCUGACCCAGAACGCCCUGAUCAACGACGAGCUGGUGGAAUGGAAGCGGAGACAGCAGUCUGCCUGUAUCGGCGGACCUCCUAAUGCCUGCCUGGACCAGCUGCAGAACUGGUUCACAAUCGUGGCCGAGAGCCUGCAGCAAGUGCGCCAGCAGCUGAAGAAGCUGGAAGAACUCGAGCAGAAGUACACCUACGAGCACGACCCCAUCACCAAGAACAAACAGGUGCUGUGGGACAGAACCUUCAGCCUGUUCCAACAGCUGAUCCAGUCCAGCUUCGUGGUGGAAAGACAGCCCUGCAUGCCUACACACCCUCAGAGGCCACUGGUGCUGAAAACCGGCGUGCAGUUCACCGUGAAGCUGCGGCUGCUGGUCAAGCUGCAAGAGCUGAACUACAACCUGAAAGUGAAGGUGCUGUUCGACAAGGACGUGAACGAGCGGAACACCGUGAAAGGCUUCCGCAAGUUCAACAUCCUGGGCACCCACACAAAAGUGAUGAACAUGGAAGAGAGCACCAACGGCAGCCUGGCCGCCGAGUUUAGACACCUCCAGCUGAAAGAGCAGAAGAACGCCGGCACCAGGACCAAUGAGGGACCUCUGAUCGUGACAGAGGAACUGCACAGCCUGAGCUUCGAAACCCAGCUGUGUCAGCCAGGCCUCGUGAUCGAUCUGGAAACCACAAGCCUGCCUGUGGUGGUCAUCAGCAAUGUGUCCCAGCUGCCUUCUGGCUGGGCCAGCAUCCUGUGGUACAACAUGCUGGUGGCCGAGCCUCGGAACCUGUCCUUCUUUCUGACCCCUCCAUGUGCCAGAUGGGCCCAGCUGUCUGAAGUGCUGAGCUGGCAGUUUAGCAGCGUGACCAAGAGGGGCCUGAAUGUCGACCAGCUGAAUAUGCUGGGCGAGAAGCUGCUGGGCCCCAACGCUUCUCCUGAUGGACUGAUCCCUUGGACCAGAUUCUGCAAAGAGAAUAUCAACGACAAGAACUUCCCGUUCUGGCUGUGGAUCGAGAGCAUCCUGGAACUGAUCAAGAAACAUCUGCUGCCCCUGUGGAACGACGGCUGCAUCAUGGGCUUCAUCUCCAAAGAGAGAGAGCGGGCCCUGCUGAAGGAUCAGCAGCCAGGCACAUUCCUGCUGCGGUUUAGCGAGUCUAGCAGAGAGGGCGCCAUCACCUUUACCUGGGUCGAGAGAUCUCAGAACGGCGGCGAGCCUGAUUUUCACGCCGUGGAACCCUACACCAAAAAAGAACUGAGCGCCGUGACAUUCCCCGACAUCAUCCGGAACUACAAAGUCAUGGCCGCUGAGAAUAUCCCCGAGAAUCCCCUGAAGUAUCUGUACCCCAACAUCGAUAAGGACCACGCCUUCGGCAAGUACUACAGCAGACCCAAAGAGGCCCCUGAGCCUAUGGAACUGGAUGGCCCUAAAGGCACCGGCUUCAUCAAGACAGAGCUGAUCUCCGUGUCCGAGGUGCACCCUAGCAGACUGCAGACCACCGAUAACCUGCUGCCUAUGAGCCCCGAGGAAUUCGACGAGGUGUCCAGAAUCGUGGGCAGCGUGGAAUUCGAUAGCAUGAUGAAUACCGUGUGA [서열번호 70]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 70에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 선천적 신호전달 경로의 억제제, 또는 그의 우성 음성 작용 형태는 IRF9에 결합하는 STAT2 짧은 형태(short form)이다. 상기 STAT2 우성 음성 짧은 형태의 한 구현예는 STAT2(133-315)(NCBI 참조 서열: NM_005419.4; UniProtKB - P52630(STAT2_HUMAN)), 또는 그의 동원체로 나타내며, 본 명세서에서 다음과 같은 서열번호 71로 나타낸다:
VLETPVESQQHEIESRILDLRAMMEKLVKSISQLKDQQDVFCFRYKIQAKGKTPSLDPHQTKEQKILQETLNELDKRRKEVLDASKALLGRLTTLIELLLPKLEEWKAQQQKACIRAPIDHGLEQLETWFTAGAKLLFHLRQLLKELKGLSCLVSYQDDPLTKGVDLRNAQVTELLQRLLHRA [서열번호 71]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 71에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 STAT2 짧은 형태 폴리펩티드는 다음과 같은 서열번호 72의 DNA 뉴클레오티드 서열에 의해 암호화된다:
GTTCTCGAAACACCTGTGGAGAGCCAGCAACATGAGATTGAATCCCGGATCCTGGATTTAAGGGCTATGATGGAGAAGCTGGTAAAATCCATCAGCCAACTGAAAGACCAGCAGGATGTCTTCTGCTTCCGATATAAGATCCAGGCCAAAGGGAAGACACCCTCTCTGGACCCCCATCAGACCAAAGAGCAGAAGATTCTGCAGGAAACTCTCAATGAACTGGACAAAAGGAGAAAGGAGGTGCTGGATGCCTCCAAAGCACTGCTAGGCCGATTAACTACCCTAATCGAGCTACTGCTGCCAAAGTTGGAGGAGTGGAAGGCCCAGCAGCAAAAAGCCTGCATCAGAGCTCCCATTGACCACGGGTTGGAACAGCTGGAGACATGGTTCACAGCTGGAGCAAAGCTGTTGTTTCACCTGAGGCAGCTGCTGAAGGAGCTGAAGGGACTGAGTTGCCTGGTTAGCTATCAGGATGACCCTCTGACCAAAGGGGTGGACCTACGCAACGCCCAGGTCACAGAGTTGCTACAGCGTCTGCTCCACAGAGCC [서열번호 72]
따라서, 바람직하게는 상기 STAT2 짧은 형태 폴리펩티드는 실질적으로 서열번호 72에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 73의 RNA 뉴클레오티드 서열을 포함할 수 있다:
GUUCUCGAAACACCUGUGGAGAGCCAGCAACAUGAGAUUGAAUCCCGGAUCCUGGAUUUAAGGGCUAUGAUGGAGAAGCUGGUAAAAUCCAUCAGCCAACUGAAAGACCAGCAGGAUGUCUUCUGCUUCCGAUAUAAGAUCCAGGCCAAAGGGAAGACACCCUCUCUGGACCCCCAUCAGACCAAAGAGCAGAAGAUUCUGCAGGAAACUCUCAAUGAACUGGACAAAAGGAGAAAGGAGGUGCUGGAUGCCUCCAAAGCACUGCUAGGCCGAUUAACUACCCUAAUCGAGCUACUGCUGCCAAAGUUGGAGGAGUGGAAGGCCCAGCAGCAAAAAGCCUGCAUCAGAGCUCCCAUUGACCACGGGUUGGAACAGCUGGAGACAUGGUUCACAGCUGGAGCAAAGCUGUUGUUUCACCUGAGGCAGCUGCUGAAGGAGCUGAAGGGACUGAGUUGCCUGGUUAGCUAUCAGGAUGACCCUCUGACCAAAGGGGUGGACCUACGCAACGCCCAGGUCACAGAGUUGCUACAGCGUCUGCUCCACAGAGCC [서열번호 73]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 73에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 71의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 74로 제공된다:
ATGGTGCTGGAAACCCCTGTGGAAAGCCAGCAGCACGAGATCGAGAGCAGAATCCTGGACCTGCGGGCCATGATGGAAAAGCTGGTCAAGAGCATCAGCCAGCTGAAGGACCAGCAGGACGTGTTCTGCTTCCGGTACAAGATCCAGGCCAAGGGCAAGACCCCTAGCCTGGATCCTCACCAGACCAAAGAGCAGAAGATCCTGCAAGAGACACTGAACGAGCTGGACAAGCGGCGGAAAGAAGTGCTGGACGCCTCTAAAGCTCTGCTGGGCAGACTGACCACTCTGATCGAACTGCTGCTGCCCAAGCTGGAAGAGTGGAAGGCCCAGCAACAGAAGGCCTGCATCAGAGCCCCTATCGACCACGGACTGGAACAGCTGGAAACATGGTTTACCGCTGGCGCCAAGCTGCTGTTCCACCTGAGACAGCTGCTGAAAGAGCTGAAGGGCCTGAGCTGCCTGGTGTCCTACCAGGATGACCCTCTGACCAAAGGCGTGGACCTGAGAAACGCCCAAGTGACCGAACTGCTCCAGCGGCTGCTGCATAGAGCTTGA [서열번호 74]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 74에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 74의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 75로 제공된다:
AUGGUGCUGGAAACCCCUGUGGAAAGCCAGCAGCACGAGAUCGAGAGCAGAAUCCUGGACCUGCGGGCCAUGAUGGAAAAGCUGGUCAAGAGCAUCAGCCAGCUGAAGGACCAGCAGGACGUGUUCUGCUUCCGGUACAAGAUCCAGGCCAAGGGCAAGACCCCUAGCCUGGAUCCUCACCAGACCAAAGAGCAGAAGAUCCUGCAAGAGACACUGAACGAGCUGGACAAGCGGCGGAAAGAAGUGCUGGACGCCUCUAAAGCUCUGCUGGGCAGACUGACCACUCUGAUCGAACUGCUGCUGCCCAAGCUGGAAGAGUGGAAGGCCCAGCAACAGAAGGCCUGCAUCAGAGCCCCUAUCGACCACGGACUGGAACAGCUGGAAACAUGGUUUACCGCUGGCGCCAAGCUGCUGUUCCACCUGAGACAGCUGCUGAAAGAGCUGAAGGGCCUGAGCUGCCUGGUGUCCUACCAGGAUGACCCUCUGACCAAAGGCGUGGACCUGAGAAACGCCCAAGUGACCGAACUGCUCCAGCGGCUGCUGCAUAGAGCUUGA [서열번호 75]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 75에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 선천적 신호전달 경로의 억제제, 또는 그의 우성 음성 형태는 IRF9에 결합하는 STAT2 우성 음성 긴 형태(long form)이다. STAT2(NCBI 참조 서열: NM_005419.4; UniProtKB - P52630(STAT2_HUMAN)), 또는 그의 동원체는 우성 음성 방식으로 작용해 ISGF-3 형성을 차단할 수 있는 F175D Y701F 돌연변이(STAT2(1-851-F175DY701F))에 의해 우성 음성으로 될 수 있으며(Rengachari S, Groiss S, Devos JM, Caron E, Grandvaux N, Panne D. Structure of the STAT2-IRF9 complex. PNAS. 2018, 115 (4) E601-E609; DOI: 10.1073/pnas.1718426115), 본 명세서에서 서열번호 76으로 나타낸다:
MAQWEMLQNLDSPFQDQLHQLYSHSLLPVDIRQYLAVWIEDQNWQEAALGSDDSKATMLFFHFLDQLNYECGRCSQDPESLLLQHNLRKFCRDIQPFSQDPTQLAEMIFNLLLEEKRILIQAQRAQLEQGEPVLETPVESQQHEIESRILDLRAMMEKLVKSISQLKDQQDVFCDRYKIQAKGKTPSLDPHQTKEQKILQETLNELDKRRKEVLDASKALLGRLTTLIELLLPKLEEWKAQQQKACIRAPIDHGLEQLETWFTAGAKLLFHLRQLLKELKGLSCLVSYQDDPLTKGVDLRNAQVTELLQRLLHRAFVVETQPCMPQTPHRPLILKTGSKFTVRTRLLVRLQEGNESLTVEVSIDRNPPQLQGFRKFNILTSNQKTLTPEKGQSQGLIWDFGYLTLVEQRSGGSGKGSNKGPLGVTEELHIISFTVKYTYQGLKQELKTDTLPVVIISNMNQLSIAWASVLWFNLLSPNLQNQQFFSNPPKAPWSLLGPALSWQFSSYVGRGLNSDQLSMLRNKLFGQNCRTEDPLLSWADFTKRESPPGKLPFWTWLDKILELVHDHLKDLWNDGRIMGFVSRSQERRLLKKTMSGTFLLRFSESSEGGITCSWVEHQDDDKVLIYSVQPYTKEVLQSLPLTEIIRHYQLLTEENIPENPLRFLYPRIPRDEAFGCYYQEKVNLQERRKYLKHRLIVVSNRQVDELQQPLELKPEPELESLELELGLVPEPELSLDLEPLLKAGLDLGPELESVLESTLEPVIEPTLCMVSQTVPEPDQGPVSQPVPEPDLPCDLRHLNTEPMEIFRNCVKIEEIMPNGDPLLAGQNTVDEVYVSRPSHFYTDGPLMPSDF [서열번호 76]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 76에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 STAT2 우성 음성 긴 형태 폴리펩티드는 다음과 같은 서열번호 77의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGCGCAGTGGGAAATGCTGCAGAATCTTGACAGCCCCTTTCAGGATCAGCTGCACCAGCTTTACTCGCACAGCCTCCTGCCTGTGGACATTCGACAGTACTTGGCTGTCTGGATTGAAGACCAGAACTGGCAGGAAGCTGCACTTGGGAGTGATGATTCCAAGGCTACCATGCTATTCTTCCACTTCTTGGATCAGCTGAACTATGAGTGTGGCCGTTGCAGCCAGGACCCAGAGTCCTTGTTGCTGCAGCACAATTTGCGGAAATTCTGCCGGGACATTCAGCCCTTTTCCCAGGATCCTACCCAGTTGGCTGAGATGATCTTTAACCTCCTTCTGGAAGAAAAAAGAATTTTGATCCAGGCTCAGAGGGCCCAATTGGAACAAGGAGAGCCAGTTCTCGAAACACCTGTGGAGAGCCAGCAACATGAGATTGAATCCCGGATCCTGGATTTAAGGGCTATGATGGAGAAGCTGGTAAAATCCATCAGCCAACTGAAAGACCAGCAGGATGTCTTCTGCGACCGATATAAGATCCAGGCCAAAGGGAAGACACCCTCTCTGGACCCCCATCAGACCAAAGAGCAGAAGATTCTGCAGGAAACTCTCAATGAACTGGACAAAAGGAGAAAGGAGGTGCTGGATGCCTCCAAAGCACTGCTAGGCCGATTAACTACCCTAATCGAGCTACTGCTGCCAAAGTTGGAGGAGTGGAAGGCCCAGCAGCAAAAAGCCTGCATCAGAGCTCCCATTGACCACGGGTTGGAACAGCTGGAGACATGGTTCACAGCTGGAGCAAAGCTGTTGTTTCACCTGAGGCAGCTGCTGAAGGAGCTGAAGGGACTGAGTTGCCTGGTTAGCTATCAGGATGACCCTCTGACCAAAGGGGTGGACCTACGCAACGCCCAGGTCACAGAGTTGCTACAGCGTCTGCTCCACAGAGCCTTTGTGGTAGAAACCCAGCCCTGCATGCCCCAAACTCCCCATCGACCCCTCATCCTCAAGACTGGCAGCAAGTTCACCGTCCGAACAAGGCTGCTGGTGAGACTCCAGGAAGGCAATGAGTCACTGACTGTGGAAGTCTCCATTGACAGGAATCCTCCTCAATTACAAGGCTTCCGGAAGTTCAACATTCTGACTTCAAACCAGAAAACTTTGACCCCCGAGAAGGGGCAGAGTCAGGGTTTGATTTGGGACTTTGGTTACCTGACTCTGGTGGAGCAACGTTCAGGTGGTTCAGGAAAGGGCAGCAATAAGGGGCCACTAGGTGTGACAGAGGAACTGCACATCATCAGCTTCACGGTCAAATATACCTACCAGGGTCTGAAGCAGGAGCTGAAAACGGACACCCTCCCTGTGGTGATTATTTCCAACATGAACCAGCTCTCAATTGCCTGGGCTTCAGTTCTCTGGTTCAATTTGCTCAGCCCAAACCTTCAGAACCAGCAGTTCTTCTCCAACCCCCCCAAGGCCCCCTGGAGCTTGCTGGGCCCTGCTCTCAGTTGGCAGTTCTCCTCCTATGTTGGCCGAGGCCTCAACTCAGACCAGCTGAGCATGCTGAGAAACAAGCTGTTCGGGCAGAACTGTAGGACTGAGGATCCATTATTGTCCTGGGCTGACTTCACTAAGCGAGAGAGCCCTCCTGGCAAGTTACCATTCTGGACATGGCTGGACAAAATTCTGGAGTTGGTACATGACCACCTGAAGGATCTCTGGAATGATGGACGCATCATGGGCTTTGTGAGTCGGAGCCAGGAGCGCCGGCTGCTGAAGAAGACCATGTCTGGCACCTTTCTACTGCGCTTCAGTGAATCGTCAGAAGGGGGCATTACCTGCTCCTGGGTGGAGCACCAGGATGATGACAAGGTGCTCATCTACTCTGTGCAACCGTACACGAAGGAGGTGCTGCAGTCACTCCCGCTGACTGAAATCATCCGCCATTACCAGTTGCTCACTGAGGAGAATATACCTGAAAACCCACTGCGCTTCCTCTATCCCCGAATCCCCCGGGATGAAGCTTTTGGGTGCTACTACCAGGAGAAAGTTAATCTCCAGGAACGGAGGAAATACCTGAAACACAGGCTCATTGTGGTCTCTAATAGACAGGTGGATGAACTGCAACAACCGCTGGAGCTTAAGCCAGAGCCAGAGCTGGAGTCATTAGAGCTGGAACTAGGGCTGGTGCCAGAGCCAGAGCTCAGCCTGGACTTAGAGCCACTGCTGAAGGCAGGGCTGGATCTGGGGCCAGAGCTAGAGTCTGTGCTGGAGTCCACTCTGGAGCCTGTGATAGAGCCCACACTATGCATGGTATCACAAACAGTGCCAGAGCCAGACCAAGGACCTGTATCACAGCCAGTGCCAGAGCCAGATTTGCCCTGTGATCTGAGACATTTGAACACTGAGCCAATGGAAATCTTCAGAAACTGTGTAAAGATTGAAGAAATCATGCCGAATGGTGACCCACTGTTGGCTGGCCAGAACACCGTGGATGAGGTTTACGTCTCCCGCCCCAGCCACTTCTACACTGATGGACCCTTGATGCCTTCTGACTTC [서열번호 77]
따라서, 바람직하게는 상기 STAT2 우성 음성 긴 형태 폴리펩티드는 실질적으로 서열번호 77에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 78의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGCGCAGUGGGAAAUGCUGCAGAAUCUUGACAGCCCCUUUCAGGAUCAGCUGCACCAGCUUUACUCGCACAGCCUCCUGCCUGUGGACAUUCGACAGUACUUGGCUGUCUGGAUUGAAGACCAGAACUGGCAGGAAGCUGCACUUGGGAGUGAUGAUUCCAAGGCUACCAUGCUAUUCUUCCACUUCUUGGAUCAGCUGAACUAUGAGUGUGGCCGUUGCAGCCAGGACCCAGAGUCCUUGUUGCUGCAGCACAAUUUGCGGAAAUUCUGCCGGGACAUUCAGCCCUUUUCCCAGGAUCCUACCCAGUUGGCUGAGAUGAUCUUUAACCUCCUUCUGGAAGAAAAAAGAAUUUUGAUCCAGGCUCAGAGGGCCCAAUUGGAACAAGGAGAGCCAGUUCUCGAAACACCUGUGGAGAGCCAGCAACAUGAGAUUGAAUCCCGGAUCCUGGAUUUAAGGGCUAUGAUGGAGAAGCUGGUAAAAUCCAUCAGCCAACUGAAAGACCAGCAGGAUGUCUUCUGCGACCGAUAUAAGAUCCAGGCCAAAGGGAAGACACCCUCUCUGGACCCCCAUCAGACCAAAGAGCAGAAGAUUCUGCAGGAAACUCUCAAUGAACUGGACAAAAGGAGAAAGGAGGUGCUGGAUGCCUCCAAAGCACUGCUAGGCCGAUUAACUACCCUAAUCGAGCUACUGCUGCCAAAGUUGGAGGAGUGGAAGGCCCAGCAGCAAAAAGCCUGCAUCAGAGCUCCCAUUGACCACGGGUUGGAACAGCUGGAGACAUGGUUCACAGCUGGAGCAAAGCUGUUGUUUCACCUGAGGCAGCUGCUGAAGGAGCUGAAGGGACUGAGUUGCCUGGUUAGCUAUCAGGAUGACCCUCUGACCAAAGGGGUGGACCUACGCAACGCCCAGGUCACAGAGUUGCUACAGCGUCUGCUCCACAGAGCCUUUGUGGUAGAAACCCAGCCCUGCAUGCCCCAAACUCCCCAUCGACCCCUCAUCCUCAAGACUGGCAGCAAGUUCACCGUCCGAACAAGGCUGCUGGUGAGACUCCAGGAAGGCAAUGAGUCACUGACUGUGGAAGUCUCCAUUGACAGGAAUCCUCCUCAAUUACAAGGCUUCCGGAAGUUCAACAUUCUGACUUCAAACCAGAAAACUUUGACCCCCGAGAAGGGGCAGAGUCAGGGUUUGAUUUGGGACUUUGGUUACCUGACUCUGGUGGAGCAACGUUCAGGUGGUUCAGGAAAGGGCAGCAAUAAGGGGCCACUAGGUGUGACAGAGGAACUGCACAUCAUCAGCUUCACGGUCAAAUAUACCUACCAGGGUCUGAAGCAGGAGCUGAAAACGGACACCCUCCCUGUGGUGAUUAUUUCCAACAUGAACCAGCUCUCAAUUGCCUGGGCUUCAGUUCUCUGGUUCAAUUUGCUCAGCCCAAACCUUCAGAACCAGCAGUUCUUCUCCAACCCCCCCAAGGCCCCCUGGAGCUUGCUGGGCCCUGCUCUCAGUUGGCAGUUCUCCUCCUAUGUUGGCCGAGGCCUCAACUCAGACCAGCUGAGCAUGCUGAGAAACAAGCUGUUCGGGCAGAACUGUAGGACUGAGGAUCCAUUAUUGUCCUGGGCUGACUUCACUAAGCGAGAGAGCCCUCCUGGCAAGUUACCAUUCUGGACAUGGCUGGACAAAAUUCUGGAGUUGGUACAUGACCACCUGAAGGAUCUCUGGAAUGAUGGACGCAUCAUGGGCUUUGUGAGUCGGAGCCAGGAGCGCCGGCUGCUGAAGAAGACCAUGUCUGGCACCUUUCUACUGCGCUUCAGUGAAUCGUCAGAAGGGGGCAUUACCUGCUCCUGGGUGGAGCACCAGGAUGAUGACAAGGUGCUCAUCUACUCUGUGCAACCGUACACGAAGGAGGUGCUGCAGUCACUCCCGCUGACUGAAAUCAUCCGCCAUUACCAGUUGCUCACUGAGGAGAAUAUACCUGAAAACCCACUGCGCUUCCUCUAUCCCCGAAUCCCCCGGGAUGAAGCUUUUGGGUGCUACUACCAGGAGAAAGUUAAUCUCCAGGAACGGAGGAAAUACCUGAAACACAGGCUCAUUGUGGUCUCUAAUAGACAGGUGGAUGAACUGCAACAACCGCUGGAGCUUAAGCCAGAGCCAGAGCUGGAGUCAUUAGAGCUGGAACUAGGGCUGGUGCCAGAGCCAGAGCUCAGCCUGGACUUAGAGCCACUGCUGAAGGCAGGGCUGGAUCUGGGGCCAGAGCUAGAGUCUGUGCUGGAGUCCACUCUGGAGCCUGUGAUAGAGCCCACACUAUGCAUGGUAUCACAAACAGUGCCAGAGCCAGACCAAGGACCUGUAUCACAGCCAGUGCCAGAGCCAGAUUUGCCCUGUGAUCUGAGACAUUUGAACACUGAGCCAAUGGAAAUCUUCAGAAACUGUGUAAAGAUUGAAGAAAUCAUGCCGAAUGGUGACCCACUGUUGGCUGGCCAGAACACCGUGGAUGAGGUUUACGUCUCCCGCCCCAGCCACUUCUACACUGAUGGACCCUUGAUGCCUUCUGACUUC [서열번호 78]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 78에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 76의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 79로 제공된다:
ATGGCCCAGTGGGAGATGCTGCAGAACCTGGACAGCCCCTTCCAGGATCAGCTGCACCAGCTGTACTCCCACTCTCTGCTGCCCGTGGACATCAGACAGTATCTGGCCGTGTGGATCGAGGACCAGAACTGGCAAGAAGCCGCTCTGGGCAGCGACGATAGCAAGGCCACAATGCTGTTCTTCCACTTCCTGGACCAGCTGAACTACGAGTGCGGCAGATGCAGCCAGGATCCAGAAAGTCTGCTGCTCCAGCACAACCTGCGGAAGTTCTGCAGAGACATCCAGCCATTCTCTCAGGACCCCACACAGCTGGCCGAGATGATCTTCAACCTGCTGCTGGAAGAGAAGCGGATCCTGATTCAGGCCCAGAGAGCCCAGCTGGAACAGGGCGAACCTGTCCTGGAAACCCCTGTGGAATCTCAGCAGCACGAGATCGAGAGCCGGATCCTGGATCTGCGGGCCATGATGGAAAAGCTGGTCAAGAGCATCAGCCAGCTGAAGGACCAGCAGGACGTGTTCTGCGACCGGTACAAGATCCAGGCCAAGGGCAAGACCCCTAGCCTGGATCCTCACCAGACCAAAGAGCAGAAGATCCTGCAAGAGACACTGAACGAGCTGGACAAGCGGCGGAAAGAAGTGCTGGACGCCTCTAAAGCTCTGCTGGGCAGACTGACCACTCTGATCGAACTGCTGCTGCCCAAGCTGGAAGAATGGAAGGCACAGCAGCAGAAGGCCTGCATCAGAGCCCCTATCGATCACGGCCTGGAACAGCTGGAAACCTGGTTTACAGCCGGCGCTAAGCTGCTGTTCCACCTGAGACAGCTGCTGAAAGAGCTGAAGGGCCTGAGCTGCCTGGTGTCCTACCAGGATGACCCTCTGACCAAAGGCGTGGACCTGAGAAACGCCCAAGTGACCGAACTGCTCCAGAGACTGCTGCACAGAGCCTTCGTGGTGGAAACCCAGCCTTGCATGCCCCAGACACCTCACAGACCCCTGATCCTGAAAACCGGCAGCAAGTTCACCGTGCGGACCAGACTGCTCGTGCGACTGCAAGAGGGCAATGAGAGCCTGACCGTGGAAGTGTCCATCGACAGAAACCCTCCACAGCTGCAGGGCTTCAGAAAGTTCAACATCCTGACCAGCAACCAGAAAACCCTGACACCTGAGAAGGGCCAGAGCCAGGGACTGATCTGGGACTTCGGCTACCTGACACTGGTCGAGCAGAGATCTGGCGGCTCTGGCAAGGGCTCTAACAAGGGACCTCTGGGCGTGACCGAGGAACTGCACATCATCAGCTTCACCGTGAAGTACACCTACCAGGGCCTGAAGCAAGAACTCAAGACCGACACACTGCCCGTCGTGATCATCAGCAACATGAACCAGCTGTCTATCGCCTGGGCCAGCGTGCTGTGGTTCAATCTGCTGAGCCCCAACCTGCAGAATCAGCAGTTCTTCAGCAACCCTCCTAAGGCTCCTTGGAGCCTGCTGGGACCTGCTCTGAGCTGGCAGTTTAGCAGCTATGTCGGCAGAGGCCTGAACAGCGATCAGCTGAGCATGCTGCGGAACAAGCTGTTCGGCCAGAACTGCAGGACCGAGGATCCACTGCTGAGCTGGGCCGACTTCACCAAGAGAGAGAGCCCTCCAGGCAAGCTGCCCTTCTGGACTTGGCTGGACAAAATCCTGGAACTGGTGCACGACCACCTGAAGGATCTGTGGAACGACGGCCGGATCATGGGCTTCGTGTCCAGATCTCAAGAGCGCAGACTGCTGAAAAAGACAATGAGCGGCACCTTCCTGCTGCGGTTCAGCGAATCTAGCGAAGGCGGCATCACCTGTAGCTGGGTCGAACACCAGGACGACGACAAGGTGCTGATCTACAGCGTGCAGCCCTACACCAAAGAGGTGCTGCAAAGCCTGCCTCTGACCGAGATCATCCGGCACTACCAGCTGCTCACCGAGGAAAACATCCCCGAGAATCCTCTGCGGTTTCTGTACCCTCGGATCCCCAGAGATGAGGCCTTTGGCTGCTACTACCAAGAGAAAGTGAATCTGCAAGAGCGGCGCAAGTACCTGAAGCACAGACTGATCGTGGTGTCCAACAGACAGGTGGACGAGCTGCAGCAGCCACTGGAACTGAAGCCTGAGCCAGAGCTGGAAAGCCTCGAGCTGGAACTTGGACTGGTGCCCGAGCCTGAACTGTCTCTGGATCTGGAACCTCTGCTGAAGGCCGGACTGGACCTCGGACCTGAACTGGAAAGCGTGCTGGAATCCACACTGGAACCTGTGATCGAGCCCACACTGTGCATGGTGTCTCAGACCGTGCCTGAACCAGATCAGGGCCCAGTGTCTCAGCCTGTTCCTGAGCCTGATCTGCCCTGCGATCTGAGGCACCTGAACACCGAGCCTATGGAAATCTTCCGGAACTGCGTGAAGATCGAGGAAATCATGCCCAACGGCGACCCTCTGCTGGCCGGACAGAATACCGTGGATGAAGTGTACGTGTCCCGGCCTAGCCACTTCTACACAGACGGACCTCTGATGCCCAGCGACTTCTGA [서열번호 79]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 79에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 79의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 80으로 제공된다:
AUGGCCCAGUGGGAGAUGCUGCAGAACCUGGACAGCCCCUUCCAGGAUCAGCUGCACCAGCUGUACUCCCACUCUCUGCUGCCCGUGGACAUCAGACAGUAUCUGGCCGUGUGGAUCGAGGACCAGAACUGGCAAGAAGCCGCUCUGGGCAGCGACGAUAGCAAGGCCACAAUGCUGUUCUUCCACUUCCUGGACCAGCUGAACUACGAGUGCGGCAGAUGCAGCCAGGAUCCAGAAAGUCUGCUGCUCCAGCACAACCUGCGGAAGUUCUGCAGAGACAUCCAGCCAUUCUCUCAGGACCCCACACAGCUGGCCGAGAUGAUCUUCAACCUGCUGCUGGAAGAGAAGCGGAUCCUGAUUCAGGCCCAGAGAGCCCAGCUGGAACAGGGCGAACCUGUCCUGGAAACCCCUGUGGAAUCUCAGCAGCACGAGAUCGAGAGCCGGAUCCUGGAUCUGCGGGCCAUGAUGGAAAAGCUGGUCAAGAGCAUCAGCCAGCUGAAGGACCAGCAGGACGUGUUCUGCGACCGGUACAAGAUCCAGGCCAAGGGCAAGACCCCUAGCCUGGAUCCUCACCAGACCAAAGAGCAGAAGAUCCUGCAAGAGACACUGAACGAGCUGGACAAGCGGCGGAAAGAAGUGCUGGACGCCUCUAAAGCUCUGCUGGGCAGACUGACCACUCUGAUCGAACUGCUGCUGCCCAAGCUGGAAGAAUGGAAGGCACAGCAGCAGAAGGCCUGCAUCAGAGCCCCUAUCGAUCACGGCCUGGAACAGCUGGAAACCUGGUUUACAGCCGGCGCUAAGCUGCUGUUCCACCUGAGACAGCUGCUGAAAGAGCUGAAGGGCCUGAGCUGCCUGGUGUCCUACCAGGAUGACCCUCUGACCAAAGGCGUGGACCUGAGAAACGCCCAAGUGACCGAACUGCUCCAGAGACUGCUGCACAGAGCCUUCGUGGUGGAAACCCAGCCUUGCAUGCCCCAGACACCUCACAGACCCCUGAUCCUGAAAACCGGCAGCAAGUUCACCGUGCGGACCAGACUGCUCGUGCGACUGCAAGAGGGCAAUGAGAGCCUGACCGUGGAAGUGUCCAUCGACAGAAACCCUCCACAGCUGCAGGGCUUCAGAAAGUUCAACAUCCUGACCAGCAACCAGAAAACCCUGACACCUGAGAAGGGCCAGAGCCAGGGACUGAUCUGGGACUUCGGCUACCUGACACUGGUCGAGCAGAGAUCUGGCGGCUCUGGCAAGGGCUCUAACAAGGGACCUCUGGGCGUGACCGAGGAACUGCACAUCAUCAGCUUCACCGUGAAGUACACCUACCAGGGCCUGAAGCAAGAACUCAAGACCGACACACUGCCCGUCGUGAUCAUCAGCAACAUGAACCAGCUGUCUAUCGCCUGGGCCAGCGUGCUGUGGUUCAAUCUGCUGAGCCCCAACCUGCAGAAUCAGCAGUUCUUCAGCAACCCUCCUAAGGCUCCUUGGAGCCUGCUGGGACCUGCUCUGAGCUGGCAGUUUAGCAGCUAUGUCGGCAGAGGCCUGAACAGCGAUCAGCUGAGCAUGCUGCGGAACAAGCUGUUCGGCCAGAACUGCAGGACCGAGGAUCCACUGCUGAGCUGGGCCGACUUCACCAAGAGAGAGAGCCCUCCAGGCAAGCUGCCCUUCUGGACUUGGCUGGACAAAAUCCUGGAACUGGUGCACGACCACCUGAAGGAUCUGUGGAACGACGGCCGGAUCAUGGGCUUCGUGUCCAGAUCUCAAGAGCGCAGACUGCUGAAAAAGACAAUGAGCGGCACCUUCCUGCUGCGGUUCAGCGAAUCUAGCGAAGGCGGCAUCACCUGUAGCUGGGUCGAACACCAGGACGACGACAAGGUGCUGAUCUACAGCGUGCAGCCCUACACCAAAGAGGUGCUGCAAAGCCUGCCUCUGACCGAGAUCAUCCGGCACUACCAGCUGCUCACCGAGGAAAACAUCCCCGAGAAUCCUCUGCGGUUUCUGUACCCUCGGAUCCCCAGAGAUGAGGCCUUUGGCUGCUACUACCAAGAGAAAGUGAAUCUGCAAGAGCGGCGCAAGUACCUGAAGCACAGACUGAUCGUGGUGUCCAACAGACAGGUGGACGAGCUGCAGCAGCCACUGGAACUGAAGCCUGAGCCAGAGCUGGAAAGCCUCGAGCUGGAACUUGGACUGGUGCCCGAGCCUGAACUGUCUCUGGAUCUGGAACCUCUGCUGAAGGCCGGACUGGACCUCGGACCUGAACUGGAAAGCGUGCUGGAAUCCACACUGGAACCUGUGAUCGAGCCCACACUGUGCAUGGUGUCUCAGACCGUGCCUGAACCAGAUCAGGGCCCAGUGUCUCAGCCUGUUCCUGAGCCUGAUCUGCCCUGCGAUCUGAGGCACCUGAACACCGAGCCUAUGGAAAUCUUCCGGAACUGCGUGAAGAUCGAGGAAAUCAUGCCCAACGGCGACCCUCUGCUGGCCGGACAGAAUACCGUGGAUGAAGUGUACGUGUCCCGGCCUAGCCACUUCUACACAGACGGACCUCUGAUGCCCAGCGACUUCUGA [서열번호 80]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 80에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 USP18(NCBI 참조 서열: NM_017414.4; UniProtKB - Q9UMW8(UBP18_HUMAN)), 또는 그의 동원체일 수 있다. USP18은 IFNAR2 및 STAT2과 상호작용해 타입 I 인터페론 신호전달을 차단하는 것으로 여겨진다. [Basters A, Knobeloch K-P, Fritz G. USP18 - a multifunctional component in the interferon response. Bioscience Reports 2018; 38; doi.org/10.1042/BSR20180250]; [Randall G, Chen L, Panis M, Fischer AK, Lindenbach BD, Sun J, Heathcote J, Rice CM, Edwards AM, McGilyray ID. Silencing of USP18 potentiates the antiviral activity of interferon against hepatitis C virus infection. Gastroenterol 2006; 1331(5): 1584-1591].
상기 USP18의 한 구현예는 본 명세서에서 다음과 같은 서열번호 161로 나타낸다:
MSKAFGLLRQICQSILAESSQSPADLEEKKEEDSNMKREQPRERPRAWDYPHGLVGLHNIGQTCCLNSLIQVFVMNVDFTRILKRITVPGADEQRRSVPFQMLLLLEKMQDSRQKAVRPLELAYCLQKCNVPLFVQHDAAQLYLKLWNLIKDQITDVHLVERLQALYTIRVKDSLICVDCAMESSRNSSMLTLPLSLFDVDSKPLKTLEDALHCFFQPRELSSKSKCFCENCGKKTRGKQVLKLTHLPQTLTIHLMRFSIRNSQTRKICHSLYFPQSLDFSQILPMKRESCDAEEQSGGQYELFAVIAHVGMADSGHYCVYIRNAVDGKWFCFNDSNICLVSWEDIQCTYGNPNYHWQETAYLLVYMKMEC [서열번호 161]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 161에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 USP18 폴리펩티드는 다음과 같은 서열번호 162의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGAGCAAGGCGTTTGGGCTCCTGAGGCAAATCTGTCAGTCCATCCTGGCTGAGTCCTCGCAGTCCCCGGCAGATCTTGAAGAAAAGAAGGAAGAAGACAGCAACATGAAGAGAGAGCAGCCCAGAGAGCGTCCCAGGGCCTGGGACTACCCTCATGGCCTGGTTGGTTTACACAACATTGGACAGACCTGCTGCCTTAACTCCTTGATTCAGGTGTTCGTAATGAATGTGGACTTCACCAGGATATTGAAGAGGATCACGGTGCCCAGGGGAGCTGACGAGCAGAGGAGAAGCGTCCCTTTCCAGATGCTTCTGCTGCTGGAGAAGATGCAGGACAGCCGGCAGAAAGCAGTGCGGCCCCTGGAGCTGGCCTACTGCCTGCAGAAGTGCAACGTGCCCTTGTTTGTCCAACATGATGCTGCCCAACTGTACCTCAAACTCTGGAACCTGATTAAGGACCAGATCACTGATGTGCACTTGGTGGAGAGACTGCAGGCCCTGTATACGATCCGGGTGAAGGACTCCTTGATTTGCGTTGACTGTGCCATGGAGAGTAGCAGAAACAGCAGCATGCTCACCCTCCCACTTTCTCTTTTTGATGTGGACTCAAAGCCCCTGAAGACACTGGAGGACGCCCTGCACTGCTTCTTCCAGCCCAGGGAGTTATCAAGCAAAAGCAAGTGCTTCTGTGAGAACTGTGGGAAGAAGACCCGTGGGAAACAGGTCTTGAAGCTGACCCATTTGCCCCAGACCCTGACAATCCACCTCATGCGATTCTCCATCAGGAATTCACAGACGAGAAAGATCTGCCACTCCCTGTACTTCCCCCAGAGCTTGGATTTCAGCCAGATCCTTCCAATGAAGCGAGAGTCTTGTGATGCTGAGGAGCAGTCTGGAGGGCAGTATGAGCTTTTTGCTGTGATTGCGCACGTGGGAATGGCAGACTCCGGTCATTACTGTGTCTACATCCGGAATGCTGTGGATGGAAAATGGTTCTGCTTCAATGACTCCAATATTTGCTTGGTGTCCTGGGAAGACATCCAGTGTACCTACGGAAATCCTAACTACCACTGGCAGGAAACTGCATATCTTCTGGTTTACATGAAGATGGAGTGC [서열번호 162]
따라서, 바람직하게는 상기 USP18 폴리펩티드는 실질적으로 서열번호 162에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 163의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGAGCAAGGCGUUUGGGCUCCUGAGGCAAAUCUGUCAGUCCAUCCUGGCUGAGUCCUCGCAGUCCCCGGCAGAUCUUGAAGAAAAGAAGGAAGAAGACAGCAACAUGAAGAGAGAGCAGCCCAGAGAGCGUCCCAGGGCCUGGGACUACCCUCAUGGCCUGGUUGGUUUACACAACAUUGGACAGACCUGCUGCCUUAACUCCUUGAUUCAGGUGUUCGUAAUGAAUGUGGACUUCACCAGGAUAUUGAAGAGGAUCACGGUGCCCAGGGGAGCUGACGAGCAGAGGAGAAGCGUCCCUUUCCAGAUGCUUCUGCUGCUGGAGAAGAUGCAGGACAGCCGGCAGAAAGCAGUGCGGCCCCUGGAGCUGGCCUACUGCCUGCAGAAGUGCAACGUGCCCUUGUUUGUCCAACAUGAUGCUGCCCAACUGUACCUCAAACUCUGGAACCUGAUUAAGGACCAGAUCACUGAUGUGCACUUGGUGGAGAGACUGCAGGCCCUGUAUACGAUCCGGGUGAAGGACUCCUUGAUUUGCGUUGACUGUGCCAUGGAGAGUAGCAGAAACAGCAGCAUGCUCACCCUCCCACUUUCUCUUUUUGAUGUGGACUCAAAGCCCCUGAAGACACUGGAGGACGCCCUGCACUGCUUCUUCCAGCCCAGGGAGUUAUCAAGCAAAAGCAAGUGCUUCUGUGAGAACUGUGGGAAGAAGACCCGUGGGAAACAGGUCUUGAAGCUGACCCAUUUGCCCCAGACCCUGACAAUCCACCUCAUGCGAUUCUCCAUCAGGAAUUCACAGACGAGAAAGAUCUGCCACUCCCUGUACUUCCCCCAGAGCUUGGAUUUCAGCCAGAUCCUUCCAAUGAAGCGAGAGUCUUGUGAUGCUGAGGAGCAGUCUGGAGGGCAGUAUGAGCUUUUUGCUGUGAUUGCGCACGUGGGAAUGGCAGACUCCGGUCAUUACUGUGUCUACAUCCGGAAUGCUGUGGAUGGAAAAUGGUUCUGCUUCAAUGACUCCAAUAUUUGCUUGGUGUCCUGGGAAGACAUCCAGUGUACCUACGGAAAUCCUAACUACCACUGGCAGGAAACUGCAUAUCUUCUGGUUUACAUGAAGAUGGAGUGC [서열번호 163]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 163에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 161의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 164로 제공된다:
ATGAGCAAGGCCTTCGGCCTGCTGAGACAGATCTGCCAGTCTATCCTGGCCGAGAGCAGCCAGTCTCCTGCCGATCTGGAAGAGAAGAAAGAAGAGGACTCCAACATGAAGCGCGAGCAGCCCAGAGAAAGACCCAGAGCCTGGGATTATCCTCACGGCCTCGTGGGCCTGCACAATATCGGCCAGACCTGCTGCCTGAACAGCCTGATCCAGGTGTTCGTGATGAACGTGGACTTCACCCGGATCCTGAAGCGGATCACAGTGCCTAGAGGCGCCGACGAGCAGAGAAGATCCGTGCCTTTTCAGATGCTGCTGCTCCTGGAAAAGATGCAGGACAGCCGGCAGAAGGCCGTCAGACCTCTGGAACTGGCCTACTGCCTGCAGAAATGCAACGTGCCCCTGTTCGTGCAGCACGATGCCGCTCAGCTGTACCTGAAGCTGTGGAACCTGATCAAGGACCAGATCACCGACGTGCACCTGGTGGAAAGACTGCAGGCCCTGTACACCATCAGAGTGAAGGACTCCCTGATCTGCGTGGACTGCGCCATGGAAAGCAGCCGGAATAGCTCCATGCTGACCCTGCCTCTGAGCCTGTTCGACGTGGACAGCAAGCCCCTGAAAACCCTGGAAGATGCCCTGCACTGCTTCTTCCAGCCTAGAGAGCTGAGCAGCAAGAGCAAGTGCTTCTGCGAGAACTGCGGCAAGAAAACCCGGGGCAAACAGGTGCTGAAGCTGACCCATCTGCCTCAGACACTGACCATCCACCTGATGCGGTTCAGCATCCGGAACAGCCAGACCAGAAAGATCTGTCACTCCCTGTACTTCCCTCAGTCTCTGGACTTCAGCCAGATTCTGCCCATGAAGAGAGAGAGCTGCGACGCCGAAGAACAGTCTGGCGGACAGTACGAGCTGTTCGCCGTGATTGCCCACGTTGGCATGGCCGATAGCGGCCACTACTGCGTGTACATCAGAAACGCCGTGGACGGCAAGTGGTTCTGTTTCAACGACAGCAATATCTGCCTGGTGTCCTGGGAAGATATCCAGTGCACCTACGGCAACCCCAACTACCACTGGCAAGAGACAGCCTACCTGCTGGTGTACATGAAGATGGAATGCTGA [서열번호 164]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 164에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 164의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 165로 제공된다:
AUGAGCAAGGCCUUCGGCCUGCUGAGACAGAUCUGCCAGUCUAUCCUGGCCGAGAGCAGCCAGUCUCCUGCCGAUCUGGAAGAGAAGAAAGAAGAGGACUCCAACAUGAAGCGCGAGCAGCCCAGAGAAAGACCCAGAGCCUGGGAUUAUCCUCACGGCCUCGUGGGCCUGCACAAUAUCGGCCAGACCUGCUGCCUGAACAGCCUGAUCCAGGUGUUCGUGAUGAACGUGGACUUCACCCGGAUCCUGAAGCGGAUCACAGUGCCUAGAGGCGCCGACGAGCAGAGAAGAUCCGUGCCUUUUCAGAUGCUGCUGCUCCUGGAAAAGAUGCAGGACAGCCGGCAGAAGGCCGUCAGACCUCUGGAACUGGCCUACUGCCUGCAGAAAUGCAACGUGCCCCUGUUCGUGCAGCACGAUGCCGCUCAGCUGUACCUGAAGCUGUGGAACCUGAUCAAGGACCAGAUCACCGACGUGCACCUGGUGGAAAGACUGCAGGCCCUGUACACCAUCAGAGUGAAGGACUCCCUGAUCUGCGUGGACUGCGCCAUGGAAAGCAGCCGGAAUAGCUCCAUGCUGACCCUGCCUCUGAGCCUGUUCGACGUGGACAGCAAGCCCCUGAAAACCCUGGAAGAUGCCCUGCACUGCUUCUUCCAGCCUAGAGAGCUGAGCAGCAAGAGCAAGUGCUUCUGCGAGAACUGCGGCAAGAAAACCCGGGGCAAACAGGUGCUGAAGCUGACCCAUCUGCCUCAGACACUGACCAUCCACCUGAUGCGGUUCAGCAUCCGGAACAGCCAGACCAGAAAGAUCUGUCACUCCCUGUACUUCCCUCAGUCUCUGGACUUCAGCCAGAUUCUGCCCAUGAAGAGAGAGAGCUGCGACGCCGAAGAACAGUCUGGCGGACAGUACGAGCUGUUCGCCGUGAUUGCCCACGUUGGCAUGGCCGAUAGCGGCCACUACUGCGUGUACAUCAGAAACGCCGUGGACGGCAAGUGGUUCUGUUUCAACGACAGCAAUAUCUGCCUGGUGUCCUGGGAAGAUAUCCAGUGCACCUACGGCAACCCCAACUACCACUGGCAAGAGACAGCCUACCUGCUGGUGUACAUGAAGAUGGAAUGCUGA [서열번호 165]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 165에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 SOCS1 폴리펩티드(NCBI 참조 서열: NM_003745.2; UniProtKB - O15524(SOCS1_HUMAN)), 그의 절단된 버전 또는 동원체일 수 있다. [Shao RX, Zhang L, Hong Z, Goto K, Cheng D, Chen WC, Jilg N, Kumthip K, Fusco DN, Peng LF, Chung RT. SOCS1 abrogates IFN's antiviral effect on hepatitis C virus replication. Antiviral Research, 2012, 97(2):101-107].
SOCS1의 한 구현예는 본 명세서에서 다음과 같은 서열번호 151로 나타낸다:
MVAHNQVAADNAVSTAAEPRRRPEPSSSSSSSPAAPARPRPCPAVPAPAPGDTHFRTFRSHADYRRITRASALLDACGFYWGPLSVHGAHERLRAEPVGTFLVRDSRQRNCFFALSVKMASGPTSIRVHFQAGRFHLDGSRESFDCLFELLEHYVAAPRRMLGAPLRQRRVRPLQELCRQRIVATVGRENLARIPLNPVLRDYLSSFPFQI [서열번호 151]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 151에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 SOCS1 폴리펩티드는 다음과 같은 서열번호 152의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGTAGCACACAACCAGGTGGCAGCCGACAATGCAGTCTCCACAGCAGCAGAGCCCCGACGGCGGCCAGAACCTTCCTCCTCTTCCTCCTCCTCGCCCGCGGCCCCCGCGCGCCCGCGGCCGTGCCCCGCGGTCCCGGCCCCGGCCCCCGGCGACACGCACTTCCGCACATTCCGTTCGCACGCCGATTACCGGCGCATCACGCGCGCCAGCGCGCTCCTGGACGCCTGCGGATTCTACTGGGGGCCCCTGAGCGTGCACGGGGCGCACGAGCGGCTGCGCGCCGAGCCCGTGGGCACCTTCCTGGTGCGCGACAGCCGCCAGCGGAACTGCTTTTTCGCCCTTAGCGTGAAGATGGCCTCGGGACCCACGAGCATCCGCGTGCACTTTCAGGCCGGCCGCTTTCACCTGGATGGCAGCCGCGAGAGCTTCGACTGCCTCTTCGAGCTGCTGGAGCACTACGTGGCGGCGCCGCGCCGCATGCTGGGGGCCCCGCTGCGCCAGCGCCGCGTGCGGCCGCTGCAGGAGCTGTGCCGCCAGCGCATCGTGGCCACCGTGGGCCGCGAGAACCTGGCTCGCATCCCCCTCAACCCCGTCCTCCGCGACTACCTGAGCTCCTTCCCCTTCCAGATT [서열번호 152]
따라서, 바람직하게는 상기 SOCS1 폴리펩티드는 실질적으로 서열번호 152에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 153의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGUAGCACACAACCAGGUGGCAGCCGACAAUGCAGUCUCCACAGCAGCAGAGCCCCGACGGCGGCCAGAACCUUCCUCCUCUUCCUCCUCCUCGCCCGCGGCCCCCGCGCGCCCGCGGCCGUGCCCCGCGGUCCCGGCCCCGGCCCCCGGCGACACGCACUUCCGCACAUUCCGUUCGCACGCCGAUUACCGGCGCAUCACGCGCGCCAGCGCGCUCCUGGACGCCUGCGGAUUCUACUGGGGGCCCCUGAGCGUGCACGGGGCGCACGAGCGGCUGCGCGCCGAGCCCGUGGGCACCUUCCUGGUGCGCGACAGCCGCCAGCGGAACUGCUUUUUCGCCCUUAGCGUGAAGAUGGCCUCGGGACCCACGAGCAUCCGCGUGCACUUUCAGGCCGGCCGCUUUCACCUGGAUGGCAGCCGCGAGAGCUUCGACUGCCUCUUCGAGCUGCUGGAGCACUACGUGGCGGCGCCGCGCCGCAUGCUGGGGGCCCCGCUGCGCCAGCGCCGCGUGCGGCCGCUGCAGGAGCUGUGCCGCCAGCGCAUCGUGGCCACCGUGGGCCGCGAGAACCUGGCUCGCAUCCCCCUCAACCCCGUCCUCCGCGACUACCUGAGCUCCUUCCCCUUCCAGAUU [서열번호 153]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 153에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 151의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 154로 제공된다:
ATGGTGGCCCATAATCAGGTGGCCGCCGATAACGCCGTGTCTACAGCTGCCGAACCTAGAAGAAGGCCCGAGCCTAGCAGCAGCAGCTCTAGTTCTCCTGCCGCTCCTGCCAGACCTAGACCTTGTCCTGCTGTTCCTGCTCCAGCTCCTGGCGACACCCACTTCAGAACCTTTAGAAGCCACGCCGACTACCGGCGGATCACAAGAGCATCTGCTCTGCTGGATGCCTGCGGCTTTTATTGGGGCCCTCTGTCTGTGCACGGCGCCCACGAAAGACTGAGAGCTGAACCTGTGGGCACCTTCCTCGTGCGGGATAGCAGACAGCGGAACTGCTTCTTTGCCCTGAGCGTGAAGATGGCCAGCGGACCCACATCCATCAGAGTGCACTTTCAGGCCGGCAGATTCCACCTGGATGGCAGCAGAGAGAGCTTCGACTGCCTGTTCGAGCTGCTGGAACACTACGTGGCCGCTCCTAGAAGGATGCTGGGAGCACCCCTGAGACAGAGAAGAGTGCGGCCTCTGCAAGAGCTGTGCCGGCAGAGAATCGTGGCCACAGTGGGCAGAGAGAACCTGGCCAGAATTCCTCTGAACCCCGTGCTGAGAGACTACCTGAGCAGCTTCCCCTTCCAAATCTGA [서열번호 154]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 154에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 154의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 155로 제공된다:
AUGGUGGCCCAUAAUCAGGUGGCCGCCGAUAACGCCGUGUCUACAGCUGCCGAACCUAGAAGAAGGCCCGAGCCUAGCAGCAGCAGCUCUAGUUCUCCUGCCGCUCCUGCCAGACCUAGACCUUGUCCUGCUGUUCCUGCUCCAGCUCCUGGCGACACCCACUUCAGAACCUUUAGAAGCCACGCCGACUACCGGCGGAUCACAAGAGCAUCUGCUCUGCUGGAUGCCUGCGGCUUUUAUUGGGGCCCUCUGUCUGUGCACGGCGCCCACGAAAGACUGAGAGCUGAACCUGUGGGCACCUUCCUCGUGCGGGAUAGCAGACAGCGGAACUGCUUCUUUGCCCUGAGCGUGAAGAUGGCCAGCGGACCCACAUCCAUCAGAGUGCACUUUCAGGCCGGCAGAUUCCACCUGGAUGGCAGCAGAGAGAGCUUCGACUGCCUGUUCGAGCUGCUGGAACACUACGUGGCCGCUCCUAGAAGGAUGCUGGGAGCACCCCUGAGACAGAGAAGAGUGCGGCCUCUGCAAGAGCUGUGCCGGCAGAGAAUCGUGGCCACAGUGGGCAGAGAGAACCUGGCCAGAAUUCCUCUGAACCCCGUGCUGAGAGACUACCUGAGCAGCUUCCCCUUCCAAAUCUGA [서열번호 155]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 155에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 SOCS3 폴리펩티드(NCBI 참조 서열: NM_003955.5; UniProtKB - O14543(SOCS3_HUMAN)), 그의 절단된 버전 또는 동원체일 수 있다. [Akhtar LN, Qin H, Muldowney MT, Yanagisawa LL, Kutsch O, Clements JE, Benveniste EN. Suppressor of cytokine signaling 3 inhibits antiviral IFN-beta signaling to enhance HIV-1 replication in macrophages. J Immunol 2010; 185(4):2393-404]. 상기 SOCS3 폴리펩티드의 한 구현예는 본 명세서에서 다음과 같은 서열번호 156으로 나타낸다:
MVTHSKFPAAGMSRPLDTSLRLKTFSSKSEYQLVVNAVRKLQESGFYWSAVTGGEANLLLSAEPAGTFLIRDSSDQRHFFTLSVKTQSGTKNLRIQCEGGSFSLQSDPRSTQPVPRFDCVLKLVHHYMPPPGAPSFPSPPTEPSSEVPEQPSAQPLPGSPPRRAYYIYSGGEKIPLVLSRPLSSNVATLQHLCRKTVNGHLDSYEKVTQLPGPIREFLDQYDAPL [서열번호 156]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 156에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 SOCS3 폴리펩티드는 다음과 같은 서열번호 157의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGTCACCCACAGCAAGTTTCCCGCCGCCGGGATGAGCCGCCCCCTGGACACCAGCCTGCGCCTCAAGACCTTCAGCTCCAAGAGCGAGTACCAGCTGGTGGTGAACGCAGTGCGCAAGCTGCAGGAGAGCGGCTTCTACTGGAGCGCAGTGACCGGCGGCGAGGCGAACCTGCTGCTCAGTGCCGAGCCCGCCGGCACCTTTCTGATCCGCGACAGCTCGGACCAGCGCCACTTCTTCACGCTCAGCGTCAAGACCCAGTCTGGGACCAAGAACCTGCGCATCCAGTGTGAGGGGGGCAGCTTCTCTCTGCAGAGCGATCCCCGGAGCACGCAGCCCGTGCCCCGCTTCGACTGCGTGCTCAAGCTGGTGCACCACTACATGCCGCCCCCTGGAGCCCCCTCCTTCCCCTCGCCACCTACTGAACCCTCCTCCGAGGTGCCCGAGCAGCCGTCTGCCCAGCCACTCCCTGGGAGTCCCCCCAGAAGAGCCTATTACATCTACTCCGGGGGCGAGAAGATCCCCCTGGTGTTGAGCCGGCCCCTCTCCTCCAACGTGGCCACTCTTCAGCATCTCTGTCGGAAGACCGTCAACGGCCACCTGGACTCCTATGAGAAAGTCACCCAGCTGCCGGGGCCCATTCGGGAGTTCCTGGACCAGTACGATGCCCCGCTT [서열번호 157]
따라서, 바람직하게는 상기 SOCS3 폴리펩티드는 실질적으로 서열번호 157에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 158의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGUCACCCACAGCAAGUUUCCCGCCGCCGGGAUGAGCCGCCCCCUGGACACCAGCCUGCGCCUCAAGACCUUCAGCUCCAAGAGCGAGUACCAGCUGGUGGUGAACGCAGUGCGCAAGCUGCAGGAGAGCGGCUUCUACUGGAGCGCAGUGACCGGCGGCGAGGCGAACCUGCUGCUCAGUGCCGAGCCCGCCGGCACCUUUCUGAUCCGCGACAGCUCGGACCAGCGCCACUUCUUCACGCUCAGCGUCAAGACCCAGUCUGGGACCAAGAACCUGCGCAUCCAGUGUGAGGGGGGCAGCUUCUCUCUGCAGAGCGAUCCCCGGAGCACGCAGCCCGUGCCCCGCUUCGACUGCGUGCUCAAGCUGGUGCACCACUACAUGCCGCCCCCUGGAGCCCCCUCCUUCCCCUCGCCACCUACUGAACCCUCCUCCGAGGUGCCCGAGCAGCCGUCUGCCCAGCCACUCCCUGGGAGUCCCCCCAGAAGAGCCUAUUACAUCUACUCCGGGGGCGAGAAGAUCCCCCUGGUGUUGAGCCGGCCCCUCUCCUCCAACGUGGCCACUCUUCAGCAUCUCUGUCGGAAGACCGUCAACGGCCACCUGGACUCCUAUGAGAAAGUCACCCAGCUGCCGGGGCCCAUUCGGGAGUUCCUGGACCAGUACGAUGCCCCGCUU [서열번호 158]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 158에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 156의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 159로 제공된다:
ATGGTCACCCACAGCAAGTTTCCAGCCGCCGGAATGAGCAGACCCCTGGATACAAGCCTGCGGCTGAAAACCTTCAGCAGCAAGAGCGAGTATCAGCTGGTGGTCAACGCCGTGCGGAAGCTGCAAGAGAGCGGCTTTTATTGGAGCGCCGTGACAGGCGGAGAGGCCAATCTTCTGCTGTCTGCCGAACCTGCCGGCACCTTCCTGATCAGAGATAGCAGCGACCAGCGGCACTTCTTCACCCTGAGCGTGAAAACCCAGAGCGGCACCAAGAACCTGCGGATCCAATGTGAAGGCGGCAGCTTCAGCCTGCAGAGCGACCCTAGATCTACCCAGCCTGTGCCTAGATTCGACTGCGTGCTGAAGCTCGTGCACCACTACATGCCTCCACCTGGCGCTCCTAGCTTCCCATCTCCTCCAACAGAGCCTAGCAGCGAGGTGCCAGAACAGCCTTCTGCTCAACCTCTGCCTGGCAGCCCTCCTAGAAGGGCCTACTACATCTATTCTGGCGGCGAGAAGATCCCTCTGGTGCTGTCTAGACCCCTGAGCAGCAATGTGGCCACTCTGCAGCACCTGTGCAGAAAGACCGTGAACGGCCACCTGGACAGCTACGAGAAAGTGACCCAACTGCCTGGACCTATCAGAGAGTTCCTGGACCAGTACGACGCCCCTCTTTGA [서열번호 159]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 159에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 159의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 160으로 제공된다:
AUGGUCACCCACAGCAAGUUUCCAGCCGCCGGAAUGAGCAGACCCCUGGAUACAAGCCUGCGGCUGAAAACCUUCAGCAGCAAGAGCGAGUAUCAGCUGGUGGUCAACGCCGUGCGGAAGCUGCAAGAGAGCGGCUUUUAUUGGAGCGCCGUGACAGGCGGAGAGGCCAAUCUUCUGCUGUCUGCCGAACCUGCCGGCACCUUCCUGAUCAGAGAUAGCAGCGACCAGCGGCACUUCUUCACCCUGAGCGUGAAAACCCAGAGCGGCACCAAGAACCUGCGGAUCCAAUGUGAAGGCGGCAGCUUCAGCCUGCAGAGCGACCCUAGAUCUACCCAGCCUGUGCCUAGAUUCGACUGCGUGCUGAAGCUCGUGCACCACUACAUGCCUCCACCUGGCGCUCCUAGCUUCCCAUCUCCUCCAACAGAGCCUAGCAGCGAGGUGCCAGAACAGCCUUCUGCUCAACCUCUGCCUGGCAGCCCUCCUAGAAGGGCCUACUACAUCUAUUCUGGCGGCGAGAAGAUCCCUCUGGUGCUGUCUAGACCCCUGAGCAGCAAUGUGGCCACUCUGCAGCACCUGUGCAGAAAGACCGUGAACGGCCACCUGGACAGCUACGAGAAAGUGACCCAACUGCCUGGACCUAUCAGAGAGUUCCUGGACCAGUACGACGCCCCUCUUUGA [서열번호 160]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 160에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
카테고리 4: RNA 인식 시스템의 억제제
또 다른 구현예에서, 상기 IMP는 RNA 인식 시스템을 억제하도록 구성될 수 있다.
그러므로, RNA에 대한 선천적 면역 반응의 감소, 제거, 또는 차단은 바람직하게는 본 발명의 RNA 구조체를 갖고 있는 숙주 세포에 의한 RNA(바람직하게는, 긴 RNA 분자)의 인식을 감소 또는 차단하는 IMP에 의해 달성된다. 긴 RNA는 ssRNA 또는 dsRNA일 수 있고 길이가 적어도 1 kb인 RNA를 의미하는 것으로 숙련된 기술자에 의해 이해될 수 있다. 따라서, 바람직하게는 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질은 RNA 인식의 돌연변이된 또는 비-기능적 억제제, 또는 그의 우성 음성 형태를 포함한다.
구현예에서, RNA 인식의 억제제는 TRBP dsRNA이다. TRBP는 RISC-로딩 복합체 서브유닛 TARBP2이고, PKR(NCBI 참조 서열: NM_134323.2; UniProtKB - Q15633(TRBP2_HUMAN)), 또는 그의 동원체를 억제한다. [Heyam A, Lagos D, Plevin M. Dissecting the roles of TRBP and PACT in double-stranded RNA recognition and processing of noncoding RNAs. Wiley Interdiscip Rev RNA. 2015 May-Jun;6(3):271-89. doi: 10.1002/wrna.1272]. 상기 TRBP dsRNA 우성 음성 형태(TARBP2(1-234))의 한 구현예는 본 명세서에서 다음과 같은 서열번호 111로 나타낸다:
MSEEEQGSGTTTGCGLPSIEQMLAANPGKTPISLLQEYGTRIGKTPVYDLLKAEGQAHQPNFTFRVTVGDTSCTGQGPSKKAAKHKAAEVALKHLKGGSMLEPALEDSSSFSPLDSSLPEDIPVFTAAAAATPVPSVVLTRSPPMELQPPVSPQQSECNPVGALQELVVQKGWRLPEYTVTQESGPAHRKEFTMTCRVERFIEIGSGTSKKLAKRNAAAKMLLRVHTVPLDARD [서열번호 111]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 111에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 TRBP dsRNA 우성 음성 형태 폴리펩티드는 다음과 같은 서열번호 112의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGAGTGAAGAGGAGCAAGGCTCCGGCACTACCACGGGCTGCGGGCTGCCTAGTATAGAGCAAATGCTGGCCGCCAACCCAGGCAAGACCCCGATCAGCCTTCTGCAGGAGTATGGGACCAGAATAGGGAAGACGCCTGTGTACGACCTTCTCAAAGCCGAGGGCCAAGCCCACCAGCCTAATTTCACCTTCCGGGTCACCGTTGGCGACACCAGCTGCACTGGTCAGGGCCCCAGCAAGAAGGCAGCCAAGCACAAGGCAGCTGAGGTGGCCCTCAAACACCTCAAAGGGGGGAGCATGCTGGAGCCGGCCCTGGAGGACAGCAGTTCTTTTTCTCCCCTAGACTCTTCACTGCCTGAGGACATTCCGGTTTTTACTGCTGCAGCAGCTGCTACCCCAGTTCCATCTGTAGTCCTAACCAGGAGCCCCCCCATGGAACTGCAGCCCCCTGTCTCCCCTCAGCAGTCTGAGTGCAACCCCGTTGGTGCTCTGCAGGAGCTGGTGGTGCAGAAAGGCTGGCGGTTGCCGGAGTACACAGTGACCCAGGAGTCTGGGCCAGCCCACCGCAAAGAATTCACCATGACCTGTCGAGTGGAGCGTTTCATTGAGATTGGGAGTGGCACTTCCAAAAAATTGGCAAAGCGGAATGCGGCGGCCAAAATGCTGCTTCGAGTGCACACGGTGCCTCTGGATGCCCGGGAT [서열번호 112]
따라서, 바람직하게는 상기 TRBP dsRNA 우성 음성 형태 폴리펩티드는 실질적으로 서열번호 112에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 113의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGAGUGAAGAGGAGCAAGGCUCCGGCACUACCACGGGCUGCGGGCUGCCUAGUAUAGAGCAAAUGCUGGCCGCCAACCCAGGCAAGACCCCGAUCAGCCUUCUGCAGGAGUAUGGGACCAGAAUAGGGAAGACGCCUGUGUACGACCUUCUCAAAGCCGAGGGCCAAGCCCACCAGCCUAAUUUCACCUUCCGGGUCACCGUUGGCGACACCAGCUGCACUGGUCAGGGCCCCAGCAAGAAGGCAGCCAAGCACAAGGCAGCUGAGGUGGCCCUCAAACACCUCAAAGGGGGGAGCAUGCUGGAGCCGGCCCUGGAGGACAGCAGUUCUUUUUCUCCCCUAGACUCUUCACUGCCUGAGGACAUUCCGGUUUUUACUGCUGCAGCAGCUGCUACCCCAGUUCCAUCUGUAGUCCUAACCAGGAGCCCCCCCAUGGAACUGCAGCCCCCUGUCUCCCCUCAGCAGUCUGAGUGCAACCCCGUUGGUGCUCUGCAGGAGCUGGUGGUGCAGAAAGGCUGGCGGUUGCCGGAGUACACAGUGACCCAGGAGUCUGGGCCAGCCCACCGCAAAGAAUUCACCAUGACCUGUCGAGUGGAGCGUUUCAUUGAGAUUGGGAGUGGCACUUCCAAAAAAUUGGCAAAGCGGAAUGCGGCGGCCAAAAUGCUGCUUCGAGUGCACACGGUGCCUCUGGAUGCCCGGGAU [서열번호 113]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 113에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 111의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 114로 제공된다:
ATGAGCGAGGAAGAACAAGGCAGCGGCACCACCACAGGATGTGGCCTGCCTTCTATCGAGCAGATGCTGGCCGCCAATCCTGGCAAGACACCTATCAGCCTGCTGCAAGAGTACGGCACCCGGATCGGAAAGACCCCTGTGTACGATCTGCTGAAGGCCGAAGGCCAGGCTCACCAGCCTAACTTCACCTTCAGAGTGACCGTGGGCGACACCAGCTGTACAGGACAGGGCCCTTCTAAGAAGGCCGCCAAACACAAAGCCGCCGAGGTGGCCCTGAAACACCTGAAAGGCGGCTCCATGCTGGAACCCGCTCTGGAAGATAGCAGCAGCTTCAGCCCTCTGGACAGCAGCCTGCCTGAGGACATCCCTGTGTTTACAGCCGCTGCCGCTGCTACACCTGTGCCATCTGTGGTGCTGACCAGATCTCCTCCAATGGAACTGCAGCCTCCTGTGTCTCCTCAGCAGAGCGAGTGTAATCCTGTGGGCGCCCTGCAAGAACTGGTGGTGCAAAAAGGATGGCGGCTGCCCGAGTACACCGTGACACAAGAATCTGGCCCCGCTCACCGGAAAGAATTCACCATGACCTGCAGAGTGGAACGGTTCATCGAGATCGGCTCCGGCACCTCTAAGAAGCTGGCCAAGAGAAACGCCGCTGCCAAGATGCTGCTGCGGGTGCACACAGTTCCTCTGGACGCCAGAGATTGA [서열번호 114]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 114에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 114의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 115로 제공된다:
AUGAGCGAGGAAGAACAAGGCAGCGGCACCACCACAGGAUGUGGCCUGCCUUCUAUCGAGCAGAUGCUGGCCGCCAAUCCUGGCAAGACACCUAUCAGCCUGCUGCAAGAGUACGGCACCCGGAUCGGAAAGACCCCUGUGUACGAUCUGCUGAAGGCCGAAGGCCAGGCUCACCAGCCUAACUUCACCUUCAGAGUGACCGUGGGCGACACCAGCUGUACAGGACAGGGCCCUUCUAAGAAGGCCGCCAAACACAAAGCCGCCGAGGUGGCCCUGAAACACCUGAAAGGCGGCUCCAUGCUGGAACCCGCUCUGGAAGAUAGCAGCAGCUUCAGCCCUCUGGACAGCAGCCUGCCUGAGGACAUCCCUGUGUUUACAGCCGCUGCCGCUGCUACACCUGUGCCAUCUGUGGUGCUGACCAGAUCUCCUCCAAUGGAACUGCAGCCUCCUGUGUCUCCUCAGCAGAGCGAGUGUAAUCCUGUGGGCGCCCUGCAAGAACUGGUGGUGCAAAAAGGAUGGCGGCUGCCCGAGUACACCGUGACACAAGAAUCUGGCCCCGCUCACCGGAAAGAAUUCACCAUGACCUGCAGAGUGGAACGGUUCAUCGAGAUCGGCUCCGGCACCUCUAAGAAGCUGGCCAAGAGAAACGCCGCUGCCAAGAUGCUGCUGCGGGUGCACACAGUUCCUCUGGACGCCAGAGAUUGA [서열번호 115]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 115에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
구현예에서, RNA 인식의 억제제, 또는 그의 우성 음성 형태는 징크 핑거(Zinc finger) 항-바이러스 단백질(Zinc AVP), 즉 우성 음성 억제제(NCBI 참조 서열: NM_020119.4; UniProtKB - Q7Z2W4(ZCCHV_HUMAN)), 또는 그의 동원체이다. [Karki S, et al. Multiple interferon stimulated genes synergize with the zinc finger antiviral protein to mediate anti-alphavirus activity. PLoS One. 2012;7(5):e37398. doi: 10.1371/journal.pone.0037398, and Meagher JL, et al. Structure of the zinc-finger antiviral protein in complex with RNA reveals a mechanism for selective targeting of CG-rich viral sequences. Proc Natl Acad Sci U S A. 2019 Nov 26;116(48):24303-24309. doi: 10.1073/pnas.1913232116]. 상기 징크 핑거 항-바이러스 단백질 우성 음성 형태의 한 구현예는 본 명세서에서 다음과 같은 서열번호 116으로 나타내는 Zinc AVP(1-200)이다:
MADPEVCCFITKILCAHGGRMALDALLQEIALSEPQLCEVLQVAGPDRFVVLETGGEAGITRSVVATTRARVCRRKYCQRPCDNLHLCKLNLLGRCNYSQSERNLCKYSHEVLSEENFKVLKNHELSGLNKEELAVLLLQSDPFFMPEICKSYKGEGRQQICNQQPPCSRLHICDHFTRGNCRFPNCLRSHNLMDRKVLA [서열번호 116]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 116에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 징크 핑거 항-바이러스 단백질 우성 음성 형태 폴리펩티드는 다음과 같은 서열번호 117의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGCGGACCCGGAGGTGTGCTGCTTCATCACCAAAATCCTGTGCGCCCACGGGGGCCGCATGGCCCTGGACGCGCTGCTCCAGGAGATCGCGCTGTCTGAGCCGCAGCTCTGTGAGGTGCTGCAGGTGGCCGGGCCCGACCGCTTTGTGGTGTTGGAGACCGGCGGCGAGGCCGGGATCACCCGATCGGTGGTGGCCACCACTCGAGCCCGGGTCTGCCGTCGCAAGTACTGCCAGAGACCCTGCGATAACCTGCATCTCTGCAAACTCAACTTGCTGGGCCGGTGCAACTATTCGCAGTCCGAGCGGAATTTATGCAAATATTCTCATGAGGTTCTCTCAGAAGAGAACTTCAAAGTCCTGAAAAATCACGAACTCTCTGGACTGAACAAAGAGGAATTAGCAGTGCTCCTCCTCCAAAGTGATCCTTTTTTTATGCCCGAGATATGCAAAAGTTATAAGGGAGAGGGTCGGCAGCAGATTTGTAACCAGCAGCCACCGTGTTCAAGACTCCACATCTGTGACCACTTCACCCGAGGGAACTGTCGTTTTCCCAACTGCCTCCGGTCCCATAACCTGATGGACAGAAAGGTGCTGGCC [서열번호 117]
따라서, 바람직하게는 상기 징크 핑거 항-바이러스 단백질 우성 음성 형태 폴리펩티드는 실질적으로 서열번호 117에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 118의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGCGGACCCGGAGGUGUGCUGCUUCAUCACCAAAAUCCUGUGCGCCCACGGGGGCCGCAUGGCCCUGGACGCGCUGCUCCAGGAGAUCGCGCUGUCUGAGCCGCAGCUCUGUGAGGUGCUGCAGGUGGCCGGGCCCGACCGCUUUGUGGUGUUGGAGACCGGCGGCGAGGCCGGGAUCACCCGAUCGGUGGUGGCCACCACUCGAGCCCGGGUCUGCCGUCGCAAGUACUGCCAGAGACCCUGCGAUAACCUGCAUCUCUGCAAACUCAACUUGCUGGGCCGGUGCAACUAUUCGCAGUCCGAGCGGAAUUUAUGCAAAUAUUCUCAUGAGGUUCUCUCAGAAGAGAACUUCAAAGUCCUGAAAAAUCACGAACUCUCUGGACUGAACAAAGAGGAAUUAGCAGUGCUCCUCCUCCAAAGUGAUCCUUUUUUUAUGCCCGAGAUAUGCAAAAGUUAUAAGGGAGAGGGUCGGCAGCAGAUUUGUAACCAGCAGCCACCGUGUUCAAGACUCCACAUCUGUGACCACUUCACCCGAGGGAACUGUCGUUUUCCCAACUGCCUCCGGUCCCAUAACCUGAUGGACAGAAAGGUGCUGGCC [서열번호 118]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 118에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 116의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 119로 제공된다:
ATGGCCGATCCTGAAGTGTGCTGCTTCATCACCAAGATCCTGTGCGCCCACGGCGGAAGAATGGCTCTGGATGCTCTGCTGCAAGAGATCGCCCTGTCTGAGCCTCAGCTGTGCGAAGTGCTGCAAGTGGCCGGACCTGACAGATTCGTGGTGCTGGAAACAGGCGGAGAGGCCGGCATTACCAGATCCGTGGTGGCTACCACAAGAGCCAGAGTGTGCCGGCGGAAGTACTGCCAGAGGCCTTGCGATAATCTGCACCTGTGCAAGCTGAACCTGCTGGGCAGATGCAACTACAGCCAGAGCGAGCGGAATCTGTGCAAGTACTCCCACGAGGTGCTGAGCGAAGAGAACTTCAAGGTGCTGAAGAACCACGAGCTGAGCGGCCTGAACAAAGAGGAACTGGCCGTTCTGCTGCTGCAGAGCGACCCATTCTTCATGCCCGAGATCTGCAAGAGCTACAAAGGCGAGGGCAGACAGCAGATCTGTAACCAGCAGCCTCCATGCAGCAGACTGCACATCTGCGACCACTTCACCCGGGGCAACTGCAGATTCCCCAACTGCCTGAGAAGCCACAACCTGATGGACCGGAAGGTGCTGGCTTGA [서열번호 119]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 119에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 119의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 120으로 제공된다:
AUGGCCGAUCCUGAAGUGUGCUGCUUCAUCACCAAGAUCCUGUGCGCCCACGGCGGAAGAAUGGCUCUGGAUGCUCUGCUGCAAGAGAUCGCCCUGUCUGAGCCUCAGCUGUGCGAAGUGCUGCAAGUGGCCGGACCUGACAGAUUCGUGGUGCUGGAAACAGGCGGAGAGGCCGGCAUUACCAGAUCCGUGGUGGCUACCACAAGAGCCAGAGUGUGCCGGCGGAAGUACUGCCAGAGGCCUUGCGAUAAUCUGCACCUGUGCAAGCUGAACCUGCUGGGCAGAUGCAACUACAGCCAGAGCGAGCGGAAUCUGUGCAAGUACUCCCACGAGGUGCUGAGCGAAGAGAACUUCAAGGUGCUGAAGAACCACGAGCUGAGCGGCCUGAACAAAGAGGAACUGGCCGUUCUGCUGCUGCAGAGCGACCCAUUCUUCAUGCCCGAGAUCUGCAAGAGCUACAAAGGCGAGGGCAGACAGCAGAUCUGUAACCAGCAGCCUCCAUGCAGCAGACUGCACAUCUGCGACCACUUCACCCGGGGCAACUGCAGAUUCCCCAACUGCCUGAGAAGCCACAACCUGAUGGACCGGAAGGUGCUGGCUUGA [서열번호 120]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 120에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
다른 구현예에서, RNA 인식의 억제제, 또는 그의 우성 음성 형태는 PKR 활성화를 차단하고 또한 NF-카파 B 활성화에 대한 차단자로 작용하는 PKR dsRNA 결합 도메인(NCBI 참조 서열: NM_002759.4; UniProtKB - P19525(E2AK2_HUMAN)), 또는 그의 동원체이다. [Bou-Nader C, et al. The search for a PKR code-differential regulation of protein kinase R activity by diverse RNA and protein regulators. RNA. 2019 May; 25(5):539-556. doi:10.1261/rna.070169.118]. 상기 PKR dsRNA 결합 도메인(PKR dsRNA DB(1-170))의 한 구현예는 본 명세서에서 다음과 같은 서열번호 121로 나타낸다:
MAGDLSAGFFMEELNTYRQKQGVVLKYQELPNSGPPHDRRFTFQVIIDGREFPEGEGRSKKEAKNAAAKLAVEILNKEKKAVSPLLLTTTNSSEGLSMGNYIGLINRIAQKKRLTVNYEQCASGVHGPEGFHYKCKMGQKEYSIGTGSTKQEAKQLAAKLAYLQILSEET [서열번호 121]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 121에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 PKR dsRNA 결합 도메인 폴리펩티드는 다음과 같은 서열번호 122의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGCTGGTGATCTTTCAGCAGGTTTCTTCATGGAGGAACTTAATACATACCGTCAGAAGCAGGGAGTAGTACTTAAATATCAAGAACTGCCTAATTCAGGACCTCCACATGATAGGAGGTTTACATTTCAAGTTATAATAGATGGAAGAGAATTTCCAGAAGGTGAAGGTAGATCAAAGAAGGAAGCAAAAAATGCCGCAGCCAAATTAGCTGTTGAGATACTTAATAAGGAAAAGAAGGCAGTTAGTCCTTTATTATTGACAACAACGAATTCTTCAGAAGGATTATCCATGGGGAATTACATAGGCCTTATCAATAGAATTGCCCAGAAGAAAAGACTAACTGTAAATTATGAACAGTGTGCATCGGGGGTGCATGGGCCAGAAGGATTTCATTATAAATGCAAAATGGGACAGAAAGAATATAGTATTGGTACAGGTTCTACTAAACAGGAAGCAAAACAATTGGCCGCTAAACTTGCATATCTTCAGATATTATCAGAAGAAACC [서열번호 122]
따라서, 바람직하게는 상기 PKR dsRNA 결합 도메인 형태 폴리펩티드는 실질적으로 서열번호 122에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 123의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGCUGGUGAUCUUUCAGCAGGUUUCUUCAUGGAGGAACUUAAUACAUACCGUCAGAAGCAGGGAGUAGUACUUAAAUAUCAAGAACUGCCUAAUUCAGGACCUCCACAUGAUAGGAGGUUUACAUUUCAAGUUAUAAUAGAUGGAAGAGAAUUUCCAGAAGGUGAAGGUAGAUCAAAGAAGGAAGCAAAAAAUGCCGCAGCCAAAUUAGCUGUUGAGAUACUUAAUAAGGAAAAGAAGGCAGUUAGUCCUUUAUUAUUGACAACAACGAAUUCUUCAGAAGGAUUAUCCAUGGGGAAUUACAUAGGCCUUAUCAAUAGAAUUGCCCAGAAGAAAAGACUAACUGUAAAUUAUGAACAGUGUGCAUCGGGGGUGCAUGGGCCAGAAGGAUUUCAUUAUAAAUGCAAAAUGGGACAGAAAGAAUAUAGUAUUGGUACAGGUUCUACUAAACAGGAAGCAAAACAAUUGGCCGCUAAACUUGCAUAUCUUCAGAUAUUAUCAGAAGAAACC [서열번호 123]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 123에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 121의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 124로 제공된다:
ATGGCTGGCGATCTGAGCGCCGGCTTCTTCATGGAAGAACTGAACACCTACCGGCAGAAACAGGGCGTCGTGCTGAAGTACCAAGAGCTGCCTAATAGCGGCCCTCCTCACGACCGGCGGTTCACCTTTCAAGTGATCATCGACGGCAGAGAGTTCCCCGAAGGCGAGGGCAGATCTAAGAAAGAGGCCAAGAACGCCGCTGCCAAGCTGGCCGTGGAAATCCTGAACAAAGAGAAGAAGGCCGTTTCTCCCCTGCTGCTGACCACCACCAATAGCTCTGAGGGCCTGAGCATGGGCAACTACATCGGCCTGATCAACCGGATCGCCCAGAAAAAGCGGCTGACCGTGAACTACGAGCAGTGTGCCAGCGGAGTGCACGGCCCTGAGGGCTTTCACTACAAGTGCAAGATGGGCCAGAAAGAGTACAGCATCGGCACCGGCAGCACCAAGCAAGAAGCCAAACAGCTGGCCGCCAAACTGGCCTACCTGCAGATCCTGAGCGAGGAAACCTGA [서열번호 124]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 124에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 124의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 125로 제공된다:
AUGGCUGGCGAUCUGAGCGCCGGCUUCUUCAUGGAAGAACUGAACACCUACCGGCAGAAACAGGGCGUCGUGCUGAAGUACCAAGAGCUGCCUAAUAGCGGCCCUCCUCACGACCGGCGGUUCACCUUUCAAGUGAUCAUCGACGGCAGAGAGUUCCCCGAAGGCGAGGGCAGAUCUAAGAAAGAGGCCAAGAACGCCGCUGCCAAGCUGGCCGUGGAAAUCCUGAACAAAGAGAAGAAGGCCGUUUCUCCCCUGCUGCUGACCACCACCAAUAGCUCUGAGGGCCUGAGCAUGGGCAACUACAUCGGCCUGAUCAACCGGAUCGCCCAGAAAAAGCGGCUGACCGUGAACUACGAGCAGUGUGCCAGCGGAGUGCACGGCCCUGAGGGCUUUCACUACAAGUGCAAGAUGGGCCAGAAAGAGUACAGCAUCGGCACCGGCAGCACCAAGCAAGAAGCCAAACAGCUGGCCGCCAAACUGGCCUACCUGCAGAUCCUGAGCGAGGAAACCUGA [서열번호 125]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 125에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
구현예에서, RNA 인식의 억제제는 OAS 패밀리 멤버(family member)이다. 인간 게놈은 4가지 OAS 패밀리 멤버, 다시 말해 OAS1, OAS2, OAS3 및 OASL1을 갖고 있다. OAS1/OASL1, OAS2, 및 OAS3은 각각 1, 2, 및 3개의 OAS 유닛으로 구성되고, 긴 dsRNA에 결합한다. 따라서, 다른 구현예에서, RNA 인식의 억제제, 또는 그의 우성 음성 형태는 OAS1, OAS2, OAS3 또는 OASL1이다.
그러나, OAS3은 다른 것들과 대비하여 긴 dsRNA에 우세하게 결합하고, 따라서 바람직하다. 따라서, 구현예에서, RNA 인식의 억제제, 또는 그의 우성 음성 형태는 OAS3이고, 가장 바람직하게는 dsRNA 결합 도메인을 함유하는 OAS3 도메인 I(NCBI 참조 서열: NM_006187.4; UniProtKB - Q9Y6K5(OAS3_HUMAN)), 또는 그의 동원체이다. [Donovan J, Whitney G, Rath S, Korennykh A. Structural mechanism of sensing long dsRNA via a noncatalytic domain in human oligoadenylate synthetase 3. Proc Natl Acad Sci U S A. 2015 Mar 31;112(13):3949-54. doi: 10.1073/pnas.1419409112]. OAS3 도메인 I의 한 구현예는 UniProtKB - Q9Y6K5(1-343)으로 나타내며, 본 명세서에서 다음과 같은 서열번호 136으로 나타낸다:
MDLYSTPAAALDRFVARRLQPRKEFVEKARRALGALAAALRERGGRLGAAAPRVLKTVKGGSSGRGTALKGGCDSELVIFLDCFKSYVDQRARRAEILSEMRASLESWWQNPVPGLRLTFPEQSVPGALQFRLTSVDLEDWMDVSLVPAFNVLGQAGSGVKPKPQVYSTLLNSGCQGGEHAACFTELRRNFVNIRPAKLKNLILLVKHWYHQVCLQGLWKETLPPVYALELLTIFAWEQGCKKDAFSLAEGLRTVLGLIQQHQHLCVFWTVNYGFEDPAVGQFLQRQLKRPRPVILDPADPTWDLGNGAAWHWDLLAQEAASCYDHPCFLRGMGDPVQSWKGP [서열번호 136]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 136에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 OAS3 도메인 I 폴리펩티드는 다음과 같은 서열번호 137의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGACTTGTACAGCACCCCGGCCGCTGCGCTGGACAGGTTCGTGGCCAGAAGGCTGCAGCCGCGGAAGGAGTTCGTAGAGAAGGCGCGGCGCGCTCTGGGCGCCCTGGCCGCTGCCCTGAGGGAGCGCGGGGGCCGCCTCGGTGCTGCTGCCCCGCGGGTGCTGAAAACTGTCAAGGGAGGCTCCTCGGGCCGGGGCACAGCTCTCAAGGGTGGCTGTGATTCTGAACTTGTCATCTTCCTCGACTGCTTCAAGAGCTATGTGGACCAGAGGGCCCGCCGTGCAGAGATCCTCAGTGAGATGCGGGCATCGCTGGAATCCTGGTGGCAGAACCCAGTCCCTGGTCTGAGACTCACGTTTCCTGAGCAGAGCGTGCCTGGGGCCCTGCAGTTCCGCCTGACATCCGTAGATCTTGAGGACTGGATGGATGTTAGCCTGGTGCCTGCCTTCAATGTCCTGGGTCAGGCCGGCTCCGGCGTCAAACCCAAGCCACAAGTCTACTCTACCCTCCTCAACAGTGGCTGCCAAGGGGGCGAGCATGCGGCCTGCTTCACAGAGCTGCGGAGGAACTTTGTGAACATTCGCCCAGCCAAGTTGAAGAACCTAATCTTGCTGGTGAAGCACTGGTACCACCAGGTGTGCCTACAGGGGTTGTGGAAGGAGACGCTGCCCCCGGTCTATGCCCTGGAATTGCTGACCATCTTCGCCTGGGAGCAGGGCTGTAAGAAGGATGCTTTCAGCCTAGCCGAAGGCCTCCGAACTGTCCTGGGCCTGATCCAACAGCATCAGCACCTGTGTGTTTTCTGGACTGTCAACTATGGCTTCGAGGACCCTGCAGTTGGGCAGTTCTTGCAGCGGCAGCTTAAGAGACCCAGGCCTGTGATCCTGGACCCAGCTGACCCCACATGGGACCTGGGGAATGGGGCAGCCTGGCACTGGGATTTGCTAGCCCAGGAGGCAGCATCCTGCTATGACCACCCATGCTTTCTGAGGGGGATGGGGGACCCAGTGCAGTCTTGGAAGGGGCCG [서열번호 137]
따라서, 바람직하게는 상기 OAS3 도메인 I 폴리펩티드는 실질적으로 서열번호 137에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 138의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGACUUGUACAGCACCCCGGCCGCUGCGCUGGACAGGUUCGUGGCCAGAAGGCUGCAGCCGCGGAAGGAGUUCGUAGAGAAGGCGCGGCGCGCUCUGGGCGCCCUGGCCGCUGCCCUGAGGGAGCGCGGGGGCCGCCUCGGUGCUGCUGCCCCGCGGGUGCUGAAAACUGUCAAGGGAGGCUCCUCGGGCCGGGGCACAGCUCUCAAGGGUGGCUGUGAUUCUGAACUUGUCAUCUUCCUCGACUGCUUCAAGAGCUAUGUGGACCAGAGGGCCCGCCGUGCAGAGAUCCUCAGUGAGAUGCGGGCAUCGCUGGAAUCCUGGUGGCAGAACCCAGUCCCUGGUCUGAGACUCACGUUUCCUGAGCAGAGCGUGCCUGGGGCCCUGCAGUUCCGCCUGACAUCCGUAGAUCUUGAGGACUGGAUGGAUGUUAGCCUGGUGCCUGCCUUCAAUGUCCUGGGUCAGGCCGGCUCCGGCGUCAAACCCAAGCCACAAGUCUACUCUACCCUCCUCAACAGUGGCUGCCAAGGGGGCGAGCAUGCGGCCUGCUUCACAGAGCUGCGGAGGAACUUUGUGAACAUUCGCCCAGCCAAGUUGAAGAACCUAAUCUUGCUGGUGAAGCACUGGUACCACCAGGUGUGCCUACAGGGGUUGUGGAAGGAGACGCUGCCCCCGGUCUAUGCCCUGGAAUUGCUGACCAUCUUCGCCUGGGAGCAGGGCUGUAAGAAGGAUGCUUUCAGCCUAGCCGAAGGCCUCCGAACUGUCCUGGGCCUGAUCCAACAGCAUCAGCACCUGUGUGUUUUCUGGACUGUCAACUAUGGCUUCGAGGACCCUGCAGUUGGGCAGUUCUUGCAGCGGCAGCUUAAGAGACCCAGGCCUGUGAUCCUGGACCCAGCUGACCCCACAUGGGACCUGGGGAAUGGGGCAGCCUGGCACUGGGAUUUGCUAGCCCAGGAGGCAGCAUCCUGCUAUGACCACCCAUGCUUUCUGAGGGGGAUGGGGGACCCAGUGCAGUCUUGGAAGGGGCCG [서열번호 138]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 138에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 136의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 139로 제공된다:
ATGGACCTGTACAGCACACCAGCCGCCGCTCTGGATAGATTCGTGGCTAGACGACTGCAGCCCCGGAAAGAATTCGTGGAAAAGGCTCGGAGAGCCCTGGGAGCACTTGCTGCTGCTCTGAGAGAAAGAGGCGGCAGACTTGGAGCCGCTGCTCCCAGAGTGCTGAAAACAGTGAAAGGCGGCAGCAGCGGCAGAGGCACAGCTCTTAAAGGCGGCTGCGATAGCGAGCTGGTCATCTTCCTGGACTGCTTCAAGAGCTACGTGGACCAGAGAGCCAGACGGGCCGAGATCCTGTCTGAGATGAGAGCCAGCCTGGAAAGCTGGTGGCAGAATCCTGTGCCTGGCCTGAGACTGACATTCCCCGAACAGTCTGTTCCCGGCGCTCTGCAGTTTAGACTGACCTCCGTGGACCTGGAAGATTGGATGGATGTGTCCCTGGTGCCTGCCTTCAATGTGCTGGGACAAGCTGGCTCTGGCGTGAAGCCTAAGCCTCAGGTGTACTCTACCCTGCTGAACTCCGGCTGTCAAGGCGGAGAACACGCCGCCTGTTTTACCGAGCTGCGGCGGAACTTCGTGAACATCAGACCCGCCAAGCTGAAGAACCTGATCCTGCTGGTCAAGCACTGGTATCACCAAGTGTGCCTGCAAGGCCTGTGGAAAGAAACCCTGCCTCCTGTGTACGCCCTGGAACTGCTGACCATCTTCGCCTGGGAACAGGGCTGCAAGAAGGACGCCTTTAGCCTGGCCGAGGGCCTGAGAACAGTTCTGGGACTGATTCAGCAGCACCAGCACCTGTGCGTGTTCTGGACCGTGAACTACGGCTTCGAGGATCCTGCCGTGGGCCAGTTTCTGCAGAGACAGCTGAAGAGGCCCAGACCTGTGATCCTGGATCCTGCAGACCCTACATGGGACCTCGGAAATGGCGCTGCCTGGCATTGGGATCTGCTGGCCCAAGAAGCCGCCAGCTGTTACGATCACCCCTGCTTTCTGAGAGGCATGGGCGATCCTGTGCAGAGCTGGAAGGGACCTTGA [서열번호 139]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 139에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 139의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 140으로 제공된다:
AUGGACCUGUACAGCACACCAGCCGCCGCUCUGGAUAGAUUCGUGGCUAGACGACUGCAGCCCCGGAAAGAAUUCGUGGAAAAGGCUCGGAGAGCCCUGGGAGCACUUGCUGCUGCUCUGAGAGAAAGAGGCGGCAGACUUGGAGCCGCUGCUCCCAGAGUGCUGAAAACAGUGAAAGGCGGCAGCAGCGGCAGAGGCACAGCUCUUAAAGGCGGCUGCGAUAGCGAGCUGGUCAUCUUCCUGGACUGCUUCAAGAGCUACGUGGACCAGAGAGCCAGACGGGCCGAGAUCCUGUCUGAGAUGAGAGCCAGCCUGGAAAGCUGGUGGCAGAAUCCUGUGCCUGGCCUGAGACUGACAUUCCCCGAACAGUCUGUUCCCGGCGCUCUGCAGUUUAGACUGACCUCCGUGGACCUGGAAGAUUGGAUGGAUGUGUCCCUGGUGCCUGCCUUCAAUGUGCUGGGACAAGCUGGCUCUGGCGUGAAGCCUAAGCCUCAGGUGUACUCUACCCUGCUGAACUCCGGCUGUCAAGGCGGAGAACACGCCGCCUGUUUUACCGAGCUGCGGCGGAACUUCGUGAACAUCAGACCCGCCAAGCUGAAGAACCUGAUCCUGCUGGUCAAGCACUGGUAUCACCAAGUGUGCCUGCAAGGCCUGUGGAAAGAAACCCUGCCUCCUGUGUACGCCCUGGAACUGCUGACCAUCUUCGCCUGGGAACAGGGCUGCAAGAAGGACGCCUUUAGCCUGGCCGAGGGCCUGAGAACAGUUCUGGGACUGAUUCAGCAGCACCAGCACCUGUGCGUGUUCUGGACCGUGAACUACGGCUUCGAGGAUCCUGCCGUGGGCCAGUUUCUGCAGAGACAGCUGAAGAGGCCCAGACCUGUGAUCCUGGAUCCUGCAGACCCUACAUGGGACCUCGGAAAUGGCGCUGCCUGGCAUUGGGAUCUGCUGGCCCAAGAAGCCGCCAGCUGUUACGAUCACCCCUGCUUUCUGAGAGGCAUGGGCGAUCCUGUGCAGAGCUGGAAGGGACCUUGA [서열번호 140]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로) 서열번호 140에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
추가 구현예에서, RNA 인식의 억제제, 또는 그의 우성 음성 형태는 RNAse L, 또는 그의 동원체이다. RNAse L은 RNA 자체는 인식하지 않는다; dsRNA는 ATP로부터 2',5'-연관 올리고아데닐레이트를 생성하도록 활성화되는 OAS에 의해 인식된다. 이것이 RNAse L에 결합할 때, 이것은 RNA를 분해하는 엔도리보뉴클레아제(endoribonuclease)(NCBI 참조 서열: NM_021133.4; UniProtKB - Q05823(RN5A_HUMAN))를 활성화하게 된다. [Tanaka N, Nakanishi M, Kusakabe Y, Goto Y, Kitade Y, Nakamura KT. Structural basis for recognition of 2',5'-linked oligoadenylates by human ribonuclease L. EMBO J. 2004 Oct 13;23(20):3929-38. doi: 10.1038/sj.emboj.7600420]. RNAse L 우성 음성의 한 구현예는 본 명세서에서 다음과 같은 서열번호 131로 나타낸다:
MESRDHNNPQEGPTSSSGRRAAVEDNHLLIKAVQNEDVDLVQQLLEGGANVNFQEEEGGWTPLHNAVQMSREDIVELLLRHGADPVLRKKNGATPFILAAIAGSVKLLKLFLSKGADVNECDFYGFTAFMEAAVYGKVKALKFLYKRGANVNLRRKTKEDQERLRKGGATALMDAAEKGHVEVLKILLDEMGADVNACDNMGRNALIHALLSSDDSDVEAITHLLLDHGADVNVRGERGKTPLILAVEKKHLGLVQRLLEQEHIEINDTDSDGKTALLLAVELKLKKIAELLCKRGASTDCGDLVMTARRNYDHSLVKVLLSHGAKEDFH [서열번호 131]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 131에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 RNAse L 폴리펩티드는 다음과 같은 서열번호 132의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGGAGAGCAGGGATCATAACAACCCCCAGGAGGGACCCACGTCCTCCAGCGGTAGAAGGGCTGCAGTGGAAGACAATCACTTGCTGATTAAAGCTGTTCAAAACGAAGATGTTGACCTGGTCCAGCAATTGCTGGAAGGTGGAGCCAATGTTAATTTCCAGGAAGAGGAAGGGGGCTGGACACCTCTGCATAACGCAGTACAAATGAGCAGGGAGGACATTGTGGAACTTCTGCTTCGTCATGGTGCTGACCCTGTTCTGAGGAAGAAGAATGGGGCCACGCCTTTTATCCTCGCAGCGATTGCGGGGAGCGTGAAGCTGCTGAAACTTTTCCTTTCTAAAGGAGCAGATGTCAATGAGTGTGATTTTTATGGCTTCACAGCCTTCATGGAAGCCGCTGTGTATGGTAAGGTCAAAGCCCTAAAATTCCTTTATAAGAGAGGAGCAAATGTGAATTTGAGGCGAAAGACAAAGGAGGATCAAGAGCGGCTGAGGAAAGGAGGGGCCACAGCTCTCATGGACGCTGCTGAAAAAGGACACGTAGAGGTCTTGAAGATTCTCCTTGATGAGATGGGGGCAGATGTAAACGCCTGTGACAATATGGGCAGAAATGCCTTGATCCATGCTCTCCTGAGCTCTGACGATAGTGATGTGGAGGCTATTACGCATCTGCTGCTGGACCATGGGGCTGATGTCAATGTGAGGGGAGAAAGAGGGAAGACTCCCCTGATCCTGGCAGTGGAGAAGAAGCACTTGGGTTTGGTGCAGAGGCTTCTGGAGCAAGAGCACATAGAGATTAATGACACAGACAGTGATGGCAAAACAGCACTGCTGCTTGCTGTTGAACTCAAACTGAAGAAAATCGCCGAGTTGCTGTGCAAACGTGGAGCCAGTACAGATTGTGGGGATCTTGTTATGACAGCGAGGCGGAATTATGACCATTCCCTTGTGAAGGTTCTTCTCTCTCATGGAGCCAAAGAAGATTTTCAC [서열번호 132]
따라서, 바람직하게는 상기 RNAse L 형태 폴리펩티드는 실질적으로 서열번호 132에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 133의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGGAGAGCAGGGAUCAUAACAACCCCCAGGAGGGACCCACGUCCUCCAGCGGUAGAAGGGCUGCAGUGGAAGACAAUCACUUGCUGAUUAAAGCUGUUCAAAACGAAGAUGUUGACCUGGUCCAGCAAUUGCUGGAAGGUGGAGCCAAUGUUAAUUUCCAGGAAGAGGAAGGGGGCUGGACACCUCUGCAUAACGCAGUACAAAUGAGCAGGGAGGACAUUGUGGAACUUCUGCUUCGUCAUGGUGCUGACCCUGUUCUGAGGAAGAAGAAUGGGGCCACGCCUUUUAUCCUCGCAGCGAUUGCGGGGAGCGUGAAGCUGCUGAAACUUUUCCUUUCUAAAGGAGCAGAUGUCAAUGAGUGUGAUUUUUAUGGCUUCACAGCCUUCAUGGAAGCCGCUGUGUAUGGUAAGGUCAAAGCCCUAAAAUUCCUUUAUAAGAGAGGAGCAAAUGUGAAUUUGAGGCGAAAGACAAAGGAGGAUCAAGAGCGGCUGAGGAAAGGAGGGGCCACAGCUCUCAUGGACGCUGCUGAAAAAGGACACGUAGAGGUCUUGAAGAUUCUCCUUGAUGAGAUGGGGGCAGAUGUAAACGCCUGUGACAAUAUGGGCAGAAAUGCCUUGAUCCAUGCUCUCCUGAGCUCUGACGAUAGUGAUGUGGAGGCUAUUACGCAUCUGCUGCUGGACCAUGGGGCUGAUGUCAAUGUGAGGGGAGAAAGAGGGAAGACUCCCCUGAUCCUGGCAGUGGAGAAGAAGCACUUGGGUUUGGUGCAGAGGCUUCUGGAGCAAGAGCACAUAGAGAUUAAUGACACAGACAGUGAUGGCAAAACAGCACUGCUGCUUGCUGUUGAACUCAAACUGAAGAAAAUCGCCGAGUUGCUGUGCAAACGUGGAGCCAGUACAGAUUGUGGGGAUCUUGUUAUGACAGCGAGGCGGAAUUAUGACCAUUCCCUUGUGAAGGUUCUUCUCUCUCAUGGAGCCAAAGAAGAUUUUCAC [서열번호 133]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 133에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 133의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 134로 제공된다:
ATGGAAAGCCGGGACCACAACAACCCTCAAGAGGGCCCTACAAGCAGCTCTGGTAGAAGGGCCGCTGTGGAAGATAACCATCTGCTGATCAAGGCCGTGCAGAACGAGGACGTGGACCTGGTGCAACAACTGCTGGAAGGCGGAGCCAACGTGAACTTCCAAGAGGAAGAAGGCGGCTGGACCCCTCTGCATAACGCTGTGCAGATGAGCAGAGAGGACATCGTCGAGCTGCTGCTGAGACATGGCGCTGACCCTGTGCTGAGAAAGAAGAACGGCGCCACACCTTTCATCCTGGCCGCCATTGCCGGAAGCGTGAAGCTGCTGAAGCTGTTCCTGAGCAAGGGCGCCGATGTGAACGAGTGCGACTTCTACGGCTTCACCGCCTTCATGGAAGCCGCCGTGTACGGCAAAGTGAAGGCCCTGAAGTTCCTGTACAAGAGGGGCGCTAACGTGAACCTGCGGAGAAAGACCAAAGAGGACCAAGAGCGGCTGCGGAAAGGTGGCGCTACAGCTCTTATGGATGCCGCCGAGAAGGGACACGTGGAAGTGCTGAAGATCCTGCTGGATGAGATGGGCGCAGACGTGAACGCCTGCGACAACATGGGAAGAAACGCCCTGATTCACGCCCTGCTGAGCAGCGACGATAGCGACGTGGAAGCCATCACACATCTGCTGCTGGATCACGGGGCTGATGTGAATGTGCGGGGCGAGAGAGGAAAGACCCCACTGATTCTGGCCGTGGAAAAGAAACACCTGGGCCTCGTGCAGAGGCTGCTGGAACAAGAGCACATCGAGATCAACGACACCGACAGCGACGGCAAGACAGCCCTGCTGCTTGCCGTGGAACTGAAGCTGAAGAAGATCGCCGAACTGCTGTGCAAGAGAGGCGCCAGCACAGATTGTGGCGACCTCGTGATGACCGCCAGACGGAACTACGATCACAGCCTGGTCAAGGTGCTGCTGTCCCATGGCGCTAAAGAGGACTTCCACTGA [서열번호 134]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 134에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 134의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 135로 제공된다:
AUGGAGAGCAGGGAUCAUAACAACCCCCAGGAGGGACCCACGUCCUCCAGCGGUAGAAGGGCUGCAGUGGAAGACAAUCACUUGCUGAUUAAAGCUGUUCAAAACGAAGAUGUUGACCUGGUCCAGCAAUUGCUGGAAGGUGGAGCCAAUGUUAAUUUCCAGGAAGAGGAAGGGGGCUGGACACCUCUGCAUAACGCAGUACAAAUGAGCAGGGAGGACAUUGUGGAACUUCUGCUUCGUCAUGGUGCUGACCCUGUUCUGAGGAAGAAGAAUGGGGCCACGCCUUUUAUCCUCGCAGCGAUUGCGGGGAGCGUGAAGCUGCUGAAACUUUUCCUUUCUAAAGGAGCAGAUGUCAAUGAGUGUGAUUUUUAUGGCUUCACAGCCUUCAUGGAAGCCGCUGUGUAUGGUAAGGUCAAAGCCCUAAAAUUCCUUUAUAAGAGAGGAGCAAAUGUGAAUUUGAGGCGAAAGACAAAGGAGGAUCAAGAGCGGCUGAGGAAAGGAGGGGCCACAGCUCUCAUGGACGCUGCUGAAAAAGGACACGUAGAGGUCUUGAAGAUUCUCCUUGAUGAGAUGGGGGCAGAUGUAAACGCCUGUGACAAUAUGGGCAGAAAUGCCUUGAUCCAUGCUCUCCUGAGCUCUGACGAUAGUGAUGUGGAGGCUAUUACGCAUCUGCUGCUGGACCAUGGGGCUGAUGUCAAUGUGAGGGGAGAAAGAGGGAAGACUCCCCUGAUCCUGGCAGUGGAGAAGAAGCACUUGGGUUUGGUGCAGAGGCUUCUGGAGCAAGAGCACAUAGAGAUUAAUGACACAGACAGUGAUGGCAAAACAGCACUGCUGCUUGCUGUUGAACUCAAACUGAAGAAAAUCGCCGAGUUGCUGUGCAAACGUGGAGCCAGUACAGAUUGUGGGGAUCUUGUUAUGACAGCGAGGCGGAAUUAUGACCAUUCCCUUGUGAAGGUUCUUCUCUCUCAUGGAGCCAAAGAAGAUUUUCAC [서열번호 135]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 135에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
구현예에서, RNA 인식의 억제제, 또는 그의 우성 음성 형태는 PACT, 즉 dsRNA 결합 도메인(1&2)은 갖지만 PKR 활성화를 방지하는 c-말단(도메인 3)은 결실된 우성 음성 형태(NCBI 참조 서열: NM_003690.5; UniProtKB - O75569(PRKRA_HUMAN)), 또는 그의 동원체이다. [Heyam A, Lagos D, Plevin M. Dissecting the roles of TRBP and PACT in double-stranded RNA recognition and processing of noncoding RNAs. Wiley Interdiscip Rev RNA. 2015 May-Jun;6(3):271-89. doi: 10.1002/wrna.1272]. 상기 PACT 우성 음성 형태의 한 구현예는 [>sp|O75569|PRKRA_HUMAN|1-194 Interferon-inducible double-stranded RNA-dependent protein kinase activator A OS=Homo sapiens OX=9606 GN=PRKRA PE=1 SV=1(PACT PRKRA BD(1-194))]로 나타내며, 본 명세서에서 다음과 같은 서열번호 126으로 나타낸다:
MSQSRHRAEAPPLEREDSGTFSLGKMITAKPGKTPIQVLHEYGMKTKNIPVYECERSDVQIHVPTFTFRVTVGDITCTGEGTSKKLAKHRAAEAAINILKANASICFAVPDPLMPDPSKQPKNQLNPIGSLQELAIHHGWRLPEYTLSQEGGPAHKREYTTICRLESFMETGKGASKKQAKRNAAEKFLAKFSN [서열번호 126]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 126에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 PACT 우성 음성 형태 폴리펩티드(PACT PRKRA BD(1-194))는 다음과 같은 서열번호 127의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGTCCCAGAGCAGGCACCGCGCCGAGGCCCCGCCGCTGGAGCGCGAGGACAGTGGGACCTTCAGTTTGGGGAAGATGATAACAGCTAAGCCAGGGAAAACACCGATTCAGGTATTACACGAATACGGCATGAAGACCAAGAACATCCCAGTTTATGAATGTGAAAGATCTGATGTGCAAATACACGTGCCCACTTTCACCTTCAGAGTAACCGTTGGTGACATAACCTGCACAGGTGAAGGTACAAGTAAGAAGCTGGCGAAACATAGAGCTGCAGAGGCTGCCATAAACATTTTGAAAGCCAATGCAAGTATTTGCTTTGCAGTTCCTGACCCCTTAATGCCTGACCCTTCCAAGCAACCAAAGAACCAGCTTAATCCTATTGGTTCATTACAGGAATTGGCTATTCATCATGGCTGGAGACTTCCTGAATATACCCTTTCCCAGGAGGGAGGACCTGCTCATAAGAGAGAATATACTACAATTTGCAGGCTAGAGTCATTTATGGAAACTGGAAAGGGGGCATCAAAAAAGCAAGCCAAAAGGAATGCTGCTGAGAAATTTCTTGCCAAATTTAGTAAT [서열번호 127]
따라서, 바람직하게는 상기 PACT 우성 음성 형태 폴리펩티드는 실질적으로 서열번호 127에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 128의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGUCCCAGAGCAGGCACCGCGCCGAGGCCCCGCCGCUGGAGCGCGAGGACAGUGGGACCUUCAGUUUGGGGAAGAUGAUAACAGCUAAGCCAGGGAAAACACCGAUUCAGGUAUUACACGAAUACGGCAUGAAGACCAAGAACAUCCCAGUUUAUGAAUGUGAAAGAUCUGAUGUGCAAAUACACGUGCCCACUUUCACCUUCAGAGUAACCGUUGGUGACAUAACCUGCACAGGUGAAGGUACAAGUAAGAAGCUGGCGAAACAUAGAGCUGCAGAGGCUGCCAUAAACAUUUUGAAAGCCAAUGCAAGUAUUUGCUUUGCAGUUCCUGACCCCUUAAUGCCUGACCCUUCCAAGCAACCAAAGAACCAGCUUAAUCCUAUUGGUUCAUUACAGGAAUUGGCUAUUCAUCAUGGCUGGAGACUUCCUGAAUAUACCCUUUCCCAGGAGGGAGGACCUGCUCAUAAGAGAGAAUAUACUACAAUUUGCAGGCUAGAGUCAUUUAUGGAAACUGGAAAGGGGGCAUCAAAAAAGCAAGCCAAAAGGAAUGCUGCUGAGAAAUUUCUUGCCAAAUUUAGUAAU [서열번호 128]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 129에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 126의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 129로 제공된다:
ATGGCTGGCGATCTGAGCGCCGGCTTCTTCATGGAAGAACTGAACACCTACCGGCAGAAACAGGGCGTCGTGCTGAAGTACCAAGAGCTGCCTAATAGCGGCCCTCCTCACGACCGGCGGTTCACCTTTCAAGTGATCATCGACGGCAGAGAGTTCCCCGAAGGCGAGGGCAGATCTAAGAAAGAGGCCAAGAACGCCGCTGCCAAGCTGGCCGTGGAAATCCTGAACAAAGAGAAGAAGGCCGTTTCTCCCCTGCTGCTGACCACCACCAATAGCTCTGAGGGCCTGAGCATGGGCAACTACATCGGCCTGATCAACCGGATCGCCCAGAAAAAGCGGCTGACCGTGAACTACGAGCAGTGTGCCAGCGGAGTGCACGGCCCTGAGGGCTTTCACTACAAGTGCAAGATGGGCCAGAAAGAGTACAGCATCGGCACCGGCAGCACCAAGCAAGAAGCCAAACAGCTGGCCGCCAAACTGGCCTACCTGCAGATCCTGAGCGAGGAAACCTGA [서열번호 129]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 129에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 129의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 130으로 제공된다:
AUGAGCCAGAGCAGACACAGAGCCGAAGCUCCUCCACUGGAAAGAGAGGACAGCGGCACCUUUAGCCUGGGCAAGAUGAUCACAGCCAAGCCUGGCAAGACCCCUAUCCAGGUGCUGCACGAGUACGGCAUGAAGACCAAGAACAUCCCCGUGUACGAGUGCGAGAGAAGCGACGUGCAGAUCCACGUGCCAACCUUCACCUUCAGAGUGACCGUGGGCGACAUCACCUGUACCGGCGAGGGCACAUCUAAGAAGCUGGCCAAACAUAGAGCCGCCGAGGCCGCCAUCAAUAUCCUGAAGGCCAAUGCCAGCAUCUGCUUCGCCGUGCCUGAUCCUCUGAUGCCCGAUCCUAGCAAGCAGCCCAAGAACCAGCUGAACCCUAUCGGCAGCCUGCAAGAGCUGGCCAUUCAUCAUGGAUGGCGGCUGCCUGAGUACACCCUGUCUCAAGAAGGCGGCCCUGCUCACAAGAGAGAGUACACCACCAUCUGCCGGCUGGAAAGCUUCAUGGAAACAGGCAAGGGCGCCAGCAAGAAACAGGCCAAGAGAAACGCCGCCGAGAAGUUCCUGGCCAAGUUCAGCAACUGA [서열번호 130]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 130에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 RIG-1(DDX 58) RNA 결합 단백질 C-말단 도메인, 또는 그의 우성 음성 형태(NCBI 참조 서열: NM_014314.4; UniProtKB - O95786(DDX58_HUMAN)), 또는 그의 동원체일 수 있다. >sp|O95786|794-925. 상기 RIG-1 우성 음성 형태의 한 구현예는 본 명세서에서 다음과 같은 서열번호 141로 나타낸다:
(M)QEKPKPVPDKENKKLLCRKCKALACYTADVRVIEECHYTVLGDAFKECFVSRPHPKPKQFSSFEKRAKIFCARQNCSHDWGIHVKYKTFEIPVIKIESFVVEDIATGVQTLYSKWKDFHFEKIPFDPAEMSK [서열번호 141]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 141에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 RIG-1 우성 음성 형태 폴리펩티드는 다음과 같은 서열번호 142의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGCAAGAAAAACCAAAACCTGTACCTGATAAGGAAAATAAAAAACTGCTCTGCAGAAAGTGCAAAGCCTTGGCATGTTACACAGCTGACGTAAGAGTGATAGAGGAATGCCATTACACTGTGCTTGGAGATGCTTTTAAGGAATGCTTTGTGAGTAGACCACATCCCAAGCCAAAGCAGTTTTCAAGTTTTGAAAAAAGAGCAAAGATATTCTGTGCCCGACAGAACTGCAGCCATGACTGGGGAATCCATGTGAAGTACAAGACATTTGAGATTCCAGTTATAAAAATTGAAAGTTTTGTGGTGGAGGATATTGCAACTGGAGTTCAGACACTGTACTCGAAGTGGAAGGACTTTCATTTTGAGAAGATACCATTTGATCCAGCAGAAATGTCCAAA [서열번호 142]
따라서, 바람직하게는 상기 RIG-1 우성 음성 형태 폴리펩티드는 실질적으로 서열번호 142에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 143의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGCAAGAAAAACCAAAACCUGUACCUGAUAAGGAAAAUAAAAAACUGCUCUGCAGAAAGUGCAAAGCCUUGGCAUGUUACACAGCUGACGUAAGAGUGAUAGAGGAAUGCCAUUACACUGUGCUUGGAGAUGCUUUUAAGGAAUGCUUUGUGAGUAGACCACAUCCCAAGCCAAAGCAGUUUUCAAGUUUUGAAAAAAGAGCAAAGAUAUUCUGUGCCCGACAGAACUGCAGCCAUGACUGGGGAAUCCAUGUGAAGUACAAGACAUUUGAGAUUCCAGUUAUAAAAAUUGAAAGUUUUGUGGUGGAGGAUAUUGCAACUGGAGUUCAGACACUGUACUCGAAGUGGAAGGACUUUCAUUUUGAGAAGAUACCAUUUGAUCCAGCAGAAAUGUCCAAA [서열번호 143]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 143에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 141의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 144로 제공된다:
ATGCAAGAGAAGCCCAAGCCTGTGCCTGACAAAGAGAACAAGAAACTGCTGTGCCGGAAGTGCAAGGCCCTGGCCTGTTATACAGCCGACGTGCGCGTGATCGAGGAATGCCACTATACAGTGCTGGGCGACGCCTTCAAAGAATGCTTCGTGTCCCGGCCTCATCCTAAGCCTAAGCAGTTCAGCAGCTTCGAGAAGCGGGCCAAGATCTTCTGCGCCAGACAGAACTGCAGCCACGACTGGGGAATCCACGTGAAGTACAAGACCTTCGAGATCCCCGTGATCAAGATCGAGAGCTTCGTGGTGGAAGATATCGCCACCGGCGTGCAGACCCTGTACAGCAAGTGGAAGGATTTCCACTTTGAGAAGATCCCTTTCGACCCCGCCGAGATGAGCAAGTGA [서열번호 144]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 144에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 144의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 145로 제공된다:
AUGCAAGAGAAGCCCAAGCCUGUGCCUGACAAAGAGAACAAGAAACUGCUGUGCCGGAAGUGCAAGGCCCUGGCCUGUUAUACAGCCGACGUGCGCGUGAUCGAGGAAUGCCACUAUACAGUGCUGGGCGACGCCUUCAAAGAAUGCUUCGUGUCCCGGCCUCAUCCUAAGCCUAAGCAGUUCAGCAGCUUCGAGAAGCGGGCCAAGAUCUUCUGCGCCAGACAGAACUGCAGCCACGACUGGGGAAUCCACGUGAAGUACAAGACCUUCGAGAUCCCCGUGAUCAAGAUCGAGAGCUUCGUGGUGGAAGAUAUCGCCACCGGCGUGCAGACCCUGUACAGCAAGUGGAAGGAUUUCCACUUUGAGAAGAUCCCUUUCGACCCCGCCGAGAUGAGCAAGUGA [서열번호 145]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 145에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 구현예에서, 상기 적어도 하나의 IMP는 RIG 스플라이스 변이체(DDX58_HUMAN_ISOFORM_2)(NCBI 참조 서열: NM_014314.4; UniProtKB - O95786(DDX58_HUMAN) AA 36 - 80 deletion), 또는 그의 동원체일 수 있다. 상기 RIG 스플라이스 변이체의 한 구현예는 본 명세서에서 다음과 같은 서열번호 186으로 나타낸다:
MTTEQRRSLQAFQDYIRKTLDPTYILSYMAPWFREGYSGLYEAIESWDFKKIEKLEEYRLLLKRLQPEFKTRIIPTDIISDLSECLINQECEEILQICSTKGMMAGAEKLVECLLRSDKENWPKTLKLALEKERNKFSELWIVEKGIKDVETEDLEDKMETSDIQIFYQEDPECQNLSENSCPPSEVSDTNLYSPFKPRNYQLELALPAMKGKNTIICAPTGCGKTFVSLLICEHHLKKFPQGQKGKVVFFANQIPVYEQQKSVFSKYFERHGYRVTGISGATAENVPVEQIVENNDIIILTPQILVNNLKKGTIPSLSIFTLMIFDECHNTSKQHPYNMIMFNYLDQKLGGSSGPLPQVIGLTASVGVGDAKNTDEALDYICKLCASLDASVIATVKHNLEELEQVVYKPQKFFRKVESRISDKFKYIIAQLMRDTESLAKRICKDLENLSQIQNREFGTQKYEQWIVTVQKACMVFQMPDKDEESRICKALFLYTSHLRKYNDALIISEHARMKDALDYLKDFFSNVRAAGFDEIEQDLTQRFEEKLQELESVSRDPSNENPKLEDLCFILQEEYHLNPETITILFVKTRALVDALKNWIEGNPKLSFLKPGILTGRGKTNQNTGMTLPAQKCILDAFKASGDHNILIATSVADEGIDIAQCNLVILYEYVGNVIKMIQTRGRGRARGSKCFLLTSNAGVIEKEQINMYKEKMMNDSILRLQTWDEAVFREKILHIQTHEKFIRDSQEKPKPVPDKENKKLLCRKCKALACYTADVRVIEECHYTVLGDAFKECFVSRPHPKPKQFSSFEKRAKIFCARQNCSHDWGIHVKYKTFEIPVIKIESFVVEDIATGVQTLYSKWKDFHFEKIPFDPAEMSK [서열번호 186]
따라서, 바람직하게는 제1 측면의 RNA 구조체는 실질적으로 서열번호 186에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 암호화하는 뉴클레오티드 서열을 포함한다.
한 구현예에서, 상기 RIG 스플라이스 변이체는 다음과 같은 서열번호 187의 DNA 뉴클레오티드 서열에 의해 암호화된다:
ATGACCACCGAGCAGCGACGCAGCCTGCAAGCCTTCCAGGATTATATCCGGAAGACCCTGGACCCTACCTACATCCTGAGCTACATGGCCCCCTGGTTTAGGGAGGGTTATTCTGGACTTTATGAAGCCATTGAAAGTTGGGATTTCAAAAAAATTGAAAAGTTGGAGGAGTATAGATTACTTTTAAAACGTTTACAACCAGAATTTAAAACCAGAATTATCCCAACCGATATCATTTCTGATCTGTCTGAATGTTTAATTAATCAGGAATGTGAAGAAATTCTACAGATTTGCTCTACTAAGGGGATGATGGCAGGTGCAGAGAAATTGGTGGAATGCCTTCTCAGATCAGACAAGGAAAACTGGCCCAAAACTTTGAAACTTGCTTTGGAGAAAGAAAGGAACAAGTTCAGTGAACTGTGGATTGTAGAGAAAGGTATAAAAGATGTTGAAACAGAAGATCTTGAGGATAAGATGGAAACTTCTGACATACAGATTTTCTACCAAGAAGATCCAGAATGCCAGAATCTTAGTGAGAATTCATGTCCACCTTCAGAAGTGTCTGATACAAACTTGTACAGCCCATTTAAACCAAGAAATTACCAATTAGAGCTTGCTTTGCCTGCTATGAAAGGAAAAAACACAATAATATGTGCTCCTACAGGTTGTGGAAAAACCTTTGTTTCACTGCTTATATGTGAACATCATCTTAAAAAATTCCCACAAGGACAAAAGGGGAAAGTTGTCTTTTTTGCGAATCAGATCCCAGTGTATGAACAGCAGAAATCTGTATTCTCAAAATACTTTGAAAGACATGGGTATAGAGTTACAGGCATTTCTGGAGCAACAGCTGAGAATGTCCCAGTGGAACAGATTGTTGAGAACAATGACATCATCATTTTAACTCCACAGATTCTTGTGAACAACCTTAAAAAGGGAACGATTCCATCACTATCCATCTTTACTTTGATGATATTTGATGAATGCCACAACACTAGTAAACAACACCCGTACAATATGATCATGTTTAATTATCTAGATCAGAAACTTGGAGGATCTTCAGGCCCACTGCCCCAGGTCATTGGGCTGACTGCCTCGGTTGGTGTTGGGGATGCCAAAAACACAGATGAAGCCTTGGATTATATCTGCAAGCTGTGTGCTTCTCTTGATGCGTCAGTGATAGCAACAGTCAAACACAATCTGGAGGAACTGGAGCAAGTTGTTTATAAGCCCCAGAAGTTTTTCAGGAAAGTGGAATCACGGATTAGCGACAAATTTAAATACATCATAGCTCAGCTGATGAGGGACACAGAGAGTCTGGCAAAGAGAATCTGCAAAGACCTCGAAAACTTATCTCAAATTCAAAATAGGGAATTTGGAACACAGAAATATGAACAATGGATTGTTACAGTTCAGAAAGCATGCATGGTGTTCCAGATGCCAGACAAAGATGAAGAGAGCAGGATTTGTAAAGCCCTGTTTTTATACACTTCACATTTGCGGAAATATAATGATGCCCTCATTATCAGTGAGCATGCACGAATGAAAGATGCTCTGGATTACTTGAAAGACTTCTTCAGCAATGTCCGAGCAGCAGGATTCGATGAGATTGAGCAAGATCTTACTCAGAGATTTGAAGAAAAGCTGCAGGAACTAGAAAGTGTTTCCAGGGATCCCAGCAATGAGAATCCTAAACTTGAAGACCTCTGCTTCATCTTACAAGAAGAGTACCACTTAAACCCAGAGACAATAACAATTCTCTTTGTGAAAACCAGAGCACTTGTGGACGCTTTAAAAAATTGGATTGAAGGAAATCCTAAACTCAGTTTTCTAAAACCTGGCATATTGACTGGACGTGGCAAAACAAATCAGAACACAGGAATGACCCTCCCGGCACAGAAGTGTATATTGGATGCATTCAAAGCCAGTGGAGATCACAATATTCTGATTGCCACCTCAGTTGCTGATGAAGGCATTGACATTGCACAGTGCAATCTTGTCATCCTTTATGAGTATGTGGGCAATGTCATCAAAATGATCCAAACCAGAGGCAGAGGAAGAGCAAGAGGTAGCAAGTGCTTCCTTCTGACTAGTAATGCTGGTGTAATTGAAAAAGAACAAATAAACATGTACAAAGAAAAAATGATGAATGACTCTATTTTACGCCTTCAGACATGGGACGAAGCAGTATTTAGGGAAAAGATTCTGCATATACAGACTCATGAAAAATTCATCAGAGATAGTCAAGAAAAACCAAAACCTGTACCTGATAAGGAAAATAAAAAACTGCTCTGCAGAAAGTGCAAAGCCTTGGCATGTTACACAGCTGACGTAAGAGTGATAGAGGAATGCCATTACACTGTGCTTGGAGATGCTTTTAAGGAATGCTTTGTGAGTAGACCACATCCCAAGCCAAAGCAGTTTTCAAGTTTTGAAAAAAGAGCAAAGATATTCTGTGCCCGACAGAACTGCAGCCATGACTGGGGAATCCATGTGAAGTACAAGACATTTGAGATTCCAGTTATAAAAATTGAAAGTTTTGTGGTGGAGGATATTGCAACTGGAGTTCAGACACTGTACTCGAAGTGGAAGGACTTTCATTTTGAGAAGATACCATTTGATCCAGCAGAAATGTCCAAA [서열번호 187]
따라서, 바람직하게는 상기 RIG 스플라이스 변이체는 실질적으로 서열번호 187에 나타낸 것과 같은 DNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 188의 RNA 뉴클레오티드 서열을 포함할 수 있다:
AUGACCACCGAGCAGCGACGCAGCCUGCAAGCCUUCCAGGAUUAUAUCCGGAAGACCCUGGACCCUACCUACAUCCUGAGCUACAUGGCCCCCUGGUUUAGGGAGGGUUAUUCUGGACUUUAUGAAGCCAUUGAAAGUUGGGAUUUCAAAAAAAUUGAAAAGUUGGAGGAGUAUAGAUUACUUUUAAAACGUUUACAACCAGAAUUUAAAACCAGAAUUAUCCCAACCGAUAUCAUUUCUGAUCUGUCUGAAUGUUUAAUUAAUCAGGAAUGUGAAGAAAUUCUACAGAUUUGCUCUACUAAGGGGAUGAUGGCAGGUGCAGAGAAAUUGGUGGAAUGCCUUCUCAGAUCAGACAAGGAAAACUGGCCCAAAACUUUGAAACUUGCUUUGGAGAAAGAAAGGAACAAGUUCAGUGAACUGUGGAUUGUAGAGAAAGGUAUAAAAGAUGUUGAAACAGAAGAUCUUGAGGAUAAGAUGGAAACUUCUGACAUACAGAUUUUCUACCAAGAAGAUCCAGAAUGCCAGAAUCUUAGUGAGAAUUCAUGUCCACCUUCAGAAGUGUCUGAUACAAACUUGUACAGCCCAUUUAAACCAAGAAAUUACCAAUUAGAGCUUGCUUUGCCUGCUAUGAAAGGAAAAAACACAAUAAUAUGUGCUCCUACAGGUUGUGGAAAAACCUUUGUUUCACUGCUUAUAUGUGAACAUCAUCUUAAAAAAUUCCCACAAGGACAAAAGGGGAAAGUUGUCUUUUUUGCGAAUCAGAUCCCAGUGUAUGAACAGCAGAAAUCUGUAUUCUCAAAAUACUUUGAAAGACAUGGGUAUAGAGUUACAGGCAUUUCUGGAGCAACAGCUGAGAAUGUCCCAGUGGAACAGAUUGUUGAGAACAAUGACAUCAUCAUUUUAACUCCACAGAUUCUUGUGAACAACCUUAAAAAGGGAACGAUUCCAUCACUAUCCAUCUUUACUUUGAUGAUAUUUGAUGAAUGCCACAACACUAGUAAACAACACCCGUACAAUAUGAUCAUGUUUAAUUAUCUAGAUCAGAAACUUGGAGGAUCUUCAGGCCCACUGCCCCAGGUCAUUGGGCUGACUGCCUCGGUUGGUGUUGGGGAUGCCAAAAACACAGAUGAAGCCUUGGAUUAUAUCUGCAAGCUGUGUGCUUCUCUUGAUGCGUCAGUGAUAGCAACAGUCAAACACAAUCUGGAGGAACUGGAGCAAGUUGUUUAUAAGCCCCAGAAGUUUUUCAGGAAAGUGGAAUCACGGAUUAGCGACAAAUUUAAAUACAUCAUAGCUCAGCUGAUGAGGGACACAGAGAGUCUGGCAAAGAGAAUCUGCAAAGACCUCGAAAACUUAUCUCAAAUUCAAAAUAGGGAAUUUGGAACACAGAAAUAUGAACAAUGGAUUGUUACAGUUCAGAAAGCAUGCAUGGUGUUCCAGAUGCCAGACAAAGAUGAAGAGAGCAGGAUUUGUAAAGCCCUGUUUUUAUACACUUCACAUUUGCGGAAAUAUAAUGAUGCCCUCAUUAUCAGUGAGCAUGCACGAAUGAAAGAUGCUCUGGAUUACUUGAAAGACUUCUUCAGCAAUGUCCGAGCAGCAGGAUUCGAUGAGAUUGAGCAAGAUCUUACUCAGAGAUUUGAAGAAAAGCUGCAGGAACUAGAAAGUGUUUCCAGGGAUCCCAGCAAUGAGAAUCCUAAACUUGAAGACCUCUGCUUCAUCUUACAAGAAGAGUACCACUUAAACCCAGAGACAAUAACAAUUCUCUUUGUGAAAACCAGAGCACUUGUGGACGCUUUAAAAAAUUGGAUUGAAGGAAAUCCUAAACUCAGUUUUCUAAAACCUGGCAUAUUGACUGGACGUGGCAAAACAAAUCAGAACACAGGAAUGACCCUCCCGGCACAGAAGUGUAUAUUGGAUGCAUUCAAAGCCAGUGGAGAUCACAAUAUUCUGAUUGCCACCUCAGUUGCUGAUGAAGGCAUUGACAUUGCACAGUGCAAUCUUGUCAUCCUUUAUGAGUAUGUGGGCAAUGUCAUCAAAAUGAUCCAAACCAGAGGCAGAGGAAGAGCAAGAGGUAGCAAGUGCUUCCUUCUGACUAGUAAUGCUGGUGUAAUUGAAAAAGAACAAAUAAACAUGUACAAAGAAAAAAUGAUGAAUGACUCUAUUUUACGCCUUCAGACAUGGGACGAAGCAGUAUUUAGGGAAAAGAUUCUGCAUAUACAGACUCAUGAAAAAUUCAUCAGAGAUAGUCAAGAAAAACCAAAACCUGUACCUGAUAAGGAAAAUAAAAAACUGCUCUGCAGAAAGUGCAAAGCCUUGGCAUGUUACACAGCUGACGUAAGAGUGAUAGAGGAAUGCCAUUACACUGUGCUUGGAGAUGCUUUUAAGGAAUGCUUUGUGAGUAGACCACAUCCCAAGCCAAAGCAGUUUUCAAGUUUUGAAAAAAGAGCAAAGAUAUUCUGUGCCCGACAGAACUGCAGCCAUGACUGGGGAAUCCAUGUGAAGUACAAGACAUUUGAGAUUCCAGUUAUAAAAAUUGAAAGUUUUGUGGUGGAGGAUAUUGCAACUGGAGUUCAGACACUGUACUCGAAGUGGAAGGACUUUCAUUUUGAGAAGAUACCAUUUGAUCCAGCAGAAAUGUCCAAA [서열번호 188]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 188에 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
따라서, 본 발명자들은 서열번호 186의 단백질 서열에 인간 발현을 위한 코돈 최적화를 하였고, 개시(ATG) 및 종결(TGA) 코돈을 포함하는 코돈 최적화된 핵산(DNA) 서열의 한 구현예는 본 명세서에서 다음과 같은 서열번호 189로 제공된다:
ATGACCACCGAGCAGAGAAGATCCCTGCAGGCCTTCCAGGACTACATCAGAAAGACACTGGACCCCACCTACATCCTGAGCTACATGGCCCCATGGTTCAGAGAGGGCTACAGCGGACTGTACGAGGCCATCGAGAGCTGGGACTTCAAGAAGATCGAGAAGCTGGAAGAGTACCGGCTGCTGCTGAAGAGACTGCAGCCCGAGTTCAAGACCCGGATCATCCCCACCGACATCATCAGCGATCTGAGCGAGTGCCTGATCAATCAAGAGTGCGAGGAAATCCTGCAGATCTGTAGCACCAAGGGCATGATGGCTGGCGCCGAGAAACTGGTGGAATGCCTGCTGAGAAGCGACAAAGAGAACTGGCCCAAGACACTGAAGCTGGCCCTGGAAAAAGAGCGGAACAAGTTCAGCGAGCTGTGGATCGTGGAAAAGGGCATCAAGGACGTGGAAACCGAGGACCTGGAAGATAAGATGGAAACCAGCGACATCCAGATCTTCTACCAAGAGGACCCCGAGTGCCAGAACCTGAGCGAGAATAGCTGCCCTCCTAGCGAGGTGTCCGACACCAATCTGTACAGCCCCTTCAAGCCCCGGAACTACCAGCTGGAACTTGCCCTGCCTGCCATGAAGGGCAAGAACACCATCATCTGTGCCCCAACCGGCTGCGGCAAGACCTTTGTGTCTCTGCTGATCTGCGAGCACCACCTGAAGAAGTTCCCTCAGGGCCAGAAAGGCAAGGTGGTGTTTTTCGCCAATCAGATCCCCGTGTACGAGCAGCAGAAAAGCGTGTTCAGCAAGTACTTCGAGCGGCACGGCTACAGAGTGACAGGCATTTCTGGCGCCACCGCCGAGAATGTGCCTGTGGAACAGATTGTGGAAAACAACGATATCATCATCCTGACGCCTCAGATCCTGGTCAACAATCTGAAGAAGGGCACAATCCCCAGCCTGAGCATCTTCACCCTGATGATCTTCGACGAGTGCCACAACACCAGCAAGCAGCACCCCTACAATATGATCATGTTCAACTACCTGGACCAGAAGCTCGGCGGCAGCTCTGGACCTCTGCCTCAAGTGATTGGCCTGACAGCCTCTGTCGGAGTGGGCGACGCCAAGAATACTGACGAGGCCCTGGATTACATCTGCAAGCTGTGCGCCAGCCTGGACGCCTCTGTGATTGCCACCGTGAAGCACAACCTCGAGGAACTGGAACAGGTGGTGTACAAGCCCCAGAAATTCTTTCGGAAGGTGGAAAGCCGGATCAGCGACAAGTTCAAGTACATCATTGCCCAGCTGATGCGGGACACCGAGAGCCTGGCTAAGAGAATCTGCAAGGATCTGGAAAACCTGAGCCAGATCCAGAACAGAGAGTTCGGCACCCAGAAATACGAGCAGTGGATTGTGACCGTGCAGAAAGCCTGCATGGTGTTCCAGATGCCTGACAAGGACGAAGAGAGCCGGATCTGCAAAGCCCTGTTCCTGTACACCAGCCACCTGAGAAAGTACAACGACGCCCTGATCATCTCCGAGCACGCCAGAATGAAGGACGCCCTGGACTACCTGAAGGACTTCTTCTCCAATGTGCGCGCTGCCGGCTTCGATGAGATCGAGCAAGATCTGACCCAGCGCTTCGAGGAAAAGCTGCAAGAGCTGGAAAGCGTGTCCAGAGATCCCAGCAACGAGAACCCCAAACTGGAAGATCTGTGCTTCATCCTGCAAGAGGAATACCATCTGAACCCCGAGACAATCACCATCCTGTTCGTGAAAACAAGAGCCCTGGTGGATGCCCTGAAGAACTGGATCGAGGGCAACCCCAAGCTGAGCTTCCTGAAGCCTGGCATCCTGACCGGCAGAGGCAAGACAAACCAGAACACCGGCATGACCCTGCCAGCTCAGAAGTGCATCCTGGACGCTTTTAAGGCCAGCGGCGACCACAACATCCTGATCGCCACATCTGTGGCCGACGAGGGCATCGATATCGCCCAGTGCAATCTGGTCATCCTGTACGAGTACGTGGGCAACGTGATCAAGATGATCCAGACAAGAGGCAGGGGCAGAGCCAGAGGCAGCAAGTGCTTTCTGCTGACCTCTAATGCCGGCGTGATCGAGAAAGAACAGATCAACATGTACAAAGAAAAGATGATGAACGACAGCATCCTGCGGCTGCAGACCTGGGATGAAGCCGTGTTCCGGGAAAAGATCCTGCACATCCAGACACACGAGAAGTTCATCCGGGACAGCCAAGAGAAGCCCAAGCCTGTGCCTGACAAAGAAAACAAGAAACTGCTGTGCCGGAAGTGCAAGGCCCTGGCCTGTTATACAGCCGACGTGCGAGTGATCGAGGAATGCCACTATACCGTGCTCGGCGACGCCTTCAAAGAATGCTTCGTGTCCCGGCCTCATCCTAAGCCTAAGCAGTTCAGCAGCTTCGAGAAGCGGGCCAAGATCTTCTGCGCCAGACAGAACTGCAGCCACGACTGGGGAATCCACGTGAAGTACAAGACCTTCGAGATCCCGGTCATCAAGATCGAGTCCTTCGTGGTGGAAGATATCGCCACCGGCGTGCAGACCCTGTACAGCAAGTGGAAGGATTTCCACTTCGAGAAAATCCCTTTCGACCCCGCCGAGATGAGCAAGTGA [서열번호 189]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 189에 나타낸 것과 같은 DNA 서열, 또는 그의 단편 또는 변이체에 의해 암호화된다.
구현예에서, 개시(AUG) 및 종결(UGA) 코돈을 포함하는 서열번호 189의 코돈 최적화된 DNA 서열에 대응하는 RNA 서열은 본 명세서에서 다음과 같은 서열번호 190으로 제공된다:
AUGACCACCGAGCAGAGAAGAUCCCUGCAGGCCUUCCAGGACUACAUCAGAAAGACACUGGACCCCACCUACAUCCUGAGCUACAUGGCCCCAUGGUUCAGAGAGGGCUACAGCGGACUGUACGAGGCCAUCGAGAGCUGGGACUUCAAGAAGAUCGAGAAGCUGGAAGAGUACCGGCUGCUGCUGAAGAGACUGCAGCCCGAGUUCAAGACCCGGAUCAUCCCCACCGACAUCAUCAGCGAUCUGAGCGAGUGCCUGAUCAAUCAAGAGUGCGAGGAAAUCCUGCAGAUCUGUAGCACCAAGGGCAUGAUGGCUGGCGCCGAGAAACUGGUGGAAUGCCUGCUGAGAAGCGACAAAGAGAACUGGCCCAAGACACUGAAGCUGGCCCUGGAAAAAGAGCGGAACAAGUUCAGCGAGCUGUGGAUCGUGGAAAAGGGCAUCAAGGACGUGGAAACCGAGGACCUGGAAGAUAAGAUGGAAACCAGCGACAUCCAGAUCUUCUACCAAGAGGACCCCGAGUGCCAGAACCUGAGCGAGAAUAGCUGCCCUCCUAGCGAGGUGUCCGACACCAAUCUGUACAGCCCCUUCAAGCCCCGGAACUACCAGCUGGAACUUGCCCUGCCUGCCAUGAAGGGCAAGAACACCAUCAUCUGUGCCCCAACCGGCUGCGGCAAGACCUUUGUGUCUCUGCUGAUCUGCGAGCACCACCUGAAGAAGUUCCCUCAGGGCCAGAAAGGCAAGGUGGUGUUUUUCGCCAAUCAGAUCCCCGUGUACGAGCAGCAGAAAAGCGUGUUCAGCAAGUACUUCGAGCGGCACGGCUACAGAGUGACAGGCAUUUCUGGCGCCACCGCCGAGAAUGUGCCUGUGGAACAGAUUGUGGAAAACAACGAUAUCAUCAUCCUGACGCCUCAGAUCCUGGUCAACAAUCUGAAGAAGGGCACAAUCCCCAGCCUGAGCAUCUUCACCCUGAUGAUCUUCGACGAGUGCCACAACACCAGCAAGCAGCACCCCUACAAUAUGAUCAUGUUCAACUACCUGGACCAGAAGCUCGGCGGCAGCUCUGGACCUCUGCCUCAAGUGAUUGGCCUGACAGCCUCUGUCGGAGUGGGCGACGCCAAGAAUACUGACGAGGCCCUGGAUUACAUCUGCAAGCUGUGCGCCAGCCUGGACGCCUCUGUGAUUGCCACCGUGAAGCACAACCUCGAGGAACUGGAACAGGUGGUGUACAAGCCCCAGAAAUUCUUUCGGAAGGUGGAAAGCCGGAUCAGCGACAAGUUCAAGUACAUCAUUGCCCAGCUGAUGCGGGACACCGAGAGCCUGGCUAAGAGAAUCUGCAAGGAUCUGGAAAACCUGAGCCAGAUCCAGAACAGAGAGUUCGGCACCCAGAAAUACGAGCAGUGGAUUGUGACCGUGCAGAAAGCCUGCAUGGUGUUCCAGAUGCCUGACAAGGACGAAGAGAGCCGGAUCUGCAAAGCCCUGUUCCUGUACACCAGCCACCUGAGAAAGUACAACGACGCCCUGAUCAUCUCCGAGCACGCCAGAAUGAAGGACGCCCUGGACUACCUGAAGGACUUCUUCUCCAAUGUGCGCGCUGCCGGCUUCGAUGAGAUCGAGCAAGAUCUGACCCAGCGCUUCGAGGAAAAGCUGCAAGAGCUGGAAAGCGUGUCCAGAGAUCCCAGCAACGAGAACCCCAAACUGGAAGAUCUGUGCUUCAUCCUGCAAGAGGAAUACCAUCUGAACCCCGAGACAAUCACCAUCCUGUUCGUGAAAACAAGAGCCCUGGUGGAUGCCCUGAAGAACUGGAUCGAGGGCAACCCCAAGCUGAGCUUCCUGAAGCCUGGCAUCCUGACCGGCAGAGGCAAGACAAACCAGAACACCGGCAUGACCCUGCCAGCUCAGAAGUGCAUCCUGGACGCUUUUAAGGCCAGCGGCGACCACAACAUCCUGAUCGCCACAUCUGUGGCCGACGAGGGCAUCGAUAUCGCCCAGUGCAAUCUGGUCAUCCUGUACGAGUACGUGGGCAACGUGAUCAAGAUGAUCCAGACAAGAGGCAGGGGCAGAGCCAGAGGCAGCAAGUGCUUUCUGCUGACCUCUAAUGCCGGCGUGAUCGAGAAAGAACAGAUCAACAUGUACAAAGAAAAGAUGAUGAACGACAGCAUCCUGCGGCUGCAGACCUGGGAUGAAGCCGUGUUCCGGGAAAAGAUCCUGCACAUCCAGACACACGAGAAGUUCAUCCGGGACAGCCAAGAGAAGCCCAAGCCUGUGCCUGACAAAGAAAACAAGAAACUGCUGUGCCGGAAGUGCAAGGCCCUGGCCUGUUAUACAGCCGACGUGCGAGUGAUCGAGGAAUGCCACUAUACCGUGCUCGGCGACGCCUUCAAAGAAUGCUUCGUGUCCCGGCCUCAUCCUAAGCCUAAGCAGUUCAGCAGCUUCGAGAAGCGGGCCAAGAUCUUCUGCGCCAGACAGAACUGCAGCCACGACUGGGGAAUCCACGUGAAGUACAAGACCUUCGAGAUCCCGGUCAUCAAGAUCGAGUCCUUCGUGGUGGAAGAUAUCGCCACCGGCGUGCAGACCCUGUACAGCAAGUGGAAGGAUUUCCACUUCGAGAAAAUCCCUUUCGACCCCGCCGAGAUGAGCAAGUGA [서열번호 190]
그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 190에 나타낸 것과 같은 서열, 또는 그의 단편 또는 변이체를 포함한다.
상기 RNA 구조체는 적어도 하나의 치료 생체분자를 암호화하는 뉴클레오티드 서열을 포함한다. 이것은 도 1에서 관심있는 유전자(GOI)로 나타낸다.
상기 적어도 하나의 치료 생체분자는 치료 단백질을 포함할 수 있다. 숙련된 기술자는 치료 단백질이 바람직하게는 인간에서 치료 적용분야를 갖는 임의의 단백질에 관한 것임을 이해할 것이다. 상기 RNA 분자에 의해 암호화될 수 있는 예시적인 치료 생체분자는 박테리아, 바이러스, 진균, 원생동물/또는 기생충과 같은 병원체로부터 유래되는 단백질 또는 펩티드를 포함한다. 상기 단백질 또는 펩티드는 항원, 즉 숙주에서 면역 반응을 자극 또는 촉발할 수 있는 항원일 수 있다. 그러므로, 상기 적어도 하나의 치료 생체분자가 항원인 구현예에서, 제1 측면의 RNA 구조체는 백신으로 간주될 수 있다.
바이러스로부터 유래되는 단백질 또는 펩티드는 바이러스 항원일 수 있다. 상기 바이러스 항원은 오르토믹소바이러스(Orthomyxovirus); 파라믹소비리대(Paramyxoviridae) 바이러스; 메타뉴모바이러스 및 모르빌리바이러스(Morbillivirus); 뉴모바이러스; 파라믹소바이러스(Paramyxovirus); 폭스비리대(Poxviridae); 메타뉴모바이러스; 모르빌리바이러스; 피코르나바이러스; 엔테로바이러스; 분야바이러스(Bunyavirus); 플레보바이러스(Phlebovirus); 나이로바이러스(Nairovirus); 헤파르나바이러스(Heparnavirus); 토가바이러스(Togavirus); 알파바이러스; 아르테리바이러스(Arterivirus); 플라비바이러스; 페스티바이러스; 헤파드나바이러스(Hepadnavirus); 라브도바이러스(Rhabdovirus); 칼리시비리대(Caliciviridae); 코로나바이러스; 레트로바이러스(Retroviruses); 레오바이러스(Reovirus); 파르보바이러스(Parvovirus); 델타 간염 바이러스(HDV); E형 간염 바이러스(HEV); 인간 헤르페스바이러스 및 파포바바이러스(Papovavirus);로 이루어진 군으로부터 선택되는 바이러스로부터 유래될 수 있다.
상기 오르토믹소바이러스는 인플루엔자(인플루엔자) A, B 및 C일 수 있다. 상기 파라믹소비리대 바이러스는 뉴모바이러스(RSV), 파라믹소바이러스(PIV)일 수 있다. 상기 메타뉴모바이러스는 모르빌리바이러스(예컨대, 홍역)일 수 있다. 상기 뉴모바이러스는 호흡기 세포융합 바이러스(RSV), 소 호흡기 세포융합 바이러스, 마우스의 폐렴 바이러스, 또는 터키 비기관염(rhinotracheitis) 바이러스일 수 있다. 상기 파라믹소바이러스는 파라인플루엔자(Parainfluenza) 바이러스 타입 1 - 4(PIV), 볼거리, 센다이 바이러스, 원숭이 바이러스 5, 소 파라인플루엔자 바이러스, 니파바이러스, 헤니파바이러스(Henipavirus) 또는 뉴캐슬(Newcastle) 질환 바이러스일 수 있다. 상기 폭스비리대는 바리올라 베라(Variola vera), 예를 들면 바리올라 메이저(major) 및 바리올라 마이너(minor)일 수 있다. 상기 메타뉴모바이러스는 인간 메타뉴모바이러스(hMPV) 또는 조류 메타뉴모바이러스(aMPV)일 수 있다. 상기 모르빌리바이러스는 홍역일 수 있다. 상기 피코르나바이러스는 엔테로바이러스, 리노바이러스, 헤파르나바이러스, 파레코바이러스(Parechovirus), 카디오바이러스(Cardiovirus) 및 아프토바이러스(Aphthovirus)일 수 있다. 상기 엔테로바이러스는 폴리오바이러스 타입 1, 2 또는 3, 콕사키 A 바이러스 타입 1 내지 22 및 24, 콕사키 B 바이러스 타입 1 내지 6, 에코바이러스(Echovirus)(ECHO) 바이러스) 타입 1 내지 9, 11 내지 27 및 29 내지 34 또는 엔테로바이러스 68 내지 71일 수 있다. 상기 분야바이러스는 캘리포니아 뇌염 바이러스일 수 있다. 상기 플레보바이러스는 리프트 계곡열 바이러스일 수 있다. 상기 나이로바이러스는 크림-콩고(Crimean-Congo) 출혈열 바이러스일 수 있다. 상기 헤파르나바이러스는 A형 간염 바이러스(HAV)일 수 있다. 상기 토가바이러스는 루비바이러스일 수 있다. 상기 플라비바이러스는 진드기-매개 뇌염(TBE) 바이러스, 뎅기(타입 1, 2, 3 또는 4) 바이러스, 황열 바이러스, 일본 뇌염 바이러스, 키아사누 삼림 바이러스(Kyasanur Forest Virus), 웨스트 나일 뇌염 바이러스, 세인트 루이스 뇌염 바이러스, 러시아 봄-여름 뇌염 바이러스 또는 포와산(Powassan) 뇌염 바이러스일 수 있다. 상기 페스티바이러스는 소 바이러스 설사(BVDV), 고전적 돼지 열병(CSFV) 또는 보더 질환(BDV)일 수 있다. 상기 헤파드나바이러스는 B형 간염 바이러스 또는 C형 간염 바이러스일 수 있다. 상기 라브도바이러스는 리사바이러스(Lyssavirus)(광견병 바이러스) 또는 수포성바이러스(Vesiculovirus)(VSV)일 수 있다. 상기 칼리시비리대는 노워크(Norwalk) 바이러스, 또는 노워크-유사 바이러스, 예컨대 하와이 바이러스 및 스노우 마운틴 바이러스(Snow Mountain Virus)일 수 있다. 상기 코로나바이러스는 SARS CoV-1, SARS-CoV-2, MERS, 인간 호흡기 코로나바이러스, 조류 전염성 기관지염(IBV), 마우스 간염 바이러스(MHV), 또는 돼지 전염성 위장염 바이러스(TGEV)일 수 있다. 상기 레트로바이러스는 온코바이러스(Oncovirus), 렌티바이러스(Lentivirus) 또는 스푸마바이러스(Spumavirus)일 수 있다. 상기 레오바이러스는 오르토레오(Orthoreo) 바이러스, 로타바이러스, 오르비바이러스(Orbivirus), 또는 콜티바이러스(Coltivirus)일 수 있다. 상기 파르보바이러스는 파르보바이러스 B 19일 수 있다. 상기 인간 헤르페스바이러스는 헤르페스 심플렉스 바이러스(HSV), 바리셀라-조스터(Varicella-zoster) 바이러스(VZV), 엡스테인-바 바이러스(EBV), 사이토메갈로바이러스(CMV), 인간 헤르페스바이러스 6(HHV6), 인간 헤르페스바이러스 7(HHV7), 또는 인간 헤르페스바이러스 8(HHV8)일 수 있다. 상기 파포바바이러스는 유두종 바이러스, 폴리오마바이러스(Polyomavirus), 아데노바이러스(Adenovirus) 또는 아레나바이러스일 수 있다.
박테리아로부터 유래되는 단백질 또는 펩티드는 박테리아 항원일 수 있다.
상기 박테리아 항원은 나이세리아 메닝기티데스(Neisseria meningitides), 스트렙토코커스 뉴모니애(Streptococcus pneumoniae), 스트렙토코커스 피로게네스(Streptococcus pyogenes), 모락셀라 카타랄리스(Moraxella catarrhalis), 보르데텔라 페르투시스(Bordetella pertussis), 부르크홀데리아 속(Burkholderia sp.)(예컨대, 부르크홀데리아 말레이, 부르크홀데리아 슈도말레이 및 부르크홀데리아 세파시아), 스타필로코커스 아우레우스(Staphylococcus aureus), 해모필러스 인쿠엔자에(Haemophilus influenzae), 클로스트리디움 테타니(Clostridium tetani)(파상풍), 클로스트리디움 퍼프린젠스(Clostridium perfringens), 클로스트리디움 보툴리눔스(Clostridium botulinums), 코리네박테리움 디프테리애(Cornynebacterium diphtheriae)(디프테리아), 슈도모나스 애루기노사(Pseudomonas aeruginosa), 레지오넬라 뉴모필라(Legionella pneumophila), 콕시엘라 부르네티이(Coxiella burnetii), 브루셀라 속(Brucella sp.)(예컨대, B. 아보르투스, B. 카니스, B. 멜리텐시스, B. 네오토마에, B. 오비스, B. 수이스 및 B. 피니페디애), 프란시셀라 속(Francisella sp.)(예컨대, F. 노비시다, F. 필로미라지아 및 F. 툴라렌시스), 스트렙토코커스 아갈락티애(Streptococcus agalactiae), 나이세리아 고노레애(Neiserria gonorrhoeae), 클라미디아 트라코마티스(Chlamydia trachomatis), 트레포네마 팔리둠(Treponema pallidum)(매독), 해모필러스 두크레이이(Haemophilus ducreyi), 엔테로코커스 패칼리스(Enterococcus faecalis), 엔테로코커스 패시움(Enterococcus faecium), 헬리코박터 파일로리(Helicobacter pylori), 스타필로코커스 사프로피티쿠스(Staphylococcus saprophyticus), 예르시니아 엔테로콜리티카(Yersinia enterocolitica), 대장균(E. coli), 바실러스 안트라시스(Bacillus anthracis)(탄저병), 예르시니아 페스티스(Yersinia pestis)(페스트), 미코박테리움 투베르쿨로시스(Mycobacterium tuberculosis), 리케챠(Rickettsia), 리스테리아(Listeria), 클라미디아 뉴모니애(Chlamydia pneumoniae), 비브리오 콜레라에(Vibrio cholerae), 살모넬라 타이피(Salmonella typhi)(장티푸스), 보렐리아 부르그도르퍼(Borrelia burgdorfer), 포르필라모나스 속(Porphyromonas sp.) 및 클레브시엘라 속(Klebsiella sp.)으로 이루어진 군으로부터 선택되는 박테리아로부터 유래될 수 있다.
진균으로부터 유래되는 단백질 또는 펩티드는 진균 항원일 수 있다.
상기 진균 항원은 에피더모피톤 코쿠숨(Epidermophyton koccusum), 마이크로스포룸 아우도우이니(Microsporum audouini), 마이크로스로품 카니스(Microsporum canis), 마이크로스포룸 디스토르툼(Microsporum distortum), 마이크로스포룸 에퀴눔(Microsporum equinum), 마이크로스포룸 기프숨(Microsporum gypsum), 마이크로스포룸 나눔(Microsporum nanum), 트리코피톤 콘센트리쿰(Trichophyton concentricum), 트리코피톤 에퀴눔(Trichophyton equinum), 트리코피톤 갈리나에(Trichophyton gallinae), 트리코피톤 기프세움(Trichophyton gypseum), 트리코피톤 메그니니(Trichophyton megnini), 트리코피톤 멘타그로피테스(Trichophyton mentagrophytes), 트리코피톤 퀸케아눔(Trichophyton quinckeanum), 트리코피톤루브룸(Trichophyton rubrum), 트리코피톤 쇼엔레이니(Trichophyton schoenleini), 트리코피톤 톤수란스(Trichophyton tonsurans), 트리코피톤 베루코숨(Trichophyton verrucosum), T 베루코숨 var. album, var. 디스코이데스, var. 오크라세움, 트리코피톤 비올라세움(Trichophyton violaceum), 및/또는 트리코피톤 파비포르메(Trichophyton faviforme)을 포함하는 피부사상균(Dermatophytres); 또는 아스페르길루스 푸미가투스(Aspergillus fumigatus), 아스페르길루스 카부스(Aspergillus kavus), 아스페르길루스 나이거(Aspergillus niger), 아스페르길루스 니둘란스(Aspergillus nidulans), 아스페르길루스 테레우스(Aspergillus terreus), 아스페르길루스 시도위(Aspergillus sydowi), 아스페르길루스 카바투스(Aspergillus kavatus), 아스페르길루스 글라우쿠스(Aspergillus glaucus), 블라스토키조마이세스 카피타투스(Blastoschizomyces capitatus), 칸디다 알비칸스(Candida albicans), 칸디다 에놀라세(Candida enolase), 칸디다 트로피칼리스(Candida tropicalis), 칸디다 글라브라타(Candida glabrata), 칸디다 크루세이(Candida krusei), 칸디다 파라프실로시스(Candida parapsilosis), 칸디다 스텔라토이데아(Candida stellatoidea), 칸디다 쿠세이(Candida kusei), 칸디다 파라쿠세이(Candida parakwsei), 칸디다 루시타니애(Candida lusitaniae), 칸디다 슈도트로피칼리스(Candida pseudotropicalis), 칸디다 구일리에르몬디(Candida guilliermondi), 클라도스포리움 카리오니이(Cladosporium carrionii), 코시디오이데스 이미티스(Coccidioides immitis), 블라스토마이세스 더마티디스(Blastomyces dermatidis), 크립토코커스 네오포르만스(Cryptococcus neoformans), 게오트리쿰 클라바툼(Geotrichum clavatum), 히스토플라스마 카프술라툼(Histoplasma capsulatum), 클레브시엘라 뉴모니애(Klebsiella pneumoniae), 마이크로스포리디아(Microsporidia), 엔세팔리토준 속(Encephalitozoon spp.), 세프타타 인테스티날리스(Septata intestinalis) 및 엔테로시토준 비에네우시(Enterocytozoon bieneusi); 브라키올라 속(Brachiola spp.), 마이크로스포룸 속(Microsporidium spp.), 노세마 속(Nosema spp.), 플레이스토포라 속(Pleistophora spp.), 트라키플레이스토포라 속(Trachipleistophora spp.), 비타포르마 속(Vittaforma spp.), 파라코시디오이데스 브라실리엔시스(Paracoccidioides 브라질iensis), 뉴모시스티스 카리니이(Pneumocystis carinii), 피티움 인시디오숨(Pythiumn insidiosum), 피티로스포룸 오발레(Pityrosporum ovale), 사카로마이세스 세레비지애(Sacharomyces cerevisiae), 사카로마이세스 보울라르디이(Saccharomyces boulardii), 사카로마이세스 폼베(Saccharomyces pombe), 세도스포리움 아디오스페룸(Scedosporium apiosperum), 스포로트릭스 센키이(Sporothrix schenckii), 트리코스포론 베이겔리이(Trichosporon beigelii), 톡소플라스마 곤디이(Toxoplasma gondii), 페니실리움 마르네페이(Penicillium marneffei), 말라세지아 속(Malassezia spp.), 폰세카애 속(Fonsecaea spp.), 완지엘라 속(Wangiella spp.), 스포로트릭스 속(Sporothrix spp.), 바시디오볼루스 속(Basidiobolus spp.), 코니디오볼루스 속(Conidiobolus spp.), 리조푸스 속(Rhizopus spp.), 무코르 속(Mucor spp.), 아브시디아 속(Absidia spp.), 모르티에렐라 속(Mortierella spp.), 쿠닝가멜라 속(Cunninghamella spp.), 사크세나애 속(Saksenaea spp.), 알테르나리아 속(Alternaria spp.), 쿠르불라리아 속(Curvularia spp.), 헬민토스포리움 속(Helminthosporium spp.), 푸사리움 속(Fusarium spp.), 아스페르길루스 속(Aspergillus spp.), 페니실리움 속(Penicillium spp.), 모놀리니아 속(Monolinia spp.), 리조크토니아 속(Rhizoctonia spp.), 패실로마이세스 속(Paecilomyces spp.), 피토마이세스 속(Pithomyces spp.), 및 클라도스포리움 속(Cladosporium spp.)으로 이루어진 군으로부터 선택되는 진균으로부터 유래될 수 있다.
원생동물로부터 유래되는 단백질 또는 펩티드는 원생동물 항원일 수 있다.
상기 원생동물 항원은 엔타모에바 히스톨리티카(Entamoeba histolytica), 지아르디아 람블리(Giardia lambli), 크립토스포리디움 파르붐(Cryptosporidium parvum), 시클로스포라 카야타넨시스(Cyclospora cayatanensis) 및 톡소플라스마(Toxoplasma)로 이루어진 군으로부터 선택되는 원생동물로부터 유래될 수 있다.
상기 치료 생체분자는 식물로부터 유래되는 단백질 또는 펩티드일 수 있다. 바람직하게는, 상기 단백질 또는 펩티드는 식물 항원이다. 예를 들면, 상기 식물 항원은 리시누스 코무니스(Ricinus communis)로부터 유래될 수 있다.
다른 구현예에서, 상기 치료 생체분자는 면역원 또는 항원일 수 있다. 바람직하게는, 상기 면역원 또는 항원은 종양 면역원 또는 항원, 또는 암 면역원 또는 항원이다. 상기 종양 면역원 및 항원은 펩티드-함유 종양 항원, 예컨대 폴리펩티드 종양 항원 또는 당단백질 종양 항원일 수 있다.
상기 종양 항원은 (a) 암 세포와 연관된 전장(full length) 분자, (b) 결실, 부가 및/또는 치환된 부분을 갖는 분자를 포함하는 상동체(homolog) 및 그의 변형된 형태, 및 (c) 그의 단편일 수 있다.
적합한 종양 면역원은 CD8+ 림프구에 의해 인식되는 클래스(class) I-제한(restricted) 항원 또는 CD4+ 림프구에 의해 인식되는 클래스 II-제한 항원을 포함한다.
상기 종양 항원은 고환암, 흑색종, 폐암, 두경부암, NSCLC, 유방암, 위장암, 방광암, 결장직장암, 췌장암, 림프종, 백혈병, 신장암, 간암, 난소암, 위암 및 전립선암으로 이루어진 군으로부터 선택되는 암과 연관된 항원일 수 있다.
상기 종양 항원은 다음으로부터 선택될 수 있다:
(a) 암-고환 항원, 예컨대 NY-ESO-I, SSX2, SCP-1뿐만 아니라 RAGE, BAGE, GAGE 및 MAGE 패밀리 폴리펩티드, 예를 들면, (예를 들면, 흑색종, 폐, 두경부, NSCLC, 유방, 위장, 및 방광 종양을 다루기 위해 사용될 수 있는) GAGE-I, GAGE-2, MAGE-I, MAGE-2, MAGE-3, MAGE-4, MAGE-5, MAGE-6, 및 MAGE-12;
(b) 돌연변이된 항원, 예를 들면, (다양한 고형 종양, 예컨대, 결장직장암, 폐암, 두경부암과 연관되는) p53, (예컨대, 흑색종, 췌장암 및 결장직장암과 연관되는) p21/Ras, (예컨대, 흑색종과 연관되는) CDK4, (예컨대, 흑색종과 연관되는) MUM-1, (예컨대, 두경부암과 연관되는) 카스파아제(caspase)-8, (예컨대, 방광암과 연관되는) CIA 0205, HLA-A2-R1701, (예컨대, 흑색종과 연관되는) 베타 카테닌, (예컨대, T-세포 비-호지킨 림프종과 연관되는) TCR, (예컨대, 만성 골수성 백혈병과 연관되는) BCR-abl, 트리오스포스페이트 아이소머라아제(triosephosphate isomerase), KIA 0205, CDC-27, 및 LDLR-FUT;
(c) 과발현된 항원, 예를 들면, (예컨대, 결장직장암과 연관되는) 갈렉틴(Galectin) 4, (예컨대, 호지킨 질환과 연관되는) 갈렉틴 9, (예컨대, 만성 골수성 백혈병과 연관되는) 프로티나아제 3, (예컨대, 다양한 백혈병과 연관되는) WT 1, (예컨대, 신장암과 연관되는) 탄산 탈수효소(carbonic anhydrase), (예컨대, 폐암과 연관되는) 알돌라아제(aldolase) A, (예컨대, 흑색종과 연관되는) PRAME, (예컨대, 유방암, 결장암, 폐암 및 난소암과 연관되는) HER-2/neu, (예컨대, 간암과 연관되는) 알파-페토단백질, (예컨대, 결장직장암과 연관되는) KSA, (예컨대, 췌장암 및 위암과 연관되는) 가스트린(gastrin), 텔로머라아제 촉매화 단백질, (예컨대, 유방암 및 난소암과 연관되는) MUC-I, (예컨대, 신장 세포 암종과 연관되는) G-250, (예컨대, 유방암, 결장암과 연관되는) p53, 및 (예컨대, 유방암, 폐암, 및 결장직장암과 같은 위장관의 암과 연관되는) 암태아성 항원;
(d) 공유된 항원, 예를 들면, (예컨대, 흑색종과 연관되는) 흑색종-멜라닌세포 분화 항원, 예컨대 MART-1/멜란(Melan) A, gp100, MClR, 멜라닌세포-자극 호르몬 수용체, 티로시나아제, 티로시나아제 관련 단백질-1/TRPl 및 티로시나아제 관련 단백질-2/TRP2;
(e) 예컨대, 전립선 암과 연관되는 전립선-연관 항원, 예컨대 PAP, PSA, PSMA, PSH-Pl, PSM-Pl, PSM-P2; 및/또는
(f) (예를 들면, 골수종 및 B 세포 림프종과 연관되는) 면역글로불린 이디오타입(idiotype).
상기 치료 생체분자는 진핵생물 단백질 또는 펩티드일 수 있다. 한 구현예에서, 상기 진핵생물 단백질 또는 펩티드는 포유동물 단백질 또는 펩티드이다. 상기 포유동물 단백질 또는 펩티드는 효소; 효소 억제제; 호르몬; 면역 시스템 단백질; 수용체; 결합 단백질; 전사 인자; 번역 인자; 종양 성장 저해 단백질; 구조 단백질 및 혈액 단백질;로 이루어진 군으로부터 선택될 수 있다.
상기 면역 시스템 단백질은 항체 또는 그의 항원 결합 단편일 수 있다. 따라서, 상기 치료 생체분자는 항체 또는 그의 항원 결합 단편일 수 있다. 상기 항원 결합 단편은 개별 중쇄 또는 경쇄, 또는 그의 단편, 예컨대 VL, VH 및 Fd; 1가(monovalent) 단편, 예컨대 Fv, Fab, 및 Fab'; 2가(bivalent) 단편, 예컨대 F(ab')2; 단일쇄 Fv(scFv); 하나 이상의 상보성 결정 영역(CDR); 또는 Fc 단편;을 포함할 수 있다.
상기 효소는 키모신; 위 리파아제; 조직 플라스미노겐 활성화제; 스트렙토키나아제; 콜레스테롤 생합성 또는 분해성 스테로이드생성 효소; 키나아제; 포스포디에스테라아제; 메틸라아제; 탈-메틸라아제; 탈수소효소; 셀룰라아제; 프로테아제; 리파아제; 포스포리파아제; 아로마타아제; 시토크롬; 아데닐레이트 또는 구아닐레이트 시클라아제 및 뉴라미다아제;로 이루어진 군으로부터 선택될 수 있다.
상기 효소 억제제는 메탈로프로티나아제(TIMP)의 조직 억제제일 수 있다. 상기 호르몬 성장 호르몬일 수 있다.
상기 면역 시스템 단백질은 사이토카인; 케모카인; 림포카인; 에리트로포이에틴; 인테그린; 아드레신; 셀렉틴; 귀소(homing) 수용체; T 세포 수용체 및 면역글로불린;으로 이루어진 군으로부터 선택될 수 있다.
상기 사이토카인은 인터루킨, 예를 들면 IL-2, IL-4 및/또는 IL-6, 콜로니 자극 인자(CSF), 과립구 콜로니 자극 인자(G-CSF), 과립구-대식세포 콜로니 자극 인자(GM-CSF) 또는 종양 괴사 인자(TNF)일 수 있다.
상기 케모카인은 대식세포 염증성 단백질-2 및/또는 플라스미노겐 활성화제일 수 있다.
상기 림포카인은 인터페론일 수 있다.
상기 면역글로불린은 자연형, 변형된 또는 키메라 면역글로불린 또는 그의 단편일 수 있다. 바람직하게는, 상기 면역글로불린은 이중 활성을 갖는 키메라 면역글로불린, 예컨대 항체 효소 또는 항체-독소 키메라이다.
상기 호르몬은 인슐린, 갑상선 호르몬, 카테콜아민, 고나도트로핀, 자극 호르몬, 프로락틴, 옥시토신, 도파민, 소 소마토트로핀, 렙틴; 성장 호르몬(예컨대, 인간 성장 호르몬), 성장 인자(예컨대, 표피 성장 인자, 신경 성장 인자, 인슐린-유사 성장 인자 등);로 이루어진 군으로부터 선택될 수 있다.
상기 수용체는 스테로이드 호르몬 수용체 또는 펩티드 수용체일 수 있다. 바람직하게는, 상기 수용체는 성장 인자 수용체이다.
상기 결합 단백질은 성장 인자 결합 단백질일 수 있다.
상기 종양 성장 저해 단백질은 혈관형성을 억제하는 단백질일 수 있다.
상기 구조 단백질은 콜라겐; 피브로인; 피브리노겐; 엘라스틴; 튜불린; 액틴; 및 미오신;으로 이루어진 군으로부터 선택될 수 있다.
상기 혈액 단백질은 트롬빈; 혈청 알부민; 인자 VII; 인자 VIII; 인슐린; 인자 IX; 인자 X; 조직 플라스미노겐 활성화제; 단백질 C; 본 빌레브란트 인자; 항-트롬빈 III; 글루코세레브로시다아제; 에리트로포이에틴 과립구 콜로니 자극 인자(GCSF) 또는 변형된 인자 VIII; 및 항응고제;로 이루어진 군으로부터 선택될 수 있다.
한 바람직한 구현예에서, 상기 치료 생체분자는 림프 항상성을 조절할 수 있는 사이토카인, 바람직하게는 T 세포의 발생, 프라이밍, 확장, 분화 및/또는 생존에 수반되고, 바람직하게는 이를 유도 또는 향상시키는 사이토카인이다. 따라서, 바람직하게는 상기 사이토카인은 인터루킨이다. 가장 바람직하게는, IL-2, IL-7, IL-12, IL-15, 또는 IL-21이다.
상기 치료 생체분자는 줄기 세포 특징을 갖는 세포로 체세포의 재프로그래밍(reprogramming)을 향상시킬 수 있는 단백질일 수 있다. 줄기 세포 특징을 갖는 세포로 체세포의 재프로그래밍을 향상시킬 수 있는 단백질은 OCT4, SOX2, NANOG, LIN28, p53, ART-4, BAGE, ss-카테닌/m, Bcr-abL CAMEL, CAP-1, CASP-8, CDC27/m, CD 4/m, CEA, CLAUDIN-12, c-MYC, CT, Cyp-B, DAM, ELF2M, ETV6-AML1, G250, GAGE, GnT-V, Gap100, HAGE, HER-2/neu, HPV-E7, HPV-E6, HAST-2, hTERT(또는 hTRT), LAGE, LDLR/FUT, MAGE-A, MAGE-B, MAGE-C, MART-1/멜란-A, MC1R, 마이오신/m, MUC1, MUM-1, -2, -3, NA88-A, NF1, NY-ESO-1, NY-BR-1, pl90 마이너 BCR-abL, Plac-1, Pml/RARa, PRAME, 프로티나아제 3, PSA, PSM, RAGE, RU1 또는 RU2, SAGE, SART-1 또는 SART-3, SCGB3A2, SCP1, SCP2, SCP3, SSX, SURVIVIN, TEL/AML1, TPI/m, TRP-1, TRP-2, TRP-2/INT2, TPTE 및 WT로 이루어진 군으로부터 선택될 수 있고, 바람직하게는 WT-1이다.
바람직하게는, MAGE-A는 MAGE-A1, MAGE-A2, MAGE-A3, MAGE-A4, MAGE-A5, MAGE-A6, MAGE-A7, MAGE-A8, MAGE-A9, MAGE-A10, MAGE-A11, 또는 MAGE-A12로 이루어진 군으로부터 선택된다.
바람직하게는, 줄기 세포 특징을 갖는 세포로 체세포의 재프로그래밍을 향상시킬 수 있는 단백질은 OCT4, SOX2, LF4; c-MYC; NANOG; LIN28;이다.
상기 치료 생체분자는 세포-치료법 적응증을 위해 생체외(ex vivo)에서 세포를 변형하기 위해 이용될 수 있는 생체분자일 수 있다. 따라서, 바람직하게는 상기 치료 생체분자는 면역글로불린, T-세포 수용체 및 NK 수용체로 이루어진 군으로부터 선택될 수 있다.
상기 치료 생체분자는 내인성(endogenous) 숙주 유전자의 발현을 조절할 수 있는 RNA 분자, 예를 들면 간섭 RNA, 예컨대 작은 RNA, siRNA 또는 microRNA일 수 있다.
상기 적어도 하나의 비-바이러스 선천적 조절 단백질(IMP)을 암호화하는 서열은 제1 측면의 RNA 구조체 내의 임의의 곳에 배치될 수 있으며, 예컨대 상기 치료 생체분자(즉, 도 1에서의 GOI)를 암호화하는 서열은 상기 적어도 하나의 선천적 조절 단백질을 암호화하는 서열의 5' 또는 3'에 배치될 수 있다.
예를 들면, 한 구현예에서, 상기 치료 생체분자를 암호화하는 서열은 바람직하게는 상기 적어도 하나의 선천적 조절 단백질을 암호화하는 서열의 5'에 배치된다. 예를 들면, 도 1에 나타낸 saRNA 구현예 2a, 3a, 4a, 및 mRNA 구현예 6a 및 7a 참조.
그러나, 다른 구현예에서, 상기 치료 생체분자를 암호화하는 서열은 바람직하게는 상기 적어도 하나의 선천적 조절 단백질을 암호화하는 서열의 3'에 배치된다. 예를 들면, 도 1에 나타낸 saRNA 구현예 2b, 3b, 4b, 및 mRNA 구현예 6b 및 7b 참조.
바람직하게는, 제1 측면에 따른 RNA 구조체는 게놈성 또는 서브게놈성일 수 있는 적어도 하나의 프로모터(promoter)를 포함한다. 그러나, 바람직하게는 상기 프로모터는 도 1(구현예 1-4b)에 나타낸 것과 같이 서브게놈성 프로모터이다. 따라서, 바람직하게는 본 발명의 saRNA 구조체는 프로모터를 포함한다. 숙련된 기술자는 상기 서브게놈성 프로모터가 적어도 하나의 치료 생체분자 및 적어도 하나의 선천적 억제제 단백질을 암호화하는 서열에 작동가능하게 연결되어서, 상기 치료 생체분자 및 적어도 하나의 선천적 조절 단백질을 암호화하는 뉴클레오티드 서열을 전사할 수 있는 프로모터에 관한 것임을 이해할 것이다.
바람직하게는, 상기 서브게놈성 프로모터는 본 명세서에서 다음과 같은 서열번호 204로 제공되는 26S이다:
GGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACAT [서열번호 204]
따라서, 바람직하게는 (바람직하게는, 서브게놈성 프로모터일 수 있는) 상기 프로모터는 실질적으로 서열번호 204에 나타낸 것, 또는 그의 변이체 또는 단편이다.
한 구현예에서, 동일한 프로모터가 관심있는 적어도 하나의 치료 생체분자를 암호화하는 서열 및 적어도 하나의 선천적 조절 단백질을 암호화하는 서열에 작동가능하게 연결된다.
상기 치료 생체분자(즉, GOI) 및 IMP가 모두 RNA의 단일 가닥에 의해 암호화되는 본 발명자들의 디자인은 유리하게는 훨씬 적은 용량의 RNA를 사용하는 것을 가능하게 하는데, 그 이유는 단백질이 상기 RNA에 의해 감지되는 동일한 세포에서 발현되고, 또한 복제될 수 있으며, 이에 따라 선천적 조절 성분이 발현 및 증폭되는 부가적인 측면을 갖기 때문이다.
따라서, 상기 RNA 구조체의 한 구현예에서, 상기 프로모터는 적어도 하나의 치료 생체분자를 암호화하는 서열 및 적어도 하나의 선천적 억제제 단백질을 암호화하는 서열의 5'에 배치되어서, 상기 프로모터는 양쪽 서열 모두에 작동가능하게 연결되고, 이로 인해 양쪽 모두의 발현을 구동한다.
그러나, 다른 구현예에서, 제1 프로모터는 관심있는 적어도 하나의 치료 생체분자를 암호화하는 서열에 작동가능하게 연결되고, 제2 프로모터는 적어도 하나의 선천적 억제제 단백질을 암호화하는 서열에 작동가능하게 연결된다.
상기 RNA 구조체는 적어도 2개, 3개, 4개 또는 5개 IMP를 암호화할 수 있다. 선천적 조절 단백질을 암호화하는 서열이 1개 이상인 구현예에서, 단일 프로모터는 선천적 조절 단백질을 암호화하는 모든 서열에 작동가능하게 연결될 수 있다. 대안적으로, 프로모터는 선천적 조절 단백질을 암호화하는 각각의 서열에 연결되어서, 각각의 선천적 조절 단백질이 별도의 프로모터에 작동가능하게 연결될 수 있다. 상기 구현예에서, 별도의 프로모터는 동일한 프로모터 서열 또는 상이한 프로모터 서열을 포함할 수 있다. 다른 구현예에서, 상이한 프로모터는 선천적 조절 단백질을 암호화하는 각각의 서열에 작동가능하게 연결된다.
상기 RNA 구조체는 관심있는 적어도 하나의 치료 생체분자를 암호화하는 서열 및 적어도 하나의 선천적 조절 단백질을 암호화하는 서열 사이에 배치된 링커(linker) 서열을 추가로 포함할 수 있다. 상기 링커 서열은 단일 프로모터로부터 IMP의 생산 및 치료 분자의 생산을 가능하게 한다. 한 구현예에서, 상기 링커 서열은 번역 후에 소화되거나 절단됨으로써 숙주 세포에서 적어도 하나의 치료 생체분자 및 적어도 하나의 선천적 조절 단백질을 분리하도록 구성되는 펩티드 링커를 암호화한다. 이와 같이, 상기 링커 서열은 바람직하게는 절단 부위를 형성할 수 있는 절단가능한 펩티드, 예를 들면 2A 펩티드이다. [Furler S, Paterna J-C, Weibel M and Bueler H Recombinant AAV vectors containing the foot and mouth disease virus 2A sequence confer efficient bicistronic gene expression in cultured cells and rat substantia nigra neurons Gene Ther. 2001, vol. 8, PP: 864-873].
바람직하게는, 상기 2A 펩티드 서열을 암호화하는 링커 서열은 2개의 코딩 서열을 함께 연결한다. 이것은 상기 RNA 구조체가 다양한 벡터에서의 발현시에 일어날 수 있는 크기 제한을 극복할 수 있게 하고, 제1 측면의 RNA 구조체에 의해 암호화되는 모든 펩티드의 발현 및 번역이 단일 프로모터의 제어 하에 단일 단백질로서 일어날 수 있게 한다. 따라서, 상기 IMP, 2A 펩티드, 및 치료 생체분자의 서열을 포함하는 단일 단백질의 번역 이후에, 말단 글리신-프롤린 연결부에서 바이러스 2A 펩티드 서열의 절단이 일어나고, 이로 인해 2개의 폴리펩티드가 자유롭게 된다.
상기 2A 스페이서(spacer) 서열은 [Wang Y et al. Scientific Reports 2015, 5]에 개시된 것과 같이 E2A, F2A, P2A 및 T2A로 나타내는 서열을 포함하는 임의의 공지된 변이체일 수 있으며, 즉 적합한 2A 펩티드는 돼지 테스코바이러스(teschovirus)-1 2A(P2A) - ATNFSLLKQAGDVEENPGP(서열번호 205), 토세아 아시그나(thosea asigna) 바이러스 2A(T2A) - QCTNYALLKLAGDVESNPGP(서열번호 206), 말 비염 A 바이러스 2A(E2A), 및 구제역 바이러스 2A(F2A) - VKQTLNFDLLKLAGDVESNPGP(서열번호 207)를 포함한다. 바람직하게는, 상기 2A 펩티드는 토세아 아시그나 바이러스 2A(T2A)이다.
다른 구현예에서, 상기 절단가능한 펩티드는 자가-절단 펩티드이다. 구현예에서, 상기 링커는 바이러스 2A 펩티드 스페이서를 포함하고, 푸린(furin) 절단 부위를 추가로 포함한다. 바람직하게는, 상기 자가-절단 펩티드는 푸린/2A 펩티드이다. 상류에 푸린 절단 부위를 삽입하면, 그렇지 않을 경우 상류 단백질에 부착되어 남아있게 되는 2A 잔기의 제거를 가능하게 한다.
상기 푸린 서열은 상기 2A 서열의 3' 또는 5'에 배치될 수 있다. 그러나, 바람직하게는 상기 푸린 서열은 상기 2A 서열의 5'에 배치되고, 바람직하게는 GSG 스페이서가 상기 푸린 및 2A 서열 사이에 배치된다.
숙련된 기술자는 푸린이 특이적 인식 서열(정규적으로 R-X-R/K/X-R(서열번호 208))에서 전구체 단백질을 절단하고, 최종 R 후에 전구단백질을 절단하는 (주로 골지 및 트랜스-골지 네트워크 내의) 분비 경로에 위치하는 편재성(ubiquitous) 칼슘-의존형 전구단백질 컨버타아제(convertase)임을 인식할 것이다. 따라서, 한 구현예에서, 상기 푸린 서열은 R-X-R/K/X-R이다. 그러나, 바람직하게는 상기 푸린 서열은 최적화된 서열 RRRRRR(서열번호 209) GSG 서열이다. 5개의 R 변이체 구현예가 또한 예측된다. 바람직하게는, 상기 GSG 상기 푸린 서열의 3' 및 상기 2A 서열의 5'에 배치된다.
따라서, 바람직하게는 상기 스페이서 서열은 NCBI 참조 서열: 진뱅크: AAC97195.1에 의해 제공되고, 본 명세서에서 다음과 같은 서열번호 210으로 제공되는 푸린/T2A이다:
RRRRRRGSGEGRGSLLTCGDVEENPGP [서열번호 210]
그러므로, 바람직하게는 상기 스페이서 서열은 서열번호 210에 나타낸 것과 같은 아미노산 서열, 또는 그의 변이체 또는 단편을 포함한다. 도 1은 상기 GOI 및 IMP가 푸린-T2a 절단 부위를 암호화하는 뉴클레오티드 서열에 의해 연결되는 구현예 2a, 2b 및 6a, 6b를 보여준다. 한 구현예에서, 도 1에서 2a 또는 6a로 나타낸 것과 같이, 상기 F-T2a 절단 부위는 5' GOI 및 3' IMP를 분리시킨다. 한 구현예에서, 도 1에서 2b 또는 6b로 나타낸 것과 같이, 상기 F-T2a 절단 부위는 3' GOI 및 5' IMP를 분리시킨다.
RNA 구조체 또는 레플리콘이 선천적 조절 단백질을 암호화하는 하나 이상의 서열을 포함하는 구현예에서, 상기 구조체는 선천적 조절 단백질을 암호화하는 각각의 서열 사이, 또는 일부 IMP 사이에만 배치되는 링커 서열을 포함할 수 있다.
한 구현예에서, 상기 적어도 하나의 치료 생체분자를 암호화하는 서열 및 적어도 하나의 선천적 조절 단백질을 암호화하는 서열은, 그 서열이 무엇이든 간에(즉, 도 1에서 구현예 3a, 3b, 7a 또는 7b에 나타낸 것과 같은 GOI 또는 IMP), 종결 코돈에 의해 분리되고, 이어서 하류 서열의 번역을 시작할 수 있는 내부 리보솜 도입 부위(IRES) 서열이 뒤따를 수 있으며, 그 서열은 어느 것이나 될 수 있다. 따라서, 바람직하게는 상기 IRES 서열은 상기 적어도 하나의 치료 생체분자를 암호화하는 서열 및 상기 적어도 하나의 선천적 조절 단백질을 암호화하는 서열 사이에 배치된다. 적어도 하나의 선천적 조절 단백질을 암호화하는 서열이 다수 사용된다면, 링커 서열은 공지된 절단 서열 및/또는 IRES 서열의 조합을 포함할 수 있다. 한 구현예에서, 도 1에서 3a 또는 7a로 나타낸 것과 같이, 상기 IRES 분위는 5' GOI 및 3' IMP를 분리한다. 한 구현예에서, 도 1에서 3b 또는 7b로 나타낸 것과 같이, 상기 IRES 부위는 3' GOI 및 5' IMP를 분리한다.
구현예에서, 상기 IRES는 피코르나바이러스 IRES이다. 다른 전형적인 IRES 서열은 뇌심근염 바이러스(EMCV) 또는 혈관 내피 성장 인자 및 타입 1 콜라겐-유도성 단백질(VCIP)의 IRES 서열과 같은 것을 포함하며, 본 기술분야의 기술자에게 알려져 있다.
다른 구현예에서, 상기 IRES는 리노바이러스 IRES, A형 간염 바이러스 IRES, C형 간염 바이러스 IRES, 폴리오바이러스 IRES, 엔테로바이러스 IRES, 카디오바이러스(cardiovirus) IRES, 아프토바이러스(aphthovirus) IRES, 플라비바이러스 IRES, 페스티바이러스 IRES, 크리파바이러스(cripavirus) IRES, 로팔로시품 파디 바이러스(rhopalosiphum padi virus) IRES, 또는 임의의 적합한 IRES로부터 선택될 수 있다. 특히, 상기 IRES는 실험적으로 검증된 IRES 구조의 데이터베이스를 제공하는 "IRESite"(http://www.iresite.org/)에 의해 기술되거나, "New Messenger RNA Research Communications" (ISBN: 1-60021-488-6)에 개시된 것과 같은 임의의 IRES일 수 있다.
바람직한 구현예에서, 상기 IRES는 다음과 같은 서열번호 211에 나타낸 것과 같을 수 있는 구제역 바이러스(FMDV) IRES, 또는 그의 단편 또는 변이체이다:
AGCAGGTTTCCCCAACTGACACAAAACGTGCAACTTGAAACTCCGCCTGGTCTTTCCAGGTCTAGAGGGGTAACACTTTGTACTGCGTTTGGCTCCACGCTCGATCCACTGGCGAGTGTTAGTAACAGCACTGTTGCTTCGTAGCGGAGCATGACGGCCGTGGGAACTCCTCCTTGGTAACAAGGACCCACGGGGCCAAAAGCCACGCCCACACGGGCCCGTCATGTGTGCAACCCCAGCACGGCGACTTTACTGCGAAACCCACTTTAAAGTGACATTGAAACTGGTACCCACACACTGGTGACAGGCTAAGGATGCCCTTCAGGTACCCCGAGGTAACACGCGACACTCGGGATCTGAGAAGGGGACTGGGGCTTCTATAAAAGCGCTCGGTTTAAAAAGCTTCTATGCCTGAATAGGTGACCGGAGGTCGGCACCTTTCCTTTGCAATTACTGACCAC [서열번호 211]
다른 바람직한 구현예에서, 상기 IRES는 뇌심근염 바이러스(EMCV) IRES이다. 상기 EMCV IRES는 다음과 같은 서열번호 212에 나타낸 것, 또는 그의 단편 또는 변이체일 수 있다:
CGTTACTGGCCGAAGCCGCTTGGAATAAGGCCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATGTGAGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAGCAGTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTGGCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCCCCTCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTTCATGTGTTTAGTCGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGATAATA [서열번호 212]
따라서, 바람직하게는 상기 IRES는 실질적으로 서열번호 211 또는 212에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 단편 또는 변이체를 포함한다.
대안적으로, IRES 또는 2A 링커 대신에, 상기 링커 서열은 상기 치료 생체분자 및 IMP 모두를 단일 폴리펩티드 사슬로서 발현하게 하는 유연성 링커를 암호화하는 서열을 포함할 수 있지만, 상기 치료 생체분자 및 IMP는 독립적인 단백질로서 작용한다. 그러므로, 상기 단백질들은 이들이 개별적으로 발현된 것과 동일한 방식으로 그 효과를 발휘한다.
상기 유연성 링커 서열은 WO 2013/061076 A1(Oxford Biomedica)에 개시된 것과 같을 수 있다. 상기 유연성 링커 서열은 본 명세서에서 다음과 같은 서열번호 213, 또는 그의 단편 또는 변이체로 나타낼 수 있다:
GGAGGTGGCGGGTCCGGGGGCGGGGGTAGCGGTGGCGGGGGCTCC [서열번호 213]
따라서, 바람직하게는 상기 유연성 링커 서열은 실질적으로 서열번호 213에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 단편 또는 변이체를 포함한다.
한 바람직한 구현예에서, 상기 유연성 링커 서열은 본 명세서에서 아래에 나타낸 것과 같은 서열번호 214로 나타낸 아미노산 서열, 또는 그의 단편 또는 변이체를 암호화하는 뉴클레오티드 서열을 포함한다:
GGGGSGGGGSGGGGS [서열번호 214]
따라서, 바람직하게는 상기 유연성 링커 서열은 실질적으로 서열번호 214에 나타낸 것과 같은 아미노산 서열, 또는 그의 단편 또는 변이체를 암호화한다.
또 다른 구현예에서, 상기 적어도 하나의 치료 생체분자 및 적어도 하나의 선천적 억제제 단백질을 암호화하는 서열은 종결 코돈에 의해 분리되고, 이어서 하류 서열의 전사를 시작할 수 있는 제2 서브게놈성 프로모터 서열이 뒤따를 수 있다. 상기 구현예의 예는 도 1, 구현예 4a 및 4b에서 실증된다.
(바람직하게는 saRNA 구조체일 때) 상기 RNA 구조체는 적어도 하나의 치료 생체분자 및 적어도 하나의 선천적 조절 단백질을 암호화하는 서열의 5' 또는 3'에 배치된 적어도 하나의 비-구조 단백질(NSP)을 암호화할 수 있다. 바람직하게는, 상기 적어도 하나의 NSP를 암호화하는 서열은 치료 생체분자 및 적어도 하나의 선천적 조절 단백질을 암호화하는 서열의 5'에 배치된다. 따라서, 바람직하게는 상기 적어도 하나의 NSP를 암호화하는 서열은 상기 RNA 구조체의 5' 말단에 배치된다.
상기 RNA 구조체에 의해 암호화되는 적어도 하나의 비-구조 단백질은 RNA 폴리머라아제 NSP4일 수 있다. 상기 하나 이상의 비-구조 단백질은 바람직하게는 레플리카아제르 암호화한다. 바람직하게는, 상기 구조체는 NSP1, NSP2, NSP3 및 NSP4를 암호화한다. 숙련된 기술자는 nsP1은 바이러스 캡핑 효소 및 복제 복합체(RC)의 막 앵커(anchor)이지만, NSP2는 ns 폴리단백질 처리를 담당하는 RNA 헬리카아제 및 프로테아제임을 이해할 것이다. NSP3은 몇 가지 숙주 단백질과 상호작용하고, 단백질 폴리- 및 모노-ADP-리보실화를 조정할 수 있으며, NSP4는 코어 바이러스 RNA-의존형 RNA 폴리머라아제이다.
한 구현예에서, NSP1은 본 명세서에서 다음과 같은 서열번호 215로 제공된다:
MEKVHVDIEEDSPFLRALQRSFPQFEVEAKQVTDNDHANARAFSHLASKLIETEVDPSDTILDIGSAPARRMYSKHKYHCICPMRCAEDPDRLYKYATKLKKNCKEITDKELDKKMKELAAVMSDPDLETETMCLHDDESCRYEGQVAVYQDVYAVDGPTSLYHQANKGVRVAYWIGFDTTPFMFKNLAGAYPSYSTNWADETVLTARNIGLCSSDVMERSRRGMSILRKKYLKPSNNVLFSVGSTIYHEKRDLLRSWHLPSVFHLRGKQNYTCRCETIVSCDGYVVKRIAISPGLYGKPSGYAATMHREGFLCCKVTDTLNGERVSFPVCTYVPATLCDQMTGILATDVSADDAQKLLVGLNQRIVVNGRTQRNTNTMKNYLLPVVAQAFARWAKEYKEDQEDERPLGLRDRQLVMGCCWAFRRHKITSIYKRPDTQTIIKVNSDFHSFVLPRIGSNTLEIGLRTRIRKMLEEHKEPSPLITAEDVQEAKCAADEAKEVREAEELRAALPPLAADVEEPTLEADVDLMLQEAGA [서열번호 215]
따라서, NSP1은 바람직하게는 실질적으로 서열번호 215에 나타낸 것과 같은 아미노산 서열, 또는 그의 생물학적 활성 변이체 또는 단편을 포함한다.
한 구현예에서, NSP1은 다음과 같은 서열번호 216에 정의된 것과 같은 뉴클레오티드 서열에 의해 암호화된다:
ATGGAGAAAGTTCACGTTGACATCGAGGAAGACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAATGACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACGATCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGATGTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAGGAATTGGACAAGAAAATGAAGGAGCTGGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTCCACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACAAGTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTTAAGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATAGGCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCATCCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTGCCGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTACGTCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAGGGATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCAGCTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTTGGGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCCGTAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTACGAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCGGATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTGGAGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAGGACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTACCACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAAGCCGATGTCGACTTGATGTTACAAGAGGCTGGGGCC [서열번호 216]
따라서, NSP1은 바람직하게는 실질적으로 서열번호 216에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 217로 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
AUGGAGAAAGUUCACGUUGACAUCGAGGAAGACAGCCCAUUCCUCAGAGCUUUGCAGCGGAGCUUCCCGCAGUUUGAGGUAGAAGCCAAGCAGGUCACUGAUAAUGACCAUGCUAAUGCCAGAGCGUUUUCGCAUCUGGCUUCAAAACUGAUCGAAACGGAGGUGGACCCAUCCGACACGAUCCUUGACAUUGGAAGUGCGCCCGCCCGCAGAAUGUAUUCUAAGCACAAGUAUCAUUGUAUCUGUCCGAUGAGAUGUGCGGAAGAUCCGGACAGAUUGUAUAAGUAUGCAACUAAGCUGAAGAAAAACUGUAAGGAAAUAACUGAUAAGGAAUUGGACAAGAAAAUGAAGGAGCUGGCCGCCGUCAUGAGCGACCCUGACCUGGAAACUGAGACUAUGUGCCUCCACGACGACGAGUCGUGUCGCUACGAAGGGCAAGUCGCUGUUUACCAGGAUGUAUACGCGGUUGACGGACCGACAAGUCUCUAUCACCAAGCCAAUAAGGGAGUUAGAGUCGCCUACUGGAUAGGCUUUGACACCACCCCUUUUAUGUUUAAGAACUUGGCUGGAGCAUAUCCAUCAUACUCUACCAACUGGGCCGACGAAACCGUGUUAACGGCUCGUAACAUAGGCCUAUGCAGCUCUGACGUUAUGGAGCGGUCACGUAGAGGGAUGUCCAUUCUUAGAAAGAAGUAUUUGAAACCAUCCAACAAUGUUCUAUUCUCUGUUGGCUCGACCAUCUACCACGAGAAGAGGGACUUACUGAGGAGCUGGCACCUGCCGUCUGUAUUUCACUUACGUGGCAAGCAAAAUUACACAUGUCGGUGUGAGACUAUAGUUAGUUGCGACGGGUACGUCGUUAAAAGAAUAGCUAUCAGUCCAGGCCUGUAUGGGAAGCCUUCAGGCUAUGCUGCUACGAUGCACCGCGAGGGAUUCUUGUGCUGCAAAGUGACAGACACAUUGAACGGGGAGAGGGUCUCUUUUCCCGUGUGCACGUAUGUGCCAGCUACAUUGUGUGACCAAAUGACUGGCAUACUGGCAACAGAUGUCAGUGCGGACGACGCGCAAAAACUGCUGGUUGGGCUCAACCAGCGUAUAGUCGUCAACGGUCGCACCCAGAGAAACACCAAUACCAUGAAAAAUUACCUUUUGCCCGUAGUGGCCCAGGCAUUUGCUAGGUGGGCAAAGGAAUAUAAGGAAGAUCAAGAAGAUGAAAGGCCACUAGGACUACGAGAUAGACAGUUAGUCAUGGGGUGUUGUUGGGCUUUUAGAAGGCACAAGAUAACAUCUAUUUAUAAGCGCCCGGAUACCCAAACCAUCAUCAAAGUGAACAGCGAUUUCCACUCAUUCGUGCUGCCCAGGAUAGGCAGUAACACAUUGGAGAUCGGGCUGAGAACAAGAAUCAGGAAAAUGUUAGAGGAGCACAAGGAGCCGUCACCUCUCAUUACCGCCGAGGACGUACAAGAAGCUAAGUGCGCAGCCGAUGAGGCUAAGGAGGUGCGUGAAGCCGAGGAGUUGCGCGCAGCUCUACCACCUUUGGCAGCUGAUGUUGAGGAGCCCACUCUGGAAGCCGAUGUCGACUUGAUGUUACAAGAGGCUGGGGCC [서열번호 217]
한 구현예에서, NSP2는 본 명세서에서 다음과 같은 서열번호 218로 제공된다:
GSVETPRGLIKVTSYDGEDKIGSYAVLSPQAVLKSEKLSCIHPLAEQVIVITHSGRKGRYAVEPYHGKVVVPEGHAIPVQDFQALSESATIVYNEREFVNRYLHHIATHGGALNTDEEYYKTVKPSEHDGEYLYDIDRKQCVKKELVTGLGLTGELVDPPFHEFAYESLRTRPAAPYQVPTIGVYGVPGSGKSGIIKSAVTKKDLVVSAKKENCAEIIRDVKKMKGLDVNARTVDSVLLNGCKHPVETLYIDEAFACHAGTLRALIAIIRPKKAVLCGDPKQCGFFNMMCLKVHFNHEICTQVFHKSISRRCTKSVTSVVSTLFYDKKMRTTNPKETKIVIDTTGSTKPKQDDLILTCFRGWVKQLQIDYKGNEIMTAAASQGLTRKGVYAVRYKVNENPLYAPTSEHVNVLLTRTEDRIVWKTLAGDPWIKTLTAKYPGNFTATIEEWQAEHDAIMRHILERPDPTDVFQNKANVCWAKALVPVLKTAGIDMTTEQWNTVDYFETDKAHSAEIVLNQLCVRFFGLDLDSGLFSAPTVPLSIRNNHWDNSPSPNMYGLNKEVVRQLSRRYPQLPRAVATGRVYDMNTGTLRNYDPRINLVPVNRRLPHALVLHHNEHPQSDFSSFVSKLKGRTVLVVGEKLSVPGKMVDWLSDRPEATFRARLDLGIPGDVPKYDIIFVNVRTPYKYHHYQQCEDHAIKLSMLTKKACLHLNPGGTCVSIGYGYADRASESIIGAIARQFKFSRVCKPKSSLEETEVLFVFIGYDRKARTHNSYKLSSTLTNIYTGSRLHEAGC [서열번호 218]
따라서, nsP2는 바람직하게는 실질적으로 서열번호 218에 나타낸 것과 같은 아미노산 서열, 또는 그의 생물학적 활성 변이체 또는 단편을 포함한다.
한 구현예에서, NSP2는 다음과 같은 서열번호 219에 정의된 것과 같은 뉴클레오티드 서열에 의해 암호화된다:
GGCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCTGTGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTGATAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACATGCAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAACAGGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCCAGCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTAGGGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCTCCTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTCACCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAAGGGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTATATTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTGCTCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGCACACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTACGACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAGCAGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATAATGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCTCTGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTAGCCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAAGCAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAACGTGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACTGTGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGACTCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCCCCGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGGGCAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTACCTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTCGTCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGGTTGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGACATAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTTAGCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCTGACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCCTCACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATTCTTACAAGCTTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGT [서열번호 219]
따라서, 바람직하게는 NSP2는 실질적으로 서열번호 219에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 220을 포함할 수 있다:
GGCUCAGUGGAGACACCUCGUGGCUUGAUAAAGGUUACCAGCUACGAUGGCGAGGACAAGAUCGGCUCUUACGCUGUGCUUUCUCCGCAGGCUGUACUCAAGAGUGAAAAAUUAUCUUGCAUCCACCCUCUCGCUGAACAAGUCAUAGUGAUAACACACUCUGGCCGAAAAGGGCGUUAUGCCGUGGAACCAUACCAUGGUAAAGUAGUGGUGCCAGAGGGACAUGCAAUACCCGUCCAGGACUUUCAAGCUCUGAGUGAAAGUGCCACCAUUGUGUACAACGAACGUGAGUUCGUAAACAGGUACCUGCACCAUAUUGCCACACAUGGAGGAGCGCUGAACACUGAUGAAGAAUAUUACAAAACUGUCAAGCCCAGCGAGCACGACGGCGAAUACCUGUACGACAUCGACAGGAAACAGUGCGUCAAGAAAGAACUAGUCACUGGGCUAGGGCUCACAGGCGAGCUGGUGGAUCCUCCCUUCCAUGAAUUCGCCUACGAGAGUCUGAGAACACGACCAGCCGCUCCUUACCAAGUACCAACCAUAGGGGUGUAUGGCGUGCCAGGAUCAGGCAAGUCUGGCAUCAUUAAAAGCGCAGUCACCAAAAAAGAUCUAGUGGUGAGCGCCAAGAAAGAAAACUGUGCAGAAAUUAUAAGGGACGUCAAGAAAAUGAAAGGGCUGGACGUCAAUGCCAGAACUGUGGACUCAGUGCUCUUGAAUGGAUGCAAACACCCCGUAGAGACCCUGUAUAUUGACGAAGCUUUUGCUUGUCAUGCAGGUACUCUCAGAGCGCUCAUAGCCAUUAUAAGACCUAAAAAGGCAGUGCUCUGCGGGGAUCCCAAACAGUGCGGUUUUUUUAACAUGAUGUGCCUGAAAGUGCAUUUUAACCACGAGAUUUGCACACAAGUCUUCCACAAAAGCAUCUCUCGCCGUUGCACUAAAUCUGUGACUUCGGUCGUCUCAACCUUGUUUUACGACAAAAAAAUGAGAACGACGAAUCCGAAAGAGACUAAGAUUGUGAUUGACACUACCGGCAGUACCAAACCUAAGCAGGACGAUCUCAUUCUCACUUGUUUCAGAGGGUGGGUGAAGCAGUUGCAAAUAGAUUACAAAGGCAACGAAAUAAUGACGGCAGCUGCCUCUCAAGGGCUGACCCGUAAAGGUGUGUAUGCCGUUCGGUACAAGGUGAAUGAAAAUCCUCUGUACGCACCCACCUCAGAACAUGUGAACGUCCUACUGACCCGCACGGAGGACCGCAUCGUGUGGAAAACACUAGCCGGCGACCCAUGGAUAAAAACACUGACUGCCAAGUACCCUGGGAAUUUCACUGCCACGAUAGAGGAGUGGCAAGCAGAGCAUGAUGCCAUCAUGAGGCACAUCUUGGAGAGACCGGACCCUACCGACGUCUUCCAGAAUAAGGCAAACGUGUGUUGGGCCAAGGCUUUAGUGCCGGUGCUGAAGACCGCUGGCAUAGACAUGACCACUGAACAAUGGAACACUGUGGAUUAUUUUGAAACGGACAAAGCUCACUCAGCAGAGAUAGUAUUGAACCAACUAUGCGUGAGGUUCUUUGGACUCGAUCUGGACUCCGGUCUAUUUUCUGCACCCACUGUUCCGUUAUCCAUUAGGAAUAAUCACUGGGAUAACUCCCCGUCGCCUAACAUGUACGGGCUGAAUAAAGAAGUGGUCCGUCAGCUCUCUCGCAGGUACCCACAACUGCCUCGGGCAGUUGCCACUGGAAGAGUCUAUGACAUGAACACUGGUACACUGCGCAAUUAUGAUCCGCGCAUAAACCUAGUACCUGUAAACAGAAGACUGCCUCAUGCUUUAGUCCUCCACCAUAAUGAACACCCACAGAGUGACUUUUCUUCAUUCGUCAGCAAAUUGAAGGGCAGAACUGUCCUGGUGGUCGGGGAAAAGUUGUCCGUCCCAGGCAAAAUGGUUGACUGGUUGUCAGACCGGCCUGAGGCUACCUUCAGAGCUCGGCUGGAUUUAGGCAUCCCAGGUGAUGUGCCCAAAUAUGACAUAAUAUUUGUUAAUGUGAGGACCCCAUAUAAAUACCAUCACUAUCAGCAGUGUGAAGACCAUGCCAUUAAGCUUAGCAUGUUGACCAAGAAAGCUUGUCUGCAUCUGAAUCCCGGCGGAACCUGUGUCAGCAUAGGUUAUGGUUACGCUGACAGGGCCAGCGAAAGCAUCAUUGGUGCUAUAGCGCGGCAGUUCAAGUUUUCCCGGGUAUGCAAACCGAAAUCCUCACUUGAAGAGACGGAAGUUCUGUUUGUAUUCAUUGGGUACGAUCGCAAGGCCCGUACGCACAAUUCUUACAAGCUUUCAUCAACCUUGACCAACAUUUAUACAGGUUCCAGACUCCACGAAGCCGGAUGU [서열번호 220]
따라서, 그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 220으로 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
한 구현예에서, NSP3은 본 명세서에서 다음과 같은 서열번호 221로 제공된다:
APSYHVVRGDIATATEGVIINAANSKGQPGGGVCGALYKKFPESFDLQPIEVGKARLVKGAAKHIIHAVGPNFNKVSEVEGDKQLAEAYESIAKIVNDNNYKSVAIPLLSTGIFSGNKDRLTQSLNHLLTALDTTDADVAIYCRDKKWEMTLKEAVARREAVEEICISDDSSVTEPDAELVRVHPKSSLAGRKGYSTSDGKTFSYLEGTKFHQAAKDIAEINAMWPVATEANEQVCMYILGESMSSIRSKCPVEESEASTPPSTLPCLCIHAMTPERVQRLKASRPEQITVCSSFPLPKYRITGVQKIQCSQPILFSPKVPAYIHPRKYLVETPPVDETPEPSAENQSTEGTPEQPPLITEDETRTRTPEPIIIEEEEEDSISLLSDGPTHQVLQVEADIHGPPSVSSSSWSIPHASDFDVDSLSILDTLEGASVTSGATSAETNSYFAKSMEFLARPVPAPRTVFRNPPHPAPRTRTPSLAPSRACSRTSLVSTPPGVNRVITREELEALTPSRTPSRSVSRTSLVSNPPGVNRVITREEFEAFVAQQQRFDAGA [서열번호 221]
따라서, 바람직하게는 nsP3은 실질적으로 서열번호 221에 나타낸 것과 같은 아미노산 서열, 또는 그의 생물학적 활성 변이체 또는 단편을 포함한다.
한 구현예에서, NSP3은 다음과 같은 서열번호 222에 정의된 것과 같은 뉴클레오티드 서열에 의해 암호화된다:
GCACCCTCATATCATGTGGTGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGAGGGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGACTGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGTGACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATTCCACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCTTTAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTGGCTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTGAGGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTGGAAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCCAATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCGGAAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTAAAAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAGAAGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTGGAAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCACCACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGCATAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTATCTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAGGGAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCGCGACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTTGCACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAGGAGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCAGGCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGTGCA [서열번호 222]
따라서, 바람직하게는 NSP3은 실질적으로 서열번호 222에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 223을 포함할 수 있다:
GCACCCUCAUAUCAUGUGGUGCGAGGGGAUAUUGCCACGGCCACCGAAGGAGUGAUUAUAAAUGCUGCUAACAGCAAAGGACAACCUGGCGGAGGGGUGUGCGGAGCGCUGUAUAAGAAAUUCCCGGAAAGCUUCGAUUUACAGCCGAUCGAAGUAGGAAAAGCGCGACUGGUCAAAGGUGCAGCUAAACAUAUCAUUCAUGCCGUAGGACCAAACUUCAACAAAGUUUCGGAGGUUGAAGGUGACAAACAGUUGGCAGAGGCUUAUGAGUCCAUCGCUAAGAUUGUCAACGAUAACAAUUACAAGUCAGUAGCGAUUCCACUGUUGUCCACCGGCAUCUUUUCCGGGAACAAAGAUCGACUAACCCAAUCAUUGAACCAUUUGCUGACAGCUUUAGACACCACUGAUGCAGAUGUAGCCAUAUACUGCAGGGACAAGAAAUGGGAAAUGACUCUCAAGGAAGCAGUGGCUAGGAGAGAAGCAGUGGAGGAGAUAUGCAUAUCCGACGACUCUUCAGUGACAGAACCUGAUGCAGAGCUGGUGAGGGUGCAUCCGAAGAGUUCUUUGGCUGGAAGGAAGGGCUACAGCACAAGCGAUGGCAAAACUUUCUCAUAUUUGGAAGGGACCAAGUUUCACCAGGCGGCCAAGGAUAUAGCAGAAAUUAAUGCCAUGUGGCCCGUUGCAACGGAGGCCAAUGAGCAGGUAUGCAUGUAUAUCCUCGGAGAAAGCAUGAGCAGUAUUAGGUCGAAAUGCCCCGUCGAAGAGUCGGAAGCCUCCACACCACCUAGCACGCUGCCUUGCUUGUGCAUCCAUGCCAUGACUCCAGAAAGAGUACAGCGCCUAAAAGCCUCACGUCCAGAACAAAUUACUGUGUGCUCAUCCUUUCCAUUGCCGAAGUAUAGAAUCACUGGUGUGCAGAAGAUCCAAUGCUCCCAGCCUAUAUUGUUCUCACCGAAAGUGCCUGCGUAUAUUCAUCCAAGGAAGUAUCUCGUGGAAACACCACCGGUAGACGAGACUCCGGAGCCAUCGGCAGAGAACCAAUCCACAGAGGGGACACCUGAACAACCACCACUUAUAACCGAGGAUGAGACCAGGACUAGAACGCCUGAGCCGAUCAUCAUCGAAGAGGAAGAAGAGGAUAGCAUAAGUUUGCUGUCAGAUGGCCCGACCCACCAGGUGCUGCAAGUCGAGGCAGACAUUCACGGGCCGCCCUCUGUAUCUAGCUCAUCCUGGUCCAUUCCUCAUGCAUCCGACUUUGAUGUGGACAGUUUAUCCAUACUUGACACCCUGGAGGGAGCUAGCGUGACCAGCGGGGCAACGUCAGCCGAGACUAACUCUUACUUCGCAAAGAGUAUGGAGUUUCUGGCGCGACCGGUGCCUGCGCCUCGAACAGUAUUCAGGAACCCUCCACAUCCCGCUCCGCGCACAAGAACACCGUCACUUGCACCCAGCAGGGCCUGCUCGAGAACCAGCCUAGUUUCCACCCCGCCAGGCGUGAAUAGGGUGAUCACUAGAGAGGAGCUCGAGGCGCUUACCCCGUCACGCACUCCUAGCAGGUCGGUCUCGAGAACCAGCCUGGUCUCCAACCCGCCAGGCGUAAAUAGGGUGAUUACAAGAGAGGAGUUUGAGGCGUUCGUAGCACAACAACAAUGACGGUUUGAUGCGGGUGCA [서열번호 223]
따라서, 그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 223으로 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
한 구현예에서, NSP4는 본 명세서에서 다음과 같은 서열번호 224로 제공된다:
YIFSSDTGQGHLQQKSVRQTVLSEVVLERTELEISYAPRLDQEKEELLRKKLQLNPTPANRSRYQSRKVENMKAITARRILQGLGHYLKAEGKVECYRTLHPVPLYSSSVNRAFSSPKVAVEACNAMLKENFPTVASYCIIPEYDAYLDMVDGASCCLDTASFCPAKLRSFPKKHSYLEPTIRSAVPSAIQNTLQNVLAAATKRNCNVTQMRELPVLDSAAFNVECFKKYACNNEYWETFKENPIRLTEENVVNYITKLKGPKAAALFAKTHNLNMLQDIPMDRFVMDLKRDVKVTPGTKHTEERPKVQVIQAADPLATAYLCGIHRELVRRLNAVLLPNIHTLFDMSAEDFDAIIAEHFQPGDCVLETDIASFDKSEDDAMALTALMILEDLGVDAELLTLIEAAFGEISSIHLPTKTKFKFGAMMKSGMFLTLFVNTVINIVIASRVLRERLTGSPCAAFIGDDNIVKGVKSDKLMADRCATWLNMEVKIIDAVVGEKAPYFCGGFILCDSVTGTACRVADPLKRLFKLGKPLAADDEHDDDRRRALHEESTRWNRVGILSELCKAVESRYETVGTSIIVMAMTTLASSVKSFSYLRGAPITLYG [서열번호 224]
따라서, 바람직하게는 NSP4는 실질적으로 서열번호 224에 나타낸 것과 같은 아미노산 서열, 또는 그의 생물학적 활성 변이체 또는 단편을 포함한다.
한 구현예에서, NSP4는 다음과 같은 서열번호 225에 정의된 것과 같은 뉴클레오티드 서열에 의해 암호화된다:
TACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAAGTGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGCAAGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCCATAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACCCTGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGTAACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGACATGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAACACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCAGCTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTGGAATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAAGAAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTGAATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACAAAACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGAATCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCTGAAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTTGATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTGTTGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGAGCCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTGTTGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCGGACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAGAAAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCCCTAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTGCATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAAACCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGAGGGGCCCCTATAACTCTCTACGGC [서열번호 225]
따라서, 바람직하게는 NSP4는 실질적으로 서열번호 225에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 변이체 또는 단편에 의해 암호화된다.
따라서, 상기 RNA 구조체는 다음과 같은 서열번호 226을 포함할 수 있다:
UACAUCUUUUCCUCCGACACCGGUCAAGGGCAUUUACAACAAAAAUCAGUAAGGCAAACGGUGCUAUCCGAAGUGGUGUUGGAGAGGACCGAAUUGGAGAUUUCGUAUGCCCCGCGCCUCGACCAAGAAAAAGAAGAAUUACUACGCAAGAAAUUACAGUUAAAUCCCACACCUGCUAACAGAAGCAGAUACCAGUCCAGGAAGGUGGAGAACAUGAAAGCCAUAACAGCUAGACGUAUUCUGCAAGGCCUAGGGCAUUAUUUGAAGGCAGAAGGAAAAGUGGAGUGCUACCGAACCCUGCAUCCUGUUCCUUUGUAUUCAUCUAGUGUGAACCGUGCCUUUUCAAGCCCCAAGGUCGCAGUGGAAGCCUGUAACGCCAUGUUGAAAGAGAACUUUCCGACUGUGGCUUCUUACUGUAUUAUUCCAGAGUACGAUGCCUAUUUGGACAUGGUUGACGGAGCUUCAUGCUGCUUAGACACUGCCAGUUUUUGCCCUGCAAAGCUGCGCAGCUUUCCAAAGAAACACUCCUAUUUGGAACCCACAAUACGAUCGGCAGUGCCUUCAGCGAUCCAGAACACGCUCCAGAACGUCCUGGCAGCUGCCACAAAAAGAAAUUGCAAUGUCACGCAAAUGAGAGAAUUGCCCGUAUUGGAUUCGGCGGCCUUUAAUGUGGAAUGCUUCAAGAAAUAUGCGUGUAAUAAUGAAUAUUGGGAAACGUUUAAAGAAAACCCCAUCAGGCUUACUGAAGAAAACGUGGUAAAUUACAUUACCAAAUUAAAAGGACCAAAAGCUGCUGCUCUUUUUGCGAAGACACAUAAUUUGAAUAUGUUGCAGGACAUACCAAUGGACAGGUUUGUAAUGGACUUAAAGAGAGACGUGAAAGUGACUCCAGGAACAAAACAUACUGAAGAACGGCCCAAGGUACAGGUGAUCCAGGCUGCCGAUCCGCUAGCAACAGCGUAUCUGUGCGGAAUCCACCGAGAGCUGGUUAGGAGAUUAAAUGCGGUCCUGCUUCCGAACAUUCAUACACUGUUUGAUAUGUCGGCUGAAGACUUUGACGCUAUUAUAGCCGAGCACUUCCAGCCUGGGGAUUGUGUUCUGGAAACUGACAUCGCGUCGUUUGAUAAAAGUGAGGACGACGCCAUGGCUCUGACCGCGUUAAUGAUUCUGGAAGACUUAGGUGUGGACGCAGAGCUGUUGACGCUGAUUGAGGCGGCUUUCGGCGAAAUUUCAUCAAUACAUUUGCCCACUAAAACUAAAUUUAAAUUCGGAGCCAUGAUGAAAUCUGGAAUGUUCCUCACACUGUUUGUGAACACAGUCAUUAACAUUGUAAUCGCAAGCAGAGUGUUGAGAGAACGGCUAACCGGAUCACCAUGUGCAGCAUUCAUUGGAGAUGACAAUAUCGUGAAAGGAGUCAAAUCGGACAAAUUAAUGGCAGACAGGUGCGCCACCUGGUUGAAUAUGGAAGUCAAGAUUAUAGAUGCUGUGGUGGGCGAGAAAGCGCCUUAUUUCUGUGGAGGGUUUAUUUUGUGUGACUCCGUGACCGGCACAGCGUGCCGUGUGGCAGACCCCCUAAAAAGGCUGUUUAAGCUUGGCAAACCUCUGGCAGCAGACGAUGAACAUGAUGAUGACAGGAGAAGGGCAUUGCAUGAAGAGUCAACACGCUGGAACCGAGUGGGUAUUCUUUCAGAGCUGUGCAAGGCAGUAGAAUCAAGGUAUGAAACCGUAGGAACUUCCAUCAUAGUUAUGGCCAUGACUACUCUAGCUAGCAGUGUUAAAUCAUUCAGCUACCUGAGAGGGGCCCCUAUAACUCUCUACGGC [서열번호 226]
따라서, 그러므로, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 226으로 나타낸 것과 같은 RNA 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
바람직하게는, 숙주 세포에 존재하는 단백질과 함께, 본 발명의 RNA 구조체에 의해 암호화되는 비-구조 단백질은 적어도 하나의 치료 생체분자 및 적어도 하나의 선천적 조절 단백질을 암호화하는 서열의 게놈 복제 및 전사를 위해 필요한 효소 복합체(즉, 레플리카아제)를 형성한다. 예를 들면, 상기 하나 이상의 비-구조 단백질은 상기 구조체가 관심있는 적어도 하나의 펩티드 또는 단백질(즉, 치료 생체분자) 및 적어도 하나의 선천적 조절 단백질을 암호화하는 뉴클레오티드 서열을 증폭하게 할 수 있는 폴리머라아제를 암호화할 수 있다.
상기 숙주 세포는 진핵생물 또는 원핵생물 숙주 세포일 수 있다. 바람직하게는, 상기 숙주 세포는 진핵생물 숙주 세포이다. 보다 바람직하게는, 상기 숙주 세포는 포유동물 숙주 세포이다.
상기 RNA 구조체는 적어도 하나의 비-구조 단백질의 5'에 배치된 프로모터를 추가로 포함할 수 있으며, 상기 프로모터는 적어도 하나의 비-구조 단백질을 암호화하는 서열에 작동가능하게 연결되고, 숙주 세포에서 적어도 하나의 비-구조 단백질을 발현할 수 있다.
바람직하게는, 상기 RNA 구조체는 본 명세서에서 다음과 같은 서열번호 227로 나타낼 수 있는 5' UTR 보존된 서열 요소를 포함한다:
AUGGGCGGCGCAUGAGAGAAGCCCAGACCAAUUACCUACCCAAA [서열번호 227]
따라서, 바람직하게는 상기 UTR은 적어도 하나의 비-구조 단백질의 5'에 배치되고, 실질적으로 서열번호 227에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 단편 또는 변이체를 포함한다.
바람직하게는, 상기 RNA 구조체는 본 명세서에서 다음과 같은 서열번호 228로 나타낼 수 있는 3' UTR 보존된 서열 요소를 포함한다:
AAUUGGCAAGCUGCUUACAUAGAACUCGCGGCGAUUGGCAUGCCGCCUUAAAAUUUUUAUUUUAUUUUUCUUUUCUUUUCCGAAUCGGAUUUUGUUUUUAAUAUUUCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA [서열번호 228]
따라서, 바람직하게는 상기 3' UTR은 상기 적어도 하나의 비-구조 단백질의 3'에 배치되고, 실질적으로 서열번호 228에 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 단편 또는 변이체를 포함한다.
바람직하게는, 상기 RNA 구조체는 폴리A 꼬리(polyA tail)를 포함한다. 바람직하게는, 상기 폴리A 꼬리는 상기 구조체의 3' 말단에 배치된다. 상기 폴리A 꼬리는 적어도 35개 nt, 또는 적어도 40개 nt, 또는 적어도 45개 nt, 또는 적어도 50개 nt를 포함할 수 있으며, 여기서 각각의 nt는 아데닌이다. 다른 구현예에서, 상기 폴리A 꼬리는 적어도 55개 nt 또는 적어도 60개 nt를 포함할 수 있으며, 여기서 각각의 nt는 아데닌이다. 또 다른 구현예에서, 상기 폴리A 꼬리는 적어도 60개 아데닌과, 이어서 하나 이상의 비-아데닌 뉴클레오티드(즉, G, C 또는 T, 바람직하게는 구아닌), 이후 다른 적어도 35개 nt, 또는 적어도 40개 nt, 또는 적어도 45개 nt, 또는 적어도 50개 nt, 또는 적어도 55개 nt, 또는 적어도 60개 nt를 포함할 수 있으며, 여기서 각각의 nt는 아데닌이다.
상기 RNA 구조체는 5' 캡을 추가로 포함할 수 있다. 본 발명의 문맥에서, 용어 "5'-캡"은 RNA 캡 구조를 닮은 5'-캡 유사체를 포함하며, 바람직하게는 생체내 및/또는 세포에서 여기에 부착될 경우 RNA를 안정화하거나, 및/또는 RNA의 번역을 향상시키는 능력을 갖도록 변형된다.
5'-캡을 갖는 RNA는 상기 5'-캡의 존재시에 DNA 주형의 시험관내 전사에 의해 달성될 수 있으며, 여기서 상기 5'-캡은 생성된 RNA 가닥 내로 공동-전사적으로 통합되거나, 상기 RNA는, 예를 들면, 시험관내 전사에 의해 생성될 수 있고, 상기 5' -캡은 캡핑 효소, 예를 들면, 백시니아 바이러스의 캡핑 효소를 이용하여 사후-전사적으로 RNA에 부착될 수 있다. 캡핑된 RNA에서, (캡핑된) RNA 분자의 제1 염기의 3' 위치는 포스포디에스테르 결합을 통해 RNA 분자의 후속 염기("제2 염기")의 5' 위치에 연결된다.
한 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 치료 생체분자를 암호화하는 서열, 링커 서열, 및 비-바이러스 선천적 조절 단백질을 암호화하는 적어도 하나의 서열을 포함한다. 한 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-바이러스 선천적 조절 단백질을 암호화하는 서열, 링커 서열, 및 적어도 하나의 치료 생체분자를 암호화하는 서열을 포함한다. 양쪽 구현예에서, 상기 링커는 F-T2a 또는 IRES일 수 있다.
다른 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-구조 단백질을 암호화하는 서열, 서브게놈성 프로모터, 적어도 하나의 치료 생체분자를 암호화하는 서열, 링커 서열, 및 적어도 하나의 비-바이러스 선천적 조절 단백질을 암호화하는 서열을 포함한다. 다른 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-구조 단백질을 암호화하는 서열, 서브게놈성 프로모터, 적어도 하나의 비-바이러스 선천적 조절 단백질을 암호화하는 서열, 링커 서열, 및 적어도 하나의 치료 생체분자를 암호화하는 서열을 포함한다. 양쪽 구현예에서, 상기 링커는 F-T2a 또는 IRES일 수 있다.
또 다른 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-구조 단백질을 암호화하는 서열, 서브게놈성 프로모터, 적어도 하나의 치료 생체분자를 암호화하는 서열, 링커 서열, 및 적어도 하나의 비-바이러스 선천적 조절 단백질을 암호화하는 서열, 및 폴리A 꼬리를 포함한다. 또 다른 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-구조 단백질을 암호화하는 서열, 서브게놈성 프로모터, 적어도 하나의 비-바이러스 선천적 조절 단백질을 암호화하는 서열, 링커 서열, 적어도 하나의 치료 생체분자를 암호화하는 서열, 및 폴리A 꼬리를 포함한다. 양쪽 구현예에서, 상기 링커는 F-T2a 또는 IRES일 수 있다.
다른 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-구조 단백질을 암호화하는 서열, 제1 서브게놈성 프로모터, 적어도 하나의 치료 생체분자를 암호화하는 서열, 제2 서브게놈성 프로모터, 적어도 하나의 선천적 조절 단백질을 암호화하는 서열, 및 폴리A 꼬리를 포함한다. 다른 구현예에서, 상기 RNA 구조체는, 바람직하게는 5'에서 3'으로, 프로모터, 적어도 하나의 비-구조 단백질을 암호화하는 서열, 제1 서브게놈성 프로모터, 적어도 하나의 선천적 조절 단백질을 암호화하는 서열, 제2 서브게놈성 프로모터, 적어도 하나의 치료 생체분자를 암호화하는 서열, 및 폴리A 꼬리를 포함한다.
가장 바람직하게는, 상기 RNA 구조체는, 5'에서 3'으로, 5' 캡, 프로모터, NSP1, NSP2, NSP3v, NSP4, 서브게놈성 프로모터 26S, 치료 생체분자를 암호화하는 서열, 링커 서열, 비-바이러스 IMP를 암호화하는 서열, 및 폴리A 꼬리를 포함한다. 가장 바람직하게는, 상기 RNA 구조체는, 5'에서 3'으로, 5' 캡, 프로모터, NSP1, NSP2, NSP3v, NSP4, 서브게놈성 프로모터 26S, 비-바이러스 IMP를 암호화하는 서열, 링커 서열, 치료 생체분자를 암호화하는 서열; 및 폴리A 꼬리를 포함한다.
따라서, 한 구현예에서, 상기 RNA 구조체는 T7 프로모터, 5'UTR, NSP1-4, 서브게놈성 프로모터, GOI(관심있는 유전자는 치료 생체분자임), 푸린 T2A, IRF1인 IMP(ATG 및 종결 코돈으로 코돈 최적화됨 - 서열번호 5), 3'UTR, 및 폴리A 꼬리를 포함할 수 있다. 따라서, 상기 RNA 구조체는 단일 RNA 구조체 내에 서열번호 229, GOI, 및 서열번호 264를 포함하거나 이로 이루어질 수 있다. 서열번호 229 및 서열번호 264는 다음과 같다:
UAAUACGACUCACUAUAGAUGGGCGGCGCAUGAGAGAAGCCCAGACCAAUUACCUACCCAAAAUGGAGAAAGUUCACGUUGACAUCGAGGAAGACAGCCCAUUCCUCAGAGCUUUGCAGCGGAGCUUCCCGCAGUUUGAGGUAGAAGCCAAGCAGGUCACUGAUAAUGACCAUGCUAAUGCCAGAGCGUUUUCGCAUCUGGCUUCAAAACUGAUCGAAACGGAGGUGGACCCAUCCGACACGAUCCUUGACAUUGGAAGUGCGCCCGCCCGCAGAAUGUAUUCUAAGCACAAGUAUCAUUGUAUCUGUCCGAUGAGAUGUGCGGAAGAUCCGGACAGAUUGUAUAAGUAUGCAACUAAGCUGAAGAAAAACUGUAAGGAAAUAACUGAUAAGGAAUUGGACAAGAAAAUGAAGGAGCUGGCCGCCGUCAUGAGCGACCCUGACCUGGAAACUGAGACUAUGUGCCUCCACGACGACGAGUCGUGUCGCUACGAAGGGCAAGUCGCUGUUUACCAGGAUGUAUACGCGGUUGACGGACCGACAAGUCUCUAUCACCAAGCCAAUAAGGGAGUUAGAGUCGCCUACUGGAUAGGCUUUGACACCACCCCUUUUAUGUUUAAGAACUUGGCUGGAGCAUAUCCAUCAUACUCUACCAACUGGGCCGACGAAACCGUGUUAACGGCUCGUAACAUAGGCCUAUGCAGCUCUGACGUUAUGGAGCGGUCACGUAGAGGGAUGUCCAUUCUUAGAAAGAAGUAUUUGAAACCAUCCAACAAUGUUCUAUUCUCUGUUGGCUCGACCAUCUACCACGAGAAGAGGGACUUACUGAGGAGCUGGCACCUGCCGUCUGUAUUUCACUUACGUGGCAAGCAAAAUUACACAUGUCGGUGUGAGACUAUAGUUAGUUGCGACGGGUACGUCGUUAAAAGAAUAGCUAUCAGUCCAGGCCUGUAUGGGAAGCCUUCAGGCUAUGCUGCUACGAUGCACCGCGAGGGAUUCUUGUGCUGCAAAGUGACAGACACAUUGAACGGGGAGAGGGUCUCUUUUCCCGUGUGCACGUAUGUGCCAGCUACAUUGUGUGACCAAAUGACUGGCAUACUGGCAACAGAUGUCAGUGCGGACGACGCGCAAAAACUGCUGGUUGGGCUCAACCAGCGUAUAGUCGUCAACGGUCGCACCCAGAGAAACACCAAUACCAUGAAAAAUUACCUUUUGCCCGUAGUGGCCCAGGCAUUUGCUAGGUGGGCAAAGGAAUAUAAGGAAGAUCAAGAAGAUGAAAGGCCACUAGGACUACGAGAUAGACAGUUAGUCAUGGGGUGUUGUUGGGCUUUUAGAAGGCACAAGAUAACAUCUAUUUAUAAGCGCCCGGAUACCCAAACCAUCAUCAAAGUGAACAGCGAUUUCCACUCAUUCGUGCUGCCCAGGAUAGGCAGUAACACAUUGGAGAUCGGGCUGAGAACAAGAAUCAGGAAAAUGUUAGAGGAGCACAAGGAGCCGUCACCUCUCAUUACCGCCGAGGACGUACAAGAAGCUAAGUGCGCAGCCGAUGAGGCUAAGGAGGUGCGUGAAGCCGAGGAGUUGCGCGCAGCUCUACCACCUUUGGCAGCUGAUGUUGAGGAGCCCACUCUGGAgGCaGAcGUCGACUUGAUGUUACAAGAGGCUGGGGCCGGCUCAGUGGAGACACCUCGUGGCUUGAUAAAGGUUACCAGCUACGAUGGCGAGGACAAGAUCGGCUCUUACGCUGUGCUUUCUCCGCAGGCUGUACUCAAGAGUGAAAAAUUAUCUUGCAUCCACCCUCUCGCUGAACAAGUCAUAGUGAUAACACACUCUGGCCGAAAAGGGCGUUAUGCCGUGGAACCAUACCAUGGUAAAGUAGUGGUGCCAGAGGGACAUGCAAUACCCGUCCAGGACUUUCAAGCUCUGAGUGAAAGUGCCACCAUUGUGUACAACGAACGUGAGUUCGUAAACAGGUACCUGCACCAUAUUGCCACACAUGGAGGAGCGCUGAACACUGAUGAAGAAUAUUACAAAACUGUCAAGCCCAGCGAGCACGACGGCGAAUACCUGUACGACAUCGACAGGAAACAGUGCGUCAAGAAAGAACUAGUCACUGGGCUAGGGCUCACAGGCGAGCUGGUGGAUCCUCCCUUCCAUGAAUUCGCCUACGAGAGUCUGAGAACACGACCAGCCGCUCCUUACCAAGUACCAACCAUAGGGGUGUAUGGCGUGCCAGGAUCAGGCAAGUCUGGCAUCAUUAAAAGCGCAGUCACCAAAAAAGAUCUAGUGGUGAGCGCCAAGAAAGAAAACUGUGCAGAAAUUAUAAGGGACGUCAAGAAAAUGAAAGGGCUGGACGUCAAUGCCAGAACUGUGGACUCAGUGCUCUUGAAUGGAUGCAAACACCCCGUAGAGACCCUGUAUAUUGACGAAGCUUUUGCUUGUCAUGCAGGUACUCUCAGAGCGCUCAUAGCCAUUAUAAGACCUAAAAAGGCAGUGCUCUGCGGGGAUCCCAAACAGUGCGGUUUUUUUAACAUGAUGUGCCUGAAAGUGCAUUUUAACCACGAGAUUUGCACACAAGUCUUCCACAAAAGCAUCUCUCGCCGUUGCACUAAAUCUGUGACUUCGGUCGUCUCAACCUUGUUUUACGACAAAAAAAUGAGAACGACGAAUCCGAAAGAGACUAAGAUUGUGAUUGACACUACCGGCAGUACCAAACCUAAGCAGGACGAUCUCAUUCUCACUUGUUUCAGAGGGUGGGUGAAGCAGUUGCAAAUAGAUUACAAAGGCAACGAAAUAAUGACGGCAGCUGCCUCUCAAGGGCUGACCCGUAAAGGUGUGUAUGCCGUUCGGUACAAGGUGAAUGAAAAUCCUCUGUACGCACCCACCUCAGAACAUGUGAACGUCCUACUGACCCGCACGGAGGACCGCAUCGUGUGGAAAACACUAGCCGGCGACCCAUGGAUAAAAACACUGACUGCCAAGUACCCUGGGAAUUUCACUGCCACGAUAGAGGAGUGGCAAGCAGAGCAUGAUGCCAUCAUGAGGCACAUCUUGGAGAGACCGGACCCUACCGACGUCUUCCAGAAUAAGGCAAACGUGUGUUGGGCCAAGGCUUUAGUGCCGGUGCUGAAGACCGCUGGCAUAGACAUGACCACUGAACAAUGGAACACUGUGGAUUAUUUUGAAACGGACAAAGCUCACUCAGCAGAGAUAGUAUUGAACCAACUAUGCGUGAGGUUCUUUGGACUCGAUCUGGACUCCGGUCUAUUUUCUGCACCCACUGUUCCGUUAUCCAUUAGGAAUAAUCACUGGGAUAACUCCCCGUCGCCUAACAUGUACGGGCUGAAUAAAGAAGUGGUCCGUCAGCUCUCUCGCAGGUACCCACAACUGCCUCGGGCAGUUGCCACUGGAAGAGUCUAUGACAUGAACACUGGUACACUGCGCAAUUAUGAUCCGCGCAUAAACCUAGUACCUGUAAACAGAAGACUGCCUCAUGCUUUAGUCCUCCACCAUAAUGAACACCCACAGAGUGACUUUUCUUCAUUCGUCAGCAAAUUGAAGGGCAGAACUGUCCUGGUGGUCGGGGAAAAGUUGUCCGUCCCAGGCAAAAUGGUUGACUGGUUGUCAGACCGGCCUGAGGCUACCUUCAGAGCUCGGCUGGAUUUAGGCAUCCCAGGUGAUGUGCCCAAAUAUGACAUAAUAUUUGUUAAUGUGAGGACCCCAUAUAAAUACCAUCACUAUCAGCAGUGUGAAGACCAUGCCAUUAAGCUUAGCAUGUUGACCAAGAAAGCUUGUCUGCAUCUGAAUCCCGGCGGAACCUGUGUCAGCAUAGGUUAUGGUUACGCUGACAGGGCCAGCGAAAGCAUCAUUGGUGCUAUAGCGCGGCAGUUCAAGUUUUCCCGGGUAUGCAAACCGAAAUCCUCACUUGAAGAGACGGAAGUUCUGUUUGUAUUCAUUGGGUACGAUCGCAAGGCCCGUACGCACAAUUCUUACAAGCUUUCAUCAACCUUGACCAACAUUUAUACAGGUUCCAGACUCCACGAAGCCGGAUGUGCACCCUCAUAUCAUGUGGUGCGAGGGGAUAUUGCCACGGCCACCGAAGGAGUGAUUAUAAAUGCUGCUAACAGCAAAGGACAACCUGGCGGAGGGGUGUGCGGAGCGCUGUAUAAGAAAUUCCCGGAAAGCUUCGAUUUACAGCCGAUCGAAGUAGGAAAAGCGCGACUGGUCAAAGGUGCAGCUAAACAUAUCAUUCAUGCCGUAGGACCAAACUUCAACAAAGUUUCGGAGGUUGAAGGUGACAAACAGUUGGCAGAGGCUUAUGAGUCCAUCGCUAAGAUUGUCAACGAUAACAAUUACAAGUCAGUAGCGAUUCCACUGUUGUCCACCGGCAUCUUUUCCGGGAACAAAGAUCGACUAACCCAAUCAUUGAACCAUUUGCUGACAGCUUUAGACACCACUGAUGCAGAUGUAGCCAUAUACUGCAGGGACAAGAAAUGGGAAAUGACUCUCAAGGAAGCAGUGGCUAGGAGAGAAGCAGUGGAGGAGAUAUGCAUAUCCGACGACUCUUCAGUGACAGAACCUGAUGCAGAGCUGGUGAGGGUGCAUCCGAAGAGUUCUUUGGCUGGAAGGAAGGGCUACAGCACAAGCGAUGGCAAAACUUUCUCAUAUUUGGAAGGGACCAAGUUUCACCAGGCGGCCAAGGAUAUAGCAGAAAUUAAUGCCAUGUGGCCCGUUGCAACGGAGGCCAAUGAGCAGGUAUGCAUGUAUAUCCUCGGAGAAAGCAUGAGCAGUAUUAGGUCGAAAUGCCCCGUCGAAGAGUCGGAAGCCUCCACACCACCUAGCACGCUGCCUUGCUUGUGCAUCCAUGCCAUGACUCCAGAAAGAGUACAGCGCCUAAAAGCCUCACGUCCAGAACAAAUUACUGUGUGCUCAUCCUUUCCAUUGCCGAAGUAUAGAAUCACUGGUGUGCAGAAGAUCCAAUGCUCCCAGCCUAUAUUGUUCUCACCGAAAGUGCCUGCGUAUAUUCAUCCAAGGAAGUAUCUCGUGGAAACACCACCGGUAGACGAGACUCCGGAGCCAUCGGCAGAGAACCAAUCCACAGAGGGGACACCUGAACAACCACCACUUAUAACCGAGGAUGAGACCAGGACUAGAACGCCUGAGCCGAUCAUCAUCGAAGAGGAAGAAGAGGAUAGCAUAAGUUUGCUGUCAGAUGGCCCGACCCACCAGGUGCUGCAAGUCGAGGCAGACAUUCACGGGCCGCCCUCUGUAUCUAGCUCAUCCUGGUCCAUUCCUCAUGCAUCCGACUUUGAUGUGGACAGUUUAUCCAUACUUGACACCCUGGAGGGAGCUAGCGUGACCAGCGGGGCAACGUCAGCCGAGACUAACUCUUACUUCGCAAAGAGUAUGGAGUUUCUGGCGCGACCGGUGCCUGCGCCUCGAACAGUAUUCAGGAACCCUCCACAUCCCGCUCCGCGCACAAGAACACCGUCACUUGCACCCAGCAGGGCCUGCUCGAGAACCAGCCUAGUUUCCACCCCGCCAGGCGUGAAUAGGGUGAUCACUAGAGAGGAGCUCGAGGCGCUUACCCCGUCACGCACUCCUAGCAGGUCGGUCUCGAGAACCAGCCUGGUCUCCAACCCGCCAGGCGUAAAUAGGGUGAUUACAAGAGAGGAGUUUGAGGCGUUCGUAGCACAACAACAAUGACGGUUUGAUGCGGGUGCAUACAUCUUUUCCUCCGACACCGGUCAAGGGCAUUUACAACAAAAAUCAGUAAGGCAAACGGUGCUAUCCGAAGUGGUGUUGGAGAGGACCGAAUUGGAGAUUUCGUAUGCCCCGCGCCUCGACCAAGAAAAAGAAGAAUUACUACGCAAGAAAUUACAGUUAAAUCCCACACCUGCUAACAGAAGCAGAUACCAGUCCAGGAAGGUGGAGAACAUGAAAGCCAUAACAGCUAGACGUAUUCUGCAAGGCCUAGGGCAUUAUUUGAAGGCAGAAGGAAAAGUGGAGUGCUACCGAACCCUGCAUCCUGUUCCUUUGUAUUCAUCUAGUGUGAACCGUGCCUUUUCAAGCCCCAAGGUCGCAGUGGAAGCCUGUAACGCCAUGUUGAAAGAGAACUUUCCGACUGUGGCUUCUUACUGUAUUAUUCCAGAGUACGAUGCCUAUUUGGACAUGGUUGACGGAGCUUCAUGCUGCUUAGACACUGCCAGUUUUUGCCCUGCAAAGCUGCGCAGCUUUCCAAAGAAACACUCCUAUUUGGAACCCACAAUACGAUCGGCAGUGCCUUCAGCGAUCCAGAACACGCUCCAGAACGUCCUGGCAGCUGCCACAAAAAGAAAUUGCAAUGUCACGCAAAUGAGAGAAUUGCCCGUAUUGGAUUCGGCGGCCUUUAAUGUGGAAUGCUUCAAGAAAUAUGCGUGUAAUAAUGAAUAUUGGGAAACGUUUAAAGAAAACCCCAUCAGGCUUACUGAAGAAAACGUGGUAAAUUACAUUACCAAAUUAAAAGGACCAAAAGCUGCUGCUCUUUUUGCGAAGACACAUAAUUUGAAUAUGUUGCAGGACAUACCAAUGGACAGGUUUGUAAUGGACUUAAAGAGAGACGUGAAAGUGACUCCAGGAACAAAACAUACUGAAGAACGGCCCAAGGUACAGGUGAUCCAGGCUGCCGAUCCGCUAGCAACAGCGUAUCUGUGCGGAAUCCACCGAGAGCUGGUUAGGAGAUUAAAUGCGGUCCUGCUUCCGAACAUUCAUACACUGUUUGAUAUGUCGGCUGAAGACUUUGACGCUAUUAUAGCCGAGCACUUCCAGCCUGGGGAUUGUGUUCUGGAAACUGACAUCGCGUCGUUUGAUAAAAGUGAGGACGACGCCAUGGCUCUGACCGCGUUAAUGAUUCUGGAAGACUUAGGUGUGGACGCAGAGCUGUUGACGCUGAUUGAGGCGGCUUUCGGCGAAAUUUCAUCAAUACAUUUGCCCACUAAAACUAAAUUUAAAUUCGGAGCCAUGAUGAAAUCUGGAAUGUUCCUCACACUGUUUGUGAACACAGUCAUUAACAUUGUAAUCGCAAGCAGAGUGUUGAGAGAACGGCUAACCGGAUCACCAUGUGCAGCAUUCAUUGGAGAUGACAAUAUCGUGAAAGGAGUCAAAUCGGACAAAUUAAUGGCAGACAGGUGCGCCACCUGGUUGAAUAUGGAAGUCAAGAUUAUAGAUGCUGUGGUGGGCGAGAAAGCGCCUUAUUUCUGUGGAGGGUUUAUUUUGUGUGACUCCGUGACCGGCACAGCGUGCCGUGUGGCAGACCCCCUAAAAAGGCUGUUUAAGCUUGGCAAACCUCUGGCAGCAGACGAUGAACAUGAUGAUGACAGGAGAAGGGCAUUGCAUGAAGAGUCAACACGCUGGAACCGAGUGGGUAUUCUUUCAGAGCUGUGCAAGGCAGUAGAAUCAAGGUAUGAAACCGUAGGAACUUCCAUCAUAGUUAUGGCCAUGACUACUCUAGCUAGCAGUGUUAAAUCAUUCAGCUACCUGAGAGGGGCCCCUAUAACUCUCUACGGCUAACCUGAAUGGACUACGACAUAGUCUAGUCCGCCAAGUCUAGCAU [서열번호 229]-----GOI---------------------CGGAGACGGCGCAGAAGAAGAGGAUCUGGCGAAGGCAGAGGCAGCCUGCUuACAUGuGGcGAcGUGGAAGAGAACCCCGGACCUAUGGGCGAUAGCAGCCCCGAUACCUUUUCCGAUGGCCUGAGCAGCAGCACCCUGCCUGAUGAUCACAGCAGCUACACCGUGCCUGGCUACAUGCAGGACCUGGAAGUGGAACAGGCCCUGACACCAGCUCUGAGCCCUUGUGCUGUGUCCAGCACACUGCCCGAUUGGCACAUCCCUGUGGAAGUGGUGCCUGACAGCACCAGCGACCUGUACAACUUCCAAGUGUCCCCUAUGCCUAGCACCUCCGAGGCCACCACCGAUGAGGAUGAAGAGGGAAAGCUGCCCGAGGACAUCAUGAAGCUGCUGGAACAGAGCGAGUGGCAGCCCACCAAUGUGGAUGGCAAGGGCUACCUGCUGAACGAGCCUGGCGUUCAGCCUACAAGCGUGUACGGCGACUUCAGCUGCAAAGAGGAACCCGAGAUCGAUAGCCCUGGCGGCGAUAUCGGACUGAGCCUGCAGAGAGUGUUCACCGACCUGAAGAACAUGGACGCCACCUGGCUGGACAGCCUGCUGACACCUGUUAGACUGCCCUCUAUCCAGGCUAUCCCCUGCGCUCCUUGAGCGGCCGCGAAUUGGCAAGCUGCUUACAUAGAACUCGCGGCGAUUGGCAUGCCGCCUUAAAAUUUUUAUUUUAUUUUUCUUUUCUUUUCCGAAUCGGAUUUUGUUUUUAAUAUUUCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA [서열번호 264]
따라서, 바람직하게는 상기 RNA 구조체는 실질적으로 서열번호 229, GOI, 및 서열번호 264를 포함하거나 이로 이루어지는 상기 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 단편 또는 변이체를 포함한다.
본 발명의 제2 측면에서, 제1 측면의 RNA 구조체를 암호화하는 핵산 서열이 제공된다.
따라서, 한 구현예에서, 상기 핵산 서열은 T7 프로모터, 5'UTR, NSP1-4, 서브게놈성 프로모터, GOI(관심있는 유전자는 치료 생체분자임), 푸린 T2A, IRF1인 IMP(ATG 및 종결 코돈으로 코돈 최적화됨 - 서열번호 4), 3'UTR, 및 폴리A 꼬리를 포함할 수 있다. 따라서, 한 구현예에서, 상기 핵산 서열은 서열번호 230, GOI, 및 서열번호 265를 포함하거나 이로 이루어질 수 있다. 서열번호 230 및 서열번호 265는 다음과 같다:
TAATACGACTCACTATAGATGGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAAGACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAATGACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACGATCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGATGTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAGGAATTGGACAAGAAAATGAAGGAGCTGGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTCCACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACAAGTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTTAAGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATAGGCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCATCCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTGCCGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTACGTCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAGGGATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCAGCTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTTGGGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCCGTAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTACGAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCGGATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTGGAGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAGGACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTACCACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAgGCaGAcGTCGACTTGATGTTACAAGAGGCTGGGGCCGGCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCTGTGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTGATAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACATGCAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAACAGGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCCAGCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTAGGGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCTCCTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTCACCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAAGGGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTATATTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTGCTCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGCACACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTACGACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAGCAGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATAATGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCTCTGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTAGCCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAAGCAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAACGTGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACTGTGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGACTCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCCCCGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGGGCAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTACCTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTCGTCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGGTTGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGACATAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTTAGCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCTGACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCCTCACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATTCTTACAAGCTTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTGGTGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGAGGGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGACTGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGTGACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATTCCACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCTTTAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTGGCTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTGAGGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTGGAAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCCAATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCGGAAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTAAAAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAGAAGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTGGAAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCACCACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGCATAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTATCTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAGGGAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCGCGACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTTGCACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAGGAGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCAGGCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGTGCATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAAGTGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGCAAGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCCATAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACCCTGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGTAACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGACATGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAACACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCAGCTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTGGAATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAAGAAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTGAATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACAAAACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGAATCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCTGAAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTTGATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTGTTGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGAGCCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTGTTGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCGGACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAGAAAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCCCTAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTGCATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAAACCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGAGGGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGTCTAGCAT [서열번호 230]---GOI---CGGAGACGGCGCAGAAGAAGAGGATCTGGCGAAGGCAGAGGCAGCCTGCTtACATGtGGcGAcGTGGAAGAGAACCCCGGACCTATGGGCGATAGCAGCCCCGATACCTTTTCCGATGGCCTGAGCAGCAGCACCCTGCCTGATGATCACAGCAGCTACACCGTGCCTGGCTACATGCAGGACCTGGAAGTGGAACAGGCCCTGACACCAGCTCTGAGCCCTTGTGCTGTGTCCAGCACACTGCCCGATTGGCACATCCCTGTGGAAGTGGTGCCTGACAGCACCAGCGACCTGTACAACTTCCAAGTGTCCCCTATGCCTAGCACCTCCGAGGCCACCACCGATGAGGATGAAGAGGGAAAGCTGCCCGAGGACATCATGAAGCTGCTGGAACAGAGCGAGTGGCAGCCCACCAATGTGGATGGCAAGGGCTACCTGCTGAACGAGCCTGGCGTTCAGCCTACAAGCGTGTACGGCGACTTCAGCTGCAAAGAGGAACCCGAGATCGATAGCCCTGGCGGCGATATCGGACTGAGCCTGCAGAGAGTGTTCACCGACCTGAAGAACATGGACGCCACCTGGCTGGACAGCCTGCTGACACCTGTTAGACTGCCCTCTATCCAGGCTATCCCCTGCGCTCCTTGAGCGGCCGCGAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA [서열번호 265]
따라서, 바람직하게는 상기 핵산 서열은 실질적으로 서열번호 230, GOI, 및 서열번호 265을 포함하거나 이로 이루어지는 상기 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 단편 또는 변이체를 포함한다.
제3 측면에서, 제2 측면에 따른 핵산 서열을 포함하는 발현 카세트(cassette)가 제공된다.
본 발명의 핵산 서열은 바람직하게는 재조합 벡터, 예를 들면 상기 RNA 구조체를 생산할 수 있도록 관심있는 숙주 세포 내로 운반하기 위한 재조합 벡터 내에 포함된다.
따라서, 제4 측면에서, 제3 측면에 따른 발현 카세트를 포함하는 재조합 벡터가 제공된다.
따라서, 한 구현예에서, 상기 벡터는 T7 프로모터, 5'UTR, NSP1-4, 서브게놈성 프로모터, GOI(관심있는 유전자는 치료 생체분자임), 푸린 T2A, IRF1인 IMP(ATG 및 종결 코돈으로 코돈 최적화됨 - 서열번호 5), 3'UTR, 및 폴리A 꼬리를 포함할 수 있다. 한 구현예에서, 상기 벡터는 단일 벡터 내에 서열번호 231의 핵산 서열, GOI, 및 서열번호 266의 핵산 서열을 포함할 수 있다. 서열번호 231 및 서열번호 266은 다음과 같으며, 여기서 "GOI"는 치료 생체분자를 암호화하는 서열의 위치를 나타낸다:
TAATACGACTCACTATAGATGGGCGGCGCATGAGAGAAGCCCAGACCAATTACCTACCCAAAATGGAGAAAGTTCACGTTGACATCGAGGAAGACAGCCCATTCCTCAGAGCTTTGCAGCGGAGCTTCCCGCAGTTTGAGGTAGAAGCCAAGCAGGTCACTGATAATGACCATGCTAATGCCAGAGCGTTTTCGCATCTGGCTTCAAAACTGATCGAAACGGAGGTGGACCCATCCGACACGATCCTTGACATTGGAAGTGCGCCCGCCCGCAGAATGTATTCTAAGCACAAGTATCATTGTATCTGTCCGATGAGATGTGCGGAAGATCCGGACAGATTGTATAAGTATGCAACTAAGCTGAAGAAAAACTGTAAGGAAATAACTGATAAGGAATTGGACAAGAAAATGAAGGAGCTGGCCGCCGTCATGAGCGACCCTGACCTGGAAACTGAGACTATGTGCCTCCACGACGACGAGTCGTGTCGCTACGAAGGGCAAGTCGCTGTTTACCAGGATGTATACGCGGTTGACGGACCGACAAGTCTCTATCACCAAGCCAATAAGGGAGTTAGAGTCGCCTACTGGATAGGCTTTGACACCACCCCTTTTATGTTTAAGAACTTGGCTGGAGCATATCCATCATACTCTACCAACTGGGCCGACGAAACCGTGTTAACGGCTCGTAACATAGGCCTATGCAGCTCTGACGTTATGGAGCGGTCACGTAGAGGGATGTCCATTCTTAGAAAGAAGTATTTGAAACCATCCAACAATGTTCTATTCTCTGTTGGCTCGACCATCTACCACGAGAAGAGGGACTTACTGAGGAGCTGGCACCTGCCGTCTGTATTTCACTTACGTGGCAAGCAAAATTACACATGTCGGTGTGAGACTATAGTTAGTTGCGACGGGTACGTCGTTAAAAGAATAGCTATCAGTCCAGGCCTGTATGGGAAGCCTTCAGGCTATGCTGCTACGATGCACCGCGAGGGATTCTTGTGCTGCAAAGTGACAGACACATTGAACGGGGAGAGGGTCTCTTTTCCCGTGTGCACGTATGTGCCAGCTACATTGTGTGACCAAATGACTGGCATACTGGCAACAGATGTCAGTGCGGACGACGCGCAAAAACTGCTGGTTGGGCTCAACCAGCGTATAGTCGTCAACGGTCGCACCCAGAGAAACACCAATACCATGAAAAATTACCTTTTGCCCGTAGTGGCCCAGGCATTTGCTAGGTGGGCAAAGGAATATAAGGAAGATCAAGAAGATGAAAGGCCACTAGGACTACGAGATAGACAGTTAGTCATGGGGTGTTGTTGGGCTTTTAGAAGGCACAAGATAACATCTATTTATAAGCGCCCGGATACCCAAACCATCATCAAAGTGAACAGCGATTTCCACTCATTCGTGCTGCCCAGGATAGGCAGTAACACATTGGAGATCGGGCTGAGAACAAGAATCAGGAAAATGTTAGAGGAGCACAAGGAGCCGTCACCTCTCATTACCGCCGAGGACGTACAAGAAGCTAAGTGCGCAGCCGATGAGGCTAAGGAGGTGCGTGAAGCCGAGGAGTTGCGCGCAGCTCTACCACCTTTGGCAGCTGATGTTGAGGAGCCCACTCTGGAgGCaGAcGTCGACTTGATGTTACAAGAGGCTGGGGCCGGCTCAGTGGAGACACCTCGTGGCTTGATAAAGGTTACCAGCTACGATGGCGAGGACAAGATCGGCTCTTACGCTGTGCTTTCTCCGCAGGCTGTACTCAAGAGTGAAAAATTATCTTGCATCCACCCTCTCGCTGAACAAGTCATAGTGATAACACACTCTGGCCGAAAAGGGCGTTATGCCGTGGAACCATACCATGGTAAAGTAGTGGTGCCAGAGGGACATGCAATACCCGTCCAGGACTTTCAAGCTCTGAGTGAAAGTGCCACCATTGTGTACAACGAACGTGAGTTCGTAAACAGGTACCTGCACCATATTGCCACACATGGAGGAGCGCTGAACACTGATGAAGAATATTACAAAACTGTCAAGCCCAGCGAGCACGACGGCGAATACCTGTACGACATCGACAGGAAACAGTGCGTCAAGAAAGAACTAGTCACTGGGCTAGGGCTCACAGGCGAGCTGGTGGATCCTCCCTTCCATGAATTCGCCTACGAGAGTCTGAGAACACGACCAGCCGCTCCTTACCAAGTACCAACCATAGGGGTGTATGGCGTGCCAGGATCAGGCAAGTCTGGCATCATTAAAAGCGCAGTCACCAAAAAAGATCTAGTGGTGAGCGCCAAGAAAGAAAACTGTGCAGAAATTATAAGGGACGTCAAGAAAATGAAAGGGCTGGACGTCAATGCCAGAACTGTGGACTCAGTGCTCTTGAATGGATGCAAACACCCCGTAGAGACCCTGTATATTGACGAAGCTTTTGCTTGTCATGCAGGTACTCTCAGAGCGCTCATAGCCATTATAAGACCTAAAAAGGCAGTGCTCTGCGGGGATCCCAAACAGTGCGGTTTTTTTAACATGATGTGCCTGAAAGTGCATTTTAACCACGAGATTTGCACACAAGTCTTCCACAAAAGCATCTCTCGCCGTTGCACTAAATCTGTGACTTCGGTCGTCTCAACCTTGTTTTACGACAAAAAAATGAGAACGACGAATCCGAAAGAGACTAAGATTGTGATTGACACTACCGGCAGTACCAAACCTAAGCAGGACGATCTCATTCTCACTTGTTTCAGAGGGTGGGTGAAGCAGTTGCAAATAGATTACAAAGGCAACGAAATAATGACGGCAGCTGCCTCTCAAGGGCTGACCCGTAAAGGTGTGTATGCCGTTCGGTACAAGGTGAATGAAAATCCTCTGTACGCACCCACCTCAGAACATGTGAACGTCCTACTGACCCGCACGGAGGACCGCATCGTGTGGAAAACACTAGCCGGCGACCCATGGATAAAAACACTGACTGCCAAGTACCCTGGGAATTTCACTGCCACGATAGAGGAGTGGCAAGCAGAGCATGATGCCATCATGAGGCACATCTTGGAGAGACCGGACCCTACCGACGTCTTCCAGAATAAGGCAAACGTGTGTTGGGCCAAGGCTTTAGTGCCGGTGCTGAAGACCGCTGGCATAGACATGACCACTGAACAATGGAACACTGTGGATTATTTTGAAACGGACAAAGCTCACTCAGCAGAGATAGTATTGAACCAACTATGCGTGAGGTTCTTTGGACTCGATCTGGACTCCGGTCTATTTTCTGCACCCACTGTTCCGTTATCCATTAGGAATAATCACTGGGATAACTCCCCGTCGCCTAACATGTACGGGCTGAATAAAGAAGTGGTCCGTCAGCTCTCTCGCAGGTACCCACAACTGCCTCGGGCAGTTGCCACTGGAAGAGTCTATGACATGAACACTGGTACACTGCGCAATTATGATCCGCGCATAAACCTAGTACCTGTAAACAGAAGACTGCCTCATGCTTTAGTCCTCCACCATAATGAACACCCACAGAGTGACTTTTCTTCATTCGTCAGCAAATTGAAGGGCAGAACTGTCCTGGTGGTCGGGGAAAAGTTGTCCGTCCCAGGCAAAATGGTTGACTGGTTGTCAGACCGGCCTGAGGCTACCTTCAGAGCTCGGCTGGATTTAGGCATCCCAGGTGATGTGCCCAAATATGACATAATATTTGTTAATGTGAGGACCCCATATAAATACCATCACTATCAGCAGTGTGAAGACCATGCCATTAAGCTTAGCATGTTGACCAAGAAAGCTTGTCTGCATCTGAATCCCGGCGGAACCTGTGTCAGCATAGGTTATGGTTACGCTGACAGGGCCAGCGAAAGCATCATTGGTGCTATAGCGCGGCAGTTCAAGTTTTCCCGGGTATGCAAACCGAAATCCTCACTTGAAGAGACGGAAGTTCTGTTTGTATTCATTGGGTACGATCGCAAGGCCCGTACGCACAATTCTTACAAGCTTTCATCAACCTTGACCAACATTTATACAGGTTCCAGACTCCACGAAGCCGGATGTGCACCCTCATATCATGTGGTGCGAGGGGATATTGCCACGGCCACCGAAGGAGTGATTATAAATGCTGCTAACAGCAAAGGACAACCTGGCGGAGGGGTGTGCGGAGCGCTGTATAAGAAATTCCCGGAAAGCTTCGATTTACAGCCGATCGAAGTAGGAAAAGCGCGACTGGTCAAAGGTGCAGCTAAACATATCATTCATGCCGTAGGACCAAACTTCAACAAAGTTTCGGAGGTTGAAGGTGACAAACAGTTGGCAGAGGCTTATGAGTCCATCGCTAAGATTGTCAACGATAACAATTACAAGTCAGTAGCGATTCCACTGTTGTCCACCGGCATCTTTTCCGGGAACAAAGATCGACTAACCCAATCATTGAACCATTTGCTGACAGCTTTAGACACCACTGATGCAGATGTAGCCATATACTGCAGGGACAAGAAATGGGAAATGACTCTCAAGGAAGCAGTGGCTAGGAGAGAAGCAGTGGAGGAGATATGCATATCCGACGACTCTTCAGTGACAGAACCTGATGCAGAGCTGGTGAGGGTGCATCCGAAGAGTTCTTTGGCTGGAAGGAAGGGCTACAGCACAAGCGATGGCAAAACTTTCTCATATTTGGAAGGGACCAAGTTTCACCAGGCGGCCAAGGATATAGCAGAAATTAATGCCATGTGGCCCGTTGCAACGGAGGCCAATGAGCAGGTATGCATGTATATCCTCGGAGAAAGCATGAGCAGTATTAGGTCGAAATGCCCCGTCGAAGAGTCGGAAGCCTCCACACCACCTAGCACGCTGCCTTGCTTGTGCATCCATGCCATGACTCCAGAAAGAGTACAGCGCCTAAAAGCCTCACGTCCAGAACAAATTACTGTGTGCTCATCCTTTCCATTGCCGAAGTATAGAATCACTGGTGTGCAGAAGATCCAATGCTCCCAGCCTATATTGTTCTCACCGAAAGTGCCTGCGTATATTCATCCAAGGAAGTATCTCGTGGAAACACCACCGGTAGACGAGACTCCGGAGCCATCGGCAGAGAACCAATCCACAGAGGGGACACCTGAACAACCACCACTTATAACCGAGGATGAGACCAGGACTAGAACGCCTGAGCCGATCATCATCGAAGAGGAAGAAGAGGATAGCATAAGTTTGCTGTCAGATGGCCCGACCCACCAGGTGCTGCAAGTCGAGGCAGACATTCACGGGCCGCCCTCTGTATCTAGCTCATCCTGGTCCATTCCTCATGCATCCGACTTTGATGTGGACAGTTTATCCATACTTGACACCCTGGAGGGAGCTAGCGTGACCAGCGGGGCAACGTCAGCCGAGACTAACTCTTACTTCGCAAAGAGTATGGAGTTTCTGGCGCGACCGGTGCCTGCGCCTCGAACAGTATTCAGGAACCCTCCACATCCCGCTCCGCGCACAAGAACACCGTCACTTGCACCCAGCAGGGCCTGCTCGAGAACCAGCCTAGTTTCCACCCCGCCAGGCGTGAATAGGGTGATCACTAGAGAGGAGCTCGAGGCGCTTACCCCGTCACGCACTCCTAGCAGGTCGGTCTCGAGAACCAGCCTGGTCTCCAACCCGCCAGGCGTAAATAGGGTGATTACAAGAGAGGAGTTTGAGGCGTTCGTAGCACAACAACAATGACGGTTTGATGCGGGTGCATACATCTTTTCCTCCGACACCGGTCAAGGGCATTTACAACAAAAATCAGTAAGGCAAACGGTGCTATCCGAAGTGGTGTTGGAGAGGACCGAATTGGAGATTTCGTATGCCCCGCGCCTCGACCAAGAAAAAGAAGAATTACTACGCAAGAAATTACAGTTAAATCCCACACCTGCTAACAGAAGCAGATACCAGTCCAGGAAGGTGGAGAACATGAAAGCCATAACAGCTAGACGTATTCTGCAAGGCCTAGGGCATTATTTGAAGGCAGAAGGAAAAGTGGAGTGCTACCGAACCCTGCATCCTGTTCCTTTGTATTCATCTAGTGTGAACCGTGCCTTTTCAAGCCCCAAGGTCGCAGTGGAAGCCTGTAACGCCATGTTGAAAGAGAACTTTCCGACTGTGGCTTCTTACTGTATTATTCCAGAGTACGATGCCTATTTGGACATGGTTGACGGAGCTTCATGCTGCTTAGACACTGCCAGTTTTTGCCCTGCAAAGCTGCGCAGCTTTCCAAAGAAACACTCCTATTTGGAACCCACAATACGATCGGCAGTGCCTTCAGCGATCCAGAACACGCTCCAGAACGTCCTGGCAGCTGCCACAAAAAGAAATTGCAATGTCACGCAAATGAGAGAATTGCCCGTATTGGATTCGGCGGCCTTTAATGTGGAATGCTTCAAGAAATATGCGTGTAATAATGAATATTGGGAAACGTTTAAAGAAAACCCCATCAGGCTTACTGAAGAAAACGTGGTAAATTACATTACCAAATTAAAAGGACCAAAAGCTGCTGCTCTTTTTGCGAAGACACATAATTTGAATATGTTGCAGGACATACCAATGGACAGGTTTGTAATGGACTTAAAGAGAGACGTGAAAGTGACTCCAGGAACAAAACATACTGAAGAACGGCCCAAGGTACAGGTGATCCAGGCTGCCGATCCGCTAGCAACAGCGTATCTGTGCGGAATCCACCGAGAGCTGGTTAGGAGATTAAATGCGGTCCTGCTTCCGAACATTCATACACTGTTTGATATGTCGGCTGAAGACTTTGACGCTATTATAGCCGAGCACTTCCAGCCTGGGGATTGTGTTCTGGAAACTGACATCGCGTCGTTTGATAAAAGTGAGGACGACGCCATGGCTCTGACCGCGTTAATGATTCTGGAAGACTTAGGTGTGGACGCAGAGCTGTTGACGCTGATTGAGGCGGCTTTCGGCGAAATTTCATCAATACATTTGCCCACTAAAACTAAATTTAAATTCGGAGCCATGATGAAATCTGGAATGTTCCTCACACTGTTTGTGAACACAGTCATTAACATTGTAATCGCAAGCAGAGTGTTGAGAGAACGGCTAACCGGATCACCATGTGCAGCATTCATTGGAGATGACAATATCGTGAAAGGAGTCAAATCGGACAAATTAATGGCAGACAGGTGCGCCACCTGGTTGAATATGGAAGTCAAGATTATAGATGCTGTGGTGGGCGAGAAAGCGCCTTATTTCTGTGGAGGGTTTATTTTGTGTGACTCCGTGACCGGCACAGCGTGCCGTGTGGCAGACCCCCTAAAAAGGCTGTTTAAGCTTGGCAAACCTCTGGCAGCAGACGATGAACATGATGATGACAGGAGAAGGGCATTGCATGAAGAGTCAACACGCTGGAACCGAGTGGGTATTCTTTCAGAGCTGTGCAAGGCAGTAGAATCAAGGTATGAAACCGTAGGAACTTCCATCATAGTTATGGCCATGACTACTCTAGCTAGCAGTGTTAAATCATTCAGCTACCTGAGAGGGGCCCCTATAACTCTCTACGGCTAACCTGAATGGACTACGACATAGTCTAGTCCGCCAAGTCTAGCAT [서열번호 231]---GOI----CGGAGACGGCGCAGAAGAAGAGGATCTGGCGAAGGCAGAGGCAGCCTGCTtACATGtGGcGAcGTGGAAGAGAACCCCGGACCTATGGGCGATAGCAGCCCCGATACCTTTTCCGATGGCCTGAGCAGCAGCACCCTGCCTGATGATCACAGCAGCTACACCGTGCCTGGCTACATGCAGGACCTGGAAGTGGAACAGGCCCTGACACCAGCTCTGAGCCCTTGTGCTGTGTCCAGCACACTGCCCGATTGGCACATCCCTGTGGAAGTGGTGCCTGACAGCACCAGCGACCTGTACAACTTCCAAGTGTCCCCTATGCCTAGCACCTCCGAGGCCACCACCGATGAGGATGAAGAGGGAAAGCTGCCCGAGGACATCATGAAGCTGCTGGAACAGAGCGAGTGGCAGCCCACCAATGTGGATGGCAAGGGCTACCTGCTGAACGAGCCTGGCGTTCAGCCTACAAGCGTGTACGGCGACTTCAGCTGCAAAGAGGAACCCGAGATCGATAGCCCTGGCGGCGATATCGGACTGAGCCTGCAGAGAGTGTTCACCGACCTGAAGAACATGGACGCCACCTGGCTGGACAGCCTGCTGACACCTGTTAGACTGCCCTCTATCCAGGCTATCCCCTGCGCTCCTTGAGCGGCCGCGAATTGGCAAGCTGCTTACATAGAACTCGCGGCGATTGGCATGCCGCCTTAAAATTTTTATTTTATTTTTCTTTTCTTTTCCGAATCGGATTTTGTTTTTAATATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACGCGTCGAGGGGAATTAATTCTTGAAGACGAAAGGGCCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCATTCTAGAATGGCGCGCCCTTAAGGGGAGAATAGGAGCCGCAACACACAAGCAACGCGAGGTCGTTTAAAC [서열번호 266]
따라서, 바람직하게는 상기 벡터는 실질적으로 서열번호 231, GOI, 및 서열번호 266을 포함하거나 이로 이루어지는 상기 나타낸 것과 같은 뉴클레오티드 서열, 또는 그의 변이체 또는 단편을 포함한다.
본 발명의 saRNA 구조체는 주형으로서 DNA 플라스미드를 이용해 만들어질 수 있다. 이후, RNA 사본(copy)은 폴리머라아제, 예컨대 T7 폴리머라아제를 이용한 시험관내 전사에 의해 만들어질 수 있고, 상기 T7 프로모터는 saRNA의 상류일 수 있다. 그러므로, 본 발명의 saRNA 구조체는 주형으로서 서열번호 1 내지 서열번호 266 중 어느 하나에 나타낸 것과 같은 핵산 서열, 예컨대 실질적으로 서열번호 231, GOI, 및 서열번호 266을 포함하거나 이로 이루어지는 상기 나타낸 것과 같은 서열, 또는 그의 변이체 또는 단편을 갖는 DNA 플라스미드를 이용해 만들어질 수 있다. 물론, 다른 RNA 폴리머라아제, 예를 들면 SP6 또는 T3 폴리머라아제가 T7 폴리머라아제 대신에 사용될 수 있음이 인식될 것이며, 이 경우 상기 saRNA 구조체는 대신에 SP6 또는 T3 프로모터를 포함할 수 있다.
제1 측면의 RNA 구조체를 암호화하는 제4 측면의 벡터는 예를 들면 플라스미드, 코스미드(cosmid) 또는 파지(phage)이거나, 및/또는 바이러스 벡터일 수 있다. 이러한 재조합 벡터는 세포를 뉴클레오티드 서열로 형질전환하기 위한 본 발명의 운반 시스템에서 매우 유용하다. 상기 뉴클레오티드 서열은 바람직하게는 DNA 서열일 수 있고, 상기 DNA 서열이 제1 측면의 RNA 구조체를 형성하는 RNA 서열을 암호화한다.
제1 측면의 RNA 구조체를 암호화하는 재조합 벡터는 또한 다른 기능적 요소를 포함할 수 있다. 예를 들면, 재조합 벡터는 숙주 세포에서 벡터의 도입시 전이유전자 발현을 시작하기 위한 적합한 프로모터를 포함하는 다양한 다른 기능적 요소를 추가로 포함할 수 있다. 예를 들어, 상기 벡터는 바람직하게는 숙주 세포, 예컨대 박테리아 세포의 핵에서 자발적으로 복제할 수 있다. 이 경우, DNA 복제를 유도 또는 조절하는 요소가 상기 재조합 벡터에 필요할 수 있다. 대안적으로, 상기 재조합 벡터는 숙주 세포의 게놈 내로 통합되도록 디자인될 수 있다. 이 경우, (예컨대 상동성 재조합에 의한) 표적화된 통합에 유리한 DNA 서열이 예측된다. 적합한 프로모터는 예로서 SV40 프로모터, CMV, EF1a, PGK, 바이러스 긴 말단 반복뿐만 아니라 유도가능한 프로모터, 예컨대 테트라사이클린 유도가능한 시스템을 포함할 수 있다. 상기 카세트 또는 벡터는 또한 종결자(terminator), 예컨대 베타 글로빈, SV40 폴리아데닐화 서열 또는 합성 폴리아데닐화 서열을 포함할 수 있다. 상기 재조합 벡터는 또한 필요시 핵산의 발현을 제어하기 위한 프로모터 또는 조절자 또는 인핸서(enhancer)를 포함할 수 있다.
상기 벡터는 또한 형질감염(transfection) 또는 형질전환(transformation)된 세포의 선택을 가능하게 하고, 이종성 DNA가 통합된 벡터를 갖고 있는 세포의 선택을 가능하게 하는 클로닝 공정에서 선택가능한 마커로서 사용될 수 있는 유전자를 코딩하는 DNA를 포함할 수 있다. 예를 들면, 암피실린, 네오마이신, 푸로마이신 또는 클로람페니콜 저항성이 예측된다. 대안적으로, 상기 선택가능한 마커 유전자는 전이유전자(들)를 함유하는 벡터와 동시에 사용될 수 있는 상이한 벡터에 있을 수 있다. 상기 카세트 또는 벡터는 또한 뉴클레오티드 서열의 발현을 조절하거나, 발현된 폴리펩티드를 숙주 세포의 소정 부분으로 표적화하는데 관여되는 DNA를 포함할 수 있다.
정제된 벡터는 적합한 수단, 예컨대 직접 식작용 섭취(endocytotic uptake)에 의해 숙주 세포 내로 직접적으로 삽입될 수 있다. 상기 벡터는 형질감염, 감염, 전기천공, 미세주사, 세포 융합, 원형질체 융합 또는 탄도적 충격(ballistic bombardment)에 의해 숙주 세포(예컨대, 진핵생물 또는 원핵생물 세포) 내로 직접 도입될 수 있다. 대안적으로, 본 발명의 벡터는 입자 총을 이용해 숙주 세포 내로 직접 도입될 수 있다.
상기 핵산 분자는 (필수적인 것은 아니지만) 숙주 세포의 DNA에 통합되게 되는 것일 수 있다. 미분화된 세포는 인정적으로 형질전환되어 유전적으로 변형된 딸세포의 생산을 유도할 수 있다(이 경우, 예컨대 특정 전사 인자 또는 유전자 활성화제를 이용한 대상체에서의 발현의 조절이 필요할 수 있다). 대안적으로, 상기 운반 시스템은 분화된 세포의 불안정하거나 일시적인 형질전환을 선호하도록 디자인될 수 있다. 이 경우일 때는 발현의 조절이 덜 중요할 수 있는데, 그 이유는 형질전환된 세포가 죽거나 단백질의 발현을 중단할 때 상기 DNA 분자의 발현이 중단될 것이기 때문이다.
대안적으로, 상기 운반 시스템은 벡터에 통합되지 않고 숙주 세포에 핵산 분자를 제공할 수 있다. 예를 들어, 상기 핵산 분자는 리포좀 또는 바이러스 입자 내에 통합될 수 있다. 대안적으로, "네이키드(naked)" 핵산 분자가 적합한 수단, 예컨대 직접 식작용 섭취에 의해 숙주 세포 내로 삽입될 수 있다.
제5 측면에서, 제1 측면의 RNA 구조체, 제2 측면의 핵산 서열, 제3 측면의 발현 카세트 또는 제4 측면의 벡터, 및 약학적으로 허용가능한 비히클(vehicle)을 포함하는 약학적 조성물이 제공된다.
제6 측면에서, 제5 측면에 따른 약학적 조성물을 제조하기 위한 공정이 제공되며, 상기 방법은 제1 측면의 RNA 구조체, 제2 측면의 핵산 서열, 제3 측면의 발현 카세트 또는 제4 측면의 벡터를 약학적으로 허용가능한 비히클과 접촉시키는 단계를 포함한다.
제7 측면에서, 제1 측면의 RNA 구조체의 제조 방법이 제공되며, 상기 방법은
a) ⅰ) 숙주 세포 내로 제4 측면의 벡터를 도입하는 단계; 및
ⅱ) 제1 측면의 RNA 구조체의 생산을 야기하기 위한 조건 하에 상기 숙주 세포를 배양하는 단계를 포함하거나; 또는
b) 제4 측면에 따른 벡터로부터 RNA 구조체를 전사하는 단계를 포함한다.
단계 a)의 숙주 세포는 진핵생물 또는 원핵생물 숙주 세포일 수 있다. 바람직하게는, 상기 숙주 세포는 진핵생물 숙주 세포이다. 보다 바람직하게는, 상기 숙주 세포는 포유동물 숙주 세포, 예컨대 인간 배아 신장 293 세포 또는 중국 햄스터 난소(CHO) 세포이다. 단계 (b)는 시험관내 또는 생체내에서, 바람직하게는 시험관내에서 수행될 수 있다.
시험관내 전사의 적합한 방법은 본 기술분야에 잘 알려져 있으며, 본 기술분야의 기술자에게 알려져 있을 것이다. 예를 들면, [Molecular Cloning, A Laboratory Manual, 2nd edition. (1989) editor C Nolan, Cold Spring Harbor Laboratory Press]에 기술되어 있다.
제1 측면의 RNA 레플리콘은 치료법용으로 특히 적합하다.
본 발명자들은 제1 측면의 RNA 구조체가 치료법에서 생체내에서 사용하기 위해 시험관내 전사에 의해 생성될 것으로 예측했지만, 본 기술분야의 숙련자는 상기 RNA 구조체는 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터를 대상체에게 생체내 운반함으로써 치료법을 위해 대상체에서 생체내에서 생성될 수 있음을 인식할 것이다.
그러므로, 제8 측면에 따르면, 약제로서 또는 치료법에서 사용하기 위한 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물이 제공된다.
본 발명의 제9 측면에서, 원생동물, 진균, 박테리아 또는 바이러스 감염의 예방, 개선 또는 치료에 사용하기 위한 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물이 제공된다.
상기 원생동물, 진균, 박테리아 또는 바이러스 감염은 제1 측면에서 정의된 것과 같은 원생동물, 진균, 박테리아 또는 바이러스의 감염일 수 있다.
본 발명의 제10 측면에서, 암의 예방, 개선 또는 치료에 사용하기 위한 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물이 제공된다.
상기 암은 제1 측면에서 정의된 것일 수 있다.
본 발명의 제11 측면에서, 원생동물, 진균, 박테리아 또는 바이러스 감염을 치료하기 위한 방법이 제공되며, 상기 방법은 이를 필요로 하는 대상체에게, 치료적 유효량의 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물을 투여하는 단계를 포함한다.
치료되어야 하는 원생동물, 진균, 박테리아 또는 바이러스 감염은 제1 측면에서 정의된 것과 같은 원생동물, 진균, 박테리아 또는 바이러스의 감염일 수 있다.
본 발명의 제12 측면에서, 암을 치료하기 위한 방법이 제공되며, 상기 방법은 이를 필요로 하는 대상체에게 치료적 유효량의 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물을 투여하는 단계를 포함한다.
치료되어야 하는 암은 제1 측면에서 정의된 것일 수 있다.
본 명세서에 기술된 RNA 구조체는 (예컨대, 바이러스, 박테리아 또는 진균 감염에 대해) 대상체 및 암을 백신화하는 효과적인 수단을 제공한다.
따라서, 본 발명의 제13 측면에서, 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물를 포함하는 백신이 제공된다.
운반 제형 내로 통합된 어주번트(adjuvant)는 박테리아 지질펩티드, 지질단백질 및 리포테이코산; 마이코박테리아 지질글리칸; 효모 자이모산, 포린, 지질폴리사카라이드, 지질 A, 모노포스포릴 지질 A(MPL), 플라젤린, CpG DNA, 헤모조인, 토마틴, ISCOM, ISCOMATRIXTM, 스쿠알렌계 에멀전, 폴리머, 예컨대 PEI, 카르보폴, 지질 나노입자 및 박테리아 독소(CT, LT)로 이루어진 군으로부터 선택될 수 있다. 운반 제형 내로 통합된 어주번트의 추가 예는 알루미늄 염, 합성 형태의 DNA, 탄수화물, 정제 결합제, 이온 교환 수지, 보존제, 폴리머, 에멀전 및/또는 지질을 포함할 수 있다. 어주번트의 예는 일나트륨 글루타메이트, 수크로오스, 덱스트로오스, 알루미늄 소, 인간 혈청 알부민, 시토신 포스포구아닌, 인산칼륨, 플라스돈 C, 무수 락토오스, 셀룰로오스, 폴라크릴린 칼륨, 글리세린, 아스파라긴, 시트르산, 인산칼륨 황산마그네슘, 철 암모늄 시트레이트, 2-페녹시에탄올, 알루미늄, 베타-프로피오락톤, 소 추출물, DOPC, EDTA, 포름알데히드, 티메로살, 페놀, 칼륨 알루미늄 설페이트, 글루탐산칼륨, 붕산나트륨, 나트륨 메타비설파이트, 우레아, PLGA, PVA, PLA, PVP, 시클로덱스트린계 안정화제, 수중유(oil in water) 에멀전 어주번트 및/또는 지질계 어주번트를 포함할 수 있다.
본 발명의 제14 측면에서, 대상체에서 면역 반응을 자극하는데 사용하기 위한 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물이 제공된다.
상기 면역 반응은 제1 측면에서 정의된 항원에 다른 원생동물, 박테리아, 바이러스, 진균 또는 암에 대해 자극될 수 있다.
제15 측면에 따르면, 줄기 세포 치료법에서 사용하기 위한 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물이 제공된다.
줄기 세포 치료법은 체세포를 줄기 세포 특징을 갖는 세포로 재프로그래밍하는 것에 관한 것일 수 있다.
체세포는 제1 측면에서 정의된 것과 같은 줄기 세포 특징을 갖는 세포로 체세포를 재프로그래밍 하는 것을 향상시킬 수 있는 하나 이상의 단백질을 운반함으로써 재프로그래밍될 수 있다.
제16 측면에 따르면, 제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물을 세포로 운반하는 단계를 포함하는 생체외 또는 시험관내에서 세포를 변형시키는 방법이 제공된다.
바람직하게는, 상기 방법은 생체외에서 수행된다.
상기 세포는 진핵생물 또는 원핵생물 세포일 수 있다. 바람직하게는, 상기 세포는 진핵생물 세포이다. 보다 바람직하게는, 상기 세포는 포유동물 숙주 세포이다. 가장 바람직하게는, 상기 세포는 인간 세포이다.
바람직하게는, 상기 변형된 세포는 세포-치료법 적응증을 위해 적합하다.
제17 측면에서, 제16 측면의 방법으로부터 수득되거나 이것에 의해 수득가능한 변형된 세포가 제공된다.
제18 측면에서, 치료법, 선택적으로 세포 치료법에서 사용하기 위한 제17 측면의 변형된 세포가 제공된다.
제1 측면에 따른 RNA 구조체, 제2 측면에 따른 핵산, 제3 측면에 따른 발현 카세트, 제4 측면에 따른 벡터 또는 제5 측면에 따른 약학적 조성물(본 명세서에서 활성제로 알려짐)이 질환을 치료, 개선, 또는 예방하거나 백신화하기 위한 단일치료법(즉, 상기 활성제의 사용)으로서 사용될 수 있는 약제에서 사용될 수 있음이 인식될 것이다. 대안적으로, 본 발명에 따른 활성제는 질환의 치료, 개선, 또는 예방을 위해 알려진 치료법에 부가하여, 또는 이와 조합하여 사용될 수 있다.
본 발명의 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 특히 조성물이 사용되어야 하는 방식에 따라 다수의 상이한 형태를 갖는 조성물에 조합될 수 있다. 따라서, 예를 들면, 상기 조성물은 분말, 정제, 캡슐, 액체, 연고, 크림, 겔, 히드로겔, 에어로졸, 스프레이, 미셀 용액, 경피 패치, 리포좀 현탁액, 폴리플렉스, 에멀전, (표면에 RNA를 갖거나 캡슐화된) 지질 나노입자의 형태 또는 치료 또는 백신화를 필요로 하는 인간 또는 동물에게 투여될 수 있는 임의의 다른 적합한 형태일 수 있다. 본 발명에 따른 약제의 비히클은 이것이 제공되는 대상체에 의해 잘-용인되는 것이어야 함이 인식될 것이다.
본 발명의 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 또한 느린-또는 지연된-방출 장치 내에 통합될 수 있다. 이러한 장치는, 예를 들면, 피부 위 또는 아래에 삽입될 수 있고, 상기 약제는 수 주 또는 심지어 수 개월에 걸쳐 방출될 수 있다. 상기 장치는 치료 부위와 적어도 인접하게 위치될 수 있다. 이러한 장치는 상기 유전자 구조체 또는 재조합 벡터를 이용한 장기간의 치료가 필요하고 정상적으로 빈번한 투여(예컨대, 적어도 매일 주사)가 필요할 때 특히 유리할 수 있다.
그러나, 바람직한 구현예에서, 본 발명에 따른 약제는 혈류, 근육, 피부 내로, 또는 치료를 필요로 하는 부위 내로 직접 주사함으로써 대상체에게 투여될 수 있다. 가장 바람직하게는, 상기 RNA 구조체를 포함하는 약제는 근육 내로 주사된다. 주사는 정맥내(볼루스 또는 수액) 또는 피하(볼루스 또는 수액), 또는 진피내(볼루스 또는 수액), 또는 근육내(볼루스 또는 수액)일 수 있다.
필요한 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물의 양은 그 생물학적 활성 및 생체이용성에 의해 결정되고, 차례로 상기 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물의 투여 방식, 물리화학적 특성 및 단일치료법 또는 조합 치료법으로 사용되는지 여부에 의존할 것임이 인식될 것이다. 투여의 빈도는 또한 치료되는 대상체 내에서 상기 활성제의 반감기에 의해 영향을 받을 것이다. 투여되어야 하는 최적의 복용량은 본 기술분야의 기술자에 의해 결정될 수 있고, 사용되는 특정 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물, 약학적 조성물의 강도, 투여 방식, 및 바이러스 감염의 타입 및 진행에 따라 변할 것이다. 대상체의 연령, 중량, 성별, 식이, 및 투여 시간을 포함하는 치료되는 특정 대상체에 의존하는 부가적인 인자는 복용량을 조정할 필요성을 야기할 것이다.
일반적으로, 0.001 ㎍/㎏의 체중 내지 10 ㎎/㎏의 체중, 또는 0.01 ㎍/㎏의 체중 내지 1 ㎎/㎏의 체중의 본 발명의 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물의 일일 용량이 사용되는 활성제에 따라 질환을 치료, 개선, 또는 예방하기 위해 사용될 수 있다.
일일 용량은 단일 투여(예컨대 단일 일일 주사 또는 코 스프레이의 흡입)로서 제공될 수 있다. 대안적으로, 상기 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 하루 동안에 2회 이상의 투여가 필요할 수 있다. 예로서, 상기 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 0.07 ㎍ 내지 700 ㎎(즉, 70 ㎏의 체중으로 추정함)의 2회(또는 치료되는 질환의 증세에 따라 그 이상)의 일일 용량으로 투여될 수 있다. 치료를 받는 환자는 기상시 제1 용량을, 이후 저녁(2회 용량 요법이라면)에 또는 이후에 3 또는 4시간 간격으로 제2 용량을 취할 수 있다. 대안적으로, 느린 방출 장치는 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물을 반복된 용량을 투여할 필요 없이 환자에게 최적의 용량을 제공하기 위해 사용될 수 있다.
그러나, 바람직하게는 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 매주 용량, 보다 바람직하게는 격주 용량으로 제공될 수 있다.
제약 산업에 의해 종래 도입되는 것들과 같은 공지된 절차(예컨대 생체내 실험, 임상 시험, 등)는 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트 또는 벡터의 구체적인 제형 및 정확한 치료 요법(예컨대, 제제의 일일 용량 및 투여 빈도)을 형성하기 위해 사용될 수 있다.
"대상체"는 척추동물, 포유동물, 또는 가축일 수 있다. 따라서, 본 발명에 따른 조성물 및 약제는 임의의 포유동물, 예를 들면 가축(예컨대, 말), 애완동물을 치료하기 dnlgo 사용될 수 있거나, 다른 수의과 적용분야에 사용될 수 있다. 그러나, 가장 바람직하게는, 상기 대상체는 인간이다.
상기 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물의 "치료적 유효량"은 대상체에게 투여될 때 임의의 해당 질환을 개선, 예방 또는 치료하기 위해 필요한 전술한 양인 임의의 양이다.
예를 들면, 본 발명의 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 약 0.0001 ㎎ 내지 약 800 ㎎, 바람직하게는 약 0.001 ㎎ 내지 약 500 ㎎으로 사용될 수 있다. 상기 레플리콘, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물의 양은 약 0.01 ㎎ 내지 약 250 ㎎, 가장 바람직하게는 약 0.01 ㎎ 내지 약 1 ㎎의 양인 것이 바람직하다. 바람직하게는, 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 1-200 ㎍의 용량으로 투여된다.
본 명세서에서 나타낼 때, "약학적으로 허용가능한 비히클"은 약학적 조성물을 제형화하는데 유용하다고 본 기술분야의 기술자에게 알려진 임의의 공지된 화합물 또는 공지된 화합물들의 조합이다.
한 구현예에서, 상기 약학적으로 허용가능한 비히클은 고체일 수 있고, 상기 조성물은 분말 또는 정제의 형태일 수 있다. 고체인 약학적으로 허용가능한 비히클은 향미제, 윤활제, 가용화제, 현탁화제, 염료, 충진제, 활택제, 압착 보조제, 불활성 결합제, 감미제, 보존제, 염료, 코팅물, 또는 정제-붕괴제로서도 작용할 수 있는 하나 이상의 물질을 포함할 수 있다. 상기 비히클은 또한 캡슐화 물질일 수 있다. 분말에서, 상기 비히클은 본 발명에 따른 세분된 활성제와 혼합되는 세분된 고체이다. 정제에서, 상기 활성제(예컨대, 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물)는 필요한 압착 특성을 갖는 비히클과 적합한 비율로 혼합되고 원하는 형태 및 크기로 압착될 수 있다. 상기 분말 및 정제는 바람직하게는 상기 활성제를 최대 99%까지 함유한다. 적합한 고체 비히클은, 예를 들면 인산칼슘, 스테아르산마그네슘, 탈크, 당, 락토오스, 덱스트린, 전분, 젤라틴, 셀룰로오스, 폴리비닐피롤리돈, 저융점 왁스 및 이온 교환 수지를 포함한다. 다른 구현예에서, 상기 약학적 비히클은 겔일 수 있고, 상기 조성물은 크림 등의 형태일 수 있다.
그러나, 상기 약학적 비히클은 액체일 수 있고, 상기 약학적 조성물은 용액의 형태이다. 액체 비히클은 용액, 현탁액, 에멀전, 시럽, 엘릭시르 및 가압 조성물의 제조에 사용된다. 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 약학적으로 허용가능한 액체 비히클, 예컨대 물, 유기 용매, 이들의 혼합물 또는 약학적으로 허용가능한 오일 또는 지방에 용해 또는 현탁될 수 있다. 상기 액체 비히클은 다른 적합한 약학적 첨가제, 예컨대 가용화제, 에멀전화제, 버퍼, 보존제, 감미제, 향미제, 현탁화제, 증점제, 착색제, 점도 조절제, 안정화제 또는 삼투-조절제를 함유할 수 있다. 경구 및 비경구 투여를 위한 액체 비히클의 적합한 예는 (상기와 같은 첨가제, 예컨대 셀룰로오스 유도체, 바람직하게는 나트륨 카르복시메틸 셀룰로오스 용액을 부분적으로 함유하는) 물, (1가 알코올 및 다가 알코올, 예컨대 글리콜을 포함하는) 알코올 및 그의 유도체, 및 오일(예컨대 분획화된 코코넛 오일 및 아라키스 오일)을 포함한다. 비경구 투여의 경우, 상기 비히클은 또한 오일성 에스테르, 예컨대 에틸 올레에이트 및 이소프로필 미리스테이트일 수 있다. 멸균 액체 비히클은 비경구 투여용 멸균 액체 형태 조성물에서 유용하다. 가압 조성물을 위한 액체 비히클은 할로겐화 탄화수소 또는 다른 약학적으로 허용가능한 추진제(propellant)일 수 있다.
멸균 용액 또는 현탁액인 액체 약학적 조성물은, 예를 들면, 피하, 진피내, 척추강내, 경막외, 복강내, 정맥내, 특히 근육내 주사에 의해 이용될 수 있다. 본 발명의 핵산 서열, 또는 발현 카세트는 멸균수, 식염수, 또는 다른 적절한 멸균 주사가능한 매체를 이용하여 투여 시점에 용해 또는 현탁될 수 있는 멸균 고체 조성물로서 제조될 수 있다.
본 발명의 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 다른 용질 또는 현탁화제(예를 들면, 용액이 등장이 되게 하기에 충분한 식염수 또는 글루코오스), 담즙 염, 아카시아, 젤라틴, 소르비탄 모노올레이이트, 폴리소르베이트 80(소르비톨의 올레에이트 에스테르 및 에틸렌 옥사이드로 공중합된 그의 무수물) 등을 함유하는 멸균 용액 또는 현탁액의 형태로 경구 투여될 수 있다. 본 발명에 따른 RNA 구조체, 핵산 서열, 발현 카세트, 벡터 또는 약학적 조성물은 또한 액체 또는 고체 조성물 형태로 경구 투여될 수 있다. 경구 투여용으로 적합한 조성물은 알약, 캡슐, 과립, 정제, 및 분말과 같은 고체 형태, 및 용액, 시럽, 엘릭시르, 및 현탁액과 같은 액체 형태를 포함한다. 비경구 투여를 위해 유용한 형태는 멸균 용액, 에멀전, 및 현탁액을 포함한다.
본 발명은 그의 변이체 또는 단편을 포함하여 실질적으로 본 명세서에 나타낸 임의의 서열의 아미노산 또는 핵산 서열을 포함하는 임의의 핵산 또는 펩티드 또는 그의 변이체, 유도체 또는 유사체로 확장됨이 인식될 것이다. 용어 "실질적으로 아미노산/뉴클레오티드/펩티드 서열", "변이체" 및 "단편"은 본 명세서에 나타낸 어느 한 서열의 아미노산/뉴클레오티드/펩티드 서열과 적어도 40% 서열 동일성(identity), 예를 들면 본명세서에서 확인된 임의의 서열과 40% 동일성을 갖는 서열일 수 있다.
나타낸 임의의 서열과 65% 이상, 보다 바람직하게는 70% 이상, 보다 더 바람직하게는 75% 이상의 서열 동일성, 보다 더 바람직하게는 80% 이상의 서열 동일성을 갖는 아미노산/폴리뉴클레오티드/폴리펩티드 서열이 또한 예측된다. 바람직하게는, 상기 아미노산/폴리뉴클레오티드/폴리펩티드 서열은 나타낸 임의의 서열과 적어도 85% 동일성, 보다 바람직하게는 적어도 90% 동일성, 보다 더 바람직하게는 적어도 92% 동일성, 보다 더 바람직하게는 적어도 95% 동일성, 보다 더 바람직하게는 적어도 97% 동일성, 보다 더 바람직하게는 적어도 98% 동일성, 가장 바람직하게는 본 명세서에 나타낸 임의의 서열과 적어도 99% 동일성을 갖는다.
숙련된 기술자는 2개의 아미노산/폴리뉴클레오티드/폴리펩티드 서열 사이에 동일성 백분율을 계산하는 방법을 인식할 것이다. 2개의 아미노산/폴리뉴클레오티드/폴리펩티드 서열 사이의 동일성 백분율을 계산하기 위하여, 상기 2개의 서열의 정렬이 먼저 준비되어야 하고, 이어서 서열 동일성 값을 계산한다. 2개의 서열에 대한 동일성 백분율은 다음에 따라 상이한 값을 취할 수 있다: (ⅰ) 서열을 정렬하기 위해 사용된 방법, 예를 들면, ClustalW, BLAST, FASTA, 스미스-워터맨(상이한 프로그램에서 실행됨), 또는 3D 비교로부터의 구조적 정렬; 및 (ⅱ) 정렬 방법에 의해 사용된 파라미터, 예를 들면, 국소 대 전체 정렬, 사용된 쌍별-점수 매트릭스(예컨대 BLOSUM62, PAM250, Gonnet 등), 및 갭-페널티, 예컨대 기능적 형태 및 상수.
정렬을 만든 후, 2개의 서열 사이에 동일성 백분율을 계산하는 많은 다양한 방식이 있다. 예를 들면, 동일성의 수를 다음에 의해 나눌 수 있다: (ⅰ) 가장 짧은 서열의 길이; (ⅱ) 정렬의 길이; (ⅲ) 서열의 평균 길이; (ⅳ) 비-갭 위치의 수; 또는 (ⅴ) 돌출부(overhang)를 배제한 등가물 위치의 수. 아울러, 동일성 백분율은 또한 매우 길이 의존성임이 인식될 것이다. 따라서, 서열의 쌍이 짧을수록 더 높은 서열 동일성이 우연히 일어날 것으로 예상할 수 있다.
그러므로, 단백질 또는 DNA 서열의 정확한 정렬은 복잡한 공정임이 인식될 것이다. 대중적인 다중 정렬 프로그램 ClustalW(Thompson et al., 1994, Nucleic Acids Research, 22, 4673-4680; Thompson et al., 1997, Nucleic Acids Research, 24, 4876-4882)는 본 발명에 따른 단백질 또는 DNA의 다중 정렬을 생성하기 위한 바람직한 방법이다. ClustalW에 대한 적합한 파라미터는 다음과 같을 수 있다: DNA 정렬의 경우: 갭 개방 페널티 = 15.0, 갭 연장 패널티 = 6.66, 및 매트릭스 = 동일성. 단백질 정렬의 경우: 갭 개방 페널티 = 10.0, 갭 연장 페널티 = 0.2, 및 매트릭스 = Gonnet. DNA 및 단백질 정렬의 경우: ENDGAP = -1, 및 GAPDIST = 4. 본 기술분야의 기술자는 최적의 서열 정렬을 위해서는 상기 및 다른 파라미터들을 변화시킬 필요가 있을 수 있음을 인식할 것이다.
바람직하게는, 2개의 아미노산/폴리뉴클레오티드/폴리펩티드 서열 사이의 동일성 백분율의 계산은 이후 이러한 정렬로부터 (N/T)×100으로서 계산될 수 있으며, 여기서 N은 서열이 동일한 잔기를 공유하는 위치의 수이고, T는 갭을 포함하고 돌출부를 포함하거나 배제한 비교된 위치의 총 수이다. 바람직하게는, 돌출부는 상기 계산에 포함된다. 따라서, 2개의 서열 사이의 동일성 백분율을 계산하기 위한 가장 바람직한 방법은 (ⅰ) 예를 들면, 상기에 나타낸 것과 같은 적합한 세트의 파라미터를 이용하여 ClustalW 프로그램을 이용하여 서열 정렬을 준비하는 단계; 및 (ⅱ) N 및 T의 값을 다음의 식: 서열 동일성 = (N/T)×100 내로 삽입하는 단계를 포함한다.
유사한 서열을 확인하기 위한 대안적인 방법은 본 기술분야의 기술자에게 알려질 것이다. 예를 들면, 실질적으로 유사한 뉴클레오티드 서열은 엄격한 조건 하에서 DNA 서열 또는 그의 상보체(complement)와 교잡하는 서열에 의해 암호화될 것이다. 엄격한 조건이란 것은, 본 발명자들은 뉴클레오티드가 대략 45℃에서 3× 염화나트륨/시트르산나트륨(SSC)과, 이어서 대략 20-65℃에서 0.2× SSC/0.1% SDS에서 적어도 1회의 세척 후 필터-결합된 DNA 또는 RNA에 혼성화하는 것을 의미한다. 대안적으로, 실질적으로 유사한 폴리펩티드는 본 명세서에 확인된 임의의 서열과 적어도 1개이지만, 5, 10, 20, 50 또는 100개 미만의 아미노산이 상이할 수 있다.
유전자 코드의 축퇴(degeneracy)로 인하여, 본 명세서에 기술된 임의의 핵산 서열은 이것에 의해 암호화된 단백질의 서열에 실질적으로 영향을 미치지 않으면서 다양해지거나 변화되어 그의 기능적 변이체를 제공할 수 있음이 명확하다. 적합한 뉴클레오티드 변이체는 서열 내에서 동일한 아미노산을 암호화하여 침묵적(동의적) 변화를 생산하는 상이한 코돈의 치환에 의해 변경된 서열을 갖는 것들이다. 다른 적합한 변이체는 상동성 뉴클레오티드 서열을 갖지만 치환되는 아미노산과 유사한 생물물리학적 특성의 측쇄를 갖는 아미노산을 암호화하는 상이한 코돈의 치환에 의해 변경되어 보존된 변화를 생산하는 서열의 전부 또는 일부를 포함하는 것들이다. 예를 들면, 작은 비극성의 소수성 아미노산은 글리신, 알라닌, 루이신, 이소루이신, 발린, 프롤린, 및 메티오닝을 포함한다. 큰 비극성의 소수성 아미노산은 페닐알라닌, 트립토판 및 티로신을 포함한다. 극성의 중성 아미노산은 세린, 트레오닌, 시스테인, 아스파라긴 및 글루타민을 포함한다. 양을 대전된(염기성) 아미노산은 리신, 아르기닌 및 히스티딘을 포함한다. 음으로 대전된(산성) 아미노산은 아스파르트산 및 글루탐산을 포함한다. 따라서, 어느 아미노산이 유사한 생물물리학적 특성을 갖는 아미노산으로 교체될 수 있음이 인식될 것이고, 숙련된 기술자는 상기 아미노산을 암호화하는 뉴클레오티드 서열을 알 것이다.
(임의의 부속 청구항, 요약 및 도면을 포함하는) 본 명세서에 기술된 모든 특성들 및/또는 여기에 개시된 임의의 방법 또는 공정의 모든 단계들은, 이러한 특성 및/또는 단계의 적어도 일부가 상호 배타적인 조합을 제외하고는, 임의의 조합으로 임의의 상기 측면들과 조합될 수 있다.
본 발명을 보다 잘 이해하고, 본 발명의 구현예가 효과를 내기 위해 수행될 수 있는 방식을 보이기 위하여, 이제 실시예로서 부속하는 도면에 대한 참조가 행해질 것이다:
도 1은 본 발명의 RNA 구조체의 다양한 구현예(1-7로 나타냄)의 개략도를 보여준다(예컨대, 왼쪽의 saRNA 레플리콘, 또는 mRNA 구조체). 상기 saRNA 레플리콘(1-4)은 알파 바이러스 백본에 기반하고 있다. 상기 소위 '스틸티콘' 벡터는 5' UTR과, 이어지는 알파바이러스, 예컨대 VEEV로부터의 비-구조 단백질(NSP1-4)을 암호화하는 핵산, 서브게놈성 프로모터(SGP), GOI(관심있는 유전자), 예컨대 바이러스, 박테리아, 진균 또는 포유동물 단백질 또는 항원, 비-바이러스 선천적 조절 단백질(IMP), 3' UTR 및 폴리A 꼬리를 포함한다. 상기 mRNA 구조체(5-7)는 5' UTR, GOI(관심있는 유전자), 예컨대 바이러스, 박테리아, 진균 또는 포유동물 단백질 또는 항원, 비-바이러스 선천적 조절 단백질(IMP), 3' UTR 및 3' 폴리A 꼬리를 포함한다. 상기 IMP 및 GOI의 순서는 상이한 실증적 구현예에 나타낸 것과 같이 saRNA 및 mRNA 모두에 대해 다양할 수 있다;
도 2는 메신저 RNA(mRNA) 백신으로 백신화된 대상체(최초 프라이머(primer) 예방접종(jab)과, 이어지는 후속 부스트(boost) 예방접종)에서의 면역 반응을 실증한다;
도 3은 표준 자가-증폭형 (saRNA) 백신으로 백신화된 대상체(최초 프라이머 예방접종과, 이어지는 부스트 예방접종)에서의 면역 반응을 실증한다;
도 4는 본 발명의 RNA 구조체의 한 구현예, 예를 들면 도 1에 나타낸 스틸티콘 벡터로 백신화된 대상체(최초 프라이머 예방접종과, 이어지는 부스트 예방접종)에서의 면역 반응을 실증한다;
도 5는 본 발명의 RNA 구조체의 한 구현예, 즉 도 1에 나타낸 스틸티콘 벡터로 백신화된 대상체(최초 프라이머 예방접종과, 이어지는 부스트 예방접종)에서의 항원 발현 레벨을 실증한다;
도 6은 F-T2A 형상(configuration)에서 선택된 IMP를 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 7은 F-T2A 형상에서 선택된 IMP를 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 8은 F-T2A 형상에서 선택된 IMP를 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 9는 이중 서브게놈성 프로모터(DSGP) 형상에서 IMP, HSP 90 CDC37을 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 10은 F-T2A 형상에서 IMP가 없는 saRNA와 비교하여 IMP를 함유하는 saRNA로 형질감염된 이후 HeLa 세포에서 생성된 VEGF-A 발현에서의 증가를 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 분비된 리포터 단백질로서 VEGF-A를 함유하는 RNA(100 ng)로 형질감염되었고, 48시간 후 ELISA에 의해 배양 배지에서 단백질 발현에 대해 평가하였다; 및
도 11은 F-T2A 형상에서 IMP를 함유하는 RNA로 형질감염된 이후 HeLa 세포에서의 n-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 RNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다.
도 1은 본 발명의 RNA 구조체의 다양한 구현예(1-7로 나타냄)의 개략도를 보여준다(예컨대, 왼쪽의 saRNA 레플리콘, 또는 mRNA 구조체). 상기 saRNA 레플리콘(1-4)은 알파 바이러스 백본에 기반하고 있다. 상기 소위 '스틸티콘' 벡터는 5' UTR과, 이어지는 알파바이러스, 예컨대 VEEV로부터의 비-구조 단백질(NSP1-4)을 암호화하는 핵산, 서브게놈성 프로모터(SGP), GOI(관심있는 유전자), 예컨대 바이러스, 박테리아, 진균 또는 포유동물 단백질 또는 항원, 비-바이러스 선천적 조절 단백질(IMP), 3' UTR 및 폴리A 꼬리를 포함한다. 상기 mRNA 구조체(5-7)는 5' UTR, GOI(관심있는 유전자), 예컨대 바이러스, 박테리아, 진균 또는 포유동물 단백질 또는 항원, 비-바이러스 선천적 조절 단백질(IMP), 3' UTR 및 3' 폴리A 꼬리를 포함한다. 상기 IMP 및 GOI의 순서는 상이한 실증적 구현예에 나타낸 것과 같이 saRNA 및 mRNA 모두에 대해 다양할 수 있다;
도 2는 메신저 RNA(mRNA) 백신으로 백신화된 대상체(최초 프라이머(primer) 예방접종(jab)과, 이어지는 후속 부스트(boost) 예방접종)에서의 면역 반응을 실증한다;
도 3은 표준 자가-증폭형 (saRNA) 백신으로 백신화된 대상체(최초 프라이머 예방접종과, 이어지는 부스트 예방접종)에서의 면역 반응을 실증한다;
도 4는 본 발명의 RNA 구조체의 한 구현예, 예를 들면 도 1에 나타낸 스틸티콘 벡터로 백신화된 대상체(최초 프라이머 예방접종과, 이어지는 부스트 예방접종)에서의 면역 반응을 실증한다;
도 5는 본 발명의 RNA 구조체의 한 구현예, 즉 도 1에 나타낸 스틸티콘 벡터로 백신화된 대상체(최초 프라이머 예방접종과, 이어지는 부스트 예방접종)에서의 항원 발현 레벨을 실증한다;
도 6은 F-T2A 형상(configuration)에서 선택된 IMP를 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 7은 F-T2A 형상에서 선택된 IMP를 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 8은 F-T2A 형상에서 선택된 IMP를 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 9는 이중 서브게놈성 프로모터(DSGP) 형상에서 IMP, HSP 90 CDC37을 함유하는 VEEV 레플리콘으로 형질감염된 이후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다;
도 10은 F-T2A 형상에서 IMP가 없는 saRNA와 비교하여 IMP를 함유하는 saRNA로 형질감염된 이후 HeLa 세포에서 생성된 VEGF-A 발현에서의 증가를 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 분비된 리포터 단백질로서 VEGF-A를 함유하는 RNA(100 ng)로 형질감염되었고, 48시간 후 ELISA에 의해 배양 배지에서 단백질 발현에 대해 평가하였다; 및
도 11은 F-T2A 형상에서 IMP를 함유하는 RNA로 형질감염된 이후 HeLa 세포에서의 n-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 RNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다.
실시예
본 발명자들은 saRNA 또는 mRNA의 선천적 인식을 억제하는 것으로 알려져 있는 비-바이러스 공급원, 예컨대 인간 및 다른 포유동물 유래의 시스 암호화 단백질이 숙주 세포에서 선천적 감지를 약화시키고 RNA 백신의 단백질 발현 및 면역원성 모두를 향상시킬 것임을 가정하였다. 따라서, 본 발명자들은 선천적 조절 단백질(IMP) 및 관심있는 유전자(GOI)를 함유하는 다양한 RNA 구조체(saRNA 및 mRNA)를 디자인 및 테스트한 후, 상기 구조체가 (관심있는 유전자에 의해 암호화된) 세포내 및 분비된 단백질 발현 모두를 향상시키는지 여부를 특징분석하였다.
재료 및 방법
IMP를 함유하는 saRNA 레플리콘 플라스미드의 클로닝
반딧불이 루시퍼라아제(fLuc) 및 베네주엘라 말 뇌염 바이러스(VEEV)로부터 유래되는 레플리카아제를 암호화하는 saRNA를 종래 기술된 것과 같이 플라스미드 벡터 내로 클로닝하였다(1). IMP(반딧불이 루시퍼라아제 f-Luc; Uniprot: Q27758)에 이어서 리포터 유전자를 함유하는 레플리콘 플라스미드를 푸린-T2A 또는 이중 서브게놈성 프로모터를 이용해 생성하였다. 이중 서브게놈성(DSG) 구조체는 상기 fLuc 및 IMP를 암호화하는 별도의 RNA 분자의 전사를 시작하도록 디자인하였고, Gibson 조립체 및 뉴클레오티드 염기 중첩부를 이용해 기본 이중 서브게놈성 벡터 내로 클로닝함으로써 생산하였다. 간략하게, 플라스미드 DNA를 37℃에서 2시간 동안 제한 소화시켰고, 제조사의 프로토콜(New England BioLabs, UK)에 따라 GeneArt(Regensburg, Germany) 또는 Integrated DNA Technologies(IDT)(Iowa, USA)에 의해 합성된 유전자 단편 스트링(string)을 이용한 NEB Builder HiFi DNA 조립 반응물에 사용하였다. fLuc 번역을 위하여 종결 코돈 없이 VEEV 일차적(primary) 서브게놈성 프로모터로부터 단일 RNA 전사체를 생성하기 위해 디자인된 푸린-T2A(F-T2A) 구조체는 F-T2A 서열을 갖는 IMP를 대응하는 DSG 플라스미드 벡터의 제한 효소 부위 내로 클로닝함으로써 생산하였다. 50℃에서 30분 동안 인큐베이션한 후, 2 ㎕의 NEB Builder HiFi 조립 반응물을 사용하여 NEB 10-알파 박테리아를 형질전환시켰고, 상기 형질전환체를 LB 아가 플레이트에 플레이팅한 후, 밤새 인큐베이션하였다. 콜로니(colony)를 선택하였고, 밤새 확장시켰고, 퀴아젠 플라스미드 미니프렙 키트(Qiagen, UK)를 이용해 재조합 플라스미드를 정제하였다. 정제된 클론성 플라스미드를 진단적 제한 효소 소화를 이용해 분석하였고, 올바른 소화 패턴을 나타내는 것을 완전히 시퀀싱하여 뉴클레오티드 동일성을 확인하였다(Eurofins, Germany).
통합된 인터페론 억제 단백질(IMP)은 다음의 데이터베이스 식별자/접근 번호를 이용해 발견될 수 있다:
IRF1 DBD(1-164) - NCBI 참조 서열: NM_002198.3, UniProtKB - P10914(IRF1_HUMAN); IRF3(191-427) - NCBI 참조 서열: NM_001571.6, UniProtKB - Q14653(IRF3_HUMAN); IRF7(238-503) - NCBI 참조 서열: NM_001572.5, UniProtKB - Q92985(IRF7_HUMAN); IRF9(142-393), IRF4(1-129) - NCBI 참조 서열: NM_002460.4, UniProtKB - Q15306(IRF4_HUMAN); IRF5 A68P - NCBI 참조 서열: NM_032643.5, UniProtKB - Q13568(IRF5_HUMAN); STAT2(133-315) - NCBI 참조 서열: NM_005419.4, UniProtKB - P52630(STAT2_HUMAN); HSP90(CDC37)(1-232) - NCBI 참조 서열: NM_007065.4, UniProtKB - Q16543(CDC37_HUMAN); STING-베타 - 진뱅크: MF360993.1, UniProtKB - A0A3G1PSE3(A0A3G1PSE3_HUMAN); A20 또는 TNFAIP3(369-775), A20 또는 TNFAIP3(606-790) NCBI 참조 서열: NM_006290.4, UniProtKB - P21580(TNAP3_HUMAN); MFN2(369-598) - NCBI 참조 서열: NM_001127660.2, UniProtKB - O95140(MFN2_HUMAN); TARBP2(1-234) - NCBI 참조 서열: NM_134323.2, UniProtKB - Q15633(TRBP2_HUMAN); 징크 핑거 AVP(1-200) - NCBI 참조 서열: NM_020119.4, UniProtKB - Q7Z2W4(ZCCHV_HUMAN); PKR dsRNA BD(1-170) - NCBI 참조 서열: NM_002759.4, UniProtKB - P19525(E2AK2_HUMAN); PACT PRKRA DBD(1-194) - NCBI 참조 서열: NM_003690.5, UniProtKB - O75569(PRKRA_HUMAN); ARL5B - NCBI 참조 서열: NM_178815.5, UniProtKB - Q96KC2(ARL5B_HUMAN); ARL16 - NCBI 참조 서열: NM_001040025.3, UniProtKB - Q0P5N6(ARL16_HUMAN), TRIM35 - NCBI 참조 서열 NM_171982.4, UniProtKB - Q9UPQ4(TRI35_HUMAN).
(1) A. K. Blakney, P. F. McKay, R. J. Shattock, Structural Components for Amplification of Positive and Negative Strand VEEV Splitzicons. Frontiers in Molecular Biosciences 5, 71 (2018).
RNA 전사를 위한 IMP를 함유하는 플라스미드의 클로닝
제한 소화와, 이어서 뉴클레오티드 염기 중첩부 영역을 갖고 F-T2A 서열을 포함하는 Gibson 조립체를 이용함으로써 IMP를 n-Luc과, 이어서 IMP의 단일 전사체 발현을 가능하게 하는 기본 플라스미드 내로 삽입하였다. 상기 기본 플라스미드는 T7 프로모터, 알파-글로빈 5' UTR 및 베타-글로빈 3' UTR을 갖는 발광성 새우 나노루시퍼라아제(n-Luc) 발현 카세트를 암호화하는 mRNA로 이루어졌다. 간략하게, 상기 n-Luc 플라스미드 구조체를 제한 효소를 이용해 37℃에서 2시간 동안 선형화한 후, NEB Builder HiFi 조립체 프로토콜(New England BioLabs, UK)에 본질적으로 기술된 것과 같은 NEB Builder HiFi DNA 조립 반응물에서 사용하였다. 50℃에서 30분 동안 인큐베이션한 후, 2 ㎕의 조립 반응물을 사용하여 프로토콜에 따라 NEB 10-알파 박테리아를 형질전환시켰고, 상기 형질전환체를 LB 아가 플레이트에 플레이팅하였고, 콜로니 성장을 위해 밤새 인큐베이션하였다. 콜로니를 선택하였고, 밤새 확장하였고, 재조합 플라스미드를 퀴아젠 플라스미드 미니프렙 키트(Qiagen, UK)를 이용해 박테리아로부터 정제하였고, 정제된 클론성 플라스미드를 처음에 진단적 제한 효소 소화물을 이용해 분석하였고, 올바른 소화 패턴을 나타내는 것을 완전히 시퀀싱하여 뉴클레오티드 동일성을 확인하였다(Eurofins, Germany).
saRNA의 시험관내 전사
플라스미드 DNA(pDNA)를 대장균(E. coli)(New England BioLabs, UK) 내로 형질전환시켰고, 100 ㎍/㎖의 카르베니실린(Sigma Aldrich, UK)을 갖는 100 ㎖의 Luria Broth(LB)에서 배양하였다. pDNA를 플라스미드 플러스 맥시프렙 키트(QIAGEN, UK)를 이용해 단리하였고, 최종 농도를 NanoDrop One(ThermoFisher, UK)에서 측정하였다. CleanCap Reagent AG(Tebu-bio, France)를 이용해 pDNA 주형으로부터 saRNA를 전사하여 자연 발생형 Cap 1 구조를 갖는 RNA 전사체를 생산하였다. 간략하게, 상기 pDNA 주형을 37℃에서 3시간 동안 선형화시킨 후, 1 ㎍의 선형화된 pDNA 주형을 제조사의 프로토콜에 따라 표준 CleanCap 전사 프로토콜(Tebu-bio, France)에 사용하였다. -20℃에서 적어도 30분 동안 LiCl 침전함으로써 전사체를 정제하였고, 20,000 g로 4℃에서 20분 동안 원심분리하여 RNA를 펠렛화하였고, 70% EtOH로 1회 세정하였고, 다시 20,000 g로 4℃에서 5분 동안 원심분리하였고, UltraPure H2O(Ambion, UK)에 재부유시켰고, 이후 사용할 때까지 -80℃에 보관하였다.
RNA의 시험관내 전사
pDNA를 대장균(New England BioLabs, UK) 내로 형질전환시켰고, 100 ㎍/㎖의 카르베니실린(Sigma Aldrich, UK)을 갖는 100 ㎖의 Luria Broth(LB)에서 배양하였다. 플라스미드 플러스 맥시프렙 키트(QIAGEN, UK)를 이용해 플라스미드를 정제하였고, 농도 및 순도를 NanoDrop One(ThermoFisher, UK)으로 측정하였다. MEGAscript™ T7 전사 프로토콜(ThermoFisher, UK)을 이용하고, 이어서 ScriptCap™ m7G 캡핑 시스템 포스트 번역(Cambio, UK)에 의해 상기 플라스미드 DNA 주형으로부터 RNA를 전사하였다. 간략하게, pDNA를 37℃에서 3시간 동안 선형화하였고, 1 ㎍의 선형화된 pDNA 주형을 표준 반응 프로토콜에 사용하였다. MEGAscript™ T7 전사 후, 상기 전사체를 -20℃에서 적어도 30분 동안 LiCl 침전에 의해 정제한 후, 20,000 g로 4℃에서 20분 동안 원심분리하여 RNA를 펠렛화하였고, 70% EtOH로 1회 세정하였고, 다시 20,000 g로 4℃에서 5분 동안 원심분리하였고, UltraPure H2O(Ambion, UK)에 재부유시켰다. 이후, 상기 전사체를 ScriptCap™ m7G 캡핑 시스템 표준 프로토콜을 이용해 전사-후 캡핑시켰고, 마지막으로 상기 기술된 것과 같이 LiCl 침전시켰다. 이후, 정제되고 Cap 1 캡핑된 RNA를 UltraPure H2O(Ambion, UK)에 재부유시켰고, 이후 사용할 때까지 -80℃에 보관하였다.
IMP 활성의 측정
바이러스 IMP를 함유하는 saRNA가 IMP가 없는 saRNA에 대비하여 saRNA f-luc 발현을 증가시키는 능력; IMP를 함유하는 mRNA가 IMP가 없는 mRNA에 대비하여 mRNA n-luc 발현을 증가시키는 능력, 및 IMP를 함유하는 mRNA가 IMP가 없는 saRNA로부터 f-luc 발현을 증가시키는 능력을 확립하기 위하여, 구조체를 인터페론 적격(competent) HeLa 세포에서 테스트하였고, 기능적 항-바이러스 신호전달 경로를 갖지 않는 HEK293T/17 세포에서 얻어진 경우와 그 발현을 비교하였다. 양쪽 세포주 모두 10%(v/v) 소 태아 혈청(FBS), 5 ㎎/㎖ L-글루타민(Gibco, ThermoFisher, UK) 및 5 ㎎/㎖ 페니실린/스트렙토마이신(Sigma-Aldrich, Merck, UK)을 함유하는 높은 글루코오스의 둘베코 변형 이글 배지(cDMEM)(Sigma-Aldrich, Merck, UK)에서 배양하였다.
saRNA 반딧불이 루시퍼라아제(f-Luc) 발현에 대한 IMP의 평가
평평한 투명 바닥 96-웰 플레이트(Corning Costar) 내로 HEK293T/17 세포를 웰 당 25,000 세포의 밀도로, 그리고 HeLa 세포를 웰 당 10,000 세포의 밀도로 플레이팅한 후, 24시간 동안 인큐베이션하였다. 0.15 ㎕의 리포펙타민(lipofectamine) MessengerMAX(ThermoFisher, UK) 및 100 ng의 saRNA IMP 구조체 또는 saRNA 대조군(IMP 없음)을 함유하는 10 ㎕의 OptiMEM(ThermoFisher, UK)을 웰에 삼중으로 첨가하였고, 추가로 24시간 후, 플레이트를 630 g로 실온에서 5분 동안 원심분리하였고, 50 ㎕의 배지를 각각의 웰로부터 제거하였고, 50 ㎕의 ONE-Glo™ Ex Reagent D-루시페린 시약(Promega, UK)을 첨가하였고, 피펫팅(pipetting)에 의해 혼합하였다. 이후, 각각의 웰로부터의 총 부피를 평평 바닥 불투명 백색 96-웰 플레이트(Corning Costar)로 전달하였고, 10분 이내에 FLUOstar OMEGA 플레이트 판독기(BMG LABTECH, UK)에서 형광을 측정하였다. saRNA를 함유하지 않은 대조군 웰로부터의 배경 형광을 saRNA를 함유하는 각각의 웰에 대한 신호로부터 뺐다. 이후, HeLa 세포에서 IMP를 함유하는 saRNA에 대해 수득한 신호를 대조군 saRNA를 이용해 수득된 신호 및 HEK293T/17 세포에서 수득된 신호로부터의 배수 변화로 표현하였다.
RNA 나노-루시퍼라아제(n-luc) 발현에 대한 IMP의 평가
평평한 투명 바닥 96-웰 플레이트(Corning Costar) 내로 HEK293T/17 세포를 웰 당 25,000 세포의 밀도로, 그리고 HeLa 세포를 웰 당 10,000 세포의 밀도로 플레이팅한 후, 24시간 동안 인큐베이션하였다. 0.15 ㎕의 리포펙타민 및 100 ng의 saRNA IMP 구조체 또는 saRNA 대조군(IMP 없음)을 함유하는 10 ㎕의 OptiMEM(ThermoFisher, UK)을 웰에 삼중으로 첨가하였고, 추가로 24시간 후, 플레이트를 630 g로 실온에서 5분 동안 원심분리하였고, 50 ㎕의 배지를 각각의 웰로부터 제거하였고, 50 ㎕의 NanoDLR™ Stop & Glo® 시약(Promega, UK)을 첨가하였고, 피펫팅에 의해 혼합하였다. 이후, 각각의 웰로부터의 총 부피를 평평 바닥 불투명 백색 96-웰 플레이트(Corning Costar)로 전달하였고, 10분 이내에 FLUOstar® OMEGA 플레이트 판독기(BMG LABTECH, UK)에서 형광을 측정하였다. RNA를 함유하지 않은 대조군 웰로부터의 배경 형광을 RNA를 함유하는 각각의 웰에 대한 신호로부터 뺐다. 이후, HeLa 세포에서 IMP를 함유하는 RNA에 대해 수득한 신호를 대조군 RNA를 이용해 수득한 신호 및 HEK293T/17 세포에서 수득한 신호로부터의 배수 변화로 표현하였다.
saRNA VEGF-A 발현에 대한 IMP의 평가
HEK293T/17 또는 Hela 세포를 f-Luc를 발현하는 구조체의 테스트에 대해 기술된 것과 동일한 방법을 이용하여 VEGF-A 유전자를 함유하는 100 ng의 saRNA로 형질감염시켰다. 48시간 후, 세포 배양 배지 내의 VEGF-A를 인간 VEGF-A ELISA 키트(Invitrogen, UK)를 이용해 측정하였다. 간략하게, 분석 플레이트 웰을 400 ㎕의 세척 버퍼로 2회 세척한 후, 테스트 샘플 또는 VEGF-A 표준물(15.6 pg/㎖ 내지 1,000 pg/㎖)을 첨가하였다. 이후, 플레이트를 마이크로플레이트 교반기(300 rpm; Jencons Scientific Ltd, UK)에서 실온에서 2시간 동안 인큐베이션한 후, 400 ㎕의 세척 버퍼로 6회 세척하였고, 100 ㎕의 바이오틴-접합체 검출 항체(1:100 희석)를 각각의 웰에 첨가하였고, 플레이트를 마이크로플레이트 교반기(실온에서 1시간, 300 rpm)에서 인큐베이션하였다. 400 ㎕의 세척 버퍼로 6회 세척한 후, 스트렙트아비딘-HRP(1:100 희석) 제2 층 접합체(100 ㎕)를 첨가하였고, 이후 추가로 1시간 인큐베이션하였고, 6회 추가로 세척하였고, 100 ㎕의 TMB 기질을 각각의 웰에 첨가하였다. 암소에서 실온에서 30분 동안 인큐베이션한 후, 100 ㎕의 중단 용액을 첨가하였고, 각각의 웰의 흡광도를 VersaMax 마이크로플레이트 분광광도계(Molecular Devices, UK)로 450 nm에서 판독하였다. 표준 곡선에 내삽(interpolation)함으로써 샘플 내의 VEGF-A 레벨을 결정하였다.
실시예 1 - 선천적 조절 단백질(IMP) 구조체의 구조 디자인
인간 선천적 조절 단백질(IMP)은 자가-증폭형 RNA(saRNA) 또는 메신저 RNA(mRNA) 시스템일 수 있는 본 발명의 RNA 구조체 내로 통합되어서, 단백질 발현 및 번역, 즉 임의의 치료 생체분자일 수 있는 관심있는 유전자(GOI)에 의해 암호화되는 단백질을 변형 또는 감소시킬 수 있는 선천적 인식 및 반응을 감소 또는 제거할 수 있다.
본 발명의 RNA 구조체에 대한 디자인 형상의 다양한 구현예는 도 1에 나타나 있다. saRNA 발현 구조체는 알파바이러스 백본을 기반으로 하며, 여기서 비-구조 단백질은 유지되지만, 관심있는 유전자(GOI)는 서브게놈성 프로모터(SGP)의 하류에 삽입되어 바이러스의 구조 유전자를 교체한다(도 1에서 구현예 "1" 참조). 상기 GOI는 임의의 단백질일 수 있으며, 바이러스, 박테리아, 진균 또는 포유동물 단백질, 즉 생물치료제 단백질일 수 있다. 그러나, 본 발명자들은 본 발명의 RNA 구조체가 백신 공간에서 현저한 유용성을 실증할 것으로 예측하며, 따라서 상기 GOI는 백신 항원, 예컨대 바이러스, 박테리아 또는 진균 단백질, 예컨대 코트 단백질을 암호화할 것이다.
saRNA 구조체(도 1의 좌측)
임의의 IMP가 다음의 디자인 접근법을 이용하여 saRNA 내에 암호화될 수 있다:
- 도 1에서 구현예 "2a"는 펩티드 절단 모티프(motif)(예컨대, 푸린-T2A)를 포함하는 융합 단백질을 암호화하는 saRNA 구조체를 보여주며, GOI(예컨대, 관심있는 항원) 및 IMP에 의해 암호화되는 단백질은 숙주 세포에서 번역시 별개의 단백질로 절단된다;
- 도 1의 구현예 "2b"에서, GOI 및 IMP의 순서가 반대이며, IMP는 GOI의 5'에 있고, 다시 상기 IMP와 GOI 사이에 펩티드 절단 모티프가 있어서, saRNA 구조체가 번역된 후 숙주 세포에서 2개의 별개의 단백질이 생산된다;
- 구현예 "3a"에서, IMP는 GOI 종결 코돈의 하류에 삽입된다. 서브게놈성 프로모터는 GOI의 번역을 구동하고, IMP의 발현/번역은 내부 리보솜 도입 부위(IRES)를 포함시킴으로써 구동된다;
- 구현예 "3b"에서, GOI 및 IMP의 순서가 반대이며, IMP의 번역은 서브게놈성 프로모터에 의해, 그리고 GOI는 IRES에 의해 촉진된다;
- 구현예 "4a"에서, IMP는 GOI 종결 코돈의 하류에 삽입된다. GOI의 번역은 제1 서브게놈성 프로모터에 의해 촉진되고, IMP의 번역은 제2 서브게놈성 프로모터를 포함시킴으로써 구동된다;
- 구현예 "4b"에서, IMP 및 GOI의 위치가 바뀌었으며, 즉 IMP가 GOI의 앞에 있다.
mRNA 구조체(도 1의 오른쪽)
도 1을 참조하면, 임의의 IMP가 다음의 디자인 접근법을 이용하여 mRNA 내에 암호화될 수 있다(구현예 "5" 참조):
- 구현예 "6a"에서, mRNA 구조체는 펩티드 절단 모티프(예컨대, F-T2a)를 포함하는 융합 단백질을 암호화하며, GOI 및 IMP는 번역시 별개의 단백질로 절단된다;
- 구현예 "6b"에서, GOI 및 IMP의 순서가 반대이며, IMP는 GOI의 5'에 있다;
- 구현예 "7a"에서, IMP는 GOI 종결 코돈의 하류에 삽입되며, 여기서 번역은 내부 리보솜 도입 부위(IRES)를 포함시킴으로써 구동된다;
- 구현예 "7b"에서, GOI 및 IMP의 순서가 반대이며, 번역은 서브게놈성 프로모터에 의해, 그리고 GOI는 IRES에 의해 촉진된다.
본 발명자들은 도 1에 실증된 RNA 구조체의 다양한 구현예에서 많은 수의 인간 IMP를 테스트하였으며, 이들이 각각 saRNA 및/또는 mRNA에 대한 발현 및 반응을 변형할 잠재력을 갖는 것으로 여긴다.
실시예 2 - 비-바이러스 선천적 조절 단백질(IMP)을 포함하는 saRNA 구조체의 구축 및 테스트
본 발명자들은 일련의 다양한 비-바이러스 IMP를 디자인하고, 구축하고, 이후 테스트하였으며, 발현 연구의 결과는 도 6 내지 도 10에 나타나 있다.
도 6을 참조하면, F-T2A 형상에서 IMP; IRF4(1-129), IRF1 DBD(1-164), IRF3(191-427), IRF7(238-503), STING 베타 및 HSP90(CDC37)(1-232)을 함유하는 VEEV 레플리콘으로 형질감염된 후 HeLa 세포에서 f-Luc 발현에서의 배수 증가를 보여준다. HEK293T/17 및 HeLa 세포는 리포터 단백질로서 루시퍼라아제를 함유하는 saRNA(100 ng)로 형질감염되었고, 24시간 후 단백질 발현에 대해 평가하였다. HeLa 세포는 HEK와 비교하여 보다 온전한 IFN 발현 경로를 갖는 것으로 알려져 있고, 따라서 대조군(리포터 단백질로서 루시퍼라아제를 함유하고 IMP는 없는 saRNA)과 대비하여 증가된 발현(배수 증가)은 상기 IMP가 saRNA 발현을 증가시키는 것을 시사한다. 상기 IMP 중에서, IRF1 DBD(1-164) 및 IRF4(1-129)는 f-Luc 발현에서 가장 큰 증가를 생성하였다. 나타낸 데이터는 HeLa 세포에서의 루시퍼라아제 발현이 HEK293T/17 세포에서의 발현과 대비하여 ∼2배 이상 증가를 제공하는 구조체이며, 3개의 별개의 saRNA의 배치를 이용하여 3번의 독립적 실험에서 수득된 데이터의 평균±SEM이다.
도 7을 참조하면, F-T2A 형상에서 A20(606-790), STAT2(133-315), MFN2(369-598), 징크 핑거 AVP(1-200) 및 TARBP2(1-234)를 함유하는 VEEV 레플리콘으로 형질감염된 후 HeLa 세포에서 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. 실험 방법의 상세 내용은 도 6에 제공된다. 이들 중에서, STAT2(133-315), MFN2(369-598)는 f-Luc 발현에서 가장 큰 증가를 생성하였다. 나타낸 데이터는 HeLa 세포에서의 루시퍼라아제 발현이 HEK293T/17 세포에서의 발현과 대비하여 ∼2배 이상 증가를 제공하는 구조체이며, 3개의 별개의 saRNA의 배치를 이용하여 3번의 독립적 실험에서 수득된 데이터의 평균±SEM이다.
도 8을 참조하면, F-T2A 형상에서 IRF5 A68P, IRF9(142-393), PKR dsRNA BD(1-170) 및 PACT PRKRA DBD(1-194), ARL5B 및 ARL16을 함유하는 VEEV 레플리콘으로 형질감염된 후 HeLa 세포에서 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. 실험 방법의 상세 내용은 도 6에 제공된다. 이들 중에서, IRF9(142-393)는 f-Luc 발현에서 가장 큰 증가를 생성하였다. 나타낸 데이터는 HeLa 세포에서의 루시퍼라아제 발현이 HEK293T/17 세포에서의 발현과 대비하여 ∼2배 이상 증가를 제공하는 구조체이며, 3개의 별개의 saRNA의 배치를 이용하여 3번의 독립적 실험에서 수득된 데이터의 평균±SEM이다.
도 9를 참조하면, 이중 서브게놈성 프로모터(DSGP) 형상에서 IMP, HSP 90 CDC37을 함유하는 VEEV 레플리콘으로 형질감염된 후 HeLa 세포에서의 f-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. 실험 방법의 상세 내용은 도 6에 제공된다. 나타낸 데이터는 HEK293T/17 세포에서의 발현과 대비한 HeLa 세포에서의 루시퍼라아제 발현이며, 3개의 별개의 saRNA의 배치를 이용하여 3번의 독립적 실험에서 수득된 데이터의 평균±SEM이다.
도 10은 IMP가 없는 saRNA와 비교하고 HEK293T/17 세포에서의 발현과 대비하여 F-T2A 형상에서 IRF1 DBD(1-164) 또는 PKR dsRNA BD(1-170)를 함유하는 saRNA로 형질감염된 후 HeLa 세포에서 생성된 VEGF-A 발현에서의 증가를 보여준다. HEK293T/17 및 HeLa 세포는 분비된 리포터 단백질로서 VEGF-A를 함유하는 RNA(100 ng)로 형질감염되었고, 48시간 후 배양 배지에서의 단백질 발현을 ELISA에 의해 평가하였다. HeLa 세포는 HEK와 비교하여 보다 온전한 IFN 발현 경로를 갖는 것으로 알려져 있으며, 따라서 대조군(GOI로서 VEGF-A를 함유하고 IMP는 없는 RNA)과 대비하여 증가된 발현은 상기 IMP가 RNA 발현을 증가시켰음을 시사한다. 데이터는 1회 실험으로부터 유래되며, 3번의 반복 측정의 평균±SEM을 나타낸다.
실시예 3 - 비-바이러스 선천적 조절 단백질(IMP)을 포함하는 RNA 구조체의 구축 및 테스트
본 발명자들은 일련의 다양한 비-바이러스 IMP를 디자인하고, 구축하고, 이후 테스트하였으며, 발현 연구의 결과는 도 11에 나타나 있다.
도 11을 참조하면, F-T2A 형상에서 IMP: IRF1 DBD(1-164), HSP90(CDC37)(1-232), IRF3(191-427), A20(369-775), A20(606-790), STING 베타 및 PKR dsRNA BD(1-170)를 함유하는 RNA로 형질감염된 후 HeLa 세포에서의 n-Luc 발현을 HEK293T/17 세포에서의 발현과 대비하여 보여준다. 실험 방법의 상세 내용은 도 6에 제공된다. 나타낸 데이터는 루시퍼라아제 발현이 ∼2배 이상 증가를 제공하는 구조체이며, 3개의 별개의 RNA의 배치를 이용하여 3번의 독립적 실험에서 수득된 데이터의 평균±SEM이다.
결론
본 발명자들은 본 명세서에 기술된 구조체가 다음을 포함하여 종래 기술에 기술된 것들보다 많은 이점을 나타낸다고 여긴다:
ⅰ) RNA 구조체, 예컨대 mRNA 또는 saRNA 내로 직접 임의의 선천적 조절 단백질을 암호화하는 뉴클레오티드 서열을 삽입하고, IMP 단백질 및 관심있는 유전자에 의해 암호화되는 생물치료제 분자의 이중 단백질 발현을 가능하게 한다;
ⅱ) 하나는 관심있는 유전자(GOI), 즉 치료 생체분자를 암호화하고 하나는 IMP를 암호화하는 2개의 상이한 별개 가닥의 RNA를 운반하는 것과 대조적으로, 단일 가닥만 운반된다;
ⅲ) IMP는 RNA의 선천적 감지를 억제하고, 이에 따라 더 높은 단백질 발현을 가능하게 한다;
ⅳ) RNA 구조체가 saRNA일 때, IMP 발현 자체는 서브게놈성 가닥에서 GOI와 공동 발현됨으로 인해 자가-증폭된다; 및/또는
ⅴ) 종래의 VEEV RNA 레플리콘 구조체와 비교하여 단백질 발현의 규모 및 기간 모두 증가된다.
번호가 매겨진 문단
다음의 문단은 청구항이 아니라 발명의 설명의 일부를 형성한다:
1. (ⅰ) 적어도 하나의 치료 생체분자; 및 (ⅱ) 적어도 하나의 비-바이러스 선천적 조절 단백질(IMP);을 암호화하는 RNA 구조체.
2. 상기 구조체가 mRNA, saRNA 또는 트랜스-레플리콘 시스템, 바람직하게는 saRNA를 포함하는, 문단 1에 따른 RNA 구조체.
3. 상기 saRNA 구조체가 알파바이러스; 피코르나바이러스; 플라비바이러스; 루비바이러스; 페스티바이러스; 헤파시바이러스; 칼리시바이러스 및 코로나바이러스;로 이루어진 속의 군으로부터 선택되는 양성 가닥 RNA 바이러스, 바람직하게는 알파바이러스, 임의적으로는 VEEV를 포함하거나 이로부터 유래되는, 문단 1 또는 문단 2에 따른 RNA 구조체.
4. 상기 IMP가 포유동물 IMP이고, 바람직하게는 인간 IMP인, 임의의 선행한 문단에 따른 RNA 구조체.
5. 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질이 돌연변이된 또는 비-기능적 인터페론 조절 인자(IRF), 또는 그의 우성 음성 형태를 포함하는, 임의의 선행한 문단에 따른 RNA 구조체.
6. 상기 돌연변이된 또는 비-기능적 인터페론 조절 인자, 또는 그의 우성 음성 형태가 IRF1, IRF2, IRF3, IRF4, IRF5, IRF6, IRF7, IRF8, 또는 IRF9 중 어느 하나, 또는 그의 동원체인, 문단 5에 따른 RNA 구조체.
7. 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질이 그 DNA 결합 도메인(DBD) 및/또는 핵 위치화 신호(NLS)가 비-기능적이거나 결실되어서, 이것이 세포질에서 우성 음성 형태가 되게 하는 인터페론 조절 인자(IRF)를 포함하는, 문단 5 또는 문단 6에 따른 RNA 구조체.
8. 상기 돌연변이된 또는 비-기능적 인터페론 조절 인자, 또는 그의 우성 음성 형태가 인터페론 조절 인자(IRF)의 DNA 결합 도메인(DBD) 및/또는 핵 위치화 신호(NLS)를 포함하거나 이로 이루어질 수 있는, 임의의 선행한 문단에 따른 RNA 구조체.
9. 상기 적어도 하나의 IMP가 IRF의 우성 음성 형태이고, IRF1 우성 음성; IRF3 우성 음성; IRF7 우성 음성; 및 IRF9 우성 음성;으로 이루어진 군으로부터 선택되는, 임의의 선행한 문단에 따른 RNA 구조체.
10. 상기 적어도 하나의 IMP가 IRF1; IRF4; IRF5; IRF8; 및 IRF9;로 이루어진 군으로부터 선택되는 IRF, 또는 그의 동원체의 DBD인, 임의의 선행한 문단에 따른 RNA 구조체.
11. 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질이 선천적 신호전달 경로의 돌연변이된 또는 비-기능적 억제제, 또는 그의 우성 음성 형태를 포함하는, 문단 1 내지 문단 4 중 어느 하나에 따른 RNA 구조체.
12. 상기 RNA 구조체에 의해 암호화되는 선천적 조절 단백질이 RNA 인식의 돌연변이된 또는 비-기능적 억제제, 또는 그의 우성 음성 형태를 포함하는, 문단 1 내지 문단 4 중 어느 하나에 따른 RNA 구조체.
13. 상기 적어도 하나의 IMP가 RIG-1, FAF1, SOCS1, SOCS3, USP18, USP21 및 USP27, 또는 그의 동원체로부터 선택될 수 있는, 문단 1 내지 문단 4 중 어느 하나에 따른 RNA 구조체.
14. 상기 적어도 하나의 IMP가 CYLD, LGP2, RIG 스플라이스 변이체, DDX-56, ARL16 및 ARL5B, 또는 그의 동원체로부터 선택될 수 있는, 문단 1 내지 문단 4 중 어느 하나에 따른 RNA 구조체.
15. 상기 치료 생체분자가 치료 단백질을 포함하고, 바람직하게는 상기 단백질 또는 펩티드가 항원, 보다 바람직하게는 바이러스 항원인, 임의의 선행한 문단에 따른 RNA 구조체.
16. 임의의 선행한 문단에 따른 RNA 구조체를 암호화하는 핵산 서열.
17. 문단 16에 따른 핵산 서열을 포함하는 발현 카세트.
18. 문단 17에 따른 발현 카세트를 포함하는 재조합 벡터.
19. 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체, 문단 16에 따른 핵산 서열, 문단 17에 따른 발현 카세트 또는 문단 18에 따른 벡터, 및 약학적으로 허용가능한 비히클을 포함하는 약학적 조성물.
20. 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체의 제조 방법으로서, 상기 방법은
a) ⅰ) 숙주 세포 내로 문단 18에 따른 벡터를 도입하는 단계; 및
ⅱ) 상기 숙주 세포를 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체의 생산을 야기하는 조건 하에 배양하는 단계; 또는
b) 문단 18에 따른 벡터로부터 RNA 구조체를 전사하는 단계;를 포함하는 방법.
21. 약제로서 또는 치료법에 사용하기 위한 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체, 문단 16에 따른 핵산 서열, 문단 17에 따른 발현 카세트 또는 문단 18에 따른 벡터 또는 문단 19에 따른 약학적 조성물.
22. 원생동물, 진균, 박테리아 또는 바이러스 감염의 예방, 개선 또는 치료에 사용하기 위한 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체, 문단 16에 따른 핵산 서열, 문단 17에 따른 발현 카세트 또는 문단 18에 따른 벡터 또는 문단 19에 따른 약학적 조성물.
23. 암의 예방, 개선 또는 치료에 사용하기 위한 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체, 문단 16에 따른 핵산 서열, 문단 17에 따른 발현 카세트 또는 문단 18에 따른 벡터 또는 문단 19에 따른 약학적 조성물.
24. 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체, 문단 16에 따른 핵산 서열, 문단 17에 따른 발현 카세트 또는 문단 18에 따른 벡터 또는 문단 19에 따른 약학적 조성물을 포함하는 백신.
25. 대상체에서 면역 반응의 자극에 사용하기 위한 문단 1 내지 문단 15 중 어느 하나에 따른 RNA 구조체, 문단 16에 따른 핵산 서열, 문단 17에 따른 발현 카세트 또는 문단 18에 따른 벡터 또는 문단 19에 따른 약학적 조성물.
<110> IMPERIAL COLLEGE INNOVATIONS LIMITED
VaxEquity Ltd
<120> RNA construct
<130> 2023-FPA-2156
<150> 2020063.0
<151> 2020-12-17
<160> 266
<170> PatentIn version 3.5
<210> 1
<211> 185
<212> PRT
<213> Homo sapiens
<400> 1
Gly Asp Ser Ser Pro Asp Thr Phe Ser Asp Gly Leu Ser Ser Ser Thr
1 5 10 15
Leu Pro Asp Asp His Ser Ser Tyr Thr Val Pro Gly Tyr Met Gln Asp
20 25 30
Leu Glu Val Glu Gln Ala Leu Thr Pro Ala Leu Ser Pro Cys Ala Val
35 40 45
Ser Ser Thr Leu Pro Asp Trp His Ile Pro Val Glu Val Val Pro Asp
50 55 60
Ser Thr Ser Asp Leu Tyr Asn Phe Gln Val Ser Pro Met Pro Ser Thr
65 70 75 80
Ser Glu Ala Thr Thr Asp Glu Asp Glu Glu Gly Lys Leu Pro Glu Asp
85 90 95
Ile Met Lys Leu Leu Glu Gln Ser Glu Trp Gln Pro Thr Asn Val Asp
100 105 110
Gly Lys Gly Tyr Leu Leu Asn Glu Pro Gly Val Gln Pro Thr Ser Val
115 120 125
Tyr Gly Asp Phe Ser Cys Lys Glu Glu Pro Glu Ile Asp Ser Pro Gly
130 135 140
Gly Asp Ile Gly Leu Ser Leu Gln Arg Val Phe Thr Asp Leu Lys Asn
145 150 155 160
Met Asp Ala Thr Trp Leu Asp Ser Leu Leu Thr Pro Val Arg Leu Pro
165 170 175
Ser Ile Gln Ala Ile Pro Cys Ala Pro
180 185
<210> 2
<211> 558
<212> DNA
<213> Homo sapiens
<400> 2
ggggattcca gccctgatac cttctctgat ggactcagca gctccactct gcctgatgac 60
cacagcagct acacagttcc aggctacatg caggacttgg aggtggagca ggccctgact 120
ccagcactgt cgccatgtgc tgtcagcagc actctccccg actggcacat cccagtggaa 180
gttgtgccgg acagcaccag tgatctgtac aacttccagg tgtcacccat gccctccacc 240
tctgaagcta caacagatga ggatgaggaa gggaaattac ctgaggacat catgaagctc 300
ttggagcagt cggagtggca gccaacaaac gtggatggga aggggtacct actcaatgaa 360
cctggagtcc agcccacctc tgtctatgga gactttagct gtaaggagga gccagaaatt 420
gacagcccag ggggggatat tgggctgagt ctacagcgtg tcttcacaga tctgaagaac 480
atggatgcca cctggctgga cagcctgctg accccagtcc ggttgccctc catccaggcc 540
attccctgtg caccgtag 558
<210> 3
<211> 558
<212> RNA
<213> Homo sapiens
<400> 3
ggggauucca gcccugauac cuucucugau ggacucagca gcuccacucu gccugaugac 60
cacagcagcu acacaguucc aggcuacaug caggacuugg agguggagca ggcccugacu 120
ccagcacugu cgccaugugc ugucagcagc acucuccccg acuggcacau cccaguggaa 180
guugugccgg acagcaccag ugaucuguac aacuuccagg ugucacccau gcccuccacc 240
ucugaagcua caacagauga ggaugaggaa gggaaauuac cugaggacau caugaagcuc 300
uuggagcagu cggaguggca gccaacaaac guggauggga agggguaccu acucaaugaa 360
ccuggagucc agcccaccuc ugucuaugga gacuuuagcu guaaggagga gccagaaauu 420
gacagcccag ggggggauau ugggcugagu cuacagcgug ucuucacaga ucugaagaac 480
auggaugcca ccuggcugga cagccugcug accccagucc gguugcccuc cauccaggcc 540
auucccugug caccguag 558
<210> 4
<211> 561
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF1
<400> 4
atgggcgata gcagccccga taccttttcc gatggcctga gcagcagcac cctgcctgat 60
gatcacagca gctacaccgt gcctggctac atgcaggacc tggaagtgga acaggccctg 120
acaccagctc tgagcccttg tgctgtgtcc agcacactgc ccgattggca catccctgtg 180
gaagtggtgc ctgacagcac cagcgacctg tacaacttcc aagtgtcccc tatgcctagc 240
acctccgagg ccaccaccga tgaggatgaa gagggaaagc tgcccgagga catcatgaag 300
ctgctggaac agagcgagtg gcagcccacc aatgtggatg gcaagggcta cctgctgaac 360
gagcctggcg ttcagcctac aagcgtgtac ggcgacttca gctgcaaaga ggaacccgag 420
atcgatagcc ctggcggcga tatcggactg agcctgcaga gagtgttcac cgacctgaag 480
aacatggacg ccacctggct ggacagcctg ctgacacctg ttagactgcc ctctatccag 540
gctatcccct gcgctccttg a 561
<210> 5
<211> 561
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF4
<400> 5
augggcgaua gcagccccga uaccuuuucc gauggccuga gcagcagcac ccugccugau 60
gaucacagca gcuacaccgu gccuggcuac augcaggacc uggaagugga acaggcccug 120
acaccagcuc ugagcccuug ugcugugucc agcacacugc ccgauuggca caucccugug 180
gaaguggugc cugacagcac cagcgaccug uacaacuucc aagugucccc uaugccuagc 240
accuccgagg ccaccaccga ugaggaugaa gagggaaagc ugcccgagga caucaugaag 300
cugcuggaac agagcgagug gcagcccacc aauguggaug gcaagggcua ccugcugaac 360
gagccuggcg uucagccuac aagcguguac ggcgacuuca gcugcaaaga ggaacccgag 420
aucgauagcc cuggcggcga uaucggacug agccugcaga gaguguucac cgaccugaag 480
aacauggacg ccaccuggcu ggacagccug cugacaccug uuagacugcc cucuauccag 540
gcuauccccu gcgcuccuug a 561
<210> 6
<211> 185
<212> PRT
<213> Homo sapiens
<400> 6
Gly Asp Ser Ser Pro Asp Thr Phe Ser Asp Gly Leu Ser Ser Ser Thr
1 5 10 15
Leu Pro Asp Asp His Ser Ser Tyr Thr Val Pro Gly Tyr Met Gln Asp
20 25 30
Leu Glu Val Glu Gln Ala Leu Thr Pro Ala Leu Ser Pro Cys Ala Val
35 40 45
Ser Ser Thr Leu Pro Asp Trp His Ile Pro Val Glu Val Val Pro Asp
50 55 60
Ser Thr Ser Asp Leu Tyr Asn Phe Gln Val Ser Pro Met Pro Ser Thr
65 70 75 80
Ser Glu Ala Thr Thr Asp Glu Asp Glu Glu Gly Lys Leu Pro Glu Asp
85 90 95
Ile Met Lys Leu Leu Glu Gln Ser Glu Trp Gln Pro Thr Asn Val Asp
100 105 110
Gly Lys Gly Tyr Leu Leu Asn Glu Pro Gly Val Gln Pro Thr Ser Val
115 120 125
Tyr Gly Asp Phe Ser Cys Arg Glu Glu Pro Glu Ile Asp Ser Pro Gly
130 135 140
Gly Asp Ile Gly Leu Ser Leu Gln Arg Val Phe Thr Asp Leu Arg Asn
145 150 155 160
Met Asp Ala Thr Trp Leu Asp Ser Leu Leu Thr Pro Val Arg Leu Pro
165 170 175
Ser Ile Gln Ala Ile Pro Cys Ala Pro
180 185
<210> 7
<211> 555
<212> DNA
<213> Homo sapiens
<400> 7
ggggattcca gccctgatac cttctctgat ggactcagca gctccactct gcctgatgac 60
cacagcagct acacagttcc aggctacatg caggacttgg aggtggagca ggccctgact 120
ccagcactgt cgccatgtgc tgtcagcagc actctccccg actggcacat cccagtggaa 180
gttgtgccgg acagcaccag tgatctgtac aacttccagg tgtcacccat gccctccacc 240
tctgaagcta caacagatga ggatgaggaa gggaaattac ctgaggacat catgaagctc 300
ttggagcagt cggagtggca gccaacaaac gtggatggga aggggtacct actcaatgaa 360
cctggagtcc agcccacctc tgtctatgga gactttagct gtcgggagga gccagaaatt 420
gacagcccag ggggggatat tgggctgagt ctacagcgtg tcttcacaga tctgcggaac 480
atggatgcca cctggctgga cagcctgctg accccagtcc ggttgccctc catccaggcc 540
attccctgtg caccg 555
<210> 8
<211> 555
<212> RNA
<213> Homo sapiens
<400> 8
ggggauucca gcccugauac cuucucugau ggacucagca gcuccacucu gccugaugac 60
cacagcagcu acacaguucc aggcuacaug caggacuugg agguggagca ggcccugacu 120
ccagcacugu cgccaugugc ugucagcagc acucuccccg acuggcacau cccaguggaa 180
guugugccgg acagcaccag ugaucuguac aacuuccagg ugucacccau gcccuccacc 240
ucugaagcua caacagauga ggaugaggaa gggaaauuac cugaggacau caugaagcuc 300
uuggagcagu cggaguggca gccaacaaac guggauggga agggguaccu acucaaugaa 360
ccuggagucc agcccaccuc ugucuaugga gacuuuagcu gucgggagga gccagaaauu 420
gacagcccag ggggggauau ugggcugagu cuacagcgug ucuucacaga ucugcggaac 480
auggaugcca ccuggcugga cagccugcug accccagucc gguugcccuc cauccaggcc 540
auucccugug caccg 555
<210> 9
<211> 561
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF1
<400> 9
atgggcgata gcagccccga taccttttcc gatggcctga gcagcagcac cctgcctgat 60
gatcacagca gctacaccgt gcctggctac atgcaggacc tggaagtgga acaggccctg 120
acaccagctc tgagcccttg tgctgtgtcc agcacactgc ccgattggca catccctgtg 180
gaagtggtgc ctgacagcac cagcgacctg tacaacttcc aagtgtcccc tatgcctagc 240
acctccgagg ccaccaccga tgaggatgaa gagggaaagc tgcccgagga catcatgaag 300
ctgctggaac agagcgagtg gcagcccacc aatgtggatg gcaagggcta cctgctgaac 360
gagcctggcg ttcagcctac aagcgtgtac ggcgacttca gctgcagaga ggaacccgag 420
atcgatagcc ctggcggcga tatcggactg agtctgcaga gggtgttcac cgacctgaga 480
aacatggacg ccacctggct ggacagcctg ctgacacctg ttagactgcc ctctatccag 540
gctatcccct gcgctccttg a 561
<210> 10
<211> 561
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF1
<400> 10
augggcgaua gcagccccga uaccuuuucc gauggccuga gcagcagcac ccugccugau 60
gaucacagca gcuacaccgu gccuggcuac augcaggacc uggaagugga acaggcccug 120
acaccagcuc ugagcccuug ugcugugucc agcacacugc ccgauuggca caucccugug 180
gaaguggugc cugacagcac cagcgaccug uacaacuucc aagugucccc uaugccuagc 240
accuccgagg ccaccaccga ugaggaugaa gagggaaagc ugcccgagga caucaugaag 300
cugcuggaac agagcgagug gcagcccacc aauguggaug gcaagggcua ccugcugaac 360
gagccuggcg uucagccuac aagcguguac ggcgacuuca gcugcagaga ggaacccgag 420
aucgauagcc cuggcggcga uaucggacug agucugcaga ggguguucac cgaccugaga 480
aacauggacg ccaccuggcu ggacagccug cugacaccug uuagacugcc cucuauccag 540
gcuauccccu gcgcuccuug a 561
<210> 11
<211> 237
<212> PRT
<213> Homo sapiens
<400> 11
Pro Leu Lys Arg Leu Leu Val Pro Gly Glu Glu Trp Glu Phe Glu Val
1 5 10 15
Thr Ala Phe Tyr Arg Gly Arg Gln Val Phe Gln Gln Thr Ile Ser Cys
20 25 30
Pro Glu Gly Leu Arg Leu Val Gly Ser Glu Val Gly Asp Arg Thr Leu
35 40 45
Pro Gly Trp Pro Val Thr Leu Pro Asp Pro Gly Met Ser Leu Thr Asp
50 55 60
Arg Gly Val Met Ser Tyr Val Arg His Val Leu Ser Cys Leu Gly Gly
65 70 75 80
Gly Leu Ala Leu Trp Arg Ala Gly Gln Trp Leu Trp Ala Gln Arg Leu
85 90 95
Gly His Cys His Thr Tyr Trp Ala Val Ser Glu Glu Leu Leu Pro Asn
100 105 110
Ser Gly His Gly Pro Asp Gly Glu Val Pro Lys Asp Lys Glu Gly Gly
115 120 125
Val Phe Asp Leu Gly Pro Phe Ile Val Asp Leu Ile Thr Phe Thr Glu
130 135 140
Gly Ser Gly Arg Ser Pro Arg Tyr Ala Leu Trp Phe Cys Val Gly Glu
145 150 155 160
Ser Trp Pro Gln Asp Gln Pro Trp Thr Lys Arg Leu Val Met Val Lys
165 170 175
Val Val Pro Thr Cys Leu Arg Ala Leu Val Glu Met Ala Arg Val Gly
180 185 190
Gly Ala Ser Ser Leu Glu Asn Thr Val Asp Leu His Ile Ser Asn Ser
195 200 205
His Pro Leu Ser Leu Thr Ser Asp Gln Tyr Lys Ala Tyr Leu Gln Asp
210 215 220
Leu Val Glu Gly Met Asp Phe Gln Gly Pro Gly Glu Ser
225 230 235
<210> 12
<211> 711
<212> DNA
<213> Homo sapiens
<400> 12
ccactgaagc ggctgttggt gccgggggaa gagtgggagt tcgaggtgac agccttctac 60
cggggccgcc aagtcttcca gcagaccatc tcctgcccgg agggcctgcg gctggtgggg 120
tccgaagtgg gagacaggac gctgcctgga tggccagtca cactgccaga ccctggcatg 180
tccctgacag acaggggagt gatgagctac gtgaggcatg tgctgagctg cctgggtggg 240
ggactggctc tctggcgggc cgggcagtgg ctctgggccc agcggctggg gcactgccac 300
acatactggg cagtgagcga ggagctgctc cccaacagcg ggcatgggcc tgatggcgag 360
gtccccaagg acaaggaagg aggcgtgttt gacctggggc ccttcattgt agatctgatt 420
accttcacgg aaggaagcgg acgctcacca cgctatgccc tctggttctg tgtgggggag 480
tcatggcccc aggaccagcc gtggaccaag aggctcgtga tggtcaaggt tgtgcccacg 540
tgcctcaggg ccttggtaga aatggcccgg gtagggggtg cctcctccct ggagaatact 600
gtggacctgc acatttccaa cagccaccca ctctccctca cctccgacca gtacaaggcc 660
tacctgcagg acttggtgga gggcatggat ttccagggcc ctggggagag c 711
<210> 13
<211> 711
<212> RNA
<213> Homo sapiens
<400> 13
ccacugaagc ggcuguuggu gccgggggaa gagugggagu ucgaggugac agccuucuac 60
cggggccgcc aagucuucca gcagaccauc uccugcccgg agggccugcg gcuggugggg 120
uccgaagugg gagacaggac gcugccugga uggccaguca cacugccaga cccuggcaug 180
ucccugacag acaggggagu gaugagcuac gugaggcaug ugcugagcug ccuggguggg 240
ggacuggcuc ucuggcgggc cgggcagugg cucugggccc agcggcuggg gcacugccac 300
acauacuggg cagugagcga ggagcugcuc cccaacagcg ggcaugggcc ugauggcgag 360
guccccaagg acaaggaagg aggcguguuu gaccuggggc ccuucauugu agaucugauu 420
accuucacgg aaggaagcgg acgcucacca cgcuaugccc ucugguucug ugugggggag 480
ucauggcccc aggaccagcc guggaccaag aggcucguga uggucaaggu ugugcccacg 540
ugccucaggg ccuugguaga aauggcccgg guagggggug ccuccucccu ggagaauacu 600
guggaccugc acauuuccaa cagccaccca cucucccuca ccuccgacca guacaaggcc 660
uaccugcagg acuuggugga gggcauggau uuccagggcc cuggggagag c 711
<210> 14
<211> 717
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF3
<400> 14
atgcccctga agagactgct ggtgcctggc gaggaatggg agtttgaagt gaccgccttc 60
taccggggca gacaggtgtt ccagcagacc atctcttgcc ccgagggact gagactcgtg 120
ggctctgaag tgggcgatag aacactgcct ggctggcctg tgacactgcc agatcctgga 180
atgagcctga ccgacagagg cgtgatgagc tatgtgcggc acgtgctgtc ttgtctcggc 240
ggaggacttg ccctttggag agctggacaa tggctgtggg ctcagagact gggccactgt 300
cacacatact gggccgtgtc tgaggaactg ctgcccaatt ctggccacgg acctgatggc 360
gaggtgccca aagacaaaga aggcggcgtt ttcgatctgg gccccttcat cgtggacctg 420
atcaccttta ccgaaggcag cggcagaagc cccagatacg ccctgtggtt ttgtgtgggc 480
gagagctggc ctcaggatca gccttggacc aagagactgg tcatggtcaa ggtggtgcct 540
acctgcctga gagccctggt ggaaatggct agagttggcg gagccagcag cctggaaaac 600
accgtggatc tgcacatcag caactctcac cctctgtctc tgaccagcga ccagtacaag 660
gcctatctgc aggacctggt cgaaggcatg gactttcaag gccctggcga gtcctga 717
<210> 15
<211> 717
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF3
<400> 15
augccccuga agagacugcu ggugccuggc gaggaauggg aguuugaagu gaccgccuuc 60
uaccggggca gacagguguu ccagcagacc aucucuugcc ccgagggacu gagacucgug 120
ggcucugaag ugggcgauag aacacugccu ggcuggccug ugacacugcc agauccugga 180
augagccuga ccgacagagg cgugaugagc uaugugcggc acgugcuguc uugucucggc 240
ggaggacuug cccuuuggag agcuggacaa uggcuguggg cucagagacu gggccacugu 300
cacacauacu gggccguguc ugaggaacug cugcccaauu cuggccacgg accugauggc 360
gaggugccca aagacaaaga aggcggcguu uucgaucugg gccccuucau cguggaccug 420
aucaccuuua ccgaaggcag cggcagaagc cccagauacg cccugugguu uugugugggc 480
gagagcuggc cucaggauca gccuuggacc aagagacugg ucauggucaa gguggugccu 540
accugccuga gagcccuggu ggaaauggcu agaguuggcg gagccagcag ccuggaaaac 600
accguggauc ugcacaucag caacucucac ccucugucuc ugaccagcga ccaguacaag 660
gccuaucugc aggaccuggu cgaaggcaug gacuuucaag gcccuggcga guccuga 717
<210> 16
<211> 266
<212> PRT
<213> Homo sapiens
<400> 16
Trp Ala Val Glu Thr Thr Pro Ser Pro Gly Pro Gln Pro Ala Ala Leu
1 5 10 15
Thr Thr Gly Glu Ala Ala Ala Pro Glu Ser Pro His Gln Ala Glu Pro
20 25 30
Tyr Leu Ser Pro Ser Pro Ser Ala Cys Thr Ala Val Gln Glu Pro Ser
35 40 45
Pro Gly Ala Leu Asp Val Thr Ile Met Tyr Lys Gly Arg Thr Val Leu
50 55 60
Gln Lys Val Val Gly His Pro Ser Cys Thr Phe Leu Tyr Gly Pro Pro
65 70 75 80
Asp Pro Ala Val Arg Ala Thr Asp Pro Gln Gln Val Ala Phe Pro Ser
85 90 95
Pro Ala Glu Leu Pro Asp Gln Lys Gln Leu Arg Tyr Thr Glu Glu Leu
100 105 110
Leu Arg His Val Ala Pro Gly Leu His Leu Glu Leu Arg Gly Pro Gln
115 120 125
Leu Trp Ala Arg Arg Met Gly Lys Cys Lys Val Tyr Trp Glu Val Gly
130 135 140
Gly Pro Pro Gly Ser Ala Ser Pro Ser Thr Pro Ala Cys Leu Leu Pro
145 150 155 160
Arg Asn Cys Asp Thr Pro Ile Phe Asp Phe Arg Val Phe Phe Gln Glu
165 170 175
Leu Val Glu Phe Arg Ala Arg Gln Arg Arg Gly Ser Pro Arg Tyr Thr
180 185 190
Ile Tyr Leu Gly Phe Gly Gln Asp Leu Ser Ala Gly Arg Pro Lys Glu
195 200 205
Lys Ser Leu Val Leu Val Lys Leu Glu Pro Trp Leu Cys Arg Val His
210 215 220
Leu Glu Gly Thr Gln Arg Glu Gly Val Ser Ser Leu Asp Ser Ser Ser
225 230 235 240
Leu Ser Leu Cys Leu Ser Ser Ala Asn Ser Leu Tyr Asp Asp Ile Glu
245 250 255
Cys Phe Leu Met Glu Leu Glu Gln Pro Ala
260 265
<210> 17
<211> 798
<212> DNA
<213> Homo sapiens
<400> 17
tgggcagtag agacgacccc cagccccggg ccccagcccg cggcactaac gacaggcgag 60
gccgcggccc cagagtcccc gcaccaggca gagccgtacc tgtcaccctc cccaagcgcc 120
tgcaccgcgg tgcaagagcc cagcccaggg gcgctggacg tgaccatcat gtacaagggc 180
cgcacggtgc tgcagaaggt ggtgggacac ccgagctgca cgttcctata cggcccccca 240
gacccagctg tccgggccac agacccccag caggtagcat tccccagccc tgccgagctc 300
ccggaccaga agcagctgcg ctacacggag gaactgctgc ggcacgtggc ccctgggttg 360
cacctggagc ttcgggggcc acagctgtgg gcccggcgca tgggcaagtg caaggtgtac 420
tgggaggtgg gcggaccccc aggctccgcc agcccctcca ccccagcctg cctgctgcct 480
cggaactgtg acacccccat cttcgacttc agagtcttct tccaagagct ggtggaattc 540
cgggcacggc agcgccgtgg ctccccacgc tataccatct acctgggctt cgggcaggac 600
ctgtcagctg ggaggcccaa ggagaagagc ctggtcctgg tgaagctgga accctggctg 660
tgccgagtgc acctagaggg cacgcagcgt gagggtgtgt cttccctgga tagcagcagc 720
ctcagcctct gcctgtccag cgccaacagc ctctatgacg acatcgagtg cttccttatg 780
gagctggagc agcccgcc 798
<210> 18
<211> 798
<212> RNA
<213> Homo sapiens
<400> 18
ugggcaguag agacgacccc cagccccggg ccccagcccg cggcacuaac gacaggcgag 60
gccgcggccc cagagucccc gcaccaggca gagccguacc ugucacccuc cccaagcgcc 120
ugcaccgcgg ugcaagagcc cagcccaggg gcgcuggacg ugaccaucau guacaagggc 180
cgcacggugc ugcagaaggu ggugggacac ccgagcugca cguuccuaua cggcccccca 240
gacccagcug uccgggccac agacccccag cagguagcau uccccagccc ugccgagcuc 300
ccggaccaga agcagcugcg cuacacggag gaacugcugc ggcacguggc cccuggguug 360
caccuggagc uucgggggcc acagcugugg gcccggcgca ugggcaagug caagguguac 420
ugggaggugg gcggaccccc aggcuccgcc agccccucca ccccagccug ccugcugccu 480
cggaacugug acacccccau cuucgacuuc agagucuucu uccaagagcu gguggaauuc 540
cgggcacggc agcgccgugg cuccccacgc uauaccaucu accugggcuu cgggcaggac 600
cugucagcug ggaggcccaa ggagaagagc cugguccugg ugaagcugga acccuggcug 660
ugccgagugc accuagaggg cacgcagcgu gagggugugu cuucccugga uagcagcagc 720
cucagccucu gccuguccag cgccaacagc cucuaugacg acaucgagug cuuccuuaug 780
gagcuggagc agcccgcc 798
<210> 19
<211> 804
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF7
<400> 19
atgtgggccg tcgagacaac accttctcca ggacctcaac ctgccgctct gacaacaggc 60
gaagctgctg ctcctgagtc tccacatcag gccgagcctt acctgtctcc atctcctagc 120
gcctgtaccg ccgtgcaaga accttctcct ggtgctctgg acgtgaccat catgtacaag 180
ggcagaaccg tgctgcagaa agtcgtggga caccccagct gcacctttct gtatggccct 240
ccagatcctg ccgtgcgggc tacagatcct cagcaggttg cattcccatc tccagccgag 300
ctgcccgatc agaagcagct gagatacacc gaggaactgc tgagacacgt ggcccctgga 360
ctgcacctgg aactgagagg accacaactg tgggccagac ggatgggcaa gtgcaaggtg 420
tactgggaag ttggcggccc tcctggatct gcctctccat ctacaccagc ctgcctgctg 480
cctcggaatt gcgacacccc tatcttcgac ttccgggtgt tcttccaaga gctggtggaa 540
ttccgggcca gacagagaag aggcagcccc agatacacca tctacctcgg ctttggccag 600
gacctgtctg ccggacggcc taaagaaaag tccctggtgc tggtcaagct ggaaccctgg 660
ctgtgtagag tgcatctgga aggcacccag agagagggcg tcagcagcct ggatagcagc 720
tctctgagcc tgtgtctgag cagcgccaac agcctgtacg acgatatcga gtgcttcctg 780
atggaactgg aacagcccgc ctga 804
<210> 20
<211> 804
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF7
<400> 20
augugggccg ucgagacaac accuucucca ggaccucaac cugccgcucu gacaacaggc 60
gaagcugcug cuccugaguc uccacaucag gccgagccuu accugucucc aucuccuagc 120
gccuguaccg ccgugcaaga accuucuccu ggugcucugg acgugaccau cauguacaag 180
ggcagaaccg ugcugcagaa agucguggga caccccagcu gcaccuuucu guauggcccu 240
ccagauccug ccgugcgggc uacagauccu cagcagguug cauucccauc uccagccgag 300
cugcccgauc agaagcagcu gagauacacc gaggaacugc ugagacacgu ggccccugga 360
cugcaccugg aacugagagg accacaacug ugggccagac ggaugggcaa gugcaaggug 420
uacugggaag uuggcggccc uccuggaucu gccucuccau cuacaccagc cugccugcug 480
ccucggaauu gcgacacccc uaucuucgac uuccgggugu ucuuccaaga gcugguggaa 540
uuccgggcca gacagagaag aggcagcccc agauacacca ucuaccucgg cuuuggccag 600
gaccugucug ccggacggcc uaaagaaaag ucccuggugc uggucaagcu ggaacccugg 660
cuguguagag ugcaucugga aggcacccag agagagggcg ucagcagccu ggauagcagc 720
ucucugagcc ugugucugag cagcgccaac agccuguacg acgauaucga gugcuuccug 780
auggaacugg aacagcccgc cuga 804
<210> 21
<211> 251
<212> PRT
<213> Homo sapiens
<400> 21
Lys Glu Glu Glu Asp Ala Met Gln Asn Cys Thr Leu Ser Pro Ser Val
1 5 10 15
Leu Gln Asp Ser Leu Asn Asn Glu Glu Glu Gly Ala Ser Gly Gly Ala
20 25 30
Val His Ser Asp Ile Gly Ser Ser Ser Ser Ser Ser Ser Pro Glu Pro
35 40 45
Gln Glu Val Thr Asp Thr Thr Glu Ala Pro Phe Gln Gly Asp Gln Arg
50 55 60
Ser Leu Glu Phe Leu Leu Pro Pro Glu Pro Asp Tyr Ser Leu Leu Leu
65 70 75 80
Thr Phe Ile Tyr Asn Gly Arg Val Val Gly Glu Ala Gln Val Gln Ser
85 90 95
Leu Asp Cys Arg Leu Val Ala Glu Pro Ser Gly Ser Glu Ser Ser Met
100 105 110
Glu Gln Val Leu Phe Pro Lys Pro Gly Pro Leu Glu Pro Thr Gln Arg
115 120 125
Leu Leu Ser Gln Leu Glu Arg Gly Ile Leu Val Ala Ser Asn Pro Arg
130 135 140
Gly Leu Phe Val Gln Arg Leu Cys Pro Ile Pro Ile Ser Trp Asn Ala
145 150 155 160
Pro Gln Ala Pro Pro Gly Pro Gly Pro His Leu Leu Pro Ser Asn Glu
165 170 175
Cys Val Glu Leu Phe Arg Thr Ala Tyr Phe Cys Arg Asp Leu Val Arg
180 185 190
Tyr Phe Gln Gly Leu Gly Pro Pro Pro Lys Phe Gln Val Thr Leu Asn
195 200 205
Phe Trp Glu Glu Ser His Gly Ser Ser His Thr Pro Gln Asn Leu Ile
210 215 220
Thr Val Lys Met Glu Gln Ala Phe Ala Arg Tyr Leu Leu Glu Gln Thr
225 230 235 240
Pro Glu Gln Gln Ala Ala Ile Leu Ser Leu Val
245 250
<210> 22
<211> 753
<212> DNA
<213> Homo sapiens
<400> 22
aaggaggaag aggatgccat gcagaactgc acactcagtc cctctgtgct ccaggactcc 60
ctcaataatg aggaggaggg ggccagtggg ggagcagtcc attcagacat tgggagcagc 120
agcagcagca gcagccctga gccacaggaa gttacagaca caactgaggc cccctttcaa 180
ggggatcaga ggtccctgga gtttctgctt cctccagagc cagactactc actgctgctc 240
accttcatct acaacgggcg cgtggtgggc gaggcccagg tgcaaagcct ggattgccgc 300
cttgtggctg agccctcagg ctctgagagc agcatggagc aggtgctgtt ccccaagcct 360
ggcccactgg agcccacgca gcgcctgctg agccagcttg agaggggcat cctagtggcc 420
agcaaccccc gaggcctctt cgtgcagcgc ctttgcccca tccccatctc ctggaatgca 480
ccccaggctc cacctgggcc aggcccgcat ctgctgccca gcaacgagtg cgtggagctc 540
ttcagaaccg cctacttctg cagagacttg gtcaggtact ttcagggcct gggcccccca 600
ccgaagttcc aggtaacact gaatttctgg gaagagagcc atggctccag ccatactcca 660
cagaatctta tcacagtgaa gatggagcag gcctttgccc gatacttgct ggagcagact 720
ccagagcagc aggcagccat tctgtccctg gtg 753
<210> 23
<211> 753
<212> RNA
<213> Homo sapiens
<400> 23
aaggaggaag aggaugccau gcagaacugc acacucaguc ccucugugcu ccaggacucc 60
cucaauaaug aggaggaggg ggccaguggg ggagcagucc auucagacau ugggagcagc 120
agcagcagca gcagcccuga gccacaggaa guuacagaca caacugaggc ccccuuucaa 180
ggggaucaga ggucccugga guuucugcuu ccuccagagc cagacuacuc acugcugcuc 240
accuucaucu acaacgggcg cguggugggc gaggcccagg ugcaaagccu ggauugccgc 300
cuuguggcug agcccucagg cucugagagc agcauggagc aggugcuguu ccccaagccu 360
ggcccacugg agcccacgca gcgccugcug agccagcuug agaggggcau ccuaguggcc 420
agcaaccccc gaggccucuu cgugcagcgc cuuugcccca uccccaucuc cuggaaugca 480
ccccaggcuc caccugggcc aggcccgcau cugcugccca gcaacgagug cguggagcuc 540
uucagaaccg ccuacuucug cagagacuug gucagguacu uucagggccu gggcccccca 600
ccgaaguucc agguaacacu gaauuucugg gaagagagcc auggcuccag ccauacucca 660
cagaaucuua ucacagugaa gauggagcag gccuuugccc gauacuugcu ggagcagacu 720
ccagagcagc aggcagccau ucugucccug gug 753
<210> 24
<211> 759
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF9
<400> 24
atgaaggaag aagaggacgc catgcagaac tgcacactga gcccaagcgt gctgcaggac 60
agcctgaaca atgaggaaga aggcgcctct ggcggagccg tgcactctga tattggcagc 120
agcagctcta gcagcagccc cgagcctcaa gaagtgaccg atacaacaga ggccccattc 180
cagggcgacc agcggagtct ggaatttctg ctgcctcctg agcctgacta cagcctgctg 240
ctgaccttca tctacaacgg cagagtcgtg ggcgaagccc aggtgcagtc tctggattgc 300
agactggtgg ccgagcctag cggaagcgag tctagtatgg aacaggtgct gttccccaag 360
cctggacctc tggaacccac acagaggctg ctgtctcaac tggaaagggg catcctggtg 420
gccagcaatc ctagaggcct gttcgtgcag agactgtgcc ctattcctat cagctggaac 480
gcccctcagg ctcctcctgg acctggacca catctgctgc ccagcaatga gtgcgtggaa 540
ctgttccgga ccgcctactt ctgcagagat ctcgtgcggt acttccaagg cctgggacct 600
cctccaaagt tccaagtgac cctgaacttc tgggaagaga gccacggcag cagccacaca 660
cctcagaatc tgatcaccgt gaagatggaa caagccttcg ccagatacct gctggaacag 720
acccctgaac agcaggccgc catcctgtct ctggtgtga 759
<210> 25
<211> 759
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF9
<400> 25
augaaggaag aagaggacgc caugcagaac ugcacacuga gcccaagcgu gcugcaggac 60
agccugaaca augaggaaga aggcgccucu ggcggagccg ugcacucuga uauuggcagc 120
agcagcucua gcagcagccc cgagccucaa gaagugaccg auacaacaga ggccccauuc 180
cagggcgacc agcggagucu ggaauuucug cugccuccug agccugacua cagccugcug 240
cugaccuuca ucuacaacgg cagagucgug ggcgaagccc aggugcaguc ucuggauugc 300
agacuggugg ccgagccuag cggaagcgag ucuaguaugg aacaggugcu guuccccaag 360
ccuggaccuc uggaacccac acagaggcug cugucucaac uggaaagggg cauccuggug 420
gccagcaauc cuagaggccu guucgugcag agacugugcc cuauuccuau cagcuggaac 480
gccccucagg cuccuccugg accuggacca caucugcugc ccagcaauga gugcguggaa 540
cuguuccgga ccgccuacuu cugcagagau cucgugcggu acuuccaagg ccugggaccu 600
ccuccaaagu uccaagugac ccugaacuuc ugggaagaga gccacggcag cagccacaca 660
ccucagaauc ugaucaccgu gaagauggaa caagccuucg ccagauaccu gcuggaacag 720
accccugaac agcaggccgc cauccugucu cugguguga 759
<210> 26
<211> 204
<212> PRT
<213> Homo sapiens
<400> 26
Ser Ser Ser Ser Ser Ser Pro Glu Pro Gln Glu Val Thr Asp Thr Thr
1 5 10 15
Glu Ala Pro Phe Gln Gly Asp Gln Arg Ser Leu Glu Phe Leu Leu Pro
20 25 30
Pro Glu Pro Asp Tyr Ser Leu Leu Leu Thr Phe Ile Tyr Asn Gly Arg
35 40 45
Val Val Gly Glu Ala Gln Val Gln Ser Leu Asp Cys Arg Leu Val Ala
50 55 60
Glu Pro Ser Gly Ser Glu Ser Ser Met Glu Gln Val Leu Phe Pro Lys
65 70 75 80
Pro Gly Pro Leu Glu Pro Thr Gln Arg Leu Leu Ser Gln Leu Glu Arg
85 90 95
Gly Ile Leu Val Ala Ser Asn Pro Arg Gly Leu Phe Val Gln Arg Leu
100 105 110
Cys Pro Ile Pro Ile Ser Trp Asn Ala Pro Gln Ala Pro Pro Gly Pro
115 120 125
Gly Pro His Leu Leu Pro Ser Asn Glu Cys Val Glu Leu Phe Arg Thr
130 135 140
Ala Tyr Phe Cys Arg Asp Leu Val Arg Tyr Phe Gln Gly Leu Gly Pro
145 150 155 160
Pro Pro Lys Phe Gln Val Thr Leu Asn Phe Trp Glu Glu Ser His Gly
165 170 175
Ser Ser His Thr Pro Gln Asn Leu Ile Thr Val Lys Met Glu Gln Ala
180 185 190
Phe Ala Arg Tyr Leu Leu Glu Gln Thr Pro Glu Gln
195 200
<210> 27
<211> 612
<212> DNA
<213> Homo sapiens
<400> 27
agcagcagca gcagcagccc tgagccacag gaagttacag acacaactga ggcccccttt 60
caaggggatc agaggtccct ggagtttctg cttcctccag agccagacta ctcactgctg 120
ctcaccttca tctacaacgg gcgcgtggtg ggcgaggccc aggtgcaaag cctggattgc 180
cgccttgtgg ctgagccctc aggctctgag agcagcatgg agcaggtgct gttccccaag 240
cctggcccac tggagcccac gcagcgcctg ctgagccagc ttgagagggg catcctagtg 300
gccagcaacc cccgaggcct cttcgtgcag cgcctttgcc ccatccccat ctcctggaat 360
gcaccccagg ctccacctgg gccaggcccg catctgctgc ccagcaacga gtgcgtggag 420
ctcttcagaa ccgcctactt ctgcagagac ttggtcaggt actttcaggg cctgggcccc 480
ccaccgaagt tccaggtaac actgaatttc tgggaagaga gccatggctc cagccatact 540
ccacagaatc ttatcacagt gaagatggag caggcctttg cccgatactt gctggagcag 600
actccagagc ag 612
<210> 28
<211> 612
<212> RNA
<213> Homo sapiens
<400> 28
agcagcagca gcagcagccc ugagccacag gaaguuacag acacaacuga ggcccccuuu 60
caaggggauc agaggucccu ggaguuucug cuuccuccag agccagacua cucacugcug 120
cucaccuuca ucuacaacgg gcgcguggug ggcgaggccc aggugcaaag ccuggauugc 180
cgccuugugg cugagcccuc aggcucugag agcagcaugg agcaggugcu guuccccaag 240
ccuggcccac uggagcccac gcagcgccug cugagccagc uugagagggg cauccuagug 300
gccagcaacc cccgaggccu cuucgugcag cgccuuugcc ccauccccau cuccuggaau 360
gcaccccagg cuccaccugg gccaggcccg caucugcugc ccagcaacga gugcguggag 420
cucuucagaa ccgccuacuu cugcagagac uuggucaggu acuuucaggg ccugggcccc 480
ccaccgaagu uccagguaac acugaauuuc ugggaagaga gccauggcuc cagccauacu 540
ccacagaauc uuaucacagu gaagauggag caggccuuug cccgauacuu gcuggagcag 600
acuccagagc ag 612
<210> 29
<211> 618
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF9
<400> 29
atgagcagct ctagcagcag ccccgagcct caagaagtga ccgatacaac agaggcccca 60
ttccagggcg accagcggag tctggaattt ctgctgcctc ctgagcctga ctacagcctg 120
ctgctgacct tcatctacaa cggcagagtc gtgggcgaag cccaggtgca gtctctggat 180
tgcagactgg tggccgagcc tagcggaagc gagtctagta tggaacaggt gctgttcccc 240
aagcctggac ctctggaacc cacacagagg ctgctgtctc aactggaaag gggcatcctg 300
gtggccagca atcctagagg cctgttcgtg cagagactgt gccctattcc tatcagctgg 360
aacgcccctc aggctcctcc tggacctgga ccacatctgc tgcccagcaa tgagtgcgtg 420
gaactgttcc ggaccgccta cttctgcaga gatctcgtgc ggtacttcca aggcctggga 480
cctcctccaa agttccaagt gaccctgaac ttctgggaag agagccacgg cagcagccac 540
acacctcaga atctgatcac cgtgaagatg gaacaagcct tcgccagata cctgctggaa 600
cagacccctg aacagtga 618
<210> 30
<211> 618
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF9
<400> 30
augagcagcu cuagcagcag ccccgagccu caagaaguga ccgauacaac agaggcccca 60
uuccagggcg accagcggag ucuggaauuu cugcugccuc cugagccuga cuacagccug 120
cugcugaccu ucaucuacaa cggcagaguc gugggcgaag cccaggugca gucucuggau 180
ugcagacugg uggccgagcc uagcggaagc gagucuagua uggaacaggu gcuguucccc 240
aagccuggac cucuggaacc cacacagagg cugcugucuc aacuggaaag gggcauccug 300
guggccagca auccuagagg ccuguucgug cagagacugu gcccuauucc uaucagcugg 360
aacgccccuc aggcuccucc uggaccugga ccacaucugc ugcccagcaa ugagugcgug 420
gaacuguucc ggaccgccua cuucugcaga gaucucgugc gguacuucca aggccuggga 480
ccuccuccaa aguuccaagu gacccugaac uucugggaag agagccacgg cagcagccac 540
acaccucaga aucugaucac cgugaagaug gaacaagccu ucgccagaua ccugcuggaa 600
cagaccccug aacaguga 618
<210> 31
<211> 109
<212> PRT
<213> Homo sapiens
<400> 31
Pro Phe Gln Gly Asp Gln Arg Ser Leu Glu Phe Leu Leu Pro Pro Glu
1 5 10 15
Pro Asp Tyr Ser Leu Leu Leu Thr Phe Ile Tyr Asn Gly Arg Val Val
20 25 30
Gly Glu Ala Gln Val Gln Ser Leu Asp Cys Arg Leu Val Ala Glu Pro
35 40 45
Ser Gly Ser Glu Ser Ser Met Glu Gln Val Leu Phe Pro Lys Pro Gly
50 55 60
Pro Leu Glu Pro Thr Gln Arg Leu Leu Ser Gln Leu Glu Arg Gly Ile
65 70 75 80
Leu Val Ala Ser Asn Pro Arg Gly Leu Phe Val Gln Arg Leu Cys Pro
85 90 95
Ile Pro Ile Ser Trp Asn Ala Pro Gln Ala Pro Pro Gly
100 105
<210> 32
<211> 327
<212> DNA
<213> Homo sapiens
<400> 32
ccctttcaag gggatcagag gtccctggag tttctgcttc ctccagagcc agactactca 60
ctgctgctca ccttcatcta caacgggcgc gtggtgggcg aggcccaggt gcaaagcctg 120
gattgccgcc ttgtggctga gccctcaggc tctgagagca gcatggagca ggtgctgttc 180
cccaagcctg gcccactgga gcccacgcag cgcctgctga gccagcttga gaggggcatc 240
ctagtggcca gcaacccccg aggcctcttc gtgcagcgcc tttgccccat ccccatctcc 300
tggaatgcac cccaggctcc acctggg 327
<210> 33
<211> 327
<212> RNA
<213> Homo sapiens
<400> 33
cccuuucaag gggaucagag gucccuggag uuucugcuuc cuccagagcc agacuacuca 60
cugcugcuca ccuucaucua caacgggcgc guggugggcg aggcccaggu gcaaagccug 120
gauugccgcc uuguggcuga gcccucaggc ucugagagca gcauggagca ggugcuguuc 180
cccaagccug gcccacugga gcccacgcag cgccugcuga gccagcuuga gaggggcauc 240
cuaguggcca gcaacccccg aggccucuuc gugcagcgcc uuugccccau ccccaucucc 300
uggaaugcac cccaggcucc accuggg 327
<210> 34
<211> 333
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF9
<400> 34
atgccattcc agggcgacca gcggagtctg gaatttctgc tgcctcctga gcctgactac 60
agcctgctgc tgaccttcat ctacaacggc agagtcgtgg gcgaagccca ggtgcagtct 120
ctggattgca gactggtggc cgagcctagc ggaagcgagt ctagtatgga acaggtgctg 180
ttccccaagc ctggacctct ggaacccaca cagaggctgc tgtctcaact ggaaaggggc 240
atcctggtgg ccagcaatcc tagaggcctg ttcgtgcaga gactgtgccc tattcctatc 300
agctggaacg cccctcaggc tcctcctgga tga 333
<210> 35
<211> 333
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF9
<400> 35
augccauucc agggcgacca gcggagucug gaauuucugc ugccuccuga gccugacuac 60
agccugcugc ugaccuucau cuacaacggc agagucgugg gcgaagccca ggugcagucu 120
cuggauugca gacugguggc cgagccuagc ggaagcgagu cuaguaugga acaggugcug 180
uuccccaagc cuggaccucu ggaacccaca cagaggcugc ugucucaacu ggaaaggggc 240
auccuggugg ccagcaaucc uagaggccug uucgugcaga gacugugccc uauuccuauc 300
agcuggaacg ccccucaggc uccuccugga uga 333
<210> 36
<211> 164
<212> PRT
<213> Homo sapiens
<400> 36
Met Pro Ile Thr Arg Met Arg Met Arg Pro Trp Leu Glu Met Gln Ile
1 5 10 15
Asn Ser Asn Gln Ile Pro Gly Leu Ile Trp Ile Asn Lys Glu Glu Met
20 25 30
Ile Phe Gln Ile Pro Trp Lys His Ala Ala Lys His Gly Trp Asp Ile
35 40 45
Asn Lys Asp Ala Cys Leu Phe Arg Ser Trp Ala Ile His Thr Gly Arg
50 55 60
Tyr Lys Ala Gly Glu Lys Glu Pro Asp Pro Lys Thr Trp Lys Ala Asn
65 70 75 80
Phe Arg Cys Ala Met Asn Ser Leu Pro Asp Ile Glu Glu Val Lys Asp
85 90 95
Gln Ser Arg Asn Lys Gly Ser Ser Ala Val Arg Val Tyr Arg Met Leu
100 105 110
Pro Pro Leu Thr Lys Asn Gln Arg Lys Glu Arg Lys Ser Lys Ser Ser
115 120 125
Arg Asp Ala Lys Ser Lys Ala Lys Arg Lys Ser Cys Gly Asp Ser Ser
130 135 140
Pro Asp Thr Phe Ser Asp Gly Leu Ser Ser Ser Thr Leu Pro Asp Asp
145 150 155 160
His Ser Ser Tyr
<210> 37
<211> 492
<212> DNA
<213> Homo sapiens
<400> 37
atgcccatca ctcggatgcg catgagaccc tggctagaga tgcagattaa ttccaaccaa 60
atcccggggc tcatctggat taataaagag gagatgatct tccagatccc atggaagcat 120
gctgccaagc atggctggga catcaacaag gatgcctgtt tgttccggag ctgggccatt 180
cacacaggcc gatacaaagc aggggaaaag gagccagatc ccaagacgtg gaaggccaac 240
tttcgctgtg ccatgaactc cctgccagat atcgaggagg tgaaagacca gagcaggaac 300
aagggcagct cagctgtgcg agtgtaccgg atgcttccac ctctcaccaa gaaccagaga 360
aaagaaagaa agtcgaagtc cagccgagat gctaagagca aggccaagag gaagtcatgt 420
ggggattcca gccctgatac cttctctgat ggactcagca gctccactct gcctgatgac 480
cacagcagct ac 492
<210> 38
<211> 492
<212> RNA
<213> Homo sapiens
<400> 38
augcccauca cucggaugcg caugagaccc uggcuagaga ugcagauuaa uuccaaccaa 60
aucccggggc ucaucuggau uaauaaagag gagaugaucu uccagauccc auggaagcau 120
gcugccaagc auggcuggga caucaacaag gaugccuguu uguuccggag cugggccauu 180
cacacaggcc gauacaaagc aggggaaaag gagccagauc ccaagacgug gaaggccaac 240
uuucgcugug ccaugaacuc ccugccagau aucgaggagg ugaaagacca gagcaggaac 300
aagggcagcu cagcugugcg aguguaccgg augcuuccac cucucaccaa gaaccagaga 360
aaagaaagaa agucgaaguc cagccgagau gcuaagagca aggccaagag gaagucaugu 420
ggggauucca gcccugauac cuucucugau ggacucagca gcuccacucu gccugaugac 480
cacagcagcu ac 492
<210> 39
<211> 495
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF1
<400> 39
atgcccatca ccagaatgag aatgcggccc tggctggaaa tgcagatcaa cagcaatcag 60
atccccggcc tgatctggat caacaaagaa gagatgatct ttcagatccc gtggaagcac 120
gccgccaagc acggatggga catcaacaag gacgcctgcc tgttcagaag ctgggccatc 180
cacaccggca gatacaaggc cggcgagaaa gagcccgatc ctaagacctg gaaggccaac 240
ttcagatgcg ccatgaacag cctgcctgac atcgaggaag tgaaggacca gagccggaac 300
aagggatctt ctgccgtgcg ggtgtaccgg atgttgcctc ctctgaccaa gaaccagcgc 360
aaagagcgga agtccaagag cagcagagat gccaagagca aggccaagag aaagtcctgc 420
ggcgacagca gccctgacac cttttctgat ggcctgagca gcagcaccct gccagatgat 480
cacagcagct actga 495
<210> 40
<211> 495
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF1
<400> 40
augcccauca ccagaaugag aaugcggccc uggcuggaaa ugcagaucaa cagcaaucag 60
auccccggcc ugaucuggau caacaaagaa gagaugaucu uucagauccc guggaagcac 120
gccgccaagc acggauggga caucaacaag gacgccugcc uguucagaag cugggccauc 180
cacaccggca gauacaaggc cggcgagaaa gagcccgauc cuaagaccug gaaggccaac 240
uucagaugcg ccaugaacag ccugccugac aucgaggaag ugaaggacca gagccggaac 300
aagggaucuu cugccgugcg gguguaccgg auguugccuc cucugaccaa gaaccagcgc 360
aaagagcgga aguccaagag cagcagagau gccaagagca aggccaagag aaaguccugc 420
ggcgacagca gcccugacac cuuuucugau ggccugagca gcagcacccu gccagaugau 480
cacagcagcu acuga 495
<210> 41
<211> 109
<212> PRT
<213> Homo sapiens
<400> 41
Asn Gly Lys Leu Arg Gln Trp Leu Ile Asp Gln Ile Asp Ser Gly Lys
1 5 10 15
Tyr Pro Gly Leu Val Trp Glu Asn Glu Glu Lys Ser Ile Phe Arg Ile
20 25 30
Pro Trp Lys His Ala Gly Lys Gln Asp Tyr Asn Arg Glu Glu Asp Ala
35 40 45
Ala Leu Phe Lys Ala Trp Ala Leu Phe Lys Gly Lys Phe Arg Glu Gly
50 55 60
Ile Asp Lys Pro Asp Pro Pro Thr Trp Lys Thr Arg Leu Arg Cys Ala
65 70 75 80
Leu Asn Lys Ser Asn Asp Phe Glu Glu Leu Val Glu Arg Ser Gln Leu
85 90 95
Asp Ile Ser Asp Pro Tyr Lys Val Tyr Arg Ile Val Pro
100 105
<210> 42
<211> 327
<212> DNA
<213> Homo sapiens
<400> 42
aacgggaagc tccgccagtg gctgatcgac cagatcgaca gcggcaagta ccccgggctg 60
gtgtgggaga acgaggagaa gagcatcttc cgcatcccct ggaagcacgc gggcaagcag 120
gactacaacc gcgaggagga cgccgcgctc ttcaaggctt gggcactgtt taaaggaaag 180
ttccgagaag gcatcgacaa gccggaccct cccacctgga agacgcgcct gcggtgcgct 240
ttgaacaaga gcaatgactt tgaggaactg gttgagcgga gccagctgga catctcagac 300
ccgtacaaag tgtacaggat tgttcct 327
<210> 43
<211> 327
<212> RNA
<213> Homo sapiens
<400> 43
aacgggaagc uccgccagug gcugaucgac cagaucgaca gcggcaagua ccccgggcug 60
gugugggaga acgaggagaa gagcaucuuc cgcauccccu ggaagcacgc gggcaagcag 120
gacuacaacc gcgaggagga cgccgcgcuc uucaaggcuu gggcacuguu uaaaggaaag 180
uuccgagaag gcaucgacaa gccggacccu cccaccugga agacgcgccu gcggugcgcu 240
uugaacaaga gcaaugacuu ugaggaacug guugagcgga gccagcugga caucucagac 300
ccguacaaag uguacaggau uguuccu 327
<210> 44
<211> 333
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF4
<400> 44
atgaacggca agctgcggca gtggctgatc gaccagatcg acagcggcaa gtatcctggc 60
ctcgtgtggg agaacgagga aaagtctatc ttcagaatcc cctggaagca cgccggcaag 120
caggactaca acagagaaga ggacgccgct ctgttcaagg cctgggctct gtttaagggc 180
aagttcagag agggcatcga caagcccgat cctccaacct ggaaaaccag actgagatgc 240
gccctgaaca agagcaacga cttcgaggaa ctggtggaaa gaagccagct ggacatcagc 300
gacccctaca aggtgtaccg gatcgtgcct tga 333
<210> 45
<211> 333
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF4
<400> 45
augaacggca agcugcggca guggcugauc gaccagaucg acagcggcaa guauccuggc 60
cucguguggg agaacgagga aaagucuauc uucagaaucc ccuggaagca cgccggcaag 120
caggacuaca acagagaaga ggacgccgcu cuguucaagg ccugggcucu guuuaagggc 180
aaguucagag agggcaucga caagcccgau ccuccaaccu ggaaaaccag acugagaugc 240
gcccugaaca agagcaacga cuucgaggaa cugguggaaa gaagccagcu ggacaucagc 300
gaccccuaca agguguaccg gaucgugccu uga 333
<210> 46
<211> 140
<212> PRT
<213> Homo sapiens
<400> 46
Met Asn Gln Ser Ile Pro Val Ala Pro Thr Pro Pro Arg Arg Val Arg
1 5 10 15
Leu Lys Pro Trp Leu Val Ala Gln Val Asn Ser Cys Gln Tyr Pro Gly
20 25 30
Leu Gln Trp Val Asn Gly Glu Lys Lys Leu Phe Cys Ile Pro Trp Arg
35 40 45
His Ala Thr Arg His Gly Pro Ser Gln Asp Gly Asp Asn Thr Ile Phe
50 55 60
Lys Ala Trp Ala Lys Glu Thr Gly Lys Tyr Thr Glu Gly Val Asp Glu
65 70 75 80
Ala Asp Pro Ala Lys Trp Lys Ala Asn Leu Arg Cys Ala Leu Asn Lys
85 90 95
Ser Arg Asp Phe Arg Leu Ile Tyr Asp Gly Pro Arg Asp Met Pro Pro
100 105 110
Gln Pro Tyr Lys Ile Tyr Glu Val Cys Ser Asn Gly Pro Ala Pro Thr
115 120 125
Asp Ser Gln Pro Pro Glu Asp Tyr Ser Phe Gly Ala
130 135 140
<210> 47
<211> 420
<212> DNA
<213> Homo sapiens
<400> 47
atgaaccagt ccatcccagt ggctcccacc ccaccccgcc gcgtgcggct gaagccctgg 60
ctggtggccc aggtgaacag ctgccagtac ccagggcttc aatgggtcaa cggggaaaag 120
aaattattct gcatcccctg gaggcatgcc acaaggcatg gtcccagcca ggacggagat 180
aacaccatct tcaaggcctg ggccaaggag acagggaaat acaccgaagg cgtggatgaa 240
gccgatccgg ccaagtggaa ggccaacctg cgctgtgccc ttaacaagag ccgggacttc 300
cgcctcatct acgacgggcc ccgggacatg ccacctcagc cctacaagat ctacgaggtc 360
tgctccaatg gccctgctcc cacagactcc cagccccctg aggattactc ttttggtgca 420
420
<210> 48
<211> 420
<212> RNA
<213> Homo sapiens
<400> 48
augaaccagu ccaucccagu ggcucccacc ccaccccgcc gcgugcggcu gaagcccugg 60
cugguggccc aggugaacag cugccaguac ccagggcuuc aaugggucaa cggggaaaag 120
aaauuauucu gcauccccug gaggcaugcc acaaggcaug gucccagcca ggacggagau 180
aacaccaucu ucaaggccug ggccaaggag acagggaaau acaccgaagg cguggaugaa 240
gccgauccgg ccaaguggaa ggccaaccug cgcugugccc uuaacaagag ccgggacuuc 300
cgccucaucu acgacgggcc ccgggacaug ccaccucagc ccuacaagau cuacgagguc 360
ugcuccaaug gcccugcucc cacagacucc cagcccccug aggauuacuc uuuuggugca 420
420
<210> 49
<211> 423
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF5
<400> 49
atgaaccaga gcatccccgt ggctcccaca cctcctagaa gagtgcgact gaagccttgg 60
ctggtggccc aagtgaacag ctgtcagtat cctggcctgc agtgggtcaa cggcgagaag 120
aagctgttct gcatcccttg gagacacgcc accagacacg gcccttctca ggacggcgac 180
aacaccatct ttaaggcctg ggccaaagag acaggcaagt acaccgaagg cgtggacgaa 240
gccgatcctg ccaagtggaa ggccaatctg agatgcgccc tgaacaagag ccgggacttc 300
cggctgatct acgacggccc tagagacatg cctcctcagc cttacaagat ctacgaagtg 360
tgcagcaacg gccctgctcc taccgattct cagcctcctg aggactacag cttcggcgct 420
tga 423
<210> 50
<211> 423
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF5
<400> 50
augaaccaga gcauccccgu ggcucccaca ccuccuagaa gagugcgacu gaagccuugg 60
cugguggccc aagugaacag cugucaguau ccuggccugc agugggucaa cggcgagaag 120
aagcuguucu gcaucccuug gagacacgcc accagacacg gcccuucuca ggacggcgac 180
aacaccaucu uuaaggccug ggccaaagag acaggcaagu acaccgaagg cguggacgaa 240
gccgauccug ccaaguggaa ggccaaucug agaugcgccc ugaacaagag ccgggacuuc 300
cggcugaucu acgacggccc uagagacaug ccuccucagc cuuacaagau cuacgaagug 360
ugcagcaacg gcccugcucc uaccgauucu cagccuccug aggacuacag cuucggcgcu 420
uga 423
<210> 51
<211> 140
<212> PRT
<213> Homo sapiens
<400> 51
Met Asn Gln Ser Ile Pro Val Ala Pro Thr Pro Pro Arg Arg Val Arg
1 5 10 15
Leu Lys Pro Trp Leu Val Ala Gln Val Asn Ser Cys Gln Tyr Pro Gly
20 25 30
Leu Gln Trp Val Asn Gly Glu Lys Lys Leu Phe Cys Ile Pro Trp Arg
35 40 45
His Ala Thr Arg His Gly Pro Ser Gln Asp Gly Asp Asn Thr Ile Phe
50 55 60
Lys Ala Trp Pro Lys Glu Thr Gly Lys Tyr Thr Glu Gly Val Asp Glu
65 70 75 80
Ala Asp Pro Ala Lys Trp Lys Ala Asn Leu Arg Cys Ala Leu Asn Lys
85 90 95
Ser Arg Asp Phe Arg Leu Ile Tyr Asp Gly Pro Arg Asp Met Pro Pro
100 105 110
Gln Pro Tyr Lys Ile Tyr Glu Val Cys Ser Asn Gly Pro Ala Pro Thr
115 120 125
Asp Ser Gln Pro Pro Glu Asp Tyr Ser Phe Gly Ala
130 135 140
<210> 52
<211> 420
<212> DNA
<213> Homo sapiens
<400> 52
atgaaccagt ccatcccagt ggctcccacc ccaccccgcc gcgtgcggct gaagccctgg 60
ctggtggccc aggtgaacag ctgccagtac ccagggcttc aatgggtcaa cggggaaaag 120
aaattattct gcatcccctg gaggcatgcc acaaggcatg gtcccagcca ggacggagat 180
aacaccatct tcaaggcctg gcccaaggag acagggaaat acaccgaagg cgtggatgaa 240
gccgatccgg ccaagtggaa ggccaacctg cgctgtgccc ttaacaagag ccgggacttc 300
cgcctcatct acgacgggcc ccgggacatg ccacctcagc cctacaagat ctacgaggtc 360
tgctccaatg gccctgctcc cacagactcc cagccccctg aggattactc ttttggtgca 420
420
<210> 53
<211> 420
<212> RNA
<213> Homo sapiens
<400> 53
augaaccagu ccaucccagu ggcucccacc ccaccccgcc gcgugcggcu gaagcccugg 60
cugguggccc aggugaacag cugccaguac ccagggcuuc aaugggucaa cggggaaaag 120
aaauuauucu gcauccccug gaggcaugcc acaaggcaug gucccagcca ggacggagau 180
aacaccaucu ucaaggccug gcccaaggag acagggaaau acaccgaagg cguggaugaa 240
gccgauccgg ccaaguggaa ggccaaccug cgcugugccc uuaacaagag ccgggacuuc 300
cgccucaucu acgacgggcc ccgggacaug ccaccucagc ccuacaagau cuacgagguc 360
ugcuccaaug gcccugcucc cacagacucc cagcccccug aggauuacuc uuuuggugca 420
420
<210> 54
<211> 423
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA enoding IRF5
<400> 54
atgaaccaga gcatccccgt ggctcccaca cctcctagaa gagtgcgact gaagccttgg 60
ctggtggccc aagtgaacag ctgtcagtat cctggcctgc agtgggtcaa cggcgagaag 120
aagctgttct gcatcccttg gagacacgcc accagacacg gcccttctca ggacggcgac 180
aacaccatct ttaaggcctg gcccaaagag acaggcaagt acaccgaagg cgtggacgaa 240
gccgatcctg ccaagtggaa ggccaatctg agatgcgccc tgaacaagag ccgggacttc 300
cggctgatct acgacggccc tagagacatg cctcctcagc cttacaagat ctacgaagtg 360
tgcagcaacg gccctgctcc taccgattct cagcctcctg aggactacag cttcggcgct 420
tga 423
<210> 55
<211> 423
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF5
<400> 55
augaaccaga gcauccccgu ggcucccaca ccuccuagaa gagugcgacu gaagccuugg 60
cugguggccc aagugaacag cugucaguau ccuggccugc agugggucaa cggcgagaag 120
aagcuguucu gcaucccuug gagacacgcc accagacacg gcccuucuca ggacggcgac 180
aacaccaucu uuaaggccug gcccaaagag acaggcaagu acaccgaagg cguggacgaa 240
gccgauccug ccaaguggaa ggccaaucug agaugcgccc ugaacaagag ccgggacuuc 300
cggcugaucu acgacggccc uagagacaug ccuccucagc cuuacaagau cuacgaagug 360
ugcagcaacg gcccugcucc uaccgauucu cagccuccug aggacuacag cuucggcgcu 420
uga 423
<210> 56
<211> 140
<212> PRT
<213> Homo sapiens
<400> 56
Met Cys Asp Arg Asn Gly Gly Arg Arg Leu Arg Gln Trp Leu Ile Glu
1 5 10 15
Gln Ile Asp Ser Ser Met Tyr Pro Gly Leu Ile Trp Glu Asn Glu Glu
20 25 30
Lys Ser Met Phe Arg Ile Pro Trp Lys His Ala Gly Lys Gln Asp Tyr
35 40 45
Asn Gln Glu Val Asp Ala Ser Ile Phe Lys Ala Trp Ala Val Phe Lys
50 55 60
Gly Lys Phe Lys Glu Gly Asp Lys Ala Glu Pro Ala Thr Trp Lys Thr
65 70 75 80
Arg Leu Arg Cys Ala Leu Asn Lys Ser Pro Asp Phe Glu Glu Val Thr
85 90 95
Asp Arg Ser Gln Leu Asp Ile Ser Glu Pro Tyr Lys Val Tyr Arg Ile
100 105 110
Val Pro Glu Glu Glu Gln Lys Cys Lys Leu Gly Val Ala Thr Ala Gly
115 120 125
Cys Val Asn Glu Val Thr Glu Met Glu Cys Gly Arg
130 135 140
<210> 57
<211> 420
<212> DNA
<213> Homo sapiens
<400> 57
atgtgtgacc ggaatggtgg tcggcggctt cgacagtggc tgatcgagca gattgacagt 60
agcatgtatc caggactgat ttgggagaat gaggagaaga gcatgttccg gatcccttgg 120
aaacacgctg gcaagcaaga ttataatcag gaagtggatg cctccatttt taaggcctgg 180
gcagttttta aagggaagtt taaagaaggg gacaaagctg aaccagccac ttggaagacg 240
aggttacgct gtgctttgaa taagagccca gattttgagg aagtgacgga ccggtcccaa 300
ctggacattt ccgagccata caaagtttac cgaattgttc ctgaggaaga gcaaaaatgc 360
aaactaggcg tggcaactgc tggctgcgtg aatgaagtta cagagatgga gtgcggtcgc 420
420
<210> 58
<211> 420
<212> RNA
<213> Homo sapiens
<400> 58
augugugacc ggaauggugg ucggcggcuu cgacaguggc ugaucgagca gauugacagu 60
agcauguauc caggacugau uugggagaau gaggagaaga gcauguuccg gaucccuugg 120
aaacacgcug gcaagcaaga uuauaaucag gaaguggaug ccuccauuuu uaaggccugg 180
gcaguuuuua aagggaaguu uaaagaaggg gacaaagcug aaccagccac uuggaagacg 240
agguuacgcu gugcuuugaa uaagagccca gauuuugagg aagugacgga ccggucccaa 300
cuggacauuu ccgagccaua caaaguuuac cgaauuguuc cugaggaaga gcaaaaaugc 360
aaacuaggcg uggcaacugc uggcugcgug aaugaaguua cagagaugga gugcggucgc 420
420
<210> 59
<211> 423
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF8
<400> 59
atgtgcgaca gaaatggcgg cagacggctg agacagtggc tgatcgagca gatcgacagc 60
agcatgtacc ccggcctgat ctgggagaac gaagagaagt ctatgttcag gatcccctgg 120
aagcacgccg gcaagcagga ctacaatcaa gaggtggacg ccagcatctt caaggcctgg 180
gccgtgttca agggcaagtt caaagagggc gacaaggccg agcctgccac ctggaaaacc 240
agactgagat gcgccctgaa caagagcccc gacttcgagg aagtgaccga cagaagccag 300
ctggacatca gcgagcccta caaggtgtac cggatcgtgc ccgaagagga acagaaatgc 360
aagctgggcg ttgccaccgc cggctgtgtg aatgaagtga cagagatgga atgcggccgg 420
tga 423
<210> 60
<211> 423
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF8
<400> 60
augugcgaca gaaauggcgg cagacggcug agacaguggc ugaucgagca gaucgacagc 60
agcauguacc ccggccugau cugggagaac gaagagaagu cuauguucag gauccccugg 120
aagcacgccg gcaagcagga cuacaaucaa gagguggacg ccagcaucuu caaggccugg 180
gccguguuca agggcaaguu caaagagggc gacaaggccg agccugccac cuggaaaacc 240
agacugagau gcgcccugaa caagagcccc gacuucgagg aagugaccga cagaagccag 300
cuggacauca gcgagcccua caagguguac cggaucgugc ccgaagagga acagaaaugc 360
aagcugggcg uugccaccgc cggcugugug aaugaaguga cagagaugga augcggccgg 420
uga 423
<210> 61
<211> 120
<212> PRT
<213> Homo sapiens
<400> 61
Met Ala Ser Gly Arg Ala Arg Cys Thr Arg Lys Leu Arg Asn Trp Val
1 5 10 15
Val Glu Gln Val Glu Ser Gly Gln Phe Pro Gly Val Cys Trp Asp Asp
20 25 30
Thr Ala Lys Thr Met Phe Arg Ile Pro Trp Lys His Ala Gly Lys Gln
35 40 45
Asp Phe Arg Glu Asp Gln Asp Ala Ala Phe Phe Lys Ala Trp Ala Ile
50 55 60
Phe Lys Gly Lys Tyr Lys Glu Gly Asp Thr Gly Gly Pro Ala Val Trp
65 70 75 80
Lys Thr Arg Leu Arg Cys Ala Leu Asn Lys Ser Ser Glu Phe Lys Glu
85 90 95
Val Pro Glu Arg Gly Arg Met Asp Val Ala Glu Pro Tyr Lys Val Tyr
100 105 110
Gln Leu Leu Pro Pro Gly Ile Val
115 120
<210> 62
<211> 360
<212> DNA
<213> Homo sapiens
<400> 62
atggcatcag gcagggcacg ctgcacccga aaactccgga actgggtggt ggagcaagtg 60
gagagtgggc agtttcccgg agtgtgctgg gatgatacag ctaagaccat gttccggatt 120
ccctggaaac atgcaggcaa gcaggacttc cgggaggacc aggatgctgc cttcttcaag 180
gcctgggcaa tatttaaggg aaagtataag gagggggaca caggaggtcc agctgtctgg 240
aagactcgcc tgcgctgtgc actcaacaag agttctgaat ttaaggaggt tcctgagagg 300
ggccgcatgg atgttgctga gccctacaag gtgtatcagt tgctgccacc aggaatcgtc 360
360
<210> 63
<211> 360
<212> RNA
<213> Homo sapiens
<400> 63
auggcaucag gcagggcacg cugcacccga aaacuccgga acuggguggu ggagcaagug 60
gagagugggc aguuucccgg agugugcugg gaugauacag cuaagaccau guuccggauu 120
cccuggaaac augcaggcaa gcaggacuuc cgggaggacc aggaugcugc cuucuucaag 180
gccugggcaa uauuuaaggg aaaguauaag gagggggaca caggaggucc agcugucugg 240
aagacucgcc ugcgcugugc acucaacaag aguucugaau uuaaggaggu uccugagagg 300
ggccgcaugg auguugcuga gcccuacaag guguaucagu ugcugccacc aggaaucguc 360
360
<210> 64
<211> 363
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF9
<400> 64
atggcttctg gcagagccag atgcacccgg aagctgagaa actgggtcgt cgaacaggtg 60
gaaagcggac agttccctgg cgtgtgctgg gatgataccg ccaagacaat gttcagaatc 120
ccctggaagc acgccggcaa gcaggacttc agagaagatc aggacgccgc cttcttcaag 180
gcctgggcca tcttcaaggg caagtacaaa gagggcgaca caggcggacc tgccgtgtgg 240
aaaaccagac tgagatgcgc cctgaacaag agcagcgagt tcaaagaggt gcccgagcgg 300
ggcagaatgg atgtggccga accttacaag gtgtaccagc tgctgcctcc tggcatcgtg 360
tga 363
<210> 65
<211> 363
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF9
<400> 65
auggcuucug gcagagccag augcacccgg aagcugagaa acugggucgu cgaacaggug 60
gaaagcggac aguucccugg cgugugcugg gaugauaccg ccaagacaau guucagaauc 120
cccuggaagc acgccggcaa gcaggacuuc agagaagauc aggacgccgc cuucuucaag 180
gccugggcca ucuucaaggg caaguacaaa gagggcgaca caggcggacc ugccgugugg 240
aaaaccagac ugagaugcgc ccugaacaag agcagcgagu ucaaagaggu gcccgagcgg 300
ggcagaaugg auguggccga accuuacaag guguaccagc ugcugccucc uggcaucgug 360
uga 363
<210> 66
<211> 750
<212> PRT
<213> Homo sapiens
<400> 66
Met Ser Gln Trp Tyr Glu Leu Gln Gln Leu Asp Ser Lys Phe Leu Glu
1 5 10 15
Gln Val His Gln Leu Tyr Asp Asp Ser Phe Pro Met Glu Ile Arg Gln
20 25 30
Tyr Leu Ala Gln Trp Leu Glu Lys Gln Asp Trp Glu His Ala Ala Asn
35 40 45
Asp Val Ser Phe Ala Thr Ile Arg Phe His Asp Leu Leu Ser Gln Leu
50 55 60
Asp Asp Gln Tyr Ser Arg Phe Ser Leu Glu Asn Asn Phe Leu Leu Gln
65 70 75 80
His Asn Ile Arg Lys Ser Lys Arg Asn Leu Gln Asp Asn Phe Gln Glu
85 90 95
Asp Pro Ile Gln Met Ser Met Ile Ile Tyr Ser Cys Leu Lys Glu Glu
100 105 110
Arg Lys Ile Leu Glu Asn Ala Gln Arg Phe Asn Gln Ala Gln Ser Gly
115 120 125
Asn Ile Gln Ser Thr Val Met Leu Asp Lys Gln Lys Glu Leu Asp Ser
130 135 140
Lys Val Arg Asn Val Lys Asp Lys Val Met Cys Ile Glu His Glu Ile
145 150 155 160
Lys Ser Leu Glu Asp Leu Gln Asp Glu Tyr Asp Phe Lys Cys Lys Thr
165 170 175
Leu Gln Asn Arg Glu His Glu Thr Asn Gly Val Ala Lys Ser Asp Gln
180 185 190
Lys Gln Glu Gln Leu Leu Leu Lys Lys Met Tyr Leu Met Leu Asp Asn
195 200 205
Lys Arg Lys Glu Val Val His Lys Ile Ile Glu Leu Leu Asn Val Thr
210 215 220
Glu Leu Thr Gln Asn Ala Leu Ile Asn Asp Glu Leu Val Glu Trp Lys
225 230 235 240
Arg Arg Gln Gln Ser Ala Cys Ile Gly Gly Pro Pro Asn Ala Cys Leu
245 250 255
Asp Gln Leu Gln Asn Trp Phe Thr Ile Val Ala Glu Ser Leu Gln Gln
260 265 270
Val Arg Gln Gln Leu Lys Lys Leu Glu Glu Leu Glu Gln Lys Tyr Thr
275 280 285
Tyr Glu His Asp Pro Ile Thr Lys Asn Lys Gln Val Leu Trp Asp Arg
290 295 300
Thr Phe Ser Leu Phe Gln Gln Leu Ile Gln Ser Ser Phe Val Val Glu
305 310 315 320
Arg Gln Pro Cys Met Pro Thr His Pro Gln Arg Pro Leu Val Leu Lys
325 330 335
Thr Gly Val Gln Phe Thr Val Lys Leu Arg Leu Leu Val Lys Leu Gln
340 345 350
Glu Leu Asn Tyr Asn Leu Lys Val Lys Val Leu Phe Asp Lys Asp Val
355 360 365
Asn Glu Arg Asn Thr Val Lys Gly Phe Arg Lys Phe Asn Ile Leu Gly
370 375 380
Thr His Thr Lys Val Met Asn Met Glu Glu Ser Thr Asn Gly Ser Leu
385 390 395 400
Ala Ala Glu Phe Arg His Leu Gln Leu Lys Glu Gln Lys Asn Ala Gly
405 410 415
Thr Arg Thr Asn Glu Gly Pro Leu Ile Val Thr Glu Glu Leu His Ser
420 425 430
Leu Ser Phe Glu Thr Gln Leu Cys Gln Pro Gly Leu Val Ile Asp Leu
435 440 445
Glu Thr Thr Ser Leu Pro Val Val Val Ile Ser Asn Val Ser Gln Leu
450 455 460
Pro Ser Gly Trp Ala Ser Ile Leu Trp Tyr Asn Met Leu Val Ala Glu
465 470 475 480
Pro Arg Asn Leu Ser Phe Phe Leu Thr Pro Pro Cys Ala Arg Trp Ala
485 490 495
Gln Leu Ser Glu Val Leu Ser Trp Gln Phe Ser Ser Val Thr Lys Arg
500 505 510
Gly Leu Asn Val Asp Gln Leu Asn Met Leu Gly Glu Lys Leu Leu Gly
515 520 525
Pro Asn Ala Ser Pro Asp Gly Leu Ile Pro Trp Thr Arg Phe Cys Lys
530 535 540
Glu Asn Ile Asn Asp Lys Asn Phe Pro Phe Trp Leu Trp Ile Glu Ser
545 550 555 560
Ile Leu Glu Leu Ile Lys Lys His Leu Leu Pro Leu Trp Asn Asp Gly
565 570 575
Cys Ile Met Gly Phe Ile Ser Lys Glu Arg Glu Arg Ala Leu Leu Lys
580 585 590
Asp Gln Gln Pro Gly Thr Phe Leu Leu Arg Phe Ser Glu Ser Ser Arg
595 600 605
Glu Gly Ala Ile Thr Phe Thr Trp Val Glu Arg Ser Gln Asn Gly Gly
610 615 620
Glu Pro Asp Phe His Ala Val Glu Pro Tyr Thr Lys Lys Glu Leu Ser
625 630 635 640
Ala Val Thr Phe Pro Asp Ile Ile Arg Asn Tyr Lys Val Met Ala Ala
645 650 655
Glu Asn Ile Pro Glu Asn Pro Leu Lys Tyr Leu Tyr Pro Asn Ile Asp
660 665 670
Lys Asp His Ala Phe Gly Lys Tyr Tyr Ser Arg Pro Lys Glu Ala Pro
675 680 685
Glu Pro Met Glu Leu Asp Gly Pro Lys Gly Thr Gly Phe Ile Lys Thr
690 695 700
Glu Leu Ile Ser Val Ser Glu Val His Pro Ser Arg Leu Gln Thr Thr
705 710 715 720
Asp Asn Leu Leu Pro Met Ser Pro Glu Glu Phe Asp Glu Val Ser Arg
725 730 735
Ile Val Gly Ser Val Glu Phe Asp Ser Met Met Asn Thr Val
740 745 750
<210> 67
<211> 2250
<212> DNA
<213> Homo sapiens
<400> 67
atgtctcagt ggtacgaact tcagcagctt gactcaaaat tcctggagca ggttcaccag 60
ctttatgatg acagttttcc catggaaatc agacagtacc tggcacagtg gttagaaaag 120
caagactggg agcacgctgc caatgatgtt tcatttgcca ccatccgttt tcatgacctc 180
ctgtcacagc tggatgatca atatagtcgc ttttctttgg agaataactt cttgctacag 240
cataacataa ggaaaagcaa gcgtaatctt caggataatt ttcaggaaga cccaatccag 300
atgtctatga tcatttacag ctgtctgaag gaagaaagga aaattctgga aaacgcccag 360
agatttaatc aggctcagtc ggggaatatt cagagcacag tgatgttaga caaacagaaa 420
gagcttgaca gtaaagtcag aaatgtgaag gacaaggtta tgtgtataga gcatgaaatc 480
aagagcctgg aagatttaca agatgaatat gacttcaaat gcaaaacctt gcagaacaga 540
gaacacgaga ccaatggtgt ggcaaagagt gatcagaaac aagaacagct gttactcaag 600
aagatgtatt taatgcttga caataagaga aaggaagtag ttcacaaaat aatagagttg 660
ctgaatgtca ctgaacttac ccagaatgcc ctgattaatg atgaactagt ggagtggaag 720
cggagacagc agagcgcctg tattgggggg ccgcccaatg cttgcttgga tcagctgcag 780
aactggttca ctatagttgc ggagagtctg cagcaagttc ggcagcagct taaaaagttg 840
gaggaattgg aacagaaata cacctacgaa catgacccta tcacaaaaaa caaacaagtg 900
ttatgggacc gcaccttcag tcttttccag cagctcattc agagctcgtt tgtggtggaa 960
agacagccct gcatgccaac gcaccctcag aggccgctgg tcttgaagac aggggtccag 1020
ttcactgtga agttgagact gttggtgaaa ttgcaagagc tgaattataa tttgaaagtc 1080
aaagtcttat ttgataaaga tgtgaatgag agaaatacag taaaaggatt taggaagttc 1140
aacattttgg gcacgcacac aaaagtgatg aacatggagg agtccaccaa tggcagtctg 1200
gcggctgaat ttcggcacct gcaattgaaa gaacagaaaa atgctggcac cagaacgaat 1260
gagggtcctc tcatcgttac tgaagagctt cactccctta gttttgaaac ccaattgtgc 1320
cagcctggtt tggtaattga cctcgagacg acctctctgc ccgttgtggt gatctccaac 1380
gtcagccagc tcccgagcgg ttgggcctcc atcctttggt acaacatgct ggtggcggaa 1440
cccaggaatc tgtccttctt cctgactcca ccatgtgcac gatgggctca gctttcagaa 1500
gtgctgagtt ggcagttttc ttctgtcacc aaaagaggtc tcaatgtgga ccagctgaac 1560
atgttgggag agaagcttct tggtcctaac gccagccccg atggtctcat tccgtggacg 1620
aggttttgta aggaaaatat aaatgataaa aattttccct tctggctttg gattgaaagc 1680
atcctagaac tcattaaaaa acacctgctc cctctctgga atgatgggtg catcatgggc 1740
ttcatcagca aggagcgaga gcgtgccctg ttgaaggacc agcagccggg gaccttcctg 1800
ctgcggttca gtgagagctc ccgggaaggg gccatcacat tcacatgggt ggagcggtcc 1860
cagaacggag gcgaacctga cttccatgcg gttgaaccct acacgaagaa agaactttct 1920
gctgttactt tccctgacat cattcgcaat tacaaagtca tggctgctga gaatattcct 1980
gagaatcccc tgaagtatct gtatccaaat attgacaaag accatgcctt tggaaagtat 2040
tactccaggc caaaggaagc accagagcca atggaacttg atggccctaa aggaactgga 2100
tttatcaaga ctgagttgat ttctgtgtct gaagttcacc cttctagact tcagaccaca 2160
gacaacctgc tccccatgtc tcctgaggag tttgacgagg tgtctcggat agtgggctct 2220
gtagaattcg acagtatgat gaacacagta 2250
<210> 68
<211> 2250
<212> RNA
<213> Homo sapiens
<400> 68
augucucagu gguacgaacu ucagcagcuu gacucaaaau uccuggagca gguucaccag 60
cuuuaugaug acaguuuucc cauggaaauc agacaguacc uggcacagug guuagaaaag 120
caagacuggg agcacgcugc caaugauguu ucauuugcca ccauccguuu ucaugaccuc 180
cugucacagc uggaugauca auauagucgc uuuucuuugg agaauaacuu cuugcuacag 240
cauaacauaa ggaaaagcaa gcguaaucuu caggauaauu uucaggaaga cccaauccag 300
augucuauga ucauuuacag cugucugaag gaagaaagga aaauucugga aaacgcccag 360
agauuuaauc aggcucaguc ggggaauauu cagagcacag ugauguuaga caaacagaaa 420
gagcuugaca guaaagucag aaaugugaag gacaagguua uguguauaga gcaugaaauc 480
aagagccugg aagauuuaca agaugaauau gacuucaaau gcaaaaccuu gcagaacaga 540
gaacacgaga ccaauggugu ggcaaagagu gaucagaaac aagaacagcu guuacucaag 600
aagauguauu uaaugcuuga caauaagaga aaggaaguag uucacaaaau aauagaguug 660
cugaauguca cugaacuuac ccagaaugcc cugauuaaug augaacuagu ggaguggaag 720
cggagacagc agagcgccug uauugggggg ccgcccaaug cuugcuugga ucagcugcag 780
aacugguuca cuauaguugc ggagagucug cagcaaguuc ggcagcagcu uaaaaaguug 840
gaggaauugg aacagaaaua caccuacgaa caugacccua ucacaaaaaa caaacaagug 900
uuaugggacc gcaccuucag ucuuuuccag cagcucauuc agagcucguu ugugguggaa 960
agacagcccu gcaugccaac gcacccucag aggccgcugg ucuugaagac agggguccag 1020
uucacuguga aguugagacu guuggugaaa uugcaagagc ugaauuauaa uuugaaaguc 1080
aaagucuuau uugauaaaga ugugaaugag agaaauacag uaaaaggauu uaggaaguuc 1140
aacauuuugg gcacgcacac aaaagugaug aacauggagg aguccaccaa uggcagucug 1200
gcggcugaau uucggcaccu gcaauugaaa gaacagaaaa augcuggcac cagaacgaau 1260
gaggguccuc ucaucguuac ugaagagcuu cacucccuua guuuugaaac ccaauugugc 1320
cagccugguu ugguaauuga ccucgagacg accucucugc ccguuguggu gaucuccaac 1380
gucagccagc ucccgagcgg uugggccucc auccuuuggu acaacaugcu gguggcggaa 1440
cccaggaauc uguccuucuu ccugacucca ccaugugcac gaugggcuca gcuuucagaa 1500
gugcugaguu ggcaguuuuc uucugucacc aaaagagguc ucaaugugga ccagcugaac 1560
auguugggag agaagcuucu ugguccuaac gccagccccg auggucucau uccguggacg 1620
agguuuugua aggaaaauau aaaugauaaa aauuuucccu ucuggcuuug gauugaaagc 1680
auccuagaac ucauuaaaaa acaccugcuc ccucucugga augaugggug caucaugggc 1740
uucaucagca aggagcgaga gcgugcccug uugaaggacc agcagccggg gaccuuccug 1800
cugcgguuca gugagagcuc ccgggaaggg gccaucacau ucacaugggu ggagcggucc 1860
cagaacggag gcgaaccuga cuuccaugcg guugaacccu acacgaagaa agaacuuucu 1920
gcuguuacuu ucccugacau cauucgcaau uacaaaguca uggcugcuga gaauauuccu 1980
gagaaucccc ugaaguaucu guauccaaau auugacaaag accaugccuu uggaaaguau 2040
uacuccaggc caaaggaagc accagagcca auggaacuug auggcccuaa aggaacugga 2100
uuuaucaaga cugaguugau uucugugucu gaaguucacc cuucuagacu ucagaccaca 2160
gacaaccugc uccccauguc uccugaggag uuugacgagg ugucucggau agugggcucu 2220
guagaauucg acaguaugau gaacacagua 2250
<210> 69
<211> 2253
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding STAT1
<400> 69
atgagccagt ggtacgagct gcagcagctg gacagcaagt tcctggaaca ggtgcaccag 60
ctgtacgacg acagcttccc catggaaatc cggcagtatc tggcccagtg gctggaaaag 120
caggattggg aacacgccgc caacgacgtg tccttcgcca ccatcagatt ccacgacctg 180
ctgagccagc tggacgacca gtacagcaga ttcagcctgg aaaacaactt cctgctccag 240
cacaacatcc ggaagtccaa gcggaacctg caggacaact tccaagagga ccccatccag 300
atgtccatga tcatctacag ctgcctgaaa gaggaacgga agatcctgga aaatgcccag 360
cggttcaatc aggcccagag cggcaatatc cagagcaccg tgatgctgga caagcagaaa 420
gaactggact ccaaagtgcg gaacgtcaag gacaaagtga tgtgcatcga gcacgagatc 480
aagagcctgg aagatctgca ggacgagtac gacttcaagt gcaagaccct gcagaaccgg 540
gaacacgaga caaacggcgt ggccaagagc gaccagaagc aagaacagct gctcctgaag 600
aaaatgtacc tgatgctcga caacaaacgg aaagaggtgg tccacaagat catcgagctg 660
ctgaacgtga ccgagctgac ccagaacgcc ctgatcaacg acgagctggt ggaatggaag 720
cggagacagc agtctgcctg tatcggcgga cctcctaatg cctgcctgga ccagctgcag 780
aactggttca caatcgtggc cgagagcctg cagcaagtgc gccagcagct gaagaagctg 840
gaagaactcg agcagaagta cacctacgag cacgacccca tcaccaagaa caaacaggtg 900
ctgtgggaca gaaccttcag cctgttccaa cagctgatcc agtccagctt cgtggtggaa 960
agacagccct gcatgcctac acaccctcag aggccactgg tgctgaaaac cggcgtgcag 1020
ttcaccgtga agctgcggct gctggtcaag ctgcaagagc tgaactacaa cctgaaagtg 1080
aaggtgctgt tcgacaagga cgtgaacgag cggaacaccg tgaaaggctt ccgcaagttc 1140
aacatcctgg gcacccacac aaaagtgatg aacatggaag agagcaccaa cggcagcctg 1200
gccgccgagt ttagacacct ccagctgaaa gagcagaaga acgccggcac caggaccaat 1260
gagggacctc tgatcgtgac agaggaactg cacagcctga gcttcgaaac ccagctgtgt 1320
cagccaggcc tcgtgatcga tctggaaacc acaagcctgc ctgtggtggt catcagcaat 1380
gtgtcccagc tgccttctgg ctgggccagc atcctgtggt acaacatgct ggtggccgag 1440
cctcggaacc tgtccttctt tctgacccct ccatgtgcca gatgggccca gctgtctgaa 1500
gtgctgagct ggcagtttag cagcgtgacc aagaggggcc tgaatgtcga ccagctgaat 1560
atgctgggcg agaagctgct gggccccaac gcttctcctg atggactgat cccttggacc 1620
agattctgca aagagaatat caacgacaag aacttcccgt tctggctgtg gatcgagagc 1680
atcctggaac tgatcaagaa acatctgctg cccctgtgga acgacggctg catcatgggc 1740
ttcatctcca aagagagaga gcgggccctg ctgaaggatc agcagccagg cacattcctg 1800
ctgcggttta gcgagtctag cagagagggc gccatcacct ttacctgggt cgagagatct 1860
cagaacggcg gcgagcctga ttttcacgcc gtggaaccct acaccaaaaa agaactgagc 1920
gccgtgacat tccccgacat catccggaac tacaaagtca tggccgctga gaatatcccc 1980
gagaatcccc tgaagtatct gtaccccaac atcgataagg accacgcctt cggcaagtac 2040
tacagcagac ccaaagaggc ccctgagcct atggaactgg atggccctaa aggcaccggc 2100
ttcatcaaga cagagctgat ctccgtgtcc gaggtgcacc ctagcagact gcagaccacc 2160
gataacctgc tgcctatgag ccccgaggaa ttcgacgagg tgtccagaat cgtgggcagc 2220
gtggaattcg atagcatgat gaataccgtg tga 2253
<210> 70
<211> 2253
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding STAT1
<400> 70
augagccagu gguacgagcu gcagcagcug gacagcaagu uccuggaaca ggugcaccag 60
cuguacgacg acagcuuccc cauggaaauc cggcaguauc uggcccagug gcuggaaaag 120
caggauuggg aacacgccgc caacgacgug uccuucgcca ccaucagauu ccacgaccug 180
cugagccagc uggacgacca guacagcaga uucagccugg aaaacaacuu ccugcuccag 240
cacaacaucc ggaaguccaa gcggaaccug caggacaacu uccaagagga ccccauccag 300
auguccauga ucaucuacag cugccugaaa gaggaacgga agauccugga aaaugcccag 360
cgguucaauc aggcccagag cggcaauauc cagagcaccg ugaugcugga caagcagaaa 420
gaacuggacu ccaaagugcg gaacgucaag gacaaaguga ugugcaucga gcacgagauc 480
aagagccugg aagaucugca ggacgaguac gacuucaagu gcaagacccu gcagaaccgg 540
gaacacgaga caaacggcgu ggccaagagc gaccagaagc aagaacagcu gcuccugaag 600
aaaauguacc ugaugcucga caacaaacgg aaagaggugg uccacaagau caucgagcug 660
cugaacguga ccgagcugac ccagaacgcc cugaucaacg acgagcuggu ggaauggaag 720
cggagacagc agucugccug uaucggcgga ccuccuaaug ccugccugga ccagcugcag 780
aacugguuca caaucguggc cgagagccug cagcaagugc gccagcagcu gaagaagcug 840
gaagaacucg agcagaagua caccuacgag cacgacccca ucaccaagaa caaacaggug 900
cugugggaca gaaccuucag ccuguuccaa cagcugaucc aguccagcuu cgugguggaa 960
agacagcccu gcaugccuac acacccucag aggccacugg ugcugaaaac cggcgugcag 1020
uucaccguga agcugcggcu gcuggucaag cugcaagagc ugaacuacaa ccugaaagug 1080
aaggugcugu ucgacaagga cgugaacgag cggaacaccg ugaaaggcuu ccgcaaguuc 1140
aacauccugg gcacccacac aaaagugaug aacauggaag agagcaccaa cggcagccug 1200
gccgccgagu uuagacaccu ccagcugaaa gagcagaaga acgccggcac caggaccaau 1260
gagggaccuc ugaucgugac agaggaacug cacagccuga gcuucgaaac ccagcugugu 1320
cagccaggcc ucgugaucga ucuggaaacc acaagccugc cugugguggu caucagcaau 1380
gugucccagc ugccuucugg cugggccagc auccuguggu acaacaugcu gguggccgag 1440
ccucggaacc uguccuucuu ucugaccccu ccaugugcca gaugggccca gcugucugaa 1500
gugcugagcu ggcaguuuag cagcgugacc aagaggggcc ugaaugucga ccagcugaau 1560
augcugggcg agaagcugcu gggccccaac gcuucuccug auggacugau cccuuggacc 1620
agauucugca aagagaauau caacgacaag aacuucccgu ucuggcugug gaucgagagc 1680
auccuggaac ugaucaagaa acaucugcug ccccugugga acgacggcug caucaugggc 1740
uucaucucca aagagagaga gcgggcccug cugaaggauc agcagccagg cacauuccug 1800
cugcgguuua gcgagucuag cagagagggc gccaucaccu uuaccugggu cgagagaucu 1860
cagaacggcg gcgagccuga uuuucacgcc guggaacccu acaccaaaaa agaacugagc 1920
gccgugacau uccccgacau cauccggaac uacaaaguca uggccgcuga gaauaucccc 1980
gagaaucccc ugaaguaucu guaccccaac aucgauaagg accacgccuu cggcaaguac 2040
uacagcagac ccaaagaggc cccugagccu auggaacugg auggcccuaa aggcaccggc 2100
uucaucaaga cagagcugau cuccgugucc gaggugcacc cuagcagacu gcagaccacc 2160
gauaaccugc ugccuaugag ccccgaggaa uucgacgagg uguccagaau cgugggcagc 2220
guggaauucg auagcaugau gaauaccgug uga 2253
<210> 71
<211> 183
<212> PRT
<213> Homo sapiens
<400> 71
Val Leu Glu Thr Pro Val Glu Ser Gln Gln His Glu Ile Glu Ser Arg
1 5 10 15
Ile Leu Asp Leu Arg Ala Met Met Glu Lys Leu Val Lys Ser Ile Ser
20 25 30
Gln Leu Lys Asp Gln Gln Asp Val Phe Cys Phe Arg Tyr Lys Ile Gln
35 40 45
Ala Lys Gly Lys Thr Pro Ser Leu Asp Pro His Gln Thr Lys Glu Gln
50 55 60
Lys Ile Leu Gln Glu Thr Leu Asn Glu Leu Asp Lys Arg Arg Lys Glu
65 70 75 80
Val Leu Asp Ala Ser Lys Ala Leu Leu Gly Arg Leu Thr Thr Leu Ile
85 90 95
Glu Leu Leu Leu Pro Lys Leu Glu Glu Trp Lys Ala Gln Gln Gln Lys
100 105 110
Ala Cys Ile Arg Ala Pro Ile Asp His Gly Leu Glu Gln Leu Glu Thr
115 120 125
Trp Phe Thr Ala Gly Ala Lys Leu Leu Phe His Leu Arg Gln Leu Leu
130 135 140
Lys Glu Leu Lys Gly Leu Ser Cys Leu Val Ser Tyr Gln Asp Asp Pro
145 150 155 160
Leu Thr Lys Gly Val Asp Leu Arg Asn Ala Gln Val Thr Glu Leu Leu
165 170 175
Gln Arg Leu Leu His Arg Ala
180
<210> 72
<211> 549
<212> DNA
<213> Homo sapiens
<400> 72
gttctcgaaa cacctgtgga gagccagcaa catgagattg aatcccggat cctggattta 60
agggctatga tggagaagct ggtaaaatcc atcagccaac tgaaagacca gcaggatgtc 120
ttctgcttcc gatataagat ccaggccaaa gggaagacac cctctctgga cccccatcag 180
accaaagagc agaagattct gcaggaaact ctcaatgaac tggacaaaag gagaaaggag 240
gtgctggatg cctccaaagc actgctaggc cgattaacta ccctaatcga gctactgctg 300
ccaaagttgg aggagtggaa ggcccagcag caaaaagcct gcatcagagc tcccattgac 360
cacgggttgg aacagctgga gacatggttc acagctggag caaagctgtt gtttcacctg 420
aggcagctgc tgaaggagct gaagggactg agttgcctgg ttagctatca ggatgaccct 480
ctgaccaaag gggtggacct acgcaacgcc caggtcacag agttgctaca gcgtctgctc 540
cacagagcc 549
<210> 73
<211> 549
<212> RNA
<213> Homo sapiens
<400> 73
guucucgaaa caccugugga gagccagcaa caugagauug aaucccggau ccuggauuua 60
agggcuauga uggagaagcu gguaaaaucc aucagccaac ugaaagacca gcaggauguc 120
uucugcuucc gauauaagau ccaggccaaa gggaagacac ccucucugga cccccaucag 180
accaaagagc agaagauucu gcaggaaacu cucaaugaac uggacaaaag gagaaaggag 240
gugcuggaug ccuccaaagc acugcuaggc cgauuaacua cccuaaucga gcuacugcug 300
ccaaaguugg aggaguggaa ggcccagcag caaaaagccu gcaucagagc ucccauugac 360
cacggguugg aacagcugga gacaugguuc acagcuggag caaagcuguu guuucaccug 420
aggcagcugc ugaaggagcu gaagggacug aguugccugg uuagcuauca ggaugacccu 480
cugaccaaag ggguggaccu acgcaacgcc caggucacag aguugcuaca gcgucugcuc 540
cacagagcc 549
<210> 74
<211> 555
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding STAT2
<400> 74
atggtgctgg aaacccctgt ggaaagccag cagcacgaga tcgagagcag aatcctggac 60
ctgcgggcca tgatggaaaa gctggtcaag agcatcagcc agctgaagga ccagcaggac 120
gtgttctgct tccggtacaa gatccaggcc aagggcaaga cccctagcct ggatcctcac 180
cagaccaaag agcagaagat cctgcaagag acactgaacg agctggacaa gcggcggaaa 240
gaagtgctgg acgcctctaa agctctgctg ggcagactga ccactctgat cgaactgctg 300
ctgcccaagc tggaagagtg gaaggcccag caacagaagg cctgcatcag agcccctatc 360
gaccacggac tggaacagct ggaaacatgg tttaccgctg gcgccaagct gctgttccac 420
ctgagacagc tgctgaaaga gctgaagggc ctgagctgcc tggtgtccta ccaggatgac 480
cctctgacca aaggcgtgga cctgagaaac gcccaagtga ccgaactgct ccagcggctg 540
ctgcatagag cttga 555
<210> 75
<211> 555
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding STAT2
<400> 75
auggugcugg aaaccccugu ggaaagccag cagcacgaga ucgagagcag aauccuggac 60
cugcgggcca ugauggaaaa gcuggucaag agcaucagcc agcugaagga ccagcaggac 120
guguucugcu uccgguacaa gauccaggcc aagggcaaga ccccuagccu ggauccucac 180
cagaccaaag agcagaagau ccugcaagag acacugaacg agcuggacaa gcggcggaaa 240
gaagugcugg acgccucuaa agcucugcug ggcagacuga ccacucugau cgaacugcug 300
cugcccaagc uggaagagug gaaggcccag caacagaagg ccugcaucag agccccuauc 360
gaccacggac uggaacagcu ggaaacaugg uuuaccgcug gcgccaagcu gcuguuccac 420
cugagacagc ugcugaaaga gcugaagggc cugagcugcc ugguguccua ccaggaugac 480
ccucugacca aaggcgugga ccugagaaac gcccaaguga ccgaacugcu ccagcggcug 540
cugcauagag cuuga 555
<210> 76
<211> 851
<212> PRT
<213> Homo sapiens
<400> 76
Met Ala Gln Trp Glu Met Leu Gln Asn Leu Asp Ser Pro Phe Gln Asp
1 5 10 15
Gln Leu His Gln Leu Tyr Ser His Ser Leu Leu Pro Val Asp Ile Arg
20 25 30
Gln Tyr Leu Ala Val Trp Ile Glu Asp Gln Asn Trp Gln Glu Ala Ala
35 40 45
Leu Gly Ser Asp Asp Ser Lys Ala Thr Met Leu Phe Phe His Phe Leu
50 55 60
Asp Gln Leu Asn Tyr Glu Cys Gly Arg Cys Ser Gln Asp Pro Glu Ser
65 70 75 80
Leu Leu Leu Gln His Asn Leu Arg Lys Phe Cys Arg Asp Ile Gln Pro
85 90 95
Phe Ser Gln Asp Pro Thr Gln Leu Ala Glu Met Ile Phe Asn Leu Leu
100 105 110
Leu Glu Glu Lys Arg Ile Leu Ile Gln Ala Gln Arg Ala Gln Leu Glu
115 120 125
Gln Gly Glu Pro Val Leu Glu Thr Pro Val Glu Ser Gln Gln His Glu
130 135 140
Ile Glu Ser Arg Ile Leu Asp Leu Arg Ala Met Met Glu Lys Leu Val
145 150 155 160
Lys Ser Ile Ser Gln Leu Lys Asp Gln Gln Asp Val Phe Cys Asp Arg
165 170 175
Tyr Lys Ile Gln Ala Lys Gly Lys Thr Pro Ser Leu Asp Pro His Gln
180 185 190
Thr Lys Glu Gln Lys Ile Leu Gln Glu Thr Leu Asn Glu Leu Asp Lys
195 200 205
Arg Arg Lys Glu Val Leu Asp Ala Ser Lys Ala Leu Leu Gly Arg Leu
210 215 220
Thr Thr Leu Ile Glu Leu Leu Leu Pro Lys Leu Glu Glu Trp Lys Ala
225 230 235 240
Gln Gln Gln Lys Ala Cys Ile Arg Ala Pro Ile Asp His Gly Leu Glu
245 250 255
Gln Leu Glu Thr Trp Phe Thr Ala Gly Ala Lys Leu Leu Phe His Leu
260 265 270
Arg Gln Leu Leu Lys Glu Leu Lys Gly Leu Ser Cys Leu Val Ser Tyr
275 280 285
Gln Asp Asp Pro Leu Thr Lys Gly Val Asp Leu Arg Asn Ala Gln Val
290 295 300
Thr Glu Leu Leu Gln Arg Leu Leu His Arg Ala Phe Val Val Glu Thr
305 310 315 320
Gln Pro Cys Met Pro Gln Thr Pro His Arg Pro Leu Ile Leu Lys Thr
325 330 335
Gly Ser Lys Phe Thr Val Arg Thr Arg Leu Leu Val Arg Leu Gln Glu
340 345 350
Gly Asn Glu Ser Leu Thr Val Glu Val Ser Ile Asp Arg Asn Pro Pro
355 360 365
Gln Leu Gln Gly Phe Arg Lys Phe Asn Ile Leu Thr Ser Asn Gln Lys
370 375 380
Thr Leu Thr Pro Glu Lys Gly Gln Ser Gln Gly Leu Ile Trp Asp Phe
385 390 395 400
Gly Tyr Leu Thr Leu Val Glu Gln Arg Ser Gly Gly Ser Gly Lys Gly
405 410 415
Ser Asn Lys Gly Pro Leu Gly Val Thr Glu Glu Leu His Ile Ile Ser
420 425 430
Phe Thr Val Lys Tyr Thr Tyr Gln Gly Leu Lys Gln Glu Leu Lys Thr
435 440 445
Asp Thr Leu Pro Val Val Ile Ile Ser Asn Met Asn Gln Leu Ser Ile
450 455 460
Ala Trp Ala Ser Val Leu Trp Phe Asn Leu Leu Ser Pro Asn Leu Gln
465 470 475 480
Asn Gln Gln Phe Phe Ser Asn Pro Pro Lys Ala Pro Trp Ser Leu Leu
485 490 495
Gly Pro Ala Leu Ser Trp Gln Phe Ser Ser Tyr Val Gly Arg Gly Leu
500 505 510
Asn Ser Asp Gln Leu Ser Met Leu Arg Asn Lys Leu Phe Gly Gln Asn
515 520 525
Cys Arg Thr Glu Asp Pro Leu Leu Ser Trp Ala Asp Phe Thr Lys Arg
530 535 540
Glu Ser Pro Pro Gly Lys Leu Pro Phe Trp Thr Trp Leu Asp Lys Ile
545 550 555 560
Leu Glu Leu Val His Asp His Leu Lys Asp Leu Trp Asn Asp Gly Arg
565 570 575
Ile Met Gly Phe Val Ser Arg Ser Gln Glu Arg Arg Leu Leu Lys Lys
580 585 590
Thr Met Ser Gly Thr Phe Leu Leu Arg Phe Ser Glu Ser Ser Glu Gly
595 600 605
Gly Ile Thr Cys Ser Trp Val Glu His Gln Asp Asp Asp Lys Val Leu
610 615 620
Ile Tyr Ser Val Gln Pro Tyr Thr Lys Glu Val Leu Gln Ser Leu Pro
625 630 635 640
Leu Thr Glu Ile Ile Arg His Tyr Gln Leu Leu Thr Glu Glu Asn Ile
645 650 655
Pro Glu Asn Pro Leu Arg Phe Leu Tyr Pro Arg Ile Pro Arg Asp Glu
660 665 670
Ala Phe Gly Cys Tyr Tyr Gln Glu Lys Val Asn Leu Gln Glu Arg Arg
675 680 685
Lys Tyr Leu Lys His Arg Leu Ile Val Val Ser Asn Arg Gln Val Asp
690 695 700
Glu Leu Gln Gln Pro Leu Glu Leu Lys Pro Glu Pro Glu Leu Glu Ser
705 710 715 720
Leu Glu Leu Glu Leu Gly Leu Val Pro Glu Pro Glu Leu Ser Leu Asp
725 730 735
Leu Glu Pro Leu Leu Lys Ala Gly Leu Asp Leu Gly Pro Glu Leu Glu
740 745 750
Ser Val Leu Glu Ser Thr Leu Glu Pro Val Ile Glu Pro Thr Leu Cys
755 760 765
Met Val Ser Gln Thr Val Pro Glu Pro Asp Gln Gly Pro Val Ser Gln
770 775 780
Pro Val Pro Glu Pro Asp Leu Pro Cys Asp Leu Arg His Leu Asn Thr
785 790 795 800
Glu Pro Met Glu Ile Phe Arg Asn Cys Val Lys Ile Glu Glu Ile Met
805 810 815
Pro Asn Gly Asp Pro Leu Leu Ala Gly Gln Asn Thr Val Asp Glu Val
820 825 830
Tyr Val Ser Arg Pro Ser His Phe Tyr Thr Asp Gly Pro Leu Met Pro
835 840 845
Ser Asp Phe
850
<210> 77
<211> 2553
<212> DNA
<213> Homo sapiens
<400> 77
atggcgcagt gggaaatgct gcagaatctt gacagcccct ttcaggatca gctgcaccag 60
ctttactcgc acagcctcct gcctgtggac attcgacagt acttggctgt ctggattgaa 120
gaccagaact ggcaggaagc tgcacttggg agtgatgatt ccaaggctac catgctattc 180
ttccacttct tggatcagct gaactatgag tgtggccgtt gcagccagga cccagagtcc 240
ttgttgctgc agcacaattt gcggaaattc tgccgggaca ttcagccctt ttcccaggat 300
cctacccagt tggctgagat gatctttaac ctccttctgg aagaaaaaag aattttgatc 360
caggctcaga gggcccaatt ggaacaagga gagccagttc tcgaaacacc tgtggagagc 420
cagcaacatg agattgaatc ccggatcctg gatttaaggg ctatgatgga gaagctggta 480
aaatccatca gccaactgaa agaccagcag gatgtcttct gcgaccgata taagatccag 540
gccaaaggga agacaccctc tctggacccc catcagacca aagagcagaa gattctgcag 600
gaaactctca atgaactgga caaaaggaga aaggaggtgc tggatgcctc caaagcactg 660
ctaggccgat taactaccct aatcgagcta ctgctgccaa agttggagga gtggaaggcc 720
cagcagcaaa aagcctgcat cagagctccc attgaccacg ggttggaaca gctggagaca 780
tggttcacag ctggagcaaa gctgttgttt cacctgaggc agctgctgaa ggagctgaag 840
ggactgagtt gcctggttag ctatcaggat gaccctctga ccaaaggggt ggacctacgc 900
aacgcccagg tcacagagtt gctacagcgt ctgctccaca gagcctttgt ggtagaaacc 960
cagccctgca tgccccaaac tccccatcga cccctcatcc tcaagactgg cagcaagttc 1020
accgtccgaa caaggctgct ggtgagactc caggaaggca atgagtcact gactgtggaa 1080
gtctccattg acaggaatcc tcctcaatta caaggcttcc ggaagttcaa cattctgact 1140
tcaaaccaga aaactttgac ccccgagaag gggcagagtc agggtttgat ttgggacttt 1200
ggttacctga ctctggtgga gcaacgttca ggtggttcag gaaagggcag caataagggg 1260
ccactaggtg tgacagagga actgcacatc atcagcttca cggtcaaata tacctaccag 1320
ggtctgaagc aggagctgaa aacggacacc ctccctgtgg tgattatttc caacatgaac 1380
cagctctcaa ttgcctgggc ttcagttctc tggttcaatt tgctcagccc aaaccttcag 1440
aaccagcagt tcttctccaa cccccccaag gccccctgga gcttgctggg ccctgctctc 1500
agttggcagt tctcctccta tgttggccga ggcctcaact cagaccagct gagcatgctg 1560
agaaacaagc tgttcgggca gaactgtagg actgaggatc cattattgtc ctgggctgac 1620
ttcactaagc gagagagccc tcctggcaag ttaccattct ggacatggct ggacaaaatt 1680
ctggagttgg tacatgacca cctgaaggat ctctggaatg atggacgcat catgggcttt 1740
gtgagtcgga gccaggagcg ccggctgctg aagaagacca tgtctggcac ctttctactg 1800
cgcttcagtg aatcgtcaga agggggcatt acctgctcct gggtggagca ccaggatgat 1860
gacaaggtgc tcatctactc tgtgcaaccg tacacgaagg aggtgctgca gtcactcccg 1920
ctgactgaaa tcatccgcca ttaccagttg ctcactgagg agaatatacc tgaaaaccca 1980
ctgcgcttcc tctatccccg aatcccccgg gatgaagctt ttgggtgcta ctaccaggag 2040
aaagttaatc tccaggaacg gaggaaatac ctgaaacaca ggctcattgt ggtctctaat 2100
agacaggtgg atgaactgca acaaccgctg gagcttaagc cagagccaga gctggagtca 2160
ttagagctgg aactagggct ggtgccagag ccagagctca gcctggactt agagccactg 2220
ctgaaggcag ggctggatct ggggccagag ctagagtctg tgctggagtc cactctggag 2280
cctgtgatag agcccacact atgcatggta tcacaaacag tgccagagcc agaccaagga 2340
cctgtatcac agccagtgcc agagccagat ttgccctgtg atctgagaca tttgaacact 2400
gagccaatgg aaatcttcag aaactgtgta aagattgaag aaatcatgcc gaatggtgac 2460
ccactgttgg ctggccagaa caccgtggat gaggtttacg tctcccgccc cagccacttc 2520
tacactgatg gacccttgat gccttctgac ttc 2553
<210> 78
<211> 2553
<212> RNA
<213> Homo sapiens
<400> 78
auggcgcagu gggaaaugcu gcagaaucuu gacagccccu uucaggauca gcugcaccag 60
cuuuacucgc acagccuccu gccuguggac auucgacagu acuuggcugu cuggauugaa 120
gaccagaacu ggcaggaagc ugcacuuggg agugaugauu ccaaggcuac caugcuauuc 180
uuccacuucu uggaucagcu gaacuaugag uguggccguu gcagccagga cccagagucc 240
uuguugcugc agcacaauuu gcggaaauuc ugccgggaca uucagcccuu uucccaggau 300
ccuacccagu uggcugagau gaucuuuaac cuccuucugg aagaaaaaag aauuuugauc 360
caggcucaga gggcccaauu ggaacaagga gagccaguuc ucgaaacacc uguggagagc 420
cagcaacaug agauugaauc ccggauccug gauuuaaggg cuaugaugga gaagcuggua 480
aaauccauca gccaacugaa agaccagcag gaugucuucu gcgaccgaua uaagauccag 540
gccaaaggga agacacccuc ucuggacccc caucagacca aagagcagaa gauucugcag 600
gaaacucuca augaacugga caaaaggaga aaggaggugc uggaugccuc caaagcacug 660
cuaggccgau uaacuacccu aaucgagcua cugcugccaa aguuggagga guggaaggcc 720
cagcagcaaa aagccugcau cagagcuccc auugaccacg gguuggaaca gcuggagaca 780
ugguucacag cuggagcaaa gcuguuguuu caccugaggc agcugcugaa ggagcugaag 840
ggacugaguu gccugguuag cuaucaggau gacccucuga ccaaaggggu ggaccuacgc 900
aacgcccagg ucacagaguu gcuacagcgu cugcuccaca gagccuuugu gguagaaacc 960
cagcccugca ugccccaaac uccccaucga ccccucaucc ucaagacugg cagcaaguuc 1020
accguccgaa caaggcugcu ggugagacuc caggaaggca augagucacu gacuguggaa 1080
gucuccauug acaggaaucc uccucaauua caaggcuucc ggaaguucaa cauucugacu 1140
ucaaaccaga aaacuuugac ccccgagaag gggcagaguc aggguuugau uugggacuuu 1200
gguuaccuga cucuggugga gcaacguuca ggugguucag gaaagggcag caauaagggg 1260
ccacuaggug ugacagagga acugcacauc aucagcuuca cggucaaaua uaccuaccag 1320
ggucugaagc aggagcugaa aacggacacc cucccugugg ugauuauuuc caacaugaac 1380
cagcucucaa uugccugggc uucaguucuc ugguucaauu ugcucagccc aaaccuucag 1440
aaccagcagu ucuucuccaa cccccccaag gcccccugga gcuugcuggg cccugcucuc 1500
aguuggcagu ucuccuccua uguuggccga ggccucaacu cagaccagcu gagcaugcug 1560
agaaacaagc uguucgggca gaacuguagg acugaggauc cauuauuguc cugggcugac 1620
uucacuaagc gagagagccc uccuggcaag uuaccauucu ggacauggcu ggacaaaauu 1680
cuggaguugg uacaugacca ccugaaggau cucuggaaug auggacgcau caugggcuuu 1740
gugagucgga gccaggagcg ccggcugcug aagaagacca ugucuggcac cuuucuacug 1800
cgcuucagug aaucgucaga agggggcauu accugcuccu ggguggagca ccaggaugau 1860
gacaaggugc ucaucuacuc ugugcaaccg uacacgaagg aggugcugca gucacucccg 1920
cugacugaaa ucauccgcca uuaccaguug cucacugagg agaauauacc ugaaaaccca 1980
cugcgcuucc ucuauccccg aaucccccgg gaugaagcuu uugggugcua cuaccaggag 2040
aaaguuaauc uccaggaacg gaggaaauac cugaaacaca ggcucauugu ggucucuaau 2100
agacaggugg augaacugca acaaccgcug gagcuuaagc cagagccaga gcuggaguca 2160
uuagagcugg aacuagggcu ggugccagag ccagagcuca gccuggacuu agagccacug 2220
cugaaggcag ggcuggaucu ggggccagag cuagagucug ugcuggaguc cacucuggag 2280
ccugugauag agcccacacu augcauggua ucacaaacag ugccagagcc agaccaagga 2340
ccuguaucac agccagugcc agagccagau uugcccugug aucugagaca uuugaacacu 2400
gagccaaugg aaaucuucag aaacugugua aagauugaag aaaucaugcc gaauggugac 2460
ccacuguugg cuggccagaa caccguggau gagguuuacg ucucccgccc cagccacuuc 2520
uacacugaug gacccuugau gccuucugac uuc 2553
<210> 79
<211> 2556
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding STAT2
<400> 79
atggcccagt gggagatgct gcagaacctg gacagcccct tccaggatca gctgcaccag 60
ctgtactccc actctctgct gcccgtggac atcagacagt atctggccgt gtggatcgag 120
gaccagaact ggcaagaagc cgctctgggc agcgacgata gcaaggccac aatgctgttc 180
ttccacttcc tggaccagct gaactacgag tgcggcagat gcagccagga tccagaaagt 240
ctgctgctcc agcacaacct gcggaagttc tgcagagaca tccagccatt ctctcaggac 300
cccacacagc tggccgagat gatcttcaac ctgctgctgg aagagaagcg gatcctgatt 360
caggcccaga gagcccagct ggaacagggc gaacctgtcc tggaaacccc tgtggaatct 420
cagcagcacg agatcgagag ccggatcctg gatctgcggg ccatgatgga aaagctggtc 480
aagagcatca gccagctgaa ggaccagcag gacgtgttct gcgaccggta caagatccag 540
gccaagggca agacccctag cctggatcct caccagacca aagagcagaa gatcctgcaa 600
gagacactga acgagctgga caagcggcgg aaagaagtgc tggacgcctc taaagctctg 660
ctgggcagac tgaccactct gatcgaactg ctgctgccca agctggaaga atggaaggca 720
cagcagcaga aggcctgcat cagagcccct atcgatcacg gcctggaaca gctggaaacc 780
tggtttacag ccggcgctaa gctgctgttc cacctgagac agctgctgaa agagctgaag 840
ggcctgagct gcctggtgtc ctaccaggat gaccctctga ccaaaggcgt ggacctgaga 900
aacgcccaag tgaccgaact gctccagaga ctgctgcaca gagccttcgt ggtggaaacc 960
cagccttgca tgccccagac acctcacaga cccctgatcc tgaaaaccgg cagcaagttc 1020
accgtgcgga ccagactgct cgtgcgactg caagagggca atgagagcct gaccgtggaa 1080
gtgtccatcg acagaaaccc tccacagctg cagggcttca gaaagttcaa catcctgacc 1140
agcaaccaga aaaccctgac acctgagaag ggccagagcc agggactgat ctgggacttc 1200
ggctacctga cactggtcga gcagagatct ggcggctctg gcaagggctc taacaaggga 1260
cctctgggcg tgaccgagga actgcacatc atcagcttca ccgtgaagta cacctaccag 1320
ggcctgaagc aagaactcaa gaccgacaca ctgcccgtcg tgatcatcag caacatgaac 1380
cagctgtcta tcgcctgggc cagcgtgctg tggttcaatc tgctgagccc caacctgcag 1440
aatcagcagt tcttcagcaa ccctcctaag gctccttgga gcctgctggg acctgctctg 1500
agctggcagt ttagcagcta tgtcggcaga ggcctgaaca gcgatcagct gagcatgctg 1560
cggaacaagc tgttcggcca gaactgcagg accgaggatc cactgctgag ctgggccgac 1620
ttcaccaaga gagagagccc tccaggcaag ctgcccttct ggacttggct ggacaaaatc 1680
ctggaactgg tgcacgacca cctgaaggat ctgtggaacg acggccggat catgggcttc 1740
gtgtccagat ctcaagagcg cagactgctg aaaaagacaa tgagcggcac cttcctgctg 1800
cggttcagcg aatctagcga aggcggcatc acctgtagct gggtcgaaca ccaggacgac 1860
gacaaggtgc tgatctacag cgtgcagccc tacaccaaag aggtgctgca aagcctgcct 1920
ctgaccgaga tcatccggca ctaccagctg ctcaccgagg aaaacatccc cgagaatcct 1980
ctgcggtttc tgtaccctcg gatccccaga gatgaggcct ttggctgcta ctaccaagag 2040
aaagtgaatc tgcaagagcg gcgcaagtac ctgaagcaca gactgatcgt ggtgtccaac 2100
agacaggtgg acgagctgca gcagccactg gaactgaagc ctgagccaga gctggaaagc 2160
ctcgagctgg aacttggact ggtgcccgag cctgaactgt ctctggatct ggaacctctg 2220
ctgaaggccg gactggacct cggacctgaa ctggaaagcg tgctggaatc cacactggaa 2280
cctgtgatcg agcccacact gtgcatggtg tctcagaccg tgcctgaacc agatcagggc 2340
ccagtgtctc agcctgttcc tgagcctgat ctgccctgcg atctgaggca cctgaacacc 2400
gagcctatgg aaatcttccg gaactgcgtg aagatcgagg aaatcatgcc caacggcgac 2460
cctctgctgg ccggacagaa taccgtggat gaagtgtacg tgtcccggcc tagccacttc 2520
tacacagacg gacctctgat gcccagcgac ttctga 2556
<210> 80
<211> 2556
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding STAT2
<400> 80
auggcccagu gggagaugcu gcagaaccug gacagccccu uccaggauca gcugcaccag 60
cuguacuccc acucucugcu gcccguggac aucagacagu aucuggccgu guggaucgag 120
gaccagaacu ggcaagaagc cgcucugggc agcgacgaua gcaaggccac aaugcuguuc 180
uuccacuucc uggaccagcu gaacuacgag ugcggcagau gcagccagga uccagaaagu 240
cugcugcucc agcacaaccu gcggaaguuc ugcagagaca uccagccauu cucucaggac 300
cccacacagc uggccgagau gaucuucaac cugcugcugg aagagaagcg gauccugauu 360
caggcccaga gagcccagcu ggaacagggc gaaccugucc uggaaacccc uguggaaucu 420
cagcagcacg agaucgagag ccggauccug gaucugcggg ccaugaugga aaagcugguc 480
aagagcauca gccagcugaa ggaccagcag gacguguucu gcgaccggua caagauccag 540
gccaagggca agaccccuag ccuggauccu caccagacca aagagcagaa gauccugcaa 600
gagacacuga acgagcugga caagcggcgg aaagaagugc uggacgccuc uaaagcucug 660
cugggcagac ugaccacucu gaucgaacug cugcugccca agcuggaaga auggaaggca 720
cagcagcaga aggccugcau cagagccccu aucgaucacg gccuggaaca gcuggaaacc 780
ugguuuacag ccggcgcuaa gcugcuguuc caccugagac agcugcugaa agagcugaag 840
ggccugagcu gccugguguc cuaccaggau gacccucuga ccaaaggcgu ggaccugaga 900
aacgcccaag ugaccgaacu gcuccagaga cugcugcaca gagccuucgu gguggaaacc 960
cagccuugca ugccccagac accucacaga ccccugaucc ugaaaaccgg cagcaaguuc 1020
accgugcgga ccagacugcu cgugcgacug caagagggca augagagccu gaccguggaa 1080
guguccaucg acagaaaccc uccacagcug cagggcuuca gaaaguucaa cauccugacc 1140
agcaaccaga aaacccugac accugagaag ggccagagcc agggacugau cugggacuuc 1200
ggcuaccuga cacuggucga gcagagaucu ggcggcucug gcaagggcuc uaacaaggga 1260
ccucugggcg ugaccgagga acugcacauc aucagcuuca ccgugaagua caccuaccag 1320
ggccugaagc aagaacucaa gaccgacaca cugcccgucg ugaucaucag caacaugaac 1380
cagcugucua ucgccugggc cagcgugcug ugguucaauc ugcugagccc caaccugcag 1440
aaucagcagu ucuucagcaa cccuccuaag gcuccuugga gccugcuggg accugcucug 1500
agcuggcagu uuagcagcua ugucggcaga ggccugaaca gcgaucagcu gagcaugcug 1560
cggaacaagc uguucggcca gaacugcagg accgaggauc cacugcugag cugggccgac 1620
uucaccaaga gagagagccc uccaggcaag cugcccuucu ggacuuggcu ggacaaaauc 1680
cuggaacugg ugcacgacca ccugaaggau cuguggaacg acggccggau caugggcuuc 1740
guguccagau cucaagagcg cagacugcug aaaaagacaa ugagcggcac cuuccugcug 1800
cgguucagcg aaucuagcga aggcggcauc accuguagcu gggucgaaca ccaggacgac 1860
gacaaggugc ugaucuacag cgugcagccc uacaccaaag aggugcugca aagccugccu 1920
cugaccgaga ucauccggca cuaccagcug cucaccgagg aaaacauccc cgagaauccu 1980
cugcgguuuc uguacccucg gauccccaga gaugaggccu uuggcugcua cuaccaagag 2040
aaagugaauc ugcaagagcg gcgcaaguac cugaagcaca gacugaucgu gguguccaac 2100
agacaggugg acgagcugca gcagccacug gaacugaagc cugagccaga gcuggaaagc 2160
cucgagcugg aacuuggacu ggugcccgag ccugaacugu cucuggaucu ggaaccucug 2220
cugaaggccg gacuggaccu cggaccugaa cuggaaagcg ugcuggaauc cacacuggaa 2280
ccugugaucg agcccacacu gugcauggug ucucagaccg ugccugaacc agaucagggc 2340
ccagugucuc agccuguucc ugagccugau cugcccugcg aucugaggca ccugaacacc 2400
gagccuaugg aaaucuuccg gaacugcgug aagaucgagg aaaucaugcc caacggcgac 2460
ccucugcugg ccggacagaa uaccguggau gaaguguacg ugucccggcc uagccacuuc 2520
uacacagacg gaccucugau gcccagcgac uucuga 2556
<210> 81
<211> 232
<212> PRT
<213> Homo sapiens
<400> 81
Met Val Asp Tyr Ser Val Trp Asp His Ile Glu Val Ser Asp Asp Glu
1 5 10 15
Asp Glu Thr His Pro Asn Ile Asp Thr Ala Ser Leu Phe Arg Trp Arg
20 25 30
His Gln Ala Arg Val Glu Arg Met Glu Gln Phe Gln Lys Glu Lys Glu
35 40 45
Glu Leu Asp Arg Gly Cys Arg Glu Cys Lys Arg Lys Val Ala Glu Cys
50 55 60
Gln Arg Lys Leu Lys Glu Leu Glu Val Ala Glu Gly Gly Lys Ala Glu
65 70 75 80
Leu Glu Arg Leu Gln Ala Glu Ala Gln Gln Leu Arg Lys Glu Glu Arg
85 90 95
Ser Trp Glu Gln Lys Leu Glu Glu Met Arg Lys Lys Glu Lys Ser Met
100 105 110
Pro Trp Asn Val Asp Thr Leu Ser Lys Asp Gly Phe Ser Lys Ser Met
115 120 125
Val Asn Thr Lys Pro Glu Lys Thr Glu Glu Asp Ser Glu Glu Val Arg
130 135 140
Glu Gln Lys His Lys Thr Phe Val Glu Lys Tyr Glu Lys Gln Ile Lys
145 150 155 160
His Phe Gly Met Leu Arg Arg Trp Asp Asp Ser Gln Lys Tyr Leu Ser
165 170 175
Asp Asn Val His Leu Val Cys Glu Glu Thr Ala Asn Tyr Leu Val Ile
180 185 190
Trp Cys Ile Asp Leu Glu Val Glu Glu Lys Cys Ala Leu Met Glu Gln
195 200 205
Val Ala His Gln Thr Ile Val Met Gln Phe Ile Leu Glu Leu Ala Lys
210 215 220
Ser Leu Lys Val Asp Pro Arg Ala
225 230
<210> 82
<211> 696
<212> DNA
<213> Homo sapiens
<400> 82
atggtggact acagcgtgtg ggaccacatt gaggtgtctg atgatgaaga cgagacgcac 60
cccaacatcg acacggccag tctcttccgc tggcggcatc aggcccgggt ggaacgcatg 120
gagcagttcc agaaggagaa ggaggaactg gacaggggct gccgcgagtg caagcgcaag 180
gtggccgagt gccagaggaa actgaaggag ctggaggtgg ccgagggcgg caaggcagag 240
ctggagcgcc tgcaggccga ggcacagcag ctgcgcaagg aggagcggag ctgggagcag 300
aagctggagg agatgcgcaa gaaggagaag agcatgccct ggaacgtgga cacgctcagc 360
aaagacggct tcagcaagag catggtaaat accaagcccg agaagacgga ggaggactca 420
gaggaggtga gggagcagaa acacaagacc ttcgtggaaa aatacgagaa acagatcaag 480
cactttggca tgcttcgccg ctgggatgac agccaaaagt acctgtcaga caacgtccac 540
ctggtgtgcg aggagacagc caattacctg gtcatttggt gcattgacct agaggtggag 600
gagaaatgtg cactcatgga gcaggtggcc caccagacaa tcgtcatgca atttatcctg 660
gagctggcca agagcctaaa ggtggacccc cgggcc 696
<210> 83
<211> 696
<212> RNA
<213> Homo sapiens
<400> 83
augguggacu acagcgugug ggaccacauu gaggugucug augaugaaga cgagacgcac 60
cccaacaucg acacggccag ucucuuccgc uggcggcauc aggcccgggu ggaacgcaug 120
gagcaguucc agaaggagaa ggaggaacug gacaggggcu gccgcgagug caagcgcaag 180
guggccgagu gccagaggaa acugaaggag cuggaggugg ccgagggcgg caaggcagag 240
cuggagcgcc ugcaggccga ggcacagcag cugcgcaagg aggagcggag cugggagcag 300
aagcuggagg agaugcgcaa gaaggagaag agcaugcccu ggaacgugga cacgcucagc 360
aaagacggcu ucagcaagag caugguaaau accaagcccg agaagacgga ggaggacuca 420
gaggagguga gggagcagaa acacaagacc uucguggaaa aauacgagaa acagaucaag 480
cacuuuggca ugcuucgccg cugggaugac agccaaaagu accugucaga caacguccac 540
cuggugugcg aggagacagc caauuaccug gucauuuggu gcauugaccu agagguggag 600
gagaaaugug cacucaugga gcagguggcc caccagacaa ucgucaugca auuuauccug 660
gagcuggcca agagccuaaa gguggacccc cgggcc 696
<210> 84
<211> 699
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding HSP90
<400> 84
atggtggact acagcgtgtg ggaccacatc gaggtgtccg acgacgagga tgagacacac 60
cccaacatcg ataccgccag cctgttcaga tggcggcacc aggctagagt ggaacggatg 120
gaacagttcc agaaagagaa agaggaactg gaccggggct gccgcgagtg caaaagaaaa 180
gtggccgagt gccagcggaa gctgaaagaa ctggaagtgg ctgaaggcgg caaggccgag 240
ctggaaagac tgcaggctga agcccagcag ctgcgcaaag aggaaagaag ctgggagcag 300
aaactggaag agatgcgcaa gaaagaaaaa tccatgccgt ggaacgtgga caccctgagc 360
aaggacggct tcagcaagag catggtcaac accaagcctg agaaaaccga agaggacagc 420
gaggaagtgc gggaacagaa acacaagacc ttcgtcgaga agtacgagaa gcagatcaag 480
cacttcggca tgctgcggag atgggacgac agccagaagt acctgagcga caacgtgcac 540
ctcgtgtgcg aggaaaccgc caactacctg gtcatctggt gcatcgatct cgaggtggaa 600
gagaagtgcg ccctcatgga acaggtggcc caccagacaa tcgtgatgca gttcatcctg 660
gaactggcca agagcctgaa ggtggaccct agagcttga 699
<210> 85
<211> 699
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding HSP90
<400> 85
augguggacu acagcgugug ggaccacauc gagguguccg acgacgagga ugagacacac 60
cccaacaucg auaccgccag ccuguucaga uggcggcacc aggcuagagu ggaacggaug 120
gaacaguucc agaaagagaa agaggaacug gaccggggcu gccgcgagug caaaagaaaa 180
guggccgagu gccagcggaa gcugaaagaa cuggaagugg cugaaggcgg caaggccgag 240
cuggaaagac ugcaggcuga agcccagcag cugcgcaaag aggaaagaag cugggagcag 300
aaacuggaag agaugcgcaa gaaagaaaaa uccaugccgu ggaacgugga cacccugagc 360
aaggacggcu ucagcaagag cauggucaac accaagccug agaaaaccga agaggacagc 420
gaggaagugc gggaacagaa acacaagacc uucgucgaga aguacgagaa gcagaucaag 480
cacuucggca ugcugcggag augggacgac agccagaagu accugagcga caacgugcac 540
cucgugugcg aggaaaccgc caacuaccug gucaucuggu gcaucgaucu cgagguggaa 600
gagaagugcg cccucaugga acagguggcc caccagacaa ucgugaugca guucauccug 660
gaacuggcca agagccugaa gguggacccu agagcuuga 699
<210> 86
<211> 231
<212> PRT
<213> Homo sapiens
<400> 86
Met Thr Trp Val Ser Leu Leu Asn Gln Val Gly Asp Arg Val Ser Arg
1 5 10 15
Asn Asn Phe Leu Gly Phe Pro Ala Ser Glu Leu Gln Ala Arg Ile Arg
20 25 30
Thr Tyr Asn Gln His Tyr Asn Asn Leu Leu Arg Gly Ala Val Ser Gln
35 40 45
Arg Leu Tyr Ile Leu Leu Pro Leu Asp Cys Gly Val Pro Asp Asn Leu
50 55 60
Ser Met Ala Asp Pro Asn Ile Arg Phe Leu Asp Lys Leu Pro Gln Gln
65 70 75 80
Thr Gly Asp His Ala Gly Ile Lys Asp Arg Val Tyr Ser Asn Ser Ile
85 90 95
Tyr Glu Leu Leu Glu Asn Gly Gln Arg Ala Gly Thr Cys Val Leu Glu
100 105 110
Tyr Ala Thr Pro Leu Gln Thr Leu Phe Ala Met Ser Gln Tyr Ser Gln
115 120 125
Ala Gly Phe Ser Arg Glu Asp Arg Leu Glu Gln Ala Lys Leu Phe Cys
130 135 140
Arg Thr Leu Glu Asp Ile Leu Ala Asp Ala Pro Glu Ser Gln Asn Asn
145 150 155 160
Cys Arg Leu Ile Ala Tyr Gln Glu Pro Ala Asp Asp Ser Ser Phe Ser
165 170 175
Leu Ser Gln Glu Val Leu Arg His Leu Arg Gln Glu Glu Lys Glu Glu
180 185 190
Val Thr Val Gly Ser Leu Lys Thr Ser Ala Val Pro Ser Thr Ser Thr
195 200 205
Met Ser Gln Glu Pro Glu Leu Leu Ile Ser Gly Met Glu Lys Pro Leu
210 215 220
Pro Leu Arg Thr Asp Phe Ser
225 230
<210> 87
<211> 693
<212> DNA
<213> Homo sapiens
<400> 87
atgacctggg tctcactcct gaatcaggtg ggagataggg ttagcaggaa taacttcttg 60
ggcttccctg cctcagagct ccaggcccgg attcgaactt acaatcagca ttacaacaac 120
ctgctacggg gtgcagtgag ccagcggctg tatattctcc tcccattgga ctgtggggtg 180
cctgataacc tgagtatggc tgaccccaac attcgcttcc tggataaact gccccagcag 240
accggtgacc atgctggcat caaggatcgg gtttacagca acagcatcta tgagcttctg 300
gagaacgggc agcgggcggg cacctgtgtc ctggagtacg ccaccccctt gcagactttg 360
tttgccatgt cacaatacag tcaagctggc tttagccggg aggataggct tgagcaggcc 420
aaactcttct gccggacact tgaggacatc ctggcagatg cccctgagtc tcagaacaac 480
tgccgcctca ttgcctacca ggaacctgca gatgacagca gcttctcgct gtcccaggag 540
gttctccggc acctgcggca ggaggaaaag gaagaggtta ctgtgggcag cttgaagacc 600
tcagcggtgc ccagtacctc cacgatgtcc caagagcctg agctcctcat cagtggaatg 660
gaaaagcccc tccctctccg cacggatttc tct 693
<210> 88
<211> 693
<212> RNA
<213> Homo sapiens
<400> 88
augaccuggg ucucacuccu gaaucaggug ggagauaggg uuagcaggaa uaacuucuug 60
ggcuucccug ccucagagcu ccaggcccgg auucgaacuu acaaucagca uuacaacaac 120
cugcuacggg gugcagugag ccagcggcug uauauucucc ucccauugga cuguggggug 180
ccugauaacc ugaguauggc ugaccccaac auucgcuucc uggauaaacu gccccagcag 240
accggugacc augcuggcau caaggaucgg guuuacagca acagcaucua ugagcuucug 300
gagaacgggc agcgggcggg caccuguguc cuggaguacg ccacccccuu gcagacuuug 360
uuugccaugu cacaauacag ucaagcuggc uuuagccggg aggauaggcu ugagcaggcc 420
aaacucuucu gccggacacu ugaggacauc cuggcagaug ccccugaguc ucagaacaac 480
ugccgccuca uugccuacca ggaaccugca gaugacagca gcuucucgcu gucccaggag 540
guucuccggc accugcggca ggaggaaaag gaagagguua cugugggcag cuugaagacc 600
ucagcggugc ccaguaccuc cacgaugucc caagagccug agcuccucau caguggaaug 660
gaaaagcccc ucccucuccg cacggauuuc ucu 693
<210> 89
<211> 696
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding STING-beta
<400> 89
atgacatggg tgtccctgct gaatcaagtg ggcgacagag tgtcccggaa caacttcctg 60
ggattccctg ccagcgaact gcaggccaga atccggacct acaaccagca ctacaacaac 120
ctgctgagag gcgccgtgtc tcagcggctg tatattctgc tgcctctgga ttgcggcgtg 180
cccgacaatc tgtctatggc cgatcctaat atccggttcc tggacaagct gccccagcag 240
acaggcgatc acgccggcat taaggaccgg gtgtacagca acagcatcta cgagctgctg 300
gaaaacggcc agcgagccgg aacatgcgtg ctggaatatg ccacacctct gcagaccctg 360
ttcgccatga gccagtatag ccaggccggc ttcagcagag aggacagact ggaacaggcc 420
aagctgttct gccggacact ggaagatatc ctggccgacg ctcctgagag ccagaacaac 480
tgtagactga tcgcctacca agagcctgcc gacgacagca gctttagcct gtctcaagag 540
gtgctgcggc acctgagaca agaggaaaaa gaggaagtca ccgtcggcag cctgaaaacc 600
tctgccgtgc ctagcaccag caccatgagt caagaacctg agctgctgat ctccggcatg 660
gaaaagcccc tgcctctgag aaccgacttc agctga 696
<210> 90
<211> 696
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding STING-beta
<400> 90
augacauggg ugucccugcu gaaucaagug ggcgacagag ugucccggaa caacuuccug 60
ggauucccug ccagcgaacu gcaggccaga auccggaccu acaaccagca cuacaacaac 120
cugcugagag gcgccguguc ucagcggcug uauauucugc ugccucugga uugcggcgug 180
cccgacaauc ugucuauggc cgauccuaau auccgguucc uggacaagcu gccccagcag 240
acaggcgauc acgccggcau uaaggaccgg guguacagca acagcaucua cgagcugcug 300
gaaaacggcc agcgagccgg aacaugcgug cuggaauaug ccacaccucu gcagacccug 360
uucgccauga gccaguauag ccaggccggc uucagcagag aggacagacu ggaacaggcc 420
aagcuguucu gccggacacu ggaagauauc cuggccgacg cuccugagag ccagaacaac 480
uguagacuga ucgccuacca agagccugcc gacgacagca gcuuuagccu gucucaagag 540
gugcugcggc accugagaca agaggaaaaa gaggaaguca ccgucggcag ccugaaaacc 600
ucugccgugc cuagcaccag caccaugagu caagaaccug agcugcugau cuccggcaug 660
gaaaagcccc ugccucugag aaccgacuuc agcuga 696
<210> 91
<211> 407
<212> PRT
<213> Homo sapiens
<400> 91
Ala Gln Asn Pro Met Glu Pro Ser Val Pro Gln Leu Ser Leu Met Asp
1 5 10 15
Val Lys Cys Glu Thr Pro Asn Cys Pro Phe Phe Met Ser Val Asn Thr
20 25 30
Gln Pro Leu Cys His Glu Cys Ser Glu Arg Arg Gln Lys Asn Gln Asn
35 40 45
Lys Leu Pro Lys Leu Asn Ser Lys Pro Gly Pro Glu Gly Leu Pro Gly
50 55 60
Met Ala Leu Gly Ala Ser Arg Gly Glu Ala Tyr Glu Pro Leu Ala Trp
65 70 75 80
Asn Pro Glu Glu Ser Thr Gly Gly Pro His Ser Ala Pro Pro Thr Ala
85 90 95
Pro Ser Pro Phe Leu Phe Ser Glu Thr Thr Ala Met Lys Cys Arg Ser
100 105 110
Pro Gly Cys Pro Phe Thr Leu Asn Val Gln His Asn Gly Phe Cys Glu
115 120 125
Arg Cys His Asn Ala Arg Gln Leu His Ala Ser His Ala Pro Asp His
130 135 140
Thr Arg His Leu Asp Pro Gly Lys Cys Gln Ala Cys Leu Gln Asp Val
145 150 155 160
Thr Arg Thr Phe Asn Gly Ile Cys Ser Thr Cys Phe Lys Arg Thr Thr
165 170 175
Ala Glu Ala Ser Ser Ser Leu Ser Thr Ser Leu Pro Pro Ser Cys His
180 185 190
Gln Arg Ser Lys Ser Asp Pro Ser Arg Leu Val Arg Ser Pro Ser Pro
195 200 205
His Ser Cys His Arg Ala Gly Asn Asp Ala Pro Ala Gly Cys Leu Ser
210 215 220
Gln Ala Ala Arg Thr Pro Gly Asp Arg Thr Gly Thr Ser Lys Cys Arg
225 230 235 240
Lys Ala Gly Cys Val Tyr Phe Gly Thr Pro Glu Asn Lys Gly Phe Cys
245 250 255
Thr Leu Cys Phe Ile Glu Tyr Arg Glu Asn Lys His Phe Ala Ala Ala
260 265 270
Ser Gly Lys Val Ser Pro Thr Ala Ser Arg Phe Gln Asn Thr Ile Pro
275 280 285
Cys Leu Gly Arg Glu Cys Gly Thr Leu Gly Ser Thr Met Phe Glu Gly
290 295 300
Tyr Cys Gln Lys Cys Phe Ile Glu Ala Gln Asn Gln Arg Phe His Glu
305 310 315 320
Ala Lys Arg Thr Glu Glu Gln Leu Arg Ser Ser Gln Arg Arg Asp Val
325 330 335
Pro Arg Thr Thr Gln Ser Thr Ser Arg Pro Lys Cys Ala Arg Ala Ser
340 345 350
Cys Lys Asn Ile Leu Ala Cys Arg Ser Glu Glu Leu Cys Met Glu Cys
355 360 365
Gln His Pro Asn Gln Arg Met Gly Pro Gly Ala His Arg Gly Glu Pro
370 375 380
Ala Pro Glu Asp Pro Pro Lys Gln Arg Cys Arg Ala Pro Ala Cys Asp
385 390 395 400
His Phe Gly Asn Ala Lys Cys
405
<210> 92
<211> 1221
<212> DNA
<213> Homo sapiens
<400> 92
gcccagaatc ccatggaacc ttccgtgccc cagctttctc tcatggatgt aaaatgtgaa 60
acgcccaact gccccttctt catgtctgtg aacacccagc ctttatgcca tgagtgctca 120
gagaggcggc aaaagaatca aaacaaactc ccaaagctga actccaagcc gggccctgag 180
gggctccctg gcatggcgct cggggcctct cggggagaag cctatgagcc cttggcgtgg 240
aaccctgagg agtccactgg ggggcctcat tcggccccac cgacagcacc cagccctttt 300
ctgttcagtg agaccactgc catgaagtgc aggagccccg gctgcccctt cacactgaat 360
gtgcagcaca acggattttg tgaacgttgc cacaacgccc ggcaacttca cgccagccac 420
gccccagacc acacaaggca cttggatccc gggaagtgcc aagcctgcct ccaggatgtt 480
accaggacat ttaatgggat ctgcagtact tgcttcaaaa ggactacagc agaggcctcc 540
tccagcctca gcaccagcct ccctccttcc tgtcaccagc gttccaagtc agatccctcg 600
cggctcgtcc ggagcccctc cccgcattct tgccacagag ctggaaacga cgcccctgct 660
ggctgcctgt ctcaagctgc acggactcct ggggacagga cggggacgag caagtgcaga 720
aaagccggct gcgtgtattt tgggactcca gaaaacaagg gcttttgcac actgtgtttc 780
atcgagtaca gagaaaacaa acattttgct gctgcctcag ggaaagtcag tcccacagcg 840
tccaggttcc agaacaccat tccgtgcctg gggagggaat gcggcaccct tggaagcacc 900
atgtttgaag gatactgcca gaagtgtttc attgaagctc agaatcagag atttcatgag 960
gccaaaagga cagaagagca actgagatcg agccagcgca gagatgtgcc tcgaaccaca 1020
caaagcacct caaggcccaa gtgcgcccgg gcctcctgca agaacatcct ggcctgccgc 1080
agcgaggagc tctgcatgga gtgtcagcat cccaaccaga ggatgggccc tggggcccac 1140
cggggtgagc ctgcccccga agaccccccc aagcagcgtt gccgggcccc cgcctgtgat 1200
cattttggca atgccaagtg c 1221
<210> 93
<211> 1221
<212> RNA
<213> Homo sapiens
<400> 93
gcccagaauc ccauggaacc uuccgugccc cagcuuucuc ucauggaugu aaaaugugaa 60
acgcccaacu gccccuucuu caugucugug aacacccagc cuuuaugcca ugagugcuca 120
gagaggcggc aaaagaauca aaacaaacuc ccaaagcuga acuccaagcc gggcccugag 180
gggcucccug gcauggcgcu cggggccucu cggggagaag ccuaugagcc cuuggcgugg 240
aacccugagg aguccacugg ggggccucau ucggccccac cgacagcacc cagcccuuuu 300
cuguucagug agaccacugc caugaagugc aggagccccg gcugccccuu cacacugaau 360
gugcagcaca acggauuuug ugaacguugc cacaacgccc ggcaacuuca cgccagccac 420
gccccagacc acacaaggca cuuggauccc gggaagugcc aagccugccu ccaggauguu 480
accaggacau uuaaugggau cugcaguacu ugcuucaaaa ggacuacagc agaggccucc 540
uccagccuca gcaccagccu cccuccuucc ugucaccagc guuccaaguc agaucccucg 600
cggcucgucc ggagccccuc cccgcauucu ugccacagag cuggaaacga cgccccugcu 660
ggcugccugu cucaagcugc acggacuccu ggggacagga cggggacgag caagugcaga 720
aaagccggcu gcguguauuu ugggacucca gaaaacaagg gcuuuugcac acuguguuuc 780
aucgaguaca gagaaaacaa acauuuugcu gcugccucag ggaaagucag ucccacagcg 840
uccagguucc agaacaccau uccgugccug gggagggaau gcggcacccu uggaagcacc 900
auguuugaag gauacugcca gaaguguuuc auugaagcuc agaaucagag auuucaugag 960
gccaaaagga cagaagagca acugagaucg agccagcgca gagaugugcc ucgaaccaca 1020
caaagcaccu caaggcccaa gugcgcccgg gccuccugca agaacauccu ggccugccgc 1080
agcgaggagc ucugcaugga gugucagcau cccaaccaga ggaugggccc uggggcccac 1140
cggggugagc cugcccccga agaccccccc aagcagcguu gccgggcccc cgccugugau 1200
cauuuuggca augccaagug c 1221
<210> 94
<211> 1227
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding A20 or TNFAIP3
<400> 94
atggcccaga atcctatgga acctagcgtg ccccagctga gcctgatgga cgtgaagtgc 60
gaaaccccta actgcccctt cttcatgtcc gtgaacaccc agcctctgtg ccacgagtgt 120
agcgagcgga gacagaagaa ccagaacaag ctgcccaagc tgaacagcaa gcccggacct 180
gaaggactgc ctggaatggc tctgggagct tctagaggcg aggcctatga acccctggcc 240
tggaatcctg aggaaagcac aggcggacct cacagcgctc ctccaacagc accttctcca 300
tttctgttca gcgagacaac cgccatgaag tgcagaagcc ctggctgccc tttcacactg 360
aacgtgcagc acaacggctt ttgcgagaga tgccacaacg ccagacagct gcacgcttct 420
cacgcccctg atcacaccag acacctggat cctggaaagt gccaggcctg cctgcaggat 480
gtgaccagaa ccttcaacgg catctgcagc acctgtttca agcggacaac agccgaggcc 540
agcagcagcc tgtctacatc tctgcctcca agctgccacc agcggagcaa gagcgatcct 600
tctagacttg tgcggagccc ctctcctcac tcctgtcaca gagccggaaa tgatgcccct 660
gccggatgtc tgtctcaggc cgctagaaca cctggcgata gaaccggcac cagcaagtgt 720
agaaaggccg gctgcgtgta cttcggcacc cctgagaaca agggattctg caccctgtgc 780
ttcatcgagt acagagagaa caagcacttc gccgctgcct ccggaaaggt gtcacctacc 840
gctagccggt tccagaacac aatcccttgc ctgggcagag agtgtggcac actgggcagc 900
acaatgttcg agggctactg ccagaagtgc tttatcgagg cccagaacca gcggttccac 960
gaggccaaga gaaccgagga acagctgaga agcagccaga gaagggacgt gcccagaaca 1020
acccagagca ccagcagacc taagtgcgcc agagccagct gcaagaacat cctggcctgt 1080
cggagcgagg aactgtgcat ggaatgccag catcctaacc agagaatggg ccctggcgct 1140
cacagaggcg aacctgctcc agaagatcct cctaagcagc ggtgtagagc ccctgcctgt 1200
gaccactttg gcaacgccaa gtgctga 1227
<210> 95
<211> 1227
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding A20 or TNFAIP3
<400> 95
auggcccaga auccuaugga accuagcgug ccccagcuga gccugaugga cgugaagugc 60
gaaaccccua acugccccuu cuucaugucc gugaacaccc agccucugug ccacgagugu 120
agcgagcgga gacagaagaa ccagaacaag cugcccaagc ugaacagcaa gcccggaccu 180
gaaggacugc cuggaauggc ucugggagcu ucuagaggcg aggccuauga accccuggcc 240
uggaauccug aggaaagcac aggcggaccu cacagcgcuc cuccaacagc accuucucca 300
uuucuguuca gcgagacaac cgccaugaag ugcagaagcc cuggcugccc uuucacacug 360
aacgugcagc acaacggcuu uugcgagaga ugccacaacg ccagacagcu gcacgcuucu 420
cacgccccug aucacaccag acaccuggau ccuggaaagu gccaggccug ccugcaggau 480
gugaccagaa ccuucaacgg caucugcagc accuguuuca agcggacaac agccgaggcc 540
agcagcagcc ugucuacauc ucugccucca agcugccacc agcggagcaa gagcgauccu 600
ucuagacuug ugcggagccc cucuccucac uccugucaca gagccggaaa ugaugccccu 660
gccggauguc ugucucaggc cgcuagaaca ccuggcgaua gaaccggcac cagcaagugu 720
agaaaggccg gcugcgugua cuucggcacc ccugagaaca agggauucug cacccugugc 780
uucaucgagu acagagagaa caagcacuuc gccgcugccu ccggaaaggu gucaccuacc 840
gcuagccggu uccagaacac aaucccuugc cugggcagag aguguggcac acugggcagc 900
acaauguucg agggcuacug ccagaagugc uuuaucgagg cccagaacca gcgguuccac 960
gaggccaaga gaaccgagga acagcugaga agcagccaga gaagggacgu gcccagaaca 1020
acccagagca ccagcagacc uaagugcgcc agagccagcu gcaagaacau ccuggccugu 1080
cggagcgagg aacugugcau ggaaugccag cauccuaacc agagaauggg cccuggcgcu 1140
cacagaggcg aaccugcucc agaagauccu ccuaagcagc gguguagagc cccugccugu 1200
gaccacuuug gcaacgccaa gugcuga 1227
<210> 96
<211> 185
<212> PRT
<213> Homo sapiens
<400> 96
Lys Cys Arg Lys Ala Gly Cys Val Tyr Phe Gly Thr Pro Glu Asn Lys
1 5 10 15
Gly Phe Cys Thr Leu Cys Phe Ile Glu Tyr Arg Glu Asn Lys His Phe
20 25 30
Ala Ala Ala Ser Gly Lys Val Ser Pro Thr Ala Ser Arg Phe Gln Asn
35 40 45
Thr Ile Pro Cys Leu Gly Arg Glu Cys Gly Thr Leu Gly Ser Thr Met
50 55 60
Phe Glu Gly Tyr Cys Gln Lys Cys Phe Ile Glu Ala Gln Asn Gln Arg
65 70 75 80
Phe His Glu Ala Lys Arg Thr Glu Glu Gln Leu Arg Ser Ser Gln Arg
85 90 95
Arg Asp Val Pro Arg Thr Thr Gln Ser Thr Ser Arg Pro Lys Cys Ala
100 105 110
Arg Ala Ser Cys Lys Asn Ile Leu Ala Cys Arg Ser Glu Glu Leu Cys
115 120 125
Met Glu Cys Gln His Pro Asn Gln Arg Met Gly Pro Gly Ala His Arg
130 135 140
Gly Glu Pro Ala Pro Glu Asp Pro Pro Lys Gln Arg Cys Arg Ala Pro
145 150 155 160
Ala Cys Asp His Phe Gly Asn Ala Lys Cys Asn Gly Tyr Cys Asn Glu
165 170 175
Cys Phe Gln Phe Lys Gln Met Tyr Gly
180 185
<210> 97
<211> 555
<212> DNA
<213> Homo sapiens
<400> 97
aagtgcagaa aagccggctg cgtgtatttt gggactccag aaaacaaggg cttttgcaca 60
ctgtgtttca tcgagtacag agaaaacaaa cattttgctg ctgcctcagg gaaagtcagt 120
cccacagcgt ccaggttcca gaacaccatt ccgtgcctgg ggagggaatg cggcaccctt 180
ggaagcacca tgtttgaagg atactgccag aagtgtttca ttgaagctca gaatcagaga 240
tttcatgagg ccaaaaggac agaagagcaa ctgagatcga gccagcgcag agatgtgcct 300
cgaaccacac aaagcacctc aaggcccaag tgcgcccggg cctcctgcaa gaacatcctg 360
gcctgccgca gcgaggagct ctgcatggag tgtcagcatc ccaaccagag gatgggccct 420
ggggcccacc ggggtgagcc tgcccccgaa gaccccccca agcagcgttg ccgggccccc 480
gcctgtgatc attttggcaa tgccaagtgc aacggctact gcaacgaatg ctttcagttc 540
aagcagatgt atggc 555
<210> 98
<211> 555
<212> RNA
<213> Homo sapiens
<400> 98
aagugcagaa aagccggcug cguguauuuu gggacuccag aaaacaaggg cuuuugcaca 60
cuguguuuca ucgaguacag agaaaacaaa cauuuugcug cugccucagg gaaagucagu 120
cccacagcgu ccagguucca gaacaccauu ccgugccugg ggagggaaug cggcacccuu 180
ggaagcacca uguuugaagg auacugccag aaguguuuca uugaagcuca gaaucagaga 240
uuucaugagg ccaaaaggac agaagagcaa cugagaucga gccagcgcag agaugugccu 300
cgaaccacac aaagcaccuc aaggcccaag ugcgcccggg ccuccugcaa gaacauccug 360
gccugccgca gcgaggagcu cugcauggag ugucagcauc ccaaccagag gaugggcccu 420
ggggcccacc ggggugagcc ugcccccgaa gaccccccca agcagcguug ccgggccccc 480
gccugugauc auuuuggcaa ugccaagugc aacggcuacu gcaacgaaug cuuucaguuc 540
aagcagaugu auggc 555
<210> 99
<211> 561
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding A20
<400> 99
atgaagtgca gaaaggccgg ctgcgtgtac ttcggcaccc ctgagaacaa gggcttctgc 60
accctgtgct tcatcgagta cagagagaac aagcacttcg ctgccgccag cggaaaggtg 120
tcacctaccg ccagcagatt ccagaacaca atcccctgcc tgggcagaga gtgtggcaca 180
ctgggcagca caatgttcga gggctactgc cagaagtgct ttatcgaggc ccagaaccag 240
cggttccacg aggccaagag aaccgaggaa cagctgagaa gcagccagag aagggacgtg 300
cccagaacaa cccagagcac cagcagacct aagtgcgcca gagccagctg caagaacatc 360
ctggcctgca gatccgagga actgtgcatg gaatgccagc atcctaacca gagaatgggc 420
cctggcgctc acagaggcga acctgctcca gaagatcctc ctaagcagcg gtgtagagcc 480
ccagcctgtg accactttgg caacgccaag tgcaacggct actgcaacga gtgcttccag 540
ttcaagcaga tgtacggctg a 561
<210> 100
<211> 561
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding A20
<400> 100
augaagugca gaaaggccgg cugcguguac uucggcaccc cugagaacaa gggcuucugc 60
acccugugcu ucaucgagua cagagagaac aagcacuucg cugccgccag cggaaaggug 120
ucaccuaccg ccagcagauu ccagaacaca auccccugcc ugggcagaga guguggcaca 180
cugggcagca caauguucga gggcuacugc cagaagugcu uuaucgaggc ccagaaccag 240
cgguuccacg aggccaagag aaccgaggaa cagcugagaa gcagccagag aagggacgug 300
cccagaacaa cccagagcac cagcagaccu aagugcgcca gagccagcug caagaacauc 360
cuggccugca gauccgagga acugugcaug gaaugccagc auccuaacca gagaaugggc 420
ccuggcgcuc acagaggcga accugcucca gaagauccuc cuaagcagcg guguagagcc 480
ccagccugug accacuuugg caacgccaag ugcaacggcu acugcaacga gugcuuccag 540
uucaagcaga uguacggcug a 561
<210> 101
<211> 131
<212> PRT
<213> Homo sapiens
<400> 101
Glu Ala Val Arg Leu Ile Met Asp Ser Leu His Met Ala Ala Arg Glu
1 5 10 15
Gln Gln Val Tyr Cys Glu Glu Met Arg Glu Glu Arg Gln Asp Arg Leu
20 25 30
Lys Phe Ile Asp Lys Gln Leu Glu Leu Leu Ala Gln Asp Tyr Lys Leu
35 40 45
Arg Ile Lys Gln Ile Thr Glu Glu Val Glu Arg Gln Val Ser Thr Ala
50 55 60
Met Ala Glu Glu Ile Arg Arg Leu Ser Val Leu Val Asp Asp Tyr Gln
65 70 75 80
Met Asp Phe His Pro Ser Pro Val Val Leu Lys Val Tyr Lys Asn Glu
85 90 95
Leu His Arg His Ile Glu Glu Gly Leu Gly Arg Asn Met Ser Asp Arg
100 105 110
Cys Ser Thr Ala Ile Thr Asn Ser Leu Gln Thr Met Gln Gln Asp Met
115 120 125
Ile Asp Gly
130
<210> 102
<211> 393
<212> DNA
<213> Homo sapiens
<400> 102
gaggcggttc gactcatcat ggactccctg cacatggcgg ctcgggagca gcaggtttac 60
tgcgaggaaa tgcgtgaaga gcggcaagac cgactgaaat ttattgacaa acagctggag 120
ctcttggctc aagactataa gctgcgaatt aagcagatta cggaggaagt ggagaggcag 180
gtgtcgactg caatggccga ggagatcagg cgcctctctg tactggtgga cgattaccag 240
atggacttcc acccttctcc agtagtcctc aaggtttata agaatgagct gcaccgccac 300
atagaggaag gactgggtcg aaacatgtct gaccgctgct ccacggccat caccaactcc 360
ctgcagacca tgcagcagga catgatagat ggc 393
<210> 103
<211> 393
<212> RNA
<213> Homo sapiens
<400> 103
gaggcgguuc gacucaucau ggacucccug cacauggcgg cucgggagca gcagguuuac 60
ugcgaggaaa ugcgugaaga gcggcaagac cgacugaaau uuauugacaa acagcuggag 120
cucuuggcuc aagacuauaa gcugcgaauu aagcagauua cggaggaagu ggagaggcag 180
gugucgacug caauggccga ggagaucagg cgccucucug uacuggugga cgauuaccag 240
auggacuucc acccuucucc aguaguccuc aagguuuaua agaaugagcu gcaccgccac 300
auagaggaag gacugggucg aaacaugucu gaccgcugcu ccacggccau caccaacucc 360
cugcagacca ugcagcagga caugauagau ggc 393
<210> 104
<211> 399
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding MFN2 protein
<400> 104
atggaagccg tgcggctgat catggacagc ctgcatatgg ccgccagaga gcagcaggtc 60
tactgcgagg aaatgcggga agagagacag gaccggctga agttcatcga caagcagctg 120
gaactgctgg cccaggacta caagctgcgg atcaagcaga tcaccgaaga ggtggaaaga 180
caggtgtcca ccgccatggc cgaggaaatc agacgactga gcgtgctggt ggacgactac 240
cagatggact ttcacccctc tccagtggtg ctgaaggtgt acaagaacga gctgcaccgg 300
cacatcgagg aaggcctggg cagaaacatg agcgacagat gcagcaccgc catcaccaat 360
agcctgcaga ccatgcagca ggacatgatc gacggctga 399
<210> 105
<211> 399
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding MFN2
<400> 105
auggaagccg ugcggcugau cauggacagc cugcauaugg ccgccagaga gcagcagguc 60
uacugcgagg aaaugcggga agagagacag gaccggcuga aguucaucga caagcagcug 120
gaacugcugg cccaggacua caagcugcgg aucaagcaga ucaccgaaga gguggaaaga 180
caggugucca ccgccauggc cgaggaaauc agacgacuga gcgugcuggu ggacgacuac 240
cagauggacu uucaccccuc uccaguggug cugaaggugu acaagaacga gcugcaccgg 300
cacaucgagg aaggccuggg cagaaacaug agcgacagau gcagcaccgc caucaccaau 360
agccugcaga ccaugcagca ggacaugauc gacggcuga 399
<210> 106
<211> 81
<212> PRT
<213> Homo sapiens
<400> 106
Arg Leu Lys Phe Ile Asp Lys Gln Leu Glu Leu Leu Ala Gln Asp Tyr
1 5 10 15
Lys Leu Arg Ile Lys Gln Ile Thr Glu Glu Val Glu Arg Gln Val Ser
20 25 30
Thr Ala Met Ala Glu Glu Ile Arg Arg Leu Ser Val Leu Val Asp Asp
35 40 45
Tyr Gln Met Asp Phe His Pro Ser Pro Val Val Leu Lys Val Tyr Lys
50 55 60
Asn Glu Leu His Arg His Ile Glu Glu Gly Leu Gly Arg Asn Met Ser
65 70 75 80
Asp
<210> 107
<211> 243
<212> DNA
<213> Homo sapiens
<400> 107
cgactgaaat ttattgacaa acagctggag ctcttggctc aagactataa gctgcgaatt 60
aagcagatta cggaggaagt ggagaggcag gtgtcgactg caatggccga ggagatcagg 120
cgcctctctg tactggtgga cgattaccag atggacttcc acccttctcc agtagtcctc 180
aaggtttata agaatgagct gcaccgccac atagaggaag gactgggtcg aaacatgtct 240
gac 243
<210> 108
<211> 243
<212> RNA
<213> Homo sapiens
<400> 108
cgacugaaau uuauugacaa acagcuggag cucuuggcuc aagacuauaa gcugcgaauu 60
aagcagauua cggaggaagu ggagaggcag gugucgacug caauggccga ggagaucagg 120
cgccucucug uacuggugga cgauuaccag auggacuucc acccuucucc aguaguccuc 180
aagguuuaua agaaugagcu gcaccgccac auagaggaag gacugggucg aaacaugucu 240
gac 243
<210> 109
<211> 249
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding MFN2
<400> 109
atgcggctga agttcatcga caagcagctg gaactgctgg cccaggacta caagctgcgg 60
atcaagcaga tcaccgaaga ggtggaaaga caggtgtcca ccgccatggc cgaggaaatc 120
agacgactga gcgtgctggt ggacgactac cagatggact ttcacccctc tccagtggtg 180
ctgaaggtgt acaagaacga gctgcaccgg cacatcgagg aaggcctggg cagaaacatg 240
agcgactga 249
<210> 110
<211> 249
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding MFN2
<400> 110
augcggcuga aguucaucga caagcagcug gaacugcugg cccaggacua caagcugcgg 60
aucaagcaga ucaccgaaga gguggaaaga caggugucca ccgccauggc cgaggaaauc 120
agacgacuga gcgugcuggu ggacgacuac cagauggacu uucaccccuc uccaguggug 180
cugaaggugu acaagaacga gcugcaccgg cacaucgagg aaggccuggg cagaaacaug 240
agcgacuga 249
<210> 111
<211> 234
<212> PRT
<213> Homo sapiens
<400> 111
Met Ser Glu Glu Glu Gln Gly Ser Gly Thr Thr Thr Gly Cys Gly Leu
1 5 10 15
Pro Ser Ile Glu Gln Met Leu Ala Ala Asn Pro Gly Lys Thr Pro Ile
20 25 30
Ser Leu Leu Gln Glu Tyr Gly Thr Arg Ile Gly Lys Thr Pro Val Tyr
35 40 45
Asp Leu Leu Lys Ala Glu Gly Gln Ala His Gln Pro Asn Phe Thr Phe
50 55 60
Arg Val Thr Val Gly Asp Thr Ser Cys Thr Gly Gln Gly Pro Ser Lys
65 70 75 80
Lys Ala Ala Lys His Lys Ala Ala Glu Val Ala Leu Lys His Leu Lys
85 90 95
Gly Gly Ser Met Leu Glu Pro Ala Leu Glu Asp Ser Ser Ser Phe Ser
100 105 110
Pro Leu Asp Ser Ser Leu Pro Glu Asp Ile Pro Val Phe Thr Ala Ala
115 120 125
Ala Ala Ala Thr Pro Val Pro Ser Val Val Leu Thr Arg Ser Pro Pro
130 135 140
Met Glu Leu Gln Pro Pro Val Ser Pro Gln Gln Ser Glu Cys Asn Pro
145 150 155 160
Val Gly Ala Leu Gln Glu Leu Val Val Gln Lys Gly Trp Arg Leu Pro
165 170 175
Glu Tyr Thr Val Thr Gln Glu Ser Gly Pro Ala His Arg Lys Glu Phe
180 185 190
Thr Met Thr Cys Arg Val Glu Arg Phe Ile Glu Ile Gly Ser Gly Thr
195 200 205
Ser Lys Lys Leu Ala Lys Arg Asn Ala Ala Ala Lys Met Leu Leu Arg
210 215 220
Val His Thr Val Pro Leu Asp Ala Arg Asp
225 230
<210> 112
<211> 702
<212> DNA
<213> Homo sapiens
<400> 112
atgagtgaag aggagcaagg ctccggcact accacgggct gcgggctgcc tagtatagag 60
caaatgctgg ccgccaaccc aggcaagacc ccgatcagcc ttctgcagga gtatgggacc 120
agaataggga agacgcctgt gtacgacctt ctcaaagccg agggccaagc ccaccagcct 180
aatttcacct tccgggtcac cgttggcgac accagctgca ctggtcaggg ccccagcaag 240
aaggcagcca agcacaaggc agctgaggtg gccctcaaac acctcaaagg ggggagcatg 300
ctggagccgg ccctggagga cagcagttct ttttctcccc tagactcttc actgcctgag 360
gacattccgg tttttactgc tgcagcagct gctaccccag ttccatctgt agtcctaacc 420
aggagccccc ccatggaact gcagccccct gtctcccctc agcagtctga gtgcaacccc 480
gttggtgctc tgcaggagct ggtggtgcag aaaggctggc ggttgccgga gtacacagtg 540
acccaggagt ctgggccagc ccaccgcaaa gaattcacca tgacctgtcg agtggagcgt 600
ttcattgaga ttgggagtgg cacttccaaa aaattggcaa agcggaatgc ggcggccaaa 660
atgctgcttc gagtgcacac ggtgcctctg gatgcccggg at 702
<210> 113
<211> 702
<212> RNA
<213> Homo sapiens
<400> 113
augagugaag aggagcaagg cuccggcacu accacgggcu gcgggcugcc uaguauagag 60
caaaugcugg ccgccaaccc aggcaagacc ccgaucagcc uucugcagga guaugggacc 120
agaauaggga agacgccugu guacgaccuu cucaaagccg agggccaagc ccaccagccu 180
aauuucaccu uccgggucac cguuggcgac accagcugca cuggucaggg ccccagcaag 240
aaggcagcca agcacaaggc agcugaggug gcccucaaac accucaaagg ggggagcaug 300
cuggagccgg cccuggagga cagcaguucu uuuucucccc uagacucuuc acugccugag 360
gacauuccgg uuuuuacugc ugcagcagcu gcuaccccag uuccaucugu aguccuaacc 420
aggagccccc ccauggaacu gcagcccccu gucuccccuc agcagucuga gugcaacccc 480
guuggugcuc ugcaggagcu gguggugcag aaaggcuggc gguugccgga guacacagug 540
acccaggagu cugggccagc ccaccgcaaa gaauucacca ugaccugucg aguggagcgu 600
uucauugaga uugggagugg cacuuccaaa aaauuggcaa agcggaaugc ggcggccaaa 660
augcugcuuc gagugcacac ggugccucug gaugcccggg au 702
<210> 114
<211> 705
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding TRBP
<400> 114
atgagcgagg aagaacaagg cagcggcacc accacaggat gtggcctgcc ttctatcgag 60
cagatgctgg ccgccaatcc tggcaagaca cctatcagcc tgctgcaaga gtacggcacc 120
cggatcggaa agacccctgt gtacgatctg ctgaaggccg aaggccaggc tcaccagcct 180
aacttcacct tcagagtgac cgtgggcgac accagctgta caggacaggg cccttctaag 240
aaggccgcca aacacaaagc cgccgaggtg gccctgaaac acctgaaagg cggctccatg 300
ctggaacccg ctctggaaga tagcagcagc ttcagccctc tggacagcag cctgcctgag 360
gacatccctg tgtttacagc cgctgccgct gctacacctg tgccatctgt ggtgctgacc 420
agatctcctc caatggaact gcagcctcct gtgtctcctc agcagagcga gtgtaatcct 480
gtgggcgccc tgcaagaact ggtggtgcaa aaaggatggc ggctgcccga gtacaccgtg 540
acacaagaat ctggccccgc tcaccggaaa gaattcacca tgacctgcag agtggaacgg 600
ttcatcgaga tcggctccgg cacctctaag aagctggcca agagaaacgc cgctgccaag 660
atgctgctgc gggtgcacac agttcctctg gacgccagag attga 705
<210> 115
<211> 705
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding TRBP
<400> 115
augagcgagg aagaacaagg cagcggcacc accacaggau guggccugcc uucuaucgag 60
cagaugcugg ccgccaaucc uggcaagaca ccuaucagcc ugcugcaaga guacggcacc 120
cggaucggaa agaccccugu guacgaucug cugaaggccg aaggccaggc ucaccagccu 180
aacuucaccu ucagagugac cgugggcgac accagcugua caggacaggg cccuucuaag 240
aaggccgcca aacacaaagc cgccgaggug gcccugaaac accugaaagg cggcuccaug 300
cuggaacccg cucuggaaga uagcagcagc uucagcccuc uggacagcag ccugccugag 360
gacaucccug uguuuacagc cgcugccgcu gcuacaccug ugccaucugu ggugcugacc 420
agaucuccuc caauggaacu gcagccuccu gugucuccuc agcagagcga guguaauccu 480
gugggcgccc ugcaagaacu gguggugcaa aaaggauggc ggcugcccga guacaccgug 540
acacaagaau cuggccccgc ucaccggaaa gaauucacca ugaccugcag aguggaacgg 600
uucaucgaga ucggcuccgg caccucuaag aagcuggcca agagaaacgc cgcugccaag 660
augcugcugc gggugcacac aguuccucug gacgccagag auuga 705
<210> 116
<211> 200
<212> PRT
<213> Homo sapiens
<400> 116
Met Ala Asp Pro Glu Val Cys Cys Phe Ile Thr Lys Ile Leu Cys Ala
1 5 10 15
His Gly Gly Arg Met Ala Leu Asp Ala Leu Leu Gln Glu Ile Ala Leu
20 25 30
Ser Glu Pro Gln Leu Cys Glu Val Leu Gln Val Ala Gly Pro Asp Arg
35 40 45
Phe Val Val Leu Glu Thr Gly Gly Glu Ala Gly Ile Thr Arg Ser Val
50 55 60
Val Ala Thr Thr Arg Ala Arg Val Cys Arg Arg Lys Tyr Cys Gln Arg
65 70 75 80
Pro Cys Asp Asn Leu His Leu Cys Lys Leu Asn Leu Leu Gly Arg Cys
85 90 95
Asn Tyr Ser Gln Ser Glu Arg Asn Leu Cys Lys Tyr Ser His Glu Val
100 105 110
Leu Ser Glu Glu Asn Phe Lys Val Leu Lys Asn His Glu Leu Ser Gly
115 120 125
Leu Asn Lys Glu Glu Leu Ala Val Leu Leu Leu Gln Ser Asp Pro Phe
130 135 140
Phe Met Pro Glu Ile Cys Lys Ser Tyr Lys Gly Glu Gly Arg Gln Gln
145 150 155 160
Ile Cys Asn Gln Gln Pro Pro Cys Ser Arg Leu His Ile Cys Asp His
165 170 175
Phe Thr Arg Gly Asn Cys Arg Phe Pro Asn Cys Leu Arg Ser His Asn
180 185 190
Leu Met Asp Arg Lys Val Leu Ala
195 200
<210> 117
<211> 600
<212> DNA
<213> Homo sapiens
<400> 117
atggcggacc cggaggtgtg ctgcttcatc accaaaatcc tgtgcgccca cgggggccgc 60
atggccctgg acgcgctgct ccaggagatc gcgctgtctg agccgcagct ctgtgaggtg 120
ctgcaggtgg ccgggcccga ccgctttgtg gtgttggaga ccggcggcga ggccgggatc 180
acccgatcgg tggtggccac cactcgagcc cgggtctgcc gtcgcaagta ctgccagaga 240
ccctgcgata acctgcatct ctgcaaactc aacttgctgg gccggtgcaa ctattcgcag 300
tccgagcgga atttatgcaa atattctcat gaggttctct cagaagagaa cttcaaagtc 360
ctgaaaaatc acgaactctc tggactgaac aaagaggaat tagcagtgct cctcctccaa 420
agtgatcctt tttttatgcc cgagatatgc aaaagttata agggagaggg tcggcagcag 480
atttgtaacc agcagccacc gtgttcaaga ctccacatct gtgaccactt cacccgaggg 540
aactgtcgtt ttcccaactg cctccggtcc cataacctga tggacagaaa ggtgctggcc 600
600
<210> 118
<211> 600
<212> RNA
<213> Homo sapiens
<400> 118
auggcggacc cggaggugug cugcuucauc accaaaaucc ugugcgccca cgggggccgc 60
auggcccugg acgcgcugcu ccaggagauc gcgcugucug agccgcagcu cugugaggug 120
cugcaggugg ccgggcccga ccgcuuugug guguuggaga ccggcggcga ggccgggauc 180
acccgaucgg ugguggccac cacucgagcc cgggucugcc gucgcaagua cugccagaga 240
cccugcgaua accugcaucu cugcaaacuc aacuugcugg gccggugcaa cuauucgcag 300
uccgagcgga auuuaugcaa auauucucau gagguucucu cagaagagaa cuucaaaguc 360
cugaaaaauc acgaacucuc uggacugaac aaagaggaau uagcagugcu ccuccuccaa 420
agugauccuu uuuuuaugcc cgagauaugc aaaaguuaua agggagaggg ucggcagcag 480
auuuguaacc agcagccacc guguucaaga cuccacaucu gugaccacuu cacccgaggg 540
aacugucguu uucccaacug ccuccggucc cauaaccuga uggacagaaa ggugcuggcc 600
600
<210> 119
<211> 603
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding Zinc finger anti-viral protein
<400> 119
atggccgatc ctgaagtgtg ctgcttcatc accaagatcc tgtgcgccca cggcggaaga 60
atggctctgg atgctctgct gcaagagatc gccctgtctg agcctcagct gtgcgaagtg 120
ctgcaagtgg ccggacctga cagattcgtg gtgctggaaa caggcggaga ggccggcatt 180
accagatccg tggtggctac cacaagagcc agagtgtgcc ggcggaagta ctgccagagg 240
ccttgcgata atctgcacct gtgcaagctg aacctgctgg gcagatgcaa ctacagccag 300
agcgagcgga atctgtgcaa gtactcccac gaggtgctga gcgaagagaa cttcaaggtg 360
ctgaagaacc acgagctgag cggcctgaac aaagaggaac tggccgttct gctgctgcag 420
agcgacccat tcttcatgcc cgagatctgc aagagctaca aaggcgaggg cagacagcag 480
atctgtaacc agcagcctcc atgcagcaga ctgcacatct gcgaccactt cacccggggc 540
aactgcagat tccccaactg cctgagaagc cacaacctga tggaccggaa ggtgctggct 600
tga 603
<210> 120
<211> 603
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding Zinc finger anti-viral protein
<400> 120
auggccgauc cugaagugug cugcuucauc accaagaucc ugugcgccca cggcggaaga 60
auggcucugg augcucugcu gcaagagauc gcccugucug agccucagcu gugcgaagug 120
cugcaagugg ccggaccuga cagauucgug gugcuggaaa caggcggaga ggccggcauu 180
accagauccg ugguggcuac cacaagagcc agagugugcc ggcggaagua cugccagagg 240
ccuugcgaua aucugcaccu gugcaagcug aaccugcugg gcagaugcaa cuacagccag 300
agcgagcgga aucugugcaa guacucccac gaggugcuga gcgaagagaa cuucaaggug 360
cugaagaacc acgagcugag cggccugaac aaagaggaac uggccguucu gcugcugcag 420
agcgacccau ucuucaugcc cgagaucugc aagagcuaca aaggcgaggg cagacagcag 480
aucuguaacc agcagccucc augcagcaga cugcacaucu gcgaccacuu cacccggggc 540
aacugcagau uccccaacug ccugagaagc cacaaccuga uggaccggaa ggugcuggcu 600
uga 603
<210> 121
<211> 170
<212> PRT
<213> Homo sapiens
<400> 121
Met Ala Gly Asp Leu Ser Ala Gly Phe Phe Met Glu Glu Leu Asn Thr
1 5 10 15
Tyr Arg Gln Lys Gln Gly Val Val Leu Lys Tyr Gln Glu Leu Pro Asn
20 25 30
Ser Gly Pro Pro His Asp Arg Arg Phe Thr Phe Gln Val Ile Ile Asp
35 40 45
Gly Arg Glu Phe Pro Glu Gly Glu Gly Arg Ser Lys Lys Glu Ala Lys
50 55 60
Asn Ala Ala Ala Lys Leu Ala Val Glu Ile Leu Asn Lys Glu Lys Lys
65 70 75 80
Ala Val Ser Pro Leu Leu Leu Thr Thr Thr Asn Ser Ser Glu Gly Leu
85 90 95
Ser Met Gly Asn Tyr Ile Gly Leu Ile Asn Arg Ile Ala Gln Lys Lys
100 105 110
Arg Leu Thr Val Asn Tyr Glu Gln Cys Ala Ser Gly Val His Gly Pro
115 120 125
Glu Gly Phe His Tyr Lys Cys Lys Met Gly Gln Lys Glu Tyr Ser Ile
130 135 140
Gly Thr Gly Ser Thr Lys Gln Glu Ala Lys Gln Leu Ala Ala Lys Leu
145 150 155 160
Ala Tyr Leu Gln Ile Leu Ser Glu Glu Thr
165 170
<210> 122
<211> 510
<212> DNA
<213> Homo sapiens
<400> 122
atggctggtg atctttcagc aggtttcttc atggaggaac ttaatacata ccgtcagaag 60
cagggagtag tacttaaata tcaagaactg cctaattcag gacctccaca tgataggagg 120
tttacatttc aagttataat agatggaaga gaatttccag aaggtgaagg tagatcaaag 180
aaggaagcaa aaaatgccgc agccaaatta gctgttgaga tacttaataa ggaaaagaag 240
gcagttagtc ctttattatt gacaacaacg aattcttcag aaggattatc catggggaat 300
tacataggcc ttatcaatag aattgcccag aagaaaagac taactgtaaa ttatgaacag 360
tgtgcatcgg gggtgcatgg gccagaagga tttcattata aatgcaaaat gggacagaaa 420
gaatatagta ttggtacagg ttctactaaa caggaagcaa aacaattggc cgctaaactt 480
gcatatcttc agatattatc agaagaaacc 510
<210> 123
<211> 510
<212> RNA
<213> Homo sapiens
<400> 123
auggcuggug aucuuucagc agguuucuuc auggaggaac uuaauacaua ccgucagaag 60
cagggaguag uacuuaaaua ucaagaacug ccuaauucag gaccuccaca ugauaggagg 120
uuuacauuuc aaguuauaau agauggaaga gaauuuccag aaggugaagg uagaucaaag 180
aaggaagcaa aaaaugccgc agccaaauua gcuguugaga uacuuaauaa ggaaaagaag 240
gcaguuaguc cuuuauuauu gacaacaacg aauucuucag aaggauuauc cauggggaau 300
uacauaggcc uuaucaauag aauugcccag aagaaaagac uaacuguaaa uuaugaacag 360
ugugcaucgg gggugcaugg gccagaagga uuucauuaua aaugcaaaau gggacagaaa 420
gaauauagua uugguacagg uucuacuaaa caggaagcaa aacaauuggc cgcuaaacuu 480
gcauaucuuc agauauuauc agaagaaacc 510
<210> 124
<211> 513
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding PKR
<400> 124
atggctggcg atctgagcgc cggcttcttc atggaagaac tgaacaccta ccggcagaaa 60
cagggcgtcg tgctgaagta ccaagagctg cctaatagcg gccctcctca cgaccggcgg 120
ttcacctttc aagtgatcat cgacggcaga gagttccccg aaggcgaggg cagatctaag 180
aaagaggcca agaacgccgc tgccaagctg gccgtggaaa tcctgaacaa agagaagaag 240
gccgtttctc ccctgctgct gaccaccacc aatagctctg agggcctgag catgggcaac 300
tacatcggcc tgatcaaccg gatcgcccag aaaaagcggc tgaccgtgaa ctacgagcag 360
tgtgccagcg gagtgcacgg ccctgagggc tttcactaca agtgcaagat gggccagaaa 420
gagtacagca tcggcaccgg cagcaccaag caagaagcca aacagctggc cgccaaactg 480
gcctacctgc agatcctgag cgaggaaacc tga 513
<210> 125
<211> 513
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding PKR
<400> 125
auggcuggcg aucugagcgc cggcuucuuc auggaagaac ugaacaccua ccggcagaaa 60
cagggcgucg ugcugaagua ccaagagcug ccuaauagcg gcccuccuca cgaccggcgg 120
uucaccuuuc aagugaucau cgacggcaga gaguuccccg aaggcgaggg cagaucuaag 180
aaagaggcca agaacgccgc ugccaagcug gccguggaaa uccugaacaa agagaagaag 240
gccguuucuc cccugcugcu gaccaccacc aauagcucug agggccugag caugggcaac 300
uacaucggcc ugaucaaccg gaucgcccag aaaaagcggc ugaccgugaa cuacgagcag 360
ugugccagcg gagugcacgg cccugagggc uuucacuaca agugcaagau gggccagaaa 420
gaguacagca ucggcaccgg cagcaccaag caagaagcca aacagcuggc cgccaaacug 480
gccuaccugc agauccugag cgaggaaacc uga 513
<210> 126
<211> 194
<212> PRT
<213> Homo sapiens
<400> 126
Met Ser Gln Ser Arg His Arg Ala Glu Ala Pro Pro Leu Glu Arg Glu
1 5 10 15
Asp Ser Gly Thr Phe Ser Leu Gly Lys Met Ile Thr Ala Lys Pro Gly
20 25 30
Lys Thr Pro Ile Gln Val Leu His Glu Tyr Gly Met Lys Thr Lys Asn
35 40 45
Ile Pro Val Tyr Glu Cys Glu Arg Ser Asp Val Gln Ile His Val Pro
50 55 60
Thr Phe Thr Phe Arg Val Thr Val Gly Asp Ile Thr Cys Thr Gly Glu
65 70 75 80
Gly Thr Ser Lys Lys Leu Ala Lys His Arg Ala Ala Glu Ala Ala Ile
85 90 95
Asn Ile Leu Lys Ala Asn Ala Ser Ile Cys Phe Ala Val Pro Asp Pro
100 105 110
Leu Met Pro Asp Pro Ser Lys Gln Pro Lys Asn Gln Leu Asn Pro Ile
115 120 125
Gly Ser Leu Gln Glu Leu Ala Ile His His Gly Trp Arg Leu Pro Glu
130 135 140
Tyr Thr Leu Ser Gln Glu Gly Gly Pro Ala His Lys Arg Glu Tyr Thr
145 150 155 160
Thr Ile Cys Arg Leu Glu Ser Phe Met Glu Thr Gly Lys Gly Ala Ser
165 170 175
Lys Lys Gln Ala Lys Arg Asn Ala Ala Glu Lys Phe Leu Ala Lys Phe
180 185 190
Ser Asn
<210> 127
<211> 582
<212> DNA
<213> Homo sapiens
<400> 127
atgtcccaga gcaggcaccg cgccgaggcc ccgccgctgg agcgcgagga cagtgggacc 60
ttcagtttgg ggaagatgat aacagctaag ccagggaaaa caccgattca ggtattacac 120
gaatacggca tgaagaccaa gaacatccca gtttatgaat gtgaaagatc tgatgtgcaa 180
atacacgtgc ccactttcac cttcagagta accgttggtg acataacctg cacaggtgaa 240
ggtacaagta agaagctggc gaaacataga gctgcagagg ctgccataaa cattttgaaa 300
gccaatgcaa gtatttgctt tgcagttcct gaccccttaa tgcctgaccc ttccaagcaa 360
ccaaagaacc agcttaatcc tattggttca ttacaggaat tggctattca tcatggctgg 420
agacttcctg aatataccct ttcccaggag ggaggacctg ctcataagag agaatatact 480
acaatttgca ggctagagtc atttatggaa actggaaagg gggcatcaaa aaagcaagcc 540
aaaaggaatg ctgctgagaa atttcttgcc aaatttagta at 582
<210> 128
<211> 582
<212> RNA
<213> Homo sapiens
<400> 128
augucccaga gcaggcaccg cgccgaggcc ccgccgcugg agcgcgagga cagugggacc 60
uucaguuugg ggaagaugau aacagcuaag ccagggaaaa caccgauuca gguauuacac 120
gaauacggca ugaagaccaa gaacauccca guuuaugaau gugaaagauc ugaugugcaa 180
auacacgugc ccacuuucac cuucagagua accguuggug acauaaccug cacaggugaa 240
gguacaagua agaagcuggc gaaacauaga gcugcagagg cugccauaaa cauuuugaaa 300
gccaaugcaa guauuugcuu ugcaguuccu gaccccuuaa ugccugaccc uuccaagcaa 360
ccaaagaacc agcuuaaucc uauugguuca uuacaggaau uggcuauuca ucauggcugg 420
agacuuccug aauauacccu uucccaggag ggaggaccug cucauaagag agaauauacu 480
acaauuugca ggcuagaguc auuuauggaa acuggaaagg gggcaucaaa aaagcaagcc 540
aaaaggaaug cugcugagaa auuucuugcc aaauuuagua au 582
<210> 129
<211> 513
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding PACT
<400> 129
atggctggcg atctgagcgc cggcttcttc atggaagaac tgaacaccta ccggcagaaa 60
cagggcgtcg tgctgaagta ccaagagctg cctaatagcg gccctcctca cgaccggcgg 120
ttcacctttc aagtgatcat cgacggcaga gagttccccg aaggcgaggg cagatctaag 180
aaagaggcca agaacgccgc tgccaagctg gccgtggaaa tcctgaacaa agagaagaag 240
gccgtttctc ccctgctgct gaccaccacc aatagctctg agggcctgag catgggcaac 300
tacatcggcc tgatcaaccg gatcgcccag aaaaagcggc tgaccgtgaa ctacgagcag 360
tgtgccagcg gagtgcacgg ccctgagggc tttcactaca agtgcaagat gggccagaaa 420
gagtacagca tcggcaccgg cagcaccaag caagaagcca aacagctggc cgccaaactg 480
gcctacctgc agatcctgag cgaggaaacc tga 513
<210> 130
<211> 585
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding PACT
<400> 130
augagccaga gcagacacag agccgaagcu ccuccacugg aaagagagga cagcggcacc 60
uuuagccugg gcaagaugau cacagccaag ccuggcaaga ccccuaucca ggugcugcac 120
gaguacggca ugaagaccaa gaacaucccc guguacgagu gcgagagaag cgacgugcag 180
auccacgugc caaccuucac cuucagagug accgugggcg acaucaccug uaccggcgag 240
ggcacaucua agaagcuggc caaacauaga gccgccgagg ccgccaucaa uauccugaag 300
gccaaugcca gcaucugcuu cgccgugccu gauccucuga ugcccgaucc uagcaagcag 360
cccaagaacc agcugaaccc uaucggcagc cugcaagagc uggccauuca ucauggaugg 420
cggcugccug aguacacccu gucucaagaa ggcggcccug cucacaagag agaguacacc 480
accaucugcc ggcuggaaag cuucauggaa acaggcaagg gcgccagcaa gaaacaggcc 540
aagagaaacg ccgccgagaa guuccuggcc aaguucagca acuga 585
<210> 131
<211> 330
<212> PRT
<213> Homo sapiens
<400> 131
Met Glu Ser Arg Asp His Asn Asn Pro Gln Glu Gly Pro Thr Ser Ser
1 5 10 15
Ser Gly Arg Arg Ala Ala Val Glu Asp Asn His Leu Leu Ile Lys Ala
20 25 30
Val Gln Asn Glu Asp Val Asp Leu Val Gln Gln Leu Leu Glu Gly Gly
35 40 45
Ala Asn Val Asn Phe Gln Glu Glu Glu Gly Gly Trp Thr Pro Leu His
50 55 60
Asn Ala Val Gln Met Ser Arg Glu Asp Ile Val Glu Leu Leu Leu Arg
65 70 75 80
His Gly Ala Asp Pro Val Leu Arg Lys Lys Asn Gly Ala Thr Pro Phe
85 90 95
Ile Leu Ala Ala Ile Ala Gly Ser Val Lys Leu Leu Lys Leu Phe Leu
100 105 110
Ser Lys Gly Ala Asp Val Asn Glu Cys Asp Phe Tyr Gly Phe Thr Ala
115 120 125
Phe Met Glu Ala Ala Val Tyr Gly Lys Val Lys Ala Leu Lys Phe Leu
130 135 140
Tyr Lys Arg Gly Ala Asn Val Asn Leu Arg Arg Lys Thr Lys Glu Asp
145 150 155 160
Gln Glu Arg Leu Arg Lys Gly Gly Ala Thr Ala Leu Met Asp Ala Ala
165 170 175
Glu Lys Gly His Val Glu Val Leu Lys Ile Leu Leu Asp Glu Met Gly
180 185 190
Ala Asp Val Asn Ala Cys Asp Asn Met Gly Arg Asn Ala Leu Ile His
195 200 205
Ala Leu Leu Ser Ser Asp Asp Ser Asp Val Glu Ala Ile Thr His Leu
210 215 220
Leu Leu Asp His Gly Ala Asp Val Asn Val Arg Gly Glu Arg Gly Lys
225 230 235 240
Thr Pro Leu Ile Leu Ala Val Glu Lys Lys His Leu Gly Leu Val Gln
245 250 255
Arg Leu Leu Glu Gln Glu His Ile Glu Ile Asn Asp Thr Asp Ser Asp
260 265 270
Gly Lys Thr Ala Leu Leu Leu Ala Val Glu Leu Lys Leu Lys Lys Ile
275 280 285
Ala Glu Leu Leu Cys Lys Arg Gly Ala Ser Thr Asp Cys Gly Asp Leu
290 295 300
Val Met Thr Ala Arg Arg Asn Tyr Asp His Ser Leu Val Lys Val Leu
305 310 315 320
Leu Ser His Gly Ala Lys Glu Asp Phe His
325 330
<210> 132
<211> 990
<212> DNA
<213> Homo sapiens
<400> 132
atggagagca gggatcataa caacccccag gagggaccca cgtcctccag cggtagaagg 60
gctgcagtgg aagacaatca cttgctgatt aaagctgttc aaaacgaaga tgttgacctg 120
gtccagcaat tgctggaagg tggagccaat gttaatttcc aggaagagga agggggctgg 180
acacctctgc ataacgcagt acaaatgagc agggaggaca ttgtggaact tctgcttcgt 240
catggtgctg accctgttct gaggaagaag aatggggcca cgccttttat cctcgcagcg 300
attgcgggga gcgtgaagct gctgaaactt ttcctttcta aaggagcaga tgtcaatgag 360
tgtgattttt atggcttcac agccttcatg gaagccgctg tgtatggtaa ggtcaaagcc 420
ctaaaattcc tttataagag aggagcaaat gtgaatttga ggcgaaagac aaaggaggat 480
caagagcggc tgaggaaagg aggggccaca gctctcatgg acgctgctga aaaaggacac 540
gtagaggtct tgaagattct ccttgatgag atgggggcag atgtaaacgc ctgtgacaat 600
atgggcagaa atgccttgat ccatgctctc ctgagctctg acgatagtga tgtggaggct 660
attacgcatc tgctgctgga ccatggggct gatgtcaatg tgaggggaga aagagggaag 720
actcccctga tcctggcagt ggagaagaag cacttgggtt tggtgcagag gcttctggag 780
caagagcaca tagagattaa tgacacagac agtgatggca aaacagcact gctgcttgct 840
gttgaactca aactgaagaa aatcgccgag ttgctgtgca aacgtggagc cagtacagat 900
tgtggggatc ttgttatgac agcgaggcgg aattatgacc attcccttgt gaaggttctt 960
ctctctcatg gagccaaaga agattttcac 990
<210> 133
<211> 990
<212> RNA
<213> Homo sapiens
<400> 133
auggagagca gggaucauaa caacccccag gagggaccca cguccuccag cgguagaagg 60
gcugcagugg aagacaauca cuugcugauu aaagcuguuc aaaacgaaga uguugaccug 120
guccagcaau ugcuggaagg uggagccaau guuaauuucc aggaagagga agggggcugg 180
acaccucugc auaacgcagu acaaaugagc agggaggaca uuguggaacu ucugcuucgu 240
cauggugcug acccuguucu gaggaagaag aauggggcca cgccuuuuau ccucgcagcg 300
auugcgggga gcgugaagcu gcugaaacuu uuccuuucua aaggagcaga ugucaaugag 360
ugugauuuuu auggcuucac agccuucaug gaagccgcug uguaugguaa ggucaaagcc 420
cuaaaauucc uuuauaagag aggagcaaau gugaauuuga ggcgaaagac aaaggaggau 480
caagagcggc ugaggaaagg aggggccaca gcucucaugg acgcugcuga aaaaggacac 540
guagaggucu ugaagauucu ccuugaugag augggggcag auguaaacgc cugugacaau 600
augggcagaa augccuugau ccaugcucuc cugagcucug acgauaguga uguggaggcu 660
auuacgcauc ugcugcugga ccauggggcu gaugucaaug ugaggggaga aagagggaag 720
acuccccuga uccuggcagu ggagaagaag cacuuggguu uggugcagag gcuucuggag 780
caagagcaca uagagauuaa ugacacagac agugauggca aaacagcacu gcugcuugcu 840
guugaacuca aacugaagaa aaucgccgag uugcugugca aacguggagc caguacagau 900
uguggggauc uuguuaugac agcgaggcgg aauuaugacc auucccuugu gaagguucuu 960
cucucucaug gagccaaaga agauuuucac 990
<210> 134
<211> 993
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding RNAse L
<400> 134
atggaaagcc gggaccacaa caaccctcaa gagggcccta caagcagctc tggtagaagg 60
gccgctgtgg aagataacca tctgctgatc aaggccgtgc agaacgagga cgtggacctg 120
gtgcaacaac tgctggaagg cggagccaac gtgaacttcc aagaggaaga aggcggctgg 180
acccctctgc ataacgctgt gcagatgagc agagaggaca tcgtcgagct gctgctgaga 240
catggcgctg accctgtgct gagaaagaag aacggcgcca cacctttcat cctggccgcc 300
attgccggaa gcgtgaagct gctgaagctg ttcctgagca agggcgccga tgtgaacgag 360
tgcgacttct acggcttcac cgccttcatg gaagccgccg tgtacggcaa agtgaaggcc 420
ctgaagttcc tgtacaagag gggcgctaac gtgaacctgc ggagaaagac caaagaggac 480
caagagcggc tgcggaaagg tggcgctaca gctcttatgg atgccgccga gaagggacac 540
gtggaagtgc tgaagatcct gctggatgag atgggcgcag acgtgaacgc ctgcgacaac 600
atgggaagaa acgccctgat tcacgccctg ctgagcagcg acgatagcga cgtggaagcc 660
atcacacatc tgctgctgga tcacggggct gatgtgaatg tgcggggcga gagaggaaag 720
accccactga ttctggccgt ggaaaagaaa cacctgggcc tcgtgcagag gctgctggaa 780
caagagcaca tcgagatcaa cgacaccgac agcgacggca agacagccct gctgcttgcc 840
gtggaactga agctgaagaa gatcgccgaa ctgctgtgca agagaggcgc cagcacagat 900
tgtggcgacc tcgtgatgac cgccagacgg aactacgatc acagcctggt caaggtgctg 960
ctgtcccatg gcgctaaaga ggacttccac tga 993
<210> 135
<211> 990
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding RNAse L
<400> 135
auggagagca gggaucauaa caacccccag gagggaccca cguccuccag cgguagaagg 60
gcugcagugg aagacaauca cuugcugauu aaagcuguuc aaaacgaaga uguugaccug 120
guccagcaau ugcuggaagg uggagccaau guuaauuucc aggaagagga agggggcugg 180
acaccucugc auaacgcagu acaaaugagc agggaggaca uuguggaacu ucugcuucgu 240
cauggugcug acccuguucu gaggaagaag aauggggcca cgccuuuuau ccucgcagcg 300
auugcgggga gcgugaagcu gcugaaacuu uuccuuucua aaggagcaga ugucaaugag 360
ugugauuuuu auggcuucac agccuucaug gaagccgcug uguaugguaa ggucaaagcc 420
cuaaaauucc uuuauaagag aggagcaaau gugaauuuga ggcgaaagac aaaggaggau 480
caagagcggc ugaggaaagg aggggccaca gcucucaugg acgcugcuga aaaaggacac 540
guagaggucu ugaagauucu ccuugaugag augggggcag auguaaacgc cugugacaau 600
augggcagaa augccuugau ccaugcucuc cugagcucug acgauaguga uguggaggcu 660
auuacgcauc ugcugcugga ccauggggcu gaugucaaug ugaggggaga aagagggaag 720
acuccccuga uccuggcagu ggagaagaag cacuuggguu uggugcagag gcuucuggag 780
caagagcaca uagagauuaa ugacacagac agugauggca aaacagcacu gcugcuugcu 840
guugaacuca aacugaagaa aaucgccgag uugcugugca aacguggagc caguacagau 900
uguggggauc uuguuaugac agcgaggcgg aauuaugacc auucccuugu gaagguucuu 960
cucucucaug gagccaaaga agauuuucac 990
<210> 136
<211> 343
<212> PRT
<213> Homo sapiens
<400> 136
Met Asp Leu Tyr Ser Thr Pro Ala Ala Ala Leu Asp Arg Phe Val Ala
1 5 10 15
Arg Arg Leu Gln Pro Arg Lys Glu Phe Val Glu Lys Ala Arg Arg Ala
20 25 30
Leu Gly Ala Leu Ala Ala Ala Leu Arg Glu Arg Gly Gly Arg Leu Gly
35 40 45
Ala Ala Ala Pro Arg Val Leu Lys Thr Val Lys Gly Gly Ser Ser Gly
50 55 60
Arg Gly Thr Ala Leu Lys Gly Gly Cys Asp Ser Glu Leu Val Ile Phe
65 70 75 80
Leu Asp Cys Phe Lys Ser Tyr Val Asp Gln Arg Ala Arg Arg Ala Glu
85 90 95
Ile Leu Ser Glu Met Arg Ala Ser Leu Glu Ser Trp Trp Gln Asn Pro
100 105 110
Val Pro Gly Leu Arg Leu Thr Phe Pro Glu Gln Ser Val Pro Gly Ala
115 120 125
Leu Gln Phe Arg Leu Thr Ser Val Asp Leu Glu Asp Trp Met Asp Val
130 135 140
Ser Leu Val Pro Ala Phe Asn Val Leu Gly Gln Ala Gly Ser Gly Val
145 150 155 160
Lys Pro Lys Pro Gln Val Tyr Ser Thr Leu Leu Asn Ser Gly Cys Gln
165 170 175
Gly Gly Glu His Ala Ala Cys Phe Thr Glu Leu Arg Arg Asn Phe Val
180 185 190
Asn Ile Arg Pro Ala Lys Leu Lys Asn Leu Ile Leu Leu Val Lys His
195 200 205
Trp Tyr His Gln Val Cys Leu Gln Gly Leu Trp Lys Glu Thr Leu Pro
210 215 220
Pro Val Tyr Ala Leu Glu Leu Leu Thr Ile Phe Ala Trp Glu Gln Gly
225 230 235 240
Cys Lys Lys Asp Ala Phe Ser Leu Ala Glu Gly Leu Arg Thr Val Leu
245 250 255
Gly Leu Ile Gln Gln His Gln His Leu Cys Val Phe Trp Thr Val Asn
260 265 270
Tyr Gly Phe Glu Asp Pro Ala Val Gly Gln Phe Leu Gln Arg Gln Leu
275 280 285
Lys Arg Pro Arg Pro Val Ile Leu Asp Pro Ala Asp Pro Thr Trp Asp
290 295 300
Leu Gly Asn Gly Ala Ala Trp His Trp Asp Leu Leu Ala Gln Glu Ala
305 310 315 320
Ala Ser Cys Tyr Asp His Pro Cys Phe Leu Arg Gly Met Gly Asp Pro
325 330 335
Val Gln Ser Trp Lys Gly Pro
340
<210> 137
<211> 1029
<212> DNA
<213> Homo sapiens
<400> 137
atggacttgt acagcacccc ggccgctgcg ctggacaggt tcgtggccag aaggctgcag 60
ccgcggaagg agttcgtaga gaaggcgcgg cgcgctctgg gcgccctggc cgctgccctg 120
agggagcgcg ggggccgcct cggtgctgct gccccgcggg tgctgaaaac tgtcaaggga 180
ggctcctcgg gccggggcac agctctcaag ggtggctgtg attctgaact tgtcatcttc 240
ctcgactgct tcaagagcta tgtggaccag agggcccgcc gtgcagagat cctcagtgag 300
atgcgggcat cgctggaatc ctggtggcag aacccagtcc ctggtctgag actcacgttt 360
cctgagcaga gcgtgcctgg ggccctgcag ttccgcctga catccgtaga tcttgaggac 420
tggatggatg ttagcctggt gcctgccttc aatgtcctgg gtcaggccgg ctccggcgtc 480
aaacccaagc cacaagtcta ctctaccctc ctcaacagtg gctgccaagg gggcgagcat 540
gcggcctgct tcacagagct gcggaggaac tttgtgaaca ttcgcccagc caagttgaag 600
aacctaatct tgctggtgaa gcactggtac caccaggtgt gcctacaggg gttgtggaag 660
gagacgctgc ccccggtcta tgccctggaa ttgctgacca tcttcgcctg ggagcagggc 720
tgtaagaagg atgctttcag cctagccgaa ggcctccgaa ctgtcctggg cctgatccaa 780
cagcatcagc acctgtgtgt tttctggact gtcaactatg gcttcgagga ccctgcagtt 840
gggcagttct tgcagcggca gcttaagaga cccaggcctg tgatcctgga cccagctgac 900
cccacatggg acctggggaa tggggcagcc tggcactggg atttgctagc ccaggaggca 960
gcatcctgct atgaccaccc atgctttctg agggggatgg gggacccagt gcagtcttgg 1020
aaggggccg 1029
<210> 138
<211> 1029
<212> RNA
<213> Homo sapiens
<400> 138
auggacuugu acagcacccc ggccgcugcg cuggacaggu ucguggccag aaggcugcag 60
ccgcggaagg aguucguaga gaaggcgcgg cgcgcucugg gcgcccuggc cgcugcccug 120
agggagcgcg ggggccgccu cggugcugcu gccccgcggg ugcugaaaac ugucaaggga 180
ggcuccucgg gccggggcac agcucucaag gguggcugug auucugaacu ugucaucuuc 240
cucgacugcu ucaagagcua uguggaccag agggcccgcc gugcagagau ccucagugag 300
augcgggcau cgcuggaauc cugguggcag aacccagucc cuggucugag acucacguuu 360
ccugagcaga gcgugccugg ggcccugcag uuccgccuga cauccguaga ucuugaggac 420
uggauggaug uuagccuggu gccugccuuc aauguccugg gucaggccgg cuccggcguc 480
aaacccaagc cacaagucua cucuacccuc cucaacagug gcugccaagg gggcgagcau 540
gcggccugcu ucacagagcu gcggaggaac uuugugaaca uucgcccagc caaguugaag 600
aaccuaaucu ugcuggugaa gcacugguac caccaggugu gccuacaggg guuguggaag 660
gagacgcugc ccccggucua ugcccuggaa uugcugacca ucuucgccug ggagcagggc 720
uguaagaagg augcuuucag ccuagccgaa ggccuccgaa cuguccuggg ccugauccaa 780
cagcaucagc accugugugu uuucuggacu gucaacuaug gcuucgagga cccugcaguu 840
gggcaguucu ugcagcggca gcuuaagaga cccaggccug ugauccugga cccagcugac 900
cccacauggg accuggggaa uggggcagcc uggcacuggg auuugcuagc ccaggaggca 960
gcauccugcu augaccaccc augcuuucug agggggaugg gggacccagu gcagucuugg 1020
aaggggccg 1029
<210> 139
<211> 1032
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding OAS3
<400> 139
atggacctgt acagcacacc agccgccgct ctggatagat tcgtggctag acgactgcag 60
ccccggaaag aattcgtgga aaaggctcgg agagccctgg gagcacttgc tgctgctctg 120
agagaaagag gcggcagact tggagccgct gctcccagag tgctgaaaac agtgaaaggc 180
ggcagcagcg gcagaggcac agctcttaaa ggcggctgcg atagcgagct ggtcatcttc 240
ctggactgct tcaagagcta cgtggaccag agagccagac gggccgagat cctgtctgag 300
atgagagcca gcctggaaag ctggtggcag aatcctgtgc ctggcctgag actgacattc 360
cccgaacagt ctgttcccgg cgctctgcag tttagactga cctccgtgga cctggaagat 420
tggatggatg tgtccctggt gcctgccttc aatgtgctgg gacaagctgg ctctggcgtg 480
aagcctaagc ctcaggtgta ctctaccctg ctgaactccg gctgtcaagg cggagaacac 540
gccgcctgtt ttaccgagct gcggcggaac ttcgtgaaca tcagacccgc caagctgaag 600
aacctgatcc tgctggtcaa gcactggtat caccaagtgt gcctgcaagg cctgtggaaa 660
gaaaccctgc ctcctgtgta cgccctggaa ctgctgacca tcttcgcctg ggaacagggc 720
tgcaagaagg acgcctttag cctggccgag ggcctgagaa cagttctggg actgattcag 780
cagcaccagc acctgtgcgt gttctggacc gtgaactacg gcttcgagga tcctgccgtg 840
ggccagtttc tgcagagaca gctgaagagg cccagacctg tgatcctgga tcctgcagac 900
cctacatggg acctcggaaa tggcgctgcc tggcattggg atctgctggc ccaagaagcc 960
gccagctgtt acgatcaccc ctgctttctg agaggcatgg gcgatcctgt gcagagctgg 1020
aagggacctt ga 1032
<210> 140
<211> 1032
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding OAS3
<400> 140
auggaccugu acagcacacc agccgccgcu cuggauagau ucguggcuag acgacugcag 60
ccccggaaag aauucgugga aaaggcucgg agagcccugg gagcacuugc ugcugcucug 120
agagaaagag gcggcagacu uggagccgcu gcucccagag ugcugaaaac agugaaaggc 180
ggcagcagcg gcagaggcac agcucuuaaa ggcggcugcg auagcgagcu ggucaucuuc 240
cuggacugcu ucaagagcua cguggaccag agagccagac gggccgagau ccugucugag 300
augagagcca gccuggaaag cugguggcag aauccugugc cuggccugag acugacauuc 360
cccgaacagu cuguucccgg cgcucugcag uuuagacuga ccuccgugga ccuggaagau 420
uggauggaug ugucccuggu gccugccuuc aaugugcugg gacaagcugg cucuggcgug 480
aagccuaagc cucaggugua cucuacccug cugaacuccg gcugucaagg cggagaacac 540
gccgccuguu uuaccgagcu gcggcggaac uucgugaaca ucagacccgc caagcugaag 600
aaccugaucc ugcuggucaa gcacugguau caccaagugu gccugcaagg ccuguggaaa 660
gaaacccugc cuccugugua cgcccuggaa cugcugacca ucuucgccug ggaacagggc 720
ugcaagaagg acgccuuuag ccuggccgag ggccugagaa caguucuggg acugauucag 780
cagcaccagc accugugcgu guucuggacc gugaacuacg gcuucgagga uccugccgug 840
ggccaguuuc ugcagagaca gcugaagagg cccagaccug ugauccugga uccugcagac 900
ccuacauggg accucggaaa uggcgcugcc uggcauuggg aucugcuggc ccaagaagcc 960
gccagcuguu acgaucaccc cugcuuucug agaggcaugg gcgauccugu gcagagcugg 1020
aagggaccuu ga 1032
<210> 141
<211> 133
<212> PRT
<213> Homo sapiens
<400> 141
Met Gln Glu Lys Pro Lys Pro Val Pro Asp Lys Glu Asn Lys Lys Leu
1 5 10 15
Leu Cys Arg Lys Cys Lys Ala Leu Ala Cys Tyr Thr Ala Asp Val Arg
20 25 30
Val Ile Glu Glu Cys His Tyr Thr Val Leu Gly Asp Ala Phe Lys Glu
35 40 45
Cys Phe Val Ser Arg Pro His Pro Lys Pro Lys Gln Phe Ser Ser Phe
50 55 60
Glu Lys Arg Ala Lys Ile Phe Cys Ala Arg Gln Asn Cys Ser His Asp
65 70 75 80
Trp Gly Ile His Val Lys Tyr Lys Thr Phe Glu Ile Pro Val Ile Lys
85 90 95
Ile Glu Ser Phe Val Val Glu Asp Ile Ala Thr Gly Val Gln Thr Leu
100 105 110
Tyr Ser Lys Trp Lys Asp Phe His Phe Glu Lys Ile Pro Phe Asp Pro
115 120 125
Ala Glu Met Ser Lys
130
<210> 142
<211> 399
<212> DNA
<213> Homo sapiens
<400> 142
atgcaagaaa aaccaaaacc tgtacctgat aaggaaaata aaaaactgct ctgcagaaag 60
tgcaaagcct tggcatgtta cacagctgac gtaagagtga tagaggaatg ccattacact 120
gtgcttggag atgcttttaa ggaatgcttt gtgagtagac cacatcccaa gccaaagcag 180
ttttcaagtt ttgaaaaaag agcaaagata ttctgtgccc gacagaactg cagccatgac 240
tggggaatcc atgtgaagta caagacattt gagattccag ttataaaaat tgaaagtttt 300
gtggtggagg atattgcaac tggagttcag acactgtact cgaagtggaa ggactttcat 360
tttgagaaga taccatttga tccagcagaa atgtccaaa 399
<210> 143
<211> 399
<212> RNA
<213> Homo sapiens
<400> 143
augcaagaaa aaccaaaacc uguaccugau aaggaaaaua aaaaacugcu cugcagaaag 60
ugcaaagccu uggcauguua cacagcugac guaagaguga uagaggaaug ccauuacacu 120
gugcuuggag augcuuuuaa ggaaugcuuu gugaguagac cacaucccaa gccaaagcag 180
uuuucaaguu uugaaaaaag agcaaagaua uucugugccc gacagaacug cagccaugac 240
uggggaaucc augugaagua caagacauuu gagauuccag uuauaaaaau ugaaaguuuu 300
gugguggagg auauugcaac uggaguucag acacuguacu cgaaguggaa ggacuuucau 360
uuugagaaga uaccauuuga uccagcagaa auguccaaa 399
<210> 144
<211> 402
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding RIG-1
<400> 144
atgcaagaga agcccaagcc tgtgcctgac aaagagaaca agaaactgct gtgccggaag 60
tgcaaggccc tggcctgtta tacagccgac gtgcgcgtga tcgaggaatg ccactataca 120
gtgctgggcg acgccttcaa agaatgcttc gtgtcccggc ctcatcctaa gcctaagcag 180
ttcagcagct tcgagaagcg ggccaagatc ttctgcgcca gacagaactg cagccacgac 240
tggggaatcc acgtgaagta caagaccttc gagatccccg tgatcaagat cgagagcttc 300
gtggtggaag atatcgccac cggcgtgcag accctgtaca gcaagtggaa ggatttccac 360
tttgagaaga tccctttcga ccccgccgag atgagcaagt ga 402
<210> 145
<211> 402
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding RIG-1
<400> 145
augcaagaga agcccaagcc ugugccugac aaagagaaca agaaacugcu gugccggaag 60
ugcaaggccc uggccuguua uacagccgac gugcgcguga ucgaggaaug ccacuauaca 120
gugcugggcg acgccuucaa agaaugcuuc gugucccggc cucauccuaa gccuaagcag 180
uucagcagcu ucgagaagcg ggccaagauc uucugcgcca gacagaacug cagccacgac 240
uggggaaucc acgugaagua caagaccuuc gagauccccg ugaucaagau cgagagcuuc 300
gugguggaag auaucgccac cggcgugcag acccuguaca gcaaguggaa ggauuuccac 360
uuugagaaga ucccuuucga ccccgccgag augagcaagu ga 402
<210> 146
<211> 650
<212> PRT
<213> Homo sapiens
<400> 146
Met Ala Ser Asn Met Asp Arg Glu Met Ile Leu Ala Asp Phe Gln Ala
1 5 10 15
Cys Thr Gly Ile Glu Asn Ile Asp Glu Ala Ile Thr Leu Leu Glu Gln
20 25 30
Asn Asn Trp Asp Leu Val Ala Ala Ile Asn Gly Val Ile Pro Gln Glu
35 40 45
Asn Gly Ile Leu Gln Ser Glu Tyr Gly Gly Glu Thr Ile Pro Gly Pro
50 55 60
Ala Phe Asn Pro Ala Ser His Pro Ala Ser Ala Pro Thr Ser Ser Ser
65 70 75 80
Ser Ser Ala Phe Arg Pro Val Met Pro Ser Arg Gln Ile Val Glu Arg
85 90 95
Gln Pro Arg Met Leu Asp Phe Arg Val Glu Tyr Arg Asp Arg Asn Val
100 105 110
Asp Val Val Leu Glu Asp Thr Cys Thr Val Gly Glu Ile Lys Gln Ile
115 120 125
Leu Glu Asn Glu Leu Gln Ile Pro Val Ser Lys Met Leu Leu Lys Gly
130 135 140
Trp Lys Thr Gly Asp Val Glu Asp Ser Thr Val Leu Lys Ser Leu His
145 150 155 160
Leu Pro Lys Asn Asn Ser Leu Tyr Val Leu Thr Pro Asp Leu Pro Pro
165 170 175
Pro Ser Ser Ser Ser His Ala Gly Ala Leu Gln Glu Ser Leu Asn Gln
180 185 190
Asn Phe Met Leu Ile Ile Thr His Arg Glu Val Gln Arg Glu Tyr Asn
195 200 205
Leu Asn Phe Ser Gly Ser Ser Thr Ile Gln Glu Val Lys Arg Asn Val
210 215 220
Tyr Asp Leu Thr Ser Ile Pro Val Arg His Gln Leu Trp Glu Gly Trp
225 230 235 240
Pro Thr Ser Ala Thr Asp Asp Ser Met Cys Leu Ala Glu Ser Gly Leu
245 250 255
Ser Tyr Pro Cys His Arg Leu Thr Val Gly Arg Arg Ser Ser Pro Ala
260 265 270
Gln Thr Arg Glu Gln Ser Glu Glu Gln Ile Thr Asp Val His Met Val
275 280 285
Ser Asp Ser Asp Gly Asp Asp Phe Glu Asp Ala Thr Glu Phe Gly Val
290 295 300
Asp Asp Gly Glu Val Phe Gly Met Ala Ser Ser Ala Leu Arg Lys Ser
305 310 315 320
Pro Met Met Pro Glu Asn Ala Glu Asn Glu Gly Asp Ala Leu Leu Gln
325 330 335
Phe Thr Ala Glu Phe Ser Ser Arg Tyr Gly Asp Cys His Pro Val Phe
340 345 350
Phe Ile Gly Ser Leu Glu Ala Ala Phe Gln Glu Ala Phe Tyr Val Lys
355 360 365
Ala Arg Asp Arg Lys Leu Leu Ala Ile Tyr Leu His His Asp Glu Ser
370 375 380
Val Leu Thr Asn Val Phe Cys Ser Gln Met Leu Cys Ala Glu Ser Ile
385 390 395 400
Val Ser Tyr Leu Ser Gln Asn Phe Ile Thr Trp Ala Trp Asp Leu Thr
405 410 415
Lys Asp Ser Asn Arg Ala Arg Phe Leu Thr Met Cys Asn Arg His Phe
420 425 430
Gly Ser Val Val Ala Gln Thr Ile Arg Thr Gln Lys Thr Asp Gln Phe
435 440 445
Pro Leu Phe Leu Ile Ile Met Gly Lys Arg Ser Ser Asn Glu Val Leu
450 455 460
Asn Val Ile Gln Gly Asn Thr Thr Val Asp Glu Leu Met Met Arg Leu
465 470 475 480
Met Ala Ala Met Glu Ile Phe Thr Ala Gln Gln Gln Glu Asp Ile Lys
485 490 495
Asp Glu Asp Glu Arg Glu Ala Arg Glu Asn Val Lys Arg Glu Gln Asp
500 505 510
Glu Ala Tyr Arg Leu Ser Leu Glu Ala Asp Arg Ala Lys Arg Glu Ala
515 520 525
His Glu Arg Glu Met Ala Glu Gln Phe Arg Leu Glu Gln Ile Arg Lys
530 535 540
Glu Gln Glu Glu Glu Arg Glu Ala Ile Arg Leu Ser Leu Glu Gln Ala
545 550 555 560
Leu Pro Pro Glu Pro Lys Glu Glu Asn Ala Glu Pro Val Ser Lys Leu
565 570 575
Arg Ile Arg Thr Pro Ser Gly Glu Phe Leu Glu Arg Arg Phe Leu Ala
580 585 590
Ser Asn Lys Leu Gln Ile Val Phe Asp Phe Val Ala Ser Lys Gly Phe
595 600 605
Pro Trp Asp Glu Tyr Lys Leu Leu Ser Thr Phe Pro Arg Arg Asp Val
610 615 620
Thr Gln Leu Asp Pro Asn Lys Ser Leu Leu Glu Val Lys Leu Phe Pro
625 630 635 640
Gln Glu Thr Leu Phe Leu Glu Ala Lys Glu
645 650
<210> 147
<211> 1950
<212> DNA
<213> Homo sapiens
<400> 147
atggcgtcca acatggaccg ggagatgatc ctggcggatt ttcaggcatg tactggcatt 60
gaaaacattg acgaagctat tacattgctt gaacaaaata attgggactt agtggcagct 120
atcaatggtg taataccaca ggaaaatggc attctacaaa gtgaatatgg aggtgagacc 180
ataccaggac ctgcatttaa tccagcaagt catccagctt cagctcctac ttcctcttct 240
tcttcagcgt ttcgacctgt aatgccatcc aggcagattg tagaaaggca acctcggatg 300
ctggacttca gggttgaata cagagacaga aatgttgatg tggtacttga agacacctgt 360
actgttggag agattaaaca gattctagaa aatgaacttc agatacctgt gtccaaaatg 420
ctgttaaaag gctggaagac gggagatgtg gaagacagta cggtcctaaa atctctacac 480
ttgccaaaaa acaacagtct ttatgtcctt acaccagatt tgccaccacc ttcatcatct 540
agtcatgctg gtgccctgca ggagtcatta aatcaaaact tcatgctgat catcacccac 600
cgagaagtcc agcgggagta caacctgaac ttctcaggaa gcagtactat tcaagaggta 660
aagagaaatg tgtatgacct tacaagtatc cccgttcgcc accaattatg ggagggctgg 720
ccaacttctg ctacagacga ctcaatgtgt cttgctgaat cagggctctc ttatccctgc 780
catcgactta cagtgggaag aagatcttca cctgcacaga cccgggaaca gtcggaagaa 840
caaatcaccg atgttcatat ggttagtgat agcgatggag atgactttga agatgctaca 900
gaatttgggg tggatgatgg agaagtattt ggcatggcgt catctgcctt gagaaaatct 960
ccaatgatgc cagaaaacgc agaaaatgaa ggagatgcct tattacaatt tacagcagag 1020
ttttcttcaa gatatggtga ttgccatcct gtatttttta ttggctcatt agaagctgct 1080
tttcaagagg ccttctatgt gaaagcccga gatagaaagc ttcttgctat ctacctccac 1140
catgatgaaa gtgtgttaac caacgtgttc tgctcacaaa tgctttgtgc tgaatccatt 1200
gtttcttatc tgagtcaaaa ttttataacc tgggcttggg atctgacaaa ggactccaac 1260
agagcaagat ttctcactat gtgcaataga cactttggca gtgttgtggc acaaaccatt 1320
cggactcaaa aaacggatca gtttccgctt ttcctgatta ttatgggaaa gcgatcatct 1380
aatgaagtgt tgaatgtgat acaagggaac acaacagtag atgagttaat gatgagactc 1440
atggctgcaa tggagatctt cacagcccaa caacaggaag atataaagga cgaggatgaa 1500
cgtgaagcca gagaaaatgt gaagagagag caagatgagg cctatcgcct ttcacttgag 1560
gctgacagag caaagaggga agctcacgag agagagatgg cagaacagtt tcgtttggag 1620
cagattcgca aagaacaaga agaggaacgt gaggccatcc ggctgtcctt agagcaagcc 1680
ctgcctcctg agccaaagga agaaaatgct gagcctgtga gcaaactgcg gatccggacc 1740
cccagtggcg agttcttgga gcggcgtttc ctggccagca acaagctcca gattgtcttt 1800
gattttgtag cttccaaagg atttccatgg gatgagtaca agttactgag cacctttcct 1860
aggagagacg taactcaact ggacccaaat aaatcattat tggaggtaaa gttgttccct 1920
caagaaaccc ttttccttga agcaaaagag 1950
<210> 148
<211> 1950
<212> RNA
<213> Homo sapiens
<400> 148
auggcgucca acauggaccg ggagaugauc cuggcggauu uucaggcaug uacuggcauu 60
gaaaacauug acgaagcuau uacauugcuu gaacaaaaua auugggacuu aguggcagcu 120
aucaauggug uaauaccaca ggaaaauggc auucuacaaa gugaauaugg aggugagacc 180
auaccaggac cugcauuuaa uccagcaagu cauccagcuu cagcuccuac uuccucuucu 240
ucuucagcgu uucgaccugu aaugccaucc aggcagauug uagaaaggca accucggaug 300
cuggacuuca ggguugaaua cagagacaga aauguugaug ugguacuuga agacaccugu 360
acuguuggag agauuaaaca gauucuagaa aaugaacuuc agauaccugu guccaaaaug 420
cuguuaaaag gcuggaagac gggagaugug gaagacagua cgguccuaaa aucucuacac 480
uugccaaaaa acaacagucu uuauguccuu acaccagauu ugccaccacc uucaucaucu 540
agucaugcug gugcccugca ggagucauua aaucaaaacu ucaugcugau caucacccac 600
cgagaagucc agcgggagua caaccugaac uucucaggaa gcaguacuau ucaagaggua 660
aagagaaaug uguaugaccu uacaaguauc cccguucgcc accaauuaug ggagggcugg 720
ccaacuucug cuacagacga cucaaugugu cuugcugaau cagggcucuc uuaucccugc 780
caucgacuua cagugggaag aagaucuuca ccugcacaga cccgggaaca gucggaagaa 840
caaaucaccg auguucauau gguuagugau agcgauggag augacuuuga agaugcuaca 900
gaauuugggg uggaugaugg agaaguauuu ggcauggcgu caucugccuu gagaaaaucu 960
ccaaugaugc cagaaaacgc agaaaaugaa ggagaugccu uauuacaauu uacagcagag 1020
uuuucuucaa gauaugguga uugccauccu guauuuuuua uuggcucauu agaagcugcu 1080
uuucaagagg ccuucuaugu gaaagcccga gauagaaagc uucuugcuau cuaccuccac 1140
caugaugaaa guguguuaac caacguguuc ugcucacaaa ugcuuugugc ugaauccauu 1200
guuucuuauc ugagucaaaa uuuuauaacc ugggcuuggg aucugacaaa ggacuccaac 1260
agagcaagau uucucacuau gugcaauaga cacuuuggca guguuguggc acaaaccauu 1320
cggacucaaa aaacggauca guuuccgcuu uuccugauua uuaugggaaa gcgaucaucu 1380
aaugaagugu ugaaugugau acaagggaac acaacaguag augaguuaau gaugagacuc 1440
auggcugcaa uggagaucuu cacagcccaa caacaggaag auauaaagga cgaggaugaa 1500
cgugaagcca gagaaaaugu gaagagagag caagaugagg ccuaucgccu uucacuugag 1560
gcugacagag caaagaggga agcucacgag agagagaugg cagaacaguu ucguuuggag 1620
cagauucgca aagaacaaga agaggaacgu gaggccaucc ggcuguccuu agagcaagcc 1680
cugccuccug agccaaagga agaaaaugcu gagccuguga gcaaacugcg gauccggacc 1740
cccaguggcg aguucuugga gcggcguuuc cuggccagca acaagcucca gauugucuuu 1800
gauuuuguag cuuccaaagg auuuccaugg gaugaguaca aguuacugag caccuuuccu 1860
aggagagacg uaacucaacu ggacccaaau aaaucauuau uggagguaaa guuguucccu 1920
caagaaaccc uuuuccuuga agcaaaagag 1950
<210> 149
<211> 1953
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding FAF1
<400> 149
atggccagca acatggacag agagatgatc ctggccgact tccaggcctg taccggcatc 60
gagaacatcg acgaggccat cacactgctg gaacagaaca actgggatct cgtggccgcc 120
atcaacggcg tgatccctca agagaatggc atcctgcaga gcgagtacgg cggcgagaca 180
attcctggac ctgccttcaa tcccgccagc catcctgcat ctgcccctac atctagcagc 240
agcagcgcct tcagacccgt gatgcctagc agacagatcg tggaacggca gcccagaatg 300
ctggacttca gagtcgagta ccgggacaga aacgtggacg tggtgctgga agatacctgc 360
accgtgggcg agatcaagca gatcctggaa aacgagctgc agatccccgt gtccaagatg 420
ctgctgaaag gctggaaaac cggcgacgtg gaagatagca ccgtgctgaa gtccctgcat 480
ctccctaaga acaacagcct gtacgtgctg acccctgacc tgcctcctcc aagctctagt 540
tctcatgctg gcgccctgca agagagcctg aaccagaact tcatgctgat catcacccac 600
cgcgaggtgc agagagagta taacctgaac ttcagcggca gcagcaccat ccaagaagtg 660
aagcggaacg tctacgacct gaccagcatt cctgtgcggc accagctttg ggaaggctgg 720
cctacaagcg ccaccgacga ttctatgtgt ctggccgaga gcggcctgag ctacccttgt 780
cacagactga ccgtgggcag aagaagcagc cctgctcaga caagagagca gtccgaggaa 840
cagatcaccg acgtgcacat ggtgtccgat agcgacggcg acgatttcga ggatgccacc 900
gagtttggag tggacgacgg cgaggttttc ggcatggcta gcagcgccct gagaaagtcc 960
cctatgatgc ccgagaacgc cgagaatgaa ggcgacgccc tgctgcagtt taccgccgag 1020
tttagcagca gatacggcga ctgtcacccc gtgttcttca tcggatctct ggaagccgcc 1080
ttccaagagg ccttttacgt gaaggccaga gacagaaagc tgctggctat ctatctgcac 1140
cacgacgaga gcgtgctgac aaacgtgttc tgcagccaga tgctgtgcgc cgagagcatc 1200
gtgtcttacc tgtctcagaa tttcatcacc tgggcctggg atctgaccaa ggacagcaac 1260
agagcccggt tcctgaccat gtgtaaccgg cactttggca gcgtggtggc ccagaccatc 1320
agaacccaga aaaccgatca gttccctctg ttcctgatca ttatgggcaa gcgcagcagc 1380
aacgaggtgc tgaatgtgat ccagggcaac accaccgtgg acgagctgat gatgagactg 1440
atggccgcta tggaaatctt cacagcccag cagcaagaag atatcaagga cgaggacgag 1500
cgcgaggccc gcgagaatgt gaaaagagaa caggacgaag cctaccggct gagcctggaa 1560
gctgacagag ccaagagaga ggcccacgag agagagatgg ccgagcagtt cagactcgag 1620
cagatccgca aagagcaaga ggaagagaga gaagccatcc ggctgtccct ggaacaagcc 1680
ttgcctcctg agcctaaaga agagaacgct gagccagtgt ccaagctgcg gatcagaact 1740
cctagcggcg agttcctgga aagacggttc ctggcctcca acaaactgca gatcgtgttc 1800
gacttcgtgg cctctaaggg cttcccctgg gacgagtaca agctgctgag cacattcccc 1860
agacgggacg tgacacagct ggaccctaac aagagcctgc tggaagtgaa actgtttccc 1920
caagagacac tgtttctcga ggccaaagag tga 1953
<210> 150
<211> 1953
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding FAF1
<400> 150
auggccagca acauggacag agagaugauc cuggccgacu uccaggccug uaccggcauc 60
gagaacaucg acgaggccau cacacugcug gaacagaaca acugggaucu cguggccgcc 120
aucaacggcg ugaucccuca agagaauggc auccugcaga gcgaguacgg cggcgagaca 180
auuccuggac cugccuucaa ucccgccagc cauccugcau cugccccuac aucuagcagc 240
agcagcgccu ucagacccgu gaugccuagc agacagaucg uggaacggca gcccagaaug 300
cuggacuuca gagucgagua ccgggacaga aacguggacg uggugcugga agauaccugc 360
accgugggcg agaucaagca gauccuggaa aacgagcugc agauccccgu guccaagaug 420
cugcugaaag gcuggaaaac cggcgacgug gaagauagca ccgugcugaa gucccugcau 480
cucccuaaga acaacagccu guacgugcug accccugacc ugccuccucc aagcucuagu 540
ucucaugcug gcgcccugca agagagccug aaccagaacu ucaugcugau caucacccac 600
cgcgaggugc agagagagua uaaccugaac uucagcggca gcagcaccau ccaagaagug 660
aagcggaacg ucuacgaccu gaccagcauu ccugugcggc accagcuuug ggaaggcugg 720
ccuacaagcg ccaccgacga uucuaugugu cuggccgaga gcggccugag cuacccuugu 780
cacagacuga ccgugggcag aagaagcagc ccugcucaga caagagagca guccgaggaa 840
cagaucaccg acgugcacau gguguccgau agcgacggcg acgauuucga ggaugccacc 900
gaguuuggag uggacgacgg cgagguuuuc ggcauggcua gcagcgcccu gagaaagucc 960
ccuaugaugc ccgagaacgc cgagaaugaa ggcgacgccc ugcugcaguu uaccgccgag 1020
uuuagcagca gauacggcga cugucacccc guguucuuca ucggaucucu ggaagccgcc 1080
uuccaagagg ccuuuuacgu gaaggccaga gacagaaagc ugcuggcuau cuaucugcac 1140
cacgacgaga gcgugcugac aaacguguuc ugcagccaga ugcugugcgc cgagagcauc 1200
gugucuuacc ugucucagaa uuucaucacc ugggccuggg aucugaccaa ggacagcaac 1260
agagcccggu uccugaccau guguaaccgg cacuuuggca gcgugguggc ccagaccauc 1320
agaacccaga aaaccgauca guucccucug uuccugauca uuaugggcaa gcgcagcagc 1380
aacgaggugc ugaaugugau ccagggcaac accaccgugg acgagcugau gaugagacug 1440
auggccgcua uggaaaucuu cacagcccag cagcaagaag auaucaagga cgaggacgag 1500
cgcgaggccc gcgagaaugu gaaaagagaa caggacgaag ccuaccggcu gagccuggaa 1560
gcugacagag ccaagagaga ggcccacgag agagagaugg ccgagcaguu cagacucgag 1620
cagauccgca aagagcaaga ggaagagaga gaagccaucc ggcugucccu ggaacaagcc 1680
uugccuccug agccuaaaga agagaacgcu gagccagugu ccaagcugcg gaucagaacu 1740
ccuagcggcg aguuccugga aagacgguuc cuggccucca acaaacugca gaucguguuc 1800
gacuucgugg ccucuaaggg cuuccccugg gacgaguaca agcugcugag cacauucccc 1860
agacgggacg ugacacagcu ggacccuaac aagagccugc uggaagugaa acuguuuccc 1920
caagagacac uguuucucga ggccaaagag uga 1953
<210> 151
<211> 211
<212> PRT
<213> Homo sapiens
<400> 151
Met Val Ala His Asn Gln Val Ala Ala Asp Asn Ala Val Ser Thr Ala
1 5 10 15
Ala Glu Pro Arg Arg Arg Pro Glu Pro Ser Ser Ser Ser Ser Ser Ser
20 25 30
Pro Ala Ala Pro Ala Arg Pro Arg Pro Cys Pro Ala Val Pro Ala Pro
35 40 45
Ala Pro Gly Asp Thr His Phe Arg Thr Phe Arg Ser His Ala Asp Tyr
50 55 60
Arg Arg Ile Thr Arg Ala Ser Ala Leu Leu Asp Ala Cys Gly Phe Tyr
65 70 75 80
Trp Gly Pro Leu Ser Val His Gly Ala His Glu Arg Leu Arg Ala Glu
85 90 95
Pro Val Gly Thr Phe Leu Val Arg Asp Ser Arg Gln Arg Asn Cys Phe
100 105 110
Phe Ala Leu Ser Val Lys Met Ala Ser Gly Pro Thr Ser Ile Arg Val
115 120 125
His Phe Gln Ala Gly Arg Phe His Leu Asp Gly Ser Arg Glu Ser Phe
130 135 140
Asp Cys Leu Phe Glu Leu Leu Glu His Tyr Val Ala Ala Pro Arg Arg
145 150 155 160
Met Leu Gly Ala Pro Leu Arg Gln Arg Arg Val Arg Pro Leu Gln Glu
165 170 175
Leu Cys Arg Gln Arg Ile Val Ala Thr Val Gly Arg Glu Asn Leu Ala
180 185 190
Arg Ile Pro Leu Asn Pro Val Leu Arg Asp Tyr Leu Ser Ser Phe Pro
195 200 205
Phe Gln Ile
210
<210> 152
<211> 633
<212> DNA
<213> Homo sapiens
<400> 152
atggtagcac acaaccaggt ggcagccgac aatgcagtct ccacagcagc agagccccga 60
cggcggccag aaccttcctc ctcttcctcc tcctcgcccg cggcccccgc gcgcccgcgg 120
ccgtgccccg cggtcccggc cccggccccc ggcgacacgc acttccgcac attccgttcg 180
cacgccgatt accggcgcat cacgcgcgcc agcgcgctcc tggacgcctg cggattctac 240
tgggggcccc tgagcgtgca cggggcgcac gagcggctgc gcgccgagcc cgtgggcacc 300
ttcctggtgc gcgacagccg ccagcggaac tgctttttcg cccttagcgt gaagatggcc 360
tcgggaccca cgagcatccg cgtgcacttt caggccggcc gctttcacct ggatggcagc 420
cgcgagagct tcgactgcct cttcgagctg ctggagcact acgtggcggc gccgcgccgc 480
atgctggggg ccccgctgcg ccagcgccgc gtgcggccgc tgcaggagct gtgccgccag 540
cgcatcgtgg ccaccgtggg ccgcgagaac ctggctcgca tccccctcaa ccccgtcctc 600
cgcgactacc tgagctcctt ccccttccag att 633
<210> 153
<211> 633
<212> RNA
<213> Homo sapiens
<400> 153
augguagcac acaaccaggu ggcagccgac aaugcagucu ccacagcagc agagccccga 60
cggcggccag aaccuuccuc cucuuccucc uccucgcccg cggcccccgc gcgcccgcgg 120
ccgugccccg cggucccggc cccggccccc ggcgacacgc acuuccgcac auuccguucg 180
cacgccgauu accggcgcau cacgcgcgcc agcgcgcucc uggacgccug cggauucuac 240
ugggggcccc ugagcgugca cggggcgcac gagcggcugc gcgccgagcc cgugggcacc 300
uuccuggugc gcgacagccg ccagcggaac ugcuuuuucg cccuuagcgu gaagauggcc 360
ucgggaccca cgagcauccg cgugcacuuu caggccggcc gcuuucaccu ggauggcagc 420
cgcgagagcu ucgacugccu cuucgagcug cuggagcacu acguggcggc gccgcgccgc 480
augcuggggg ccccgcugcg ccagcgccgc gugcggccgc ugcaggagcu gugccgccag 540
cgcaucgugg ccaccguggg ccgcgagaac cuggcucgca ucccccucaa ccccguccuc 600
cgcgacuacc ugagcuccuu ccccuuccag auu 633
<210> 154
<211> 636
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding SOCS1
<400> 154
atggtggccc ataatcaggt ggccgccgat aacgccgtgt ctacagctgc cgaacctaga 60
agaaggcccg agcctagcag cagcagctct agttctcctg ccgctcctgc cagacctaga 120
ccttgtcctg ctgttcctgc tccagctcct ggcgacaccc acttcagaac ctttagaagc 180
cacgccgact accggcggat cacaagagca tctgctctgc tggatgcctg cggcttttat 240
tggggccctc tgtctgtgca cggcgcccac gaaagactga gagctgaacc tgtgggcacc 300
ttcctcgtgc gggatagcag acagcggaac tgcttctttg ccctgagcgt gaagatggcc 360
agcggaccca catccatcag agtgcacttt caggccggca gattccacct ggatggcagc 420
agagagagct tcgactgcct gttcgagctg ctggaacact acgtggccgc tcctagaagg 480
atgctgggag cacccctgag acagagaaga gtgcggcctc tgcaagagct gtgccggcag 540
agaatcgtgg ccacagtggg cagagagaac ctggccagaa ttcctctgaa ccccgtgctg 600
agagactacc tgagcagctt ccccttccaa atctga 636
<210> 155
<211> 636
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding SOCS1
<400> 155
augguggccc auaaucaggu ggccgccgau aacgccgugu cuacagcugc cgaaccuaga 60
agaaggcccg agccuagcag cagcagcucu aguucuccug ccgcuccugc cagaccuaga 120
ccuuguccug cuguuccugc uccagcuccu ggcgacaccc acuucagaac cuuuagaagc 180
cacgccgacu accggcggau cacaagagca ucugcucugc uggaugccug cggcuuuuau 240
uggggcccuc ugucugugca cggcgcccac gaaagacuga gagcugaacc ugugggcacc 300
uuccucgugc gggauagcag acagcggaac ugcuucuuug cccugagcgu gaagauggcc 360
agcggaccca cauccaucag agugcacuuu caggccggca gauuccaccu ggauggcagc 420
agagagagcu ucgacugccu guucgagcug cuggaacacu acguggccgc uccuagaagg 480
augcugggag caccccugag acagagaaga gugcggccuc ugcaagagcu gugccggcag 540
agaaucgugg ccacaguggg cagagagaac cuggccagaa uuccucugaa ccccgugcug 600
agagacuacc ugagcagcuu ccccuuccaa aucuga 636
<210> 156
<211> 225
<212> PRT
<213> Homo sapiens
<400> 156
Met Val Thr His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu
1 5 10 15
Asp Thr Ser Leu Arg Leu Lys Thr Phe Ser Ser Lys Ser Glu Tyr Gln
20 25 30
Leu Val Val Asn Ala Val Arg Lys Leu Gln Glu Ser Gly Phe Tyr Trp
35 40 45
Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro
50 55 60
Ala Gly Thr Phe Leu Ile Arg Asp Ser Ser Asp Gln Arg His Phe Phe
65 70 75 80
Thr Leu Ser Val Lys Thr Gln Ser Gly Thr Lys Asn Leu Arg Ile Gln
85 90 95
Cys Glu Gly Gly Ser Phe Ser Leu Gln Ser Asp Pro Arg Ser Thr Gln
100 105 110
Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu Val His His Tyr Met
115 120 125
Pro Pro Pro Gly Ala Pro Ser Phe Pro Ser Pro Pro Thr Glu Pro Ser
130 135 140
Ser Glu Val Pro Glu Gln Pro Ser Ala Gln Pro Leu Pro Gly Ser Pro
145 150 155 160
Pro Arg Arg Ala Tyr Tyr Ile Tyr Ser Gly Gly Glu Lys Ile Pro Leu
165 170 175
Val Leu Ser Arg Pro Leu Ser Ser Asn Val Ala Thr Leu Gln His Leu
180 185 190
Cys Arg Lys Thr Val Asn Gly His Leu Asp Ser Tyr Glu Lys Val Thr
195 200 205
Gln Leu Pro Gly Pro Ile Arg Glu Phe Leu Asp Gln Tyr Asp Ala Pro
210 215 220
Leu
225
<210> 157
<211> 675
<212> DNA
<213> Homo sapiens
<400> 157
atggtcaccc acagcaagtt tcccgccgcc gggatgagcc gccccctgga caccagcctg 60
cgcctcaaga ccttcagctc caagagcgag taccagctgg tggtgaacgc agtgcgcaag 120
ctgcaggaga gcggcttcta ctggagcgca gtgaccggcg gcgaggcgaa cctgctgctc 180
agtgccgagc ccgccggcac ctttctgatc cgcgacagct cggaccagcg ccacttcttc 240
acgctcagcg tcaagaccca gtctgggacc aagaacctgc gcatccagtg tgaggggggc 300
agcttctctc tgcagagcga tccccggagc acgcagcccg tgccccgctt cgactgcgtg 360
ctcaagctgg tgcaccacta catgccgccc cctggagccc cctccttccc ctcgccacct 420
actgaaccct cctccgaggt gcccgagcag ccgtctgccc agccactccc tgggagtccc 480
cccagaagag cctattacat ctactccggg ggcgagaaga tccccctggt gttgagccgg 540
cccctctcct ccaacgtggc cactcttcag catctctgtc ggaagaccgt caacggccac 600
ctggactcct atgagaaagt cacccagctg ccggggccca ttcgggagtt cctggaccag 660
tacgatgccc cgctt 675
<210> 158
<211> 675
<212> RNA
<213> Homo sapiens
<400> 158
auggucaccc acagcaaguu ucccgccgcc gggaugagcc gcccccugga caccagccug 60
cgccucaaga ccuucagcuc caagagcgag uaccagcugg uggugaacgc agugcgcaag 120
cugcaggaga gcggcuucua cuggagcgca gugaccggcg gcgaggcgaa ccugcugcuc 180
agugccgagc ccgccggcac cuuucugauc cgcgacagcu cggaccagcg ccacuucuuc 240
acgcucagcg ucaagaccca gucugggacc aagaaccugc gcauccagug ugaggggggc 300
agcuucucuc ugcagagcga uccccggagc acgcagcccg ugccccgcuu cgacugcgug 360
cucaagcugg ugcaccacua caugccgccc ccuggagccc ccuccuuccc cucgccaccu 420
acugaacccu ccuccgaggu gcccgagcag ccgucugccc agccacuccc ugggaguccc 480
cccagaagag ccuauuacau cuacuccggg ggcgagaaga ucccccuggu guugagccgg 540
ccccucuccu ccaacguggc cacucuucag caucucuguc ggaagaccgu caacggccac 600
cuggacuccu augagaaagu cacccagcug ccggggccca uucgggaguu ccuggaccag 660
uacgaugccc cgcuu 675
<210> 159
<211> 678
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding SOCS3
<400> 159
atggtcaccc acagcaagtt tccagccgcc ggaatgagca gacccctgga tacaagcctg 60
cggctgaaaa ccttcagcag caagagcgag tatcagctgg tggtcaacgc cgtgcggaag 120
ctgcaagaga gcggctttta ttggagcgcc gtgacaggcg gagaggccaa tcttctgctg 180
tctgccgaac ctgccggcac cttcctgatc agagatagca gcgaccagcg gcacttcttc 240
accctgagcg tgaaaaccca gagcggcacc aagaacctgc ggatccaatg tgaaggcggc 300
agcttcagcc tgcagagcga ccctagatct acccagcctg tgcctagatt cgactgcgtg 360
ctgaagctcg tgcaccacta catgcctcca cctggcgctc ctagcttccc atctcctcca 420
acagagccta gcagcgaggt gccagaacag ccttctgctc aacctctgcc tggcagccct 480
cctagaaggg cctactacat ctattctggc ggcgagaaga tccctctggt gctgtctaga 540
cccctgagca gcaatgtggc cactctgcag cacctgtgca gaaagaccgt gaacggccac 600
ctggacagct acgagaaagt gacccaactg cctggaccta tcagagagtt cctggaccag 660
tacgacgccc ctctttga 678
<210> 160
<211> 678
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding SOCS3
<400> 160
auggucaccc acagcaaguu uccagccgcc ggaaugagca gaccccugga uacaagccug 60
cggcugaaaa ccuucagcag caagagcgag uaucagcugg uggucaacgc cgugcggaag 120
cugcaagaga gcggcuuuua uuggagcgcc gugacaggcg gagaggccaa ucuucugcug 180
ucugccgaac cugccggcac cuuccugauc agagauagca gcgaccagcg gcacuucuuc 240
acccugagcg ugaaaaccca gagcggcacc aagaaccugc ggauccaaug ugaaggcggc 300
agcuucagcc ugcagagcga cccuagaucu acccagccug ugccuagauu cgacugcgug 360
cugaagcucg ugcaccacua caugccucca ccuggcgcuc cuagcuuccc aucuccucca 420
acagagccua gcagcgaggu gccagaacag ccuucugcuc aaccucugcc uggcagcccu 480
ccuagaaggg ccuacuacau cuauucuggc ggcgagaaga ucccucuggu gcugucuaga 540
ccccugagca gcaauguggc cacucugcag caccugugca gaaagaccgu gaacggccac 600
cuggacagcu acgagaaagu gacccaacug ccuggaccua ucagagaguu ccuggaccag 660
uacgacgccc cucuuuga 678
<210> 161
<211> 371
<212> PRT
<213> Homo sapiens
<400> 161
Met Ser Lys Ala Phe Gly Leu Leu Arg Gln Ile Cys Gln Ser Ile Leu
1 5 10 15
Ala Glu Ser Ser Gln Ser Pro Ala Asp Leu Glu Glu Lys Lys Glu Glu
20 25 30
Asp Ser Asn Met Lys Arg Glu Gln Pro Arg Glu Arg Pro Arg Ala Trp
35 40 45
Asp Tyr Pro His Gly Leu Val Gly Leu His Asn Ile Gly Gln Thr Cys
50 55 60
Cys Leu Asn Ser Leu Ile Gln Val Phe Val Met Asn Val Asp Phe Thr
65 70 75 80
Arg Ile Leu Lys Arg Ile Thr Val Pro Gly Ala Asp Glu Gln Arg Arg
85 90 95
Ser Val Pro Phe Gln Met Leu Leu Leu Leu Glu Lys Met Gln Asp Ser
100 105 110
Arg Gln Lys Ala Val Arg Pro Leu Glu Leu Ala Tyr Cys Leu Gln Lys
115 120 125
Cys Asn Val Pro Leu Phe Val Gln His Asp Ala Ala Gln Leu Tyr Leu
130 135 140
Lys Leu Trp Asn Leu Ile Lys Asp Gln Ile Thr Asp Val His Leu Val
145 150 155 160
Glu Arg Leu Gln Ala Leu Tyr Thr Ile Arg Val Lys Asp Ser Leu Ile
165 170 175
Cys Val Asp Cys Ala Met Glu Ser Ser Arg Asn Ser Ser Met Leu Thr
180 185 190
Leu Pro Leu Ser Leu Phe Asp Val Asp Ser Lys Pro Leu Lys Thr Leu
195 200 205
Glu Asp Ala Leu His Cys Phe Phe Gln Pro Arg Glu Leu Ser Ser Lys
210 215 220
Ser Lys Cys Phe Cys Glu Asn Cys Gly Lys Lys Thr Arg Gly Lys Gln
225 230 235 240
Val Leu Lys Leu Thr His Leu Pro Gln Thr Leu Thr Ile His Leu Met
245 250 255
Arg Phe Ser Ile Arg Asn Ser Gln Thr Arg Lys Ile Cys His Ser Leu
260 265 270
Tyr Phe Pro Gln Ser Leu Asp Phe Ser Gln Ile Leu Pro Met Lys Arg
275 280 285
Glu Ser Cys Asp Ala Glu Glu Gln Ser Gly Gly Gln Tyr Glu Leu Phe
290 295 300
Ala Val Ile Ala His Val Gly Met Ala Asp Ser Gly His Tyr Cys Val
305 310 315 320
Tyr Ile Arg Asn Ala Val Asp Gly Lys Trp Phe Cys Phe Asn Asp Ser
325 330 335
Asn Ile Cys Leu Val Ser Trp Glu Asp Ile Gln Cys Thr Tyr Gly Asn
340 345 350
Pro Asn Tyr His Trp Gln Glu Thr Ala Tyr Leu Leu Val Tyr Met Lys
355 360 365
Met Glu Cys
370
<210> 162
<211> 1116
<212> DNA
<213> Homo sapiens
<400> 162
atgagcaagg cgtttgggct cctgaggcaa atctgtcagt ccatcctggc tgagtcctcg 60
cagtccccgg cagatcttga agaaaagaag gaagaagaca gcaacatgaa gagagagcag 120
cccagagagc gtcccagggc ctgggactac cctcatggcc tggttggttt acacaacatt 180
ggacagacct gctgccttaa ctccttgatt caggtgttcg taatgaatgt ggacttcacc 240
aggatattga agaggatcac ggtgcccagg ggagctgacg agcagaggag aagcgtccct 300
ttccagatgc ttctgctgct ggagaagatg caggacagcc ggcagaaagc agtgcggccc 360
ctggagctgg cctactgcct gcagaagtgc aacgtgccct tgtttgtcca acatgatgct 420
gcccaactgt acctcaaact ctggaacctg attaaggacc agatcactga tgtgcacttg 480
gtggagagac tgcaggccct gtatacgatc cgggtgaagg actccttgat ttgcgttgac 540
tgtgccatgg agagtagcag aaacagcagc atgctcaccc tcccactttc tctttttgat 600
gtggactcaa agcccctgaa gacactggag gacgccctgc actgcttctt ccagcccagg 660
gagttatcaa gcaaaagcaa gtgcttctgt gagaactgtg ggaagaagac ccgtgggaaa 720
caggtcttga agctgaccca tttgccccag accctgacaa tccacctcat gcgattctcc 780
atcaggaatt cacagacgag aaagatctgc cactccctgt acttccccca gagcttggat 840
ttcagccaga tccttccaat gaagcgagag tcttgtgatg ctgaggagca gtctggaggg 900
cagtatgagc tttttgctgt gattgcgcac gtgggaatgg cagactccgg tcattactgt 960
gtctacatcc ggaatgctgt ggatggaaaa tggttctgct tcaatgactc caatatttgc 1020
ttggtgtcct gggaagacat ccagtgtacc tacggaaatc ctaactacca ctggcaggaa 1080
actgcatatc ttctggttta catgaagatg gagtgc 1116
<210> 163
<211> 1116
<212> RNA
<213> Homo sapiens
<400> 163
augagcaagg cguuugggcu ccugaggcaa aucugucagu ccauccuggc ugaguccucg 60
caguccccgg cagaucuuga agaaaagaag gaagaagaca gcaacaugaa gagagagcag 120
cccagagagc gucccagggc cugggacuac ccucauggcc ugguugguuu acacaacauu 180
ggacagaccu gcugccuuaa cuccuugauu cagguguucg uaaugaaugu ggacuucacc 240
aggauauuga agaggaucac ggugcccagg ggagcugacg agcagaggag aagcgucccu 300
uuccagaugc uucugcugcu ggagaagaug caggacagcc ggcagaaagc agugcggccc 360
cuggagcugg ccuacugccu gcagaagugc aacgugcccu uguuugucca acaugaugcu 420
gcccaacugu accucaaacu cuggaaccug auuaaggacc agaucacuga ugugcacuug 480
guggagagac ugcaggcccu guauacgauc cgggugaagg acuccuugau uugcguugac 540
ugugccaugg agaguagcag aaacagcagc augcucaccc ucccacuuuc ucuuuuugau 600
guggacucaa agccccugaa gacacuggag gacgcccugc acugcuucuu ccagcccagg 660
gaguuaucaa gcaaaagcaa gugcuucugu gagaacugug ggaagaagac ccgugggaaa 720
caggucuuga agcugaccca uuugccccag acccugacaa uccaccucau gcgauucucc 780
aucaggaauu cacagacgag aaagaucugc cacucccugu acuuccccca gagcuuggau 840
uucagccaga uccuuccaau gaagcgagag ucuugugaug cugaggagca gucuggaggg 900
caguaugagc uuuuugcugu gauugcgcac gugggaaugg cagacuccgg ucauuacugu 960
gucuacaucc ggaaugcugu ggauggaaaa ugguucugcu ucaaugacuc caauauuugc 1020
uugguguccu gggaagacau ccaguguacc uacggaaauc cuaacuacca cuggcaggaa 1080
acugcauauc uucugguuua caugaagaug gagugc 1116
<210> 164
<211> 1119
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding USP18
<400> 164
atgagcaagg ccttcggcct gctgagacag atctgccagt ctatcctggc cgagagcagc 60
cagtctcctg ccgatctgga agagaagaaa gaagaggact ccaacatgaa gcgcgagcag 120
cccagagaaa gacccagagc ctgggattat cctcacggcc tcgtgggcct gcacaatatc 180
ggccagacct gctgcctgaa cagcctgatc caggtgttcg tgatgaacgt ggacttcacc 240
cggatcctga agcggatcac agtgcctaga ggcgccgacg agcagagaag atccgtgcct 300
tttcagatgc tgctgctcct ggaaaagatg caggacagcc ggcagaaggc cgtcagacct 360
ctggaactgg cctactgcct gcagaaatgc aacgtgcccc tgttcgtgca gcacgatgcc 420
gctcagctgt acctgaagct gtggaacctg atcaaggacc agatcaccga cgtgcacctg 480
gtggaaagac tgcaggccct gtacaccatc agagtgaagg actccctgat ctgcgtggac 540
tgcgccatgg aaagcagccg gaatagctcc atgctgaccc tgcctctgag cctgttcgac 600
gtggacagca agcccctgaa aaccctggaa gatgccctgc actgcttctt ccagcctaga 660
gagctgagca gcaagagcaa gtgcttctgc gagaactgcg gcaagaaaac ccggggcaaa 720
caggtgctga agctgaccca tctgcctcag acactgacca tccacctgat gcggttcagc 780
atccggaaca gccagaccag aaagatctgt cactccctgt acttccctca gtctctggac 840
ttcagccaga ttctgcccat gaagagagag agctgcgacg ccgaagaaca gtctggcgga 900
cagtacgagc tgttcgccgt gattgcccac gttggcatgg ccgatagcgg ccactactgc 960
gtgtacatca gaaacgccgt ggacggcaag tggttctgtt tcaacgacag caatatctgc 1020
ctggtgtcct gggaagatat ccagtgcacc tacggcaacc ccaactacca ctggcaagag 1080
acagcctacc tgctggtgta catgaagatg gaatgctga 1119
<210> 165
<211> 1119
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding USP18
<400> 165
augagcaagg ccuucggccu gcugagacag aucugccagu cuauccuggc cgagagcagc 60
cagucuccug ccgaucugga agagaagaaa gaagaggacu ccaacaugaa gcgcgagcag 120
cccagagaaa gacccagagc cugggauuau ccucacggcc ucgugggccu gcacaauauc 180
ggccagaccu gcugccugaa cagccugauc cagguguucg ugaugaacgu ggacuucacc 240
cggauccuga agcggaucac agugccuaga ggcgccgacg agcagagaag auccgugccu 300
uuucagaugc ugcugcuccu ggaaaagaug caggacagcc ggcagaaggc cgucagaccu 360
cuggaacugg ccuacugccu gcagaaaugc aacgugcccc uguucgugca gcacgaugcc 420
gcucagcugu accugaagcu guggaaccug aucaaggacc agaucaccga cgugcaccug 480
guggaaagac ugcaggcccu guacaccauc agagugaagg acucccugau cugcguggac 540
ugcgccaugg aaagcagccg gaauagcucc augcugaccc ugccucugag ccuguucgac 600
guggacagca agccccugaa aacccuggaa gaugcccugc acugcuucuu ccagccuaga 660
gagcugagca gcaagagcaa gugcuucugc gagaacugcg gcaagaaaac ccggggcaaa 720
caggugcuga agcugaccca ucugccucag acacugacca uccaccugau gcgguucagc 780
auccggaaca gccagaccag aaagaucugu cacucccugu acuucccuca gucucuggac 840
uucagccaga uucugcccau gaagagagag agcugcgacg ccgaagaaca gucuggcgga 900
caguacgagc uguucgccgu gauugcccac guuggcaugg ccgauagcgg ccacuacugc 960
guguacauca gaaacgccgu ggacggcaag ugguucuguu ucaacgacag caauaucugc 1020
cugguguccu gggaagauau ccagugcacc uacggcaacc ccaacuacca cuggcaagag 1080
acagccuacc ugcuggugua caugaagaug gaaugcuga 1119
<210> 166
<211> 565
<212> PRT
<213> Homo sapiens
<400> 166
Met Pro Gln Ala Ser Glu His Arg Leu Gly Arg Thr Arg Glu Pro Pro
1 5 10 15
Val Asn Ile Gln Pro Arg Val Gly Ser Lys Leu Pro Phe Ala Pro Arg
20 25 30
Ala Arg Ser Lys Glu Arg Arg Asn Pro Ala Ser Gly Pro Asn Pro Met
35 40 45
Leu Arg Pro Leu Pro Pro Arg Pro Gly Leu Pro Asp Glu Arg Leu Lys
50 55 60
Lys Leu Glu Leu Gly Arg Gly Arg Thr Ser Gly Pro Arg Pro Arg Gly
65 70 75 80
Pro Leu Arg Ala Asp His Gly Val Pro Leu Pro Gly Ser Pro Pro Pro
85 90 95
Thr Val Ala Leu Pro Leu Pro Ser Arg Thr Asn Leu Ala Arg Ser Lys
100 105 110
Ser Val Ser Ser Gly Asp Leu Arg Pro Met Gly Ile Ala Leu Gly Gly
115 120 125
His Arg Gly Thr Gly Glu Leu Gly Ala Ala Leu Ser Arg Leu Ala Leu
130 135 140
Arg Pro Glu Pro Pro Thr Leu Arg Arg Ser Thr Ser Leu Arg Arg Leu
145 150 155 160
Gly Gly Phe Pro Gly Pro Pro Thr Leu Phe Ser Ile Arg Thr Glu Pro
165 170 175
Pro Ala Ser His Gly Ser Phe His Met Ile Ser Ala Arg Ser Ser Glu
180 185 190
Pro Phe Tyr Ser Asp Asp Lys Met Ala His His Thr Leu Leu Leu Gly
195 200 205
Ser Gly His Val Gly Leu Arg Asn Leu Gly Asn Thr Cys Phe Leu Asn
210 215 220
Ala Val Leu Gln Cys Leu Ser Ser Thr Arg Pro Leu Arg Asp Phe Cys
225 230 235 240
Leu Arg Arg Asp Phe Arg Gln Glu Val Pro Gly Gly Gly Arg Ala Gln
245 250 255
Glu Leu Thr Glu Ala Phe Ala Asp Val Ile Gly Ala Leu Trp His Pro
260 265 270
Asp Ser Cys Glu Ala Val Asn Pro Thr Arg Phe Arg Ala Val Phe Gln
275 280 285
Lys Tyr Val Pro Ser Phe Ser Gly Tyr Ser Gln Gln Asp Ala Gln Glu
290 295 300
Phe Leu Lys Leu Leu Met Glu Arg Leu His Leu Glu Ile Asn Arg Arg
305 310 315 320
Gly Arg Arg Ala Pro Pro Ile Leu Ala Asn Gly Pro Val Pro Ser Pro
325 330 335
Pro Arg Arg Gly Gly Ala Leu Leu Glu Glu Pro Glu Leu Ser Asp Asp
340 345 350
Asp Arg Ala Asn Leu Met Trp Lys Arg Tyr Leu Glu Arg Glu Asp Ser
355 360 365
Lys Ile Val Asp Leu Phe Val Gly Gln Leu Lys Ser Cys Leu Lys Cys
370 375 380
Gln Ala Cys Gly Tyr Arg Ser Thr Thr Phe Glu Val Phe Cys Asp Leu
385 390 395 400
Ser Leu Pro Ile Pro Lys Lys Gly Phe Ala Gly Gly Lys Val Ser Leu
405 410 415
Arg Asp Cys Phe Asn Leu Phe Thr Lys Glu Glu Glu Leu Glu Ser Glu
420 425 430
Asn Ala Pro Val Cys Asp Arg Cys Arg Gln Lys Thr Arg Ser Thr Lys
435 440 445
Lys Leu Thr Val Gln Arg Phe Pro Arg Ile Leu Val Leu His Leu Asn
450 455 460
Arg Phe Ser Ala Ser Arg Gly Ser Ile Lys Lys Ser Ser Val Gly Val
465 470 475 480
Asp Phe Pro Leu Gln Arg Leu Ser Leu Gly Asp Phe Ala Ser Asp Lys
485 490 495
Ala Gly Ser Pro Val Tyr Gln Leu Tyr Ala Leu Cys Asn His Ser Gly
500 505 510
Ser Val His Tyr Gly His Tyr Thr Ala Leu Cys Arg Cys Gln Thr Gly
515 520 525
Trp His Val Tyr Asn Asp Ser Arg Val Ser Pro Val Ser Glu Asn Gln
530 535 540
Val Ala Ser Ser Glu Gly Tyr Val Leu Phe Tyr Gln Leu Met Gln Glu
545 550 555 560
Pro Pro Arg Cys Leu
565
<210> 167
<211> 1695
<212> DNA
<213> Homo sapiens
<400> 167
atgccccagg cctctgagca ccgcctgggc cgtacccgag agccacctgt taatatccag 60
ccccgagtgg gatccaagct accatttgcc cccagggccc gcagcaagga gcgcagaaac 120
ccagcctctg ggccaaaccc catgttacga cctctgcctc cccggccagg tctgcctgat 180
gaacggctca agaaactgga gctgggacgg ggacggacct caggccctcg tcccagaggc 240
ccccttcgag cagatcatgg ggttcccctg cctggctcac cacccccaac agtggctttg 300
cctctcccat ctcggaccaa cttagcccgt tccaagtctg tgagcagtgg ggacttgcgt 360
ccaatgggga ttgccttggg agggcaccgt ggcaccggag agcttggggc tgcactgagc 420
cgcttggccc tccggcctga gccacccact ttgagacgta gcacttctct ccgccgccta 480
gggggctttc ctggaccccc taccctgttc agcatacgga cagagccccc tgcttcccat 540
ggctccttcc acatgatatc cgcccggtcc tctgagcctt tctactctga tgacaagatg 600
gctcatcaca cactccttct gggctctggt catgttggcc ttcgaaacct gggaaacacg 660
tgcttcctga atgctgtgct gcagtgtctg agcagcactc gacctcttcg ggacttctgt 720
ctgagaaggg acttccggca agaggtgcct ggaggaggcc gagcccaaga gctcactgaa 780
gcctttgcag atgtgattgg tgccctctgg caccctgact cctgcgaagc tgtgaatcct 840
actcgattcc gagctgtctt ccagaaatat gttccctcct tctctggata cagccagcag 900
gatgcccaag agttcctgaa gctcctcatg gagcggctac accttgaaat caaccgccga 960
ggccgccggg ctccaccgat acttgccaat ggtccagttc cctctccacc ccgccgagga 1020
ggggctctgc tagaagaacc tgagttaagt gatgatgacc gagccaacct aatgtggaaa 1080
cgttacctgg agcgagagga cagcaagatt gtggacctgt ttgtgggcca gttgaaaagt 1140
tgtctcaagt gccaggcctg tgggtatcgc tccacgacct tcgaggtttt ttgtgacctg 1200
tccctgccca tccccaagaa aggatttgct gggggcaagg tgtctctgcg ggattgtttc 1260
aaccttttca ctaaggaaga agagctagag tcggagaatg ccccagtgtg tgaccgatgt 1320
cggcagaaaa ctcgaagtac caaaaagttg acagtacaaa gattccctcg aatcctcgtg 1380
ctccatctga atcgattttc tgcctcccga ggctccatca aaaaaagttc agtaggtgta 1440
gactttccac tgcagcgact gagcctaggg gactttgcca gtgacaaagc cggaagtcct 1500
gtataccagc tgtatgccct ttgcaaccac tcaggcagcg tccactatgg ccactacaca 1560
gccctgtgcc ggtgccagac tggttggcat gtctacaatg actctcgtgt ctcccctgtc 1620
agtgaaaacc aggtggcatc cagcgagggc tacgtgctgt tctaccaact gatgcaggag 1680
ccaccccggt gcctg 1695
<210> 168
<211> 1695
<212> RNA
<213> Homo sapiens
<400> 168
augccccagg ccucugagca ccgccugggc cguacccgag agccaccugu uaauauccag 60
ccccgagugg gauccaagcu accauuugcc cccagggccc gcagcaagga gcgcagaaac 120
ccagccucug ggccaaaccc cauguuacga ccucugccuc cccggccagg ucugccugau 180
gaacggcuca agaaacugga gcugggacgg ggacggaccu caggcccucg ucccagaggc 240
ccccuucgag cagaucaugg gguuccccug ccuggcucac cacccccaac aguggcuuug 300
ccucucccau cucggaccaa cuuagcccgu uccaagucug ugagcagugg ggacuugcgu 360
ccaaugggga uugccuuggg agggcaccgu ggcaccggag agcuuggggc ugcacugagc 420
cgcuuggccc uccggccuga gccacccacu uugagacgua gcacuucucu ccgccgccua 480
gggggcuuuc cuggaccccc uacccuguuc agcauacgga cagagccccc ugcuucccau 540
ggcuccuucc acaugauauc cgcccggucc ucugagccuu ucuacucuga ugacaagaug 600
gcucaucaca cacuccuucu gggcucuggu cauguuggcc uucgaaaccu gggaaacacg 660
ugcuuccuga augcugugcu gcagugucug agcagcacuc gaccucuucg ggacuucugu 720
cugagaaggg acuuccggca agaggugccu ggaggaggcc gagcccaaga gcucacugaa 780
gccuuugcag augugauugg ugcccucugg cacccugacu ccugcgaagc ugugaauccu 840
acucgauucc gagcugucuu ccagaaauau guucccuccu ucucuggaua cagccagcag 900
gaugcccaag aguuccugaa gcuccucaug gagcggcuac accuugaaau caaccgccga 960
ggccgccggg cuccaccgau acuugccaau gguccaguuc ccucuccacc ccgccgagga 1020
ggggcucugc uagaagaacc ugaguuaagu gaugaugacc gagccaaccu aauguggaaa 1080
cguuaccugg agcgagagga cagcaagauu guggaccugu uugugggcca guugaaaagu 1140
ugucucaagu gccaggccug uggguaucgc uccacgaccu ucgagguuuu uugugaccug 1200
ucccugccca uccccaagaa aggauuugcu gggggcaagg ugucucugcg ggauuguuuc 1260
aaccuuuuca cuaaggaaga agagcuagag ucggagaaug ccccagugug ugaccgaugu 1320
cggcagaaaa cucgaaguac caaaaaguug acaguacaaa gauucccucg aauccucgug 1380
cuccaucuga aucgauuuuc ugccucccga ggcuccauca aaaaaaguuc aguaggugua 1440
gacuuuccac ugcagcgacu gagccuaggg gacuuugcca gugacaaagc cggaaguccu 1500
guauaccagc uguaugcccu uugcaaccac ucaggcagcg uccacuaugg ccacuacaca 1560
gcccugugcc ggugccagac ugguuggcau gucuacaaug acucucgugu cuccccuguc 1620
agugaaaacc agguggcauc cagcgagggc uacgugcugu ucuaccaacu gaugcaggag 1680
ccaccccggu gccug 1695
<210> 169
<211> 1698
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding USP21
<400> 169
atgcctcagg cctctgagca cagactgggc agaaccagag aacctcctgt gaacatccag 60
cctagagtgg gcagcaagct gcccttcgct cctagagcca gaagcaaaga gcggagaaac 120
cctgccagcg gacccaatcc tatgctgagg cctttgcctc ctagacctgg cctgcctgac 180
gagagactga agaagctgga actcggcaga ggcagaacaa gcggccctag acctagagga 240
cctctgagag ctgatcacgg cgttccactg cctggaagcc ctccacctac agttgctctg 300
ccactgccta gcaggaccaa cctggccaga tctaagagcg tgtccagcgg cgatctgcgg 360
cctatgggaa ttgccctcgg aggccataga ggaacaggcg aacttggagc cgctctgagc 420
agactggccc tcagacctga acctcctaca ctgagaagaa gcaccagcct gagaaggctc 480
ggcggctttc ctggaccacc aacactgttc agcatccgga cagagcctcc agccagccac 540
ggcagctttc acatgatcag cgccagatcc agcgagccct tctacagcga cgacaagatg 600
gcccaccaca cactgctgct cggctctgga catgtgggcc tgagaaacct gggcaatacc 660
tgcttcctga atgccgtgct gcagtgcctg agcagcacaa gacccctgag agacttctgc 720
ctgcggcggg actttagaca agaagtgcct ggcggaggca gagcccaaga actgacagag 780
gctttcgccg atgtgatcgg agccctgtgg caccctgatt cttgcgaggc cgtgaatccc 840
accagattcc gggccgtgtt ccagaaatac gtgcccagct ttagcggcta cagccagcag 900
gatgcccaag agttcctgaa gctgctgatg gaacggctgc acctggaaat caacagaaga 960
ggcagacggg cccctcctat cctggctaat ggacctgttc ctagtcctcc tagaagaggc 1020
ggcgctctgc tggaagaacc tgagctgagc gacgacgaca gagccaacct gatgtggaag 1080
agatacctgg aacgcgagga cagcaagatc gtggatctgt tcgtgggcca gctgaagtcc 1140
tgcctgaagt gtcaggcctg tggctacagg tccaccacct tcgaggtgtt ctgcgacctg 1200
tctctgccca ttcctaagaa gggctttgcc ggcggaaagg tgtccctgag ggactgcttc 1260
aacctgttca ccaaagagga agaactcgag agcgagaacg cccctgtgtg cgacagatgc 1320
cggcagaaaa cccggtccac caagaaactg accgtgcagc ggttccccag aatcctggtg 1380
ctgcatctga acagattctc cgccagccgg ggcagcatca agaaaagctc tgtgggcgtc 1440
gacttcccac tgcagcgact gagcctgggc gatttcgcct ctgataaggc cggctctcct 1500
gtgtaccagc tgtacgccct gtgtaaccac agcggctctg tgcactacgg ccactacacc 1560
gctctgtgta gatgccagac aggctggcac gtgtacaacg acagcagagt gtcccctgtg 1620
tccgagaatc aggtggccag ctctgagggc tacgtgctgt tctaccagct gatgcaagag 1680
cctcctcggt gcctgtga 1698
<210> 170
<211> 1698
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding USP21
<400> 170
augccucagg ccucugagca cagacugggc agaaccagag aaccuccugu gaacauccag 60
ccuagagugg gcagcaagcu gcccuucgcu ccuagagcca gaagcaaaga gcggagaaac 120
ccugccagcg gacccaaucc uaugcugagg ccuuugccuc cuagaccugg ccugccugac 180
gagagacuga agaagcugga acucggcaga ggcagaacaa gcggcccuag accuagagga 240
ccucugagag cugaucacgg cguuccacug ccuggaagcc cuccaccuac aguugcucug 300
ccacugccua gcaggaccaa ccuggccaga ucuaagagcg uguccagcgg cgaucugcgg 360
ccuaugggaa uugcccucgg aggccauaga ggaacaggcg aacuuggagc cgcucugagc 420
agacuggccc ucagaccuga accuccuaca cugagaagaa gcaccagccu gagaaggcuc 480
ggcggcuuuc cuggaccacc aacacuguuc agcauccgga cagagccucc agccagccac 540
ggcagcuuuc acaugaucag cgccagaucc agcgagcccu ucuacagcga cgacaagaug 600
gcccaccaca cacugcugcu cggcucugga caugugggcc ugagaaaccu gggcaauacc 660
ugcuuccuga augccgugcu gcagugccug agcagcacaa gaccccugag agacuucugc 720
cugcggcggg acuuuagaca agaagugccu ggcggaggca gagcccaaga acugacagag 780
gcuuucgccg augugaucgg agcccugugg cacccugauu cuugcgaggc cgugaauccc 840
accagauucc gggccguguu ccagaaauac gugcccagcu uuagcggcua cagccagcag 900
gaugcccaag aguuccugaa gcugcugaug gaacggcugc accuggaaau caacagaaga 960
ggcagacggg ccccuccuau ccuggcuaau ggaccuguuc cuaguccucc uagaagaggc 1020
ggcgcucugc uggaagaacc ugagcugagc gacgacgaca gagccaaccu gauguggaag 1080
agauaccugg aacgcgagga cagcaagauc guggaucugu ucgugggcca gcugaagucc 1140
ugccugaagu gucaggccug uggcuacagg uccaccaccu ucgagguguu cugcgaccug 1200
ucucugccca uuccuaagaa gggcuuugcc ggcggaaagg ugucccugag ggacugcuuc 1260
aaccuguuca ccaaagagga agaacucgag agcgagaacg ccccugugug cgacagaugc 1320
cggcagaaaa cccgguccac caagaaacug accgugcagc gguuccccag aauccuggug 1380
cugcaucuga acagauucuc cgccagccgg ggcagcauca agaaaagcuc ugugggcguc 1440
gacuucccac ugcagcgacu gagccugggc gauuucgccu cugauaaggc cggcucuccu 1500
guguaccagc uguacgcccu guguaaccac agcggcucug ugcacuacgg ccacuacacc 1560
gcucugugua gaugccagac aggcuggcac guguacaacg acagcagagu guccccugug 1620
uccgagaauc agguggccag cucugagggc uacgugcugu ucuaccagcu gaugcaagag 1680
ccuccucggu gccuguga 1698
<210> 171
<211> 438
<212> PRT
<213> Homo sapiens
<400> 171
Met Cys Lys Asp Tyr Val Tyr Asp Lys Asp Ile Glu Gln Ile Ala Lys
1 5 10 15
Glu Glu Gln Gly Glu Ala Leu Lys Leu Gln Ala Ser Thr Ser Thr Glu
20 25 30
Val Ser His Gln Gln Cys Ser Val Pro Gly Leu Gly Glu Lys Phe Pro
35 40 45
Thr Trp Glu Thr Thr Lys Pro Glu Leu Glu Leu Leu Gly His Asn Pro
50 55 60
Arg Arg Arg Arg Ile Thr Ser Ser Phe Thr Ile Gly Leu Arg Gly Leu
65 70 75 80
Ile Asn Leu Gly Asn Thr Cys Phe Met Asn Cys Ile Val Gln Ala Leu
85 90 95
Thr His Thr Pro Ile Leu Arg Asp Phe Phe Leu Ser Asp Arg His Arg
100 105 110
Cys Glu Met Pro Ser Pro Glu Leu Cys Leu Val Cys Glu Met Ser Ser
115 120 125
Leu Phe Arg Glu Leu Tyr Ser Gly Asn Pro Ser Pro His Val Pro Tyr
130 135 140
Lys Leu Leu His Leu Val Trp Ile His Ala Arg His Leu Ala Gly Tyr
145 150 155 160
Arg Gln Gln Asp Ala His Glu Phe Leu Ile Ala Ala Leu Asp Val Leu
165 170 175
His Arg His Cys Lys Gly Asp Asp Val Gly Lys Ala Ala Asn Asn Pro
180 185 190
Asn His Cys Asn Cys Ile Ile Asp Gln Ile Phe Thr Gly Gly Leu Gln
195 200 205
Ser Asp Val Thr Cys Gln Ala Cys His Gly Val Ser Thr Thr Ile Asp
210 215 220
Pro Cys Trp Asp Ile Ser Leu Asp Leu Pro Gly Ser Cys Thr Ser Phe
225 230 235 240
Trp Pro Met Ser Pro Gly Arg Glu Ser Ser Val Asn Gly Glu Ser His
245 250 255
Ile Pro Gly Ile Thr Thr Leu Thr Asp Cys Leu Arg Arg Phe Thr Arg
260 265 270
Pro Glu His Leu Gly Ser Ser Ala Lys Ile Lys Cys Gly Ser Cys Gln
275 280 285
Ser Tyr Gln Glu Ser Thr Lys Gln Leu Thr Met Asn Lys Leu Pro Val
290 295 300
Val Ala Cys Phe His Phe Lys Arg Phe Glu His Ser Ala Lys Gln Arg
305 310 315 320
Arg Lys Ile Thr Thr Tyr Ile Ser Phe Pro Leu Glu Leu Asp Met Thr
325 330 335
Pro Phe Met Ala Ser Ser Lys Glu Ser Arg Met Asn Gly Gln Leu Gln
340 345 350
Leu Pro Thr Asn Ser Gly Asn Asn Glu Asn Lys Tyr Ser Leu Phe Ala
355 360 365
Val Val Asn His Gln Gly Thr Leu Glu Ser Gly His Tyr Thr Ser Phe
370 375 380
Ile Arg His His Lys Asp Gln Trp Phe Lys Cys Asp Asp Ala Val Ile
385 390 395 400
Thr Lys Ala Ser Ile Lys Asp Val Leu Asp Ser Glu Gly Tyr Leu Leu
405 410 415
Phe Tyr His Lys Gln Val Leu Glu His Glu Ser Glu Lys Val Lys Glu
420 425 430
Met Asn Thr Gln Ala Tyr
435
<210> 172
<211> 1314
<212> DNA
<213> Homo sapiens
<400> 172
atgtgtaagg actatgtata tgacaaagac attgagcaaa ttgccaaaga agagcaagga 60
gaagctttga aattacaagc ctccacctca acagaggttt ctcaccagca gtgttcagtg 120
ccaggccttg gtgagaaatt cccaacctgg gaaacaacca aaccagaatt agaactgctg 180
gggcacaacc cgaggagaag aagaatcacc tccagcttta cgatcggttt aagaggactc 240
atcaatcttg gcaacacgtg ctttatgaac tgcattgtcc aggccctcac ccacacgccg 300
atactgagag atttctttct ctctgacagg caccgatgtg agatgccgag tcccgagttg 360
tgtctggtct gtgagatgtc gtcgctgttt cgggagttgt attctggaaa cccgtctcct 420
catgtgccct ataagttact gcacctggtg tggatacatg cccgccattt agcagggtac 480
aggcaacagg atgcccacga gttcctcatt gcagcgttag atgtcctgca caggcactgc 540
aaaggtgatg atgtcgggaa ggcggccaac aatcccaacc actgtaactg catcatagac 600
caaatcttca caggtggcct gcagtctgat gtcacctgtc aagcctgcca tggcgtctcc 660
accacgatag acccatgctg ggacattagt ttggacttgc ctggctcttg cacctccttc 720
tggcccatga gcccagggag ggagagcagt gtgaacgggg aaagccacat accaggaatc 780
accaccctca cggactgctt gcggaggttt acgaggccag agcacttagg aagcagtgcc 840
aaaatcaaat gtggtagttg ccaaagctac caggaatcta ccaaacagct cacaatgaat 900
aaattacctg tcgttgcctg ttttcatttc aaacggtttg aacattcagc gaaacagagg 960
cgcaagatca ctacatacat ttcctttcct ctggagctgg atatgacgcc gtttatggcc 1020
tcaagtaaag agagcagaat gaatggacaa ttgcagctgc caaccaatag tggaaacaac 1080
gaaaataagt attccttgtt tgctgtggtt aatcaccaag gaaccttgga gagtggccac 1140
tataccagct tcatccggca ccacaaggac cagtggttca agtgtgatga tgccgtcatc 1200
actaaggcca gtattaagga cgtactggac agtgaagggt atttactgtt ctatcacaaa 1260
caggtgctag aacatgagtc agaaaaagtg aaagaaatga acacacaagc ctac 1314
<210> 173
<211> 1314
<212> RNA
<213> Homo sapiens
<400> 173
auguguaagg acuauguaua ugacaaagac auugagcaaa uugccaaaga agagcaagga 60
gaagcuuuga aauuacaagc cuccaccuca acagagguuu cucaccagca guguucagug 120
ccaggccuug gugagaaauu cccaaccugg gaaacaacca aaccagaauu agaacugcug 180
gggcacaacc cgaggagaag aagaaucacc uccagcuuua cgaucgguuu aagaggacuc 240
aucaaucuug gcaacacgug cuuuaugaac ugcauugucc aggcccucac ccacacgccg 300
auacugagag auuucuuucu cucugacagg caccgaugug agaugccgag ucccgaguug 360
ugucuggucu gugagauguc gucgcuguuu cgggaguugu auucuggaaa cccgucuccu 420
caugugcccu auaaguuacu gcaccuggug uggauacaug cccgccauuu agcaggguac 480
aggcaacagg augcccacga guuccucauu gcagcguuag auguccugca caggcacugc 540
aaaggugaug augucgggaa ggcggccaac aaucccaacc acuguaacug caucauagac 600
caaaucuuca cagguggccu gcagucugau gucaccuguc aagccugcca uggcgucucc 660
accacgauag acccaugcug ggacauuagu uuggacuugc cuggcucuug caccuccuuc 720
uggcccauga gcccagggag ggagagcagu gugaacgggg aaagccacau accaggaauc 780
accacccuca cggacugcuu gcggagguuu acgaggccag agcacuuagg aagcagugcc 840
aaaaucaaau gugguaguug ccaaagcuac caggaaucua ccaaacagcu cacaaugaau 900
aaauuaccug ucguugccug uuuucauuuc aaacgguuug aacauucagc gaaacagagg 960
cgcaagauca cuacauacau uuccuuuccu cuggagcugg auaugacgcc guuuauggcc 1020
ucaaguaaag agagcagaau gaauggacaa uugcagcugc caaccaauag uggaaacaac 1080
gaaaauaagu auuccuuguu ugcugugguu aaucaccaag gaaccuugga gaguggccac 1140
uauaccagcu ucauccggca ccacaaggac cagugguuca agugugauga ugccgucauc 1200
acuaaggcca guauuaagga cguacuggac agugaagggu auuuacuguu cuaucacaaa 1260
caggugcuag aacaugaguc agaaaaagug aaagaaauga acacacaagc cuac 1314
<210> 174
<211> 1317
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding USP27
<400> 174
atgtgcaagg actacgtgta cgacaaggac atcgagcaga tcgccaaaga ggaacagggc 60
gaagccctga agctgcaggc cagcacatct acagaggtgt cccaccagca gtgtagcgtg 120
ccaggactgg gcgagaagtt ccctacctgg gaaaccacca agcctgagct ggaactgctg 180
ggccacaatc ctcggcggag aagaatcacc agcagcttca ccatcggcct gcggggcctg 240
atcaatctgg gcaatacctg cttcatgaac tgcatcgtgc aggccctgac acacacccct 300
atcctgagag acttcttcct gtccgaccgg cacagatgcg agatgccttc tccagagctg 360
tgcctcgtgt gcgagatgag cagcctgttc cgggaactgt acagcggcaa cccttctcct 420
cacgtgccct acaaactgct gcacctcgtg tggattcacg ccagacacct ggccggctac 480
agacagcagg atgcccacga gtttctgatc gccgctctgg acgtgctgca cagacactgc 540
aaaggcgacg atgtgggcaa agccgccaac aatcccaacc actgcaactg catcatcgac 600
cagatcttca caggcggcct gcagagcgac gttacctgtc aagcttgtca cggcgtgtcc 660
accaccatcg atccctgctg ggatatcagc ctggatctgc ctggcagctg caccagcttt 720
tggcctatga gccctggcag agaaagcagc gtgaacggcg agtctcacat ccccggcatc 780
accacactga ccgactgcct gcggagattc accagacctg agcacctggg aagcagcgcc 840
aagatcaagt gtggctcctg ccagagctac caagagagca ccaagcagct gaccatgaac 900
aagctgcctg tggtggcctg cttccacttc aagagattcg agcactccgc caagcagcgg 960
cggaagatca caacctacat cagcttccct ctggaactgg acatgacccc tttcatggcc 1020
agcagcaaag aaagccggat gaacggccag ctccagctgc ctacaaatag cggcaacaac 1080
gagaacaagt actccctgtt cgccgtggtc aaccaccagg gcacactgga aagcggccac 1140
tacaccagct tcatcagaca ccacaaggac cagtggttca agtgcgacga cgccgtgatc 1200
accaaggcca gcatcaagga tgtcctggac agcgagggct acctgctgtt ctaccacaaa 1260
caggtgctgg aacacgagag cgagaaagtg aaagagatga acacccaggc ctactga 1317
<210> 175
<211> 1317
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding USP27
<400> 175
augugcaagg acuacgugua cgacaaggac aucgagcaga ucgccaaaga ggaacagggc 60
gaagcccuga agcugcaggc cagcacaucu acagaggugu cccaccagca guguagcgug 120
ccaggacugg gcgagaaguu cccuaccugg gaaaccacca agccugagcu ggaacugcug 180
ggccacaauc cucggcggag aagaaucacc agcagcuuca ccaucggccu gcggggccug 240
aucaaucugg gcaauaccug cuucaugaac ugcaucgugc aggcccugac acacaccccu 300
auccugagag acuucuuccu guccgaccgg cacagaugcg agaugccuuc uccagagcug 360
ugccucgugu gcgagaugag cagccuguuc cgggaacugu acagcggcaa cccuucuccu 420
cacgugcccu acaaacugcu gcaccucgug uggauucacg ccagacaccu ggccggcuac 480
agacagcagg augcccacga guuucugauc gccgcucugg acgugcugca cagacacugc 540
aaaggcgacg augugggcaa agccgccaac aaucccaacc acugcaacug caucaucgac 600
cagaucuuca caggcggccu gcagagcgac guuaccuguc aagcuuguca cggcgugucc 660
accaccaucg aucccugcug ggauaucagc cuggaucugc cuggcagcug caccagcuuu 720
uggccuauga gcccuggcag agaaagcagc gugaacggcg agucucacau ccccggcauc 780
accacacuga ccgacugccu gcggagauuc accagaccug agcaccuggg aagcagcgcc 840
aagaucaagu guggcuccug ccagagcuac caagagagca ccaagcagcu gaccaugaac 900
aagcugccug ugguggccug cuuccacuuc aagagauucg agcacuccgc caagcagcgg 960
cggaagauca caaccuacau cagcuucccu cuggaacugg acaugacccc uuucauggcc 1020
agcagcaaag aaagccggau gaacggccag cuccagcugc cuacaaauag cggcaacaac 1080
gagaacaagu acucccuguu cgccgugguc aaccaccagg gcacacugga aagcggccac 1140
uacaccagcu ucaucagaca ccacaaggac cagugguuca agugcgacga cgccgugauc 1200
accaaggcca gcaucaagga uguccuggac agcgagggcu accugcuguu cuaccacaaa 1260
caggugcugg aacacgagag cgagaaagug aaagagauga acacccaggc cuacuga 1317
<210> 176
<211> 956
<212> PRT
<213> Homo sapiens
<400> 176
Met Ser Ser Gly Leu Trp Ser Gln Glu Lys Val Thr Ser Pro Tyr Trp
1 5 10 15
Glu Glu Arg Ile Phe Tyr Leu Leu Leu Gln Glu Cys Ser Val Thr Asp
20 25 30
Lys Gln Thr Gln Lys Leu Leu Lys Val Pro Lys Gly Ser Ile Gly Gln
35 40 45
Tyr Ile Gln Asp Arg Ser Val Gly His Ser Arg Ile Pro Ser Ala Lys
50 55 60
Gly Lys Lys Asn Gln Ile Gly Leu Lys Ile Leu Glu Gln Pro His Ala
65 70 75 80
Val Leu Phe Val Asp Glu Lys Asp Val Val Glu Ile Asn Glu Lys Phe
85 90 95
Thr Glu Leu Leu Leu Ala Ile Thr Asn Cys Glu Glu Arg Phe Ser Leu
100 105 110
Phe Lys Asn Arg Asn Arg Leu Ser Lys Gly Leu Gln Ile Asp Val Gly
115 120 125
Cys Pro Val Lys Val Gln Leu Arg Ser Gly Glu Glu Lys Phe Pro Gly
130 135 140
Val Val Arg Phe Arg Gly Pro Leu Leu Ala Glu Arg Thr Val Ser Gly
145 150 155 160
Ile Phe Phe Gly Val Glu Leu Leu Glu Glu Gly Arg Gly Gln Gly Phe
165 170 175
Thr Asp Gly Val Tyr Gln Gly Lys Gln Leu Phe Gln Cys Asp Glu Asp
180 185 190
Cys Gly Val Phe Val Ala Leu Asp Lys Leu Glu Leu Ile Glu Asp Asp
195 200 205
Asp Thr Ala Leu Glu Ser Asp Tyr Ala Gly Pro Gly Asp Thr Met Gln
210 215 220
Val Glu Leu Pro Pro Leu Glu Ile Asn Ser Arg Val Ser Leu Lys Val
225 230 235 240
Gly Glu Thr Ile Glu Ser Gly Thr Val Ile Phe Cys Asp Val Leu Pro
245 250 255
Gly Lys Glu Ser Leu Gly Tyr Phe Val Gly Val Asp Met Asp Asn Pro
260 265 270
Ile Gly Asn Trp Asp Gly Arg Phe Asp Gly Val Gln Leu Cys Ser Phe
275 280 285
Ala Cys Val Glu Ser Thr Ile Leu Leu His Ile Asn Asp Ile Ile Pro
290 295 300
Ala Leu Ser Glu Ser Val Thr Gln Glu Arg Arg Pro Pro Lys Leu Ala
305 310 315 320
Phe Met Ser Arg Gly Val Gly Asp Lys Gly Ser Ser Ser His Asn Lys
325 330 335
Pro Lys Ala Thr Gly Ser Thr Ser Asp Pro Gly Asn Arg Asn Arg Ser
340 345 350
Glu Leu Phe Tyr Thr Leu Asn Gly Ser Ser Val Asp Ser Gln Pro Gln
355 360 365
Ser Lys Ser Lys Asn Thr Trp Tyr Ile Asp Glu Val Ala Glu Asp Pro
370 375 380
Ala Lys Ser Leu Thr Glu Ile Ser Thr Asp Phe Asp Arg Ser Ser Pro
385 390 395 400
Pro Leu Gln Pro Pro Pro Val Asn Ser Leu Thr Thr Glu Asn Arg Phe
405 410 415
His Ser Leu Pro Phe Ser Leu Thr Lys Met Pro Asn Thr Asn Gly Ser
420 425 430
Ile Gly His Ser Pro Leu Ser Leu Ser Ala Gln Ser Val Met Glu Glu
435 440 445
Leu Asn Thr Ala Pro Val Gln Glu Ser Pro Pro Leu Ala Met Pro Pro
450 455 460
Gly Asn Ser His Gly Leu Glu Val Gly Ser Leu Ala Glu Val Lys Glu
465 470 475 480
Asn Pro Pro Phe Tyr Gly Val Ile Arg Trp Ile Gly Gln Pro Pro Gly
485 490 495
Leu Asn Glu Val Leu Ala Gly Leu Glu Leu Glu Asp Glu Cys Ala Gly
500 505 510
Cys Thr Asp Gly Thr Phe Arg Gly Thr Arg Tyr Phe Thr Cys Ala Leu
515 520 525
Lys Lys Ala Leu Phe Val Lys Leu Lys Ser Cys Arg Pro Asp Ser Arg
530 535 540
Phe Ala Ser Leu Gln Pro Val Ser Asn Gln Ile Glu Arg Cys Asn Ser
545 550 555 560
Leu Ala Phe Gly Gly Tyr Leu Ser Glu Val Val Glu Glu Asn Thr Pro
565 570 575
Pro Lys Met Glu Lys Glu Gly Leu Glu Ile Met Ile Gly Lys Lys Lys
580 585 590
Gly Ile Gln Gly His Tyr Asn Ser Cys Tyr Leu Asp Ser Thr Leu Phe
595 600 605
Cys Leu Phe Ala Phe Ser Ser Val Leu Asp Thr Val Leu Leu Arg Pro
610 615 620
Lys Glu Lys Asn Asp Val Glu Tyr Tyr Ser Glu Thr Gln Glu Leu Leu
625 630 635 640
Arg Thr Glu Ile Val Asn Pro Leu Arg Ile Tyr Gly Tyr Val Cys Ala
645 650 655
Thr Lys Ile Met Lys Leu Arg Lys Ile Leu Glu Lys Val Glu Ala Ala
660 665 670
Ser Gly Phe Thr Ser Glu Glu Lys Asp Pro Glu Glu Phe Leu Asn Ile
675 680 685
Leu Phe His His Ile Leu Arg Val Glu Pro Leu Leu Lys Ile Arg Ser
690 695 700
Ala Gly Gln Lys Val Gln Asp Cys Tyr Phe Tyr Gln Ile Phe Met Glu
705 710 715 720
Lys Asn Glu Lys Val Gly Val Pro Thr Ile Gln Gln Leu Leu Glu Trp
725 730 735
Ser Phe Ile Asn Ser Asn Leu Lys Phe Ala Glu Ala Pro Ser Cys Leu
740 745 750
Ile Ile Gln Met Pro Arg Phe Gly Lys Asp Phe Lys Leu Phe Lys Lys
755 760 765
Ile Phe Pro Ser Leu Glu Leu Asn Ile Thr Asp Leu Leu Glu Asp Thr
770 775 780
Pro Arg Gln Cys Arg Ile Cys Gly Gly Leu Ala Met Tyr Glu Cys Arg
785 790 795 800
Glu Cys Tyr Asp Asp Pro Asp Ile Ser Ala Gly Lys Ile Lys Gln Phe
805 810 815
Cys Lys Thr Cys Asn Thr Gln Val His Leu His Pro Lys Arg Leu Asn
820 825 830
His Lys Tyr Asn Pro Val Ser Leu Pro Lys Asp Leu Pro Asp Trp Asp
835 840 845
Trp Arg His Gly Cys Ile Pro Cys Gln Asn Met Glu Leu Phe Ala Val
850 855 860
Leu Cys Ile Glu Thr Ser His Tyr Val Ala Phe Val Lys Tyr Gly Lys
865 870 875 880
Asp Asp Ser Ala Trp Leu Phe Phe Asp Ser Met Ala Asp Arg Asp Gly
885 890 895
Gly Gln Asn Gly Phe Asn Ile Pro Gln Val Thr Pro Cys Pro Glu Val
900 905 910
Gly Glu Tyr Leu Lys Met Ser Leu Glu Asp Leu His Ser Leu Asp Ser
915 920 925
Arg Arg Ile Gln Gly Cys Ala Arg Arg Leu Leu Cys Asp Ala Tyr Met
930 935 940
Cys Met Tyr Gln Ser Pro Thr Met Ser Leu Tyr Lys
945 950 955
<210> 177
<211> 2868
<212> DNA
<213> Homo sapiens
<400> 177
atgagttcag gcttatggag ccaagaaaaa gtcacttcac cctactggga agagcggatt 60
ttttacttgc ttcttcaaga atgcagcgtt acagacaaac aaacacaaaa gctccttaaa 120
gtaccgaagg gaagtatagg acagtatatt caagatcgtt ctgtggggca ttcaaggatt 180
ccttctgcaa aaggcaagaa aaatcagatt ggattaaaaa ttctagagca acctcatgca 240
gttctctttg ttgatgaaaa ggatgttgta gagataaatg aaaagttcac agagttactt 300
ttggcaatta ccaattgtga ggagaggttc agcctgttta aaaacagaaa cagactaagt 360
aaaggcctcc aaatagacgt gggctgtcct gtgaaagtac agctgagatc tggggaagaa 420
aaatttcctg gagttgtacg cttcagagga cccctgttag cagagaggac agtctccgga 480
atattctttg gagttgaatt gctggaagaa ggtcgtggtc aaggtttcac tgacggggtg 540
taccaaggga aacagctttt tcagtgtgat gaagattgtg gcgtgtttgt tgcattggac 600
aagctagaac tcatagaaga tgatgacact gcattggaaa gtgattacgc aggtcctggg 660
gacacaatgc aggtcgaact tcctcctttg gaaataaact ccagagtttc tttgaaggtt 720
ggagaaacaa tagaatctgg aacagttata ttctgtgatg ttttgccagg aaaagaaagc 780
ttaggatatt ttgttggtgt ggacatggat aaccctattg gcaactggga tggaagattt 840
gatggagtgc agctttgtag ttttgcgtgt gttgaaagta caattctatt gcacatcaat 900
gatatcatcc cagctttatc agagagtgtg acgcaggaaa ggaggcctcc caaacttgcc 960
tttatgtcaa gaggtgttgg ggacaaaggt tcatccagtc ataataaacc aaaggctaca 1020
ggatctacct cagaccctgg aaatagaaac agatctgaat tattttatac cttaaatggg 1080
tcttctgttg actcacaacc acaatccaaa tcaaaaaata catggtacat tgatgaagtt 1140
gcagaagacc ctgcaaaatc tcttacagag atatctacag actttgaccg ttcttcacca 1200
ccactccagc ctcctcctgt gaactcactg accaccgaga acagattcca ctctttacca 1260
ttcagtctca ccaagatgcc caataccaat ggaagtattg gccacagtcc actttctctg 1320
tcagcccagt ctgtaatgga agagctaaac actgcacccg tccaagagag tccacccttg 1380
gccatgcctc ctgggaactc acatggtcta gaagtgggct cattggctga agttaaggag 1440
aaccctcctt tctatggggt aatccgttgg atcggtcagc caccaggact gaatgaagtg 1500
ctcgctggac tggaactgga agatgagtgt gcaggctgta cggatggaac cttcagaggc 1560
actcggtatt tcacctgtgc cctgaagaag gcgctgtttg tgaaactgaa gagctgcagg 1620
cctgactcta ggtttgcatc attgcagccg gtttccaatc agattgagcg ctgtaactct 1680
ttagcatttg gaggctactt aagtgaagta gtagaagaaa atactccacc aaaaatggaa 1740
aaagaaggct tggagataat gattgggaag aagaaaggca tccagggtca ttacaattct 1800
tgttacttag actcaacctt attctgctta tttgctttta gttctgttct ggacactgtg 1860
ttacttagac ccaaagaaaa gaacgatgta gaatattata gtgaaaccca agagctactg 1920
aggacagaaa ttgttaatcc tctgagaata tatggatatg tgtgtgccac aaaaattatg 1980
aaactgagga aaatacttga aaaggtggag gctgcatcag gatttacctc tgaagaaaaa 2040
gatcctgagg aattcttgaa tattctgttt catcatattt taagggtaga acctttgcta 2100
aaaataagat cagcaggtca aaaggtacaa gattgttact tctatcaaat ttttatggaa 2160
aaaaatgaga aagttggcgt tcccacaatt cagcagttgt tagaatggtc ttttatcaac 2220
agtaacctga aatttgcaga ggcaccatca tgtctgatta ttcagatgcc tcgatttgga 2280
aaagacttta aactatttaa aaaaattttt ccttctctgg aattaaatat aacagattta 2340
cttgaagaca ctcccagaca gtgccggata tgtggagggc ttgcaatgta tgagtgtaga 2400
gaatgctacg acgatccgga catctcagct ggaaaaatca agcagttttg taaaacctgc 2460
aacactcaag tccaccttca tccgaagagg ctgaatcata aatataaccc agtgtcactt 2520
cccaaagact tacccgactg ggactggaga cacggctgca tcccttgcca gaatatggag 2580
ttatttgctg ttctctgcat agaaacaagc cactatgttg cttttgtgaa gtatgggaag 2640
gacgattctg cctggctctt ctttgacagc atggccgatc gggatggtgg tcagaatggc 2700
ttcaacattc ctcaagtcac cccatgccca gaagtaggag agtacttgaa gatgtctctg 2760
gaagacctgc attccttgga ctccaggaga atccaaggct gtgcacgaag actgctttgt 2820
gatgcatata tgtgcatgta ccagagtcca acaatgagtt tgtacaaa 2868
<210> 178
<211> 2868
<212> RNA
<213> Homo sapiens
<400> 178
augaguucag gcuuauggag ccaagaaaaa gucacuucac ccuacuggga agagcggauu 60
uuuuacuugc uucuucaaga augcagcguu acagacaaac aaacacaaaa gcuccuuaaa 120
guaccgaagg gaaguauagg acaguauauu caagaucguu cuguggggca uucaaggauu 180
ccuucugcaa aaggcaagaa aaaucagauu ggauuaaaaa uucuagagca accucaugca 240
guucucuuug uugaugaaaa ggauguugua gagauaaaug aaaaguucac agaguuacuu 300
uuggcaauua ccaauuguga ggagagguuc agccuguuua aaaacagaaa cagacuaagu 360
aaaggccucc aaauagacgu gggcuguccu gugaaaguac agcugagauc uggggaagaa 420
aaauuuccug gaguuguacg cuucagagga ccccuguuag cagagaggac agucuccgga 480
auauucuuug gaguugaauu gcuggaagaa ggucgugguc aagguuucac ugacggggug 540
uaccaaggga aacagcuuuu ucagugugau gaagauugug gcguguuugu ugcauuggac 600
aagcuagaac ucauagaaga ugaugacacu gcauuggaaa gugauuacgc agguccuggg 660
gacacaaugc aggucgaacu uccuccuuug gaaauaaacu ccagaguuuc uuugaagguu 720
ggagaaacaa uagaaucugg aacaguuaua uucugugaug uuuugccagg aaaagaaagc 780
uuaggauauu uuguuggugu ggacauggau aacccuauug gcaacuggga uggaagauuu 840
gauggagugc agcuuuguag uuuugcgugu guugaaagua caauucuauu gcacaucaau 900
gauaucaucc cagcuuuauc agagagugug acgcaggaaa ggaggccucc caaacuugcc 960
uuuaugucaa gagguguugg ggacaaaggu ucauccaguc auaauaaacc aaaggcuaca 1020
ggaucuaccu cagacccugg aaauagaaac agaucugaau uauuuuauac cuuaaauggg 1080
ucuucuguug acucacaacc acaauccaaa ucaaaaaaua caugguacau ugaugaaguu 1140
gcagaagacc cugcaaaauc ucuuacagag auaucuacag acuuugaccg uucuucacca 1200
ccacuccagc cuccuccugu gaacucacug accaccgaga acagauucca cucuuuacca 1260
uucagucuca ccaagaugcc caauaccaau ggaaguauug gccacagucc acuuucucug 1320
ucagcccagu cuguaaugga agagcuaaac acugcacccg uccaagagag uccacccuug 1380
gccaugccuc cugggaacuc acauggucua gaagugggcu cauuggcuga aguuaaggag 1440
aacccuccuu ucuauggggu aauccguugg aucggucagc caccaggacu gaaugaagug 1500
cucgcuggac uggaacugga agaugagugu gcaggcugua cggauggaac cuucagaggc 1560
acucgguauu ucaccugugc ccugaagaag gcgcuguuug ugaaacugaa gagcugcagg 1620
ccugacucua gguuugcauc auugcagccg guuuccaauc agauugagcg cuguaacucu 1680
uuagcauuug gaggcuacuu aagugaagua guagaagaaa auacuccacc aaaaauggaa 1740
aaagaaggcu uggagauaau gauugggaag aagaaaggca uccaggguca uuacaauucu 1800
uguuacuuag acucaaccuu auucugcuua uuugcuuuua guucuguucu ggacacugug 1860
uuacuuagac ccaaagaaaa gaacgaugua gaauauuaua gugaaaccca agagcuacug 1920
aggacagaaa uuguuaaucc ucugagaaua uauggauaug ugugugccac aaaaauuaug 1980
aaacugagga aaauacuuga aaagguggag gcugcaucag gauuuaccuc ugaagaaaaa 2040
gauccugagg aauucuugaa uauucuguuu caucauauuu uaaggguaga accuuugcua 2100
aaaauaagau cagcagguca aaagguacaa gauuguuacu ucuaucaaau uuuuauggaa 2160
aaaaaugaga aaguuggcgu ucccacaauu cagcaguugu uagaaugguc uuuuaucaac 2220
aguaaccuga aauuugcaga ggcaccauca ugucugauua uucagaugcc ucgauuugga 2280
aaagacuuua aacuauuuaa aaaaauuuuu ccuucucugg aauuaaauau aacagauuua 2340
cuugaagaca cucccagaca gugccggaua uguggagggc uugcaaugua ugaguguaga 2400
gaaugcuacg acgauccgga caucucagcu ggaaaaauca agcaguuuug uaaaaccugc 2460
aacacucaag uccaccuuca uccgaagagg cugaaucaua aauauaaccc agugucacuu 2520
cccaaagacu uacccgacug ggacuggaga cacggcugca ucccuugcca gaauauggag 2580
uuauuugcug uucucugcau agaaacaagc cacuauguug cuuuugugaa guaugggaag 2640
gacgauucug ccuggcucuu cuuugacagc auggccgauc gggauggugg ucagaauggc 2700
uucaacauuc cucaagucac cccaugccca gaaguaggag aguacuugaa gaugucucug 2760
gaagaccugc auuccuugga cuccaggaga auccaaggcu gugcacgaag acugcuuugu 2820
gaugcauaua ugugcaugua ccagagucca acaaugaguu uguacaaa 2868
<210> 179
<211> 2871
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding CYLD
<400> 179
atgtctagcg gcctgtggtc ccaagagaaa gtgacaagcc cctactggga agagaggatc 60
ttctacctgc tgctgcaaga gtgcagcgtg accgacaagc agacccagaa actgctgaag 120
gtgcccaagg gcagcatcgg ccagtacatc caggatagaa gcgtgggcca cagcagaatc 180
cctagcgcca agggcaagaa gaaccagatc ggcctgaaga tcctggaaca gcctcacgcc 240
gtgctgttcg tggacgagaa ggacgtggtg gaaatcaacg agaagttcac cgagctgctg 300
ctggccatca ccaactgcga ggaacggttc agcctgttca agaaccggaa ccggctgagc 360
aagggcctgc agatcgatgt gggatgccct gtgaaggtgc agctgagaag cggcgaagag 420
aagttccctg gcgtcgtgcg gtttagagga cctctgctgg ccgagagaac cgtgtccggc 480
atcttctttg gcgtggaact gctggaagaa ggcagaggcc agggctttac cgatggcgtg 540
taccagggca agcagctgtt tcagtgcgac gaggattgcg gcgtgttcgt ggccctggat 600
aagctggaac tgatcgagga cgacgacaca gccctggaaa gcgattatgc cggacctggc 660
gataccatgc aggtcgaact gcctccactc gagatcaaca gccgggtgtc cctgaaagtg 720
ggcgagacaa tcgagagcgg caccgtgatc ttttgcgacg tgctgcctgg caaagagtcc 780
ctgggctatt ttgtgggcgt cgacatggac aaccccatcg gcaattggga cggcagattt 840
gacggcgtgc agctgtgcag cttcgcctgt gtggaaagca ccatcctgct gcacatcaac 900
gacatcatcc ccgctctgag cgagagcgtg acccaagaaa gacggcctcc taagctggcc 960
ttcatgtcta gaggcgtggg cgataagggc agctccagcc acaacaagcc taaggccaca 1020
ggctccacaa gcgaccccgg caacagaaac agaagcgagc tgttctacac cctgaacggc 1080
agcagcgtgg acagccagcc tcagagcaag agcaagaaca cctggtacat cgacgaggtg 1140
gccgaggatc ctgccaagag cctgacagag atcagcaccg acttcgacag aagcagccct 1200
ccactgcagc ctccacctgt gaatagcctg accaccgaga acagattcca cagcctgcct 1260
ttcagcctga ctaagatgcc caacaccaac ggctccatcg ggcactctcc actgtctctg 1320
tctgcccaga gcgtgatgga agaactgaac acagcccctg tgcaagagtc ccctcctctg 1380
gctatgcctc ctggcaattc tcacggcctg gaagtgggat ctctggccga agtgaaagag 1440
aaccctcctt tctacggcgt gatccggtgg atcggacaac ctcctggact gaatgaagtg 1500
ctggccggac tggaactgga agatgagtgt gccggctgca ccgacggcac ctttagaggc 1560
accagatact tcacatgcgc cctgaagaaa gccctgttcg tgaagctgaa gtcctgcaga 1620
cccgacagca gattcgctag cctgcagcct gtgtccaatc agatcgagcg gtgcaactcc 1680
ctggcctttg gcggctatct gtccgaggtg gtggaagaga acacccctcc taagatggaa 1740
aaagagggcc tcgagattat gatcgggaag aagaagggca tccaggggca ctacaatagc 1800
tgctacctgg acagcaccct gttctgcctg ttcgccttta gcagcgtgct ggacactgtg 1860
ctgctgcggc ccaaagagaa gaacgacgtc gagtactaca gcgagacaca agagctgctg 1920
agaaccgaga tcgtgaaccc tctgcggatc tacggctacg tgtgcgccac caagatcatg 1980
aagctgcgga agattctgga aaaggtggaa gccgcctccg gcttcaccag cgaggaaaag 2040
gatcccgaag agttcctgaa catcctgttt caccacatcc tgagagtgga acccctgctg 2100
aagatcagat ccgccggaca gaaagtgcag gactgctact tctaccagat cttcatggaa 2160
aagaacgaga aagtcggcgt gcccaccatc cagcaactgc tcgagtggtc cttcatcaac 2220
agcaacctga agttcgccga ggctcccagc tgcctgatca tccagatgcc tagattcggc 2280
aaggacttca agctgttcaa aaagatcttc cccagcctcg agctgaacat caccgacctg 2340
ctcgaggaca cccctcggca gtgtagaatt tgtggcggcc tggctatgta cgagtgcaga 2400
gagtgctacg acgaccccga tatcagcgcc ggcaagatca agcagttctg caagacctgc 2460
aacacccaag tgcatctgca ccccaagcgg ctgaaccaca agtacaaccc cgtgtctctg 2520
cccaaggacc tgcctgactg ggattggaga cacggctgta tcccttgcca gaacatggaa 2580
ctgttcgctg tgctgtgcat cgagacaagc cactacgtgg ccttcgtgaa gtacggcaag 2640
gatgacagcg cctggctgtt cttcgacagc atggccgata gagatggcgg ccagaacggc 2700
ttcaacatcc ctcaagtgac cccttgtcct gaagtgggag agtacctgaa gatgagcctg 2760
gaagatctgc acagcctgga ctccagacgg atccagggat gtgctagaag gctgctgtgc 2820
gacgcctaca tgtgcatgta tcagagcccc accatgagcc tgtacaagtg a 2871
<210> 180
<211> 2871
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding CYLD
<400> 180
augucuagcg gccugugguc ccaagagaaa gugacaagcc ccuacuggga agagaggauc 60
uucuaccugc ugcugcaaga gugcagcgug accgacaagc agacccagaa acugcugaag 120
gugcccaagg gcagcaucgg ccaguacauc caggauagaa gcgugggcca cagcagaauc 180
ccuagcgcca agggcaagaa gaaccagauc ggccugaaga uccuggaaca gccucacgcc 240
gugcuguucg uggacgagaa ggacguggug gaaaucaacg agaaguucac cgagcugcug 300
cuggccauca ccaacugcga ggaacgguuc agccuguuca agaaccggaa ccggcugagc 360
aagggccugc agaucgaugu gggaugcccu gugaaggugc agcugagaag cggcgaagag 420
aaguucccug gcgucgugcg guuuagagga ccucugcugg ccgagagaac cguguccggc 480
aucuucuuug gcguggaacu gcuggaagaa ggcagaggcc agggcuuuac cgauggcgug 540
uaccagggca agcagcuguu ucagugcgac gaggauugcg gcguguucgu ggcccuggau 600
aagcuggaac ugaucgagga cgacgacaca gcccuggaaa gcgauuaugc cggaccuggc 660
gauaccaugc aggucgaacu gccuccacuc gagaucaaca gccggguguc ccugaaagug 720
ggcgagacaa ucgagagcgg caccgugauc uuuugcgacg ugcugccugg caaagagucc 780
cugggcuauu uugugggcgu cgacauggac aaccccaucg gcaauuggga cggcagauuu 840
gacggcgugc agcugugcag cuucgccugu guggaaagca ccauccugcu gcacaucaac 900
gacaucaucc ccgcucugag cgagagcgug acccaagaaa gacggccucc uaagcuggcc 960
uucaugucua gaggcguggg cgauaagggc agcuccagcc acaacaagcc uaaggccaca 1020
ggcuccacaa gcgaccccgg caacagaaac agaagcgagc uguucuacac ccugaacggc 1080
agcagcgugg acagccagcc ucagagcaag agcaagaaca ccugguacau cgacgaggug 1140
gccgaggauc cugccaagag ccugacagag aucagcaccg acuucgacag aagcagcccu 1200
ccacugcagc cuccaccugu gaauagccug accaccgaga acagauucca cagccugccu 1260
uucagccuga cuaagaugcc caacaccaac ggcuccaucg ggcacucucc acugucucug 1320
ucugcccaga gcgugaugga agaacugaac acagccccug ugcaagaguc cccuccucug 1380
gcuaugccuc cuggcaauuc ucacggccug gaagugggau cucuggccga agugaaagag 1440
aacccuccuu ucuacggcgu gauccggugg aucggacaac cuccuggacu gaaugaagug 1500
cuggccggac uggaacugga agaugagugu gccggcugca ccgacggcac cuuuagaggc 1560
accagauacu ucacaugcgc ccugaagaaa gcccuguucg ugaagcugaa guccugcaga 1620
cccgacagca gauucgcuag ccugcagccu guguccaauc agaucgagcg gugcaacucc 1680
cuggccuuug gcggcuaucu guccgaggug guggaagaga acaccccucc uaagauggaa 1740
aaagagggcc ucgagauuau gaucgggaag aagaagggca uccaggggca cuacaauagc 1800
ugcuaccugg acagcacccu guucugccug uucgccuuua gcagcgugcu ggacacugug 1860
cugcugcggc ccaaagagaa gaacgacguc gaguacuaca gcgagacaca agagcugcug 1920
agaaccgaga ucgugaaccc ucugcggauc uacggcuacg ugugcgccac caagaucaug 1980
aagcugcgga agauucugga aaagguggaa gccgccuccg gcuucaccag cgaggaaaag 2040
gaucccgaag aguuccugaa cauccuguuu caccacaucc ugagagugga accccugcug 2100
aagaucagau ccgccggaca gaaagugcag gacugcuacu ucuaccagau cuucauggaa 2160
aagaacgaga aagucggcgu gcccaccauc cagcaacugc ucgagugguc cuucaucaac 2220
agcaaccuga aguucgccga ggcucccagc ugccugauca uccagaugcc uagauucggc 2280
aaggacuuca agcuguucaa aaagaucuuc cccagccucg agcugaacau caccgaccug 2340
cucgaggaca ccccucggca guguagaauu uguggcggcc uggcuaugua cgagugcaga 2400
gagugcuacg acgaccccga uaucagcgcc ggcaagauca agcaguucug caagaccugc 2460
aacacccaag ugcaucugca ccccaagcgg cugaaccaca aguacaaccc cgugucucug 2520
cccaaggacc ugccugacug ggauuggaga cacggcugua ucccuugcca gaacauggaa 2580
cuguucgcug ugcugugcau cgagacaagc cacuacgugg ccuucgugaa guacggcaag 2640
gaugacagcg ccuggcuguu cuucgacagc auggccgaua gagauggcgg ccagaacggc 2700
uucaacaucc cucaagugac cccuuguccu gaagugggag aguaccugaa gaugagccug 2760
gaagaucugc acagccugga cuccagacgg auccagggau gugcuagaag gcugcugugc 2820
gacgccuaca ugugcaugua ucagagcccc accaugagcc uguacaagug a 2871
<210> 181
<211> 678
<212> PRT
<213> Homo sapiens
<400> 181
Met Glu Leu Arg Ser Tyr Gln Trp Glu Val Ile Met Pro Ala Leu Glu
1 5 10 15
Gly Lys Asn Ile Ile Ile Trp Leu Pro Thr Gly Ala Gly Lys Thr Arg
20 25 30
Ala Ala Ala Tyr Val Ala Lys Arg His Leu Glu Thr Val Asp Gly Ala
35 40 45
Lys Val Val Val Leu Val Asn Arg Val His Leu Val Thr Gln His Gly
50 55 60
Glu Glu Phe Arg Arg Met Leu Asp Gly Arg Trp Thr Val Thr Thr Leu
65 70 75 80
Ser Gly Asp Met Gly Pro Arg Ala Gly Phe Gly His Leu Ala Arg Cys
85 90 95
His Asp Leu Leu Ile Cys Thr Ala Glu Leu Leu Gln Met Ala Leu Thr
100 105 110
Ser Pro Glu Glu Glu Glu His Val Glu Leu Thr Val Phe Ser Leu Ile
115 120 125
Val Val Asp Glu Cys His His Thr His Lys Asp Thr Val Tyr Asn Val
130 135 140
Ile Met Ser Gln Tyr Leu Glu Leu Lys Leu Gln Arg Ala Gln Pro Leu
145 150 155 160
Pro Gln Val Leu Gly Leu Thr Ala Ser Pro Gly Thr Gly Gly Ala Ser
165 170 175
Lys Leu Asp Gly Ala Ile Asn His Val Leu Gln Leu Cys Ala Asn Leu
180 185 190
Asp Thr Trp Cys Ile Met Ser Pro Gln Asn Cys Cys Pro Gln Leu Gln
195 200 205
Glu His Ser Gln Gln Pro Cys Lys Gln Tyr Asn Leu Cys His Arg Arg
210 215 220
Ser Gln Asp Pro Phe Gly Asp Leu Leu Lys Lys Leu Met Asp Gln Ile
225 230 235 240
His Asp His Leu Glu Met Pro Glu Leu Ser Arg Lys Phe Gly Thr Gln
245 250 255
Met Tyr Glu Gln Gln Val Val Lys Leu Ser Glu Ala Ala Ala Leu Ala
260 265 270
Gly Leu Gln Glu Gln Arg Val Tyr Ala Leu His Leu Arg Arg Tyr Asn
275 280 285
Asp Ala Leu Leu Ile His Asp Thr Val Arg Ala Val Asp Ala Leu Ala
290 295 300
Ala Leu Gln Asp Phe Tyr His Arg Glu His Val Thr Lys Thr Gln Ile
305 310 315 320
Leu Cys Ala Glu Arg Arg Leu Leu Ala Leu Phe Asp Asp Arg Lys Asn
325 330 335
Glu Leu Ala His Leu Ala Thr His Gly Pro Glu Asn Pro Lys Leu Glu
340 345 350
Met Leu Glu Lys Ile Leu Gln Arg Gln Phe Ser Ser Ser Asn Ser Pro
355 360 365
Arg Gly Ile Ile Phe Thr Arg Thr Arg Gln Ser Ala His Ser Leu Leu
370 375 380
Leu Trp Leu Gln Gln Gln Gln Gly Leu Gln Thr Val Asp Ile Arg Ala
385 390 395 400
Gln Leu Leu Ile Gly Ala Gly Asn Ser Ser Gln Ser Thr His Met Thr
405 410 415
Gln Arg Asp Gln Gln Glu Val Ile Gln Lys Phe Gln Asp Gly Thr Leu
420 425 430
Asn Leu Leu Val Ala Thr Ser Val Ala Glu Glu Gly Leu Asp Ile Pro
435 440 445
His Cys Asn Val Val Val Arg Tyr Gly Leu Leu Thr Asn Glu Ile Ser
450 455 460
Met Val Gln Ala Arg Gly Arg Ala Arg Ala Asp Gln Ser Val Tyr Ala
465 470 475 480
Phe Val Ala Thr Glu Gly Ser Arg Glu Leu Lys Arg Glu Leu Ile Asn
485 490 495
Glu Ala Leu Glu Thr Leu Met Glu Gln Ala Val Ala Ala Val Gln Lys
500 505 510
Met Asp Gln Ala Glu Tyr Gln Ala Lys Ile Arg Asp Leu Gln Gln Ala
515 520 525
Ala Leu Thr Lys Arg Ala Ala Gln Ala Ala Gln Arg Glu Asn Gln Arg
530 535 540
Gln Gln Phe Pro Val Glu His Val Gln Leu Leu Cys Ile Asn Cys Met
545 550 555 560
Val Ala Val Gly His Gly Ser Asp Leu Arg Lys Val Glu Gly Thr His
565 570 575
His Val Asn Val Asn Pro Asn Phe Ser Asn Tyr Tyr Asn Val Ser Arg
580 585 590
Asp Pro Val Val Ile Asn Lys Val Phe Lys Asp Trp Lys Pro Gly Gly
595 600 605
Val Ile Ser Cys Arg Asn Cys Gly Glu Val Trp Gly Leu Gln Met Ile
610 615 620
Tyr Lys Ser Val Lys Leu Pro Val Leu Lys Val Arg Ser Met Leu Leu
625 630 635 640
Glu Thr Pro Gln Gly Arg Ile Gln Ala Lys Lys Trp Ser Arg Val Pro
645 650 655
Phe Ser Val Pro Asp Phe Asp Phe Leu Gln His Cys Ala Glu Asn Leu
660 665 670
Ser Asp Leu Ser Leu Asp
675
<210> 182
<211> 2034
<212> DNA
<213> Homo sapiens
<400> 182
atggagcttc ggtcctacca atgggaggtg atcatgcctg ccctggaggg caagaatatc 60
atcatctggc tgcccacggg tgccgggaag acccgggcgg ctgcttatgt ggccaagcgg 120
cacctagaga ctgtggatgg agccaaggtg gttgtattgg tcaacagggt gcacctggtg 180
acccagcatg gtgaagagtt caggcgcatg ctggatggac gctggaccgt gacaaccctg 240
agtggggaca tgggaccacg tgctggcttt ggccacctgg cccggtgcca tgacctgctc 300
atctgcacag cagagcttct gcagatggca ctgaccagcc ccgaggagga ggagcacgtg 360
gagctcactg tcttctccct gatcgtggtg gatgagtgcc accacacgca caaggacacc 420
gtctacaacg tcatcatgag ccagtaccta gaacttaaac tccagagggc acagccgcta 480
ccccaggtgc tgggtctcac agcctcccca ggcactggcg gggcctccaa actcgatggg 540
gccatcaacc acgtcctgca gctctgtgcc aacttggaca cgtggtgcat catgtcaccc 600
cagaactgct gcccccagct gcaggagcac agccaacagc cttgcaaaca gtacaacctc 660
tgccacaggc gcagccagga tccgtttggg gacttgctga agaagctcat ggaccaaatc 720
catgaccacc tggagatgcc tgagttgagc cggaaatttg ggacgcaaat gtatgagcag 780
caggtggtga agctgagtga ggctgcggct ttggctgggc ttcaggagca acgggtgtat 840
gcgcttcacc tgaggcgcta caatgacgcg ctgctcatcc atgacaccgt ccgcgccgtg 900
gatgccttgg ctgcgctgca ggatttctat cacagggagc acgtcactaa aacccagatc 960
ctgtgtgccg agcgccggct gctggccctg ttcgatgacc gcaagaatga gctggcccac 1020
ttggcaactc atggcccaga gaatccaaaa ctggagatgc tggaaaagat cctgcaaagg 1080
cagttcagta gctctaacag ccctcggggt atcatcttca cccgcacccg ccaaagcgca 1140
cactccctcc tgctctggct ccagcagcag cagggcctgc agactgtgga catccgggcc 1200
cagctactga ttggggctgg gaacagcagc cagagcaccc acatgaccca gagggaccag 1260
caagaagtga tccagaagtt ccaagatgga accctgaacc ttctggtggc cacgagtgtg 1320
gcggaggagg ggctggacat cccacattgc aatgtggtgg tgcgttatgg gctcttgacc 1380
aatgaaatct ccatggtcca ggccaggggc cgtgcccggg ccgatcagag tgtatacgcg 1440
tttgtagcaa ctgaaggtag ccgggagctg aagcgggagc tgatcaacga ggcgctggag 1500
acgctgatgg agcaggcagt ggctgctgtg cagaaaatgg accaggccga gtaccaggcc 1560
aagatccggg atctgcagca ggcagccttg accaagcggg cggcccaggc agcccagcgg 1620
gagaaccagc ggcagcagtt cccagtggag cacgtgcagc tactctgcat caactgcatg 1680
gtggctgtgg gccatggcag cgacctgcgg aaggtggagg gcacccacca tgtcaatgtg 1740
aaccccaact tctcgaacta ctataatgtc tccagggatc ctgtggtcat caacaaagtc 1800
ttcaaggact ggaagcctgg gggtgtcatc agctgcagga actgtgggga ggtctggggt 1860
ctgcagatga tctacaagtc agtgaagctg ccagtgctca aagtccgcag catgctgctg 1920
gagacccctc aggggcggat ccaggccaaa aagtggtccc gcgtgccctt ctccgtgcct 1980
gactttgact tcctgcagca ttgtgccgag aacttgtcgg acctctccct ggac 2034
<210> 183
<211> 2034
<212> RNA
<213> Homo sapiens
<400> 183
auggagcuuc gguccuacca augggaggug aucaugccug cccuggaggg caagaauauc 60
aucaucuggc ugcccacggg ugccgggaag acccgggcgg cugcuuaugu ggccaagcgg 120
caccuagaga cuguggaugg agccaaggug guuguauugg ucaacagggu gcaccuggug 180
acccagcaug gugaagaguu caggcgcaug cuggauggac gcuggaccgu gacaacccug 240
aguggggaca ugggaccacg ugcuggcuuu ggccaccugg cccggugcca ugaccugcuc 300
aucugcacag cagagcuucu gcagauggca cugaccagcc ccgaggagga ggagcacgug 360
gagcucacug ucuucucccu gaucguggug gaugagugcc accacacgca caaggacacc 420
gucuacaacg ucaucaugag ccaguaccua gaacuuaaac uccagagggc acagccgcua 480
ccccaggugc ugggucucac agccucccca ggcacuggcg gggccuccaa acucgauggg 540
gccaucaacc acguccugca gcucugugcc aacuuggaca cguggugcau caugucaccc 600
cagaacugcu gcccccagcu gcaggagcac agccaacagc cuugcaaaca guacaaccuc 660
ugccacaggc gcagccagga uccguuuggg gacuugcuga agaagcucau ggaccaaauc 720
caugaccacc uggagaugcc ugaguugagc cggaaauuug ggacgcaaau guaugagcag 780
caggugguga agcugaguga ggcugcggcu uuggcugggc uucaggagca acggguguau 840
gcgcuucacc ugaggcgcua caaugacgcg cugcucaucc augacaccgu ccgcgccgug 900
gaugccuugg cugcgcugca ggauuucuau cacagggagc acgucacuaa aacccagauc 960
cugugugccg agcgccggcu gcuggcccug uucgaugacc gcaagaauga gcuggcccac 1020
uuggcaacuc auggcccaga gaauccaaaa cuggagaugc uggaaaagau ccugcaaagg 1080
caguucagua gcucuaacag cccucggggu aucaucuuca cccgcacccg ccaaagcgca 1140
cacucccucc ugcucuggcu ccagcagcag cagggccugc agacugugga cauccgggcc 1200
cagcuacuga uuggggcugg gaacagcagc cagagcaccc acaugaccca gagggaccag 1260
caagaaguga uccagaaguu ccaagaugga acccugaacc uucugguggc cacgagugug 1320
gcggaggagg ggcuggacau cccacauugc aauguggugg ugcguuaugg gcucuugacc 1380
aaugaaaucu ccauggucca ggccaggggc cgugcccggg ccgaucagag uguauacgcg 1440
uuuguagcaa cugaagguag ccgggagcug aagcgggagc ugaucaacga ggcgcuggag 1500
acgcugaugg agcaggcagu ggcugcugug cagaaaaugg accaggccga guaccaggcc 1560
aagauccggg aucugcagca ggcagccuug accaagcggg cggcccaggc agcccagcgg 1620
gagaaccagc ggcagcaguu cccaguggag cacgugcagc uacucugcau caacugcaug 1680
guggcugugg gccauggcag cgaccugcgg aagguggagg gcacccacca ugucaaugug 1740
aaccccaacu ucucgaacua cuauaauguc uccagggauc cuguggucau caacaaaguc 1800
uucaaggacu ggaagccugg gggugucauc agcugcagga acugugggga ggucuggggu 1860
cugcagauga ucuacaaguc agugaagcug ccagugcuca aaguccgcag caugcugcug 1920
gagaccccuc aggggcggau ccaggccaaa aagugguccc gcgugcccuu cuccgugccu 1980
gacuuugacu uccugcagca uugugccgag aacuugucgg accucucccu ggac 2034
<210> 184
<211> 2037
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding LGP2
<400> 184
atggaactgc ggagctacca gtgggaagtg atcatgcctg ctctggaagg caagaacatc 60
atcatctggc tgcccaccgg cgctggcaaa acaagagctg ctgcctacgt ggccaagcgg 120
cacctggaaa cagtggatgg cgctaaggtg gtggtgctgg tcaacagagt gcacctggtt 180
acccagcacg gcgaggaatt cagaagaatg ctggacggcc ggtggaccgt gacaacactg 240
tctggcgata tgggccctag agccggcttt ggacacctgg ccagatgcca cgatctgctg 300
atctgtacag ccgaactgct gcagatggcc ctgacaagcc ctgaggaaga ggaacacgtc 360
gagctgaccg tgttcagcct gatcgtggtg gacgagtgcc accacacaca caaggacacc 420
gtgtacaacg tgatcatgag ccagtacctg gaactgaagc tgcagagagc ccagcctctg 480
cctcaagtgc tgggactgac agcctctcct ggaacaggcg gagcctctaa actggacggc 540
gccatcaatc acgtgctgca gctgtgcgcc aacctggata cctggtgcat catgtcccca 600
cagaactgct gtccccagct gcaagagcac tctcagcagc cctgcaagca gtacaacctg 660
tgccacagaa gatctcagga ccccttcggc gacctgctga agaaactgat ggaccagatc 720
cacgaccacc tcgagatgcc cgagctgagc agaaagttcg gcacccagat gtacgagcag 780
caggttgtga agctgagcga agccgctgct ctggccggac tgcaagaaca gagagtgtac 840
gccctgcacc tgaggcggta caatgatgcc ctgctgatcc acgataccgt gcgcgctgtt 900
gatgctctgg ctgctctgca ggatttctac caccgcgagc acgtgaccaa gacacagatc 960
ctgtgtgccg agagaaggct gctggccctg ttcgacgaca gaaagaatga gctggcccac 1020
ctggctacac acggccccga aaatcccaag ctggaaatgc tggaaaagat cctgcagcgg 1080
cagttcagca gcagcaacag ccctagaggc atcatcttca cccggaccag acagagcgcc 1140
cactctctgc tgctgtggct gcagcaacaa cagggactgc agaccgtgga cattagggcc 1200
cagctgctga tcggagccgg caatagctct cagagcaccc acatgaccca gcgggaccag 1260
caagaagtga tccagaagtt ccaggacggc accctgaatc tgctggtggc cacatctgtg 1320
gctgaggaag gcctggatat ccctcactgc aacgtggtcg tcagatacgg cctgctgacc 1380
aacgagatca gcatggtgca ggccagaggc agagccagag ccgatcagtc tgtgtacgcc 1440
ttcgtggcta cagagggctc cagagagctg aagcgcgagc tgatcaatga ggccctggaa 1500
accctgatgg aacaagccgt ggccgccgtg cagaaaatgg atcaggccga gtaccaggcc 1560
aagatcaggg atctgcaaca ggccgctctg accaagagag ctgctcaggc tgcccagaga 1620
gagaaccaga gacagcaatt ccccgtggaa cacgtgcagc tgctgtgtat caactgcatg 1680
gtggccgtcg gacacggcag cgatctgaga aaagtggaag gcacccacca cgtgaacgtg 1740
aaccccaact tcagcaacta ctacaacgtg tccagagatc ccgtggtcat caacaaggtg 1800
ttcaaggact ggaagcctgg cggcgtgatc agctgcagaa attgcggaga agtgtggggc 1860
ctgcagatga tctacaagag cgtgaagctg cccgtgctga aagtgcggag catgctgctg 1920
gaaacacccc agggaagaat ccaggccaaa aagtggtcca gagtgccctt cagcgtgccc 1980
gacttcgatt tcctgcagca ctgcgccgag aacctgagcg atctgtccct ggattga 2037
<210> 185
<211> 2037
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding LGP2
<400> 185
auggaacugc ggagcuacca gugggaagug aucaugccug cucuggaagg caagaacauc 60
aucaucuggc ugcccaccgg cgcuggcaaa acaagagcug cugccuacgu ggccaagcgg 120
caccuggaaa caguggaugg cgcuaaggug guggugcugg ucaacagagu gcaccugguu 180
acccagcacg gcgaggaauu cagaagaaug cuggacggcc gguggaccgu gacaacacug 240
ucuggcgaua ugggcccuag agccggcuuu ggacaccugg ccagaugcca cgaucugcug 300
aucuguacag ccgaacugcu gcagauggcc cugacaagcc cugaggaaga ggaacacguc 360
gagcugaccg uguucagccu gaucguggug gacgagugcc accacacaca caaggacacc 420
guguacaacg ugaucaugag ccaguaccug gaacugaagc ugcagagagc ccagccucug 480
ccucaagugc ugggacugac agccucuccu ggaacaggcg gagccucuaa acuggacggc 540
gccaucaauc acgugcugca gcugugcgcc aaccuggaua ccuggugcau caugucccca 600
cagaacugcu guccccagcu gcaagagcac ucucagcagc ccugcaagca guacaaccug 660
ugccacagaa gaucucagga ccccuucggc gaccugcuga agaaacugau ggaccagauc 720
cacgaccacc ucgagaugcc cgagcugagc agaaaguucg gcacccagau guacgagcag 780
cagguuguga agcugagcga agccgcugcu cuggccggac ugcaagaaca gagaguguac 840
gcccugcacc ugaggcggua caaugaugcc cugcugaucc acgauaccgu gcgcgcuguu 900
gaugcucugg cugcucugca ggauuucuac caccgcgagc acgugaccaa gacacagauc 960
cugugugccg agagaaggcu gcuggcccug uucgacgaca gaaagaauga gcuggcccac 1020
cuggcuacac acggccccga aaaucccaag cuggaaaugc uggaaaagau ccugcagcgg 1080
caguucagca gcagcaacag cccuagaggc aucaucuuca cccggaccag acagagcgcc 1140
cacucucugc ugcuguggcu gcagcaacaa cagggacugc agaccgugga cauuagggcc 1200
cagcugcuga ucggagccgg caauagcucu cagagcaccc acaugaccca gcgggaccag 1260
caagaaguga uccagaaguu ccaggacggc acccugaauc ugcugguggc cacaucugug 1320
gcugaggaag gccuggauau cccucacugc aacguggucg ucagauacgg ccugcugacc 1380
aacgagauca gcauggugca ggccagaggc agagccagag ccgaucaguc uguguacgcc 1440
uucguggcua cagagggcuc cagagagcug aagcgcgagc ugaucaauga ggcccuggaa 1500
acccugaugg aacaagccgu ggccgccgug cagaaaaugg aucaggccga guaccaggcc 1560
aagaucaggg aucugcaaca ggccgcucug accaagagag cugcucaggc ugcccagaga 1620
gagaaccaga gacagcaauu ccccguggaa cacgugcagc ugcuguguau caacugcaug 1680
guggccgucg gacacggcag cgaucugaga aaaguggaag gcacccacca cgugaacgug 1740
aaccccaacu ucagcaacua cuacaacgug uccagagauc ccguggucau caacaaggug 1800
uucaaggacu ggaagccugg cggcgugauc agcugcagaa auugcggaga aguguggggc 1860
cugcagauga ucuacaagag cgugaagcug cccgugcuga aagugcggag caugcugcug 1920
gaaacacccc agggaagaau ccaggccaaa aaguggucca gagugcccuu cagcgugccc 1980
gacuucgauu uccugcagca cugcgccgag aaccugagcg aucugucccu ggauuga 2037
<210> 186
<211> 880
<212> PRT
<213> Homo sapiens
<400> 186
Met Thr Thr Glu Gln Arg Arg Ser Leu Gln Ala Phe Gln Asp Tyr Ile
1 5 10 15
Arg Lys Thr Leu Asp Pro Thr Tyr Ile Leu Ser Tyr Met Ala Pro Trp
20 25 30
Phe Arg Glu Gly Tyr Ser Gly Leu Tyr Glu Ala Ile Glu Ser Trp Asp
35 40 45
Phe Lys Lys Ile Glu Lys Leu Glu Glu Tyr Arg Leu Leu Leu Lys Arg
50 55 60
Leu Gln Pro Glu Phe Lys Thr Arg Ile Ile Pro Thr Asp Ile Ile Ser
65 70 75 80
Asp Leu Ser Glu Cys Leu Ile Asn Gln Glu Cys Glu Glu Ile Leu Gln
85 90 95
Ile Cys Ser Thr Lys Gly Met Met Ala Gly Ala Glu Lys Leu Val Glu
100 105 110
Cys Leu Leu Arg Ser Asp Lys Glu Asn Trp Pro Lys Thr Leu Lys Leu
115 120 125
Ala Leu Glu Lys Glu Arg Asn Lys Phe Ser Glu Leu Trp Ile Val Glu
130 135 140
Lys Gly Ile Lys Asp Val Glu Thr Glu Asp Leu Glu Asp Lys Met Glu
145 150 155 160
Thr Ser Asp Ile Gln Ile Phe Tyr Gln Glu Asp Pro Glu Cys Gln Asn
165 170 175
Leu Ser Glu Asn Ser Cys Pro Pro Ser Glu Val Ser Asp Thr Asn Leu
180 185 190
Tyr Ser Pro Phe Lys Pro Arg Asn Tyr Gln Leu Glu Leu Ala Leu Pro
195 200 205
Ala Met Lys Gly Lys Asn Thr Ile Ile Cys Ala Pro Thr Gly Cys Gly
210 215 220
Lys Thr Phe Val Ser Leu Leu Ile Cys Glu His His Leu Lys Lys Phe
225 230 235 240
Pro Gln Gly Gln Lys Gly Lys Val Val Phe Phe Ala Asn Gln Ile Pro
245 250 255
Val Tyr Glu Gln Gln Lys Ser Val Phe Ser Lys Tyr Phe Glu Arg His
260 265 270
Gly Tyr Arg Val Thr Gly Ile Ser Gly Ala Thr Ala Glu Asn Val Pro
275 280 285
Val Glu Gln Ile Val Glu Asn Asn Asp Ile Ile Ile Leu Thr Pro Gln
290 295 300
Ile Leu Val Asn Asn Leu Lys Lys Gly Thr Ile Pro Ser Leu Ser Ile
305 310 315 320
Phe Thr Leu Met Ile Phe Asp Glu Cys His Asn Thr Ser Lys Gln His
325 330 335
Pro Tyr Asn Met Ile Met Phe Asn Tyr Leu Asp Gln Lys Leu Gly Gly
340 345 350
Ser Ser Gly Pro Leu Pro Gln Val Ile Gly Leu Thr Ala Ser Val Gly
355 360 365
Val Gly Asp Ala Lys Asn Thr Asp Glu Ala Leu Asp Tyr Ile Cys Lys
370 375 380
Leu Cys Ala Ser Leu Asp Ala Ser Val Ile Ala Thr Val Lys His Asn
385 390 395 400
Leu Glu Glu Leu Glu Gln Val Val Tyr Lys Pro Gln Lys Phe Phe Arg
405 410 415
Lys Val Glu Ser Arg Ile Ser Asp Lys Phe Lys Tyr Ile Ile Ala Gln
420 425 430
Leu Met Arg Asp Thr Glu Ser Leu Ala Lys Arg Ile Cys Lys Asp Leu
435 440 445
Glu Asn Leu Ser Gln Ile Gln Asn Arg Glu Phe Gly Thr Gln Lys Tyr
450 455 460
Glu Gln Trp Ile Val Thr Val Gln Lys Ala Cys Met Val Phe Gln Met
465 470 475 480
Pro Asp Lys Asp Glu Glu Ser Arg Ile Cys Lys Ala Leu Phe Leu Tyr
485 490 495
Thr Ser His Leu Arg Lys Tyr Asn Asp Ala Leu Ile Ile Ser Glu His
500 505 510
Ala Arg Met Lys Asp Ala Leu Asp Tyr Leu Lys Asp Phe Phe Ser Asn
515 520 525
Val Arg Ala Ala Gly Phe Asp Glu Ile Glu Gln Asp Leu Thr Gln Arg
530 535 540
Phe Glu Glu Lys Leu Gln Glu Leu Glu Ser Val Ser Arg Asp Pro Ser
545 550 555 560
Asn Glu Asn Pro Lys Leu Glu Asp Leu Cys Phe Ile Leu Gln Glu Glu
565 570 575
Tyr His Leu Asn Pro Glu Thr Ile Thr Ile Leu Phe Val Lys Thr Arg
580 585 590
Ala Leu Val Asp Ala Leu Lys Asn Trp Ile Glu Gly Asn Pro Lys Leu
595 600 605
Ser Phe Leu Lys Pro Gly Ile Leu Thr Gly Arg Gly Lys Thr Asn Gln
610 615 620
Asn Thr Gly Met Thr Leu Pro Ala Gln Lys Cys Ile Leu Asp Ala Phe
625 630 635 640
Lys Ala Ser Gly Asp His Asn Ile Leu Ile Ala Thr Ser Val Ala Asp
645 650 655
Glu Gly Ile Asp Ile Ala Gln Cys Asn Leu Val Ile Leu Tyr Glu Tyr
660 665 670
Val Gly Asn Val Ile Lys Met Ile Gln Thr Arg Gly Arg Gly Arg Ala
675 680 685
Arg Gly Ser Lys Cys Phe Leu Leu Thr Ser Asn Ala Gly Val Ile Glu
690 695 700
Lys Glu Gln Ile Asn Met Tyr Lys Glu Lys Met Met Asn Asp Ser Ile
705 710 715 720
Leu Arg Leu Gln Thr Trp Asp Glu Ala Val Phe Arg Glu Lys Ile Leu
725 730 735
His Ile Gln Thr His Glu Lys Phe Ile Arg Asp Ser Gln Glu Lys Pro
740 745 750
Lys Pro Val Pro Asp Lys Glu Asn Lys Lys Leu Leu Cys Arg Lys Cys
755 760 765
Lys Ala Leu Ala Cys Tyr Thr Ala Asp Val Arg Val Ile Glu Glu Cys
770 775 780
His Tyr Thr Val Leu Gly Asp Ala Phe Lys Glu Cys Phe Val Ser Arg
785 790 795 800
Pro His Pro Lys Pro Lys Gln Phe Ser Ser Phe Glu Lys Arg Ala Lys
805 810 815
Ile Phe Cys Ala Arg Gln Asn Cys Ser His Asp Trp Gly Ile His Val
820 825 830
Lys Tyr Lys Thr Phe Glu Ile Pro Val Ile Lys Ile Glu Ser Phe Val
835 840 845
Val Glu Asp Ile Ala Thr Gly Val Gln Thr Leu Tyr Ser Lys Trp Lys
850 855 860
Asp Phe His Phe Glu Lys Ile Pro Phe Asp Pro Ala Glu Met Ser Lys
865 870 875 880
<210> 187
<211> 2640
<212> DNA
<213> Homo sapiens
<400> 187
atgaccaccg agcagcgacg cagcctgcaa gccttccagg attatatccg gaagaccctg 60
gaccctacct acatcctgag ctacatggcc ccctggttta gggagggtta ttctggactt 120
tatgaagcca ttgaaagttg ggatttcaaa aaaattgaaa agttggagga gtatagatta 180
cttttaaaac gtttacaacc agaatttaaa accagaatta tcccaaccga tatcatttct 240
gatctgtctg aatgtttaat taatcaggaa tgtgaagaaa ttctacagat ttgctctact 300
aaggggatga tggcaggtgc agagaaattg gtggaatgcc ttctcagatc agacaaggaa 360
aactggccca aaactttgaa acttgctttg gagaaagaaa ggaacaagtt cagtgaactg 420
tggattgtag agaaaggtat aaaagatgtt gaaacagaag atcttgagga taagatggaa 480
acttctgaca tacagatttt ctaccaagaa gatccagaat gccagaatct tagtgagaat 540
tcatgtccac cttcagaagt gtctgataca aacttgtaca gcccatttaa accaagaaat 600
taccaattag agcttgcttt gcctgctatg aaaggaaaaa acacaataat atgtgctcct 660
acaggttgtg gaaaaacctt tgtttcactg cttatatgtg aacatcatct taaaaaattc 720
ccacaaggac aaaaggggaa agttgtcttt tttgcgaatc agatcccagt gtatgaacag 780
cagaaatctg tattctcaaa atactttgaa agacatgggt atagagttac aggcatttct 840
ggagcaacag ctgagaatgt cccagtggaa cagattgttg agaacaatga catcatcatt 900
ttaactccac agattcttgt gaacaacctt aaaaagggaa cgattccatc actatccatc 960
tttactttga tgatatttga tgaatgccac aacactagta aacaacaccc gtacaatatg 1020
atcatgttta attatctaga tcagaaactt ggaggatctt caggcccact gccccaggtc 1080
attgggctga ctgcctcggt tggtgttggg gatgccaaaa acacagatga agccttggat 1140
tatatctgca agctgtgtgc ttctcttgat gcgtcagtga tagcaacagt caaacacaat 1200
ctggaggaac tggagcaagt tgtttataag ccccagaagt ttttcaggaa agtggaatca 1260
cggattagcg acaaatttaa atacatcata gctcagctga tgagggacac agagagtctg 1320
gcaaagagaa tctgcaaaga cctcgaaaac ttatctcaaa ttcaaaatag ggaatttgga 1380
acacagaaat atgaacaatg gattgttaca gttcagaaag catgcatggt gttccagatg 1440
ccagacaaag atgaagagag caggatttgt aaagccctgt ttttatacac ttcacatttg 1500
cggaaatata atgatgccct cattatcagt gagcatgcac gaatgaaaga tgctctggat 1560
tacttgaaag acttcttcag caatgtccga gcagcaggat tcgatgagat tgagcaagat 1620
cttactcaga gatttgaaga aaagctgcag gaactagaaa gtgtttccag ggatcccagc 1680
aatgagaatc ctaaacttga agacctctgc ttcatcttac aagaagagta ccacttaaac 1740
ccagagacaa taacaattct ctttgtgaaa accagagcac ttgtggacgc tttaaaaaat 1800
tggattgaag gaaatcctaa actcagtttt ctaaaacctg gcatattgac tggacgtggc 1860
aaaacaaatc agaacacagg aatgaccctc ccggcacaga agtgtatatt ggatgcattc 1920
aaagccagtg gagatcacaa tattctgatt gccacctcag ttgctgatga aggcattgac 1980
attgcacagt gcaatcttgt catcctttat gagtatgtgg gcaatgtcat caaaatgatc 2040
caaaccagag gcagaggaag agcaagaggt agcaagtgct tccttctgac tagtaatgct 2100
ggtgtaattg aaaaagaaca aataaacatg tacaaagaaa aaatgatgaa tgactctatt 2160
ttacgccttc agacatggga cgaagcagta tttagggaaa agattctgca tatacagact 2220
catgaaaaat tcatcagaga tagtcaagaa aaaccaaaac ctgtacctga taaggaaaat 2280
aaaaaactgc tctgcagaaa gtgcaaagcc ttggcatgtt acacagctga cgtaagagtg 2340
atagaggaat gccattacac tgtgcttgga gatgctttta aggaatgctt tgtgagtaga 2400
ccacatccca agccaaagca gttttcaagt tttgaaaaaa gagcaaagat attctgtgcc 2460
cgacagaact gcagccatga ctggggaatc catgtgaagt acaagacatt tgagattcca 2520
gttataaaaa ttgaaagttt tgtggtggag gatattgcaa ctggagttca gacactgtac 2580
tcgaagtgga aggactttca ttttgagaag ataccatttg atccagcaga aatgtccaaa 2640
2640
<210> 188
<211> 2640
<212> RNA
<213> Homo sapiens
<400> 188
augaccaccg agcagcgacg cagccugcaa gccuuccagg auuauauccg gaagacccug 60
gacccuaccu acauccugag cuacauggcc cccugguuua gggaggguua uucuggacuu 120
uaugaagcca uugaaaguug ggauuucaaa aaaauugaaa aguuggagga guauagauua 180
cuuuuaaaac guuuacaacc agaauuuaaa accagaauua ucccaaccga uaucauuucu 240
gaucugucug aauguuuaau uaaucaggaa ugugaagaaa uucuacagau uugcucuacu 300
aaggggauga uggcaggugc agagaaauug guggaaugcc uucucagauc agacaaggaa 360
aacuggccca aaacuuugaa acuugcuuug gagaaagaaa ggaacaaguu cagugaacug 420
uggauuguag agaaagguau aaaagauguu gaaacagaag aucuugagga uaagauggaa 480
acuucugaca uacagauuuu cuaccaagaa gauccagaau gccagaaucu uagugagaau 540
ucauguccac cuucagaagu gucugauaca aacuuguaca gcccauuuaa accaagaaau 600
uaccaauuag agcuugcuuu gccugcuaug aaaggaaaaa acacaauaau augugcuccu 660
acagguugug gaaaaaccuu uguuucacug cuuauaugug aacaucaucu uaaaaaauuc 720
ccacaaggac aaaaggggaa aguugucuuu uuugcgaauc agaucccagu guaugaacag 780
cagaaaucug uauucucaaa auacuuugaa agacaugggu auagaguuac aggcauuucu 840
ggagcaacag cugagaaugu cccaguggaa cagauuguug agaacaauga caucaucauu 900
uuaacuccac agauucuugu gaacaaccuu aaaaagggaa cgauuccauc acuauccauc 960
uuuacuuuga ugauauuuga ugaaugccac aacacuagua aacaacaccc guacaauaug 1020
aucauguuua auuaucuaga ucagaaacuu ggaggaucuu caggcccacu gccccagguc 1080
auugggcuga cugccucggu ugguguuggg gaugccaaaa acacagauga agccuuggau 1140
uauaucugca agcugugugc uucucuugau gcgucaguga uagcaacagu caaacacaau 1200
cuggaggaac uggagcaagu uguuuauaag ccccagaagu uuuucaggaa aguggaauca 1260
cggauuagcg acaaauuuaa auacaucaua gcucagcuga ugagggacac agagagucug 1320
gcaaagagaa ucugcaaaga ccucgaaaac uuaucucaaa uucaaaauag ggaauuugga 1380
acacagaaau augaacaaug gauuguuaca guucagaaag caugcauggu guuccagaug 1440
ccagacaaag augaagagag caggauuugu aaagcccugu uuuuauacac uucacauuug 1500
cggaaauaua augaugcccu cauuaucagu gagcaugcac gaaugaaaga ugcucuggau 1560
uacuugaaag acuucuucag caauguccga gcagcaggau ucgaugagau ugagcaagau 1620
cuuacucaga gauuugaaga aaagcugcag gaacuagaaa guguuuccag ggaucccagc 1680
aaugagaauc cuaaacuuga agaccucugc uucaucuuac aagaagagua ccacuuaaac 1740
ccagagacaa uaacaauucu cuuugugaaa accagagcac uuguggacgc uuuaaaaaau 1800
uggauugaag gaaauccuaa acucaguuuu cuaaaaccug gcauauugac uggacguggc 1860
aaaacaaauc agaacacagg aaugacccuc ccggcacaga aguguauauu ggaugcauuc 1920
aaagccagug gagaucacaa uauucugauu gccaccucag uugcugauga aggcauugac 1980
auugcacagu gcaaucuugu cauccuuuau gaguaugugg gcaaugucau caaaaugauc 2040
caaaccagag gcagaggaag agcaagaggu agcaagugcu uccuucugac uaguaaugcu 2100
gguguaauug aaaaagaaca aauaaacaug uacaaagaaa aaaugaugaa ugacucuauu 2160
uuacgccuuc agacauggga cgaagcagua uuuagggaaa agauucugca uauacagacu 2220
caugaaaaau ucaucagaga uagucaagaa aaaccaaaac cuguaccuga uaaggaaaau 2280
aaaaaacugc ucugcagaaa gugcaaagcc uuggcauguu acacagcuga cguaagagug 2340
auagaggaau gccauuacac ugugcuugga gaugcuuuua aggaaugcuu ugugaguaga 2400
ccacauccca agccaaagca guuuucaagu uuugaaaaaa gagcaaagau auucugugcc 2460
cgacagaacu gcagccauga cuggggaauc caugugaagu acaagacauu ugagauucca 2520
guuauaaaaa uugaaaguuu ugugguggag gauauugcaa cuggaguuca gacacuguac 2580
ucgaagugga aggacuuuca uuuugagaag auaccauuug auccagcaga aauguccaaa 2640
2640
<210> 189
<211> 2643
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding RIG splice variant
<400> 189
atgaccaccg agcagagaag atccctgcag gccttccagg actacatcag aaagacactg 60
gaccccacct acatcctgag ctacatggcc ccatggttca gagagggcta cagcggactg 120
tacgaggcca tcgagagctg ggacttcaag aagatcgaga agctggaaga gtaccggctg 180
ctgctgaaga gactgcagcc cgagttcaag acccggatca tccccaccga catcatcagc 240
gatctgagcg agtgcctgat caatcaagag tgcgaggaaa tcctgcagat ctgtagcacc 300
aagggcatga tggctggcgc cgagaaactg gtggaatgcc tgctgagaag cgacaaagag 360
aactggccca agacactgaa gctggccctg gaaaaagagc ggaacaagtt cagcgagctg 420
tggatcgtgg aaaagggcat caaggacgtg gaaaccgagg acctggaaga taagatggaa 480
accagcgaca tccagatctt ctaccaagag gaccccgagt gccagaacct gagcgagaat 540
agctgccctc ctagcgaggt gtccgacacc aatctgtaca gccccttcaa gccccggaac 600
taccagctgg aacttgccct gcctgccatg aagggcaaga acaccatcat ctgtgcccca 660
accggctgcg gcaagacctt tgtgtctctg ctgatctgcg agcaccacct gaagaagttc 720
cctcagggcc agaaaggcaa ggtggtgttt ttcgccaatc agatccccgt gtacgagcag 780
cagaaaagcg tgttcagcaa gtacttcgag cggcacggct acagagtgac aggcatttct 840
ggcgccaccg ccgagaatgt gcctgtggaa cagattgtgg aaaacaacga tatcatcatc 900
ctgacgcctc agatcctggt caacaatctg aagaagggca caatccccag cctgagcatc 960
ttcaccctga tgatcttcga cgagtgccac aacaccagca agcagcaccc ctacaatatg 1020
atcatgttca actacctgga ccagaagctc ggcggcagct ctggacctct gcctcaagtg 1080
attggcctga cagcctctgt cggagtgggc gacgccaaga atactgacga ggccctggat 1140
tacatctgca agctgtgcgc cagcctggac gcctctgtga ttgccaccgt gaagcacaac 1200
ctcgaggaac tggaacaggt ggtgtacaag ccccagaaat tctttcggaa ggtggaaagc 1260
cggatcagcg acaagttcaa gtacatcatt gcccagctga tgcgggacac cgagagcctg 1320
gctaagagaa tctgcaagga tctggaaaac ctgagccaga tccagaacag agagttcggc 1380
acccagaaat acgagcagtg gattgtgacc gtgcagaaag cctgcatggt gttccagatg 1440
cctgacaagg acgaagagag ccggatctgc aaagccctgt tcctgtacac cagccacctg 1500
agaaagtaca acgacgccct gatcatctcc gagcacgcca gaatgaagga cgccctggac 1560
tacctgaagg acttcttctc caatgtgcgc gctgccggct tcgatgagat cgagcaagat 1620
ctgacccagc gcttcgagga aaagctgcaa gagctggaaa gcgtgtccag agatcccagc 1680
aacgagaacc ccaaactgga agatctgtgc ttcatcctgc aagaggaata ccatctgaac 1740
cccgagacaa tcaccatcct gttcgtgaaa acaagagccc tggtggatgc cctgaagaac 1800
tggatcgagg gcaaccccaa gctgagcttc ctgaagcctg gcatcctgac cggcagaggc 1860
aagacaaacc agaacaccgg catgaccctg ccagctcaga agtgcatcct ggacgctttt 1920
aaggccagcg gcgaccacaa catcctgatc gccacatctg tggccgacga gggcatcgat 1980
atcgcccagt gcaatctggt catcctgtac gagtacgtgg gcaacgtgat caagatgatc 2040
cagacaagag gcaggggcag agccagaggc agcaagtgct ttctgctgac ctctaatgcc 2100
ggcgtgatcg agaaagaaca gatcaacatg tacaaagaaa agatgatgaa cgacagcatc 2160
ctgcggctgc agacctggga tgaagccgtg ttccgggaaa agatcctgca catccagaca 2220
cacgagaagt tcatccggga cagccaagag aagcccaagc ctgtgcctga caaagaaaac 2280
aagaaactgc tgtgccggaa gtgcaaggcc ctggcctgtt atacagccga cgtgcgagtg 2340
atcgaggaat gccactatac cgtgctcggc gacgccttca aagaatgctt cgtgtcccgg 2400
cctcatccta agcctaagca gttcagcagc ttcgagaagc gggccaagat cttctgcgcc 2460
agacagaact gcagccacga ctggggaatc cacgtgaagt acaagacctt cgagatcccg 2520
gtcatcaaga tcgagtcctt cgtggtggaa gatatcgcca ccggcgtgca gaccctgtac 2580
agcaagtgga aggatttcca cttcgagaaa atccctttcg accccgccga gatgagcaag 2640
tga 2643
<210> 190
<211> 2643
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding RIG splice variant
<400> 190
augaccaccg agcagagaag aucccugcag gccuuccagg acuacaucag aaagacacug 60
gaccccaccu acauccugag cuacauggcc ccaugguuca gagagggcua cagcggacug 120
uacgaggcca ucgagagcug ggacuucaag aagaucgaga agcuggaaga guaccggcug 180
cugcugaaga gacugcagcc cgaguucaag acccggauca uccccaccga caucaucagc 240
gaucugagcg agugccugau caaucaagag ugcgaggaaa uccugcagau cuguagcacc 300
aagggcauga uggcuggcgc cgagaaacug guggaaugcc ugcugagaag cgacaaagag 360
aacuggccca agacacugaa gcuggcccug gaaaaagagc ggaacaaguu cagcgagcug 420
uggaucgugg aaaagggcau caaggacgug gaaaccgagg accuggaaga uaagauggaa 480
accagcgaca uccagaucuu cuaccaagag gaccccgagu gccagaaccu gagcgagaau 540
agcugcccuc cuagcgaggu guccgacacc aaucuguaca gccccuucaa gccccggaac 600
uaccagcugg aacuugcccu gccugccaug aagggcaaga acaccaucau cugugcccca 660
accggcugcg gcaagaccuu ugugucucug cugaucugcg agcaccaccu gaagaaguuc 720
ccucagggcc agaaaggcaa ggugguguuu uucgccaauc agauccccgu guacgagcag 780
cagaaaagcg uguucagcaa guacuucgag cggcacggcu acagagugac aggcauuucu 840
ggcgccaccg ccgagaaugu gccuguggaa cagauugugg aaaacaacga uaucaucauc 900
cugacgccuc agauccuggu caacaaucug aagaagggca caauccccag ccugagcauc 960
uucacccuga ugaucuucga cgagugccac aacaccagca agcagcaccc cuacaauaug 1020
aucauguuca acuaccugga ccagaagcuc ggcggcagcu cuggaccucu gccucaagug 1080
auuggccuga cagccucugu cggagugggc gacgccaaga auacugacga ggcccuggau 1140
uacaucugca agcugugcgc cagccuggac gccucuguga uugccaccgu gaagcacaac 1200
cucgaggaac uggaacaggu gguguacaag ccccagaaau ucuuucggaa gguggaaagc 1260
cggaucagcg acaaguucaa guacaucauu gcccagcuga ugcgggacac cgagagccug 1320
gcuaagagaa ucugcaagga ucuggaaaac cugagccaga uccagaacag agaguucggc 1380
acccagaaau acgagcagug gauugugacc gugcagaaag ccugcauggu guuccagaug 1440
ccugacaagg acgaagagag ccggaucugc aaagcccugu uccuguacac cagccaccug 1500
agaaaguaca acgacgcccu gaucaucucc gagcacgcca gaaugaagga cgcccuggac 1560
uaccugaagg acuucuucuc caaugugcgc gcugccggcu ucgaugagau cgagcaagau 1620
cugacccagc gcuucgagga aaagcugcaa gagcuggaaa gcguguccag agaucccagc 1680
aacgagaacc ccaaacugga agaucugugc uucauccugc aagaggaaua ccaucugaac 1740
cccgagacaa ucaccauccu guucgugaaa acaagagccc ugguggaugc ccugaagaac 1800
uggaucgagg gcaaccccaa gcugagcuuc cugaagccug gcauccugac cggcagaggc 1860
aagacaaacc agaacaccgg caugacccug ccagcucaga agugcauccu ggacgcuuuu 1920
aaggccagcg gcgaccacaa cauccugauc gccacaucug uggccgacga gggcaucgau 1980
aucgcccagu gcaaucuggu cauccuguac gaguacgugg gcaacgugau caagaugauc 2040
cagacaagag gcaggggcag agccagaggc agcaagugcu uucugcugac cucuaaugcc 2100
ggcgugaucg agaaagaaca gaucaacaug uacaaagaaa agaugaugaa cgacagcauc 2160
cugcggcugc agaccuggga ugaagccgug uuccgggaaa agauccugca cauccagaca 2220
cacgagaagu ucauccggga cagccaagag aagcccaagc cugugccuga caaagaaaac 2280
aagaaacugc ugugccggaa gugcaaggcc cuggccuguu auacagccga cgugcgagug 2340
aucgaggaau gccacuauac cgugcucggc gacgccuuca aagaaugcuu cgugucccgg 2400
ccucauccua agccuaagca guucagcagc uucgagaagc gggccaagau cuucugcgcc 2460
agacagaacu gcagccacga cuggggaauc cacgugaagu acaagaccuu cgagaucccg 2520
gucaucaaga ucgaguccuu cgugguggaa gauaucgcca ccggcgugca gacccuguac 2580
agcaagugga aggauuucca cuucgagaaa aucccuuucg accccgccga gaugagcaag 2640
uga 2643
<210> 191
<211> 547
<212> PRT
<213> Homo sapiens
<400> 191
Met Glu Asp Ser Glu Ala Leu Gly Phe Glu His Met Gly Leu Asp Pro
1 5 10 15
Arg Leu Leu Gln Ala Val Thr Asp Leu Gly Trp Ser Arg Pro Thr Leu
20 25 30
Ile Gln Glu Lys Ala Ile Pro Leu Ala Leu Glu Gly Lys Asp Leu Leu
35 40 45
Ala Arg Ala Arg Thr Gly Ser Gly Lys Thr Ala Ala Tyr Ala Ile Pro
50 55 60
Met Leu Gln Leu Leu Leu His Arg Lys Ala Thr Gly Pro Val Val Glu
65 70 75 80
Gln Ala Val Arg Gly Leu Val Leu Val Pro Thr Lys Glu Leu Ala Arg
85 90 95
Gln Ala Gln Ser Met Ile Gln Gln Leu Ala Thr Tyr Cys Ala Arg Asp
100 105 110
Val Arg Val Ala Asn Val Ser Ala Ala Glu Asp Ser Val Ser Gln Arg
115 120 125
Ala Val Leu Met Glu Lys Pro Asp Val Val Val Gly Thr Pro Ser Arg
130 135 140
Ile Leu Ser His Leu Gln Gln Asp Ser Leu Lys Leu Arg Asp Ser Leu
145 150 155 160
Glu Leu Leu Val Val Asp Glu Ala Asp Leu Leu Phe Ser Phe Gly Phe
165 170 175
Glu Glu Glu Leu Lys Ser Leu Leu Cys His Leu Pro Arg Ile Tyr Gln
180 185 190
Ala Phe Leu Met Ser Ala Thr Phe Asn Glu Asp Val Gln Ala Leu Lys
195 200 205
Glu Leu Ile Leu His Asn Pro Val Thr Leu Lys Leu Gln Glu Ser Gln
210 215 220
Leu Pro Gly Pro Asp Gln Leu Gln Gln Phe Gln Val Val Cys Glu Thr
225 230 235 240
Glu Glu Asp Lys Phe Leu Leu Leu Tyr Ala Leu Leu Lys Leu Ser Leu
245 250 255
Ile Arg Gly Lys Ser Leu Leu Phe Val Asn Thr Leu Glu Arg Ser Tyr
260 265 270
Arg Leu Arg Leu Phe Leu Glu Gln Phe Ser Ile Pro Thr Cys Val Leu
275 280 285
Asn Gly Glu Leu Pro Leu Arg Ser Arg Cys His Ile Ile Ser Gln Phe
290 295 300
Asn Gln Gly Phe Tyr Asp Cys Val Ile Ala Thr Asp Ala Glu Val Leu
305 310 315 320
Gly Ala Pro Val Lys Gly Lys Arg Arg Gly Arg Gly Pro Lys Gly Asp
325 330 335
Lys Ala Ser Asp Pro Glu Ala Gly Val Ala Arg Gly Ile Asp Phe His
340 345 350
His Val Ser Ala Val Leu Asn Phe Asp Leu Pro Pro Thr Pro Glu Ala
355 360 365
Tyr Ile His Arg Ala Gly Arg Thr Ala Arg Ala Asn Asn Pro Gly Ile
370 375 380
Val Leu Thr Phe Val Leu Pro Thr Glu Gln Phe His Leu Gly Lys Ile
385 390 395 400
Glu Glu Leu Leu Ser Gly Glu Asn Arg Gly Pro Ile Leu Leu Pro Tyr
405 410 415
Gln Phe Arg Met Glu Glu Ile Glu Gly Phe Arg Tyr Arg Cys Arg Asp
420 425 430
Ala Met Arg Ser Val Thr Lys Gln Ala Ile Arg Glu Ala Arg Leu Lys
435 440 445
Glu Ile Lys Glu Glu Leu Leu His Ser Glu Lys Leu Lys Thr Tyr Phe
450 455 460
Glu Asp Asn Pro Arg Asp Leu Gln Leu Leu Arg His Asp Leu Pro Leu
465 470 475 480
His Pro Ala Val Val Lys Pro His Leu Gly His Val Pro Asp Tyr Leu
485 490 495
Val Pro Pro Ala Leu Arg Gly Leu Val Arg Pro His Lys Lys Arg Lys
500 505 510
Lys Leu Ser Ser Ser Cys Arg Lys Ala Lys Arg Ala Lys Ser Gln Asn
515 520 525
Pro Leu Arg Ser Phe Lys His Lys Gly Lys Lys Phe Arg Pro Thr Ala
530 535 540
Lys Pro Ser
545
<210> 192
<211> 1641
<212> DNA
<213> Homo sapiens
<400> 192
atggaggact ctgaagcact gggcttcgaa cacatgggcc tcgatccccg gctccttcag 60
gctgtcaccg atctgggctg gtcgcgacct acgctgatcc aggagaaggc catcccactg 120
gccctagaag ggaaggacct cctggctcgg gcccgcacgg gctccgggaa gacggccgct 180
tatgctattc cgatgctgca gctgttgctc cataggaagg cgacaggtcc ggtggtagaa 240
caggcagtga gaggccttgt tcttgttcct accaaggagc tggcacggca agcacagtcc 300
atgattcagc agctggctac ctactgtgct cgggatgtcc gagtggccaa tgtctcagct 360
gctgaagact cagtctctca gagagctgtg ctgatggaga agccagatgt ggtagtaggg 420
accccatctc gcatattaag ccacttgcag caagacagcc tgaaacttcg tgactccctg 480
gagcttttgg tggtggacga agctgacctt cttttttcct ttggctttga agaagagctc 540
aagagtctcc tctgtcactt gccccggatt taccaggctt ttctcatgtc agctactttt 600
aacgaggacg tacaagcact caaggagctg atattacata acccggttac ccttaagtta 660
caggagtccc agctgcctgg gccagaccag ttacagcagt ttcaggtggt ctgtgagact 720
gaggaagaca aattcctcct gctgtatgcc ctgctcaagc tgtcattgat tcggggcaag 780
tctctgctct ttgtcaacac tctagaacgg agttaccggc tacgcctgtt cttggaacag 840
ttcagcatcc ccacctgtgt gctcaatgga gagcttccac tgcgctccag gtgccacatc 900
atctcacagt tcaaccaagg cttctacgac tgtgtcatag caactgatgc tgaagtcctg 960
ggggccccag tcaagggcaa gcgtcggggc cgagggccca aaggggacaa ggcctctgat 1020
ccggaagcag gtgtggcccg gggcatagac ttccaccatg tgtctgctgt gctcaacttt 1080
gatcttcccc caacccctga ggcctacatc catcgagctg gcaggacagc acgcgctaac 1140
aacccaggca tagtcttaac ctttgtgctt cccacggagc agttccactt aggcaagatt 1200
gaggagcttc tcagtggaga gaacaggggc cccattctgc tcccctacca gttccggatg 1260
gaggagatcg agggcttccg ctatcgctgc agggatgcca tgcgctcagt gactaagcag 1320
gccattcggg aggcaagatt gaaggagatc aaggaagagc ttctgcattc tgagaagctt 1380
aagacatact ttgaagacaa ccctagggac ctccagctgc tgcggcatga cctacctttg 1440
caccccgcag tggtgaagcc ccacctgggc catgttcctg actacctggt tcctcctgct 1500
ctccgtggcc tggtgcgccc tcacaagaag cggaagaagc tgtcttcctc ttgtaggaag 1560
gccaagagag caaagtccca gaacccactg cgcagcttca agcacaaagg aaagaaattc 1620
agacccacag ccaagccctc c 1641
<210> 193
<211> 1641
<212> RNA
<213> Homo sapiens
<400> 193
auggaggacu cugaagcacu gggcuucgaa cacaugggcc ucgauccccg gcuccuucag 60
gcugucaccg aucugggcug gucgcgaccu acgcugaucc aggagaaggc caucccacug 120
gcccuagaag ggaaggaccu ccuggcucgg gcccgcacgg gcuccgggaa gacggccgcu 180
uaugcuauuc cgaugcugca gcuguugcuc cauaggaagg cgacaggucc ggugguagaa 240
caggcaguga gaggccuugu ucuuguuccu accaaggagc uggcacggca agcacagucc 300
augauucagc agcuggcuac cuacugugcu cgggaugucc gaguggccaa ugucucagcu 360
gcugaagacu cagucucuca gagagcugug cugauggaga agccagaugu gguaguaggg 420
accccaucuc gcauauuaag ccacuugcag caagacagcc ugaaacuucg ugacucccug 480
gagcuuuugg ugguggacga agcugaccuu cuuuuuuccu uuggcuuuga agaagagcuc 540
aagagucucc ucugucacuu gccccggauu uaccaggcuu uucucauguc agcuacuuuu 600
aacgaggacg uacaagcacu caaggagcug auauuacaua acccgguuac ccuuaaguua 660
caggaguccc agcugccugg gccagaccag uuacagcagu uucagguggu cugugagacu 720
gaggaagaca aauuccuccu gcuguaugcc cugcucaagc ugucauugau ucggggcaag 780
ucucugcucu uugucaacac ucuagaacgg aguuaccggc uacgccuguu cuuggaacag 840
uucagcaucc ccaccugugu gcucaaugga gagcuuccac ugcgcuccag gugccacauc 900
aucucacagu ucaaccaagg cuucuacgac ugugucauag caacugaugc ugaaguccug 960
ggggccccag ucaagggcaa gcgucggggc cgagggccca aaggggacaa ggccucugau 1020
ccggaagcag guguggcccg gggcauagac uuccaccaug ugucugcugu gcucaacuuu 1080
gaucuucccc caaccccuga ggccuacauc caucgagcug gcaggacagc acgcgcuaac 1140
aacccaggca uagucuuaac cuuugugcuu cccacggagc aguuccacuu aggcaagauu 1200
gaggagcuuc ucaguggaga gaacaggggc cccauucugc uccccuacca guuccggaug 1260
gaggagaucg agggcuuccg cuaucgcugc agggaugcca ugcgcucagu gacuaagcag 1320
gccauucggg aggcaagauu gaaggagauc aaggaagagc uucugcauuc ugagaagcuu 1380
aagacauacu uugaagacaa cccuagggac cuccagcugc ugcggcauga ccuaccuuug 1440
caccccgcag uggugaagcc ccaccugggc cauguuccug acuaccuggu uccuccugcu 1500
cuccguggcc uggugcgccc ucacaagaag cggaagaagc ugucuuccuc uuguaggaag 1560
gccaagagag caaaguccca gaacccacug cgcagcuuca agcacaaagg aaagaaauuc 1620
agacccacag ccaagcccuc c 1641
<210> 194
<211> 1644
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding DDX-56
<400> 194
atggaagatt ctgaggccct gggcttcgag cacatgggcc ttgatcctag actgctgcag 60
gccgtgacag atctcggatg gtccagacct acactgatcc aagagaaggc cattcctctg 120
gctctggaag gcaaggacct gctggccaga gctagaacag gctctggcaa gacagccgcc 180
tacgctatcc ctatgctgca gctgctgctg cacagaaagg ccacaggacc agtggtggaa 240
caggccgtta gaggactggt gctggtgccc acaaaagagc tggctagaca ggcccagagc 300
atgatccagc agctggccac atactgcgcc agagatgtgc gagtggccaa tgtgtctgcc 360
gccgaggatt ctgtgtctca gagggccgtg ctgatggaaa agcccgatgt ggtcgtgggc 420
acccctagca gaatcctgtc tcatctgcag caggacagcc tgaagctgag agacagcctg 480
gaactgctgg tggtggatga ggccgatctg ctgttcagct tcggcttcga ggaagaactg 540
aagtccctgc tgtgccatct gcctcggatc taccaggcct tcctgatgag cgccaccttc 600
aacgaagatg tgcaggccct gaaagagctg atcctgcaca accccgtgac actgaagctg 660
caagagagcc agctgccagg acctgatcag ctccagcagt ttcaagtcgt gtgcgagaca 720
gaagaggaca agttcctgct gctgtacgcc ctgctgaagc tgtccctgat cagaggcaag 780
agcctgctgt tcgtgaacac cctggaaaga agctaccggc tgcggctgtt tctggaacag 840
ttcagcatcc ctacctgcgt gctgaacggc gagctgcctc tgagaagcag atgccacatc 900
atcagccagt tcaaccaggg cttctacgac tgcgtgatcg ccacagatgc cgaagtgctg 960
ggagcacccg tgaagggcaa aagaagaggc agaggcccca agggcgataa ggccagtgat 1020
cctgaagcag gcgtggccag aggcatcgat tttcaccatg tgtccgctgt gctgaacttc 1080
gacctgccac ctacacctga ggcctacatc cacagagccg gcagaacagc cagagccaac 1140
aatcctggca tcgtgctgac cttcgtgctg cctaccgaac agttccacct gggcaagatc 1200
gaagaactgc tgtccggcga gaacaggggc cctatcctgc tgccttacca gttccggatg 1260
gaagagatcg agggcttcag atacagatgc agggacgcca tgcggagcgt gacaaagcag 1320
gccattagag aggcccggct gaaagagatc aaagaggaac tgctccacag cgagaagctc 1380
aagacctact tcgaggacaa ccccagggac ctgcagctcc tgagacatga tctgcctctg 1440
caccctgccg tggtcaaacc tcatctggga cacgtgcccg actacctggt tcctcctgct 1500
ctgagaggcc ttgtgcgccc tcacaagaag cggaagaagc tgagcagctc ttgtcggaag 1560
gccaagcggg ccaagagcca gaatccactg agaagcttca agcacaaggg caagaagttc 1620
agacccaccg ccaagcctag ctga 1644
<210> 195
<211> 1644
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding DDX-56
<400> 195
auggaagauu cugaggcccu gggcuucgag cacaugggcc uugauccuag acugcugcag 60
gccgugacag aucucggaug guccagaccu acacugaucc aagagaaggc cauuccucug 120
gcucuggaag gcaaggaccu gcuggccaga gcuagaacag gcucuggcaa gacagccgcc 180
uacgcuaucc cuaugcugca gcugcugcug cacagaaagg ccacaggacc agugguggaa 240
caggccguua gaggacuggu gcuggugccc acaaaagagc uggcuagaca ggcccagagc 300
augauccagc agcuggccac auacugcgcc agagaugugc gaguggccaa ugugucugcc 360
gccgaggauu cugugucuca gagggccgug cugauggaaa agcccgaugu ggucgugggc 420
accccuagca gaauccuguc ucaucugcag caggacagcc ugaagcugag agacagccug 480
gaacugcugg ugguggauga ggccgaucug cuguucagcu ucggcuucga ggaagaacug 540
aagucccugc ugugccaucu gccucggauc uaccaggccu uccugaugag cgccaccuuc 600
aacgaagaug ugcaggcccu gaaagagcug auccugcaca accccgugac acugaagcug 660
caagagagcc agcugccagg accugaucag cuccagcagu uucaagucgu gugcgagaca 720
gaagaggaca aguuccugcu gcuguacgcc cugcugaagc ugucccugau cagaggcaag 780
agccugcugu ucgugaacac ccuggaaaga agcuaccggc ugcggcuguu ucuggaacag 840
uucagcaucc cuaccugcgu gcugaacggc gagcugccuc ugagaagcag augccacauc 900
aucagccagu ucaaccaggg cuucuacgac ugcgugaucg ccacagaugc cgaagugcug 960
ggagcacccg ugaagggcaa aagaagaggc agaggcccca agggcgauaa ggccagugau 1020
ccugaagcag gcguggccag aggcaucgau uuucaccaug uguccgcugu gcugaacuuc 1080
gaccugccac cuacaccuga ggccuacauc cacagagccg gcagaacagc cagagccaac 1140
aauccuggca ucgugcugac cuucgugcug ccuaccgaac aguuccaccu gggcaagauc 1200
gaagaacugc uguccggcga gaacaggggc ccuauccugc ugccuuacca guuccggaug 1260
gaagagaucg agggcuucag auacagaugc agggacgcca ugcggagcgu gacaaagcag 1320
gccauuagag aggcccggcu gaaagagauc aaagaggaac ugcuccacag cgagaagcuc 1380
aagaccuacu ucgaggacaa ccccagggac cugcagcucc ugagacauga ucugccucug 1440
cacccugccg uggucaaacc ucaucuggga cacgugcccg acuaccuggu uccuccugcu 1500
cugagaggcc uugugcgccc ucacaagaag cggaagaagc ugagcagcuc uugucggaag 1560
gccaagcggg ccaagagcca gaauccacug agaagcuuca agcacaaggg caagaaguuc 1620
agacccaccg ccaagccuag cuga 1644
<210> 196
<211> 173
<212> PRT
<213> Homo sapiens
<400> 196
Met Cys Leu Leu Leu Gly Ala Thr Gly Val Gly Lys Thr Leu Leu Val
1 5 10 15
Lys Arg Leu Gln Glu Val Ser Ser Arg Asp Gly Lys Gly Asp Leu Gly
20 25 30
Glu Pro Pro Pro Thr Arg Pro Thr Val Gly Thr Asn Leu Thr Asp Ile
35 40 45
Val Ala Gln Arg Lys Ile Thr Ile Arg Glu Leu Gly Gly Cys Met Gly
50 55 60
Pro Ile Trp Ser Ser Tyr Tyr Gly Asn Cys Arg Ser Leu Leu Phe Val
65 70 75 80
Met Asp Ala Ser Asp Pro Thr Gln Leu Ser Ala Ser Cys Val Gln Leu
85 90 95
Leu Gly Leu Leu Ser Ala Glu Gln Leu Ala Glu Ala Ser Val Leu Ile
100 105 110
Leu Phe Asn Lys Ile Asp Leu Pro Cys Tyr Met Ser Thr Glu Glu Met
115 120 125
Lys Ser Leu Ile Arg Leu Pro Asp Ile Ile Ala Cys Ala Lys Gln Asn
130 135 140
Ile Thr Thr Ala Glu Ile Ser Ala Arg Glu Gly Thr Gly Leu Ala Gly
145 150 155 160
Val Leu Ala Trp Leu Gln Ala Thr His Arg Ala Asn Asp
165 170
<210> 197
<211> 519
<212> DNA
<213> Homo sapiens
<400> 197
atgtgtctcc tgctgggggc cacgggcgtc gggaagacgc tgctggtgaa acggctgcag 60
gaggtgagct cccgggatgg gaaaggcgac ctgggggagc cgcccccgac acggcccacg 120
gtgggcacca atcttactga catcgtggca cagagaaaga tcaccatccg ggagcttggg 180
gggtgcatgg gccccatctg gtccagttac tatggaaact gccgttctct cctgtttgtg 240
atggacgcct ctgaccccac ccagctctct gcatcctgtg tgcagctctt aggtctcctt 300
tctgcagaac aacttgcaga agcatcggtg ctgatactct tcaataaaat cgacctaccc 360
tgttacatgt ccacggagga gatgaagtca ttaatcaggc ttccagacat cattgcttgt 420
gccaagcaga acatcaccac ggcagaaatc agcgcccgtg aaggcactgg cttagcaggg 480
gtgctggcct ggctccaggc cacccacaga gccaacgat 519
<210> 198
<211> 519
<212> RNA
<213> Homo sapiens
<400> 198
augugucucc ugcugggggc cacgggcguc gggaagacgc ugcuggugaa acggcugcag 60
gaggugagcu cccgggaugg gaaaggcgac cugggggagc cgcccccgac acggcccacg 120
gugggcacca aucuuacuga caucguggca cagagaaaga ucaccauccg ggagcuuggg 180
gggugcaugg gccccaucug guccaguuac uauggaaacu gccguucucu ccuguuugug 240
auggacgccu cugaccccac ccagcucucu gcauccugug ugcagcucuu aggucuccuu 300
ucugcagaac aacuugcaga agcaucggug cugauacucu ucaauaaaau cgaccuaccc 360
uguuacaugu ccacggagga gaugaaguca uuaaucaggc uuccagacau cauugcuugu 420
gccaagcaga acaucaccac ggcagaaauc agcgcccgug aaggcacugg cuuagcaggg 480
gugcuggccu ggcuccaggc cacccacaga gccaacgau 519
<210> 199
<211> 178
<212> PRT
<213> Homo sapiens
<400> 199
Met Gly Leu Ile Phe Ala Lys Leu Trp Ser Leu Phe Cys Asn Gln Glu
1 5 10 15
His Lys Val Ile Ile Val Gly Leu Asp Asn Ala Gly Lys Thr Thr Ile
20 25 30
Tyr Gln Phe Leu Met Asn Glu Val Val His Thr Ser Pro Thr Ile Gly
35 40 45
Ser Asn Val Glu Glu Ile Val Val Lys Asn Thr His Phe Leu Met Trp
50 55 60
Asp Ile Gly Gly Gln Glu Ser Leu Arg Ser Ser Trp Asn Thr Tyr Tyr
65 70 75 80
Ser Asn Thr Glu Phe Ile Ile Leu Val Val Asp Ser Ile Asp Arg Glu
85 90 95
Arg Leu Ala Ile Thr Lys Glu Glu Leu Tyr Arg Met Leu Ala His Glu
100 105 110
Asp Leu Arg Lys Ala Ala Val Leu Ile Phe Ala Asn Lys Gln Asp Met
115 120 125
Lys Gly Cys Met Thr Ala Ala Glu Ile Ser Lys Tyr Leu Thr Leu Ser
130 135 140
Ser Ile Lys Asp His Pro Trp His Ile Gln Ser Cys Cys Ala Leu Thr
145 150 155 160
Gly Glu Gly Leu Cys Gln Gly Leu Glu Trp Met Thr Ser Arg Ile Gly
165 170 175
Val Arg
<210> 200
<211> 537
<212> DNA
<213> Homo sapiens
<400> 200
atggggctga tcttcgccaa actgtggagc ctcttctgta accaagaaca caaagtaatt 60
atagtgggac tggataatgc agggaaaacc accattcttt accaattctt aatgaatgaa 120
gtggttcata cttctccaac cataggaagc aatgttgaag aaatagttgt gaagaacact 180
cattttctta tgtgggatat tggtggtcag gagtctctgc gatcatcctg gaacacatat 240
tactcaaata cagagttcat cattcttgtt gttgatagca ttgacaggga acgactagct 300
attacaaaag aagaattata cagaatgttg gctcatgagg atttacggaa ggctgcagtc 360
cttatctttg caaataaaca ggatatgaaa gggtgtatga cagcagctga aatctcgaaa 420
tacctcaccc ttagttcaat taaggatcat ccatggcaca ttcaatcctg ctgtgctctc 480
acaggagaag ggttatgcca aggtctagag tggatgacct cccggattgg tgtgaga 537
<210> 201
<211> 537
<212> RNA
<213> Homo sapiens
<400> 201
auggggcuga ucuucgccaa acuguggagc cucuucugua accaagaaca caaaguaauu 60
auagugggac uggauaaugc agggaaaacc accauucuuu accaauucuu aaugaaugaa 120
gugguucaua cuucuccaac cauaggaagc aauguugaag aaauaguugu gaagaacacu 180
cauuuucuua ugugggauau ugguggucag gagucucugc gaucauccug gaacacauau 240
uacucaaaua cagaguucau cauucuuguu guugauagca uugacaggga acgacuagcu 300
auuacaaaag aagaauuaua cagaauguug gcucaugagg auuuacggaa ggcugcaguc 360
cuuaucuuug caaauaaaca ggauaugaaa ggguguauga cagcagcuga aaucucgaaa 420
uaccucaccc uuaguucaau uaaggaucau ccauggcaca uucaauccug cugugcucuc 480
acaggagaag gguuaugcca aggucuagag uggaugaccu cccggauugg ugugaga 537
<210> 202
<211> 540
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding ARL5B
<400> 202
atgggcctga tcttcgccaa actgtggtcc ctgttctgca atcaagagca caaagtgatc 60
atcgtcggcc tggacaacgc cggcaagaca acaatcctgt accagttcct gatgaacgag 120
gtggtgcaca caagccccac catcggcagc aacgtggaag agatcgtggt caagaatacc 180
cacttcctga tgtgggacat cggcggccaa gagagcctga gaagcagctg gaacacctac 240
tacagcaaca ccgagttcat catcctggtg gtggacagca tcgacagaga gagactggcc 300
atcaccaaag aggaactgta ccggatgctg gcccacgagg atctgagaaa agccgccgtg 360
ctgatttttg ccaacaagca ggacatgaag ggctgcatga cagccgccga gatcagcaag 420
tacctgacac tgagcagcat caaggatcac ccctggcaca tccagagctg ctgtgcattg 480
acaggcgagg gcctgtgtca gggactcgag tggatgacaa gcagaatcgg cgtgcggtga 540
540
<210> 203
<211> 540
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding ARL5B
<400> 203
augggccuga ucuucgccaa acuguggucc cuguucugca aucaagagca caaagugauc 60
aucgucggcc uggacaacgc cggcaagaca acaauccugu accaguuccu gaugaacgag 120
guggugcaca caagccccac caucggcagc aacguggaag agaucguggu caagaauacc 180
cacuuccuga ugugggacau cggcggccaa gagagccuga gaagcagcug gaacaccuac 240
uacagcaaca ccgaguucau cauccuggug guggacagca ucgacagaga gagacuggcc 300
aucaccaaag aggaacugua ccggaugcug gcccacgagg aucugagaaa agccgccgug 360
cugauuuuug ccaacaagca ggacaugaag ggcugcauga cagccgccga gaucagcaag 420
uaccugacac ugagcagcau caaggaucac cccuggcaca uccagagcug cugugcauug 480
acaggcgagg gccuguguca gggacucgag uggaugacaa gcagaaucgg cgugcgguga 540
540
<210> 204
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> 26S promoter
<400> 204
gggcccctat aactctctac ggctaacctg aatggactac gacat 45
<210> 205
<211> 19
<212> PRT
<213> Artificial Sequence
<220>
<223> P2A spacer
<400> 205
Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val Glu Glu Asn
1 5 10 15
Pro Gly Pro
<210> 206
<211> 20
<212> PRT
<213> Artificial Sequence
<220>
<223> T2A spacer
<400> 206
Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp Val Glu Ser
1 5 10 15
Asn Pro Gly Pro
20
<210> 207
<211> 22
<212> PRT
<213> Artificial Sequence
<220>
<223> F2A spacer
<400> 207
Val Lys Gln Thr Leu Asn Phe Asp Leu Leu Lys Leu Ala Gly Asp Val
1 5 10 15
Glu Ser Asn Pro Gly Pro
20
<210> 208
<211> 6
<212> PRT
<213> Artificial Sequence
<220>
<223> Furin sequence
<220>
<221> MISC_FEATURE
<222> (2)
<223> Xaa can be any naturally occurring amino acid
<220>
<221> MISC_FEATURE
<222> (5)
<223> Xaa can be any naturally occurring amino acid
<400> 208
Arg Xaa Arg Lys Xaa Arg
1 5
<210> 209
<211> 6
<212> PRT
<213> Artificial Sequence
<220>
<223> Furin sequence
<400> 209
Arg Arg Arg Arg Arg Arg
1 5
<210> 210
<211> 27
<212> PRT
<213> Artificial Sequence
<220>
<223> Furin/T2A sequence
<400> 210
Arg Arg Arg Arg Arg Arg Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu
1 5 10 15
Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro
20 25
<210> 211
<211> 461
<212> DNA
<213> Foot-and-mouth disease virus
<400> 211
agcaggtttc cccaactgac acaaaacgtg caacttgaaa ctccgcctgg tctttccagg 60
tctagagggg taacactttg tactgcgttt ggctccacgc tcgatccact ggcgagtgtt 120
agtaacagca ctgttgcttc gtagcggagc atgacggccg tgggaactcc tccttggtaa 180
caaggaccca cggggccaaa agccacgccc acacgggccc gtcatgtgtg caaccccagc 240
acggcgactt tactgcgaaa cccactttaa agtgacattg aaactggtac ccacacactg 300
gtgacaggct aaggatgccc ttcaggtacc ccgaggtaac acgcgacact cgggatctga 360
gaaggggact ggggcttcta taaaagcgct cggtttaaaa agcttctatg cctgaatagg 420
tgaccggagg tcggcacctt tcctttgcaa ttactgacca c 461
<210> 212
<211> 551
<212> DNA
<213> Encephalomyocarditis virus
<400> 212
cgttactggc cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc 60
caccatattg ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac 120
gagcattcct aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt 180
gaaggaagca gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg 240
caggcagcgg aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata 300
agatacacct gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga 360
aagagtcaaa tggctcccct caagcgtatt caacaagggg ctgaaggatg cccagaaggt 420
accccattgt atgggatctg atctggggcc tcggtgcaca tgcttttcat gtgtttagtc 480
gaggttaaaa aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca 540
cgatgataat a 551
<210> 213
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Flexible linker
<400> 213
ggaggtggcg ggtccggggg cgggggtagc ggtggcgggg gctcc 45
<210> 214
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Flexible linker
<400> 214
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
1 5 10 15
<210> 215
<211> 535
<212> PRT
<213> Unknown
<220>
<223> Non-structural protein 1
<400> 215
Met Glu Lys Val His Val Asp Ile Glu Glu Asp Ser Pro Phe Leu Arg
1 5 10 15
Ala Leu Gln Arg Ser Phe Pro Gln Phe Glu Val Glu Ala Lys Gln Val
20 25 30
Thr Asp Asn Asp His Ala Asn Ala Arg Ala Phe Ser His Leu Ala Ser
35 40 45
Lys Leu Ile Glu Thr Glu Val Asp Pro Ser Asp Thr Ile Leu Asp Ile
50 55 60
Gly Ser Ala Pro Ala Arg Arg Met Tyr Ser Lys His Lys Tyr His Cys
65 70 75 80
Ile Cys Pro Met Arg Cys Ala Glu Asp Pro Asp Arg Leu Tyr Lys Tyr
85 90 95
Ala Thr Lys Leu Lys Lys Asn Cys Lys Glu Ile Thr Asp Lys Glu Leu
100 105 110
Asp Lys Lys Met Lys Glu Leu Ala Ala Val Met Ser Asp Pro Asp Leu
115 120 125
Glu Thr Glu Thr Met Cys Leu His Asp Asp Glu Ser Cys Arg Tyr Glu
130 135 140
Gly Gln Val Ala Val Tyr Gln Asp Val Tyr Ala Val Asp Gly Pro Thr
145 150 155 160
Ser Leu Tyr His Gln Ala Asn Lys Gly Val Arg Val Ala Tyr Trp Ile
165 170 175
Gly Phe Asp Thr Thr Pro Phe Met Phe Lys Asn Leu Ala Gly Ala Tyr
180 185 190
Pro Ser Tyr Ser Thr Asn Trp Ala Asp Glu Thr Val Leu Thr Ala Arg
195 200 205
Asn Ile Gly Leu Cys Ser Ser Asp Val Met Glu Arg Ser Arg Arg Gly
210 215 220
Met Ser Ile Leu Arg Lys Lys Tyr Leu Lys Pro Ser Asn Asn Val Leu
225 230 235 240
Phe Ser Val Gly Ser Thr Ile Tyr His Glu Lys Arg Asp Leu Leu Arg
245 250 255
Ser Trp His Leu Pro Ser Val Phe His Leu Arg Gly Lys Gln Asn Tyr
260 265 270
Thr Cys Arg Cys Glu Thr Ile Val Ser Cys Asp Gly Tyr Val Val Lys
275 280 285
Arg Ile Ala Ile Ser Pro Gly Leu Tyr Gly Lys Pro Ser Gly Tyr Ala
290 295 300
Ala Thr Met His Arg Glu Gly Phe Leu Cys Cys Lys Val Thr Asp Thr
305 310 315 320
Leu Asn Gly Glu Arg Val Ser Phe Pro Val Cys Thr Tyr Val Pro Ala
325 330 335
Thr Leu Cys Asp Gln Met Thr Gly Ile Leu Ala Thr Asp Val Ser Ala
340 345 350
Asp Asp Ala Gln Lys Leu Leu Val Gly Leu Asn Gln Arg Ile Val Val
355 360 365
Asn Gly Arg Thr Gln Arg Asn Thr Asn Thr Met Lys Asn Tyr Leu Leu
370 375 380
Pro Val Val Ala Gln Ala Phe Ala Arg Trp Ala Lys Glu Tyr Lys Glu
385 390 395 400
Asp Gln Glu Asp Glu Arg Pro Leu Gly Leu Arg Asp Arg Gln Leu Val
405 410 415
Met Gly Cys Cys Trp Ala Phe Arg Arg His Lys Ile Thr Ser Ile Tyr
420 425 430
Lys Arg Pro Asp Thr Gln Thr Ile Ile Lys Val Asn Ser Asp Phe His
435 440 445
Ser Phe Val Leu Pro Arg Ile Gly Ser Asn Thr Leu Glu Ile Gly Leu
450 455 460
Arg Thr Arg Ile Arg Lys Met Leu Glu Glu His Lys Glu Pro Ser Pro
465 470 475 480
Leu Ile Thr Ala Glu Asp Val Gln Glu Ala Lys Cys Ala Ala Asp Glu
485 490 495
Ala Lys Glu Val Arg Glu Ala Glu Glu Leu Arg Ala Ala Leu Pro Pro
500 505 510
Leu Ala Ala Asp Val Glu Glu Pro Thr Leu Glu Ala Asp Val Asp Leu
515 520 525
Met Leu Gln Glu Ala Gly Ala
530 535
<210> 216
<211> 1605
<212> DNA
<213> Unknown
<220>
<223> Non-structural protein 1
<400> 216
atggagaaag ttcacgttga catcgaggaa gacagcccat tcctcagagc tttgcagcgg 60
agcttcccgc agtttgaggt agaagccaag caggtcactg ataatgacca tgctaatgcc 120
agagcgtttt cgcatctggc ttcaaaactg atcgaaacgg aggtggaccc atccgacacg 180
atccttgaca ttggaagtgc gcccgcccgc agaatgtatt ctaagcacaa gtatcattgt 240
atctgtccga tgagatgtgc ggaagatccg gacagattgt ataagtatgc aactaagctg 300
aagaaaaact gtaaggaaat aactgataag gaattggaca agaaaatgaa ggagctggcc 360
gccgtcatga gcgaccctga cctggaaact gagactatgt gcctccacga cgacgagtcg 420
tgtcgctacg aagggcaagt cgctgtttac caggatgtat acgcggttga cggaccgaca 480
agtctctatc accaagccaa taagggagtt agagtcgcct actggatagg ctttgacacc 540
acccctttta tgtttaagaa cttggctgga gcatatccat catactctac caactgggcc 600
gacgaaaccg tgttaacggc tcgtaacata ggcctatgca gctctgacgt tatggagcgg 660
tcacgtagag ggatgtccat tcttagaaag aagtatttga aaccatccaa caatgttcta 720
ttctctgttg gctcgaccat ctaccacgag aagagggact tactgaggag ctggcacctg 780
ccgtctgtat ttcacttacg tggcaagcaa aattacacat gtcggtgtga gactatagtt 840
agttgcgacg ggtacgtcgt taaaagaata gctatcagtc caggcctgta tgggaagcct 900
tcaggctatg ctgctacgat gcaccgcgag ggattcttgt gctgcaaagt gacagacaca 960
ttgaacgggg agagggtctc ttttcccgtg tgcacgtatg tgccagctac attgtgtgac 1020
caaatgactg gcatactggc aacagatgtc agtgcggacg acgcgcaaaa actgctggtt 1080
gggctcaacc agcgtatagt cgtcaacggt cgcacccaga gaaacaccaa taccatgaaa 1140
aattaccttt tgcccgtagt ggcccaggca tttgctaggt gggcaaagga atataaggaa 1200
gatcaagaag atgaaaggcc actaggacta cgagatagac agttagtcat ggggtgttgt 1260
tgggctttta gaaggcacaa gataacatct atttataagc gcccggatac ccaaaccatc 1320
atcaaagtga acagcgattt ccactcattc gtgctgccca ggataggcag taacacattg 1380
gagatcgggc tgagaacaag aatcaggaaa atgttagagg agcacaagga gccgtcacct 1440
ctcattaccg ccgaggacgt acaagaagct aagtgcgcag ccgatgaggc taaggaggtg 1500
cgtgaagccg aggagttgcg cgcagctcta ccacctttgg cagctgatgt tgaggagccc 1560
actctggaag ccgatgtcga cttgatgtta caagaggctg gggcc 1605
<210> 217
<211> 1605
<212> RNA
<213> Unknown
<220>
<223> Non-structural protein 1
<400> 217
auggagaaag uucacguuga caucgaggaa gacagcccau uccucagagc uuugcagcgg 60
agcuucccgc aguuugaggu agaagccaag caggucacug auaaugacca ugcuaaugcc 120
agagcguuuu cgcaucuggc uucaaaacug aucgaaacgg agguggaccc auccgacacg 180
auccuugaca uuggaagugc gcccgcccgc agaauguauu cuaagcacaa guaucauugu 240
aucuguccga ugagaugugc ggaagauccg gacagauugu auaaguaugc aacuaagcug 300
aagaaaaacu guaaggaaau aacugauaag gaauuggaca agaaaaugaa ggagcuggcc 360
gccgucauga gcgacccuga ccuggaaacu gagacuaugu gccuccacga cgacgagucg 420
ugucgcuacg aagggcaagu cgcuguuuac caggauguau acgcgguuga cggaccgaca 480
agucucuauc accaagccaa uaagggaguu agagucgccu acuggauagg cuuugacacc 540
accccuuuua uguuuaagaa cuuggcugga gcauauccau cauacucuac caacugggcc 600
gacgaaaccg uguuaacggc ucguaacaua ggccuaugca gcucugacgu uauggagcgg 660
ucacguagag ggauguccau ucuuagaaag aaguauuuga aaccauccaa caauguucua 720
uucucuguug gcucgaccau cuaccacgag aagagggacu uacugaggag cuggcaccug 780
ccgucuguau uucacuuacg uggcaagcaa aauuacacau gucgguguga gacuauaguu 840
aguugcgacg gguacgucgu uaaaagaaua gcuaucaguc caggccugua ugggaagccu 900
ucaggcuaug cugcuacgau gcaccgcgag ggauucuugu gcugcaaagu gacagacaca 960
uugaacgggg agagggucuc uuuucccgug ugcacguaug ugccagcuac auugugugac 1020
caaaugacug gcauacuggc aacagauguc agugcggacg acgcgcaaaa acugcugguu 1080
gggcucaacc agcguauagu cgucaacggu cgcacccaga gaaacaccaa uaccaugaaa 1140
aauuaccuuu ugcccguagu ggcccaggca uuugcuaggu gggcaaagga auauaaggaa 1200
gaucaagaag augaaaggcc acuaggacua cgagauagac aguuagucau gggguguugu 1260
ugggcuuuua gaaggcacaa gauaacaucu auuuauaagc gcccggauac ccaaaccauc 1320
aucaaaguga acagcgauuu ccacucauuc gugcugccca ggauaggcag uaacacauug 1380
gagaucgggc ugagaacaag aaucaggaaa auguuagagg agcacaagga gccgucaccu 1440
cucauuaccg ccgaggacgu acaagaagcu aagugcgcag ccgaugaggc uaaggaggug 1500
cgugaagccg aggaguugcg cgcagcucua ccaccuuugg cagcugaugu ugaggagccc 1560
acucuggaag ccgaugucga cuugauguua caagaggcug gggcc 1605
<210> 218
<211> 794
<212> PRT
<213> Unknown
<220>
<223> Non-structural protein 2
<400> 218
Gly Ser Val Glu Thr Pro Arg Gly Leu Ile Lys Val Thr Ser Tyr Asp
1 5 10 15
Gly Glu Asp Lys Ile Gly Ser Tyr Ala Val Leu Ser Pro Gln Ala Val
20 25 30
Leu Lys Ser Glu Lys Leu Ser Cys Ile His Pro Leu Ala Glu Gln Val
35 40 45
Ile Val Ile Thr His Ser Gly Arg Lys Gly Arg Tyr Ala Val Glu Pro
50 55 60
Tyr His Gly Lys Val Val Val Pro Glu Gly His Ala Ile Pro Val Gln
65 70 75 80
Asp Phe Gln Ala Leu Ser Glu Ser Ala Thr Ile Val Tyr Asn Glu Arg
85 90 95
Glu Phe Val Asn Arg Tyr Leu His His Ile Ala Thr His Gly Gly Ala
100 105 110
Leu Asn Thr Asp Glu Glu Tyr Tyr Lys Thr Val Lys Pro Ser Glu His
115 120 125
Asp Gly Glu Tyr Leu Tyr Asp Ile Asp Arg Lys Gln Cys Val Lys Lys
130 135 140
Glu Leu Val Thr Gly Leu Gly Leu Thr Gly Glu Leu Val Asp Pro Pro
145 150 155 160
Phe His Glu Phe Ala Tyr Glu Ser Leu Arg Thr Arg Pro Ala Ala Pro
165 170 175
Tyr Gln Val Pro Thr Ile Gly Val Tyr Gly Val Pro Gly Ser Gly Lys
180 185 190
Ser Gly Ile Ile Lys Ser Ala Val Thr Lys Lys Asp Leu Val Val Ser
195 200 205
Ala Lys Lys Glu Asn Cys Ala Glu Ile Ile Arg Asp Val Lys Lys Met
210 215 220
Lys Gly Leu Asp Val Asn Ala Arg Thr Val Asp Ser Val Leu Leu Asn
225 230 235 240
Gly Cys Lys His Pro Val Glu Thr Leu Tyr Ile Asp Glu Ala Phe Ala
245 250 255
Cys His Ala Gly Thr Leu Arg Ala Leu Ile Ala Ile Ile Arg Pro Lys
260 265 270
Lys Ala Val Leu Cys Gly Asp Pro Lys Gln Cys Gly Phe Phe Asn Met
275 280 285
Met Cys Leu Lys Val His Phe Asn His Glu Ile Cys Thr Gln Val Phe
290 295 300
His Lys Ser Ile Ser Arg Arg Cys Thr Lys Ser Val Thr Ser Val Val
305 310 315 320
Ser Thr Leu Phe Tyr Asp Lys Lys Met Arg Thr Thr Asn Pro Lys Glu
325 330 335
Thr Lys Ile Val Ile Asp Thr Thr Gly Ser Thr Lys Pro Lys Gln Asp
340 345 350
Asp Leu Ile Leu Thr Cys Phe Arg Gly Trp Val Lys Gln Leu Gln Ile
355 360 365
Asp Tyr Lys Gly Asn Glu Ile Met Thr Ala Ala Ala Ser Gln Gly Leu
370 375 380
Thr Arg Lys Gly Val Tyr Ala Val Arg Tyr Lys Val Asn Glu Asn Pro
385 390 395 400
Leu Tyr Ala Pro Thr Ser Glu His Val Asn Val Leu Leu Thr Arg Thr
405 410 415
Glu Asp Arg Ile Val Trp Lys Thr Leu Ala Gly Asp Pro Trp Ile Lys
420 425 430
Thr Leu Thr Ala Lys Tyr Pro Gly Asn Phe Thr Ala Thr Ile Glu Glu
435 440 445
Trp Gln Ala Glu His Asp Ala Ile Met Arg His Ile Leu Glu Arg Pro
450 455 460
Asp Pro Thr Asp Val Phe Gln Asn Lys Ala Asn Val Cys Trp Ala Lys
465 470 475 480
Ala Leu Val Pro Val Leu Lys Thr Ala Gly Ile Asp Met Thr Thr Glu
485 490 495
Gln Trp Asn Thr Val Asp Tyr Phe Glu Thr Asp Lys Ala His Ser Ala
500 505 510
Glu Ile Val Leu Asn Gln Leu Cys Val Arg Phe Phe Gly Leu Asp Leu
515 520 525
Asp Ser Gly Leu Phe Ser Ala Pro Thr Val Pro Leu Ser Ile Arg Asn
530 535 540
Asn His Trp Asp Asn Ser Pro Ser Pro Asn Met Tyr Gly Leu Asn Lys
545 550 555 560
Glu Val Val Arg Gln Leu Ser Arg Arg Tyr Pro Gln Leu Pro Arg Ala
565 570 575
Val Ala Thr Gly Arg Val Tyr Asp Met Asn Thr Gly Thr Leu Arg Asn
580 585 590
Tyr Asp Pro Arg Ile Asn Leu Val Pro Val Asn Arg Arg Leu Pro His
595 600 605
Ala Leu Val Leu His His Asn Glu His Pro Gln Ser Asp Phe Ser Ser
610 615 620
Phe Val Ser Lys Leu Lys Gly Arg Thr Val Leu Val Val Gly Glu Lys
625 630 635 640
Leu Ser Val Pro Gly Lys Met Val Asp Trp Leu Ser Asp Arg Pro Glu
645 650 655
Ala Thr Phe Arg Ala Arg Leu Asp Leu Gly Ile Pro Gly Asp Val Pro
660 665 670
Lys Tyr Asp Ile Ile Phe Val Asn Val Arg Thr Pro Tyr Lys Tyr His
675 680 685
His Tyr Gln Gln Cys Glu Asp His Ala Ile Lys Leu Ser Met Leu Thr
690 695 700
Lys Lys Ala Cys Leu His Leu Asn Pro Gly Gly Thr Cys Val Ser Ile
705 710 715 720
Gly Tyr Gly Tyr Ala Asp Arg Ala Ser Glu Ser Ile Ile Gly Ala Ile
725 730 735
Ala Arg Gln Phe Lys Phe Ser Arg Val Cys Lys Pro Lys Ser Ser Leu
740 745 750
Glu Glu Thr Glu Val Leu Phe Val Phe Ile Gly Tyr Asp Arg Lys Ala
755 760 765
Arg Thr His Asn Ser Tyr Lys Leu Ser Ser Thr Leu Thr Asn Ile Tyr
770 775 780
Thr Gly Ser Arg Leu His Glu Ala Gly Cys
785 790
<210> 219
<211> 2382
<212> DNA
<213> Unknown
<220>
<223> Non-structural protein 2
<400> 219
ggctcagtgg agacacctcg tggcttgata aaggttacca gctacgatgg cgaggacaag 60
atcggctctt acgctgtgct ttctccgcag gctgtactca agagtgaaaa attatcttgc 120
atccaccctc tcgctgaaca agtcatagtg ataacacact ctggccgaaa agggcgttat 180
gccgtggaac cataccatgg taaagtagtg gtgccagagg gacatgcaat acccgtccag 240
gactttcaag ctctgagtga aagtgccacc attgtgtaca acgaacgtga gttcgtaaac 300
aggtacctgc accatattgc cacacatgga ggagcgctga acactgatga agaatattac 360
aaaactgtca agcccagcga gcacgacggc gaatacctgt acgacatcga caggaaacag 420
tgcgtcaaga aagaactagt cactgggcta gggctcacag gcgagctggt ggatcctccc 480
ttccatgaat tcgcctacga gagtctgaga acacgaccag ccgctcctta ccaagtacca 540
accatagggg tgtatggcgt gccaggatca ggcaagtctg gcatcattaa aagcgcagtc 600
accaaaaaag atctagtggt gagcgccaag aaagaaaact gtgcagaaat tataagggac 660
gtcaagaaaa tgaaagggct ggacgtcaat gccagaactg tggactcagt gctcttgaat 720
ggatgcaaac accccgtaga gaccctgtat attgacgaag cttttgcttg tcatgcaggt 780
actctcagag cgctcatagc cattataaga cctaaaaagg cagtgctctg cggggatccc 840
aaacagtgcg gtttttttaa catgatgtgc ctgaaagtgc attttaacca cgagatttgc 900
acacaagtct tccacaaaag catctctcgc cgttgcacta aatctgtgac ttcggtcgtc 960
tcaaccttgt tttacgacaa aaaaatgaga acgacgaatc cgaaagagac taagattgtg 1020
attgacacta ccggcagtac caaacctaag caggacgatc tcattctcac ttgtttcaga 1080
gggtgggtga agcagttgca aatagattac aaaggcaacg aaataatgac ggcagctgcc 1140
tctcaagggc tgacccgtaa aggtgtgtat gccgttcggt acaaggtgaa tgaaaatcct 1200
ctgtacgcac ccacctcaga acatgtgaac gtcctactga cccgcacgga ggaccgcatc 1260
gtgtggaaaa cactagccgg cgacccatgg ataaaaacac tgactgccaa gtaccctggg 1320
aatttcactg ccacgataga ggagtggcaa gcagagcatg atgccatcat gaggcacatc 1380
ttggagagac cggaccctac cgacgtcttc cagaataagg caaacgtgtg ttgggccaag 1440
gctttagtgc cggtgctgaa gaccgctggc atagacatga ccactgaaca atggaacact 1500
gtggattatt ttgaaacgga caaagctcac tcagcagaga tagtattgaa ccaactatgc 1560
gtgaggttct ttggactcga tctggactcc ggtctatttt ctgcacccac tgttccgtta 1620
tccattagga ataatcactg ggataactcc ccgtcgccta acatgtacgg gctgaataaa 1680
gaagtggtcc gtcagctctc tcgcaggtac ccacaactgc ctcgggcagt tgccactgga 1740
agagtctatg acatgaacac tggtacactg cgcaattatg atccgcgcat aaacctagta 1800
cctgtaaaca gaagactgcc tcatgcttta gtcctccacc ataatgaaca cccacagagt 1860
gacttttctt cattcgtcag caaattgaag ggcagaactg tcctggtggt cggggaaaag 1920
ttgtccgtcc caggcaaaat ggttgactgg ttgtcagacc ggcctgaggc taccttcaga 1980
gctcggctgg atttaggcat cccaggtgat gtgcccaaat atgacataat atttgttaat 2040
gtgaggaccc catataaata ccatcactat cagcagtgtg aagaccatgc cattaagctt 2100
agcatgttga ccaagaaagc ttgtctgcat ctgaatcccg gcggaacctg tgtcagcata 2160
ggttatggtt acgctgacag ggccagcgaa agcatcattg gtgctatagc gcggcagttc 2220
aagttttccc gggtatgcaa accgaaatcc tcacttgaag agacggaagt tctgtttgta 2280
ttcattgggt acgatcgcaa ggcccgtacg cacaattctt acaagctttc atcaaccttg 2340
accaacattt atacaggttc cagactccac gaagccggat gt 2382
<210> 220
<211> 2382
<212> RNA
<213> Unknown
<220>
<223> Non-structural protein 2
<400> 220
ggcucagugg agacaccucg uggcuugaua aagguuacca gcuacgaugg cgaggacaag 60
aucggcucuu acgcugugcu uucuccgcag gcuguacuca agagugaaaa auuaucuugc 120
auccacccuc ucgcugaaca agucauagug auaacacacu cuggccgaaa agggcguuau 180
gccguggaac cauaccaugg uaaaguagug gugccagagg gacaugcaau acccguccag 240
gacuuucaag cucugaguga aagugccacc auuguguaca acgaacguga guucguaaac 300
agguaccugc accauauugc cacacaugga ggagcgcuga acacugauga agaauauuac 360
aaaacuguca agcccagcga gcacgacggc gaauaccugu acgacaucga caggaaacag 420
ugcgucaaga aagaacuagu cacugggcua gggcucacag gcgagcuggu ggauccuccc 480
uuccaugaau ucgccuacga gagucugaga acacgaccag ccgcuccuua ccaaguacca 540
accauagggg uguauggcgu gccaggauca ggcaagucug gcaucauuaa aagcgcaguc 600
accaaaaaag aucuaguggu gagcgccaag aaagaaaacu gugcagaaau uauaagggac 660
gucaagaaaa ugaaagggcu ggacgucaau gccagaacug uggacucagu gcucuugaau 720
ggaugcaaac accccguaga gacccuguau auugacgaag cuuuugcuug ucaugcaggu 780
acucucagag cgcucauagc cauuauaaga ccuaaaaagg cagugcucug cggggauccc 840
aaacagugcg guuuuuuuaa caugaugugc cugaaagugc auuuuaacca cgagauuugc 900
acacaagucu uccacaaaag caucucucgc cguugcacua aaucugugac uucggucguc 960
ucaaccuugu uuuacgacaa aaaaaugaga acgacgaauc cgaaagagac uaagauugug 1020
auugacacua ccggcaguac caaaccuaag caggacgauc ucauucucac uuguuucaga 1080
ggguggguga agcaguugca aauagauuac aaaggcaacg aaauaaugac ggcagcugcc 1140
ucucaagggc ugacccguaa agguguguau gccguucggu acaaggugaa ugaaaauccu 1200
cuguacgcac ccaccucaga acaugugaac guccuacuga cccgcacgga ggaccgcauc 1260
guguggaaaa cacuagccgg cgacccaugg auaaaaacac ugacugccaa guacccuggg 1320
aauuucacug ccacgauaga ggaguggcaa gcagagcaug augccaucau gaggcacauc 1380
uuggagagac cggacccuac cgacgucuuc cagaauaagg caaacgugug uugggccaag 1440
gcuuuagugc cggugcugaa gaccgcuggc auagacauga ccacugaaca auggaacacu 1500
guggauuauu uugaaacgga caaagcucac ucagcagaga uaguauugaa ccaacuaugc 1560
gugagguucu uuggacucga ucuggacucc ggucuauuuu cugcacccac uguuccguua 1620
uccauuagga auaaucacug ggauaacucc ccgucgccua acauguacgg gcugaauaaa 1680
gaaguggucc gucagcucuc ucgcagguac ccacaacugc cucgggcagu ugccacugga 1740
agagucuaug acaugaacac ugguacacug cgcaauuaug auccgcgcau aaaccuagua 1800
ccuguaaaca gaagacugcc ucaugcuuua guccuccacc auaaugaaca cccacagagu 1860
gacuuuucuu cauucgucag caaauugaag ggcagaacug uccugguggu cggggaaaag 1920
uuguccgucc caggcaaaau gguugacugg uugucagacc ggccugaggc uaccuucaga 1980
gcucggcugg auuuaggcau cccaggugau gugcccaaau augacauaau auuuguuaau 2040
gugaggaccc cauauaaaua ccaucacuau cagcagugug aagaccaugc cauuaagcuu 2100
agcauguuga ccaagaaagc uugucugcau cugaaucccg gcggaaccug ugucagcaua 2160
gguuaugguu acgcugacag ggccagcgaa agcaucauug gugcuauagc gcggcaguuc 2220
aaguuuuccc ggguaugcaa accgaaaucc ucacuugaag agacggaagu ucuguuugua 2280
uucauugggu acgaucgcaa ggcccguacg cacaauucuu acaagcuuuc aucaaccuug 2340
accaacauuu auacagguuc cagacuccac gaagccggau gu 2382
<210> 221
<211> 556
<212> PRT
<213> Unknown
<220>
<223> Non-structural protein 3
<400> 221
Ala Pro Ser Tyr His Val Val Arg Gly Asp Ile Ala Thr Ala Thr Glu
1 5 10 15
Gly Val Ile Ile Asn Ala Ala Asn Ser Lys Gly Gln Pro Gly Gly Gly
20 25 30
Val Cys Gly Ala Leu Tyr Lys Lys Phe Pro Glu Ser Phe Asp Leu Gln
35 40 45
Pro Ile Glu Val Gly Lys Ala Arg Leu Val Lys Gly Ala Ala Lys His
50 55 60
Ile Ile His Ala Val Gly Pro Asn Phe Asn Lys Val Ser Glu Val Glu
65 70 75 80
Gly Asp Lys Gln Leu Ala Glu Ala Tyr Glu Ser Ile Ala Lys Ile Val
85 90 95
Asn Asp Asn Asn Tyr Lys Ser Val Ala Ile Pro Leu Leu Ser Thr Gly
100 105 110
Ile Phe Ser Gly Asn Lys Asp Arg Leu Thr Gln Ser Leu Asn His Leu
115 120 125
Leu Thr Ala Leu Asp Thr Thr Asp Ala Asp Val Ala Ile Tyr Cys Arg
130 135 140
Asp Lys Lys Trp Glu Met Thr Leu Lys Glu Ala Val Ala Arg Arg Glu
145 150 155 160
Ala Val Glu Glu Ile Cys Ile Ser Asp Asp Ser Ser Val Thr Glu Pro
165 170 175
Asp Ala Glu Leu Val Arg Val His Pro Lys Ser Ser Leu Ala Gly Arg
180 185 190
Lys Gly Tyr Ser Thr Ser Asp Gly Lys Thr Phe Ser Tyr Leu Glu Gly
195 200 205
Thr Lys Phe His Gln Ala Ala Lys Asp Ile Ala Glu Ile Asn Ala Met
210 215 220
Trp Pro Val Ala Thr Glu Ala Asn Glu Gln Val Cys Met Tyr Ile Leu
225 230 235 240
Gly Glu Ser Met Ser Ser Ile Arg Ser Lys Cys Pro Val Glu Glu Ser
245 250 255
Glu Ala Ser Thr Pro Pro Ser Thr Leu Pro Cys Leu Cys Ile His Ala
260 265 270
Met Thr Pro Glu Arg Val Gln Arg Leu Lys Ala Ser Arg Pro Glu Gln
275 280 285
Ile Thr Val Cys Ser Ser Phe Pro Leu Pro Lys Tyr Arg Ile Thr Gly
290 295 300
Val Gln Lys Ile Gln Cys Ser Gln Pro Ile Leu Phe Ser Pro Lys Val
305 310 315 320
Pro Ala Tyr Ile His Pro Arg Lys Tyr Leu Val Glu Thr Pro Pro Val
325 330 335
Asp Glu Thr Pro Glu Pro Ser Ala Glu Asn Gln Ser Thr Glu Gly Thr
340 345 350
Pro Glu Gln Pro Pro Leu Ile Thr Glu Asp Glu Thr Arg Thr Arg Thr
355 360 365
Pro Glu Pro Ile Ile Ile Glu Glu Glu Glu Glu Asp Ser Ile Ser Leu
370 375 380
Leu Ser Asp Gly Pro Thr His Gln Val Leu Gln Val Glu Ala Asp Ile
385 390 395 400
His Gly Pro Pro Ser Val Ser Ser Ser Ser Trp Ser Ile Pro His Ala
405 410 415
Ser Asp Phe Asp Val Asp Ser Leu Ser Ile Leu Asp Thr Leu Glu Gly
420 425 430
Ala Ser Val Thr Ser Gly Ala Thr Ser Ala Glu Thr Asn Ser Tyr Phe
435 440 445
Ala Lys Ser Met Glu Phe Leu Ala Arg Pro Val Pro Ala Pro Arg Thr
450 455 460
Val Phe Arg Asn Pro Pro His Pro Ala Pro Arg Thr Arg Thr Pro Ser
465 470 475 480
Leu Ala Pro Ser Arg Ala Cys Ser Arg Thr Ser Leu Val Ser Thr Pro
485 490 495
Pro Gly Val Asn Arg Val Ile Thr Arg Glu Glu Leu Glu Ala Leu Thr
500 505 510
Pro Ser Arg Thr Pro Ser Arg Ser Val Ser Arg Thr Ser Leu Val Ser
515 520 525
Asn Pro Pro Gly Val Asn Arg Val Ile Thr Arg Glu Glu Phe Glu Ala
530 535 540
Phe Val Ala Gln Gln Gln Arg Phe Asp Ala Gly Ala
545 550 555
<210> 222
<211> 1671
<212> DNA
<213> Unknown
<220>
<223> Non-structural protein 3
<400> 222
gcaccctcat atcatgtggt gcgaggggat attgccacgg ccaccgaagg agtgattata 60
aatgctgcta acagcaaagg acaacctggc ggaggggtgt gcggagcgct gtataagaaa 120
ttcccggaaa gcttcgattt acagccgatc gaagtaggaa aagcgcgact ggtcaaaggt 180
gcagctaaac atatcattca tgccgtagga ccaaacttca acaaagtttc ggaggttgaa 240
ggtgacaaac agttggcaga ggcttatgag tccatcgcta agattgtcaa cgataacaat 300
tacaagtcag tagcgattcc actgttgtcc accggcatct tttccgggaa caaagatcga 360
ctaacccaat cattgaacca tttgctgaca gctttagaca ccactgatgc agatgtagcc 420
atatactgca gggacaagaa atgggaaatg actctcaagg aagcagtggc taggagagaa 480
gcagtggagg agatatgcat atccgacgac tcttcagtga cagaacctga tgcagagctg 540
gtgagggtgc atccgaagag ttctttggct ggaaggaagg gctacagcac aagcgatggc 600
aaaactttct catatttgga agggaccaag tttcaccagg cggccaagga tatagcagaa 660
attaatgcca tgtggcccgt tgcaacggag gccaatgagc aggtatgcat gtatatcctc 720
ggagaaagca tgagcagtat taggtcgaaa tgccccgtcg aagagtcgga agcctccaca 780
ccacctagca cgctgccttg cttgtgcatc catgccatga ctccagaaag agtacagcgc 840
ctaaaagcct cacgtccaga acaaattact gtgtgctcat cctttccatt gccgaagtat 900
agaatcactg gtgtgcagaa gatccaatgc tcccagccta tattgttctc accgaaagtg 960
cctgcgtata ttcatccaag gaagtatctc gtggaaacac caccggtaga cgagactccg 1020
gagccatcgg cagagaacca atccacagag gggacacctg aacaaccacc acttataacc 1080
gaggatgaga ccaggactag aacgcctgag ccgatcatca tcgaagagga agaagaggat 1140
agcataagtt tgctgtcaga tggcccgacc caccaggtgc tgcaagtcga ggcagacatt 1200
cacgggccgc cctctgtatc tagctcatcc tggtccattc ctcatgcatc cgactttgat 1260
gtggacagtt tatccatact tgacaccctg gagggagcta gcgtgaccag cggggcaacg 1320
tcagccgaga ctaactctta cttcgcaaag agtatggagt ttctggcgcg accggtgcct 1380
gcgcctcgaa cagtattcag gaaccctcca catcccgctc cgcgcacaag aacaccgtca 1440
cttgcaccca gcagggcctg ctcgagaacc agcctagttt ccaccccgcc aggcgtgaat 1500
agggtgatca ctagagagga gctcgaggcg cttaccccgt cacgcactcc tagcaggtcg 1560
gtctcgagaa ccagcctggt ctccaacccg ccaggcgtaa atagggtgat tacaagagag 1620
gagtttgagg cgttcgtagc acaacaacaa tgacggtttg atgcgggtgc a 1671
<210> 223
<211> 1671
<212> RNA
<213> Unknown
<220>
<223> Non-structural protein 3
<400> 223
gcacccucau aucauguggu gcgaggggau auugccacgg ccaccgaagg agugauuaua 60
aaugcugcua acagcaaagg acaaccuggc ggaggggugu gcggagcgcu guauaagaaa 120
uucccggaaa gcuucgauuu acagccgauc gaaguaggaa aagcgcgacu ggucaaaggu 180
gcagcuaaac auaucauuca ugccguagga ccaaacuuca acaaaguuuc ggagguugaa 240
ggugacaaac aguuggcaga ggcuuaugag uccaucgcua agauugucaa cgauaacaau 300
uacaagucag uagcgauucc acuguugucc accggcaucu uuuccgggaa caaagaucga 360
cuaacccaau cauugaacca uuugcugaca gcuuuagaca ccacugaugc agauguagcc 420
auauacugca gggacaagaa augggaaaug acucucaagg aagcaguggc uaggagagaa 480
gcaguggagg agauaugcau auccgacgac ucuucaguga cagaaccuga ugcagagcug 540
gugagggugc auccgaagag uucuuuggcu ggaaggaagg gcuacagcac aagcgauggc 600
aaaacuuucu cauauuugga agggaccaag uuucaccagg cggccaagga uauagcagaa 660
auuaaugcca uguggcccgu ugcaacggag gccaaugagc agguaugcau guauauccuc 720
ggagaaagca ugagcaguau uaggucgaaa ugccccgucg aagagucgga agccuccaca 780
ccaccuagca cgcugccuug cuugugcauc caugccauga cuccagaaag aguacagcgc 840
cuaaaagccu cacguccaga acaaauuacu gugugcucau ccuuuccauu gccgaaguau 900
agaaucacug gugugcagaa gauccaaugc ucccagccua uauuguucuc accgaaagug 960
ccugcguaua uucauccaag gaaguaucuc guggaaacac caccgguaga cgagacuccg 1020
gagccaucgg cagagaacca auccacagag gggacaccug aacaaccacc acuuauaacc 1080
gaggaugaga ccaggacuag aacgccugag ccgaucauca ucgaagagga agaagaggau 1140
agcauaaguu ugcugucaga uggcccgacc caccaggugc ugcaagucga ggcagacauu 1200
cacgggccgc ccucuguauc uagcucaucc ugguccauuc cucaugcauc cgacuuugau 1260
guggacaguu uauccauacu ugacacccug gagggagcua gcgugaccag cggggcaacg 1320
ucagccgaga cuaacucuua cuucgcaaag aguauggagu uucuggcgcg accggugccu 1380
gcgccucgaa caguauucag gaacccucca caucccgcuc cgcgcacaag aacaccguca 1440
cuugcaccca gcagggccug cucgagaacc agccuaguuu ccaccccgcc aggcgugaau 1500
agggugauca cuagagagga gcucgaggcg cuuaccccgu cacgcacucc uagcaggucg 1560
gucucgagaa ccagccuggu cuccaacccg ccaggcguaa auagggugau uacaagagag 1620
gaguuugagg cguucguagc acaacaacaa ugacgguuug augcgggugc a 1671
<210> 224
<211> 607
<212> PRT
<213> Unknown
<220>
<223> Non-structural protein 4
<400> 224
Tyr Ile Phe Ser Ser Asp Thr Gly Gln Gly His Leu Gln Gln Lys Ser
1 5 10 15
Val Arg Gln Thr Val Leu Ser Glu Val Val Leu Glu Arg Thr Glu Leu
20 25 30
Glu Ile Ser Tyr Ala Pro Arg Leu Asp Gln Glu Lys Glu Glu Leu Leu
35 40 45
Arg Lys Lys Leu Gln Leu Asn Pro Thr Pro Ala Asn Arg Ser Arg Tyr
50 55 60
Gln Ser Arg Lys Val Glu Asn Met Lys Ala Ile Thr Ala Arg Arg Ile
65 70 75 80
Leu Gln Gly Leu Gly His Tyr Leu Lys Ala Glu Gly Lys Val Glu Cys
85 90 95
Tyr Arg Thr Leu His Pro Val Pro Leu Tyr Ser Ser Ser Val Asn Arg
100 105 110
Ala Phe Ser Ser Pro Lys Val Ala Val Glu Ala Cys Asn Ala Met Leu
115 120 125
Lys Glu Asn Phe Pro Thr Val Ala Ser Tyr Cys Ile Ile Pro Glu Tyr
130 135 140
Asp Ala Tyr Leu Asp Met Val Asp Gly Ala Ser Cys Cys Leu Asp Thr
145 150 155 160
Ala Ser Phe Cys Pro Ala Lys Leu Arg Ser Phe Pro Lys Lys His Ser
165 170 175
Tyr Leu Glu Pro Thr Ile Arg Ser Ala Val Pro Ser Ala Ile Gln Asn
180 185 190
Thr Leu Gln Asn Val Leu Ala Ala Ala Thr Lys Arg Asn Cys Asn Val
195 200 205
Thr Gln Met Arg Glu Leu Pro Val Leu Asp Ser Ala Ala Phe Asn Val
210 215 220
Glu Cys Phe Lys Lys Tyr Ala Cys Asn Asn Glu Tyr Trp Glu Thr Phe
225 230 235 240
Lys Glu Asn Pro Ile Arg Leu Thr Glu Glu Asn Val Val Asn Tyr Ile
245 250 255
Thr Lys Leu Lys Gly Pro Lys Ala Ala Ala Leu Phe Ala Lys Thr His
260 265 270
Asn Leu Asn Met Leu Gln Asp Ile Pro Met Asp Arg Phe Val Met Asp
275 280 285
Leu Lys Arg Asp Val Lys Val Thr Pro Gly Thr Lys His Thr Glu Glu
290 295 300
Arg Pro Lys Val Gln Val Ile Gln Ala Ala Asp Pro Leu Ala Thr Ala
305 310 315 320
Tyr Leu Cys Gly Ile His Arg Glu Leu Val Arg Arg Leu Asn Ala Val
325 330 335
Leu Leu Pro Asn Ile His Thr Leu Phe Asp Met Ser Ala Glu Asp Phe
340 345 350
Asp Ala Ile Ile Ala Glu His Phe Gln Pro Gly Asp Cys Val Leu Glu
355 360 365
Thr Asp Ile Ala Ser Phe Asp Lys Ser Glu Asp Asp Ala Met Ala Leu
370 375 380
Thr Ala Leu Met Ile Leu Glu Asp Leu Gly Val Asp Ala Glu Leu Leu
385 390 395 400
Thr Leu Ile Glu Ala Ala Phe Gly Glu Ile Ser Ser Ile His Leu Pro
405 410 415
Thr Lys Thr Lys Phe Lys Phe Gly Ala Met Met Lys Ser Gly Met Phe
420 425 430
Leu Thr Leu Phe Val Asn Thr Val Ile Asn Ile Val Ile Ala Ser Arg
435 440 445
Val Leu Arg Glu Arg Leu Thr Gly Ser Pro Cys Ala Ala Phe Ile Gly
450 455 460
Asp Asp Asn Ile Val Lys Gly Val Lys Ser Asp Lys Leu Met Ala Asp
465 470 475 480
Arg Cys Ala Thr Trp Leu Asn Met Glu Val Lys Ile Ile Asp Ala Val
485 490 495
Val Gly Glu Lys Ala Pro Tyr Phe Cys Gly Gly Phe Ile Leu Cys Asp
500 505 510
Ser Val Thr Gly Thr Ala Cys Arg Val Ala Asp Pro Leu Lys Arg Leu
515 520 525
Phe Lys Leu Gly Lys Pro Leu Ala Ala Asp Asp Glu His Asp Asp Asp
530 535 540
Arg Arg Arg Ala Leu His Glu Glu Ser Thr Arg Trp Asn Arg Val Gly
545 550 555 560
Ile Leu Ser Glu Leu Cys Lys Ala Val Glu Ser Arg Tyr Glu Thr Val
565 570 575
Gly Thr Ser Ile Ile Val Met Ala Met Thr Thr Leu Ala Ser Ser Val
580 585 590
Lys Ser Phe Ser Tyr Leu Arg Gly Ala Pro Ile Thr Leu Tyr Gly
595 600 605
<210> 225
<211> 1821
<212> DNA
<213> Unknown
<220>
<223> Non-structural protein 4
<400> 225
tacatctttt cctccgacac cggtcaaggg catttacaac aaaaatcagt aaggcaaacg 60
gtgctatccg aagtggtgtt ggagaggacc gaattggaga tttcgtatgc cccgcgcctc 120
gaccaagaaa aagaagaatt actacgcaag aaattacagt taaatcccac acctgctaac 180
agaagcagat accagtccag gaaggtggag aacatgaaag ccataacagc tagacgtatt 240
ctgcaaggcc tagggcatta tttgaaggca gaaggaaaag tggagtgcta ccgaaccctg 300
catcctgttc ctttgtattc atctagtgtg aaccgtgcct tttcaagccc caaggtcgca 360
gtggaagcct gtaacgccat gttgaaagag aactttccga ctgtggcttc ttactgtatt 420
attccagagt acgatgccta tttggacatg gttgacggag cttcatgctg cttagacact 480
gccagttttt gccctgcaaa gctgcgcagc tttccaaaga aacactccta tttggaaccc 540
acaatacgat cggcagtgcc ttcagcgatc cagaacacgc tccagaacgt cctggcagct 600
gccacaaaaa gaaattgcaa tgtcacgcaa atgagagaat tgcccgtatt ggattcggcg 660
gcctttaatg tggaatgctt caagaaatat gcgtgtaata atgaatattg ggaaacgttt 720
aaagaaaacc ccatcaggct tactgaagaa aacgtggtaa attacattac caaattaaaa 780
ggaccaaaag ctgctgctct ttttgcgaag acacataatt tgaatatgtt gcaggacata 840
ccaatggaca ggtttgtaat ggacttaaag agagacgtga aagtgactcc aggaacaaaa 900
catactgaag aacggcccaa ggtacaggtg atccaggctg ccgatccgct agcaacagcg 960
tatctgtgcg gaatccaccg agagctggtt aggagattaa atgcggtcct gcttccgaac 1020
attcatacac tgtttgatat gtcggctgaa gactttgacg ctattatagc cgagcacttc 1080
cagcctgggg attgtgttct ggaaactgac atcgcgtcgt ttgataaaag tgaggacgac 1140
gccatggctc tgaccgcgtt aatgattctg gaagacttag gtgtggacgc agagctgttg 1200
acgctgattg aggcggcttt cggcgaaatt tcatcaatac atttgcccac taaaactaaa 1260
tttaaattcg gagccatgat gaaatctgga atgttcctca cactgtttgt gaacacagtc 1320
attaacattg taatcgcaag cagagtgttg agagaacggc taaccggatc accatgtgca 1380
gcattcattg gagatgacaa tatcgtgaaa ggagtcaaat cggacaaatt aatggcagac 1440
aggtgcgcca cctggttgaa tatggaagtc aagattatag atgctgtggt gggcgagaaa 1500
gcgccttatt tctgtggagg gtttattttg tgtgactccg tgaccggcac agcgtgccgt 1560
gtggcagacc ccctaaaaag gctgtttaag cttggcaaac ctctggcagc agacgatgaa 1620
catgatgatg acaggagaag ggcattgcat gaagagtcaa cacgctggaa ccgagtgggt 1680
attctttcag agctgtgcaa ggcagtagaa tcaaggtatg aaaccgtagg aacttccatc 1740
atagttatgg ccatgactac tctagctagc agtgttaaat cattcagcta cctgagaggg 1800
gcccctataa ctctctacgg c 1821
<210> 226
<211> 1821
<212> RNA
<213> Unknown
<220>
<223> Non-structural protein 4
<400> 226
uacaucuuuu ccuccgacac cggucaaggg cauuuacaac aaaaaucagu aaggcaaacg 60
gugcuauccg aagugguguu ggagaggacc gaauuggaga uuucguaugc cccgcgccuc 120
gaccaagaaa aagaagaauu acuacgcaag aaauuacagu uaaaucccac accugcuaac 180
agaagcagau accaguccag gaagguggag aacaugaaag ccauaacagc uagacguauu 240
cugcaaggcc uagggcauua uuugaaggca gaaggaaaag uggagugcua ccgaacccug 300
cauccuguuc cuuuguauuc aucuagugug aaccgugccu uuucaagccc caaggucgca 360
guggaagccu guaacgccau guugaaagag aacuuuccga cuguggcuuc uuacuguauu 420
auuccagagu acgaugccua uuuggacaug guugacggag cuucaugcug cuuagacacu 480
gccaguuuuu gcccugcaaa gcugcgcagc uuuccaaaga aacacuccua uuuggaaccc 540
acaauacgau cggcagugcc uucagcgauc cagaacacgc uccagaacgu ccuggcagcu 600
gccacaaaaa gaaauugcaa ugucacgcaa augagagaau ugcccguauu ggauucggcg 660
gccuuuaaug uggaaugcuu caagaaauau gcguguaaua augaauauug ggaaacguuu 720
aaagaaaacc ccaucaggcu uacugaagaa aacgugguaa auuacauuac caaauuaaaa 780
ggaccaaaag cugcugcucu uuuugcgaag acacauaauu ugaauauguu gcaggacaua 840
ccaauggaca gguuuguaau ggacuuaaag agagacguga aagugacucc aggaacaaaa 900
cauacugaag aacggcccaa gguacaggug auccaggcug ccgauccgcu agcaacagcg 960
uaucugugcg gaauccaccg agagcugguu aggagauuaa augcgguccu gcuuccgaac 1020
auucauacac uguuugauau gucggcugaa gacuuugacg cuauuauagc cgagcacuuc 1080
cagccugggg auuguguucu ggaaacugac aucgcgucgu uugauaaaag ugaggacgac 1140
gccauggcuc ugaccgcguu aaugauucug gaagacuuag guguggacgc agagcuguug 1200
acgcugauug aggcggcuuu cggcgaaauu ucaucaauac auuugcccac uaaaacuaaa 1260
uuuaaauucg gagccaugau gaaaucugga auguuccuca cacuguuugu gaacacaguc 1320
auuaacauug uaaucgcaag cagaguguug agagaacggc uaaccggauc accaugugca 1380
gcauucauug gagaugacaa uaucgugaaa ggagucaaau cggacaaauu aauggcagac 1440
aggugcgcca ccugguugaa uauggaaguc aagauuauag augcuguggu gggcgagaaa 1500
gcgccuuauu ucuguggagg guuuauuuug ugugacuccg ugaccggcac agcgugccgu 1560
guggcagacc cccuaaaaag gcuguuuaag cuuggcaaac cucuggcagc agacgaugaa 1620
caugaugaug acaggagaag ggcauugcau gaagagucaa cacgcuggaa ccgagugggu 1680
auucuuucag agcugugcaa ggcaguagaa ucaagguaug aaaccguagg aacuuccauc 1740
auaguuaugg ccaugacuac ucuagcuagc aguguuaaau cauucagcua ccugagaggg 1800
gccccuauaa cucucuacgg c 1821
<210> 227
<211> 44
<212> RNA
<213> Artificial Sequence
<220>
<223> 5'UTR
<400> 227
augggcggcg caugagagaa gcccagacca auuaccuacc caaa 44
<210> 228
<211> 142
<212> RNA
<213> Artificial Sequence
<220>
<223> 3'UTR
<400> 228
aauuggcaag cugcuuacau agaacucgcg gcgauuggca ugccgccuua aaauuuuuau 60
uuuauuuuuc uuuucuuuuc cgaaucggau uuuguuuuua auauuucaaa aaaaaaaaaa 120
aaaaaaaaaa aaaaaaaaaa aa 142
<210> 229
<211> 7587
<212> RNA
<213> Artificial Sequence
<220>
<223> RNA construct
<400> 229
uaauacgacu cacuauagau gggcggcgca ugagagaagc ccagaccaau uaccuaccca 60
aaauggagaa aguucacguu gacaucgagg aagacagccc auuccucaga gcuuugcagc 120
ggagcuuccc gcaguuugag guagaagcca agcaggucac ugauaaugac caugcuaaug 180
ccagagcguu uucgcaucug gcuucaaaac ugaucgaaac ggagguggac ccauccgaca 240
cgauccuuga cauuggaagu gcgcccgccc gcagaaugua uucuaagcac aaguaucauu 300
guaucugucc gaugagaugu gcggaagauc cggacagauu guauaaguau gcaacuaagc 360
ugaagaaaaa cuguaaggaa auaacugaua aggaauugga caagaaaaug aaggagcugg 420
ccgccgucau gagcgacccu gaccuggaaa cugagacuau gugccuccac gacgacgagu 480
cgugucgcua cgaagggcaa gucgcuguuu accaggaugu auacgcgguu gacggaccga 540
caagucucua ucaccaagcc aauaagggag uuagagucgc cuacuggaua ggcuuugaca 600
ccaccccuuu uauguuuaag aacuuggcug gagcauaucc aucauacucu accaacuggg 660
ccgacgaaac cguguuaacg gcucguaaca uaggccuaug cagcucugac guuauggagc 720
ggucacguag agggaugucc auucuuagaa agaaguauuu gaaaccaucc aacaauguuc 780
uauucucugu uggcucgacc aucuaccacg agaagaggga cuuacugagg agcuggcacc 840
ugccgucugu auuucacuua cguggcaagc aaaauuacac augucggugu gagacuauag 900
uuaguugcga cggguacguc guuaaaagaa uagcuaucag uccaggccug uaugggaagc 960
cuucaggcua ugcugcuacg augcaccgcg agggauucuu gugcugcaaa gugacagaca 1020
cauugaacgg ggagaggguc ucuuuucccg ugugcacgua ugugccagcu acauugugug 1080
accaaaugac uggcauacug gcaacagaug ucagugcgga cgacgcgcaa aaacugcugg 1140
uugggcucaa ccagcguaua gucgucaacg gucgcaccca gagaaacacc aauaccauga 1200
aaaauuaccu uuugcccgua guggcccagg cauuugcuag gugggcaaag gaauauaagg 1260
aagaucaaga agaugaaagg ccacuaggac uacgagauag acaguuaguc augggguguu 1320
guugggcuuu uagaaggcac aagauaacau cuauuuauaa gcgcccggau acccaaacca 1380
ucaucaaagu gaacagcgau uuccacucau ucgugcugcc caggauaggc aguaacacau 1440
uggagaucgg gcugagaaca agaaucagga aaauguuaga ggagcacaag gagccgucac 1500
cucucauuac cgccgaggac guacaagaag cuaagugcgc agccgaugag gcuaaggagg 1560
ugcgugaagc cgaggaguug cgcgcagcuc uaccaccuuu ggcagcugau guugaggagc 1620
ccacucugga ggcagacguc gacuugaugu uacaagaggc uggggccggc ucaguggaga 1680
caccucgugg cuugauaaag guuaccagcu acgauggcga ggacaagauc ggcucuuacg 1740
cugugcuuuc uccgcaggcu guacucaaga gugaaaaauu aucuugcauc cacccucucg 1800
cugaacaagu cauagugaua acacacucug gccgaaaagg gcguuaugcc guggaaccau 1860
accaugguaa aguaguggug ccagagggac augcaauacc cguccaggac uuucaagcuc 1920
ugagugaaag ugccaccauu guguacaacg aacgugaguu cguaaacagg uaccugcacc 1980
auauugccac acauggagga gcgcugaaca cugaugaaga auauuacaaa acugucaagc 2040
ccagcgagca cgacggcgaa uaccuguacg acaucgacag gaaacagugc gucaagaaag 2100
aacuagucac ugggcuaggg cucacaggcg agcuggugga uccucccuuc caugaauucg 2160
ccuacgagag ucugagaaca cgaccagccg cuccuuacca aguaccaacc auaggggugu 2220
auggcgugcc aggaucaggc aagucuggca ucauuaaaag cgcagucacc aaaaaagauc 2280
uaguggugag cgccaagaaa gaaaacugug cagaaauuau aagggacguc aagaaaauga 2340
aagggcugga cgucaaugcc agaacugugg acucagugcu cuugaaugga ugcaaacacc 2400
ccguagagac ccuguauauu gacgaagcuu uugcuuguca ugcagguacu cucagagcgc 2460
ucauagccau uauaagaccu aaaaaggcag ugcucugcgg ggaucccaaa cagugcgguu 2520
uuuuuaacau gaugugccug aaagugcauu uuaaccacga gauuugcaca caagucuucc 2580
acaaaagcau cucucgccgu ugcacuaaau cugugacuuc ggucgucuca accuuguuuu 2640
acgacaaaaa aaugagaacg acgaauccga aagagacuaa gauugugauu gacacuaccg 2700
gcaguaccaa accuaagcag gacgaucuca uucucacuug uuucagaggg ugggugaagc 2760
aguugcaaau agauuacaaa ggcaacgaaa uaaugacggc agcugccucu caagggcuga 2820
cccguaaagg uguguaugcc guucgguaca aggugaauga aaauccucug uacgcaccca 2880
ccucagaaca ugugaacguc cuacugaccc gcacggagga ccgcaucgug uggaaaacac 2940
uagccggcga cccauggaua aaaacacuga cugccaagua cccugggaau uucacugcca 3000
cgauagagga guggcaagca gagcaugaug ccaucaugag gcacaucuug gagagaccgg 3060
acccuaccga cgucuuccag aauaaggcaa acguguguug ggccaaggcu uuagugccgg 3120
ugcugaagac cgcuggcaua gacaugacca cugaacaaug gaacacugug gauuauuuug 3180
aaacggacaa agcucacuca gcagagauag uauugaacca acuaugcgug agguucuuug 3240
gacucgaucu ggacuccggu cuauuuucug cacccacugu uccguuaucc auuaggaaua 3300
aucacuggga uaacuccccg ucgccuaaca uguacgggcu gaauaaagaa gugguccguc 3360
agcucucucg cagguaccca caacugccuc gggcaguugc cacuggaaga gucuaugaca 3420
ugaacacugg uacacugcgc aauuaugauc cgcgcauaaa ccuaguaccu guaaacagaa 3480
gacugccuca ugcuuuaguc cuccaccaua augaacaccc acagagugac uuuucuucau 3540
ucgucagcaa auugaagggc agaacugucc ugguggucgg ggaaaaguug uccgucccag 3600
gcaaaauggu ugacugguug ucagaccggc cugaggcuac cuucagagcu cggcuggauu 3660
uaggcauccc aggugaugug cccaaauaug acauaauauu uguuaaugug aggaccccau 3720
auaaauacca ucacuaucag cagugugaag accaugccau uaagcuuagc auguugacca 3780
agaaagcuug ucugcaucug aaucccggcg gaaccugugu cagcauaggu uaugguuacg 3840
cugacagggc cagcgaaagc aucauuggug cuauagcgcg gcaguucaag uuuucccggg 3900
uaugcaaacc gaaauccuca cuugaagaga cggaaguucu guuuguauuc auuggguacg 3960
aucgcaaggc ccguacgcac aauucuuaca agcuuucauc aaccuugacc aacauuuaua 4020
cagguuccag acuccacgaa gccggaugug cacccucaua ucauguggug cgaggggaua 4080
uugccacggc caccgaagga gugauuauaa augcugcuaa cagcaaagga caaccuggcg 4140
gaggggugug cggagcgcug uauaagaaau ucccggaaag cuucgauuua cagccgaucg 4200
aaguaggaaa agcgcgacug gucaaaggug cagcuaaaca uaucauucau gccguaggac 4260
caaacuucaa caaaguuucg gagguugaag gugacaaaca guuggcagag gcuuaugagu 4320
ccaucgcuaa gauugucaac gauaacaauu acaagucagu agcgauucca cuguugucca 4380
ccggcaucuu uuccgggaac aaagaucgac uaacccaauc auugaaccau uugcugacag 4440
cuuuagacac cacugaugca gauguagcca uauacugcag ggacaagaaa ugggaaauga 4500
cucucaagga agcaguggcu aggagagaag caguggagga gauaugcaua uccgacgacu 4560
cuucagugac agaaccugau gcagagcugg ugagggugca uccgaagagu ucuuuggcug 4620
gaaggaaggg cuacagcaca agcgauggca aaacuuucuc auauuuggaa gggaccaagu 4680
uucaccaggc ggccaaggau auagcagaaa uuaaugccau guggcccguu gcaacggagg 4740
ccaaugagca gguaugcaug uauauccucg gagaaagcau gagcaguauu aggucgaaau 4800
gccccgucga agagucggaa gccuccacac caccuagcac gcugccuugc uugugcaucc 4860
augccaugac uccagaaaga guacagcgcc uaaaagccuc acguccagaa caaauuacug 4920
ugugcucauc cuuuccauug ccgaaguaua gaaucacugg ugugcagaag auccaaugcu 4980
cccagccuau auuguucuca ccgaaagugc cugcguauau ucauccaagg aaguaucucg 5040
uggaaacacc accgguagac gagacuccgg agccaucggc agagaaccaa uccacagagg 5100
ggacaccuga acaaccacca cuuauaaccg aggaugagac caggacuaga acgccugagc 5160
cgaucaucau cgaagaggaa gaagaggaua gcauaaguuu gcugucagau ggcccgaccc 5220
accaggugcu gcaagucgag gcagacauuc acgggccgcc cucuguaucu agcucauccu 5280
gguccauucc ucaugcaucc gacuuugaug uggacaguuu auccauacuu gacacccugg 5340
agggagcuag cgugaccagc ggggcaacgu cagccgagac uaacucuuac uucgcaaaga 5400
guauggaguu ucuggcgcga ccggugccug cgccucgaac aguauucagg aacccuccac 5460
aucccgcucc gcgcacaaga acaccgucac uugcacccag cagggccugc ucgagaacca 5520
gccuaguuuc caccccgcca ggcgugaaua gggugaucac uagagaggag cucgaggcgc 5580
uuaccccguc acgcacuccu agcaggucgg ucucgagaac cagccugguc uccaacccgc 5640
caggcguaaa uagggugauu acaagagagg aguuugaggc guucguagca caacaacaau 5700
gacgguuuga ugcgggugca uacaucuuuu ccuccgacac cggucaaggg cauuuacaac 5760
aaaaaucagu aaggcaaacg gugcuauccg aagugguguu ggagaggacc gaauuggaga 5820
uuucguaugc cccgcgccuc gaccaagaaa aagaagaauu acuacgcaag aaauuacagu 5880
uaaaucccac accugcuaac agaagcagau accaguccag gaagguggag aacaugaaag 5940
ccauaacagc uagacguauu cugcaaggcc uagggcauua uuugaaggca gaaggaaaag 6000
uggagugcua ccgaacccug cauccuguuc cuuuguauuc aucuagugug aaccgugccu 6060
uuucaagccc caaggucgca guggaagccu guaacgccau guugaaagag aacuuuccga 6120
cuguggcuuc uuacuguauu auuccagagu acgaugccua uuuggacaug guugacggag 6180
cuucaugcug cuuagacacu gccaguuuuu gcccugcaaa gcugcgcagc uuuccaaaga 6240
aacacuccua uuuggaaccc acaauacgau cggcagugcc uucagcgauc cagaacacgc 6300
uccagaacgu ccuggcagcu gccacaaaaa gaaauugcaa ugucacgcaa augagagaau 6360
ugcccguauu ggauucggcg gccuuuaaug uggaaugcuu caagaaauau gcguguaaua 6420
augaauauug ggaaacguuu aaagaaaacc ccaucaggcu uacugaagaa aacgugguaa 6480
auuacauuac caaauuaaaa ggaccaaaag cugcugcucu uuuugcgaag acacauaauu 6540
ugaauauguu gcaggacaua ccaauggaca gguuuguaau ggacuuaaag agagacguga 6600
aagugacucc aggaacaaaa cauacugaag aacggcccaa gguacaggug auccaggcug 6660
ccgauccgcu agcaacagcg uaucugugcg gaauccaccg agagcugguu aggagauuaa 6720
augcgguccu gcuuccgaac auucauacac uguuugauau gucggcugaa gacuuugacg 6780
cuauuauagc cgagcacuuc cagccugggg auuguguucu ggaaacugac aucgcgucgu 6840
uugauaaaag ugaggacgac gccauggcuc ugaccgcguu aaugauucug gaagacuuag 6900
guguggacgc agagcuguug acgcugauug aggcggcuuu cggcgaaauu ucaucaauac 6960
auuugcccac uaaaacuaaa uuuaaauucg gagccaugau gaaaucugga auguuccuca 7020
cacuguuugu gaacacaguc auuaacauug uaaucgcaag cagaguguug agagaacggc 7080
uaaccggauc accaugugca gcauucauug gagaugacaa uaucgugaaa ggagucaaau 7140
cggacaaauu aauggcagac aggugcgcca ccugguugaa uauggaaguc aagauuauag 7200
augcuguggu gggcgagaaa gcgccuuauu ucuguggagg guuuauuuug ugugacuccg 7260
ugaccggcac agcgugccgu guggcagacc cccuaaaaag gcuguuuaag cuuggcaaac 7320
cucuggcagc agacgaugaa caugaugaug acaggagaag ggcauugcau gaagagucaa 7380
cacgcuggaa ccgagugggu auucuuucag agcugugcaa ggcaguagaa ucaagguaug 7440
aaaccguagg aacuuccauc auaguuaugg ccaugacuac ucuagcuagc aguguuaaau 7500
cauucagcua ccugagaggg gccccuauaa cucucuacgg cuaaccugaa uggacuacga 7560
cauagucuag uccgccaagu cuagcau 7587
<210> 230
<211> 7587
<212> DNA
<213> Artificial Sequence
<220>
<223> Nucleic acid sequence encoding RNA construct
<400> 230
taatacgact cactatagat gggcggcgca tgagagaagc ccagaccaat tacctaccca 60
aaatggagaa agttcacgtt gacatcgagg aagacagccc attcctcaga gctttgcagc 120
ggagcttccc gcagtttgag gtagaagcca agcaggtcac tgataatgac catgctaatg 180
ccagagcgtt ttcgcatctg gcttcaaaac tgatcgaaac ggaggtggac ccatccgaca 240
cgatccttga cattggaagt gcgcccgccc gcagaatgta ttctaagcac aagtatcatt 300
gtatctgtcc gatgagatgt gcggaagatc cggacagatt gtataagtat gcaactaagc 360
tgaagaaaaa ctgtaaggaa ataactgata aggaattgga caagaaaatg aaggagctgg 420
ccgccgtcat gagcgaccct gacctggaaa ctgagactat gtgcctccac gacgacgagt 480
cgtgtcgcta cgaagggcaa gtcgctgttt accaggatgt atacgcggtt gacggaccga 540
caagtctcta tcaccaagcc aataagggag ttagagtcgc ctactggata ggctttgaca 600
ccaccccttt tatgtttaag aacttggctg gagcatatcc atcatactct accaactggg 660
ccgacgaaac cgtgttaacg gctcgtaaca taggcctatg cagctctgac gttatggagc 720
ggtcacgtag agggatgtcc attcttagaa agaagtattt gaaaccatcc aacaatgttc 780
tattctctgt tggctcgacc atctaccacg agaagaggga cttactgagg agctggcacc 840
tgccgtctgt atttcactta cgtggcaagc aaaattacac atgtcggtgt gagactatag 900
ttagttgcga cgggtacgtc gttaaaagaa tagctatcag tccaggcctg tatgggaagc 960
cttcaggcta tgctgctacg atgcaccgcg agggattctt gtgctgcaaa gtgacagaca 1020
cattgaacgg ggagagggtc tcttttcccg tgtgcacgta tgtgccagct acattgtgtg 1080
accaaatgac tggcatactg gcaacagatg tcagtgcgga cgacgcgcaa aaactgctgg 1140
ttgggctcaa ccagcgtata gtcgtcaacg gtcgcaccca gagaaacacc aataccatga 1200
aaaattacct tttgcccgta gtggcccagg catttgctag gtgggcaaag gaatataagg 1260
aagatcaaga agatgaaagg ccactaggac tacgagatag acagttagtc atggggtgtt 1320
gttgggcttt tagaaggcac aagataacat ctatttataa gcgcccggat acccaaacca 1380
tcatcaaagt gaacagcgat ttccactcat tcgtgctgcc caggataggc agtaacacat 1440
tggagatcgg gctgagaaca agaatcagga aaatgttaga ggagcacaag gagccgtcac 1500
ctctcattac cgccgaggac gtacaagaag ctaagtgcgc agccgatgag gctaaggagg 1560
tgcgtgaagc cgaggagttg cgcgcagctc taccaccttt ggcagctgat gttgaggagc 1620
ccactctgga ggcagacgtc gacttgatgt tacaagaggc tggggccggc tcagtggaga 1680
cacctcgtgg cttgataaag gttaccagct acgatggcga ggacaagatc ggctcttacg 1740
ctgtgctttc tccgcaggct gtactcaaga gtgaaaaatt atcttgcatc caccctctcg 1800
ctgaacaagt catagtgata acacactctg gccgaaaagg gcgttatgcc gtggaaccat 1860
accatggtaa agtagtggtg ccagagggac atgcaatacc cgtccaggac tttcaagctc 1920
tgagtgaaag tgccaccatt gtgtacaacg aacgtgagtt cgtaaacagg tacctgcacc 1980
atattgccac acatggagga gcgctgaaca ctgatgaaga atattacaaa actgtcaagc 2040
ccagcgagca cgacggcgaa tacctgtacg acatcgacag gaaacagtgc gtcaagaaag 2100
aactagtcac tgggctaggg ctcacaggcg agctggtgga tcctcccttc catgaattcg 2160
cctacgagag tctgagaaca cgaccagccg ctccttacca agtaccaacc ataggggtgt 2220
atggcgtgcc aggatcaggc aagtctggca tcattaaaag cgcagtcacc aaaaaagatc 2280
tagtggtgag cgccaagaaa gaaaactgtg cagaaattat aagggacgtc aagaaaatga 2340
aagggctgga cgtcaatgcc agaactgtgg actcagtgct cttgaatgga tgcaaacacc 2400
ccgtagagac cctgtatatt gacgaagctt ttgcttgtca tgcaggtact ctcagagcgc 2460
tcatagccat tataagacct aaaaaggcag tgctctgcgg ggatcccaaa cagtgcggtt 2520
tttttaacat gatgtgcctg aaagtgcatt ttaaccacga gatttgcaca caagtcttcc 2580
acaaaagcat ctctcgccgt tgcactaaat ctgtgacttc ggtcgtctca accttgtttt 2640
acgacaaaaa aatgagaacg acgaatccga aagagactaa gattgtgatt gacactaccg 2700
gcagtaccaa acctaagcag gacgatctca ttctcacttg tttcagaggg tgggtgaagc 2760
agttgcaaat agattacaaa ggcaacgaaa taatgacggc agctgcctct caagggctga 2820
cccgtaaagg tgtgtatgcc gttcggtaca aggtgaatga aaatcctctg tacgcaccca 2880
cctcagaaca tgtgaacgtc ctactgaccc gcacggagga ccgcatcgtg tggaaaacac 2940
tagccggcga cccatggata aaaacactga ctgccaagta ccctgggaat ttcactgcca 3000
cgatagagga gtggcaagca gagcatgatg ccatcatgag gcacatcttg gagagaccgg 3060
accctaccga cgtcttccag aataaggcaa acgtgtgttg ggccaaggct ttagtgccgg 3120
tgctgaagac cgctggcata gacatgacca ctgaacaatg gaacactgtg gattattttg 3180
aaacggacaa agctcactca gcagagatag tattgaacca actatgcgtg aggttctttg 3240
gactcgatct ggactccggt ctattttctg cacccactgt tccgttatcc attaggaata 3300
atcactggga taactccccg tcgcctaaca tgtacgggct gaataaagaa gtggtccgtc 3360
agctctctcg caggtaccca caactgcctc gggcagttgc cactggaaga gtctatgaca 3420
tgaacactgg tacactgcgc aattatgatc cgcgcataaa cctagtacct gtaaacagaa 3480
gactgcctca tgctttagtc ctccaccata atgaacaccc acagagtgac ttttcttcat 3540
tcgtcagcaa attgaagggc agaactgtcc tggtggtcgg ggaaaagttg tccgtcccag 3600
gcaaaatggt tgactggttg tcagaccggc ctgaggctac cttcagagct cggctggatt 3660
taggcatccc aggtgatgtg cccaaatatg acataatatt tgttaatgtg aggaccccat 3720
ataaatacca tcactatcag cagtgtgaag accatgccat taagcttagc atgttgacca 3780
agaaagcttg tctgcatctg aatcccggcg gaacctgtgt cagcataggt tatggttacg 3840
ctgacagggc cagcgaaagc atcattggtg ctatagcgcg gcagttcaag ttttcccggg 3900
tatgcaaacc gaaatcctca cttgaagaga cggaagttct gtttgtattc attgggtacg 3960
atcgcaaggc ccgtacgcac aattcttaca agctttcatc aaccttgacc aacatttata 4020
caggttccag actccacgaa gccggatgtg caccctcata tcatgtggtg cgaggggata 4080
ttgccacggc caccgaagga gtgattataa atgctgctaa cagcaaagga caacctggcg 4140
gaggggtgtg cggagcgctg tataagaaat tcccggaaag cttcgattta cagccgatcg 4200
aagtaggaaa agcgcgactg gtcaaaggtg cagctaaaca tatcattcat gccgtaggac 4260
caaacttcaa caaagtttcg gaggttgaag gtgacaaaca gttggcagag gcttatgagt 4320
ccatcgctaa gattgtcaac gataacaatt acaagtcagt agcgattcca ctgttgtcca 4380
ccggcatctt ttccgggaac aaagatcgac taacccaatc attgaaccat ttgctgacag 4440
ctttagacac cactgatgca gatgtagcca tatactgcag ggacaagaaa tgggaaatga 4500
ctctcaagga agcagtggct aggagagaag cagtggagga gatatgcata tccgacgact 4560
cttcagtgac agaacctgat gcagagctgg tgagggtgca tccgaagagt tctttggctg 4620
gaaggaaggg ctacagcaca agcgatggca aaactttctc atatttggaa gggaccaagt 4680
ttcaccaggc ggccaaggat atagcagaaa ttaatgccat gtggcccgtt gcaacggagg 4740
ccaatgagca ggtatgcatg tatatcctcg gagaaagcat gagcagtatt aggtcgaaat 4800
gccccgtcga agagtcggaa gcctccacac cacctagcac gctgccttgc ttgtgcatcc 4860
atgccatgac tccagaaaga gtacagcgcc taaaagcctc acgtccagaa caaattactg 4920
tgtgctcatc ctttccattg ccgaagtata gaatcactgg tgtgcagaag atccaatgct 4980
cccagcctat attgttctca ccgaaagtgc ctgcgtatat tcatccaagg aagtatctcg 5040
tggaaacacc accggtagac gagactccgg agccatcggc agagaaccaa tccacagagg 5100
ggacacctga acaaccacca cttataaccg aggatgagac caggactaga acgcctgagc 5160
cgatcatcat cgaagaggaa gaagaggata gcataagttt gctgtcagat ggcccgaccc 5220
accaggtgct gcaagtcgag gcagacattc acgggccgcc ctctgtatct agctcatcct 5280
ggtccattcc tcatgcatcc gactttgatg tggacagttt atccatactt gacaccctgg 5340
agggagctag cgtgaccagc ggggcaacgt cagccgagac taactcttac ttcgcaaaga 5400
gtatggagtt tctggcgcga ccggtgcctg cgcctcgaac agtattcagg aaccctccac 5460
atcccgctcc gcgcacaaga acaccgtcac ttgcacccag cagggcctgc tcgagaacca 5520
gcctagtttc caccccgcca ggcgtgaata gggtgatcac tagagaggag ctcgaggcgc 5580
ttaccccgtc acgcactcct agcaggtcgg tctcgagaac cagcctggtc tccaacccgc 5640
caggcgtaaa tagggtgatt acaagagagg agtttgaggc gttcgtagca caacaacaat 5700
gacggtttga tgcgggtgca tacatctttt cctccgacac cggtcaaggg catttacaac 5760
aaaaatcagt aaggcaaacg gtgctatccg aagtggtgtt ggagaggacc gaattggaga 5820
tttcgtatgc cccgcgcctc gaccaagaaa aagaagaatt actacgcaag aaattacagt 5880
taaatcccac acctgctaac agaagcagat accagtccag gaaggtggag aacatgaaag 5940
ccataacagc tagacgtatt ctgcaaggcc tagggcatta tttgaaggca gaaggaaaag 6000
tggagtgcta ccgaaccctg catcctgttc ctttgtattc atctagtgtg aaccgtgcct 6060
tttcaagccc caaggtcgca gtggaagcct gtaacgccat gttgaaagag aactttccga 6120
ctgtggcttc ttactgtatt attccagagt acgatgccta tttggacatg gttgacggag 6180
cttcatgctg cttagacact gccagttttt gccctgcaaa gctgcgcagc tttccaaaga 6240
aacactccta tttggaaccc acaatacgat cggcagtgcc ttcagcgatc cagaacacgc 6300
tccagaacgt cctggcagct gccacaaaaa gaaattgcaa tgtcacgcaa atgagagaat 6360
tgcccgtatt ggattcggcg gcctttaatg tggaatgctt caagaaatat gcgtgtaata 6420
atgaatattg ggaaacgttt aaagaaaacc ccatcaggct tactgaagaa aacgtggtaa 6480
attacattac caaattaaaa ggaccaaaag ctgctgctct ttttgcgaag acacataatt 6540
tgaatatgtt gcaggacata ccaatggaca ggtttgtaat ggacttaaag agagacgtga 6600
aagtgactcc aggaacaaaa catactgaag aacggcccaa ggtacaggtg atccaggctg 6660
ccgatccgct agcaacagcg tatctgtgcg gaatccaccg agagctggtt aggagattaa 6720
atgcggtcct gcttccgaac attcatacac tgtttgatat gtcggctgaa gactttgacg 6780
ctattatagc cgagcacttc cagcctgggg attgtgttct ggaaactgac atcgcgtcgt 6840
ttgataaaag tgaggacgac gccatggctc tgaccgcgtt aatgattctg gaagacttag 6900
gtgtggacgc agagctgttg acgctgattg aggcggcttt cggcgaaatt tcatcaatac 6960
atttgcccac taaaactaaa tttaaattcg gagccatgat gaaatctgga atgttcctca 7020
cactgtttgt gaacacagtc attaacattg taatcgcaag cagagtgttg agagaacggc 7080
taaccggatc accatgtgca gcattcattg gagatgacaa tatcgtgaaa ggagtcaaat 7140
cggacaaatt aatggcagac aggtgcgcca cctggttgaa tatggaagtc aagattatag 7200
atgctgtggt gggcgagaaa gcgccttatt tctgtggagg gtttattttg tgtgactccg 7260
tgaccggcac agcgtgccgt gtggcagacc ccctaaaaag gctgtttaag cttggcaaac 7320
ctctggcagc agacgatgaa catgatgatg acaggagaag ggcattgcat gaagagtcaa 7380
cacgctggaa ccgagtgggt attctttcag agctgtgcaa ggcagtagaa tcaaggtatg 7440
aaaccgtagg aacttccatc atagttatgg ccatgactac tctagctagc agtgttaaat 7500
cattcagcta cctgagaggg gcccctataa ctctctacgg ctaacctgaa tggactacga 7560
catagtctag tccgccaagt ctagcat 7587
<210> 231
<211> 7587
<212> DNA
<213> Artificial Sequence
<220>
<223> Recombinant vector
<400> 231
taatacgact cactatagat gggcggcgca tgagagaagc ccagaccaat tacctaccca 60
aaatggagaa agttcacgtt gacatcgagg aagacagccc attcctcaga gctttgcagc 120
ggagcttccc gcagtttgag gtagaagcca agcaggtcac tgataatgac catgctaatg 180
ccagagcgtt ttcgcatctg gcttcaaaac tgatcgaaac ggaggtggac ccatccgaca 240
cgatccttga cattggaagt gcgcccgccc gcagaatgta ttctaagcac aagtatcatt 300
gtatctgtcc gatgagatgt gcggaagatc cggacagatt gtataagtat gcaactaagc 360
tgaagaaaaa ctgtaaggaa ataactgata aggaattgga caagaaaatg aaggagctgg 420
ccgccgtcat gagcgaccct gacctggaaa ctgagactat gtgcctccac gacgacgagt 480
cgtgtcgcta cgaagggcaa gtcgctgttt accaggatgt atacgcggtt gacggaccga 540
caagtctcta tcaccaagcc aataagggag ttagagtcgc ctactggata ggctttgaca 600
ccaccccttt tatgtttaag aacttggctg gagcatatcc atcatactct accaactggg 660
ccgacgaaac cgtgttaacg gctcgtaaca taggcctatg cagctctgac gttatggagc 720
ggtcacgtag agggatgtcc attcttagaa agaagtattt gaaaccatcc aacaatgttc 780
tattctctgt tggctcgacc atctaccacg agaagaggga cttactgagg agctggcacc 840
tgccgtctgt atttcactta cgtggcaagc aaaattacac atgtcggtgt gagactatag 900
ttagttgcga cgggtacgtc gttaaaagaa tagctatcag tccaggcctg tatgggaagc 960
cttcaggcta tgctgctacg atgcaccgcg agggattctt gtgctgcaaa gtgacagaca 1020
cattgaacgg ggagagggtc tcttttcccg tgtgcacgta tgtgccagct acattgtgtg 1080
accaaatgac tggcatactg gcaacagatg tcagtgcgga cgacgcgcaa aaactgctgg 1140
ttgggctcaa ccagcgtata gtcgtcaacg gtcgcaccca gagaaacacc aataccatga 1200
aaaattacct tttgcccgta gtggcccagg catttgctag gtgggcaaag gaatataagg 1260
aagatcaaga agatgaaagg ccactaggac tacgagatag acagttagtc atggggtgtt 1320
gttgggcttt tagaaggcac aagataacat ctatttataa gcgcccggat acccaaacca 1380
tcatcaaagt gaacagcgat ttccactcat tcgtgctgcc caggataggc agtaacacat 1440
tggagatcgg gctgagaaca agaatcagga aaatgttaga ggagcacaag gagccgtcac 1500
ctctcattac cgccgaggac gtacaagaag ctaagtgcgc agccgatgag gctaaggagg 1560
tgcgtgaagc cgaggagttg cgcgcagctc taccaccttt ggcagctgat gttgaggagc 1620
ccactctgga ggcagacgtc gacttgatgt tacaagaggc tggggccggc tcagtggaga 1680
cacctcgtgg cttgataaag gttaccagct acgatggcga ggacaagatc ggctcttacg 1740
ctgtgctttc tccgcaggct gtactcaaga gtgaaaaatt atcttgcatc caccctctcg 1800
ctgaacaagt catagtgata acacactctg gccgaaaagg gcgttatgcc gtggaaccat 1860
accatggtaa agtagtggtg ccagagggac atgcaatacc cgtccaggac tttcaagctc 1920
tgagtgaaag tgccaccatt gtgtacaacg aacgtgagtt cgtaaacagg tacctgcacc 1980
atattgccac acatggagga gcgctgaaca ctgatgaaga atattacaaa actgtcaagc 2040
ccagcgagca cgacggcgaa tacctgtacg acatcgacag gaaacagtgc gtcaagaaag 2100
aactagtcac tgggctaggg ctcacaggcg agctggtgga tcctcccttc catgaattcg 2160
cctacgagag tctgagaaca cgaccagccg ctccttacca agtaccaacc ataggggtgt 2220
atggcgtgcc aggatcaggc aagtctggca tcattaaaag cgcagtcacc aaaaaagatc 2280
tagtggtgag cgccaagaaa gaaaactgtg cagaaattat aagggacgtc aagaaaatga 2340
aagggctgga cgtcaatgcc agaactgtgg actcagtgct cttgaatgga tgcaaacacc 2400
ccgtagagac cctgtatatt gacgaagctt ttgcttgtca tgcaggtact ctcagagcgc 2460
tcatagccat tataagacct aaaaaggcag tgctctgcgg ggatcccaaa cagtgcggtt 2520
tttttaacat gatgtgcctg aaagtgcatt ttaaccacga gatttgcaca caagtcttcc 2580
acaaaagcat ctctcgccgt tgcactaaat ctgtgacttc ggtcgtctca accttgtttt 2640
acgacaaaaa aatgagaacg acgaatccga aagagactaa gattgtgatt gacactaccg 2700
gcagtaccaa acctaagcag gacgatctca ttctcacttg tttcagaggg tgggtgaagc 2760
agttgcaaat agattacaaa ggcaacgaaa taatgacggc agctgcctct caagggctga 2820
cccgtaaagg tgtgtatgcc gttcggtaca aggtgaatga aaatcctctg tacgcaccca 2880
cctcagaaca tgtgaacgtc ctactgaccc gcacggagga ccgcatcgtg tggaaaacac 2940
tagccggcga cccatggata aaaacactga ctgccaagta ccctgggaat ttcactgcca 3000
cgatagagga gtggcaagca gagcatgatg ccatcatgag gcacatcttg gagagaccgg 3060
accctaccga cgtcttccag aataaggcaa acgtgtgttg ggccaaggct ttagtgccgg 3120
tgctgaagac cgctggcata gacatgacca ctgaacaatg gaacactgtg gattattttg 3180
aaacggacaa agctcactca gcagagatag tattgaacca actatgcgtg aggttctttg 3240
gactcgatct ggactccggt ctattttctg cacccactgt tccgttatcc attaggaata 3300
atcactggga taactccccg tcgcctaaca tgtacgggct gaataaagaa gtggtccgtc 3360
agctctctcg caggtaccca caactgcctc gggcagttgc cactggaaga gtctatgaca 3420
tgaacactgg tacactgcgc aattatgatc cgcgcataaa cctagtacct gtaaacagaa 3480
gactgcctca tgctttagtc ctccaccata atgaacaccc acagagtgac ttttcttcat 3540
tcgtcagcaa attgaagggc agaactgtcc tggtggtcgg ggaaaagttg tccgtcccag 3600
gcaaaatggt tgactggttg tcagaccggc ctgaggctac cttcagagct cggctggatt 3660
taggcatccc aggtgatgtg cccaaatatg acataatatt tgttaatgtg aggaccccat 3720
ataaatacca tcactatcag cagtgtgaag accatgccat taagcttagc atgttgacca 3780
agaaagcttg tctgcatctg aatcccggcg gaacctgtgt cagcataggt tatggttacg 3840
ctgacagggc cagcgaaagc atcattggtg ctatagcgcg gcagttcaag ttttcccggg 3900
tatgcaaacc gaaatcctca cttgaagaga cggaagttct gtttgtattc attgggtacg 3960
atcgcaaggc ccgtacgcac aattcttaca agctttcatc aaccttgacc aacatttata 4020
caggttccag actccacgaa gccggatgtg caccctcata tcatgtggtg cgaggggata 4080
ttgccacggc caccgaagga gtgattataa atgctgctaa cagcaaagga caacctggcg 4140
gaggggtgtg cggagcgctg tataagaaat tcccggaaag cttcgattta cagccgatcg 4200
aagtaggaaa agcgcgactg gtcaaaggtg cagctaaaca tatcattcat gccgtaggac 4260
caaacttcaa caaagtttcg gaggttgaag gtgacaaaca gttggcagag gcttatgagt 4320
ccatcgctaa gattgtcaac gataacaatt acaagtcagt agcgattcca ctgttgtcca 4380
ccggcatctt ttccgggaac aaagatcgac taacccaatc attgaaccat ttgctgacag 4440
ctttagacac cactgatgca gatgtagcca tatactgcag ggacaagaaa tgggaaatga 4500
ctctcaagga agcagtggct aggagagaag cagtggagga gatatgcata tccgacgact 4560
cttcagtgac agaacctgat gcagagctgg tgagggtgca tccgaagagt tctttggctg 4620
gaaggaaggg ctacagcaca agcgatggca aaactttctc atatttggaa gggaccaagt 4680
ttcaccaggc ggccaaggat atagcagaaa ttaatgccat gtggcccgtt gcaacggagg 4740
ccaatgagca ggtatgcatg tatatcctcg gagaaagcat gagcagtatt aggtcgaaat 4800
gccccgtcga agagtcggaa gcctccacac cacctagcac gctgccttgc ttgtgcatcc 4860
atgccatgac tccagaaaga gtacagcgcc taaaagcctc acgtccagaa caaattactg 4920
tgtgctcatc ctttccattg ccgaagtata gaatcactgg tgtgcagaag atccaatgct 4980
cccagcctat attgttctca ccgaaagtgc ctgcgtatat tcatccaagg aagtatctcg 5040
tggaaacacc accggtagac gagactccgg agccatcggc agagaaccaa tccacagagg 5100
ggacacctga acaaccacca cttataaccg aggatgagac caggactaga acgcctgagc 5160
cgatcatcat cgaagaggaa gaagaggata gcataagttt gctgtcagat ggcccgaccc 5220
accaggtgct gcaagtcgag gcagacattc acgggccgcc ctctgtatct agctcatcct 5280
ggtccattcc tcatgcatcc gactttgatg tggacagttt atccatactt gacaccctgg 5340
agggagctag cgtgaccagc ggggcaacgt cagccgagac taactcttac ttcgcaaaga 5400
gtatggagtt tctggcgcga ccggtgcctg cgcctcgaac agtattcagg aaccctccac 5460
atcccgctcc gcgcacaaga acaccgtcac ttgcacccag cagggcctgc tcgagaacca 5520
gcctagtttc caccccgcca ggcgtgaata gggtgatcac tagagaggag ctcgaggcgc 5580
ttaccccgtc acgcactcct agcaggtcgg tctcgagaac cagcctggtc tccaacccgc 5640
caggcgtaaa tagggtgatt acaagagagg agtttgaggc gttcgtagca caacaacaat 5700
gacggtttga tgcgggtgca tacatctttt cctccgacac cggtcaaggg catttacaac 5760
aaaaatcagt aaggcaaacg gtgctatccg aagtggtgtt ggagaggacc gaattggaga 5820
tttcgtatgc cccgcgcctc gaccaagaaa aagaagaatt actacgcaag aaattacagt 5880
taaatcccac acctgctaac agaagcagat accagtccag gaaggtggag aacatgaaag 5940
ccataacagc tagacgtatt ctgcaaggcc tagggcatta tttgaaggca gaaggaaaag 6000
tggagtgcta ccgaaccctg catcctgttc ctttgtattc atctagtgtg aaccgtgcct 6060
tttcaagccc caaggtcgca gtggaagcct gtaacgccat gttgaaagag aactttccga 6120
ctgtggcttc ttactgtatt attccagagt acgatgccta tttggacatg gttgacggag 6180
cttcatgctg cttagacact gccagttttt gccctgcaaa gctgcgcagc tttccaaaga 6240
aacactccta tttggaaccc acaatacgat cggcagtgcc ttcagcgatc cagaacacgc 6300
tccagaacgt cctggcagct gccacaaaaa gaaattgcaa tgtcacgcaa atgagagaat 6360
tgcccgtatt ggattcggcg gcctttaatg tggaatgctt caagaaatat gcgtgtaata 6420
atgaatattg ggaaacgttt aaagaaaacc ccatcaggct tactgaagaa aacgtggtaa 6480
attacattac caaattaaaa ggaccaaaag ctgctgctct ttttgcgaag acacataatt 6540
tgaatatgtt gcaggacata ccaatggaca ggtttgtaat ggacttaaag agagacgtga 6600
aagtgactcc aggaacaaaa catactgaag aacggcccaa ggtacaggtg atccaggctg 6660
ccgatccgct agcaacagcg tatctgtgcg gaatccaccg agagctggtt aggagattaa 6720
atgcggtcct gcttccgaac attcatacac tgtttgatat gtcggctgaa gactttgacg 6780
ctattatagc cgagcacttc cagcctgggg attgtgttct ggaaactgac atcgcgtcgt 6840
ttgataaaag tgaggacgac gccatggctc tgaccgcgtt aatgattctg gaagacttag 6900
gtgtggacgc agagctgttg acgctgattg aggcggcttt cggcgaaatt tcatcaatac 6960
atttgcccac taaaactaaa tttaaattcg gagccatgat gaaatctgga atgttcctca 7020
cactgtttgt gaacacagtc attaacattg taatcgcaag cagagtgttg agagaacggc 7080
taaccggatc accatgtgca gcattcattg gagatgacaa tatcgtgaaa ggagtcaaat 7140
cggacaaatt aatggcagac aggtgcgcca cctggttgaa tatggaagtc aagattatag 7200
atgctgtggt gggcgagaaa gcgccttatt tctgtggagg gtttattttg tgtgactccg 7260
tgaccggcac agcgtgccgt gtggcagacc ccctaaaaag gctgtttaag cttggcaaac 7320
ctctggcagc agacgatgaa catgatgatg acaggagaag ggcattgcat gaagagtcaa 7380
cacgctggaa ccgagtgggt attctttcag agctgtgcaa ggcagtagaa tcaaggtatg 7440
aaaccgtagg aacttccatc atagttatgg ccatgactac tctagctagc agtgttaaat 7500
cattcagcta cctgagaggg gcccctataa ctctctacgg ctaacctgaa tggactacga 7560
catagtctag tccgccaagt ctagcat 7587
<210> 232
<211> 113
<212> PRT
<213> Homo sapiens
<400> 232
Met Pro Val Glu Arg Met Arg Met Arg Pro Trp Leu Glu Glu Gln Ile
1 5 10 15
Asn Ser Asn Thr Ile Pro Gly Leu Lys Trp Leu Asn Lys Glu Lys Lys
20 25 30
Ile Phe Gln Ile Pro Trp Met His Ala Ala Arg His Gly Trp Asp Val
35 40 45
Glu Lys Asp Ala Pro Leu Phe Arg Asn Trp Ala Ile His Thr Gly Lys
50 55 60
His Gln Pro Gly Val Asp Lys Pro Asp Pro Lys Thr Trp Lys Ala Asn
65 70 75 80
Phe Arg Cys Ala Met Asn Ser Leu Pro Asp Ile Glu Glu Val Lys Asp
85 90 95
Lys Ser Ile Lys Lys Gly Asn Asn Ala Phe Arg Val Tyr Arg Met Leu
100 105 110
Pro
<210> 233
<211> 339
<212> DNA
<213> Homo sapiens
<400> 233
atgccggtgg aaaggatgcg catgcgcccg tggctggagg agcagataaa ctccaacacg 60
atcccggggc tcaagtggct taacaaggaa aagaagattt ttcagatccc ctggatgcat 120
gcggctagac atgggtggga tgtggaaaaa gatgcaccac tctttagaaa ctgggcaatc 180
catacaggaa agcatcaacc aggagtagat aaacctgatc ccaaaacatg gaaggcgaat 240
ttcagatgcg ccatgaattc cttgcctgat attgaagaag tcaaggataa aagcataaag 300
aaaggaaata atgccttcag ggtctaccga atgctgccc 339
<210> 234
<211> 339
<212> RNA
<213> Homo sapiens
<400> 234
augccggugg aaaggaugcg caugcgcccg uggcuggagg agcagauaaa cuccaacacg 60
aucccggggc ucaaguggcu uaacaaggaa aagaagauuu uucagauccc cuggaugcau 120
gcggcuagac auggguggga uguggaaaaa gaugcaccac ucuuuagaaa cugggcaauc 180
cauacaggaa agcaucaacc aggaguagau aaaccugauc ccaaaacaug gaaggcgaau 240
uucagaugcg ccaugaauuc cuugccugau auugaagaag ucaaggauaa aagcauaaag 300
aaaggaaaua augccuucag ggucuaccga augcugccc 339
<210> 235
<211> 342
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF2
<400> 235
atgcccgtgg aacggatgag aatgaggccc tggctggaag aacagatcaa cagcaacaca 60
atccccggcc tgaagtggct gaacaaagag aagaagatct ttcagatccc ctggatgcac 120
gccgccagac acggatggga tgtcgagaaa gatgcccctc tgttcagaaa ctgggccatc 180
cacaccggca aacaccagcc tggcgtggac aagcctgatc ctaagacctg gaaggccaac 240
ttcagatgcg ccatgaacag cctgcctgac atcgaggaag tgaaggacaa gagcatcaag 300
aagggcaaca acgccttccg ggtgtacaga atgctgccct ga 342
<210> 236
<211> 342
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF2
<400> 236
augcccgugg aacggaugag aaugaggccc uggcuggaag aacagaucaa cagcaacaca 60
auccccggcc ugaaguggcu gaacaaagag aagaagaucu uucagauccc cuggaugcac 120
gccgccagac acggauggga ugucgagaaa gaugccccuc uguucagaaa cugggccauc 180
cacaccggca aacaccagcc uggcguggac aagccugauc cuaagaccug gaaggccaac 240
uucagaugcg ccaugaacag ccugccugac aucgaggaag ugaaggacaa gagcaucaag 300
aagggcaaca acgccuuccg gguguacaga augcugcccu ga 342
<210> 237
<211> 115
<212> PRT
<213> Homo sapiens
<400> 237
Met Ala Leu His Pro Arg Arg Val Arg Leu Lys Pro Trp Leu Val Ala
1 5 10 15
Gln Val Asp Ser Gly Leu Tyr Pro Gly Leu Ile Trp Leu His Arg Asp
20 25 30
Ser Lys Arg Phe Gln Ile Pro Trp Lys His Ala Thr Arg His Ser Pro
35 40 45
Gln Gln Glu Glu Glu Asn Thr Ile Phe Lys Ala Trp Ala Val Glu Thr
50 55 60
Gly Lys Tyr Gln Glu Gly Val Asp Asp Pro Asp Pro Ala Lys Trp Lys
65 70 75 80
Ala Gln Leu Arg Cys Ala Leu Asn Lys Ser Arg Glu Phe Asn Leu Met
85 90 95
Tyr Asp Gly Thr Lys Glu Val Pro Met Asn Pro Val Lys Ile Tyr Gln
100 105 110
Val Cys Asp
115
<210> 238
<211> 345
<212> DNA
<213> Homo sapiens
<400> 238
atggccctcc acccccgcag agtccggcta aagccctggc tggtggccca ggtggatagt 60
ggcctctacc ctgggctcat ctggctacac agggactcta aacgcttcca gattccctgg 120
aaacatgcca cccggcatag ccctcaacaa gaagaggaaa ataccatttt taaggcctgg 180
gctgtagaga cagggaagta ccaggaaggg gtggatgacc ctgacccagc taaatggaag 240
gcccagctgc gctgtgctct caataagagc agagaattca acctgatgta tgatggcacc 300
aaggaggtgc ccatgaaccc agtgaagata tatcaagtgt gtgac 345
<210> 239
<211> 345
<212> RNA
<213> Homo sapiens
<400> 239
auggcccucc acccccgcag aguccggcua aagcccuggc ugguggccca gguggauagu 60
ggccucuacc cugggcucau cuggcuacac agggacucua aacgcuucca gauucccugg 120
aaacaugcca cccggcauag cccucaacaa gaagaggaaa auaccauuuu uaaggccugg 180
gcuguagaga cagggaagua ccaggaaggg guggaugacc cugacccagc uaaauggaag 240
gcccagcugc gcugugcucu caauaagagc agagaauuca accugaugua ugauggcacc 300
aaggaggugc ccaugaaccc agugaagaua uaucaagugu gugac 345
<210> 240
<211> 348
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF6
<400> 240
atggccctgc atcctagaag agtgcggctg aagccttggc tggtggctca agtggatagc 60
ggcctgtatc ctggcctgat ctggctgcac agagacagca agcggtttca gatcccctgg 120
aagcacgcca ccagacacag ccctcagcaa gaggaagaga acaccatctt caaggcctgg 180
gccgtcgaga caggcaagta ccaagaaggc gtggacgacc ccgatcctgc caaatggaaa 240
gcccagctga gatgcgccct gaacaagagc cgcgagttca acctgatgta cgacggcacc 300
aaagaggtgc ccatgaatcc cgtgaagatc taccaagtgt gcgactga 348
<210> 241
<211> 348
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF6
<400> 241
auggcccugc auccuagaag agugcggcug aagccuuggc ugguggcuca aguggauagc 60
ggccuguauc cuggccugau cuggcugcac agagacagca agcgguuuca gauccccugg 120
aagcacgcca ccagacacag cccucagcaa gaggaagaga acaccaucuu caaggccugg 180
gccgucgaga caggcaagua ccaagaaggc guggacgacc ccgauccugc caaauggaaa 240
gcccagcuga gaugcgcccu gaacaagagc cgcgaguuca accugaugua cgacggcacc 300
aaagaggugc ccaugaaucc cgugaagauc uaccaagugu gcgacuga 348
<210> 242
<211> 229
<212> PRT
<213> Homo sapiens
<400> 242
Glu Ala Val Arg Leu Ile Met Asp Ser Leu His Met Ala Ala Arg Glu
1 5 10 15
Gln Gln Val Tyr Cys Glu Glu Met Arg Glu Glu Arg Gln Asp Arg Leu
20 25 30
Lys Phe Ile Asp Lys Gln Leu Glu Leu Leu Ala Gln Asp Tyr Lys Leu
35 40 45
Arg Ile Lys Gln Ile Thr Glu Glu Val Glu Arg Gln Val Ser Thr Ala
50 55 60
Met Ala Glu Glu Ile Arg Arg Leu Ser Val Leu Val Asp Asp Tyr Gln
65 70 75 80
Met Asp Phe His Pro Ser Pro Val Val Leu Lys Val Tyr Lys Asn Glu
85 90 95
Leu His Arg His Ile Glu Glu Gly Leu Gly Arg Asn Met Ser Asp Arg
100 105 110
Cys Ser Thr Ala Ile Thr Asn Ser Leu Gln Thr Met Gln Gln Asp Met
115 120 125
Ile Asp Gly Leu Lys Pro Leu Leu Pro Val Ser Val Arg Ser Gln Ile
130 135 140
Asp Met Leu Val Pro Arg Gln Cys Phe Ser Leu Asn Tyr Asp Leu Asn
145 150 155 160
Cys Asp Lys Leu Cys Ala Asp Phe Gln Glu Asp Ile Glu Phe His Phe
165 170 175
Ser Leu Gly Trp Thr Met Leu Val Asn Arg Phe Leu Gly Pro Lys Asn
180 185 190
Ser Arg Arg Ala Leu Met Gly Tyr Asn Asp Gln Val Gln Arg Pro Ile
195 200 205
Pro Leu Thr Pro Ala Asn Pro Ser Met Pro Pro Leu Pro Gln Gly Ser
210 215 220
Leu Thr Gln Glu Glu
225
<210> 243
<211> 687
<212> DNA
<213> Homo sapiens
<400> 243
gaggcggttc gactcatcat ggactccctg cacatggcgg ctcgggagca gcaggtttac 60
tgcgaggaaa tgcgtgaaga gcggcaagac cgactgaaat ttattgacaa acagctggag 120
ctcttggctc aagactataa gctgcgaatt aagcagatta cggaggaagt ggagaggcag 180
gtgtcgactg caatggccga ggagatcagg cgcctctctg tactggtgga cgattaccag 240
atggacttcc acccttctcc agtagtcctc aaggtttata agaatgagct gcaccgccac 300
atagaggaag gactgggtcg aaacatgtct gaccgctgct ccacggccat caccaactcc 360
ctgcagacca tgcagcagga catgatagat ggcttgaaac ccctccttcc tgtgtctgtg 420
cggagtcaga tagacatgct ggtcccacgc cagtgcttct ccctcaacta tgacctaaac 480
tgtgacaagc tgtgtgctga cttccaggaa gacattgagt tccatttctc tctcggatgg 540
accatgctgg tgaataggtt cctgggcccc aagaacagcc gtcgggcctt gatgggctac 600
aatgaccagg tccagcgtcc catccctctg acgccagcca accccagcat gcccccactg 660
ccacagggct cgctcaccca ggaggag 687
<210> 244
<211> 687
<212> RNA
<213> Homo sapiens
<400> 244
gaggcgguuc gacucaucau ggacucccug cacauggcgg cucgggagca gcagguuuac 60
ugcgaggaaa ugcgugaaga gcggcaagac cgacugaaau uuauugacaa acagcuggag 120
cucuuggcuc aagacuauaa gcugcgaauu aagcagauua cggaggaagu ggagaggcag 180
gugucgacug caauggccga ggagaucagg cgccucucug uacuggugga cgauuaccag 240
auggacuucc acccuucucc aguaguccuc aagguuuaua agaaugagcu gcaccgccac 300
auagaggaag gacugggucg aaacaugucu gaccgcugcu ccacggccau caccaacucc 360
cugcagacca ugcagcagga caugauagau ggcuugaaac cccuccuucc ugugucugug 420
cggagucaga uagacaugcu ggucccacgc cagugcuucu cccucaacua ugaccuaaac 480
ugugacaagc ugugugcuga cuuccaggaa gacauugagu uccauuucuc ucucggaugg 540
accaugcugg ugaauagguu ccugggcccc aagaacagcc gucgggccuu gaugggcuac 600
aaugaccagg uccagcgucc caucccucug acgccagcca accccagcau gcccccacug 660
ccacagggcu cgcucaccca ggaggag 687
<210> 245
<211> 693
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding MFN2
<400> 245
atggaggccg tcagactgat catggacagc ctgcatatgg ccgccagaga gcagcaggtc 60
tactgcgagg aaatgcggga agagagacag gaccggctga agttcatcga caagcagctg 120
gaactgctgg cccaggacta caagctgcgg atcaagcaga tcaccgaaga ggtggaaaga 180
caggtgtcca ccgccatggc cgaggaaatc agacgactga gcgtgctggt ggacgactac 240
cagatggact ttcacccctc tccagtggtg ctgaaggtgt acaagaacga gctgcaccgg 300
cacatcgagg aaggcctggg cagaaacatg agcgacagat gcagcaccgc catcaccaat 360
agcctgcaga ccatgcagca ggacatgatc gacggcctga aacctctgct gcctgtgtcc 420
gtcagatccc agatcgacat gctggtgccc agacagtgct tcagcctgaa ctacgacctg 480
aactgcgaca agctgtgcgc cgacttccaa gaggacatcg agttccactt cagcctcggc 540
tggacaatgc tggtcaacag atttctgggc cccaagaaca gcagacgggc cctgatgggc 600
tacaacgatc aggtgcagag gcccattcct ctgacacccg ccaatcctag catgcctcca 660
ctgcctcagg gcagcctgac acaagaagaa tga 693
<210> 246
<211> 693
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding MFN2
<400> 246
auggaggccg ucagacugau cauggacagc cugcauaugg ccgccagaga gcagcagguc 60
uacugcgagg aaaugcggga agagagacag gaccggcuga aguucaucga caagcagcug 120
gaacugcugg cccaggacua caagcugcgg aucaagcaga ucaccgaaga gguggaaaga 180
caggugucca ccgccauggc cgaggaaauc agacgacuga gcgugcuggu ggacgacuac 240
cagauggacu uucaccccuc uccaguggug cugaaggugu acaagaacga gcugcaccgg 300
cacaucgagg aaggccuggg cagaaacaug agcgacagau gcagcaccgc caucaccaau 360
agccugcaga ccaugcagca ggacaugauc gacggccuga aaccucugcu gccugugucc 420
gucagauccc agaucgacau gcuggugccc agacagugcu ucagccugaa cuacgaccug 480
aacugcgaca agcugugcgc cgacuuccaa gaggacaucg aguuccacuu cagccucggc 540
uggacaaugc uggucaacag auuucugggc cccaagaaca gcagacgggc ccugaugggc 600
uacaacgauc aggugcagag gcccauuccu cugacacccg ccaauccuag caugccucca 660
cugccucagg gcagccugac acaagaagaa uga 693
<210> 247
<211> 463
<212> PRT
<213> Homo sapiens
<400> 247
Gly Cys Glu Leu Val Asp Leu Ala Asp Glu Val Ala Ser Val Tyr Gln
1 5 10 15
Ser Tyr Gln Pro Arg Thr Ser Asp Arg Pro Pro Asp Pro Leu Glu Pro
20 25 30
Pro Ser Leu Pro Ala Glu Arg Pro Gly Pro Pro Thr Pro Ala Ala Ala
35 40 45
His Ser Ile Pro Tyr Asn Ser Cys Arg Glu Lys Glu Pro Ser Tyr Pro
50 55 60
Met Pro Val Gln Glu Thr Gln Ala Pro Glu Ser Pro Gly Glu Asn Ser
65 70 75 80
Glu Gln Ala Leu Gln Thr Leu Ser Pro Arg Ala Ile Pro Arg Asn Pro
85 90 95
Asp Gly Gly Pro Leu Glu Ser Ser Ser Asp Leu Ala Ala Leu Ser Pro
100 105 110
Leu Thr Ser Ser Gly His Gln Glu Gln Asp Thr Glu Leu Gly Ser Thr
115 120 125
His Thr Ala Gly Ala Thr Ser Ser Leu Thr Pro Ser Arg Gly Pro Val
130 135 140
Ser Pro Ser Val Ser Phe Gln Pro Leu Ala Arg Ser Thr Pro Arg Ala
145 150 155 160
Ser Arg Leu Pro Gly Pro Thr Gly Ser Val Val Ser Thr Gly Thr Ser
165 170 175
Phe Ser Ser Ser Ser Pro Gly Leu Ala Ser Ala Gly Ala Ala Glu Gly
180 185 190
Lys Gln Gly Ala Glu Ser Asp Gln Ala Glu Pro Ile Ile Cys Ser Ser
195 200 205
Gly Ala Glu Ala Pro Ala Asn Ser Leu Pro Ser Lys Val Pro Thr Thr
210 215 220
Leu Met Pro Val Asn Thr Val Ala Leu Lys Val Pro Ala Asn Pro Ala
225 230 235 240
Ser Val Ser Thr Val Pro Ser Lys Leu Pro Thr Ser Ser Lys Pro Pro
245 250 255
Gly Ala Val Pro Ser Asn Ala Leu Thr Asn Pro Ala Pro Ser Lys Leu
260 265 270
Pro Ile Asn Ser Thr Arg Ala Gly Met Val Pro Ser Lys Val Pro Thr
275 280 285
Ser Met Val Leu Thr Lys Val Ser Ala Ser Thr Val Pro Thr Asp Gly
290 295 300
Ser Ser Arg Asn Glu Glu Thr Pro Ala Ala Pro Thr Pro Ala Gly Ala
305 310 315 320
Thr Gly Gly Ser Ser Ala Trp Leu Asp Ser Ser Ser Glu Asn Arg Gly
325 330 335
Leu Gly Ser Glu Leu Ser Lys Pro Gly Val Leu Ala Ser Gln Val Asp
340 345 350
Ser Pro Phe Ser Gly Cys Phe Glu Asp Leu Ala Ile Ser Ala Ser Thr
355 360 365
Ser Leu Gly Met Gly Pro Cys His Gly Pro Glu Glu Asn Glu Tyr Lys
370 375 380
Ser Glu Gly Thr Phe Gly Ile His Val Ala Glu Asn Pro Ser Ile Gln
385 390 395 400
Leu Leu Glu Gly Asn Pro Gly Pro Pro Ala Asp Pro Asp Gly Gly Pro
405 410 415
Arg Pro Gln Ala Asp Arg Lys Phe Gln Glu Arg Glu Val Pro Cys His
420 425 430
Arg Pro Ser Pro Gly Ala Leu Trp Leu Gln Val Ala Val Thr Gly Val
435 440 445
Leu Val Val Thr Leu Leu Val Val Leu Tyr Arg Arg Arg Leu His
450 455 460
<210> 248
<211> 1389
<212> DNA
<213> Homo sapiens
<400> 248
ggctgtgagc tagttgatct cgcggacgaa gtggcctctg tctaccagag ctaccagcct 60
cggacctcgg accgtccccc agacccactg gagccaccgt cacttcctgc tgagaggcca 120
gggcccccca cacctgctgc ggcccacagc atcccctaca acagctgcag agagaaggag 180
ccaagttacc ccatgcctgt ccaggagacc caggcgccag agtccccagg agagaattca 240
gagcaagccc tgcagacgct cagccccaga gccatcccaa ggaatccaga tggtggcccc 300
ctggagtcct cctctgacct ggcagccctc agccctctga cctccagcgg gcatcaggag 360
caggacacag aactgggcag tacccacaca gcaggtgcga cctccagcct cacaccatcc 420
cgtgggcctg tgtctccatc tgtctccttc cagcccctgg cccgttccac ccccagggca 480
agccgcttgc ctggacccac agggtcagtt gtatctactg gcacctcctt ctcctcctca 540
tcccctggct tggcctctgc aggggctgca gagggtaaac agggtgcaga gagtgaccag 600
gccgagccta tcatctgctc cagtggggca gaggcacctg ccaactctct gccctccaaa 660
gtgcctacca ccttgatgcc tgtgaacaca gtggccctga aagtgcctgc caacccagca 720
tctgtcagca cagtgccctc caagttgcca actagctcaa agccccctgg tgcagtgcct 780
tctaatgcgc tcaccaatcc agcaccatcc aaattgccca tcaactcaac ccgtgctggc 840
atggtgccat ccaaagtgcc tactagcatg gtgctcacca aggtgtctgc cagcacagtc 900
cccactgacg ggagcagcag aaatgaggag accccagcag ctccaacacc cgccggcgcc 960
actggaggca gctcagcctg gctagacagc agctctgaga ataggggcct tgggtcggag 1020
ctgagtaagc ctggcgtgct ggcatcccag gtagacagcc cgttctcggg ctgcttcgag 1080
gatcttgcca tcagtgccag cacctccttg ggcatggggc cctgccatgg cccagaggag 1140
aatgagtata agtccgaggg cacctttggg atccacgtgg ctgagaaccc cagcatccag 1200
ctcctggagg gcaaccctgg gccacctgcg gacccggatg gcggccccag gccacaagcc 1260
gaccggaagt tccaggagag ggaggtgcca tgccacaggc cctcacctgg ggctctgtgg 1320
ctccaggtgg ctgtgacagg ggtgctggta gtcacactcc tggtggtgct gtaccggcgg 1380
cgtctgcac 1389
<210> 249
<211> 1389
<212> RNA
<213> Homo sapiens
<400> 249
ggcugugagc uaguugaucu cgcggacgaa guggccucug ucuaccagag cuaccagccu 60
cggaccucgg accguccccc agacccacug gagccaccgu cacuuccugc ugagaggcca 120
gggcccccca caccugcugc ggcccacagc auccccuaca acagcugcag agagaaggag 180
ccaaguuacc ccaugccugu ccaggagacc caggcgccag aguccccagg agagaauuca 240
gagcaagccc ugcagacgcu cagccccaga gccaucccaa ggaauccaga ugguggcccc 300
cuggaguccu ccucugaccu ggcagcccuc agcccucuga ccuccagcgg gcaucaggag 360
caggacacag aacugggcag uacccacaca gcaggugcga ccuccagccu cacaccaucc 420
cgugggccug ugucuccauc ugucuccuuc cagccccugg cccguuccac ccccagggca 480
agccgcuugc cuggacccac agggucaguu guaucuacug gcaccuccuu cuccuccuca 540
uccccuggcu uggccucugc aggggcugca gaggguaaac agggugcaga gagugaccag 600
gccgagccua ucaucugcuc caguggggca gaggcaccug ccaacucucu gcccuccaaa 660
gugccuacca ccuugaugcc ugugaacaca guggcccuga aagugccugc caacccagca 720
ucugucagca cagugcccuc caaguugcca acuagcucaa agcccccugg ugcagugccu 780
ucuaaugcgc ucaccaaucc agcaccaucc aaauugccca ucaacucaac ccgugcuggc 840
auggugccau ccaaagugcc uacuagcaug gugcucacca aggugucugc cagcacaguc 900
cccacugacg ggagcagcag aaaugaggag accccagcag cuccaacacc cgccggcgcc 960
acuggaggca gcucagccug gcuagacagc agcucugaga auaggggccu ugggucggag 1020
cugaguaagc cuggcgugcu ggcaucccag guagacagcc cguucucggg cugcuucgag 1080
gaucuugcca ucagugccag caccuccuug ggcauggggc ccugccaugg cccagaggag 1140
aaugaguaua aguccgaggg caccuuuggg auccacgugg cugagaaccc cagcauccag 1200
cuccuggagg gcaacccugg gccaccugcg gacccggaug gcggccccag gccacaagcc 1260
gaccggaagu uccaggagag ggaggugcca ugccacaggc ccucaccugg ggcucugugg 1320
cuccaggugg cugugacagg ggugcuggua gucacacucc ugguggugcu guaccggcgg 1380
cgucugcac 1389
<210> 250
<211> 1395
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding MAVS
<400> 250
atgggctgtg aactggtgga tctggccgat gaagtggcca gcgtgtacca gagctaccag 60
cctagaacca gcgaccggcc tcctgatcct ctggaacctc catctctgcc cgccgaaaga 120
cctggacctc ctacaccagc tgccgctcac agcatccctt acaacagctg cagagagaaa 180
gaacctagct accccatgcc tgtgcaagag acacaggccc cagaaagccc tggcgagaat 240
tctgaacagg ccctgcagac actgagcccc agagccattc ctagaaaccc tgatggcggc 300
cctctggaaa gcagcagtga tctggctgct ctgagccctc tgacaagctc tggacaccaa 360
gagcaggata ccgagctggg cagcacacat acagccggcg ctacaagcag cctgacacct 420
tctagaggcc ccgtgtctcc cagcgtgtca tttcagcctc tggccaggtc tacccctaga 480
gcctctagac tgcctggacc tacaggcagc gtggtgtcta ccggcacaag cttcagctct 540
agctctcctg gactggcctc tgctggtgcc gctgagggaa aacaaggcgc cgaatctgat 600
caggccgagc ctatcatctg tagcagcgga gcagaagccc ctgccaatag cctgcctagc 660
aaggtgccaa ccacactgat gcccgtgaac acagtggccc tgaaggtgcc agctaatcct 720
gcctccgtgt ccaccgtgcc ttctaagctg ccaaccagct ctaagccacc tggcgccgtg 780
ccatctaacg ccctgacaaa tcctgctcca agcaagctgc ccatcaacag cacaagagcc 840
ggcatggtgc cctctaaggt gcccacatct atggtgctga ccaaggtgtc cgccagcacc 900
gtgccaacag atggcagcag cagaaacgag gaaacccctg ccgctcctac tcctgctggc 960
gctacaggcg gatcttctgc ctggctggat agcagctccg agaatagagg cctgggcagc 1020
gagctgtcta aacctggcgt tctggcaagc caggtggaca gccctttcag cggctgcttt 1080
gaggacctgg ctatcagcgc ctctacaagc ctcggcatgg gaccttgtca cggccccgag 1140
gaaaacgagt acaagagcga gggcaccttc ggcatccacg tggccgagaa tcctagcatc 1200
caactgctgg aaggcaaccc cggacctcct gctgatccag atggtggacc tagacctcag 1260
gccgaccgga agttccaaga aagagaggtg ccctgccacc ggccatctcc aggtgcactt 1320
tggctgcaag tggctgtgac aggcgtgctg gtggttacac tgctggtcgt gctgtacaga 1380
aggcggctgc attga 1395
<210> 251
<211> 1395
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding MAVS
<400> 251
augggcugug aacuggugga ucuggccgau gaaguggcca gcguguacca gagcuaccag 60
ccuagaacca gcgaccggcc uccugauccu cuggaaccuc caucucugcc cgccgaaaga 120
ccuggaccuc cuacaccagc ugccgcucac agcaucccuu acaacagcug cagagagaaa 180
gaaccuagcu accccaugcc ugugcaagag acacaggccc cagaaagccc uggcgagaau 240
ucugaacagg cccugcagac acugagcccc agagccauuc cuagaaaccc ugauggcggc 300
ccucuggaaa gcagcaguga ucuggcugcu cugagcccuc ugacaagcuc uggacaccaa 360
gagcaggaua ccgagcuggg cagcacacau acagccggcg cuacaagcag ccugacaccu 420
ucuagaggcc ccgugucucc cagcguguca uuucagccuc uggccagguc uaccccuaga 480
gccucuagac ugccuggacc uacaggcagc guggugucua ccggcacaag cuucagcucu 540
agcucuccug gacuggccuc ugcuggugcc gcugagggaa aacaaggcgc cgaaucugau 600
caggccgagc cuaucaucug uagcagcgga gcagaagccc cugccaauag ccugccuagc 660
aaggugccaa ccacacugau gcccgugaac acaguggccc ugaaggugcc agcuaauccu 720
gccuccgugu ccaccgugcc uucuaagcug ccaaccagcu cuaagccacc uggcgccgug 780
ccaucuaacg cccugacaaa uccugcucca agcaagcugc ccaucaacag cacaagagcc 840
ggcauggugc ccucuaaggu gcccacaucu auggugcuga ccaagguguc cgccagcacc 900
gugccaacag auggcagcag cagaaacgag gaaaccccug ccgcuccuac uccugcuggc 960
gcuacaggcg gaucuucugc cuggcuggau agcagcuccg agaauagagg ccugggcagc 1020
gagcugucua aaccuggcgu ucuggcaagc cagguggaca gcccuuucag cggcugcuuu 1080
gaggaccugg cuaucagcgc cucuacaagc cucggcaugg gaccuuguca cggccccgag 1140
gaaaacgagu acaagagcga gggcaccuuc ggcauccacg uggccgagaa uccuagcauc 1200
caacugcugg aaggcaaccc cggaccuccu gcugauccag augguggacc uagaccucag 1260
gccgaccgga aguuccaaga aagagaggug cccugccacc ggccaucucc aggugcacuu 1320
uggcugcaag uggcugugac aggcgugcug gugguuacac ugcuggucgu gcuguacaga 1380
aggcggcugc auuga 1395
<210> 252
<211> 493
<212> PRT
<213> Homo sapiens
<400> 252
Met Glu Arg Ser Pro Asp Val Ser Pro Gly Pro Ser Arg Ser Phe Lys
1 5 10 15
Glu Glu Leu Leu Cys Ala Val Cys Tyr Asp Pro Phe Arg Asp Ala Val
20 25 30
Thr Leu Arg Cys Gly His Asn Phe Cys Arg Gly Cys Val Ser Arg Cys
35 40 45
Trp Glu Val Gln Val Ser Pro Thr Cys Pro Val Cys Lys Asp Arg Ala
50 55 60
Ser Pro Ala Asp Leu Arg Thr Asn His Thr Leu Asn Asn Leu Val Glu
65 70 75 80
Lys Leu Leu Arg Glu Glu Ala Glu Gly Ala Arg Trp Thr Ser Tyr Arg
85 90 95
Phe Ser Arg Val Cys Arg Leu His Arg Gly Gln Leu Ser Leu Phe Cys
100 105 110
Leu Glu Asp Lys Glu Leu Leu Cys Cys Ser Cys Gln Ala Asp Pro Arg
115 120 125
His Gln Gly His Arg Val Gln Pro Val Lys Asp Thr Ala His Asp Phe
130 135 140
Arg Ala Lys Cys Arg Asn Met Glu His Ala Leu Arg Glu Lys Ala Lys
145 150 155 160
Ala Phe Trp Ala Met Arg Arg Ser Tyr Glu Ala Ile Ala Lys His Asn
165 170 175
Gln Val Glu Ala Ala Trp Leu Glu Gly Arg Ile Arg Gln Glu Phe Asp
180 185 190
Lys Leu Arg Glu Phe Leu Arg Val Glu Glu Gln Ala Ile Leu Asp Ala
195 200 205
Met Ala Glu Glu Thr Arg Gln Lys Gln Leu Leu Ala Asp Glu Lys Met
210 215 220
Lys Gln Leu Thr Glu Glu Thr Glu Val Leu Ala His Glu Ile Glu Arg
225 230 235 240
Leu Gln Met Glu Met Lys Glu Asp Asp Val Ser Phe Leu Met Lys His
245 250 255
Lys Ser Arg Lys Arg Arg Leu Phe Cys Thr Met Glu Pro Glu Pro Val
260 265 270
Gln Pro Gly Met Leu Ile Asp Val Cys Lys Tyr Leu Gly Ser Leu Gln
275 280 285
Tyr Arg Val Trp Lys Lys Met Leu Ala Ser Val Glu Ser Val Pro Phe
290 295 300
Ser Phe Asp Pro Asn Thr Ala Ala Gly Trp Leu Ser Val Ser Asp Asp
305 310 315 320
Leu Thr Ser Val Thr Asn His Gly Tyr Arg Val Gln Val Glu Asn Pro
325 330 335
Glu Arg Phe Ser Ser Ala Pro Cys Leu Leu Gly Ser Arg Val Phe Ser
340 345 350
Gln Gly Ser His Ala Trp Glu Val Ala Leu Gly Gly Leu Gln Ser Trp
355 360 365
Arg Val Gly Val Val Arg Val Arg Gln Asp Ser Gly Ala Glu Gly His
370 375 380
Ser His Ser Cys Tyr His Asp Thr Arg Ser Gly Phe Trp Tyr Val Cys
385 390 395 400
Arg Thr Gln Gly Val Glu Gly Asp His Cys Val Thr Ser Asp Pro Ala
405 410 415
Thr Ser Pro Leu Val Leu Ala Ile Pro Arg Arg Leu Arg Val Glu Leu
420 425 430
Glu Cys Glu Glu Gly Glu Leu Ser Phe Tyr Asp Ala Glu Arg His Cys
435 440 445
His Leu Tyr Thr Phe His Ala Arg Phe Gly Glu Val Arg Pro Tyr Phe
450 455 460
Tyr Leu Gly Gly Ala Arg Gly Ala Gly Pro Pro Glu Pro Leu Arg Ile
465 470 475 480
Cys Pro Leu His Ile Ser Val Lys Glu Glu Leu Asp Gly
485 490
<210> 253
<211> 1479
<212> DNA
<213> Homo sapiens
<400> 253
atggagcgga gtcccgacgt gtcccccggg ccttcccgct ccttcaagga ggagttgctc 60
tgcgccgtct gctacgaccc cttccgcgac gcagtcactc tgcgctgcgg ccacaacttc 120
tgccgcgggt gcgtgagccg ctgctgggag gtgcaggtgt cgcccacctg cccagtgtgc 180
aaagaccgcg cgtcacccgc cgacctgcgc accaaccaca ccctcaacaa cctggtggag 240
aagctgctgc gcgaggaggc cgagggcgcg cgctggacca gctaccgctt ctcgcgtgtc 300
tgccgcctgc accgcggaca gctcagcctc ttctgcctcg aggacaagga gctgctgtgc 360
tgctcctgcc aggccgaccc ccgacaccag gggcaccgcg tgcagccggt gaaggacact 420
gcccacgact ttcgggccaa gtgcaggaac atggagcatg cactgcggga gaaggccaag 480
gccttctggg ccatgcggcg ctcctatgag gccatcgcca agcacaatca ggtggaggct 540
gcatggctgg aaggccggat ccggcaggag tttgataagc ttcgcgagtt cttgagagtg 600
gaggagcagg ccattctgga tgccatggcc gaggagacaa ggcagaagca acttctggcc 660
gacgagaaga tgaagcagct cacagaggag acggaggtgc tggcacatga gatcgagcgg 720
ctgcagatgg agatgaagga ggacgacgtt tcttttctca tgaaacacaa gagccgaaaa 780
cgccgactct tctgcaccat ggagccagag ccagtccagc ccggcatgct tatcgatgtc 840
tgcaagtacc tgggctccct gcagtaccgc gtctggaaga agatgcttgc atctgtggaa 900
tctgtaccct tcagctttga ccccaacacc gcagctggct ggctctccgt gtctgacgac 960
ctcaccagcg tcaccaacca tggctaccgc gtgcaggtgg agaacccgga acgcttctcc 1020
tcggcgccct gcctgctggg ctcccgtgtc ttctcacagg gctcgcacgc ctgggaggtg 1080
gcccttgggg ggctgcagag ctggagggtg ggcgtggtac gtgtgcgcca ggactcgggc 1140
gctgagggcc actcacacag ctgctaccac gacacacgct cgggcttctg gtatgtctgc 1200
cgcacgcagg gcgtggaggg ggaccactgc gtgacctcgg acccagccac gtcgcccctg 1260
gtcctggcca tcccacgccg cctgcgtgtg gagctggagt gtgaggaggg cgagctgtct 1320
ttctatgacg cggagcgcca ctgccacctg tacaccttcc acgcccgctt tggggaggtt 1380
cgcccctact tctacctggg gggtgcacgg ggcgccgggc ctccagagcc tttgcgcatc 1440
tgccccttgc acatcagtgt caaggaagaa ctggatggc 1479
<210> 254
<211> 1479
<212> RNA
<213> Homo sapiens
<400> 254
auggagcgga gucccgacgu gucccccggg ccuucccgcu ccuucaagga ggaguugcuc 60
ugcgccgucu gcuacgaccc cuuccgcgac gcagucacuc ugcgcugcgg ccacaacuuc 120
ugccgcgggu gcgugagccg cugcugggag gugcaggugu cgcccaccug cccagugugc 180
aaagaccgcg cgucacccgc cgaccugcgc accaaccaca cccucaacaa ccugguggag 240
aagcugcugc gcgaggaggc cgagggcgcg cgcuggacca gcuaccgcuu cucgcguguc 300
ugccgccugc accgcggaca gcucagccuc uucugccucg aggacaagga gcugcugugc 360
ugcuccugcc aggccgaccc ccgacaccag gggcaccgcg ugcagccggu gaaggacacu 420
gcccacgacu uucgggccaa gugcaggaac auggagcaug cacugcggga gaaggccaag 480
gccuucuggg ccaugcggcg cuccuaugag gccaucgcca agcacaauca gguggaggcu 540
gcauggcugg aaggccggau ccggcaggag uuugauaagc uucgcgaguu cuugagagug 600
gaggagcagg ccauucugga ugccauggcc gaggagacaa ggcagaagca acuucuggcc 660
gacgagaaga ugaagcagcu cacagaggag acggaggugc uggcacauga gaucgagcgg 720
cugcagaugg agaugaagga ggacgacguu ucuuuucuca ugaaacacaa gagccgaaaa 780
cgccgacucu ucugcaccau ggagccagag ccaguccagc ccggcaugcu uaucgauguc 840
ugcaaguacc ugggcucccu gcaguaccgc gucuggaaga agaugcuugc aucuguggaa 900
ucuguacccu ucagcuuuga ccccaacacc gcagcuggcu ggcucuccgu gucugacgac 960
cucaccagcg ucaccaacca uggcuaccgc gugcaggugg agaacccgga acgcuucucc 1020
ucggcgcccu gccugcuggg cucccguguc uucucacagg gcucgcacgc cugggaggug 1080
gcccuugggg ggcugcagag cuggagggug ggcgugguac gugugcgcca ggacucgggc 1140
gcugagggcc acucacacag cugcuaccac gacacacgcu cgggcuucug guaugucugc 1200
cgcacgcagg gcguggaggg ggaccacugc gugaccucgg acccagccac gucgccccug 1260
guccuggcca ucccacgccg ccugcgugug gagcuggagu gugaggaggg cgagcugucu 1320
uucuaugacg cggagcgcca cugccaccug uacaccuucc acgcccgcuu uggggagguu 1380
cgccccuacu ucuaccuggg gggugcacgg ggcgccgggc cuccagagcc uuugcgcauc 1440
ugccccuugc acaucagugu caaggaagaa cuggauggc 1479
<210> 255
<211> 1482
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding TRIM35
<400> 255
atggaaagat cccctgacgt gtcccctgga cctagcagaa gcttcaaaga ggaactgctc 60
tgcgccgtgt gctacgaccc cttcagagat gccgtgacac tgagatgcgg ccacaacttc 120
tgcagaggct gcgtgtccag atgctgggaa gtgcaggttt cccctacatg ccccgtgtgc 180
aaggacagag cctctcctgc cgatctgcgg accaatcaca ccctgaacaa cctggtggaa 240
aagctgctga gagaagaggc cgaaggcgcc agatggacca gctacagatt cagcagagtg 300
tgccggctgc acagaggcca gctgagcctg ttctgtctcg aggacaaaga actgctgtgc 360
tgcagctgcc aggccgatcc tagacaccag ggacatagag tgcagcccgt gaaggacaca 420
gcccacgact tcagagccaa gtgccggaac atggaacacg ccctgagaga gaaggccaaa 480
gccttctggg ccatgcggag aagctatgag gccattgcca agcacaatca ggtggaagcc 540
gcctggctgg aaggccggat cagacaagag ttcgacaagc tgcgcgagtt cctgagagtg 600
gaagaacagg ccatcctgga cgccatggcc gaggaaacaa gacagaaaca gctgctggcc 660
gacgagaaga tgaagcagct gaccgaagag acagaggtgc tggcccacga aatcgagcgg 720
ctgcagatgg aaatgaagga agatgatgtg tcctttctga tgaagcacaa gagccggaag 780
cggcggctgt tctgcacaat ggaacctgag ccagtgcagc ctggcatgct gatcgatgtg 840
tgcaagtacc tgggcagcct gcagtacaga gtgtggaaga aaatgctggc ctccgtggaa 900
agcgtgccct tcagcttcga ccctaatact gccgctggct ggctgagcgt gtccgatgat 960
ctgaccagcg tgaccaacca cggctacaga gtgcaggtcg agaaccccga gagattcagc 1020
tctgcccctt gtctgctggg ctccagagtg ttttctcagg gctctcacgc ctgggaagtt 1080
gcccttggag gactccagtc ttggagagtg ggcgttgtca gagtgcggca ggattctggc 1140
gccgaaggac actctcacag ctgctaccac gatacccgca gcggcttttg gtacgtgtgt 1200
agaacacagg gcgtcgaggg cgaccactgt gtgacatctg accctgccac atctcctctg 1260
gtgctggcta tccctcggag actgagagtc gagctggaat gcgaggaagg cgagctgagc 1320
ttctacgacg ccgagagaca ctgccacctg tacaccttcc acgccagatt tggcgaagtg 1380
cggccctact tttatctcgg cggagctaga ggtgccggac ctcctgaacc tctgagaatc 1440
tgccctctgc acatcagcgt gaaagaggaa ttggacggct ga 1482
<210> 256
<211> 1482
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding TRIM35
<400> 256
auggaaagau ccccugacgu guccccugga ccuagcagaa gcuucaaaga ggaacugcuc 60
ugcgccgugu gcuacgaccc cuucagagau gccgugacac ugagaugcgg ccacaacuuc 120
ugcagaggcu gcguguccag augcugggaa gugcagguuu ccccuacaug ccccgugugc 180
aaggacagag ccucuccugc cgaucugcgg accaaucaca cccugaacaa ccugguggaa 240
aagcugcuga gagaagaggc cgaaggcgcc agauggacca gcuacagauu cagcagagug 300
ugccggcugc acagaggcca gcugagccug uucugucucg aggacaaaga acugcugugc 360
ugcagcugcc aggccgaucc uagacaccag ggacauagag ugcagcccgu gaaggacaca 420
gcccacgacu ucagagccaa gugccggaac auggaacacg cccugagaga gaaggccaaa 480
gccuucuggg ccaugcggag aagcuaugag gccauugcca agcacaauca gguggaagcc 540
gccuggcugg aaggccggau cagacaagag uucgacaagc ugcgcgaguu ccugagagug 600
gaagaacagg ccauccugga cgccauggcc gaggaaacaa gacagaaaca gcugcuggcc 660
gacgagaaga ugaagcagcu gaccgaagag acagaggugc uggcccacga aaucgagcgg 720
cugcagaugg aaaugaagga agaugaugug uccuuucuga ugaagcacaa gagccggaag 780
cggcggcugu ucugcacaau ggaaccugag ccagugcagc cuggcaugcu gaucgaugug 840
ugcaaguacc ugggcagccu gcaguacaga guguggaaga aaaugcuggc cuccguggaa 900
agcgugcccu ucagcuucga cccuaauacu gccgcuggcu ggcugagcgu guccgaugau 960
cugaccagcg ugaccaacca cggcuacaga gugcaggucg agaaccccga gagauucagc 1020
ucugccccuu gucugcuggg cuccagagug uuuucucagg gcucucacgc cugggaaguu 1080
gcccuuggag gacuccaguc uuggagagug ggcguuguca gagugcggca ggauucuggc 1140
gccgaaggac acucucacag cugcuaccac gauacccgca gcggcuuuug guacgugugu 1200
agaacacagg gcgucgaggg cgaccacugu gugacaucug acccugccac aucuccucug 1260
gugcuggcua ucccucggag acugagaguc gagcuggaau gcgaggaagg cgagcugagc 1320
uucuacgacg ccgagagaca cugccaccug uacaccuucc acgccagauu uggcgaagug 1380
cggcccuacu uuuaucucgg cggagcuaga ggugccggac cuccugaacc ucugagaauc 1440
ugcccucugc acaucagcgu gaaagaggaa uuggacggcu ga 1482
<210> 257
<211> 129
<212> PRT
<213> Homo sapiens
<400> 257
Met Asn Leu Glu Gly Gly Gly Arg Gly Gly Glu Phe Gly Met Ser Ala
1 5 10 15
Val Ser Cys Gly Asn Gly Lys Leu Arg Gln Trp Leu Ile Asp Gln Ile
20 25 30
Asp Ser Gly Lys Tyr Pro Gly Leu Val Trp Glu Asn Glu Glu Lys Ser
35 40 45
Ile Phe Arg Ile Pro Trp Lys His Ala Gly Lys Gln Asp Tyr Asn Arg
50 55 60
Glu Glu Asp Ala Ala Leu Phe Lys Ala Trp Ala Leu Phe Lys Gly Lys
65 70 75 80
Phe Arg Glu Gly Ile Asp Lys Pro Asp Pro Pro Thr Trp Lys Thr Arg
85 90 95
Leu Arg Cys Ala Leu Asn Lys Ser Asn Asp Phe Glu Glu Leu Val Glu
100 105 110
Arg Ser Gln Leu Asp Ile Ser Asp Pro Tyr Lys Val Tyr Arg Ile Val
115 120 125
Pro
<210> 258
<211> 387
<212> DNA
<213> Homo sapiens
<400> 258
atgaacctgg agggcggcgg ccgaggcgga gagttcggca tgagcgcggt gagctgcggc 60
aacgggaagc tccgccagtg gctgatcgac cagatcgaca gcggcaagta ccccgggctg 120
gtgtgggaga acgaggagaa gagcatcttc cgcatcccct ggaagcacgc gggcaagcag 180
gactacaacc gcgaggagga cgccgcgctc ttcaaggctt gggcactgtt taaaggaaag 240
ttccgagaag gcatcgacaa gccggaccct cccacctgga agacgcgcct gcggtgcgct 300
ttgaacaaga gcaatgactt tgaggaactg gttgagcgga gccagctgga catctcagac 360
ccgtacaaag tgtacaggat tgttcct 387
<210> 259
<211> 387
<212> RNA
<213> Homo sapiens
<400> 259
augaaccugg agggcggcgg ccgaggcgga gaguucggca ugagcgcggu gagcugcggc 60
aacgggaagc uccgccagug gcugaucgac cagaucgaca gcggcaagua ccccgggcug 120
gugugggaga acgaggagaa gagcaucuuc cgcauccccu ggaagcacgc gggcaagcag 180
gacuacaacc gcgaggagga cgccgcgcuc uucaaggcuu gggcacuguu uaaaggaaag 240
uuccgagaag gcaucgacaa gccggacccu cccaccugga agacgcgccu gcggugcgcu 300
uugaacaaga gcaaugacuu ugaggaacug guugagcgga gccagcugga caucucagac 360
ccguacaaag uguacaggau uguuccu 387
<210> 260
<211> 390
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding IRF4
<400> 260
atgaatctgg aaggcggcgg aagaggcggc gagtttggaa tgtctgccgt gtcctgtggc 60
aacggcaagc tgagacagtg gctgatcgac cagatcgaca gcggcaagta tcctggcctc 120
gtgtgggaga acgaggaaaa gtctatcttc agaatcccct ggaagcacgc cggcaagcag 180
gactacaaca gagaagagga cgccgctctg ttcaaggcct gggctctgtt taagggcaag 240
ttcagagagg gcatcgacaa gcccgatcct ccaacctgga aaaccagact gagatgcgcc 300
ctgaacaaga gcaacgactt cgaggaactg gtggaaagaa gccagctgga catcagcgac 360
ccctacaagg tgtaccggat cgtgccctga 390
<210> 261
<211> 390
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding IRF4
<400> 261
augaaucugg aaggcggcgg aagaggcggc gaguuuggaa ugucugccgu guccuguggc 60
aacggcaagc ugagacagug gcugaucgac cagaucgaca gcggcaagua uccuggccuc 120
gugugggaga acgaggaaaa gucuaucuuc agaauccccu ggaagcacgc cggcaagcag 180
gacuacaaca gagaagagga cgccgcucug uucaaggccu gggcucuguu uaagggcaag 240
uucagagagg gcaucgacaa gcccgauccu ccaaccugga aaaccagacu gagaugcgcc 300
cugaacaaga gcaacgacuu cgaggaacug guggaaagaa gccagcugga caucagcgac 360
cccuacaagg uguaccggau cgugcccuga 390
<210> 262
<211> 522
<212> DNA
<213> Artificial Sequence
<220>
<223> Codon optimised DNA encoding ARL16
<400> 262
atgtgtctgc tgctgggagc tacaggcgtg ggcaagacac tgctggtcaa gcggctgcaa 60
gaggtgtcca gcagagatgg caaaggcgat ctgggagagc ctcctccaac cagacctacc 120
gtgggcacca acctgacaga tatcgtggcc cagcggaaga tcaccatcag agaactcggc 180
ggctgcatgg gccctatctg gtctagctac tacggcaact gccgcagcct gctgttcgtg 240
atggatgcca gcgatcccac acagctgagc gcctcttgtg tgcaactgct gggactgctg 300
tctgccgaac aactggccga agcctctgtg ctgatcctgt tcaacaagat cgacctgcct 360
tgctacatga gcaccgagga aatgaagtcc ctgatcagac tgcccgacat cattgcctgc 420
gccaagcaga atatcaccac agccgagatc agcgccagag aaggcacagg acttgctggc 480
gttctggcat ggctgcaggc cacacacaga gccaacgatt ga 522
<210> 263
<211> 522
<212> RNA
<213> Artificial Sequence
<220>
<223> Codon optimised RNA encoding ARL16
<400> 263
augugucugc ugcugggagc uacaggcgug ggcaagacac ugcuggucaa gcggcugcaa 60
gaggugucca gcagagaugg caaaggcgau cugggagagc cuccuccaac cagaccuacc 120
gugggcacca accugacaga uaucguggcc cagcggaaga ucaccaucag agaacucggc 180
ggcugcaugg gcccuaucug gucuagcuac uacggcaacu gccgcagccu gcuguucgug 240
auggaugcca gcgaucccac acagcugagc gccucuugug ugcaacugcu gggacugcug 300
ucugccgaac aacuggccga agccucugug cugauccugu ucaacaagau cgaccugccu 360
ugcuacauga gcaccgagga aaugaagucc cugaucagac ugcccgacau cauugccugc 420
gccaagcaga auaucaccac agccgagauc agcgccagag aaggcacagg acuugcuggc 480
guucuggcau ggcugcaggc cacacacaga gccaacgauu ga 522
<210> 264
<211> 796
<212> RNA
<213> Artificial Sequence
<220>
<223> RNA construct
<400> 264
cggagacggc gcagaagaag aggaucuggc gaaggcagag gcagccugcu uacauguggc 60
gacguggaag agaaccccgg accuaugggc gauagcagcc ccgauaccuu uuccgauggc 120
cugagcagca gcacccugcc ugaugaucac agcagcuaca ccgugccugg cuacaugcag 180
gaccuggaag uggaacaggc ccugacacca gcucugagcc cuugugcugu guccagcaca 240
cugcccgauu ggcacauccc uguggaagug gugccugaca gcaccagcga ccuguacaac 300
uuccaagugu ccccuaugcc uagcaccucc gaggccacca ccgaugagga ugaagaggga 360
aagcugcccg aggacaucau gaagcugcug gaacagagcg aguggcagcc caccaaugug 420
gauggcaagg gcuaccugcu gaacgagccu ggcguucagc cuacaagcgu guacggcgac 480
uucagcugca aagaggaacc cgagaucgau agcccuggcg gcgauaucgg acugagccug 540
cagagagugu ucaccgaccu gaagaacaug gacgccaccu ggcuggacag ccugcugaca 600
ccuguuagac ugcccucuau ccaggcuauc cccugcgcuc cuugagcggc cgcgaauugg 660
caagcugcuu acauagaacu cgcggcgauu ggcaugccgc cuuaaaauuu uuauuuuauu 720
uuucuuuucu uuuccgaauc ggauuuuguu uuuaauauuu caaaaaaaaa aaaaaaaaaa 780
aaaaaaaaaa aaaaaa 796
<210> 265
<211> 796
<212> DNA
<213> Artificial Sequence
<220>
<223> Nucleic acid sequence encoding RNA construct
<400> 265
cggagacggc gcagaagaag aggatctggc gaaggcagag gcagcctgct tacatgtggc 60
gacgtggaag agaaccccgg acctatgggc gatagcagcc ccgatacctt ttccgatggc 120
ctgagcagca gcaccctgcc tgatgatcac agcagctaca ccgtgcctgg ctacatgcag 180
gacctggaag tggaacaggc cctgacacca gctctgagcc cttgtgctgt gtccagcaca 240
ctgcccgatt ggcacatccc tgtggaagtg gtgcctgaca gcaccagcga cctgtacaac 300
ttccaagtgt cccctatgcc tagcacctcc gaggccacca ccgatgagga tgaagaggga 360
aagctgcccg aggacatcat gaagctgctg gaacagagcg agtggcagcc caccaatgtg 420
gatggcaagg gctacctgct gaacgagcct ggcgttcagc ctacaagcgt gtacggcgac 480
ttcagctgca aagaggaacc cgagatcgat agccctggcg gcgatatcgg actgagcctg 540
cagagagtgt tcaccgacct gaagaacatg gacgccacct ggctggacag cctgctgaca 600
cctgttagac tgccctctat ccaggctatc ccctgcgctc cttgagcggc cgcgaattgg 660
caagctgctt acatagaact cgcggcgatt ggcatgccgc cttaaaattt ttattttatt 720
tttcttttct tttccgaatc ggattttgtt tttaatattt caaaaaaaaa aaaaaaaaaa 780
aaaaaaaaaa aaaaaa 796
<210> 266
<211> 2661
<212> DNA
<213> Artificial Sequence
<220>
<223> Recombinant vector
<400> 266
cggagacggc gcagaagaag aggatctggc gaaggcagag gcagcctgct tacatgtggc 60
gacgtggaag agaaccccgg acctatgggc gatagcagcc ccgatacctt ttccgatggc 120
ctgagcagca gcaccctgcc tgatgatcac agcagctaca ccgtgcctgg ctacatgcag 180
gacctggaag tggaacaggc cctgacacca gctctgagcc cttgtgctgt gtccagcaca 240
ctgcccgatt ggcacatccc tgtggaagtg gtgcctgaca gcaccagcga cctgtacaac 300
ttccaagtgt cccctatgcc tagcacctcc gaggccacca ccgatgagga tgaagaggga 360
aagctgcccg aggacatcat gaagctgctg gaacagagcg agtggcagcc caccaatgtg 420
gatggcaagg gctacctgct gaacgagcct ggcgttcagc ctacaagcgt gtacggcgac 480
ttcagctgca aagaggaacc cgagatcgat agccctggcg gcgatatcgg actgagcctg 540
cagagagtgt tcaccgacct gaagaacatg gacgccacct ggctggacag cctgctgaca 600
cctgttagac tgccctctat ccaggctatc ccctgcgctc cttgagcggc cgcgaattgg 660
caagctgctt acatagaact cgcggcgatt ggcatgccgc cttaaaattt ttattttatt 720
tttcttttct tttccgaatc ggattttgtt tttaatattt caaaaaaaaa aaaaaaaaaa 780
aaaaaaaaaa aaaaaacgcg tcgaggggaa ttaattcttg aagacgaaag ggccaggtgg 840
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 900
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 960
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 1020
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 1080
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 1140
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 1200
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 1260
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 1320
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 1380
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 1440
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 1500
gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 1560
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 1620
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 1680
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 1740
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 1800
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 1860
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 1920
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 1980
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 2040
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 2100
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 2160
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 2220
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 2280
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 2340
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagcatt gagaaagcgc 2400
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 2460
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 2520
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 2580
gaaaaacgcc attctagaat ggcgcgccct taaggggaga ataggagccg caacacacaa 2640
gcaacgcgag gtcgtttaaa c 2661
Claims (24)
- (ⅰ) 적어도 하나의 치료 생체분자; 및 (ⅱ) 적어도 하나의 비-바이러스 선천적 조절 단백질(IMP);를 암호화하는 RNA 구조체.
- 청구항 1에 있어서,
상기 구조체는 mRNA를 포함하는 RNA 구조체. - 청구항 1에 있어서,
상기 구조체는 saRNA를 포함하는 RNA 구조체. - 청구항 1 내지 청구항 3 중 어느 한 항에 있어서,
상기 saRNA 구조체는 알파바이러스; 피코르나바이러스; 플라비바이러스; 루비바이러스; 페스티바이러스; 헤파시바이러스; 칼리시바이러스 및 코로나바이러스;로 이루어진 속의 군으로부터 선택되는 양성 가닥 RNA 바이러스, 바람직하게는 알파바이러스, 임의적으로는 VEEV를 포함하거나 이로부터 유래되는 RNA 구조체. - 청구항 1 내지 청구항 청구항 4 중 어느 한 항에 있어서,
상기 IMP는 포유동물 IMP이고, 바람직하게는 인간 IMP인 RNA 구조체. - 청구항 1 내지 청구항 5 중 어느 한 항에 있어서,
상기 IMP는 인터페론 조절 인자 활성을 억제하도록 구성되는 RNA 구조체. - 청구항 6에 있어서,
상기 IMP는 IRF1 DBD(1-164), IRF9(142-393), IRF4(1-129), IRF5 A68P, IRF3(191-427), IRF7(238-503), IRF2(1-113), IRF9(1-120), IRF4(21-129), IRF9(182-235), IRF9(200-308), IRF5(1-140), IRF6(1-115), IRF8(1-140), 및/또는 IRF1(141-325)로부터 선택되고; 바람직하게는 상기 IMP는 IRF1 DBD(1-164), IRF9(142-393), IRF4(1-129), IRF5 A68P, IRF3(191-427) 및/또는 IRF7(238-503)로부터 선택되는 RNA 구조체. - 청구항 1 내지 청구항 7 중 어느 한 항에 있어서,
상기 IMP는 인터페론 생산을 유도하고 인터페론-자극 유전자의 자극을 야기하는 경로를 억제하도록 구성되는 RNA 구조체. - 청구항 8에 있어서,
상기 IMP는 HSP90(CDC37)(1-232), STING 베타, MFN2(369-598), A20(606-790), A20(369-775), ARL5B, ARL16, FAF1, MFN2(1 - 757), USP21, USP27, CYLD, LGP2, DDX-56, MAVS(ΔCARD 도메인), TRIM35, MFN2(400-480), 및/또는 MFN2(369-490)로부터 선택되고; 바람직하게는 상기 IMP는 HSP90(CDC37)(1-232), STING 베타, MFN2(369-598), A20(606-790), A20(369-775), 및/또는 ARL5B, ARL16으로부터 선택되는 RNA 구조체. - 청구항 1 내지 청구항 9 중 어느 한 항에 있어서,
상기 IMP는 인터페론 신호전달을 억제하도록 구성되는 RNA 구조체. - 청구항 10에 있어서,
상기 IMP는 STAT2(133-315), IRF9(142-393), STAT1 DN, STAT2(1-851-F175DY701F), USP18, SOCS1, 및/또는 SOCS3으로부터 선택되고; 바람직하게는 상기 IMP는 STAT2(133-315), 및/또는 IRF9(142-393)로부터 선택되는 RNA 구조체. - 청구항 1 내지 청구항 11 중 어느 한 항에 있어서,
상기 IMP는 RNA 인식 시스템을 억제하도록 구성되는 RNA 구조체. - 청구항 12에 있어서,
상기 IMP는 Zinc AVP(1-200), TARBP2(1-234), PKR dsRNA BD(1-170), PACT PRKRA BD(1-194), OAS3 도메인 1, RNAse L 우성 음성, 및/또는 RIG-1 우성 음성 또는 스플라이스 변이체로부터 선택되고; 바람직하게는 상기 IMP는 Zinc AVP(1-200), TARBP2(1-234), PKR dsRNA BD(1-170) 및/또는 PACT PRKRA BD(1-194)로부터 선택되는 RNA 구조체. - 청구항 1 내지 청구항 13 중 어느 한 항에 있어서,
상기 치료 생체분자는 치료 단백질을 포함하며, 바람직하게는 상기 단백질 또는 펩티드는 항원이고, 보다 바람직하게는 바이러스 항원인 RNA 구조체. - 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체를 암호화하는 핵산 서열.
- 청구항 16에 따른 핵산 서열을 포함하는 발현 카세트.
- 청구항 16에 따른 발현 카세트를 포함하는 재조합 벡터.
- 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체, 청구항 15에 따른 핵산 서열, 청구항 16에 따른 발현 카세트 또는 청구항 17에 따른 벡터, 및 약학적으로 허용가능한 비히클을 포함하는 약학적 조성물.
- 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체의 제조 방법으로서,
상기 방법은
a) ⅰ) 숙주 세포 내로 청구항 18에 따른 벡터를 도입하는 단계; 및
ⅱ) 상기 숙주 세포를 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체의 생산을 야기하는 조건 하에 배양하는 단계; 또는
b) 청구항 17에 따른 벡터로부터 RNA 구조체를 전사하는 단계;를 포함하는 방법. - 약제로서 또는 치료법에 사용하기 위한 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체, 청구항 15에 따른 핵산 서열, 청구항 16에 따른 발현 카세트, 청구항 17에 따른 벡터, 또는 청구항 18에 따른 약학적 조성물.
- 원생동물, 진균, 박테리아 또는 바이러스 감염의 예방, 개선 또는 치료에 사용하기 위한 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체, 청구항 15에 따른 핵산 서열, 청구항 16에 따른 발현 카세트, 청구항 17에 따른 벡터, 또는 청구항 18에 따른 약학적 조성물.
- 암의 예방, 개선 또는 치료에 사용하기 위한 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체, 청구항 15에 따른 핵산 서열, 청구항 16에 따른 발현 카세트, 청구항 17에 따른 벡터, 또는 청구항 18에 따른 약학적 조성물.
- 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체, 청구항 15에 따른 핵산 서열, 청구항 16에 따른 발현 카세트, 청구항 17에 따른 벡터, 또는 청구항 18에 따른 약학적 조성물을 포함하는 백신.
- 대상체에서 면역 반응의 자극에 사용하기 위한 것이고, 임의적으로 상기 면역 반응은 원생동물, 박테리아, 바이러스, 진균 또는 암에 대해 자극되는 것인 청구항 1 내지 청구항 14 중 어느 한 항에 따른 RNA 구조체, 청구항 15에 따른 핵산 서열, 청구항 16에 따른 발현 카세트, 청구항 17에 따른 벡터, 또는 청구항 18에 따른 약학적 조성물.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB2020063.0A GB202020063D0 (en) | 2020-12-17 | 2020-12-17 | RNA construct |
GB2020063.0 | 2020-12-17 | ||
PCT/GB2021/053361 WO2022129944A1 (en) | 2020-12-17 | 2021-12-17 | Rna construct |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20230134488A true KR20230134488A (ko) | 2023-09-21 |
Family
ID=74221398
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020237024303A KR20230134488A (ko) | 2020-12-17 | 2021-12-17 | Rna 구조체 |
Country Status (11)
Country | Link |
---|---|
US (1) | US20240124530A1 (ko) |
EP (1) | EP4262854A1 (ko) |
JP (1) | JP2024501084A (ko) |
KR (1) | KR20230134488A (ko) |
CN (1) | CN116847878A (ko) |
AU (1) | AU2021399216A1 (ko) |
CA (1) | CA3205126A1 (ko) |
GB (1) | GB202020063D0 (ko) |
IL (1) | IL303723A (ko) |
MX (1) | MX2023007229A (ko) |
WO (1) | WO2022129944A1 (ko) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2439335A1 (en) * | 2001-02-26 | 2002-09-06 | Gesellschaft Fuer Biotechnologische Forschung Mbh (Gbf) | Interferon regulatory factor-1/human estrogen receptor fusion protein and its use for treating carcinomas |
WO2017162265A1 (en) | 2016-03-21 | 2017-09-28 | Biontech Rna Pharmaceuticals Gmbh | Trans-replicating rna |
BR112019008369A2 (pt) * | 2016-10-26 | 2019-10-01 | Modernatx Inc | ácidos ribonucleicos mensageiros para intensificar respostas imunes e métodos para uso dos mesmos |
-
2020
- 2020-12-17 GB GBGB2020063.0A patent/GB202020063D0/en not_active Ceased
-
2021
- 2021-12-17 WO PCT/GB2021/053361 patent/WO2022129944A1/en active Application Filing
- 2021-12-17 EP EP21836221.8A patent/EP4262854A1/en active Pending
- 2021-12-17 CA CA3205126A patent/CA3205126A1/en active Pending
- 2021-12-17 IL IL303723A patent/IL303723A/en unknown
- 2021-12-17 MX MX2023007229A patent/MX2023007229A/es unknown
- 2021-12-17 JP JP2023561927A patent/JP2024501084A/ja active Pending
- 2021-12-17 KR KR1020237024303A patent/KR20230134488A/ko unknown
- 2021-12-17 US US18/257,698 patent/US20240124530A1/en active Pending
- 2021-12-17 CN CN202180094102.2A patent/CN116847878A/zh active Pending
- 2021-12-17 AU AU2021399216A patent/AU2021399216A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2024501084A (ja) | 2024-01-10 |
MX2023007229A (es) | 2023-09-05 |
IL303723A (en) | 2023-08-01 |
CA3205126A1 (en) | 2022-06-23 |
EP4262854A1 (en) | 2023-10-25 |
GB202020063D0 (en) | 2021-02-03 |
US20240124530A1 (en) | 2024-04-18 |
AU2021399216A1 (en) | 2023-07-06 |
CN116847878A (zh) | 2023-10-03 |
WO2022129944A1 (en) | 2022-06-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2748892C2 (ru) | Рнк-репликон для универсальной и эффективной генной экспрессии | |
CN110352247B (zh) | 用于增强基因表达的组合物和方法 | |
DK2951309T3 (en) | INCREASED TRANSGEN EXPRESSION AND PROCESSING | |
CN113811607A (zh) | CRISPR-Cas效应子多肽和其使用方法 | |
CA3143117A1 (en) | Rna construct | |
KR20200058508A (ko) | 게놈 편집을 위한 폴리뉴클레오티드, 조성물 및 방법 | |
Ohimann et al. | Proteolytic cleavage of initiation factor elF-4γ in the reticulocyte lysate inhibits translation of capped mRNAs but enhances that of uncapped mRNAs | |
KR20170100660A (ko) | 신규 인공 핵산 분자 | |
WO2021191630A1 (en) | Coronavirus vaccine | |
KR20110119650A (ko) | 3중 나선구조를 갖는 단백질 및 그의 제조방법 | |
KR20180091099A (ko) | 단백질 제조를 위한 개선된 진핵 세포 및 이들을 제조하는 방법 | |
KR20230000471A (ko) | 비천연 5'-비번역 영역 및 3'-비번역 영역 및 그의 용도 | |
KR20230134488A (ko) | Rna 구조체 | |
CA3234214A1 (en) | Methods for determining mutations for increasing modified replicable rna function and related compositions and their use | |
JP2021533826A (ja) | 正確に改変した秘匿メッセンジャーrna、及び、その他のポリヌクレオチド | |
US20230364226A1 (en) | Rna construct | |
CN113105537B (zh) | 一种促进a型流感病毒复制的宿主蛋白及其应用 | |
KR20240118877A (ko) | Rna 구조체 | |
JP2023527910A (ja) | 多用途かつ効率的な遺伝子発現のためのrnaレプリコン | |
KR20220088729A (ko) | 폴리아민 콘쥬게이트를 생산하는 효모 | |
KR20230127069A (ko) | 세포질에서 자가 전사가 가능한 mRNA를 제공하는 RNA/DNA 시스템 | |
JP5126945B2 (ja) | デカケタイドを産生する非天然型植物ポリケタイド合成酵素 | |
KR101623526B1 (ko) | 인간 보체 인자 h의 생산 방법 |