CN117625652A - Novel coronavirus variant mRNA vaccine and application thereof - Google Patents
Novel coronavirus variant mRNA vaccine and application thereof Download PDFInfo
- Publication number
- CN117625652A CN117625652A CN202211064993.XA CN202211064993A CN117625652A CN 117625652 A CN117625652 A CN 117625652A CN 202211064993 A CN202211064993 A CN 202211064993A CN 117625652 A CN117625652 A CN 117625652A
- Authority
- CN
- China
- Prior art keywords
- seq
- mrna
- sequence
- nucleic acid
- glycoprotein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000711573 Coronaviridae Species 0.000 title claims description 53
- 108700021021 mRNA Vaccine Proteins 0.000 title description 7
- 229940126582 mRNA vaccine Drugs 0.000 title description 7
- 101710167605 Spike glycoprotein Proteins 0.000 claims abstract description 87
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 52
- 108020004999 messenger RNA Proteins 0.000 claims abstract description 22
- 230000028993 immune response Effects 0.000 claims abstract description 18
- 208000001528 Coronaviridae Infections Diseases 0.000 claims abstract description 14
- 108091026890 Coding region Proteins 0.000 claims abstract description 13
- 150000007523 nucleic acids Chemical class 0.000 claims description 91
- 102000039446 nucleic acids Human genes 0.000 claims description 83
- 108020004707 nucleic acids Proteins 0.000 claims description 83
- 108700026244 Open Reading Frames Proteins 0.000 claims description 57
- 150000002632 lipids Chemical class 0.000 claims description 45
- 125000003729 nucleotide group Chemical group 0.000 claims description 40
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 36
- 108090000623 proteins and genes Proteins 0.000 claims description 35
- 239000002773 nucleotide Substances 0.000 claims description 34
- 102000004169 proteins and genes Human genes 0.000 claims description 25
- 229960005486 vaccine Drugs 0.000 claims description 25
- 108091007433 antigens Proteins 0.000 claims description 24
- 102000036639 antigens Human genes 0.000 claims description 24
- -1 cationic lipid Chemical class 0.000 claims description 24
- 239000000427 antigen Substances 0.000 claims description 23
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 20
- 230000001939 inductive effect Effects 0.000 claims description 20
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 19
- 229930185560 Pseudouridine Natural products 0.000 claims description 19
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 claims description 19
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 claims description 19
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 claims description 19
- 239000003814 drug Substances 0.000 claims description 16
- 230000003472 neutralizing effect Effects 0.000 claims description 13
- 108020005345 3' Untranslated Regions Proteins 0.000 claims description 12
- 238000002360 preparation method Methods 0.000 claims description 11
- 230000004048 modification Effects 0.000 claims description 10
- 238000012986 modification Methods 0.000 claims description 10
- 125000002091 cationic group Chemical group 0.000 claims description 9
- 238000011282 treatment Methods 0.000 claims description 9
- UVBYMVOUBXYSFV-XUTVFYLZSA-N 1-methylpseudouridine Chemical compound O=C1NC(=O)N(C)C=C1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 UVBYMVOUBXYSFV-XUTVFYLZSA-N 0.000 claims description 8
- 230000005875 antibody response Effects 0.000 claims description 8
- 229930182558 Sterol Natural products 0.000 claims description 7
- AGWRKMKSPDCRHI-UHFFFAOYSA-K [[5-(2-amino-7-methyl-6-oxo-1H-purin-9-ium-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-oxidophosphoryl] [[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-oxidophosphoryl]oxy-5-(6-aminopurin-9-yl)-4-methoxyoxolan-2-yl]methoxy-oxidophosphoryl] phosphate Chemical compound COC1C(OP([O-])(=O)OCC2OC(C(O)C2O)N2C=NC3=C2N=C(N)NC3=O)C(COP([O-])(=O)OP([O-])(=O)OP([O-])(=O)OCC2OC(C(O)C2O)N2C=[N+](C)C3=C2N=C(N)NC3=O)OC1N1C=NC2=C1N=CN=C2N AGWRKMKSPDCRHI-UHFFFAOYSA-K 0.000 claims description 7
- 150000003432 sterols Chemical class 0.000 claims description 7
- 235000003702 sterols Nutrition 0.000 claims description 7
- UVBYMVOUBXYSFV-UHFFFAOYSA-N 1-methylpseudouridine Natural products O=C1NC(=O)N(C)C=C1C1C(O)C(O)C(CO)O1 UVBYMVOUBXYSFV-UHFFFAOYSA-N 0.000 claims description 6
- AMMRPAYSYYGRKP-BGZDPUMWSA-N 5-[(2s,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1-ethylpyrimidine-2,4-dione Chemical compound O=C1NC(=O)N(CC)C=C1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 AMMRPAYSYYGRKP-BGZDPUMWSA-N 0.000 claims description 6
- ZXIATBNUWJBBGT-JXOAFFINSA-N 5-methoxyuridine Chemical compound O=C1NC(=O)C(OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZXIATBNUWJBBGT-JXOAFFINSA-N 0.000 claims description 6
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 claims description 6
- FHHZHGZBHYYWTG-INFSMZHSSA-N [(2r,3s,4r,5r)-5-(2-amino-7-methyl-6-oxo-3h-purin-9-ium-9-yl)-3,4-dihydroxyoxolan-2-yl]methyl [[[(2r,3s,4r,5r)-5-(2-amino-6-oxo-3h-purin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl] phosphate Chemical compound N1C(N)=NC(=O)C2=C1[N+]([C@H]1[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=C(C(N=C(N)N4)=O)N=C3)O)O1)O)=CN2C FHHZHGZBHYYWTG-INFSMZHSSA-N 0.000 claims description 6
- 230000002265 prevention Effects 0.000 claims description 6
- 239000003937 drug carrier Substances 0.000 claims description 5
- 238000013518 transcription Methods 0.000 abstract description 24
- 230000035897 transcription Effects 0.000 abstract description 23
- 238000013519 translation Methods 0.000 abstract description 22
- 238000005457 optimization Methods 0.000 abstract description 10
- 241000124008 Mammalia Species 0.000 abstract description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 5
- 239000002502 liposome Substances 0.000 abstract description 4
- 238000001727 in vivo Methods 0.000 abstract description 3
- 108700001237 Nucleic Acid-Based Vaccines Proteins 0.000 abstract 1
- 229940023146 nucleic acid vaccine Drugs 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 39
- 230000014616 translation Effects 0.000 description 23
- 238000000034 method Methods 0.000 description 21
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 20
- 239000000203 mixture Substances 0.000 description 18
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 14
- 238000001890 transfection Methods 0.000 description 14
- 241000700605 Viruses Species 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 108020004705 Codon Proteins 0.000 description 10
- 235000012000 cholesterol Nutrition 0.000 description 10
- 229940079593 drug Drugs 0.000 description 10
- 239000002105 nanoparticle Substances 0.000 description 10
- 108090000765 processed proteins & peptides Proteins 0.000 description 10
- 238000009472 formulation Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 9
- 239000012096 transfection reagent Substances 0.000 description 9
- QGWBEETXHOVFQS-UHFFFAOYSA-N 6-[6-(2-hexyldecanoyloxy)hexyl-(4-hydroxybutyl)amino]hexyl 2-hexyldecanoate Chemical compound CCCCCCCCC(CCCCCC)C(=O)OCCCCCCN(CCCCO)CCCCCCOC(=O)C(CCCCCC)CCCCCCCC QGWBEETXHOVFQS-UHFFFAOYSA-N 0.000 description 8
- 230000000694 effects Effects 0.000 description 8
- 229920002477 rna polymer Polymers 0.000 description 8
- 229940035893 uracil Drugs 0.000 description 8
- 230000003612 virological effect Effects 0.000 description 8
- 241000699666 Mus <mouse, genus> Species 0.000 description 7
- 108091023045 Untranslated Region Proteins 0.000 description 7
- 230000000069 prophylactic effect Effects 0.000 description 7
- 208000035473 Communicable disease Diseases 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000000338 in vitro Methods 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 230000001225 therapeutic effect Effects 0.000 description 6
- 108020003589 5' Untranslated Regions Proteins 0.000 description 5
- DJJCXFVJDGTHFX-UHFFFAOYSA-N Uridinemonophosphate Natural products OC1C(O)C(COP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-UHFFFAOYSA-N 0.000 description 5
- FOGRQMPFHUHIGU-UHFFFAOYSA-N Uridylic acid Natural products OC1C(OP(O)(O)=O)C(CO)OC1N1C(=O)NC(=O)C=C1 FOGRQMPFHUHIGU-UHFFFAOYSA-N 0.000 description 5
- 230000027455 binding Effects 0.000 description 5
- 230000015556 catabolic process Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000006731 degradation reaction Methods 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 238000002649 immunization Methods 0.000 description 5
- 230000003053 immunization Effects 0.000 description 5
- 230000005847 immunogenicity Effects 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 230000014621 translational initiation Effects 0.000 description 5
- DJJCXFVJDGTHFX-XVFCMESISA-N uridine 5'-monophosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-XVFCMESISA-N 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 4
- 229910052799 carbon Inorganic materials 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 238000007912 intraperitoneal administration Methods 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 150000003904 phospholipids Chemical class 0.000 description 4
- 108020003175 receptors Proteins 0.000 description 4
- 102000005962 receptors Human genes 0.000 description 4
- 239000011550 stock solution Substances 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 3
- 102000004961 Furin Human genes 0.000 description 3
- 108090001126 Furin Proteins 0.000 description 3
- 108091027974 Mature messenger RNA Proteins 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 108091034057 RNA (poly(A)) Proteins 0.000 description 3
- BGNVBNJYBVCBJH-UHFFFAOYSA-N SM-102 Chemical compound OCCN(CCCCCCCC(=O)OC(CCCCCCCC)CCCCCCCC)CCCCCC(OCCCCCCCCCCC)=O BGNVBNJYBVCBJH-UHFFFAOYSA-N 0.000 description 3
- 230000007910 cell fusion Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 230000002708 enhancing effect Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 210000000987 immune system Anatomy 0.000 description 3
- 238000007918 intramuscular administration Methods 0.000 description 3
- 125000005647 linker group Chemical group 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 238000007747 plating Methods 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 238000007920 subcutaneous administration Methods 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 2
- 241001123946 Gaga Species 0.000 description 2
- 241000008920 Gammacoronavirus Species 0.000 description 2
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000638154 Homo sapiens Transmembrane protease serine 2 Proteins 0.000 description 2
- 239000000232 Lipid Bilayer Substances 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 101800004803 Papain-like protease Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000002067 Protein Subunits Human genes 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 101710137500 T7 RNA polymerase Proteins 0.000 description 2
- NRLNQCOGCKAESA-KWXKLSQISA-N [(6z,9z,28z,31z)-heptatriaconta-6,9,28,31-tetraen-19-yl] 4-(dimethylamino)butanoate Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCC(OC(=O)CCCN(C)C)CCCCCCCC\C=C/C\C=C/CCCCC NRLNQCOGCKAESA-KWXKLSQISA-N 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 229940076810 beta sitosterol Drugs 0.000 description 2
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 2
- LGJMUZUPVCAVPU-UHFFFAOYSA-N beta-Sitostanol Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(C)CCC(CC)C(C)C)C1(C)CC2 LGJMUZUPVCAVPU-UHFFFAOYSA-N 0.000 description 2
- NJKOMDUNNDKEAI-UHFFFAOYSA-N beta-sitosterol Natural products CCC(CCC(C)C1CCC2(C)C3CC=C4CC(O)CCC4C3CCC12C)C(C)C NJKOMDUNNDKEAI-UHFFFAOYSA-N 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 150000001783 ceramides Chemical class 0.000 description 2
- GGCLNOIGPMGLDB-GYKMGIIDSA-N cholest-5-en-3-one Chemical class C1C=C2CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 GGCLNOIGPMGLDB-GYKMGIIDSA-N 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 150000001982 diacylglycerols Chemical class 0.000 description 2
- 125000005265 dialkylamine group Chemical class 0.000 description 2
- 150000001985 dialkylglycerols Chemical class 0.000 description 2
- 238000002296 dynamic light scattering Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000010166 immunofluorescence Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 210000004492 nuclear pore Anatomy 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 150000008103 phosphatidic acids Chemical class 0.000 description 2
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- KZJWDPNRJALLNS-VJSFXXLFSA-N sitosterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CC[C@@H](CC)C(C)C)[C@@]1(C)CC2 KZJWDPNRJALLNS-VJSFXXLFSA-N 0.000 description 2
- 229950005143 sitosterol Drugs 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 2
- 229940045145 uridine Drugs 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- NRJAVPSFFCBXDT-HUESYALOSA-N 1,2-distearoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCCCC NRJAVPSFFCBXDT-HUESYALOSA-N 0.000 description 1
- 101800001779 2'-O-methyltransferase Proteins 0.000 description 1
- VUFNLQXQSDUXKB-DOFZRALJSA-N 2-[4-[4-[bis(2-chloroethyl)amino]phenyl]butanoyloxy]ethyl (5z,8z,11z,14z)-icosa-5,8,11,14-tetraenoate Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(=O)OCCOC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 VUFNLQXQSDUXKB-DOFZRALJSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 101710159080 Aconitate hydratase A Proteins 0.000 description 1
- 101710159078 Aconitate hydratase B Proteins 0.000 description 1
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 1
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 1
- 241000008904 Betacoronavirus Species 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 241001678559 COVID-19 virus Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 108090000227 Chymases Proteins 0.000 description 1
- 102000003858 Chymases Human genes 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241001461743 Deltacoronavirus Species 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 102100027685 Hemoglobin subunit alpha Human genes 0.000 description 1
- 101500024730 Homo sapiens Angiotensin-2 Proteins 0.000 description 1
- 101001009007 Homo sapiens Hemoglobin subunit alpha Proteins 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 101100185402 Mus musculus Mug1 gene Proteins 0.000 description 1
- 241001292005 Nidovirales Species 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 101150001779 ORF1a gene Proteins 0.000 description 1
- 101001028244 Onchocerca volvulus Fatty-acid and retinol-binding protein 1 Proteins 0.000 description 1
- 102000005877 Peptide Initiation Factors Human genes 0.000 description 1
- 108010044843 Peptide Initiation Factors Proteins 0.000 description 1
- 108091036407 Polyadenylation Proteins 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- 102000029797 Prion Human genes 0.000 description 1
- 108091000054 Prion Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 101000933967 Pseudomonas phage KPP25 Major capsid protein Proteins 0.000 description 1
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 1
- 101710105008 RNA-binding protein Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 102100031989 Transmembrane protease serine 2 Human genes 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 230000026502 entry into host cell Effects 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical class C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- VRIVJOXICYMTAG-IYEMJOQQSA-L iron(ii) gluconate Chemical compound [Fe+2].OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C([O-])=O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C([O-])=O VRIVJOXICYMTAG-IYEMJOQQSA-L 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 210000000865 mononuclear phagocyte system Anatomy 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical group CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 1
- 239000013047 polymeric layer Substances 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 108010027510 vaccinia virus capping enzyme Proteins 0.000 description 1
- 230000008478 viral entry into host cell Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000000733 zeta-potential measurement Methods 0.000 description 1
- 244000059546 zoonotic virus Species 0.000 description 1
Landscapes
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
The invention relates to the field of nucleic acid vaccines, and the inventor obtains an optimized omicron (B.1.1.529) S glycoprotein mRNA sequence after transcription by optimizing a novel coronary variant omicron (B.1.1.529) S glycoprotein coding region. The mRNA after transcription optimization has more stable structure, higher translation efficiency of S glycoprotein delivered to mammals and human bodies and can induce immune response in vivo. The mRNA nucleic acid molecule of the invention can be prepared into liposome pharmaceutical compositions for controlling, preventing and treating coronavirus infection diseases.
Description
Technical Field
The invention relates to the technical field of vaccine development, in particular to mRNA nucleic acid molecules for controlling, preventing and treating coronavirus infection, a pharmaceutical composition containing the mRNA nucleic acid molecules and application thereof.
Background
Coronavirus (CoV) is a zoonotic virus that is widely found in nature, and was originally discovered and isolated in mice. CoV belongs to the order of the mantle viridae (Nidovirales), the subgenoles Coronaviridae (cornidovirrineae), the family Coronaviridae (cornoavidae), the subfamily orthocoronaviridae (orthoviroavirinae). The orthocoronaviridae are divided into 47 genera of coronaviridae a (alphacorenavirus, α -CoV), coronaviridae b (betacorenavirus, β -CoV), coronaviridae c (Gammacoronavirus, γ -CoV) and delta coronaviridae (deltacorenavirus, δ -CoV) 4, of which α -CoV is divided into 19 subgenera, according to the unique conserved 5 domains in all viruses of the order of the set of viruses, in combination with phylogenetic and genomic structures; beta-CoV is divided into 16 subgenera 5; delta-CoV split 3 subgenera 7; gamma-CoV is divided into 3 subgenera 5.
CoV is an enveloped single-stranded sense (+) non-segmented RNA virus, approximately 100nm in diameter, with a genome of about 26-32 kb. The genome has a cap structure at the 5 'end and a poly (A) structure at the 3' end; both the 5 'and 3' ends have untranslated regions (untranslated regions, UTR) which contain cis-regulatory elements that play an important role in replication and transcription of the viral genome. The 2/3 region of the 5' end of the genome encodes 2 large open reading frames (open reading frame, ORF): ORF1a and ORF1b, translated into 2 large polyproteins 1ab (PP 1 ab), PP1ab is cleaved by the viral chymotrypsin-like protease 3 chymmotrypsin-like protease 3,3CL-pro and papain-like protease (PLpro) into 16 nonstructural proteins (non-structural protein, nsp) that are primarily involved in replication and transcription of the virus; the genome of the other 1/3 region encodes 4 major structural proteins: spike protein (S), envelope protein (E), membrane protein (M), nucleocapsid protein (N); among the S and N genes are genes encoding accessory proteins designated ORF 3a, 6, 7a, 8 and 10, respectively, which are essential components of viral particles. Wherein, S glycoprotein is located on the surface of virus to form a rod-shaped structure, and is one of main antigen proteins of virus, and is a main gene for typing.
Like all coronaviruses, SARS-CoV-2 utilizes S glycoprotein to facilitate entry into host cells. The protein contains two important functional domains: one is the S1 Receptor Binding Domain (RBD) and the other is the S2 domain that mediates fusion of the viral and host cell membranes.
The S glycoprotein first binds to human angiotensin 2 (hACE 2) expressing surface receptors of host cells using the S1 receptor binding domain. Next, the S1 receptor binding domain is shed from the viral surface, facilitating fusion of the S2 domain with the host cell membrane. This process requires activation of the S glycoprotein, which is accomplished by cleavage at two sites (S1/S2 and S2') by proteases Furin and TMPRSS 2. Cleavage of Furin at the S1/S2 site can result in a conformational change in the viral S glycoprotein, thereby exposing the RBD domain and/or S2 domain. Researchers believe that TMPRSS2 cleavage of the SARS-CoV-2S glycoprotein can promote fusion of the viral capsid with the host cell, thereby allowing the virus to enter the host cell.
Exposure of the RBD domain in the S1 protein subunit can result in conformational instability of the subunit. Thus, during binding, conformational rearrangement of the subunit occurs in two states, referred to as the up-conformation and the down-conformation, respectively. Wherein the down state will transiently hide the RBD domain and the up state will expose the RBD domain, which will eventually temporarily place the protein subunit in an unstable state. In trimeric S glycoproteins, only one of the three RBD domains binds to a host cell surface receptor in an accessible conformation.
SARS-CoV-2S glycoprotein, ACE2 and Furin and TMPRSS2 enzymes play an important role in binding and mediating viral entry into host cells and thus can be a key target for covd drug development and viral inhibition, however, vaccines play a very critical role in preventing or blocking the transmission of novel coronaviruses.
The new coronavirus vaccines currently existing on the market have a number of drawbacks in their general design and optimization of the S glycoprotein mRNA sequence, in particular: 1) mRNA sequences are poorly stable and are easily degraded by intracellular enzymes in cells, resulting in lower levels of S glycoprotein effective antigen expression. 2) The mRNA sequence is translated with low efficiency, resulting in a low amount of effective antigen translated into S glycoprotein in the cell, which affects the effective activation of the immune system. Thus, it is currently desirable to provide an optimized design of S glycoprotein mRNA sequences for novel coronavirus vaccines.
In view of this, the present invention has been made.
Summary of The Invention
The present invention provides nucleic acid molecules useful for the prevention, control and treatment of novel coronavirus variants omicron (b.1.1.529), the optimized nucleic acid molecules comprising a coding region, wherein said coding region comprises one or more open reading frames (open reading frame, ORFs), and wherein at least one ORF encodes an S glycoprotein of coronavirus omicron (b.1.1.529), said S glycoprotein having the protein sequence shown in SEQ ID NO: 1. The optimized nucleotide sequence ensures that the transcribed mRNA structure is more stable, and the translation efficiency of target proteins in mammals and human bodies is higher.
In some embodiments, the invention provides an optimized nucleic acid molecule comprising an S glycoprotein encoding a coronavirus omicron (b.1.1.529), wherein the coding region comprises one or more open reading frames (open reading frame, ORFs), and wherein at least one ORF encodes an S glycoprotein of coronavirus omicron (b.1.1.529) having a protein sequence as shown in SEQ ID No. 1, but having a nucleotide sequence at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identity to the sequence shown in SEQ ID No. 2.
In another embodiment, the invention provides an optimized nucleic acid molecule comprising an S glycoprotein encoding coronavirus omicron (B.1.1.529), the sequence optimization comprising adjusting for codon preference when expressed in humans, increasing GC content of the sequence, etc., the sequence of which is shown in SEQ ID NO:3-SEQ ID NO: 5.
In another embodiment, the optimized DNA coding gene is used as a template for transcription to obtain an mRNA molecule for coding the S glycoprotein of the novel coronal variant omicron (B.1.1.529), and the sequence of the mRNA molecule is shown as SEQ ID NO. 6-SEQ ID NO. 8. The optimized nucleotide sequence enables the transcribed mRNA structure to be more stable, the translation efficiency of target proteins in mammals and human bodies to be higher, the technical problems that mRNA translation efficiency is poor, stability is to be improved and the like in the prior art can be solved, and an organism can be efficiently induced to generate immune response, so that the problems of low antibody titer, small effective antigen quantity and high production cost of a novel coronary variant strain omicron after immunization in the prior art are improved.
In another embodiment, the present disclosure provides an mRNA nucleic acid molecule comprising:
(i) A 5 'untranslated region (5' -UTR);
(ii) A CDS, wherein the CDS comprises an Open Reading Frame (ORF) encoding a coronavirus omicron (b.1.1.529) antigen S glycoprotein capable of inducing an immune response comprising a nucleotide sequence as set forth in SEQ ID No. 6-SEQ ID No. 8;
(iii) 3 '-untranslated region (3' -UTR).
In another embodiment, the mRNA nucleic acid molecule further comprises a 5 'untranslated region (5' utr). Optionally, wherein the 5' UTR comprises the sequence shown as SEQ ID NO 9-SEQ ID NO 11.
In another embodiment, the mRNA nucleic acid molecule further comprises a 3 'untranslated region (3' utr). Optionally, wherein the 3' UTR comprises the sequence of SEQ ID NO. 12-SEQ ID NO. 14.
In another embodiment, the mRNA nucleic acid molecules of the present invention further comprise a native 5' -cap structure or analog thereof produced by an endogenous process. Modification of the 5' -cap can increase the stability of the nucleic acid molecule, increase its half-life and can increase translation efficiency. Modifications to the native 5' -cap structure include 2' -O-methylation at the 5' -end of the polynucleotide and/or at the ribose 2' -hydroxy group of the 5' -end nucleic acid. The 5' -cap analogue may optionally be selected from m7G (5 ') ppp (5 ') (2 ' OMeA) pG, 3' -O-Me-m7G (5 ') ppp (5 ') G; g (5 ') ppp (5') A; g (5 ') ppp (5') G; m7G (5 ') ppp (5') A; m7G (5 ') ppp (5') G, etc.
In another embodiment, the mRNA nucleic acid molecule further comprises a poly-a tail or polyadenylation signal, optionally having a length of 80 to 180 nucleotides. In a preferred embodiment, the 3' -polyadenylation sequence (polyA) is preferably 80-180A with a linker sequence in between, as shown in SEQ ID NO. 15; more preferably 80-160A's, still more preferably 130A's, as shown in SEQ ID NO. 16.
In another preferred embodiment, the CDS comprises a signal peptide sequence as shown in SEQ ID NO. 17-SEQ ID NO. 19 and a mature peptide sequence as shown in SEQ ID NO. 1.
In another embodiment, the mRNA nucleic acid molecule further comprises one or more functional nucleotide analog modifications selected from the group consisting of pseudouridine (ψ), 1-methyl-pseudouridine (m1ψ), 1-ethyl-pseudouridine (e 1 ψ), 5-methoxy-uridine (mo 5U) and 5-methylcytosine (m 5C). In some embodiments, the presently disclosed mRNA includes pseudo-uridine substitutions at one or more or all uridine positions of the nucleic acid.
In another embodiment, the invention provides a ribonucleic acid (RNA) comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA), wherein CDS comprises an Open Reading Frame (ORF) encoding coronavirus omicron (b.1.1.529) antigen S glycoprotein capable of inducing an immune response. Wherein the 5' -UTR further comprises a Kozak sequence. The addition of 5 '-cap structure, 5' -UTR, kozak, 3 '-poly A sequence (polyA) and 3' -UTR sequence elements can further improve the stability of the sequence and avoid degradation. In another preferred embodiment, the CDS may further comprise a signal peptide sequence. The purpose of adding the signal peptide is to enhance the translation of the protein, improve the translation stability and promote better folding of the protein. The preferred signal peptide sequence is shown in SEQ ID NO. 17-SEQ ID NO. 19.
In another preferred embodiment, the invention provides an mRNA nucleic acid molecule comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA) as shown in SEQ ID NO. 20-SEQ ID NO. 22.
In a particularly preferred embodiment, the inventors have achieved a reduction in mRNA immunogenicity by reducing the U (uridylic acid) content of the mRNA molecule by substitution of all uracils in the nucleic acid sequences SEQ ID NO:20-SEQ ID NO:22 with pseudouridine (ψ), the capping substituted sequences shown in SEQ ID NO:23-SEQ ID NO: 25.
In another embodiment, provided herein are pharmaceutical compositions for inducing a neutralizing antibody response in a subject to a novel coronal variant omicron (b.1.1.529) S glycoprotein, the pharmaceutical compositions described herein comprising mRNA molecules having the sequences shown in SEQ ID No. 20-SEQ ID No. 25, or any combination thereof, encapsulated in a lipid shell formulated in a lipid nanoparticle form, wherein the lipid nanoparticle generally comprises an ionizable cationic lipid, a non-cationic lipid, a sterol, and a PEG lipid component.
In another embodiment, provided herein are mRNAs as shown in SEQ ID NO. 20-SEQ ID NO. 25 useful for inducing a neutralizing antibody response in a subject to the S glycoprotein of the novel coronavariant omicron (B.1.1.529), which mRNAs provided herein can control, prevent or treat infectious diseases caused by coronaviruses in a subject to be administered.
In another embodiment, the subject of the invention is a human or non-human mammal. In another embodiment, the subject is an adult, a human child, or a human infant. In another embodiment, the subject is at risk of or is susceptible to a coronavirus infection. In another embodiment, the subject is an elderly person. In another embodiment, the subject has been diagnosed as positive for a coronavirus infection. In another embodiment, the subject is asymptomatic.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are needed in the description of the embodiments or the prior art will be briefly described, and it is obvious that the drawings in the description below are some embodiments of the present invention, and other drawings can be obtained according to the drawings without inventive effort for a person skilled in the art.
FIG. 1 shows the effect of flow cytometry in example 2 of the present invention on the expression of different omacron (B.1.1.529) S glycoprotein mRNA transfected HEK293 cells;
FIG. 2 shows the effect of intermediate immunofluorescence detection of different omicron (B.1.1.529) S glycoprotein mRNA in HEK293 cells transfected according to example 3 of the present invention;
FIG. 3 shows the effect of Western blot detection on the expression of different omicron (B.1.1.529) S glycoprotein mRNA transfected HEK293 cells in example 4 of the present invention;
FIG. 4 shows serum antibodies after immunization of animals with different omacron (B.1.1.529) S glycoprotein mRNA preparations detected by IgG neutralizing antibodies of example 5 of the present invention;
FIG. 5 is a schematic diagram of mRNA cap structure.
Detailed Description
The present invention provides therapeutic nucleic acid molecules useful for the prevention, management and treatment of infectious diseases or conditions caused by the novel coronal variant virus omacron (b.1.1.529). By optimizing the nucleotide sequence of the S glycoprotein of encoding coronavirus omicron (B.1.1.529), the transcribed mRNA structure is more stable, and the translation efficiency of the target protein in mammals and human bodies is higher. Provided herein are pharmaceutical compositions for inducing a neutralizing antibody response in a subject to a novel coronal variant omicron (b.1.1.529) S glycoprotein, comprising mRNA molecules encoding the novel coronal variant omicron (b.1.1.529) S glycoprotein and related therapeutic methods and uses for preventing, managing and treating infectious diseases caused by the novel coronal variant omicron (b.1.1.529) to improve the problems of low antibody titer, low effective antigen amount, and high production cost of the novel coronal variant omicron in the prior art.
To make the invention easier to understand, certain terms are first defined. Additional definitions will be set forth throughout the detailed description.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are within the skill of the art, which are fully explained in the technical literature and general textbooks of the art, such as Molecular Cloning: a Laboratory Manual (molecular cloning: laboratory Manual), etc.
Antigen (abbreviated Ag) refers to a substance that causes the production of antibodies, and any substance that induces an immune response. The foreign molecules can be passed through the recognition of immunoglobulins on B cells or through the treatment of antigen presenting cells and combined with the major histocompatibility complex to form a complex that reactivates T cells, eliciting a continuous immune response. In general, an antigen may be a protein capable of inducing an immune response (e.g., causing the immune system to produce antibodies to the antigen).
Nucleic acids, which are a class of biological macromolecules composed of nucleotides, are largely classified into two classes, deoxyribonucleic acid (Deoxyribonucleic acid, DNA) and Ribonucleic acid (RNA). In addition to prions, nucleic acids are biological macromolecules necessary for life, which act primarily as carriers of genetic information in the life system.
Open reading frame (open reading frame, ORF): it is meant that the normal nucleotide sequence of the structural gene, from the start codon to the stop codon, encodes the complete polypeptide chain without the stop codon interrupting translation.
CDS: is an abbreviation for Coding sequence. The CDS may be one ORF but may also include multiple ORFs.
Messenger ribonucleic acid (mRNA) is a template that directs the synthesis of proteins and is a messenger that transfers genetic information from DNA to proteins. The main sequence of mature mRNA is the coding region, with non-coding regions both 5' upstream and downstream. Eukaryotic mRNA molecules also have 5 'cap and 3' tail structures at both ends.
5' -cap structure: is a modified 5' end of messenger RNA (mRNA) in eukaryotes. It is linked to the 5' nucleotide of mRNA by pyrophosphorylation from methylated guanylic acid (m 7G) after mRNA transcription. The 5' end cap structure plays an important role in many biological functions of mRNA, including: shearing, stabilization, transport, and ribosome recruitment of mRNA. Due to the important biological functions of the 5 'end cap sub-structure, it is desirable to optimize the 5' end cap sub-structure during the development of mRNA vaccines and to "cap" the mRNA feed using a suitable process.
Untranslated region (UTR): refers to any fragment located at both ends of the mRNA strand coding sequence. If it is at the 5 'end, it is referred to as the 5' untranslated region (or "leader") and if it is at the 3 'end, it is referred to as the 3' untranslated region (or "trailer"). The untranslated region sequence is not a codon and cannot be translated into an amino acid, but can control the degradation and transcription efficiency of an mRNA product by an RNA-binding protein. Therefore, each mRNA vaccine needs to consider how to design the UTR module of the product, thereby improving the stability and transcription efficiency of the mRNA vaccine in the cells of interest.
Poly (A): the 3' -end of most eukaryotic mRNA has Poly (A) tail formed by polymerizing 100-200 adenylates, which can prevent exonuclease degradation and can be used as a marker of nuclear pore transport system, and is related to the transport of mature mRNA to cytoplasm through nuclear pores. The length of Poly (A) is also a very important optimization criterion in the design of the sequence of mRNA vaccine products.
Kozak sequence: eukaryotic translation initiation sequences are commonly referred to as "Kozak" consensus sequences. A Kozak sequence is a nucleic acid sequence located behind the 5' -UTR of eukaryotic mRNA and between the initiation codon AUG, which represents the translation initiation site of the methionine codon at position +1, typically ACCACCAUGG, GCCAUGAUGG, etc. The Kozak sequence on an mRNA molecule is recognized by the ribosome as a translation initiation site, which can bind to a translation initiation factor to mediate translation initiation of mRNA containing a 5' cap structure. The Kozak sequence may be used to enhance the translation efficiency of eukaryotic genes. The kozak sequence different from the present invention may be selected and replaced or deleted by conventional technical means, for example, the kozak sequence may be replaced with GCC, ACC, GCCACC or GCCANN etc., where N denotes A, T/U, C or G.
Transcription: is the process by which genetic information is copied from DNA to RNA (especially mRNA) under the catalysis of RNA polymerase. As a first step in protein biosynthesis, transcription is the pathway for the synthesis of mRNA as well as non-coding RNAs (tRNA, rRNA, etc.).
Polyadenylation: refers to the covalent linking of polyadenylation acids to messenger RNA (mRNA) molecules. During protein biosynthesis, this is part of the way in which mature mRNA is produced ready for translation.
Antibody titer: the minimum concentration (i.e., the maximum dilution) required for a certain antibody (anti) to recognize a particular epitope (epitope) is measured.
Neutralizing antibodies: is an antibody which is used for protecting cells from a certain antigen or infectious agent.
Isolated polypeptides, antibodies, polynucleotides, vectors, cells, or compositions include those that have been purified to the extent that they are no longer found in their form in nature.
The term "subject" refers to any animal, including but not limited to humans, non-human primates, dogs, cats, rodents, etc., that is the recipient of a particular treatment. Typically, the terms "subject" and "patient" are used interchangeably herein with reference to a human subject of the present invention.
The term "effective amount" or "therapeutically effective amount" or "therapeutic effect" refers to a therapeutically effective amount to treat a disease or disorder in a subject. The therapeutically effective amount of the medicament has therapeutic effects, so that the development of diseases or symptoms can be prevented, slowed down or lightened, the morbidity and the mortality are reduced, and the quality of life is improved.
Nucleic acid molecules
The present invention provides nucleic acid molecules useful for the prevention, control and treatment of novel coronavariant viruses omicron (b.1.1.529) which, when administered to a subject in need thereof, are expressed by cells in the subject to produce a coded peptide or polypeptide. An optimized nucleic acid molecule comprises a coding region, wherein the coding region comprises one or more Open Reading Frames (ORFs), and wherein at least one ORF encodes an S glycoprotein of coronavirus omicron (b.1.1.529), said S glycoprotein having a protein sequence as set forth in SEQ ID NO: 1:
NLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHVISGTNGTKRFDNPVLPFNDGVYFASIEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLDHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPIIVREPEDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFDEVFNATRFASVYAWNRKRISNCVADYSVLYNLAPFFTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGNIADYNYKLPDDFTGCVIAWNSNKLDSKVSGNYNYLYRLFRKSNLKPFERDISTEIYQAGNKPCNGVAGFNCYFPLRSYSFRPTYGVGHQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLKGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQGVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEYVNNSYECDIPIGAGICASYQTQTKSHRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLKRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKYFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFKGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNHNAQALNTLVKQLSSKFGAISSVLNDIFSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT*(SEQ ID NO:1)
in some embodiments, the nucleic acid molecule comprising a nucleic acid sequence encoding a coronavirus omicron (B.1.1.529) S glycoprotein has the nucleotide sequence shown in SEQ ID NO. 2.
AATCTTACAACCAGAACTCAATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGTTATCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGCTTCCATTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTATTATAGTGCGTGAGCCAGAAGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGATGAAGTTTTTAACGCCACCAGATTTGCATCTGTTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATCTCGCACCATTTTTCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAATATTGCTGATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAAGCTTGATTCTAAGGTTAGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAACTGAAATCTATCAGGCCGGTAACAAACCTTGTAATGGTGTTGCAGGTTTTAATTGTTACTTTCCTTTACGATCATATAGTTTCCGACCCACTTATGGTGTTGGTCACCAACCATACAGAGTAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAAAAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTGTTCTTTATCAGGGTGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTACTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGCTGAATATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACTCAGACTAAGTCTCATCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAAACGTGCTTTAACTGGAATAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACCAATTAAATATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAAAGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTGGAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCATAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAAATTTGGTGCAATTTCAAGTGTTTTAAATGATATCTTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTATGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGATTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTGCCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATGA(SEQ ID NO:2)
In another embodiment, the nucleic acid molecule is a DNA molecule. The invention optimizes the codon of S glycoprotein (SEQ ID NO: 1) of encoding coronavirus omacron (B.1.1.529), and the sequence optimization comprises: optimizing GC content in human body, optimizing the use frequency of common codons, optimizing codon preference and the like to obtain corresponding DNA coding genes. In some embodiments, the invention provides an optimized nucleic acid molecule comprising an S glycoprotein encoding a coronavirus omicron (b.1.1.529), wherein the coding region comprises one or more Open Reading Frames (ORFs), and wherein at least one ORF encodes an S glycoprotein of coronavirus omicron (b.1.1.529) having a sequence as set forth in SEQ ID No. 1, but having a nucleotide sequence that is at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to the sequence set forth in SEQ ID No. 2.
In another preferred embodiment, the invention provides an optimized nucleic acid molecule comprising an S glycoprotein encoding coronavirus omacron (B.1.1.529) having the sequence shown in SEQ ID NO:3-SEQ ID NO: 5. Increasing the GC content and optimizing the codons to obtain the nucleotide sequence enables the transcribed mRNA structure to be more stable and the translation efficiency of the target protein to be higher in mammals and human bodies. The DNA encoding gene is cloned into a vector and transformed into a host, which is a prokaryotic cell, eukaryotic cell or mammalian cell, preferably a prokaryotic cell, more preferably E.coli, etc.
AACCTAACCACCCGCACCCAACTACCGCCGGCGTACACCAACTCGTTCACCCGCGGGGTCTACTACCCGGACAAGGTCTTCCGCTCGTCGGTCCTACACTCGACCCAAGACCTATTCCTACCGTTCTTCTCGAACGTCACCTGGTTCCACGTCATATCGGGGACCAACGGGACCAAGCGCTTCGACAACCCGGTCCTACCGTTCAACGACGGGGTCTACTTCGCGTCGATAGAGAAGTCGAACATAATACGCGGCTGGATTTTCGGCACTACTCTGGACTCTAAGACTCAGTCTCTGCTGATTGTGAACAACGCTACTAACGTGGTGATTAAGGTGTGCGAGTTCCAGTTCTGCAACGACCCTTTCCTGGACCACAAGAACAACAAGTCTTGGATGGAGTCTGAGTTCCGTGTGTACTCTTCTGCTAACAACTGCACTTTCGAGTACGTGTCTCAGCCTTTCCTGATGGACCTGGAGGGCAAGCAGGGCAACTTCAAGAACCTGCGTGAGTTCGTGTTCAAGAACATTGACGGCTACTTCAAGATTTACTCTAAGCACACTCCTATTATTGTGCGTGAGCCTGAGGACCTGCCTCAGGGCTTCTCTGCTCTGGAGCCTCTGGTGGACCTGCCTATTGGCATTAACATTACTCGTTTCCAGACTCTGCTGGCTCTGCACCGTTCTTACCTGACTCCTGGCGACTCTTCTTCTGGCTGGACTGCTGGCGCTGCTGCTTACTACGTGGGCTACCTGCAGCCTCGTACTTTCCTGCTGAAGTACAACGAGAACGGCACTATTACTGACGCTGTGGACTGCGCTCTGGACCCTCTGTCTGAGACTAAGTGCACTCTGAAGTCTTTCACTGTGGAGAAGGGCATTTACCAGACTTCTAACTTCCGTGTGCAGCCTACTGAGTCTATTGTGCGTTTCCCTAACATTACTAACCTGTGCCCTTTCGACGAGGTGTTCAACGCTACTCGTTTCGCTTCTGTGTACGCTTGGAACCGTAAGCGTATTTCTAACTGCGTGGCTGACTACTCTGTGCTGTACAACCTGGCTCCTTTCTTCACTTTCAAGTGCTACGGCGTGTCTCCTACTAAGCTGAACGACCTGTGCTTCACTAACGTGTACGCTGACTCTTTCGTGATTCGTGGCGACGAGGTGCGTCAGATTGCTCCTGGCCAGACTGGCAACATTGCTGACTACAACTACAAGCTGCCTGACGACTTCACTGGCTGCGTGATTGCTTGGAACTCTAACAAGCTGGACTCTAAGGTGTCTGGCAACTACAACTACCTGTACCGTCTGTTCCGTAAGTCTAACCTGAAGCCTTTCGAGCGTGACATTTCTACTGAGATTTACCAGGCTGGCAACAAGCCTTGCAACGGCGTGGCTGGCTTCAACTGCTACTTCCCTCTGCGTTCTTACTCTTTCCGTCCTACTTACGGCGTGGGCCACCAGCCTTACCGTGTGGTGGTGCTGTCTTTCGAGCTGCTGCACGCTCCTGCTACTGTGTGCGGCCCTAAGAAGTCTACTAACCTGGTGAAGAACAAGTGCGTGAACTTCAACTTCAACGGCCTGAAGGGCACTGGCGTGCTGACTGAGTCTAACAAGAAGTTCCTGCCTTTCCAGCAGTTCGGCCGTGACATTGCTGACACTACTGACGCTGTGCGTGACCCTCAGACTCTGGAGATTCTGGACATTACTCCTTGCTCTTTCGGCGGCGTGTCTGTGATTACTCCTGGCACTAACACTTCTAACCAGGTGGCTGTGCTGTACCAGGGCGTGAACTGCACTGAGGTGCCTGTGGCTATTCACGCTGACCAGCTGACTCCTACTTGGCGTGTGTACTCTACTGGCTCTAACGTGTTCCAGACTCGTGCTGGCTGCCTGATTGGCGCTGAGTACGTGAACAACTCTTACGAGTGCGACATTCCTATTGGCGCTGGCATTTGCGCTTCTTACCAGACTCAGACTAAGTCTCACCGTCGTGCTCGTTCTGTGGCTTCTCAGTCTATTATTGCTTACACTATGTCTCTGGGCGCTGAGAACTCTGTGGCTTACTCTAACAACTCTATTGCTATTCCTACTAACTTCACTATTTCTGTGACTACTGAGATTCTGCCTGTGTCTATGACTAAGACTTCTGTGGACTGCACTATGTACATTTGCGGCGACTCTACTGAGTGCTCTAACCTGCTGCTGCAGTACGGCTCTTTCTGCACTCAGCTGAAGCGTGCTCTGACTGGCATTGCTGTGGAGCAGGACAAGAACACTCAGGAGGTGTTCGCTCAGGTGAAGCAGATTTACAAGACTCCTCCTATTAAGTACTTCGGCGGCTTCAACTTCTCTCAGATTCTGCCTGACCCTTCTAAGCCTTCTAAGCGTTCTTTCATTGAGGACCTGCTGTTCAACAAGGTGACTCTGGCTGACGCTGGCTTCATTAAGCAGTACGGCGACTGCCTGGGCGACATTGCTGCTCGTGACCTGATTTGCGCTCAGAAGTTCAAGGGCCTGACTGTGCTGCCTCCTCTGCTGACTGACGAGATGATTGCTCAGTACACTTCTGCTCTGCTGGCTGGCACTATTACTTCTGGCTGGACTTTCGGCGCTGGCGCTGCTCTGCAGATTCCTTTCGCTATGCAGATGGCTTACCGTTTCAACGGCATTGGCGTGACTCAGAACGTGCTGTACGAGAACCAGAAGCTGATTGCTAACCAGTTCAACTCTGCTATTGGCAAGATTCAGGACTCTCTGTCTTCTACTGCTTCTGCTCTGGGCAAGCTGCAGGACGTGGTGAACCACAACGCTCAGGCTCTGAACACTCTGGTGAAGCAGCTGTCTTCTAAGTTCGGCGCTATTTCTTCTGTGCTGAACGACATTTTCTCTCGTCTGGACAAGGTGGAGGCTGAGGTGCAGATTGACCGTCTGATTACTGGCCGTCTGCAGTCTCTGCAGACTTACGTGACTCAGCAGCTGATTCGTGCTGCTGAGATTCGTGCTTCTGCTAACCTGGCTGCTACTAAGATGTCTGAGTGCGTGCTGGGCCAGTCTAAGCGTGTGGACTTCTGCGGCAAGGGCTACCACCTGATGTCTTTCCCTCAGTCTGCTCCTCACGGCGTGGTGTTCCTGCACGTGACTTACGTGCCTGCTCAGGAGAAGAACTTCACTACTGCTCCTGCTATTTGCCACGACGGCAAGGCTCACTTCCCTCGTGAGGGCGTGTTCGTGTCTAACGGCACTCACTGGTTCGTGACTCAGCGTAACTTCTACGAGCCTCAGATTATTACTACTGACAACACTTTCGTGTCTGGCAACTGCGACGTGGTGATTGGCATTGTGAACAACACTGTGTACGACCCTCTGCAGCCTGAGCTGGACTCTTTCAAGGAGGAGCTGGACAAGTACTTCAAGAACCACACTTCTCCTGACGTGGACCTGGGCGACATTTCTGGCATTAACGCTTCTGTGGTGAACATTCAGAAGGAGATTGACCGTCTGAACGAGGTGGCTAAGAACCTGAACGAGTCTCTGATTGACCTGCAGGAGCTGGGCAAGTACGAGCAGTACATTAAGTGGCCTTGGTACATTTGGCTGGGCTTCATTGCTGGCCTGATTGCTATTGTGATGGTGACTATTATGCTGTGCTGCATGACTTCTTGCTGCTCTTGCCTGAAGGGCTGCTGCTCTTGCGGCTCTTGCTGCAAGTTCGACGAGGACGACTCTGAGCCTGTGCTGAAGGGCGTGAAGCTGCACTACACTTGA(SEQ ID NO:3)
AACCTGACTACTCGTACTCAGCTGCCTCCTGCTTACACTAACTCTTTCACTCGTGGCGTGTACTACCCTGACAAGGTGTTCCGTTCTTCTGTGCTGCACTCTACTCAGGACCTGTTCCTGCCTTTCTTCTCTAACGTGACTTGGTTCCACGTGATTTCTGGCACTAACGGCACTAAGCGTTTCGACAACCCTGTGCTGCCTTTCAACGACGGCGTGTACTTCGCTTCTATTGAGAAGTCTAACATTATTCGTGGCTGGATTTTCGGCACTACTCTGGACTCTAAGACTCAGTCTCTGCTGATTGTGAACAACGCTACTAACGTGGTGATTAAGGTGTGCGAGTTCCAGTTCTGCAACGACCCTTTCCTGGACCACAAGAACAACAAGTCTTGGATGGAGTCTGAGTTCCGTGTGTACTCTTCTGCTAACAACTGCACTTTCGAGTACGTGTCTCAGCCTTTCCTGATGGACCTGGAGGGCAAGCAGGGCAACTTCAAGAACCTGCGTGAGTTCGTGTTCAAGAACATTGACGGCTACTTCAAGATTTACTCTAAGCACACTCCTATTATTGTGCGTGAGCCTGAGGACCTGCCTCAGGGCTTCTCTGCTCTGGAGCCTCTGGTGGACCTGCCTATTGGCATTAACATTACTCGTTTCCAGACTCTGCTGGCTCTGCACCGTTCTTACCTGACTCCTGGCGACTCTTCTTCTGGCTGGACTGCTGGCGCTGCTGCTTACTACGTGGGCTACCTGCAGCCTCGTACTTTCCTGCTGAAGTACAACGAGAACGGCACTATTACTGACGCTGTGGACTGCGCTCTGGACCCTCTGTCTGAGACTAAGTGCACTCTGAAGTCTTTCACTGTGGAGAAGGGCATTTACCAGACTTCTAACTTCCGTGTGCAGCCTACTGAGTCTATTGTGCGTTTCCCTAACATTACTAACCTGTGCCCTTTCGACGAGGTGTTCAACGCTACTCGTTTCGCTTCTGTGTACGCTTGGAACCGTAAGCGTATTTCTAACTGCGTGGCTGACTACTCTGTGCTGTACAACCTGGCTCCTTTCTTCACTTTCAAGTGCTACGGCGTGTCTCCTACTAAGCTGAACGACCTGTGCTTCACTAACGTGTACGCTGACTCTTTCGTGATTCGTGGCGACGAGGTGCGTCAGATTGCTCCTGGCCAGACTGGCAACATTGCTGACTACAACTACAAGCTGCCTGACGACTTCACTGGCTGCGTGATTGCTTGGAACTCTAACAAGCTGGACTCTAAGGTGTCTGGCAACTACAACTACCTGTACCGTCTGTTCCGTAAGTCTAACCTGAAGCCTTTCGAGCGTGACATTTCTACTGAGATTTACCAGGCTGGCAACAAGCCTTGCAACGGCGTGGCTGGCTTCAACTGCTACTTCCCTCTGCGTTCTTACTCTTTCCGTCCTACTTACGGCGTGGGCCACCAGCCTTACCGTGTGGTGGTGCTGTCTTTCGAGCTGCTGCACGCTCCTGCTACTGTGTGCGGCCCTAAGAAGTCTACTAACCTGGTGAAGAACAAGTGCGTGAACTTCAACTTCAACGGCCTGAAGGGCACTGGCGTGCTGACTGAGTCTAACAAGAAGTTCCTGCCTTTCCAGCAGTTCGGCCGTGACATTGCTGACACTACTGACGCTGTGCGTGACCCTCAGACTCTGGAGATTCTGGACATTACTCCTTGCTCTTTCGGCGGCGTGTCTGTGATTACTCCTGGCACTAACACTTCTAACCAGGTGGCTGTGCTGTACCAGGGCGTGAACTGCACTGAGGTGCCTGTGGCTATTCACGCTGACCAGCTGACTCCTACTTGGCGTGTGTACTCTACTGGCTCTAACGTGTTCCAGACTCGTGCTGGCTGCCTGATTGGCGCTGAGTACGTGAACAACTCTTACGAGTGCGACATTCCTATTGGCGCTGGCATTTGCGCTTCTTACCAGACTCAGACTAAGTCTCACCGTCGTGCTCGTTCTGTGGCTTCTCAGTCTATTATTGCTTACACTATGTCTCTGGGCGCTGAGAACTCTGTGGCTTACTCTAACAACTCTATTGCTATTCCTACTAACTTCACTATTTCTGTGACTACTGAGATTCTGCCTGTGTCTATGACTAAGACTTCTGTGGACTGCACTATGTACATTTGCGGCGACTCTACTGAGTGCTCTAACCTGCTGCTGCAGTACGGCTCTTTCTGCACTCAGCTGAAGCGTGCTCTGACTGGCATTGCTGTGGAGCAGGACAAGAACACTCAGGAGGTGTTCGCTCAGGTGAAGCAGATTTACAAGACTCCTCCTATTAAGTACTTCGGCGGCTTCAACTTCTCTCAGATTCTGCCTGACCCTTCTAAGCCTTCTAAGCGTTCTTTCATTGAGGACCTGCTGTTCAACAAGGTGACTCTGGCTGACGCTGGCTTCATTAAGCAGTACGGCGACTGCCTGGGCGACATTGCTGCTCGTGACCTGATTTGCGCTCAGAAGTTCAAGGGCCTGACTGTGCTGCCTCCTCTGCTGACTGACGAGATGATTGCTCAGTACACTTCTGCTCTGCTGGCTGGCACTATTACTTCTGGCTGGACTTTCGGCGCTGGCGCTGCTCTGCAGATTCCTTTCGCTATGCAGATGGCTTACCGTTTCAACGGCATTGGCGTGACTCAGAACGTGCTGTACGAGAACCAGAAGCTGATTGCTAACCAGTTCAACTCTGCTATTGGCAAGATTCAGGACTCTCTGTCTTCTACTGCTTCTGCTCTGGGCAAGCTGCAGGACGTGGTGAACCACAACGCTCAGGCTCTGAACACTCTGGTGAAGCAGCTGTCTTCTAAGTTCGGCGCTATTTCTTCTGTGCTGAACGACATTTTCTCTCGTCTGGACAAGGTGGAGGCTGAGGTGCAGATTGACCGTCTGATTACTGGCCGTCTGCAGTCTCTGCAGACTTACGTGACTCAGCAGCTGATTCGTGCTGCTGAGATTCGTGCTTCTGCTAACCTGGCTGCTACTAAGATGTCTGAGTGCGTGCTGGGCCAGTCTAAGCGTGTGGACTTCTGCGGCAAGGGCTACCACCTGATGTCTTTCCCTCAGTCTGCTCCTCACGGCGTGGTGTTCCTGCACGTGACTTACGTGCCTGCTCAGGAGAAGAACTTCACTACTGCTCCTGCTATTTGCCACGACGGCAAGGCTCACTTCCCTCGTGAGGGCGTGTTCGTGTCTAACGGCACTCACTGGTTCGTGACTCAGCGTAACTTCTACGAGCCTCAGATTATTACTACTGACAACACTTTCGTGTCTGGCAACTGCGACGTGGTGATTGGCATTGTGAACAACACTGTGTACGACCCTCTGCAGCCTGAGCTGGACTCTTTCAAGGAGGAGCTGGACAAGTACTTCAAGAACCACACTTCTCCTGACGTGGACCTGGGCGACATTTCTGGCATTAACGCTTCTGTGGTGAACATTCAGAAGGAGATTGACCGTCTGAACGAGGTGGCTAAGAACCTGAACGAGTCTCTGATTGACCTGCAGGAGCTGGGCAAGTACGAGCAGTACATTAAGTGGCCTTGGTACATTTGGCTGGGCTTCATTGCTGGCCTGATTGCTATTGTGATGGTGACTATTATGCTGTGCTGCATGACTTCTTGCTGCTCTTGCCTGAAGGGCTGCTGCTCTTGCGGCTCTTGCTGCAAGTTCGACGAGGACGACTCTGAGCCTGTGCTGAAGGGCGTGAAGCTGCACTACACTTGA(SEQ ID NO:4)
AACCTGACGACGCGGACGCAGCTGCCCCCCGCCTACACGAACAGCTTCACGCGGGGCGTGTACTACCCCGACAAGGTGTTCCGGAGCAGCGTGCTGCACAGCACGCAGGACCTGTTCCTGCCCTTCTTCAGCAACGTGACGTGGTTCCACGTGATCAGCGGCACGAACGGCACGAAGCGGTTCGACAACCCCGTGCTGCCCTTCAACGACGGCGTGTACTTCGCCAGCATCGAGAAGAGCAACATCATCCGGGGCTGGATCTTCGGCACGACGCTGGACAGCAAGACGCAGAGCCTGCTGATCGTGAACAACGCCACGAACGTGGTGATCAAGGTGTGCGAGTTCCAGTTCTGCAACGACCCCTTCCTGGACCACAAGAACAACAAGAGCTGGATGGAGAGCGAGTTCCGGGTGTACAGCAGCGCCAACAACTGCACGTTCGAGTACGTGAGCCAGCCCTTCCTGATGGACCTGGAGGGCAAGCAGGGCAACTTCAAGAACCTGCGGGAGTTCGTGTTCAAGAACATCGACGGCTACTTCAAGATCTACAGCAAGCACACGCCCATCATCGTGCGGGAGCCCGAGGACCTGCCCCAGGGCTTCAGCGCCCTGGAGCCCCTGGTGGACCTGCCCATCGGCATCAACATCACGCGGTTCCAGACGCTGCTGGCCCTGCACCGGAGCTACCTGACGCCCGGCGACAGCAGCAGCGGCTGGACGGCCGGCGCCGCCGCCTACTACGTGGGCTACCTGCAGCCCCGGACGTTCCTGCTGAAGTACAACGAGAACGGCACGATCACGGACGCCGTGGACTGCGCCCTGGACCCCCTGAGCGAGACGAAGTGCACGCTGAAGAGCTTCACGGTGGAGAAGGGCATCTACCAGACGAGCAACTTCCGGGTGCAGCCCACGGAGAGCATCGTGCGGTTCCCCAACATCACGAACCTGTGCCCCTTCGACGAGGTGTTCAACGCCACGCGGTTCGCCAGCGTGTACGCCTGGAACCGGAAGCGGATCAGCAACTGCGTGGCCGACTACAGCGTGCTGTACAACCTGGCCCCCTTCTTCACGTTCAAGTGCTACGGCGTGAGCCCCACGAAGCTGAACGACCTGTGCTTCACGAACGTGTACGCCGACAGCTTCGTGATCCGGGGCGACGAGGTGCGGCAGATCGCCCCCGGCCAGACGGGCAACATCGCCGACTACAACTACAAGCTGCCCGACGACTTCACGGGCTGCGTGATCGCCTGGAACAGCAACAAGCTGGACAGCAAGGTGAGCGGCAACTACAACTACCTGTACCGGCTGTTCCGGAAGAGCAACCTGAAGCCCTTCGAGCGGGACATCAGCACGGAGATCTACCAGGCCGGCAACAAGCCCTGCAACGGCGTGGCCGGCTTCAACTGCTACTTCCCCCTGCGGAGCTACAGCTTCCGGCCCACGTACGGCGTGGGCCACCAGCCCTACCGGGTGGTGGTGCTGAGCTTCGAGCTGCTGCACGCCCCCGCCACGGTGTGCGGCCCCAAGAAGAGCACGAACCTGGTGAAGAACAAGTGCGTGAACTTCAACTTCAACGGCCTGAAGGGCACGGGCGTGCTGACGGAGAGCAACAAGAAGTTCCTGCCCTTCCAGCAGTTCGGCCGGGACATCGCCGACACGACGGACGCCGTGCGGGACCCCCAGACGCTGGAGATCCTGGACATCACGCCCTGCAGCTTCGGCGGCGTGAGCGTGATCACGCCCGGCACGAACACGAGCAACCAGGTGGCCGTGCTGTACCAGGGCGTGAACTGCACGGAGGTGCCCGTGGCCATCCACGCCGACCAGCTGACGCCCACGTGGCGGGTGTACAGCACGGGCAGCAACGTGTTCCAGACGCGGGCCGGCTGCCTGATCGGCGCCGAGTACGTGAACAACAGCTACGAGTGCGACATCCCCATCGGCGCCGGCATCTGCGCCAGCTACCAGACGCAGACGAAGAGCCACCGGCGGGCCCGGAGCGTGGCCAGCCAGAGCATCATCGCCTACACGATGAGCCTGGGCGCCGAGAACAGCGTGGCCTACAGCAACAACAGCATCGCCATCCCCACGAACTTCACGATCAGCGTGACGACGGAGATCCTGCCCGTGAGCATGACGAAGACGAGCGTGGACTGCACGATGTACATCTGCGGCGACAGCACGGAGTGCAGCAACCTGCTGCTGCAGTACGGCAGCTTCTGCACGCAGCTGAAGCGGGCCCTGACGGGCATCGCCGTGGAGCAGGACAAGAACACGCAGGAGGTGTTCGCCCAGGTGAAGCAGATCTACAAGACGCCCCCCATCAAGTACTTCGGCGGCTTCAACTTCAGCCAGATCCTGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCTTCATCGAGGACCTGCTGTTCAACAAGGTGACGCTGGCCGACGCCGGCTTCATCAAGCAGTACGGCGACTGCCTGGGCGACATCGCCGCCCGGGACCTGATCTGCGCCCAGAAGTTCAAGGGCCTGACGGTGCTGCCCCCCCTGCTGACGGACGAGATGATCGCCCAGTACACGAGCGCCCTGCTGGCCGGCACGATCACGAGCGGCTGGACGTTCGGCGCCGGCGCCGCCCTGCAGATCCCCTTCGCCATGCAGATGGCCTACCGGTTCAACGGCATCGGCGTGACGCAGAACGTGCTGTACGAGAACCAGAAGCTGATCGCCAACCAGTTCAACAGCGCCATCGGCAAGATCCAGGACAGCCTGAGCAGCACGGCCAGCGCCCTGGGCAAGCTGCAGGACGTGGTGAACCACAACGCCCAGGCCCTGAACACGCTGGTGAAGCAGCTGAGCAGCAAGTTCGGCGCCATCAGCAGCGTGCTGAACGACATCTTCAGCCGGCTGGACAAGGTGGAGGCCGAGGTGCAGATCGACCGGCTGATCACGGGCCGGCTGCAGAGCCTGCAGACGTACGTGACGCAGCAGCTGATCCGGGCCGCCGAGATCCGGGCCAGCGCCAACCTGGCCGCCACGAAGATGAGCGAGTGCGTGCTGGGCCAGAGCAAGCGGGTGGACTTCTGCGGCAAGGGCTACCACCTGATGAGCTTCCCCCAGAGCGCCCCCCACGGCGTGGTGTTCCTGCACGTGACGTACGTGCCCGCCCAGGAGAAGAACTTCACGACGGCCCCCGCCATCTGCCACGACGGCAAGGCCCACTTCCCCCGGGAGGGCGTGTTCGTGAGCAACGGCACGCACTGGTTCGTGACGCAGCGGAACTTCTACGAGCCCCAGATCATCACGACGGACAACACGTTCGTGAGCGGCAACTGCGACGTGGTGATCGGCATCGTGAACAACACGGTGTACGACCCCCTGCAGCCCGAGCTGGACAGCTTCAAGGAGGAGCTGGACAAGTACTTCAAGAACCACACGAGCCCCGACGTGGACCTGGGCGACATCAGCGGCATCAACGCCAGCGTGGTGAACATCCAGAAGGAGATCGACCGGCTGAACGAGGTGGCCAAGAACCTGAACGAGAGCCTGATCGACCTGCAGGAGCTGGGCAAGTACGAGCAGTACATCAAGTGGCCCTGGTACATCTGGCTGGGCTTCATCGCCGGCCTGATCGCCATCGTGATGGTGACGATCATGCTGTGCTGCATGACGAGCTGCTGCAGCTGCCTGAAGGGCTGCTGCAGCTGCGGCAGCTGCTGCAAGTTCGACGAGGACGACAGCGAGCCCGTGCTGAAGGGCGTGAAGCTGCACTACACGTGA(SEQ ID NO:5)
In another embodiment, the invention provides an mRNA molecule comprising an encoded coronavirus omacron (b.1.1.529) S glycoprotein. The DNA coding gene is transcribed by using a template to obtain an mRNA sequence for coding the novel coronary variant omacron (B.1.1.529) S glycoprotein, and the sequence is shown as SEQ ID NO. 6-SEQ ID NO. 8. The optimized DNA coding gene is used as a template for transcription, the nucleotide sequence enables the transcribed mRNA structure to be more stable, the translation efficiency of target proteins in mammals and human bodies to be higher, the technical problems that the translation efficiency of mRNA is poor, the stability is to be improved and the like in the prior art can be solved, and the immune response of organisms can be efficiently induced, so that the problems of low antibody titer, small effective antigen quantity and high production cost of novel crown variant omicron immune in the prior art can be improved. AACCUAACCACCCGCACCCAACUACCGCCGGCGUACACCAACUCGUUCACCCGCGGGGUCUACUACCCGGACAAGGUCUUCCGCUCGUCGGUCCUACACUCGACCCAAGACCUAUUCCUACCGUUCUUCUCGAACGUCACCUGGUUCCACGUCAUAUCGGGGACCAACGGGACCAAGCGCUUCGACAACCCGGUCCUACCGUUCAACGACGGGGUCUACUUCGCGUCGAUAGAGAAGUCGAACAUAAUACGCGGCUGGAUUUUCGGCACUACUCUGGACUCUAAGACUCAGUCUCUGCUGAUUGUGAACAACGCUACUAACGUGGUGAUUAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCUUUCCUGGACCACAAGAACAACAAGUCUUGGAUGGAGUCUGAGUUCCGUGUGUACUCUUCUGCUAACAACUGCACUUUCGAGUACGUGUCUCAGCCUUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGUGAGUUCGUGUUCAAGAACAUUGACGGCUACUUCAAGAUUUACUCUAAGCACACUCCUAUUAUUGUGCGUGAGCCUGAGGACCUGCCUCAGGGCUUCUCUGCUCUGGAGCCUCUGGUGGACCUGCCUAUUGGCAUUAACAUUACUCGUUUCCAGACUCUGCUGGCUCUGCACCGUUCUUACCUGACUCCUGGCGACUCUUCUUCUGGCUGGACUGCUGGCGCUGCUGCUUACUACGUGGGCUACCUGCAGCCUCGUACUUUCCUGCUGAAGUACAACGAGAACGGCACUAUUACUGACGCUGUGGACUGCGCUCUGGACCCUCUGUCUGAGACUAAGUGCACUCUGAAGUCUUUCACUGUGGAGAAGGGCAUUUACCAGACUUCUAACUUCCGUGUGCAGCCUACUGAGUCUAUUGUGCGUUUCCCUAACAUUACUAACCUGUGCCCUUUCGACGAGGUGUUCAACGCUACUCGUUUCGCUUCUGUGUACGCUUGGAACCGUAAGCGUAUUUCUAACUGCGUGGCUGACUACUCUGUGCUGUACAACCUGGCUCCUUUCUUCACUUUCAAGUGCUACGGCGUGUCUCCUACUAAGCUGAACGACCUGUGCUUCACUAACGUGUACGCUGACUCUUUCGUGAUUCGUGGCGACGAGGUGCGUCAGAUUGCUCCUGGCCAGACUGGCAACAUUGCUGACUACAACUACAAGCUGCCUGACGACUUCACUGGCUGCGUGAUUGCUUGGAACUCUAACAAGCUGGACUCUAAGGUGUCUGGCAACUACAACUACCUGUACCGUCUGUUCCGUAAGUCUAACCUGAAGCCUUUCGAGCGUGACAUUUCUACUGAGAUUUACCAGGCUGGCAACAAGCCUUGCAACGGCGUGGCUGGCUUCAACUGCUACUUCCCUCUGCGUUCUUACUCUUUCCGUCCUACUUACGGCGUGGGCCACCAGCCUUACCGUGUGGUGGUGCUGUCUUUCGAGCUGCUGCACGCUCCUGCUACUGUGUGCGGCCCUAAGAAGUCUACUAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUGAAGGGCACUGGCGUGCUGACUGAGUCUAACAAGAAGUUCCUGCCUUUCCAGCAGUUCGGCCGUGACAUUGCUGACACUACUGACGCUGUGCGUGACCCUCAGACUCUGGAGAUUCUGGACAUUACUCCUUGCUCUUUCGGCGGCGUGUCUGUGAUUACUCCUGGCACUAACACUUCUAACCAGGUGGCUGUGCUGUACCAGGGCGUGAACUGCACUGAGGUGCCUGUGGCUAUUCACGCUGACCAGCUGACUCCUACUUGGCGUGUGUACUCUACUGGCUCUAACGUGUUCCAGACUCGUGCUGGCUGCCUGAUUGGCGCUGAGUACGUGAACAACUCUUACGAGUGCGACAUUCCUAUUGGCGCUGGCAUUUGCGCUUCUUACCAGACUCAGACUAAGUCUCACCGUCGUGCUCGUUCUGUGGCUUCUCAGUCUAUUAUUGCUUACACUAUGUCUCUGGGCGCUGAGAACUCUGUGGCUUACUCUAACAACUCUAUUGCUAUUCCUACUAACUUCACUAUUUCUGUGACUACUGAGAUUCUGCCUGUGUCUAUGACUAAGACUUCUGUGGACUGCACUAUGUACAUUUGCGGCGACUCUACUGAGUGCUCUAACCUGCUGCUGCAGUACGGCUCUUUCUGCACUCAGCUGAAGCGUGCUCUGACUGGCAUUGCUGUGGAGCAGGACAAGAACACUCAGGAGGUGUUCGCUCAGGUGAAGCAGAUUUACAAGACUCCUCCUAUUAAGUACUUCGGCGGCUUCAACUUCUCUCAGAUUCUGCCUGACCCUUCUAAGCCUUCUAAGCGUUCUUUCAUUGAGGACCUGCUGUUCAACAAGGUGACUCUGGCUGACGCUGGCUUCAUUAAGCAGUACGGCGACUGCCUGGGCGACAUUGCUGCUCGUGACCUGAUUUGCGCUCAGAAGUUCAAGGGCCUGACUGUGCUGCCUCCUCUGCUGACUGACGAGAUGAUUGCUCAGUACACUUCUGCUCUGCUGGCUGGCACUAUUACUUCUGGCUGGACUUUCGGCGCUGGCGCUGCUCUGCAGAUUCCUUUCGCUAUGCAGAUGGCUUACCGUUUCAACGGCAUUGGCGUGACUCAGAACGUGCUGUACGAGAACCAGAAGCUGAUUGCUAACCAGUUCAACUCUGCUAUUGGCAAGAUUCAGGACUCUCUGUCUUCUACUGCUUCUGCUCUGGGCAAGCUGCAGGACGUGGUGAACCACAACGCUCAGGCUCUGAACACUCUGGUGAAGCAGCUGUCUUCUAAGUUCGGCGCUAUUUCUUCUGUGCUGAACGACAUUUUCUCUCGUCUGGACAAGGUGGAGGCUGAGGUGCAGAUUGACCGUCUGAUUACUGGCCGUCUGCAGUCUCUGCAGACUUACGUGACUCAGCAGCUGAUUCGUGCUGCUGAGAUUCGUGCUUCUGCUAACCUGGCUGCUACUAAGAUGUCUGAGUGCGUGCUGGGCCAGUCUAAGCGUGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGUCUUUCCCUCAGUCUGCUCCUCACGGCGUGGUGUUCCUGCACGUGACUUACGUGCCUGCUCAGGAGAAGAACUUCACUACUGCUCCUGCUAUUUGCCACGACGGCAAGGCUCACUUCCCUCGUGAGGGCGUGUUCGUGUCUAACGGCACUCACUGGUUCGUGACUCAGCGUAACUUCUACGAGCCUCAGAUUAUUACUACUGACAACACUUUCGUGUCUGGCAACUGCGACGUGGUGAUUGGCAUUGUGAACAACACUGUGUACGACCCUCUGCAGCCUGAGCUGGACUCUUUCAAGGAGGAGCUGGACAAGUACUUCAAGAACCACACUUCUCCUGACGUGGACCUGGGCGACAUUUCUGGCAUUAACGCUUCUGUGGUGAACAUUCAGAAGGAGAUUGACCGUCUGAACGAGGUGGCUAAGAACCUGAACGAGUCUCUGAUUGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUUAAGUGGCCUUGGUACAUUUGGCUGGGCUUCAUUGCUGGCCUGAUUGCUAUUGUGAUGGUGACUAUUAUGCUGUGCUGCAUGACUUCUUGCUGCUCUUGCCUGAAGGGCUGCUGCUCUUGCGGCUCUUGCUGCAAGUUCGACGAGGACGACUCUGAGCCUGUGCUGAAGGGCGUGAAGCUGCACUACACUUGA (SEQ ID NO: 6)
AACCUGACUACUCGUACUCAGCUGCCUCCUGCUUACACUAACUCUUUCACUCGUGGCGUGUACUACCCUGACAAGGUGUUCCGUUCUUCUGUGCUGCACUCUACUCAGGACCUGUUCCUGCCUUUCUUCUCUAACGUGACUUGGUUCCACGUGAUUUCUGGCACUAACGGCACUAAGCGUUUCGACAACCCUGUGCUGCCUUUCAACGACGGCGUGUACUUCGCUUCUAUUGAGAAGUCUAACAUUAUUCGUGGCUGGAUUUUCGGCACUACUCUGGACUCUAAGACUCAGUCUCUGCUGAUUGUGAACAACGCUACUAACGUGGUGAUUAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCUUUCCUGGACCACAAGAACAACAAGUCUUGGAUGGAGUCUGAGUUCCGUGUGUACUCUUCUGCUAACAACUGCACUUUCGAGUACGUGUCUCAGCCUUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGUGAGUUCGUGUUCAAGAACAUUGACGGCUACUUCAAGAUUUACUCUAAGCACACUCCUAUUAUUGUGCGUGAGCCUGAGGACCUGCCUCAGGGCUUCUCUGCUCUGGAGCCUCUGGUGGACCUGCCUAUUGGCAUUAACAUUACUCGUUUCCAGACUCUGCUGGCUCUGCACCGUUCUUACCUGACUCCUGGCGACUCUUCUUCUGGCUGGACUGCUGGCGCUGCUGCUUACUACGUGGGCUACCUGCAGCCUCGUACUUUCCUGCUGAAGUACAACGAGAACGGCACUAUUACUGACGCUGUGGACUGCGCUCUGGACCCUCUGUCUGAGACUAAGUGCACUCUGAAGUCUUUCACUGUGGAGAAGGGCAUUUACCAGACUUCUAACUUCCGUGUGCAGCCUACUGAGUCUAUUGUGCGUUUCCCUAACAUUACUAACCUGUGCCCUUUCGACGAGGUGUUCAACGCUACUCGUUUCGCUUCUGUGUACGCUUGGAACCGUAAGCGUAUUUCUAACUGCGUGGCUGACUACUCUGUGCUGUACAACCUGGCUCCUUUCUUCACUUUCAAGUGCUACGGCGUGUCUCCUACUAAGCUGAACGACCUGUGCUUCACUAACGUGUACGCUGACUCUUUCGUGAUUCGUGGCGACGAGGUGCGUCAGAUUGCUCCUGGCCAGACUGGCAACAUUGCUGACUACAACUACAAGCUGCCUGACGACUUCACUGGCUGCGUGAUUGCUUGGAACUCUAACAAGCUGGACUCUAAGGUGUCUGGCAACUACAACUACCUGUACCGUCUGUUCCGUAAGUCUAACCUGAAGCCUUUCGAGCGUGACAUUUCUACUGAGAUUUACCAGGCUGGCAACAAGCCUUGCAACGGCGUGGCUGGCUUCAACUGCUACUUCCCUCUGCGUUCUUACUCUUUCCGUCCUACUUACGGCGUGGGCCACCAGCCUUACCGUGUGGUGGUGCUGUCUUUCGAGCUGCUGCACGCUCCUGCUACUGUGUGCGGCCCUAAGAAGUCUACUAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUGAAGGGCACUGGCGUGCUGACUGAGUCUAACAAGAAGUUCCUGCCUUUCCAGCAGUUCGGCCGUGACAUUGCUGACACUACUGACGCUGUGCGUGACCCUCAGACUCUGGAGAUUCUGGACAUUACUCCUUGCUCUUUCGGCGGCGUGUCUGUGAUUACUCCUGGCACUAACACUUCUAACCAGGUGGCUGUGCUGUACCAGGGCGUGAACUGCACUGAGGUGCCUGUGGCUAUUCACGCUGACCAGCUGACUCCUACUUGGCGUGUGUACUCUACUGGCUCUAACGUGUUCCAGACUCGUGCUGGCUGCCUGAUUGGCGCUGAGUACGUGAACAACUCUUACGAGUGCGACAUUCCUAUUGGCGCUGGCAUUUGCGCUUCUUACCAGACUCAGACUAAGUCUCACCGUCGUGCUCGUUCUGUGGCUUCUCAGUCUAUUAUUGCUUACACUAUGUCUCUGGGCGCUGAGAACUCUGUGGCUUACUCUAACAACUCUAUUGCUAUUCCUACUAACUUCACUAUUUCUGUGACUACUGAGAUUCUGCCUGUGUCUAUGACUAAGACUUCUGUGGACUGCACUAUGUACAUUUGCGGCGACUCUACUGAGUGCUCUAACCUGCUGCUGCAGUACGGCUCUUUCUGCACUCAGCUGAAGCGUGCUCUGACUGGCAUUGCUGUGGAGCAGGACAAGAACACUCAGGAGGUGUUCGCUCAGGUGAAGCAGAUUUACAAGACUCCUCCUAUUAAGUACUUCGGCGGCUUCAACUUCUCUCAGAUUCUGCCUGACCCUUCUAAGCCUUCUAAGCGUUCUUUCAUUGAGGACCUGCUGUUCAACAAGGUGACUCUGGCUGACGCUGGCUUCAUUAAGCAGUACGGCGACUGCCUGGGCGACAUUGCUGCUCGUGACCUGAUUUGCGCUCAGAAGUUCAAGGGCCUGACUGUGCUGCCUCCUCUGCUGACUGACGAGAUGAUUGCUCAGUACACUUCUGCUCUGCUGGCUGGCACUAUUACUUCUGGCUGGACUUUCGGCGCUGGCGCUGCUCUGCAGAUUCCUUUCGCUAUGCAGAUGGCUUACCGUUUCAACGGCAUUGGCGUGACUCAGAACGUGCUGUACGAGAACCAGAAGCUGAUUGCUAACCAGUUCAACUCUGCUAUUGGCAAGAUUCAGGACUCUCUGUCUUCUACUGCUUCUGCUCUGGGCAAGCUGCAGGACGUGGUGAACCACAACGCUCAGGCUCUGAACACUCUGGUGAAGCAGCUGUCUUCUAAGUUCGGCGCUAUUUCUUCUGUGCUGAACGACAUUUUCUCUCGUCUGGACAAGGUGGAGGCUGAGGUGCAGAUUGACCGUCUGAUUACUGGCCGUCUGCAGUCUCUGCAGACUUACGUGACUCAGCAGCUGAUUCGUGCUGCUGAGAUUCGUGCUUCUGCUAACCUGGCUGCUACUAAGAUGUCUGAGUGCGUGCUGGGCCAGUCUAAGCGUGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGUCUUUCCCUCAGUCUGCUCCUCACGGCGUGGUGUUCCUGCACGUGACUUACGUGCCUGCUCAGGAGAAGAACUUCACUACUGCUCCUGCUAUUUGCCACGACGGCAAGGCUCACUUCCCUCGUGAGGGCGUGUUCGUGUCUAACGGCACUCACUGGUUCGUGACUCAGCGUAACUUCUACGAGCCUCAGAUUAUUACUACUGACAACACUUUCGUGUCUGGCAACUGCGACGUGGUGAUUGGCAUUGUGAACAACACUGUGUACGACCCUCUGCAGCCUGAGCUGGACUCUUUCAAGGAGGAGCUGGACAAGUACUUCAAGAACCACACUUCUCCUGACGUGGACCUGGGCGACAUUUCUGGCAUUAACGCUUCUGUGGUGAACAUUCAGAAGGAGAUUGACCGUCUGAACGAGGUGGCUAAGAACCUGAACGAGUCUCUGAUUGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUUAAGUGGCCUUGGUACAUUUGGCUGGGCUUCAUUGCUGGCCUGAUUGCUAUUGUGAUGGUGACUAUUAUGCUGUGCUGCAUGACUUCUUGCUGCUCUUGCCUGAAGGGCUGCUGCUCUUGCGGCUCUUGCUGCAAGUUCGACGAGGACGACUCUGAGCCUGUGCUGAAGGGCGUGAAGCUGCACUACACUUGA(SEQ ID NO:7)
AACCUGACGACGCGGACGCAGCUGCCCCCCGCCUACACGAACAGCUUCACGCGGGGCGUGUACUACCCCGACAAGGUGUUCCGGAGCAGCGUGCUGCACAGCACGCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACGUGGUUCCACGUGAUCAGCGGCACGAACGGCACGAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCAUCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACGACGCUGGACAGCAAGACGCAGAGCCUGCUGAUCGUGAACAACGCCACGAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACGUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACGCCCAUCAUCGUGCGGGAGCCCGAGGACCUGCCCCAGGGCUUCAGCGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACGCGGUUCCAGACGCUGCUGGCCCUGCACCGGAGCUACCUGACGCCCGGCGACAGCAGCAGCGGCUGGACGGCCGGCGCCGCCGCCUACUACGUGGGCUACCUGCAGCCCCGGACGUUCCUGCUGAAGUACAACGAGAACGGCACGAUCACGGACGCCGUGGACUGCGCCCUGGACCCCCUGAGCGAGACGAAGUGCACGCUGAAGAGCUUCACGGUGGAGAAGGGCAUCUACCAGACGAGCAACUUCCGGGUGCAGCCCACGGAGAGCAUCGUGCGGUUCCCCAACAUCACGAACCUGUGCCCCUUCGACGAGGUGUUCAACGCCACGCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACCUGGCCCCCUUCUUCACGUUCAAGUGCUACGGCGUGAGCCCCACGAAGCUGAACGACCUGUGCUUCACGAACGUGUACGCCGACAGCUUCGUGAUCCGGGGCGACGAGGUGCGGCAGAUCGCCCCCGGCCAGACGGGCAACAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACGGGCUGCGUGAUCGCCUGGAACAGCAACAAGCUGGACAGCAAGGUGAGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACGGAGAUCUACCAGGCCGGCAACAAGCCCUGCAACGGCGUGGCCGGCUUCAACUGCUACUUCCCCCUGCGGAGCUACAGCUUCCGGCCCACGUACGGCGUGGGCCACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCCGCCACGGUGUGCGGCCCCAAGAAGAGCACGAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUGAAGGGCACGGGCGUGCUGACGGAGAGCAACAAGAAGUUCCUGCCCUUCCAGCAGUUCGGCCGGGACAUCGCCGACACGACGGACGCCGUGCGGGACCCCCAGACGCUGGAGAUCCUGGACAUCACGCCCUGCAGCUUCGGCGGCGUGAGCGUGAUCACGCCCGGCACGAACACGAGCAACCAGGUGGCCGUGCUGUACCAGGGCGUGAACUGCACGGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACGCCCACGUGGCGGGUGUACAGCACGGGCAGCAACGUGUUCCAGACGCGGGCCGGCUGCCUGAUCGGCGCCGAGUACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGCGCCAGCUACCAGACGCAGACGAAGAGCCACCGGCGGGCCCGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACGAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACGAACUUCACGAUCAGCGUGACGACGGAGAUCCUGCCCGUGAGCAUGACGAAGACGAGCGUGGACUGCACGAUGUACAUCUGCGGCGACAGCACGGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACGCAGCUGAAGCGGGCCCUGACGGGCAUCGCCGUGGAGCAGGACAAGAACACGCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACGCCCCCCAUCAAGUACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACGCUGGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUGGGCGACAUCGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAAGGGCCUGACGGUGCUGCCCCCCCUGCUGACGGACGAGAUGAUCGCCCAGUACACGAGCGCCCUGCUGGCCGGCACGAUCACGAGCGGCUGGACGUUCGGCGCCGGCGCCGCCCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACGCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACGGCCAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCACAACGCCCAGGCCCUGAACACGCUGGUGAAGCAGCUGAGCAGCAAGUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCUUCAGCCGGCUGGACAAGGUGGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACGGGCCGGCUGCAGAGCCUGCAGACGUACGUGACGCAGCAGCUGAUCCGGGCCGCCGAGAUCCGGGCCAGCGCCAACCUGGCCGCCACGAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUCCCCCAGAGCGCCCCCCACGGCGUGGUGUUCCUGCACGUGACGUACGUGCCCGCCCAGGAGAAGAACUUCACGACGGCCCCCGCCAUCUGCCACGACGGCAAGGCCCACUUCCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACGCACUGGUUCGUGACGCAGCGGAACUUCUACGAGCCCCAGAUCAUCACGACGGACAACACGUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACGGUGUACGACCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAACCACACGAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGACCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACGAUCAUGCUGUGCUGCAUGACGAGCUGCUGCAGCUGCCUGAAGGGCUGCUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACGUGA(SEQ ID NO:8)
In another embodiment, the present disclosure provides an mRNA nucleic acid molecule comprising:
(i) A 5 'untranslated region (5' -UTR);
(ii) A CDS, wherein the CDS comprises an Open Reading Frame (ORF) encoding a coronavirus omicron (b.1.1.529) antigen S glycoprotein capable of inducing an immune response comprising a nucleotide sequence as set forth in SEQ ID No. 6-SEQ ID No. 8;
(iii) 3 '-untranslated region (3' -UTR).
In another embodiment, the mRNA nucleic acid molecules of the invention comprise a 5 'untranslated region (5' -UTR). Optionally, wherein the 5' UTR comprises the sequence of SEQ ID NO. 9, SEQ ID NO. 10 or SEQ ID NO. 11. Specifically, the 5' -UTR sequence may or may not comprise a Kozak sequence. In another preferred embodiment, wherein the 5' UTR may comprise a kozak sequence, such as GCCACC or GCCANN, etc., N refers to A, T/U, C or G.
AGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC(SEQ ID NO:9)
AGGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC(SEQ ID NO:10)
AGGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGAGCCACC(SEQ ID NO:11)
In another embodiment, the nucleic acid molecules of the invention further comprise a 3 'untranslated region (3' -UTR). Optionally, wherein the 3' UTR comprises the sequence of SEQ ID NO. 12, SEQ ID NO. 13 or SEQ ID NO. 14.
UAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC(SEQ ID NO:12)
UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC(SEQ ID NO:13)
UGAUAAUAGGCUGGAGCCUCGGUGGCCAUGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC(SEQ ID NO:14)
In another embodiment, the mRNA nucleic acid molecules of the present invention further comprise a native 5' -cap structure or analog thereof produced by an endogenous process. Modification of the 5' -cap can increase the stability of the nucleic acid molecule, increase its half-life and can increase translation efficiency. Modifications to the native 5' -cap structure include 2' -O-methylation at the 5' -end of the polynucleotide and/or at the ribose 2' -hydroxy group of the 5' -end nucleic acid. The 5' -cap analogue may optionally be selected from m7G (5 ') ppp (5 ') (2 ' OMeA) pG, 3' -O-Me-m7G (5 ') ppp (5 ') G; g (5 ') ppp (5') A; g (5 ') ppp (5') G; m7G (5 ') ppp (5') A; m7G (5 ') ppp (5') G, etc. Cap analogs can be chemically (i.e., non-enzymatically) or enzymatically synthesized and/or attached to the 5' end of an RNA molecule. The mRNA molecules produced after transcription can be modified by vaccinia virus capping enzymes to produce a 7-methylguanosine (m 7G) Cap bridged by triphosphate (m 7G (5 ' ') PPP (5 ') G), a structure known as Cap 0. Cap 1 structures (m 7 GpppmN) and Cap 2 structures (m 7 GpppmNmN) can be produced using vaccinia virus capping enzyme and 2' -O methyltransferase. The 5' -cap structures that may be used in connection with the present disclosure may be referred to those described in international patent publication nos. WO2008127688, WO2008016473 and WO2011015347, the entire contents of which are incorporated herein by reference. In a preferred embodiment, the 5' -cap selected for mRNA co-transcription capping according to the invention is similarly selected from commercially available Reagent AG- (N-7113), which is m7G (5 ') ppp (5 ') (2 ' OMeA) pG, can generate Cap 1 structure when mRNA is capped by in vitro transcription. Clearcap has capping efficiency as high as 98%. Cap 1mRNA has higher in vivo activity than Cap 0mRNA produced by conventional capping methods such as mCap or anti-reverse Cap analog (ARCA).
In another embodiment, the mRNA nucleic acid molecules of the invention further comprise a poly-A tail or polyadenylation signal, optionally having a length of 80 to 180 nucleotides. In a preferred embodiment, the 3' -polyadenylation sequence (polyA) is preferably 80-180A with a linker sequence in between, as shown in SEQ ID NO. 15; more preferably 80-160A's, still more preferably 130A's, as shown in SEQ ID NO. 16: AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAUAUGACUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO: 15)
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAUAUGACUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA(SEQ ID NO:16)
In another preferred embodiment, the CDS may further comprise a signal peptide sequence. The purpose of adding the signal peptide is to enhance the translation of the protein, improve the translation stability and promote better folding of the protein. The preferred signal peptide sequence is shown in SEQ ID NO. 17-SEQ ID NO. 19. The signal peptide is positioned in front of the mature peptide, and the sequence of the mature peptide is shown as SEQ ID NO. 1.
AUGUUCGUCUUCCUAGUCCUACUACCGCUAGUCUCGUCGCAAUGCGUC(SEQ ID NO:17)
AUGUUCGUGUUCCUGGUGCUGCUGCCUCUGGUGUCUUCUCAGUGCGUG(SEQ ID NO:18)
AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUG(SEQ ID NO:19)
In another embodiment, the mRNA nucleic acid molecules of the present invention further comprise one or more functional nucleotide analog modifications selected from the group consisting of pseudouridine (ψ), 1-methyl-pseudouridine (m 1 ψ), 1-ethyl-pseudouridine (e 1 ψ), 5-methoxy-uridine (mo 5U) and 5-methylcytosine (m 5C). Human immune response to mRNA is mainly related to uridine (partially consisting of uracil), whereas replacement of uracil with pseudouracil reduces mRNA recognition by the immune system. Methods for replacing uracil with pseudo-uracil refer to U.S. patent nos. 8278036B2, US9750824B2, US8835108B2, US8748089B2, US8691966B2, etc.; reference is made to US9428535B2 using the 1-methyl pseudouracil modification method; the effect of reducing the immunogenicity of mRNA by reducing the amount of U (uridylic acid) in the mRNA molecule is described in WO2017036889A1; the relevant technical content is incorporated herein by reference in its entirety. In some embodiments, the presently disclosed mRNA includes pseudo-uridine (ψ) substitutions at one or more or all uracil positions of the nucleic acid. In a particularly preferred embodiment, the uracil (U) sequences shown in SEQ ID NO. 6-SEQ ID NO. 25 are each replaced by a pseudouridine (ψ).
In another embodiment, the invention provides an mRNA nucleic acid molecule comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA), wherein CDS comprises an Open Reading Frame (ORF) encoding coronavirus omicron (B.1.1.529) antigen S glycoprotein capable of inducing immune response, the nucleotide sequence is shown as SEQ ID NO:6-SEQ ID NO: 8. The inventor optimizes the novel coronal variant omacron (B.1.1.529) S glycoprotein coding region (CDS), and the sequence optimization comprises adjusting GC content when expressing in human body, improving the GC content of the sequence, so that the transcribed mRNA structure is more stable, and the translation efficiency of target proteins in mammals and human bodies is higher. Preferably, the 5' -UTR comprises a Kozak sequence which may be used to enhance translation efficiency of mRNA. The addition of 5 '-cap structure, 5' -UTR, kozak, 3 '-poly A sequence (polyA) and 3' -UTR sequence elements can further improve the stability of the sequence and avoid degradation.
In another preferred embodiment, the invention provides an mRNA molecule comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA);
Wherein said 5' -cap structure is optionally selected from m7G (5 ') ppp (5 ') (2 ' OMeA) pG, 3' -O-Me-m7G (5 ') ppp (5 ') G; g (5 ') ppp (5') A; g (5 ') ppp (5') G; m7G (5 ') ppp (5') A; m7G (5 ') ppp (5') G, etc.;
wherein the 5' -UTR comprises a sequence shown as SEQ ID NO. 9-SEQ ID NO. 11;
wherein the CDS comprises an Open Reading Frame (ORF) which encodes a signal peptide and a coronavirus omacron (B.1.1.529) antigen S glycoprotein capable of inducing an immune response, wherein the sequence of the signal peptide is shown as SEQ ID NO. 17-SEQ ID NO. 19, and the sequence of the coronavirus omacron (B.1.1.529) antigen S glycoprotein is shown as SEQ ID NO. 6-SEQ ID NO. 8;
wherein the 3' -UTR comprises a nucleotide sequence shown as SEQ ID NO. 12-SEQ ID NO. 14;
wherein the 3' -polyadenylation sequence (polyA) optionally has a length of 80 to 180 adenylates.
In another preferred embodiment, the invention provides an mRNA molecule comprising the following elements: 5' -cap structure, 5' -UTR, CDS, 3' -UTR and 3' -polyadenylation sequence (polyA), wherein 5' -UTR comprises Kozak sequence, CDS comprises signal peptide and mature peptide, and the sequence is shown as SEQ ID NO:20-SEQ ID NO: 22.
GAGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACCAUGUUCGUCUUCCUAGUCCUACUACCGCUAGUCUCGUCGCAAUGCGUCAACCUAACCACCCGCACCCAACUACCGCCGGCGUACACCAACUCGUUCACCCGCGGGGUCUACUACCCGGACAAGGUCUUCCGCUCGUCGGUCCUACACUCGACCCAAGACCUAUUCCUACCGUUCUUCUCGAACGUCACCUGGUUCCACGUCAUAUCGGGGACCAACGGGACCAAGCGCUUCGACAACCCGGUCCUACCGUUCAACGACGGGGUCUACUUCGCGUCGAUAGAGAAGUCGAACAUAAUACGCGGCUGGAUUUUCGGCACUACUCUGGACUCUAAGACUCAGUCUCUGCUGAUUGUGAACAACGCUACUAACGUGGUGAUUAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCUUUCCUGGACCACAAGAACAACAAGUCUUGGAUGGAGUCUGAGUUCCGUGUGUACUCUUCUGCUAACAACUGCACUUUCGAGUACGUGUCUCAGCCUUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGUGAGUUCGUGUUCAAGAACAUUGACGGCUACUUCAAGAUUUACUCUAAGCACACUCCUAUUAUUGUGCGUGAGCCUGAGGACCUGCCUCAGGGCUUCUCUGCUCUGGAGCCUCUGGUGGACCUGCCUAUUGGCAUUAACAUUACUCGUUUCCAGACUCUGCUGGCUCUGCACCGUUCUUACCUGACUCCUGGCGACUCUUCUUCUGGCUGGACUGCUGGCGCUGCUGCUUACUACGUGGGCUACCUGCAGCCUCGUACUUUCCUGCUGAAGUACAACGAGAACGGCACUAUUACUGACGCUGUGGACUGCGCUCUGGACCCUCUGUCUGAGACUAAGUGCACUCUGAAGUCUUUCACUGUGGAGAAGGGCAUUUACCAGACUUCUAACUUCCGUGUGCAGCCUACUGAGUCUAUUGUGCGUUUCCCUAACAUUACUAACCUGUGCCCUUUCGACGAGGUGUUCAACGCUACUCGUUUCGCUUCUGUGUACGCUUGGAACCGUAAGCGUAUUUCUAACUGCGUGGCUGACUACUCUGUGCUGUACAACCUGGCUCCUUUCUUCACUUUCAAGUGCUACGGCGUGUCUCCUACUAAGCUGAACGACCUGUGCUUCACUAACGUGUACGCUGACUCUUUCGUGAUUCGUGGCGACGAGGUGCGUCAGAUUGCUCCUGGCCAGACUGGCAACAUUGCUGACUACAACUACAAGCUGCCUGACGACUUCACUGGCUGCGUGAUUGCUUGGAACUCUAACAAGCUGGACUCUAAGGUGUCUGGCAACUACAACUACCUGUACCGUCUGUUCCGUAAGUCUAACCUGAAGCCUUUCGAGCGUGACAUUUCUACUGAGAUUUACCAGGCUGGCAACAAGCCUUGCAACGGCGUGGCUGGCUUCAACUGCUACUUCCCUCUGCGUUCUUACUCUUUCCGUCCUACUUACGGCGUGGGCCACCAGCCUUACCGUGUGGUGGUGCUGUCUUUCGAGCUGCUGCACGCUCCUGCUACUGUGUGCGGCCCUAAGAAGUCUACUAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUGAAGGGCACUGGCGUGCUGACUGAGUCUAACAAGAAGUUCCUGCCUUUCCAGCAGUUCGGCCGUGACAUUGCUGACACUACUGACGCUGUGCGUGACCCUCAGACUCUGGAGAUUCUGGACAUUACUCCUUGCUCUUUCGGCGGCGUGUCUGUGAUUACUCCUGGCACUAACACUUCUAACCAGGUGGCUGUGCUGUACCAGGGCGUGAACUGCACUGAGGUGCCUGUGGCUAUUCACGCUGACCAGCUGACUCCUACUUGGCGUGUGUACUCUACUGGCUCUAACGUGUUCCAGACUCGUGCUGGCUGCCUGAUUGGCGCUGAGUACGUGAACAACUCUUACGAGUGCGACAUUCCUAUUGGCGCUGGCAUUUGCGCUUCUUACCAGACUCAGACUAAGUCUCACCGUCGUGCUCGUUCUGUGGCUUCUCAGUCUAUUAUUGCUUACACUAUGUCUCUGGGCGCUGAGAACUCUGUGGCUUACUCUAACAACUCUAUUGCUAUUCCUACUAACUUCACUAUUUCUGUGACUACUGAGAUUCUGCCUGUGUCUAUGACUAAGACUUCUGUGGACUGCACUAUGUACAUUUGCGGCGACUCUACUGAGUGCUCUAACCUGCUGCUGCAGUACGGCUCUUUCUGCACUCAGCUGAAGCGUGCUCUGACUGGCAUUGCUGUGGAGCAGGACAAGAACACUCAGGAGGUGUUCGCUCAGGUGAAGCAGAUUUACAAGACUCCUCCUAUUAAGUACUUCGGCGGCUUCAACUUCUCUCAGAUUCUGCCUGACCCUUCUAAGCCUUCUAAGCGUUCUUUCAUUGAGGACCUGCUGUUCAACAAGGUGACUCUGGCUGACGCUGGCUUCAUUAAGCAGUACGGCGACUGCCUGGGCGACAUUGCUGCUCGUGACCUGAUUUGCGCUCAGAAGUUCAAGGGCCUGACUGUGCUGCCUCCUCUGCUGACUGACGAGAUGAUUGCUCAGUACACUUCUGCUCUGCUGGCUGGCACUAUUACUUCUGGCUGGACUUUCGGCGCUGGCGCUGCUCUGCAGAUUCCUUUCGCUAUGCAGAUGGCUUACCGUUUCAACGGCAUUGGCGUGACUCAGAACGUGCUGUACGAGAACCAGAAGCUGAUUGCUAACCAGUUCAACUCUGCUAUUGGCAAGAUUCAGGACUCUCUGUCUUCUACUGCUUCUGCUCUGGGCAAGCUGCAGGACGUGGUGAACCACAACGCUCAGGCUCUGAACACUCUGGUGAAGCAGCUGUCUUCUAAGUUCGGCGCUAUUUCUUCUGUGCUGAACGACAUUUUCUCUCGUCUGGACAAGGUGGAGGCUGAGGUGCAGAUUGACCGUCUGAUUACUGGCCGUCUGCAGUCUCUGCAGACUUACGUGACUCAGCAGCUGAUUCGUGCUGCUGAGAUUCGUGCUUCUGCUAACCUGGCUGCUACUAAGAUGUCUGAGUGCGUGCUGGGCCAGUCUAAGCGUGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGUCUUUCCCUCAGUCUGCUCCUCACGGCGUGGUGUUCCUGCACGUGACUUACGUGCCUGCUCAGGAGAAGAACUUCACUACUGCUCCUGCUAUUUGCCACGACGGCAAGGCUCACUUCCCUCGUGAGGGCGUGUUCGUGUCUAACGGCACUCACUGGUUCGUGACUCAGCGUAACUUCUACGAGCCUCAGAUUAUUACUACUGACAACACUUUCGUGUCUGGCAACUGCGACGUGGUGAUUGGCAUUGUGAACAACACUGUGUACGACCCUCUGCAGCCUGAGCUGGACUCUUUCAAGGAGGAGCUGGACAAGUACUUCAAGAACCACACUUCUCCUGACGUGGACCUGGGCGACAUUUCUGGCAUUAACGCUUCUGUGGUGAACAUUCAGAAGGAGAUUGACCGUCUGAACGAGGUGGCUAAGAACCUGAACGAGUCUCUGAUUGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUUAAGUGGCCUUGGUACAUUUGGCUGGGCUUCAUUGCUGGCCUGAUUGCUAUUGUGAUGGUGACUAUUAUGCUGUGCUGCAUGACUUCUUGCUGCUCUUGCCUGAAGGGCUGCUGCUCUUGCGGCUCUUGCUGCAAGUUCGACGAGGACGACUCUGAGCCUGUGCUGAAGGGCGUGAAGCUGCACUACACUUGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAUAUGACUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA(SEQ ID NO:20)
GAGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACCAUGUUCGUGUUCCUGGUGCUGCUGCCUCUGGUGUCUUCUCAGUGCGUGAACCUGACUACUCGUACUCAGCUGCCUCCUGCUUACACUAACUCUUUCACUCGUGGCGUGUACUACCCUGACAAGGUGUUCCGUUCUUCUGUGCUGCACUCUACUCAGGACCUGUUCCUGCCUUUCUUCUCUAACGUGACUUGGUUCCACGUGAUUUCUGGCACUAACGGCACUAAGCGUUUCGACAACCCUGUGCUGCCUUUCAACGACGGCGUGUACUUCGCUUCUAUUGAGAAGUCUAACAUUAUUCGUGGCUGGAUUUUCGGCACUACUCUGGACUCUAAGACUCAGUCUCUGCUGAUUGUGAACAACGCUACUAACGUGGUGAUUAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCUUUCCUGGACCACAAGAACAACAAGUCUUGGAUGGAGUCUGAGUUCCGUGUGUACUCUUCUGCUAACAACUGCACUUUCGAGUACGUGUCUCAGCCUUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGUGAGUUCGUGUUCAAGAACAUUGACGGCUACUUCAAGAUUUACUCUAAGCACACUCCUAUUAUUGUGCGUGAGCCUGAGGACCUGCCUCAGGGCUUCUCUGCUCUGGAGCCUCUGGUGGACCUGCCUAUUGGCAUUAACAUUACUCGUUUCCAGACUCUGCUGGCUCUGCACCGUUCUUACCUGACUCCUGGCGACUCUUCUUCUGGCUGGACUGCUGGCGCUGCUGCUUACUACGUGGGCUACCUGCAGCCUCGUACUUUCCUGCUGAAGUACAACGAGAACGGCACUAUUACUGACGCUGUGGACUGCGCUCUGGACCCUCUGUCUGAGACUAAGUGCACUCUGAAGUCUUUCACUGUGGAGAAGGGCAUUUACCAGACUUCUAACUUCCGUGUGCAGCCUACUGAGUCUAUUGUGCGUUUCCCUAACAUUACUAACCUGUGCCCUUUCGACGAGGUGUUCAACGCUACUCGUUUCGCUUCUGUGUACGCUUGGAACCGUAAGCGUAUUUCUAACUGCGUGGCUGACUACUCUGUGCUGUACAACCUGGCUCCUUUCUUCACUUUCAAGUGCUACGGCGUGUCUCCUACUAAGCUGAACGACCUGUGCUUCACUAACGUGUACGCUGACUCUUUCGUGAUUCGUGGCGACGAGGUGCGUCAGAUUGCUCCUGGCCAGACUGGCAACAUUGCUGACUACAACUACAAGCUGCCUGACGACUUCACUGGCUGCGUGAUUGCUUGGAACUCUAACAAGCUGGACUCUAAGGUGUCUGGCAACUACAACUACCUGUACCGUCUGUUCCGUAAGUCUAACCUGAAGCCUUUCGAGCGUGACAUUUCUACUGAGAUUUACCAGGCUGGCAACAAGCCUUGCAACGGCGUGGCUGGCUUCAACUGCUACUUCCCUCUGCGUUCUUACUCUUUCCGUCCUACUUACGGCGUGGGCCACCAGCCUUACCGUGUGGUGGUGCUGUCUUUCGAGCUGCUGCACGCUCCUGCUACUGUGUGCGGCCCUAAGAAGUCUACUAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUGAAGGGCACUGGCGUGCUGACUGAGUCUAACAAGAAGUUCCUGCCUUUCCAGCAGUUCGGCCGUGACAUUGCUGACACUACUGACGCUGUGCGUGACCCUCAGACUCUGGAGAUUCUGGACAUUACUCCUUGCUCUUUCGGCGGCGUGUCUGUGAUUACUCCUGGCACUAACACUUCUAACCAGGUGGCUGUGCUGUACCAGGGCGUGAACUGCACUGAGGUGCCUGUGGCUAUUCACGCUGACCAGCUGACUCCUACUUGGCGUGUGUACUCUACUGGCUCUAACGUGUUCCAGACUCGUGCUGGCUGCCUGAUUGGCGCUGAGUACGUGAACAACUCUUACGAGUGCGACAUUCCUAUUGGCGCUGGCAUUUGCGCUUCUUACCAGACUCAGACUAAGUCUCACCGUCGUGCUCGUUCUGUGGCUUCUCAGUCUAUUAUUGCUUACACUAUGUCUCUGGGCGCUGAGAACUCUGUGGCUUACUCUAACAACUCUAUUGCUAUUCCUACUAACUUCACUAUUUCUGUGACUACUGAGAUUCUGCCUGUGUCUAUGACUAAGACUUCUGUGGACUGCACUAUGUACAUUUGCGGCGACUCUACUGAGUGCUCUAACCUGCUGCUGCAGUACGGCUCUUUCUGCACUCAGCUGAAGCGUGCUCUGACUGGCAUUGCUGUGGAGCAGGACAAGAACACUCAGGAGGUGUUCGCUCAGGUGAAGCAGAUUUACAAGACUCCUCCUAUUAAGUACUUCGGCGGCUUCAACUUCUCUCAGAUUCUGCCUGACCCUUCUAAGCCUUCUAAGCGUUCUUUCAUUGAGGACCUGCUGUUCAACAAGGUGACUCUGGCUGACGCUGGCUUCAUUAAGCAGUACGGCGACUGCCUGGGCGACAUUGCUGCUCGUGACCUGAUUUGCGCUCAGAAGUUCAAGGGCCUGACUGUGCUGCCUCCUCUGCUGACUGACGAGAUGAUUGCUCAGUACACUUCUGCUCUGCUGGCUGGCACUAUUACUUCUGGCUGGACUUUCGGCGCUGGCGCUGCUCUGCAGAUUCCUUUCGCUAUGCAGAUGGCUUACCGUUUCAACGGCAUUGGCGUGACUCAGAACGUGCUGUACGAGAACCAGAAGCUGAUUGCUAACCAGUUCAACUCUGCUAUUGGCAAGAUUCAGGACUCUCUGUCUUCUACUGCUUCUGCUCUGGGCAAGCUGCAGGACGUGGUGAACCACAACGCUCAGGCUCUGAACACUCUGGUGAAGCAGCUGUCUUCUAAGUUCGGCGCUAUUUCUUCUGUGCUGAACGACAUUUUCUCUCGUCUGGACAAGGUGGAGGCUGAGGUGCAGAUUGACCGUCUGAUUACUGGCCGUCUGCAGUCUCUGCAGACUUACGUGACUCAGCAGCUGAUUCGUGCUGCUGAGAUUCGUGCUUCUGCUAACCUGGCUGCUACUAAGAUGUCUGAGUGCGUGCUGGGCCAGUCUAAGCGUGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGUCUUUCCCUCAGUCUGCUCCUCACGGCGUGGUGUUCCUGCACGUGACUUACGUGCCUGCUCAGGAGAAGAACUUCACUACUGCUCCUGCUAUUUGCCACGACGGCAAGGCUCACUUCCCUCGUGAGGGCGUGUUCGUGUCUAACGGCACUCACUGGUUCGUGACUCAGCGUAACUUCUACGAGCCUCAGAUUAUUACUACUGACAACACUUUCGUGUCUGGCAACUGCGACGUGGUGAUUGGCAUUGUGAACAACACUGUGUACGACCCUCUGCAGCCUGAGCUGGACUCUUUCAAGGAGGAGCUGGACAAGUACUUCAAGAACCACACUUCUCCUGACGUGGACCUGGGCGACAUUUCUGGCAUUAACGCUUCUGUGGUGAACAUUCAGAAGGAGAUUGACCGUCUGAACGAGGUGGCUAAGAACCUGAACGAGUCUCUGAUUGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUUAAGUGGCCUUGGUACAUUUGGCUGGGCUUCAUUGCUGGCCUGAUUGCUAUUGUGAUGGUGACUAUUAUGCUGUGCUGCAUGACUUCUUGCUGCUCUUGCCUGAAGGGCUGCUGCUCUUGCGGCUCUUGCUGCAAGUUCGACGAGGACGACUCUGAGCCUGUGCUGAAGGGCGUGAAGCUGCACUACACUUGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAUAUGACUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA(SEQ ID NO:21)
GAGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACCAUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACGACGCGGACGCAGCUGCCCCCCGCCUACACGAACAGCUUCACGCGGGGCGUGUACUACCCCGACAAGGUGUUCCGGAGCAGCGUGCUGCACAGCACGCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACGUGGUUCCACGUGAUCAGCGGCACGAACGGCACGAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCAUCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACGACGCUGGACAGCAAGACGCAGAGCCUGCUGAUCGUGAACAACGCCACGAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACGUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACGCCCAUCAUCGUGCGGGAGCCCGAGGACCUGCCCCAGGGCUUCAGCGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACGCGGUUCCAGACGCUGCUGGCCCUGCACCGGAGCUACCUGACGCCCGGCGACAGCAGCAGCGGCUGGACGGCCGGCGCCGCCGCCUACUACGUGGGCUACCUGCAGCCCCGGACGUUCCUGCUGAAGUACAACGAGAACGGCACGAUCACGGACGCCGUGGACUGCGCCCUGGACCCCCUGAGCGAGACGAAGUGCACGCUGAAGAGCUUCACGGUGGAGAAGGGCAUCUACCAGACGAGCAACUUCCGGGUGCAGCCCACGGAGAGCAUCGUGCGGUUCCCCAACAUCACGAACCUGUGCCCCUUCGACGAGGUGUUCAACGCCACGCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACCUGGCCCCCUUCUUCACGUUCAAGUGCUACGGCGUGAGCCCCACGAAGCUGAACGACCUGUGCUUCACGAACGUGUACGCCGACAGCUUCGUGAUCCGGGGCGACGAGGUGCGGCAGAUCGCCCCCGGCCAGACGGGCAACAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACGGGCUGCGUGAUCGCCUGGAACAGCAACAAGCUGGACAGCAAGGUGAGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACGGAGAUCUACCAGGCCGGCAACAAGCCCUGCAACGGCGUGGCCGGCUUCAACUGCUACUUCCCCCUGCGGAGCUACAGCUUCCGGCCCACGUACGGCGUGGGCCACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCCGCCACGGUGUGCGGCCCCAAGAAGAGCACGAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUGAAGGGCACGGGCGUGCUGACGGAGAGCAACAAGAAGUUCCUGCCCUUCCAGCAGUUCGGCCGGGACAUCGCCGACACGACGGACGCCGUGCGGGACCCCCAGACGCUGGAGAUCCUGGACAUCACGCCCUGCAGCUUCGGCGGCGUGAGCGUGAUCACGCCCGGCACGAACACGAGCAACCAGGUGGCCGUGCUGUACCAGGGCGUGAACUGCACGGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACGCCCACGUGGCGGGUGUACAGCACGGGCAGCAACGUGUUCCAGACGCGGGCCGGCUGCCUGAUCGGCGCCGAGUACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGCGCCAGCUACCAGACGCAGACGAAGAGCCACCGGCGGGCCCGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACGAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACGAACUUCACGAUCAGCGUGACGACGGAGAUCCUGCCCGUGAGCAUGACGAAGACGAGCGUGGACUGCACGAUGUACAUCUGCGGCGACAGCACGGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACGCAGCUGAAGCGGGCCCUGACGGGCAUCGCCGUGGAGCAGGACAAGAACACGCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACGCCCCCCAUCAAGUACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACGCUGGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUGGGCGACAUCGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAAGGGCCUGACGGUGCUGCCCCCCCUGCUGACGGACGAGAUGAUCGCCCAGUACACGAGCGCCCUGCUGGCCGGCACGAUCACGAGCGGCUGGACGUUCGGCGCCGGCGCCGCCCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACGCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACGGCCAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCACAACGCCCAGGCCCUGAACACGCUGGUGAAGCAGCUGAGCAGCAAGUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCUUCAGCCGGCUGGACAAGGUGGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACGGGCCGGCUGCAGAGCCUGCAGACGUACGUGACGCAGCAGCUGAUCCGGGCCGCCGAGAUCCGGGCCAGCGCCAACCUGGCCGCCACGAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUCCCCCAGAGCGCCCCCCACGGCGUGGUGUUCCUGCACGUGACGUACGUGCCCGCCCAGGAGAAGAACUUCACGACGGCCCCCGCCAUCUGCCACGACGGCAAGGCCCACUUCCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACGCACUGGUUCGUGACGCAGCGGAACUUCUACGAGCCCCAGAUCAUCACGACGGACAACACGUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACGGUGUACGACCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAACCACACGAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGACCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACGAUCAUGCUGUGCUGCAUGACGAGCUGCUGCAGCUGCCUGAAGGGCUGCUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACGUGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAUAUGACUAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA(SEQ ID NO:22)
In another preferred embodiment, the mRNA nucleic acid molecule further comprises one or more functional nucleotide analogue modifications selected from the group consisting of pseudouridine (ψ), 1-methyl-pseudouridine (m1ψ), 1-ethyl-pseudouridine (e 1 ψ), 5-methoxy-uridine (mo 5U) and 5-methylcytosine (m 5C), preferably one or more or all uracils of the mRNA nucleic acid molecule are substituted with pseudouridine (ψ). In a particularly preferred embodiment, the inventors have achieved a reduction in mRNA immunogenicity by reducing the U (uridylic acid) content of the mRNA molecule by substitution of all uracils in the nucleic acid sequences SEQ ID NO:20-SEQ ID NO:22 with pseudouridine (ψ), the capping substituted sequences shown in SEQ ID NO:23-SEQ ID NO: 25. All uracils in the nucleic acid sequences SEQ ID NO. 6-SEQ ID NO. 8 are replaced by pseudouridine (psi), and the sequence after capping replacement is shown as SEQ ID NO. 26-SEQ ID NO. 28.
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
/>
Pharmaceutical composition
In another embodiment, provided herein is a pharmaceutical composition for inducing a neutralizing antibody response in a subject to a novel coronal variant omicron (b.1.1.529) S glycoprotein comprising an mRNA nucleic acid molecule as described above and a pharmaceutically acceptable carrier. Preferably, the pharmaceutical compositions described herein comprise an mRNA molecule of the sequence shown in any one of SEQ ID NOs 20 to 25 or a combination thereof, and a pharmaceutically acceptable carrier.
In some embodiments, the mRNA of the present disclosure is formulated in lipid nanoparticles (Lipid nanoparticle, LNP) to form a lipid nanoparticle form. Accordingly, in another aspect, the present invention provides a pharmaceutical composition comprising mRNA comprising the following components:
(1) Any one of the mRNA molecules with the sequences shown in SEQ ID NO. 20-SEQ ID NO. 25 or the combination of the mRNA molecules;
(2) 20-60% by mole of ionizable cationic lipids (ionizable lipids);
(3) 5-25% by mole of a non-cationic lipid;
(4) 25-55% by mole of sterols; and
(5) 0.5-15% by mole of PEG modified lipid.
In another embodiment, the ionizable cationic lipid whole structure can be divided into three parts, a head part, a connecting fragment and a tail part. In LNP formulations, the ionizable lipids appear neutral at physiological pH, while being positively charged in the acidic environment of the endosome. The pH-dependent ionization capacity makes ionizable lipids suitable materials for nucleic acid delivery due to the substantial improvement in effectiveness and toxicity characteristics. Preferably, the ionizable lipid is present in the formulation in a molar ratio of typically 30% to 50% of the total lipid. Preferred ionizable cationic lipids that have been used clinically include DLin-MC3-DMA, SM-102, ALC-0315, and the like. The head groups such as ALC-0315 and SM-102 contain a terminal hydroxyl group, which can reduce hydration of the head groups and enhance hydrogen bonding interactions with nucleic acids, thereby possibly enhancing transfection ability.
In another embodiment, the non-cationic lipid is generally a neutral helper phospholipid. Phospholipids are often used as structural lipids for LNP formulations because they can spontaneously organize into lipid bilayers and higher phase transition temperatures enhance the membrane stability of LNP. Preferably, the phospholipid is present in the formulation in a molar ratio of 10% to 20% of the total lipid. Preferred phospholipids that have been used clinically may be DSPC, DOPE and DGTS. Wherein the 1, 2-distearoyl-sn-glycero-3-phosphorylcholine (DSPC) structure consists of a phosphatidylcholine head group and two saturated 18 carbon tails, both tails forming a tightly packed lipid bilayer. DSPC is a preferred structural lipid in the clinically validated mRNA vaccine LNP.
In another embodiment, the sterols may be selected from cholesterol, beta-sitosterol, oxidized cholesterol derivatives, and the like. Cholesterol is a naturally abundant component of cell membranes and is often used as a structural lipid in LNP formulations. The cholesterol ratio in the LNP formulation may preferably be 30-50 mole percent.
In another embodiment, PEG-modified lipids (PEG-modified lipids) are an important component in LNP that regulates half-life and cellular uptake. Preferably, the PEG-modified lipids are present in the formulation at a molar ratio of 0.5-2.5% of the total lipid. PEG provides an external polymeric layer for LNP to prevent serum protein adsorption and uptake by the mononuclear phagocyte system, prolonging circulation time in vivo. PEG can also prevent aggregation of nanoparticles during storage and in blood. In some embodiments, the PEG-modified lipids of the present disclosure include PEG-modified phosphatidylethanolamine, PEG-modified phosphatidic acid, PEG-modified ceramide, PEG-modified dialkylamine, PEG-modified diacylglycerol, PEG-modified dialkylglycerol, and mixtures thereof. In some examples, the PEG-modified lipid is DMG-PEG, PEG-DOMG, PEG-DSG, and/or PEG-DPG, or the like. More preferred PEG modified lipid ALC0159.
In another embodiment, the LNP of the present disclosure comprises a mass ratio of ionizable cationic lipid component to mRNA of from about 10:1 to about 100:1. In another embodiment, the LNP of the present disclosure comprises a mass ratio of ionizable cationic lipid component to mRNA of from about 20:1 to about 40:1.
In another embodiment, the disclosed pharmaceutical composition, the mRNA molecules are encapsulated in a lipid shell and formulated into a lipid nanoparticle form, wherein the lipid nanoparticle generally comprises four lipid components of ALC0315, DSPC, cholesterol, and ALC 0159.
In another preferred embodiment, the present invention provides a pharmaceutical composition comprising mRNA comprising the following components:
(1) mRNA molecules with the sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof;
(2) 20-60% by mole of ionizable cationic lipid ALC0315;
(3) 5-25% by mole of DSPC;
(4) Cholesterol in 25-55% molar ratio; and
(5) 0.5-15% by mole of PEG modified lipid ALC0159;
wherein the mass ratio of the ionizable cationic lipid component to the mRNA is from about 10:1 to about 100:1.
In another preferred embodiment, the present invention provides a pharmaceutical composition comprising mRNA comprising the following components:
(1) mRNA molecules with the sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof;
(2) 30-50% by mole of ionizable cationic lipid ALC0315;
(3) 10-20% by mole of DSPC;
(4) 23-50% cholesterol by mole; and
(5) 0.5-2.5% by mole of PEG modified lipid ALC0159;
wherein the mass ratio of the ionizable cationic lipid component to the mRNA is from about 20:1 to about 40:1.
The mean particle size and particle size distribution of the LNP are important initial determinants of LNP quality and suitability for various applications. These features are typically characterized by Dynamic Light Scattering (DLS). In some embodiments, the presently disclosed LNPs have an average diameter of about 20-200 nm. In other embodiments, the LNPs of the disclosure have an average diameter of about 50nm to 150 nm. In other embodiments, the LNPs of the disclosure have an average diameter of about 70nm to 120 nm.
The surface charge of LNP is responsible for interactions with cell membranes and biological environments, and is typically assessed by Zeta potential measurements. One common method of adjusting the total charge on the surface of the LNP is to adjust the N/P ratio, i.e., the ratio of ionizable lipids (N, representing cationic amines) to nucleic acids (P, representing anionic phosphates). In some embodiments, the LNPs disclosed herein comprise an N to P ratio of about 2:1 to about 30:1. In other embodiments, the LNPs of the present disclosure include an N to P ratio of about 6:1. In other embodiments, the LNPs of the present disclosure include an N to P ratio of about 3:1.
Use of the same
In another embodiment, provided herein is the use of an mRNA nucleic acid molecule or pharmaceutical composition thereof for inducing a neutralizing antibody response in a subject to the novel coronavirus variant omacron (b.1.1.529) S glycoprotein in the manufacture of a medicament or vaccine for treating or preventing a novel coronavirus infection. Preferably, provided herein are mRNAs as shown in SEQ ID NO:20-SEQ ID NO:25 useful for inducing a neutralizing antibody response in a subject against the S glycoprotein of the novel coronavariant strain omacron (B.1.1.529), which mRNAs provided herein can control, prevent or treat infectious diseases caused by coronaviruses in a subject to be administered.
In some embodiments, the invention discloses the use of mRNA as shown in SEQ ID NO. 20-SEQ ID NO. 25 for the preparation of a medicament or vaccine for the treatment or prevention of novel coronavirus infections. In some embodiments, compositions according to the present disclosure, including mRNA and polypeptides encoded thereby, are useful for treating or preventing coronavirus infections, particularly novel coronavariant omicron (b.1.1.529) virus infections. The pharmaceutical composition may be administered to a subject as part of an active immunization regimen or as a therapeutic pharmaceutical composition. In some embodiments, the amount of mRNA provided to the subject may be a dose effective for immunoprophylaxis.
In another embodiment, the subject of the invention is a human or non-human mammal. In another embodiment, the subject is an adult, a human child, or a human infant. In another embodiment, the subject is at risk of or is susceptible to a coronavirus infection. In another embodiment, the subject is an elderly person. In another embodiment, the subject has been diagnosed as positive for a coronavirus infection. In another embodiment, the subject is asymptomatic.
The pharmaceutical composition may be administered with other prophylactic or therapeutic treatments. As used herein, when referring to a prophylactic composition (e.g., vaccine), the term "enhancer" refers to the additional administration of the prophylactic (vaccine) composition. After administration of the prophylactic pharmaceutical composition, an enhancer (or an enhanced vaccine) may be administered. The time of intermittent administration of the prophylactic pharmaceutical composition and the enhancer may be, but is not limited to, 1 week, 2 weeks, 3 weeks, 1 month, 2 months, 3 months, 6 months, or 1 year. The mRNA described herein can be formulated into the dosage forms described herein, for example, intranasal, intraperitoneal, intravenous, intraocular, intravitreal, intramuscular, intradermal, intracardiac intraperitoneal, or subcutaneous administration, preferably subcutaneous or intramuscular administration.
The mRNA provided herein can control, prevent or treat infectious diseases caused by coronaviruses in a subject to be administered, and the effective dose of mRNA can range from 10 μg to 5mg, more preferably from 10 μg to 1mg, from 10 μg to 500 μg, from 20 μg to 300 μg or from 40 μg to 200 μg, etc. The amount of therapeutic nucleic acid or composition thereof effective in treating, preventing and/or treating an infectious disease will depend on the nature of the disease being treated, the route of administration, the general health of the subject being administered, etc., and will generally be at the discretion of the physician. Standard clinical techniques, such as in vitro assays, may optionally be employed to assist in determining the optimal dosage range.
The present invention provides mRNA nucleic acid molecules, pharmaceutical compositions comprising the mRNA nucleic acid molecules, and uses thereof for controlling, preventing, and treating coronavirus infections. The technical scheme of the invention is summarized as follows:
1. a nucleic acid molecule comprising an S glycoprotein encoding coronavirus omicron (b.1.1.529), wherein said coding region comprises one or more open reading frames (open reading frame, ORFs), and wherein at least one ORF encodes an S glycoprotein of coronavirus omicron (b.1.1.529) having a protein sequence as set forth in SEQ ID No. 1; and having a nucleotide sequence at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identity to the sequence shown in SEQ ID NO. 2.
2. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule has a sequence shown in SEQ ID NO. 3-SEQ ID NO. 5.
3. Comprises mRNA molecules encoding coronavirus omacron (B.1.1.529) S glycoprotein, and the sequence of the mRNA molecules is shown as SEQ ID NO. 6-SEQ ID NO. 8.
4. An mRNA nucleic acid molecule comprising:
(i) A 5 'untranslated region (5' -UTR);
(ii) CDS, wherein the CDS comprises an Open Reading Frame (ORF) encoding a coronavirus omicron (B.1.1.529) antigen S glycoprotein capable of inducing an immune response comprising a polypeptide as set forth in SEQ ID NO:6-SEQ ID NO:8
The nucleotide sequence shown;
(iii) 3 '-untranslated region (3' -UTR).
5. The mRNA nucleic acid molecule of claim 4, wherein: wherein the 5' UTR comprises the sequence of SEQ ID NO. 9, SEQ ID NO. 10 or SEQ ID NO. 11.
6. The mRNA nucleic acid molecule of claim 4, wherein: the 5' -UTR sequence may or may not comprise a Kozak sequence.
7. The mRNA nucleic acid molecule of claim 6, wherein: wherein the 5' UTR may comprise the kozak sequence GCCACC or GCCANN et al, wherein N refers to A, T/U, C or G.
8. The mRNA nucleic acid molecule of claim 4, wherein: wherein the 3' UTR comprises the sequence of SEQ ID NO. 12, SEQ ID NO. 13 or SEQ ID NO. 14.
9. The mRNA nucleic acid molecule of claim 4, wherein: the mRNA nucleic acid molecules also contain native 5' -cap structures or analogs thereof produced by endogenous processes.
10. The mRNA nucleic acid molecule of claim 9, wherein: the 5' -cap analogue may optionally be selected from m7G (5 ') ppp (5 ') (2 ' OMeA) pG, 3' -O-Me-m7G (5 ') ppp (5 ') G; g (5 ') ppp (5') A; g (5 ') ppp (5') G; m7G (5 ') ppp (5') A; m7G (5 ') ppp (5') G, etc.
11. The mRNA nucleic acid molecule of claim 4, wherein: the mRNA nucleic acid molecule also comprises a poly-a tail or polyadenylation signal, optionally having a length of 80 to 180 nucleotides.
12. The mRNA nucleic acid molecule of claim 11, wherein: the poly A sequence (polyA) is 80-180A, and a linker sequence is arranged in the middle.
13. The mRNA nucleic acid molecule of claim 12, wherein: the poly A sequence (polyA) is 80-160A.
14. The mRNA nucleic acid molecule of claim 11, wherein: the poly A sequence is shown as SEQ ID NO. 15 or SEQ ID NO. 16.
15. The mRNA nucleic acid molecule according to any one of claims 3-11, characterized in that: the mRNA nucleic acid molecule further comprises one or more functional nucleotide analog modifications selected from the group consisting of pseudouridine (ψ), 1-methyl-pseudouridine (m 1 ψ), 1-ethyl-pseudouridine (e 1 ψ), 5-methoxy-uridine (mo 5U) and 5-methylcytosine (m 5C).
16. The mRNA nucleic acid molecule of claim 15, wherein: the mRNA includes pseudouridine (ψ) substitutions at one or more or all uracil positions of the nucleic acid.
17. The mRNA nucleic acid molecule of claim 16, wherein: uracil (U) as shown in any one of SEQ ID NO. 6-SEQ ID NO. 25 is replaced by pseudouridine (ψ).
18. An mRNA nucleic acid molecule, said mRNA molecule comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA), wherein CDS comprises an Open Reading Frame (ORF) encoding coronavirus omicron (B.1.1.529) antigen S glycoprotein capable of inducing immune response, the nucleotide sequence is shown as SEQ ID NO:6-SEQ ID NO: 8.
19. The mRNA nucleic acid molecule of claim 18, wherein: wherein the 5' -UTR comprises a Kozak sequence.
20. The mRNA nucleic acid molecule of claim 18, wherein: the CDS may further comprise a signal peptide sequence.
21. The mRNA nucleic acid molecule of claim 20, wherein: the signal peptide sequence is shown as SEQ ID NO. 17-SEQ ID NO. 19.
22. An mRNA molecule, said mRNA comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA);
wherein said 5' -cap structure is optionally selected from m7G (5 ') ppp (5 ') (2 ' OMeA) pG, 3' -O-Me-m7G (5 ') ppp (5 ') G; g (5 ') ppp (5') A; g (5 ') ppp (5') G; m7G (5 ') ppp (5') A; m7G (5 ') ppp (5') G, etc.;
wherein the 5' -UTR comprises a sequence shown as SEQ ID NO. 9-SEQ ID NO. 11;
wherein the CDS comprises an Open Reading Frame (ORF) which encodes a signal peptide and a coronavirus omacron (B.1.1.529) antigen S glycoprotein capable of inducing an immune response, wherein the sequence of the signal peptide is shown as SEQ ID NO. 17-SEQ ID NO. 19, and the sequence of the coronavirus omacron (B.1.1.529) antigen S glycoprotein is shown as SEQ ID NO. 6-SEQ ID NO. 8;
wherein the 3' -UTR comprises a nucleotide sequence shown as SEQ ID NO. 12-SEQ ID NO. 14;
Wherein the 3' -polyadenylation sequence (polyA) optionally has a length of 80 to 180 adenylates.
23. The mRNA nucleic acid molecule of claim 22, wherein: the sequence is shown as SEQ ID NO. 20-SEQ ID NO. 22.
24. The mRNA nucleic acid molecule of any one of claims 18-23, wherein: the mRNA nucleic acid molecule further comprises one or more functional nucleotide analog modifications selected from the group consisting of pseudouridine (ψ), 1-methyl-pseudouridine (m 1 ψ), 1-ethyl-pseudouridine (e 1 ψ), 5-methoxy-uridine (mo 5U) and 5-methylcytosine (m 5C).
25. The mRNA nucleic acid molecule of claim 24, wherein: one or more or all uracils of the mRNA nucleic acid molecule are replaced with pseudouridine (ψ).
26. The mRNA nucleic acid molecule of claim 24, wherein: all uracils in the nucleic acid sequences SEQ ID NO. 20-SEQ ID NO. 22 are replaced by pseudouridine (psi), and the replaced sequences are shown as SEQ ID NO. 23-SEQ ID NO. 25.
27. A pharmaceutical composition for inducing a neutralizing antibody response in a subject to a novel coronal variant omacron (b.1.1.529) S glycoprotein comprising the mRNA nucleic acid molecule of any of claims 4-26 and a pharmaceutically acceptable carrier.
28. The pharmaceutical composition according to claim 27, wherein: the pharmaceutical composition comprises an mRNA molecule shown as any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof and a pharmaceutically acceptable carrier.
29. A pharmaceutical composition comprising mRNA comprising the following components:
(1) mRNA molecules with the sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof;
(2) 20-60% by mole of ionizable cationic lipids (ionizable lipids);
(3) 5-25% by mole of a non-cationic lipid;
(4) 25-55% by mole of sterols; and
(5) 0.5-15% by mole of PEG modified lipid.
30. The pharmaceutical composition according to claim 29, wherein: the mole ratio of the ionizable cationic lipid is 30% -50%.
31. The pharmaceutical composition according to claim 29, wherein: the ionizable cationic lipid comprises DLin-MC3-DMA, SM-102, ALC-0315, etc.
32. The pharmaceutical composition according to claim 29, wherein: the molar ratio of the non-cationic lipid is 10% -20%.
33. The pharmaceutical composition according to claim 29, wherein: the non-cationic lipid is DSPC, DOPE or DGTS.
34. The pharmaceutical composition according to claim 29, wherein: the sterol is selected from cholesterol, beta-sitosterol, oxidized cholesterol derivatives and the like.
35. The pharmaceutical composition according to claim 29, wherein: the mole ratio of the sterol is 30-50%.
35. The pharmaceutical composition according to claim 29, wherein: the molar ratio of the PEG modified lipid is 0.5-2.5%.
36. The pharmaceutical composition according to claim 29, wherein: the PEG modified lipid comprises PEG modified phosphatidylethanolamine, PEG modified phosphatidic acid, PEG modified ceramide, PEG modified dialkylamine, PEG modified diacylglycerol, PEG modified dialkylglycerol and mixture thereof.
37. A pharmaceutical composition according to claim 36, wherein: the PEG modified lipid is DMG-PEG, PEG-DOMG, PEG-DSG and/or PEG-DPG.
38. The pharmaceutical composition according to claim 37, wherein: the PEG modified lipid is ALC0159.
39. The pharmaceutical composition according to claim 29, wherein: the pharmaceutical composition has a mass ratio of ionizable cationic lipid component to mRNA of from about 10:1 to about 100:1.
40. The pharmaceutical composition according to claim 39, wherein: the pharmaceutical composition has a mass ratio of ionizable cationic lipid component to mRNA of from about 20:1 to about 40:1.
41. A pharmaceutical composition comprising mRNA comprising the following components:
(1) mRNA molecules with the sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof;
(2) 20-60% by mole of ionizable cationic lipid ALC0315;
(3) 5-25% by mole of DSPC;
(4) Cholesterol in 25-55% molar ratio; and
(5) 0.5-15% by mole of PEG modified lipid ALC0159;
wherein the mass ratio of the ionizable cationic lipid component to the mRNA is from about 10:1 to about 100:1.
42. A pharmaceutical composition comprising mRNA comprising the following components:
(1) mRNA molecules with the sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof;
(2) 30-50% by mole of ionizable cationic lipid ALC0315;
(3) 10-20% by mole of DSPC;
(4) 23-50% cholesterol by mole; and
(5) 0.5-2.5% by mole of PEG modified lipid ALC0159;
wherein the mass ratio of the ionizable cationic lipid component to the mRNA is from about 20:1 to about 40:1.
43. The pharmaceutical composition according to any one of claims 29-42, wherein: the pharmaceutical composition forms Lipid Nanoparticles (LNPs), the LNPs having an average diameter of about 20-200 nm.
44. The pharmaceutical composition according to claim 43, wherein: the LNP has an average diameter of about 50nm to 150 nm.
45. The pharmaceutical composition of claim 44, wherein: the LNP has an average diameter of about 70nm to 120 nm.
46. The pharmaceutical composition according to claim 43, wherein: the LNP comprises an N to P ratio of about 2:1 to about 30:1, wherein the N/P ratio is the ratio of ionizable lipids (N, representing cationic amines) to nucleic acids (P, representing anionic phosphates).
47. The pharmaceutical composition according to claim 46, wherein: the LNP includes an N to P ratio of about 6:1.
48. The pharmaceutical composition according to claim 46, wherein: the LNP includes an N to P ratio of about 3:1.
49. Use of an mRNA nucleic acid molecule according to any one of claims 4-26 or a pharmaceutical composition according to any one of claims 27-48 for the preparation of a medicament or vaccine for the treatment or prevention of a novel coronavirus infection.
50. The use according to claim 49, wherein: the mRNA has a sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof.
51. The use according to claim 49, wherein: the mRNA drug or vaccine is administered to a subject that is a human or non-human mammal.
52. The use according to claim 51, characterized in that: the mRNA drug or vaccine is administered to a subject who is an adult, a human child, or a human young child.
53. The use according to claim 52, characterized in that: the subject to which the mRNA drug or vaccine is administered is at risk of, or is susceptible to, a coronavirus infection.
54. The use according to claim 52, characterized in that: the administration subject of the mRNA drug or vaccine is elderly.
55. The use according to claim 52, characterized in that: the subject to whom the mRNA drug or vaccine is administered has been diagnosed as positive for coronavirus infection.
56. The use according to claim 52, characterized in that: the administration subject of the mRNA drug or vaccine is asymptomatic.
57. The use according to claim 49, wherein: the mRNA drug or vaccine may be administered with the enhancer (or enhanced vaccine) after administration of the prophylactic pharmaceutical composition, and the time of intermittent administration of the prophylactic pharmaceutical composition and enhancer may be, but is not limited to, 1 week, 2 weeks, 3 weeks, 1 month, 2 months, 3 months, 6 months, or 1 year.
58. The use according to claim 49, wherein: the mRNA drug or vaccine is formulated for intranasal, intraperitoneal, intravenous, intraocular, intravitreal, intramuscular, intradermal, intracardiac intraperitoneal, or subcutaneous administration.
59. The use according to claim 58, wherein: the mRNA drug or vaccine is administered subcutaneously or intramuscularly.
60. The use according to claim 49, wherein: the effective dosage range of the mRNA drug or vaccine is 10 mug-5 mg.
61. The use according to claim 60, characterized in that: the effective dosage range of the mRNA medicine or vaccine is 10 mug-1 mg,10 mug-500 mug, 20 mug-300 mug or 40 mug-200 mug, etc.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to examples, but it will be understood by those skilled in the art that the following examples are only for illustrating the present invention and should not be construed as limiting the scope of the present invention. The specific conditions are not noted in the examples and are carried out according to conventional conditions or conditions recommended by the manufacturer.
Unless otherwise defined, the technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. In addition, any method or material similar or equivalent to those described may be used in the present invention.
Example 1 preparation of mRNA
The inventors optimized the novel coronal variant omacron (B.1.1.529) S glycoprotein coding region (SEQ ID NO: 2), and sequence optimization included: the GC content is regulated aiming at the expression in human body, the GC content of the sequence is improved, and the transcription optimization omicron (B.1.1.529) S glycoprotein DNA sequence with the nucleotide sequence shown as SEQ ID NO. 3 is obtained through optimization; optimizing the codon to obtain a transcription optimized omicron (B.1.1.529) S glycoprotein DNA sequence with a nucleotide sequence shown as SEQ ID NO. 4; increasing GC content and optimizing codon to obtain transcription optimized omicron (B.1.1.529) S glycoprotein DNA sequence with nucleotide sequence shown as SEQ ID N0.5. The nucleotide sequence makes the transcribed mRNA structure more stable and the translation efficiency of target protein in mammal and human body higher. mRNA sequences obtained through transcription according to the sequences of SEQ ID NO. 3-SEQ ID NO. 5 are respectively shown as SEQ ID NO. 6-SEQ ID NO. 8. The inventor realizes reducing mRNA immunogenicity by reducing the content of U (uridylic acid) in mRNA molecules, and substitutes all uracils in nucleic acid sequences SEQ ID NO. 6-SEQ ID NO. 8 with pseudo-uridine (psi), and the substituted sequences are shown as SEQ ID NO. 26-SEQ ID NO. 28.
The nucleotide sequence of the optimized omicron (B.1.1.529) S glycoprotein mRNA is shown as SEQ ID NO. 26: 5' -AACC psi AACCACCCGCACCCAAC psi ACCGCCGGCG psi ACACACC psi CG psi CACCCGCGGGG psi C psi AC psi ACCCGGACAAGG psi C psi CCGC psi CG psi CGG psi CC psi ACC psi CGACCCAAGACC psi A psi CC psi ACG psi C psi CGA CAC psi C psi CAC psi GG psi CCACpsi A psi CGGGGACCAACGGGACCAAGCGC psi A kind of 13-CGACAACCCGG-13-3-13-9-13-3-13-9-13-3-13-to-13-to-13 psi CGACAACCCGG psi CC psi ACCG psi CAACGACGGGG psi C psi AC psi CGCG psi CGA psi AGAGAAG psi CGAAG psi AA psi ACGCGGC psi GGA psi A kind of psi CGGCAC psi AC psi C psi GGAC psi C psi AAGAC psi CAG psi C psi GC psi GA psi G psi GAACAACGC psi AC psi AAG psi GG A kind of 13C, 3G, C, 3G, C, G, C, G, 3, C, 3, C, of; GC-PSGGCGC-PSI-PSG-GAG-PSG-G-PSG-G-carrier GC ψGGCGC ψGC ψ GC ψ GC ψ AC ψ ACG ψ GGGC ψ ACC ψ GCAGCC ψ CG ψ AC ψ CC ψ GC ψ GAAG ψ ACAACGAGAACGGCAC A.psigargin AC psi GACGC psi G psi GGAC psi GCGC psi GGACCC psi C psi G psi C psi GAGAGAC psi AAG psi GCAC psi C psi GAAG GAGA psi-psi ACCAGGC-psi GGCAACAAGCC psi-psi GCAACGGCG psi GGC-psi CAAC-psi GC-psi CCC-psi GCG-psi C-psi AC ψ C ψ cci CCG ψ CC ψ AC ψ acggg G psi GGGCCACCAGCC ψ cpcg CG psi G dg GG GC ψ C ψ CGAGC GAGA psi ACCAGGC GGCAACAAGCC psi ACGGC psi GCAACGGCG psi GGC psi CAAC psi GC psi CCC psi GCG psi C psi AC psi CCC psi ACpsi psi GGGCCACCAGCC psi ACCG psi G psi GG GGC psi G psi CGpsi C psi CGpsi CGstep CGAGC CGstep A kind of 13-GC-13-GCACGC-13-CC-13-GC-13-C-13-G-C-13-G-13-C-13-G-13-G and G and a G G G and a G G and G and G the level of the ground plane is the same as the level of the ground plane, the level of the ground plane is the same as the level of the ground plane, and the level of the ground plane The method comprises the steps of determining a total of a total and a total of psi AAGCAG psi ACGGCGAC psi GCC psi GGGCGACA psi GC psi CG psi GACC psi GA psi GG psi GCGAAG GGCC A kind of PSAAGCG PSPSPSPSPSPSGAGGACACGAPSGAPSGAPSGAPSGAGAGGCGCAPSGAGGCCs of PSACPSGAPSGAPSGAPSGAGGCC of PSACGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAGGCGACPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSGAPSPSPSPSGAPSPSGAPSPSPSPSGAPSGAPSPSGAPSPSPSPSPSPSPSPSGAPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPSPScarrier to a-type-of three-stage dual-purpose antenna, common antenna, with high-core, common antenna, common mode, common mode,), common mode,), or method,) to), A method for preparing a composite material of a plurality of components, wherein the composite material comprises the steps of a plurality of components, a grade, a one to be one G the level of the ground plane is the same as the level of the ground plane, the level of the ground plane is the same as the level of the ground plane Psi CAGAAGGAGA psi GACCG psi C psi GAACGAGG psi GGC psi AAGAACGAACGAACGAG psi C psi GA psi GACC psi GCAGGAGC psi GGGCAAG psi ACGAGCAG either of the above-mentioned modes A kind of 13-C GACCG-3-C GAACGAGG-C-3-G-C-G-3-G-C-G GG-GAC-GAG-GAC-GAG-GAAGC-GAC-ACC-GAC-3' (SEQ ID NO: 26).
An optimized omicron (B.1.1.529) S glycoprotein mRNA (opt) nucleotide sequence is shown in SEQ ID NO. 27.
5’-AACCΨGACΨACΨCGΨACΨCAGCΨGCCΨCCΨGCΨΨACACΨAACΨCΨΨΨCACΨCGΨGGCGΨGΨACΨACCCΨGACAAGGΨGΨΨCCGΨΨCΨΨCΨGΨGCΨGCACΨCΨACΨCAGGACCΨGΨΨCCΨGCCΨΨΨCΨΨCΨCΨAACGΨGACΨΨGGΨΨCCACGΨGAΨΨΨCΨGGCACΨAACGGCACΨAAGCGΨΨΨCGACAACCCΨGΨGCΨGCCΨΨΨCAACGACGGCGΨGΨACΨΨCGCΨΨCΨAΨΨGAGAAGΨCΨAACAΨΨAΨΨCGΨGGCΨGGAΨΨΨΨCGGCACΨACΨCΨGGACΨCΨAAGACΨCAGΨCΨCΨGCΨGAΨΨGΨGAACAACGCΨACΨAACGΨGGΨGAΨΨAAGGΨGΨGCGAGΨΨCCAGΨΨCΨGCAACGACCCΨΨΨCCΨGGACCACAAGAACAACAAGΨCΨΨGGAΨGGAGΨCΨGAGΨΨCCGΨGΨGΨACΨCΨΨCΨGCΨAACAACΨGCACΨΨΨCGAGΨACGΨGΨCΨCAGCCΨΨΨCCΨGAΨGGACCΨGGAGGGCAAGCAGGGCAACΨΨCAAGAACCΨGCGΨGAGΨΨCGΨGΨΨCAAGAACAΨΨGACGGCΨACΨΨCAAGAΨΨΨACΨCΨAAGCACACΨCCΨAΨΨAΨΨGΨGCGΨGAGCCΨGAGGACCΨGCCΨCAGGGCΨΨCΨCΨGCΨCΨGGAGCCΨCΨGGΨGGACCΨGCCΨAΨΨGGCAΨΨAACAΨΨACΨCGΨΨΨCCAGACΨCΨGCΨGGCΨCΨGCACCGΨΨCΨΨACCΨGACΨCCΨGGCGACΨCΨΨCΨΨCΨGGCΨGGACΨGCΨGGCGCΨGCΨGCΨΨACΨACGΨGGGCΨACCΨGCAGCCΨCGΨACΨΨΨCCΨGCΨGAAGΨACAACGAGAACGGCACΨAΨΨACΨGACGCΨGΨGGACΨGCGCΨCΨGGACCCΨCΨGΨCΨGAGACΨAAGΨGCACΨCΨGAAGΨCΨΨΨCACΨGΨGGAGAAGGGCAΨΨΨACCAGACΨΨCΨAACΨΨCCGΨGΨGCAGCCΨACΨGAGΨCΨAΨΨGΨGCGΨΨΨCCCΨAACAΨΨACΨAACCΨGΨGCCCΨΨΨCGACGAGGΨGΨΨCAACGCΨACΨCGΨΨΨCGCΨΨCΨGΨGΨACGCΨΨGGAACCGΨAAGCGΨAΨΨΨCΨAACΨGCGΨGGCΨGACΨACΨCΨGΨGCΨGΨACAACCΨGGCΨCCΨΨΨCΨΨCACΨΨΨCAAGΨGCΨACGGCGΨGΨCΨCCΨACΨAAGCΨGAACGACCΨGΨGCΨΨCACΨAACGΨGΨACGCΨGACΨCΨΨΨCGΨGAΨΨCGΨGGCGACGAGGΨGCGΨCAGAΨΨGCΨCCΨGGCCAGACΨGGCAACAΨΨGCΨGACΨACAACΨACAAGCΨGCCΨGACGACΨΨCACΨGGCΨGCGΨGAΨΨGCΨΨGGAACΨCΨAACAAGCΨGGACΨCΨAAGGΨGΨCΨGGCAACΨACAACΨACCΨGΨACCGΨCΨGΨΨCCGΨAAGΨCΨAACCΨGAAGCCΨΨΨCGAGCGΨGACAΨΨΨCΨACΨGAGAΨΨΨACCAGGCΨGGCAACAAGCCΨΨGCAACGGCGΨGGCΨGGCΨΨCAACΨGCΨACΨΨCCCΨCΨGCGΨΨCΨΨACΨCΨΨΨCCGΨCCΨACΨΨACGGCGΨGGGCCACCAGCCΨΨACCGΨGΨGGΨGGΨGCΨGΨCΨΨΨCGAGCΨGCΨGCACGCΨCCΨGCΨACΨGΨGΨGCGGCCCΨAAGAAGΨCΨACΨAACCΨGGΨGAAGAACAAGΨGCGΨGAACΨΨCAACΨΨCAACGGCCΨGAAGGGCACΨGGCGΨGCΨGACΨGAGΨCΨAACAAGAAGΨΨCCΨGCCΨΨΨCCAGCAGΨΨCGGCCGΨGACAΨΨGCΨGACACΨACΨGACGCΨGΨGCGΨGACCCΨCAGACΨCΨGGAGAΨΨCΨGGACAΨΨACΨCCΨΨGCΨCΨΨΨCGGCGGCGΨGΨCΨGΨGAΨΨACΨCCΨGGCACΨAACACΨΨCΨAACCAGGΨGGCΨGΨGCΨGΨACCAGGGCGΨGAACΨGCACΨGAGGΨGCCΨGΨGGCΨAΨΨCACGCΨGACCAGCΨGACΨCCΨACΨΨGGCGΨGΨGΨACΨCΨACΨGGCΨCΨAACGΨGΨΨCCAGACΨCGΨGCΨGGCΨGCCΨGAΨΨGGCGCΨGAGΨACGΨGAACAACΨCΨΨACGAGΨGCGACAΨΨCCΨAΨΨGGCGCΨGGCAΨΨΨGCGCΨΨCΨΨACCAGACΨCAGACΨAAGΨCΨCACCGΨCGΨGCΨCGΨΨCΨGΨGGCΨΨCΨCAGΨCΨAΨΨAΨΨGCΨΨACACΨAΨGΨCΨCΨGGGCGCΨGAGAACΨCΨGΨGGCΨΨACΨCΨAACAACΨCΨAΨΨGCΨAΨΨCCΨACΨAACΨΨCACΨAΨΨΨCΨGΨGACΨACΨGAGAΨΨCΨGCCΨGΨGΨCΨAΨGACΨAAGACΨΨCΨGΨGGACΨGCACΨAΨGΨACAΨΨΨGCGGCGACΨCΨACΨGAGΨGCΨCΨAACCΨGCΨGCΨGCAGΨACGGCΨCΨΨΨCΨGCACΨCAGCΨGAAGCGΨGCΨCΨGACΨGGCAΨΨGCΨGΨGGAGCAGGACAAGAACACΨCAGGAGGΨGΨΨCGCΨCAGGΨGAAGCAGAΨΨΨACAAGACΨCCΨCCΨAΨΨAAGΨACΨΨCGGCGGCΨΨCAACΨΨCΨCΨCAGAΨΨCΨGCCΨGACCCΨΨCΨAAGCCΨΨCΨAAGCGΨΨCΨΨΨCAΨΨGAGGACCΨGCΨGΨΨCAACAAGGΨGACΨCΨGGCΨGACGCΨGGCΨΨCAΨΨAAGCAGΨACGGCGACΨGCCΨGGGCGACAΨΨGCΨGCΨCGΨGACCΨGAΨΨΨGCGCΨCAGAAGΨΨCAAGGGCCΨGACΨGΨGCΨGCCΨCCΨCΨGCΨGACΨGACGAGAΨGAΨΨGCΨCAGΨACACΨΨCΨGCΨCΨGCΨGGCΨGGCACΨAΨΨACΨΨCΨGGCΨGGACΨΨΨCGGCGCΨGGCGCΨGCΨCΨGCAGAΨΨCCΨΨΨCGCΨAΨGCAGAΨGGCΨΨACCGΨΨΨCAACGGCAΨΨGGCGΨGACΨCAGAACGΨGCΨGΨACGAGAACCAGAAGCΨGAΨΨGCΨAACCAGΨΨCAACΨCΨGCΨAΨΨGGCAAGAΨΨCAGGACΨCΨCΨGΨCΨΨCΨACΨGCΨΨCΨGCΨCΨGGGCAAGCΨGCAGGACGΨGGΨGAACCACAACGCΨCAGGCΨCΨGAACACΨCΨGGΨGAAGCAGCΨGΨCΨΨCΨAAGΨΨCGGCGCΨAΨΨΨCΨΨCΨGΨGCΨGAACGACAΨΨΨΨCΨCΨCGΨCΨGGACAAGGΨGGAGGCΨGAGGΨGCAGAΨΨGACCGΨCΨGAΨΨACΨGGCCGΨCΨGCAGΨCΨCΨGCAGACΨΨACGΨGACΨCAGCAGCΨGAΨΨCGΨGCΨGCΨGAGAΨΨCGΨGCΨΨCΨGCΨAACCΨGGCΨGCΨACΨAAGAΨGΨCΨGAGΨGCGΨGCΨGGGCCAGΨCΨAAGCGΨGΨGGACΨΨCΨGCGGCAAGGGCΨACCACCΨGAΨGΨCΨΨΨCCCΨCAGΨCΨGCΨCCΨCACGGCGΨGGΨGΨΨCCΨGCACGΨGACΨΨACGΨGCCΨGCΨCAGGAGAAGAACΨΨCACΨACΨGCΨCCΨGCΨAΨΨΨGCCACGACGGCAAGGCΨCACΨΨCCCΨCGΨGAGGGCGΨGΨΨCGΨGΨCΨAACGGCACΨCACΨGGΨΨCGΨGACΨCAGCGΨAACΨΨCΨACGAGCCΨCAGAΨΨAΨΨACΨACΨGACAACACΨΨΨCGΨGΨCΨGGCAACΨGCGACGΨGGΨGAΨΨGGCAΨΨGΨGAACAACACΨGΨGΨACGACCCΨCΨGCAGCCΨGAGCΨGGACΨCΨΨΨCAAGGAGGAGCΨGGACAAGΨACΨΨCAAGAACCACACΨΨCΨCCΨGACGΨGGACCΨGGGCGACAΨΨΨCΨGGCAΨΨAACGCΨΨCΨGΨGGΨGAACAΨΨCAGAAGGAGAΨΨGACCGΨCΨGAACGAGGΨGGCΨAAGAACCΨGAACGAGΨCΨCΨGAΨΨGACCΨGCAGGAGCΨGGGCAAGΨACGAGCAGΨACAΨΨAAGΨGGCCΨΨGGΨACAΨΨΨGGCΨGGGCΨΨCAΨΨGCΨGGCCΨGAΨΨGCΨAΨΨGΨGAΨGGΨGACΨAΨΨAΨGCΨGΨGCΨGCAΨGACΨΨCΨΨGCΨGCΨCΨΨGCCΨGAAGGGCΨGCΨGCΨCΨΨGCGGCΨCΨΨGCΨGCAAGΨΨCGACGAGGACGACΨCΨGAGCCΨGΨGCΨGAAGGGCGΨGAAGCΨGCACΨACACΨΨGA-3’(SEQ ID NO:27)。
An optimized omicron (B.1.1.529) S glycoprotein mRNA (gcH global opt) has the nucleotide sequence shown in SEQ ID NO. 28.
5’-AACCΨGACGACGCGGACGCAGCΨGCCCCCCGCCΨACACGAACAGCΨΨCACGCGGGGCGΨGΨACΨACCCCGACAAGGΨGΨΨCCGGAGCAGCGΨGCΨGCACAGCACGCAGGACCΨGΨΨCCΨGCCCΨΨCΨΨCAGCAACGΨGACGΨGGΨΨCCACGΨGAΨCAGCGGCACGAACGGCACGAAGCGGΨΨCGACAACCCCGΨGCΨGCCCΨΨCAACGACGGCGΨGΨACΨΨCGCCAGCAΨCGAGAAGAGCAACAΨCAΨCCGGGGCΨGGAΨCΨΨCGGCACGACGCΨGGACAGCAAGACGCAGAGCCΨGCΨGAΨCGΨGAACAACGCCACGAACGΨGGΨGAΨCAAGGΨGΨGCGAGΨΨCCAGΨΨCΨGCAACGACCCCΨΨCCΨGGACCACAAGAACAACAAGAGCΨGGAΨGGAGAGCGAGΨΨCCGGGΨGΨACAGCAGCGCCAACAACΨGCACGΨΨCGAGΨACGΨGAGCCAGCCCΨΨCCΨGAΨGGACCΨGGAGGGCAAGCAGGGCAACΨΨCAAGAACCΨGCGGGAGΨΨCGΨGΨΨCAAGAACAΨCGACGGCΨACΨΨCAAGAΨCΨACAGCAAGCACACGCCCAΨCAΨCGΨGCGGGAGCCCGAGGACCΨGCCCCAGGGCΨΨCAGCGCCCΨGGAGCCCCΨGGΨGGACCΨGCCCAΨCGGCAΨCAACAΨCACGCGGΨΨCCAGACGCΨGCΨGGCCCΨGCACCGGAGCΨACCΨGACGCCCGGCGACAGCAGCAGCGGCΨGGACGGCCGGCGCCGCCGCCΨACΨACGΨGGGCΨACCΨGCAGCCCCGGACGΨΨCCΨGCΨGAAGΨACAACGAGAACGGCACGAΨCACGGACGCCGΨGGACΨGCGCCCΨGGACCCCCΨGAGCGAGACGAAGΨGCACGCΨGAAGAGCΨΨCACGGΨGGAGAAGGGCAΨCΨACCAGACGAGCAACΨΨCCGGGΨGCAGCCCACGGAGAGCAΨCGΨGCGGΨΨCCCCAACAΨCACGAACCΨGΨGCCCCΨΨCGACGAGGΨGΨΨCAACGCCACGCGGΨΨCGCCAGCGΨGΨACGCCΨGGAACCGGAAGCGGAΨCAGCAACΨGCGΨGGCCGACΨACAGCGΨGCΨGΨACAACCΨGGCCCCCΨΨCΨΨCACGΨΨCAAGΨGCΨACGGCGΨGAGCCCCACGAAGCΨGAACGACCΨGΨGCΨΨCACGAACGΨGΨACGCCGACAGCΨΨCGΨGAΨCCGGGGCGACGAGGΨGCGGCAGAΨCGCCCCCGGCCAGACGGGCAACAΨCGCCGACΨACAACΨACAAGCΨGCCCGACGACΨΨCACGGGCΨGCGΨGAΨCGCCΨGGAACAGCAACAAGCΨGGACAGCAAGGΨGAGCGGCAACΨACAACΨACCΨGΨACCGGCΨGΨΨCCGGAAGAGCAACCΨGAAGCCCΨΨCGAGCGGGACAΨCAGCACGGAGAΨCΨACCAGGCCGGCAACAAGCCCΨGCAACGGCGΨGGCCGGCΨΨCAACΨGCΨACΨΨCCCCCΨGCGGAGCΨACAGCΨΨCCGGCCCACGΨACGGCGΨGGGCCACCAGCCCΨACCGGGΨGGΨGGΨGCΨGAGCΨΨCGAGCΨGCΨGCACGCCCCCGCCACGGΨGΨGCGGCCCCAAGAAGAGCACGAACCΨGGΨGAAGAACAAGΨGCGΨGAACΨΨCAACΨΨCAACGGCCΨGAAGGGCACGGGCGΨGCΨGACGGAGAGCAACAAGAAGΨΨCCΨGCCCΨΨCCAGCAGΨΨCGGCCGGGACAΨCGCCGACACGACGGACGCCGΨGCGGGACCCCCAGACGCΨGGAGAΨCCΨGGACAΨCACGCCCΨGCAGCΨΨCGGCGGCGΨGAGCGΨGAΨCACGCCCGGCACGAACACGAGCAACCAGGΨGGCCGΨGCΨGΨACCAGGGCGΨGAACΨGCACGGAGGΨGCCCGΨGGCCAΨCCACGCCGACCAGCΨGACGCCCACGΨGGCGGGΨGΨACAGCACGGGCAGCAACGΨGΨΨCCAGACGCGGGCCGGCΨGCCΨGAΨCGGCGCCGAGΨACGΨGAACAACAGCΨACGAGΨGCGACAΨCCCCAΨCGGCGCCGGCAΨCΨGCGCCAGCΨACCAGACGCAGACGAAGAGCCACCGGCGGGCCCGGAGCGΨGGCCAGCCAGAGCAΨCAΨCGCCΨACACGAΨGAGCCΨGGGCGCCGAGAACAGCGΨGGCCΨACAGCAACAACAGCAΨCGCCAΨCCCCACGAACΨΨCACGAΨCAGCGΨGACGACGGAGAΨCCΨGCCCGΨGAGCAΨGACGAAGACGAGCGΨGGACΨGCACGAΨGΨACAΨCΨGCGGCGACAGCACGGAGΨGCAGCAACCΨGCΨGCΨGCAGΨACGGCAGCΨΨCΨGCACGCAGCΨGAAGCGGGCCCΨGACGGGCAΨCGCCGΨGGAGCAGGACAAGAACACGCAGGAGGΨGΨΨCGCCCAGGΨGAAGCAGAΨCΨACAAGACGCCCCCCAΨCAAGΨACΨΨCGGCGGCΨΨCAACΨΨCAGCCAGAΨCCΨGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCΨΨCAΨCGAGGACCΨGCΨGΨΨCAACAAGGΨGACGCΨGGCCGACGCCGGCΨΨCAΨCAAGCAGΨACGGCGACΨGCCΨGGGCGACAΨCGCCGCCCGGGACCΨGAΨCΨGCGCCCAGAAGΨΨCAAGGGCCΨGACGGΨGCΨGCCCCCCCΨGCΨGACGGACGAGAΨGAΨCGCCCAGΨACACGAGCGCCCΨGCΨGGCCGGCACGAΨCACGAGCGGCΨGGACGΨΨCGGCGCCGGCGCCGCCCΨGCAGAΨCCCCΨΨCGCCAΨGCAGAΨGGCCΨACCGGΨΨCAACGGCAΨCGGCGΨGACGCAGAACGΨGCΨGΨACGAGAACCAGAAGCΨGAΨCGCCAACCAGΨΨCAACAGCGCCAΨCGGCAAGAΨCCAGGACAGCCΨGAGCAGCACGGCCAGCGCCCΨGGGCAAGCΨGCAGGACGΨGGΨGAACCACAACGCCCAGGCCCΨGAACACGCΨGGΨGAAGCAGCΨGAGCAGCAAGΨΨCGGCGCCAΨCAGCAGCGΨGCΨGAACGACAΨCΨΨCAGCCGGCΨGGACAAGGΨGGAGGCCGAGGΨGCAGAΨCGACCGGCΨGAΨCACGGGCCGGCΨGCAGAGCCΨGCAGACGΨACGΨGACGCAGCAGCΨGAΨCCGGGCCGCCGAGAΨCCGGGCCAGCGCCAACCΨGGCCGCCACGAAGAΨGAGCGAGΨGCGΨGCΨGGGCCAGAGCAAGCGGGΨGGACΨΨCΨGCGGCAAGGGCΨACCACCΨGAΨGAGCΨΨCCCCCAGAGCGCCCCCCACGGCGΨGGΨGΨΨCCΨGCACGΨGACGΨACGΨGCCCGCCCAGGAGAAGAACΨΨCACGACGGCCCCCGCCAΨCΨGCCACGACGGCAAGGCCCACΨΨCCCCCGGGAGGGCGΨGΨΨCGΨGAGCAACGGCACGCACΨGGΨΨCGΨGACGCAGCGGAACΨΨCΨACGAGCCCCAGAΨCAΨCACGACGGACAACACGΨΨCGΨGAGCGGCAACΨGCGACGΨGGΨGAΨCGGCAΨCGΨGAACAACACGGΨGΨACGACCCCCΨGCAGCCCGAGCΨGGACAGCΨΨCAAGGAGGAGCΨGGACAAGΨACΨΨCAAGAACCACACGAGCCCCGACGΨGGACCΨGGGCGACAΨCAGCGGCAΨCAACGCCAGCGΨGGΨGAACAΨCCAGAAGGAGAΨCGACCGGCΨGAACGAGGΨGGCCAAGAACCΨGAACGAGAGCCΨGAΨCGACCΨGCAGGAGCΨGGGCAAGΨACGAGCAGΨACAΨCAAGΨGGCCCΨGGΨACAΨCΨGGCΨGGGCΨΨCAΨCGCCGGCCΨGAΨCGCCAΨCGΨGAΨGGΨGACGAΨCAΨGCΨGΨGCΨGCAΨGACGAGCΨGCΨGCAGCΨGCCΨGAAGGGCΨGCΨGCAGCΨGCGGCAGCΨGCΨGCAAGΨΨCGACGAGGACGACAGCGAGCCCGΨGCΨGAAGGGCGΨGAAGCΨGCACΨACACGΨGA-3’(SEQ ID NO:28)。
The sequence of the transcription optimized omicron (b.1.1.529) S glycoprotein mRNA also includes the following elements: 5 '-cap structure, 5' -UTR (comprising Kozak sequence), CDS, 3'-UTR and 3' -polyadenylation sequence (polyA). The addition of 5 '-cap structure, 5' -UTR, kozak, 3 '-poly A sequence (polyA) and 3' -UTR sequence elements can further improve the stability of the sequence and avoid degradation. The CDS may further comprise a signal peptide sequence, a preferred signal peptide sequence being shown in SEQ ID NO 17-SEQ ID NO 19, for the purpose of enhancing translation of the protein, enhancing translation stability of the block and better folding of subsequent proteins. Key elements of omicron (b.1.1.529) S glycoprotein mRNA sequences are the coding region nucleotides, HBA1 'utr and polyA sequences using HBA 1' utr, kozak, novel coronal variants Omcron virus S glycoprotein. The sequence of the complete mRNA molecule is shown as SEQ ID NO. 20-SEQ ID NO. 22.
TABLE 1 mRNA sequence key elements
The sequence optimization scheme of omicron (b.1.1.529) S glycoprotein mRNA in this example is shown in table 1, and the specific preparation process is as follows:
pUC57-Kan plasmid containing Omicron mRNA (gcH global nonopt), omicron mRNA (opt) and Omicron mRNA (gcH global opt) DNA sequences in example 1 was transcribed in vitro to prepare Omicron mRNA (gcH global nonopt) mRNA, omicron mRNA (opt) mRNA and Omicron mRNA (gcH global opt) mRNA stock. The preparation of the original liquid by in vitro transcription of the plasmid is completed by entrusting the preparation of the Norflua company, and the preparation method comprises four steps:
in the first step, three different pUC57-Kan mRNA plasmids, omicron mRNA (gcH global nonopt), omicron mRNA (opt) and Omicron mRNA (gcH global opt), were digested with linearizing enzyme and purified to give linearized plasmids;
secondly, the obtained linearized plasmid was added to the corresponding T7 RNA Polymerase Mix, 10 Xreaction buffer, N1-Me-Pseudo UTP Solution (100 mM), ATP (100 mM), GTP (100 mM), CTP (100 mM) and N1-Me-pseudoUTP according to Kit instructions using a commercial mRNA IVT Kit T7 High Yield RNA Transcription Kit (N1-Me-pseudoUTP) -DD4202, the Reaction system was prepared, reacted at 37℃for 16 hours using RNase-free H2O, DNase I was added to the Reaction system, and the transcribed DNA template was digested at 37℃for 15 minutes (in a PCR apparatus). The reaction system is as follows:
T7 RNA Polymerase Mix may be added last when the system is configured.
Thirdly, purifying the mRNA stock solution synthesized in the second step by using magnetic beads according to an Oligo (dT) magnetic bead purification kit to obtain a purified product mRNA.
And fourthly, adding 10 multiplied by the buffer, GTP (10 mM), SAM (thioadenosylmethionine), vaccinia Capping Enzyme and mRNA Cap 2' -O-methyl-transfer ferrose into the obtained mRNA stock solution, supplementing the mRNA stock solution into a corresponding reaction system by using RNase-free H2O, reacting for 60-100min at 37 ℃, and purifying the mRNA stock solution according to the operation of the third step.
Reaction conditions: reacting at 37 ℃ for 60min
The inventor realizes reducing mRNA immunogenicity by reducing the content of U (uridylic acid) in mRNA molecules, and substitutes all uracils in the nucleic acid sequences SEQ ID NO. 20-SEQ ID NO. 22 by pseudo uridine (psi), and the sequence after capping substitution is finally shown as SEQ ID NO. 23-SEQ ID NO. 25.
Example 2 flow cytometry detection of expression effects after transfection of HEK293 cells with different omacron (B.1.1.529) S glycoprotein mRNA
HEK293 cells were transfected with the above-mentioned Omacron mRNA (gcH global nonopt), omacron mRNA (opt) and Omacron mRNA (gcH global opt) prepared by the in vitro transcription process, respectively. First, 4×10 is used 5 Plating of HEK293 cells at individual cell/mlCell transfection was performed at about 80% of cell fusion after 24 hours. Transfection System in 1 well HEK293 cells added to 6 well plates included 1. Mu.g mRNA and transfection reagent Lipofectamine TM 3000 Transfection Reagent, the specific transfection procedure was performed according to the transfection reagent product instructions. HEK293 cells 10, 24 and 72 hours post-transfection were immunolabeled with omacron virus S glycoprotein antibody (fig. 1A) and then the S glycoprotein expression positive cell proportion was detected using a flow cytometer (fig. 1B). Empty liposome wells served as negative controls. The results are shown in FIG. 1, and it can be seen from the results in the figure that there was no significant difference in the proportion of S glycoprotein expression-positive cells compared to the Omicron mRNA (gcH global nonopt) group after cells were transfected with Omicron mRNA (gcH global nonopt) mRNA, omicron mRNA (opt) mRNA and Omicron mRNA (gcH global opt).
Example 3 Indirect immunofluorescence detection of expression Effect of different omacron (B.1.1.529) S glycoprotein mRNA transfected HEK293 cells
HEK293 cells were transfected with the above-mentioned Omacron mRNA (gcH global nonopt), omacron mRNA (opt) and Omacron mRNA (gcH global opt) prepared by the in vitro transcription process, respectively. First, 4×10 is used 5 Cell transfection was performed at a density of individual cells/ml by plating HEK293 cells at a cell fusion state of approximately 80% after 24 hours. Transfection System in 1 well HEK293 cells added to 6 well plates included 1. Mu.g mRNA and transfection reagent Lipofectamine TM 3000 Transfection Reagent, the specific transfection procedure was performed according to the transfection reagent product instructions. HEK293 cells 24 hours after transfection were immunolabeled with omicron virus S glycoprotein antibody and then the proportion of S glycoprotein expressing positive cells was detected using fluorescence microscopy. Empty liposome wells served as negative controls. The results are shown in FIG. 2, from which it can be seen that there was no significant difference in the proportion of S glycoprotein expression-positive cells compared to the Omicron mRNA (gcH global nonopt) group after cells were transfected with Omicron mRNA (gcH global nonopt) mRNA, omicron mRNA (opt) mRNA and Omicron mRNA (gcH global opt).
Example 4Western blot detection of expression Effect of different omacron (B.1.1.529) S glycoprotein mRNA transfected HEK293 cells
HEK293 cells were transfected with the above-mentioned Omacron mRNA (opt), and Omacron mRNA (gcH global opt) prepared by the in vitro transcription process, respectively. First, 4×10 is used 5 Cell transfection was performed at a density of individual cells/ml by plating HEK293 cells at a cell fusion state of approximately 80% after 24 hours. Transfection System in 1 well HEK293 cells added to 6 well plates included 1. Mu.g mRNA and transfection reagent Lipofectamine TM 3000 Transfection Reagent, the specific transfection procedure was performed according to the transfection reagent product instructions. HEK293 cells 24, 48 and 72 hours after transfection were lysed with lysate, and then the amount of S glycoprotein expression was detected using Western blot. Empty liposome wells served as negative controls.
As shown in FIG. 3, it was found from the results of the figures that the expression level of S glycoprotein was higher in the Omicron mRNA (opt) and Omicron mRNA (gcH global opt) transfected groups than in the Omicron mRNA (gcH global nonopt) transfected groups, and that optimizing the ORF codons and increasing the GC content could increase the protein expression level. Omicron mRNA (opt) and Omicron mRNA (gcH global opt) are the mRNA with the sequences of SEQ ID NO. 23 and SEQ ID NO. 24 provided by the invention.
Example 5IgG neutralizing antibodies detection of serum antibodies after immunization of animals with different preparations of omacron (B.1.1.529) S glycoprotein mRNA
mRNA the mice immunization experiments were performed using Omacron mRNA (opt) and Omacron mRNA (gcH global opt) from example 4, and the vaccine delivery system formulation was: mRNA molecule 1mg in quantity, ionizable cationic lipid ALC0315 14.00mg,DSPC 2.93mg, cholesterol 6.52mg and PEG modified lipid ALC0159 1.55mg. The prepared mRNA-LNP (Lipid nanoparticles) was used in a mouse administration experiment in which each mouse of the Omacron mRNA (opt) mRNA test group was injected at an administration amount of 10. Mu.g, the injection volume was 100. Mu.l/mouse, and the Omacron mRNA (gcH global opt) mRNA test group was divided into 3 dose groups of 5. Mu.g, 10. Mu.g and 20. Mu.g, respectively, and each mouse was injected at an injection volume of 100. Mu.l/mouse, respectively. Spike positive control test groups were prepared using omacron recombinant S glycoprotein according to the formulation method described in Thermo Scientific Imject alum adjuvant instructions (2. Mu.g dose per mouse, 100. Mu.l/mouse injection volume) and negative control as an equal volume saline injection. The results of measuring the neutralizing antibody titer of the serum Omicron virus of the mice 12 days after one administration of the above various vaccine preparations are shown in FIG. 4, from which it can be seen that Omicron S specific IgG neutralizing antibody titers were significantly higher in the group consisting of 10. Mu.g of Omicron mRNA (opt) mRNA, 5. Mu.g of Omicron mRNA (gcH global opt), 10. Mu.g of Omicron mRNA (gcH global opt) and 20. Mu.g of Omicron mRNA (gcH global opt) than in the Spike positive control test group, and the group consisting of 10. Mu.g of Omicron mRNA (gcH global opt) and 20. Mu.g of Omicron mRNA (gcH global opt) from the results.
Claims (10)
1. A nucleic acid molecule comprising an S glycoprotein encoding coronavirus omicron (b.1.1.529), wherein said coding region comprises one or more open reading frames (open reading frame, ORFs), and wherein at least one ORF encodes an S glycoprotein of coronavirus omicron (b.1.1.529) having a protein sequence as set forth in SEQ ID No. 1; and having a nucleotide sequence at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identity to the sequence shown in SEQ ID NO. 2.
2. The nucleic acid molecule of claim 1, wherein the nucleic acid molecule has a sequence as set forth in SEQ ID NO. 3-SEQ ID NO. 5.
3. Comprises mRNA molecules encoding coronavirus omacron (B.1.1.529) S glycoprotein, and the sequence of the mRNA molecules is shown as SEQ ID NO. 6-SEQ ID NO. 8.
An mrna nucleic acid molecule comprising:
(i) A 5 'untranslated region (5' -UTR);
(ii) A CDS, wherein the CDS comprises an Open Reading Frame (ORF) encoding a coronavirus omicron (b.1.1.529) antigen S glycoprotein capable of inducing an immune response comprising a nucleotide sequence as set forth in SEQ ID No. 6-SEQ ID No. 8;
(iii) 3 '-untranslated region (3' -UTR).
5. An mRNA nucleic acid molecule, said mRNA molecule comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA), wherein CDS comprises an Open Reading Frame (ORF) encoding coronavirus omicron (B.1.1.529) antigen S glycoprotein capable of inducing immune response, the nucleotide sequence is shown as SEQ ID NO:6-SEQ ID NO: 8.
6. An mRNA molecule, said mRNA comprising the following elements: 5 '-cap structure, 5' -UTR, CDS, 3'-UTR and 3' -polyadenylation sequence (polyA);
wherein said 5' -cap structure is optionally selected from m7G (5 ') ppp (5 ') (2 ' OMeA) pG, 3' -O-Me-m7G (5 ') ppp (5 ') G; g (5 ') ppp (5') A; g (5 ') ppp (5') G;
m7G (5 ') ppp (5') A; m7G (5 ') ppp (5') G, etc.;
wherein the 5' -UTR comprises a sequence shown as SEQ ID NO. 9-SEQ ID NO. 11;
wherein the CDS comprises an Open Reading Frame (ORF) which encodes a signal peptide and a coronavirus omacron (B.1.1.529) antigen S glycoprotein capable of inducing an immune response, wherein the sequence of the signal peptide is shown as SEQ ID NO. 17-SEQ ID NO. 19, and the sequence of the coronavirus omacron (B.1.1.529) antigen S glycoprotein is shown as SEQ ID NO. 6-SEQ ID NO. 8;
Wherein the 3' -UTR comprises a nucleotide sequence shown as SEQ ID NO. 12-SEQ ID NO. 14;
wherein the 3' -polyadenylation sequence (polyA) optionally has a length of 80 to 180 adenylates.
7. The mRNA nucleic acid molecule of any one of claims 3-6, wherein: the mRNA nucleic acid molecule further comprises one or more functional nucleotide analog modifications selected from the group consisting of pseudouridine (ψ), 1-methyl-pseudouridine (m 1 ψ), 1-ethyl-pseudouridine (e 1 ψ), 5-methoxy-uridine (mo 5U) and 5-methylcytosine (m 5C).
8. A pharmaceutical composition for inducing a neutralizing antibody response in a subject to a novel coronal variant omacron (b.1.1.529) S glycoprotein comprising the mRNA nucleic acid molecule of any of claims 3-7 and a pharmaceutically acceptable carrier.
9. A pharmaceutical composition comprising mRNA comprising the following components:
(1) mRNA molecules with the sequence shown in any one of SEQ ID NO. 20-SEQ ID NO. 25 or a combination thereof;
(2) 20-60% by mole of ionizable cationic lipids (ionizable lipids);
(3) 5-25% by mole of a non-cationic lipid;
(4) 25-55% by mole of sterols; and
(5) 0.5-15% by mole of PEG modified lipid.
10. Use of an mRNA nucleic acid molecule according to any one of claims 3-7 or a pharmaceutical composition according to any one of claims 8-9 for the preparation of a medicament or vaccine for the treatment or prevention of a novel coronavirus infection.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211064993.XA CN117625652A (en) | 2022-09-01 | 2022-09-01 | Novel coronavirus variant mRNA vaccine and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211064993.XA CN117625652A (en) | 2022-09-01 | 2022-09-01 | Novel coronavirus variant mRNA vaccine and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117625652A true CN117625652A (en) | 2024-03-01 |
Family
ID=90032748
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211064993.XA Pending CN117625652A (en) | 2022-09-01 | 2022-09-01 | Novel coronavirus variant mRNA vaccine and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117625652A (en) |
-
2022
- 2022-09-01 CN CN202211064993.XA patent/CN117625652A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111218458B (en) | mRNAs encoding SARS-CoV-2 virus antigen and vaccine and preparation method of vaccine | |
JP2023511633A (en) | coronavirus RNA vaccine | |
US20240100145A1 (en) | Vlp enteroviral vaccines | |
JP2023513073A (en) | Respiratory virus immunization composition | |
JP2024514183A (en) | Epstein-Barr virus mRNA vaccine | |
CN113736801B (en) | mRNA and novel coronavirus mRNA vaccine comprising same | |
JP7445657B2 (en) | Compositions and methods for treating ornithine transcarbamylase deficiency | |
US20210284974A1 (en) | Compositions and methods for the treatment of ornithine transcarbamylase deficiency | |
WO2023051701A1 (en) | Mrna, protein and vaccine against sars-cov-2 infection | |
CN114507675A (en) | Novel coronavirus mRNA vaccine and preparation method thereof | |
US20240181038A1 (en) | Immunogenic compositions | |
CN116617382B (en) | Novel coronavirus vaccine, preparation method and application thereof | |
WO2023098679A1 (en) | Novel coronavirus mrna vaccine against mutant strains | |
CN117625652A (en) | Novel coronavirus variant mRNA vaccine and application thereof | |
KR20230000471A (en) | non naturally occurring 5 prime untranslated region and 3 prime untranslated region and use thereof | |
CN114126668A (en) | Compositions and methods for the treatment of hemochromatosis | |
CN117625651A (en) | Rabies virus mRNA vaccine and application thereof | |
EP4349990A1 (en) | Artificial polynucleotides for expressing proteins | |
WO2024021817A1 (en) | Vaccine against sars-cov-2, method for preparing same, and use thereof | |
EP4342993A1 (en) | Hpv infectious disease vaccine | |
CN117903263A (en) | Protein or mRNA vaccine for resisting new coronavirus and preparation method and application thereof | |
CN117886903A (en) | Protein or mRNA vaccine for resisting new coronavirus and preparation method and application thereof | |
WO2024015890A1 (en) | Norovirus mrna vaccines | |
CN117886904A (en) | Protein or mRNA vaccine for resisting new coronavirus and preparation method and application thereof | |
KR20240010693A (en) | Modified RNA for preparing mRNA vaccines and therapeutics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication |