EP4366767A2 - Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteins - Google Patents
Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteinsInfo
- Publication number
- EP4366767A2 EP4366767A2 EP22838408.7A EP22838408A EP4366767A2 EP 4366767 A2 EP4366767 A2 EP 4366767A2 EP 22838408 A EP22838408 A EP 22838408A EP 4366767 A2 EP4366767 A2 EP 4366767A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- utr
- nucleic acid
- sars
- peptide
- cov
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 298
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 243
- 108091023045 Untranslated Region Proteins 0.000 title claims abstract description 77
- 108091034117 Oligonucleotide Proteins 0.000 title claims abstract description 74
- 230000003612 virological effect Effects 0.000 title claims abstract description 48
- 230000014509 gene expression Effects 0.000 title claims description 168
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 title claims description 19
- 230000001965 increasing effect Effects 0.000 title description 40
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 277
- 210000004027 cell Anatomy 0.000 claims abstract description 199
- 238000004519 manufacturing process Methods 0.000 claims abstract description 96
- 241000700605 Viruses Species 0.000 claims abstract description 42
- 229960005486 vaccine Drugs 0.000 claims abstract description 36
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims abstract description 22
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims abstract description 22
- 150000007523 nucleic acids Chemical class 0.000 claims description 186
- 235000018102 proteins Nutrition 0.000 claims description 175
- 241001678559 COVID-19 virus Species 0.000 claims description 118
- 239000012634 fragment Substances 0.000 claims description 117
- 108020004999 messenger RNA Proteins 0.000 claims description 114
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 107
- 150000001413 amino acids Chemical group 0.000 claims description 94
- 108020003589 5' Untranslated Regions Proteins 0.000 claims description 91
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 90
- 229940024606 amino acid Drugs 0.000 claims description 75
- 108091036066 Three prime untranslated region Proteins 0.000 claims description 73
- 235000001014 amino acid Nutrition 0.000 claims description 73
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 71
- 102000039446 nucleic acids Human genes 0.000 claims description 61
- 108020004707 nucleic acids Proteins 0.000 claims description 61
- 230000002708 enhancing effect Effects 0.000 claims description 55
- 238000000034 method Methods 0.000 claims description 52
- 238000010367 cloning Methods 0.000 claims description 46
- 230000001413 cellular effect Effects 0.000 claims description 40
- 108020005345 3' Untranslated Regions Proteins 0.000 claims description 36
- 241000711573 Coronaviridae Species 0.000 claims description 36
- 229920001184 polypeptide Polymers 0.000 claims description 31
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 30
- 102000040430 polynucleotide Human genes 0.000 claims description 30
- 108091033319 polynucleotide Proteins 0.000 claims description 30
- 239000002157 polynucleotide Substances 0.000 claims description 29
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 26
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 26
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 26
- 235000009582 asparagine Nutrition 0.000 claims description 26
- 229960001230 asparagine Drugs 0.000 claims description 26
- 229940009098 aspartate Drugs 0.000 claims description 26
- 108700021021 mRNA Vaccine Proteins 0.000 claims description 25
- 229940126582 mRNA vaccine Drugs 0.000 claims description 24
- -1 virions Proteins 0.000 claims description 24
- 239000000427 antigen Substances 0.000 claims description 22
- 108091007433 antigens Proteins 0.000 claims description 22
- 102000036639 antigens Human genes 0.000 claims description 22
- 239000000203 mixture Substances 0.000 claims description 22
- 239000013604 expression vector Substances 0.000 claims description 21
- 230000001105 regulatory effect Effects 0.000 claims description 21
- 230000004927 fusion Effects 0.000 claims description 18
- 108010041986 DNA Vaccines Proteins 0.000 claims description 16
- 229940021995 DNA vaccine Drugs 0.000 claims description 16
- 102000037865 fusion proteins Human genes 0.000 claims description 16
- 108020001507 fusion proteins Proteins 0.000 claims description 16
- 239000004471 Glycine Substances 0.000 claims description 15
- 230000003592 biomimetic effect Effects 0.000 claims description 15
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 claims description 13
- 239000004475 Arginine Substances 0.000 claims description 13
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 claims description 13
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 13
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 13
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 13
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 13
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 13
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 13
- ZFOMKMMPBOQKMC-KXUCPTDWSA-N L-pyrrolysine Chemical compound C[C@@H]1CC=N[C@H]1C(=O)NCCCC[C@H]([NH3+])C([O-])=O ZFOMKMMPBOQKMC-KXUCPTDWSA-N 0.000 claims description 13
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 13
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 13
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 13
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 13
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 13
- 239000004472 Lysine Substances 0.000 claims description 13
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 13
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 13
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 13
- 239000004473 Threonine Substances 0.000 claims description 13
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 13
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 13
- 235000004279 alanine Nutrition 0.000 claims description 13
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 13
- 235000018417 cysteine Nutrition 0.000 claims description 13
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 13
- 229930195712 glutamate Natural products 0.000 claims description 13
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 13
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 13
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 13
- 229960000310 isoleucine Drugs 0.000 claims description 13
- 229930182817 methionine Natural products 0.000 claims description 13
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 13
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 claims description 13
- 229940055619 selenocysteine Drugs 0.000 claims description 13
- 235000016491 selenocysteine Nutrition 0.000 claims description 13
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 13
- 239000004474 valine Substances 0.000 claims description 13
- 210000002845 virion Anatomy 0.000 claims description 13
- 229940023041 peptide vaccine Drugs 0.000 claims description 12
- 241001430294 unidentified retrovirus Species 0.000 claims description 11
- 238000003306 harvesting Methods 0.000 claims description 5
- 125000001433 C-terminal amino-acid group Chemical group 0.000 claims description 4
- 241000709664 Picornaviridae Species 0.000 claims description 4
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 claims description 3
- 239000003085 diluting agent Substances 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- 230000021615 conjugation Effects 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 30
- 238000000338 in vitro Methods 0.000 abstract description 15
- 210000004962 mammalian cell Anatomy 0.000 abstract description 11
- 238000001727 in vivo Methods 0.000 abstract description 8
- 239000013598 vector Substances 0.000 description 191
- 108010067390 Viral Proteins Proteins 0.000 description 75
- 238000004806 packaging method and process Methods 0.000 description 74
- 108020004414 DNA Proteins 0.000 description 69
- 102000053602 DNA Human genes 0.000 description 67
- 238000001890 transfection Methods 0.000 description 56
- 230000000694 effects Effects 0.000 description 50
- 239000005090 green fluorescent protein Substances 0.000 description 48
- 239000006228 supernatant Substances 0.000 description 48
- 230000014616 translation Effects 0.000 description 43
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 41
- 230000028327 secretion Effects 0.000 description 40
- 238000003556 assay Methods 0.000 description 39
- 238000012546 transfer Methods 0.000 description 39
- 230000009977 dual effect Effects 0.000 description 38
- 238000007792 addition Methods 0.000 description 36
- 125000003729 nucleotide group Chemical group 0.000 description 36
- 238000001262 western blot Methods 0.000 description 34
- 239000002773 nucleotide Substances 0.000 description 33
- 239000000047 product Substances 0.000 description 32
- 102100031673 Corneodesmosin Human genes 0.000 description 31
- 101710139375 Corneodesmosin Proteins 0.000 description 31
- 239000013612 plasmid Substances 0.000 description 31
- 239000002777 nucleoside Substances 0.000 description 29
- 230000000875 corresponding effect Effects 0.000 description 28
- 238000012986 modification Methods 0.000 description 28
- 238000013519 translation Methods 0.000 description 28
- 230000004048 modification Effects 0.000 description 27
- 239000013592 cell lysate Substances 0.000 description 23
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 description 21
- 239000012528 membrane Substances 0.000 description 21
- 235000000346 sugar Nutrition 0.000 description 21
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 20
- 239000000499 gel Substances 0.000 description 20
- 230000003248 secreting effect Effects 0.000 description 20
- 238000002474 experimental method Methods 0.000 description 19
- 238000013518 transcription Methods 0.000 description 19
- 230000035897 transcription Effects 0.000 description 19
- 230000006870 function Effects 0.000 description 18
- 102000048657 human ACE2 Human genes 0.000 description 18
- 239000005089 Luciferase Substances 0.000 description 17
- YHIPILPTUVMWQT-UHFFFAOYSA-N Oplophorus luciferin Chemical compound C1=CC(O)=CC=C1CC(C(N1C=C(N2)C=3C=CC(O)=CC=3)=O)=NC1=C2CC1=CC=CC=C1 YHIPILPTUVMWQT-UHFFFAOYSA-N 0.000 description 16
- 125000003835 nucleoside group Chemical group 0.000 description 16
- 108060001084 Luciferase Proteins 0.000 description 15
- 238000011160 research Methods 0.000 description 15
- 241000701022 Cytomegalovirus Species 0.000 description 14
- 238000002965 ELISA Methods 0.000 description 14
- 108010002350 Interleukin-2 Proteins 0.000 description 14
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 14
- 230000002068 genetic effect Effects 0.000 description 14
- 230000016784 immunoglobulin production Effects 0.000 description 14
- 230000035772 mutation Effects 0.000 description 14
- 239000002953 phosphate buffered saline Substances 0.000 description 14
- 230000008569 process Effects 0.000 description 14
- 101710204837 Envelope small membrane protein Proteins 0.000 description 13
- 101710145006 Lysis protein Proteins 0.000 description 13
- 238000005457 optimization Methods 0.000 description 13
- 238000000746 purification Methods 0.000 description 13
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 12
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 12
- 241000713666 Lentivirus Species 0.000 description 12
- 239000000020 Nitrocellulose Substances 0.000 description 12
- RJURFGZVJUQBHK-UHFFFAOYSA-N actinomycin D Natural products CC1OC(=O)C(C(C)C)N(C)C(=O)CN(C)C(=O)C2CCCN2C(=O)C(C(C)C)NC(=O)C1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)NC4C(=O)NC(C(N5CCCC5C(=O)N(C)CC(=O)N(C)C(C(C)C)C(=O)OC4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-UHFFFAOYSA-N 0.000 description 12
- 101150063416 add gene Proteins 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 12
- KQNZDYYTLMIZCT-KQPMLPITSA-N brefeldin A Chemical compound O[C@@H]1\C=C\C(=O)O[C@@H](C)CCC\C=C\[C@@H]2C[C@H](O)C[C@H]21 KQNZDYYTLMIZCT-KQPMLPITSA-N 0.000 description 12
- JUMGSHROWPPKFX-UHFFFAOYSA-N brefeldin-A Natural products CC1CCCC=CC2(C)CC(O)CC2(C)C(O)C=CC(=O)O1 JUMGSHROWPPKFX-UHFFFAOYSA-N 0.000 description 12
- 239000012091 fetal bovine serum Substances 0.000 description 12
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 12
- 229920001220 nitrocellulos Polymers 0.000 description 12
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 description 11
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 description 11
- 241000196324 Embryophyta Species 0.000 description 11
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 description 11
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 description 11
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 description 11
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 11
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 description 11
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 description 11
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 description 11
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 description 11
- 238000010586 diagram Methods 0.000 description 11
- 239000003112 inhibitor Substances 0.000 description 11
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 description 10
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 description 10
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 description 10
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 description 10
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 description 10
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 description 10
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 description 10
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 description 10
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 description 10
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 description 10
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 description 10
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 description 10
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 description 10
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 description 10
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 description 10
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 description 10
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 description 10
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 description 10
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 description 10
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 description 10
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 description 10
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 description 10
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 description 10
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 description 10
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 description 10
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 description 10
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 description 10
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 description 10
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 description 10
- 108700026244 Open Reading Frames Proteins 0.000 description 10
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 description 10
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 description 10
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 description 10
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 description 10
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 description 10
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 description 10
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 description 10
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 description 10
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 description 10
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 description 10
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 description 10
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 description 10
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 description 10
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 description 10
- 230000033228 biological regulation Effects 0.000 description 10
- 230000008859 change Effects 0.000 description 10
- 238000001514 detection method Methods 0.000 description 10
- 238000010361 transduction Methods 0.000 description 10
- 230000026683 transduction Effects 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 9
- 108700019146 Transgenes Proteins 0.000 description 9
- 239000000872 buffer Substances 0.000 description 9
- 230000001419 dependent effect Effects 0.000 description 9
- 239000003623 enhancer Substances 0.000 description 9
- 238000000684 flow cytometry Methods 0.000 description 9
- 238000006386 neutralization reaction Methods 0.000 description 9
- 238000010606 normalization Methods 0.000 description 9
- 239000002245 particle Substances 0.000 description 9
- 230000008488 polyadenylation Effects 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 239000012096 transfection reagent Substances 0.000 description 9
- 230000032258 transport Effects 0.000 description 9
- 108091026890 Coding region Proteins 0.000 description 8
- 238000003365 immunocytochemistry Methods 0.000 description 8
- 238000011534 incubation Methods 0.000 description 8
- 230000001939 inductive effect Effects 0.000 description 8
- 238000011068 loading method Methods 0.000 description 8
- 238000000386 microscopy Methods 0.000 description 8
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 8
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 8
- 230000008685 targeting Effects 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- 230000002103 transcriptional effect Effects 0.000 description 8
- 238000011144 upstream manufacturing Methods 0.000 description 8
- 101800001779 2'-O-methyltransferase Proteins 0.000 description 7
- 101800003073 2'-O-methyltransferase nsp16 Proteins 0.000 description 7
- 241000494545 Cordyline virus 2 Species 0.000 description 7
- 238000007702 DNA assembly Methods 0.000 description 7
- 108090000331 Firefly luciferases Proteins 0.000 description 7
- 241000963438 Gaussia <copepod> Species 0.000 description 7
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 7
- 108010076504 Protein Sorting Signals Proteins 0.000 description 7
- 241001112090 Pseudovirus Species 0.000 description 7
- 101800001255 Putative 2'-O-methyl transferase Proteins 0.000 description 7
- 210000004899 c-terminal region Anatomy 0.000 description 7
- 239000002299 complementary DNA Substances 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 231100000673 dose–response relationship Toxicity 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 239000001963 growth medium Substances 0.000 description 7
- 208000015181 infectious disease Diseases 0.000 description 7
- 150000003833 nucleoside derivatives Chemical class 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- 239000000758 substrate Substances 0.000 description 7
- 108010092160 Dactinomycin Proteins 0.000 description 6
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 241000725303 Human immunodeficiency virus Species 0.000 description 6
- 108060003951 Immunoglobulin Proteins 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 229930040373 Paraformaldehyde Natural products 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- 239000006180 TBST buffer Substances 0.000 description 6
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 6
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- 238000003776 cleavage reaction Methods 0.000 description 6
- 229960000640 dactinomycin Drugs 0.000 description 6
- 230000018109 developmental process Effects 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- 229940088598 enzyme Drugs 0.000 description 6
- 239000003102 growth factor Substances 0.000 description 6
- 102000018358 immunoglobulin Human genes 0.000 description 6
- 230000005764 inhibitory process Effects 0.000 description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 6
- 239000006166 lysate Substances 0.000 description 6
- 229920002866 paraformaldehyde Polymers 0.000 description 6
- 150000004713 phosphodiesters Chemical class 0.000 description 6
- 230000001124 posttranscriptional effect Effects 0.000 description 6
- 238000007480 sanger sequencing Methods 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 6
- 229910052717 sulfur Inorganic materials 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 239000012224 working solution Substances 0.000 description 6
- 108020004705 Codon Proteins 0.000 description 5
- 102000004127 Cytokines Human genes 0.000 description 5
- 108090000695 Cytokines Proteins 0.000 description 5
- 241000206602 Eukaryota Species 0.000 description 5
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 5
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 5
- 241000714474 Rous sarcoma virus Species 0.000 description 5
- 101000667982 Severe acute respiratory syndrome coronavirus 2 Envelope small membrane protein Proteins 0.000 description 5
- 238000004113 cell culture Methods 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000007423 decrease Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 238000011161 development Methods 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- 239000000975 dye Substances 0.000 description 5
- 238000001415 gene therapy Methods 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 238000002372 labelling Methods 0.000 description 5
- 238000005259 measurement Methods 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 230000001254 nonsecretory effect Effects 0.000 description 5
- 229920002401 polyacrylamide Polymers 0.000 description 5
- 238000001742 protein purification Methods 0.000 description 5
- 238000011002 quantification Methods 0.000 description 5
- 238000003753 real-time PCR Methods 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 238000004448 titration Methods 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- UAIUNKRWKOVEES-UHFFFAOYSA-N 3,3',5,5'-tetramethylbenzidine Chemical compound CC1=C(N)C(C)=CC(C=2C=C(C)C(N)=C(C)C=2)=C1 UAIUNKRWKOVEES-UHFFFAOYSA-N 0.000 description 4
- 241000180579 Arca Species 0.000 description 4
- 241000283074 Equus asinus Species 0.000 description 4
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 4
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 4
- 108010052285 Membrane Proteins Proteins 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- 101710144128 Non-structural protein 2 Proteins 0.000 description 4
- 229930182555 Penicillin Natural products 0.000 description 4
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 4
- 102100022648 Reticulon-2 Human genes 0.000 description 4
- 108091058545 Secretory proteins Proteins 0.000 description 4
- 102000040739 Secretory proteins Human genes 0.000 description 4
- 238000000692 Student's t-test Methods 0.000 description 4
- 108700005077 Viral Genes Proteins 0.000 description 4
- 239000007864 aqueous solution Substances 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 4
- 229960000074 biopharmaceutical Drugs 0.000 description 4
- 239000006143 cell culture medium Substances 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 238000010217 densitometric analysis Methods 0.000 description 4
- 239000012153 distilled water Substances 0.000 description 4
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 4
- 238000001976 enzyme digestion Methods 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 230000028993 immune response Effects 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- 238000004020 luminiscence type Methods 0.000 description 4
- 229910052757 nitrogen Inorganic materials 0.000 description 4
- 229940049954 penicillin Drugs 0.000 description 4
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 4
- ZJAOAACCNHFJAH-UHFFFAOYSA-N phosphonoformic acid Chemical class OC(=O)P(O)(O)=O ZJAOAACCNHFJAH-UHFFFAOYSA-N 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 238000007619 statistical method Methods 0.000 description 4
- 239000011550 stock solution Substances 0.000 description 4
- 229960005322 streptomycin Drugs 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000036962 time dependent Effects 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000014621 translational initiation Effects 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 3
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical compound CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 3
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 3
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 3
- 102000007469 Actins Human genes 0.000 description 3
- 108010085238 Actins Proteins 0.000 description 3
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 3
- 101100272788 Arabidopsis thaliana BSL3 gene Proteins 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 240000007124 Brassica oleracea Species 0.000 description 3
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 3
- 229940022962 COVID-19 vaccine Drugs 0.000 description 3
- SRBFZHDQGSBBOR-SOOFDHNKSA-N D-ribopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@@H]1O SRBFZHDQGSBBOR-SOOFDHNKSA-N 0.000 description 3
- 241001269524 Dura Species 0.000 description 3
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 108020005004 Guide RNA Proteins 0.000 description 3
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 3
- 108090001061 Insulin Proteins 0.000 description 3
- 102000004877 Insulin Human genes 0.000 description 3
- 102000014150 Interferons Human genes 0.000 description 3
- 108010050904 Interferons Proteins 0.000 description 3
- 108010063738 Interleukins Proteins 0.000 description 3
- 102000015696 Interleukins Human genes 0.000 description 3
- 241001024099 Olla Species 0.000 description 3
- 229910019142 PO4 Inorganic materials 0.000 description 3
- 108091093037 Peptide nucleic acid Proteins 0.000 description 3
- 108091036407 Polyadenylation Proteins 0.000 description 3
- 229940096437 Protein S Drugs 0.000 description 3
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 3
- 241000315672 SARS coronavirus Species 0.000 description 3
- 108091007576 SARS-CoV-2 structural proteins Proteins 0.000 description 3
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- 108091027544 Subgenomic mRNA Proteins 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 3
- 229960005305 adenosine Drugs 0.000 description 3
- 238000001261 affinity purification Methods 0.000 description 3
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 230000000975 bioactive effect Effects 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 150000001720 carbohydrates Chemical class 0.000 description 3
- 235000014633 carbohydrates Nutrition 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 238000012761 co-transfection Methods 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 238000000799 fluorescence microscopy Methods 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 238000010362 genome editing Methods 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 229940088597 hormone Drugs 0.000 description 3
- 239000005556 hormone Substances 0.000 description 3
- 230000001771 impaired effect Effects 0.000 description 3
- 229940125396 insulin Drugs 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 229940079322 interferon Drugs 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 231100000219 mutagenic Toxicity 0.000 description 3
- 230000003505 mutagenic effect Effects 0.000 description 3
- 231100000150 mutagenicity / genotoxicity testing Toxicity 0.000 description 3
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 3
- 239000010452 phosphate Substances 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 230000003389 potentiating effect Effects 0.000 description 3
- 125000002924 primary amino group Chemical class [H]N([H])* 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 239000004017 serum-free culture medium Substances 0.000 description 3
- 125000001424 substituent group Chemical group 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 230000002195 synergetic effect Effects 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 102000003390 tumor necrosis factor Human genes 0.000 description 3
- 238000002255 vaccination Methods 0.000 description 3
- WMBWREPUVVBILR-WIYYLYMNSA-N (-)-Epigallocatechin-3-o-gallate Chemical compound O([C@@H]1CC2=C(O)C=C(C=C2O[C@@H]1C=1C=C(O)C(O)=C(O)C=1)O)C(=O)C1=CC(O)=C(O)C(O)=C1 WMBWREPUVVBILR-WIYYLYMNSA-N 0.000 description 2
- AXAVXPMQTGXXJZ-UHFFFAOYSA-N 2-aminoacetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound NCC(O)=O.OCC(N)(CO)CO AXAVXPMQTGXXJZ-UHFFFAOYSA-N 0.000 description 2
- GJTBSTBJLVYKAU-XVFCMESISA-N 2-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=S)NC(=O)C=C1 GJTBSTBJLVYKAU-XVFCMESISA-N 0.000 description 2
- PECYZEOJVXMISF-UHFFFAOYSA-N 3-aminoalanine Chemical compound [NH3+]CC(N)C([O-])=O PECYZEOJVXMISF-UHFFFAOYSA-N 0.000 description 2
- QXDXBKZJFLRLCM-UAKXSSHOSA-N 5-hydroxyuridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(O)=C1 QXDXBKZJFLRLCM-UAKXSSHOSA-N 0.000 description 2
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 2
- 108020005176 AU Rich Elements Proteins 0.000 description 2
- 239000012099 Alexa Fluor family Substances 0.000 description 2
- 244000099147 Ananas comosus Species 0.000 description 2
- 235000007119 Ananas comosus Nutrition 0.000 description 2
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 2
- 108010039627 Aprotinin Proteins 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 102000004219 Brain-derived neurotrophic factor Human genes 0.000 description 2
- 108090000715 Brain-derived neurotrophic factor Proteins 0.000 description 2
- 241000555281 Brevibacillus Species 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 102100023804 Coagulation factor VII Human genes 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 2
- 150000008574 D-amino acids Chemical class 0.000 description 2
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 2
- 238000013382 DNA quantification Methods 0.000 description 2
- 241000721047 Danaus plexippus Species 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 238000008157 ELISA kit Methods 0.000 description 2
- 102000003951 Erythropoietin Human genes 0.000 description 2
- 108090000394 Erythropoietin Proteins 0.000 description 2
- 108010023321 Factor VII Proteins 0.000 description 2
- 108010014173 Factor X Proteins 0.000 description 2
- WMBWREPUVVBILR-UHFFFAOYSA-N GCG Natural products C=1C(O)=C(O)C(O)=CC=1C1OC2=CC(O)=CC(O)=C2CC1OC(=O)C1=CC(O)=C(O)C(O)=C1 WMBWREPUVVBILR-UHFFFAOYSA-N 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 2
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 2
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Natural products C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- 102000001554 Hemoglobins Human genes 0.000 description 2
- 108010054147 Hemoglobins Proteins 0.000 description 2
- 108090000100 Hepatocyte Growth Factor Proteins 0.000 description 2
- 102100021866 Hepatocyte growth factor Human genes 0.000 description 2
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 2
- 108020005350 Initiator Codon Proteins 0.000 description 2
- 229930010555 Inosine Natural products 0.000 description 2
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 2
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 2
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 2
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- 102100025584 Leukocyte immunoglobulin-like receptor subfamily B member 1 Human genes 0.000 description 2
- GDBQQVLCIARPGH-UHFFFAOYSA-N Leupeptin Natural products CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N GDBQQVLCIARPGH-UHFFFAOYSA-N 0.000 description 2
- 241000713333 Mouse mammary tumor virus Species 0.000 description 2
- 102000003505 Myosin Human genes 0.000 description 2
- 108060008487 Myosin Proteins 0.000 description 2
- SLEHROROQDYRAW-KQYNXXCUSA-N N(2)-methylguanosine Chemical compound C1=NC=2C(=O)NC(NC)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O SLEHROROQDYRAW-KQYNXXCUSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108010025020 Nerve Growth Factor Proteins 0.000 description 2
- 102000015336 Nerve Growth Factor Human genes 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 2
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 102000004316 Oxidoreductases Human genes 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 229940026233 Pfizer-BioNTech COVID-19 vaccine Drugs 0.000 description 2
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 2
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102000029301 Protein S Human genes 0.000 description 2
- 108010066124 Protein S Proteins 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 238000011529 RT qPCR Methods 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 101150010882 S gene Proteins 0.000 description 2
- 108091006197 SARS-CoV-2 Nucleocapsid Protein Proteins 0.000 description 2
- 101000953880 Severe acute respiratory syndrome coronavirus 2 Membrane protein Proteins 0.000 description 2
- 102000013275 Somatomedins Human genes 0.000 description 2
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- 102000009618 Transforming Growth Factors Human genes 0.000 description 2
- 108010009583 Transforming Growth Factors Proteins 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102000044159 Ubiquitin Human genes 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 108020000999 Viral RNA Proteins 0.000 description 2
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 description 2
- 108010076089 accutase Proteins 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 238000012867 alanine scanning Methods 0.000 description 2
- 230000000840 anti-viral effect Effects 0.000 description 2
- 229960004405 aprotinin Drugs 0.000 description 2
- 239000012131 assay buffer Substances 0.000 description 2
- 239000012298 atmosphere Substances 0.000 description 2
- 239000005441 aurora Substances 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 125000002619 bicyclic group Chemical group 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 238000005415 bioluminescence Methods 0.000 description 2
- 230000029918 bioluminescence Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 229940077737 brain-derived neurotrophic factor Drugs 0.000 description 2
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 2
- 229960003669 carbenicillin Drugs 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 210000004037 cervical canal epithelial cell Anatomy 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- CVSVTCORWBXHQV-UHFFFAOYSA-N creatine Chemical compound NC(=[NH2+])N(C)CC([O-])=O CVSVTCORWBXHQV-UHFFFAOYSA-N 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000001086 cytosolic effect Effects 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000007876 drug discovery Methods 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 229940105423 erythropoietin Drugs 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 229940012413 factor vii Drugs 0.000 description 2
- 229940012426 factor x Drugs 0.000 description 2
- 230000001605 fetal effect Effects 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 108091006047 fluorescent proteins Proteins 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 238000002825 functional assay Methods 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- 238000001476 gene delivery Methods 0.000 description 2
- 229940029575 guanosine Drugs 0.000 description 2
- 238000010842 high-capacity cDNA reverse transcription kit Methods 0.000 description 2
- 238000010808 iTaq universal SYBR Green Supermix Kit Methods 0.000 description 2
- 238000010191 image analysis Methods 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 238000012744 immunostaining Methods 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 2
- 229960003786 inosine Drugs 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 108010052968 leupeptin Proteins 0.000 description 2
- GDBQQVLCIARPGH-ULQDDVLXSA-N leupeptin Chemical compound CC(C)C[C@H](NC(C)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C=O)CCCN=C(N)N GDBQQVLCIARPGH-ULQDDVLXSA-N 0.000 description 2
- 238000012417 linear regression Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 230000012976 mRNA stabilization Effects 0.000 description 2
- 235000013336 milk Nutrition 0.000 description 2
- 239000008267 milk Substances 0.000 description 2
- 210000004080 milk Anatomy 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 229940053128 nerve growth factor Drugs 0.000 description 2
- 230000003472 neutralizing effect Effects 0.000 description 2
- 108091027963 non-coding RNA Proteins 0.000 description 2
- 102000042567 non-coding RNA Human genes 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 238000001543 one-way ANOVA Methods 0.000 description 2
- 229960003104 ornithine Drugs 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- XUYJLQHKOGNDPB-UHFFFAOYSA-N phosphonoacetic acid Chemical compound OC(=O)CP(O)(O)=O XUYJLQHKOGNDPB-UHFFFAOYSA-N 0.000 description 2
- 230000004962 physiological condition Effects 0.000 description 2
- INAAIJLSXJJHOZ-UHFFFAOYSA-N pibenzimol Chemical compound C1CN(C)CCN1C1=CC=C(N=C(N2)C=3C=C4NC(=NC4=CC=3)C=3C=CC(O)=CC=3)C2=C1 INAAIJLSXJJHOZ-UHFFFAOYSA-N 0.000 description 2
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 230000029983 protein stabilization Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 238000013207 serial dilution Methods 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000011593 sulfur Substances 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- PASOFFRBGIVJET-YRKGHMEHSA-N (2r,3r,4r,5r)-2-(6-aminopurin-9-yl)-5-(hydroxymethyl)-3-methyloxolane-3,4-diol Chemical compound C[C@@]1(O)[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(N)=C2N=C1 PASOFFRBGIVJET-YRKGHMEHSA-N 0.000 description 1
- JPSHPWJJSVEEAX-OWPBQMJCSA-N (2s)-2-amino-4-fluoranylpentanedioic acid Chemical compound OC(=O)[C@@H](N)CC([18F])C(O)=O JPSHPWJJSVEEAX-OWPBQMJCSA-N 0.000 description 1
- BDJISGBETBWCTR-IBZYUGMLSA-N (2s,3r)-2-amino-n-[[9-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2-methylsulfanylpurin-6-yl]-methylcarbamoyl]-3-hydroxybutanamide Chemical compound C12=NC(SC)=NC(N(C)C(=O)NC(=O)[C@@H](N)[C@@H](C)O)=C2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O BDJISGBETBWCTR-IBZYUGMLSA-N 0.000 description 1
- GPTUGCGYEMEAOC-IBZYUGMLSA-N (2s,3r)-2-amino-n-[[9-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]purin-6-yl]-methylcarbamoyl]-3-hydroxybutanamide Chemical compound C1=NC=2C(N(C)C(=O)NC(=O)[C@@H](N)[C@H](O)C)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O GPTUGCGYEMEAOC-IBZYUGMLSA-N 0.000 description 1
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- XIJAZGMFHRTBFY-FDDDBJFASA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2-$l^{1}-selanyl-5-(methylaminomethyl)pyrimidin-4-one Chemical compound [Se]C1=NC(=O)C(CNC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 XIJAZGMFHRTBFY-FDDDBJFASA-N 0.000 description 1
- HXVKEKIORVUWDR-FDDDBJFASA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-(methylaminomethyl)-2-sulfanylidenepyrimidin-4-one Chemical compound S=C1NC(=O)C(CNC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 HXVKEKIORVUWDR-FDDDBJFASA-N 0.000 description 1
- UTAIYTHAJQNQDW-KQYNXXCUSA-N 1-methylguanosine Chemical compound C1=NC=2C(=O)N(C)C(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O UTAIYTHAJQNQDW-KQYNXXCUSA-N 0.000 description 1
- WJNGQIYEQLPJMN-IOSLPCCCSA-N 1-methylinosine Chemical compound C1=NC=2C(=O)N(C)C=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O WJNGQIYEQLPJMN-IOSLPCCCSA-N 0.000 description 1
- FPUGCISOLXNPPC-IOSLPCCCSA-N 2'-methoxyadenosine Natural products CO[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(N)=C2N=C1 FPUGCISOLXNPPC-IOSLPCCCSA-N 0.000 description 1
- TYIRBZOAKBEYEJ-UHFFFAOYSA-N 2-(1,3-dimethyl-2,6-dioxopurin-7-yl)ethyl 2-[1-methyl-5-(4-methylbenzoyl)pyrrol-2-yl]acetate Chemical compound C1=CC(C)=CC=C1C(=O)C(N1C)=CC=C1CC(=O)OCCN1C(C(=O)N(C)C(=O)N2C)=C2N=C1 TYIRBZOAKBEYEJ-UHFFFAOYSA-N 0.000 description 1
- IQZWKGWOBPJWMX-UHFFFAOYSA-N 2-Methyladenosine Natural products C12=NC(C)=NC(N)=C2N=CN1C1OC(CO)C(O)C1O IQZWKGWOBPJWMX-UHFFFAOYSA-N 0.000 description 1
- SOEYIPCQNRSIAV-IOSLPCCCSA-N 2-amino-5-(aminomethyl)-7-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1h-pyrrolo[2,3-d]pyrimidin-4-one Chemical compound C1=2NC(N)=NC(=O)C=2C(CN)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O SOEYIPCQNRSIAV-IOSLPCCCSA-N 0.000 description 1
- BIRQNXWAXWLATA-IOSLPCCCSA-N 2-amino-7-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-4-oxo-1h-pyrrolo[2,3-d]pyrimidine-5-carbonitrile Chemical compound C1=C(C#N)C=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O BIRQNXWAXWLATA-IOSLPCCCSA-N 0.000 description 1
- VWSLLSXLURJCDF-UHFFFAOYSA-N 2-methyl-4,5-dihydro-1h-imidazole Chemical compound CC1=NCCN1 VWSLLSXLURJCDF-UHFFFAOYSA-N 0.000 description 1
- IQZWKGWOBPJWMX-IOSLPCCCSA-N 2-methyladenosine Chemical compound C12=NC(C)=NC(N)=C2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O IQZWKGWOBPJWMX-IOSLPCCCSA-N 0.000 description 1
- QEWSGVMSLPHELX-UHFFFAOYSA-N 2-methylthio-N6-(cis-hydroxyisopentenyl) adenosine Chemical compound C12=NC(SC)=NC(NCC=C(C)CO)=C2N=CN1C1OC(CO)C(O)C1O QEWSGVMSLPHELX-UHFFFAOYSA-N 0.000 description 1
- RHFUOMFWUGWKKO-XVFCMESISA-N 2-thiocytidine Chemical compound S=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 RHFUOMFWUGWKKO-XVFCMESISA-N 0.000 description 1
- YXNIEZJFCGTDKV-JANFQQFMSA-N 3-(3-amino-3-carboxypropyl)uridine Chemical compound O=C1N(CCC(N)C(O)=O)C(=O)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 YXNIEZJFCGTDKV-JANFQQFMSA-N 0.000 description 1
- RDPUKVRQKWBSPK-UHFFFAOYSA-N 3-Methylcytidine Natural products O=C1N(C)C(=N)C=CN1C1C(O)C(O)C(CO)O1 RDPUKVRQKWBSPK-UHFFFAOYSA-N 0.000 description 1
- HOEIPINIBKBXTJ-IDTAVKCVSA-N 3-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-4,6,7-trimethylimidazo[1,2-a]purin-9-one Chemical compound C1=NC=2C(=O)N3C(C)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O HOEIPINIBKBXTJ-IDTAVKCVSA-N 0.000 description 1
- RDPUKVRQKWBSPK-ZOQUXTDFSA-N 3-methylcytidine Chemical compound O=C1N(C)C(=N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 RDPUKVRQKWBSPK-ZOQUXTDFSA-N 0.000 description 1
- FBTSQILOGYXGMD-LURJTMIESA-N 3-nitro-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C([N+]([O-])=O)=C1 FBTSQILOGYXGMD-LURJTMIESA-N 0.000 description 1
- ZLOIGESWDJYCTF-UHFFFAOYSA-N 4-Thiouridine Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- YUDSCJBUWTYENI-VPCXQMTMSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)-2-methyloxolan-2-yl]pyrimidin-2-one Chemical compound C1=CC(N)=NC(=O)N1[C@]1(C)O[C@H](CO)[C@@H](O)[C@H]1O YUDSCJBUWTYENI-VPCXQMTMSA-N 0.000 description 1
- OCMSXKMNYAHJMU-JXOAFFINSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2-oxopyrimidine-5-carbaldehyde Chemical compound C1=C(C=O)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 OCMSXKMNYAHJMU-JXOAFFINSA-N 0.000 description 1
- ZLOIGESWDJYCTF-XVFCMESISA-N 4-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-XVFCMESISA-N 0.000 description 1
- UVGCZRPOXXYZKH-QADQDURISA-N 5-(carboxyhydroxymethyl)uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(C(O)C(O)=O)=C1 UVGCZRPOXXYZKH-QADQDURISA-N 0.000 description 1
- FAWQJBLSWXIJLA-VPCXQMTMSA-N 5-(carboxymethyl)uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(CC(O)=O)=C1 FAWQJBLSWXIJLA-VPCXQMTMSA-N 0.000 description 1
- VSCNRXVDHRNJOA-PNHWDRBUSA-N 5-(carboxymethylaminomethyl)uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(CNCC(O)=O)=C1 VSCNRXVDHRNJOA-PNHWDRBUSA-N 0.000 description 1
- NFEXJLMYXXIWPI-JXOAFFINSA-N 5-Hydroxymethylcytidine Chemical compound C1=C(CO)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NFEXJLMYXXIWPI-JXOAFFINSA-N 0.000 description 1
- ZYEWPVTXYBLWRT-UHFFFAOYSA-N 5-Uridinacetamid Natural products O=C1NC(=O)C(CC(=O)N)=CN1C1C(O)C(O)C(CO)O1 ZYEWPVTXYBLWRT-UHFFFAOYSA-N 0.000 description 1
- LOEDKMLIGFMQKR-JXOAFFINSA-N 5-aminomethyl-2-thiouridine Chemical compound S=C1NC(=O)C(CN)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 LOEDKMLIGFMQKR-JXOAFFINSA-N 0.000 description 1
- ZYEWPVTXYBLWRT-VPCXQMTMSA-N 5-carbamoylmethyluridine Chemical compound O=C1NC(=O)C(CC(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZYEWPVTXYBLWRT-VPCXQMTMSA-N 0.000 description 1
- VKLFQTYNHLDMDP-PNHWDRBUSA-N 5-carboxymethylaminomethyl-2-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=S)NC(=O)C(CNCC(O)=O)=C1 VKLFQTYNHLDMDP-PNHWDRBUSA-N 0.000 description 1
- YIZYCHKPHCPKHZ-PNHWDRBUSA-N 5-methoxycarbonylmethyluridine Chemical compound O=C1NC(=O)C(CC(=O)OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 YIZYCHKPHCPKHZ-PNHWDRBUSA-N 0.000 description 1
- ZXIATBNUWJBBGT-JXOAFFINSA-N 5-methoxyuridine Chemical compound O=C1NC(=O)C(OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZXIATBNUWJBBGT-JXOAFFINSA-N 0.000 description 1
- SNNBPMAXGYBMHM-JXOAFFINSA-N 5-methyl-2-thiouridine Chemical compound S=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 SNNBPMAXGYBMHM-JXOAFFINSA-N 0.000 description 1
- HXVKEKIORVUWDR-UHFFFAOYSA-N 5-methylaminomethyl-2-thiouridine Natural products S=C1NC(=O)C(CNC)=CN1C1C(O)C(O)C(CO)O1 HXVKEKIORVUWDR-UHFFFAOYSA-N 0.000 description 1
- ZXQHKBUIXRFZBV-FDDDBJFASA-N 5-methylaminomethyluridine Chemical compound O=C1NC(=O)C(CNC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZXQHKBUIXRFZBV-FDDDBJFASA-N 0.000 description 1
- ODHCTXKNWHHXJC-VKHMYHEASA-N 5-oxo-L-proline Chemical compound OC(=O)[C@@H]1CCC(=O)N1 ODHCTXKNWHHXJC-VKHMYHEASA-N 0.000 description 1
- 102100023990 60S ribosomal protein L17 Human genes 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 206010066224 Acute post asthmatic amyotrophy Diseases 0.000 description 1
- 101150051188 Adora2a gene Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 101710134784 Agnoprotein Proteins 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 1
- 241000219317 Amaranthaceae Species 0.000 description 1
- 208000009575 Angelman syndrome Diseases 0.000 description 1
- 102000006306 Antigen Receptors Human genes 0.000 description 1
- 108010083359 Antigen Receptors Proteins 0.000 description 1
- 108700042778 Antimicrobial Peptides Proteins 0.000 description 1
- 102000044503 Antimicrobial Peptides Human genes 0.000 description 1
- 108010078554 Aromatase Proteins 0.000 description 1
- 102000014654 Aromatase Human genes 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000714230 Avian leukemia virus Species 0.000 description 1
- 241000713826 Avian leukosis virus Species 0.000 description 1
- 102100029822 B- and T-lymphocyte attenuator Human genes 0.000 description 1
- 108010074708 B7-H1 Antigen Proteins 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 241001416152 Bos frontalis Species 0.000 description 1
- 241000711443 Bovine coronavirus Species 0.000 description 1
- 241000219193 Brassicaceae Species 0.000 description 1
- 102100038078 CD276 antigen Human genes 0.000 description 1
- 102100036008 CD48 antigen Human genes 0.000 description 1
- 101150085381 CDC19 gene Proteins 0.000 description 1
- 229940125579 COVID-19 vaccine candidate Drugs 0.000 description 1
- 108091033409 CRISPR Proteins 0.000 description 1
- 108010021064 CTLA-4 Antigen Proteins 0.000 description 1
- 229940045513 CTLA4 antagonist Drugs 0.000 description 1
- 102400000113 Calcitonin Human genes 0.000 description 1
- 108060001064 Calcitonin Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 108010022172 Chitinases Proteins 0.000 description 1
- 102000012286 Chitinases Human genes 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 244000180278 Copernicia prunifera Species 0.000 description 1
- 235000010919 Copernicia prunifera Nutrition 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 102000004420 Creatine Kinase Human genes 0.000 description 1
- 108010042126 Creatine kinase Proteins 0.000 description 1
- 241000235646 Cyberlindnera jadinii Species 0.000 description 1
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108700003861 Dominant Genes Proteins 0.000 description 1
- 101100388059 Drosophila melanogaster PolQ gene Proteins 0.000 description 1
- 101710121765 Endo-1,4-beta-xylanase Proteins 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 108091006020 Fc-tagged proteins Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 1
- 102400000321 Glucagon Human genes 0.000 description 1
- 108060003199 Glucagon Proteins 0.000 description 1
- 101710114810 Glycoprotein Proteins 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 108010051696 Growth Hormone Proteins 0.000 description 1
- 108010007712 Hepatitis A Virus Cellular Receptor 1 Proteins 0.000 description 1
- 102100034459 Hepatitis A virus cellular receptor 1 Human genes 0.000 description 1
- 102100034458 Hepatitis A virus cellular receptor 2 Human genes 0.000 description 1
- 101710083479 Hepatitis A virus cellular receptor 2 homolog Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 1
- 101000864344 Homo sapiens B- and T-lymphocyte attenuator Proteins 0.000 description 1
- 101000716130 Homo sapiens CD48 antigen Proteins 0.000 description 1
- 101000599940 Homo sapiens Interferon gamma Proteins 0.000 description 1
- 101001002657 Homo sapiens Interleukin-2 Proteins 0.000 description 1
- 101000984190 Homo sapiens Leukocyte immunoglobulin-like receptor subfamily B member 1 Proteins 0.000 description 1
- 101000984189 Homo sapiens Leukocyte immunoglobulin-like receptor subfamily B member 2 Proteins 0.000 description 1
- 101000868279 Homo sapiens Leukocyte surface antigen CD47 Proteins 0.000 description 1
- 101000804764 Homo sapiens Lymphotactin Proteins 0.000 description 1
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 1
- 101000605639 Homo sapiens Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Proteins 0.000 description 1
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 101000831007 Homo sapiens T-cell immunoreceptor with Ig and ITIM domains Proteins 0.000 description 1
- 101000801742 Homo sapiens Triosephosphate isomerase Proteins 0.000 description 1
- 101000863873 Homo sapiens Tyrosine-protein phosphatase non-receptor type substrate 1 Proteins 0.000 description 1
- 101000666896 Homo sapiens V-type immunoglobulin domain-containing suppressor of T-cell activation Proteins 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 102100034980 ICOS ligand Human genes 0.000 description 1
- 102000037982 Immune checkpoint proteins Human genes 0.000 description 1
- 108091008036 Immune checkpoint proteins Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 241000701460 JC polyomavirus Species 0.000 description 1
- 102000002698 KIR Receptors Human genes 0.000 description 1
- 108010043610 KIR Receptors Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical compound OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- MRAUNPAHJZDYCK-BYPYZUCNSA-N L-nitroarginine Chemical compound OC(=O)[C@@H](N)CCCNC(=N)N[N+]([O-])=O MRAUNPAHJZDYCK-BYPYZUCNSA-N 0.000 description 1
- 102000017578 LAG3 Human genes 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000208822 Lactuca Species 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 101150030213 Lag3 gene Proteins 0.000 description 1
- 108010092277 Leptin Proteins 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- 102100025583 Leukocyte immunoglobulin-like receptor subfamily B member 2 Human genes 0.000 description 1
- 101710145805 Leukocyte immunoglobulin-like receptor subfamily B member 3 Proteins 0.000 description 1
- 102100032913 Leukocyte surface antigen CD47 Human genes 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 102100035304 Lymphotactin Human genes 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 101710085938 Matrix protein Proteins 0.000 description 1
- 238000003820 Medium-pressure liquid chromatography Methods 0.000 description 1
- 108010061593 Member 14 Tumor Necrosis Factor Receptors Proteins 0.000 description 1
- 101710127721 Membrane protein Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 1
- 101100407308 Mus musculus Pdcd1lg2 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- RSPURTUNRHNVGF-IOSLPCCCSA-N N(2),N(2)-dimethylguanosine Chemical compound C1=NC=2C(=O)NC(N(C)C)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O RSPURTUNRHNVGF-IOSLPCCCSA-N 0.000 description 1
- NIDVTARKFBZMOT-PEBGCTIMSA-N N(4)-acetylcytidine Chemical compound O=C1N=C(NC(=O)C)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NIDVTARKFBZMOT-PEBGCTIMSA-N 0.000 description 1
- WVGPGNPCZPYCLK-WOUKDFQISA-N N(6),N(6)-dimethyladenosine Chemical compound C1=NC=2C(N(C)C)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O WVGPGNPCZPYCLK-WOUKDFQISA-N 0.000 description 1
- XKOPXXUOWZQFQE-SDBHATRESA-N N(6)-isopentenyladenosine Chemical compound C1=NC=2C(NCCC(=C)C)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O XKOPXXUOWZQFQE-SDBHATRESA-N 0.000 description 1
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 1
- UNUYMBPXEFMLNW-DWVDDHQFSA-N N-[(9-beta-D-ribofuranosylpurin-6-yl)carbamoyl]threonine Chemical compound C1=NC=2C(NC(=O)N[C@@H]([C@H](O)C)C(O)=O)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O UNUYMBPXEFMLNW-DWVDDHQFSA-N 0.000 description 1
- LZCNWAXLJWBRJE-ZOQUXTDFSA-N N4-Methylcytidine Chemical compound O=C1N=C(NC)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 LZCNWAXLJWBRJE-ZOQUXTDFSA-N 0.000 description 1
- GOSWTRUMMSCNCW-UHFFFAOYSA-N N6-(cis-hydroxyisopentenyl)adenosine Chemical compound C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1OC(CO)C(O)C1O GOSWTRUMMSCNCW-UHFFFAOYSA-N 0.000 description 1
- 102100029527 Natural cytotoxicity triggering receptor 3 ligand 1 Human genes 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 101100234604 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) ace-8 gene Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- LYNKVJADAPZJIK-UHFFFAOYSA-H P([O-])([O-])=O.[B+3].P([O-])([O-])=O.P([O-])([O-])=O.[B+3] Chemical compound P([O-])([O-])=O.[B+3].P([O-])([O-])=O.P([O-])([O-])=O.[B+3] LYNKVJADAPZJIK-UHFFFAOYSA-H 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 1
- 101150093629 PYK1 gene Proteins 0.000 description 1
- 102000003982 Parathyroid hormone Human genes 0.000 description 1
- 108090000445 Parathyroid hormone Proteins 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- 102100038332 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Human genes 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241001135989 Porcine reproductive and respiratory syndrome virus Species 0.000 description 1
- 241001672814 Porcine teschovirus 1 Species 0.000 description 1
- 241001533393 Potyviridae Species 0.000 description 1
- 101710124584 Probable DNA-binding protein Proteins 0.000 description 1
- 108700030875 Programmed Cell Death 1 Ligand 2 Proteins 0.000 description 1
- 102100024216 Programmed cell death 1 ligand 1 Human genes 0.000 description 1
- 102100024213 Programmed cell death 1 ligand 2 Human genes 0.000 description 1
- 108010057464 Prolactin Proteins 0.000 description 1
- 102100024819 Prolactin Human genes 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 229930185560 Pseudouridine Natural products 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- ODHCTXKNWHHXJC-GSVOUGTGSA-N Pyroglutamic acid Natural products OC(=O)[C@H]1CCC(=O)N1 ODHCTXKNWHHXJC-GSVOUGTGSA-N 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 238000001069 Raman spectroscopy Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 235000004789 Rosa xanthina Nutrition 0.000 description 1
- 241000220222 Rosaceae Species 0.000 description 1
- 108091005774 SARS-CoV-2 proteins Proteins 0.000 description 1
- 101100010928 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) tuf gene Proteins 0.000 description 1
- 108010077895 Sarcosine Proteins 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 244000000231 Sesamum indicum Species 0.000 description 1
- 235000003434 Sesamum indicum Nutrition 0.000 description 1
- 101001024637 Severe acute respiratory syndrome coronavirus 2 Nucleoprotein Proteins 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 241000208292 Solanaceae Species 0.000 description 1
- 102000005157 Somatostatin Human genes 0.000 description 1
- 108010056088 Somatostatin Proteins 0.000 description 1
- 102100038803 Somatotropin Human genes 0.000 description 1
- 101710167605 Spike glycoprotein Proteins 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 235000014897 Streptococcus lactis Nutrition 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 102100039367 T-cell immunoglobulin and mucin domain-containing protein 4 Human genes 0.000 description 1
- 101710174757 T-cell immunoglobulin and mucin domain-containing protein 4 Proteins 0.000 description 1
- 229940126547 T-cell immunoglobulin mucin-3 Drugs 0.000 description 1
- 102100024834 T-cell immunoreceptor with Ig and ITIM domains Human genes 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101150001810 TEAD1 gene Proteins 0.000 description 1
- 101150074253 TEF1 gene Proteins 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical group OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102000036693 Thrombopoietin Human genes 0.000 description 1
- 108010041111 Thrombopoietin Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108090000373 Tissue Plasminogen Activator Proteins 0.000 description 1
- 102000003978 Tissue Plasminogen Activator Human genes 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 102100029898 Transcriptional enhancer factor TEF-1 Human genes 0.000 description 1
- 108060008539 Transglutaminase Proteins 0.000 description 1
- 102100033598 Triosephosphate isomerase Human genes 0.000 description 1
- 102100028785 Tumor necrosis factor receptor superfamily member 14 Human genes 0.000 description 1
- 102100029948 Tyrosine-protein phosphatase non-receptor type substrate 1 Human genes 0.000 description 1
- 102100038929 V-set domain-containing T-cell activation inhibitor 1 Human genes 0.000 description 1
- 102100038282 V-type immunoglobulin domain-containing suppressor of T-cell activation Human genes 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 241000711975 Vesicular stomatitis virus Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 229940118555 Viral entry inhibitor Drugs 0.000 description 1
- JCZSFCLRSONYLH-UHFFFAOYSA-N Wyosine Natural products N=1C(C)=CN(C(C=2N=C3)=O)C=1N(C)C=2N3C1OC(CO)C(O)C1O JCZSFCLRSONYLH-UHFFFAOYSA-N 0.000 description 1
- YXNIEZJFCGTDKV-UHFFFAOYSA-N X-Nucleosid Natural products O=C1N(CCC(N)C(O)=O)C(=O)C=CN1C1C(O)C(O)C(CO)O1 YXNIEZJFCGTDKV-UHFFFAOYSA-N 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- ODHCTXKNWHHXJC-UHFFFAOYSA-N acide pyroglutamique Natural products OC(=O)C1CCC(=O)N1 ODHCTXKNWHHXJC-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N adenyl group Chemical group N1=CN=C2N=CNC2=C1N GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- PPQRONHOSHZGFQ-LMVFSUKVSA-N aldehydo-D-ribose 5-phosphate Chemical group OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PPQRONHOSHZGFQ-LMVFSUKVSA-N 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- 230000001772 anti-angiogenic effect Effects 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 230000000845 anti-microbial effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000005975 antitumor immune response Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- 229940000635 beta-alanine Drugs 0.000 description 1
- MVCRZALXJBDOKF-JPZHCBQBSA-N beta-hydroxywybutosine 5'-monophosphate Chemical compound C1=NC=2C(=O)N3C(CC(O)[C@H](NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O MVCRZALXJBDOKF-JPZHCBQBSA-N 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 229930189065 blasticidin Natural products 0.000 description 1
- 238000011094 buffer selection Methods 0.000 description 1
- 229960004015 calcitonin Drugs 0.000 description 1
- BBBFJLBPOGFECG-VJVYQDLKSA-N calcitonin Chemical compound N([C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(N)=O)C(C)C)C(=O)[C@@H]1CSSC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1 BBBFJLBPOGFECG-VJVYQDLKSA-N 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 210000002230 centromere Anatomy 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000010382 chemical cross-linking Methods 0.000 description 1
- 108091006116 chimeric peptides Proteins 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 230000008045 co-localization Effects 0.000 description 1
- 230000003920 cognitive function Effects 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010205 computational analysis Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 150000004696 coordination complex Chemical class 0.000 description 1
- 229960003624 creatine Drugs 0.000 description 1
- 239000006046 creatine Substances 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 101150110403 cspA gene Proteins 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical group O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000000326 densiometry Methods 0.000 description 1
- ANCLJVISBRWUTR-UHFFFAOYSA-N diaminophosphinic acid Chemical compound NP(N)(O)=O ANCLJVISBRWUTR-UHFFFAOYSA-N 0.000 description 1
- ZPTBLXKRQACLCR-XVFCMESISA-N dihydrouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)CC1 ZPTBLXKRQACLCR-XVFCMESISA-N 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 229940030275 epigallocatechin gallate Drugs 0.000 description 1
- 230000008472 epithelial growth Effects 0.000 description 1
- RRCFLRBBBFZLSB-XIFYLAFSSA-N epoxyqueuosine Chemical compound C1=C(CN[C@@H]2[C@H]([C@@H](O)[C@@H]3O[C@@H]32)O)C=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O RRCFLRBBBFZLSB-XIFYLAFSSA-N 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000002073 fluorescence micrograph Methods 0.000 description 1
- IJJVMEJXYNJXOJ-UHFFFAOYSA-N fluquinconazole Chemical compound C=1C=C(Cl)C=C(Cl)C=1N1C(=O)C2=CC(F)=CC=C2N=C1N1C=NC=N1 IJJVMEJXYNJXOJ-UHFFFAOYSA-N 0.000 description 1
- 229960005102 foscarnet Drugs 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 1
- 229960004666 glucagon Drugs 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000000122 growth hormone Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 125000000625 hexosyl group Chemical group 0.000 description 1
- 102000043557 human IFNG Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 150000002431 hydrogen Chemical class 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000005660 hydrophilic surface Effects 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 230000017730 intein-mediated protein splicing Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 229940039781 leptin Drugs 0.000 description 1
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000031852 maintenance of location in cell Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000006993 memory improvement Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 229910044991 metal oxide Inorganic materials 0.000 description 1
- 150000004706 metal oxides Chemical class 0.000 description 1
- GWKIZNPISGBQGY-GNLDREGESA-N methyl (2S)-4-[4,6-dimethyl-9-oxo-3-[(2R,3R,4S,5R)-2,3,4-trihydroxy-5-(hydroxymethyl)oxolan-2-yl]imidazo[1,2-a]purin-7-yl]-2-(methoxycarbonylamino)butanoate Chemical class O[C@@]1([C@H](O)[C@H](O)[C@@H](CO)O1)N1C=NC=2C(=O)N3C(CC[C@@H](C(=O)OC)NC(=O)OC)=C(C)N=C3N(C)C21 GWKIZNPISGBQGY-GNLDREGESA-N 0.000 description 1
- KTKIKSMBDRMPBG-PNHWDRBUSA-N methyl 2-[1-[(2r,3r,4r,5r)-4-hydroxy-5-(hydroxymethyl)-3-sulfanyloxolan-2-yl]-2,4-dioxopyrimidin-5-yl]acetate Chemical compound O=C1NC(=O)C(CC(=O)OC)=CN1[C@H]1[C@H](S)[C@H](O)[C@@H](CO)O1 KTKIKSMBDRMPBG-PNHWDRBUSA-N 0.000 description 1
- JNVLKTZUCGRYNN-LQGIRWEJSA-N methyl 2-[1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-2,4-dioxopyrimidin-5-yl]-2-hydroxyacetate Chemical compound O=C1NC(=O)C(C(O)C(=O)OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 JNVLKTZUCGRYNN-LQGIRWEJSA-N 0.000 description 1
- WCNMEQDMUYVWMJ-UHFFFAOYSA-N methyl 4-[3-[3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-4,6-dimethyl-9-oxoimidazo[1,2-a]purin-7-yl]-3-hydroperoxy-2-(methoxycarbonylamino)butanoate Chemical compound C1=NC=2C(=O)N3C(CC(C(NC(=O)OC)C(=O)OC)OO)=C(C)N=C3N(C)C=2N1C1OC(CO)C(O)C1O WCNMEQDMUYVWMJ-UHFFFAOYSA-N 0.000 description 1
- WZRYXYRWFAPPBJ-PNHWDRBUSA-N methyl uridin-5-yloxyacetate Chemical compound O=C1NC(=O)C(OCC(=O)OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 WZRYXYRWFAPPBJ-PNHWDRBUSA-N 0.000 description 1
- 125000000250 methylamino group Chemical group [H]N(*)C([H])([H])[H] 0.000 description 1
- YACKEPLHDIMKIO-UHFFFAOYSA-L methylphosphonate(2-) Chemical compound CP([O-])([O-])=O YACKEPLHDIMKIO-UHFFFAOYSA-L 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229960005030 other vaccine in atc Drugs 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229960001319 parathyroid hormone Drugs 0.000 description 1
- 239000000199 parathyroid hormone Substances 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000003950 pathogenic mechanism Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 230000003285 pharmacodynamic effect Effects 0.000 description 1
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-L phosphoramidate Chemical compound NP([O-])([O-])=O PTMHPRAIXMAOOB-UHFFFAOYSA-L 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 125000005642 phosphothioate group Chemical group 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000186 progesterone Substances 0.000 description 1
- 229960003387 progesterone Drugs 0.000 description 1
- 229940097325 prolactin Drugs 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 238000001711 protein immunostaining Methods 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- QQXQGKSPIMGUIZ-AEZJAUAXSA-N queuosine Chemical compound C1=2C(=O)NC(N)=NC=2N([C@H]2[C@@H]([C@H](O)[C@@H](CO)O2)O)C=C1CN[C@H]1C=C[C@H](O)[C@@H]1O QQXQGKSPIMGUIZ-AEZJAUAXSA-N 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000008960 regulation of mRNA stability Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 208000023504 respiratory system disease Diseases 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- DWRXFEITVBNRMK-JXOAFFINSA-N ribothymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 DWRXFEITVBNRMK-JXOAFFINSA-N 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- RHFUOMFWUGWKKO-UHFFFAOYSA-N s2C Natural products S=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 RHFUOMFWUGWKKO-UHFFFAOYSA-N 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 238000003118 sandwich ELISA Methods 0.000 description 1
- 229940043230 sarcosine Drugs 0.000 description 1
- 238000010206 sensitivity analysis Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 210000001812 small ribosome subunit Anatomy 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 1
- 229960000553 somatostatin Drugs 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 108700004027 tat Genes Proteins 0.000 description 1
- 101150098170 tat gene Proteins 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 229960000187 tissue plasminogen activator Drugs 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 102000003601 transglutaminase Human genes 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 230000010415 tropism Effects 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- RVCNQQGZJWVLIP-VPCXQMTMSA-N uridin-5-yloxyacetic acid Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(OCC(O)=O)=C1 RVCNQQGZJWVLIP-VPCXQMTMSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- YIZYCHKPHCPKHZ-UHFFFAOYSA-N uridine-5-acetic acid methyl ester Natural products COC(=O)Cc1cn(C2OC(CO)C(O)C2O)c(=O)[nH]c1=O YIZYCHKPHCPKHZ-UHFFFAOYSA-N 0.000 description 1
- 108700001624 vesicular stomatitis virus G Proteins 0.000 description 1
- 230000004095 viral genome expression Effects 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- QAOHCFGKCWTBGC-QHOAOGIMSA-N wybutosine Chemical compound C1=NC=2C(=O)N3C(CC[C@H](NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O QAOHCFGKCWTBGC-QHOAOGIMSA-N 0.000 description 1
- QAOHCFGKCWTBGC-UHFFFAOYSA-N wybutosine Natural products C1=NC=2C(=O)N3C(CCC(NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1C1OC(CO)C(O)C1O QAOHCFGKCWTBGC-UHFFFAOYSA-N 0.000 description 1
- JCZSFCLRSONYLH-QYVSTXNMSA-N wyosin Chemical compound N=1C(C)=CN(C(C=2N=C3)=O)C=1N(C)C=2N3[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O JCZSFCLRSONYLH-QYVSTXNMSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6897—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids involving reporter genes operably linked to promoters
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1002—Coronaviridae
- C07K16/1003—Severe acute respiratory syndrome coronavirus 2 [SARS‐CoV‐2 or Covid-19]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/10—Immunoglobulins specific features characterized by their source of isolation or production
- C07K2317/14—Specific host cells or culture conditions, e.g. components, pH or temperature
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/35—Fusion polypeptide containing a fusion for enhanced stability/folding during expression, e.g. fusions with chaperones or thioredoxin
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16041—Use of virus, viral particle or viral elements as a vector
- C12N2740/16043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/15—Vector systems having a special element relevant for transcription chimeric enhancer/promoter combination
Definitions
- This disclosure relates to novel oligonucleotides, peptide tag(s) having specified short nucleotide sequences or derivatives thereof as well as the native untranslated region (UTR) of SARS-CoV-2 (snUTR).
- Methods utilizing these novel molecules include enhancing production of the targeted proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro , ex vivo and in vivo.
- peptide (epitope) tags such as Flag, Myc, HA, Ollas, V5, His, C7, and T7 have demonstrated functions in protein labeling, affinity purification, and immune detection (DeCaprio and Kohl, 2019; Katayama et al., 2021; Lee et al., 2020; Mishra, 2020; Peighambardoust et al., 2021; Pina et al., 2021; Traenkle et al., 2020).
- no tagging peptides have been identified that enhance the expression/production of the targeted proteins in mammalian cells.
- the 5’-UTR within SARS-CoV-2 genome is critical to initiate the generation of the entire genomic and subgenomic transcripts (Baldassarre et al., 2020; Yang and Leibowitz, 2015).
- the 3’-UTR also regulates the viral genome expression and replication (Chan et al., 2020; Zhao et al.,
- 5’-UTR and 3’-UTR are highly conserved among SARS-CoV genome and their variants (Baldassarre et al. , 2020; Bottaro et al., 2021; Rangan et al., 2020; Rouchka et al., 2020; Ryder et al., 2021; Yang and Leibowitz, 2015). Recent computerization studies have identified a very stable four-way junction of 5’-UTR close to the AUG start codon (Miao et al., 2020).
- Embodiments are directed to novel chimeric molecules comprising an oligonucleotide comprising a c/.s-regul atory coding motif, a peptide tag, a 5’- untranslated region (5’-UTR), a 3’- untranslated region (3’-UTR) and combinations thereof for use in the enhanced production and expression of a desired biomolecule.
- the synergistic boosting effect observed has extensive applications and broad research interest. For industrial applications, the strategy will reduce the cost of many widely used products and facilitate their availability, such as vaccines, antibodies, recombinant proteins, and therapeutic gene products. An immediate and highly important usage of this system would be to boost mRNA vaccines against COVID-19 variants.
- a composition comprises an expression-enhancing oligonucleotide having between 15 and 30 nucleic acid bases and includes a c/.s-regulatory coding motif that locates in the coding regions and retains open reading frame (ORF) with targeted genes.
- the expression-enhancing oligonucleotide comprises twenty-one nucleic acid bases.
- the expression-enhancing oligonucleotide comprises a nucleic acid sequence having at least a 75% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the expression enhancing oligonucleotide comprises a nucleic acid sequence having at least a 95% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the expression enhancing oligonucleotide comprises a nucleic acid sequence comprising cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7).
- a synthetic oligonucleotide comprises a nucleic acid sequence having at least a 75% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the synthetic oligonucleotide comprises a nucleic acid sequence having at least a 95% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the oligonucleotide comprises a nucleic acid sequence comprising cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7).
- the oligonucleotide encodes a peptide comprising an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the oligonucleotide encodes a peptide comprising an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the oligonucleotide encodes a peptide comprising the amino acid sequence QPRFAAA (SEQ ID NO: 1).
- a construct comprises the synthetic oligonucleotide embodied herein.
- a chimeric molecule comprises one or more peptide domains and one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof.
- the one or more peptide domains comprise from about five amino acids to about twenty amino acids. In certain embodiments, the one or more peptide domains comprise about seven amino acids.
- the one or more peptide domains comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the peptide comprises an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1).
- the peptide comprises the amino acid sequence QPRFAAA (SEQ ID NO: 1).
- the peptide comprises X n -QPRFAAA-X n , wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- the one or more 5’- untranslated region (UTR) sequences or fragments thereof are derived from one or more viruses.
- the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
- the 5’ -UTR and/or 3’ -UTR are from a coronavirus.
- the coronavirus is SARS-CoV-2.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’- UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’-UTR.
- the chimeric molecule further comprises one or more biomolecules operably linked to the one or more peptide domains and/or the one or more 5’UTR and/or 3’ -UTR sequences.
- the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- the chimeric molecule further comprises one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
- a host cell comprises an oligonucleotide embodied herein, or a chimeric molecule embodied herein.
- a construct encodes an oligonucleotide embodied herein, or a chimeric molecule embodied herein.
- a method of enhancing production of biomolecules comprises tagging a desired peptide or a nucleic acid sequence with the chimeric molecule of any one of claims 1- 34, by fusion or cloning, expressing the peptide or nucleic acid sequence, and harvesting the protein.
- the proteins comprise: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- a nucleic acid comprises a promoter, a 5’ -untranslated region (5’ -UTR) sequence, a biomolecule of interest, an oligonucleotide comprising a c/.s-regulatory coding motif, a 3’ -untranslated region (3’-UTR) sequence and combinations thereof.
- the one or more 5’ -untranslated region (UTR) and/or 3’UTR sequences or fragments thereof are derived from one or more viruses.
- the one or more viruses comprise coronaviruses, retroviruses, picornaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
- the 5’ -UTR and/or 3’ -UTR are derived from a coronavirus.
- the coronavirus is SARS-CoV-2.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’ -UTR.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a SARS-CoV-2 5’-UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’-UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a SARS-CoV-2 3’-UTR.
- a chimeric molecule comprises one or more oligonucleotides comprising a nucleic acid sequence of cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7) and one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof.
- the one or more oligonucleotides encode a peptide comprising from about five amino acids to about twenty amino acids.
- the one or more peptides comprise about seven amino acids.
- the one or more peptides comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1).
- the one or more peptides comprise an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptides comprise the amino acid sequence QPRFAAA (SEQ ID NO: 1).
- the one or more peptides comprises a sequence comprising X n -QPRFAAA-X n , wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- the one or more 5’ -untranslated region (UTR) sequences or fragments thereof are derived from one or more viruses.
- the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
- the 5’ -UTR and/or 3’ -UTR are from a coronavirus.
- the coronavirus is SARS-CoV-2.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’- UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS- CoV-2 5’ -UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-23’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’- UTR.
- the chimeric molecule further comprises one or more biomolecules operably linked to the one or more oligonucleotides and/or the one or more 5’UTR and/or 3’-UTR sequences.
- the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- the chimeric molecule further comprises one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
- an expression vector comprises the nucleic acids embodied herein.
- a novel peptide tag comprises a specified short amino acid sequence or its derivative. In certain embodiments the peptide tag is about 5 to about 10 amino acids in length. In certain embodiments, the peptide tag is about 7 amino acids in length. In certain embodiments, the peptide tag comprises two or more tandem repeats of peptides.
- a synthetic peptide tag comprises an amino acid sequence unit of about five to about fifteen amino acids wherein the N-terminal and/or C-terminal amino acids are linked or fused to a target molecule.
- the amino acid sequence unit comprises seven amino acids.
- the amino acid sequence comprises at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1).
- the amino acid sequence comprises at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1).
- the amino acid sequence comprises the amino acid sequence QPRFAAA (SEQ ID NO: 1).
- the amino acid sequence comprises the amino acid sequence wherein the peptide domain comprises Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- the synthetic peptide tag further comprises a plurality of repeating amino acid sequence units.
- the repeating amino acid sequence units are in tandem.
- the amino acid sequence units are separated by linker molecules or one or more amino acids.
- a synthetic peptide comprises the structure: (AA-AA-AA-AA-AA- AAZ-AAZ)X, wherein x is greater than or equal to 1, z is 0 or 1 and each AA is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- a synthetic peptide comprises the structure: AA1-AA2-AA3-AA4- AA5-AA6-AA7, wherein each AA is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- a synthetic peptide comprises an amino acid sequence comprising the structure: Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- a fusion protein comprises a synthetic peptide embodied herein fused to one or more target peptides.
- two or more synthetic peptides embodied herein are fused to a target peptide.
- a fusion molecule comprises a synthetic peptide embodied herein fused to one or more biomolecules.
- the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- a method of enhancing production of proteins comprises tagging a desired peptide or a nucleic acid sequence with the peptide tag embodied herein, by fusion or cloning, expressing the peptide or nucleic acid sequence, and harvesting the protein.
- the proteins comprise: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, biomimetics, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- a composition comprises a peptide-tagged biomolecule embodied herein and a pharmaceutically acceptable excipient, diluent or carrier.
- nucleic acid encodes the peptide tags embodied herein.
- an expression vector comprises a nucleic acid encoding the peptide tags embodied herein.
- a host cell comprises the expression vector encoding the peptide tags embodied herein.
- a method of utilizing the peptide tag(s) comprises enhancing production of the tagged proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro , ex vivo and in vivo.
- tandem peptide repeats further boost production of a targeted molecule.
- a method of increasing protein production in a cell comprises tagging a target molecule in the cell.
- a chimeric molecule comprises one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof associated with one or more biomolecules.
- the one or more 5’ -untranslated region (UTR) and/or 3’ -UTR sequences or fragments thereof are derived from one or more viruses.
- the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
- the 5’ -UTR and/or 3’ -UTR are from a coronavirus.
- the coronavirus is SARS-CoV-2.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’- UTR.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS- CoV-2 5’ -UTR.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a SARS-CoV-2 5’ -UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-23’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’- UTR.
- the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- the chimeric molecule further comprises one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
- a host cell comprises the chimeric molecule embodied herein.
- a construct encodes the chimeric molecules embodied herein.
- a method of enhancing production of biomolecules comprises tagging a desired peptide or a nucleic acid sequence with the chimeric molecules embodied herein, by fusion or cloning, expressing the peptide or nucleic acid sequence, and harvesting the protein.
- the proteins comprise: oligonucleotides, polynucleotides, mRNA vaccines, DNA vaccines, viral transcripts/proteins, antibodies, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
- a nucleic acid comprises a promoter, a 5’ -untranslated region (5’ -UTR) sequence, a biomolecule of interest, a peptide domain, a 3’ -untranslated region (3’ -UTR) sequence and combinations thereof.
- the one or more 5’ -untranslated region (UTR) sequences or fragments thereof are derived from one or more viruses.
- the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
- the 5’ -UTR and/or 3’ -UTR are derived from a coronavirus.
- the coronavirus is SARS-CoV-2.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a SARS-CoV-2 5’-UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-23’ -UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’- UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a SARS-CoV-2 3’ -UTR.
- an expression vector comprises the nucleic acids embodied herein.
- a host cell comprises the nucleic acids or expression vectors embodied herein.
- the term “about” or “approximately” means within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, “about” can mean within 1 or more than 1 standard deviation, per the practice in the art. Alternatively, “about” can mean a range of up to 20%, up to 10%, up to 5%, or up to 1% of a given value or range. Alternatively, particularly with respect to biological systems or processes, the term can mean within an order of magnitude within 5-fold, and also within 2-fold, of a value.
- phrases such as “at least one of’ or “one or more of’ may occur followed by a conjunctive list of elements or features.
- the term “and/or” may also occur in a list of two or more elements or features. Unless otherwise implicitly or explicitly contradicted by the context in which it is used, such a phrase is intended to mean any of the listed elements or features individually or any of the recited elements or features in combination with any of the other recited elements or features.
- the phrases “at least one of A and B;” “one or more of A and B;” and “A and/or B” are each intended to mean “A alone, B alone, or A and B together.”
- a similar interpretation is also intended for lists including three or more items.
- phrases “at least one of A, B, and C;” “one or more of A, B, and C;” and “A, B, and/or C” are each intended to mean “A alone, B alone, C alone, A and B together, A and C together, B and C together, or A and B and C together.”
- use of the term “based on,” is intended to mean, “based at least in part on,” such that an unrecited feature or element is also permissible.
- amino acid encompasses both naturally occurring amino acids and non-naturally occurring amino acids.
- non-naturally occurring amino acids include, but are not limited to, D-amino acids (i.e., an amino acid of an opposite chirality to the naturally occurring form), N-a-methyl amino acids, C-a-methyl amino acids, b-methyl amino acids and D- or L ⁇ -amino acids.
- Non-naturally occurring amino acids include, for example, b-alanine (b-Ala), norleucine (Me), norvaline (Nva), homoarginine (Har), 4- aminobutyric acid (g-Abu), 2-aminoisobutyric acid (Aib), 6-aminohexanoic acid (e-Ahx), ornithine (orn), sarcosine, a-amino isobutyric acid, 3-aminopropionic acid, 2,3-diaminopropionic acid (2,3-diaP), D- or L-phenylglycine, D-(trifluoromethyl)-phenylalanine, and D-p- fluoropheny 1 al anine .
- biomolecule refers to any of the numerous substances that are produced by cells and living organisms. Biomolecules have a wide range of sizes and structures and perform a vast array of functions. The four major types of biomolecules are carbohydrates, lipids, nucleic acids, and proteins or characteristic associated with the peptide and/or protein of interest.
- the biomolecules may be used in a variety of applications including, but not limited to curative agents for diseases (e.g., insulin, interferon, interleukins, anti -angiogenic peptides, tumor necrosis factor); molecules that bind to defined cellular targets such as receptors, channels, lipids, cytosolic proteins, and membrane proteins, to name a few; biomolecules having antimicrobial activity, antiviral activity, anti-cancer, anti-inflammatory activity, and the like.
- diseases e.g., insulin, interferon, interleukins, anti -angiogenic peptides, tumor necrosis factor
- molecules that bind to defined cellular targets such as receptors, channels, lipids, cytosolic proteins, and membrane proteins, to name a few
- biomolecules having antimicrobial activity, antiviral activity, anti-cancer, anti-inflammatory activity, and the like.
- cleavable linker elements As used herein, “cleavable linker elements”, “peptide linkers”, and “cleavable peptide linkers” will be used interchangeably and refer to cleavable peptide segments found, in certain embodiments, between peptide tags and the biomolecule, e.g., peptide, of interest. After the peptide tags are separated and/or partially purified or purified from the cell lysate, the cleavable linker elements can be cleaved chemically and/or enzymatically to separate the peptide tag from the biomolecule, e.g. peptide, of interest.
- the fusion peptide may also include a plurality of regions encoding one or more peptides of interest separated by one or more cleavable peptide linkers.
- the peptide of interest can then be isolated from the peptide tag, if necessary.
- the peptide tag(s) and the peptide of interest exhibit different solubilities in a defined medium (typically an aqueous medium), facilitating separation of the peptide tag from the biomolecule, e.g., polypeptide of interest.
- the peptide tag is insoluble in an aqueous solution while the protein/polypeptide of interest is appreciably soluble in an aqueous solution.
- the pH, temperature, and/or ionic strength of the aqueous solution can be adjusted to facilitate recovery of the peptide of interest.
- the differential solubility between the inclusion body tag and the peptide of interest occurs in an aqueous solution having a pH of 4 to 11 and a temperature range of 15 to 50° C.
- the cleavable peptide linker may be from 1 to about 50 amino acids, from 1 to about 20 amino acids in length.
- the cleavable peptide linkers may be incorporated into the fusion proteins using any number of techniques well known in the art.
- Means to prepare the present peptides are well known in the art and in preferred embodiments the entire peptide reagent may be prepared using the recombinant DNA and molecular cloning techniques.
- checkpoint proteins means a group of molecules on the cell surface of CD4 + and/or CD8 + T cells that fine-tune immune responses by down-modulating or inhibiting an anti tumor immune response.
- the terms “comprising,” “comprise” or “comprised,” and variations thereof, in reference to defined or described elements of an item, composition, apparatus, method, process, system, etc. are meant to be inclusive or open ended, permitting additional elements, thereby indicating that the defined or described item, composition, apparatus, method, process, system, etc. includes those specified elements— or, as appropriate, equivalents thereof— and that other elements can be included and still fall within the scope/defmition of the defined item, composition, apparatus, method, process, system, etc.
- the terms “conjugated,” “linked,” “attached,” “fused” and “tethered,” when used with respect to two or more moieties, means that the moieties or domains are physically associated or connected with one another, either directly or via one or more additional moieties that serve as a linking agent, to form a structure that is sufficiently stable so that the moieties remain physically associated under the conditions in which the structure is used, e.g., physiological conditions.
- the linkage can be based on genetic fusion according to the methods known in the art or can be performed by, e.g., chemical cross-linking.
- the compounds and targeting agents may be linked by a flexible linker, such as a polypeptide linker.
- the polypeptide linker can comprise plural, hydrophilic or peptide-bonded amino acids of varying lengths.
- associated will be used for the sake of brevity and is meant to include all possible methods of physically and chemically associating each domain.
- fusion protein As used herein, the terms “fusion protein”, “fusion peptide”, “chimeric protein”, and “chimeric peptide” will be used interchangeably and will refer to a polymer of amino acids (peptide, oligopeptide, polypeptide, or protein) comprising at least two portions, each portion comprising a distinct function. At least one first portion of the fusion peptide comprises at least one of the present peptide tags. At least one second portion of the fusion peptide comprises at least one peptide of interest. In certain embodiments, the fusion protein additionally includes at least one cleavable peptide linker that facilitates cleavage (chemical and/or enzymatic) and separation of the peptide tag(s) and the peptide(s) of interest.
- Nucleic acid refers to nucleotides (e.g ., deoxyribonucleotides, ribonucleotides, and T - modified nucleotides) and polymers thereof in either single-, double- or multiple-stranded form, or complements thereof.
- polynucleotide e.g ., deoxyribonucleotides, ribonucleotides, and T - modified nucleotides
- polynucleotide oligonucleotide
- oligo refer, in the usual and customary sense, to a linear sequence of nucleotides.
- nucleotide refers, in the usual and customary sense, to a single unit of a polynucleotide, i.e., a monomer.
- Nucleotides can be ribonucleotides, deoxyribonucleotides, or modified versions thereof.
- Examples of polynucleotides contemplated herein include single and double stranded DNA, single and double stranded RNA, and hybrid molecules having mixtures of single and double stranded DNA and RNA.
- Examples of nucleic acid, e.g., polynucleotides contemplated herein include any types of RNA, e.g., mRNA, siRNA, miRNA, and guide RNA and any types of DNA, genomic DNA, plasmid DNA, and mini circle DNA, and any fragments thereof.
- the term “duplex” in the context of polynucleotides refers, in the usual and customary sense, to double strandedness.
- Nucleic acids can include one or more reactive moieties.
- the term reactive moiety includes any group capable of reacting with another molecule, e.g, a nucleic acid or polypeptide through covalent, non-covalent or other interactions.
- the nucleic acid can include an amino acid reactive moiety that reacts with an amio acid on a protein or polypeptide through a covalent, non-covalent, or other interaction.
- nucleic acids containing known nucleotide analogs or modified backbone residues or linkages which are synthetic, naturally occurring, and non- naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides.
- Examples of such analogs include, include, without limitation, phosphodiester derivatives including, e.g., phosphoramidate, phosphorodiamidate, phosphorothioate (also known as phosphothioate having double bonded sulfur replacing oxygen in the phosphate), phosphorodithioate, phosphonocarboxylic acids, phosphonocarboxylates, phosphonoacetic acid, phosphonoformic acid, methyl phosphonate, boron phosphonate, or O-methylphosphoroamidite linkages (see Eckstein, OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL APPROACH, Oxford University Press) as well as modifications to the nucleotide bases such as in 5-methyl cytidine or pseudouridine.; and peptide nucleic acid backbones and linkages.
- phosphodiester derivatives including, e.g., phosphoramidate, phosphorodiamidate, phosphorothioate (also known as phospho
- nucleic acids include those with positive backbones; non-ionic backbones, modified sugars, and non-ribose backbones (e.g., phosphorodiamidate morpholino oligos or locked nucleic acids (LNA) as known in the art), including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, CARBOHYDRATE MODIFICATIONS IN ANTISENSE RESEARCH, Sanghui & Cook, eds. Nucleic acids containing one or more carbocyclic sugars are also included within one definition of nucleic acids.
- LNA locked nucleic acids
- Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g. , to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip.
- Mixtures of naturally occurring nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made.
- the intemucleotide linkages in DNA are phosphodiester, phosphodiester derivatives, or a combination of both.
- operably linked refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other.
- a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., that the coding sequence is under the transcriptional control of the promoter).
- the definition of “operably linked” may also be extended to describe the products of chimeric genes, such as fusion proteins.
- “operably linked” will also refer to the linking of peptide tag to a biomolecule, e.g., peptide of interest to be produced and recovered.
- the peptide tag is “operably linked” to the peptide of interest if upon expression the fusion protein is insoluble and accumulates it inclusion bodies in the expressing host cell.
- the fusion peptide will include at least on cleavable peptide linker useful in separating the peptide tag from the peptide of interest.
- the cleavable peptide linkers may be incorporated into the fusion proteins using any number of techniques well known in the art.
- polypeptide and “peptide” will be used interchangeably to refer to a polymer of two or more amino acids joined together by a peptide bond, wherein the peptide is of unspecified length, thus, peptides, oligopeptides, polypeptides, and proteins are included within the present definition.
- this term also includes post expression modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations and the like. Included within the definition are, for example, peptides containing one or more analogues of an amino acid or labeled amino acids and peptidomimetics.
- protein of interest protein of interest
- polypeptide of interest peptide of interest
- targeted protein targeted polypeptide
- targeted peptide targeted peptide
- expressible polypeptide will be used interchangeably and refer to a protein, polypeptide, or peptide which may be expressed by the genetic machinery of a host cell.
- plasmid refers to an extrachromosomal element often carrying genes which are not part of the central metabolism of the cell, and usually in the form of circular double-stranded DNA molecules.
- Such elements may be autonomously replicating sequences, genome integrating sequences, phage or nucleotide sequences, linear or circular, of a single- or double-stranded DNA or RNA, derived from any source, in which a number of nucleotide sequences have been joined or recombined into a unique construction which is capable of introducing a promoter fragment and DNA sequence for a selected gene product along with appropriate 3' untranslated sequence into a cell.
- Transformation cassette refers to a specific vector containing a foreign gene and having elements in addition to the foreign gene that facilitates transformation of a particular host cell.
- Expression cassette refers to a specific vector containing a foreign gene and having elements in addition to the foreign gene that allow for enhanced expression of that gene in a foreign host.
- promoter/regulatory sequence means a nucleic acid sequence which is required for expression of a gene product operably linked to the promoter/regulatory sequence. In some instances, this sequence may be the core promoter sequence and in other instances, this sequence may also include an enhancer sequence and other regulatory elements which are required for expression of the gene product. The promoter/regulatory sequence may, for example, be one which expresses the gene product in a tissue specific manner.
- promoter as used herein is defined as a DNA sequence recognized by the synthetic machinery of the cell, or introduced synthetic machinery, required to initiate the specific transcription of a polynucleotide sequence.
- a “constitutive” promoter is a nucleotide sequence which, when operably linked with a polynucleotide which encodes or specifies a gene product, causes the gene product to be produced in a cell under most or all physiological conditions of the cell.
- an “inducible” promoter is a nucleotide sequence which, when operably linked with a polynucleotide which encodes or specifies a gene product, causes the gene product to be produced in a cell substantially only when an inducer which corresponds to the promoter is present in the cell.
- a “tissue-specific” promoter is a nucleotide sequence which, when operably linked with a polynucleotide encodes or specified by a gene, causes the gene product to be produced in a cell substantially only if the cell is a cell of the tissue type corresponding to the
- target molecule includes any macromolecule, including protein, peptide, polypeptide, gene, polynucleotide, oligonucleotide, carbohydrate, enzyme, polysaccharide, glycoprotein, receptor, antigen, tumor antigen, markers, molecules associated with a disease, an antibody, growth factor; or it may be any small organic molecule including a hormone, substrate, metabolite, cofactor, inhibitor, drug, dye, nutrient, pesticide, peptide; or it may be an inorganic molecule including a metal, metal ion, metal oxide, and metal complex; it may also be an entire organism including a bacterium, virus, and single-cell eukaryote such as a protozoon.
- the term “translatable” may be used interchangeably with the term “expressible.” These terms can refer to the ability of polynucleotide, or a portion thereof, to provide a polypeptide, by transcription and/or translation events in a process using biological molecules, or in a cell, or in a natural biological setting. In some settings, translation is a process that can occur when a ribosome creates a polypeptide in a cell. In translation, a messenger RNA (mRNA) can be decoded by a ribosome to produce a specific amino acid chain, or polypeptide.
- mRNA messenger RNA
- a translatable polynucleotide can provide a coding sequence region (usually, CDS), or portion thereof, that can be processed to provide a polypeptide, protein, or fragment thereof.
- CDS coding sequence region
- 3 '-untranslated region relates to the section of messenger RNA (mRNA) that immediately follows the translation termination codon.
- the 3' UTR may comprise regulatory regions within the 3 '-untranslated region which are known to influence polyadenylation and stability of the mRNA.
- Many 3'-UTRs also contain AU-rich elements (AREs).
- the 3 '-UTR may preferably contain the sequence that directs addition of several hundred adenine residues called the poly(A) tail to the end of the mRNA transcript.
- 5'- untranslated region refers to a polynucleotide sequence that, when linked to a transcript, is capable of recruiting ribosome complexes and initiating translation of the transcript.
- a 5’-UTR is positioned directly upstream of the initiation codon of a transcript; specifically, between the cap site and the initiation codon.
- the 5' UTR begins at the transcription start site and ends one nucleotide (nt) before the start codon (usually AUG in the mRNA) of the coding region.
- nt nucleotide
- the length of the 5' UTR is generally from 100 to several thousand nucleotides long but sometimes also shorter UTRs occur in eukaryotes.
- FIGS. 1A- IF are a series of graphs, schematic representation and fluorescent microscopic images demonstrating that Qa tagging in SARS-CoV-2 viral proteins robustly boosts the production of dual reporter-fused viral proteins in HEK293T cells.
- FIG. 1A Diagram for 2A-mediated dual reporter gdLuc/dsGFP fused with viral protein and potential multiple measures of viral protein expression/production.
- FIGS. 1B-1D Representative experiments for Qa boosting of SARS-CoV-2 envelop (E) protein dynamic production (FIG. IB) and average fold induction of 10 experiments determined by gdLuc assay of cultured media at 24-48 h after transfection with indicated pcDNA6B vector (100 ng/well) in quadruplicates for each experiment (FIG. 1C), as well as representative images of Qa-boosted dsGFP expression detected by fluorescent microscopy (FIG. ID).
- E SARS-CoV-2 envelop
- FIGS. 1B-1D Representative experiments for Qa boosting of SARS-CoV-2 envelop (E) protein dynamic production (FIG. IB) and average fold induction of 10 experiments determined by gdLuc assay of cultured media at 24-48 h after transfection with indicated pcDNA6B vector (100 ng/well) in quadruplicates for each experiment (FIG. 1C), as well as representative images of Qa-boosted dsGFP expression detected by
- FIGS. 1E-1F Representative gdLuc assay showing various degrees of Qa boosting in other SARS-CoV-2 structural protein spike (S) and nucleocapsid (N), as well as accessory proteins NSP2, NSP16 and ORF3.
- S structural protein spike
- N nucleocapsid
- FIGS. 1E-1F Representative gdLuc assay showing various degrees of Qa boosting in other SARS-CoV-2 structural protein spike (S) and nucleocapsid (N), as well as accessory proteins NSP2, NSP16 and ORF3.
- S structural protein spike
- N nucleocapsid
- FIGS. 1E-1F Representative gdLuc assay showing various degrees of Qa boosting in other SARS-CoV-2 structural protein spike (S) and nucleocapsid (N), as well as accessory proteins NSP2, NSP16 and ORF3.
- Cells were transfected in quadruplicates with indicated pcDNA6B vectors at 100 ng/well.
- FIGS. 2A-2G are a series of graphs and fluorescent microscopic images demonstrating that Qa boosting is versatile in dosages, non-viral proteins, cell types and tagging location.
- FIG. 2A Dose-dependent Qa boosting of SARS-CoV-2 S, M, N and ORF3 in a various degree. Cells were transfected in quadruplicates with indicated pcDNA6B vectors at indicated amounts of vectors. Data represent mean ⁇ SE of gdLuc activity in the supernatant at 48 h after transfection. The fold number indicate relative changes in Qa groups compared with corresponding control group.
- FIGS. 2B-2D Dose-independent Qa boosting in host cellular gene NIBP and hACE2 determined by gdLuc assay (FIGS. 2B, 2C) and representative fluorescent microscopic images (FIG. 2D) at 48 h after transfection with pcDNA6B regular vectors.
- FIGS. 2E, 2F Dose-independent Qa boosting in secretory IFNy and IL-2 by gdLuc assay at 48 h after transfection with pRRL LV vectors.
- FIG. 2G Qa boosting on viral proteins E and S as well as non-viral protein hACE2 exhibits similar efficiency in different cell types.
- FIGS. 3A-3I are a series of graphs, fluorescent microscopic images, a schematic representation and a blot demonstrating that Qa boosting is accelerated by stronger promoter and SARS-CoV-2 native untranslated region (UTR).
- FIGS. 3A-3C Stronger promoter CAG further increases Qa boosting efficiency in viral protein E, S and NSP16.
- FIGS. 3D, 3E The 5’ -UTR inclusion robustly increases promoter-dependent expression of E protein as determined by Western blot and immunocytochemistry with anti -Flag antibody, and addition of 3’ -UTR further increases E protein expression. Different size of E-Flag-Q results from the 37 amino acids addition within the open reading frame after stop removal during the cloning.
- FIGS. 3F, 3G The 5’ -UTR inclusion dramatically increases the C AG-driven expression of Qa tagged S-fused dual reporter as determined by representative fluorescent microscopic images and gdLuc assay.
- FIG. 3H The 5’ -UTR inclusion further accelerates Qa boosting efficiency of CMV- driven S dual reporter protein production as compared with LG group at 48 h after transfection with regular vector.
- FIG. 31 The 5’ -UTR inclusion accelerates Qa boosting of dual reporter without viral proteins as determined by gdLuc assay at 48 h after transfection with pRRL LV vectors.
- FIGS. 4A-4H are a series of graphs, fluorescent microscopic images, a schematic representation and a blot demonstrating that Qa tagging and 5’ -UTR inclusion boost the packaging and transduction efficiency of SARS-CoV-2 S protein-pseudotyped lentivirus-like particles (S-LVLP).
- FIG. 4A Diagram for different vectors expressing human codon-optimized Sdl8 and the process of S-LVLP packaging.
- FIG. 4B Qa and 5’-UTR increase Sdl8 protein expression in the transfected cells as determined by Western blot with serum from SARS-CoV-2 patient.
- FIGS. 4C-4F Qa tagging increases S-LVLP packaging titer for the standard pRRL-GFP LV vector determined by GFP positivity, which is further increased by 5’-UTR inclusion, polybrene treatment and purification.
- FIGS. 4G-4H Qa tagging and 5’-UTR inclusion increase S-LVLP packaging titer for dual reporter LV vectors pRRL-LG or pRRL-E-LG determined by GFP positivity and gdLuc activity.
- FIGS. 5A-5I is a series of graphs demonstrating that Qa tagging and 5’-UTR inclusion boost mRNA-dependent production of SARS-CoV-2 viral proteins S, N, E and ORF3 as well as non-viral hACE2 via increasing mRNA stability and translational efficiency.
- FIGS. 5A-5C Qa tagging robustly boosts mRNA-dependent production of dual reporter in a time- and dose-dependent manner to a various extent with different targeted proteins.
- FIGS. 5D, 5E The 5’-UTR inclusion further accelerates Qa boosting on mRNA-derived production of S, E and hACE2 proteins.
- FIGS. 5F-5I Qa tagging increases posttranscriptional mRNA stability and translational efficiency in the presence of transcriptional inhibitor actinomycin D.
- FIGS. 6A-6M are a series of diagrams, blots, graphs, fluorescent microscopic images and a photograph demonstrating that Qa tagging boosts the production yield of anti-SARS monoclonal antibody and lentiviruses.
- FIG. 6A Diagram showing the Qa tagging on the C-terminus of constant regions for heavy and light chains of anti-SARS monoclonal antibody.
- FIGS. 6B-6D Representative ELISA for robust boosting of mAb production at 48 h after co-transfection of H/L or HQ/LQ (50ng/well) with or without normalization of GFP (FIG. 6G) or firefly-luciferase (FIG. 6L).
- the mAb amount is quantified by Sigmoidal four-parameter logistic curve (4PL) determined. Relative fold changes were presented as compared with corresponding LG.
- 4PL Sigmoidal four-parameter logistic curve
- FIG. 6E Average fold changes of 16 experiments based on ELISA results.
- FIG. 6F Western blot analysis confirmed the boost of Qa on mAb production in the supernatant.
- FIG. 6G Qa tagging in the LV transfer vector pRRL-E-LG increased gdLuc activity in the supernatant after LV infection of HEK293T cells.
- FIGS. 6H, 61 Qa tagging in the LV transfer vector pLV-EFla-Flag-spCas9-Qa-T2A- RFP increases the transgene expression determined by Western blot analysis with anti-Flag antibody (FIG. 6H) but does not increase the packaging efficiency measured with FACS for RFP positivity (FIG. 61).
- FIG. 6J Representative fluorescent images showing Qa tagging on Pol and RRE boosts LV packaging efficiency for pRRL-GFP transfer LV vector but Qa tagging on Gag impairs LV packaging.
- FIGS. 6K-GM Qa tagging on Pol and RRE boosts LV packaging efficiency for pRRL- UTR-QLG and pLV-EFla-MS2-spCas9-F2A-GFP determined with LV qPCR titer kit (FIG. 6K), flow cytometry (FIG. 6L) and gdLuc assay (FIG. 6M).
- FIGS. 7A-7H are a series of blots and graphs identifies the secretion boost of Qa tagging on various targeted proteins in HEK293T cells.
- FIGS. 7A-7B Qa tagging remarkably decreases the expression level of E-Flag-gdLuc protein in the cell lysate at 48 h after transfection with indicated vectors.
- FIG. 7C Qa tagging decreases the expression levels of secretory IFNy/IL-2 or non- secretory viral protein N and non-viral protein hACE2 in the cell lysates at 48 h after transfection with indicated vectors. T2A auto-cleaving efficiency varies with targeted proteins, showing different ratio of the cleaved band (c) and non-cleaved band (n).
- FIG. 7D Qa tagging decreases S protein level in the cell lysates while 5’-UTR inclusion does increase the protein level despite of the continuous secretion. The cleaved S-Flag-gdLuc fragment was detected with anti-Flag antibody and the cleaved dsGFP fragment was detected with anti-GFP antibody.
- FIGS. 7E, 7F Qa tagging robustly increases the protein level of secretory E-QLG in the supernatant detected by Western blot analysis and gdLuc assay of the supernatant.
- Cells were transfected with indicated vectors in quadruplicates for 24 h and cultured with FreeStyleTM 293 Expression Medium for 48 h.
- FIG. 7G ER-Golgi trafficking inhibitor brefeldin A completely blocks the secretion of Qa tagged viral proteins and host cellular proteins.
- FIG. 7H Qa tagging does increase the protein expression of non-secretory firefly- luciferase (fLuc) in the cell lysate and has no effect on the background of fLuc activity in the supernatant.
- fLuc non-secretory firefly- luciferase
- FIGS. 8A-8I are a series of schematics, photographs of stained cells and graphs demonstrating that Exen21/Qa addition in SARS-CoV-2 viral proteins robustly boosts production of dual reporter-fused viral proteins in HEK293T cells.
- FIG. 8A Diagram of 2A-mediated dual reporter gdLuc/dsGFP (LG) and Qa tagged LG (QLG) fused with viral protein and potential multiple measures of viral protein expression/production.
- the Exen21/Qa stands for the 21-mer nucleotide motif and its corresponding heptapeptide.
- FIGS. 8B-8D Representative experiments showing Exen21 boosting of SARS-CoV-2 envelope (E) protein dynamic production (FIG. 8B) and average fold induction with results of 20 experiments (FIG. 8C) determined by gdLuc assays in supernatants, 24-72 h after transfection with indicated pcDNA6B vector (100 ng/well, quadruplicate), and representative images of Exen21 -boosted dsGFP expression detected by fluorescence microscopy (FIG. 8D). Data represent mean ⁇ SE of gdLuc activity with the relative fold changes (in red) in QLG over corresponding LG groups (the same below).
- 8E-8F Representative gdLuc assay showing various degrees of Exen21 boosting in other SARS-CoV-2 structural proteins: spike (S), nucleocapsid (N), and accessory proteins: NSP2, NSP16, and ORF3.
- S spike
- N nucleocapsid
- NSP2 nucleocapsid
- ORF3 ORF3
- Cells were transfected with indicated pcDNA6B vectors (100 ng/well, quadruplicates). Data represent mean ⁇ SE of gdLuc activity in supernatants 48 h post transfection.
- FIGS. 8G-8I Alanine scanning and deletion mutation (FIG. 8G) as well as degenerate (FIG. 8H) and missense (FIG. 81) mutation assays showing the critical role of the unique and specific Exen21 in boosting E-LG production.
- Cells were transfected with indicated pcDNA6B- E vectors (100 ng/well, quadruplicates). Data represent mean ⁇ SE of gdLuc activity in supernatants 48 h post-transfection, with the relative percentage changes compared with the parent E-QLG group.
- Inset in FIG. 8G shows the heptapeptide structure with the residue position.
- Insets in FIGS. 8H and 81 show the mutated nucleotide and corresponding residues. The dQ for degenerate QLG and mQ for missense QLG mutants.
- FIGS. 9A-9G are a series of graphs demonstrating that Exen21 boosting is versatile in dosages, non-viral proteins and cell types.
- FIG. 9A Dose-dependent and varying extents of Exen21 -boosted expression of SARS- CoV-2 S, M, N and ORF3 protein levels.
- Cells were transfected in quadruplicates with indicated pcDNA6B (6B) vectors in indicated amounts. Data represent mean ⁇ SE of gdLuc activity in supernatants 48 h post- transfection. Fold values indicate changes in Exen21 groups relative to those of corresponding control groups.
- FIGS. 9B, 9C Dose-independent boosting by Exen21 of host cellular gene NIBP and hACE2 levels, determined by gdLuc assay (FIGS. 9B, 9C) 48 h after transfection with pcDNA6B regular vectors.
- FIGS. 9D, 9E Dose-independent boosting by Exen21 of secretory IFNy and IL-2 by gdLuc assay 48 h after transfection with pRRL LV vectors.
- FIG. 9F Stronger promoter CAG further increases boosting efficiency in LG system by Qa (QLG) in viral protein E.
- FIG. 9G Exen21 -induced boosting of viral E and S proteins and non-viral protein hACE2 exhibits similar efficiencies across different cell types.
- FIGS. 10A-10F are a series of schematics, a photograph, a blot and graphs demonstrating that Exen21 addition boosts production yields of anti-SARS monoclonal antibody (mAb).
- FIG. 10A Diagram showing human anti-SARS mAb and Exen21/Qa tags introduced (right panel) on its C-termini of constant regions of heavy and light chains.
- FIG. 10B Representative ELISA showing robust boosting by Exen21/Qa (HQ/LQ) of mAb production 48h after co-transfection of mAb H/L or HQ/LQ expression vectors (50 ng/well, in triplicates), with normalization vectors empty control (C), GFP (G) or firefly-luciferase (L).
- C normalization vectors empty control
- G G
- L firefly-luciferase
- FIG. IOC Sigmoidal 4-parameter logistic curve (4PL) determination of mAb concentrations.
- FIG. 10D Normalized quantitative data from experiment/assay shown in B. Relative fold changes are presented as compared with corresponding mAb H/L.
- FIG. 10E Average Exen21/Qa-induced fold changes of ELISA-based mAb production for 16 experiments at p ⁇ 0.0001 with student’s t test.
- FIG. 10F Western blot analysis confirming the boost of Exen21/Qa on mAb production in the supernatant.
- Membrane staining as a loading control is for densitometric analysis of relative fold changes in light chain (LC) between HQ/LQ and H/L groups.
- FIGS. 11A-11K are a series of schematics, blots, graphs and photographs demonstrating that Exen21 addition boosts packaging and transduction efficiencies of SARS-CoV-2 S protein- pseudotyped lentivirus-like particles (S-LVLP) and standard lentiviral packaging.
- S-LVLP S protein- pseudotyped lentivirus-like particles
- FIG. 11 A Diagrams of different vectors expressing human codon-optimized Sdl8, and the process of S-LVLP packaging in HEK293T cells.
- FIG. 11B Exen21 increases Sdl8 protein expression in transfected cells, shown by Western blot with serum from SARS-CoV-2 patient, which contains specific anti-S antibody. Representative fold change for S2 fragment is quantified by densitometric analysis with GAPDH normalization.
- FIG. 11C Exen21 addition increases S-LVLP packaging titer of the standard pRRL-GFP LV vector determined by GFP positivity.
- FIGS. 11D-11E Exen21 addition increases S-LVLP packaging titer for dual reporter LV vectors pRRL-E-QLG determined by GFP positivity (FIG. 11D) and gdLuc activity (FIG. HE).
- FIGS. 11F Exen21/Qa in the LV transfer vector pRRL-E-QLG induces LV dose-related increases in gdLuc activity in supernatants of HEK293T cells 48-72 h after infection with indicated amount of crude LV preparation (m ⁇ per well, triplicates). Shown are fold changes in gdLuc activity from E-QLG vs. control E-LG group.
- FIGS. 11G, 11H Exen21/Qa in the LV transfer vector pLV-EFla-Flag-spCas9-Qa- T2A-RFP (Qa) increases transgene expression vs. untagged vector (Con), seen by Western blot analysis with anti -Flag antibody (FIG. 11G), but does not increase packaging efficiency as measured by FACS for RFP positivity 48 h after infection with crude LV preparation (FIG.
- FIG. Ill Representative fluorescence images show that Exen21/Qa addition to Pol and RRE enhances pRRL-GFP LV packaging efficiency vs. control (psPAX2) levels, but Exen21/Qa ion Gag impairs LV packaging.
- FIGS. 11J, 11K Exen21/Qa tagging on Pol and RRE (PolQ/RREQ) boosts LV packaging efficiency for pRRL-GFP transfer vector, determined by cell counting (FIG. 11J) and flow cytometry (FIG. 11K).
- FIG. 11L The gdLuc assay showing the boosting of Exen21/Qa tagging.
- FIGS. 12A-12G are a series of graphs demonstrating that Exen21 addition boosts mRNA-dependent production of SARS-CoV-2 viral proteins S, N, E and ORF3 as well as non- viral hACE2 by increasing mRNA stability and translational efficiency.
- FIGS. 12A-12C Exen21 addition robustly boosts mRNA-dependent production of dual reporter in a time-and dose-dependent manner to a various extent with different targeted proteins.
- FIG. 12A Time course of responses to different concentrations of capped mRNAs for S- LGvs S-QLG (ng/well, quadruplicate).
- FIG. 12B Time course of response to indicated mRNAs (100 ng/well, quadruplicate).
- FIG. 12C Dose response to indicated mRNAs at 24 h post transfection.
- FIGS. 12D-12G Exen21 addition (QLG; right panels in D, E) increases posttranscriptional mRNA stability and translational efficiency in the presence of transcriptional inhibitor actinomycin D, shown in time-course plots of reporter activity (FIGS. 12D, 12E) and mRNA decay (FIGS. 12F, 12G). The mRNA levels were determined by RT-qPCR analysis.
- FIGS. 13A-13G are a series of blots and graphs demonstrating that Exen21 addition enhances secretion of various targeted proteins in HEK293T cells, shown by Western blot analyses.
- FIGS. 13A-13C Exen21 addition remarkably decreases protein expression levels of viral proteins (E, S, N), non-viral protein hACE2 and secretory IFNy/IL-2 in cell lysates 48 h after transfection with indicated pcDNA6B (6B) vectors. Fold numbers are relative densitometric changes after normalization by the loading control GAPDH or non-specific bands (NS). P2A auto-cleaving efficiency varies with targeted proteins, showing different ratio of the cleaved band (c) and non-cleaved band (n).
- FIGS. 13D, 13E Exen21 addition robustly increases secretory E-QLG protein levels in the supernatants, seen both by Western blot analyses (FIG. 13D) and gdLuc assay (FIG. 13E).
- Cells were transfected with indicated vectors in quadruplicates for 24 h and cultured with FreeStyleTM 293 Expression Medium for 48 h.
- Membrane staining as a loading control is for densitometric analysis of relative fold changes in E-QLG over E-LG.
- FIG. 13F ER-Golgi trafficking inhibitor brefeldin A blocks the secretion of Qa-tagged viral E protein (E-QLG) and host cell protein (IFNy), seen both by Western blot analyses (left)) and gdLuc assay (right) of the supernatants 48 h after brefeldin A treatment.
- E-QLG Qa-tagged viral E protein
- IFNy host cell protein
- FIG. 13G Exen21 addition elevates non-secretory firefly-luciferase (fLuc) protein levels in cell lysates. Relative fold change is quantified by densitometric analysis with GAPDH normalization.
- fLuc non-secretory firefly-luciferase
- FIGS. 14A-14C are a series of photographs showing a representative fluorescent microscopy detection of dual reporter. Related to FIGS. 8A-8I and 9A-9G.
- FIG. 14A Three indicated antibodies detected dual reporter of E-Flag-gdLuc-T2A-GFP with 2A and Flag complete colocalization while some cleaved GFP stayed alone without the corresponding E-Flag- gdLuc-T2A, which may have been secreted.
- FIGS. 15A-15B are a series of graphs and photographs demonstrating the dose- dependent Exen21/Qa boosting of SARS-CoV2 viral proteins and saturation of boosting activity at a higher amount of transfected reporter DNA in all the tested viral dual reporters.
- FIG. 9A is a series of graphs and photographs demonstrating the dose- dependent Exen21/Qa boosting of SARS-CoV2 viral proteins and saturation of boosting activity at a higher amount of transfected reporter DNA in all the tested viral dual reporters.
- FIG. 15A Exen21/Qa boosting in different dosage determined by NanoLight Gaussia luciferase assay.
- HEK293T cells in a 96- well plate was transfected with indicated gdLuc-P2A-dsGFP reporter at indicated amount.
- EGFP images were taken, and the supernatants were collected for luciferase assay. Data represents relative fold changes compared to corresponding LG group with mean ⁇ SE of 4 wells.
- FIGS. 16A-16E are a series of photographs, a blot, a graph and a schematic demonstrating eExen21 boosting of mRNA vaccine production and efficacy. Related to FIGS. 12A-12G.
- FIG. 16A Diagram for in vitro transcription and 5’ -Cap modification.
- FIG. 16B Gel electrophoresis (1% agarose) images for transcript length, integrity and quantity of both CO and Cl 5’ -Capped mRNAs.
- FIG. 16C 10 ⁇ 30-fold increases in the expression of dual reporter at equal levels of functional mRNA for viral genes N, E and ORF3.
- the Capped (Cap-CO) and tailed mRNAs of indicated targets were synthesized using Hi Scribe T7 ARC A mRNA Kit (NEB, E2065) and cDNA template from the corresponding linearized plasmid.
- Half of the Cap-CO mRNAs were further methylated at the 2 -0 position of the first nucleotide adjacent to the Cap-CO structure using mRNA Cap 2'-0-Methyltransferase (NEB, M0366).
- Both Cap-CO and Cap-Cl mRNAs were purified with Monarch RNA Cleanup Kit (NEB, T2040).
- HEK293T cells in a 96-well plate were transfected with indicated mRNA (100 ng/well). At 24 h after transfection, the supernatants were collected for NanoLight Gaussia luciferase assay. Data represents relative fold changes compared to corresponding LG group with mean ⁇ SE of 4 independent experiments.
- FIGS. 17A-17E are a series of blots and a graph demonstrating that Qa tagging robustly increases the protein level of secretory E-QLG in the supernatant detected by Western blot analysis and gdLuc assay of the supernatant.
- FIGS. 17A-17E are a series of blots and a graph demonstrating that Qa tagging robustly increases the protein level of secretory E-QLG in the supernatant detected by Western blot analysis and gdLuc assay of the supernatant.
- FIGS. 17A, 17B Western blot with anti-gdLuc monoclonal antibody (Proteintech, Cat# 60158-1-Ig).
- FIGS. 17C, 17D Western blot with anti-GFP polyclonal antibody (Proteintech, Cat# 50430-2-AP).
- FIG. 17E Relative fold changes of boosting efficiency by gdLuc assay.
- HEK293T Cells were transfected with indicated vectors (100 ng/well) in triplicates for 24 h and cultured with FreeStyleTM 293 Expression Medium for 48 h before analysis.
- FIGS. 18A-18D are a series of graphs and blots demonstrating that ER-Golgi trafficking inhibitor brefeldin A blocks the secretion of Qa tagged viral proteins and host cellular proteins. Related to FIGS. 13A-13G.
- FIGS. 18A, 18B Relative gdLuc activity changes in the supernatant (FIG. 18A) and cell lysate (FIG. 18B) after brefeldin A treatment.
- FIGS. 18C, 18D Western blot with anti-gdLuc monoclonal antibody (Proteintech, Cat# 60158-1-Ig) and anti-GFP polyclonal antibody (Proteintech, Cat# 50430-2-AP).
- HEK293T Cells were transfected with indicated vectors (50 ng/well) in quadruplicates for 24 h and cultured with FreeStyleTM 293 Expression Medium for 48 h before analysis.
- FIGS. 19A-19C are a series of schematics and a table showing the SARS-CoV-2 UTR- E-Flag-Qa-UTR synthesis and cloning.
- FIG. 19A Diagram for the synthetic 5’-UTR-E-Flag- Qa-3’-UTR.
- FIG. 19B NEBuilder HiFi DNA assembly cloning of the synthetic nucleotides (946 bp) into pCAG-Flag expression vector.
- FIG. 19C List of cloning strategy to obtain indicated vector for E protein and S protein fused with QLG dual reporter.
- FIGS. 20A-20C are a series of photographs of stained cells showing that both 5’-UTR and 3’-UTR apparently enhanced the promoter-driven expression of QA-tagged E protein in HEK293T cells.
- HEK293T cells in a 96-well plate were transfected with indicated vectors in triplicate (100 ng/well). At 48 h after transfection, cells were fixed with 4% PAF for 10 minutes and immunocytochemistry with anti-Flag antibody was performed.
- FIG. 20A Representative confocal images.
- FIG. 20B Mean fluorescent intensity determined by ImageJ analysis of 6 fields from 3 wells.
- FIG. 20C Western blot analysis with anti-Flag antibody and anti-GAPDH for loading control.
- FIGS. 21A-21D are a series of schematics, photographs of stained cells and graphs showing that addition of 5’-UTR between CAG promoter and S-Flag-QLG dual reporter enhanced S protein expression.
- FIG. 21A Diagram of dual reporter design with the secretable gaussia dura luciferase (gdLuc) plus P2A autocleavable destabilized GFP (dsGFP) and various measures to assess the expression of targeted proteins (here SARS-CoV-2 viral proteins). Novel Q tag locates between targeted protein and gdLuc.
- FIGS. 21B-21D HEK293T cells in a 96-well plate were transfected with indicated vectors in quadruplicate at indicated amount of DNA (12.5- 100 ng/well).
- FIG. 21B EGFP images were taken (FIG. 21B), and the supernatants were collected for NanoLight Gaussia luciferase assay (FIGS. 21B, 21C, 21D).
- Data represents relative light unit of bioluminescence (FIG. 21C) or fold changes (FIG. 21D) compared to corresponding non-UTR group with mean ⁇ SE of 4 wells.
- FIGS. 22A-22C are a series of blots and graphs showing that addition of 5’-UTR to the pCAG, pcDNA6B and pRRL vectors dramatically increased the protein expression of the transgenes.
- HEK293T cells in a 24-well plate (FIG. 22A) or 96-well plate (FIGS. 22B, 22C) were transfected with indicated vectors (500 ng/well in A or 100 ng/well in FIGS. 22B, 22C).
- EGFP expression was determined with Western blot (FIG. 22A), and the supernatants were collected for NanoLight Gaussia luciferase assay (FIGS. 22B, 22C).
- Data represents fold changes compared to corresponding LG group (FIG. 22B) or relative light unit of bioluminescence (FIG. 22C) non-UTR group with mean ⁇ SE of 4 wells.
- FIGS. 23 A, 23B are a series of graphs showing that addition of 5’-UTR to the upstream of in vitro transcribed mRNA significantly enhances the protein expression in HEK293T cells.
- HEK293T cells in a 96-well plate were transfected using Lipofectamine® MessengerMAX mRNA Transfection Reagent with indicated mRNAs (50 ng/well) generated from in vitro transcription with 5’ -capped and 3’ -poly A tail.
- the mRNAs encode indicated viral protein (E or S protein) or endogenous hACE2 protein fused with dual reporter LG or QLG.
- FIGS. 24A-24F are a series of schematics, photographs of stained cells, blots and graphs showing that Qa tagging and 5’-UTR inclusion boost the packaging and transduction efficiency of SARS-CoV-2 S protein-pseudotyped lentivirus-like particles (S-LVLP).
- FIG. 24A Diagram for different vectors expressing human codon-optimized Sdl8 and the process of S-LVLP packaging.
- FIG. 24B Qa and 5’-UTR increase Sdl8 protein expression in the transfected cells as determined by Western blot with serum from SARS-CoV-2 patient.
- FIGS. 24C-24D Qa tagging increases S-LVLP packaging titer for the standard pRRL- GFP LV vector determined by GFP positivity, which is further increased by 5’-UTR inclusion.
- FIGS. 24E-24F Qa tagging and 5’-UTR inclusion increase S-LVLP packaging titer for dual reporter LV vectors pRRL-LG or pRRL-E-LG determined by GFP positivity and gdLuc activity.
- the disclosure is based in part, of the unexpected finding that an oligonucleotide cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7) that encodes a short peptide (termed herein “Qa” that significantly boosted the expression/production of fusion protein.
- Qa short peptide
- Further expanded studies identified the versatile property of Exen21/Qa tagging in boosting the production (by up to thousand-folds) of various proteins including viral proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins and virus packaging proteins. Also discovered was the potent boosting of protein production by SARS-CoV-2 native 5’-UTR, and its synergistic role with Qa tagging.
- Qa increased mRNA/protein stability and/or enhanced protein translation as well as facilitates protein secretion.
- These versatile protein boost strategies will be beneficial extensively to the biomedical science and protein engineering industry. This is the first evidence for protein regulation/boosting by short peptide tagging and SARS-CoV2 native 5’-UTR.
- embodiments are directed to novel chimeric molecules comprising a peptide tag and 5’- untranslated region (5’ -UTR) for use in the enhanced production and expression of a desired biomolecule.
- An untranslated region refers to either of two sections, one on each side of a coding sequence on a strand of mRNA. If it is found on the 5' side, it is called the 5’ -UTR (or leader sequence), or if it is found on the 3' side, it is called the 3' UTR (or trailer sequence).
- the mRNA is initially transcribed from the corresponding DNA sequence and then translated into protein. However, several regions of the mRNA are usually not translated into protein, including the 5' and 3' UTRs.
- the 5’-UTR is a sequence that is recognized by the ribosome which allows the ribosome to bind and initiate translation.
- the mechanism of translation initiation differs in prokaryotes and eukaryotes.
- the 3' UTR is found immediately following the translation stop codon. The 3' UTR plays a critical role in translation termination as well as post-transcriptional modification.
- a chimeric molecule for use in enhancing the expression and production of a desired biomolecule comprises one or more short peptide domains and one or more UTRs.
- the UTR is a 5’-UTR. In certain embodiments, the UTR is a 3 ’-UTR.
- the one or more 5’ -untranslated region (UTR) domains or fragments thereof are derived from one or more viruses.
- the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
- the 5’ -UTR and/or 3’-UTR are from a coronavirus.
- the coronavirus is SARS-CoV-2.
- the one or more 5’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS- CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’ -UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’-UTR.
- the one or more 3’- UTR nucleic acid sequences or fragments thereof comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS- CoV-23’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-23’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’-UTR.
- the one or more UTR sequences are engineered to include a Shine-Dalgarno sequence 5'-AGGAGGU-3'). This sequence is found 3-10 base pairs upstream from the initiation codon.
- the one or more UTR sequences are engineered to contain a Kozak consensus sequence (ACCAUGG).
- the one or more of the 5’-UTR sequences (or nucleic acid molecules each comprising a 5’-UTR sequence) may comprise a synthetic sequence (i.e., a sequence that is not found in nature).
- one or more of the 5’-UTR sequences may comprise an endogenous 5’-UTR sequence (i.e., a 5’- UTR sequence that is used in nature to recruit ribosome complexes and initiate translation of a transcript).
- an endogenous 5’-UTR sequence may be part of a mRNA expressed in a cell or population of cells.
- the cells in the population of cells may be the same type of cell (e.g., HEK-293 cells, PC3 cells, or muscle cells).
- the population of cells may comprise different cell types (e.g., HEK-293 cells, PC3 cells, and muscle cells).
- the length of the 5’-UTR sequences may vary. For example, in some embodiments, at least two of the 5’-UTR sequences have different lengths. In some embodiments, at least two of the 5’-UTR sequences have the same length. In some embodiments, each of the 5’-UTR sequences have the same length. In some embodiments, the length of at least one of the 5’-UTR sequences in the initial chimeric molecule is 3, 4, 5, 6, 7, 8, 9, or 10 base pairs in length.
- the length of at least one of the 5’-UTR sequences is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550, at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 950, at least 1000, at least 1500, at least 2000, or at least 3000 base pairs in length.
- the length of each of the 5’-UTR sequences is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550, at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 950, at least 1000, at least 1500, at least 2000, or at least 3000 base pairs in length.
- the chimeric molecule comprises one or more coronavirus 5’-UTR and/ or 3’ -UTR sequences, the length of at least one UTR sequence is increased to a length of interest by added nucleotides to one or both ends (e.g., by adding repeats of a motif that does not have known secondary structure). Nucleotides may be added to the 5' end, the 3' end, or both the 5' and 3' ends of a 5’-UTR and/or 3’-UTR sequences. In some embodiments, the length of one or more 5’- or 3’-UTR sequences are decreased to a length of interest by removing nucleotides to one or both ends. Nucleotides may be removed from the 5' end, the 3' end, or both the 5' and 3' ends of a 5’ -UTR sequence.
- the UTR sequences comprise one or more mutations.
- the mutations may be introduced using a genetic algorithm. Examples of genetic algorithms are known to those having skill in the art. See e.g., Scrucca, L. GA: A Package for Genetic Algorithms in R. J. Stat. Softw. (2015). doi:10.18637/jss.v053.i04.
- the number of mutations introduced into each of the UTR sequences may vary. In some embodiments, at least one UTR sequences is mutated at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotide positions.
- a mutation may comprise a base pair substitution, a deletion, or an insertion.
- the UTRs comprise one or more chemically modified nucleotides. Amongst these is the inclusion of chemically modified nucleotides;
- RNA molecules can be found, for example, in Genes VI, Chapter 9 (“Interpreting the Genetic Code”), Lewis, ed. (1997, Oxford University Press, New York), and Modification and Editing of RNA, Grosjean and Benne, eds. (1998, ASM Press, Washington D.C.).
- Modified RNA components include the following: 2'-0- methylcytidine; N 4 -methylcytidine; N 4 -2'-0-dimethylcytidine; N 4 -acetylcytidine; 5- methylcytidine; 5,2'-0-di methylcytidine; 5-hydroxymethylcytidine; 5-formylcytidine; 2'-0- methyl-5-formaylcytidine; 3-methylcytidine; 2-thiocytidine; lysidine; 2'-0-methyluridine; 2- thiouridine; 2-thio-2'-0-methyluridine; 3,2'-0-dimethyluridine; 3-(3-amino-3- carboxypropyl)uridine; 4-thiouridine; ribosylthymine; 5,2'-0-dimethyluridine; 5-methyl-2- thiouridine; 5-hydroxyuridine; 5-methoxyuridine; uridine 5-oxyacetic acid; uridine 5-oxyacetic acid ur
- the UTR is a synthetic oligonucleotide.
- the synthetic nucleotide comprises a modified nucleotide.
- Modification of the inter-nucleoside linker i.e. backbone
- inter-nucleoside linker modifications prevent or reduce degradation by cellular nucleases, thus increasing the pharmacokinetics and bioavailability of the UTR.
- a modified inter nucleoside linker includes any linker other than other than phosphodiester (PO) liners, that covalently couples two nucleosides together.
- the modified inter nucleoside linker increases the nuclease resistance of the UTR compared to a phosphodiester linker.
- the inter-nucleoside linker includes phosphate groups creating a phosphodiester bond between adjacent nucleosides.
- the UTR comprises one or more inter-nucleoside linkers modified from the natural phosphodiester.
- inter-nucleoside linkers of the UTR are modified.
- the inter-nucleoside linkage comprises sulfur (S), such as a phosphorothioate inter-nucleoside linkage.
- a modified nucleoside includes the introduction of one or more modifications of the sugar moiety or the nucleobase moiety.
- the UTRs, as described comprise one or more nucleosides comprising a modified sugar moiety, wherein the modified sugar moiety is a modification of the sugar moiety when compared to the ribose sugar moiety found in deoxyribose nucleic acid (DNA) and RNA.
- DNA deoxyribose nucleic acid
- RNA deoxyribose nucleic acid
- Numerous nucleosides with modification of the ribose sugar moiety can be utilized, primarily with the aim of improving certain properties of oligonucleotides, such as affinity and/or stability.
- Such modifications include those where the ribose ring structure is modified. These modifications include replacement with a hexose ring (HNA), a bicyclic ring having a biradical bridge between the C2 and C4 carbons on the ribose ring (e.g. locked nucleic acids (LNA)), or an unlinked ribose ring which typically lacks a bond between the C2 and C3 carbons (e.g. UNA).
- HNA hexose ring
- LNA locked nucleic acids
- UPA unlinked ribose ring which typically lacks a bond between the C2 and C3 carbons
- Other sugar modified nucleosides include, for example, bicyclohexose nucleic acids or tricyclic nucleic acids. Modified nucleosides also include nucleosides where the sugar moiety is replaced with a non-sugar moiety, for example in the case of peptide nucleic acids (PNA), or
- Sugar modifications also include modifications made by altering the substituent groups on the ribose ring to groups other than hydrogen, or the 2'-OH group naturally found in DNA and RNA nucleosides. Substituents may, for example be introduced at the 2', 3', 4' or 5' positions.
- Nucleosides with modified sugar moieties also include 2' modified nucleosides, such as 2' substituted nucleosides. Indeed, much focus has been spent on developing 2' substituted nucleosides, and numerous 2' substituted nucleosides have been found to have beneficial properties when incorporated into oligonucleotides, such as enhanced nucleoside resistance and enhanced affinity.
- a 2' sugar modified nucleoside is a nucleoside that has a substituent other than H or -OH at the 2' position (2' substituted nucleoside) or comprises a 2' linked biradicle, and includes 2' substituted nucleosides and LNA (2'-4' biradicle bridged) nucleosides.
- 2' substituted modified nucleosides are 2'-0-alkyl-RNA, 2'-0-methyl-RNA, 2'-alkoxy-RNA, 2'-0- methoxyethyl-RNA (MOE), 2'-amino-DNA, 2'-Fluoro-RNA, and 2'-F-ANA nucleoside.
- the modification in the ribose group comprises a modification at the 2' position of the ribose group.
- the modification at the 2' position of the ribose group is selected from the group consisting of 2'-0-methyl, 2'-fluoro, 2'- deoxy, and 2'-0-(2-methoxyethyl).
- the UTRs comprise one or more modified sugars.
- the gRNA comprises only modified sugars. In certain embodiments, the gRNA comprises greater than 10%, 25%, 50%, 75%, or 90% modified sugars.
- the modified sugar is a bicyclic sugar. In some embodiments, the modified sugar comprises a 2'- O-methoxyethyl group. In some embodiments, the UTR comprises both inter-nucleoside linker modifications and nucleoside modifications.
- the chimeric molecule comprises an internal ribosome entry site (IRES).
- IRES is an RNA element that allows for translation initiation in an end-independent manner.
- the IRES is in the 5' UTR. In other embodiments, the IRES may be outside the 5' UTR.
- the chimeric molecule for use in enhancing the expression and production of a desired biomolecule comprises one or more short peptide domains and one or more UTRs.
- the chimeric molecule comprises one or more peptide domains.
- the one or more peptide domains comprise from about five amino acids to about twenty amino acids.
- the one or more peptide domains comprise about seven amino acids.
- the synthetic peptide tag comprises an amino acid sequence having at least about 70% (such as at least about 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or greater) sequence identity to the sequence: QPRFAAA (SEQ ID NO: 1).
- the one or more peptide domains comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptide domains comprise an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptide domains comprise the amino acid sequence QPRFAAA (SEQ ID NO:
- the chimeric molecule comprises one or more peptide domains comprise an amino acid sequence of X n -QPRFAAA-X n , wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
- n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), As
- the one or more peptide domains comprise one or more non natural amino acids or modified amino acids.
- modified amino acids include amino acids that have been phosphorylated, acetylated, glycosylated, carboxylated, hydroxylated, sulfated, and the like.
- non-natural amino acids include D-amino acids, homo amino acids, N-methyl amino acids, alpha-methyl amino acids, beta (homo) amino acids, gamma amino acids, helix/turn stabilizing motifs, backbone modifications (e.g. peptoids).
- amino acids that are contemplated include hydroxyproline (Hyp), beta-alanine, citrulline (Cit), ornithine (Orn), norleucine (Me), 3-nitrotyrosine, nitroarginine, pyroglutamic acid (Pyr).
- a fusion protein or chimeric molecule e.g. a peptide domain and/or UTR sequence associated with a biomolecule such as a protein, of the present disclosure is obtained by associating a peptide tag to a target protein (also referred to as a fusion protein of a tag and a target protein).
- a target protein also referred to as a fusion protein of a tag and a target protein.
- One or more chimeric molecules may be bound to the N-terminus of the target protein, one or more chimeric molecules may be bound to the C-terminus of the target protein, or one or more chimeric molecules may be bound to both the N-terminus and the C-terminus of the target protein, or one or more chimeric molecules may be inserted into internal region of the tagged proteins.
- the one or more chimeric molecules may be directly bound to the N-terminus and / or the C-terminus of the target protein or may be bound through a sequence of 1 to several amino acids (for example, 1 to 10 amino acids).
- the sequence of 1 to several amino acids may be any sequence as far as the sequence does not adversely affect the function or the expression level of the chimeric molecule-target protein.
- the chimeric molecules may be isolated from the target protein after expression and purification by using a protease recognition sequence.
- At least one or more chimeric molecules are associated with one or more biomolecules of interest.
- biomolecules include cytokines, growth factors, viral antigens, tumor antigens, antigens, polynucleotides, oligonucleotides, hormones, enzymes, checkpoint proteins, an antigen, an antibody, a transcription factor, a receptor, a ligand, immunoglobulins, immunoglobulin fragments, a fluorescent protein, etc.
- the length of the biomolecule, e.g. peptide of interest may vary as long as the amount of the targeted biomolecule, e.g. a peptide produced is significantly increased when expressed in the form of a fusion peptide/chimeric molecule.
- the enzyme examples include enzymes such as lipase, protease, steroid synthesizing enzyme, kinase, phosphatase, xylanase, esterase, methylase, demethylase, oxidase, reductase, cellulase, aromatase, Carnauba, transglutaminase, glycosidase, and chitinase.
- enzymes such as lipase, protease, steroid synthesizing enzyme, kinase, phosphatase, xylanase, esterase, methylase, demethylase, oxidase, reductase, cellulase, aromatase, Carnauba, transglutaminase, glycosidase, and chitinase.
- Growth factors include, for example, epithelial growth factor (EGF), insulin-like growth factor (IGF), transforming growth factor (TGF), nerve growth factor (NGF), brain derived neurotrophic factor (BDNF) (VEGF), granulocyte colony stimulating factor (G-CSF), granulocyte macrophage colony stimulating factor (GM-CSF), platelet derived growth factor (PDGF), erythropoietin (EPO), thrombopoietin, Pre-eukaryotic cell growth factor (FGF), hepatocyte growth factor (HGF).
- the hormone include insulin, glucagon, somatostatin, growth hormone, parathyroid hormone, prolactin, leptin and calcitonin.
- cytokines include interleukin, interferon (IFN alpha, IFN beta, IFN gamma), tumor necrosis factor (TNF).
- Blood proteins include, for example, thrombin, serum albumin, Factor VII, Factor VII, Factor X, Factor X, tissue plasminogen activator.
- Antibody proteins include for example, F (ab')2, Fc, Fc fusion protein, heavy chain (H chain), light chain (L chain), short chain Fv scFv), sc(Fv)2, disulfide- linked Fv (sdFv), Diabodies.
- Immune checkpoint proteins are well known in the art and include, without limitation, CTLA-4, PD-1, VISTA, B7-H2, B7-H3, PD-L1, B7-H4, B7-H6, 2B4, ICOS, HVEM, PD-L2,
- CD 160 gp49B, PIR-B, KIR family receptors, TIM-1, TIM-3, TIM-4, LAG-3, BTLA, SIRPalpha (CD47), CD48, 2B4 (CD244), B7.1, B7.2, ILT-2, ILT-4, TIGIT, and A2aR.
- Antigens may be appropriately selected depending on the subject of the immunological response, for example, a protein derived from a pathogenic bacterium, or a protein derived from a pathogenic virus.
- the chimeric molecules may be combined with a secretory signal peptide functioning in the host cell for secretory production.
- the secretory signal peptide can be exemplified by an invertase secretion signal.
- the secretory signal is obtained from two or more different sources.
- Various sources include, for example, Bacillus species, Lactococcus lactis , Streptomyces, or Corynebacterium .
- Other signal sequences include, for example, human IL-2, human chymotrypsin, human interferon gamma, etc.
- the chimeric molecules may be added with a transport signal peptide such as an endoplasmic reticulum residual signal peptide or a liquid phase transition signal peptide for expression in a specific cell compartment.
- a transport signal peptide such as an endoplasmic reticulum residual signal peptide or a liquid phase transition signal peptide for expression in a specific cell compartment.
- the chimeric biomolecules can be chemically synthesized or can be genetically produced.
- the DNA of the present disclosure is characterized by including nucleic acids encoding the chimeric molecule of the present disclosure.
- the DNA of the present disclosure may contain an enhancer sequence or the like functioning in the host cell so as to improve the expression in the host cell.
- the enhancer include the 5'-untranslated region of the Kozak sequence and the plant-derived alcohol dehydrogenase gene.
- Genetic constructs or vectors comprise a nucleotide sequence that encodes a desired protein operably linked to regulatory elements needed for gene expression. Accordingly, incorporation of the DNA or RNA molecule into a living cell results in the expression of the DNA or RNA encoding the desired protein and thus, production of the desired protein.
- the chimeric molecules of the present disclosure can be produced by a general genetic engineering technique. For example, a recombinant vector encoding for the chimeric molecule.
- the recombinant vector of the present disclosure is not particularly limited as long as the nucleic acid sequences chimeric molecule is inserted into the vector so that it can be expressed in a host cell into which the vector is introduced.
- the vector is not particularly limited as long as it is replicable in the host cell, and examples thereof include plasmid DNA and viral DNA.
- the regulatory elements necessary for gene expression of a DNA molecule include: a promoter, an initiation codon, a stop codon, and a polyadenylation signal.
- enhancers are often required for gene expression. It is necessary that these elements be operable linked to the sequence that encodes the desired proteins and that the regulatory elements are operably in the individual to whom they are administered.
- Initiation codons and stop codon are generally considered to be part of a nucleotide sequence that encodes the desired protein. However, it is necessary that these elements are functional in the individual to whom the gene construct is administered. The initiation and termination codons must be in frame with the coding sequence.
- the molecule that encodes a desired protein may be DNA or RNA which comprise a nucleotide sequence that encodes the desired protein. These molecules may be cDNA, genomic DNA, synthesized DNA or a hybrid thereof or an RNA molecule such as mRNA. Accordingly, as used herein, the terms “DNA construct”, “genetic construct”, “nucleotide sequence”, nucleic acid” are meant to refer to both DNA and RNA molecules.
- the genetic construct which includes the nucleotide sequence encoding the desired protein operably linked to the regulatory elements may remain present in the cell as a functioning extrachromosomal molecule or it may integrate into the cell's chromosomal DNA.
- DNA may be introduced into cells where it remains as separate genetic material in the form of a plasmid.
- linear DNA which can integrate into the chromosome may be introduced into the cell.
- reagents which promote DNA integration into chromosomes may be added.
- DNA sequences which are useful to promote integration may also be included in the DNA molecule.
- RNA may be administered to the cell. It is also contemplated to provide the genetic construct as a linear minichromosome including a centromere, telomeres and an origin of replication.
- the present disclosure includes a vector comprising one or more cassettes comprising: a UTR, biomolecule, peptide tag domain, e.g. Qa tag (SEQ ID NO: 1).
- the vector can be any vector that is known in the art and is suitable for expressing the desired expression cassette.
- a number of vectors are known or can be designed to be capable of mediating transfer of gene products to mammalian cells, as is known in the art and described herein.
- a vector refers to a nucleic acid polynucleotide to be delivered to a host cell, either in vitro or in vivo.
- one or more cassettes are provided on a single vector.
- cassettes are provided on a two or more vectors.
- cassettes are provided by one or more vectors comprising an isolated nucleic acid encoding one or more elements of a gene editing system.
- the cassettes are provided by one or more vectors comprising an isolated nucleic acid encoding one or more components comprising: a UTR(s), biomolecule(s), peptide tag(s).
- the expression of natural or synthetic nucleic acids encoding a RNA and/or peptide is typically achieved by operably linking a nucleic acid encoding the RNA and/or peptide or portions thereof to a promoter and incorporating the construct into an expression vector.
- the vectors to be used are suitable for replication and, optionally, integration in eukaryotic cells. Typical vectors contain transcription and translation terminators, initiation sequences, and promoters useful for regulation of the expression of the desired nucleic acid sequence.
- the isolated nucleic acids of the disclosure can be cloned into a number of types of vectors.
- the nucleic acid can be cloned into a vector including, but not limited to a plasmid, a phagemid, a phage derivative, an animal virus, and a cosmid.
- Vectors of particular interest include expression vectors, replication vectors, probe generation vectors, and sequencing vectors.
- the vector also includes conventional control elements which are operably linked to the transgene in a manner which permits its transcription, translation and/or expression in a cell transfected with the plasmid vector or infected with the virus comprising a nucleic acid comprising the described cassettes or compositions.
- operably linked sequences include both expression control sequences that are contiguous with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest.
- Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); sequences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product.
- polyA polyadenylation
- a great number of expression control sequences, including promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art and can be utilized.
- promoters typically contain functional elements downstream of the start site as well.
- the spacing between promoter elements frequently is flexible, so that promoter function is preserved when elements are inverted or moved relative to one another.
- tk thymidine kinase
- the spacing between promoter elements can be increased to 50 bp apart before activity begins to decline.
- individual elements can function either cooperatively or independently to activate transcription.
- promoters can readily be accomplished. In certain aspects, one would use a high expression promoter. Promoters and polyadenylation signals used must be functional within the cells of the individual.
- the promoter used in the vector may be appropriately selected depending on the host cell into which the vector is introduced. For example, when expressed in yeast, the GALl promoter, the PGK1 promoter, the TEF1 promoter, the ADH1 promoter, the TPI1 promoter, the PYK1 promoter and the like can be used. When expressed in plants, Cauliflower Mosaic Virus 35S promoter, rice actin promoter, com ubiquitin promoter, lettuce ubiquitin promoter, and the like can be used.
- T7 promoter and the like When expressed in Escherichia coli, T7 promoter and the like can be used. In the case of expression in Brevibacillus, P2 promoter and P22 promoter and the like can be mentioned.
- Inducible promoter For example, in addition to lac, tac and trc which are inducible by IPTG, trp which can be induced by Iaa, ara which can be induced by L-arabinose, Pzt-1 which can be induced by using tetracycline, A P L promoter inducible at high temperature (42 ° C), and a promoter of cspA gene, which is one of cold shock genes.
- promoters useful in the production of a genetic vaccine for humans include but are not limited to promoters from Simian Virus 40 (SV40, Mouse Mammary Tumor Virus (MMTV) promoter, Human Immunodeficiency Virus (HIV) such as the HIV Long Terminal Repeat (LTR) promoter, Moloney virus, ALV, Cytomegalovirus (CMV) such as the CMV immediate early promoter, Epstein Barr Virus (EBV), Rous Sarcoma Virus (RSV) as well as promoters from human genes such as human Actin, human Myosin, human Hemoglobin, human muscle creatine and human metalothionein.
- Simian Virus 40 SV40
- MMTV Mouse Mammary Tumor Virus
- HAV Human Immunodeficiency Virus
- LTR HIV Long Terminal Repeat
- ALV a virus
- CMV Cytomegalovirus
- EBV Epstein Barr Virus
- RSV Rous Sarcoma Virus
- polyadenylation signals useful to practice the present disclosure, especially in the production of a genetic vaccine for humans include but are not limited to SV40 polyadenylation signals and LTR polyadenylation signals.
- the SV40 polyadenylation signal which is in pCEP4 plasmid (Invitrogen, San Diego Calif.), referred to as the SV40 polyadenylation signal is used.
- a suitable promoter is the CAG promoter or the immediate early cytomegalovirus (CMV) promoter sequence.
- CMV immediate early cytomegalovirus
- This promoter sequence is a strong constitutive promoter sequence capable of driving high levels of expression of any polynucleotide sequence operatively linked thereto.
- the Rous sarcoma virus (RSV) and MMT promoters are also be used.
- Certain proteins can be expressed using their native promoter.
- Other elements that can enhance expression can also be included such as an enhancer or a system that results in high levels of expression such as a tat gene and tar element.
- This cassette can then be inserted into a vector, e.g., a plasmid vector such as, pUC19, pUCl 18, pBR322, or other known plasmid vectors, that includes, for example, an E. coli origin of replication.
- a vector e.g., a plasmid vector such as, pUC19, pUCl 18, pBR322, or other known plasmid vectors, that includes, for example, an E. coli origin of replication.
- Elongation Growth Factor-la is Elongation Growth Factor-la (EF-la).
- EF-la Elongation Growth Factor-la
- other constitutive promoter sequences including, but not limited to the simian virus 40 (SV40) early promoter, mouse mammary tumor virus (MMTV), human immunodeficiency virus (HIV) long terminal repeat (LTR) promoter, MoMuLV promoter, an avian leukemia virus promoter, an Epstein-Barr virus immediate early promoter, a Rous sarcoma virus promoter, as well as human gene promoters such as, but not limited to, the actin promoter, the myosin promoter, the hemoglobin promoter, and the creatine kinase promoter.
- SV40 simian virus 40
- MMTV mouse mammary tumor virus
- HSV human immunodeficiency virus
- LTR long terminal repeat
- MoMuLV promoter MoMuLV promoter
- inducible promoters are also contemplated as part of the disclosed.
- the use of an inducible promoter provides a molecular switch capable of turning on expression of the polynucleotide sequence which it is operatively linked when such expression is desired or turning off the expression when expression is not desired.
- inducible promoters include, but are not limited to a metallothionine promoter, a glucocorticoid promoter, a progesterone promoter, and a tetracycline promoter.
- Enhancer sequences found on a vector also regulates expression of the gene contained therein.
- enhancers are bound with protein factors to enhance the transcription of a gene.
- enhancers are located upstream or downstream of the gene it regulates.
- enhancers are also tissue-specific to enhance transcription in a specific cell or tissue type.
- the vector of the present disclosure comprises one or more enhancers to boost transcription of the gene present within the vector.
- the expression of the nucleic acid and/or protein, the expression vector to be introduced into a cell can also contain either a selectable marker gene or a reporter gene or both to facilitate identification and selection of expressing cells from the population of cells sought to be transfected or infected through viral vectors.
- the selectable marker is carried on a separate piece of DNA and used in a co-transfection procedure.
- Both selectable markers and reporter genes can be flanked with appropriate regulatory sequences to enable expression in the host cells.
- Useful selectable markers include, for example, antibiotic-resistance genes, such as neo and the like.
- a terminator sequence may also be included depending on the host cell.
- the recombinant vector of the present disclosure can be produced, for example, by digesting a DNA construct with a suitable restriction enzyme, or adding a restriction enzyme site by PCR, and inserting the vector into a restriction enzyme site or a multicloning site.
- the host cell used for transformation may be eukaryotic cells or prokaryotic cells, preferably eukaryotic cells.
- eukaryotic cells eukaryotic cells, yeast cells, mammalian cells, plant cells, insect cells and the like are used.
- yeast include Saccharomyces cerevisiae, Candida utilis, Schizosaccharomyces pombe, Pichia pastoris , and the like.
- microorganisms such as Aspergillus may be used.
- prokaryotic cells include Escherichia coli, Lactobacillus, Bacillus, Brevibacillus, Agrobacterium tumefaciens, actinomycetes and the like.
- Plant cells include plant cells belonging to Astaraceae, Solanaceae, Brassicaceae, Rosaceae, Chenopodiaceae, etc., such as Lactuca.
- the transformant used in the present disclosure can be produced by introducing the recombinant vector of the present disclosure into a host cell using a general genetic engineering technique. For example, an electrophoresis method (Tada, et al., 1990, Theor. Appl. Genet, 80: 475), a protoplast method (Gene, 39, 281-286 (1985)), a polyethylene glycol method, 1993, Transgenic, Res. 2: 218, Hiei, et al., 1994), Agrobacterium-mediated transformation (Hood et al., 1991, Theor.
- gene expression may be a routine expression or a stable expression inserted into a chromosome.
- the transformants can be selected according to the phenotype of the selection marker.
- the tagged protein can be produced by culturing the selected transformant.
- the culture medium and conditions used for the culture can be appropriately selected depending on the species of the transformant.
- the plant cell When the host cell is a plant cell, the plant cell can be regenerated by culturing the selected plant cell by a conventional method, and the tag-added protein can be accumulated in the plant cell or outside the cell membrane of the plant cell.
- Tagged biomolecules which have accumulated in the cells or cells can be separated and purified according to methods well known to those skilled in the art. For example, a known method known in the art, such as salting out, ethanol precipitation, ultrafiltration, gel filtration chromatography, ion exchange column chromatography, affinity chromatography, medium pressure liquid chromatography, reversed phase chromatography, hydrophobic chromatography, can be separated and purified.
- Example 1 Tagging for enhancing protein expression/secretion
- an LV pCDH- nCoV-E-Flag vector (Zhang et al., 2020) was selected as the backbone for dual reporter cloning, which expresses the SARS-CoV-2 structural envelope (E) protein.
- NEBbuilder-HiFi cloning was performed via NotEApal sites using two fragments derived by PCR using the primers Flag-Not- gdLuc-F and gdLuc-P2A-R for the gdLuc fragment and P2A-dsGFP-F and dsGFP-PCR-Apa-R for the dsGFP fragment.
- CAG promoter exhibited higher activity than CMV promoter as previously demonstrated (Dou et ak, 2021; Zhang et al ., 2020).
- Qa tagging further enhanced CAG-driven gene expression of viral proteins.
- FIGS. 3A-3C Qa induced stronger enhancing of viral E and S proteins in the presence of the stronger CAG promoter (5 ⁇ 6- fold).
- the enhanced enhancing by Qa in CAG-driven NSP16 expression reached up to 212-fold (FIG 3C).
- S protein is very important for vaccine development, pseudovirion production, and drug discovery, however, its expression is the most difficult among the viral proteins of SARS-CoV-2 (Boson et al. , 2020; Hu et al. , 2020; Ou et al. , 2020; Walls et al., 2020; Wang et al., 2020; Zhang et al. , 2020).
- the native 5’ -UTR was added upstream of Qa-tagged S protein in the pCAG expression vector, which shows 6-fold higher expression than pcDNA6B vector (FIG. 3C).
- Pseudotyped virus has been widely used for not only gene delivery but also vaccine production, antibody neutralization, cellular entry, and pathogenic mechanisms.
- Pseudovirion is an excellent alternative for high-risk viruses that require BSL3 facilities for working with live viruses, such as SARS-CoV-2 and its variants (Korber et al., 2020; Muik et al., 2021; Nie et al., 2020; Walls et al ., 2020; Weissman et al., 2021; Wibmer et al., 2021a).
- Pseudovirion is the virus-like particle coated with viral surface or membrane proteins that harbor specific cellular tropism (Kuzmina et al., 2021; Walls et al, 2020; Wibmer et al, 2021a). Virus-like particles pseudotyped with S protein will have better immune responses than individual viral proteins due to similarity of three-dimensional structure to live virus (Kuzmina et al. , 2021; Walls et al. ,
- SARS-CoV-2 S protein has been widely used to generate S pseudovirion but the packaging efficiency for lentivirus-like (LVLP) or VSV-like particles (VSVLP) has been very low in most reports, even when using the codon-optimized C-terminal deletion S protein (Korber et al., 2020; Muik et al, 2021; Ou et al, 2020; Walls et al, 2020). Given Qa tagging enhances S protein production in mammalian cells, it was speculated that Qa could enhance the packaging efficiency of S pseudotyped LVLP (S-LVLP).
- S-LVLP S pseudotyped LVLP
- the presence of Qa tag significantly increased the production of viral protein S from the transfected functional mRNAs in a time-dependent and dose-dependent manner.
- Such enhancing is universally applicable to the mRNAs of other viral proteins N, E, and ORF3 as well as the host cellular gene ACE2 (FIGS. 5B, 5C).
- Addition of 5’-UTR significantly increased the mRNA-dependent translation of Qa-tagged viral S protein (FIG. 5D) as well as viral E protein and cellular ACE2 (FIG. 5E), consistent with the cDNA expression vector (FIGS. 3D-3I).
- Qa tagging and native 5’-UTR inclusion on a target mRNA significantly increased mRNA stability and translational efficiency and thus enhanced the protein expression/production of the targeted mRNA (e.g., S protein mRNA vaccine).
- S protein mRNA vaccine e.g., S protein mRNA vaccine
- the therapeutics based on effective monoclonal antibody requires optimization of antibody production in a suitable cell culture platform, which relies on high performance expression vectors.
- Various genetic elements in monoclonal antibody production vectors have been widely modified.
- the human anti-SARS-CoV monoclonal antibody (Bei, CR3022) was used as a test platform.
- the Qa tag was cloned into the C-terminus of the immunoglobulin heavy and light chain (H/L) of CR3022, which contains variable regions of heavy and light chains derived from human anti-SARS-CoV mAh (GenBank: DQ 168569 and DQ 168570, respectively), to generate Qa-tagged HQ and LQ (FIG. 6A).
- the HQ and LQ were co-transfected into HEK293T cells to generate Qa-tagged monoclonal antibody, using the original H and L vectors (NR52399 and NR52400) as a control.
- the supernatants containing the monoclonal antibody were collected at 2-3 days after transfection and their levels were measured using sandwich ELISA with SARS-CoV-2 S protein as the coating antigen (FIGS. 6B, 6C). It was found that Qa tagging enhanced the antibody production by up to 37-fold with or without the normalization for transfection efficiency (FIG. 6D). The enhancing efficiency varied with the experimental conditions (cell density, transfection efficiency, and ELISA variations) in an average of 13-fold (FIG. 6E). Western blot analysis of the supernatant validated Qa enhancing of the antibody production (FIG. 6F). These data provide evidence that Qa tagging induces a robust enhancing of antibody production(secretion).
- Viral gene therapy has been extensively studied and actively applied to clinical diseases.
- AAV and LV are the most promising strategies for viral gene therapy.
- viral packaging efficiency production yield
- viral packaging efficiency is also a rate-limiting factor in developing genome editing and therapeutics.
- the level of mRNA from LV transfer vector could affect the LV packaging efficiency. It was hypothesized that Qa tagging in the LV transfer vector would enhance the efficiency of LV packaging and gene delivery if Qa tagging increases the mRNA level of the transgene during the packaging.
- the LV transfer vectors pRRL-E-LG and pRRL-E-QLG were compared for standard LV packaging (psPax2 and VSV-G).
- Qa tagging increased the production of the transgene reporter gdLuc from the transfer vector (FIG. 6G), similar to the enhancing efficiency in the transfected cells without LV packaging (FIG. 2H).
- Qa tagging on the transfer vector only had a marginal effect on the packaging efficiency i.e., the titer of packaged LV (data not shown).
- Qa tagging enhances secretion of targeted proteins.
- Qa tagging increased the expression of various types of targeted proteins.
- FIG. 7A When Western blot analysis was performed using the cell lysates to confirm the enhancing effect of Qa tagging on E dual reporter protein expression, it was unexpectedly found that the E-Flag-gdLuc protein level in the cell lysates was remarkably reduced in Qa tagging group (FIG. 7A), even though the gdLuc activity in the supernatant was robustly increased by Qa tagging (FIGS. 1 A-1F). In the presence of 5’- UTR, the reduction in CAG-driven E-Flag-gdLuc expression level was more robust in the cell lysate (FIG. 7B).
- Protein and peptide tags have been extensively employed for protein labeling/detection and affinity purification (DeCaprio and Kohl, 2019; Katayama etal., 2021; Lee et al, 2020; Mishra, 2020; Peighambardoust et al, 2021; Pina et al, 2021; Traenkle et al, 2020).
- the fusion of peptide tags with targeted proteins allows detection by immunostaining and immunoblotting with corresponding highly specific antibodies both in vitro and in vivo.
- Novel “spaghetti monster' fluorescent protein (smFP) technology with tandem tags dramatically enhances the sensitivity of the tagged protein detection (Viswanathan et al., 2015).
- tags can be also used for protein purification by immunoprecipitation and/or affinity chromatography. Some tags may enhance the yield of protein purification by extending protein half-life or rendering protein soluble (Bhagawati et al., 2019; Han et al., 2020; Li, 2011; Saribas et al, 2018). For some cases, tagging may influence the activity or function of the targeted proteins (Majorek et al., 2014). For example, N-terminal tagging on PI3KCA increases kinase activity while C-terminal tagging affects membrane binding activity (Vasan et al., 2019).
- the N-terminal secretory signal peptide of the gdLuc not only determines its inherent secretory property but also regulates the protein folding and functional activity (Gaur et al., 2017).
- the C-terminal or N- terminal amino acid composition could regulate the protein expression (Cambray et al., 2018; Weber et al., 2020). Modification of C-terminal endoplasmic reticulum targeting peptide on the gdLuc significantly improves its intracellular retention (Gaur etal, 2017).
- Some peptides such as PEST (Shumway et al., 1999) or KFERQ (Dong et al., 2020; Park et al., 2016) fused or endogenously contained in the target proteins mark the proteins for proteolysis or degradation.
- PEST Shinway et al., 1999
- KFERQ Double et al., 2020; Park et al., 2016
- RNA sequence encoding the Qa peptide has secondary structure that may directly regulate the mRNA stability of targeted protein (Boo and Kim, 2020). It’s interesting to determine whether the synonymous substitution of Qa peptide influences the expression/production of tagged proteins. Whether the amino acids sequence of Qa tag directly binds to poly- A or 3’-UTR or which residues contribute to mRNA stabilization and translation enhancing needs to be determined. For the protein secretion, this Qa tag has no function similar to secretion peptide, because Qa tagging on non-secretory protein i.e.
- firefly-luciferase does not change the background luciferase activity in the cultured media, which exists likely due to partial cell death.
- Qa tagging on secretory proteins such as S protein, antibody, IFNy and IL-2 robustly enhanced their production yields. This is very important for the industrial application of these secretory proteins.
- S protein for mRNA vaccine would be released more from the vaccinated cells in the presence of Qa tag, which not only reduces the mRNA amount for each vaccination but also promotes the immune response due to higher level of secretory S protein.
- the UTRs at both ends of a viral genome or host cellular mRNA are important in regulating the transcription and translation efficiency (Berkhout et ak, 2011; Hinnebusch et ak, 2016; Raman and Brian, 2005; Senanayake and Brian, 1999; Williams et ak, 1999).
- the 5’-UTR of coronaviruses regulates translational rate via ribosomal scanning (Berkhout et al. , 2011; Hinnebusch et al. , 2016; Shirokikh et ak, 2019; Zhang et ak, 2015).
- a synthetic (non-viral) 5’-UTR has been used to enhance the translation of SARS-CoV-2 S mRNA in both Pfizer and Modema vaccines.
- the native UTRs of SARS-CoV-2 are highly conserved and plays key role in viral RNA replication and transcription of the genomic and subgenomic viral transcripts (Baldassarre etal. , 2020; Yang and Leibowitz, 2015).
- native 5’-UTR is assumed to enhance accumulation of viral protein.
- solid evidence is provided that the native (natural) 5’- and 3’-UTRs of SARS-CoV-2 enhanced the production of viral E-LG fusion protein.
- the native 5’-UTR served as a universal regulator in enhancing not only viral proteins but also many non-viral cellular proteins. It was hypothesized that this potent UTR could be used in enhancing any proteins, particularly for virus packaging systems. For example, it was observed that UTR-Sdl8Q increased the packaging efficiency of S-pseudotyped LVLP or VSVLP. UTR in viral transfer vector enhanced the lentivirus production. The native UTR would also enhance the AMINO ACIDSV packaging and transduction efficiency.
- This study identified the combination of Qa tagging and SARS-CoV-2 native UTR as a novel strategy to enhance or enhance the production of any targeted gene/protein of interest. For industry applications, this strategy will reduce the cost of many widely used products and facilitate their availability. Since it enhanced the production of all tested viral proteins of SARS- CoV-2, an immediate usage of this method would be the enhancing of vaccine production for the urgent need to fight COVID-19.
- the studies herein demonstrated at least a 200-fold enhance efficiency of S mRNA vaccine. This is extremely important to expedite the mRNA vaccine availability when producing new mRNA vaccines against SARS-CoV-2 variants or any other emerging viruses. This strategy can be easily incorporated into the DNA vaccine vector.
- Another immediate industry value of the methods herein is to enhance antibody production.
- Qa tagging at the C-terminus of the immunoglobulin heavy and light chain variable regions robustly enhanced the antibody secretion by up to 37-fold (average 13- fold).
- Qa tagging in the middle of targeted protein shows much stronger enhancing efficiency
- optimization of Qa tagging in different region of the targeted antibody heavy and light chains is expected to achieve higher levels of antibody production enhance.
- Enhancing the production yield of viruses or pseudotyped viruses is also invaluable in the fields of gene therapy and biomedical research.
- Pseudotyped viruses have facilitated the research on high-risk viruses that require BSL3 facilities.
- Pseudovirus of SARS-CoV-2 S protein or its variants have been extensively utilized for evaluation of neutralization antibody and vaccination as well as mechanistic and functional studies (Donofrio et ah, 2021; Korber etal. , 2020; Muik et al ., 2021; Ou et al ., 2020; Wibmer et ah, 2021b).
- the bottleneck for generation of S pseudovirions is the limited packaging efficiency for LVLP or VSV-like particles (Korber et al.
- the Qa tagging would facilitate the yield of protein expression, such as insulin, interferon, interleukin, cytokines, and growth factors. Even only a few-fold increase of enhancing reduced the production expenses and expedite clinical applications.
- Qa tagging via novel CRISPR/Cas gene knockin strategy could be used to facilitate the expression of loss-of-function genes, particularly in haplo- insufficient mutagenic diseases such as Angelman syndrome, Pitts-Hopkins syndrome, and others.
- Qa tagging enhancement of dominant genes may improve phenotype of organisms, particularly in agriculture applications.
- this novel Qa tag can be used as a general tag in a similar way to other peptide tags such as Flag, Myc, HA, Ollas, C7, and T7 for protein tracing, protein purification, immunostaining, and Western blotting.
- the Qa tagging can enhance the labeling intensity of the endogenous proteins due to its enhancing property. This is very important for neural network tracing.
- this study reports a novel peptide tag consisting of a specified short amino acid (7 amino acids) sequence that can be utilized for enhancing production of the tagged proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro , ex vivo, and in vivo.
- This novel and universal peptide tag would facilitate protein expression and secretion. It would be invaluable to perform library screening for this master Qa tag to discover optimal peptides that maximize the protein expression/production/secretion.
- This study also reports the exceptionally potent efficiency of SARS-CoV-2 native 5’-UTR in boosting the protein expression/production. Combining Qa tagging with the native 5’-UTR offers a synergistic boosting on the production of viral and non-viral proteins. All these strategies are invaluable in biopharmaceutical development, immunological/vaccine industry, and biological therapeutics.
- HEK293T Hela and BHK cell lines were cultured in standard protocol.
- Dual reporter vectors The dual reporter LG fragment, encoding Gaussia-Dura luciferase (gdLuc) and destabilized GFP (dsGFP), was generated by overlay PCR: 1) Standard PCR was performed to generate fragment 1 (gdLuc) from template plasmid pMCS-Gaussia-Dura-Luciferase (Thermo Fisher Scientific, Cat#16190) with primer pair T1290/T1291 while fragment 2 (dsGFP) from plasmid pLenti-EFS-EGFPd2PEST-2A-MCS-Hygro (TP1380), a gift from Neville Sanjana (Addgene Cat# 138152) with T1292/T1293; 2) Purified two fragments (100 ng/each) with overlay ed 19 nucleotides were mixed for 8 cycles of PCR; 3) the PCR product at 1:100 dilution was used as template for 28 cycles of standard PCR with primer pairs T1292/T1293 to generate LG fragment.
- this LG fragment (1485bp) was cloned into pcDNA6B-nCoV-x-Flag vector encoding various viral proteins of SARS-CoV-2 or cellular gene hACE2 as listed in Key Resource Table (Zhang et al., 2020) via SacW cloning site using NEBuilder® HiFi DNA Assembly cloning kit (NEB, E5520S) to generate pcDNA6B-SARS-CoV-2-x-Flag-LG vectors as listed in Table.
- the “x” indicates the gene of interest.
- the insert fragment encoding SARS-CoV2 N protein from pcDNA6B-nCoV-N-Flag vector (TP1431) or hACE2 from pcDNA6B-hACE2-Flag vector (TP1470) was cloned into TP1479 via KpnEXbal sites to generate pcDNA6B-SARS-CoV2- N-Flag-QLG (TP1490) or pcDNA6B-hACE2-Flag-QLG (TP1491).
- the pcDNA6B-NTBP-Flag-LG (TP 1560) vector was generated by NEB-HiFi cloning of NIBP PCR product from pYX-Asc-mNIBP (Genbank # BC070463) into pcDNA6B-hACE2-Flag- LG (TP1540) via NotEXbal, while the pcDNA6B-NIBP-Flag-QLG (TP1558) was generated by NEB-HiFi cloning of NIBP PCR product into pcDNA6B-SARS-CoV-2-E-Flag-QLG (TP1479) via XhoEXbal.
- the pCAG vectors encoding E, S and NSP16 were generated by replacing the CMV promoter in corresponding pcDNA6B-SARS-CoV2-x-Flag-LG or -QLG vectors with CAG promoter via SnaBI/Kpnl sites.
- UTR containing vectors The DNA fragment containing 5’-UTR-E-Flag-Qa-3’-UTR designed according to the public SARS-CoV-2 sequencing was synthesized by Synbio Technologies and cloned into the pCAG-Flag vector via EcoRV/Age sites using NEBuilder® HiFi DNA Assembly cloning kit (NEB, E5520).
- This vector pC AG-UTR-E-F1 ag-Qa-UTR (TP1583) was digested with SnaBI/EcoRV (both blunt end) to remove CAG promoter and re-ligation generated pUTR-E-Flag-Qa-UTR vector (TP1585).
- the 3’-UTR with pCAG-UTR-E-Flag-Qa- UTR was removed by Notl digestion and ligation to generate pCAG-UTR-E-Flag-Qa (TP 1584) with additional 37 amino acids at open reading frame.
- the pCAG-UTR-S-Flag-QLG vector (TP1586) was generated by replacing the E-Flag-Qa-UTR fragment with S-Flag-QLG fragment from pCAG-S-Flag-QLG vector (TP1518) via XhoEAgel sites.
- the pCAG-UTR-Sdl8-Q (TP 1595) was generated by NEB HiFi cloning via EcoRI sites of pCAG-Sdl8-Q vector (TP 1506) with PCR product 5’ -UTR from pUTR-E-Flag-UTR vector (TP 1585).
- UTR-containing pcDNA6B vectors were generated by restriction cloning via KpnEXhoI to transfer 5 ’ -UTR from pC AG-UTR- E-Flag-Qa-UTR into corresponding pcDNA6B vectors such as pcDNA6B-S-QLG (TP1487), pcDNA6B-E-Flag-QLG (TP1479), pcDNA6B-ORF3-Flag-QLG (TP1483).
- Antibody vectors The plasmid set CR3022 for pFUSEss-CHIg-hGl-SARS-CoV2-mAb (NR-52399, TP1565) and pFUSE2ss-CLIg-hk-SARS-CoV2-mAb (NR-52400, TP1566) expressing the heavy (H) and light (L) chains of human anti-SARS-CoV mAh respectively (GenBank: DQ 168569 and DQ168570) were produced under HHSN272201400008C and obtained from BEI Resources, NIAID, NIH (Cat# NR-53260).
- the Qa-tagged HQ (TP 1574) and LQ (TP1571) vectors were generated from H or L plasmids at Nhe site using NEBuilder® HiFi DNA Assembly cloning kit with the synthesized oligonucleotides that contain Qa-encoding sequence and the C-terminus of the immunoglobulin heavy and light chain (see Table SI for sequences).
- Lentiviral vector The pRRLSIN.cPPT.PGK-GFP.WPRE (TP792), a gift from Didier Trono (Addgene #12252), was used to generate pRRL-E-Flag-LG-GFP (TP1577) by transferring E-Flag- LG insert from TP1478 to TP792 via BamHI/Agel.
- the pRRL-E-Fl ag-LG (TP1578) was generated from TP 1577 by Agel/Kpnl blunt ligation.
- the pRRL-E-Flag-QLG was generated by transferring E-Flag-QLG from TP1479 to TP1578 via BamHI/BstBI.
- TP1578 and TP1579 were used as the backbone vector for NEB-HiFi cloning of human IFNy and IL2 PCR products via Xbal site to generate pRRL-IFNy-LG (TP 1604) or QLG (TP 1605) and pRRL-IL2-LG (TP 1606) or QLG (TP 1607).
- the PCR fragments of IFNy and IL2 were derived, respectively, from pUC8-IFNy (a gift from Howard Young, Addgene #17600) and pAIP-hIL2-co (a gift from Jeremy Luban, Addgene #90513) using primer pairs T1407/Tq408 and T1409/T1410 as listed in Table SI.
- the pRRL-UTR-Flag-LG (TP 1621) and pRRL-UTR-Flag-QLG (TP 1622) were generated respectively by NEB-HiFi cloning of 5’-UTR PCR products from TP1583 into TP1578 and TP1579 via Xbal.
- the pRRL-Flag-LG (TP 1685) and pRRL-Flag-QLG (TP 1686) were generated respectively from TP 1621 and Tpl622 via BsmBI/Xbal digestion to remove UTR and NEB HiFi cloning with oligonucleotide insert (T1469) to correct the ATG site in ORF.
- the LV packaging vector psPAX2-Gag-Q (TP1618) was generated from psPAX2 (TP592, a gift from Didier Trono, Addgene #12260) via Sph/EcoRV sites by NEB HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template.
- the psPAX2-Pol-Q-RRE-Q (TP1619) was generated from psPAX2 via Swal/Nhel sites by NEB HiFi cloning of two overlay PCR fragments with primer pairs T1400/T1401 and T1402/T1403 using psPAX2 as PCR template.
- the psPAX2-Gag-Q-Pol-Q-RRE-Q (TP 1620) was from TP1619 via Sph/EcoRV sites by NEB HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template.
- S-pseudoviral vectors The vector pCAG-SARS-CoV2-Sdl8Q (TP1506) encoding human codon-optimized S gene of SARS-CoV2 with C-terminal 18 amino acids deletion (Sdl8) and Qa tag fusion was constructed using NEBuilder® HiFi DNA Assembly cloning kit (NEB, E5520S).
- the Sdl8 expression cassette in the CMV-driven vector pcDNA3.1-SARS2-S was transferred to a CAG-driven vector pCAG-Flag-SARS- CoV2-S (gift from Peihui Wang) via EcoRVNotl sites and PCR with primer pairs T1323/T1324.
- the vector pCAG-SARS-CoV2-Sdl8 (TP1567) encoding Sdl8 without Qa tag was constructed via the NEB HiFi cloning with synthesized oligonucleotide insert T1367 at SacII/Not site of pCAG-SARS-CoV2-Sdl8Q vector.
- the pCAG-UTR-Sdl8Q (TP1595) vector was generated as described above. Plasmid DNA purification and DNA quantification
- Plasmid DNAs were purified using commercial kits for endotoxin-free miniprep (Cat# REF 740490) or midipreps (Cat# REF 740420) from Macherey-Nagel (Germany).
- the E. coli bacterial cultures (5 ml for miniprep, 200 ml for midiprep) harboring relevant plasmids were grown in LB or 2YT media supplemented with 100 pg/ml Carbenicillin at 30°C for NEB-stable or 37°C for DH5alpha E. coli cells overnight.
- the bacterial cultures were harvested by centrifugation, the pellets obtained after centrifugation were processed to purify plasmid DNA according to manufacturer’s guideline.
- the final DNA was dissolved in ultra-pure distilled water and DNA concentrations were determined either using Nanodrop 1 UV-Vis Spectrophotometer (Thermo- Fisher) or in a Take3 plate using Bio-Tek multiplate reader.
- HEK293T human fetal kidney and Hela human cervix epithelial cells were obtained from ATCC (http://www. atcc.org). Both cells were cultures in Dulbecco’s Modified Eagle’s Medium (DMEM, Gibco) supplemented with Fetal Bovine Serum (FBS) and antibiotic 1% Penicillin/Streptomycin (Corning). BHK-21-/WI-2 cells (EH1011, Kerafast, Boston, MA, USA) were grown in DMEM supplemented with 5% FBS and 1% Penicillin/Streptomycin. All cells were incubated in a 37°C incubator under 5% CO2 atmosphere.
- DMEM Dulbecco’s Modified Eagle’s Medium
- FBS Fetal Bovine Serum
- Penicillin/Streptomycin Corning
- BHK-21-/WI-2 cells EH1011, Kerafast, Boston, MA, USA
- 96-well plate was used.
- 24-well plate was used.
- Cells resuspended (in DMEM plus 10% FBS) were seeded (3-4xl0 4 cells/well for 96-well plate or l-2xl0 5 cells/well for 24-well plate) the night before the transfections.
- Transporter 5 transfection reagent TP5
- 50-100 ng plasmid DNA per well for 96-well plate was mixed 0.2-0.4 m ⁇ TP5 in 0.9% NaCl solution and incubated at room temperature for 20 min.
- the transfection reagent and DNA solution were mixed again and added to each well dropwise.
- the transfections were incubated at 37°C in 5% CO2 overnight (16-18 h), the media was replaced with DMEM plus 10% FBS.
- RNA Extraction and Reverse Transcription Quantitative PCR for mRNA stability assay
- HEK293T cells were transfected with indicated vectors (500 ng/well for 24-well plate) for 24 h before treatment with transcriptional inhibitor actinomycin D (10 mM) for various period.
- Total RNA was extracted using Monarch Total RNA Miniprep Kit (NEB, Cat# T2010) that includes two steps of DNA removal. Equal amount of RNA (0.5 pg) was used to synthesize cDNA using High Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific, Cat# 4368814) with random hexanucleotide primer. Real time PCR analysis was carried out on QuantStudioTM 3 System.
- the mRNA expression levels of reporter gdLuc luciferase and huma b-actin were determined using iTaq Universal SYBR Green Supermix kit (BioRad, Cat# 1725121).
- the sequences for gdLuc primers are (forward) 5’- GATTACAAGGATGACGACGATAAG-3’ (SEQ ID NO: 2) (T1364 targeting Flag) and (reverse) 5’- AAGTCTTCGTTGTTCTCGGTGGG-3 ’ (SEQ ID NO: 3) (T432 targeting gdLuc).
- Human b-actin primers are (forward) 5’- AAGAGCTATGAGCTGCCTGA-3 ’ (SEQ ID NO: 4) and (reverse) 5’- TACGGATGTCAACGTCACAC-3’ (SEQ ID NO: 5). Each sample was tested in triplicate. Cycle threshold (Ct) values were obtained graphically for reporter and b-actin. The difference in Ct values between for reporter and b-actin were represented as ACt values. The AACt values were obtained by subtracting the ACt values of the control samples from that of the samples at different time points. Relative percentage change in gene expression was calculated as 2-AACt. The mRNA decay rate was calculated by non-linear regression curve fitting (one phase decay) using GraphPad Prism 9.1. Three independent experiments were performed.
- the Coelenterazine (CTZ) substrate (Cat # 3032, Nanolight Technology) was dissolved in 10 ml ultra-sterile distilled water to make the stock solutions and kept at -20°C until use.
- the CTZ stock solution was diluted 10-30 times to make working solutions.
- Equal amount of CTZ working solution and cell culture media (25-50 m ⁇ ) after transfection were mixed in a Coming (CLS3922) white opaque 96-well optiplate, and the luminescence was measured in a BioTek Synergy LX multiplate reader.
- the ONE- Glo Luciferase assay kit (Promega Corp, Cat # E6110) was used.
- pcDNA6B vector containing T7 promoter the DNA was lineated with Agel digestion followed by gel purification.
- the primers included the T7 promoter (TTA ATAC GACTC AC TATAGGGT GGA ATTC T GC AGATAT C C AG (SEQ ID NO: 6), T1427), generating DNA fragment containing 5’-UTR, target gene, LG or QLG dual reporter and a poly(A) tail.
- PCR was performed using Phusion High-Fidelity PCR Master Mix kit (Thermo Fisher Scientific, F531). The DNA was purified using gel extraction kit. and the concentration determined using Take3 plate in Bio-Tek multiplate reader.
- RNA was synthesized from the purified DNA template using HiScribeTM T7 ARCA mRNA Kit (New England Biolabs, Cat#E2060) and cotranscriptionally capped with m7G anti-reverse cap analog (ARCA, Cat#1411), and poly A tailing.
- the synthesized RNA was purified using Monarch RNA cleanup kit (New England Biolabs, Cat#E2040) and quantified with Take3 plate. Equal amount of RNA between LG and QLG groups at different dosage were used for transfection into HEK293T cells in quadruplicate with Lipofectamine® MessengerMAX mRNA Transfection Reagent (Thermo Fisher Scientific, Cat#LMRNA015) following manufacture’s manual.
- the culture media containing gdLuc were collected, and gdLuc assay was performed as above.
- the recombinant lentivirus carrying indicated lentiviral vector was produced in a small scale using the second generation of LV packaging system according to standard protocols. Briefly, HEK293T cells in one of 6-well plate were cotransfected by TP5 kit with the indicated transfer LV vector (1.4 pg), the packaging vector psPAX2 or its mutants (1 pg) and VSV-G or Sdl8 vector (0.4 pg). At 2-3 days post-transfection, the supernatants containing LV were concentrated and purified with simplified 10% sucrose purification as described previously.
- the functional titers of the crude and purified lentivirus were determined by counting GFP-expressing HEK293T cells at 48 h after infection with serial dilutions of lentiviruses under fluorescent microscopy. For some cases, flow cytometry or RT-qPCR analysis were used for LV titration.
- PCR analysis cell culture medium was collected from infected cells and centrifuged at 2,000 g for 5 min. Supernatant was subjected to viral lysis to extract viral RNA.
- One step RT-qPCR was performed using the qPCR Lentivirus Complete Titration Kit (Applied Biological Materials Inc., Cat No. LV900-S) and the QuantStudio 3 Real-Time PCR System (Applied Biosystems, Cat No. A28567) according to manufacturer protocols. The resulting data was analyzed using QuantStudio Design and Analysis Desktop Software (Applied Biosystems).
- SDS-polyacrylamide gels (10-12%) were home-made or Mini -PROTEAN TGX gels (Cat# 4561093, 4561096) were purchased from BioRad.
- the cell lysates were prepared using the lysis buffer composed of 50 mM Tris-HCl pH 7.0, 150 mM NaCl, 5 mM EDTA and 1 % Triton X-100 supplemented with PMSF (lOOx), Aprotinin and Leupeptin (200x).
- the 50 m ⁇ lysates were prepared from each well after collecting the supernatant.
- the lysates were incubated at 4°C for 20- 30 minutes, centrifuged at maximum speed in an Eppendorf Centrifuge.
- the clear lysates were either denatured for 5 minutes at 98°C immediately in lx SDS-PAGE loading dye or stored at - 80°C until use. Supernatants were stored at 4°C until before they treated with lx SDS-PAGE loading dye. The denatured 10-20 m ⁇ aliquots of cell lysates or 20-30 m ⁇ supernatants were loaded onto SDS-polyacryramide gels. The SDS-PAGE was performed in Tris-Glycine/SDS buffers under denaturing and reducing conditions.
- the polyacrylamide gels were transferred to 0.2-mih nitrocellulose membranes (BioRad supported nitrocellulose (NC) membrane, Cat # 162-0097) either using wet transfer or iBlot®2 device using IBlot®2 NC mini (IB23002) or regular Stacks (IB23001).
- lx transfer buffer 25 mM Tris-HCl pH 7.6, 192 mM glycine, 20% Methanol.
- the gels were sandwiched together with NC membranes and transfers were performed in lx Transfer buffer at 250 mA at 4°C for 1- 2 hours.
- Dry Western blot transfers were performed in a IBlot®2 gel transfer device (Invitrogen, Thermo-Fisher, Ref# IB21001) using mini or regular IBlot®2 stacks for 7 min according to manufacturer’s guidelines.
- the membranes were blocked in lx TBST buffer containing 5% milk.
- the membranes then were treated with primary antibodies overnight at 4°C or 2 hours at RT.
- the membranes were washed three times with lx TBST buffer minute each followed by incubation with secondary antibodies.
- the secondary antibodies with infrared tag were diluted 1/10000- 120000 and incubated with the NC membranes for 45 minutes to an hour.
- the membranes were washed with lx TBST buffer three times, 5 minutes each and scanned on a Li-COR Odyssey image analyzer.
- HEK293T cells were cotransfected with the Qa-tagged HQ (TP1574) and LQ (TP1571) at 50 ng/well of 96-well plate in quadruplicates with or without normalization vector pGL4.16-CMV (TP329) or pRRL-E-Flag-LG (TP1578) at 20 ng/well.
- the original antibody plasmids for pFUSEss-CHIg-hGl-SARS-CoV2-mAb (TP 1565) and pFUSE2ss-CLIg-hk-SARS-CoV2-mAb (TP 1566) were used as the control.
- ELISA was performed using a Human IgG (Total) Uncoated ELISA Kit (Invitrogen, Thermo-Fisher, Cat # 88-50550-88).
- a 96-well Costar ELISA plate (Coming) was first coated with SARS-Cov2-Spike (S) protein from BEI (Cat # NR52724) at 100 pg/well overnight at 4°C. The washing and blocking steps were performed using the buffers and solutions provided in the kit.
- Supernatants containing secreted antibodies were collected from the transfections at 24 and 48 h and kept at 4°C until use. The aliquots of 0.5, 2.5 and 5.0 antibody supernatants were added to each SARS-Cov2-S coated wells.
- HRP horse radish peroxidase
- assay buffer 1/250
- RT room temperature
- the wells were then washed 3 times (400 m ⁇ each) using a buffer provided in the kit at RT and treated with 300 pL substrate TMB (3, 3’, 5, 5’- tetramethyl benzidine) for 15 min to develop blue color and the reactions were terminated with 2 N HC1.
- the yellow color formation was measured at 450 nm using a BioTek microplate reader.
- the level of anti-SARS-CoV monoclonal antibody was quantified by Sigmoidal four-parameter logistic curve (4PL) fit using Prism GraphPad 9.1.
- ER-Golgi transport inhibition with Brefeldin A Brefeldin A was dissolved in DMSO to make 1 mg/mL working solution.
- HEK293T cells were transfected with indicated vectors using TP5 transfection reagent in DMEM plus 10 % FBS as described above. The transfected cells were incubated overnight, and 10 pg/ml BA was added prior to media change and incubated for 3 hours at 37°C in 5% CO2.
- the culture media was replaced with 293 FreeStyle serum free media (Gibco, Thermo-Fisher, Cat# 12-338-018) with 10 pg/ml BA and incubation was continued for 24 h at 37°C in 5% CO2.
- the supernatants were withdrawn right after media replacement and collected after 24 h.
- the cell lysates were also prepared at 24 h time point. The supernatants and cell lysates were tested for gdLuc activity and Western blot analysis.
- SARS-CoV-2 envelope and membrane proteins modulate maturation and retention of the spike protein, allowing assembly of virus-like particles. J Biol Chem 296 , 100111. 10.1074/jbc.RA120.016175.
- SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature 586 , 567-571.
- SARS-CoV-2 spike variants exhibit differential infectivity and neutralization resistance to convalescent or post-vaccination sera.
- RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses a first look. RNA 26, 937-959. 10.1261/ma.076141.120.
- Wibmer C.K., Ayres, F., Hermanus, T., Madzivhandila, M., Kgagudi, P., Oosthuysen,
- SARS-CoV- 2 501 Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med. 10.1038/s41591-021-01285-x.
- Wibmer C.K., Ayres, F., Hermanus, T., Madzivhandila, M., Kgagudi, P., Oosthuysen,
- SARS-CoV- 2 501 Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med 27, 622- 625. 10.1038/s41591-021-01285-x.
- Example 3 Protein Expression/Secretion Boost By An Expression-Enhancing 21-mer Cis- Regulatory Motif (Exen21)
- Exen21/Qa The insertion of Exen21/Qa was extended to various types of proteins and found out that it could enhance the production of other proteins of SARS-CoV-2, cellular gene products, mRNA vaccines, antibodies, engineered recombinant proteins, and virus-packaging proteins.
- Dual reporter vectors The dual reporter LG fragment, encoding Gaussia- Dura luciferase (gdLuc) and destabilized GFP (dsGFP), was generated by overlay PCR: 1) Standard PCR was performed to generate fragment 1 (gdLuc) from template plasmid pMCS-Gawvv/a-Dura- Luciferase (Thermo Fisher Scientific, Cat#16190) with primer pair T1290/T1291, while fragment 2 (dsGFP) was generated from plasmid pLenti-EFS-EGFPd2PEST-2A-MCS-Hygro (TP 1380, a gift from Neville Sanjana (Addgene Cat# 138152)) with T1292/T1293; 2) Purified two fragments (100 ng/each) with overlayed 19 nucleotides were mixed for 5 cycles of PCR with primer pairs T1304/T1305 at 98°C 15 sec, 58°C 30 sec and 72°C 1 min followed by 30 cycles of
- this LG fragment (1485 bp) was cloned into pcDNA6B-nCoV-X-Flag vectors encoding various viral proteins of SARS-CoV-2 or cellular gene hACE2 via Sacll cloning site using NEBuilder® HiFi DNA Assembly cloning kit (NEB, Cat# E5520S, assigned as NEB-HiFi) to generate pcDNA6B-SARS-CoV-2-X-Flag-LG vectors.
- the “X” indicates the gene of interest.
- TP1479 The insert fragment encoding SARS-CoV-2 S protein from pcDNA6B-nCoV-S-Flag vector (TP1456) was cloned into TP1479 via XhoMXbaX sites to generate pcDNA6B-SARS-CoV-2-S-Flag-QLG (TP1487).
- the insert fragment encoding SARS-CoV-2 N protein from pcDNA6B-nCoV-N-Flag vector (TP1431) or hACE2 from pcDNA6B-hACE2-Flag vector (TP1470) was cloned into TP1479 via KpuMXbal sites to generate pcDNA6B-SARS-CoV-2-N-Flag-QLG (TP1490) or pcDNA6B-hACE2-Flag-QLG (TP1491).
- the pcDNA6B-NIBP-Flag-LG (TP 1560) vector was generated by NEB-HiFi cloning of NIBP PCR product from pYX-Asc-mNIBP (TP546, Genbank # BC070463) with the primers T1375/T1376 into pcDNA6B-hACE2-Flag-LG (TP1538) via NotVXbal, while the pcDNA6B- NIBP-Flag-QLG(TP1558) was generated by NEB-HiFi cloning of the NIBP PCR product into pcDNA6B-SARS-CoV-2-E-Flag-QLG (TP1479) viaXhoVXbal.
- the pCAG vectors encoding E were generated by replacing the CMV promoter in corresponding pcDNA6B-SARS-CoV-2-E-Flag-LG or -QLG vectors with C AG promoter via SnaBVKpnl sites.
- Mutation vectors Site-directed or deletion mutagenesis of Exen21/Qa were performed using pcDNA6B-SARS-CoV-2-E-Flag-QLG(TP1479) as atemplate. Mutagenic primers were designed to change or delete specific nucleotides in Exen21 sequence. For each mutation a Phusion High-Fidelity PCR reaction was performed using a universal primer (T1640) matching a region upstream of SARS-CoV-2 E and a mutagenic primer matching Exen21 sequence except for the region a desired mutation introduced. The PCR product which carries the Exen21 mutation was gel purified and cloned into AcoAV/Ao/I-digested 6B-E-QLG DNA using NEBuilder ®HiFi DNA assembly kit.
- Antibody vectors The plasmid set CR3022 for pFUSEss-CHIg-hGl-SARS-CoV-2- mAb (NR-52399, TP1565) and pFUSE2ss-CLIg-hk-SARS-CoV-2-mAb (NR-52400, TP1566) expressing the heavy (H) and light (L) chains of human anti-SARS-CoV mAb respectively (GenBank: DQ168569 and DQ168570) were produced under HHSN272201400008C and obtained from BEI Resources, NIAID, NIH (Cat# NR-53260).
- the Q-tagged HQ (TP 1574) and LQ (TP1571) vectors were generated from H or L plasmids at Nhe I site using NEB-HiFi with the synthesized oligonucleotides that contain Q-encoding sequence and the C-terminus of the immunoglobulin heavy and light chain (T1378, T1380-T1383).
- Lentiviral vectors The vector pRRL-SIN.cPPT.PGK-GFP.WPRE (TP792), (Addgene #12252), was used to generate pRRL-E-Flag-LG-GFP (TP1577) by transferring E-Flag-LG insert from TP1478 to TP792 via BamHUAgel.
- the pRRL-E-Flag-LG (TP1578) vector was generated from TP 1577 by AgeVKpnl blunt ligation.
- the pRRL-E-Flag-QLG (TP 1579) vector was generated by transferring E-Flag-QLG from TP1479 to TP1578 via BamHliBstBl.
- TP1578 and TP1579 vectors were used as the backbone for NEB-HiFi cloning of human IFNy and IL2 PCR products via Xbal site to generate pRRL-IFNy-LG (TP 1604) or QLG (TP 1605) and pRRL-IL2-LG (TP 1606) or QLG (TP 1607).
- the PCR fragments of IFNy and IL2 were derived, respectively, from pUC8-IFNY (Addgene #17600) and pAIP-hIL2-co (Addgene #90513) using primer pairs T1407/T1408 and T1409/T1410.
- the pRRL-Flag-LG (TP 1685) and pRRL-Flag-QLG (TP 1686) vectors were generated respectively from TP 1621 and TP 1622 via BsmBMXbal digestion and NEB-HiFi cloning with oligonucleotide insert (T1469).
- the pLV-EFla-spCas9-Q-T2A-RFP (TP1562) was generated from pLV-EFla-spCas9-T2A-RFP (TP855) at Ariel site using NEB- HiFi cloning with the synthesized oligonucleotide that contains Q-encoding sequence (T1361).
- the pLV-EFla-MS2-spCas9-Q-F2A-GFP (TP1552) vector was generated from pLV-EFla-MS2- spCas9-F2A-GFP (TP 1081) at Ariel site using NEB-HiFi cloning with oligonucleotide (T1361).
- the LV packaging vector psPAX2-Gag-Q (TP1618) was generated from psPAX2 (TP592, Addgene #12260) via SphVEcoKV sites by NEB-HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template.
- the psPAX2-Pol-Q-RRE-Q (TP1619) was generated from psPAX2 via SwaVNhel sites by NEB-HiFi cloning of two overlay PCR fragments with primer pairs T1400/T1401 and T1402/T1403 using psPAX2 as PCR template.
- the psPAX2-Gag-Q-Pol-Q-RRE-Q (TP 1620) was from TP1619 via SphVEcoRV sites by NEB-HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template.
- S-pseudoviral vectors The vector pCAG-SARS-CoV-2-Sdl8Q (TP1506) encoding human codon-optimized S gene of SARS-CoV-2 with C-terminal 18 aa deletion (Sdl8) and Qa tag fusion was constructed using NEB-HiFi. Briefly, the Sdl8 expression cassette in the CMV- driven vector pcDNA3.1-SARS2-S (Addgene Cat# 145032), was transferred to a CAG-driven vector pCAG-Flag-SARS-CoV-2-S (gift from Peihui Wang) via EcoRVNotl sites and PCR with primer pairs T1323/T1324.
- the vector pCAG-SARS-CoV-2-Sdl8 (TP1567) encoding Sdl8 without Qa tag was constructed via the NEB-HiFi cloning with synthesized oligonucleotide insert T1367 at SacIVNotl site of pCAG-SARS-CoV-2-Sdl8Q vector.
- Plasmid DNAs were purified using commercial kits for endotoxin-free miniprep (Cat# REF 740490) or midipreps (Cat# REF 740420) from Macherey -Nagel.
- the E. coli bacterial cultures (4 ml for miniprep, 200 ml for midiprep) harboring relevant plasmids were grown in LB or 2YT media supplemented with 100 pg/ml Carbenicillin, 50 pg/ml Kanamycin, 50 pg/ml blasticidin, or 50 pg/ml Zeocin at 30°C for NEB-stable or 37°C for DH5a E. coli cells overnight.
- the bacterial cultures were harvested by centrifugation, the pellets obtained after centrifugation were processed to purify plasmid DNA according to manufacturer’s guideline.
- the final DNA was dissolved in ultra-pure DNase/RNase-free distilled water (Thermo Fisher, Cat#10977023) and DNA concentrations were determined either using Nanodrop 1 UV-Vis Spectrophotometer (Thermo-Fisher) or in a Take3 plate using Bio-Tek multiplate reader.
- HEK293T human fetal kidney and Hela human cervix epithelial cells were obtained from ATCC (Cat# CRL-3216 and CCL-2). Both cells were cultures in Dulbecco’s Modified Eagle’s Medium (DMEM, Gibco) supplemented with Fetal Bovine Serum (FBS) and antibiotic 1% Penicillin/Streptomycin (Coming). BHK-21-/WI-2 cells (Kerafast, EH1011) were grown in DMEM supplemented with 5% FBS and 1% Penicillin/Streptomycin. All cells were incubated in a 37°C incubator under 5% CO2 atmosphere.
- DMEM Modified Eagle’s Medium
- FBS Fetal Bovine Serum
- Penicillin/Streptomycin Coming
- BHK-21-/WI-2 cells were grown in DMEM supplemented with 5% FBS and 1% Penicillin/Streptomycin. All cells were incubated in a 37°C incubator under 5% CO2 atmosphere.
- 96-well plate was used.
- 24-well plate was used.
- Cells resuspended (in DMEM plus 10% FBS) were seeded (3-4x10 ⁇ cells/well for 96- well plate or 1-2x10 ⁇ cells/well for 24-well plate) the night before the transfections.
- Transporter 5 transfection reagent TP5
- 50-100 ng plasmid DNA per well for 96-well plate was mixed 0.2-0.4 m ⁇ TP5 in 0.9% NaCl solution and incubated at room temperature for 20 min.
- the transfection reagent and DNA solution were mixed again and added to each well dropwise.
- the transfections were incubated at 37°C in 5% CO2 overnight (16-18 h), the media was replaced with DMEM plus 10% FBS.
- RNA Extraction and Reverse Transcription Quantitative PCR for inRNA stability assay
- HEK293T cells were transfected with indicated vectors (500 ng/well for 24-well plate) for 24 h before treatment with transcriptional inhibitor actinomycin D (10 pM) for various period.
- Total RNA was extracted using Monarch Total RNA miniprep Kit (NEB, Cat# T2010) that includes two steps of DNA removal. Equal amount of RNA (0.5 pg) was used to synthesize cDNA using High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific, Cat# 4368814) with random hexanucleotide primer. Real time PCR analysis was carried out on QuantStudioTM 3 System.
- mRNA expression levels of reporter gdLuc luciferase and huma b-actin were determined using iTaq Universal SYBR Green Supermix kit (BioRad, Cat# 1725121).
- sequences for gdLuc primers are (forward) 5’-
- Human b-actin primers are (forward) 5’-AAGAGCTATGAGCTGCCTGA-3’ and (reverse) 5’-
- Cycle threshold (Ct) values were obtained graphically for reporter and b-actin. The difference in Ct values between for reporter and b-actin were represented as ACt values.
- the AACt values were obtained by subtracting the ACt values of the control samples from that of the samples at different time points. Relative percentage change in gene expression was calculated as 2-AACt.
- the mRNA decay rate was calculated by non-linear regression curve fitting (one phase decay) using GraphPad Prism 9.1. Three independent experiments were performed.
- the Coelenterazine (CTZ) substrate (Nanolight Technology, Cat # 3032) was dissolved in 10 ml ultra-sterile distilled water to make the stock solutions and kept at - 20°C until use.
- the CTZ stock solution was diluted 10-30 times to make working solutions.
- telomere sequence For pcDNA6B vector containing T7 promoter, the DNA was lineated with A gel digestion followed by gel purification.
- the primers included the T7 promoter (TTAATACGACTCACTATAGGGTGGAATTCTGCAGATATCCAG. T1427), generating DNA fragment containing target gene, LG or QLG dual reporter and a poly(A) tail.
- PCR was performed using Phusion High-Fidelity PCR Master Mix kit (Thermo Fisher Scientific, F531). The DNA was purified using gel extraction kit and the concentration determined using Take3 plate in Bio Tek multiplate reader.
- RNA was synthesized from the purified DNA template using HiScribeTM T7 ARCA mRNA Kit (NEB, Cat#E2060) and co-transcriptionally capped with m7G anti-reverse cap analog (ARCA, Cat#1411), and poly A tailing.
- the synthesized RNA was purified using Monarch RNA cleanup kit (NEB, Cat#E2040) and quantified with Take3 plate. Equal amount of RNA between LG and QLG groups at different dosage were used for transfection into HEK293T cells in quadruplicate with Lipofectamine® MessengerMAX mRNA Transfection Reagent (Thermo Fisher Scientific, Cat#LMRNA015) following manufacture’s manual.
- the culture media containing gdLuc were collected, and gdLuc assay was performed as above.
- VSV-G or S protein-pseudotyped lentivirus packaging and titration The recombinant lentivirus carrying indicated lentiviral vector was produced in a small scale using the second generation of LV packaging system according to standard protocols. Briefly, HEK293T cells in one of 6-well plate were cotransfected by TP5 kit with the indicated transfer LV vector (1.4 pg), the packaging vector psPAX2 or its mutants (1 pg) and VSV-G or Sdl8 vector (0.4 pg). At 2-3 days post-transfection, the supernatants containing LV were concentrated and
- the functional titers of the crude and purified lentivirus were determined by counting GFP-expressing HEK293T cells at 48 h after infection with serial dilutions of lentiviruses under fluorescent microscopy. For some cases, flow cytometry analysis was used for LV titration.
- SDS-polyacrylamide gels (10-12%) were home-made or mini-PROTEAN TGX gels (Cat# 4561093, 4561096) were purchased from BioRad.
- the cell lysates were prepared using the lysis buffer composed of 50 mM Tris-HCl pH 7.0, 150 mM NaCl, 5 mM EDTA and 1 % Triton X-100 supplemented with PMSF (lOOx), Aprotinin and Leupeptin (200x).
- the 50 pi lysates were prepared from each well after collecting the supernatant. The lysates were incubated at 4°C for 20-30 min, centrifuged at maximum speed in an Eppendorf Centrifuge.
- the clear lysates were either denatured for 5 min at 98°C immediately in lx SDS-PAGE loading dye or stored at -80°C until use. Supernatants were stored at 4°C until before they treated with lx SDS-PAGE loading dye. The denatured 10-20 ul aliquots of cell lysates or 20-30 pi supernatants were loaded onto SDS- polyacrylamide gels. The SDS-PAGE was performed in Tris-Glycine/SDS buffers under denaturing and reducing conditions.
- the polyacrylamide gels were transferred to 0.2-pm nitrocellulose membranes (BioRad supported nitrocellulose (NC) membrane, Cat # 162-0097) either using wet transfer or iBlot®2 device using IBlot®2 NC mini (IB23002) or regular Stacks (IB23001).
- lx transfer buffer 25 mM Tris-HCl pH 7.6, 192 mM glycine, 20% Methanol.
- the gels were sandwiched together with NC membranes and transfers were performed in lx Transfer buffer at 250 mA at 4°C for 1- 2 h.
- Dry western blot transfers were performed in a IBlot®2 gel transfer device (Invitrogen, Thermo-Fisher, Ref# IB21001) using mini or regular IBlot®2 stacks for 7 min according to manufacturer’s guidelines. After the transfer, the membranes were blocked in lx TBST buffer containing 5% milk. The membranes then were treated with primary antibodies overnight at 4°C or 2 h at RT. The membranes were washed three times with lx TBST buffer minute each followed by incubation with secondary antibodies. The secondary antibodies with infrared tag were diluted 1/10000-120000 and incubated with the NC membranes for 45 min to an h.
- the membranes were washed with lx TBST buffer three times, 5 min each and scanned on a Li-COR Odyssey image analyzer.
- the images were analyzed with NIH ImageJ (1.53 version) densitometric measurements. The data were expressed as integrated density times area and presented as relative fold in comparison with corresponding control.
- HEK293T cells were cotransfected with the Q-tagged HQ (TP1574) and LQ (TP1571) at 50 ng/well of 96-well plate in quadruplicates with or without normalization vector pGL4.16-CMV (TP329), which derived from the promoterless vector pGL4.16 (Promega, Cat#E6711), or pRRL-E-Flag-LG (TP1578) at 20 ng/well.
- the original antibody plasmids for pFUSEss-CHIg-hGl-SARS-CoV-2-mAb (TP 1565) and pFUSE2ss-CLIg-hk-SARS-CoV-2-mAb (TP1566) were used as the control.
- ELISA was performed using a Human IgG (Total) Uncoated ELISA Kit (Invitrogen, Thermo-Fisher, Cat# 88-50550-88).
- a 96-well Costar ELISA plate (Coming) was first coated with SARS-CoV-2-Spike (S) protein from BEI (Cat # NR52724) at 100 pg/well overnight at 4°C. The washing and blocking steps were performed using the buffers and solutions provided in the kit.
- Supernatants containing secreted antibodies were collected from the transfections at 24 and 48 h and kept at 4°C until use. The aliquots of 0.5, 2.5 and 5.0 m ⁇ antibody supernatants were added to each SARS-CoV-2-S coated wells.
- HEK293T cells were transfected with indicated vectors using TP5 transfection reagent in DMEM plus 10 % FBS as described above. The transfected cells were incubated overnight, and 10 pg/ml BA was added prior to media change and incubated for 3 h at 37°C in 5% CO2. The culture media was replaced with 293 FreeStyle serum free media (Gibco, Thermo-Fisher, Cat# 12-338-018) with 10 pg/ml BA and incubation was continued for 24 h at 37°C in 5% CO2. The supernatants were withdrawn right after media replacement and collected after 24 h. The cell lysates were also prepared at 24 h time point. The supernatants and cell lysates were tested for gdLuc activity and Western blot analysis.
- a dual reporter system was generated to measure the viral protein expression quantitatively and dynamically.
- Gaussia- Dura luciferase (gdLuc) and destabilized green fluorescent protein (dsGFP) were fused, abbreviated LG, onto the C-terminus of SARS-CoV-2 E protein (FIGS. 8A and 14A-14C).
- LG destabilized green fluorescent protein
- E7 exhibited >20-fold higher luciferase activity than El.
- the E7 DNA sequence was confirmed by Sanger sequencing. Unexpectedly, it was discovered that E7 had an additional 21 -nucleotide sequence that encodes 7 amino acids (aa) in frame between the upstream of LG and the downstream of the Flag tag. This heptapeptide was designated as Qa based on its aa sequence and named its linked LG as QLG.
- Exen21/Qa induced stronger boosting of SARS-CoV-2 E protein in the presence of the stronger CAG promoter (FIG. 9F). It was further found that similar boosting of protein expression and production occurred in other cell types including Hela, BHK, and others (FIG. 9G). In addition to being functional in regular plasmids, the Exen21/Qa also exhibited boosting activity in viral transfer vectors such as lentiviral (LV) vectors (FIGS. 9D and 9E).
- LV lentiviral
- Exen21/Qa addition has a broad capability of boosting protein expression/production across various gene products, vectors, mammalian cell types, and species.
- Monoclonal antibody (mAb)-based therapeutics require the optimization of antibody production in suitable cell culture platforms, which relies on high-performance expression vectors. To achieve this, genetic elements in mAh production vectors have been widely modified.
- a human anti-SARS-CoV mAh (Bei, CR3022) was used, which contains the consistent regions of heavy and light chains (GenBank: DQ 168569, DQ 168570, respectively) as a test platform.
- Exen21 was inserted into the C-termini of the immunoglobulin heavy and light chains (H/L) of CR3022 to generate Qa-tagged HQ and LQ (FIG. 10 A).
- Pseudotyped virus has been widely used in studies not only for gene delivery, but also for vaccine production, antibody neutralization, cellular entry, and pathogenic exploration.
- Pseudovirion is an excellent alternative to high-risk viruses such as SARS-CoV-2 and its variants and does not require BSL3 facilities for working with.
- Pseudovirions are virus-like particles (VLPs) coated with viral surface or membrane proteins that harbor specific cellular tropisms. VLPs pseudotyped with SARS-CoV-2 S protein evoke stronger immune responses than any individual viral protein due to their 3-dimensional structures like those of live virus 8 ’ ia u .
- SARS-CoV-2 S protein has been widely used to generate S pseudovirion, but the packaging efficiency for lenti virus-like (LVLP) or vesicular stomatitis virus-like (VSVLP) particles has been low in most reports, even with the codon-optimized C-terminal deletion S protein 5, 6 ’ X 12 . Given the fact that Exen21/Qa addition boosts S protein production in mammalian cells, it was speculated that it might boost the packaging efficiency of S pseudotyped LVLP (S-LVLP). By applying the widely used C-terminal 18 aa-deleted codon-optimized SARS-CoV-2 S protein (Sdl8) as a test platform (FIG.
- Viral gene therapy has been extensively studied and actively applied to clinical diseases. Both AAV and LV are the most promising strategies for viral gene therapy, but viral packaging efficiency (production yield) has been a bottleneck. In genome editing by CRISPR/Cas, viral packaging efficiency is also a rate-limiting factor for development of novel therapeutics.
- the level of mRNA supplied by LV transfer vector can affect LV packaging efficiency. It was hypothesized that Exen21 addition in the LV transfer vector can elevate the transgene mRNA levels during packaging and thereby boost the efficiency of LV packaging and gene delivery. This idea was tested by comparing the LV transfer vectors pRRL-E-LG and pRRL-E-QLG for standard LV packaging (psPAX2 and VSV-G).
- Exen21 After LV infection of HEK293T cells, Exen21 increased production of the transgene reporter gdLuc from the transfer vector (FIG. 1 IF), like its boosting efficiency in transfected cells without LV packaging (FIGS. 9D, 9E). However, Exen21 addition in the transfer vector only marginally affected packaging efficiency (i.e., the titer of packaged LV; data not shown). Similar changes were observed with LV-spCas9-Q-RFP and LV-MS2-spCas9-Q-GFP (FIGS. 11G and 11H), for which packaging efficiency is usually ⁇ 1% that of standard LV-RFP or LV-GFP.
- Exen21/Qa addition may be in the elevation of vaccine yields for the urgent fight against COVID-19 pandemic.
- the most promising vaccines against SARS-CoV-2 and its variants are derived from mRNA or DNA encoding S proteinl3.
- the Exen21/Qa addition increased S protein expression by -3-24 fold in a CMV-driven cDNA expression vector (FIG. 9A). If such an enhancement of vaccine production is applied in large scale, it would reduce costs and expedite the availability of COVID-19 vaccines.
- Exen21/Qa addition regulates mRNA-dependent translation
- the dynamic changes of translational products were measured after inhibiting transcription with actinomycin D.
- actinomycin D completely blocked the production of viral protein E (FIG. 12D) and ORF3 (FIG. 12E), measured by gdLuc activity.
- the Exen21/Qa addition showed a time-dependent increase of the protein expression and production/accumulation even with the transcriptional inhibition (FIGS. 12D and 12E), providing evidence that the Exen21/Qa addition in the targeted genes facilitates protein expression and production via the posttranscriptional regulation (increased translation efficiency and/or mRNA stability).
- the data indicate that the Exen21 addition in a given target mRNA significantly increases mRNA stability and translational efficiency, thereby boosts protein expression and production of the targeted mRNA (e.g., S protein mRNA vaccine).
- Exen21/Qa addition elevated expression of various types of targeted proteins. Aiming to test if Exen21/Qa addition boosted E protein dual reporter protein expression within cells (by Western blot analyses on cell lysates), it was unexpectedly found that E-QL protein levels in the lysates were remarkably reduced rather than increased, in the Exen21/Qa group detected by Western blot analysis with anti-Flag antibody (FIG. 13 A), even though Exen21/Qa addition robustly increased gdLuc activity in culture supernatants (FIG. 8C). Similar reductions by Exen21/Qa addition were found in corresponding intracellular levels of other viral proteins (S and N), and the host cellular proteins (IFNy, IL-2, and hACE2) (FIGS.
- S and N the host cellular proteins
- IFNy, IL-2, and hACE2 host cellular proteins
- the protein secretion was blocked by treatment with the endoplasmic reticulum (ER)-Golgi protein trafficking inhibitor brefeldin A (FIGS. 13F and 18A-18D).
- ER endoplasmic reticulum
- brefeldin A brefeldin A
- fLuc non-secretory firefly-luciferase
- Exen21/Qa addition appears to boost expression of the targeted proteins and facilitate their secretion. It was noted that auto-cleavage by the 2A system of most of the targeted proteins was incomplete, varying among different proteins (FIGS. 11C, 1 ID and 1 IF), which has been reported by others 14, 15 .
- Exen21 has many features different from SECReTE: (1) No triplet repeats such as NNY or NYN; (2) Unique and exclusive composition/order of the 21 nucleotides; (3) Smaller size (21-mer) than SECReTE (> 30-mer from >10 triplet repeats); and (4) Absence in any cellular or viral genes.
- Exen21/Qa is also quite different from the activity-enhancing motif that involves promoter enhancerl7-19 or anti-sense activity 20 . The data herein indicated that adding the Exen21 motif to a given mRNA could remarkably enhance the corresponding protein expression and secretion. This was also demonstrated in different types of proteins including viral, nonviral, intracellular, structural, and secretory proteins.
- Exen21/Qa The protein production-enhancing actions of Exen21/Qa were largely independent of the specific promoter used, among those tested, but it did elicit stronger enhancement of protein production in combination with the stronger CAG promoter (FIGS. 9A-9G).
- the Exen21/Qa addition enhanced mRNA-dependent production of targeted viral and non-viral protein fusion reporters, determined by in vitro RNA transcription and mRNA transfection, followed by dual reporter assays (FIGS 12A-12G).
- Exen21/Qa enhanced the yield of S-containing pseudoviruses and lentivirus packaging (Fig. 4).
- Exen21/Qa addition increased the release of secreted host proteins, including a robust enhancement of antibody production when Exen21/Qa was placed in antibody heavy and light chains, and augmented the secretion of IFNy and IL-2.
- Exen21/Qa actions were blocked by the Golgi - trafficking inhibitor brefeldin A.
- the Exen21/Qa addition robustly boosted the regulated secretion of secretory proteins such as S protein, antibody, IFNy and IL-2, but not via any signal peptide-like intracellular targeting mechanism, because it did not induce release of non-secretory proteins such as ///v/Ty-luciferase and spCas9. This property could potentially prove invaluable for industrial application of such secreted proteins.
- the Exen21/Qa addition could presumably enhance the production/secretion of S protein in mRNA-based vaccines against SARS-CoV-2 variants, therefore reduce the amount of mRNA needed per vaccination due to the higher levels of S protein released 13 while still provide the same host immune responses.
- the Exen21/Qa addition at the C-termini of Pol and RRE increased LV packaging efficiency, but at the Gag C-terminus it impaired LV packaging.
- optimizing Exen21/Qa locations within LV packaging vector will be helpful in applications to maximize Exen21/Qa boosting efficiency.
- Exen21/Qa addition boosted both Sdl8 expression and the packaging efficiency of S-LVLP the Exen21/Qa aition in VSV-G protein may boost regular LV packaging efficiency.
- the Exen21/Qa addition at different locations of VSV-G25, 26 may thus maximize its production-boosting efficiency.
- optimizing Exen21/Qa boosting activity on AAV, or other viral packaging system may prove valuable in biopharmaceutical applications.
- epitope tags including Flag, Myc, HA, Ollas, V5, His, C7, and T7 developed earlier enable specific research and biotechnological applications such as protein labeling, tracing, immunoaffmity purification, immunostaining, immunodetection enhancing 27 34 , protein degradation slowing, and solubility conferring 35 38 .
- Other tags modulate activity or function of targeted proteins 39 , such as N- or C-terminal tagging of PI3KCA, which increase its kinase and membrane binding activity, respectively 40 .
- Exen21/Qa exerts its actions on the enhancement of protein expression/secretion remain mainly to be delineated.
- the initial findings indicated that the presence of Exen21/Qa slowed mRNA decay as the boosting effects persisted during global transcription inhibition by actinomycin D, providing evidence that Exen21/Qa plays a key role in posttranscriptional regulation, which may include increased mRNA stability and perhaps translation efficiency.
- This Exen21/Qa supports previous proof of concept that the coding sequence harbors numerous regulatory sites that may regulate mRNA location, stability and translation efficiency 41 .
- Exen21/Qa c/.s-regulatory motif has a special secondary RNA structure that can recruit RNA-binding proteins 41 , directly regulates mRNA stability of targeted proteins 42 , or binds directly to poly-A or untranslated region (UTR) to exert its stabilizing effects upon mRNA and boosting of translation.
- RNA-binding proteins 41 directly regulates mRNA stability of targeted proteins 42 , or binds directly to poly-A or untranslated region (UTR) to exert its stabilizing effects upon mRNA and boosting of translation.
- UTR untranslated region
- secretion inhibitors might be used to identify additional pathways involved in the Exen21/Qa-modulated protein secretion, particularly the non-conventionally secreted proteins (e.g., that of cytokines such as IL-1) 47 ’ 48 .
- non-conventionally secreted proteins e.g., that of cytokines such as IL-1) 47 ’ 48 .
- Exen21/Qa a novel, small (21-mer) and unique c/.s-regulatory motif Exen21/Qa was discovered that can greatly enhance the production of a variety of different types of proteins ranging from viral transcripts/proteins, endogenous gene products, vaccines, antibodies to engineered recombinant proteins in mammalian cells.
- This Exen21/Qa has a universal protein production-boosting capacity that should facilitate versatile applications in biomedical research and biotechnological industry. Library screening related to this master Exen21/Qa is underway for optimizing the motif that would maximize the protein expression/secretion.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Health & Medical Sciences (AREA)
- Virology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Pulmonology (AREA)
- Biomedical Technology (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Plant Pathology (AREA)
- Mycology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
A novel, small (21-mer oligonucleotide) and unique cz's-regulatory coding motif can greatly enhance the production of a variety of different types of proteins ranging from viral transcripts/proteins, endogenous gene products, vaccines, antibodies to engineered recombinant proteins in mammalian cells. The combination of novel peptide tag(s) having specified short amino acid sequences or derivatives thereof and the untranslated region (UTR) of viruses (snUTR) enhanced production of tagged proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro, ex vivo and in vivo.
Description
OLIGONUCLEOTIDES AND VIRAL UNTRANSLATED REGION (UTR) FOR INCREASING EXPRESSION OF TARGET GENES AND PROTEINS
CROSS REFERENCE TO RELATED APPLICATIONS
This Application claims the benefit of U.S. Provisional Application 63/332,378 filed on April 19, 2022, U.S. Provisional Application 63/219,596 filed on July 8, 2021, U.S. Provisional Application 63/219,599 filed on July 8, 2021, and U.S. Provisional Application 63/219,587 filed on July 8, 2021. The entire contents of these applications are incorporated herein by reference in their entirety.
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH
This disclosure was made with government support under Grant Number 1R01AI145034 awarded by the National Institutes of Health. The government has certain rights in the disclosure.
FIELD
This disclosure relates to novel oligonucleotides, peptide tag(s) having specified short nucleotide sequences or derivatives thereof as well as the native untranslated region (UTR) of SARS-CoV-2 (snUTR). Methods utilizing these novel molecules include enhancing production of the targeted proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro , ex vivo and in vivo.
BACKGROUND
Many technologies have been developed to enhance protein expression/production, such as promoter optimization, mRNA stabilization, codon optimization, coding regulation, and protein stabilization, as well as modification of host cellular expression machinery including humanized yeast system. Although these optimization strategies have been extensively utilized in the biopharmaceutical industry and biomedical research, additional enhancing technology is still vital to help reduce the cost and speed up production. Computational analysis recently identified
the secretion-enhancing c/.s- regulatory targeting element (SECReTE) that facilitates ER- localized mRNA translation and protein secretion (Cohen-Zontag, Baez et al. 2019). This SECReTE motif is enriched in nearly all mRNAs encoding secreted/membrane proteins in eukaryotes and its addition results in enhanced protein secretion (Cohen-Zontag, Baez et al.
2019). It also boosts protein expression and secretion when adding to an mRNA for an exogenously expressed protein such as GFP(Cohen-Zontag, Baez et al. 2019). Various types of peptide (epitope) tags such as Flag, Myc, HA, Ollas, V5, His, C7, and T7 have demonstrated functions in protein labeling, affinity purification, and immune detection (DeCaprio and Kohl, 2019; Katayama et al., 2021; Lee et al., 2020; Mishra, 2020; Peighambardoust et al., 2021; Pina et al., 2021; Traenkle et al., 2020). However, no tagging peptides have been identified that enhance the expression/production of the targeted proteins in mammalian cells.
The 5’-UTR within SARS-CoV-2 genome is critical to initiate the generation of the entire genomic and subgenomic transcripts (Baldassarre et al., 2020; Yang and Leibowitz, 2015). The 3’-UTR also regulates the viral genome expression and replication (Chan et al., 2020; Zhao et al.,
2020). Both 5’-UTR and 3’-UTR are highly conserved among SARS-CoV genome and their variants (Baldassarre et al. , 2020; Bottaro et al., 2021; Rangan et al., 2020; Rouchka et al., 2020; Ryder et al., 2021; Yang and Leibowitz, 2015). Recent computerization studies have identified a very stable four-way junction of 5’-UTR close to the AUG start codon (Miao et al., 2020).
SUMMARY
Embodiments are directed to novel chimeric molecules comprising an oligonucleotide comprising a c/.s-regul atory coding motif, a peptide tag, a 5’- untranslated region (5’-UTR), a 3’- untranslated region (3’-UTR) and combinations thereof for use in the enhanced production and expression of a desired biomolecule. The synergistic boosting effect observed has extensive applications and broad research interest. For industrial applications, the strategy will reduce the cost of many widely used products and facilitate their availability, such as vaccines, antibodies, recombinant proteins, and therapeutic gene products. An immediate and highly important usage of this system would be to boost mRNA vaccines against COVID-19 variants. For biomedical research, novel chimeric molecules will stimulate interest in exploring novel oligonucleotides and peptides that regulate protein expression and secretion as well as screening additional viral native UTRs for protein production boost.
In certain aspects, a composition comprises an expression-enhancing oligonucleotide having between 15 and 30 nucleic acid bases and includes a c/.s-regulatory coding motif that locates in the coding regions and retains open reading frame (ORF) with targeted genes. In certain embodiments, the expression-enhancing oligonucleotide comprises twenty-one nucleic acid bases. In certain embodiments, the expression-enhancing oligonucleotide comprises a nucleic acid sequence having at least a 75% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the expression enhancing oligonucleotide comprises a nucleic acid sequence having at least a 95% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the expression enhancing oligonucleotide comprises a nucleic acid sequence comprising cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7).
In another aspect, a synthetic oligonucleotide comprises a nucleic acid sequence having at least a 75% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the synthetic oligonucleotide comprises a nucleic acid sequence having at least a 95% sequence identity to cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the oligonucleotide comprises a nucleic acid sequence comprising cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7). In certain embodiments, the oligonucleotide encodes a peptide comprising an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the oligonucleotide encodes a peptide comprising an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the oligonucleotide encodes a peptide comprising the amino acid sequence QPRFAAA (SEQ ID NO: 1).
In another aspect, a construct comprises the synthetic oligonucleotide embodied herein.
In another aspect, a chimeric molecule comprises one or more peptide domains and one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof. In certain embodiments, the one or more peptide domains comprise from about five amino acids to about twenty amino acids. In certain embodiments, the one or more peptide domains comprise about seven amino acids. In certain embodiments, the one or more peptide domains comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the peptide comprises an amino acid sequence having at least a 90% sequence
identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the peptide comprises the amino acid sequence QPRFAAA (SEQ ID NO: 1). In certain embodiments, the peptide comprises Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof. In certain embodiments, the one or more 5’- untranslated region (UTR) sequences or fragments thereof, are derived from one or more viruses. In certain embodiments, the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof. In certain embodiments, the 5’ -UTR and/or 3’ -UTR are from a coronavirus. In certain embodiments, the coronavirus is SARS-CoV-2. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’- UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’-UTR. In certain embodiments, the chimeric molecule further comprises one or more biomolecules operably linked to the one or more peptide domains and/or the one or more 5’UTR and/or 3’ -UTR sequences. In certain embodiments, the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics. In certain embodiments, the chimeric molecule further
comprises one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
In another aspect a host cell comprises an oligonucleotide embodied herein, or a chimeric molecule embodied herein.
In another aspect, a construct encodes an oligonucleotide embodied herein, or a chimeric molecule embodied herein.
In another aspect, a method of enhancing production of biomolecules, comprises tagging a desired peptide or a nucleic acid sequence with the chimeric molecule of any one of claims 1- 34, by fusion or cloning, expressing the peptide or nucleic acid sequence, and harvesting the protein. In certain embodiments, the proteins comprise: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
In another aspect, a nucleic acid comprises a promoter, a 5’ -untranslated region (5’ -UTR) sequence, a biomolecule of interest, an oligonucleotide comprising a c/.s-regulatory coding motif, a 3’ -untranslated region (3’-UTR) sequence and combinations thereof. In certain embodiments, the one or more 5’ -untranslated region (UTR) and/or 3’UTR sequences or fragments thereof, are derived from one or more viruses. In certain embodiments, the one or more viruses comprise coronaviruses, retroviruses, picornaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof. In certain embodiments, the 5’ -UTR and/or 3’ -UTR are derived from a coronavirus. In certain embodiments, the coronavirus is SARS-CoV-2. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’ -UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’-UTR. In certain embodiments, the one or more 3’- UTR
nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’-UTR.
In another aspect, a chimeric molecule comprises one or more oligonucleotides comprising a nucleic acid sequence of cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7) and one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof. In certain embodiments, the one or more oligonucleotides encode a peptide comprising from about five amino acids to about twenty amino acids. In certain embodiments, the one or more peptides comprise about seven amino acids. In certain embodiments, the one or more peptides comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptides comprise an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptides comprise the amino acid sequence QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptides comprises a sequence comprising Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof. In certain embodiments, the one or more 5’ -untranslated region (UTR) sequences or fragments thereof, are derived from one or more viruses. In certain embodiments, the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof. In certain embodiments, the 5’ -UTR and/or 3’ -UTR are from a coronavirus. In certain embodiments, the coronavirus is SARS-CoV-2. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’- UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS- CoV-2 5’ -UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR. In certain embodiments, the one or more
3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-23’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’- UTR. In certain embodiments, the chimeric molecule further comprises one or more biomolecules operably linked to the one or more oligonucleotides and/or the one or more 5’UTR and/or 3’-UTR sequences. In certain embodiments, the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics. In certain embodiments, the chimeric molecule further comprises one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
In another aspect, an expression vector comprises the nucleic acids embodied herein.
In another aspect, a novel peptide tag comprises a specified short amino acid sequence or its derivative. In certain embodiments the peptide tag is about 5 to about 10 amino acids in length. In certain embodiments, the peptide tag is about 7 amino acids in length. In certain embodiments, the peptide tag comprises two or more tandem repeats of peptides.
In certain aspects, a synthetic peptide tag comprises an amino acid sequence unit of about five to about fifteen amino acids wherein the N-terminal and/or C-terminal amino acids are linked or fused to a target molecule. In certain embodiments, the amino acid sequence unit comprises seven amino acids. In certain embodiments, the amino acid sequence comprises at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the amino acid sequence comprises at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the amino acid sequence comprises the amino acid sequence QPRFAAA (SEQ ID NO: 1). In certain embodiments, the amino acid sequence comprises the amino acid sequence wherein the peptide domain comprises Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine
(M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof. In certain embodiments, the synthetic peptide tag further comprises a plurality of repeating amino acid sequence units. In certain embodiments, the repeating amino acid sequence units are in tandem. In certain embodiments, the amino acid sequence units are separated by linker molecules or one or more amino acids.
In another aspect, a synthetic peptide comprises the structure: (AA-AA-AA-AA-AA- AAZ-AAZ)X, wherein x is greater than or equal to 1, z is 0 or 1 and each AA is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
In another aspect, a synthetic peptide comprises the structure: AA1-AA2-AA3-AA4- AA5-AA6-AA7, wherein each AA is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
In another aspect, a synthetic peptide comprises an amino acid sequence comprising the structure: Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
In another aspect, a fusion protein comprises a synthetic peptide embodied herein fused to one or more target peptides. In certain embodiments, two or more synthetic peptides embodied herein are fused to a target peptide.
In another aspect, a fusion molecule comprises a synthetic peptide embodied herein fused to one or more biomolecules. In certain embodiments, the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
In another aspect, a method of enhancing production of proteins comprises tagging a desired peptide or a nucleic acid sequence with the peptide tag embodied herein, by fusion or cloning, expressing the peptide or nucleic acid sequence, and harvesting the protein. In certain embodiments, the proteins comprise: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, biomimetics, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
In certain aspects, a composition comprises a peptide-tagged biomolecule embodied herein and a pharmaceutically acceptable excipient, diluent or carrier.
In another aspect, a nucleic acid encodes the peptide tags embodied herein.
In another aspect, an expression vector comprises a nucleic acid encoding the peptide tags embodied herein.
In another aspect, a host cell comprises the expression vector encoding the peptide tags embodied herein.
In certain aspects, a method of utilizing the peptide tag(s) comprises enhancing production of the tagged proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro , ex vivo and in vivo. In certain embodiments, tandem peptide repeats further boost production of a targeted molecule. In certain embodiments, a method of increasing protein production in a cell comprises tagging a target molecule in the cell.
In another aspect, a chimeric molecule comprises one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof associated with one or more biomolecules. In certain embodiments, the one or more 5’ -untranslated region (UTR) and/or 3’ -UTR sequences or
fragments thereof, are derived from one or more viruses. In certain embodiments, the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof. In certain embodiments, the 5’ -UTR and/or 3’ -UTR are from a coronavirus. In certain embodiments, the coronavirus is SARS-CoV-2. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’- UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS- CoV-2 5’ -UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-23’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’- UTR. In certain embodiments, the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics. In certain embodiments, the chimeric molecule further comprises one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
In another aspect, a host cell comprises the chimeric molecule embodied herein.
In another aspect, a construct encodes the chimeric molecules embodied herein.
In another aspect, a method of enhancing production of biomolecules, comprises tagging a desired peptide or a nucleic acid sequence with the chimeric molecules embodied herein, by fusion or cloning, expressing the peptide or nucleic acid sequence, and harvesting the protein. In certain embodiments, the proteins comprise: oligonucleotides, polynucleotides, mRNA vaccines, DNA vaccines, viral transcripts/proteins, antibodies, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
In another aspect, a nucleic acid comprises a promoter, a 5’ -untranslated region (5’ -UTR) sequence, a biomolecule of interest, a peptide domain, a 3’ -untranslated region (3’ -UTR) sequence and combinations thereof. In certain embodiments, the one or more 5’ -untranslated region (UTR) sequences or fragments thereof, are derived from one or more viruses. In certain embodiments, the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof. In certain embodiments, the 5’ -UTR and/or 3’ -UTR are derived from a coronavirus. In certain embodiments, the coronavirus is SARS-CoV-2. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-23’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’- UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’ -UTR.
In another aspect, an expression vector comprises the nucleic acids embodied herein.
In another aspect, a host cell comprises the nucleic acids or expression vectors embodied herein.
Definitions
Unless specifically defined otherwise, all technical and scientific terms used herein shall be taken to have the same meaning as commonly understood by one of ordinary skill in the art (e.g., in cell culture, molecular genetics, and biochemistry).
The term “about” or “approximately” means within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For
example, “about” can mean within 1 or more than 1 standard deviation, per the practice in the art. Alternatively, “about” can mean a range of up to 20%, up to 10%, up to 5%, or up to 1% of a given value or range. Alternatively, particularly with respect to biological systems or processes, the term can mean within an order of magnitude within 5-fold, and also within 2-fold, of a value. Where particular values are described in the application and claims, unless otherwise stated the term “about” meaning within an acceptable error range for the particular value should be assumed. It is understood that where a parameter range is provided, all integers within that range, and tenths thereof, are also provided by the disclosure. For example, “0.2-5 mg” is a disclosure of 0.2 mg, 0.3 mg, 0.4 mg, 0.5 mg, 0.6 mg etc. up to and including 5.0 mg.
In the descriptions in the disclosure and in the claims, phrases such as “at least one of’ or “one or more of’ may occur followed by a conjunctive list of elements or features. The term “and/or” may also occur in a list of two or more elements or features. Unless otherwise implicitly or explicitly contradicted by the context in which it is used, such a phrase is intended to mean any of the listed elements or features individually or any of the recited elements or features in combination with any of the other recited elements or features. For example, the phrases “at least one of A and B;” “one or more of A and B;” and “A and/or B” are each intended to mean “A alone, B alone, or A and B together.” A similar interpretation is also intended for lists including three or more items. For example, the phrases “at least one of A, B, and C;” “one or more of A, B, and C;” and “A, B, and/or C” are each intended to mean “A alone, B alone, C alone, A and B together, A and C together, B and C together, or A and B and C together.” In addition, use of the term “based on,” is intended to mean, “based at least in part on,” such that an unrecited feature or element is also permissible.
The term “amino acid,” as used herein, encompasses both naturally occurring amino acids and non-naturally occurring amino acids. Examples of non-naturally occurring amino acids include, but are not limited to, D-amino acids (i.e., an amino acid of an opposite chirality to the naturally occurring form), N-a-methyl amino acids, C-a-methyl amino acids, b-methyl amino acids and D- or L^-amino acids. Other non-naturally occurring amino acids include, for example, b-alanine (b-Ala), norleucine (Me), norvaline (Nva), homoarginine (Har), 4- aminobutyric acid (g-Abu), 2-aminoisobutyric acid (Aib), 6-aminohexanoic acid (e-Ahx), ornithine (orn), sarcosine, a-amino isobutyric acid, 3-aminopropionic acid, 2,3-diaminopropionic
acid (2,3-diaP), D- or L-phenylglycine, D-(trifluoromethyl)-phenylalanine, and D-p- fluoropheny 1 al anine .
As used herein, the term “biomolecule” refers to any of the numerous substances that are produced by cells and living organisms. Biomolecules have a wide range of sizes and structures and perform a vast array of functions. The four major types of biomolecules are carbohydrates, lipids, nucleic acids, and proteins or characteristic associated with the peptide and/or protein of interest. The biomolecules may be used in a variety of applications including, but not limited to curative agents for diseases (e.g., insulin, interferon, interleukins, anti -angiogenic peptides, tumor necrosis factor); molecules that bind to defined cellular targets such as receptors, channels, lipids, cytosolic proteins, and membrane proteins, to name a few; biomolecules having antimicrobial activity, antiviral activity, anti-cancer, anti-inflammatory activity, and the like.
As used herein, “cleavable linker elements”, “peptide linkers”, and “cleavable peptide linkers” will be used interchangeably and refer to cleavable peptide segments found, in certain embodiments, between peptide tags and the biomolecule, e.g., peptide, of interest. After the peptide tags are separated and/or partially purified or purified from the cell lysate, the cleavable linker elements can be cleaved chemically and/or enzymatically to separate the peptide tag from the biomolecule, e.g. peptide, of interest. The fusion peptide may also include a plurality of regions encoding one or more peptides of interest separated by one or more cleavable peptide linkers. The peptide of interest can then be isolated from the peptide tag, if necessary. In one embodiment, the peptide tag(s) and the peptide of interest exhibit different solubilities in a defined medium (typically an aqueous medium), facilitating separation of the peptide tag from the biomolecule, e.g., polypeptide of interest. In an embodiment, the peptide tag is insoluble in an aqueous solution while the protein/polypeptide of interest is appreciably soluble in an aqueous solution. The pH, temperature, and/or ionic strength of the aqueous solution can be adjusted to facilitate recovery of the peptide of interest. In an embodiment, the differential solubility between the inclusion body tag and the peptide of interest occurs in an aqueous solution having a pH of 4 to 11 and a temperature range of 15 to 50° C. The cleavable peptide linker may be from 1 to about 50 amino acids, from 1 to about 20 amino acids in length. The cleavable peptide linkers may be incorporated into the fusion proteins using any number of techniques well known in the art. Means to prepare the present peptides (peptide tags, cleavable peptide linkers, peptides
of interest, and fusion peptides) are well known in the art and in preferred embodiments the entire peptide reagent may be prepared using the recombinant DNA and molecular cloning techniques.
The term “checkpoint proteins” means a group of molecules on the cell surface of CD4+ and/or CD8+ T cells that fine-tune immune responses by down-modulating or inhibiting an anti tumor immune response.
As used herein, the terms “comprising,” “comprise” or “comprised,” and variations thereof, in reference to defined or described elements of an item, composition, apparatus, method, process, system, etc. are meant to be inclusive or open ended, permitting additional elements, thereby indicating that the defined or described item, composition, apparatus, method, process, system, etc. includes those specified elements— or, as appropriate, equivalents thereof— and that other elements can be included and still fall within the scope/defmition of the defined item, composition, apparatus, method, process, system, etc.
As used herein, the terms “conjugated,” “linked,” “attached,” “fused” and “tethered,” when used with respect to two or more moieties, means that the moieties or domains are physically associated or connected with one another, either directly or via one or more additional moieties that serve as a linking agent, to form a structure that is sufficiently stable so that the moieties remain physically associated under the conditions in which the structure is used, e.g., physiological conditions. The linkage can be based on genetic fusion according to the methods known in the art or can be performed by, e.g., chemical cross-linking. The compounds and targeting agents may be linked by a flexible linker, such as a polypeptide linker. The polypeptide linker can comprise plural, hydrophilic or peptide-bonded amino acids of varying lengths. The term “associated” will be used for the sake of brevity and is meant to include all possible methods of physically and chemically associating each domain.
As used herein, the terms “enhancement,” “enhance,” “enhanced”, “enhances,” “enhancing” or “enhance” as used interchangeably and refer to an increase in the specified parameter (e.g., at least about a 1.1-fold, 1.25-fold, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 8-fold, 10-fold, twelve-fold, or even fifteen-fold or more increase) and/or an increase in the
specified activity of at least about 5%, 10%, 25%, 35%, 40%, 50%, 60%, 75%, 80%, 90%, 95%, 97%, 98%, 99% or 100% over baseline values.
As used herein, the terms “fusion protein”, “fusion peptide”, “chimeric protein”, and “chimeric peptide” will be used interchangeably and will refer to a polymer of amino acids (peptide, oligopeptide, polypeptide, or protein) comprising at least two portions, each portion comprising a distinct function. At least one first portion of the fusion peptide comprises at least one of the present peptide tags. At least one second portion of the fusion peptide comprises at least one peptide of interest. In certain embodiments, the fusion protein additionally includes at least one cleavable peptide linker that facilitates cleavage (chemical and/or enzymatic) and separation of the peptide tag(s) and the peptide(s) of interest.
“Nucleic acid” refers to nucleotides ( e.g ., deoxyribonucleotides, ribonucleotides, and T - modified nucleotides) and polymers thereof in either single-, double- or multiple-stranded form, or complements thereof. The terms “polynucleotide,” “oligonucleotide,” “oligo” or the like refer, in the usual and customary sense, to a linear sequence of nucleotides. The term “nucleotide” refers, in the usual and customary sense, to a single unit of a polynucleotide, i.e., a monomer. Nucleotides can be ribonucleotides, deoxyribonucleotides, or modified versions thereof. Examples of polynucleotides contemplated herein include single and double stranded DNA, single and double stranded RNA, and hybrid molecules having mixtures of single and double stranded DNA and RNA. Examples of nucleic acid, e.g., polynucleotides contemplated herein include any types of RNA, e.g., mRNA, siRNA, miRNA, and guide RNA and any types of DNA, genomic DNA, plasmid DNA, and mini circle DNA, and any fragments thereof. The term “duplex” in the context of polynucleotides refers, in the usual and customary sense, to double strandedness.
Nucleic acids, including e.g, nucleic acids with a phosphorothioate backbone, can include one or more reactive moieties. As used herein, the term reactive moiety includes any group capable of reacting with another molecule, e.g, a nucleic acid or polypeptide through covalent, non-covalent or other interactions. By way of example, the nucleic acid can include an amino acid reactive moiety that reacts with an amio acid on a protein or polypeptide through a covalent, non-covalent, or other interaction.
The terms also encompass nucleic acids containing known nucleotide analogs or modified backbone residues or linkages, which are synthetic, naturally occurring, and non- naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides. Examples of such analogs include, include, without limitation, phosphodiester derivatives including, e.g., phosphoramidate, phosphorodiamidate, phosphorothioate (also known as phosphothioate having double bonded sulfur replacing oxygen in the phosphate), phosphorodithioate, phosphonocarboxylic acids, phosphonocarboxylates, phosphonoacetic acid, phosphonoformic acid, methyl phosphonate, boron phosphonate, or O-methylphosphoroamidite linkages (see Eckstein, OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL APPROACH, Oxford University Press) as well as modifications to the nucleotide bases such as in 5-methyl cytidine or pseudouridine.; and peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with positive backbones; non-ionic backbones, modified sugars, and non-ribose backbones (e.g., phosphorodiamidate morpholino oligos or locked nucleic acids (LNA) as known in the art), including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, CARBOHYDRATE MODIFICATIONS IN ANTISENSE RESEARCH, Sanghui & Cook, eds. Nucleic acids containing one or more carbocyclic sugars are also included within one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g. , to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. In embodiments, the intemucleotide linkages in DNA are phosphodiester, phosphodiester derivatives, or a combination of both.
As used herein, the term “operably linked” refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. For example, a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., that the coding sequence is under the transcriptional control of the promoter). In a further embodiment, the definition of “operably linked” may also be extended to describe the products of chimeric genes, such as fusion proteins.
As such, “operably linked” will also refer to the linking of peptide tag to a biomolecule, e.g., peptide of interest to be produced and recovered. The peptide tag is “operably linked” to the peptide of interest if upon expression the fusion protein is insoluble and accumulates it inclusion bodies in the expressing host cell. In a preferred embodiment, the fusion peptide will include at least on cleavable peptide linker useful in separating the peptide tag from the peptide of interest. The cleavable peptide linkers may be incorporated into the fusion proteins using any number of techniques well known in the art.
As used herein, the terms “polypeptide” and “peptide” will be used interchangeably to refer to a polymer of two or more amino acids joined together by a peptide bond, wherein the peptide is of unspecified length, thus, peptides, oligopeptides, polypeptides, and proteins are included within the present definition. In one aspect, this term also includes post expression modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations and the like. Included within the definition are, for example, peptides containing one or more analogues of an amino acid or labeled amino acids and peptidomimetics.
As used herein, the terms “protein of interest”, “polypeptide of interest”, “peptide of interest”, “targeted protein”, “targeted polypeptide”, “targeted peptide”, “expressible protein”, and “expressible polypeptide” will be used interchangeably and refer to a protein, polypeptide, or peptide which may be expressed by the genetic machinery of a host cell.
As used herein, the terms “plasmid”, “vector” and “cassette” refer to an extrachromosomal element often carrying genes which are not part of the central metabolism of the cell, and usually in the form of circular double-stranded DNA molecules. Such elements may be autonomously replicating sequences, genome integrating sequences, phage or nucleotide sequences, linear or circular, of a single- or double-stranded DNA or RNA, derived from any source, in which a number of nucleotide sequences have been joined or recombined into a unique construction which is capable of introducing a promoter fragment and DNA sequence for a selected gene product along with appropriate 3' untranslated sequence into a cell. “Transformation cassette” refers to a specific vector containing a foreign gene and having elements in addition to the foreign gene that facilitates transformation of a particular host cell. “Expression cassette” refers to a specific vector containing a foreign gene and having elements in addition to the foreign gene that allow for enhanced expression of that gene in a foreign host.
As used herein, the term “promoter/regulatory sequence” means a nucleic acid sequence which is required for expression of a gene product operably linked to the promoter/regulatory sequence. In some instances, this sequence may be the core promoter sequence and in other instances, this sequence may also include an enhancer sequence and other regulatory elements which are required for expression of the gene product. The promoter/regulatory sequence may, for example, be one which expresses the gene product in a tissue specific manner.
As used herein, the term “promoter” as used herein is defined as a DNA sequence recognized by the synthetic machinery of the cell, or introduced synthetic machinery, required to initiate the specific transcription of a polynucleotide sequence. A “constitutive” promoter is a nucleotide sequence which, when operably linked with a polynucleotide which encodes or specifies a gene product, causes the gene product to be produced in a cell under most or all physiological conditions of the cell. An “inducible” promoter is a nucleotide sequence which, when operably linked with a polynucleotide which encodes or specifies a gene product, causes the gene product to be produced in a cell substantially only when an inducer which corresponds to the promoter is present in the cell. A “tissue-specific” promoter is a nucleotide sequence which, when operably linked with a polynucleotide encodes or specified by a gene, causes the gene product to be produced in a cell substantially only if the cell is a cell of the tissue type corresponding to the
As used herein, the terms “target molecule”, “biomolecule” or “target biomolecule” includes any macromolecule, including protein, peptide, polypeptide, gene, polynucleotide, oligonucleotide, carbohydrate, enzyme, polysaccharide, glycoprotein, receptor, antigen, tumor antigen, markers, molecules associated with a disease, an antibody, growth factor; or it may be any small organic molecule including a hormone, substrate, metabolite, cofactor, inhibitor, drug, dye, nutrient, pesticide, peptide; or it may be an inorganic molecule including a metal, metal ion, metal oxide, and metal complex; it may also be an entire organism including a bacterium, virus, and single-cell eukaryote such as a protozoon.
As used herein, the term “translatable” may be used interchangeably with the term “expressible.” These terms can refer to the ability of polynucleotide, or a portion thereof, to provide a polypeptide, by transcription and/or translation events in a process using biological molecules, or in a cell, or in a natural biological setting. In some settings, translation is a process
that can occur when a ribosome creates a polypeptide in a cell. In translation, a messenger RNA (mRNA) can be decoded by a ribosome to produce a specific amino acid chain, or polypeptide.
A translatable polynucleotide can provide a coding sequence region (usually, CDS), or portion thereof, that can be processed to provide a polypeptide, protein, or fragment thereof.
As used herein, the term “3 '-untranslated region” (3'-UTR) relates to the section of messenger RNA (mRNA) that immediately follows the translation termination codon. The 3' UTR may comprise regulatory regions within the 3 '-untranslated region which are known to influence polyadenylation and stability of the mRNA. Many 3'-UTRs also contain AU-rich elements (AREs). Furthermore, the 3 '-UTR may preferably contain the sequence that directs addition of several hundred adenine residues called the poly(A) tail to the end of the mRNA transcript.
As used herein, the term “5'- untranslated region” (5’ -UTR) refers to a polynucleotide sequence that, when linked to a transcript, is capable of recruiting ribosome complexes and initiating translation of the transcript. Typically, a 5’-UTR is positioned directly upstream of the initiation codon of a transcript; specifically, between the cap site and the initiation codon. The 5' UTR begins at the transcription start site and ends one nucleotide (nt) before the start codon (usually AUG in the mRNA) of the coding region. In eukaryotes the length of the 5' UTR is generally from 100 to several thousand nucleotides long but sometimes also shorter UTRs occur in eukaryotes.
Throughout this disclosure, various aspects of the disclosure can be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the disclosure. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 2.7, 3, 4, 5, 5.3, and 6. This applies regardless of the breadth of the range.
Any compositions or methods provided herein can be combined with one or more of any of the other compositions and methods provided herein.
BRIEF DESCRIPTION OF THE DRAWINGS
FIGS. 1A- IF are a series of graphs, schematic representation and fluorescent microscopic images demonstrating that Qa tagging in SARS-CoV-2 viral proteins robustly boosts the production of dual reporter-fused viral proteins in HEK293T cells.
FIG. 1A: Diagram for 2A-mediated dual reporter gdLuc/dsGFP fused with viral protein and potential multiple measures of viral protein expression/production.
FIGS. 1B-1D: Representative experiments for Qa boosting of SARS-CoV-2 envelop (E) protein dynamic production (FIG. IB) and average fold induction of 10 experiments determined by gdLuc assay of cultured media at 24-48 h after transfection with indicated pcDNA6B vector (100 ng/well) in quadruplicates for each experiment (FIG. 1C), as well as representative images of Qa-boosted dsGFP expression detected by fluorescent microscopy (FIG. ID).
FIGS. 1E-1F: Representative gdLuc assay showing various degrees of Qa boosting in other SARS-CoV-2 structural protein spike (S) and nucleocapsid (N), as well as accessory proteins NSP2, NSP16 and ORF3. Cells were transfected in quadruplicates with indicated pcDNA6B vectors at 100 ng/well. Data represent mean ± SE of gdLuc activity in the supernatant at 48 h after transfection. The fold number indicates relative changes in Qa groups compared with corresponding control group.
FIGS. 2A-2G are a series of graphs and fluorescent microscopic images demonstrating that Qa boosting is versatile in dosages, non-viral proteins, cell types and tagging location.
FIG. 2A: Dose-dependent Qa boosting of SARS-CoV-2 S, M, N and ORF3 in a various degree. Cells were transfected in quadruplicates with indicated pcDNA6B vectors at indicated amounts of vectors. Data represent mean ± SE of gdLuc activity in the supernatant at 48 h after transfection. The fold number indicate relative changes in Qa groups compared with corresponding control group.
FIGS. 2B-2D: Dose-independent Qa boosting in host cellular gene NIBP and hACE2 determined by gdLuc assay (FIGS. 2B, 2C) and representative fluorescent microscopic images (FIG. 2D) at 48 h after transfection with pcDNA6B regular vectors.
FIGS. 2E, 2F: Dose-independent Qa boosting in secretory IFNy and IL-2 by gdLuc assay at 48 h after transfection with pRRL LV vectors.
FIG. 2G: Qa boosting on viral proteins E and S as well as non-viral protein hACE2 exhibits similar efficiency in different cell types.
FIGS. 3A-3I are a series of graphs, fluorescent microscopic images, a schematic representation and a blot demonstrating that Qa boosting is accelerated by stronger promoter and SARS-CoV-2 native untranslated region (UTR).
FIGS. 3A-3C: Stronger promoter CAG further increases Qa boosting efficiency in viral protein E, S and NSP16.
FIGS. 3D, 3E: The 5’ -UTR inclusion robustly increases promoter-dependent expression of E protein as determined by Western blot and immunocytochemistry with anti -Flag antibody, and addition of 3’ -UTR further increases E protein expression. Different size of E-Flag-Q results from the 37 amino acids addition within the open reading frame after stop removal during the cloning.
FIGS. 3F, 3G: The 5’ -UTR inclusion dramatically increases the C AG-driven expression of Qa tagged S-fused dual reporter as determined by representative fluorescent microscopic images and gdLuc assay.
FIG. 3H: The 5’ -UTR inclusion further accelerates Qa boosting efficiency of CMV- driven S dual reporter protein production as compared with LG group at 48 h after transfection with regular vector.
FIG. 31: The 5’ -UTR inclusion accelerates Qa boosting of dual reporter without viral proteins as determined by gdLuc assay at 48 h after transfection with pRRL LV vectors.
FIGS. 4A-4H are a series of graphs, fluorescent microscopic images, a schematic representation and a blot demonstrating that Qa tagging and 5’ -UTR inclusion boost the
packaging and transduction efficiency of SARS-CoV-2 S protein-pseudotyped lentivirus-like particles (S-LVLP).
FIG. 4A: Diagram for different vectors expressing human codon-optimized Sdl8 and the process of S-LVLP packaging.
FIG. 4B: Qa and 5’-UTR increase Sdl8 protein expression in the transfected cells as determined by Western blot with serum from SARS-CoV-2 patient.
FIGS. 4C-4F: Qa tagging increases S-LVLP packaging titer for the standard pRRL-GFP LV vector determined by GFP positivity, which is further increased by 5’-UTR inclusion, polybrene treatment and purification.
FIGS. 4G-4H: Qa tagging and 5’-UTR inclusion increase S-LVLP packaging titer for dual reporter LV vectors pRRL-LG or pRRL-E-LG determined by GFP positivity and gdLuc activity.
FIGS. 5A-5I is a series of graphs demonstrating that Qa tagging and 5’-UTR inclusion boost mRNA-dependent production of SARS-CoV-2 viral proteins S, N, E and ORF3 as well as non-viral hACE2 via increasing mRNA stability and translational efficiency.
FIGS. 5A-5C: Qa tagging robustly boosts mRNA-dependent production of dual reporter in a time- and dose-dependent manner to a various extent with different targeted proteins.
FIGS. 5D, 5E: The 5’-UTR inclusion further accelerates Qa boosting on mRNA-derived production of S, E and hACE2 proteins.
FIGS. 5F-5I: Qa tagging increases posttranscriptional mRNA stability and translational efficiency in the presence of transcriptional inhibitor actinomycin D.
FIGS. 6A-6M are a series of diagrams, blots, graphs, fluorescent microscopic images and a photograph demonstrating that Qa tagging boosts the production yield of anti-SARS monoclonal antibody and lentiviruses.
FIG. 6A: Diagram showing the Qa tagging on the C-terminus of constant regions for heavy and light chains of anti-SARS monoclonal antibody.
FIGS. 6B-6D: Representative ELISA for robust boosting of mAb production at 48 h after co-transfection of H/L or HQ/LQ (50ng/well) with or without normalization of GFP (FIG. 6G) or firefly-luciferase (FIG. 6L). The mAb amount is quantified by Sigmoidal four-parameter logistic curve (4PL) determined. Relative fold changes were presented as compared with corresponding LG.
FIG. 6E: Average fold changes of 16 experiments based on ELISA results.
FIG. 6F: Western blot analysis confirmed the boost of Qa on mAb production in the supernatant.
FIG. 6G: Qa tagging in the LV transfer vector pRRL-E-LG increased gdLuc activity in the supernatant after LV infection of HEK293T cells.
FIGS. 6H, 61: Qa tagging in the LV transfer vector pLV-EFla-Flag-spCas9-Qa-T2A- RFP increases the transgene expression determined by Western blot analysis with anti-Flag antibody (FIG. 6H) but does not increase the packaging efficiency measured with FACS for RFP positivity (FIG. 61).
FIG. 6J: Representative fluorescent images showing Qa tagging on Pol and RRE boosts LV packaging efficiency for pRRL-GFP transfer LV vector but Qa tagging on Gag impairs LV packaging.
FIGS. 6K-GM: Qa tagging on Pol and RRE boosts LV packaging efficiency for pRRL- UTR-QLG and pLV-EFla-MS2-spCas9-F2A-GFP determined with LV qPCR titer kit (FIG. 6K), flow cytometry (FIG. 6L) and gdLuc assay (FIG. 6M).
FIGS. 7A-7H are a series of blots and graphs identifies the secretion boost of Qa tagging on various targeted proteins in HEK293T cells.
FIGS. 7A-7B: Qa tagging remarkably decreases the expression level of E-Flag-gdLuc protein in the cell lysate at 48 h after transfection with indicated vectors.
FIG. 7C: Qa tagging decreases the expression levels of secretory IFNy/IL-2 or non- secretory viral protein N and non-viral protein hACE2 in the cell lysates at 48 h after transfection with indicated vectors. T2A auto-cleaving efficiency varies with targeted proteins, showing different ratio of the cleaved band (c) and non-cleaved band (n).
FIG. 7D: Qa tagging decreases S protein level in the cell lysates while 5’-UTR inclusion does increase the protein level despite of the continuous secretion. The cleaved S-Flag-gdLuc fragment was detected with anti-Flag antibody and the cleaved dsGFP fragment was detected with anti-GFP antibody.
FIGS. 7E, 7F: Qa tagging robustly increases the protein level of secretory E-QLG in the supernatant detected by Western blot analysis and gdLuc assay of the supernatant. Cells were transfected with indicated vectors in quadruplicates for 24 h and cultured with FreeStyle™ 293 Expression Medium for 48 h.
FIG. 7G: ER-Golgi trafficking inhibitor brefeldin A completely blocks the secretion of Qa tagged viral proteins and host cellular proteins.
FIG. 7H: Qa tagging does increase the protein expression of non-secretory firefly- luciferase (fLuc) in the cell lysate and has no effect on the background of fLuc activity in the supernatant.
FIGS. 8A-8I are a series of schematics, photographs of stained cells and graphs demonstrating that Exen21/Qa addition in SARS-CoV-2 viral proteins robustly boosts production of dual reporter-fused viral proteins in HEK293T cells.
FIG. 8A: Diagram of 2A-mediated dual reporter gdLuc/dsGFP (LG) and Qa tagged LG (QLG) fused with viral protein and potential multiple measures of viral protein expression/production. The Exen21/Qa stands for the 21-mer nucleotide motif and its corresponding heptapeptide.
FIGS. 8B-8D: Representative experiments showing Exen21 boosting of SARS-CoV-2 envelope (E) protein dynamic production (FIG. 8B) and average fold induction with results of 20 experiments (FIG. 8C) determined by gdLuc assays in supernatants, 24-72 h after transfection with indicated pcDNA6B vector (100 ng/well, quadruplicate), and representative images of Exen21 -boosted dsGFP expression detected by fluorescence microscopy (FIG. 8D). Data represent mean ± SE of gdLuc activity with the relative fold changes (in red) in QLG over corresponding LG groups (the same below).
FIGS. 8E-8F: Representative gdLuc assay showing various degrees of Exen21 boosting in other SARS-CoV-2 structural proteins: spike (S), nucleocapsid (N), and accessory proteins: NSP2, NSP16, and ORF3. Cells were transfected with indicated pcDNA6B vectors (100 ng/well, quadruplicates). Data represent mean ± SE of gdLuc activity in supernatants 48 h post transfection.
FIGS. 8G-8I: Alanine scanning and deletion mutation (FIG. 8G) as well as degenerate (FIG. 8H) and missense (FIG. 81) mutation assays showing the critical role of the unique and specific Exen21 in boosting E-LG production. Cells were transfected with indicated pcDNA6B- E vectors (100 ng/well, quadruplicates). Data represent mean ± SE of gdLuc activity in supernatants 48 h post-transfection, with the relative percentage changes compared with the parent E-QLG group. Inset in FIG. 8G shows the heptapeptide structure with the residue position. Insets in FIGS. 8H and 81 show the mutated nucleotide and corresponding residues. The dQ for degenerate QLG and mQ for missense QLG mutants.
FIGS. 9A-9G are a series of graphs demonstrating that Exen21 boosting is versatile in dosages, non-viral proteins and cell types.
FIG. 9A: Dose-dependent and varying extents of Exen21 -boosted expression of SARS- CoV-2 S, M, N and ORF3 protein levels. Cells were transfected in quadruplicates with indicated pcDNA6B (6B) vectors in indicated amounts. Data represent mean ± SE of gdLuc activity in supernatants 48 h post- transfection. Fold values indicate changes in Exen21 groups relative to those of corresponding control groups.
FIGS. 9B, 9C: Dose-independent boosting by Exen21 of host cellular gene NIBP and hACE2 levels, determined by gdLuc assay (FIGS. 9B, 9C) 48 h after transfection with pcDNA6B regular vectors.
FIGS. 9D, 9E: Dose-independent boosting by Exen21 of secretory IFNy and IL-2 by gdLuc assay 48 h after transfection with pRRL LV vectors.
FIG. 9F: Stronger promoter CAG further increases boosting efficiency in LG system by Qa (QLG) in viral protein E.
FIG. 9G: Exen21 -induced boosting of viral E and S proteins and non-viral protein hACE2 exhibits similar efficiencies across different cell types.
FIGS. 10A-10F are a series of schematics, a photograph, a blot and graphs demonstrating that Exen21 addition boosts production yields of anti-SARS monoclonal antibody (mAb).
FIG. 10A: Diagram showing human anti-SARS mAb and Exen21/Qa tags introduced (right panel) on its C-termini of constant regions of heavy and light chains.
FIG. 10B: Representative ELISA showing robust boosting by Exen21/Qa (HQ/LQ) of mAb production 48h after co-transfection of mAb H/L or HQ/LQ expression vectors (50 ng/well, in triplicates), with normalization vectors empty control (C), GFP (G) or firefly-luciferase (L).
FIG. IOC: Sigmoidal 4-parameter logistic curve (4PL) determination of mAb concentrations.
FIG. 10D: Normalized quantitative data from experiment/assay shown in B. Relative fold changes are presented as compared with corresponding mAb H/L.
FIG. 10E: Average Exen21/Qa-induced fold changes of ELISA-based mAb production for 16 experiments at p<0.0001 with student’s t test.
FIG. 10F: Western blot analysis confirming the boost of Exen21/Qa on mAb production in the supernatant. Membrane staining as a loading control is for densitometric analysis of relative fold changes in light chain (LC) between HQ/LQ and H/L groups.
FIGS. 11A-11K are a series of schematics, blots, graphs and photographs demonstrating that Exen21 addition boosts packaging and transduction efficiencies of SARS-CoV-2 S protein- pseudotyped lentivirus-like particles (S-LVLP) and standard lentiviral packaging.
FIG. 11 A: Diagrams of different vectors expressing human codon-optimized Sdl8, and the process of S-LVLP packaging in HEK293T cells.
FIG. 11B: Exen21 increases Sdl8 protein expression in transfected cells, shown by Western blot with serum from SARS-CoV-2 patient, which contains specific anti-S antibody. Representative fold change for S2 fragment is quantified by densitometric analysis with GAPDH normalization.
FIG. 11C: Exen21 addition increases S-LVLP packaging titer of the standard pRRL-GFP LV vector determined by GFP positivity.
FIGS. 11D-11E: Exen21 addition increases S-LVLP packaging titer for dual reporter LV vectors pRRL-E-QLG determined by GFP positivity (FIG. 11D) and gdLuc activity (FIG. HE).
FIGS. 11F: Exen21/Qa in the LV transfer vector pRRL-E-QLG induces LV dose-related increases in gdLuc activity in supernatants of HEK293T cells 48-72 h after infection with indicated amount of crude LV preparation (mΐ per well, triplicates). Shown are fold changes in gdLuc activity from E-QLG vs. control E-LG group.
FIGS. 11G, 11H: Exen21/Qa in the LV transfer vector pLV-EFla-Flag-spCas9-Qa- T2A-RFP (Qa) increases transgene expression vs. untagged vector (Con), seen by Western blot analysis with anti -Flag antibody (FIG. 11G), but does not increase packaging efficiency as measured by FACS for RFP positivity 48 h after infection with crude LV preparation (FIG.
11H) at no significance (ns) by student’s t test.
FIG. Ill: Representative fluorescence images show that Exen21/Qa addition to Pol and RRE enhances pRRL-GFP LV packaging efficiency vs. control (psPAX2) levels, but Exen21/Qa ion Gag impairs LV packaging.
FIGS. 11J, 11K: Exen21/Qa tagging on Pol and RRE (PolQ/RREQ) boosts LV packaging efficiency for pRRL-GFP transfer vector, determined by cell counting (FIG. 11J) and flow cytometry (FIG. 11K).
FIG. 11L: The gdLuc assay showing the boosting of Exen21/Qa tagging.
FIGS. 12A-12G are a series of graphs demonstrating that Exen21 addition boosts mRNA-dependent production of SARS-CoV-2 viral proteins S, N, E and ORF3 as well as non- viral hACE2 by increasing mRNA stability and translational efficiency.
FIGS. 12A-12C: Exen21 addition robustly boosts mRNA-dependent production of dual reporter in a time-and dose-dependent manner to a various extent with different targeted proteins.
FIG. 12A: Time course of responses to different concentrations of capped mRNAs for S- LGvs S-QLG (ng/well, quadruplicate).
FIG. 12B: Time course of response to indicated mRNAs (100 ng/well, quadruplicate).
FIG. 12C: Dose response to indicated mRNAs at 24 h post transfection.
FIGS. 12D-12G: Exen21 addition (QLG; right panels in D, E) increases posttranscriptional mRNA stability and translational efficiency in the presence of transcriptional inhibitor actinomycin D, shown in time-course plots of reporter activity (FIGS. 12D, 12E) and mRNA decay (FIGS. 12F, 12G). The mRNA levels were determined by RT-qPCR analysis.
FIGS. 13A-13G are a series of blots and graphs demonstrating that Exen21 addition enhances secretion of various targeted proteins in HEK293T cells, shown by Western blot analyses.
FIGS. 13A-13C: Exen21 addition remarkably decreases protein expression levels of viral proteins (E, S, N), non-viral protein hACE2 and secretory IFNy/IL-2 in cell lysates 48 h after transfection with indicated pcDNA6B (6B) vectors. Fold numbers are relative densitometric changes after normalization by the loading control GAPDH or non-specific bands (NS). P2A auto-cleaving efficiency varies with targeted proteins, showing different ratio of the cleaved band (c) and non-cleaved band (n).
FIGS. 13D, 13E: Exen21 addition robustly increases secretory E-QLG protein levels in the supernatants, seen both by Western blot analyses (FIG. 13D) and gdLuc assay (FIG. 13E). Cells were transfected with indicated vectors in quadruplicates for 24 h and cultured with FreeStyle™ 293 Expression Medium for 48 h. Membrane staining as a loading control is for densitometric analysis of relative fold changes in E-QLG over E-LG.
FIG. 13F: ER-Golgi trafficking inhibitor brefeldin A blocks the secretion of Qa-tagged viral E protein (E-QLG) and host cell protein (IFNy), seen both by Western blot analyses (left)) and gdLuc assay (right) of the supernatants 48 h after brefeldin A treatment.
FIG. 13G: Exen21 addition elevates non-secretory firefly-luciferase (fLuc) protein levels in cell lysates. Relative fold change is quantified by densitometric analysis with GAPDH normalization.
FIGS. 14A-14C are a series of photographs showing a representative fluorescent microscopy detection of dual reporter. Related to FIGS. 8A-8I and 9A-9G.
FIG. 14A: Three indicated antibodies detected dual reporter of E-Flag-gdLuc-T2A-GFP with 2A and Flag complete colocalization while some cleaved GFP stayed alone without the corresponding E-Flag- gdLuc-T2A, which may have been secreted.
FIGS. 14B, 14C: Representative fluorescent micrographs showing the boosting of N and ORF3 viral proteins and human ACE2 cellular protein under CMV promoter (FIG. 14B) as well as time dependent E protein under CAG promoter (FIG. 14C) by Exen21/Qa addition. Images were taken at the same exposure settings. Scale bars = 100 pm.
FIGS. 15A-15B are a series of graphs and photographs demonstrating the dose- dependent Exen21/Qa boosting of SARS-CoV2 viral proteins and saturation of boosting activity at a higher amount of transfected reporter DNA in all the tested viral dual reporters. Related to FIG. 9A.
FIG. 15A: Exen21/Qa boosting in different dosage determined by NanoLight Gaussia luciferase assay.
FIG. 15B: Representative confocal images from 3 fields of 4 wells per group. All images were taken at the same exposure settings. For lower dosage or lower expression groups, stronger GFP signal can be observed at a longer exposure. Scale bars = 100 pm. HEK293T cells in a 96- well plate was transfected with indicated gdLuc-P2A-dsGFP reporter at indicated amount. At 72 h after transfection, EGFP images were taken, and the supernatants were collected for luciferase assay. Data represents relative fold changes compared to corresponding LG group with mean ± SE of 4 wells.
FIGS. 16A-16E are a series of photographs, a blot, a graph and a schematic demonstrating eExen21 boosting of mRNA vaccine production and efficacy. Related to FIGS. 12A-12G.
FIG. 16A: Diagram for in vitro transcription and 5’ -Cap modification.
FIG. 16B: Gel electrophoresis (1% agarose) images for transcript length, integrity and quantity of both CO and Cl 5’ -Capped mRNAs.
FIG. 16C: 10~30-fold increases in the expression of dual reporter at equal levels of functional mRNA for viral genes N, E and ORF3. The Capped (Cap-CO) and tailed mRNAs of
indicated targets were synthesized using Hi Scribe T7 ARC A mRNA Kit (NEB, E2065) and cDNA template from the corresponding linearized plasmid. Half of the Cap-CO mRNAs were further methylated at the 2 -0 position of the first nucleotide adjacent to the Cap-CO structure using mRNA Cap 2'-0-Methyltransferase (NEB, M0366). Both Cap-CO and Cap-Cl mRNAs were purified with Monarch RNA Cleanup Kit (NEB, T2040). HEK293T cells in a 96-well plate were transfected with indicated mRNA (100 ng/well). At 24 h after transfection, the supernatants were collected for NanoLight Gaussia luciferase assay. Data represents relative fold changes compared to corresponding LG group with mean ± SE of 4 independent experiments.
FIGS. 16D, 16E: Representative fluorescent GFP images in live cells at 24 h after transfection with indicated vectors. Scale bars = 100 pm.
FIGS. 17A-17E are a series of blots and a graph demonstrating that Qa tagging robustly increases the protein level of secretory E-QLG in the supernatant detected by Western blot analysis and gdLuc assay of the supernatant. Related to Figures 6C, D.
FIGS. 17A, 17B: Western blot with anti-gdLuc monoclonal antibody (Proteintech, Cat# 60158-1-Ig).
FIGS. 17C, 17D: Western blot with anti-GFP polyclonal antibody (Proteintech, Cat# 50430-2-AP).
FIG. 17E: Relative fold changes of boosting efficiency by gdLuc assay. HEK293T Cells were transfected with indicated vectors (100 ng/well) in triplicates for 24 h and cultured with FreeStyle™ 293 Expression Medium for 48 h before analysis.
FIGS. 18A-18D are a series of graphs and blots demonstrating that ER-Golgi trafficking inhibitor brefeldin A blocks the secretion of Qa tagged viral proteins and host cellular proteins. Related to FIGS. 13A-13G.
FIGS. 18A, 18B: Relative gdLuc activity changes in the supernatant (FIG. 18A) and cell lysate (FIG. 18B) after brefeldin A treatment.
FIGS. 18C, 18D: Western blot with anti-gdLuc monoclonal antibody (Proteintech, Cat# 60158-1-Ig) and anti-GFP polyclonal antibody (Proteintech, Cat# 50430-2-AP). HEK293T Cells
were transfected with indicated vectors (50 ng/well) in quadruplicates for 24 h and cultured with FreeStyle™ 293 Expression Medium for 48 h before analysis.
FIGS. 19A-19C are a series of schematics and a table showing the SARS-CoV-2 UTR- E-Flag-Qa-UTR synthesis and cloning. FIG. 19A: Diagram for the synthetic 5’-UTR-E-Flag- Qa-3’-UTR. FIG. 19B: NEBuilder HiFi DNA assembly cloning of the synthetic nucleotides (946 bp) into pCAG-Flag expression vector. FIG. 19C: List of cloning strategy to obtain indicated vector for E protein and S protein fused with QLG dual reporter.
FIGS. 20A-20C are a series of photographs of stained cells showing that both 5’-UTR and 3’-UTR apparently enhanced the promoter-driven expression of QA-tagged E protein in HEK293T cells. HEK293T cells in a 96-well plate were transfected with indicated vectors in triplicate (100 ng/well). At 48 h after transfection, cells were fixed with 4% PAF for 10 minutes and immunocytochemistry with anti-Flag antibody was performed. FIG. 20A: Representative confocal images. FIG. 20B: Mean fluorescent intensity determined by ImageJ analysis of 6 fields from 3 wells. FIG. 20C: Western blot analysis with anti-Flag antibody and anti-GAPDH for loading control.
FIGS. 21A-21D are a series of schematics, photographs of stained cells and graphs showing that addition of 5’-UTR between CAG promoter and S-Flag-QLG dual reporter enhanced S protein expression. FIG. 21A: Diagram of dual reporter design with the secretable gaussia dura luciferase (gdLuc) plus P2A autocleavable destabilized GFP (dsGFP) and various measures to assess the expression of targeted proteins (here SARS-CoV-2 viral proteins). Novel Q tag locates between targeted protein and gdLuc. FIGS. 21B-21D: HEK293T cells in a 96-well plate were transfected with indicated vectors in quadruplicate at indicated amount of DNA (12.5- 100 ng/well). At 24-72 h after transfection, EGFP images were taken (FIG. 21B), and the supernatants were collected for NanoLight Gaussia luciferase assay (FIGS. 21B, 21C, 21D). Data represents relative light unit of bioluminescence (FIG. 21C) or fold changes (FIG. 21D) compared to corresponding non-UTR group with mean ± SE of 4 wells.
FIGS. 22A-22C are a series of blots and graphs showing that addition of 5’-UTR to the pCAG, pcDNA6B and pRRL vectors dramatically increased the protein expression of the transgenes. HEK293T cells in a 24-well plate (FIG. 22A) or 96-well plate (FIGS. 22B, 22C)
were transfected with indicated vectors (500 ng/well in A or 100 ng/well in FIGS. 22B, 22C). At 48 h after transfection, EGFP expression was determined with Western blot (FIG. 22A), and the supernatants were collected for NanoLight Gaussia luciferase assay (FIGS. 22B, 22C). Data represents fold changes compared to corresponding LG group (FIG. 22B) or relative light unit of bioluminescence (FIG. 22C) non-UTR group with mean ± SE of 4 wells.
FIGS. 23 A, 23B are a series of graphs showing that addition of 5’-UTR to the upstream of in vitro transcribed mRNA significantly enhances the protein expression in HEK293T cells. HEK293T cells in a 96-well plate were transfected using Lipofectamine® MessengerMAX mRNA Transfection Reagent with indicated mRNAs (50 ng/well) generated from in vitro transcription with 5’ -capped and 3’ -poly A tail. The mRNAs encode indicated viral protein (E or S protein) or endogenous hACE2 protein fused with dual reporter LG or QLG. At 6-24 h after transfection, EGFP images were taken (data not shown), and the supernatants were collected for NanoLight Gaussia luciferase assay. Data represents relative fold changes compared to corresponding LG group with mean ± SE of 4 wells.
FIGS. 24A-24F are a series of schematics, photographs of stained cells, blots and graphs showing that Qa tagging and 5’-UTR inclusion boost the packaging and transduction efficiency of SARS-CoV-2 S protein-pseudotyped lentivirus-like particles (S-LVLP).
FIG. 24A: Diagram for different vectors expressing human codon-optimized Sdl8 and the process of S-LVLP packaging.
FIG. 24B: Qa and 5’-UTR increase Sdl8 protein expression in the transfected cells as determined by Western blot with serum from SARS-CoV-2 patient.
FIGS. 24C-24D: Qa tagging increases S-LVLP packaging titer for the standard pRRL- GFP LV vector determined by GFP positivity, which is further increased by 5’-UTR inclusion.
FIGS. 24E-24F: Qa tagging and 5’-UTR inclusion increase S-LVLP packaging titer for dual reporter LV vectors pRRL-LG or pRRL-E-LG determined by GFP positivity and gdLuc activity.
DETAILED DESCRIPTION
The disclosure is based in part, of the unexpected finding that an oligonucleotide cAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7) that encodes a short peptide (termed herein “Qa” that significantly boosted the expression/production of fusion protein. Further expanded studies identified the versatile property of Exen21/Qa tagging in boosting the production (by up to thousand-folds) of various proteins including viral proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins and virus packaging proteins. Also discovered was the potent boosting of protein production by SARS-CoV-2 native 5’-UTR, and its synergistic role with Qa tagging. Mechanistically, Qa increased mRNA/protein stability and/or enhanced protein translation as well as facilitates protein secretion. These versatile protein boost strategies will be beneficial extensively to the biomedical science and protein engineering industry. This is the first evidence for protein regulation/boosting by short peptide tagging and SARS-CoV2 native 5’-UTR.
Accordingly, embodiments are directed to novel chimeric molecules comprising a peptide tag and 5’- untranslated region (5’ -UTR) for use in the enhanced production and expression of a desired biomolecule.
5’- and 3’ -Untranslated Regions (UTRs)
An untranslated region (or UTR) refers to either of two sections, one on each side of a coding sequence on a strand of mRNA. If it is found on the 5' side, it is called the 5’ -UTR (or leader sequence), or if it is found on the 3' side, it is called the 3' UTR (or trailer sequence). The mRNA is initially transcribed from the corresponding DNA sequence and then translated into protein. However, several regions of the mRNA are usually not translated into protein, including the 5' and 3' UTRs.
Within the 5’-UTR is a sequence that is recognized by the ribosome which allows the ribosome to bind and initiate translation. The mechanism of translation initiation differs in prokaryotes and eukaryotes. The 3' UTR is found immediately following the translation stop codon. The 3' UTR plays a critical role in translation termination as well as post-transcriptional modification.
In this study, it was found that the presence of the native 5’ -UTR and 3’ -UTR in standard protein expression system robustly enhances the expression of the viral subgenomic transcripts
and further viral protein production. It was also found that the combination of the native 5’-UTR with a short peptide, termed herein as the Qa peptide, further facilitated the production of viral and non-viral proteins.
Accordingly, in certain embodiments, a chimeric molecule for use in enhancing the expression and production of a desired biomolecule comprises one or more short peptide domains and one or more UTRs. In certain embodiments, the UTR is a 5’-UTR. In certain embodiments, the UTR is a 3 ’-UTR.
In certain embodiments, the one or more 5’ -untranslated region (UTR) domains or fragments thereof, are derived from one or more viruses. In certain embodiments, the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof. In certain embodiments, the 5’ -UTR and/or 3’-UTR are from a coronavirus. In certain embodiments, the coronavirus is SARS-CoV-2.
In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS- CoV-2 5’-UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’ -UTR. In certain embodiments, the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’-UTR.
In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS- CoV-23’-UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-23’ -UTR. In certain embodiments, the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 3’-UTR.
In certain embodiments, the one or more UTR sequences are engineered to include a Shine-Dalgarno sequence 5'-AGGAGGU-3'). This sequence is found 3-10 base pairs upstream from the initiation codon. In certain embodiments, the one or more UTR sequences are engineered to contain a Kozak consensus sequence (ACCAUGG).
In certain embodiments, the one or more of the 5’-UTR sequences (or nucleic acid molecules each comprising a 5’-UTR sequence) may comprise a synthetic sequence (i.e., a sequence that is not found in nature).
In certain embodiments, one or more of the 5’-UTR sequences (or nucleic acid molecules each comprising a 5’-UTR sequence) may comprise an endogenous 5’-UTR sequence (i.e., a 5’- UTR sequence that is used in nature to recruit ribosome complexes and initiate translation of a transcript). For example, an endogenous 5’-UTR sequence may be part of a mRNA expressed in a cell or population of cells. The cells in the population of cells may be the same type of cell (e.g., HEK-293 cells, PC3 cells, or muscle cells). Alternatively, the population of cells may comprise different cell types (e.g., HEK-293 cells, PC3 cells, and muscle cells). Methods of identifying mRNAs expressed in a cell or population of cells, and of identifying the 5’-UTR sequences of the mRNAs, are known to those having skill in the art. Indeed, various public databases contain cellular mRNA expression and/or 5’-UTR sequence information.
The length of the 5’-UTR sequences (or the nucleic acid molecules each comprising a 5’- UTR sequence) may vary. For example, in some embodiments, at least two of the 5’-UTR sequences have different lengths. In some embodiments, at least two of the 5’-UTR sequences have the same length. In some embodiments, each of the 5’-UTR sequences have the same length. In some embodiments, the length of at least one of the 5’-UTR sequences in the initial chimeric molecule is 3, 4, 5, 6, 7, 8, 9, or 10 base pairs in length.
In some embodiments, the length of at least one of the 5’-UTR sequences (or the nucleic acid molecules each comprising a 5’-UTR sequence) is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550, at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 950, at least 1000, at least 1500, at least 2000, or at least 3000 base pairs in length. In some embodiments, the length of each of the 5’-UTR sequences (or the nucleic acid molecules each comprising a 5’-UTR sequence) is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550, at least 600, at least
650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 950, at least 1000, at least 1500, at least 2000, or at least 3000 base pairs in length.
In some embodiments, the chimeric molecule comprises one or more coronavirus 5’-UTR and/ or 3’ -UTR sequences, the length of at least one UTR sequence is increased to a length of interest by added nucleotides to one or both ends (e.g., by adding repeats of a motif that does not have known secondary structure). Nucleotides may be added to the 5' end, the 3' end, or both the 5' and 3' ends of a 5’-UTR and/or 3’-UTR sequences. In some embodiments, the length of one or more 5’- or 3’-UTR sequences are decreased to a length of interest by removing nucleotides to one or both ends. Nucleotides may be removed from the 5' end, the 3' end, or both the 5' and 3' ends of a 5’ -UTR sequence.
In certain embodiments, the UTR sequences comprise one or more mutations. The mutations may be introduced using a genetic algorithm. Examples of genetic algorithms are known to those having skill in the art. See e.g., Scrucca, L. GA: A Package for Genetic Algorithms in R. J. Stat. Softw. (2015). doi:10.18637/jss.v053.i04. The number of mutations introduced into each of the UTR sequences may vary. In some embodiments, at least one UTR sequences is mutated at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotide positions. A mutation may comprise a base pair substitution, a deletion, or an insertion.
Modifications: In certain embodiments, the UTRs comprise one or more chemically modified nucleotides. Amongst these is the inclusion of chemically modified nucleotides;
Current Opinion in Drug Discovery and Development, 2007, 10:523. Kormann et al. have shown that the replacement of only 25% of uridine and cytidine residues by 2-thiouridine and 5-methyl- cytidine suffices to increase mRNA stability as well as to reduce the activation of innate immunity triggered by externally administered mRNA in vitro (WO2012/0195936 Al; W02007024708 A2). For example, known modifications of RNA molecules can be found, for example, in Genes VI, Chapter 9 (“Interpreting the Genetic Code”), Lewis, ed. (1997, Oxford University Press, New York), and Modification and Editing of RNA, Grosjean and Benne, eds. (1998, ASM Press, Washington D.C.). Modified RNA components include the following: 2'-0- methylcytidine; N4-methylcytidine; N4-2'-0-dimethylcytidine; N4-acetylcytidine; 5- methylcytidine; 5,2'-0-di methylcytidine; 5-hydroxymethylcytidine; 5-formylcytidine; 2'-0-
methyl-5-formaylcytidine; 3-methylcytidine; 2-thiocytidine; lysidine; 2'-0-methyluridine; 2- thiouridine; 2-thio-2'-0-methyluridine; 3,2'-0-dimethyluridine; 3-(3-amino-3- carboxypropyl)uridine; 4-thiouridine; ribosylthymine; 5,2'-0-dimethyluridine; 5-methyl-2- thiouridine; 5-hydroxyuridine; 5-methoxyuridine; uridine 5-oxyacetic acid; uridine 5-oxyacetic acid methyl ester; 5-carboxymethyluridine; 5-methoxycarbonylmethyluridine; 5- methoxycarbonylmethyl-2'-0-methyluridine; 5-m ethoxy carbonylmethyl-2'-thiouri dine; 5- carbamoylmethyluridine; 5-carbamoylmethyl-2'-0-methyluridine; 5- (carboxyhydroxymethyl)uridine; 5-(carboxyhydroxymethyl) uridinemethyl ester; 5- aminomethyl-2-thiouridine; 5-methylaminomethyluridine; 5-methylaminomethyl-2-thiouridine; 5-methylaminomethyl-2-selenouridine; 5-carboxymethylaminomethyluridine; 5- carboxymethylaminomethyl-2'-0-methyl -uridine; 5-carboxymethylaminomethyl-2-thiouridine; dihydrouridine; dihydroribosylthymine; 2'-methyladenosine; 2-methyladenosine; N6N- methyladenosine; N6,N6-dimethyladenosine; N6,2'-0-trimethyladenosine; 2-methylthio-N6N- isopentenyladenosine; N6-(cis-hydroxyisopentenyl)-adenosine; 2-methylthio-N6-(cis- hydroxyisopentenyl)-adenosine; N6-glycinylcarbamoyl)adenosine; N6-threonylcarbamoyl adenosine; N6-methyl-N6-threonylcarbamoyl adenosine; 2-methylthio-N6-methyl-N6- threonylcarbamoyl adenosine; N6-hydroxynorvalylcarbamoyl adenosine; 2-methylthio-N6- hydroxnorvalylcarbamoyl adenosine; 2'-0-ribosyladenosine (phosphate); inosine; 2'0-methyl inosine; 1 -methyl inosine; l;2'-0-dimethyl inosine; 2'-0-methyl guanosine; 1 -methyl guanosine; N2-methyl guanosine; N2,N2-dimethyl guanosine; N2,2'-0-dimethyl guanosine; N2,N2,2'-0- trimethyl guanosine; 2'-0-ribosyl guanosine (phosphate); 7-methyl guanosine; N2;7-dimethyl guanosine; N2; N2;7-trimethyl guanosine; wyosine; methylwyosine; under-modified hydroxywybutosine; wybutosine; hydroxywybutosine; peroxywybutosine; queuosine; epoxyqueuosine; galactosyl-queuosine; mannosyl-queuosine; 7-cyano-7-deazaguanosine; arachaeosine [also called 7-formamido-7-deazaguanosine]; and 7-aminomethyl-7- deazaguanosine.
In some embodiments, the UTR is a synthetic oligonucleotide. In some embodiments, the synthetic nucleotide comprises a modified nucleotide. Modification of the inter-nucleoside linker (i.e. backbone) can be utilized to increase stability or pharmacodynamic properties. For example, inter-nucleoside linker modifications prevent or reduce degradation by cellular nucleases, thus
increasing the pharmacokinetics and bioavailability of the UTR. Generally, a modified inter nucleoside linker includes any linker other than other than phosphodiester (PO) liners, that covalently couples two nucleosides together. In some embodiments, the modified inter nucleoside linker increases the nuclease resistance of the UTR compared to a phosphodiester linker. For naturally occurring oligonucleotides, the inter-nucleoside linker includes phosphate groups creating a phosphodiester bond between adjacent nucleosides. In some embodiments, the UTR comprises one or more inter-nucleoside linkers modified from the natural phosphodiester.
In some embodiments all of the inter-nucleoside linkers of the UTR, or contiguous nucleotide sequence thereof, are modified. For example, in some embodiments the inter-nucleoside linkage comprises sulfur (S), such as a phosphorothioate inter-nucleoside linkage.
Modifications to the ribose sugar or nucleobase can also be utilized herein. Generally, a modified nucleoside includes the introduction of one or more modifications of the sugar moiety or the nucleobase moiety. In some embodiments, the UTRs, as described, comprise one or more nucleosides comprising a modified sugar moiety, wherein the modified sugar moiety is a modification of the sugar moiety when compared to the ribose sugar moiety found in deoxyribose nucleic acid (DNA) and RNA. Numerous nucleosides with modification of the ribose sugar moiety can be utilized, primarily with the aim of improving certain properties of oligonucleotides, such as affinity and/or stability. Such modifications include those where the ribose ring structure is modified. These modifications include replacement with a hexose ring (HNA), a bicyclic ring having a biradical bridge between the C2 and C4 carbons on the ribose ring (e.g. locked nucleic acids (LNA)), or an unlinked ribose ring which typically lacks a bond between the C2 and C3 carbons (e.g. UNA). Other sugar modified nucleosides include, for example, bicyclohexose nucleic acids or tricyclic nucleic acids. Modified nucleosides also include nucleosides where the sugar moiety is replaced with a non-sugar moiety, for example in the case of peptide nucleic acids (PNA), or morpholino nucleic acids.
Sugar modifications also include modifications made by altering the substituent groups on the ribose ring to groups other than hydrogen, or the 2'-OH group naturally found in DNA and RNA nucleosides. Substituents may, for example be introduced at the 2', 3', 4' or 5' positions. Nucleosides with modified sugar moieties also include 2' modified nucleosides, such as 2' substituted nucleosides. Indeed, much focus has been spent on developing 2' substituted
nucleosides, and numerous 2' substituted nucleosides have been found to have beneficial properties when incorporated into oligonucleotides, such as enhanced nucleoside resistance and enhanced affinity. A 2' sugar modified nucleoside is a nucleoside that has a substituent other than H or -OH at the 2' position (2' substituted nucleoside) or comprises a 2' linked biradicle, and includes 2' substituted nucleosides and LNA (2'-4' biradicle bridged) nucleosides. Examples of 2' substituted modified nucleosides are 2'-0-alkyl-RNA, 2'-0-methyl-RNA, 2'-alkoxy-RNA, 2'-0- methoxyethyl-RNA (MOE), 2'-amino-DNA, 2'-Fluoro-RNA, and 2'-F-ANA nucleoside. By way of further example, in some embodiments, the modification in the ribose group comprises a modification at the 2' position of the ribose group. In some embodiments, the modification at the 2' position of the ribose group is selected from the group consisting of 2'-0-methyl, 2'-fluoro, 2'- deoxy, and 2'-0-(2-methoxyethyl).
In some embodiments, the UTRs comprise one or more modified sugars. In some embodiments, the gRNA comprises only modified sugars. In certain embodiments, the gRNA comprises greater than 10%, 25%, 50%, 75%, or 90% modified sugars. In some embodiments, the modified sugar is a bicyclic sugar. In some embodiments, the modified sugar comprises a 2'- O-methoxyethyl group. In some embodiments, the UTR comprises both inter-nucleoside linker modifications and nucleoside modifications.
In additional aspects, the chimeric molecule comprises an internal ribosome entry site (IRES). As is understood in the art, an IRES is an RNA element that allows for translation initiation in an end-independent manner. In exemplary embodiments, the IRES is in the 5' UTR. In other embodiments, the IRES may be outside the 5' UTR.
Peptide Domains
As discussed above, the chimeric molecule for use in enhancing the expression and production of a desired biomolecule comprises one or more short peptide domains and one or more UTRs.
In certain embodiments, the chimeric molecule comprises one or more peptide domains. In certain embodiments, the one or more peptide domains comprise from about five amino acids to about twenty amino acids. In certain embodiments, the one or more peptide domains comprise about seven amino acids. In certain embodiments, the synthetic peptide tag comprises an amino acid sequence having at least about 70% (such as at least about 75%, 80%, 85%, 90%, 91%,
92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or greater) sequence identity to the sequence: QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptide domains comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptide domains comprise an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1). In certain embodiments, the one or more peptide domains comprise the amino acid sequence QPRFAAA (SEQ ID NO:
1).
In certain embodiments, the chimeric molecule comprises one or more peptide domains comprise an amino acid sequence of Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
In certain embodiments, the one or more peptide domains comprise one or more non natural amino acids or modified amino acids. Examples of modified amino acids include amino acids that have been phosphorylated, acetylated, glycosylated, carboxylated, hydroxylated, sulfated, and the like. Examples of non-natural amino acids include D-amino acids, homo amino acids, N-methyl amino acids, alpha-methyl amino acids, beta (homo) amino acids, gamma amino acids, helix/turn stabilizing motifs, backbone modifications (e.g. peptoids). Other examples of amino acids that are contemplated include hydroxyproline (Hyp), beta-alanine, citrulline (Cit), ornithine (Orn), norleucine (Me), 3-nitrotyrosine, nitroarginine, pyroglutamic acid (Pyr).
Biomolecules of Interest
A fusion protein or chimeric molecule, e.g. a peptide domain and/or UTR sequence associated with a biomolecule such as a protein, of the present disclosure is obtained by associating a peptide tag to a target protein (also referred to as a fusion protein of a tag and a target protein). One or more chimeric molecules may be bound to the N-terminus of the target protein, one or more chimeric molecules may be bound to the C-terminus of the target protein, or one or more chimeric molecules may be bound to both the N-terminus and the C-terminus of the
target protein, or one or more chimeric molecules may be inserted into internal region of the tagged proteins. The one or more chimeric molecules may be directly bound to the N-terminus and / or the C-terminus of the target protein or may be bound through a sequence of 1 to several amino acids (for example, 1 to 10 amino acids). The sequence of 1 to several amino acids may be any sequence as far as the sequence does not adversely affect the function or the expression level of the chimeric molecule-target protein. However, the chimeric molecules may be isolated from the target protein after expression and purification by using a protease recognition sequence.
In certain embodiments, at least one or more chimeric molecules are associated with one or more biomolecules of interest. Examples of biomolecules include cytokines, growth factors, viral antigens, tumor antigens, antigens, polynucleotides, oligonucleotides, hormones, enzymes, checkpoint proteins, an antigen, an antibody, a transcription factor, a receptor, a ligand, immunoglobulins, immunoglobulin fragments, a fluorescent protein, etc. The length of the biomolecule, e.g. peptide of interest may vary as long as the amount of the targeted biomolecule, e.g. a peptide produced is significantly increased when expressed in the form of a fusion peptide/chimeric molecule.
Examples of the enzyme include enzymes such as lipase, protease, steroid synthesizing enzyme, kinase, phosphatase, xylanase, esterase, methylase, demethylase, oxidase, reductase, cellulase, aromatase, Carnauba, transglutaminase, glycosidase, and chitinase. Growth factors include, for example, epithelial growth factor (EGF), insulin-like growth factor (IGF), transforming growth factor (TGF), nerve growth factor (NGF), brain derived neurotrophic factor (BDNF) (VEGF), granulocyte colony stimulating factor (G-CSF), granulocyte macrophage colony stimulating factor (GM-CSF), platelet derived growth factor (PDGF), erythropoietin (EPO), thrombopoietin, Pre-eukaryotic cell growth factor (FGF), hepatocyte growth factor (HGF). Examples of the hormone include insulin, glucagon, somatostatin, growth hormone, parathyroid hormone, prolactin, leptin and calcitonin. Examples of cytokines include interleukin, interferon (IFN alpha, IFN beta, IFN gamma), tumor necrosis factor (TNF). Blood proteins include, for example, thrombin, serum albumin, Factor VII, Factor VII, Factor X, Factor X, tissue plasminogen activator. Antibody proteins include for example, F (ab')2, Fc, Fc fusion protein, heavy chain (H chain), light chain (L chain), short chain Fv scFv), sc(Fv)2, disulfide- linked Fv (sdFv), Diabodies.
Immune checkpoint proteins are well known in the art and include, without limitation, CTLA-4, PD-1, VISTA, B7-H2, B7-H3, PD-L1, B7-H4, B7-H6, 2B4, ICOS, HVEM, PD-L2,
CD 160, gp49B, PIR-B, KIR family receptors, TIM-1, TIM-3, TIM-4, LAG-3, BTLA, SIRPalpha (CD47), CD48, 2B4 (CD244), B7.1, B7.2, ILT-2, ILT-4, TIGIT, and A2aR.
Antigens may be appropriately selected depending on the subject of the immunological response, for example, a protein derived from a pathogenic bacterium, or a protein derived from a pathogenic virus.
The chimeric molecules may be combined with a secretory signal peptide functioning in the host cell for secretory production. When the yeast is used as a host, the secretory signal peptide can be exemplified by an invertase secretion signal. In certain embodiments, the secretory signal is obtained from two or more different sources. Various sources include, for example, Bacillus species, Lactococcus lactis , Streptomyces, or Corynebacterium . Other signal sequences include, for example, human IL-2, human chymotrypsin, human interferon gamma, etc.
In certain embodiments, the chimeric molecules may be added with a transport signal peptide such as an endoplasmic reticulum residual signal peptide or a liquid phase transition signal peptide for expression in a specific cell compartment.
The chimeric biomolecules can be chemically synthesized or can be genetically produced. The DNA of the present disclosure is characterized by including nucleic acids encoding the chimeric molecule of the present disclosure.
The DNA of the present disclosure may contain an enhancer sequence or the like functioning in the host cell so as to improve the expression in the host cell. Examples of the enhancer include the 5'-untranslated region of the Kozak sequence and the plant-derived alcohol dehydrogenase gene.
Constructs
Genetic constructs or vectors comprise a nucleotide sequence that encodes a desired protein operably linked to regulatory elements needed for gene expression. Accordingly, incorporation of the DNA or RNA molecule into a living cell results in the expression of the
DNA or RNA encoding the desired protein and thus, production of the desired protein. The chimeric molecules of the present disclosure can be produced by a general genetic engineering technique. For example, a recombinant vector encoding for the chimeric molecule. The recombinant vector of the present disclosure is not particularly limited as long as the nucleic acid sequences chimeric molecule is inserted into the vector so that it can be expressed in a host cell into which the vector is introduced. The vector is not particularly limited as long as it is replicable in the host cell, and examples thereof include plasmid DNA and viral DNA. The regulatory elements necessary for gene expression of a DNA molecule include: a promoter, an initiation codon, a stop codon, and a polyadenylation signal. In addition, enhancers are often required for gene expression. It is necessary that these elements be operable linked to the sequence that encodes the desired proteins and that the regulatory elements are operably in the individual to whom they are administered.
Initiation codons and stop codon are generally considered to be part of a nucleotide sequence that encodes the desired protein. However, it is necessary that these elements are functional in the individual to whom the gene construct is administered. The initiation and termination codons must be in frame with the coding sequence.
The molecule that encodes a desired protein may be DNA or RNA which comprise a nucleotide sequence that encodes the desired protein. These molecules may be cDNA, genomic DNA, synthesized DNA or a hybrid thereof or an RNA molecule such as mRNA. Accordingly, as used herein, the terms “DNA construct”, “genetic construct”, “nucleotide sequence”, nucleic acid” are meant to refer to both DNA and RNA molecules.
When taken up by a cell, the genetic construct which includes the nucleotide sequence encoding the desired protein operably linked to the regulatory elements may remain present in the cell as a functioning extrachromosomal molecule or it may integrate into the cell's chromosomal DNA. DNA may be introduced into cells where it remains as separate genetic material in the form of a plasmid. Alternatively, linear DNA which can integrate into the chromosome may be introduced into the cell. When introducing DNA into the cell, reagents which promote DNA integration into chromosomes may be added. DNA sequences which are useful to promote integration may also be included in the DNA molecule. Alternatively, RNA
may be administered to the cell. It is also contemplated to provide the genetic construct as a linear minichromosome including a centromere, telomeres and an origin of replication.
Accordingly, in certain embodiments, the present disclosure includes a vector comprising one or more cassettes comprising: a UTR, biomolecule, peptide tag domain, e.g. Qa tag (SEQ ID NO: 1). The vector can be any vector that is known in the art and is suitable for expressing the desired expression cassette. A number of vectors are known or can be designed to be capable of mediating transfer of gene products to mammalian cells, as is known in the art and described herein. In certain aspects, a vector refers to a nucleic acid polynucleotide to be delivered to a host cell, either in vitro or in vivo. In some embodiments, one or more cassettes are provided on a single vector. In some embodiments, one or more cassettes are provided on a two or more vectors. In some embodiments, cassettes are provided by one or more vectors comprising an isolated nucleic acid encoding one or more elements of a gene editing system. In some embodiments, the cassettes are provided by one or more vectors comprising an isolated nucleic acid encoding one or more components comprising: a UTR(s), biomolecule(s), peptide tag(s). In some instances, the expression of natural or synthetic nucleic acids encoding a RNA and/or peptide is typically achieved by operably linking a nucleic acid encoding the RNA and/or peptide or portions thereof to a promoter and incorporating the construct into an expression vector. The vectors to be used are suitable for replication and, optionally, integration in eukaryotic cells. Typical vectors contain transcription and translation terminators, initiation sequences, and promoters useful for regulation of the expression of the desired nucleic acid sequence.
The isolated nucleic acids of the disclosure can be cloned into a number of types of vectors. For example, the nucleic acid can be cloned into a vector including, but not limited to a plasmid, a phagemid, a phage derivative, an animal virus, and a cosmid. Vectors of particular interest include expression vectors, replication vectors, probe generation vectors, and sequencing vectors.
Additional promoter elements, e.g., enhancers, regulate the frequency of transcriptional initiation. In some embodiments, the vector also includes conventional control elements which are operably linked to the transgene in a manner which permits its transcription, translation and/or expression in a cell transfected with the plasmid vector or infected with the virus comprising a nucleic acid comprising the described cassettes or compositions. As used herein,
“operably linked” sequences include both expression control sequences that are contiguous with the gene of interest and expression control sequences that act in trans or at a distance to control the gene of interest. Expression control sequences include appropriate transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus sequence); sequences that enhance protein stability; and when desired, sequences that enhance secretion of the encoded product. A great number of expression control sequences, including promoters which are native, constitutive, inducible and/or tissue-specific, are known in the art and can be utilized.
Typically, these are located in the region 30-110 bp upstream of the start site, although a number of promoters have recently been shown to contain functional elements downstream of the start site as well. The spacing between promoter elements frequently is flexible, so that promoter function is preserved when elements are inverted or moved relative to one another. In the thymidine kinase (tk) promoter, the spacing between promoter elements can be increased to 50 bp apart before activity begins to decline. Depending on the promoter, it appears that individual elements can function either cooperatively or independently to activate transcription.
The selection of appropriate promoters can readily be accomplished. In certain aspects, one would use a high expression promoter. Promoters and polyadenylation signals used must be functional within the cells of the individual. The promoter used in the vector may be appropriately selected depending on the host cell into which the vector is introduced. For example, when expressed in yeast, the GALl promoter, the PGK1 promoter, the TEF1 promoter, the ADH1 promoter, the TPI1 promoter, the PYK1 promoter and the like can be used. When expressed in plants, Cauliflower Mosaic Virus 35S promoter, rice actin promoter, com ubiquitin promoter, lettuce ubiquitin promoter, and the like can be used. When expressed in Escherichia coli, T7 promoter and the like can be used. In the case of expression in Brevibacillus, P2 promoter and P22 promoter and the like can be mentioned. Inducible promoter. For example, in addition to lac, tac and trc which are inducible by IPTG, trp which can be induced by Iaa, ara which can be induced by L-arabinose, Pzt-1 which can be induced by using tetracycline, A P L promoter inducible at high temperature (42 ° C), and a promoter of cspA gene, which is one of cold shock genes. Other examples of promoters useful in the production of a genetic vaccine for
humans, include but are not limited to promoters from Simian Virus 40 (SV40, Mouse Mammary Tumor Virus (MMTV) promoter, Human Immunodeficiency Virus (HIV) such as the HIV Long Terminal Repeat (LTR) promoter, Moloney virus, ALV, Cytomegalovirus (CMV) such as the CMV immediate early promoter, Epstein Barr Virus (EBV), Rous Sarcoma Virus (RSV) as well as promoters from human genes such as human Actin, human Myosin, human Hemoglobin, human muscle creatine and human metalothionein. Examples of polyadenylation signals useful to practice the present disclosure, especially in the production of a genetic vaccine for humans, include but are not limited to SV40 polyadenylation signals and LTR polyadenylation signals. In particular, the SV40 polyadenylation signal which is in pCEP4 plasmid (Invitrogen, San Diego Calif.), referred to as the SV40 polyadenylation signal, is used.
One example of a suitable promoter is the CAG promoter or the immediate early cytomegalovirus (CMV) promoter sequence. This promoter sequence is a strong constitutive promoter sequence capable of driving high levels of expression of any polynucleotide sequence operatively linked thereto. In certain embodiments, the Rous sarcoma virus (RSV) and MMT promoters are also be used. Certain proteins can be expressed using their native promoter. Other elements that can enhance expression can also be included such as an enhancer or a system that results in high levels of expression such as a tat gene and tar element. This cassette can then be inserted into a vector, e.g., a plasmid vector such as, pUC19, pUCl 18, pBR322, or other known plasmid vectors, that includes, for example, an E. coli origin of replication.
Another example of a suitable promoter is Elongation Growth Factor-la (EF-la). However, in some embodiments, other constitutive promoter sequences are used, including, but not limited to the simian virus 40 (SV40) early promoter, mouse mammary tumor virus (MMTV), human immunodeficiency virus (HIV) long terminal repeat (LTR) promoter, MoMuLV promoter, an avian leukemia virus promoter, an Epstein-Barr virus immediate early promoter, a Rous sarcoma virus promoter, as well as human gene promoters such as, but not limited to, the actin promoter, the myosin promoter, the hemoglobin promoter, and the creatine kinase promoter. Further, the disclosed should not be limited to the use of constitutive promoters. Inducible promoters are also contemplated as part of the disclosed. The use of an inducible promoter provides a molecular switch capable of turning on expression of the polynucleotide sequence which it is operatively linked when such expression is desired or turning off the
expression when expression is not desired. Examples of inducible promoters include, but are not limited to a metallothionine promoter, a glucocorticoid promoter, a progesterone promoter, and a tetracycline promoter.
Enhancer sequences found on a vector also regulates expression of the gene contained therein. Typically, enhancers are bound with protein factors to enhance the transcription of a gene. In some instances, enhancers are located upstream or downstream of the gene it regulates. In some instances, enhancers are also tissue-specific to enhance transcription in a specific cell or tissue type. In some embodiments, the vector of the present disclosure comprises one or more enhancers to boost transcription of the gene present within the vector. In some instances, the expression of the nucleic acid and/or protein, the expression vector to be introduced into a cell can also contain either a selectable marker gene or a reporter gene or both to facilitate identification and selection of expressing cells from the population of cells sought to be transfected or infected through viral vectors. In other embodiments, the selectable marker is carried on a separate piece of DNA and used in a co-transfection procedure. Both selectable markers and reporter genes can be flanked with appropriate regulatory sequences to enable expression in the host cells. Useful selectable markers include, for example, antibiotic-resistance genes, such as neo and the like.
If necessary, a terminator sequence may also be included depending on the host cell.
The recombinant vector of the present disclosure can be produced, for example, by digesting a DNA construct with a suitable restriction enzyme, or adding a restriction enzyme site by PCR, and inserting the vector into a restriction enzyme site or a multicloning site.
Host cells. The host cell used for transformation (“transformant”) may be eukaryotic cells or prokaryotic cells, preferably eukaryotic cells. In certain embodiments, eukaryotic cells, yeast cells, mammalian cells, plant cells, insect cells and the like are used. Examples of the yeast include Saccharomyces cerevisiae, Candida utilis, Schizosaccharomyces pombe, Pichia pastoris , and the like. In addition, microorganisms such as Aspergillus may be used. Examples of prokaryotic cells include Escherichia coli, Lactobacillus, Bacillus, Brevibacillus, Agrobacterium tumefaciens, actinomycetes and the like. Plant cells include plant cells belonging to Astaraceae, Solanaceae, Brassicaceae, Rosaceae, Chenopodiaceae, etc., such as Lactuca.
The transformant used in the present disclosure can be produced by introducing the recombinant vector of the present disclosure into a host cell using a general genetic engineering technique. For example, an electrophoresis method (Tada, et al., 1990, Theor. Appl. Genet, 80: 475), a protoplast method (Gene, 39, 281-286 (1985)), a polyethylene glycol method, 1993, Transgenic, Res. 2: 218, Hiei, et al., 1994), Agrobacterium-mediated transformation (Hood et al., 1991, Theor. Appl. Genet. J. 6: 271), particle dry method (Sanford et al., 1987, J. Part. Sci. Tech. 5: 27), polycation method (Ohtsuki, et al., FEBS Lett. 3): 235-240) can be used. In addition, gene expression may be a routine expression or a stable expression inserted into a chromosome.
After introducing the recombinant vector of the present disclosure into a host cell, the transformants can be selected according to the phenotype of the selection marker. In addition, the tagged protein can be produced by culturing the selected transformant. The culture medium and conditions used for the culture can be appropriately selected depending on the species of the transformant.
When the host cell is a plant cell, the plant cell can be regenerated by culturing the selected plant cell by a conventional method, and the tag-added protein can be accumulated in the plant cell or outside the cell membrane of the plant cell.
Tagged biomolecules which have accumulated in the cells or cells can be separated and purified according to methods well known to those skilled in the art. For example, a known method known in the art, such as salting out, ethanol precipitation, ultrafiltration, gel filtration chromatography, ion exchange column chromatography, affinity chromatography, medium pressure liquid chromatography, reversed phase chromatography, hydrophobic chromatography, can be separated and purified.
Hereinafter, embodiments of the present disclosure will be described, but the present disclosure is not limited to these embodiments.
Examples
Example 1: Tagging for enhancing protein expression/secretion
Proteins play a key role in various physiological processes and pathological conditions. Protein expression is a critical and integral process in biological and medical research, but it can
be difficult and costly to increase for large-scale applications. Enhancing the production yield of protein expression/secretion is increasingly important for biopharmaceutical development, immunological/vaccine industry, and biological therapeutics.
Results
Unexpected discovery of short peptide Qa tag to enhance gene expression. During the preliminary study to express SARS-CoV-2 viral proteins in mammalian cells, the aim was to set up a dual reporter system by fusing Gaussia-Dura luciferase (gdLuc) and destabilized GFP (dsGFP), abbreviated as LG, into the C-terminus of SARS-CoV-2 viral proteins (FIG. 1 A). The advantage of this dual reporter is the dynamic quantitative measurement of secretory gdLuc- fused target protein in the culture media with sensitive gdLuc assay and dsGFP positivity and intensity by fluorescent microscope and flow cytometry. Since using a lentiviral (LV) system would allow a wider range of targeting cells and easily establish stable cell line, an LV pCDH- nCoV-E-Flag vector (Zhang et al., 2020) was selected as the backbone for dual reporter cloning, which expresses the SARS-CoV-2 structural envelope (E) protein. NEBbuilder-HiFi cloning was performed via NotEApal sites using two fragments derived by PCR using the primers Flag-Not- gdLuc-F and gdLuc-P2A-R for the gdLuc fragment and P2A-dsGFP-F and dsGFP-PCR-Apa-R for the dsGFP fragment. Due to failure of such cloning, a new pair of primers were designed to generate one fragment by overlay PCR with the PCR products from the above two fragments and cloned into the pcDNA6B-nCoV-E-Flag vector (Zhang et al., 2020) via the SacII cloning site using NEBbuilder-HiFi kit. After confirmation of the correct clones by restriction enzyme digestion, the positive clones, El and E7, were tested for protein expression detected by fluorescent microscopy and secretory gdLuc reporter assay (FIG. 1 A). Surprisingly, it was found that E7 exhibited >20-fold higher luciferase activity than El. After Sanger sequencing, it was discovered that the E7 clone had an additional 21 nucleotides before the LG and after the Flag tag, which encoded 7 amino acids (amino acids) in frame. This 7-amino acids peptide was assigned the term “Qa” based on the potential pronunciation of the sequence and its linked LG as QLG in the following studies. Further validation studies confirmed that the pcDNA6B-E-QLG had up to 90-fold higher expression than pcDNA6B-E-LG (FIGS. 1 A-1D). The effect of this Qa tag on the expression of other structural proteins of SARS-CoV-2, such as spike (S), nucleocapsid (N), and membrane (M), as well as accessory proteins NSP2, NSP16 and ORF3
was examined. It was found that Qa had a 3 to 4000-fold efficiency of enhancing all the tested viral proteins (FIGS. 1E-1F and FIG. 2A), with the extent depending on each protein. Such variation of enhancing efficiency may result from differences in cellular density/functionality, transfection efficiency, reporter dosage, and viral protein types. Similar enhancing effects are applicable to many non-viral proteins (FIGS. 2B-2E). Interestingly, Transfection with a lower amount of plasmid DNA in HEK293T cells showed a higher efficiency of enhancing for most viral proteins of SARS-CoV-2 (FIG. 2A), but not in host cellular gene products such as mouse NIBP (FIG. 2B) and human ACE2 (FIGS. 2C, 2D) or cytokines such as IFNy (FIG. 2E) and IL-2 (FIG. 2F). Similar enhancing function is applicable to other cell types such as Hela, BHK and others (FIG. 2G). In addition to regular plasmids, this Qa tag also enhances protein expression in viral transfer vectors such as LV vectors (FIGS. 2E, 2F). In summary, this Qa tagging is versatile to enhance protein expression/production in various genes, cell types and species.
Further enhancement of viral protein expression by optimization of promoter and native UTR. During the initial studies herein, to express SARS-CoV-2 viral proteins for their cellular distribution and functional role in the pathogenesis of COVID-19, there was an absence or low expression of most viral proteins (data not shown). In the pcDNA6B expression vector system (CMV promoter), it was found that most viral proteins showed undetectable expression by Western blot and immunocytochemistry, while the host cellular gene hACE2 showed very strong expression. In the pCAG vector system (CAG promoter), it was found that most viral proteins were detectable but to various degrees. However, the expression for E and S proteins remained very weak or undetectable by immunocytochemistry and Western blot with anti-Flag antibody, which is in concordance with several reports (Boson et ak, 2020; Hu et ah, 2020; Ou et ah, 2020; Zhang et al ., 2020). To solve this problem, the dual LG reporter system was established with high sensitivity and quantitative analysis. The data suggested that the LG system can detect the expression of both E and S protein in either pcDNA6B or pCAG vector although the levels remained very low, however, Qa tagging robustly increased their expressions (FIG. 3 A). Again, the CAG promoter exhibited higher activity than CMV promoter as previously demonstrated (Dou et ak, 2021; Zhang et al ., 2020). To address whether the Qa tagging further enhanced CAG-driven gene expression of viral proteins, parallel experiments were performed between CMV and CAG promotors for viral protein E and S. As shown in FIGS. 3A-3C, Qa induced
stronger enhancing of viral E and S proteins in the presence of the stronger CAG promoter (5~6- fold). The enhanced enhancing by Qa in CAG-driven NSP16 expression reached up to 212-fold (FIG 3C).
To test whether SARS-CoV-2 native UTR regulates Qa-accelerated expression of the viral proteins, a DNA fragment containing 5’-UTR-E-Flag-Qa-3’-UTR was synthesized based on the public SARS-CoV-2 strain (Wu et al., 2020) and cloned into the pCAG vector. The E protein was selected because of its relatively small size for cheap and fast synthesis as compared with S protein. Surprisingly, it was found that the addition of the native 5’ -UTR robustly enhanced the expression of E protein as determined by Western blot and immunocytochemistry with anti -Flag antibody (FIGS. 3D, 3E). Inclusion of the native 3’ -UTR further increased E protein expression (FIGS. 3D, 3E). S protein is very important for vaccine development, pseudovirion production, and drug discovery, however, its expression is the most difficult among the viral proteins of SARS-CoV-2 (Boson et al. , 2020; Hu et al. , 2020; Ou et al. , 2020; Walls et al., 2020; Wang et al., 2020; Zhang et al. , 2020). To maximize the production of S protein, the native 5’ -UTR was added upstream of Qa-tagged S protein in the pCAG expression vector, which shows 6-fold higher expression than pcDNA6B vector (FIG. 3C). Addition of 5’ -UTR further enhanced S protein production by 20~70-fold as compared with CAG-driven Qa-tagged S protein as determined by fluorescent immunocytochemistry (FIG. 3F) and gdLuc assay (FIG. 3G). Similar enhanced power occurs with the CMV-driven Qa-tagged S protein expression system (FIG. 3H). In the LV vector, addition of 5’ -UTR enhances the dual reporter Flag-tagged LG and QLG in the absence of viral proteins (FIG. 31). These data provide evidence that optimization of gene expression components (promoter, UTR) further increases Qa-tagged viral protein expression. Importantly, the native 5’ -UTR of SARS-CoV-2 alone dramatically enhances the expression of the targeted gene protein.
Enhancing of SARS-CoV-2 S pseudovirion production. Pseudotyped virus has been widely used for not only gene delivery but also vaccine production, antibody neutralization, cellular entry, and pathogenic mechanisms. Pseudovirion is an excellent alternative for high-risk viruses that require BSL3 facilities for working with live viruses, such as SARS-CoV-2 and its variants (Korber et al., 2020; Muik et al., 2021; Nie et al., 2020; Walls et al ., 2020; Weissman et al., 2021; Wibmer et al., 2021a). For example, limited access to BSL3 and ABSL3 facilities has
slowed down basic research and vaccine/therapy development for COVID-19. Pseudovirion is the virus-like particle coated with viral surface or membrane proteins that harbor specific cellular tropism (Kuzmina et al., 2021; Walls et al, 2020; Wibmer et al, 2021a). Virus-like particles pseudotyped with S protein will have better immune responses than individual viral proteins due to similarity of three-dimensional structure to live virus (Kuzmina et al. , 2021; Walls et al. ,
2020; Wibmer et al, 2021a). SARS-CoV-2 S protein has been widely used to generate S pseudovirion but the packaging efficiency for lentivirus-like (LVLP) or VSV-like particles (VSVLP) has been very low in most reports, even when using the codon-optimized C-terminal deletion S protein (Korber et al., 2020; Muik et al, 2021; Ou et al, 2020; Walls et al, 2020). Given Qa tagging enhances S protein production in mammalian cells, it was speculated that Qa could enhance the packaging efficiency of S pseudotyped LVLP (S-LVLP). Using the C- terminal 18 amino acids-deleted codon-optimized SARS-CoV-2 S protein (Sdl8) as the test platform (FIG. 4A), which is widely used for S pseudovirus studies, it was validated that Qa addition on the C-terminal Sdl8 (Sdl8Q) enhanced Sdl8 expression, which was further increased by inclusion of 5’-UTR, as determined by Western blot analysis (FIG. 4B). It was found the Qa tagging increased the S-LVLP packaging efficiency by ~2-4-fold using a standard pRRL-GFP LV reporter transfer vector using microscopy (FIG. 4C) and flow cytometry (FIG. 4D) in HEK-hACE2 cells. Addition of 5’-UTR further increased the packaging efficiency by ~4- 10-fold (FIGS. 4C, 4D). Like regular VSV-G-pseudotyped LV packaging, polybrene treatment increased the titer of S-LVLP (FIG. 4E) and simplified high-speed sucrose concentration/purification retained the transduction capability (FIG. 4F). To provide dynamic measurement of S-pseudovirion transduction, the packaging efficiency was tested for the dual reporter LV vector pRRL-LG and pRRL-E-LG, which can harbor bigger size of inserts than just the GFP insert (FIGS. 4G, 4H). As expected, the original Sdl8 had very low packaging efficiency for pRRL-LG and pRRL-E-LG. However, Qa addition significantly enhanced their transduction efficiency, and 5’-UTR additions further enhanced the transduction efficiency as determined by fluorescent microscopy and gdLuc assay (FIGS. 4G, 4H). These data provide evidence that both Qa and 5’-UTR additions on the Sdl8 expression system significantly enhanced the packaging and transduction efficiency of SARS-CoV-2 S-LVLP.
Enhancing of DN A and mRNA vaccine production. One significant immediate usage of Qa tagging could be for enhancing vaccine production for the urgent need to fight COVID-19. The most promising vaccines against SARS-CoV-2 or its variants derived from the mRNA or DNA encoding S protein or other SARS-CoV-2 viral proteins. Taking S protein as an example, Qa tagging increased the S protein expression by 4~27-fold in the CMV promoter-driven cDNA expression vector (FIG. 1H) and additional 6-fold increase occurs when CAG promoter is utilized (FIG. 3C). Inclusion of 5’-UTR in the CAG-driven cDNA expression vector further increased S protein expression by ~20-70-fold (FIG. 3G). Therefore, the Qa tagging plus 5’-UTR regulation for S protein-encoding DNA vaccine enhanced the vaccine production by at least 200- fold (FIG. 3H). Such an enhancement of DNA vaccine production in a large scale will robustly reduce the cost and expedite the availability of COVID-19 vaccines. Since mRNA vaccine exhibits numerous advantages over other vaccines and COVID-19 S protein mRNA vaccine has been well established for extensive human application during the pandemic, it was hypothesized that the Qa tagging plus 5’-UTR would enhance the mRNA-dependent translation, leading to increased expression of viral proteins such as S protein for vaccine production. To test this, in vitro transcription was performed to generate capped mRNA with Qa tag and it was examined if Qa tagging can affect the translation of viral proteins after mRNA transfection in HEK293T cells. As shown in FIG. 5A, the presence of Qa tag significantly increased the production of viral protein S from the transfected functional mRNAs in a time-dependent and dose-dependent manner. Such enhancing is universally applicable to the mRNAs of other viral proteins N, E, and ORF3 as well as the host cellular gene ACE2 (FIGS. 5B, 5C). Addition of 5’-UTR significantly increased the mRNA-dependent translation of Qa-tagged viral S protein (FIG. 5D) as well as viral E protein and cellular ACE2 (FIG. 5E), consistent with the cDNA expression vector (FIGS. 3D-3I). These data provide evidence that the Qa tagging and 5’-UTR inclusion enhance the translational efficiency leading to an increase in the production of the reporter protein in all the tested targets, and the extent relies on different targeted proteins.
To further determine if Qa tagging regulated mRNA-dependent translation, the dynamic changes of translational products were measured after transcriptional inhibition with actinomycin D. It was found that treatment with actinomycin D completely blocked the production of viral protein S (FIG. 5F) and ORF3 (FIG. 5G) without Qa tagging as determined by the gdLuc
activity. However, Qa tagging increased the protein expression/production during transcriptional inhibition, which was accumulating in a time-dependent manner (FIGS. 5F, 5G). These data provide evidence that Qa tagging on the targeted protein facilitates the protein expression through posttranscriptional regulation (increased translation efficiency and/or mRNA stability). To further determine if Qa tagging influenced mRNA stability of the targeted genes, mRNA decay assays were performed using S and E viral proteins as examples. Although the time-course changing between S and E viral mRNAs exhibited different pattern, Qa tagging on both S (FIG. 5H) and E (FIG. 51) viral proteins increased the mRNA half-life by around 6-7 h.
Taken all together, Qa tagging and native 5’-UTR inclusion on a target mRNA significantly increased mRNA stability and translational efficiency and thus enhanced the protein expression/production of the targeted mRNA (e.g., S protein mRNA vaccine). Such enhancing not only reduces the cost but also stimulates vaccine response due to higher level of S protein expression/release from vaccinated target cells.
Enhancing of antibody production. The therapeutics based on effective monoclonal antibody (mAh) requires optimization of antibody production in a suitable cell culture platform, which relies on high performance expression vectors. Various genetic elements in monoclonal antibody production vectors have been widely modified. To determine if the novel Qa tagging would enhance antibody production, the human anti-SARS-CoV monoclonal antibody (Bei, CR3022) was used as a test platform. The Qa tag was cloned into the C-terminus of the immunoglobulin heavy and light chain (H/L) of CR3022, which contains variable regions of heavy and light chains derived from human anti-SARS-CoV mAh (GenBank: DQ 168569 and DQ 168570, respectively), to generate Qa-tagged HQ and LQ (FIG. 6A). The HQ and LQ were co-transfected into HEK293T cells to generate Qa-tagged monoclonal antibody, using the original H and L vectors (NR52399 and NR52400) as a control. The supernatants containing the monoclonal antibody were collected at 2-3 days after transfection and their levels were measured using sandwich ELISA with SARS-CoV-2 S protein as the coating antigen (FIGS. 6B, 6C). It was found that Qa tagging enhanced the antibody production by up to 37-fold with or without the normalization for transfection efficiency (FIG. 6D). The enhancing efficiency varied with the experimental conditions (cell density, transfection efficiency, and ELISA variations) in an average of 13-fold (FIG. 6E). Western blot analysis of the supernatant validated Qa enhancing of
the antibody production (FIG. 6F). These data provide evidence that Qa tagging induces a robust enhancing of antibody production(secretion).
Enhancing of lentivirus production. Viral gene therapy has been extensively studied and actively applied to clinical diseases. AAV and LV are the most promising strategies for viral gene therapy. However, viral packaging efficiency (production yield) has been a bottleneck for both AAV and LV gene therapy. In the field of CRISPR/Cas genome editing, viral packaging efficiency is also a rate-limiting factor in developing genome editing and therapeutics. Generally, the level of mRNA from LV transfer vector could affect the LV packaging efficiency. It was hypothesized that Qa tagging in the LV transfer vector would enhance the efficiency of LV packaging and gene delivery if Qa tagging increases the mRNA level of the transgene during the packaging. To test this, the LV transfer vectors pRRL-E-LG and pRRL-E-QLG were compared for standard LV packaging (psPax2 and VSV-G). After LV infection of HEK293T cells, Qa tagging increased the production of the transgene reporter gdLuc from the transfer vector (FIG. 6G), similar to the enhancing efficiency in the transfected cells without LV packaging (FIG. 2H). However, Qa tagging on the transfer vector only had a marginal effect on the packaging efficiency i.e., the titer of packaged LV (data not shown). Similar changes were seen with LV- spCas9-Q-RFP and LV-MS2-spCas9-Q-GFP (FIGS. 6H, 61), where the packaging efficiency is usually >100-fold less than the standard LV-RFP or LV-GFP. These data provide evidence that the marginal change in the mRNA of transfer gene in the transfer LV vector by the Qa tagging did not increase the packaging efficiency, although the production of transgene protein in the transduced cells is enhanced by the Qa tagging (FIG. 6G). This is consistent with the finding that Qa tagging influences translation instead of transcription (FIGS. 5A-5I). It was then tested if Qa tagging on the LV packaging proteins, such as Gag, Pol, and RRE, in the packaging vector psPAX2 could enhance the packaging efficiency. Interestingly, Qa tagging on Gag significantly impaired LV packaging but Qa tagging on Pol and RRE enhanced the LV packaging for pRRL- GFP, pRRL-QLG and LV-MS2-Cas9-GFP; however, the enhance efficiency wass only l~3-fold (FIGS. 6J-6L). Further optimization of LV packaging enhance by Qa tagging is warranted.
Qa tagging enhances secretion of targeted proteins. As demonstrated above, Qa tagging increased the expression of various types of targeted proteins. When Western blot analysis was performed using the cell lysates to confirm the enhancing effect of Qa tagging on E dual reporter
protein expression, it was unexpectedly found that the E-Flag-gdLuc protein level in the cell lysates was remarkably reduced in Qa tagging group (FIG. 7A), even though the gdLuc activity in the supernatant was robustly increased by Qa tagging (FIGS. 1 A-1F). In the presence of 5’- UTR, the reduction in CAG-driven E-Flag-gdLuc expression level was more robust in the cell lysate (FIG. 7B). This reduction also occurred with other viral proteins, N and S, as well as host cellular genes such as IFNy, IL-2, and hACE2 (FIGS. 7C, 7D). Inclusion of UTR increased the protein expression in the cell lysates (FIGS. 7B, 7D and data not shown).
These unexpected observations prompted the hypothesis that the robust increase in the supernatant gdLuc activity by Qa tagging must involve the protein secretion process. This was supported by the enhanced antibody secretion (FIGS. 6A-6E) as well as the secretory IFNy and IL-2 (FIGS. 2D, 2E). To corroborate the enhanced secretion, the protein level of the secretory E- Flag-gdLuc in the supernatant was analyzed using serum-free culture media. As shown in FIG. 7E, the cleaved E-Flag-gdLuc and GFP as well as the uncleaved E-Flag-gdLuc-GFP were detected by Western blot analysis with anti-gdLuc and anti -GFP antibodies in the unconcentrated supernatant (40%) of Qa tagging E-QLG group. Densitometric quantification detected a 17-fold increase in the secretory protein, which is consistent with the enhancing detected by the gdLuc assay (FIG. 7F). The secretion can be completely blocked by treatment with the ER-Golgi protein trafficking inhibitor Brefeldin A (FIG. 7G). To further confirm the secretion enhancing of Qa tagging, the non-secretory //>y//>'-luciferase (fLuc) assay was utilized. There was no fLuc activity in the supernatant even in the presence of Qa tagging, but the levels of protein expression and enzyme activity were still significantly increased (FIG. 7H). Altogether, Qa tagging apparently enhanced the expression and secretion of the targeted proteins. We also noted the incomplete auto-cleavage by 2A system in most targeted proteins, with variable cleaving efficiency for different proteins (FIG. 7C).
Discussion
In both published literature and patents, different types of bioactive peptides have been developed that regulate or enhance the production of targeted proteins (Daliri et ak, 2017; Katayama etal. , 2021; Peighambardoust et al. , 2021). This study presents a novel short peptide (epitope) tag (only 7 amino acids) that enhances targeted protein expression and secretion. Various types of peptide (epitope) tags have been identified previously for protein labeling,
tracing, purification, and immunostaining (DeCaprio and Kohl, 2019; Katayama etal. , 2021; Lee etal. , 2020; Mishra, 2020; Peighambardoust etal., 2021; Pina etal. , 2021; Traenkle etal.,
2020). However, no peptide tag has been identified for the regulation or enhancing of targeted protein production including expression and secretion. This Qa tag would serve as a universal enhancer for protein production. So far, all the tested target proteins as shown in this study have an enhanced protein production, with some proteins having up to thousand-fold increases. Extensive testing on many other target proteins is warranted in the context of research interests and potential applications. To the inventor’s knowledge, this is the first evidence for protein regulation/enhancing by a short peptide (epitope) tag that traditionally serves for protein labeling/detection and affinity purification. This finding offers a paradigm shift in the context of epitope tagging and protein functional regulation.
Protein and peptide tags have been extensively employed for protein labeling/detection and affinity purification (DeCaprio and Kohl, 2019; Katayama etal., 2021; Lee et al, 2020; Mishra, 2020; Peighambardoust et al, 2021; Pina et al, 2021; Traenkle et al, 2020). The fusion of peptide tags with targeted proteins allows detection by immunostaining and immunoblotting with corresponding highly specific antibodies both in vitro and in vivo. Novel “spaghetti monster' fluorescent protein (smFP) technology with tandem tags dramatically enhances the sensitivity of the tagged protein detection (Viswanathan et al., 2015). Most tags can be also used for protein purification by immunoprecipitation and/or affinity chromatography. Some tags may enhance the yield of protein purification by extending protein half-life or rendering protein soluble (Bhagawati et al., 2019; Han et al., 2020; Li, 2011; Saribas et al, 2018). For some cases, tagging may influence the activity or function of the targeted proteins (Majorek et al., 2014). For example, N-terminal tagging on PI3KCA increases kinase activity while C-terminal tagging affects membrane binding activity (Vasan et al., 2019). The N-terminal secretory signal peptide of the gdLuc not only determines its inherent secretory property but also regulates the protein folding and functional activity (Gaur et al., 2017). For a specific protein, the C-terminal or N- terminal amino acid composition could regulate the protein expression (Cambray et al., 2018; Weber et al., 2020). Modification of C-terminal endoplasmic reticulum targeting peptide on the gdLuc significantly improves its intracellular retention (Gaur etal, 2017). Some peptides such as PEST (Shumway et al., 1999) or KFERQ (Dong et al., 2020; Park et al., 2016) fused or
endogenously contained in the target proteins mark the proteins for proteolysis or degradation. However, there is no evidence that these epitope tags could directly boost the protein expression and secretion, particularly in mammalian cells.
In this study, a novel epitope tag Qa was discovered that enhanced the expression and secretion of the tagged SARS-CoV-2 viral proteins and many non-viral proteins. This conclusion is supported by the findings that 1) Qa tag robustly enhances the production of secretory gdluc fusion protein for several viral proteins and host cellular gene products determined by dual reporter assay and fluorescent microscopy (FIGS. 1 A-1F, 2A-2G); 2) Qa tag enhancing is promoter-independent (FIGS. 3A-3I, 5A-5I); 3) Qa significantly enhances mRNA-dependent production of targeted viral and non-viral protein fusion reporter determined by in vitro RNA transcription, mRNA transfection, and dual reporter assay (FIGS. 5A-5I); 4) Qa tagging retained its time-dependent enhancing in the presence of transcriptional inhibition, indicating posttranscriptional mechanism (increased translation efficiency and/or mRNA stability); 5) Qa tagging on S protein enhanced the production yield of S pseudoviruses (FIGS. 4A-4H); 6) Qa tagging on antibody heavy and light chain robustly increased the antibody production; 7) Qa tagging on the LV packaging vector enhanced the packaging efficiency; and 8) Qa tagging enhanced the protein secretion process as shown by Western blot and gdLuc analysis on gdLuc fusion proteins with or without brefeldin A treatment, antibody production, as well as the secretion form of IFNy and IL-2.
Although the mechanisms underlying the enhance of protein expression/secretion remain to be delineated, the studies herein, using classical measure of mRNA stability via global transcription inhibition identified the posttranscriptional mechanisms, such as increased mRNA stability and translation efficiency. Novel measurements of mRNA decay (Chan et ak, 2018) are needed to expand these preliminary observations. The regulation of mRNA involves the dynamic balancing between the synthesis and degradation processes. The synthesis process is well understood; however, less is known about the mRNA decay (Chan et al ., 2018). Qa tagging may serve as a novel mechanism to regulate mRNA stability. How Qa tag regulates mRNA stability and translation initiation/elongation would be an interesting and important direction to be explored. For example, it’s important to know whether the RNA sequence encoding the Qa peptide has secondary structure that may directly regulate the mRNA stability of targeted protein
(Boo and Kim, 2020). It’s interesting to determine whether the synonymous substitution of Qa peptide influences the expression/production of tagged proteins. Whether the amino acids sequence of Qa tag directly binds to poly- A or 3’-UTR or which residues contribute to mRNA stabilization and translation enhancing needs to be determined. For the protein secretion, this Qa tag has no function similar to secretion peptide, because Qa tagging on non-secretory protein i.e. firefly-luciferase does not change the background luciferase activity in the cultured media, which exists likely due to partial cell death. However, Qa tagging on secretory proteins such as S protein, antibody, IFNy and IL-2 robustly enhanced their production yields. This is very important for the industrial application of these secretory proteins. In particular, S protein for mRNA vaccine would be released more from the vaccinated cells in the presence of Qa tag, which not only reduces the mRNA amount for each vaccination but also promotes the immune response due to higher level of secretory S protein. Given brefeldin A is well known to inhibit ER-Golgi trafficking and completely blocks Qa-stimulated protein secretion, we speculate that Qa tag could regulate protein retrograde or anterograde trafficking. Other secretion inhibitors could be used to identify additional pathways for protein secretion. Whether Qa tag influences unconventionally secreted proteins remains to be determined (Cohen et ak, 2020).
The UTRs at both ends of a viral genome or host cellular mRNA are important in regulating the transcription and translation efficiency (Berkhout et ak, 2011; Hinnebusch et ak, 2016; Raman and Brian, 2005; Senanayake and Brian, 1999; Williams et ak, 1999). In particular, the 5’-UTR of coronaviruses regulates translational rate via ribosomal scanning (Berkhout et al. , 2011; Hinnebusch et al. , 2016; Shirokikh et ak, 2019; Zhang et ak, 2015). A synthetic (non-viral) 5’-UTR has been used to enhance the translation of SARS-CoV-2 S mRNA in both Pfizer and Modema vaccines. The native UTRs of SARS-CoV-2 are highly conserved and plays key role in viral RNA replication and transcription of the genomic and subgenomic viral transcripts (Baldassarre etal. , 2020; Yang and Leibowitz, 2015). Thus, native 5’-UTR is assumed to enhance accumulation of viral protein. In this study, solid evidence is provided that the native (natural) 5’- and 3’-UTRs of SARS-CoV-2 enhanced the production of viral E-LG fusion protein. Importantly, the native 5’-UTR served as a universal regulator in enhancing not only viral proteins but also many non-viral cellular proteins. It was hypothesized that this potent UTR could be used in enhancing any proteins, particularly for virus packaging systems. For example,
it was observed that UTR-Sdl8Q increased the packaging efficiency of S-pseudotyped LVLP or VSVLP. UTR in viral transfer vector enhanced the lentivirus production. The native UTR would also enhance the AMINO ACIDSV packaging and transduction efficiency.
This study identified the combination of Qa tagging and SARS-CoV-2 native UTR as a novel strategy to enhance or enhance the production of any targeted gene/protein of interest. For industry applications, this strategy will reduce the cost of many widely used products and facilitate their availability. Since it enhanced the production of all tested viral proteins of SARS- CoV-2, an immediate usage of this method would be the enhancing of vaccine production for the urgent need to fight COVID-19. The studies herein demonstrated at least a 200-fold enhance efficiency of S mRNA vaccine. This is extremely important to expedite the mRNA vaccine availability when producing new mRNA vaccines against SARS-CoV-2 variants or any other emerging viruses. This strategy can be easily incorporated into the DNA vaccine vector. Thus, enhancing vaccine production yield in a large scale will reduce robustly the cost and expedite the availability of COVID-19 vaccine. Another immediate industry value of the methods herein, is to enhance antibody production. Taking human anti-SARS-CoV monoclonal antibody as an example, it was found that Qa tagging at the C-terminus of the immunoglobulin heavy and light chain variable regions robustly enhanced the antibody secretion by up to 37-fold (average 13- fold). Given Qa tagging in the middle of targeted protein shows much stronger enhancing efficiency, optimization of Qa tagging in different region of the targeted antibody heavy and light chains is expected to achieve higher levels of antibody production enhance.
Enhancing the production yield of viruses or pseudotyped viruses is also invaluable in the fields of gene therapy and biomedical research. Pseudotyped viruses have facilitated the research on high-risk viruses that require BSL3 facilities. Pseudovirus of SARS-CoV-2 S protein or its variants have been extensively utilized for evaluation of neutralization antibody and vaccination as well as mechanistic and functional studies (Donofrio et ah, 2021; Korber etal. , 2020; Muik et al ., 2021; Ou et al ., 2020; Wibmer et ah, 2021b). The bottleneck for generation of S pseudovirions is the limited packaging efficiency for LVLP or VSV-like particles (Korber et al. , 2020; Muik et al. , 2021; Ou et al. , 2020; Walls et al. , 2020). The method used herein to combine Qa tagging and native 5’ -UTR on the Sdl8 expression system robustly enhanced the packaging and transduction efficiency of SARS-CoV-2 S-LVLP. This strategy has facilitated current
research on the antiviral effect of EGCG and the protective efficiency of vaccinated serum from patients against the emerging SARS-CoV-2 variants (Liu et al., 2021a; Liu et al., 2021b). One of the challenges for viral gene therapy is the limited viral packaging efficiency (production yield). Using an LV system as a test platform, it was found that Qa tagging in the LV transfer vector had only a marginal effect on the packaging efficiency, although the production of transgene protein in the transduced cells or the transfected packaging cells was enhanced. This is expected because Qa tagging influences translation instead of transcription of targeted genes, while LV packaging needs the presence of intermediate RNA from the transfer vector. Qa tagging at the C -terminus of the Pol and RRE in the packaging vector psPAX2 increased the LV packaging efficiency, but Qa tagging at Gag C-terminus impaired LV packaging. Thus, optimization of Qa tagging location in the LV packaging proteins was essential for maximation of Qa enhancing efficiency. Given Sdl8 Qa tagging enhanced Sdl8 expression and the packaging efficiency of S- LVLP, Qa tagging on VSV-G protein could enhance regular LV packaging efficiency. Qa insertion at different locations of VSV-G (Lorenz et al., 2014; Schlehuber and Rose, 2004) can maximize the enhancing efficiency. Like LV packaging, Qa tagging on AMINO ACIDSV packaging system warrants further optimization.
In recombinant protein production systems, the Qa tagging would facilitate the yield of protein expression, such as insulin, interferon, interleukin, cytokines, and growth factors. Even only a few-fold increase of enhancing reduced the production expenses and expedite clinical applications. For in vivo gene enhancing, Qa tagging via novel CRISPR/Cas gene knockin strategy could be used to facilitate the expression of loss-of-function genes, particularly in haplo- insufficient mutagenic diseases such as Angelman syndrome, Pitts-Hopkins syndrome, and others. For genetic engineering, Qa tagging enhancement of dominant genes may improve phenotype of organisms, particularly in agriculture applications. Finally, this novel Qa tag can be used as a general tag in a similar way to other peptide tags such as Flag, Myc, HA, Ollas, C7, and T7 for protein tracing, protein purification, immunostaining, and Western blotting. Importantly, the Qa tagging can enhance the labeling intensity of the endogenous proteins due to its enhancing property. This is very important for neural network tracing.
When performing Western blot analysis, the incomplete cleavage of several targeted proteins via the auto-cleaving 2A system was observed. This is in accord with previous reports
on 2A cleaving insufficiency, which varies with different type of 2A peptide and different targeted proteins (Chng et al., 2015; Kim et al., 2011). Addition of a peptide (APVKQLL) to F2A increases the cleavage efficiency (Groot Bramel-Verheije et al., 2000). Whether Qa tagging affects the function of 2A system remains to be determined. The 2A system functions mainly inside the cells but may have relatively low activity outside the cells, because the ratio of cleaved over non-cleaved bands in the cell lysates is apparently higher than that in the cell culture media (FIGS. 7A-7H).
Based on a lot of previous studies on epitope tags such as Flag, HA, Myc, Ollas, C7 and T7 both in vitro and in vivo , it is anticipated that the smaller Qa tag (7 amino acids) should not have any toxicity.
In summary, this study reports a novel peptide tag consisting of a specified short amino acid (7 amino acids) sequence that can be utilized for enhancing production of the tagged proteins, including viral transcripts/proteins, endogenous gene products, vaccine, antibody, engineered recombinant proteins in a cell both in vitro , ex vivo, and in vivo. This novel and universal peptide tag would facilitate protein expression and secretion. It would be invaluable to perform library screening for this master Qa tag to discover optimal peptides that maximize the protein expression/production/secretion. This study also reports the exceptionally potent efficiency of SARS-CoV-2 native 5’-UTR in boosting the protein expression/production. Combining Qa tagging with the native 5’-UTR offers a synergistic boosting on the production of viral and non-viral proteins. All these strategies are invaluable in biopharmaceutical development, immunological/vaccine industry, and biological therapeutics.
Example 2: Experimental model and subject details
Cell lines:
HEK293T, Hela and BHK cell lines were cultured in standard protocol.
Method Details Vector cloning
All the PCR reactions for cloning in this study were performed using Phusion High-Fidelity PCR Master Mix kit (Thermo Fisher, F531) and purified using the Monarch PCR & DNA Cleanup
Kit (NEB, T1030S). The correct clones were verified by restriction enzyme digestion and Sanger sequencing as well as functional measures.
Dual reporter vectors: The dual reporter LG fragment, encoding Gaussia-Dura luciferase (gdLuc) and destabilized GFP (dsGFP), was generated by overlay PCR: 1) Standard PCR was performed to generate fragment 1 (gdLuc) from template plasmid pMCS-Gaussia-Dura-Luciferase (Thermo Fisher Scientific, Cat#16190) with primer pair T1290/T1291 while fragment 2 (dsGFP) from plasmid pLenti-EFS-EGFPd2PEST-2A-MCS-Hygro (TP1380), a gift from Neville Sanjana (Addgene Cat# 138152) with T1292/T1293; 2) Purified two fragments (100 ng/each) with overlay ed 19 nucleotides were mixed for 8 cycles of PCR; 3) the PCR product at 1:100 dilution was used as template for 28 cycles of standard PCR with primer pairs T1292/T1293 to generate LG fragment. After purification with NucleoSpin Gel and PCR Clean-up kit (Macherey-Nagel, Cat# REF740609), this LG fragment (1485bp) was cloned into pcDNA6B-nCoV-x-Flag vector encoding various viral proteins of SARS-CoV-2 or cellular gene hACE2 as listed in Key Resource Table (Zhang et al., 2020) via SacW cloning site using NEBuilder® HiFi DNA Assembly cloning kit (NEB, E5520S) to generate pcDNA6B-SARS-CoV-2-x-Flag-LG vectors as listed in Table. The “x” indicates the gene of interest.
Unexpectedly, functional assay and Sanger sequencing identified a novel clone assigned as pcDNA6B-SARS-CoV-2-E-Flag-QLG (TP1479), which has a Qa peptide in open reading frame before LG, assigned as QLG. The insert fragment encoding SARS-CoV-2 S protein from pcDNA6B-nCoV-S-Flag vector (TP1456) was cloned into TP1479 via XhoEXbal sites to generate pcDNA6B-SARS-CoV-2-S-Flag-QLG (TP1487). The insert fragment encoding SARS-CoV2 N protein from pcDNA6B-nCoV-N-Flag vector (TP1431) or hACE2 from pcDNA6B-hACE2-Flag vector (TP1470) was cloned into TP1479 via KpnEXbal sites to generate pcDNA6B-SARS-CoV2- N-Flag-QLG (TP1490) or pcDNA6B-hACE2-Flag-QLG (TP1491).
The pcDNA6B-NTBP-Flag-LG (TP 1560) vector was generated by NEB-HiFi cloning of NIBP PCR product from pYX-Asc-mNIBP (Genbank # BC070463) into pcDNA6B-hACE2-Flag- LG (TP1540) via NotEXbal, while the pcDNA6B-NIBP-Flag-QLG (TP1558) was generated by NEB-HiFi cloning of NIBP PCR product into pcDNA6B-SARS-CoV-2-E-Flag-QLG (TP1479) via XhoEXbal.
The pCAG vectors encoding E, S and NSP16 were generated by replacing the CMV promoter in corresponding pcDNA6B-SARS-CoV2-x-Flag-LG or -QLG vectors with CAG promoter via SnaBI/Kpnl sites.
UTR containing vectors: The DNA fragment containing 5’-UTR-E-Flag-Qa-3’-UTR designed according to the public SARS-CoV-2 sequencing was synthesized by Synbio Technologies and cloned into the pCAG-Flag vector via EcoRV/Age sites using NEBuilder® HiFi DNA Assembly cloning kit (NEB, E5520). This vector pC AG-UTR-E-F1 ag-Qa-UTR (TP1583) was digested with SnaBI/EcoRV (both blunt end) to remove CAG promoter and re-ligation generated pUTR-E-Flag-Qa-UTR vector (TP1585). The 3’-UTR with pCAG-UTR-E-Flag-Qa- UTR was removed by Notl digestion and ligation to generate pCAG-UTR-E-Flag-Qa (TP 1584) with additional 37 amino acids at open reading frame. The pCAG-UTR-S-Flag-QLG vector (TP1586) was generated by replacing the E-Flag-Qa-UTR fragment with S-Flag-QLG fragment from pCAG-S-Flag-QLG vector (TP1518) via XhoEAgel sites. The pCAG-UTR-Sdl8-Q (TP 1595) was generated by NEB HiFi cloning via EcoRI sites of pCAG-Sdl8-Q vector (TP 1506) with PCR product 5’ -UTR from pUTR-E-Flag-UTR vector (TP 1585). UTR-containing pcDNA6B vectors were generated by restriction cloning via KpnEXhoI to transfer 5 ’ -UTR from pC AG-UTR- E-Flag-Qa-UTR into corresponding pcDNA6B vectors such as pcDNA6B-S-QLG (TP1487), pcDNA6B-E-Flag-QLG (TP1479), pcDNA6B-ORF3-Flag-QLG (TP1483).
Antibody vectors: The plasmid set CR3022 for pFUSEss-CHIg-hGl-SARS-CoV2-mAb (NR-52399, TP1565) and pFUSE2ss-CLIg-hk-SARS-CoV2-mAb (NR-52400, TP1566) expressing the heavy (H) and light (L) chains of human anti-SARS-CoV mAh respectively (GenBank: DQ 168569 and DQ168570) were produced under HHSN272201400008C and obtained from BEI Resources, NIAID, NIH (Cat# NR-53260). The Qa-tagged HQ (TP 1574) and LQ (TP1571) vectors were generated from H or L plasmids at Nhe site using NEBuilder® HiFi DNA Assembly cloning kit with the synthesized oligonucleotides that contain Qa-encoding sequence and the C-terminus of the immunoglobulin heavy and light chain (see Table SI for sequences).
Lentiviral vector : The pRRLSIN.cPPT.PGK-GFP.WPRE (TP792), a gift from Didier Trono (Addgene #12252), was used to generate pRRL-E-Flag-LG-GFP (TP1577) by transferring E-Flag- LG insert from TP1478 to TP792 via BamHI/Agel. The pRRL-E-Fl ag-LG (TP1578) was
generated from TP 1577 by Agel/Kpnl blunt ligation. The pRRL-E-Flag-QLG was generated by transferring E-Flag-QLG from TP1479 to TP1578 via BamHI/BstBI. TP1578 and TP1579 were used as the backbone vector for NEB-HiFi cloning of human IFNy and IL2 PCR products via Xbal site to generate pRRL-IFNy-LG (TP 1604) or QLG (TP 1605) and pRRL-IL2-LG (TP 1606) or QLG (TP 1607). The PCR fragments of IFNy and IL2 were derived, respectively, from pUC8-IFNy (a gift from Howard Young, Addgene #17600) and pAIP-hIL2-co (a gift from Jeremy Luban, Addgene #90513) using primer pairs T1407/Tq408 and T1409/T1410 as listed in Table SI. The pRRL-UTR-Flag-LG (TP 1621) and pRRL-UTR-Flag-QLG (TP 1622) were generated respectively by NEB-HiFi cloning of 5’-UTR PCR products from TP1583 into TP1578 and TP1579 via Xbal. The pRRL-Flag-LG (TP 1685) and pRRL-Flag-QLG (TP 1686) were generated respectively from TP 1621 and Tpl622 via BsmBI/Xbal digestion to remove UTR and NEB HiFi cloning with oligonucleotide insert (T1469) to correct the ATG site in ORF.
The LV packaging vector psPAX2-Gag-Q (TP1618) was generated from psPAX2 (TP592, a gift from Didier Trono, Addgene #12260) via Sph/EcoRV sites by NEB HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template. The psPAX2-Pol-Q-RRE-Q (TP1619) was generated from psPAX2 via Swal/Nhel sites by NEB HiFi cloning of two overlay PCR fragments with primer pairs T1400/T1401 and T1402/T1403 using psPAX2 as PCR template. The psPAX2-Gag-Q-Pol-Q-RRE-Q (TP 1620) was from TP1619 via Sph/EcoRV sites by NEB HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template.
S-pseudoviral vectors : The vector pCAG-SARS-CoV2-Sdl8Q (TP1506) encoding human codon-optimized S gene of SARS-CoV2 with C-terminal 18 amino acids deletion (Sdl8) and Qa tag fusion was constructed using NEBuilder® HiFi DNA Assembly cloning kit (NEB, E5520S). Briefly, the Sdl8 expression cassette in the CMV-driven vector pcDNA3.1-SARS2-S, a gift from Fang Li (Addgene Cat # 145032), was transferred to a CAG-driven vector pCAG-Flag-SARS- CoV2-S (gift from Peihui Wang) via EcoRVNotl sites and PCR with primer pairs T1323/T1324. The vector pCAG-SARS-CoV2-Sdl8 (TP1567) encoding Sdl8 without Qa tag was constructed via the NEB HiFi cloning with synthesized oligonucleotide insert T1367 at SacII/Not site of pCAG-SARS-CoV2-Sdl8Q vector. The pCAG-UTR-Sdl8Q (TP1595) vector was generated as described above.
Plasmid DNA purification and DNA quantification
Plasmid DNAs were purified using commercial kits for endotoxin-free miniprep (Cat# REF 740490) or midipreps (Cat# REF 740420) from Macherey-Nagel (Germany). The E. coli bacterial cultures (5 ml for miniprep, 200 ml for midiprep) harboring relevant plasmids were grown in LB or 2YT media supplemented with 100 pg/ml Carbenicillin at 30°C for NEB-stable or 37°C for DH5alpha E. coli cells overnight. The bacterial cultures were harvested by centrifugation, the pellets obtained after centrifugation were processed to purify plasmid DNA according to manufacturer’s guideline. The final DNA was dissolved in ultra-pure distilled water and DNA concentrations were determined either using Nanodrop 1 UV-Vis Spectrophotometer (Thermo- Fisher) or in a Take3 plate using Bio-Tek multiplate reader.
Cell culture and Transfections
HEK293T human fetal kidney and Hela human cervix epithelial cells were obtained from ATCC (http://www. atcc.org). Both cells were cultures in Dulbecco’s Modified Eagle’s Medium (DMEM, Gibco) supplemented with Fetal Bovine Serum (FBS) and antibiotic 1% Penicillin/Streptomycin (Corning). BHK-21-/WI-2 cells (EH1011, Kerafast, Boston, MA, USA) were grown in DMEM supplemented with 5% FBS and 1% Penicillin/Streptomycin. All cells were incubated in a 37°C incubator under 5% CO2 atmosphere.
For most experiments, 96-well plate was used. For mRNA stability, 24-well plate was used. Cells resuspended (in DMEM plus 10% FBS) were seeded (3-4xl04 cells/well for 96-well plate or l-2xl05 cells/well for 24-well plate) the night before the transfections. For transfections, Transporter 5 transfection reagent (TP5) (Polysciences Cat# 26008) was used at 1 to 4 ratios of DNA/reagent. Typically, 50-100 ng plasmid DNA per well for 96-well plate was mixed 0.2-0.4 mΐ TP5 in 0.9% NaCl solution and incubated at room temperature for 20 min. The transfection reagent and DNA solution were mixed again and added to each well dropwise. The transfections were incubated at 37°C in 5% CO2 overnight (16-18 h), the media was replaced with DMEM plus 10% FBS.
Multilabeled fluorescent immunocytochemistry and confocal image analysis
Cells were fixed for 30 min with 4% paraformaldehyde (PFA), washed with lx PBS, and permeabilized with 0.5% TritonX-100/lx phosphate buffered saline (PBS) for 30 minutes, blocked
with 10% donkey serum for 1 hour and incubated with mouse anti -Flag monoclonal or anti-2A primary antibodies (Table 1) in 0.1% TritonX-100/lX PBS overnight at 4°C. The next day, cells were washed with IX PBS and incubated with the corresponding Alexa Fluor secondary antibodies (Jackson Immuno Research Labs; donkey anti-rabbit, anti-mouse, IgG (H+L) 488, 594, or 680) at a 1:400 dilution for 1 hour at room temperature, using Hoechst 33258 (1:5000) as a nuclear counterstain. Fluorescent confocal images were acquired and analyzed using the Leica SP8 confocal system.
Table 1
Flow Cytometry
Cells expressing dsGFP reporter were dissociated with Accutase (Coming), passed through a 70 pm nylon cell strainer (Corning) to remove large clumps, and washed with IX PBS. Dissociated cells were fixed with 4% PFA in PBS and GFP positive cells were analyzed using Cytek Aurora Flow cytometer.
RNA Extraction and Reverse Transcription Quantitative PCR (RT-qPCR) for mRNA stability assay
HEK293T cells were transfected with indicated vectors (500 ng/well for 24-well plate) for 24 h before treatment with transcriptional inhibitor actinomycin D (10 mM) for various period. Total RNA was extracted using Monarch Total RNA Miniprep Kit (NEB, Cat# T2010) that includes two steps of DNA removal. Equal amount of RNA (0.5 pg) was used to synthesize cDNA using High Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific, Cat# 4368814) with random hexanucleotide primer. Real time PCR analysis was carried out on QuantStudio™ 3 System. The mRNA expression levels of reporter gdLuc luciferase and huma b-actin were determined using iTaq Universal SYBR Green Supermix kit (BioRad, Cat# 1725121). The sequences for gdLuc primers are (forward) 5’- GATTACAAGGATGACGACGATAAG-3’ (SEQ ID NO: 2) (T1364 targeting Flag) and (reverse) 5’- AAGTCTTCGTTGTTCTCGGTGGG-3 ’ (SEQ ID NO: 3) (T432 targeting gdLuc). Human b-actin primers are (forward) 5’- AAGAGCTATGAGCTGCCTGA-3 ’ (SEQ ID NO: 4) and (reverse) 5’- TACGGATGTCAACGTCACAC-3’ (SEQ ID NO: 5). Each sample was tested in triplicate. Cycle threshold (Ct) values were obtained graphically for reporter and b-actin. The difference in Ct values between for reporter and b-actin were represented as ACt values. The AACt values were obtained by subtracting the ACt values of the control samples from that of the samples at different time points. Relative percentage change in gene expression was calculated as 2-AACt. The mRNA decay rate was calculated by non-linear regression curve fitting (one phase decay) using GraphPad Prism 9.1. Three independent experiments were performed.
Luciferase assays:
For gdLuc assay, the Coelenterazine (CTZ) substrate (Cat # 3032, Nanolight Technology) was dissolved in 10 ml ultra-sterile distilled water to make the stock solutions and kept at -20°C until use. The CTZ stock solution was diluted 10-30 times to make working solutions. Equal amount of CTZ working solution and cell culture media (25-50 mΐ) after transfection were mixed in a Coming (CLS3922) white opaque 96-well optiplate, and the luminescence was measured in a BioTek Synergy LX multiplate reader. For firefly luciferase assay in some experiment, the ONE- Glo Luciferase assay kit (Promega Corp, Cat # E6110) was used. Aliquots of 100 mΐ substrate
solution were mixed with 3-5 mΐ of cell lysates and the luminescence was measured in a BioTek Synergy LX multiplate reader. Data were presented as relative luciferase activity or fold changes compared with corresponding group. Experiments were performed at least 3 times with each in quadruplicates. In vitro transcription and mRNA transfection
For pcDNA6B vector containing T7 promoter, the DNA was lineated with Agel digestion followed by gel purification. For PCR product, the primers included the T7 promoter (TTA ATAC GACTC AC TATAGGGT GGA ATTC T GC AGATAT C C AG (SEQ ID NO: 6), T1427), generating DNA fragment containing 5’-UTR, target gene, LG or QLG dual reporter and a poly(A) tail. PCR was performed using Phusion High-Fidelity PCR Master Mix kit (Thermo Fisher Scientific, F531). The DNA was purified using gel extraction kit. and the concentration determined using Take3 plate in Bio-Tek multiplate reader. RNA was synthesized from the purified DNA template using HiScribe™ T7 ARCA mRNA Kit (New England Biolabs, Cat#E2060) and cotranscriptionally capped with m7G anti-reverse cap analog (ARCA, Cat#1411), and poly A tailing. The synthesized RNA was purified using Monarch RNA cleanup kit (New England Biolabs, Cat#E2040) and quantified with Take3 plate. Equal amount of RNA between LG and QLG groups at different dosage were used for transfection into HEK293T cells in quadruplicate with Lipofectamine® MessengerMAX mRNA Transfection Reagent (Thermo Fisher Scientific, Cat#LMRNA015) following manufacture’s manual. At 4-72 h post-transfection, the culture media containing gdLuc were collected, and gdLuc assay was performed as above.
VSV-G or S protein-pseudotyped lentivirus packaging and titration
The recombinant lentivirus carrying indicated lentiviral vector was produced in a small scale using the second generation of LV packaging system according to standard protocols. Briefly, HEK293T cells in one of 6-well plate were cotransfected by TP5 kit with the indicated transfer LV vector (1.4 pg), the packaging vector psPAX2 or its mutants (1 pg) and VSV-G or Sdl8 vector (0.4 pg). At 2-3 days post-transfection, the supernatants containing LV were concentrated and purified with simplified 10% sucrose purification as described previously. The functional titers of the crude and purified lentivirus were determined by counting GFP-expressing HEK293T cells at 48 h after infection with serial dilutions of lentiviruses under fluorescent
microscopy. For some cases, flow cytometry or RT-qPCR analysis were used for LV titration. For PCR analysis, cell culture medium was collected from infected cells and centrifuged at 2,000 g for 5 min. Supernatant was subjected to viral lysis to extract viral RNA. One step RT-qPCR was performed using the qPCR Lentivirus Complete Titration Kit (Applied Biological Materials Inc., Cat No. LV900-S) and the QuantStudio 3 Real-Time PCR System (Applied Biosystems, Cat No. A28567) according to manufacturer protocols. The resulting data was analyzed using QuantStudio Design and Analysis Desktop Software (Applied Biosystems).
Western Blot Analysis
SDS-polyacrylamide gels (10-12%) were home-made or Mini -PROTEAN TGX gels (Cat# 4561093, 4561096) were purchased from BioRad. The cell lysates were prepared using the lysis buffer composed of 50 mM Tris-HCl pH 7.0, 150 mM NaCl, 5 mM EDTA and 1 % Triton X-100 supplemented with PMSF (lOOx), Aprotinin and Leupeptin (200x). The 50 mΐ lysates were prepared from each well after collecting the supernatant. The lysates were incubated at 4°C for 20- 30 minutes, centrifuged at maximum speed in an Eppendorf Centrifuge. The clear lysates were either denatured for 5 minutes at 98°C immediately in lx SDS-PAGE loading dye or stored at - 80°C until use. Supernatants were stored at 4°C until before they treated with lx SDS-PAGE loading dye. The denatured 10-20 mΐ aliquots of cell lysates or 20-30 mΐ supernatants were loaded onto SDS-polyacryramide gels. The SDS-PAGE was performed in Tris-Glycine/SDS buffers under denaturing and reducing conditions.
The polyacrylamide gels were transferred to 0.2-mih nitrocellulose membranes (BioRad supported nitrocellulose (NC) membrane, Cat # 162-0097) either using wet transfer or iBlot®2 device using IBlot®2 NC mini (IB23002) or regular Stacks (IB23001). In wet transfer following lx transfer buffer was used: 25 mM Tris-HCl pH 7.6, 192 mM glycine, 20% Methanol. The gels were sandwiched together with NC membranes and transfers were performed in lx Transfer buffer at 250 mA at 4°C for 1- 2 hours.
Dry Western blot transfers were performed in a IBlot®2 gel transfer device (Invitrogen, Thermo-Fisher, Ref# IB21001) using mini or regular IBlot®2 stacks for 7 min according to manufacturer’s guidelines.
After the transfer, the membranes were blocked in lx TBST buffer containing 5% milk. The membranes then were treated with primary antibodies overnight at 4°C or 2 hours at RT. The membranes were washed three times with lx TBST buffer minute each followed by incubation with secondary antibodies. The secondary antibodies with infrared tag were diluted 1/10000- 120000 and incubated with the NC membranes for 45 minutes to an hour. At the end of incubation, the membranes were washed with lx TBST buffer three times, 5 minutes each and scanned on a Li-COR Odyssey image analyzer.
Antibody Detection with Enzyme-Linked Immunosorbent Assay (ELISA)
HEK293T cells were cotransfected with the Qa-tagged HQ (TP1574) and LQ (TP1571) at 50 ng/well of 96-well plate in quadruplicates with or without normalization vector pGL4.16-CMV (TP329) or pRRL-E-Flag-LG (TP1578) at 20 ng/well. The original antibody plasmids for pFUSEss-CHIg-hGl-SARS-CoV2-mAb (TP 1565) and pFUSE2ss-CLIg-hk-SARS-CoV2-mAb (TP 1566) were used as the control. ELISA was performed using a Human IgG (Total) Uncoated ELISA Kit (Invitrogen, Thermo-Fisher, Cat # 88-50550-88). A 96-well Costar ELISA plate (Coming) was first coated with SARS-Cov2-Spike (S) protein from BEI (Cat # NR52724) at 100 pg/well overnight at 4°C. The washing and blocking steps were performed using the buffers and solutions provided in the kit. Supernatants containing secreted antibodies were collected from the transfections at 24 and 48 h and kept at 4°C until use. The aliquots of 0.5, 2.5 and 5.0 antibody supernatants were added to each SARS-Cov2-S coated wells. After overnight incubation, the wells were washed (400 mΐ per well) the solutions provided in the kit. The horse radish peroxidase (HRP)-conjugated anti-monoclonal detection antibody was diluted in assay buffer (1/250) and added to each well and incubated at room temperature (RT) for 2-3 h. The wells were then washed 3 times (400 mΐ each) using a buffer provided in the kit at RT and treated with 300 pL substrate TMB (3, 3’, 5, 5’- tetramethyl benzidine) for 15 min to develop blue color and the reactions were terminated with 2 N HC1. The yellow color formation was measured at 450 nm using a BioTek microplate reader. The level of anti-SARS-CoV monoclonal antibody was quantified by Sigmoidal four-parameter logistic curve (4PL) fit using Prism GraphPad 9.1.
ER-Golgi transport inhibition with Brefeldin A
Brefeldin A (BA, AdipoGen Life Sciences, Cat # AG-CN2-0018) was dissolved in DMSO to make 1 mg/mL working solution. HEK293T cells were transfected with indicated vectors using TP5 transfection reagent in DMEM plus 10 % FBS as described above. The transfected cells were incubated overnight, and 10 pg/ml BA was added prior to media change and incubated for 3 hours at 37°C in 5% CO2. The culture media was replaced with 293 FreeStyle serum free media (Gibco, Thermo-Fisher, Cat# 12-338-018) with 10 pg/ml BA and incubation was continued for 24 h at 37°C in 5% CO2. The supernatants were withdrawn right after media replacement and collected after 24 h. The cell lysates were also prepared at 24 h time point. The supernatants and cell lysates were tested for gdLuc activity and Western blot analysis.
Quantification and statistical analysis
Quantification of fold changes in Qa groups or UTR groups compared with corresponding non-Qa or non-UTR groups was performed using excel software. Statistical analysis was performed using Prism GraphPad 9.1. Significance at *P < 0.05, ** P < 0.01 and *** P < 0.001 was determined using a two-tailed student’s t-test between two groups or by one-way ANOVA for multiple comparisons. Data were presented as mean SE. The size and type of individual samples were indicated and specified in the figure legends.
All of the features disclosed in this specification may be combined in any combination. Each feature disclosed in this specification may be replaced by an alternative feature serving the same, equivalent, or similar purpose.
References
Baldassarre, A., Paolini, A., Bruno, S.P., Felli, C., Tozzi, A.E., and Masotti, A. (2020). Potential use of noncoding RNAs and innovative therapeutic strategies to target the 5'UTR of SARS-CoV-2. Epigenomics 12, 1349-1361. 10.2217/epi-2020-0162.
Berkhout, B., Arts, K., and Abbink, T.E. (2011). Ribosomal scanning on the 5'- untranslated region of the human immunodeficiency virus RNA genome. Nucleic Acids Res 39, 5232-5244. 10.1093/nar/gkrl 13.
Bhagawati, M., Terhorst, T.M.E., Fusser, F., Hoffmann, S., Pasch, T., Pietrokovski, S., and Mootz, H.D. (2019). A mesophilic cysteine-less split intein for protein trans-splicing
applications under oxidizing conditions. Proc Natl Acad Sci U S A 116 , 22164-22172.
10.1073/pnas.1909825116.
Boo, S.H., and Kim, Y.K. (2020). The emerging role of RNA modifications in the regulation of mRNA stability. Exp Mol Med 52, 400-408. 10.1038/sl2276-020-0407-z.
Boson, B., Legros, V., Zhou, B., Siret, E., Mathieu, C., Cosset, F.L., Lavillette, D., and Denolly, S. (2020). The SARS-CoV-2 envelope and membrane proteins modulate maturation and retention of the spike protein, allowing assembly of virus-like particles. J Biol Chem 296 , 100111. 10.1074/jbc.RA120.016175.
Bottaro, S., Bussi, G., and Lindorff-Larsen, K. (2021). Conformational Ensembles of Noncoding Elements in the SARS-CoV-2 Genome from Molecular Dynamics Simulations. J Am Chem Soc. 10.1021/jacs.lc01094.
Cambray, G., Guimaraes, J.C., and Arkin, A.P. (2018). Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli. Nat Biotechnol 36, 1005-1015. 10.1038/nbt.4238.
Chan, A.P., Choi, Y., and Schork, N. J. (2020). Conserved Genomic Terminals of SARS- CoV-2 as Coevolving Functional Elements and Potential Therapeutic Targets. mSphere 5.
10.1128/mSphere.00754-20.
Chan, L.Y., Mugler, C.F., Heinrich, S., Vallotton, P., and Weis, K. (2018). Non-invasive measurement of mRNA decay reveals translation initiation as the major determinant of mRNA stability. Elife 7. 10.7554/eLife.32536.
Chng, J., Wang, T., Nian, R., Lau, A., Hoi, K.M., Ho, S.C., Gagnon, P., Bi, X., and Yang, Y. (2015). Cleavage efficient 2A peptides for high level monoclonal antibody expression in CHO cells. MAbs 7, 403-412. 10.1080/19420862.2015.1008351.
Corbett, K.S., Edwards, D.K., Leist, S.R., Abiona, O.M., Boyoglu-Barnum, S., Gillespie, R.A., Himansu, S., Schafer, A., Ziwawo, C.T., DiPiazza, A.T., et al. (2020). SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature 586 , 567-571.
10.1038/s41586-020-2622-0.
Daliri, E.B., Oh, D.H., and Lee, B.H. (2017). Bioactive Peptides. Foods 6. 10.3390/foods6050032.
DeCaprio, J., and Kohl, T.O. (2019). Tandem Immunoaffmity Purification Using Anti- FLAG and Anti-HA Antibodies. Cold Spring Harb Protoc 2019. 10.1101/pdb.prot098657.
Donofrio, G., Franceschi, V., Macchi, F., Russo, L., Rocci, A., Marchica, V., Costa, F., Giuliani, N., Ferrari, C., and Missale, G. (2021). A Simplified SARS-CoV-2 Pseudovirus Neutralization Assay. Vaccines (Basel) 9. 10.3390/vaccines9040389.
Dou, Y., Lin, Y., Wang, T.Y., Wang, X.Y., Jia, Y.L., and Zhao, C.P. (2021). The CAG promoter maintains high-level transgene expression in HEK293 cells. FEBS Open Bio 11, 95- 104. 10.1002/2211-5463.13029.
Gaur, S., Bhargava-Shah, A., Hori, S., Afjei, R., Sekar, T.V., Gambhir, S.S., Massoud, T.F., and Paulmurugan, R. (2017). Engineering Intracellularly Retained Gaussia Luciferase Reporters for Improved Biosensing and Molecular Imaging Applications. ACS Chem Biol 12, 2345-2353. 10.1021/acschembio.7b00454.
GrootBramel-Verheije, M.H., Rottier, P.J., and Meulenberg, J.J. (2000). Expression of a foreign epitope by porcine reproductive and respiratory syndrome virus. Virology 278, 380-389. 10.1006/viro.2000.0525.
Han, X., Ning, W., Ma, X., Wang, X., and Zhou, K. (2020). Improving protein solubility and activity by introducing small peptide tags designed with machine learning models. Metab Eng Commun 11, e00138. 10.1016/j.mec.2020.e00138.
Hinnebusch, A.G., Ivanov, I.P., and Sonenberg, N. (2016). Translational control by 5'- untranslated regions of eukaryotic mRNAs. Science 352, 1413-1416. 10.1126/science. amino acidsd9868.
Hu, J., Gao, Q., He, C., Huang, A., Tang, N., and Wang, K. (2020). Development of cell- based pseudovirus entry assay to identify potential viral entry inhibitors and neutralizing antibodies against SARS-CoV-2. Genes Dis 7, 551-557. 10.1016/j.gendis.2020.07.006.
Katayama, S., Corpuz, H.M., and Nakamura, S. (2021). Potential of plant-derived peptides for the improvement of memory and cognitive function. Peptides 142, 170571.
10.1016/j.peptides.2021.170571.
Kim, J.H., Lee, S.R., Li, L.H., Park, H.J., Park, J.H., Lee, K.Y., Kim, M.K., Shin, B.A., and Choi, S.Y. (2011). High cleavage efficiency of a 2 A peptide derived from porcine teschovirus-1 in human cell lines, zebrafish and mice. PLoS One 6 , el 8556.
10.1371/journal. pone.0018556.
Kolahchi, Z., De Domenico, M., Uddin, L.Q., Cauda, V., Grossmann, I., Lacasa, L., Grancini, G., Mahmoudi, M., and Rezaei, N. (2021). COVID-19 and Its Global Economic Impact. Adv Exp Med Biol 1318, 825-837. 10.1007/978-3-030-63761-3_46.
Korber, B., Fischer, W.M., Gnanakaran, S., Yoon, H., Theiler, L, Abfalterer, W., Hengartner, N., Giorgi, E.E., Bhattacharya, T., Foley, B., et al. (2020). Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell 182, 812-827 e819. 10.1016/j.cell.2020.06.043.
Kuzmina, A., Khalaila, Y., Voloshin, O., Keren-Naus, A., Boehm-Cohen, L., Raviv, Y., Shemer-Avni, Y., Rosenberg, E., and Taube, R. (2021). SARS-CoV-2 spike variants exhibit differential infectivity and neutralization resistance to convalescent or post-vaccination sera. Cell Host Microbe. 10.1016/j . chom.2021.03.008.
Lee, T.H., Kim, K.S., Kim, J.H., Jeong, J.H., Woo, H R., Park, S.R., Sohn, M.H., Lee, H.J., Rhee, J.H., Cha, S.S., et al. (2020). Novel short peptide tag from a bacterial toxin for versatile applications. J Immunol Methods 479, 112750. 10.1016/j.jim.2020.112750.
Li, Y. (2011). Recombinant production of antimicrobial peptides in Escherichia coli: a review. Protein Expr Purif 80, 260-267. 10.1016/j . pep.2011.08.001.
Liu, J., Bodnar, B.H., Meng, F., Khan, A., Wang, X., Luo, G., Saribas, S., Wang, T., Lohani, S.C., Wang, P., et al. (2021a). Epigallocatechin Gallate from Green Tea Effectively Blocks Infection of SARS-CoV-2 and New Variants by Inhibiting Spike Binding to ACE2 Receptor. bioRxiv, 2021.2003.2017.435637. 10.1101/2021.03.17.435637.
Liu, J., Bodnar, B.H., Wang, X., Wang, P., Meng, F., Khan, A.I., Saribas, A.S., Padhiar, N.H., McCluskey, E., Shah, S., et al. (2021b). Correlation of vaccine-elicited antibody levels and neutralizing activities against SARS-CoV-2 and its variants. bioRxiv, 2021.2005.2031.445871. 10.1101/2021.05.31.445871.
Lorenz, I.C., Nguyen, H.T., Kemelman, M., Lindsay, R.W., Yuan, M., Wright, K.J., Arendt, H., Back, J.W., DeStefano, J., Hoffenberg, S., et al. (2014). The stem of vesicular stomatitis virus G can be replaced with the HIV-1 Env membrane-proximal external region without loss of G function or membrane-proximal external region antigenic properties. AIDS Res Hum Retroviruses 30, 1130-1144. 10.1089/AID.2013.0206.
Majorek, K.A., Kuhn, M.L., Chruszcz, M., Anderson, W.F., and Minor, W. (2014). Double trouble-Buffer selection and His-tag presence may be responsible for nonreproducibility of biomedical experiments. Protein Sci 23, 1359-1368. 10.1002/pro.2520.
Miao, Z., Tidu, A., Eriani, G., and Martin, F. (2020). Secondary structure of the SARS- CoV-2 5'-UTR. RNA Biol, 1-10. 10.1080/15476286.2020.1814556.
Mishra, V. (2020). Affinity Tags for Protein Purification. Curr Protein Pept Sci 21, 821 - 830. 10.2174/1389203721666200606220109.
Muik, A., Wallisch, A.K., Sanger, B., Swanson, K.A., Muhl, J., Chen, W., Cai, H., Maurus, D., Sarkar, R., Tureci, O., et al. (2021). Neutralization of SARS-CoV-2 lineage B.1.1.7 pseudovirus by BNT162b2 vaccine-elicited human sera. Science 371, 1152-1153.
10.1126/science. abg6105.
Nie, L, Li, Q., Wu, L, Zhao, C., Hao, H., Liu, H., Zhang, L., Nie, L., Qin, H., Wang, M., et al. (2020). Establishment and validation of a pseudovirus neutralization assay for SARS-CoV- 2. Emerg Microbes Infect 9, 680-686. 10.1080/22221751.2020.1743767.
Ou, X., Liu, Y., Lei, X., Li, P., Mi, D., Ren, L., Guo, L., Guo, R., Chen, T., Hu, L, et al. (2020). Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat Commun 11, 1620. 10.1038/s41467-020-15562-9.
Peighambardoust, S.H., Karami, Z., Pateiro, M., and Lorenzo, J.M. (2021). A Review on Health-Promoting, Biological, and Functional Aspects of Bioactive Peptides in Food Applications. Biomolecules 11. 10.3390/bioml 1050631.
Pina, A.S., Batalha, I.L., Dias, A., and Roque, A.C.A. (2021). Affinity Tags in Protein Purification and Peptide Enrichment: An Overview. Methods Mol Biol 2178 , 107-132.
10.1007/978-l-0716-0775-6_10.
Polack, F.P., Thomas, S.J., Kitchin, N., Absalon, L, Gurtman, A., Lockhart, S., Perez, J.L., Perez Marc, G., Moreira, E.D., Zerbini, C., et al. (2020). Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine. N Engl J Med 383 , 2603-2615.
10.1056/NEJMoa2034577.
Raman, S., and Brian, D.A. (2005). Stem-loop IV in the 5' untranslated region is a cis- acting element in bovine coronavirus defective interfering RNA replication. J Virol 79, 12434- 12446. 10.1128/JVI.79.19.12434-12446.2005.
Rangan, R., Zheludev, I.N., Hagey, R.J., Pham, E.A., Wayment- Steele, H.K., Glenn, J.S., and Das, R. (2020). RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look. RNA 26, 937-959. 10.1261/ma.076141.120.
Rezaei, N., Ashkevarian, S., Fathi, M.K., Hanaei, S., Kolahchi, Z., Ladi Seyedian, S.S., Rayzan, E., Sarzaeim, M., Vahed, A., Mohamed, K., et al. (2021). Introduction on Coronavirus Disease (COVID-19) Pandemic: The Global Challenge. Adv Exp Med Biol 1318, 1-22.
10.1007/978-3-030-63761-3_1.
Rouchka, E.C., Chariker, J.H., and Chung, D. (2020). Variant analysis of 1,040 SARS- CoV-2 genomes. PLoS One 15, e0241535. 10.1371/journal.pone.0241535.
Ryder, S.P., Morgan, B.R., Coskun, P., Antkowiak, K., and Massi, F. (2021). Analysis of Emerging Variants in Structured Regions of the SARS-CoV-2 Genome. Evol Bioinform Online 17, 11769343211014167. 10.1177/11769343211014167.
Saribas, A.S., White, M.K., and Safak, M. (2018). Structure-based release analysis of the JC virus agnoprotein regions: A role for the hydrophilic surface of the major alpha helix domain in release. J Cell Physiol 233, 2343-2359. 10.1002/jcp.26106.
Schlehuber, L.D., and Rose, J.K. (2004). Prediction and identification of a permissive epitope insertion site in the vesicular stomatitis virus glycoprotein. J Virol 78, 5079-5087.
10.1128/jvi.78.10.5079-5087.2004.
Senanayake, S.D., and Brian, D.A. (1999). Translation from the 5' untranslated region (UTR) of mRNA 1 is repressed, but that from the 5' UTR of mRNA 7 is stimulated in coronavirus-infected cells. J Virol 73, 8003-8009. 10.1128/JVI.73.10.8003-8009.1999.
Shirokikh, N.E., Dutikova, Y.S., Staroverova, M.A., Hannan, R.D., and Preiss, T. (2019). Migration of Small Ribosomal Subunits on the 5' Untranslated Regions of Capped Messenger RNA. Int J Mol Sci 20. 10.3390/ijms20184464.
Traenkle, B., Segan, S., Fagbadebo, F.O., Kaiser, P.D., and Rothbauer, U. (2020). A novel epitope tagging system to visualize and monitor antigens in live cells with chromobodies. Sci Rep 10, 14267. 10.1038/s41598-020-71091-x.
Vasan, N., Razavi, P., Johnson, J.L., Shao, H., Shah, H., Antoine, A., Ladewig, E., Gorelick, A., Lin, T.Y., Toska, E., et al. (2019). Double PIK3CA mutations in cis increase oncogenicity and sensitivity to PBKalpha inhibitors. Science 366, 714-723.
10.1126/science. amino acidsw9032.
Viswanathan, S., Williams, M.E., Bloss, E.B., Stasevich, T.J., Speer, C.M., Nern, A., Pfeiffer, B.D., Hooks, B.M., Li, W.P., English, B.P., et al. (2015). High-performance probes for light and electron microscopy. Nat Methods 12, 568-576. 10.1038/nmeth.3365.
Walls, A.C., Park, Y.J., Tortorici, M.A., Wall, A., McGuire, A.T., and Veesler, D.
(2020). Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 181, 281-292 e286. 10.1016/j.cell.2020.02.058.
Walsh, E.E., Frenck, R.W., Jr., Falsey, A.R., Kitchin, N., Absalon, J., Gurtman, A., Lockhart, S., Neuzil, K., Mulligan, M.J., Bailey, R., et al. (2020). Safety and Immunogenicity of Two RNA-Based Covid-19 Vaccine Candidates. N Engl J Med 383, 2439-2450.
10.1056/NEJMoa2027906.
Wang, Q., Zhang, Y., Wu, L., Niu, S., Song, C., Zhang, Z., Lu, G., Qiao, C., Hu, Y., Yuen, K.Y., et al. (2020). Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2. Cell 181, 894-904 e899. 10.1016/j.cell.2020.03.045.
Weber, M., Burgos, R., Yus, E., Yang, J.S., Lluch-Senar, M., and Serrano, L. (2020). Impact of C-terminal amino acid composition on protein expression in bacteria. Mol Syst Biol 16, e9208. 10.15252/msb.20199208.
Weissman, D., Alameh, M.G., de Silva, T., Collini, P., Hornsby, H., Brown, R., LaBranche, C.C., Edwards, R.J., Sutherland, L., Santra, S., et al. (2021). D614G Spike Mutation Increases SARS CoV-2 Susceptibility to Neutralization. Cell Host Microbe 29, 23-31 e24.
10.1016/j.chom.2020.11.012.
Wibmer, C.K., Ayres, F., Hermanus, T., Madzivhandila, M., Kgagudi, P., Oosthuysen,
B., Lambson, B.E., de Oliveira, T., Vermeulen, M., van der Berg, K., et al. (2021a). SARS-CoV- 2 501 Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med. 10.1038/s41591-021-01285-x.
Wibmer, C.K., Ayres, F., Hermanus, T., Madzivhandila, M., Kgagudi, P., Oosthuysen,
B., Lambson, B.E., de Oliveira, T., Vermeulen, M., van der Berg, K., et al. (2021b). SARS-CoV- 2 501 Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med 27, 622- 625. 10.1038/s41591-021-01285-x.
Williams, G.D., Chang, R.Y., and Brian, D.A. (1999). A phylogenetically conserved hairpin-type 3' untranslated region pseudoknot functions in coronavirus RNA replication. J Virol 73, 8349-8355. 10.1128/JVI.73.10.8349-8355.1999.
Wu, F., Zhao, S., Yu, B., Chen, Y.M., Wang, W., Song, Z.G., Hu, Y., Tao, Z.W., Tian, J.H., Pei, Y.Y., et al. (2020). A new coronavirus associated with human respiratory disease in China. Nature 579, 265-269. 10.1038/s41586-020-2008-3.
Yang, D., and Leibowitz, J.L. (2015). The structure and functions of coronavirus genomic 3' and 5' ends. Virus Res 206, 120-133. 10.1016/j.virusres.2015.02.025.
Zhang, J., Cruz-Cosme, R., Zhuang, M.W., Liu, D., Liu, Y., Teng, S., Wang, P.H., and Tang, Q. (2020). A systemic and molecular study of subcellular localization of SARS-CoV-2 proteins. Signal Transduct Target Ther 5, 269. 10.1038/s41392-020-00372-8.
Zhang, J., Roberts, R., and Rakotondrafara, A.M. (2015). The role of the 5' untranslated regions of Potyviridae in translation. Virus Res 206, 74-81. 10.1016/j.virusres.2015.02.005.
Zhao, J., Qiu, J., Aryal, S., Hackett, J.L., and Wang, J. (2020). The RNA Architecture of the SARS-CoV-2 3 '-Untranslated Region. Viruses 12. 10.3390/vl2121473.
Example 3: Protein Expression/Secretion Boost By An Expression-Enhancing 21-mer Cis- Regulatory Motif (Exen21)
Many technologies have been developed to boost protein production, such as promoter optimization, mRNA regulation, codon optimization, and protein stabilization, as well as modification of host cellular expression machinery including humanized yeast system. While these strategies have been successfully used in various research fields and by the biopharmaceutical companies, it remains to be a research focus for developing a simple universal method that can increase protein production at lower cost. Studying SARS-CoV- 2 has been hampered by the low-level expression of many viral proteins including the spike (S)
1 protein in mammalian cells , which has limited the quick response to the COVID-19 pandemic^ . To optimize the SARS-CoV-2 viral protein expression, various expression vectors were developed herein using different promoters and a luciferase/GFP -based dual reporter system. During the vector optimizing process, it was discovered that the addition of a novel 21-mer oligonucleotide motif (termed herein “Exen21”, Expression-Enhancing 21) into the vector dramatically increased the expression and secretion of SARS-CoV-2 envelop (E) protein. This unique Exen21 encodes a specific heptapeptide designated as Qa. The insertion of Exen21/Qa was extended to various types of proteins and found out that it could enhance the production of other proteins of SARS-CoV-2, cellular gene products, mRNA vaccines, antibodies, engineered recombinant proteins, and virus-packaging proteins.
Materials and Methods
Vector cloning
All the PCR reactions for cloning in this study were performed using Phusion High- Fidelity PCR Master Mix kit (Thermo Fisher, F531) and purified using the Monarch PCR & DNA Cleanup Kit (NEB, T1030S). The correct clones were verified by restriction enzyme digestion and Sanger sequencing as well as functional measures.
Dual reporter vectors: The dual reporter LG fragment, encoding Gaussia- Dura luciferase (gdLuc) and destabilized GFP (dsGFP), was generated by overlay PCR: 1) Standard PCR was performed to generate fragment 1 (gdLuc) from template plasmid pMCS-Gawvv/a-Dura- Luciferase (Thermo Fisher Scientific, Cat#16190) with primer pair T1290/T1291, while fragment 2 (dsGFP) was generated from plasmid pLenti-EFS-EGFPd2PEST-2A-MCS-Hygro (TP 1380, a gift from Neville Sanjana (Addgene Cat# 138152)) with T1292/T1293; 2) Purified two fragments (100 ng/each) with overlayed 19 nucleotides were mixed for 5 cycles of PCR with primer pairs T1304/T1305 at 98°C 15 sec, 58°C 30 sec and 72°C 1 min followed by 30 cycles of PCR at 98°C 30 sec, 55°C 30 sec and 72°C 1 min to generate LG fragment. After purification with NucleoSpin Gel and PCR Clean-up kit (Macherey -Nagel, Cat# REF740609), this LG fragment (1485 bp) was cloned into pcDNA6B-nCoV-X-Flag vectors encoding various viral proteins of SARS-CoV-2 or cellular gene hACE2 via Sacll cloning site using NEBuilder® HiFi DNA Assembly cloning kit (NEB, Cat# E5520S, assigned as NEB-HiFi) to generate pcDNA6B-SARS-CoV-2-X-Flag-LG vectors. The “X” indicates the gene of interest.
Unexpectedly, functional assay and Sanger sequencing identified a novel clone assigned as pcDNA6B-SARS-CoV-2-E-Flag-QLG (TP1479), which has a Qa peptide in ORF before LG, assigned as QLG. The insert fragment encoding SARS-CoV-2 S protein from pcDNA6B-nCoV-S-Flag vector (TP1456) was cloned into TP1479 via XhoMXbaX sites to generate pcDNA6B-SARS-CoV-2-S-Flag-QLG (TP1487). The insert fragment encoding SARS-CoV-2 N protein from pcDNA6B-nCoV-N-Flag vector (TP1431) or hACE2 from pcDNA6B-hACE2-Flag vector (TP1470) was cloned into TP1479 via KpuMXbal sites to generate pcDNA6B-SARS-CoV-2-N-Flag-QLG (TP1490) or pcDNA6B-hACE2-Flag-QLG (TP1491).
The pcDNA6B-NIBP-Flag-LG (TP 1560) vector was generated by NEB-HiFi cloning of NIBP PCR product from pYX-Asc-mNIBP (TP546, Genbank # BC070463) with the primers T1375/T1376 into pcDNA6B-hACE2-Flag-LG (TP1538) via NotVXbal, while the pcDNA6B-
NIBP-Flag-QLG(TP1558) was generated by NEB-HiFi cloning of the NIBP PCR product into pcDNA6B-SARS-CoV-2-E-Flag-QLG (TP1479) viaXhoVXbal.
The pCAG vectors encoding E were generated by replacing the CMV promoter in corresponding pcDNA6B-SARS-CoV-2-E-Flag-LG or -QLG vectors with C AG promoter via SnaBVKpnl sites.
Mutation vectors: Site-directed or deletion mutagenesis of Exen21/Qa were performed using pcDNA6B-SARS-CoV-2-E-Flag-QLG(TP1479) as atemplate. Mutagenic primers were designed to change or delete specific nucleotides in Exen21 sequence. For each mutation a Phusion High-Fidelity PCR reaction was performed using a universal primer (T1640) matching a region upstream of SARS-CoV-2 E and a mutagenic primer matching Exen21 sequence except for the region a desired mutation introduced. The PCR product which carries the Exen21 mutation was gel purified and cloned into AcoAV/Ao/I-digested 6B-E-QLG DNA using NEBuilder ®HiFi DNA assembly kit.
Antibody vectors: The plasmid set CR3022 for pFUSEss-CHIg-hGl-SARS-CoV-2- mAb (NR-52399, TP1565) and pFUSE2ss-CLIg-hk-SARS-CoV-2-mAb (NR-52400, TP1566) expressing the heavy (H) and light (L) chains of human anti-SARS-CoV mAb respectively (GenBank: DQ168569 and DQ168570) were produced under HHSN272201400008C and obtained from BEI Resources, NIAID, NIH (Cat# NR-53260). The Q-tagged HQ (TP 1574) and LQ (TP1571) vectors were generated from H or L plasmids at Nhe I site using NEB-HiFi with the synthesized oligonucleotides that contain Q-encoding sequence and the C-terminus of the immunoglobulin heavy and light chain (T1378, T1380-T1383).
Lentiviral vectors : The vector pRRL-SIN.cPPT.PGK-GFP.WPRE (TP792), (Addgene #12252), was used to generate pRRL-E-Flag-LG-GFP (TP1577) by transferring E-Flag-LG insert from TP1478 to TP792 via BamHUAgel. The pRRL-E-Flag-LG (TP1578) vector was generated from TP 1577 by AgeVKpnl blunt ligation. The pRRL-E-Flag-QLG (TP 1579) vector was generated by transferring E-Flag-QLG from TP1479 to TP1578 via BamHliBstBl. TP1578 and TP1579 vectors were used as the backbone for NEB-HiFi cloning of human IFNy and IL2 PCR products via Xbal site to generate pRRL-IFNy-LG (TP 1604) or QLG (TP 1605) and pRRL-IL2-LG (TP 1606) or QLG (TP 1607). The PCR fragments of IFNy and IL2 were derived, respectively, from
pUC8-IFNY (Addgene #17600) and pAIP-hIL2-co (Addgene #90513) using primer pairs T1407/T1408 and T1409/T1410. The pRRL-Flag-LG (TP 1685) and pRRL-Flag-QLG (TP 1686) vectors were generated respectively from TP 1621 and TP 1622 via BsmBMXbal digestion and NEB-HiFi cloning with oligonucleotide insert (T1469). The pLV-EFla-spCas9-Q-T2A-RFP (TP1562) was generated from pLV-EFla-spCas9-T2A-RFP (TP855) at Ariel site using NEB- HiFi cloning with the synthesized oligonucleotide that contains Q-encoding sequence (T1361). The pLV-EFla-MS2-spCas9-Q-F2A-GFP (TP1552) vector was generated from pLV-EFla-MS2- spCas9-F2A-GFP (TP 1081) at Ariel site using NEB-HiFi cloning with oligonucleotide (T1361).
The LV packaging vector psPAX2-Gag-Q (TP1618) was generated from psPAX2 (TP592, Addgene #12260) via SphVEcoKV sites by NEB-HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template. The psPAX2-Pol-Q-RRE-Q (TP1619) was generated from psPAX2 via SwaVNhel sites by NEB-HiFi cloning of two overlay PCR fragments with primer pairs T1400/T1401 and T1402/T1403 using psPAX2 as PCR template. The psPAX2-Gag-Q-Pol-Q-RRE-Q (TP 1620) was from TP1619 via SphVEcoRV sites by NEB-HiFi cloning of two overlay PCR fragments with primer pairs T1396/T1397 and T1398/T1399 using TP592 as PCR template.
S-pseudoviral vectors : The vector pCAG-SARS-CoV-2-Sdl8Q (TP1506) encoding human codon-optimized S gene of SARS-CoV-2 with C-terminal 18 aa deletion (Sdl8) and Qa tag fusion was constructed using NEB-HiFi. Briefly, the Sdl8 expression cassette in the CMV- driven vector pcDNA3.1-SARS2-S (Addgene Cat# 145032), was transferred to a CAG-driven vector pCAG-Flag-SARS-CoV-2-S (gift from Peihui Wang) via EcoRVNotl sites and PCR with primer pairs T1323/T1324. The vector pCAG-SARS-CoV-2-Sdl8 (TP1567) encoding Sdl8 without Qa tag was constructed via the NEB-HiFi cloning with synthesized oligonucleotide insert T1367 at SacIVNotl site of pCAG-SARS-CoV-2-Sdl8Q vector.
Plasmid DNA purification and DNA quantification
Plasmid DNAs were purified using commercial kits for endotoxin-free miniprep (Cat# REF 740490) or midipreps (Cat# REF 740420) from Macherey -Nagel. The E. coli bacterial cultures (4 ml for miniprep, 200 ml for midiprep) harboring relevant plasmids were grown in LB or 2YT media supplemented with 100 pg/ml Carbenicillin, 50 pg/ml
Kanamycin, 50 pg/ml blasticidin, or 50 pg/ml Zeocin at 30°C for NEB-stable or 37°C for DH5a E. coli cells overnight. The bacterial cultures were harvested by centrifugation, the pellets obtained after centrifugation were processed to purify plasmid DNA according to manufacturer’s guideline. The final DNA was dissolved in ultra-pure DNase/RNase-free distilled water (Thermo Fisher, Cat#10977023) and DNA concentrations were determined either using Nanodrop 1 UV-Vis Spectrophotometer (Thermo-Fisher) or in a Take3 plate using Bio-Tek multiplate reader.
Cell culture and Transfections
HEK293T human fetal kidney and Hela human cervix epithelial cells were obtained from ATCC (Cat# CRL-3216 and CCL-2). Both cells were cultures in Dulbecco’s Modified Eagle’s Medium (DMEM, Gibco) supplemented with Fetal Bovine Serum (FBS) and antibiotic 1% Penicillin/Streptomycin (Coming). BHK-21-/WI-2 cells (Kerafast, EH1011) were grown in DMEM supplemented with 5% FBS and 1% Penicillin/Streptomycin. All cells were incubated in a 37°C incubator under 5% CO2 atmosphere.
For most experiments, 96-well plate was used. For mRNA stability, 24-well plate was used. Cells resuspended (in DMEM plus 10% FBS) were seeded (3-4x10^ cells/well for 96- well plate or 1-2x10^ cells/well for 24-well plate) the night before the transfections. For transfections, Transporter 5 transfection reagent (TP5) (Polysciences Cat# 26008) was used at 1 to 4 ratios of DNA/reagent. Typically, 50-100 ng plasmid DNA per well for 96-well plate was mixed 0.2-0.4 mΐ TP5 in 0.9% NaCl solution and incubated at room temperature for 20 min. The transfection reagent and DNA solution were mixed again and added to each well dropwise. The transfections were incubated at 37°C in 5% CO2 overnight (16-18 h), the media was replaced with DMEM plus 10% FBS.
Multilabeled fluorescent immunocytochemistry and confocal image analysis
Cells were fixed for 30 min with 4% paraformaldehyde (PFA), washed with lx PBS, and permeabilized with 0.5% TritonX-100/lx phosphate buffered saline (PBS) for 30 min, blocked with 10% donkey serum for 1 h and incubated with mouse anti -Flag monoclonal or anti-2 A primary antibodies in 0.1% TritonX-100/lX PBS overnight at 4°C. The next day, cells were washed with IX PBS and incubated with the corresponding Alexa Fluor secondary antibodies
(Jackson Immuno Research Labs; donkey anti-rabbit, anti-mouse, IgG (H+L) 488, 594, or 680) at a 1:400 dilution for 1 h at room temperature, using Hoechst 33258 (1:5000) as a nuclear counterstain. Fluorescent confocal images were acquired and analyzed using the Leica SP8 confocal system. Flow Cytometry
Cells expressing dsGFP reporter were dissociated with Accutase (Coming), passed through a 70 pm nylon cell strainer (Coming) to remove large clumps, and washed with IX PBS. Dissociated cells were fixed with 4% PFA in PBS and GFP positive cells were analyzed using Cytek Aurora Flow cytometer. RNA Extraction and Reverse Transcription Quantitative PCR (RT-qPCR) for inRNA stability assay
HEK293T cells were transfected with indicated vectors (500 ng/well for 24-well plate) for 24 h before treatment with transcriptional inhibitor actinomycin D (10 pM) for various period. Total RNA was extracted using Monarch Total RNA miniprep Kit (NEB, Cat# T2010) that includes two steps of DNA removal. Equal amount of RNA (0.5 pg) was used to synthesize cDNA using High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific, Cat# 4368814) with random hexanucleotide primer. Real time PCR analysis was carried out on QuantStudio™ 3 System. The mRNA expression levels of reporter gdLuc luciferase and huma b-actin were determined using iTaq Universal SYBR Green Supermix kit (BioRad, Cat# 1725121). The sequences for gdLuc primers are (forward) 5’-
GATTAC AAGGATGACGACGATAAG-3 ’ (T1364 targeting Flag) and (reverse) 5’- AAGTCTTCGTTGTTCTCGGTGGG-3’ (T432 targeting gdLuc). Human b-actin primers are (forward) 5’-AAGAGCTATGAGCTGCCTGA-3’ and (reverse) 5’-
TACGGATGTCAACGTCACAC-3’. Each sample was tested in triplicate. Cycle threshold (Ct) values were obtained graphically for reporter and b-actin. The difference in Ct values between for reporter and b-actin were represented as ACt values. The AACt values were obtained by subtracting the ACt values of the control samples from that of the samples at different time points. Relative percentage change in gene expression was calculated as 2-AACt. The mRNA
decay rate was calculated by non-linear regression curve fitting (one phase decay) using GraphPad Prism 9.1. Three independent experiments were performed.
Lucif erase assays
For gdLuc assay, the Coelenterazine (CTZ) substrate (Nanolight Technology, Cat # 3032) was dissolved in 10 ml ultra-sterile distilled water to make the stock solutions and kept at - 20°C until use. The CTZ stock solution was diluted 10-30 times to make working solutions.
Equal amount of CTZ working solution and cell culture media (25-50 mΐ) after transfection were mixed in a white opaque 96-well optiplate (Coming, Cat# CLS3922), and the luminescence was measured in aBioTek Synergy LX multiplate reader. For firefly luciferase assay in some experiment, the ONE-Glo Luciferase assay kit (Promega Corp, Cat # E6110) was used.
Aliquots of 100 mΐ substrate solution were mixed with 3-5m1 of cell lysates and the luminescence was measured in a BioTek Synergy LX multiplate reader. Data were presented as relative luciferase activity or fold changes compared with corresponding group. Experiments were performed at least 3 times with each in quadrupli cates.
In vitro transcription and mRNA transfection
For pcDNA6B vector containing T7 promoter, the DNA was lineated with A gel digestion followed by gel purification. For PCR product, the primers included the T7 promoter (TTAATACGACTCACTATAGGGTGGAATTCTGCAGATATCCAG. T1427), generating DNA fragment containing target gene, LG or QLG dual reporter and a poly(A) tail. PCR was performed using Phusion High-Fidelity PCR Master Mix kit (Thermo Fisher Scientific, F531). The DNA was purified using gel extraction kit and the concentration determined using Take3 plate in Bio Tek multiplate reader. RNA was synthesized from the purified DNA template using HiScribe™ T7 ARCA mRNA Kit (NEB, Cat#E2060) and co-transcriptionally capped with m7G anti-reverse cap analog (ARCA, Cat#1411), and poly A tailing. The synthesized RNA was purified using Monarch RNA cleanup kit (NEB, Cat#E2040) and quantified with Take3 plate. Equal amount of RNA between LG and QLG groups at different dosage were used for transfection into HEK293T cells in quadruplicate with Lipofectamine® MessengerMAX mRNA Transfection Reagent (Thermo Fisher Scientific, Cat#LMRNA015) following
manufacture’s manual. At 4-72 h post-transfection, the culture media containing gdLuc were collected, and gdLuc assay was performed as above.
VSV-G or S protein-pseudotyped lentivirus packaging and titration The recombinant lentivirus carrying indicated lentiviral vector was produced in a small scale using the second generation of LV packaging system according to standard protocols. Briefly, HEK293T cells in one of 6-well plate were cotransfected by TP5 kit with the indicated transfer LV vector (1.4 pg), the packaging vector psPAX2 or its mutants (1 pg) and VSV-G or Sdl8 vector (0.4 pg). At 2-3 days post-transfection, the supernatants containing LV were concentrated and
49 purified with simplified 10% sucrose purification as described previously . The functional titers of the crude and purified lentivirus were determined by counting GFP-expressing HEK293T cells at 48 h after infection with serial dilutions of lentiviruses under fluorescent microscopy. For some cases, flow cytometry analysis was used for LV titration.
Western Blot Analysis
SDS-polyacrylamide gels (10-12%) were home-made or mini-PROTEAN TGX gels (Cat# 4561093, 4561096) were purchased from BioRad. The cell lysates were prepared using the lysis buffer composed of 50 mM Tris-HCl pH 7.0, 150 mM NaCl, 5 mM EDTA and 1 % Triton X-100 supplemented with PMSF (lOOx), Aprotinin and Leupeptin (200x). The 50 pi lysates were prepared from each well after collecting the supernatant. The lysates were incubated at 4°C for 20-30 min, centrifuged at maximum speed in an Eppendorf Centrifuge.
The clear lysates were either denatured for 5 min at 98°C immediately in lx SDS-PAGE loading dye or stored at -80°C until use. Supernatants were stored at 4°C until before they treated with lx SDS-PAGE loading dye. The denatured 10-20 ul aliquots of cell lysates or 20-30 pi supernatants were loaded onto SDS- polyacrylamide gels. The SDS-PAGE was performed in Tris-Glycine/SDS buffers under denaturing and reducing conditions.
The polyacrylamide gels were transferred to 0.2-pm nitrocellulose membranes (BioRad supported nitrocellulose (NC) membrane, Cat # 162-0097) either using wet transfer or iBlot®2 device using IBlot®2 NC mini (IB23002) or regular Stacks (IB23001). In wet transfer following lx transfer buffer was used: 25 mM Tris-HCl pH 7.6, 192 mM glycine, 20% Methanol. The gels
were sandwiched together with NC membranes and transfers were performed in lx Transfer buffer at 250 mA at 4°C for 1- 2 h.
Dry western blot transfers were performed in a IBlot®2 gel transfer device (Invitrogen, Thermo-Fisher, Ref# IB21001) using mini or regular IBlot®2 stacks for 7 min according to manufacturer’s guidelines. After the transfer, the membranes were blocked in lx TBST buffer containing 5% milk. The membranes then were treated with primary antibodies overnight at 4°C or 2 h at RT. The membranes were washed three times with lx TBST buffer minute each followed by incubation with secondary antibodies. The secondary antibodies with infrared tag were diluted 1/10000-120000 and incubated with the NC membranes for 45 min to an h. At the end of incubation, the membranes were washed with lx TBST buffer three times, 5 min each and scanned on a Li-COR Odyssey image analyzer. The images were analyzed with NIH ImageJ (1.53 version) densitometric measurements. The data were expressed as integrated density times area and presented as relative fold in comparison with corresponding control.
Antibody Detection with Enzyme-Linked Immunosorbent Assay (ELISA)
HEK293T cells were cotransfected with the Q-tagged HQ (TP1574) and LQ (TP1571) at 50 ng/well of 96-well plate in quadruplicates with or without normalization vector pGL4.16-CMV (TP329), which derived from the promoterless vector pGL4.16 (Promega, Cat#E6711), or pRRL-E-Flag-LG (TP1578) at 20 ng/well. The original antibody plasmids for pFUSEss-CHIg-hGl-SARS-CoV-2-mAb (TP 1565) and pFUSE2ss-CLIg-hk-SARS-CoV-2-mAb (TP1566) were used as the control. ELISA was performed using a Human IgG (Total) Uncoated ELISA Kit (Invitrogen, Thermo-Fisher, Cat# 88-50550-88). A 96-well Costar ELISA plate (Coming) was first coated with SARS-CoV-2-Spike (S) protein from BEI (Cat # NR52724) at 100 pg/well overnight at 4°C. The washing and blocking steps were performed using the buffers and solutions provided in the kit. Supernatants containing secreted antibodies were collected from the transfections at 24 and 48 h and kept at 4°C until use. The aliquots of 0.5, 2.5 and 5.0 mΐ antibody supernatants were added to each SARS-CoV-2-S coated wells. After overnight incubation, the wells were washed (400 mΐ per well) 4 times. The horse radish peroxidase (HRP)-conjugated anti-human IgG detection monoclonal antibody in assay buffer (1/250) was added to each well and incubated at room temperature for 2-3 h. The wells were then washed 3 times (400 mΐ each) and treated with 300 pL substrate TMB (3, 3’, 5, 5’- tetramethyl
benzidine) for 15 min to develop blue color and the reactions were terminated with 2 N HC1. The yellow color formation was measured at 450 nm using a BioTek microplate reader. The level of anti- SARS-CoV monoclonal antibody was quantified by Sigmoidal four-parameter logistic curve (4PL) fit using Prism GraphPad 9.1.
ER-Golgi transport inhibition with brefeldin A
Brefeldin A (BA, AdipoGen Life Sciences, Cat # AG-CN2-0018) was dissolved in DMSO to make 1 mg/mL working solution. HEK293T cells were transfected with indicated vectors using TP5 transfection reagent in DMEM plus 10 % FBS as described above. The transfected cells were incubated overnight, and 10 pg/ml BA was added prior to media change and incubated for 3 h at 37°C in 5% CO2. The culture media was replaced with 293 FreeStyle serum free media (Gibco, Thermo-Fisher, Cat# 12-338-018) with 10 pg/ml BA and incubation was continued for 24 h at 37°C in 5% CO2. The supernatants were withdrawn right after media replacement and collected after 24 h. The cell lysates were also prepared at 24 h time point. The supernatants and cell lysates were tested for gdLuc activity and Western blot analysis.
Quantification and statistical analysis
Quantification of fold changes in Q groups compared with corresponding non-Q groups was performed using excel software. Statistical analysis was performed using Prism GraphPad 9.1. Significance at *P < 0.05, ** P < 0.01 and *** P < 0.001 was determined using a two- tailed student’s t-test between two groups or by one-way ANOVA for multiple comparisons. Data were presented as mean ± SE. The size and type of individual samples were indicated and specified in the figure legends.
Results
Discovery of a novel heptapeptide Qa in boosting protein expression/production.
To study SARS-CoV-2 viral protein expression in mammalian cells, a dual reporter system was generated to measure the viral protein expression quantitatively and dynamically. Gaussia- Dura luciferase (gdLuc) and destabilized green fluorescent protein (dsGFP) were fused, abbreviated LG, onto the C-terminus of SARS-CoV-2 E protein (FIGS. 8A and 14A-14C). This design allows dual measures of the secretory gdLuc-fused target protein in culture media by
sensitive gdLuc assay and the dsGFP positivity and intensity by fluorescence microscopy and flow cytometry. During the cloning of the E protein-expressing vector, the correct clones were initially screened by restriction enzyme digestion and tested positive clones El and E7 for protein expression by gdLuc assay (FIGS. 8B-8C) and fluorescence microscopy (FIGS. 8D and 14A). Surprisingly, E7 exhibited >20-fold higher luciferase activity than El. The E7 DNA sequence was confirmed by Sanger sequencing. Unexpectedly, it was discovered that E7 had an additional 21 -nucleotide sequence that encodes 7 amino acids (aa) in frame between the upstream of LG and the downstream of the Flag tag. This heptapeptide was designated as Qa based on its aa sequence and named its linked LG as QLG. It was confirmed that transfection of pcDNA6B-E-QLG (E7) exhibited up to 90-fold higher expression than pcDNA6B-E-LG (El) in HEK293T cells (FIGS. 8B-8D). The effect of this Qa addition on the expression of other SARS- CoV-2 structural proteins was examined, including S, nucleocapsid (N), and membrane (M), and accessory proteins NSP2, NSP16 and ORF3. It was found that Qa boosts the production of all the tested viral proteins (FIGS. 8E, 8F, 9 A, 14A-14C, and 15A-15B), with efficiency ranging 3- to 3848-fold, depending upon the respective protein. Such variation of Qa boosting efficiency may result from differences in cellular density/function, transfection efficiency, reporter dosage, and viral protein types.
Novel and Unique 21-mer Oligonucleotide Cis-Regulatory Motif Contributing to Qa Boosting.
Given that the Qa insertion needs an open reading frame (ORF) with the targeted genes for protein expression and functional detection, it was initially speculated that the in-frame heptapeptide Qa plays a critical role in boosting protein production. Thus, alanine scanning and deletion mutation assays were performed (FIG. 8G) to determine the role of amino acid residues in regulating Qa function at the peptide level. All these tested mutations impaired the boosting activity to various extents from >57% loss of boosting activity to almost complete loss in the 4A mutation, indicating that each residue of this unique Qa heptapeptide appears to be important for the boosting activity and the residue 4 is the most critical one. To explore the contribution of the underlying oligonucleotides at the RNA level, synonymous (silent, degenerate) mutations were created that only change nucleotides but not the amino acids. Unexpectedly, it was found that all the degenerate mutants tested showed significant loss (>90%) of Qa boosting activity (FIG. 8H),
indicating that Qa boosting activity derived predominantly from the action of the 21-mer oligonucleotide motif instead of the unique heptapeptide. Then nonsynonymous (missense) mutation assay were performed by retaining the ORF required for the reporter expression. All the tested mutants lost the boosting activity to various degrees as compared to the parent Qa group (FIG. 81). These data provide evidence that the sequence (composition) and number of this 21- mer motif is critical for Qa boosting activity. The name Exen21 was assigned as a new name for the unique and novel expression-enhancing 21-mer cis-regulatory motif, which encodes an epitope tag (Qa).
Broad capability of Exen21/Qa addition to boost protein expression/production.
To expand the potential application of Exen21/Qa in boosting protein expression and production, similar assays were performed in different types of proteins, mammalian cells, and species. Similar boosting effects for many non-viral proteins were observed (FIGS. 9B-9E). Interestingly, transfection with a lower amount of plasmid DNA in HEK293T cells yielded higher boosting efficiency for most SARS-CoV-2 viral proteins (FIGS. 9A, 15A and 15B), but not for host cellular gene products such as mouse NIBP4 (FIG. 9B) and human ACE2 (hACE2) (FIG. 9C), or cytokines such as IFNy (FIG. 9D) and IL-2 (FIG. 9E). Exen21/Qa induced stronger boosting of SARS-CoV-2 E protein in the presence of the stronger CAG promoter (FIG. 9F). It was further found that similar boosting of protein expression and production occurred in other cell types including Hela, BHK, and others (FIG. 9G). In addition to being functional in regular plasmids, the Exen21/Qa also exhibited boosting activity in viral transfer vectors such as lentiviral (LV) vectors (FIGS. 9D and 9E).
In summary, the Exen21/Qa addition has a broad capability of boosting protein expression/production across various gene products, vectors, mammalian cell types, and species.
Exen21/Qa enhancement of antibody production.
Monoclonal antibody (mAb)-based therapeutics require the optimization of antibody production in suitable cell culture platforms, which relies on high-performance expression vectors. To achieve this, genetic elements in mAh production vectors have been widely modified. To determine if Exen21/Qa addition plays a role in boosting antibody production, a human anti-SARS-CoV mAh (Bei, CR3022) was used, which contains the consistent regions of
heavy and light chains (GenBank: DQ 168569, DQ 168570, respectively) as a test platform. Exen21 was inserted into the C-termini of the immunoglobulin heavy and light chains (H/L) of CR3022 to generate Qa-tagged HQ and LQ (FIG. 10 A). HQ and LQ were cotransfected into HEK293T cells to generate Qa-tagged mAh, using the original H and L vectors (NR52399 and NR52400) as the controls. MAb-containing supernatants were collected 2-3 days after transfection and their mAh levels were measured by ELISA using SARS-CoV-2 S protein as the coating antigen (FIGS. 10B and IOC). It was found that the Exen21/Qa boosted mAh production by up to 37 folds, with or without normalization of transfection efficiency (FIG. 10D). Boosting efficiency was obtained at least 13 folds on average from 16 independent experiments even with varied experimental conditions (cell density, transfection efficiency, and ELISA variations) (FIG 10E). It was further confirmed that Exen21/Qa boosted mAh production by Western blot analyses of cell culture supernatants (FIG. 10F). The data indicate that Exen21/Qa addition robustly boosts mAh production/secretion.
Exen21/Qa enhancement of SARS-CoV-2 S pseudovirion production.
Pseudotyped virus has been widely used in studies not only for gene delivery, but also for vaccine production, antibody neutralization, cellular entry, and pathogenic exploration. Pseudovirion is an excellent alternative to high-risk viruses such as SARS-CoV-2 and its variants and does not require BSL3 facilities for working with. Pseudovirions are virus-like particles (VLPs) coated with viral surface or membrane proteins that harbor specific cellular tropisms. VLPs pseudotyped with SARS-CoV-2 S protein evoke stronger immune responses than any individual viral protein due to their 3-dimensional structures like those of live virus 8’ ia u. SARS-CoV-2 S protein has been widely used to generate S pseudovirion, but the packaging efficiency for lenti virus-like (LVLP) or vesicular stomatitis virus-like (VSVLP) particles has been low in most reports, even with the codon-optimized C-terminal deletion S protein5, 6’ X 12. Given the fact that Exen21/Qa addition boosts S protein production in mammalian cells, it was speculated that it might boost the packaging efficiency of S pseudotyped LVLP (S-LVLP). By applying the widely used C-terminal 18 aa-deleted codon-optimized SARS-CoV-2 S protein (Sdl8) as a test platform (FIG. 11 A), it was validated that Exen21/Qa addition on the C-terminal Sdl8 (Sdl8Q) boosted Sdl8 expression by Western blot analysis (FIG. 1 IB). It was also found that Exen21/Qa addition increased S-LVLP packaging efficiency by ~2-4 folds in HEK-hACE2
cells (FIG. 11C). To provide dynamic measurement of S-pseudovirion transduction, packaging efficiency of the dual-reporter LV vector pRRL-E-QLG was tested, which harbors inserts larger than the GFP insert alone. As expected, the original Sdl8 in transfer vector pRRL-E-QLG showed significantly lower packaging efficiency than of the Exen21/Qa addition (FIGS. 1 ID and 1 IE). These data demonstrate that Exen21/Qa addition in the Sdl8 expression system significantly boosts packaging and transduction efficiencies of SARS-CoV-2 S-LVLP.
Exen21/Qa enhancement of lentivirus production.
Viral gene therapy has been extensively studied and actively applied to clinical diseases. Both AAV and LV are the most promising strategies for viral gene therapy, but viral packaging efficiency (production yield) has been a bottleneck. In genome editing by CRISPR/Cas, viral packaging efficiency is also a rate-limiting factor for development of novel therapeutics. Generally, the level of mRNA supplied by LV transfer vector can affect LV packaging efficiency. It was hypothesized that Exen21 addition in the LV transfer vector can elevate the transgene mRNA levels during packaging and thereby boost the efficiency of LV packaging and gene delivery. This idea was tested by comparing the LV transfer vectors pRRL-E-LG and pRRL-E-QLG for standard LV packaging (psPAX2 and VSV-G). After LV infection of HEK293T cells, Exen21 increased production of the transgene reporter gdLuc from the transfer vector (FIG. 1 IF), like its boosting efficiency in transfected cells without LV packaging (FIGS. 9D, 9E). However, Exen21 addition in the transfer vector only marginally affected packaging efficiency (i.e., the titer of packaged LV; data not shown). Similar changes were observed with LV-spCas9-Q-RFP and LV-MS2-spCas9-Q-GFP (FIGS. 11G and 11H), for which packaging efficiency is usually < 1% that of standard LV-RFP or LV-GFP. These data provide evidence that the Exen21 -induced marginal change in mRNA level of the transfer gene in the transfer LV vector does not increase packaging efficiency, although Exen21 addition does enhance production of transgene protein in the transduced cells (FIG. 1 IF). This is consistent with the finding that Exen21 addition augments translation, rather than transcription (FIGS. 12A-12G). It was also tested if Exen21/Qa addition on the LV packaging proteins such as Gag, Pol, and RRE, via the packaging vector psPAX2 could boost packaging efficiency. Interestingly, Exen21 addition to Gag significantly impaired, rather than augmenting, LV packaging, but to Pol and RRE it significantly boosted LV packaging of pRRL-GFP (FIG. 111). These data provide
evidence that proper insertion of Exen21/Qa in the LV packaging vectors could boost the packaging and transduction efficiency.
Exen21/Qa enhancement of vaccine production via increasing mRNA stability and translational efficiency.
Another immediate application of Exen21/Qa addition may be in the elevation of vaccine yields for the urgent fight against COVID-19 pandemic. Currently, the most promising vaccines against SARS-CoV-2 and its variants are derived from mRNA or DNA encoding S proteinl3. As shown in the above results, the Exen21/Qa addition increased S protein expression by -3-24 fold in a CMV-driven cDNA expression vector (FIG. 9A). If such an enhancement of vaccine production is applied in large scale, it would reduce costs and expedite the availability of COVID-19 vaccines. Since mRNA vaccine exhibits numerous advantages over other vaccines and the application of SARS-CoV-2 S protein mRNA-based vaccines are now well-established in humans, it was hypothesized that the Exen21/Qa addition could also boost mRNA-dependent translation of SARS-CoV-2 proteins such as S protein for increasing vaccination efficiency. To test this idea, a capped mRNA was generated with the Exen21 insertion by in vitro transcription (promoter independent) and examined if the Exen21/Qa after mRNA transfection in HEK293T cells (FIGS. 16A-16E). The data showed that the presence of Exen21/Qa significantly increased the production of SARS-CoV-2 protein S from the transfected functional mRNAs in a time- and dose-dependent manner (FIG. 12 A). It was found that such protein production-boosting motif can be universally applicable to mRNAs of other SARS-CoV-2 proteins including N, E, and ORF3 and the host cellular gene hACE2 (FIGS. 12B, 12C, and 16A-16E). These data provide evidence that the Exen21/Qa addition could act in a transcription-independent manner (promoterless) by facilitating mRNA stability and/or translational efficiency. To further determine if the Exen21/Qa addition regulates mRNA-dependent translation, the dynamic changes of translational products were measured after inhibiting transcription with actinomycin D. In the absence of Exen21/Qa addition, actinomycin D completely blocked the production of viral protein E (FIG. 12D) and ORF3 (FIG. 12E), measured by gdLuc activity. In contrast, the Exen21/Qa addition showed a time-dependent increase of the protein expression and production/accumulation even with the transcriptional inhibition (FIGS. 12D and 12E), providing evidence that the Exen21/Qa addition in the targeted genes facilitates protein
expression and production via the posttranscriptional regulation (increased translation efficiency and/or mRNA stability). To further determine if the Exen21 addition influences mRNA stability of the targeted genes, an mRNA decay assay was used for E and S viral proteins. Although E and S viral mRNAs exhibited different patterns of changes during the time course, the Exen21/Qa addition on both viral E (FIG. 12F) and S (FIG. 12G) proteins increased half-life of the encoding mRNAs by ~6-7 hours.
Taken together, the data indicate that the Exen21 addition in a given target mRNA significantly increases mRNA stability and translational efficiency, thereby boosts protein expression and production of the targeted mRNA (e.g., S protein mRNA vaccine).
Exen21/Qa boosting of targeted protein secretion.
As was found above, Exen21/Qa addition elevated expression of various types of targeted proteins. Aiming to test if Exen21/Qa addition boosted E protein dual reporter protein expression within cells (by Western blot analyses on cell lysates), it was unexpectedly found that E-QL protein levels in the lysates were remarkably reduced rather than increased, in the Exen21/Qa group detected by Western blot analysis with anti-Flag antibody (FIG. 13 A), even though Exen21/Qa addition robustly increased gdLuc activity in culture supernatants (FIG. 8C). Similar reductions by Exen21/Qa addition were found in corresponding intracellular levels of other viral proteins (S and N), and the host cellular proteins (IFNy, IL-2, and hACE2) (FIGS.
13B and 13C).
Based on these unexpected observations, it was hypothesized that the robust Exen21- induced increases in supernatant gdLuc activity must involve the protein secretion process. This idea is supported by the Exen21 -induced boosting that was observed in antibody secretion (FIGS. 10A-10F) and secretory IFNy and IL-2 (FIGS. 9E and 9F) experiments. To corroborate this secretion-boosting activity, the protein levels of secretory E-Flag-gdLuc were analyzed in the cell culture supernatants using serum-free media. It was found that the cleaved E-Flag-gdLuc and GFP as well as the non-cleaved E-Flag-gdLuc-GFP were detectable by Western blot analyses using anti -gdLuc and anti-GFP antibodies in the unconcentrated supernatants (40 mΐ from 100 mΐ) of the Qa-tagged E-QLG group (FIGS. 13D and 17A-17E). Densitometric quantification analysis revealed a 17-fold increase in the level of secretory protein (FIG. 13D),
consistent with the boosting also seen in the gdLuc assay (FIG. 13E). The protein secretion was blocked by treatment with the endoplasmic reticulum (ER)-Golgi protein trafficking inhibitor brefeldin A (FIGS. 13F and 18A-18D). To further confirm the secretion-enhancing feature of the Exen21/Qa addition, we used the non-secretory firefly-luciferase (fLuc) assay. Cellular levels of the fLuc protein expression and enzyme activity were significantly increased in cell lysates, but no fLuc activity was detectable in the supernatants, even in presence of Exen21/Qa addition (FIG. 13G), which is consistent with non-secretory protein spCas9 (FIG. 11G). Thus, the Exen21/Qa addition appears to boost expression of the targeted proteins and facilitate their secretion. It was noted that auto-cleavage by the 2A system of most of the targeted proteins was incomplete, varying among different proteins (FIGS. 11C, 1 ID and 1 IF), which has been reported by others14, 15.
Discussion
In this study, the discovery of a novel and unique Exen21/Qa c/.s-regulqlory motif was reported that has versatile capabilities of boosting the expression and secretion of targeted proteins. This c/.s-regulatory Exen21 sounds like the secretion-enhancing c/.s- regulatory targeting element (SECReTE) that was recently identified by computational analysis to facilitate ER- localized mRNA translation and protein secretion16. This SECReTE motif is enriched in nearly all mRNAs encoding secreted/membrane proteins in eukaryotes and its addition results in enhanced protein secretion16. It also boosts protein expression and secretion when adding to an mRNA for an exogenously expressed protein such as GFP16. However, Exen21 has many features different from SECReTE: (1) No triplet repeats such as NNY or NYN; (2) Unique and exclusive composition/order of the 21 nucleotides; (3) Smaller size (21-mer) than SECReTE (> 30-mer from >10 triplet repeats); and (4) Absence in any cellular or viral genes. In addition, Exen21/Qa is also quite different from the activity-enhancing motif that involves promoter enhancerl7-19 or anti-sense activity20. The data herein indicated that adding the Exen21 motif to a given mRNA could remarkably enhance the corresponding protein expression and secretion. This was also demonstrated in different types of proteins including viral, nonviral, intracellular, structural, and secretory proteins. The extent of such enhancement varied, with proteins such as N and ORF3 exhibiting up to thousands-fold increase. It is believed that these findings may be translatable to a paradigm shift in applied protein production in research and industry.
The range, extent, and mechanisms of these Exen21/Qa actions was explored using a variety of tools, approaches, and target proteins. The Exen21/Qa addition robustly augmented production of a secretory gdLuc fusion protein derivative of multiple SARS-CoV-2 structural proteins (S, M, N, and E), the accessory proteins (NSP2, NSP16, and ORF3), and the host cellular gene products (FIGS. 8A-8I and 9A-9G). The protein production-enhancing actions of Exen21/Qa were largely independent of the specific promoter used, among those tested, but it did elicit stronger enhancement of protein production in combination with the stronger CAG promoter (FIGS. 9A-9G). The Exen21/Qa addition enhanced mRNA-dependent production of targeted viral and non-viral protein fusion reporters, determined by in vitro RNA transcription and mRNA transfection, followed by dual reporter assays (FIGS 12A-12G). Exen21/Qa enhanced the yield of S-containing pseudoviruses and lentivirus packaging (Fig. 4). Exen21/Qa addition increased the release of secreted host proteins, including a robust enhancement of antibody production when Exen21/Qa was placed in antibody heavy and light chains, and augmented the secretion of IFNy and IL-2. Exen21/Qa actions were blocked by the Golgi - trafficking inhibitor brefeldin A. These findings point not only to a wide range of activities elicited by Exen21 addition, but also to potentially important and diverse applications in biotechnology areas such as production of vaccines, monoclonal antibodies, and other biopharmaceuticals where mammalian cell expression systems are needed.
It was found that the Exen21/Qa addition robustly boosted the regulated secretion of secretory proteins such as S protein, antibody, IFNy and IL-2, but not via any signal peptide-like intracellular targeting mechanism, because it did not induce release of non-secretory proteins such as ///v/Ty-luciferase and spCas9. This property could potentially prove invaluable for industrial application of such secreted proteins. For example, the Exen21/Qa addition could presumably enhance the production/secretion of S protein in mRNA-based vaccines against SARS-CoV-2 variants, therefore reduce the amount of mRNA needed per vaccination due to the higher levels of S protein released13 while still provide the same host immune responses.
The ability to boost production yields of viruses or pseudotyped viruses can be invaluable to the fields of gene therapy and biomedical research. The use of pseudotyped viruses has facilitated the research on high-risk viruses that require BSL3 facilities. Pseudoviruses of SARS- CoV-2 S protein and its variants have been used extensively in the evaluation of neutralizing
antibodies and vaccination, as well as in mechanistic and functional studies5, 6’ 12, 21, 22. The bottleneck for generation of S pseudovirions has been the limited packaging efficiencies for LVLP or VSVLP5, 6’ 8 12. The new approach herein to add Exen21/Qa in the Sdl8 expression system boosted packaging and transduction efficiencies of SARS-CoV-2 S-LVLP. This strategy has facilitated the ongoing research on the antiviral effect of EGCG and the protective efficiency of serum from vaccinated patients against the emerging SARS-CoV-2 variants23, 24. A challenge in viral gene therapy is the limited efficiency of viral packaging. Using a LV system as the test platform, it was found that the Exen21/Qa addition to the LV transfer vector affected packaging efficiency only marginally, but it boosted the production of transgene protein in the transduced cells or the transfected packaging cells. This was expected, because it was found that Exen21/Qa influences posttranscriptional regulation rather than transcription of targeted genes, whereas LV packaging requires the presence of intermediate RNA from the transfer vector. In the packaging vector psPAX2, the Exen21/Qa addition at the C-termini of Pol and RRE increased LV packaging efficiency, but at the Gag C-terminus it impaired LV packaging. Thus, optimizing Exen21/Qa locations within LV packaging vector will be helpful in applications to maximize Exen21/Qa boosting efficiency. Because Exen21/Qa addition boosted both Sdl8 expression and the packaging efficiency of S-LVLP, the Exen21/Qa aition in VSV-G protein may boost regular LV packaging efficiency. The Exen21/Qa addition at different locations of VSV-G25, 26 may thus maximize its production-boosting efficiency. Likewise, optimizing Exen21/Qa boosting activity on AAV, or other viral packaging system may prove valuable in biopharmaceutical applications.
Many varieties of epitope tags including Flag, Myc, HA, Ollas, V5, His, C7, and T7 developed earlier enable specific research and biotechnological applications such as protein labeling, tracing, immunoaffmity purification, immunostaining, immunodetection enhancing27 34, protein degradation slowing, and solubility conferring35 38. Other tags modulate activity or function of targeted proteins39, such as N- or C-terminal tagging of PI3KCA, which increase its kinase and membrane binding activity, respectively40. Until now, however, no tagged epitope had ever been discovered that can stimulate protein expression and secretion. A series of mutation analyses were conducted including alanine scanning, deletion, synonymous and nonsynonymous mutation and proved that the unique 21-mer motif Exen21 with a specific
order/number of the nucleotide composition is essential for its boosting activity, which requires ORF fitting into the targeted genes. Thus, the encoded unique heptapeptide Qa can serve as a novel epitope tag that shares features with well-established epitope tags for general applications. Importantly, the Exen21/Qa addition can enhance the intensity of endogenous protein labeling owing to its boosting capacity and thus improve detecting sensitivities in applications such as neural network tracing27. A broad area yet of importance, yet to be explored, is the potential of the Exen21/Qa addition to enhance the expression of targeted, highly specific bioengineered proteins in vivo , such as via novel CRISPR/Cas gene knock-in strategies that could facilitate expression of loss-of-function genes. Such applications would be valuable in treating disorders such as haplo-insufficient mutagenic diseases including Angelman syndrome, Pitts-Hopkins syndrome, and others. In genetic engineering, the Exen21/Qa boost of dominant genes may improve organism phenotypes, such as in agriculture applications. Of course, any potential toxicities or off-target effects of such in vivo expression of Qa-tagged proteins are yet unknown and untested. Nevertheless, based on prior findings with the well-tested epitope tags both in vitro and in vivo , we do not anticipate any propensity for toxicity of the very small 7-aa Qa tag.
The mechanisms via which Exen21/Qa exerts its actions on the enhancement of protein expression/secretion remain mainly to be delineated. However, the initial findings indicated that the presence of Exen21/Qa slowed mRNA decay as the boosting effects persisted during global transcription inhibition by actinomycin D, providing evidence that Exen21/Qa plays a key role in posttranscriptional regulation, which may include increased mRNA stability and perhaps translation efficiency. This Exen21/Qa supports previous proof of concept that the coding sequence harbors numerous regulatory sites that may regulate mRNA location, stability and translation efficiency41. It would be interesting to determine if the Exen21/Qa c/.s-regulatory motif has a special secondary RNA structure that can recruit RNA-binding proteins41, directly regulates mRNA stability of targeted proteins42, or binds directly to poly-A or untranslated region (UTR) to exert its stabilizing effects upon mRNA and boosting of translation. Because brefeldin A, an inhibitor of the conventional ER-Golgi secretion pathway, blocked Exen21/Qa- stimulated protein secretion, it was speculated that Exen21/Qa may regulate protein retrograde or anterograde trafficking among ER-Golgi network43 46 and facilitate ER-targeted mRNA translation and protein secretion like SECReTel6. Other secretion inhibitors might be used to
identify additional pathways involved in the Exen21/Qa-modulated protein secretion, particularly the non-conventionally secreted proteins (e.g., that of cytokines such as IL-1)47’ 48.
In summary, a novel, small (21-mer) and unique c/.s-regulatory motif Exen21/Qa was discovered that can greatly enhance the production of a variety of different types of proteins ranging from viral transcripts/proteins, endogenous gene products, vaccines, antibodies to engineered recombinant proteins in mammalian cells. This Exen21/Qa has a universal protein production-boosting capacity that should facilitate versatile applications in biomedical research and biotechnological industry. Library screening related to this master Exen21/Qa is underway for optimizing the motif that would maximize the protein expression/secretion.
References
1. Zhang, J. et al. A systemic and molecular study of subcellular localization of SARS- CoV-2 proteins. Signal Transduct Target Ther 5, 269 (2020).
2. Rezaei, N. et al. Introduction on Coronavirus Disease (COVID-19) Pandemic: The Global Challenge. Adv Exp Med Biol 1318, 1-22 (2021).
3. Kolahchi, Z. et al. COVID-19 and Its Global Economic Impact. Adv Exp Med Biol 1318, 825-837 (2021).
4. Bodnar, B. et al. Emerging role of NIK/IKK2 -binding protein (NIBP)/trafficking protein particle complex 9 (TRAPPC9) in nervous system diseases. Transl Res 224, 55-70 (2020).
5. Korber, B. et al. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell 182, 812-827 e819 (2020).
6. Muik, A. et al. Neutralization of SARS-CoV-2 lineage B.1.1.7 pseudovirus by BNT162b2 vaccine-elicited human sera. Science 371, 1152-1153 (2021).
7. Nie, J. et al. Establishment and validation of a pseudovirus neutralization assay for SARS-CoV-2. Emerg Microbes Infect 9, 680-686 (2020).
8. Walls, A.C. et al. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 181, 281-292 e286 (2020).
9. Weissman, D. et al. D614G Spike Mutation Increases SARS CoV-2 Susceptibility to Neutralization. Cell Host Microbe 29, 23-31 e24 (2021).
10. Wibmer, C.K. et al. SARS-CoV-2 501 Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med (2021).
11. Kuzmina, A. et al. SARS-CoV-2 spike variants exhibit differential infectivity and neutralization resistance to convalescent or post-vaccination sera. Cell Host Microbe (2021).
12. Ou, X. et al. Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat Commun 11, 1620 (2020).
13. Rijkers, G.T. et al. Antigen Presentation of mRNA-Based and Virus- Vectored SARS- CoV-2 Vaccines. Vaccines (Basel) 9 (2021).
14. Chaung, J. et al. Cleavage efficient 2A peptides for high level monoclonal antibody expression in CHO cells. MAbs 7, 403-412 (2015).
15. Kim, J.H. et al. High cleavage efficiency of a 2 A peptide derived from porcine teschovirus-1 in human cell lines, zebrafish and mice. PLoS One 6, el8556 (2011).
16. Cohen-Zontag, O. et al. A secretion-enhancing cis regulatory targeting element (SECReTE) involved in mRNA localization and protein synthesis. PLoS Genet 15, el008248 (2019).
17. Erceg, J. et al. Subtle changes in motif positioning cause tissue-specific effects on robustness of an enhancer's activity. PLoS Genet 10, el004060 (2014).
18. Kheradpour, P. et al. Systematic dissection of regulatory motifs in 2000 predicted human enhancers using a massively parallel reporter assay. Genome Res 23, 800-811 (2013).
19. Ma, S., Shah, S., Bohnert, H.J., Snyder, M. & Dinesh-Kumar, S.P. Incorporating motif analysis into gene co-expression networks reveals novel modular expression pattern and new signaling pathways. PLoS Genet 9, el003840 (2013).
20. Matveeva, O.V. et al. Identification of sequence motifs in oligonucleotides whose presence is correlated with antisense activity. Nucleic Acids Res 28, 2862-2865 (2000).
21. Donofrio, G. et al. A Simplified SARS-CoV-2 Pseudovirus Neutralization Assay. Vaccines (Basel) 9 (2021).
22. Wibmer, C.K. et al. SARS-CoV-2 501 Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med 21, 622-625 (2021).
23. Liu, J. et al. Correlation of vaccine-elicited antibody levels and neutralizing activities against SARS-CoV-2 and its variants. Clin Transl Med11, e644 (2021).
24. Liu, J. et al. Epigallocatechin gallate from green tea effectively blocks infection of SARS-CoV-2 and new variants by inhibiting spike binding to ACE2 receptor. Cell Biosci 11, 168 (2021).
25. Schlehuber, L.D. & Rose, J.K. Prediction and identification of a permissive epitope insertion site in the vesicular stomatitis virus glycoprotein. J Virol78, 5079-5087 (2004).
26. Lorenz, I.C. et al. The stem of vesicular stomatitis virus G can be replaced with the HIV-1 Env membrane-proximal external region without loss of G function or membrane-proximal external region antigenic properties. AIDS Res Hum Retroviruses 30, 1130-1144 (2014).
27. Viswanathan, S. et al. High-performance probes for light and electron microscopy. Nat Methods 68212, 568-576 (2015).
28. Pina, A.S., Batalha, I.L., Dias, A. & Roque, A.C.A. Affinity Tags in Protein Purification and Peptide Enrichment: An Overview. Methods Mol Biol 2178, 107-132 (2021).
29. Peighambardoust, S.H., Karami, Z., Pateiro, M. & Lorenzo, J.M. A Review on Health-Promoting, Biological, and Functional Aspects of Bioactive Peptides in Food Applications. Biomolecules 11 (2021).
30. Katayama, S., Corpuz, H.M. & Nakamura, S. Potential of plant-derived peptides for the improvement of memory and cognitive function. Peptides 142, 170571 (2021).
31. Lee, T.H. et al. Novel short peptide tag from a bacterial toxin for versatile applications. J Immunol Methods 479, 112750 (2020).
32. DeCaprio, J. & Kohl, T.O. Tandem Immunoaffmity Purification Using Anti-FLAG and Anti -HA Antibodies. Cold Spring Harb Protoc2019 (2019).
33. Traenkle, B., Segan, S., Fagbadebo, F.O., Kaiser, P.D. & Rothbauer, U. A novel epitope tagging system to visualize and monitor antigens in live cells with chromobodies. Sci Rep 10, 14267 (2020).
34. Mishra, V. Affinity Tags for Protein Purification. Curr Protein Pept Sci 21, 821-830 (2020).
35. Li, Y. Recombinant production of antimicrobial peptides in Escherichia coli: a review. Protein Expr Purif 80, 260-267 (2011).
36. Bhagawati, M. et al. A mesophilic cysteine-less split intein for protein trans-splicing applications under oxidizing conditions. Proc Natl Acad Sci USA 116, 22164-22172 (2019).
37. Han, X., Ning, W., Ma, X., Wang, X. & Zhou, K. Improving protein solubility and activity by introducing small peptide tags designed with machine learning models. Metab Eng Commun 11, e00138 (2020).
38. Saribas, A.S., White, M.K. & Safak, M. Structure-based release analysis of the JC virus agnoprotein regions: A role for the hydrophilic surface of the major alpha helix domain in release. J Cell Physiol 233, 2343-2359 (2018).
39. Majorek, K.A., Kuhn, M.L., Chruszcz, M., Anderson, W.F. & Minor, W. Double trouble-Buffer selection and His-tag presence may be responsible for nonreproducibility of biomedical experiments. Protein Sci 23, 1359-1368 (2014).
40. Vasan, N. et al. Double PIK3CA mutations in cis increase oncogenicity and sensitivity to PBKalpha inhibitors. Science 366, 714-723 (2019).
41. Ding, Y., Lorenz, W.A. & Chuang, J.H. Coding Motif: exact determination of overrepresented nucleotide motifs in coding sequences. BMC Bioinformatics 13, 32 (2012).
42. Boo, S.H. & Kim, Y.K. The emerging role of RNA modifications in the regulation of mRNA stability. Exp Mol Med 52, 400-408 (2020).
43. Kim, J.J., Lipatova, Z. & Segev, N. TRAPP Complexes in Secretion and Autophagy. Front Cell Dev Biol 4, 20 (2016).
44. Pinar, M. et al. TRAPPII regulates exocytic Golgi exit by mediating nucleotide exchange on the Ypt31 ortholog RabERAB 11. Proc Natl Acad Sci USA 112, 4346- 4351 (2015).
45. Reitz, C. The role of the retromer complex in aging-related neurodegeneration: a molecular and genomic review. Mol Genet Genomics 290, 413-427 (2015).
46. Vardarajan, B.N. et al. Identification of Alzheimer disease-associated variants in genes that regulate retromer function. Neurobiol Aging 33, 2231 e2215-2231 e2230 (2012).
47. Cohen, M. I, Chirico, W. J. & Lipke, P.N. Through the back door: Unconventional protein secretion. Cell Surf 6, 100045 (2020).
48. Ni, D. et al. Canonical Secretomes, Innate Immune Caspase-1-, 4/11-Gasdermin D Non-Canonical Secretomes and Exosomes May Contribute to Maintain Treg-Ness for Treg Immunosuppression, Tissue Repair and Modulate Anti-Tumor Immunity via ROS Pathways. Front Immunol 12, 678201 (2021).
49. Boroujeni, M.E. & Gardaneh, M. The Superiority of Sucrose Cushion Centrifugation to Ultrafiltration and PEGylation in Generating High-Titer Lentivirus Particles and Transducing Stem Cells with Enhanced Efficiency. Mol Biotechnol 60, 185-193 (2018).
OTHER EMBODIMENTS
While the disclosure has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the disclosure, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
The patent and scientific literature referred to herein establishes the knowledge that is available to those with skill in the art. All United States patents and published or unpublished United States patent applications cited herein are incorporated by reference. All published foreign patents and patent applications cited herein are hereby incorporated by reference. All other published references, documents, manuscripts and scientific literature cited herein are hereby incorporated by reference.
Claims
1. A composition comprising an expression enhancing oligonucleotide having 21 nucleic acid bases and includes a c/.s-regulatory coding motif that retains in-frame of target gene.
2. The composition of claim 1, wherein the expression enhancing oligonucleotide comprises a nucleic acid sequence comprising CAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7).
3. A synthetic oligonucleotide comprising a nucleic acid sequence comprising CAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7).
4. The synthetic oligonucleotide of claim 3, wherein the oligonucleotide encodes a peptide comprising an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1).
5. The synthetic oligonucleotide of any one of claims 3-4 wherein the oligonucleotide encodes a peptide comprising an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1).
6. The synthetic oligonucleotide of any one of claims 3-5 wherein the oligonucleotide encodes a peptide comprising the amino acid sequence QPRFAAA (SEQ ID NO: 1).
7. A construct comprising the oligonucleotide of any one of claims 1-6.
8. A chimeric molecule comprising one or more peptide domains and one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof.
9. The chimeric molecule of claim 8, wherein the one or more peptide domains comprise from about five amino acids to about twenty amino acids.
10. The chimeric molecule of claim 9, wherein the one or more peptide domains comprise about seven amino acids.
11. The chimeric molecule of claim 8, wherein the one or more peptide domains comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1).
12. The chimeric molecule of claim 8, wherein the peptide comprises an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1).
13. The chimeric molecule of claim 12, wherein the peptide comprises the amino acid sequence QPRFAAA (SEQ ID NO: 1).
14. The chimeric molecule of claim 8, wherein the peptide domain comprises Xn- QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
15. The chimeric molecule of claim 8, wherein the one or more 5’ -untranslated region (UTR) sequences or fragments thereof, are derived from one or more viruses.
16. The chimeric molecule of claim 15, wherein the one or more viruses comprise coronaviruses, retroviruses, picornaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
17. The chimeric molecule of claim 16, wherein the 5’ -UTR and/or 3’ -UTR are from a coronavirus.
18. The chimeric molecule of claim 17, wherein the coronavirus is SARS-CoV-2.
19. The chimeric molecule of claim 18, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR.
20. The chimeric molecule of claim 19, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR.
21. The chimeric molecule of claim 19, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR.
22. The chimeric molecule of claim 19, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’-UTR.
23. The chimeric molecule of claim 19, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR.
24. The chimeric molecule of claim 19, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-23’ -UTR.
25. The chimeric molecule of claim 8, further comprising one or more biomolecules operably linked to the one or more peptide domains and/or the one or more 5’UTR and/or 3’ -UTR sequences.
26. The chimeric molecule of claim 25, wherein the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
27. The chimeric molecule of claims 8-26, further comprising one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
28. A host cell comprising an oligonucleotide of any one of claims 1-7 or the chimeric molecule of any one of claims 8-27.
29. A construct encoding the chimeric molecule of claims 8-27.
30. A method of enhancing production of biomolecules, comprising: tagging a desired peptide or a nucleic acid sequence with the chimeric molecule of any one of claims 1-27, by conjugation or cloning, expressing the peptide or nucleic acid sequence, and, harvesting the protein.
31. The method of claim 30, wherein the proteins comprise: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an
oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
32. A nucleic acid comprising a promoter, a 5’ -untranslated region (5’ -UTR) sequence, a biomolecule of interest, an oligonucleotide comprising a c/.s-regulatory coding motif, a 3’- untranslated region (3’ -UTR) sequence and combinations thereof.
33. The nucleic acid of claim 32, wherein the one or more 5’ -untranslated region (UTR) and/or 3’UTR sequences or fragments thereof, are derived from one or more viruses.
34. The nucleic acid of claim 33, wherein the one or more viruses comprise coronaviruses, retroviruses, picomaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
35. The nucleic acid of claim 33, wherein the 5’-UTR and/or 3’-UTR are derived from a coronavirus.
36. The nucleic acid of claim 35, wherein the coronavirus is SARS-CoV-2.
37. The nucleic acid of claim 36, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’ -UTR.
38. The nucleic acid of claim 36, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’ -UTR.
39. The nucleic acid of claim 36, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR.
40. The nucleic acid of claim 36, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’ -UTR.
41. The nucleic acid of claim 36, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’ -UTR.
42. The nucleic acid of claim 36, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-23’ -UTR.
43. A chimeric molecule comprising one or more oligonucleotides comprising a nucleic acid sequence of CAACCGCGGTTCGCGGCCGCT (SEQ ID NO: 7) and one or more 5’- and/or 3’- untranslated region (UTR) sequences or fragments thereof.
44. The chimeric molecule of claim 43, wherein the one or more oligonucleotides encode a peptide comprising from about five amino acids to about twenty amino acids.
45. The chimeric molecule of claim 44, wherein the one or more peptides comprise about seven amino acids.
46. The chimeric molecule of claim 44, wherein the one or more peptides comprise an amino acid sequence having at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1).
47. The chimeric molecule of claim 44, wherein the one or more peptides comprise an amino acid sequence having at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1).
48. The chimeric molecule of claim 44, wherein the one or more peptides comprise the amino acid sequence QPRFAAA (SEQ ID NO: 1).
49. The chimeric molecule of claim 43, wherein the one or more peptides comprises a sequence comprising Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
50. The chimeric molecule of claim 43, wherein the one or more 5’ -untranslated region (UTR) sequences or fragments thereof, are derived from one or more viruses.
51. The chimeric molecule of claim 50, wherein the one or more viruses comprise coronaviruses, retroviruses, picornaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
52. The chimeric molecule of claim 51, wherein the 5’ -UTR and/or 3’ -UTR are from a coronavirus.
53. The chimeric molecule of claim 52, wherein the coronavirus is SARS-CoV-2.
54. The chimeric molecule of claim 53, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR.
55. The chimeric molecule of claim 53, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR.
56. The chimeric molecule of claim 53, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR.
57. The chimeric molecule of claim 53, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’-UTR.
58. The chimeric molecule of claim 53, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR.
59. The chimeric molecule of claim 53, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-23’ -UTR.
60. The chimeric molecule of claim 43, further comprising one or more biomolecules operably linked to the one or more oligonucleotides and/or the one or more 5’UTR and/or 3’- UTR sequences.
61. The chimeric molecule of claim 60, wherein the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
62. The chimeric molecule of claims 43-61, further comprising one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
63. An expression vector comprising the nucleic acids of any one of claims 43-62.
64. A host cell comprising the nucleic acid of any one of claims 1-63.
65. A chimeric molecule comprising one or more 5’- and/or 3’ -untranslated region (UTR) sequences or fragments thereof associated with one or more biomolecules.
66. The chimeric molecule of claim 65, wherein the one or more 5’ -untranslated region (UTR) and/or 3’ -UTR sequences or fragments thereof, are derived from one or more viruses.
67. The chimeric molecule of claim 66, wherein the one or more viruses comprise coronaviruses, retroviruses, picornaviruses, togaviruses, orthomyxoviruses, rhabdoviruses or combinations thereof.
68. The chimeric molecule of claim 67, wherein the 5’-UTR and/or 3’-UTR are from a coronavirus.
69. The chimeric molecule of claim 68, wherein the coronavirus is SARS-CoV-2.
70. The chimeric molecule of claim 69, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 5’-UTR.
71. The chimeric molecule of claim 69, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 5’-UTR.
72. The chimeric molecule of claim 69, wherein the one or more 5’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-2 5’ -UTR.
73. The chimeric molecule of claim 69, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 70% sequence identity to a SARS-CoV-2 3’-UTR.
74. The chimeric molecule of claim 69, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a nucleic acid sequence having at least a 90% sequence identity to a SARS-CoV-2 3’-UTR.
75. The chimeric molecule of claim 69, wherein the one or more 3’- UTR nucleic acid sequences or fragments thereof, comprise a SARS-CoV-23’ -UTR.
76. The chimeric molecule of claim 65, wherein the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
77. The chimeric molecule of any one of claims 65-76, further comprising one or more promoters and/or regulatory sequences operably linked to the UTRs or biomolecule.
78. A synthetic peptide tag comprising an amino acid sequence unit of about five to about fifteen amino acids wherein the N-terminal and/or C-terminal amino acids are linked or fused to a target molecule.
79. The synthetic peptide tag of claim 78, wherein the amino acid sequence unit comprises seven amino acids.
80. The synthetic peptide tag of claim 79, wherein the amino acid sequence comprises at least a 70% sequence identity to QPRFAAA (SEQ ID NO: 1).
81. The synthetic peptide tag of claim 79, wherein the amino acid sequence comprises at least a 90% sequence identity to QPRFAAA (SEQ ID NO: 1).
82. The synthetic peptide tag of claim 79, wherein the amino acid sequence comprises the amino acid sequence QPRFAAA (SEQ ID NO: 1).
83. The synthetic peptide tag of claim 78, wherein the amino acid sequence comprises the amino acid sequence wherein the peptide domain comprises Xn-QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
84. The synthetic peptide tag of claim 78, further comprising a plurality of repeating amino acid sequence units.
85. The synthetic peptide tag of claim 84, wherein the repeating amino acid sequence units are in tandem.
86. The synthetic peptide tag of claim 85, wherein the amino acid sequence units are separated by linker molecules or one or more amino acids.
87. A synthetic peptide comprising the structure: (AA-AA-AA-AA-AA-AAz-AAz)x, wherein x is greater than or equal to 1, z is 0 or 1 and each AA is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
88. A synthetic peptide comprising the structure: AA1-AA2-AA3-AA4-AA5-AA6-AA7, wherein each AA is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
89. A synthetic peptide comprising an amino acid sequence comprising the structure: Xn- QPRFAAA-Xn, wherein n is independently 0 or greater than or equal to 1 and each X is independently: Alanine (A), Arginine (R), Asparagine (N), Aspartate (D), Aspartate (D), Asparagine (N), Cysteine (C), Glutamate (E), Glutamine (Q), Glycine (G), Histidine (H), Isoleucine (I), Leucine (L), Lysine (K), Methionine (M), Phenylalanine (F), Proline (P), Serine (S), Threonine (T), Tryptophan (W), Tyrosine (Y), Valine (V), Selenocysteine, Pyrrolysine, modified amino acids or combinations thereof.
90. A fusion protein comprising a synthetic peptide of any one of claims 78-89, fused to one or more target peptides.
91. The fusion protein of claim 90, wherein two or more synthetic peptides of any one of claims 78-89 are fused to a target peptide.
92. A fusion molecule comprising a synthetic peptide of any one of claims 78-91, fused to one or more biomolecules.
93. The fusion molecule of claim 92, wherein the biomolecule comprises: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
94. A method of enhancing production of proteins comprising: tagging a desired peptide or a nucleic acid sequence with the peptide tag of any one of claims 78-93, by fusion or cloning, expressing the peptide or nucleic acid sequence, and, harvesting the protein.
95. The method of claim 94, wherein the proteins comprise: viral transcripts/proteins, vaccines, antibodies, an mRNA, an mRNA vaccine, a DNA vaccine, peptide vaccines, an oligonucleotide, a polynucleotide, a peptide, a polypeptide, biomimetics, engineered recombinant proteins, synthetic peptides, natural peptides, cellular proteins, virions, antigens or biomimetics.
96. A composition comprising a peptide-tagged biomolecule according to any one of claims 78-93 and a pharmaceutically acceptable excipient, diluent or carrier.
97. A nucleic acid encoding the peptide tag according to any one of claims 78-93.
98. An expression vector comprising the nucleic acid according to claim 97.
99. A host cell comprising the expression vector according to claim 98.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163219596P | 2021-07-08 | 2021-07-08 | |
US202163219587P | 2021-07-08 | 2021-07-08 | |
US202163219599P | 2021-07-08 | 2021-07-08 | |
US202263332378P | 2022-04-19 | 2022-04-19 | |
PCT/US2022/036367 WO2023283342A2 (en) | 2021-07-08 | 2022-07-07 | Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteins |
Publications (1)
Publication Number | Publication Date |
---|---|
EP4366767A2 true EP4366767A2 (en) | 2024-05-15 |
Family
ID=84800934
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP22838408.7A Pending EP4366767A2 (en) | 2021-07-08 | 2022-07-07 | Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteins |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP4366767A2 (en) |
CA (1) | CA3226284A1 (en) |
WO (1) | WO2023283342A2 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4366767A2 (en) * | 2021-07-08 | 2024-05-15 | Temple University - Of The Commonwealth System of Higher Education | Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteins |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8088889B2 (en) * | 2002-05-02 | 2012-01-03 | David Lovejoy | Teneurin c-terminal associated peptides (TCAP) and uses thereof |
JP2010535492A (en) * | 2007-08-03 | 2010-11-25 | センティネラ ファーマスティカル インコーポレイテッド | Genes and proteins for biosynthesis of lantibiotic 107891 |
US9283287B2 (en) * | 2012-04-02 | 2016-03-15 | Moderna Therapeutics, Inc. | Modified polynucleotides for the production of nuclear proteins |
EP3030663B1 (en) * | 2013-07-19 | 2019-09-04 | Monsanto Technology LLC | Compositions and methods for controlling leptinotarsa |
WO2017191274A2 (en) * | 2016-05-04 | 2017-11-09 | Curevac Ag | Rna encoding a therapeutic protein |
JP2022546699A (en) * | 2019-08-30 | 2022-11-07 | イェール ユニバーシティー | Compositions and methods for delivering nucleic acids to cells |
EP4366767A2 (en) * | 2021-07-08 | 2024-05-15 | Temple University - Of The Commonwealth System of Higher Education | Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteins |
-
2022
- 2022-07-07 EP EP22838408.7A patent/EP4366767A2/en active Pending
- 2022-07-07 WO PCT/US2022/036367 patent/WO2023283342A2/en active Application Filing
- 2022-07-07 CA CA3226284A patent/CA3226284A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2023283342A2 (en) | 2023-01-12 |
WO2023283342A3 (en) | 2023-10-05 |
CA3226284A1 (en) | 2023-01-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7350485B2 (en) | Compositions and methods for improving viral vector efficiency | |
KR20200083550A (en) | How to rescue a stop codon through gene redirection by ACE-tRNA | |
Fournier et al. | Recruitment of RED-SMU1 complex by Influenza A Virus RNA polymerase to control Viral mRNA splicing | |
JP5432415B2 (en) | Capping prone RNA polymerase enzyme and its application | |
AU2021230545A1 (en) | Improved methods and compositions for modulating a genome | |
KR20210044213A (en) | Vesicles for trace-free delivery of guide RNA molecules and/or guide RNA molecule/RNA-guided nuclease complex(s) and methods of production thereof | |
WO2016164797A1 (en) | Activatable crispr/cas9 for spatial and temporal control of genome editing | |
KR20240133774A (en) | Regulation of gene expression by aptamer-mediated modulation of alternative splicing | |
Bex et al. | Phosphorylation of the human T-cell leukemia virus type 1 transactivator tax on adjacent serine residues is critical for tax activation | |
Cressman et al. | Mechanisms of nuclear import and export that control the subcellular localization of class II transactivator | |
Zanker et al. | Influenza A virus infection induces viral and cellular defective ribosomal products encoded by alternative reading frames | |
JP2023537158A (en) | KRAB fusion suppressor and method and composition for suppressing gene expression | |
EP4366767A2 (en) | Oligonucleotides and viral untranslated region (utr) for increasing expression of target genes and proteins | |
CN115768487A (en) | CRISPR inhibition for facioscapulohumeral muscular dystrophy | |
JP5385796B2 (en) | Method for producing active scFv antibodies and libraries thereof | |
WO2020260899A1 (en) | Screen for inhibitors | |
Gould et al. | Cellular mRNAs access second ORFs using a novel amino acid sequence-dependent coupled translation termination–reinitiation mechanism | |
KR102208919B1 (en) | Cell transfection of nucleic acid using nano-assembly by fusion peptide and calcium ion and its application | |
WO2019117057A1 (en) | Cell-membrane-permeable peptide | |
CN117979994A (en) | Oligonucleotides and viral untranslated regions (UTRs) for increased expression of target genes and proteins | |
Zhang et al. | Cellular protein TTRAP interacts with HIV-1 integrase to facilitate viral integration | |
Lee et al. | Multicistronic IVT mRNA for simultaneous expression of multiple fluorescent proteins | |
WO2007093449A2 (en) | Method and means for high- throughput -screening of compounds that exhibit anti -arenavirus activity | |
CN113811317A (en) | Mutant VSV ectodomain polypeptides and uses thereof | |
KR20080012437A (en) | Transmembrane delivery peptide and bio-material comprising the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE |
|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE |
|
17P | Request for examination filed |
Effective date: 20240208 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
DAV | Request for validation of the european patent (deleted) | ||
DAX | Request for extension of the european patent (deleted) |