WO2024018188A1 - Virus-like particles, heterodimeric capsid proteins and methods of production thereof - Google Patents
Virus-like particles, heterodimeric capsid proteins and methods of production thereof Download PDFInfo
- Publication number
- WO2024018188A1 WO2024018188A1 PCT/GB2023/051873 GB2023051873W WO2024018188A1 WO 2024018188 A1 WO2024018188 A1 WO 2024018188A1 GB 2023051873 W GB2023051873 W GB 2023051873W WO 2024018188 A1 WO2024018188 A1 WO 2024018188A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- suitably
- amino acid
- acid sequence
- capsid protein
- monomer
- Prior art date
Links
- 108090000565 Capsid Proteins Proteins 0.000 title claims abstract description 234
- 102100023321 Ceruloplasmin Human genes 0.000 title claims abstract description 233
- 238000000034 method Methods 0.000 title claims abstract description 64
- 239000002245 particle Substances 0.000 title claims abstract description 36
- 238000004519 manufacturing process Methods 0.000 title description 24
- 239000000178 monomer Substances 0.000 claims abstract description 317
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 242
- 230000035772 mutation Effects 0.000 claims abstract description 221
- 230000027455 binding Effects 0.000 claims abstract description 182
- 230000003612 virological effect Effects 0.000 claims abstract description 155
- 239000000833 heterodimer Substances 0.000 claims abstract description 112
- 230000008569 process Effects 0.000 claims abstract description 32
- 230000009881 electrostatic interaction Effects 0.000 claims abstract description 31
- 108091008324 binding proteins Proteins 0.000 claims description 184
- 108090000623 proteins and genes Proteins 0.000 claims description 173
- 102000004169 proteins and genes Human genes 0.000 claims description 167
- 150000007523 nucleic acids Chemical class 0.000 claims description 153
- 102000039446 nucleic acids Human genes 0.000 claims description 127
- 108020004707 nucleic acids Proteins 0.000 claims description 127
- 230000002163 immunogen Effects 0.000 claims description 54
- 239000000203 mixture Substances 0.000 claims description 44
- 208000006454 hepatitis Diseases 0.000 claims description 41
- 231100000283 hepatitis Toxicity 0.000 claims description 41
- 241000283923 Marmota monax Species 0.000 claims description 36
- 102200042487 rs141568342 Human genes 0.000 claims description 30
- 238000012217 deletion Methods 0.000 claims description 29
- 230000037430 deletion Effects 0.000 claims description 29
- 102200101168 rs1554341499 Human genes 0.000 claims description 24
- 101710183938 Barstar Proteins 0.000 claims description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 20
- 238000012258 culturing Methods 0.000 claims description 18
- 231100000699 Bacterial toxin Toxicity 0.000 claims description 16
- 239000000688 bacterial toxin Substances 0.000 claims description 16
- 102200074797 rs111033564 Human genes 0.000 claims description 13
- 239000003112 inhibitor Substances 0.000 claims description 12
- 238000002156 mixing Methods 0.000 claims description 12
- 239000003814 drug Substances 0.000 claims description 11
- 206010019799 Hepatitis viral Diseases 0.000 claims description 8
- 201000001862 viral hepatitis Diseases 0.000 claims description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 7
- 102000023732 binding proteins Human genes 0.000 claims 4
- 102220479829 Voltage-dependent L-type calcium channel subunit beta-2_R56K_mutation Human genes 0.000 claims 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 59
- 201000010099 disease Diseases 0.000 abstract description 56
- 229960005486 vaccine Drugs 0.000 abstract description 17
- 230000002265 prevention Effects 0.000 abstract description 10
- 238000011160 research Methods 0.000 abstract description 8
- 238000003745 diagnosis Methods 0.000 abstract description 7
- 102000014914 Carrier Proteins Human genes 0.000 description 180
- 235000018102 proteins Nutrition 0.000 description 155
- 239000013598 vector Substances 0.000 description 104
- 210000004027 cell Anatomy 0.000 description 103
- 108091007433 antigens Proteins 0.000 description 60
- 102000036639 antigens Human genes 0.000 description 60
- 239000000427 antigen Substances 0.000 description 56
- 238000006467 substitution reaction Methods 0.000 description 51
- 229940024606 amino acid Drugs 0.000 description 50
- 150000001413 amino acids Chemical class 0.000 description 49
- 210000000234 capsid Anatomy 0.000 description 45
- 230000015572 biosynthetic process Effects 0.000 description 37
- 241000700605 Viruses Species 0.000 description 29
- 102200094610 rs397518416 Human genes 0.000 description 29
- 241000588724 Escherichia coli Species 0.000 description 28
- 230000014509 gene expression Effects 0.000 description 27
- 241000282414 Homo sapiens Species 0.000 description 23
- 239000000539 dimer Substances 0.000 description 23
- 238000007385 chemical modification Methods 0.000 description 22
- 239000003795 chemical substances by application Substances 0.000 description 22
- 208000002672 hepatitis B Diseases 0.000 description 21
- 230000004048 modification Effects 0.000 description 18
- 238000012986 modification Methods 0.000 description 18
- 239000000710 homodimer Substances 0.000 description 17
- 108010016529 Bacillus amyloliquefaciens ribonuclease Proteins 0.000 description 16
- 230000000295 complement effect Effects 0.000 description 16
- 241001492404 Woodchuck hepatitis virus Species 0.000 description 15
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 15
- 238000001597 immobilized metal affinity chromatography Methods 0.000 description 14
- 230000003993 interaction Effects 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- 102000005962 receptors Human genes 0.000 description 11
- 108020003175 receptors Proteins 0.000 description 11
- 241000894006 Bacteria Species 0.000 description 10
- 102000000844 Cell Surface Receptors Human genes 0.000 description 10
- 108010001857 Cell Surface Receptors Proteins 0.000 description 10
- 108020004414 DNA Proteins 0.000 description 10
- 125000005842 heteroatom Chemical group 0.000 description 10
- 230000001225 therapeutic effect Effects 0.000 description 10
- 241000700721 Hepatitis B virus Species 0.000 description 9
- 150000001335 aliphatic alkanes Chemical class 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 9
- 241001678559 COVID-19 virus Species 0.000 description 8
- 102000004127 Cytokines Human genes 0.000 description 8
- 108090000695 Cytokines Proteins 0.000 description 8
- 102000003816 Interleukin-13 Human genes 0.000 description 8
- 108090000176 Interleukin-13 Proteins 0.000 description 8
- 102100021596 Interleukin-31 Human genes 0.000 description 8
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 8
- 239000004472 Lysine Substances 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 108020001507 fusion proteins Proteins 0.000 description 8
- 229930195712 glutamate Natural products 0.000 description 8
- 230000002757 inflammatory effect Effects 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 108010067003 Interleukin-33 Proteins 0.000 description 7
- 102000017761 Interleukin-33 Human genes 0.000 description 7
- 102000025171 antigen binding proteins Human genes 0.000 description 7
- 108091000831 antigen binding proteins Proteins 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 238000004587 chromatography analysis Methods 0.000 description 7
- 102000037865 fusion proteins Human genes 0.000 description 7
- 208000015181 infectious disease Diseases 0.000 description 7
- 239000006166 lysate Substances 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 108010073254 Colicins Proteins 0.000 description 6
- 101001043821 Homo sapiens Interleukin-31 Proteins 0.000 description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 6
- 241001529936 Murinae Species 0.000 description 6
- 229940009098 aspartate Drugs 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 230000001086 cytosolic effect Effects 0.000 description 6
- 229940079593 drug Drugs 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 230000016784 immunoglobulin production Effects 0.000 description 6
- 239000000411 inducer Substances 0.000 description 6
- 230000001939 inductive effect Effects 0.000 description 6
- 239000012678 infectious agent Substances 0.000 description 6
- 230000036961 partial effect Effects 0.000 description 6
- 238000001556 precipitation Methods 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 5
- 101000998146 Homo sapiens Interleukin-17A Proteins 0.000 description 5
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 5
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 5
- 102100033461 Interleukin-17A Human genes 0.000 description 5
- 102100024407 Jouberin Human genes 0.000 description 5
- 206010028980 Neoplasm Diseases 0.000 description 5
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 description 5
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 5
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 5
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 5
- 238000007792 addition Methods 0.000 description 5
- -1 as shown Proteins 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 230000028993 immune response Effects 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 5
- 239000000546 pharmaceutical excipient Substances 0.000 description 5
- 239000002244 precipitate Substances 0.000 description 5
- 238000002560 therapeutic procedure Methods 0.000 description 5
- 239000004475 Arginine Substances 0.000 description 4
- 208000023275 Autoimmune disease Diseases 0.000 description 4
- 101710167800 Capsid assembly scaffolding protein Proteins 0.000 description 4
- 208000024172 Cardiovascular disease Diseases 0.000 description 4
- 208000035473 Communicable disease Diseases 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- 241000701806 Human papillomavirus Species 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 101710130420 Probable capsid assembly scaffolding protein Proteins 0.000 description 4
- 201000004681 Psoriasis Diseases 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 101710204410 Scaffold protein Proteins 0.000 description 4
- 238000001042 affinity chromatography Methods 0.000 description 4
- 238000005571 anion exchange chromatography Methods 0.000 description 4
- 241000617156 archaeon Species 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 229960003121 arginine Drugs 0.000 description 4
- 208000006673 asthma Diseases 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 230000003197 catalytic effect Effects 0.000 description 4
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 231100000676 disease causative agent Toxicity 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 230000001433 inducive effect Effects 0.000 description 4
- 208000027866 inflammatory disease Diseases 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 3
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 3
- 241000193738 Bacillus anthracis Species 0.000 description 3
- 102000019034 Chemokines Human genes 0.000 description 3
- 108010012236 Chemokines Proteins 0.000 description 3
- 241000193163 Clostridioides difficile Species 0.000 description 3
- 241000711573 Coronaviridae Species 0.000 description 3
- 206010012335 Dependence Diseases 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 241000606768 Haemophilus influenzae Species 0.000 description 3
- 108090000144 Human Proteins Proteins 0.000 description 3
- 102000003839 Human Proteins Human genes 0.000 description 3
- 241000725303 Human immunodeficiency virus Species 0.000 description 3
- 101710181613 Interleukin-31 Proteins 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 208000012902 Nervous system disease Diseases 0.000 description 3
- 208000025966 Neurological disease Diseases 0.000 description 3
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 3
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 229940096437 Protein S Drugs 0.000 description 3
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 101710198474 Spike protein Proteins 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 206010003246 arthritis Diseases 0.000 description 3
- 239000002585 base Substances 0.000 description 3
- 238000002869 basic local alignment search tool Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000005034 decoration Methods 0.000 description 3
- 230000005786 degenerative changes Effects 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 229940023064 escherichia coli Drugs 0.000 description 3
- 238000011049 filling Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 229940047650 haemophilus influenzae Drugs 0.000 description 3
- 229960002885 histidine Drugs 0.000 description 3
- 230000008105 immune reaction Effects 0.000 description 3
- 230000000415 inactivating effect Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 239000003446 ligand Substances 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 206010025135 lupus erythematosus Diseases 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 238000013411 master cell bank Methods 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000000869 mutational effect Effects 0.000 description 3
- 239000002105 nanoparticle Substances 0.000 description 3
- 230000001613 neoplastic effect Effects 0.000 description 3
- 208000015122 neurodegenerative disease Diseases 0.000 description 3
- IOQPZZOEVPZRBK-UHFFFAOYSA-N octan-1-amine Chemical compound CCCCCCCCN IOQPZZOEVPZRBK-UHFFFAOYSA-N 0.000 description 3
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 3
- 229920000053 polysorbate 80 Polymers 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 102200081526 rs121913583 Human genes 0.000 description 3
- 238000000527 sonication Methods 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 229930101283 tetracycline Natural products 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- SNICXCGAKADSCV-JTQLQIEISA-N (-)-Nicotine Chemical compound CN1CCC[C@H]1C1=CC=CN=C1 SNICXCGAKADSCV-JTQLQIEISA-N 0.000 description 2
- SNKAWJBJQDLSFF-NVKMUCNASA-N 1,2-dioleoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC SNKAWJBJQDLSFF-NVKMUCNASA-N 0.000 description 2
- 102000051403 ADAMTS4 Human genes 0.000 description 2
- 108091005664 ADAMTS4 Proteins 0.000 description 2
- 241000186041 Actinomyces israelii Species 0.000 description 2
- 206010002329 Aneurysm Diseases 0.000 description 2
- 108091023037 Aptamer Proteins 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- 201000001320 Atherosclerosis Diseases 0.000 description 2
- 241000193755 Bacillus cereus Species 0.000 description 2
- 208000035143 Bacterial infection Diseases 0.000 description 2
- 241000606124 Bacteroides fragilis Species 0.000 description 2
- 241001518086 Bartonella henselae Species 0.000 description 2
- 241000606108 Bartonella quintana Species 0.000 description 2
- 241000588832 Bordetella pertussis Species 0.000 description 2
- 241000180135 Borrelia recurrentis Species 0.000 description 2
- 241001148604 Borreliella afzelii Species 0.000 description 2
- 241000589969 Borreliella burgdorferi Species 0.000 description 2
- 241001148605 Borreliella garinii Species 0.000 description 2
- 208000003174 Brain Neoplasms Diseases 0.000 description 2
- 241000589567 Brucella abortus Species 0.000 description 2
- 241001509299 Brucella canis Species 0.000 description 2
- 241001148106 Brucella melitensis Species 0.000 description 2
- 241001148111 Brucella suis Species 0.000 description 2
- 208000025721 COVID-19 Diseases 0.000 description 2
- 241000589875 Campylobacter jejuni Species 0.000 description 2
- 206010008342 Cervix carcinoma Diseases 0.000 description 2
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- 241001502567 Chikungunya virus Species 0.000 description 2
- 241001647372 Chlamydia pneumoniae Species 0.000 description 2
- 241001647378 Chlamydia psittaci Species 0.000 description 2
- 241000606153 Chlamydia trachomatis Species 0.000 description 2
- 102000009016 Cholera Toxin Human genes 0.000 description 2
- 108010049048 Cholera Toxin Proteins 0.000 description 2
- 241000193155 Clostridium botulinum Species 0.000 description 2
- 241000193468 Clostridium perfringens Species 0.000 description 2
- 241000193449 Clostridium tetani Species 0.000 description 2
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 2
- 241000186226 Corynebacterium glutamicum Species 0.000 description 2
- 241000150230 Crimean-Congo hemorrhagic fever orthonairovirus Species 0.000 description 2
- 208000011231 Crohn disease Diseases 0.000 description 2
- IELOKBJPULMYRW-NJQVLOCASA-N D-alpha-Tocopheryl Acid Succinate Chemical compound OC(=O)CCC(=O)OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C IELOKBJPULMYRW-NJQVLOCASA-N 0.000 description 2
- 206010012289 Dementia Diseases 0.000 description 2
- 241000725619 Dengue virus Species 0.000 description 2
- 241000702421 Dependoparvovirus Species 0.000 description 2
- 201000004624 Dermatitis Diseases 0.000 description 2
- 241001115402 Ebolavirus Species 0.000 description 2
- 241001466953 Echovirus Species 0.000 description 2
- 241000710188 Encephalomyocarditis virus Species 0.000 description 2
- 241000194032 Enterococcus faecalis Species 0.000 description 2
- 241000194031 Enterococcus faecium Species 0.000 description 2
- 241000991587 Enterovirus C Species 0.000 description 2
- 102220499737 Epiphycan_H569A_mutation Human genes 0.000 description 2
- 102220561793 Forkhead box protein C1_I87K_mutation Human genes 0.000 description 2
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 2
- 241000589602 Francisella tularensis Species 0.000 description 2
- 102000053171 Glial Fibrillary Acidic Human genes 0.000 description 2
- 101710193519 Glial fibrillary acidic protein Proteins 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 208000035895 Guillain-Barré syndrome Diseases 0.000 description 2
- 241000150562 Hantaan orthohantavirus Species 0.000 description 2
- 241000590002 Helicobacter pylori Species 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 241000711549 Hepacivirus C Species 0.000 description 2
- 208000005176 Hepatitis C Diseases 0.000 description 2
- 208000005331 Hepatitis D Diseases 0.000 description 2
- 241000724675 Hepatitis E virus Species 0.000 description 2
- 208000037262 Hepatitis delta Diseases 0.000 description 2
- 241000724709 Hepatitis delta virus Species 0.000 description 2
- 241000709721 Hepatovirus A Species 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 244000309467 Human Coronavirus Species 0.000 description 2
- 241000598171 Human adenovirus sp. Species 0.000 description 2
- 241000701085 Human alphaherpesvirus 3 Species 0.000 description 2
- 241001479210 Human astrovirus Species 0.000 description 2
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 2
- 241001207270 Human enterovirus Species 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- 241000711920 Human orthopneumovirus Species 0.000 description 2
- 241000430519 Human rhinovirus sp. Species 0.000 description 2
- 241000947839 Human torovirus Species 0.000 description 2
- 241000712431 Influenza A virus Species 0.000 description 2
- 241000713196 Influenza B virus Species 0.000 description 2
- 241000713297 Influenza C virus Species 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 241000710842 Japanese encephalitis virus Species 0.000 description 2
- 241000588747 Klebsiella pneumoniae Species 0.000 description 2
- 241000710912 Kunjin virus Species 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- 102220477854 Laforin_R538A_mutation Human genes 0.000 description 2
- 241000712902 Lassa mammarenavirus Species 0.000 description 2
- 241000589242 Legionella pneumophila Species 0.000 description 2
- 241000589929 Leptospira interrogans Species 0.000 description 2
- 241001135196 Leptospira noguchii Species 0.000 description 2
- 241001135198 Leptospira santarosai Species 0.000 description 2
- 241001135200 Leptospira weilii Species 0.000 description 2
- 241000186779 Listeria monocytogenes Species 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 241000712079 Measles morbillivirus Species 0.000 description 2
- 206010049567 Miller Fisher syndrome Diseases 0.000 description 2
- 241000700560 Molluscum contagiosum virus Species 0.000 description 2
- 241000711386 Mumps virus Species 0.000 description 2
- 241000186362 Mycobacterium leprae Species 0.000 description 2
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 2
- 241000187917 Mycobacterium ulcerans Species 0.000 description 2
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 2
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 2
- 241000588650 Neisseria meningitidis Species 0.000 description 2
- 241000526636 Nipah henipavirus Species 0.000 description 2
- 241000187678 Nocardia asteroides Species 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 201000005702 Pertussis Diseases 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 108010093965 Polymyxin B Proteins 0.000 description 2
- 241001505332 Polyomavirus sp. Species 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 241000711798 Rabies lyssavirus Species 0.000 description 2
- 241000606701 Rickettsia Species 0.000 description 2
- 241001137860 Rotavirus A Species 0.000 description 2
- 241001137861 Rotavirus B Species 0.000 description 2
- 241001506005 Rotavirus C Species 0.000 description 2
- 241000710799 Rubella virus Species 0.000 description 2
- 241000608282 Sagiyama virus Species 0.000 description 2
- 241000033084 Salivirus A Species 0.000 description 2
- 241000607361 Salmonella enterica subsp. enterica Species 0.000 description 2
- 241000293871 Salmonella enterica subsp. enterica serovar Typhi Species 0.000 description 2
- 241000369753 Sapporo virus Species 0.000 description 2
- 241000607764 Shigella dysenteriae Species 0.000 description 2
- 241000607760 Shigella sonnei Species 0.000 description 2
- 241000710960 Sindbis virus Species 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 206010041925 Staphylococcal infections Diseases 0.000 description 2
- 241000191967 Staphylococcus aureus Species 0.000 description 2
- 241000191963 Staphylococcus epidermidis Species 0.000 description 2
- 241001147691 Staphylococcus saprophyticus Species 0.000 description 2
- 241000193985 Streptococcus agalactiae Species 0.000 description 2
- 241000193998 Streptococcus pneumoniae Species 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- 241001312524 Streptococcus viridans Species 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- 241000713154 Toscana virus Species 0.000 description 2
- 241000589884 Treponema pallidum Species 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 241000202921 Ureaplasma urealyticum Species 0.000 description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 2
- 241000713152 Uukuniemi virus Species 0.000 description 2
- 241000710959 Venezuelan equine encephalitis virus Species 0.000 description 2
- 241000711975 Vesicular stomatitis virus Species 0.000 description 2
- 241000607626 Vibrio cholerae Species 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 241000710886 West Nile virus Species 0.000 description 2
- 241000710951 Western equine encephalitis virus Species 0.000 description 2
- 241000710772 Yellow fever virus Species 0.000 description 2
- 241000607447 Yersinia enterocolitica Species 0.000 description 2
- 241000607479 Yersinia pestis Species 0.000 description 2
- 241000607477 Yersinia pseudotuberculosis Species 0.000 description 2
- 241000907316 Zika virus Species 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 230000001147 anti-toxic effect Effects 0.000 description 2
- 230000005875 antibody response Effects 0.000 description 2
- 238000000149 argon plasma sintering Methods 0.000 description 2
- 206010003119 arrhythmia Diseases 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 229940065181 bacillus anthracis Drugs 0.000 description 2
- 208000022362 bacterial infectious disease Diseases 0.000 description 2
- 229940092524 bartonella henselae Drugs 0.000 description 2
- 229940092523 bartonella quintana Drugs 0.000 description 2
- 229940056450 brucella abortus Drugs 0.000 description 2
- 229940038698 brucella melitensis Drugs 0.000 description 2
- 102220363249 c.40C>A Human genes 0.000 description 2
- RYYVLZVUVIJVGH-UHFFFAOYSA-N caffeine Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 125000004432 carbon atom Chemical group C* 0.000 description 2
- 210000004671 cell-free system Anatomy 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 238000010382 chemical cross-linking Methods 0.000 description 2
- 229940038705 chlamydia trachomatis Drugs 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 229960003920 cocaine Drugs 0.000 description 2
- 210000000172 cytosol Anatomy 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 206010012601 diabetes mellitus Diseases 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 229940032049 enterococcus faecalis Drugs 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 229940118764 francisella tularensis Drugs 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 210000005046 glial fibrillary acidic protein Anatomy 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 229940037467 helicobacter pylori Drugs 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 208000005252 hepatitis A Diseases 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- JYGXADMDTFJGBT-VWUMJDOOSA-N hydrocortisone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 JYGXADMDTFJGBT-VWUMJDOOSA-N 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 229910052500 inorganic mineral Inorganic materials 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 230000002427 irreversible effect Effects 0.000 description 2
- 229940045505 klebsiella pneumoniae Drugs 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 229960001375 lactose Drugs 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 229940115932 legionella pneumophila Drugs 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 208000030159 metabolic disease Diseases 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 235000010755 mineral Nutrition 0.000 description 2
- 239000011707 mineral Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 206010028417 myasthenia gravis Diseases 0.000 description 2
- 229960002715 nicotine Drugs 0.000 description 2
- SNICXCGAKADSCV-UHFFFAOYSA-N nicotine Natural products CN1CCCC1C1=CC=CN=C1 SNICXCGAKADSCV-UHFFFAOYSA-N 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 2
- 229940068968 polysorbate 80 Drugs 0.000 description 2
- 108010054624 red fluorescent protein Proteins 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000003252 repetitive effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 206010039073 rheumatoid arthritis Diseases 0.000 description 2
- 102220198236 rs1057519927 Human genes 0.000 description 2
- 102200082910 rs34948328 Human genes 0.000 description 2
- 102220329197 rs776688705 Human genes 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000001338 self-assembly Methods 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 229940007046 shigella dysenteriae Drugs 0.000 description 2
- 229940115939 shigella sonnei Drugs 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 230000006641 stabilisation Effects 0.000 description 2
- 238000011105 stabilization Methods 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 230000006918 subunit interaction Effects 0.000 description 2
- 208000011580 syndromic disease Diseases 0.000 description 2
- 238000010381 tandem affinity purification Methods 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 231100000765 toxin Toxicity 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 238000004627 transmission electron microscopy Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 201000008827 tuberculosis Diseases 0.000 description 2
- 241001529453 unidentified herpesvirus Species 0.000 description 2
- 229940118696 vibrio cholerae Drugs 0.000 description 2
- 229940051021 yellow-fever virus Drugs 0.000 description 2
- 229940098232 yersinia enterocolitica Drugs 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- RDEIXVOBVLKYNT-VQBXQJRRSA-N (2r,3r,4r,5r)-2-[(1s,2s,3r,4s,6r)-4,6-diamino-3-[(2r,3r,6s)-3-amino-6-(1-aminoethyl)oxan-2-yl]oxy-2-hydroxycyclohexyl]oxy-5-methyl-4-(methylamino)oxane-3,5-diol;(2r,3r,4r,5r)-2-[(1s,2s,3r,4s,6r)-4,6-diamino-3-[(2r,3r,6s)-3-amino-6-(aminomethyl)oxan-2-yl]o Chemical compound OS(O)(=O)=O.O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H](CC[C@@H](CN)O2)N)[C@@H](N)C[C@H]1N.O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H](CC[C@H](O2)C(C)N)N)[C@@H](N)C[C@H]1N.O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N RDEIXVOBVLKYNT-VQBXQJRRSA-N 0.000 description 1
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- OOSZCNKVJAVHJI-UHFFFAOYSA-N 1-[(4-fluorophenyl)methyl]piperazine Chemical compound C1=CC(F)=CC=C1CN1CCNCC1 OOSZCNKVJAVHJI-UHFFFAOYSA-N 0.000 description 1
- WKBPZYKAUNRMKP-UHFFFAOYSA-N 1-[2-(2,4-dichlorophenyl)pentyl]1,2,4-triazole Chemical compound C=1C=C(Cl)C=C(Cl)C=1C(CCC)CN1C=NC=N1 WKBPZYKAUNRMKP-UHFFFAOYSA-N 0.000 description 1
- YMHOBZXQZVXHBM-UHFFFAOYSA-N 2,5-dimethoxy-4-bromophenethylamine Chemical compound COC1=CC(CCN)=C(OC)C=C1Br YMHOBZXQZVXHBM-UHFFFAOYSA-N 0.000 description 1
- WEEMDRWIKYCTQM-UHFFFAOYSA-N 2,6-dimethoxybenzenecarbothioamide Chemical compound COC1=CC=CC(OC)=C1C(N)=S WEEMDRWIKYCTQM-UHFFFAOYSA-N 0.000 description 1
- HNLXNOZHXNSSPN-UHFFFAOYSA-N 2-[2-[2-[2-[2-[2-[2-[4-(2,4,4-trimethylpentan-2-yl)phenoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethanol Chemical compound CC(C)(C)CC(C)(C)C1=CC=C(OCCOCCOCCOCCOCCOCCOCCO)C=C1 HNLXNOZHXNSSPN-UHFFFAOYSA-N 0.000 description 1
- QCDWFXQBSFUVSP-UHFFFAOYSA-N 2-phenoxyethanol Chemical compound OCCOC1=CC=CC=C1 QCDWFXQBSFUVSP-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 102000040125 5-hydroxytryptamine receptor family Human genes 0.000 description 1
- 108091032151 5-hydroxytryptamine receptor family Proteins 0.000 description 1
- GJCOSYZMQJWQCA-UHFFFAOYSA-N 9H-xanthene Chemical compound C1=CC=C2CC3=CC=CC=C3OC2=C1 GJCOSYZMQJWQCA-UHFFFAOYSA-N 0.000 description 1
- 102000051389 ADAMTS5 Human genes 0.000 description 1
- 108091005663 ADAMTS5 Proteins 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 102000003678 AMPA Receptors Human genes 0.000 description 1
- 108090000078 AMPA Receptors Proteins 0.000 description 1
- 208000026872 Addison Disease Diseases 0.000 description 1
- 208000010370 Adenoviridae Infections Diseases 0.000 description 1
- 206010060931 Adenovirus infection Diseases 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 208000007848 Alcoholism Diseases 0.000 description 1
- 201000004384 Alopecia Diseases 0.000 description 1
- 241000187390 Amycolatopsis lactamdurans Species 0.000 description 1
- 102000013455 Amyloid beta-Peptides Human genes 0.000 description 1
- 108010090849 Amyloid beta-Peptides Proteins 0.000 description 1
- 206010002383 Angina Pectoris Diseases 0.000 description 1
- 102220594594 Ankyrin repeat and SOCS box protein 8_H88W_mutation Human genes 0.000 description 1
- 244000105975 Antidesma platyphyllum Species 0.000 description 1
- 208000003343 Antiphospholipid Syndrome Diseases 0.000 description 1
- 208000025494 Aortic disease Diseases 0.000 description 1
- 206010003267 Arthritis reactive Diseases 0.000 description 1
- 102220496625 Aryl hydrocarbon receptor nuclear translocator-like protein 1_S78E_mutation Human genes 0.000 description 1
- 206010003591 Ataxia Diseases 0.000 description 1
- 108091005950 Azurite Proteins 0.000 description 1
- 108091008875 B cell receptors Proteins 0.000 description 1
- 208000004429 Bacillary Dysentery Diseases 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 241000194107 Bacillus megaterium Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 206010004022 Bacterial food poisoning Diseases 0.000 description 1
- 208000023328 Basedow disease Diseases 0.000 description 1
- 208000006373 Bell palsy Diseases 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 208000018084 Bone neoplasm Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 208000003508 Botulism Diseases 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 241000193764 Brevibacillus brevis Species 0.000 description 1
- CPELXLSAUQHCOX-UHFFFAOYSA-M Bromide Chemical compound [Br-] CPELXLSAUQHCOX-UHFFFAOYSA-M 0.000 description 1
- 229940022962 COVID-19 vaccine Drugs 0.000 description 1
- 108010071134 CRM197 (non-toxic variant of diphtheria toxin) Proteins 0.000 description 1
- 102000004657 Calcium-Calmodulin-Dependent Protein Kinase Type 2 Human genes 0.000 description 1
- 108010003721 Calcium-Calmodulin-Dependent Protein Kinase Type 2 Proteins 0.000 description 1
- 208000006339 Caliciviridae Infections Diseases 0.000 description 1
- 206010051226 Campylobacter infection Diseases 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 208000006029 Cardiomegaly Diseases 0.000 description 1
- 208000031229 Cardiomyopathies Diseases 0.000 description 1
- 241000010804 Caulobacter vibrioides Species 0.000 description 1
- 206010007882 Cellulitis Diseases 0.000 description 1
- 229920000623 Cellulose acetate phthalate Polymers 0.000 description 1
- 108091005944 Cerulean Proteins 0.000 description 1
- 229940124957 Cervarix Drugs 0.000 description 1
- 201000006082 Chickenpox Diseases 0.000 description 1
- 208000004293 Chikungunya Fever Diseases 0.000 description 1
- 206010067256 Chikungunya virus infection Diseases 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 241000579895 Chlorostilbon Species 0.000 description 1
- 239000004099 Chlortetracycline Substances 0.000 description 1
- 206010008631 Cholera Diseases 0.000 description 1
- 241000047960 Chromohalobacter salexigens Species 0.000 description 1
- 206010008874 Chronic Fatigue Syndrome Diseases 0.000 description 1
- 208000006545 Chronic Obstructive Pulmonary Disease Diseases 0.000 description 1
- YASYEJJMZJALEJ-UHFFFAOYSA-N Citric acid monohydrate Chemical compound O.OC(=O)CC(O)(C(O)=O)CC(O)=O YASYEJJMZJALEJ-UHFFFAOYSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 208000015943 Coeliac disease Diseases 0.000 description 1
- 206010009900 Colitis ulcerative Diseases 0.000 description 1
- 102000007644 Colony-Stimulating Factors Human genes 0.000 description 1
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 1
- 208000002330 Congenital Heart Defects Diseases 0.000 description 1
- 206010010904 Convulsion Diseases 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 201000006306 Cor pulmonale Diseases 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000186145 Corynebacterium ammoniagenes Species 0.000 description 1
- 208000000307 Crimean Hemorrhagic Fever Diseases 0.000 description 1
- 201000003075 Crimean-Congo hemorrhagic fever Diseases 0.000 description 1
- 208000008953 Cryptosporidiosis Diseases 0.000 description 1
- 206010011502 Cryptosporidiosis infection Diseases 0.000 description 1
- 108091005943 CyPet Proteins 0.000 description 1
- RFSUNEUAIZKAJO-VRPWFDPXSA-N D-Fructose Natural products OC[C@H]1OC(O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-VRPWFDPXSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 208000001490 Dengue Diseases 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 206010012438 Dermatitis atopic Diseases 0.000 description 1
- 108700022150 Designed Ankyrin Repeat Proteins Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 108091005941 EBFP Proteins 0.000 description 1
- 108091005947 EBFP2 Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 201000011001 Ebola Hemorrhagic Fever Diseases 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 206010014612 Encephalitis viral Diseases 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 241000701832 Enterobacteria phage T3 Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 201000000297 Erysipelas Diseases 0.000 description 1
- 208000007985 Erythema Infectiosum Diseases 0.000 description 1
- 102000003951 Erythropoietin Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- 102100029951 Estrogen receptor beta Human genes 0.000 description 1
- 208000006168 Ewing Sarcoma Diseases 0.000 description 1
- 201000005866 Exanthema Subitum Diseases 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 206010016936 Folliculitis Diseases 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 206010017533 Fungal infection Diseases 0.000 description 1
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 1
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 1
- 102000027484 GABAA receptors Human genes 0.000 description 1
- 108091008681 GABAA receptors Proteins 0.000 description 1
- 102000007563 Galectins Human genes 0.000 description 1
- 108010046569 Galectins Proteins 0.000 description 1
- 208000022072 Gallbladder Neoplasms Diseases 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 229940124897 Gardasil Drugs 0.000 description 1
- 208000005577 Gastroenteritis Diseases 0.000 description 1
- 201000003741 Gastrointestinal carcinoma Diseases 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 208000034826 Genetic Predisposition to Disease Diseases 0.000 description 1
- 102000004216 Glial cell line-derived neurotrophic factor receptors Human genes 0.000 description 1
- 108090000722 Glial cell line-derived neurotrophic factor receptors Proteins 0.000 description 1
- 206010018364 Glomerulonephritis Diseases 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 102000018899 Glutamate Receptors Human genes 0.000 description 1
- 108010027915 Glutamate Receptors Proteins 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- 108010076533 Glycine Receptors Proteins 0.000 description 1
- 102000011714 Glycine Receptors Human genes 0.000 description 1
- 206010018612 Gonorrhoea Diseases 0.000 description 1
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004457 Granulocyte-Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 208000015023 Graves' disease Diseases 0.000 description 1
- 102220542942 Growth arrest-specific protein 6_F41L_mutation Human genes 0.000 description 1
- 241000206596 Halomonas Species 0.000 description 1
- 208000008913 Hantavirus Infections Diseases 0.000 description 1
- 206010019280 Heart failures Diseases 0.000 description 1
- 208000007514 Herpes zoster Diseases 0.000 description 1
- 101001023784 Heteractis crispa GFP-like non-fluorescent chromoprotein Proteins 0.000 description 1
- 101000756632 Homo sapiens Actin, cytoplasmic 1 Proteins 0.000 description 1
- 101000975496 Homo sapiens Keratin, type II cytoskeletal 8 Proteins 0.000 description 1
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 1
- 101000837639 Homo sapiens Thyroxine-binding globulin Proteins 0.000 description 1
- 101000818510 Homo sapiens Zinc-activated ligand-gated ion channel Proteins 0.000 description 1
- 108091006905 Human Serum Albumin Proteins 0.000 description 1
- 102000008100 Human Serum Albumin Human genes 0.000 description 1
- 208000037147 Hypercalcaemia Diseases 0.000 description 1
- 208000035150 Hypercholesterolemia Diseases 0.000 description 1
- 208000031226 Hyperlipidaemia Diseases 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- 206010021143 Hypoxia Diseases 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- 206010021531 Impetigo Diseases 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000003814 Interleukin-10 Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 102000003815 Interleukin-11 Human genes 0.000 description 1
- 108090000177 Interleukin-11 Proteins 0.000 description 1
- 102000013462 Interleukin-12 Human genes 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 108050003558 Interleukin-17 Proteins 0.000 description 1
- 102000013691 Interleukin-17 Human genes 0.000 description 1
- 102000004890 Interleukin-8 Human genes 0.000 description 1
- 102000000585 Interleukin-9 Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- LPHGQDQBBGAPDZ-UHFFFAOYSA-N Isocaffeine Natural products CN1C(=O)N(C)C(=O)C2=C1N(C)C=N2 LPHGQDQBBGAPDZ-UHFFFAOYSA-N 0.000 description 1
- 102000000079 Kainic Acid Receptors Human genes 0.000 description 1
- 108010069902 Kainic Acid Receptors Proteins 0.000 description 1
- 208000007766 Kaposi sarcoma Diseases 0.000 description 1
- 208000011200 Kawasaki disease Diseases 0.000 description 1
- 102100023972 Keratin, type II cytoskeletal 8 Human genes 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- LEVWYRKDKASIDU-IMJSIDKUSA-N L-cystine Chemical compound [O-]C(=O)[C@@H]([NH3+])CSSC[C@H]([NH3+])C([O-])=O LEVWYRKDKASIDU-IMJSIDKUSA-N 0.000 description 1
- 239000004158 L-cystine Substances 0.000 description 1
- 235000019393 L-cystine Nutrition 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 244000199866 Lactobacillus casei Species 0.000 description 1
- 235000013958 Lactobacillus casei Nutrition 0.000 description 1
- 241000186606 Lactobacillus gasseri Species 0.000 description 1
- 240000006024 Lactobacillus plantarum Species 0.000 description 1
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 1
- 241000186604 Lactobacillus reuteri Species 0.000 description 1
- 206010023825 Laryngeal cancer Diseases 0.000 description 1
- 206010023927 Lassa fever Diseases 0.000 description 1
- 208000004023 Legionellosis Diseases 0.000 description 1
- 208000035353 Legionnaires disease Diseases 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- 206010024238 Leptospirosis Diseases 0.000 description 1
- 206010061523 Lip and/or oral cavity cancer Diseases 0.000 description 1
- 206010024641 Listeriosis Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 208000016604 Lyme disease Diseases 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 208000032271 Malignant tumor of penis Diseases 0.000 description 1
- 208000000932 Marburg Virus Disease Diseases 0.000 description 1
- 201000011013 Marburg hemorrhagic fever Diseases 0.000 description 1
- 201000005505 Measles Diseases 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- 206010027202 Meningitis bacterial Diseases 0.000 description 1
- 206010027260 Meningitis viral Diseases 0.000 description 1
- 206010027406 Mesothelioma Diseases 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 208000025370 Middle East respiratory syndrome Diseases 0.000 description 1
- 208000019695 Migraine disease Diseases 0.000 description 1
- 208000026072 Motor neurone disease Diseases 0.000 description 1
- 208000003445 Mouth Neoplasms Diseases 0.000 description 1
- 208000005647 Mumps Diseases 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 241000187480 Mycobacterium smegmatis Species 0.000 description 1
- 241000204051 Mycoplasma genitalium Species 0.000 description 1
- 208000031888 Mycoses Diseases 0.000 description 1
- 208000009525 Myocarditis Diseases 0.000 description 1
- 102000004868 N-Methyl-D-Aspartate Receptors Human genes 0.000 description 1
- 108090001041 N-Methyl-D-Aspartate Receptors Proteins 0.000 description 1
- SBKRTALNRRAOJP-BWSIXKJUSA-N N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methylheptanamide (6S)-N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methyloctanamide sulfuric acid Polymers OS(O)(=O)=O.CC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O.CC[C@H](C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O SBKRTALNRRAOJP-BWSIXKJUSA-N 0.000 description 1
- 229910002651 NO3 Inorganic materials 0.000 description 1
- 208000001894 Nasopharyngeal Neoplasms Diseases 0.000 description 1
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 1
- 206010028885 Necrotising fasciitis Diseases 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 102000019315 Nicotinic acetylcholine receptors Human genes 0.000 description 1
- 108050006807 Nicotinic acetylcholine receptors Proteins 0.000 description 1
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 1
- 208000010505 Nose Neoplasms Diseases 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 1
- 101150037263 PIP2 gene Proteins 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 208000009608 Papillomavirus Infections Diseases 0.000 description 1
- 206010033799 Paralysis Diseases 0.000 description 1
- 208000029082 Pelvic Inflammatory Disease Diseases 0.000 description 1
- 208000002471 Penile Neoplasms Diseases 0.000 description 1
- 206010034299 Penile cancer Diseases 0.000 description 1
- 208000005764 Peripheral Arterial Disease Diseases 0.000 description 1
- 208000030831 Peripheral arterial occlusive disease Diseases 0.000 description 1
- 208000031845 Pernicious anaemia Diseases 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 206010035718 Pneumonia legionella Diseases 0.000 description 1
- 208000000474 Poliomyelitis Diseases 0.000 description 1
- 108010002885 Polygeline Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 102000007327 Protamines Human genes 0.000 description 1
- 108010007568 Protamines Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 208000010362 Protozoan Infections Diseases 0.000 description 1
- 208000003251 Pruritus Diseases 0.000 description 1
- 241000590028 Pseudoalteromonas haloplanktis Species 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- 241000589776 Pseudomonas putida Species 0.000 description 1
- 208000004186 Pulmonary Heart Disease Diseases 0.000 description 1
- 206010037742 Rabies Diseases 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000015634 Rectal Neoplasms Diseases 0.000 description 1
- 208000004531 Renal Artery Obstruction Diseases 0.000 description 1
- 206010038378 Renal artery stenosis Diseases 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 206010061603 Respiratory syncytial virus infection Diseases 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 208000036485 Roseola Diseases 0.000 description 1
- 206010067470 Rotavirus infection Diseases 0.000 description 1
- 108091006197 SARS-CoV-2 Nucleocapsid Protein Proteins 0.000 description 1
- 101100262439 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) UBA2 gene Proteins 0.000 description 1
- 206010039438 Salmonella Infections Diseases 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- 206010040047 Sepsis Diseases 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 108010034546 Serratia marcescens nuclease Proteins 0.000 description 1
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 1
- 241000490596 Shewanella sp. Species 0.000 description 1
- 206010040550 Shigella infections Diseases 0.000 description 1
- 208000021386 Sjogren Syndrome Diseases 0.000 description 1
- 208000000453 Skin Neoplasms Diseases 0.000 description 1
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 1
- UIIMBOGNXHQVGW-DEQYMQKBSA-M Sodium bicarbonate-14C Chemical compound [Na+].O[14C]([O-])=O UIIMBOGNXHQVGW-DEQYMQKBSA-M 0.000 description 1
- 239000004147 Sorbitan trioleate Substances 0.000 description 1
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 1
- 208000006045 Spondylarthropathies Diseases 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 206010061372 Streptococcal infection Diseases 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 235000014897 Streptococcus lactis Nutrition 0.000 description 1
- 241000187392 Streptomyces griseus Species 0.000 description 1
- 241000187398 Streptomyces lividans Species 0.000 description 1
- 208000006011 Stroke Diseases 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 108050009621 Synapsin Proteins 0.000 description 1
- 102000001435 Synapsin Human genes 0.000 description 1
- 201000009594 Systemic Scleroderma Diseases 0.000 description 1
- 206010042953 Systemic sclerosis Diseases 0.000 description 1
- 238000003917 TEM image Methods 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 102100028709 Thyroxine-binding globulin Human genes 0.000 description 1
- 101710120037 Toxin CcdB Proteins 0.000 description 1
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 1
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 1
- 208000032109 Transient ischaemic attack Diseases 0.000 description 1
- 206010052779 Transplant rejections Diseases 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 1
- 206010054094 Tumour necrosis Diseases 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- 208000037386 Typhoid Diseases 0.000 description 1
- 208000025865 Ulcer Diseases 0.000 description 1
- 201000006704 Ulcerative Colitis Diseases 0.000 description 1
- COQLPRJCUIATTQ-UHFFFAOYSA-N Uranyl acetate Chemical compound O.O.O=[U]=O.CC(O)=O.CC(O)=O COQLPRJCUIATTQ-UHFFFAOYSA-N 0.000 description 1
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 1
- 208000002495 Uterine Neoplasms Diseases 0.000 description 1
- 208000036826 VIIth nerve paralysis Diseases 0.000 description 1
- 206010046980 Varicella Diseases 0.000 description 1
- 241000700647 Variola virus Species 0.000 description 1
- 206010047115 Vasculitis Diseases 0.000 description 1
- 241000545067 Venus Species 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 206010047642 Vitiligo Diseases 0.000 description 1
- 206010047741 Vulval cancer Diseases 0.000 description 1
- 208000000260 Warts Diseases 0.000 description 1
- 206010057293 West Nile viral infection Diseases 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 208000003152 Yellow Fever Diseases 0.000 description 1
- 208000001455 Zika Virus Infection Diseases 0.000 description 1
- 208000035332 Zika virus disease Diseases 0.000 description 1
- 208000020329 Zika virus infectious disease Diseases 0.000 description 1
- 102100021143 Zinc-activated ligand-gated ion channel Human genes 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 208000011589 adenoviridae infectious disease Diseases 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- GZCGUPFRVQAUEE-KVTDHHQDSA-N aldehydo-D-mannose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O GZCGUPFRVQAUEE-KVTDHHQDSA-N 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 108010004469 allophycocyanin Proteins 0.000 description 1
- 231100000360 alopecia Toxicity 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 239000004411 aluminium Substances 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 239000001132 aluminium potassium sulphate Substances 0.000 description 1
- 235000011126 aluminium potassium sulphate Nutrition 0.000 description 1
- 159000000013 aluminium salts Chemical class 0.000 description 1
- 235000005550 amino acid supplement Nutrition 0.000 description 1
- SOIFLUNRINLCBN-UHFFFAOYSA-N ammonium thiocyanate Chemical compound [NH4+].[S-]C#N SOIFLUNRINLCBN-UHFFFAOYSA-N 0.000 description 1
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 1
- 239000002870 angiogenesis inducing agent Substances 0.000 description 1
- 229960004977 anhydrous lactose Drugs 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 239000002249 anxiolytic agent Substances 0.000 description 1
- 230000000949 anxiolytic effect Effects 0.000 description 1
- 229940005530 anxiolytics Drugs 0.000 description 1
- 206010003230 arteritis Diseases 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 244000309743 astrovirus Species 0.000 description 1
- 201000008937 atopic dermatitis Diseases 0.000 description 1
- 201000009904 bacterial meningitis Diseases 0.000 description 1
- 229910052788 barium Inorganic materials 0.000 description 1
- DSAJWYNOEDNPEQ-UHFFFAOYSA-N barium atom Chemical compound [Ba] DSAJWYNOEDNPEQ-UHFFFAOYSA-N 0.000 description 1
- UREZNYTWGJKWBI-UHFFFAOYSA-M benzethonium chloride Chemical compound [Cl-].C1=CC(C(C)(C)CC(C)(C)C)=CC=C1OCCOCC[N+](C)(C)CC1=CC=CC=C1 UREZNYTWGJKWBI-UHFFFAOYSA-M 0.000 description 1
- 229960001950 benzethonium chloride Drugs 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- VEZXCJBBBCKRPI-UHFFFAOYSA-N beta-propiolactone Chemical compound O=C1CCO1 VEZXCJBBBCKRPI-UHFFFAOYSA-N 0.000 description 1
- 208000027119 bilirubin metabolic disease Diseases 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 230000036760 body temperature Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 229910021538 borax Inorganic materials 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 206010006451 bronchitis Diseases 0.000 description 1
- 210000004900 c-terminal fragment Anatomy 0.000 description 1
- 102220352322 c.232A>C Human genes 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- VJEONQKOZGKCAK-UHFFFAOYSA-N caffeine Natural products CN1C(=O)N(C)C(=O)C2=C1C=CN2C VJEONQKOZGKCAK-UHFFFAOYSA-N 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 201000004927 campylobacteriosis Diseases 0.000 description 1
- 229930003827 cannabinoid Natural products 0.000 description 1
- 239000003557 cannabinoid Substances 0.000 description 1
- 229940065144 cannabinoids Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 239000004359 castor oil Substances 0.000 description 1
- 235000019438 castor oil Nutrition 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 229940081734 cellulose acetate phthalate Drugs 0.000 description 1
- 208000026106 cerebrovascular disease Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229930002875 chlorophyll Natural products 0.000 description 1
- 235000019804 chlorophyll Nutrition 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- CYDMQBQPVICBEU-UHFFFAOYSA-N chlorotetracycline Natural products C1=CC(Cl)=C2C(O)(C)C3CC4C(N(C)C)C(O)=C(C(N)=O)C(=O)C4(O)C(O)=C3C(=O)C2=C1O CYDMQBQPVICBEU-UHFFFAOYSA-N 0.000 description 1
- 229960004475 chlortetracycline Drugs 0.000 description 1
- CYDMQBQPVICBEU-XRNKAMNCSA-N chlortetracycline Chemical compound C1=CC(Cl)=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O CYDMQBQPVICBEU-XRNKAMNCSA-N 0.000 description 1
- 235000019365 chlortetracycline Nutrition 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 208000037976 chronic inflammation Diseases 0.000 description 1
- 208000037893 chronic inflammatory disorder Diseases 0.000 description 1
- 208000025302 chronic primary adrenal insufficiency Diseases 0.000 description 1
- 229960002303 citric acid monohydrate Drugs 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 229940047120 colony stimulating factors Drugs 0.000 description 1
- 238000004040 coloring Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 208000028831 congenital heart disease Diseases 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 208000029078 coronary artery disease Diseases 0.000 description 1
- 230000000139 costimulatory effect Effects 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 108010082025 cyan fluorescent protein Proteins 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- 201000003146 cystitis Diseases 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 206010061811 demyelinating polyneuropathy Diseases 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 229940009976 deoxycholate Drugs 0.000 description 1
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 229960002086 dextran Drugs 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 229940111685 dibasic potassium phosphate Drugs 0.000 description 1
- 229940061607 dibasic sodium phosphate Drugs 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- KDQPSPMLNJTZAL-UHFFFAOYSA-L disodium hydrogenphosphate dihydrate Chemical compound O.O.[Na+].[Na+].OP([O-])([O-])=O KDQPSPMLNJTZAL-UHFFFAOYSA-L 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 101150015101 dsbC gene Proteins 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000002296 dynamic light scattering Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000010976 emerald Substances 0.000 description 1
- 229910052876 emerald Inorganic materials 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 206010014599 encephalitis Diseases 0.000 description 1
- 206010014665 endocarditis Diseases 0.000 description 1
- 230000012202 endocytosis Effects 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 102000027412 enzyme-linked receptors Human genes 0.000 description 1
- 108091008592 enzyme-linked receptors Proteins 0.000 description 1
- 206010015037 epilepsy Diseases 0.000 description 1
- 229940105423 erythropoietin Drugs 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 208000024519 eye neoplasm Diseases 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 108010021843 fluorescent protein 583 Proteins 0.000 description 1
- 210000000285 follicular dendritic cell Anatomy 0.000 description 1
- 229960004279 formaldehyde Drugs 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- PGBHMTALBVVCIT-VCIWKGPPSA-N framycetin Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CN)O2)N)O[C@@H]1CO PGBHMTALBVVCIT-VCIWKGPPSA-N 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 201000010175 gallbladder cancer Diseases 0.000 description 1
- 229940102767 gardasil 9 Drugs 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 229940014259 gelatin Drugs 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229960000587 glutaral Drugs 0.000 description 1
- ZEMPKEQAKRGZGQ-XOQCFJPHSA-N glycerol triricinoleate Natural products CCCCCC[C@@H](O)CC=CCCCCCCCC(=O)OC[C@@H](COC(=O)CCCCCCCC=CC[C@@H](O)CCCCCC)OC(=O)CCCCCCCC=CC[C@H](O)CCCCCC ZEMPKEQAKRGZGQ-XOQCFJPHSA-N 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 235000009424 haa Nutrition 0.000 description 1
- 230000003400 hallucinatory effect Effects 0.000 description 1
- 208000029629 hantavirus infectious disease Diseases 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 208000018578 heart valve disease Diseases 0.000 description 1
- 201000010284 hepatitis E Diseases 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 102000045345 human IL31 Human genes 0.000 description 1
- 208000003906 hydrocephalus Diseases 0.000 description 1
- 229960000890 hydrocortisone Drugs 0.000 description 1
- 208000036796 hyperbilirubinemia Diseases 0.000 description 1
- 230000000148 hypercalcaemia Effects 0.000 description 1
- 208000030915 hypercalcemia disease Diseases 0.000 description 1
- 208000006575 hypertriglyceridemia Diseases 0.000 description 1
- 239000003326 hypnotic agent Substances 0.000 description 1
- 230000000147 hypnotic effect Effects 0.000 description 1
- 230000007954 hypoxia Effects 0.000 description 1
- 210000001822 immobilized cell Anatomy 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 201000002313 intestinal cancer Diseases 0.000 description 1
- 102000027415 ion channel-linked receptors Human genes 0.000 description 1
- 108091008593 ion channel-linked receptors Proteins 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 229940017800 lactobacillus casei Drugs 0.000 description 1
- 229940072205 lactobacillus plantarum Drugs 0.000 description 1
- 229940001882 lactobacillus reuteri Drugs 0.000 description 1
- 150000002605 large molecules Chemical class 0.000 description 1
- 206010023841 laryngeal neoplasm Diseases 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 108700021021 mRNA Vaccine Proteins 0.000 description 1
- 229940126582 mRNA vaccine Drugs 0.000 description 1
- 108091005958 mTurquoise2 Proteins 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 229940057948 magnesium stearate Drugs 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- 208000015688 methicillin-resistant staphylococcus aureus infectious disease Diseases 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000009149 molecular binding Effects 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 208000008588 molluscum contagiosum Diseases 0.000 description 1
- 229940111688 monobasic potassium phosphate Drugs 0.000 description 1
- 229940045641 monobasic sodium phosphate Drugs 0.000 description 1
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- LPUQAYUQRXPFSQ-DFWYDOINSA-M monosodium L-glutamate Chemical compound [Na+].[O-]C(=O)[C@@H](N)CCC(O)=O LPUQAYUQRXPFSQ-DFWYDOINSA-M 0.000 description 1
- 239000004223 monosodium glutamate Substances 0.000 description 1
- 235000013923 monosodium glutamate Nutrition 0.000 description 1
- 229910000403 monosodium phosphate Inorganic materials 0.000 description 1
- 235000019799 monosodium phosphate Nutrition 0.000 description 1
- 208000005264 motor neuron disease Diseases 0.000 description 1
- 208000001725 mucocutaneous lymph node syndrome Diseases 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 208000010805 mumps infectious disease Diseases 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 208000029766 myalgic encephalomeyelitis/chronic fatigue syndrome Diseases 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 208000010125 myocardial infarction Diseases 0.000 description 1
- 201000003631 narcolepsy Diseases 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 229940053050 neomycin sulfate Drugs 0.000 description 1
- 208000022145 neurocutaneous syndrome Diseases 0.000 description 1
- 201000001119 neuropathy Diseases 0.000 description 1
- 230000007823 neuropathy Effects 0.000 description 1
- LGQLOGILCSXPEA-UHFFFAOYSA-L nickel sulfate Chemical compound [Ni+2].[O-]S([O-])(=O)=O LGQLOGILCSXPEA-UHFFFAOYSA-L 0.000 description 1
- 229940053662 nickel sulfate Drugs 0.000 description 1
- 229910000363 nickel(II) sulfate Inorganic materials 0.000 description 1
- IJGRMHOSHXDMSA-UHFFFAOYSA-N nitrogen Substances N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920004905 octoxynol-10 Polymers 0.000 description 1
- 201000008106 ocular cancer Diseases 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 229940005483 opioid analgesics Drugs 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 208000021090 palsy Diseases 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 229960002566 papillomavirus vaccine Drugs 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 208000033808 peripheral neuropathy Diseases 0.000 description 1
- 229960005323 phenoxyethanol Drugs 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 229920003074 plasdone C polymer Polymers 0.000 description 1
- 229960000540 polacrilin potassium Drugs 0.000 description 1
- 229960004250 polygeline Drugs 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 229920000024 polymyxin B Polymers 0.000 description 1
- 229960005266 polymyxin b Drugs 0.000 description 1
- 229960003548 polymyxin b sulfate Drugs 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- GRLPQNLYRHEGIJ-UHFFFAOYSA-J potassium aluminium sulfate Chemical compound [Al+3].[K+].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O GRLPQNLYRHEGIJ-UHFFFAOYSA-J 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 229940093916 potassium phosphate Drugs 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- WVWZXTJUCNEUAE-UHFFFAOYSA-M potassium;1,2-bis(ethenyl)benzene;2-methylprop-2-enoate Chemical compound [K+].CC(=C)C([O-])=O.C=CC1=CC=CC=C1C=C WVWZXTJUCNEUAE-UHFFFAOYSA-M 0.000 description 1
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 229960000380 propiolactone Drugs 0.000 description 1
- 229950008679 protamine sulfate Drugs 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 108020001775 protein parts Proteins 0.000 description 1
- 238000000455 protein structure prediction Methods 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 208000002574 reactive arthritis Diseases 0.000 description 1
- 230000030541 receptor transactivation Effects 0.000 description 1
- 206010038038 rectal cancer Diseases 0.000 description 1
- 201000001275 rectum cancer Diseases 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 208000030925 respiratory syncytial virus infectious disease Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 208000004124 rheumatic heart disease Diseases 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 102220057650 rs730881913 Human genes 0.000 description 1
- 201000005404 rubella Diseases 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 206010039447 salmonellosis Diseases 0.000 description 1
- 239000013017 sartobind Substances 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 206010039722 scoliosis Diseases 0.000 description 1
- 239000000932 sedative agent Substances 0.000 description 1
- 229940125723 sedative agent Drugs 0.000 description 1
- 238000011896 sensitive detection Methods 0.000 description 1
- 201000005113 shigellosis Diseases 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 208000037968 sinus cancer Diseases 0.000 description 1
- 201000009890 sinusitis Diseases 0.000 description 1
- 201000000849 skin cancer Diseases 0.000 description 1
- 201000010153 skin papilloma Diseases 0.000 description 1
- 230000000391 smoking effect Effects 0.000 description 1
- 229960004599 sodium borate Drugs 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 229940001593 sodium carbonate Drugs 0.000 description 1
- 235000017550 sodium carbonate Nutrition 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 229960001790 sodium citrate Drugs 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- 229940074545 sodium dihydrogen phosphate dihydrate Drugs 0.000 description 1
- 239000004296 sodium metabisulphite Substances 0.000 description 1
- 235000010262 sodium metabisulphite Nutrition 0.000 description 1
- 229940045946 sodium taurodeoxycholate Drugs 0.000 description 1
- 235000010339 sodium tetraborate Nutrition 0.000 description 1
- YXHRQQJFKOHLAP-FVCKGWAHSA-M sodium;2-[[(4r)-4-[(3r,5r,8r,9s,10s,12s,13r,14s,17r)-3,12-dihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]pentanoyl]amino]ethanesulfonate Chemical compound [Na+].C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 YXHRQQJFKOHLAP-FVCKGWAHSA-M 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 229960000391 sorbitan trioleate Drugs 0.000 description 1
- 235000019337 sorbitan trioleate Nutrition 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 229960002920 sorbitol Drugs 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 208000002320 spinal muscular atrophy Diseases 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 229960002385 streptomycin sulfate Drugs 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 208000011117 substance-related disease Diseases 0.000 description 1
- 239000008362 succinate buffer Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 208000006379 syphilis Diseases 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- 101150061166 tetR gene Proteins 0.000 description 1
- OFVLGDICTFRJMM-WESIUVDSSA-N tetracycline Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O OFVLGDICTFRJMM-WESIUVDSSA-N 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 206010043778 thyroiditis Diseases 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 239000011031 topaz Substances 0.000 description 1
- 229910052853 topaz Inorganic materials 0.000 description 1
- 230000024033 toxin binding Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 201000010875 transient cerebral ischemia Diseases 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- GWBUNZLLLLDXMD-UHFFFAOYSA-H tricopper;dicarbonate;dihydroxide Chemical compound [OH-].[OH-].[Cu+2].[Cu+2].[Cu+2].[O-]C([O-])=O.[O-]C([O-])=O GWBUNZLLLLDXMD-UHFFFAOYSA-H 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- BSVBQGMMJUBVOD-UHFFFAOYSA-N trisodium borate Chemical compound [Na+].[Na+].[Na+].[O-]B([O-])[O-] BSVBQGMMJUBVOD-UHFFFAOYSA-N 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 201000008297 typhoid fever Diseases 0.000 description 1
- 229960004441 tyrosine Drugs 0.000 description 1
- 231100000397 ulcer Toxicity 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 208000019206 urinary tract infection Diseases 0.000 description 1
- 206010046766 uterine cancer Diseases 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 206010046885 vaginal cancer Diseases 0.000 description 1
- 208000013139 vaginal neoplasm Diseases 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 201000002498 viral encephalitis Diseases 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 201000010044 viral meningitis Diseases 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 201000005102 vulva cancer Diseases 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229920001285 xanthan gum Polymers 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/20—Antivirals for DNA viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/555—Medicinal preparations containing antigens or antibodies characterised by a specific combination antigen/adjuvant
- A61K2039/55511—Organic adjuvants
- A61K2039/55555—Liposomes; Vesicles, e.g. nanoparticles; Spheres, e.g. nanospheres; Polymers
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/57—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2
- A61K2039/575—Medicinal preparations containing antigens or antibodies characterised by the type of response, e.g. Th1, Th2 humoral response
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2730/00—Reverse transcribing DNA viruses
- C12N2730/00011—Details
- C12N2730/10011—Hepadnaviridae
- C12N2730/10111—Orthohepadnavirus, e.g. hepatitis B virus
- C12N2730/10123—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/10011—Arenaviridae
- C12N2760/10111—Deltavirus, e.g. hepatitis delta virus
- C12N2760/10122—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/10011—Arenaviridae
- C12N2760/10111—Deltavirus, e.g. hepatitis delta virus
- C12N2760/10123—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/10011—Arenaviridae
- C12N2760/10111—Deltavirus, e.g. hepatitis delta virus
- C12N2760/10134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
Definitions
- VLPs virus-like particles
- one monomer comprises a single high affinity protein attachment system which allows interchangeable decoration with any functional molecule of choice, wherein the heterodimeric subunits are created by modification of the electrostatic interaction between the monomers forming each heterodimer.
- the present invention further relates to processes of producing the VLPs, including a rapid single cell process, and uses of the VLPs in research, diagnosis and as vaccines for use in prevention/treatment of diseases.
- Virus-like particles are molecules that closely resemble viruses, but contain no viral genetic material. They are formed from viral structural proteins, such as viral capsid proteins that, when individually expressed, self-assemble into a particle. Most Virus-like particles appear as hollow ‘nano-footballs’ where the entire surface of the football is made up by many copies of a single self-assembled protein. For production purposes this means that production of one single protein is sufficient to generate the entire nano-football type VLP structure. This has been exploited in medicine. The most common use of VLPs are as vaccines. Mammals have evolved immune sensing mechanisms to recognise highly repetitive patterns seen on viral capsids as intruders.
- VLPs which contain repetitive, high density displays of viral surface proteins. Therefore the VLP can generate an immune reaction but the viral genome is typically removed.
- HPV human papillomavirus
- HPV vaccines of this type such as Cervarix by GlaxoSmithKline along with Gardasil and Gardasil-9, produced by Merck & Co.
- Further developments of VLPs for use as vaccines involve tethering of other agents to the VLP shell. In this case, the VLP shell serves to present an additional agent as an ‘epitope’ to the immune system and thereby stimulate an immune reaction.
- the viral capsid proteins forming the VLP shell can be modified to directly incorporate the epitope for display through genetic fusion.
- Current COVID19 vaccines that are under development use this form of VLP, where the spike protein from the coronavirus is directly fused to a viral capsid protein forming a VLP shell from an unrelated virus.
- this approach commonly leads to impaired VLP assembly and large proteins routinely cause VLP instability.
- this approach cannot be used if the agent is not protein-based.
- a further alternative is to assemble the VLP and then use attachment means to secure the agent to the VLP shell.
- Such VLPs with additional attachment means may be termed ‘compound VLPs’.
- Compound VLPs may be manufactured by methods such as chemical crosslinking, reactive unnatural amino acids, or the use of binding proteins such as the SpyTag/SpyCatcher system, to covalently attach the desired agent or epitope to the viral capsid proteins forming the VLP.
- the latter method allows the attachment of other non-protein epitopes to the VLP, but requires a complicated production process and cannot yet be used commercially for any agent.
- Some desired proteins are simply too large to attach to the VLP using current attachment means, and some complex epitopes include multimers with numerous components that must be separately linked together, which must be achieved by additional chemical crosslinking.
- the current binding proteins which are used as attachment means have further issues in that the binding between the proteins whilst being strong, does not occur instantly but requires time for the reactants to fuse, and can result in VLP aggregation depending on which agent or epitope is attached to the VLP. Further complexities arise in the production of VLPs used in clinical human or veterinary applications, which regulators classify VLPs as “biological” active drug intermediates (ADI’s). “Biologic” drugs are produced in living cells, followed by purification according to a regulator – approved process.
- ADI biological active drug intermediates
- Each cell line (regardless whether bacterial, plant, yeast, insect or mammalian) used for the production process is carefully characterized so as to guarantee long-term stability of the ADI and stored under highly specified conditions as a “Master Cell Bank” (MCB).
- MCB Master Cell Bank
- E.coli The most efficient type of production cell is bacteria (specifically: E.coli).
- many proteins do not assume their native shape when produced in E.coli but must be re-folded into their proper form from a denatured state as part of the purification process, which results in huge drop of overall yield and significantly adds to the complexity of the production process
- the production process of compound VLP-type drugs which attempt to attach agents such as epitopes to the viral capsid proteins is complex and expensive. This has limited the wide-spread exploration of compound VLP applications to fields where inexpensive mass production could make them more competitive.
- a technology which simplifies the process of making compound VLPs has recently been described which uses the Hepatitis B virus capsid, HBc.
- this technology is based on a pair of binding proteins positioned on the surface tip of each monomer forming each homodimer of the VLP, as shown in Figure 2B herein. These binding proteins can in turn act as a docking point to allow other ‘epitope’ proteins to be positioned on the outside of the VLP.
- binding proteins can in turn act as a docking point to allow other ‘epitope’ proteins to be positioned on the outside of the VLP.
- the effectiveness of vaccines made with this technology may be impacted by “crowding” of too many ‘epitope’ proteins being presented on the surface of the VLP depending on the size and shape of the protein being presented.
- Another potential limitation, as with all HBc VLP vaccines may be that industrial scale manufacture, stability, and storage could be temperature-dependent, posing practical limitations.
- VLP for use as a vaccine that overcomes the problem of “crowding” of proteins presented on the surface by using heterodimeric capsid subunits which are still able to self-assemble but present a single ‘epitope’ protein for each of the dimeric subunits forming the VLP.
- the present invention aims to provide a VLP that has improved stability when manufactured at industrial scale. While other technologies, including mRNA, can generate “conventional” vaccines against infectious diseases, vaccines against self-proteins, such as, for example vaccines to treat asthma or dermatitis, cannot be delivered by mRNA platforms.
- mRNA translated to self-proteins is not loaded onto the surface of presenting cells (follicular dendritic cells) as a whole intact folded protein, and thus is not able to trigger activation of B-cells.
- mRNA vaccines do not generate an antibody response against self-proteins. Therefore, there is a general need for improved vaccine development in the areas of chronic inflammatory diseases, animal health, cancer, dementia, and animal health.
- One or more aspects of the present invention are aimed at solving one or more of the above- mentioned problems.
- a virus-like particle comprising: - One or more viral capsid protein heterodimer(s) each comprising a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence, and; - a binding molecule attached exclusively to either the first monomer or the second monomer, and; - wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof.
- the first amino acid sequence and the second amino acid sequence both comprise at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof.
- the at least one mutation of the first amino acid sequence is different to the at least one mutation of the second amino acid sequence.
- the binding molecule is operable to be attached to a functional molecule.
- the binding molecule is a binding protein.
- the binding protein comprises a bacterial toxin inhibitor.
- the bacterial toxin inhibitor is selected from Im7, Im8, Im9, Im2, and Barstar.
- one of the first or the second amino acid sequences comprises the sequence as set out in SEQ ID NO: 2 or 3.
- one of the first or the second amino acid sequences comprises the sequence as set out in SEQ ID NO: 4 or 5.
- the first amino acid sequence comprises the sequence as set out in SEQ ID NO:2 and the second amino acid sequence comprises the sequence as set out in SEQ ID NO:4. In one embodiment, the first amino acid sequence comprises the sequence as set out in SEQ ID NO:3 and the second amino acid sequence comprises the sequence as set out in SEQ ID NO:5. In one embodiment, the first amino acid sequence comprises a mutation corresponding to E8K of SEQ ID NO: 1 or 16, and the second amino acid sequence comprises a mutation corresponding to R56D of SEQ ID NO: 1 or 16. In one embodiment, the first amino acid sequence comprises a mutation corresponding to E64K of SEQ ID NO: 1 or 16, and the second amino acid sequence comprises a mutation corresponding to K96D of SEQ ID NO: 1 or 16.
- the first amino acid or the second amino acid sequence further comprises a mutation corresponding to K67E; R82D and/or E97K of SEQ ID NO: 16. In one embodiment, the first amino acid sequence further comprises a mutation corresponding to K67E; R82D and/or E97K of SEQ ID NO: 16.
- the viral capsid protein heterodimer is a woodchuck hepatitis viral capsid protein heterodimer. In any embodiment herein E97K may be replaced with E79K.
- the first amino acid sequence comprises a further mutation corresponding to H88K of SEQ ID NO: 16
- the second amino acid sequence comprises a further mutation corresponding to W71D of SEQ ID NO: 16.
- the viral capsid protein heterodimer is a woodchuck hepatitis viral capsid protein heterodimer.
- the second amino acid sequence further comprises a deletion of a serine residue at a position corresponding to residue 78 of SEQ ID NO:16.
- the viral capsid protein heterodimer is a woodchuck hepatitis viral capsid protein heterodimer.
- the binding molecule is attached to the second amino acid sequence.
- the viral capsid protein is from any virus having a dimeric capsid protein.
- the viral capsid protein is a hepatitis capsid protein.
- the viral hepatitis capsid protein is a Woodchuck hepatitis capsid protein.
- a viral capsid protein heterodimer comprising: - a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence; - a binding molecule attached exclusively to either the first monomer or the second monomer, and; - wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof.
- the first amino acid sequence comprises a least one mutation and the second amino acid sequence comprises at least one mutation.
- the at least one mutation of the first amino acid sequence is different to the at least one mutation of the second amino acid sequence.
- a viral capsid protein monomer encoded by an amino acid sequence - wherein the amino acid sequence comprises at least one mutation and, - the at least one mutation modifies the electrostatic interaction between the viral capsid protein monomer and other viral capsid protein monomers to promote heterodimeric association thereof, - wherein optionally the viral capsid protein monomer comprises a binding molecule attached thereto.
- a first or second viral capsid protein monomer encoded by: - a first amino acid sequence comprising at least one mutation, or - a second amino acid sequence comprising at least one mutation respectively; - wherein the at least one mutation in the first and second amino acid sequences are different, and - wherein the mutations in the first or second amino acid sequences promote association of the first monomer comprising the first amino acid sequence with the second monomer comprising the second amino acid sequence, - wherein optionally one of the first or second monomers exclusively comprises a binding molecule attached thereto.
- nucleic acid encoding the viral capsid protein heterodimer according to the second aspect or a viral capsid protein monomer according to the third or fourth aspect of the invention.
- nucleic acid construct encoding the viral capsid protein heterodimer according to the second aspect, the construct comprising: - a first nucleic acid sequence encoding a first viral capsid protein monomer according to the third or fourth aspect; - a second nucleic acid sequence encoding a second viral capsid protein monomer attached to a binding molecule according to the third or fourth aspect.
- a vector comprising one or more of the nucleic acids according to the fifth aspect or the construct according to the sixth aspect of the invention.
- a host cell comprising one or more of the nucleic acids according to the fifth aspect, the construct according to the sixth aspect, or the vector according to the seventh aspect.
- a process of producing a virus-like particle (VLP) in a single host cell comprising: a) Providing a host cell comprising i. a first nucleic acid encoding a first viral capsid protein monomer according to the third or fourth aspects, ii.
- nucleic acids are comprised on one or more vectors.
- the first, second and third nucleic acids are comprised on one vector, suitably a vector according to the seventh aspect.
- a process of producing a virus-like particle comprising; (a) Providing a first host cell comprising: (i) a first nucleic acid encoding a first viral capsid protein monomer according to the third or fourth aspects, (ii) a second nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule according to the third or fourth aspects, (b) Providing a second host cell comprising: (i) a third nucleic acid encoding a functional molecule operable to bind to the binding molecule; (c) Culturing the first and second host cells under conditions to express the proteins from the first, second, and third nucleic acids respectively; (d) Recovering the proteins; (e) Mixing the proteins to (i) form functionalised viral capsid protein heterodimers and (ii) form virus-like particles from the functionalised viral capsid protein heterodimers.
- the first nucleic acid encoding the first viral capsid protein monomer and the second nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule are comprised on a first vector.
- the third nucleic acid is comprised on a second vector.
- the or each viral capsid protein monomer is from any virus having a dimeric capsid protein.
- the or each viral capsid protein monomer is a hepatitis capsid protein monomer.
- the or each viral hepatitis capsid protein monomer is a Woodchuck hepatitis capsid protein monomer.
- a cell culture comprising one or more host cells according to the eighth aspect of the invention.
- an immunogenic composition comprising the virus-like particle according to the first aspect.
- a virus-like particle (VLP) according to the first aspect, an immunogenic composition according to the twelfth aspect of the invention for use as a medicament.
- a virus-like particle (VLP) according to the first aspect, or an immunogenic composition according to the twelfth aspect for use in the prevention and/or treatment of infectious diseases, cardiovascular diseases, cancer, inflammatory diseases, autoimmune diseases, neurological disease, rheumatological degenerative disease, or addiction.
- a virus-like particle (VLP) according to the first aspect of the invention in research, or in the diagnosis of a disease.
- a method of diagnosing a disease in a subject comprising: (a) Providing a virus like particle according to the first aspect of the invention, wherein the binding molecule is attached to a functional molecule and, wherein the functional molecule is an antibody directed towards an antigen derived from a disease causing agent; (b) Mixing the virus like particle with a suitable sample from the subject; (c) Detecting whether the virus like particle precipitates; (d) Diagnosing the presence of a disease if the virus like particle precipitates.
- the present invention relates to VLPs comprising one or more viral capsid protein heterodimer(s).
- the Heterodimer(s) each comprise a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence.
- the first or the second monomer having a binding molecule attached, exclusively to the first or the second monomer.
- the first and/or second amino acid sequences comprise at least one mutation as compare to the wild-type sequence, such that, the mutation modifies the electrostatic interaction between the first and the second monomers as compared to wild-type sequences. This modification in the electrostatic interaction promotes the formation of a heterodimer comprising the first monomer and the second monomer, and a binding molecule attached exclusively to either the first or the second monomer of the capsid protein heterodimer.
- Many wild-type viral capsid proteins are made up of dimers formed from identical proteins, that is, the monomeric subunits of the homodimers are identical, such as hepatitis capsid proteins. These homodimers self-assemble to form VLPs. The inventors have found that even if one of the monomers of the dimer has a binding molecule attached, homodimers are still formed. This results in homodimers that have a binding molecule attached to each monomer, or a homodimer having no binding molecules attached. This does not solve the problems in the art discussed above in which crowding of functional molecules on the VLP surface inhibits VLP formation and activity.
- the inventors have now found that certain mutations in the amino acid sequences encoding the monomers can overcome the usual formation of homodimers and promote association of monomers wherein a binding molecule is attached to only one of the monomers, thus forming a heterodimer.
- the inventors have achieved this by making targeted mutations in each of the two monomers in order to influence the electrostatic attraction of the monomers, such that they instead preferentially form heterodimers.
- Each heterodimer having only one binding molecule attached exclusively to either the first monomer or the second monomer which may then be functionalised allows for larger functional molecules to be attached to the VLP surface without steric hindrance.
- the viral capsid protein may be a hepatitis B capsid protein (HBc).
- HBc hepatitis B capsid protein
- the viral capsid protein may be a human hepatitis B capsid protein.
- the viral capsid protein may be a Woodchuck hepatitis virus capsid protein (wHv).
- wHv capsid protein confers particular advantages over that of human hepatitis capsid protein.
- wHv can form capsids at cold temperatures (> 60% capsid formation occurs at temperatures as low as 4°C, Kukreja 2014). This can result in an increased yield of VLPs in low temperature and allows the use of bacterial fermentation at lower temperatures which is preferable for production of complex proteins.
- the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See, for example, Current Protocols in Molecular Biology (Ausubel, 2000, Wiley and son Inc, Library of Congress, USA); Molecular Cloning: A Laboratory Manual, Third Edition, (Sambrook et al, 2001, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press); Oligonucleotide Synthesis (M. J. Gait ed., 1984); U.S. Pat. No.4,683,195; Nucleic Acid Hybridization (Harries and Higgins eds.
- identity refers to the sequence similarity between two polymeric molecules, e.g., between two nucleic acid molecules, such as between two DNA molecules, or between two protein molecules. Sequence alignments and determination of sequence identity can be done, e.g., using the Basic Local Alignment Search Tool (BLAST) originally described by Altschul et al. 1990 (J Mol Biol 215: 403-10), such as the "Blast 2 sequences” algorithm described by Tatusova and Madden 1999 (FEMS Microbiol Lett 174: 247-250). Methods for aligning sequences for comparison are well-known in the art. Various programs and alignment algorithms are described in, for example: Smith and Waterman (1981) Adv. Appl.
- BLAST Basic Local Alignment Search Tool
- Nucleic acid sequences with even greater similarity to the reference sequences will show increasing percentage identity when assessed by this method.
- the percentage sequence identity is calculated over the entire length of the sequence.
- a global optimal alignment is suitably found by the Needleman-Wunsch algorithm with the following scoring parameters: Match score: +2, Mismatch score: -3; Gap penalties: gap open 5, gap extension 2.
- the percentage identity of the resulting optimal global alignment is suitably calculated by the ratio of the number of aligned bases to the total length of the alignment, where the alignment length includes both matches and mismatches, multiplied by 100.
- the term “vector” is well known in the art, and as used herein refers to a nucleic acid molecule, e.g.
- a vector is suitably used to transport an inserted nucleic acid molecule into a suitable host cell.
- a vector typically contains all of the necessary elements that permit transcribing the insert nucleic acid molecule, and, preferably, translating the transcript into a polypeptide.
- a vector typically contains all of the necessary elements such that, once the vector is in a host cell, the vector can replicate independently of, or coincidental with, the host chromosomal DNA; several copies of the vector and its inserted nucleic acid molecule may be generated.
- operably linked refers to the arrangement of various nucleic acid elements relative to each other such that the elements are functionally connected and are able to interact with each other in the manner intended.
- therapy refers to reducing, ameliorating or eliminating one or more signs, symptoms, or effects of a disease or condition.
- Treatment thus includes any treatment of a disease in a mammal, particularly in a human, and includes: (a) preventing the disease from occurring in a subject predisposed to the disease or at risk of acquiring the disease but has not yet been diagnosed as having it; (b) inhibiting the disease, i.e., arresting its development; and (c) relieving the disease, i.e., causing regression of the disease.
- the “administration” of an agent to a subject includes any route of introducing or delivering to a subject the agent to perform its intended function.
- Administration can be carried out by any suitable route, including orally, intranasally, intraocularly, ophthalmically, parenterally (intravascularly, intramuscularly, intraperitoneally, or subcutaneously), or topically. Administration includes self-administration and the administration by another.
- the terms “individual,” “subject,” and “patient” are used interchangeably, and refer to any individual subject with a disease or condition in need of therapy, suitably in need of therapy by treatment with the present invention.
- the subject may be a human or animal, for example primate, preferably a human, or another mammal, such as a dog, cat, horse, pig, goat, or bovine, and the like.
- Figure 1 shows: a cartoon diagram representing a VLP capsid and subunits.
- the overall structure of the Woodchuck hepatitis virus (wHv) capsid On the top right: close up (side view and ‘top’ view, respectively) of one so-called “asymmetric unit” composed of two homodimers which in turn are formed each by joining together of one single capsid protein.
- a ribbon diagram of one single “homodimer” composed of two identical monomer protein subunits is shown (side view and ‘top’ view, respectively). In this way, the entire capsid is formed from ordered assembly of a single protein where the central building block is a homodimer.
- FIG. 1 From top to bottom (A) shows a wild type homodimeric capsid subunit (as in Figure 1, but here shown as space-filling model), (B) shows a prior art homodimeric capsid subunit which has been modified to allow two functional molecules (dark grey, on top, facing the surface of the capsid) to be presented by each dimeric subunit on the VLP surface. (C) shows the heterodimeric subunit of the present invention, which allows only one single functional molecule (dark grey, on top, facing the surface of the capsid) to be presented by each heterodimeric subunit on the surface of the VLP. This allows the production of VLP vaccines where only one protein is presented for each of the dimeric capsid subunits.
- Figure 3 shows: a diagrammatic representation of an approach to creating a mutant wHv capsid where one dimer displays a binding protein such as an integrated Im7 protein toward the VLP surface.
- Panel A shows an expression vector that allows simultaneous expression of a wild type wHv protein (ORF2) and a wHv protein carrying the binding protein (an integrated Im7 protein, ORF1), driven by an independent second promoter or through an internal ribosome binding site (RBS).
- ORF2 + ORF2 as shown, or ORF1 + ORF1, which is not illustrated here
- heterodimers ORF1 + ORF2
- Panel B shows a close up of the central ‘tip’ of the wHv dimer (dashed boxed in Panel A). Highlighted are negatively charged (E64) and positively charged (K96) amino acids, respectively, which create electrostatic attraction, contributing to dimer formation of two subunits.
- Figure 4 shows: a preliminary approach to create asymmetrical interaction between two wHv monomers in order to favour formation of heterodimers over homodimer formation.
- Panel A shows a schematic of an exemplary expression vector. Complementary mutations are introduced into the two different monomers: K96E in wHv-Im7 and E64K into wHv, respectively.
- Panel B shows structural prediction using Alphafold2 database (Jumper, J et al. (2021) and Varadi, M et al. (2021), showing that formation of homodimers composed of wHv/wHv and wHv-Im7/wHv-Im7, respectively, is still favoured despite the mutations introduced (shown is the top-ranked of five models, all of which show homodimer formation).
- Figure 5 shows: mutational refinement in the wHv monomers.
- lysine96 (K96) in wHv is replaced by aspartate (D96), instead of glutamate (E96), in contrast to the model shown in figure 4.
- Panel A shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96D in wHv- Im7 and E64K into wHv, respectively.
- the mutation of K96 to D96 reduces side-chain bulkiness, shown on the left of panel B.
- the resulting structural prediction using Alphafold2 database (right) shows a predicted mixture between heterodimer and homodimer formation.
- Figure 6 shows: Additional intra-chain ionic stabilization by added mutation of Lysine 67 (K67) to glutamate (E67) in the wHv149 subunit.
- Panel A shows a schematic of an exemplary expression vector showing that, in contrast to figure 5, an additional mutation is introduced into the wHv monomer (ORF2).
- an additional mutation K67E is introduced to the wHv149 subunit.
- This added mutation results in increased heterodimer formation since now E67 interacts with K64 and no longer with D63 within the same protein chain. This, in turn stabilizes the interaction of K64 with D96 of the heterodimeric partner monomer (ORF1). Nonetheless, still a mix of hetero- and homodimer structures is predicted by Alphafold2 (shown in panel B).
- Figure 7 shows: a ribbon diagram of the top rated (by Alphafold2 database) heterodimeric capsid subunit structures obtained with the optimized engineered combination of mutations (wHv149: E8K, E64K, K67E, R82D, E79K, H88K; wHv149-Im7/Bs: R56D, K96D, W71D, summarized in table 2 of the examples), generated using Alphafold2 software, for heterodimers consisting of wHv149/wHv-Im7 (top) and wHv149/wHv-Bs (bottom).
- Figure 8 shows: an overlay of ribbon-diagram structures of wild type wHv with the combination of mutants listed in table 2, generated using the icn3d web-app (NCBI/structure; PDB: 6edj) and Alphafold2 database, respectively.
- the Im7-domain on top of the tip of wHv-Im7 is not shown, as it does not align with wild type wHv.
- the figure illustrates that the enacted mutations to not impact on the backbone structure of the mutated monomer proteins.
- Figure 9 shows: Space filling model generated using icn3d (NCBI/structure; PDB: 6edj), showing the wild type wHv (left) and the modified wHv149/wHv-Im7 (centre) and wHv149/wHv-Barstar (right) heterodimers, respectively, from the front (top) and from the surface (bottom) of the capsid.
- the dashed line (top) indicates the plane of subunit interaction leading to VLP assembly, which remains undisturbed by the integration of binding proteins (Im7 or Barstar).
- the bottom illustration shows that, compared to the wild type (left) the space occupancy of the heterodimers remains essentially unchanged.
- Figure 10 shows: (A) view from the side showing “spike” made from two helices from each monomer protruding to the outside (top arrow) and bottom part mediating interaction with other dimers in the capsid (bottom arrow).
- Panel (B) shows the wild-type amino acid sequences of both the Woodchuck (wild type wHv )(SEQ ID NO: 16) and Human hepatitis B capsid protein (SEQ ID NO: 1) monomers (subunit).
- the bold residues show differences in the amino acid sequences between Woodchuck and Human hepatitis capsid protein monomers and the highlighted amino acids relate to amino acids capable of forming inter-molecular electrostatic bonds.
- Figure 11 shows: the electrostatic interactions occurring between to wild type monomers forming a homodimer in the human hepatitis B virus capsid.
- Left a ribbon diagram of the homodimer where dark circles and light circles highlight the position of electrostatic interactions between subunits in the ‘tip’ (dark) and in the ‘base’ (light) of the homodimer, respectively.
- the diagram shows that in the wild type HBc dimer E64 of each monomer interacts with K96 of the other monomer (top) and that E8 of each monomer interacts with R56 of the other monomer (bottom).
- the figure shows that complementary mutations in the human HBc virus capsid can be enacted analogous to the mutations illustrated for the wHv capsid in order to drive assembly of heterodimers, when one of the monomers has been altered, for example by in-frame fusion of Im7 or Bs (or any other change only implemented in one of the monomers).
- Figure 12 shows: panel A shows a diagram of an exemplary plasmid to produce the monomeric subunits of the VLP capsid.
- wHv capsid heterodimer components are soluble and evenly expressed in E.coli.
- Panel B shows SDS-PAGE gel of showing that both wHv_Im7 and wHv proteins, when driven by individual T7 promoters, can be expressed as soluble proteins in E.coli and are synthesized at approximately even stoichiometric ratios ( black arrows).
- Figure 13 shows: purification of a human Hepatitis B capsid dimer linked to a binding protein via IMAC.
- FIG. 14 shows: SDS PAGE analysis of a discontinuous density gradient run on a heterodimeric VLP decorated with a surface epitope protein (functional molecule). After 6h of 120,000g in the ultracentrifuge, all of the epitope, as well as the two heterodimeric VLP scaffold proteins partition to the 50% and 60% cushion, indicating high – density nanoparticle formation.
- FIG. 15 shows: SDS PAGE analysis of heterodimeric WHcIm7/WHc VLPs decorated with an epitope protein.
- the middle lane shows a cytosolic fraction.
- the right lane shows the same preparation after purification via immobilized metal affinity chromatography (IMAC).
- IMAC immobilized metal affinity chromatography
- VLPs which comprise one or more viral capsid proteins, the viral capsid proteins self-assemble into the VLP, to which functional molecules can then be attached through a binding molecule which may be a binding protein and/or chemical modification as discussed elsewhere herein.
- the viral capsid protein is from any virus having a dimeric capsid protein, suitably from any virus having a dimeric capsid protein formed from identical monomers.
- the viral capsid protein is a hepatitis capsid protein, which may be selected from Hepatitis A, B, C or D.
- the viral capsid protein is a Hepatitis B viral capsid protein (HBc).
- the viral capsid protein is a mammalian Hepatitis viral capsid protein.
- the viral capsid protein is a human Hepatitis B viral capsid protein.
- the viral capsid protein is a Woodchuck (Marmota monax) Hepatitis viral capsid protein (wHv).
- the viral capsid protein is a heterodimer, suitably therefore a heterodimeric viral capsid protein.
- each heterodimeric viral capsid protein is made up of a first monomer and a second monomer.
- each heterodimeric viral capsid protein is attached to a binding molecule.
- each heterodimeric viral capsid protein displays a binding molecule.
- each heterodimeric viral capsid protein is attached to a single binding molecule, suitably therefore one of the monomers of each heterodimeric capsid protein is attached to a binding molecule.
- the first or second monomer that forms the heterodimeric viral capsid protein is modified to display a binding molecule.
- the first or second monomer of the heterodimeric viral capsid protein is fused to binding molecule.
- the first or second monomer of the heterodimeric viral capsid protein is modified to display a binding molecule by fusing the binding molecule to the monomer of the heterodimeric viral capsid protein.
- the first or second monomer of the heterodimeric viral capsid protein is modified to display a binding molecule by inserting the amino acid sequence of the binding molecule into the amino acid sequence of the relevant monomer of the heterodimeric viral capsid protein.
- the amino acid sequence of the binding molecule is inserted into the major immunodominant region of the monomer of the heterodimeric viral capsid protein.
- the binding molecule is fused to the major immunodominant region of the monomer of the heterodimeric viral capsid protein.
- the binding molecule is inserted between amino acids corresponding to residues 76 and 80 of the major immunodominant region of the monomer of the heterodimeric viral capsid protein.
- the binding molecule is inserted between amino acid residues corresponding to 77 and 79 of the major immunodominant region of the monomer of the heterodimeric viral capsid protein.
- the binding molecule is inserted between amino acid residues corresponding to 77 and 78 of the major immunodominant region of the monomer of the heterodimeric viral capsid protein.
- the VLP of the invention comprises one or more linkers.
- the linkers join the amino acid sequence of the monomer in accordance with the third or fourth aspect with the binding molecule.
- the linkers are located between the protein coding sequences of the monomer and the binding molecule.
- the binding molecule is a binding protein and a linker is located at the N and C terminus of the binding protein, suitably to link to the heterodimeric viral capsid protein.
- a further linker may also be located between the functional molecule and the binding protein.
- the further linker may comprise an alpha helix.
- the further linker may comprise or consist of the sequence of LAEAAAKEAAAKEAAKAA (SEQ ID NO: 33).
- each linker may also be located between the binding molecule and a further binding molecule.
- each linker is between 5 to 50 amino acids in length.
- each linker is 5, 10, 15, 20, 21, 25, 30, 35, 40 amino acids in length.
- each linker is 9, 10 or 11 amino acids in length.
- each linker comprises the sequence: GGGGSGGGGS (SEQ ID NO:9) or GGGGGSGGGGS (SEQ ID NO: 10), SGGGSSGSG (SEQ ID NO: 11), KAAAEKAAAE (SEQ ID NO: 14), GGKAAAE (SEQ ID NO:15) or LAEAAAKEAAAKEAAKAA (SEQ ID NO: 33).
- the linkers used to link the amino acid sequence of the monomer in accordance with the third or fourth aspect with the binding molecule comprise either KAAAEKAAAE (SEQ ID NO: 14) or GGKAAAE (SEQ ID NO:15).
- the first linker is KAAAEKAAAE (SEQ ID NO: 14) which links the N terminus of the binding protein to the heterodimeric viral capsid protein.
- the second linker is GGKAAAE (SEQ ID NO:15) which links the C terminus of the binding protein to the heterodimeric viral capsid protein.
- the binding molecule is a binding protein.
- the viral capsid protein is a hepatitis B capsid protein and may comprise the amino acid sequence of SEQ ID NO: 2, suitably forming the first monomer.
- the viral capsid protein is a hepatitis B capsid protein and may comprise the amino acid sequence of SEQ ID NO: 4, suitably forming the second monomer.
- the heterodimeric viral capsid protein is a heterodimeric hepatitis B capsid protein and comprises the amino acid sequence of SEQ ID NO:2, and the amino acid sequence of SEQ ID NO:4.
- the viral capsid protein is a woodchuck hepatitis capsid protein and may comprise a truncated viral capsid protein.
- one or both monomers of the woodchuck hepatitis capsid protein are truncated.
- both monomers of the woodchuck hepatitis capsid protein are truncated, suitably comprising a truncation of the C terminus.
- the C- terminus is removed from the or each monomer of the woodchuck hepatitis capsid protein.
- the or each woodchuck hepatitis capsid protein monomer comprises amino acids 1 to 149 of the wild type woodchuck hepatitis capsid protein monomer sequence.
- references herein to woodchuck hepatitis capsid protein, wHv capsid protein, or wHv149 capsid protein are to the truncated form.
- a truncated monomer sequence is shown in SEQ ID NO:16, and is additionally shown in SEQ ID NO:3 and 5 with the mutations described herein.
- the viral capsid protein is a woodchuck hepatitis capsid protein and may comprise the amino acid sequence of SEQ ID NO: 3, suitably forming the first monomer.
- the sequence may comprise mutations at positions K67, R82, E97, and/or H88, which are described further herein.
- E97 may be replaced with E79.
- the viral capsid protein is a woodchuck hepatitis capsid protein and may comprise the amino acid sequence of SEQ ID NO: 5, suitably forming the second monomer.
- the sequence may further comprise mutation at position W71 and/or a deletion at S78, which are described further herein.
- the heterodimeric viral capsid protein is a heterodimeric woodchuck hepatitis capsid protein and comprises the amino acid sequence of SEQ ID NO:3, and the amino acid sequence of SEQ ID NO:5.
- SEQ ID NO:3 further comprises the additional mutations at positions K67, R82, E97, and/or H88, which are described further herein
- SEQ ID NO:5 further comprises the additional mutation at position W71 and/or a deletion at S78, which are described further herein.
- E97 may be replaced with E79.
- Modified Monomers and Heterodimers Suitably the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises a mutation as compared to the wild- type sequence.
- a mutation may be an insertion, deletion or substitution.
- the first amino acid sequence encoding the first monomer and the second amino acid sequence encoding the second monomer each comprise at least one mutation as compared to the wild-type sequence.
- the or each mutation in the first amino acid sequence is different to the or each mutation in the second amino acid sequence.
- the amino acid sequence of the first monomer is different to the amino acid sequence of the second monomer.
- a mutation in the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer modifies the electrostatic interactions between the monomers as compared to the wild type sequences.
- the or each mutation modifies the electrostatic interaction between the monomers to promote association of the monomers to form heterodimers.
- the viral capsid protein may comprise further modifications. Suitable modifications may include: insertions, deletions, substituents, truncations, reversals, repeats, or the like in the amino acid sequence encoding the protein.
- the viral capsid protein may comprise further modifications in the major immunodominant region. Suitably such modifications aid the insertion of the binding molecule into the viral capsid protein.
- the viral capsid protein may comprise amino acid deletions.
- the viral capsid protein may comprise amino acid deletions in the major immunodominant region.
- the viral capsid protein may comprise amino acid deletions in the major immunodominant region which remove negatively charged amino acids.
- the first amino acid sequence encoding the first monomer or the second amino acid sequence encoding the second monomer comprise at least one mutation.
- the first amino acid sequence encoding the first monomer and the second amino acid sequence encoding the second monomer each comprise at least one mutation.
- the at least one mutation of the first amino acid sequence is different to the at least one mutation of the second amino acid sequence.
- a mutation to an amino acid sequence can include insertions, deletions or substitutions.
- the first and/or second amino acid sequences may include further modifications such as truncations, reversals, repeats, or the like.
- the first and/or second amino acid sequences include at least one mutation (for example, a substitution, addition, or deletion) as compared to the wild type sequence.
- the first and/or second amino acid sequences include at least 2 mutations, at least 3 mutations, at least 4 mutations, at least 5 mutations, at least 6 mutations, at least 7 mutations, at least 8 mutations, at least 9 mutations, at least 10 mutations, at least 11 mutations, at least 12 mutations, at least 13 mutations, at least 14 mutations, at least 15 mutations, at least 16 mutations, at least 17 mutations, at least 18 mutations, at least 19 mutations, at least 20 mutations, at least 21 mutations, at least 22 mutations, at least 23 mutations, at least 24 mutations, at least 25 mutations, at least 26 mutations, at least 27 mutations, at least 28 mutations, at least 29 mutations or at least 30 mutations as compared to the wild type sequence.
- the or each mutation is present in the alpha helices of the first or second monomers.
- the or each mutation may be an amino acid substitution.
- the or each mutation may be an amino acid deletion.
- the amino acid substitution(s) modify the electrostatic interactions between the first and second monomers to promote association of the first and second monomers.
- a substitution in the first and/or second amino acid sequence may be a substitution of a negatively charged amino acid with a positively charged amino acid.
- a substitution in the first and/or second amino acid sequence may be a substitution of a positively charged amino acid with a negatively charged amino acid.
- a mutation in the first amino acid sequence may be complementary to a mutation of the second amino acid sequence.
- the mutations in the first and second amino acid sequences may be regarded as pairs, suitably as complementary pairs.
- the first and second amino acid sequences may comprise at least one pair of complementary mutations.
- a pair of complementary mutations may comprise at least one positively charged amino acid and at least one negatively charged amino acid.
- the electrostatic interaction may be an electrostatic attraction or repulsion.
- an electrostatic attraction is an interaction that occurs between molecules that have opposite partial charges.
- these molecules do not form a covalent bond, but interact with each other if they are in close proximity to their opposite partial charge.
- an electrostatic attraction the molecules are pulled together by the attraction between the opposite partial charges.
- an electrostatic repulsion is an interaction that occurs between molecules that have the same partial charge. These molecules do not form a covalent bond but interact with each other if they are in close proximity to the same partial charge. In the case of electrostatic repulsion, the molecules are pushed away from each other by the repulsion of the same partial charge.
- the molecules described above relate to monomers of the present invention.
- a mutation of the first amino acid sequence may form an electrostatic interaction with a mutation in the second amino acid sequence.
- a mutation of the first amino acid sequence may form an electrostatic interaction with a non-mutated residue of the second amino acid sequence.
- a mutation of the second amino acid sequence may form an electrostatic interaction with a non-mutated residue of the first amino acid sequence.
- an electrostatic interaction may be formed between a positively charged amino acid and a negatively charged amino acid.
- the electrostatic interaction may be an attraction.
- the first and second amino acid sequences may form an electrostatic attraction between a positively charged amino acid and a negatively charged amino acid.
- the positively charged amino acid is comprised on the first amino acid sequence and the negatively charged amino acid is comprised on the second amino acid sequence.
- an electrostatic interaction may be a repulsion.
- an electrostatic repulsion may be formed between two or more amino acids having a similar charge.
- an electrostatic repulsion may be formed between two or more amino acids having a positive charge.
- an electrostatic repulsion may be formed between two or more amino acids having a negative charge.
- the heterodimer comprises a binding molecule attached to either the first or second monomer.
- the heterodimer comprises a mutation in the first amino acid sequence encoding the first monomer and a different mutation in the second amino acid sequence encoding the second monomer and a binding molecule attached to either the first or second monomer.
- the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 8 of SEQ ID NO: 1 or 16, or a position corresponding thereto.
- the mutation may be an amino acid substitution at residue 8 of SEQ ID NO: 1 or 16, or a position corresponding thereto.
- the amino acid substitution may be from a glutamate to a lysine at residue 8 (E8K) of SEQ ID NO:1 or 16, or a position corresponding thereto.
- the first amino acid sequence encoding the first monomer or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 64 of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the mutation may be an amino acid substitution at residue 64 of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the amino acid substitution may be from a glutamate to a lysine at residue 64 (E64K) of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid comprise an amino acid substitution from a glutamate to a lysine at residue 8 (E8K) and an amino acid substitution from a glutamate to a lysine at residue 64 (E64K) of SEQ ID NO:1 or 16, or positions corresponding thereto.
- the first or second monomer may be encoded by an amino acid sequence comprising a sequence set out in SEQ ID NO: 2 or SEQ ID NO: 3, or a sequence having at least 60 %, at least 70%, at least 80%, at least 90%, at least 95 %, at least 96%, at least 97%, at least 98%, at least 99% sequence identity thereto and comprising the mutations defined above.
- the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 56 of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the mutation may be an amino acid substitution at residue 56 of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the amino acid substitution may be from an arginine to an aspartate at residue 56 (R56D) of SEQ ID NO:1 or 16 or a position corresponding thereto.
- the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 96 of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the mutation may be an amino acid substitution at residue 96 of SEQ ID NO: 1 or 16 or a position corresponding thereto.
- the amino acid substitution may be from a lysine to an aspartate at residue 96(K96D) of SEQ ID NO:1 or 16 or a position corresponding thereto.
- the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid comprise an amino acid substitution from an arginine to an aspartate at residue 56 (R56D) and an amino acid substitution from a lysine to an aspartate at residue 96 (K96D) of SEQ ID NO:1 or 16 or positions corresponding thereto.
- the first amino acid sequence encoding the first monomer or the second amino acid sequence encoding the second monomer comprises amino acid substitutions R56D and K96D in SEQ ID NO:1 or 16, or substitutions corresponding thereto, and further comprises a binding molecule attached to the monomer.
- the first or second monomer may be encoded by an amino acid sequence comprising a sequence set out in SEQ ID NO: 4 or SEQ ID NO: 5 or a sequence having at least at least 60 %, at least 70%, at least 80%, at least 90%, at least 95 %, at least 96%, at least 97%, at least 98%, at least 99% sequence identity thereto and comprising the mutations defined above.
- a capsid protein heterodimer comprises a first monomer comprising amino acid substitutions E8K and E64K in SEQ ID NO:1 or 16, or substitutions corresponding thereto, and a second monomer comprising amino acid substitutions corresponding to R56D and K96D in SEQ ID NO:1 or 16, or substitutions corresponding thereto, and further comprising a binding molecule attached to the second monomer.
- the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 2.
- the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 4.
- the capsid protein heterodimer comprises a first monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 2 and a second monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 4.
- the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 3.
- the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 5.
- the capsid protein heterodimer comprises a first monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 3 and a second monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 5.
- the capsid protein heterodimer comprises a first or second monomer encoded by an amino acid sequence comprising one or more mutations selected from the list consisting of: E64K, K67E, R56D, H88W, K96D, W71H, W71D, E8K, E8R, K96E, S78E, S78R, D64K, R82D, E67K, H88K, and E97K, and/or optionally a deletion at S78 of SEQ ID NO:1 or 16, or at a position corresponding thereto, or combinations thereof.
- E97K may be replaced with E79K.
- the one or more mutations may be selected from the list consisting of: E8K; E64K; K67E; R82D;E97K; and H88K of SEQ ID NO:1 or 16, or at a position corresponding thereto.
- the one or more mutations may be selected from the list: E8K and E64K of SEQ ID NO:1 or at a position corresponding thereto.
- the one or more mutations may be selected from the list: E8K; E64K; K67E; R82D;E97K; and H88K of SEQ ID NO:16, or at a position corresponding thereto.
- E97K may be replaced with E79K.
- the mutations may be selected from one or more of the following groups: (i) E8K and E64K and K67E; (ii) E8K and R82D and E97K; (iii) E8K and H88K; (iv) E64K and K67E; and R82D and E97K; (v) E64K and K67E; and H88K; (vi) R82D and E97K; and H88K; (vii) E8K; E64K and K67E; and R82D and E97K; (viii) E8K; E64K and K67E; and H88K; (ix) E8K; R82D and E97K; and H88K; (x) E64K and K67E; R82D and E97K; and H88K; or (xi) E8K; E64K and K67E; R82D and E97K; and H88K; Of SEQ ID NO:1 or 16, or at positions corresponding thereto.
- E97K may be replaced with E79K.
- mutations are present in the first monomer.
- the one or more mutations may be selected from the list consisting of: R56D; K96D; W71D and a deletion at S78 of SEQ ID NO:1 or 16, or at a position corresponding thereto.
- the one or more mutations may be selected from the list: R56D and K96D of SEQ ID NO:1 or at a position corresponding thereto.
- the one or more mutations may be selected from the list: R56D; K96D; W71D; and a deletion of S78 of SEQ ID NO:16 or at a position corresponding thereto.
- the mutations may be selected from: (i) R56D and K96D; (ii) R56D and W71D; (iii) R56D and a deletion at S78; (iv) K96D and W71D; (v) K96D and a deletion at S78; (vi) W71D and a deletion at S78; (vii) R56D and K96D and W71D; (viii) R56D and K96D and a deletion at S78; (ix) R56D and W71D and a deletion at S78; (x) K96D and W71D and a deletion at S78; (xi) R56D and K96D and W71D and a deletion at S78; Of SEQ ID NO:1 or 16, or at positions corresponding thereto.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution at E8K of SEQ ID NO:1 or 16, or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a substitution at R56D of SEQ ID NO:1 or 16 or at a position corresponding thereto, the second monomer having a binding molecule attached.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution at E64K of SEQ ID NO:1 or 16, or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a substitution at K96D of SEQ ID NO:1 or 16, or at a position corresponding thereto, the second monomer having a binding molecule attached.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution at H88K of SEQ ID NO:16 or at a position corresponding thereto and a second monomer encoded by an amino acid sequence comprising a substitution at W71D of SEQ ID NO:16 or at a position corresponding thereto, the second monomer having a binding molecule attached.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution corresponding to K67E of SEQ ID NO:16 or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a wild type sequence (corresponding to SEQ ID NO:16), the second monomer having a binding molecule attached.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution corresponding to R82D and E97K of SEQ ID NO:16 or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a wild type sequence (corresponding to SEQ ID NO:16), the second monomer having a binding molecule attached.
- E97K may be replaced with E79K.
- the substitutions corresponding to R82D and E97K result in an Alpha-helix charge inversion.
- the heterodimer comprises a first monomer encoded by a wild type amino acid sequence (corresponding to SEQ ID NO:1 or 16) and a second monomer encoded by an amino acid sequence comprising a deletion of a serine residue at a position corresponding to residue 78 of SEQ ID NO:16 or at a position corresponding thereto, the second monomer having a binding molecule attached.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising one or more mutations selected from: E8K; E64K; K67E; R82D; E97K; and H88K of SEQ ID NO:1 or 16, or at positions corresponding thereto; and a second monomer encoded by an amino acid sequence comprising one or more mutations selected from: R56D; K96D; W71D; and a deletion at S78 of SEQ ID NO:1 or 16, or at positions corresponding thereto.
- the second monomer having a binding molecule attached thereto.
- E97K may be replaced with E79K.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising one or more mutations selected from: E8K and E64K of SEQ ID NO:1 or 16, or at positions corresponding thereto; and a second monomer encoded by an amino acid sequence comprising one or more mutations selected from: R56D and K96D of SEQ ID NO:1 or 16, or at positions corresponding thereto.
- the second monomer having a binding molecule attached thereto.
- the heterodimer may be a hepatitis B heterodimer or a woodchuck hepatitis heterodimer.
- the heterodimeric viral capsid protein when the heterodimeric viral capsid protein is from a woodchuck hepatitis virus, one or more further mutations may be present in the first and/or second monomers.
- the first monomer further comprises one or more further mutations selected from: K67E; R82D; E97K; and H88K of SEQ ID NO:16, or at positions corresponding thereto.
- E97K may be replaced with E79K.
- the first monomer comprises the further mutation K67E, and optionally the further mutations R82D; E97K; and H88K of SEQ ID NO:16, or at positions corresponding thereto.
- E97K may be replaced with E79K.
- the second monomer further comprises one or more further mutations selected from: W71D and a deletion of residue S78 of SEQ ID NO:16, or at positions corresponding thereto.
- the heterodimer comprises a first monomer encoded by an amino acid sequence comprising one or more mutations selected from: E8K; E64K; K67E; R82D; E97K; and H88K of SEQ ID NO:16, or at positions corresponding thereto; and a second monomer encoded by an amino acid sequence comprising one or more mutations selected from: R56D; K96D; W71D; and a deletion of S78 of SEQ ID NO:16, or at positions corresponding thereto.
- E97K may be replaced with E79K.
- the heterodimer is a woodchuck hepatitis heterodimer.
- the heterodimer is a hepatitis B capsid heterodimer and comprises a first monomer encoded by an amino acid sequence according to SEQ ID NO:1 and comprising mutations E8K and E64K; and a second monomer according to SEQ ID NO:1 and comprising mutations R56D and K96D.
- the second monomer having a binding molecule attached thereto.
- the heterodimer is a woodchuck hepatitis capsid heterodimer and comprises a first monomer encoded by an amino acid sequence according to SEQ ID NO:16 and comprising mutations E8K; E64K; and K67E, and a second monomer according to SEQ ID NO:16 and comprising mutations R56D; and K96D.
- the second monomer having a binding molecule attached thereto.
- the heterodimer is a woodchuck hepatitis capsid heterodimer and comprises a first monomer encoded by an amino acid sequence according to SEQ ID NO:16 and comprising mutations E8K; E64K; K67E; R82D; E97K; and H88K, and a second monomer according to SEQ ID NO:16 and comprising mutations R56D; K96D; W71D and a deletion of S78.
- E97K may be replaced with E79K.
- the second monomer having a binding molecule attached thereto.
- the heterodimer may comprise any combination of mutations described above.
- a mutation to the first and/or second amino acid sequence may modify the electrostatic interactions between the first monomer an the second monomer.
- the electrostatic interactions between the first monomer and the second monomer may be modified to increase repulsions and/or attractions between the monomers.
- a mutation of the first and/or second amino acid sequence may modify electrostatic interactions to increase repulsion between the first and the second amino acid sequences.
- a mutation of the first and/or second amino acid sequence may modify electrostatic interactions to increase attraction between the first and the second amino acid sequences.
- the mutations described on any one of the embodiments of the present invention modify the electrostatic interactions between the first and the second monomers to promote association thereof.
- the heterodimer comprises a single binding molecule attached exclusively to either the first or the second monomer.
- the binding molecule is a binding protein.
- any of the embodiments described herein may comprise further mutations or modifications to the amino acid sequences encoding the first and/or the second monomers.
- Binding molecule The present invention is based on the use of VLPs to present a functional molecule on its surface, for example to the immune system.
- the invention relates to VLPs which make use of a binding molecule which can attach a functional molecule, typically an antigen, to the viral capsid proteins forming the VLP.
- an a first capsid protein monomer encoded by a first amino acid sequence and a second capsid protein monomer encoded by a second amino acid sequence comprising a binding molecule attached thereto.
- each capsid protein heterodimer comprises a binding molecule attached exclusively to either the first or the second monomer.
- each capsid protein heterodimer is able to attach a functional molecule via the binding molecule.
- the binding molecule is operable to bind to a functional molecule.
- the binding molecule may be any molecule that can be attached to the first or second monomer for the purposes of attaching to a functional molecule, such as an antigen.
- the binding molecule is a binding protein.
- the binding protein has low homology to proteins of the subjects which may be treated with the VLP.
- the binding protein has low homology to human proteins.
- the binding protein has low homology with the tertiary structure of any human proteins.
- low homology with human proteins means that the binding protein itself is less likely to stimulate an off-target immune reaction.
- the binding protein does not contain any disulphide bonds.
- the binding protein is not glycosylated.
- the binding protein is relatively small in size.
- the binding protein comprises a relatively short sequence length.
- the binding protein comprises a length of between 84 – 134 amino acids.
- the binding protein comprises a length of less than 135 amino acids.
- the lack of disulphide bonds, lack of glycosylation, and small size means that the binding protein is easier to produce in bacterial cells such as E.coli.
- the binding protein comprises a bacterial toxin or a bacterial toxin inhibitor or antitoxin.
- the binding protein of the VLP is a bacterial toxin inhibitor.
- the binding protein is a bacterial toxin inhibitor called Im7.
- the binding protein is a bacterial toxin inhibitor called Barstar.
- the binding protein is part of a pair of binding proteins.
- the binding protein attached to the first or second monomer is a first binding protein operable to bind to a second binding protein.
- the second binding protein is attached to the functional molecule.
- a linker as explained hereinabove, which may be an alpha helix linker.
- the first binding protein is a bacterial toxin or a bacterial toxin inhibitor or antitoxin.
- the first binding protein of the VLP is a bacterial toxin inhibitor which may be selected from Im7 or Barstar. Alternatively the first binding protein may be SpyCatcher.
- the first binding protein may be a nanobody.
- the second binding protein is a bacterial toxin which may be selected from ColE7, or Barnase.
- the second binding protein may be SpyTag.
- the second binding protein may be a cognate target protein.
- the first binding protein is Barstar and the second binding protein is Barnase.
- the first binding protein is Im7 and the second binding protein is ColE7.
- the first binding protein is Im7 and the second binding protein is ColE7.
- this pair of binding proteins allows purification of VLPs by anion chromatography.
- the first binding protein is a nanobody and the second binding protein is a cognate target protein.
- the first binding protein is attached exclusively to either the first or second monomer of the invention and the second binding protein is attached to a functional molecule.
- the first binding protein binds to the second binding protein.
- a binding molecule may comprise the SpyTag/SpyCatcher system.
- the binding protein may be the wild-type protein, or it may be modified.
- the binding protein may be modified to improve its function as a binding protein in the context of the VLP of the invention. Suitable modifications may include: insertions, deletions, substituents, truncations, reversals, repeats, or the like in the amino acid sequence encoding the protein.
- any property of the bacterial toxin binding protein detrimental to either the host cell and / or the recipient organism intended for VLP administration is neutralized by targeted modifications.
- the or each binding protein may comprise one or more amino acid substitutions.
- the amino acid substitutions may increase the binding affinity between the or each binding protein and the functional molecule .
- the amino acid substitutions may remove undesirable disulphide bonds from a given binding protein.
- the or each binding protein may comprise one or more amino acid substitutions.
- the amino acid sequence of Barstar comprises one or more of the following substitutions: C40A, C82A, and I87E.
- the amino acid sequence of Barstar may comprise all of the following substitutions: C40A, C82A, and I87E.
- the amino acid sequence of Barstar comprises: KKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDALTGWVEYPLVLEWRQFEQSKQ LTENGAESVLQVFREAKAEGADITIELS (SEQ ID NO: 6)
- the amino acid sequence of Im7 comprises the following substitution: F41L.
- the amino acid sequence of Im7 comprises: ELKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDNRDDS PEGIVKEIKEWRAANGKPGFKQ (SEQ ID NO: 7).
- the second binding protein may also comprise one or more amino acid substitutions.
- the amino acid substitutions in the amino acid sequence of the second binding protein may increase the negative charge of the second binding protein.
- the amino acid sequence of Barnase comprises the following substitution: E73W.
- the amino acid sequence of Barnase comprises: AQVINTFDGVADYLQTYHKLPDNYITKSEAQALGWVASKGNLADVAPGKSIGGDIFSNREG KLPGKSGRTWRWADINYTSGFRNSDRILYSSDWLIYKTTDHYQTFTKIR (SEQ ID NO: 12).
- the amino acid sequence of ColE7 comprises one or more of the following substitutions: Arg538Ala, Glu542Ala, and His569Ala.
- the amino acid sequence of ColE7 may comprise all of the following substitutions: Arg538Ala, Glu542Ala, and His569Ala.
- the amino acid sequence of ColE7 comprises: ESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEV SKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKATSFALHHEKPISQNGGVYDMDNISV VTPKRAIDIHRGKS (SEQ ID NO: 13).
- the first or second binding proteins may be truncated.
- the second binding protein is truncated.
- the whole or a part of the ColE7 protein may be used as the second binding protein.
- Suitably only a part of the ColE7 protein is used as the second binding protein.
- the ColE7 protein is truncated, suitably so that it only comprises the catalytic domain of ColE7.
- the second binding protein comprises the catalytic domain of ColE7.
- the second binding protein is Barnase
- the whole or a part of the Barnase protein may be used as the second binding protein.
- the barnase protein is truncated, suitably so that it only comprises the catalytic domain of Barnase.
- the second binding protein comprises the catalytic domain of Barnase.
- the binding molecule may comprise additional modifications.
- the binding molecule may comprise chemical modification.
- Suitably Im7 may comprise chemical modification.
- the chemical modification is capable of binding to a functional molecule.
- the chemical modification is capable of covalently binding to a functional molecule.
- the functional molecule bound to the chemical modification may be a fluorescent molecule.
- Other suitable functional molecules are described elsewhere herein.
- the chemical is attached to the first binding protein by non-covalent binding.
- the chemical is attached to the first binding protein by electrostatic and/or hydrophobic bonding.
- chemical modifications include alkanes having an amine group.
- the alkane may have any chain length.
- the alkane is a lower alkane.
- the alkane may have a chain length of between 1 and 10 carbons.
- the alkane may have a chain length of between 4 and 8 carbons.
- the alkane may be branched.
- the length of the carbon chain and the length of branched substitutions on the amine group are chosen such as to allow either irreversible attachment to the protein or reversible attachment, dependent on the desired application.
- the chemical is attached irreversibly to the first binding protein.
- the alkane has eight carbon atoms and a terminal nitrogen (octylamine).
- the chemical is attached reversibly to the first binding protein.
- the alkane has 4 carbon atoms in a branched structure (diethylethanolamine).
- the first binding protein may be chemically modified at one or more sites, suitably at one or more amino acids.
- the first binding protein is chemically modified at one amino acid.
- the first binding protein is chemically modified with DEAE.
- the first binding protein is chemically modified with octylamine.
- the first binding protein may be Im7.
- modification with DEAE allows the first binding protein to be purified. Suitably purification by chromatography.
- modification with octylamine allows the first binding protein to directly bind to a functional molecule.
- the chemical modification of the binding protein occurs within the host cell.
- the chemical modification of the binding protein occurs outside of the host cell.
- the chemical modification of the binding protein occurs outside of the host cell.
- a chemical reaction Suitably by means of a non-enzymatically catalyzed non-covalent attachment.
- Functional Molecule The present invention relates to VLPs which are able to display various functional molecules on their surface by virtue of a binding molecule which may be a binding protein attached exclusively to the first or second monomer of the heterodimeric capsid proteins.
- each heterodimer has a single binding molecule attached.
- the binding molecule is a binding protein.
- the binding molecule is attached to at least one functional molecule.
- the binding molecule may be attached to more than one functional molecule.
- the functional molecule(s) may be of the same type or different types.
- the binding molecule may be attached to any combination of one or more antigens, antigen binding proteins, or fluorescent molecules.
- the binding protein is attached to one functional molecule.
- the binding protein may be attached to a functional molecule by a chemical modification, or alternatively via a second binding protein wherein in such an embodiment the binding protein attached to the capsid protein is the first binding protein.
- the binding protein may comprise a chemical modification.
- a chemical modification may be attached to a functional molecule.
- the functional molecule is attached to the binding protein via the chemical modification.
- the functional molecule is a non- protein antigen or epitope thereof, or a fluorescent molecule.
- each viral capsid protein is attached to a binding protein
- each chemical modification is attached to a functional molecule.
- the functional molecule may be attached to the second binding protein.
- the second binding protein is Barnase.
- the second binding protein is ColE7.
- the second binding protein may be attached to the first binding protein, the first binding protein being exclusively attached to either the first or the second monomer of the invention.
- the first binding protein is Barstar.
- the first binding protein is Im7.
- the functional molecule is attached to Barnase.
- Barnase may be bound to Barstar.
- the functional molecule is attached to ColE7.
- ColE7 may be bound to Im7.
- the binding proteins are directly or indirectly attached to the viral capsid protein and to the functional molecule.
- the binding proteins are directly attached to the viral capsid protein and in some cases directly attached to the functional molecule.
- the binding protein may be fused to the first or second monomer of the viral capsid protein heterodimer as described hereinabove.
- the functional molecule may be fused to the binding protein.
- the binding protein may be indirectly attached to the functional molecule via a second binding protein attached to the first binding protein.
- the binding protein may be indirectly attached to the functional molecule.
- Suitable functional molecules may include: protein or non-protein antigens; antigen binding proteins such as antibodies or binding fragments thereof, antibody mimetics, and aptamers; fluorescent molecules.
- the functional molecule may be modified, suitably by the introduction of one or more mutations to change its characteristics, properties or biological effect.
- the functional molecule many comprise one or more substitution mutations for example.
- the functional molecule is an antigen binding molecule such as an antibody.
- the second binding protein is a generic antibody binding protein.
- the antibody binding protein is selected from protein G, protein A, protein AG, and streptavidin.
- an antigen binding protein such as an antibody for use as a functional molecule is capable of binding an antigen of interest.
- an antigen binding protein such as an antibody as a functional molecule produces a VLP which is capable of binding to an antigen.
- Suitable antigens may include the whole or part of an antigen.
- the antigen may be a subunit or monomer of an antigen.
- the functional molecule may be an epitope of an antigen.
- an antigen as a functional molecule produces a VLP which is capable of stimulating an immune response to the antigen.
- the antigen may be a protein or non-protein antigen. Suitable non-protein antigens may include sugars, lipids or carbohydrates, or small molecule chemicals to which an immune response is desired, or who need to be detected, such as nicotine, cocaine, or other exogenous toxins.
- the antigen may be a self or non-self antigen relative to the subject intended to be treated with the VLP.
- the antigen may be a human or non-human antigen.
- the antigen may be derived from the causative agent in a disease or disorder.
- the causative agent may be self or non-self.
- a non-self causative agent may be an infectious agent.
- the antigen may be derived from an infectious agent such as a virus, bacterium, fungus, protozoan, archaeon.
- the antigen may be derived from a virus selected from: Adeno-associated virus, Chikungunya virus, Crimean-Congo hemorrhagic fever virus, Dengue virus, Ebolavirus, Echovirus, Encephalomyocarditis virus, Epstein-Barr virus, Hantaan virus, Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E virus, Human adenovirus, Human astrovirus, Human coronavirus, Human cytomegalovirus, Human enterovirus, Human herpesvirus, Human immunodeficiency virus, Human papillomavirus, Human parainfluenza, Human respiratory syncytial virus, Human rhinovirus, Human torovirus, Influenza A virus, Influenza B virus, Influenza C virus, Japanese encephalitis virus, Polyomavirus, Kunjin virus, Lassa virus, Measles virus, Molluscum contagiosum virus, Mumps virus,
- the antigen may be derived from a bacterium selected from: Actinomyces israelii, Bacillus anthracis, Bacillus cereus, Bartonella henselae, Bartonella quintana, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Borrelia garinii, Borrelia afzelii, Borrelia recurrentis, Brucella abortus, Brucella canis, Brucella melitensis, Brucella suis, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Chlamydophila psittaci, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium tetani, Corynebacterium diphtheriae, Enterococcus faecalis, Enterococcus faecium
- enterica Salmonella typhi, Shigella sonnei, Shigella dysenteriae, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus saprophyticus, Streptococcus agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus viridans, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae, Yersinia pestis, Yersinia enterocolitica, Yersinia pseudotuberculosis .
- the antigen is derived from a coronavirus, suitably from SARS-CoV-2.
- the antigen is the whole or part of a spike protein derived from SARS-CoV-2, or the whole or part of a nucleocapsid protein derived from SARS-CoV-2.
- the functional molecule is part of a spike protein derived from SARS-CoV-2.
- the receptor binding domain In another embodiment, therefore, the functional molecule is part of a nucleocapsid protein derived from SARS-CoV-2.
- the C-terminus Suitably a self-causative agent may be a non-infectious agent.
- the antigen may be derived from a non-infectious agent such as an inflammatory molecule, or a molecule causing degenerative changes in nervous (such as beta-amyloid), cartilage or bone tissue, or a molecule causing worsening of a neoplastic disease.
- a non-infectious agent such as an inflammatory molecule, or a molecule causing degenerative changes in nervous (such as beta-amyloid), cartilage or bone tissue, or a molecule causing worsening of a neoplastic disease.
- the antigen may be an inflammatory molecule or a molecule causing degenerative changes or a molecule conducive to a neoplastic disease which is a causative agent in a disease or disorder.
- the molecule may operate in humans or in non-human mammals.
- the molecule may cause a disease or disorder in a specific species.
- Suitable inflammatory molecules may include chemokines or cytokines, or proteases.
- Suitable chemokines or cytokines may include: interleukins, tumour necrosis factors, interferons, and colony stimulating factors. Suitable chemokines or cytokines may include: IL1, IL2, Il3, Il4, IL5, Il6, Il7, IL8, IL9, IL10, IL11, IL12, IL13, IL17, IL33, TNF ⁇ , TNF ⁇ , IFN ⁇ , IFN ⁇ , G-CSF, GM-CSF, M-CSF, erythropoietin, and TGF ⁇ .
- Suitable proteases may include ADAMTS4, ADAMTS5.
- the antigen is an interleukin or a protease.
- the antigen is IL13, IL17 or IL33 or a fragment thereof.
- the functional molecule is IL13, IL17 or IL33.
- the IL13 is modified.
- the IL13 may comprise one or more substitution mutations.
- the or each modification may reduce, limit, or change the effect of IL3, suitably it may reduce the transactivation of receptors.
- Suitable molecules which case degenerative changes in nervous tissue or worsening of neoplastic diseases may include: ADAMTS4/5, angiogenesis factors, or factors allowing escape of tumours such as galectin proteins. References to any antigens herein may equally refer to an epitope of said antigen.
- an antigen of interest may be any of those listed above.
- an antigen of interest may be from a disease causing agent such as a virus, bacterium, fungus, protozoan, or archaeon.
- an antigen of interest may be from a non-infectious agent, for example, a cell surface receptor.
- the antibody may be capable of binding to an antigen from a virus, bacterium, fungus, protozoan, archaeon as listed above.
- Suitable viruses may be selected from, for example: Adeno-associated virus, Chikungunya virus, Crimean-Congo hemorrhagic fever virus, Dengue virus, Ebolavirus, Echovirus, Encephalomyocarditis virus, Epstein-Barr virus, Hantaan virus, Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E virus, Human adenovirus, Human astrovirus, Human coronavirus, Human cytomegalovirus, Human enterovirus, Human herpesvirus, Human immunodeficiency virus, Human papillomavirus, Human parainfluenza, Human respiratory syncytial virus, Human rhinovirus, Human torovirus, Influenza A virus, Influenza B virus, Influenza C virus, Japanese encephalitis virus, Polyomavirus, Kunjin virus, Lassa virus, Measles virus, Molluscum contagiosum virus, Mumps virus, Nipah virus
- the functional molecule is an antibody capable of binding to an antigen from a coronavirus.
- the antibody is capable of binding to an antigen from SARS-CoV-2.
- Suitable bacteria may be selected from: Actinomyces israelii, Bacillus anthracis, Bacillus cereus, Bartonella henselae, Bartonella quintana, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Borrelia garinii, Borrelia afzelii, Borrelia recurrentis, Brucella abortus, Brucella canis, Brucella melitensis, Brucella suis, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Chlamydophila psittaci, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium tet
- the VLP may be targeted to a particular virus.
- the VLP may therefore be used for detecting the presence of a virus. Further details on this use are provided elsewhere.
- the antigen binding protein such as an antibody may be capable of binding to an antigen from a cell surface receptor.
- the cell surface receptor may be an ion-channel linked receptor, a G-protein coupled receptor, or an enzyme-linked receptor.
- the cell surface receptor is selected from: 5-HT receptor, nAch-receptor, Zinc-activated ion channel, GABAA receptor, Wnt-family member receptors, co-receptors contained in lipid rafts, T-cell and T-cell co-receptors, B-cell receptors and B-cell costimulatory molecules, Glycine receptor, AMPA receptor, Kainate receptor, NMDA receptor, Glutamate receptor, ATP-gated channel, PIP2 gated channel, Erb receptor, GDNF receptor, NP receptor, trk receptor, toll-like receptor, GABAB receptor, GBPCR class A, B, C, D, E, or F.
- the VLP may be targeted to a particular cell. Suitably targeted to bind to a particular cell. Suitably the VLP may be used to deliver cargo to a cell. Further details on this use are provided elsewhere.
- Suitable antibodies may include IgG, IgM, IgE, IgA, IgD antibodies. Suitably, the antibody is an IgG antibody.
- IgG subclasses include IgG1, IgG2, IgG3 and IgG4.
- Suitable further antigen binding proteins may include antibody binding fragments or antibody mimetics which perform the same function as an antibody. Suitably they are also capable of binding an antigen of interest.
- an antibody binding fragment or mimetic as a functional molecule also produces a VLP which is capable of binding to an antigen.
- Suitable antibody binding fragments may include: Fab, monospecific or bispecific F(ab)2, F(ab’)2, monospecific or bispecific diabody, nanobody, ScFv, ScFv-Fc, F(ab)3.
- Suitable antibody mimetics may include affibodies, affilins, affimers, affitins, alphabodies, anticalins, avimers, DARPins, fynomers, Kunitz domain peptides, monobodies, nanCLAMPs.
- a fluorescent molecule as a functional molecule produces a VLP which is visible.
- this is useful for labelling, especially when combined with a second functional molecule which can bind to an antigen, for example antibodies or binding fragments thereof, antibody mimetics, or aptamers.
- Suitable fluorescent molecules may include: GFP, EBFP, EBFP2, Azurite, GFPuv, T-saphhire, Cerulean, CFP, mCFP, mTurquoise2, CyPet, mKeima-red, tagCFP, AmCyan1, mTFP1, midoriishi cyan, turboGFP, tagGFP, emerald, azami green, ZsGreen1, YFP, tagYFP, EYFP, topaz, venus, mCtrine, YPet, turboYFP, ZsYellow1, Kusabira Orange, mOrange, allophycocyanin, mkO, RFP, turboRFP, tdTomato, tagRFP, dsRed, mStrawberry, turboFP602, asRed2, J-red, R-phycoerythrin, B-phycoerythrin, mCherry, HcRed, Katusha,
- the fluorescent molecule is GFP or any modified form of GFP.
- the or each functional molecule is IL13, IL17, IL33, the receptor binding domain of SARS Cov-2 spike protein, or the C-terminus of the SARS Cov-2 nucleocapsid protein.
- the or each functional molecule is an IgG antibody or binding fragment thereof.
- the antibody or binding fragment thereof is an antibody or binding fragment thereof directed towards SARS-CoV-2.
- the functional molecule is an epitope.
- the functional molecule is IL-31, suitably according to SEQ ID NO:17.
- the functional molecules may comprise one or more epitopes and/or a fluorescent molecule.
- the functional molecules may comprise two epitopes.
- the functional molecules may comprise an epitope and a fluorescent molecule.
- Virus-Like Particle The present invention relates to VLPs, their uses and methods of manufacture thereof.
- the VLP comprises one or more viral capsid protein heterodimers which suitably form a VLP.
- the one or more viral capsid proteins self-assemble into the VLP.
- the VLP comprises one or more binding molecules, each attached to a viral capsid protein heterodimer.
- the VLP comprises one or more functional molecules which are suitably each attached to the binding molecule, and/or chemical modifications present on the binding molecule of the capsid protein heterodimer.
- the VLP of the invention stably displays the functional molecules on its surface.
- the VLP may comprise a plurality of subunits.
- each subunit comprises a complete viral capsid protein heterodimer, one binding molecule and one functional molecule.
- the subunits self-assemble into a VLP.
- the VLP comprises a plurality of viral capsid protein heterodimers, a plurality of binding molecules (each attached to a capsid protein heterodimer and a plurality of functional molecules, each attached to a binding molecule .
- each viral capsid heterodimer comprise a first monomer and a second monomer. The first and/or the second monomer comprising at least one mutation in its amino acid sequences as compared to the wildtype amino acid sequence.
- each capsid protein heterodimer comprises a single binding molecule.
- the binding molecule may be attached to either the first or the second monomer.
- the binding molecule is attached to at least one functional molecule.
- the VLP comprises a plurality of capsid protein heterodimers, each heterodimer comprising a first and a second monomer, a binding molecule attached to either the first or the second monomer, and a functional molecule.
- the amino acid sequence of the first and/or second monomer comprises at least one mutation as compared to the wild-type sequence.
- the mutation in the amino acid sequence promotes association of the monomers to form heterodimers. It will be appreciated to those skilled in the art, that unmodified wild-type Hepatitis B viral capsid proteins favour the formation of homodimers.
- the inventors have found that the introduction of specific mutations in the amino acid sequence of the monomeric subunits of the Hepatitis B viral capsid proteins promotes association of the mutated monomers to form heterodimers.
- the first or second monomer of the heterodimer comprises a binding molecule. Suitable mutations to the amino acid sequences of the monomeric subunits that promote heterodimerisation are discussed elsewhere in the specification.
- the VLP comprises a negative surface charge, suitably a homogenous negative surface charge.
- the binding molecule may be a binding protein or a pair of binding proteins as discussed hereinabove.
- the VLP may have an average diameter of between 20nm to 100nm, suitably between 25nm and 75nm, suitably between 28nm and 50nm, suitably between 30nm and 40nm, suitably around 32nm.
- the size of the VLP may be determined by dynamic light scattering (DLS). For example using a Malvern Zetasizer Ultra.
- DLS dynamic light scattering
- the present invention relates to nucleic acids encoding component protein parts which form the VLP, and vectors comprising said nucleic acids which may be used in host cells to produce VLPs.
- the invention relates to, and makes use of, a first nucleic acid encoding a first viral capsid protein monomer and a second nucleic acid encoding a second viral capsid protein monomer.
- the first or second nucleic acid may encode a capsid protein monomer attached to a binding molecule.
- the first or second nucleic acid may encode a fusion protein comprising the viral capsid protein monomer fused to a binding molecule.
- the viral capsid protein may be a hepatitis B capsid protein.
- the viral capsid protein may be a Woodchuck hepatitis capsid protein.
- the binding molecule may be a binding protein, suitably a first binding protein.
- the first or second nucleic acids encoding a fusion protein comprising the viral capsid protein monomer fused to a binding molecule may be known as the ‘capsid fusion protein’.
- the first or second nucleic acid sequences may encode a viral capsid protein monomer and may suitably comprise a sequence according to SEQ ID NO: 20 or 23.
- the first or second nucleic acid sequences may encode a viral capsid protein monomer and may suitably comprise a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 20 or 23.
- the first or second nucleic acid sequences may encode a viral capsid protein monomer and may consist of a sequence according to SEQ ID NO: 20 or 23.
- any of these monomer sequences may further comprise a sequence encoding a binding molecule, suitably encoding the viral capsid protein monomer fused to a binding molecule, suitably to a first binding protein.
- the first and second nucleic acid sequences may encode a heterodimer comprising a binding molecule, suitably a first binding protein, and may suitably comprise a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 18, 19, 21, or 22.
- a nucleic acid encoding a heterodimer of the invention may comprise a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 18, 19, 21, or 22.
- the invention relates to, and makes use of, a third nucleic acid encoding a functional molecule.
- the third nucleic acid encoding a functional molecule may be optionally attached to a second binding protein as explained hereinabove.
- the third nucleic acid may encode a fusion protein comprising the functional molecule optionally fused to a second binding protein.
- this may be known as the ‘functional fusion protein’.
- the third nucleic acid encodes only a functional molecule.
- the third nucleic acid encodes a functional molecule attached to second binding protein.
- the third nucleic acid encodes a functional molecule fused to a second binding protein.
- the third nucleic acid encodes a functional fusion protein.
- a third nucleic acid sequence encoding a functional molecule is a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO:17.
- the third nucleic acid may comprise a sequence encoding a functional molecule attached to a second binding protein, suitably having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 24-25.
- the third nucleic acid encoding a functional molecule attached to a second binding protein may consist of a sequence according to SEQ ID NO: 24-25.
- the third nucleic acid encodes two epitopes, they may be any two epitopes fused to a second binding protein.
- the third nucleic acid may encode for example a SARS-Cov2 spike protein receptor binding domain and a C-terminal fragment of the nucleocapsid protein.
- the third nucleic acid encodes an epitope and a fluorescent molecule fused to a second binding protein, they may be any epitope and any fluorescent molecule.
- the third nucleic acid may encode for example a SARS-Cov2 spike protein receptor binding domain and eGFP.
- the invention may make use of the first and second nucleic acids.
- the invention may make use of the first, second and third nucleic acids.
- the first, second, and third nucleic acids described herein may be provided as one contiguous nucleic acid sequence, or may be provided as a plurality of separate nucleic acid sequences. References to the first, second, and third, nucleic acids include embodiments where plurality of nucleic acid sequences may be used to encode the same proteins as the first, second, third, nucleic acids.
- the first and second nucleic acids may comprise a sequence according to SEQ ID NO: 18, 19, 21 or 22.
- the third nucleic acid may comprise a sequence according to SEQ ID NO: 24 or 25.
- the first and second nucleic acids may comprise a contiguous sequence according to SEQ ID NO:18 or 21, and the third nucleic acid may comprise a sequence according to SEQ ID NO: 24.
- the first binding protein is lm7 and the second binding protein is Col-E7.
- the first and second nucleic acids may comprise a contiguous sequence according to SEQ ID NO:19 or 22, and the third nucleic acid may comprise a sequence according to SEQ ID NO: 25.
- the first binding protein is Barstar and the second binding protein is Barnase.
- the nucleic acids may comprise one or more expression elements to aid in expression of the proteins encoded thereon. Suitable expression elements include promoters, operators, enhancers, activators, repressors, 5’UTRs, 3’UTRs, introns, IRES, etc.
- each of the nucleic acids comprises one or more expression elements which ensure equal expression of the proteins encoded thereon.
- each of the nucleic acids comprises a promoter which ensures equal expression of the proteins encoded therein.
- the promoter may comprise one or more modifications which adapt the level of expression therefrom.
- the promoter may comprise one or more mutations.
- each nucleic acid described herein is operably linked to a promoter.
- Suitable promoters may be selected from: CMV-IE, EF1a, SV40, PGK1, CAG, human beta actin, T7, TetR/TetA, T7lac, SP6, LP1, TTR, CK8, Synapsin, Glial fibrillary acidic protein (GFAP), CaMKII, TBG, and albumin promoter.
- GFAP Glial fibrillary acidic protein
- each nucleic acid may be linked to the same promoter or a different promoter.
- each nucleic acid may be linked to the same promoter.
- each nucleic acid may be expressed at the same time.
- each nucleic acid may be linked to a T7 promoter, optionally with one or more modifications to ensure equal expression levels of the proteins encoded by the nucleic acids.
- each nucleic acid may be linked to a different promoter.
- each nucleic acid may be expressed at different times.
- the or each nucleic acid may be independently expressed.
- expression of each nucleic acid may be induced at different times.
- the or each promoter may be an inducible promoter. Suitably which may be induced by contacting the promoter with a suitable inducer, at a concentration effective to induce expression therefrom.
- the first or second nucleic acid sequence may be linked to a first promoter and the first or second nucleic acid may be linked to a second promoter.
- the first or second promoter may be a T7 promoter.
- the T7 promoter may comprise the sequence agcataat (SEQ ID NO:8).
- the first or second promoter may be a TetR/TetA promoter.
- the first or second nucleic acid expresses the viral capsid protein monomers described herein at the same or equal levels.
- the heterodimeric capsid protein is expressed at a 1:1 level compared to the functional fusion protein, or the functional molecule.
- nucleic acids may be comprised on one or more vectors.
- first, second and/or third nucleic acids may be comprised on one vector.
- first, second, and/or third nucleic acids may be comprised on multiple vectors.
- the first nucleic acid may be comprised on one vector and the second nucleic acid may be comprised on another vector.
- nucleic acids may be comprised on one or more vectors as constructs. Suitably as expression constructs.
- an expression construct in accordance with the invention may comprise a first nucleic acid sequence encoding a first viral capsid protein monomer; and a second nucleic acid sequence encoding a second viral capsid protein monomer attached to a binding molecule.
- the expression construct comprises a first promoter operably linked to the first and second nucleic acids.
- the expression construct comprises an IRES or ribosome binding site located between the first nucleic acid and the second nucleic acid.
- the first and second nucleic acids are expressed as separate proteins, suitably as separate monomers.
- the expression construct comprises a sequence encoding an internal cleavage peptide, suitably a self-cleaving peptide located between the first nucleic acid and the second nucleic acid.
- an internal cleavage peptide suitably a self-cleaving peptide located between the first nucleic acid and the second nucleic acid.
- the first nucleic acid is comprised on a first vector, suitably for example the first vector may comprise a first nucleic acid sequence encoding a woodchuck hepatitis monomer, in such an embodiment, the vector may has a sequence according to SEQ ID NO: 29.
- the first vector may comprise a first nucleic acid encoding a hepatitis B monomer, suitably comprising a sequence according to SEQ ID NO: 20.
- the second nucleic acid is comprised on a second vector, suitably for example the second vector may comprise a second nucleic acid sequence encoding a woodchuck hepatitis monomer attached to a binding molecule, suitably a first binding protein, in such an embodiment, the vector has a sequence according to SEQ ID NO:28.
- the second vector may comprise a second nucleic acid encoding a hepatitis B monomer attached to a binding molecule, suitably a first binding protein.
- the first and second nucleic acids may be comprised on the same vector.
- the vector has a sequence according to SEQ ID NO: 26.
- the third nucleic acid may be comprises on a second vector.
- the second vector may comprise a sequence according to SEQ ID NO: 24 or 25.
- the first, second and the third nucleic acids may be comprised on the same vector, suitably therefore one vector may encode the VLP of the invention.
- the vector may comprise a sequence encoding both woodchuck hepatitis monomers, i.e.
- the vector has a sequence according to SEQ ID NO: 27.
- the vector may comprise a sequence encoding a both hepatitis B monomers, i.e. a hepatitis B heterodimer, a first and second binding protein and a functional molecule, suitably comprising sequences according to SEQ ID NOs: 18 or 19, and 24 or 25.
- the first nucleic acid and the third nucleic acid are comprised on the same vector..
- the vector may comprise a nucleic acid sequence according to SEQ ID NO: 20 or 23, and a nucleic acid sequence according to SEQ ID NO: 24 or 25.
- the second nucleic acid and the third nucleic acid are comprised on the same vector.
- a single vector comprises SEQ ID NO:18 and SEQ ID NO: 24.
- a first vector comprises SEQ ID NO:18 and a second vector comprises SEQ ID NO: 24.
- a single vector copmrpises SEQ ID NO:19 and SEQ ID NO:25.
- a first vector comprises SEQ ID NO:19 and a second vector comprises SEQ ID NO: 25.
- a single vector comprises SEQ ID NO:21 and SEQ ID NO: 24.
- a first vector comprises SEQ ID NO:21 and a second vector comprises SEQ ID NO: 24.
- a single vector comprises SEQ ID NO:22 and SEQ ID NO: 25.
- a first vector comprises SEQ ID NO:22 and a second vector comprises SEQ ID NO: 25.
- a host cell may comprise any of the above vectors in any combination.
- the first, second and/or third nucleic acids may be comprised on one vector or on a first and second vector, or on a first, second and third vector respectively as described above.
- the first and second nucleic acids are comprised on one vector.
- the or each vector is present in the single host cell.
- the first and second nucleic acids are comprised on a single vector of SEQ ID NO: 26.
- the first and third nucleic acids are comprised on a single vector.
- the single host cell comprises a single vector of SEQ ID NO:26 or 27.
- the first, second and/or third nucleic acids are comprised on two different vectors.
- the first nucleic acid may be comprised on a first vector selected from SEQ ID NO:29.
- the second nucleic acid may be comprised on a second vector selected from SEQ ID NO:28.
- any workable combination of first and second vectors may be used in the single host cell.
- the first vector may comprise SEQ ID NO:28 and may be combined with the second vector of SEQ ID NO:29.
- the first and second nucleic acids may be comprised on a first vector, or may be comprised on a first and second vector respectively.
- the third nucleic acid may be comprised on a third vector.
- the first vector and optionally the second vector is present in the first host cell and the third vector is present in a second host cell.
- the first vector is of SEQ ID NO:29
- the second vector is of SEQ ID NO: 28.
- any workable combination of first and second vectors may be used in the host cells.
- the first host cell may comprise a first vector of SEQ ID NO:28 and may be combined with a second vector of SEQ ID NO: 29.
- the first host cell may comprise a first vector of SEQ ID NO: 29 and may be combined with a second vector of any of SEQ ID NO: 28.
- the one or more vectors may further comprise the third nucleic acid. Any suitable vector may be used for the chosen host cell/s. Suitable host cells are discussed below.
- the vector is selected from: a plasmid, a cosmid, a phage, a virus, an artificial chromosome.
- the or each vector is a plasmid.
- Suitable plasmid vectors for a host E.coli cell may include, for example: pALTER-Ex1, pALTER-Ex2, pBAD/His, pBAD/Myc-His, pBAD/gIII, pCal-n, pCal-n-EK, Cal-c, pCal-Kc, pcDNA 2.1, pDUAL, pET-3a-c, pET-9a-d, pET-11a-d, pET-12a-c, pET-14b, pET-15b, pET- 16b, pET-17b, pET-19b, pET-20b(+), pET-21a-d(+), pET-22b(+),pET-23a-d(+), pET-24a-d(+), pET-25b(+), pET-26b(+), pET-27b(+), pET-28a-c(+), pET-29a-
- the vector used is pET-Duet.
- Suitable plasmid vectors for a host mammalian cell may include: the pSV and the pCMV series of vectors.
- the vector used is pcDNA5D.
- host mammalian cells are HEK293 cells or CHO cells or derivatives thereof.
- the vector may comprise a variety of other functional nucleic acid sequences, such as one or more selectable markers, one or more origins of replication, multiple cloning sites and the like.
- Process of Producing a VLP The present invention further relates to processes for the production of VLPs.
- a single cell process of producing a VLP there is provided a single cell process of producing a VLP.
- a multiple cell process of producing a VLP Suitably a dual cell process.
- the processes may further comprise transfecting the one or more vectors comprising the nucleic acids into the or each host cell. Suitably prior to culturing the or each host cell.
- transfection may take place by any suitable method such as electroporation, microinjection, particle delivery, chemical mediated endocytosis, calcium phosphate co- precipitation, or liposome mediated delivery.
- culturing the host cells under conditions to express the proteins comprises culturing the host cells under optimum growth conditions.
- the optimum growth conditions will vary depending on the host cell being used.
- the host cell may be selected from any bacterium, yeast, insect cell or human cell.
- the host cell is a bacterial host cell.
- the host cell is selected from E.coli, B.subtilis, Caulobacter crescentus, Rodhobacter sphaeroides, Pseudoalteromonas haloplanktis, Shewanella sp.
- strain Ac10 Pseudomonas fluorescens, Pseudomonas putida, Pseudomonas aeruginosa, Halomonas elongate, Chromohalobacter salexigens, Streptomyces lividans, Streptomyces griseus, Nocardia lactamdurans, Mycobacterium smegmatis, Corynebacterium glutamicum, Corynebacterium ammoniagenes, Brevibacterium lactofermentum, Bacillus brevis, Bacillus megaterium, Bacillus licheniformis, Bacillus amyloliquefacien, Lactococcus lactis, Lactobacillus plantarum, Lactobacillus casei, Lactobacillus reuteri, Lactobacillus gasseri.
- the host cell is E.coli.
- the E.coli strain is selected from BL21, lemo21, NiCo21, NEB Express, SHuffle, T7 Express, BLR, HMS174, Tuner, Origami2, Rosetta2, m15.
- the E.coli strain is BL21(DE3) where the additional genes regulating disulfide formation, dsbC and erv1P, are integrated genomically.
- the genomic integration is within the recAX locus.
- the host cell is a human cell, such as a HEK293T cell.
- optimum growth conditions comprise culturing at a temperature of 15-25°C.
- Suitably optimum growth conditions comprise culturing in a medium compatible with bioprocess applications for medicines intended for use in humans, such as chemically defined medium.
- Suitably optimum growth conditions comprise culturing in an aerated culture medium.
- the host cells are cultured to a high density.
- Suitably culturing the host cells under conditions to express the proteins may also comprise inducing the host cells to express the proteins.
- Suitably inducing the host cells may comprise addition of an inducer into the culture medium, or the creation of certain inducive conditions within the culture medium such as acid/alkali pH, heat shock, hypoxia or the like.
- the inducer or inducive condition stimulates transcription of the nucleic acids.
- an inducer or inducive condition does so by stimulating an inducible expression control sequence within the nucleic acids.
- the inducible expression control sequence may be an inducible promoter.
- Suitable inducers include isopropyl- ⁇ -d-thiogalactoside (IPTG) for lactose driven promoters or tetracycline for tetracycline – regulated promoters.
- IPTG isopropyl- ⁇ -d-thiogalactoside
- the host cells are induced to express the proteins once the culture has reached the optimal density described above.
- the host cells are induced to express the proteins during logarithmic growth.
- the concentration of proteins may be varied by adjusting the concentration of an inducer or altering the inducive conditions to which the host cells are exposed.
- the culturing step takes between 4 – 24 hours.
- the host cells are induced to express the proteins after 2-6h of culturing or when an OD of 6-8 has been achieved.
- a cell culture comprising one or more host cells of the ninth or tenth aspects and a culture medium.
- the process may not be conducted within one or more cells, and may be conducted in a cell-free system.
- step (a) and/or (b) and/or (c) is conducted within a host cell, to ensure proper production of the VLP.
- steps relating to mixing and forming functionalised viral capsid protein heterodimers may occur outside of a host cell, in a cell free system.
- the processes may further comprise a step of recovering the VLPs.
- Suitably recovering the VLPs may comprise disrupting the host cells.
- the host cells may secrete the VLPs into the culture solution.
- Suitably disrupting the host cells may be carried out by any suitable method such as homogenisation, sonication, or freeze-thaw. Recovery of the VLPs may take place by any suitable method such as filtration, pull-down, centrifugation, or chromatography.
- the recovery and purification of VLPs takes place by chromatography.
- chromatography e.g. involving a sequence of steps including mixed mode (hydrophobic interaction and size exclusion) chromatography, anion exchange chromatography, and ultrafiltration.
- anion exchange chromatography e.g. when anion exchange chromatography is used to recover the VLPs, the VLP may comprise chemical modification, suitably in such an embodiment the first binding protein of the VLP is modified with DEAE.
- the DEAE molecules can bind to the chromatography column.
- the recovery and purification of VLPs takes place by affinity chromatography. For example, immobilized metal affinity chromatography (IMAC).
- IMAC immobilized metal affinity chromatography
- the VLP when affinity chromatography is used to recover the VLPs, the VLP may comprise an affinity tag which is capable of binding to affinity agents used for the affinity chromatography.
- the VLP when immobilized metal affinity chromatography (IMAC) is used the VLP may comprise a metal binding affinity tag, suitably in such an embodiment the first binding protein of the VLP, second binding protein of the VLP and/or a functional protein bound the first and/or second binding protein may comprise an affinity tag.
- the functional protein comprises the affinity tag.
- the functional protein is fused to the affinity tag. In such examples, binding of the functional protein to the first and/or second binding proteins can be confirmed by affinity chromatography as the first and/or second binding proteins will be recovered by virtue of binding to the functional protein.
- affinity tag refers to a short amino acid sequence or peptide enabling a specific interaction with a protein or a ligand, for example in the case of IMAC a metal ion.
- affinity tags include biotin, desthiobiotin, histidine, polyhistidine, myc, hemagglutinin (HA), FLAG, fluorescence tag, tandem affinity purification (TAP) tags, FLAG, glutathione S transferase (GST) or derivatives thereof.
- a suitably affinity tag may comprise or consist of a sequence according to SEQ ID NO: 37.
- step (d) comprises recovering the proteins.
- Suitably recovering the proteins may be performed by similar techniques.
- Suitably recovering the proteins may comprise disrupting the host cells as above.
- the host cells may secrete the proteins into the culture solution.
- the VLPs form by self-assembly, suitably automatic self-assembly.
- the step of culturing the host cell further comprises culturing under conditions such that the proteins expressed from the first and second nucleic acids, or from any further nucleic acids, bind to each other.
- the binding molecule may be chemically modified.
- the method may comprise a step of recovering the proteins, and subsequently chemically modifying the binding molecule.
- these steps take place after step (b) but prior to step (c).
- the host cell may be cultured under conditions so as to express proteins from the first, second, and third nucleic acids.
- the third nucleic acid encodes only a functional molecule.
- the binding molecule is chemically modified.
- step (c) of the ninth aspect comprises a binding molecule binding to a functional molecule, suitably via a chemical modification.
- the binding molecule may be chemically modified.
- the conditions for culturing the second host cell are such that the binding protein is chemically modified.
- such chemical modification of the binding protein may take place post-translationally.
- the method may comprise a step of chemically modifying the binding protein.
- this step takes place after step (d) but prior to step (e).
- the host cells may be cultured under conditions so as to express proteins from the first, second, and third nucleic acids.
- step (e) comprises each binding protein binding to each functional molecule. Suitably via a chemical modification.
- step (e) further comprises mixing under conditions such that the proteins bind to each other.
- step (e) comprises mixing host cell supernatants or host cell lysates.
- the mixing is such that the ratio of first host cell supernatant or lysate to further host cell(s) supernatant or lysate is about 1:1.
- the mixing step takes place at room temperature, suitably around 18-22°C.
- mixing takes place for between 15 minutes to 2 hours, suitably between 20 minutes and 1 hour, suitably between 25 minutes and 45 minutes, suitably for about 30 minutes.
- a functional molecule may be mixed with the VLPs once formed.
- Immunogenic Composition The present invention further relates to an immunogenic composition comprising the VLP of the invention.
- the immunogenic composition may be a vaccine.
- the immunogenic composition may further comprise one or more adjuvants. Suitable adjuvants include: mineral salts, emulsions, microorganism derived adjuvants, carbohydrates, cytokines, particulates or tensoactive compounds.
- Suitable mineral salts include: adjumer, alhydrogel, aluminium hydroxide, aluminum phosphate, aluminium potassium sulphate, amorphous aluminium hydroxyphosphate sulfate (AAHSA), aluminium salts in general, calcium phosphate, Rehydragel HPA, or Rehydragel LV.
- Suitable emulsions include: Freund’s complete, Freund’s incomplete, montanide ISA720, montanide ISA 51, montanide incomplete, Ribi, TiterMax, AF03, AS03, MF59, specol, SPT, or squalene.
- Suitable microorganism derived include: cholera toxin or mutants thereof, cholera toxin subunit B, CpG DNA, LTR 192G, MPL, Bordella pertussis components, E.coli heat labile toxin, CTA1-DD gene fusion protein, Etx B subunit, lipopolysaccharides, flagellin, Corynebacterium derived P40, LTK72, MPL-SE, or Ty particles.
- the immunogenic composition may further comprise one or more pharmaceutically acceptable excipients.
- Pharmaceutically acceptable excipients may include stabilizers, fillers, preservatives, diluents, nutrients, antioxidants, antimicrobial agents, buffers, solvents, inactivating agents, purifiers, emulsifiers, surfactants and the like.
- Suitable excipients may be selected from, for example: monosodium glutamate, sucrose, D- mannose, D-fructose, dextrose, human serum albumin, potassium phosphate, plasdone C, anhydrous lactose, microcrystalline cellulose, polacrilin potassium, magnesium stearate, cellulose acetate phthalate, alcohol, acetone, castor oil, sodium chloride, benzethonium chloride, formaldehyde, ascorbic acid, hydrolyzed casein, sodium bicarbonate, sodium carbonate, glutaraldehyde, 2-phenoxyethanol, polysorbate 80 (Tween 80), neomycin, polymyxin B sulfate, bovine serum albumin, neomycin sulfate, polymyxin B, yeast protein, streptomycin sulfate, ammonium thiocyanate, rice protein, lactose, formalin, amino acid supplement, phosphate-buffered s
- the excipients may be arginine, glutamine and trehalose.
- the immunogenic composition is formulated as a fluid, suitably as a liquid.
- the excipients and additives are selected such that the formulation is a liquid.
- an injectable liquid Suitably an injectable liquid.
- Immunogenicity means that a VLP or an immunogenic composition comprising the VLP of the invention is capable of eliciting an immune response in a subject.
- the VLP or an immunogenic composition comprising the VLP of the invention may be capable of generating an antibody response in a subject and/or a non-antibody based immune response in a subject.
- the immunogenic activity of the VLP or an immunogenic composition comprising the VLP of the invention may be determined by the amount of antibodies present in a subject after administration of the VLP or an immunogenic composition comprising the VLP of the invention i.e. antibody production.
- the amount of antibodies which bind to the antigen of the VLP Suitably the amount of antibodies present in a subject after administration of the VLP or an immunogenic composition comprising the VLP of the invention, i.e. antibody production, is sustained and consistent over a period of time.
- the immunogenic activity of the VLP or an immunogenic composition comprising the VLP of the invention may be determined by the amount of antibodies present in a subject after administration of the VLP or an immunogenic composition comprising the VLP of the invention over a given period of time, i.e. antibody production over a given period of time. Suitable periods of time are outlined below.
- amount of antibodies it is meant the titre or concentration thereof.
- concentration of antibodies in sera is outlined below.
- a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 5 days, at least 10 days, at least 15 days, at least 20 days, at least 25 days, at least 30 days, at least 35 days, at least 40 days, at least 45 days, at least 50 days, at least 55 days, at least 60 days, at least 65 days, at least 70 days, at least 75 days, at least 80 days, at least 85 days, at least 90 days, at least 95 days, or at least 100 days or more in a subject.
- a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 110 days, at least 120 days, at least 130 days, at least 140 days, at least 150 days, at least 160 days, at least 170 days, at least 180 days, at least 190 days, at least 200 days, at least 210 days, at least 220 days, at least 230 days, at least 240 days, at least 250 days, at least 260 days, at least 270 days, at least 280 days, at least 290 days, at least 300 days or more in subject.
- a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 1 week, at least 2 weeks, at least 3 weeks, at least 4 weeks, at least 5 weeks, at least 6 weeks, at least 7 weeks, at least 8 weeks, at least 9 weeks, at least 10 weeks, at least 12 weeks, at least 14 weeks, at least 16 weeks, at least 18 weeks, at least 20 weeks days or more in a subject.
- a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least at least 30 weeks, at least 40 weeks, at least 50 weeks, at least 60 weeks, at least 70 weeks, at least 80 weeks, at least 90 weeks, at least 100 weeks or more in a subject.
- a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least for at least 1 year, at least 2 years at least 3 years, at least 4 years, at least 5 years, at least 6 years, at least 7 years, at least 8 years, at least 9 years or at least 10 years or more in a subject.
- a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 10 years, for at least 15 years, for at least 20 years, for at least 25 years, for at least 30 years, for at least 35 years, for at least 40 years, for at least 45 years, for at least 50 years or more in a subject.
- immunogenic activity may refer to immunogenic antibody production.
- antibody production at a concentration which is immunogenic Suitably antibody production at a concentration in sera which is immunogenic. Suitably at a concentration of between 1-20 ⁇ g/ml, 1-18 ⁇ g/ml, 1-16 ⁇ g/ml, 1-14 ⁇ g/ml, 1-12 ⁇ g/ml, 2-18 ⁇ g/ml , 2-16 ⁇ g/ml , 2- 14 ⁇ g/ml, 2-12 ⁇ g/ml, or 2 – 10 ⁇ g/ml in sera for example.
- the skilled reader, on considering the information set out in the Examples, will recognise that the VLPs or the immunogenic compositions of the invention exhibit immunogenic activity that makes them well suited to therapeutic use in the manner described in this specification.
- the present invention further relates to use of the VLP or the immunogenic composition comprising the VLP for use in therapy, or in the prevention and/or treatment of a disease.
- the present invention further provides a method of treating a subject having a disease, comprising administering an effective amount of a VLP according to the first aspect or an immunogenic composition according to the twelfth aspect, to the subject.
- the present invention further provides a method of manufacturing a medicament for the treatment of a disease, the medicament comprising an effective amount of a VLP according to the first aspect or an immunogenic composition according to the twelfth aspect.
- the disease may be selected from: an infectious disease, cancer, an autoimmune disease, a cardiovascular disease, a metabolic disease, an inflammatory disease, a neurological disease, or rheumatological degenerative disease, or an addiction.
- infectious diseases include: viral, bacterial, fungal, or protozoan infections.
- Suitable viral infections include: COVID-19, SARS, MERS, influenza, common cold, respiratory syncytial virus infection, adenovirus infection, parainfluenza virus infection, norovirus infection, rotavirus infection, astrovirus infection, measles, mumps, rubella, chickenpox, shingles, roseola, smallpox, fifth disease, chikungunya virus infection, HPV infection, Hepatitis A, B, C, D or E, warts, herpes, molluscum contagiosum, ebola, lassa fever, dengue fever, yellow fever, Marburg hemorrhagic fever, Crimean-Congo hemorrhagic fever, polio, viral meningitis, viral encephalitis, rabies, zika virus infection, west nile virus infection, HIV/AIDS, Hantavirus infection, HPS.
- Suitable bacterial infections include: urinary tract infections, cystitis, impetigo, bacterial food poisoning, campylobacteriosis, C.difficile infection, bacterial cellulitis, MRSA, CRPA, VRSA, sepsis, erysipelas, necrotising fasciitis, bacterial folliculitis, gonorrhoea, chlamydia, syphilis, mycoplasma genitalium, bacterila vaginosis, pelvic inflammatory disease, tuberculosis, whooping cough, Haemophilus influenzae disease, pneumonia, bacterial meningitis, lyme disease, cholera, botulism, tetanus, anthrax, Cryptosporidiosis, Diphtheria, E.
- Suitable cancers include: breast cancer, liver cancer, lung cancer, pancreatic cancer, brain cancer, prostate cancer, bowel cancer, rectal cancer, bone cancer, leukemia, bladder cancer, cervical cancer, endometrial cancer, eye cancer, retinoblastoma, ewing sarcoma, gallbladder cancer, head and neck cancer, kaposi’s sarcoma, kidney cancer, laryngeal cancer, mesothelioma, myeloma, lymphoma, ovarian cancer, oesophageal cancer, mouth cancer, nasopharyngeal cancer, nose and sinus cancer, skin cancer, sarcoma, stomach cancer, testicular cancer, thyroid cancer, uterine cancer, vaginal cancer, penile cancer, vulval cancer.
- Suitable autoimmune diseases include: asthma, psoriasis, MS, rheumatoid arthritis, reactive arthritis, lupus, inflammatory bowel syndrome/disease, type 1 diabetes, Guillain-Barre syndrome, demyelinating polyneuropathy, Graves’ disease, Hashimo’s thyroiditis, Myasthenia gravis, vasculitis, pernicious anemia, ulcerative colitis, antiphospholipid syndrome, Kawasaki disease, alopecia, vitiligo, scleroderma, Sjogren’s syndrome, crohn’s disease, coeliac disease, Addison’s disease, narcolepsy.
- Suitable cardiovascular diseases include: angina, heart attack, heart failure, coronary heart disease, stroke, transient ischemic attack, peripheral arterial disease, aortic disease, atherosclerosis, hypertension, cerebrovascular disease, renal artery stenosis, aneurysm, cardiomyopathy, pulmonary heart disease, arrythmia, dysrhythmia, endocarditis, cardiomegaly, myocarditis, valvular heart disease, congenital heart disease, rheumatic heart disease.
- Suitable metabolic diseases include: hypercholesterolemia, hypertriglyceridemia, diabetes, hyperlipidemia, hyperbilirubinemia, hypercalcemia.
- Suitable inflammatory diseases may include any of the above infections or autoimmune diseases.
- Suitable inflammatory diseases may include include: arthritis, asthma, tuberculosis, periodontis, chronic ulcers, sinusitis, hepatitis, glomerulonephritis, inflammatory bowel syndrome/disease, preperfusion injury, transplant rejection, sickle cell disease, allergies, cardiovascular disease, psoriasis, cytokine-mediated pruritus, COPD, diabetes, bronchitis, Crohn’s disease, atherosclerosis, dermatitis, arteritis, lupus.
- Suitable neurological diseases include: Alzheimer’s, ataxia, ALS, Bells palsy, brain tumours, aneurysms, epilepsy, Guillain-Barre syndrome, hydrocephalus, Meningitis, MS, muscular dystrophy, neurocutaneous syndromes, Parkinson’s, migraines, encephalitis, myasthenia gravis, dementia, seizures, spinal muscular atrophy, motor neuron disease, scoliosis, neuropathy, chronic fatigue syndrome, cerebal palsy.
- Suitable rheumatological degenerative diseases include: rheumatoid arthritis, psoriasis arthritis, spondylarthropathy, osteoarthritis, lupus, systemic sclerosis.
- Suitable addictions include: alcohol, nicotine, caffeine, amphetamines, opioids, sedatives, hypnotics, anxiolytics, cocaine, cannabinoids, hallucinogenics, phenycylcidine.
- the VLP or the immunogenic composition are for use in the prevention or treatment of COVID-19.
- the functional molecule may be a SARS-CoV-2 antigen, suitably a SARS-CoV-2 spike protein.
- the functional molecule may be an inflammatory cytokine, suitably IL-33.
- the VLP or the immunogenic composition are for use in the prevention or treatment of psoriasis or arthritis.
- the functional molecule may be an inflammatory cytokine, suitably IL17.
- the VLP or the immunogenic composition are for use in the prevention or treatment of asthma or atopic dermatitis.
- the functional molecule may be an inflammatory cytokine, suitably IL13 or IL33.
- an effective amount for administration to the subject is an effective amount to prevent or treat the disease. Suitable effective amounts can be readily determined by the skilled medical practitioner.
- a dose comprises an effective amount.
- a suitable dose of the VLP may comprise: 10- 100 micrograms, suitably 10-80 micrograms, suitably 20-60 micrograms, suitably 20-40 micrograms.
- the VLP or immunogenic composition may be administered by any route.
- the VLP or immunogenic composition may be administered enterally or parenterally.
- the VLP or immunogenic composition may be administered orally, rectally, vaginally, sublingually, by injection, transdermally, or by inhalation.
- the VLP or immunogenic composition may be administered by injection, suitably by subcutaneous injection.
- the VLP or immunogenic composition may be administered by inhalation, suitably by nasal inhalation.
- Subject The present invention relates to the prevention and/or treatment of a disease in a subject by using the VLP or immunogenic composition thereof.
- the subject may be human or animal.
- the prevention and/or treatment of disease may be in the veterinary field.
- the subject may be adult or child.
- the subject may be male or female.
- the subject is an adult human.
- the subject may have been diagnosed with a disease.
- the subject may be suspected of having a disease.
- the subject may display one or more symptoms of a disease.
- the subject may be at risk of contracting a disease.
- the subject may have one or more risk factors associated with a disease. Suitable risk factors may include: weight, smoking, alcohol or substance addiction, age, sex, race, inheritance for example. Suitable risk factors may further include a genetic predisposition to a disease, for example by expression of particular gene, or by the presence of a particular mutation in a gene.
- subjects that have been diagnosed with a disease or who have one or more symptoms of a disease are provided with the VLP or immunogenic composition for treatment of the disease.
- subjects that are at risk of developing a disease are provided with the VLP or immunogenic composition for prevention of the disease.
- Other Uses The present invention further relates to use of the VLP in research and in the diagnosis of diseases.
- the VLP of the first aspect may be used in research.
- the VLP may be used as a detection tool.
- the VLP may be used as a label.
- the binding molecule of the VLP is attached to a functional molecule which is a fluorescent molecule.
- the binding molecule may be attached to a functional molecule which is an antigen binding molecule such as an antibody.
- the functional molecule may further comprise a fluorescent molecule.
- the antigen binding molecule may specifically bind a cell surface receptor. Suitable cell surface receptors are discussed elsewhere herein, however suitably the cell surface receptor is specific to a cell type.
- the VLP is capable of binding to, and labelling, specific cell types.
- the VLP may be used as a carrier.
- the VLP may comprise a cargo.
- the cargo may be contained within the VLP, suitably within the VLP shell.
- the cargo may be a therapeutic molecule.
- the VLP may not in itself be a therapeutic, but may be a carrier of a therapeutic molecule.
- Suitable therapeutic molecules may include oligonucleotides, small molecules, peptides, for example.
- the therapeutic molecule may comprise an antisense oligonucleotide which may act to repress expression of a particular nucleic acid.
- the therapeutic molecule may comprise a cytotoxic chemical which may act to trigger cell death.
- the VLP is targeted to a particular site, for example to a particular cell or cell type where the therapeutic molecule is required. Suitably this is achieved by the binding molecule of the VLP being attached to a functional molecule which is an antigen binding molecule such as an antibody.
- the antigen binding molecule may specifically bind to a cell surface receptor.
- a cell surface receptor specific to the target cell Suitably binding to the cell surface receptor may stimulate uptake of the VLP into the cell.
- the VLP is capable of binding to specific cell types and delivering cargo thereto.
- a carrier VLP comprising the features of the first aspect, and in addition a cargo, wherein the cargo is contained within the VLP shell.
- the cargo is a therapeutic molecule.
- the VLP of the first aspect may also be used in diagnosis.
- the binding molecule of the VLP is attached to a functional molecule which is an antigen binding molecule, such as an antibody.
- the antibody specifically binds an antigen derived from a disease causing agent as discussed hereinabove.
- a disease causing agent as discussed hereinabove.
- an infectious agent such as a virus, bacterium, fungus, protozoan, or archaeon.
- the VLP is capable of binding to a disease causing agent and allowing detection thereof.
- the VLP of the invention may be used in a method of diagnosing a disease in accordance with the sixteenth aspect of the present invention.
- a method of diagnosing a disease in a subject comprising: (a) Providing a virus like particle according to the first aspect of the invention, wherein the binding molecule is attached to a functional molecule and wherein the functional molecule is an antibody directed towards an antigen derived from a disease causing agent; (b) Mixing the virus like particle with a suitable sample from the subject; (c) Detecting whether the virus like particle precipitates; (d) Diagnosing the presence of a disease if the VLP precipitates.
- the functional molecule is an antigen binding molecule
- the VLP further comprises a second binding protein.
- the second binding protein is described elsewhere herein.
- the antigen binding protein is attached to the second binding protein.
- the second binding protein binds to the first binding protein which is attached to a monomer of the heterodimeric capsid protein as described hereinabove.
- detection is via precipitation of the VLP bound to the disease causing agent.
- detecting precipitation may comprise visual confirmation, or testing with a spectrometer.
- the VLP may also comprise a fluorescent molecule.
- a fluorescent molecule may be attached to a chemical modification of the binding molecule.
- the detection step may comprise detecting the presence of fluorescence in the sample.
- the detection step may comprise detecting the presence of fluorescent precipitation in the sample.
- a suitable sample from a subject may be a blood sample, saliva sample, serum sample, sputum sample, sperm sample, mucus sample, CSF sample.
- the sample is a fluid sample.
- the method of diagnosis may further comprise a step of incubating the sample with the VLP.
- a period of time sufficient to allow the VLP to bind to any antigens in the sample and precipitate.
- at least 1 minute suitably up to 30 minutes, suitably up to 25 minutes, suitably up to 20 minutes, suitably up to 15minutes.
- Suitable diseases which may be detected by the method may be any of those listed herein above.
- the method of diagnosis may further comprise a step of treatment of the subject if a disease is diagnosed.
- treatment of the subject may comprise administering an effective amount of any known treatment for the relevant disease to the subject.
- Alphafold2 Advanced available at https://colab.research.google.com/github/sokrypton/ColabFold/blob/main/beta/AlphaFold2_a dvanced.ipynb
- This algorithm allows the accurate modelling of structures of protein pairs where the quality of predicted folding can be the pTM score rather than the pLDTT score which returns modelled protein structures ranked by the protein-protein interaction.
- the stoichiometry of interacting subunits is specified as x:y:z where ‘x’ is the predefined number of molecules of protein 1, ‘y’ the number of protein 2 molecules, and so forth.
- the so-called asymmetric unit which gives rise to the building blocks automatically assembling into the entire capsid, contains two dimers of one single protein, which would be modelled by simply specifying a number of ‘4’ for the capsid protein. This returns a structure of two dimers.
- two different proteins are derived from the original wild type capsid protein: one protein which contains a binding protein (such as Im7 or Barstar), and a second protein which does not.
- a binding protein such as Im7 or Barstar
- Alphafold2 returned two dimers which were in turn positioned exactly as in the published Hepatitis virus asymmetrical unit structure (PDB: 6EDJ). Furthermore, the structures in each case as such were classified as ‘high predictive accuracy’ (Alphafold2 colouring index), with the predicted exception of unfolded domains (glycine serine linkers, and histidine-containing tags). This result confirms the operational accuracy of this modelling approach.
- Alphafold2 was then interrogated and returned models consisting of either (i) dimers that were either composed of 2x protein 1 and 2x protein 2 (homodimers only), (ii) dimers that exclusively consisted of protein 1 and protein 2 (heterodimers only), or (iii) a mixture of both.
- This in silico approach allowed the iterative testing and identification of mutational combinations triggering the exclusive assembly of heterodimer structures, as detailed in the examples below.
- both pDLL and pTM scores for the structures were high, confirming a high predictive capacity, as shown below in the Table.
- Alphafold2 structure prediction of dimer formation generated by the optimized sequences shown in table 2.
- wHv has distinct advantages: i) It forms capsids at cold temperatures (> 60% capsid formation occurs at temperatures as low as 4°C, Kukreja2014). This will increase the yield of VLPs in low temperature bacterial fermentation which is preferable for production of complex proteins. (ii) The temperature dependence on capsid formation is much weaker: Since the virus is adapted to groundhogs which hibernate and whose body temperature wildly fluctuates between 6.5°C and 37°C, VLPs from wHv are much more stable during temperature fluctuation. This could be a significant advantage both for manufacture and storage. (iii) Use of wHv abrogates any cross-reactivity to human Hepatitis B.
- both the wild type wHv protein and the wHv protein carrying an integrated Im7 protein are produced by bacteria, they can form dimers in two ways: either the two unmodified (called “wild type”) proteins together (called a “homodimer”) or one unmodified and one Im7-carrying protein (called “heterodimer”).
- wild type proteins together
- heterodimer one unmodified and one Im7-carrying protein
- the interaction itself is affected by the interaction of electrically charged amino acids within the protein. This is shown in Figure 3B (right).
- the negatively charged amino acid called E64 from one of the monomers interacts with a positively charged amino acid called K96 from the other monomer.
- Figure 3 shows the approach to creating a mutant wHv capsid where one dimer displays an integrated Im7 protein (marked as ‘Im7-insert’) toward the VLP surface.
- A an expression vector is established which allows simultaneous expression of a wild type wHv protein and a wHv protein carrying the integrated Im7 protein, driven by a single promoter, through an internal ribosome binding site (RBS). This set-up can result in either formation of homodimers or heterodimers.
- B The central ‘tip’ of a wHv homodimer, of amino acid making up the protein, as well as a close-up view of electrostatic forces between charged amino acids from both monomers (right).
- Example 2 modified wHv protein This electrostatic interaction between the monomers can be modified.
- An example of such a modification is; if a positively charged amino acid, for example K96, is replaced by a negatively charged amino acid, then the protein which is usually forms a dimer with itself (homodimer), experiences electrostatic repulsion (E64 now meets position 96, now mutated from K to E).
- the K96->E replacement on the protein is accompanied with a complementary replacement of E64->K placed on the corresponding monomer of wHv-Im7, then whenever one wHv monomer and one wHv-Im7 monomer form a dimer (“heterodimer”) this interaction is strengthened by an electrostatic attraction.
- Panel A shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96E in wHv-Im7 and E64K into wHv, respectively.
- Panel B shows structural prediction using Alphafold2 database, showing that formation of homodimers composed of wHv/wHv and wHv- Im7/wHv-Im7, respectively, is still favoured despite the mutations introduced (shown is the top-ranked of five models, all of which show homodimer formation). This suggests that the mutations introduced into the amino acid sequences of the monomers to promote formation of heterodimers are not trivial. In order to overcome this problem, the inventors derived and introduced additional mutations as further refinements.
- the resulting structural prediction using Alphafold2 database shows a predicted mixture between heterodimer and homodimer formation.
- further additional intra-chain ionic stabilization was achieved by added mutation of Lysine 67 to glutamate in the wHv149 subunit.
- Panel A shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96D in wHv-Im7 and E64K into wHv, respectively, in this example, an addition mutation K67E is introduced to the wHv149 subunit.
- this set of mutations resulted in increased heterodimer formation, as predicted by Alphafold2 (shown in panel B).
- Example 3 mutations in wHv protein promote heterodimer formation
- the inventors found that certain combinations of mutations in amino acid sequence of the monomers combined with a specifically required linkers to join the wHv and the binding molecule, in this example Im7, as well as Barstar proteins, achieved the desired result of heterodimeric capsid proteins.
- This final, optimized, combination of mutations is summarized in Table 2.
- the analysis using Alphafold2 yielded very high statistical scores, indicating that the structures have a high predictive probability of forming the desired heterodimers, as shown in Table 3.
- FIG 7 shows the Ribbon diagram of the top rated structures obtained with the optimized engineered combination of mutations summarized in table 2, generated using Alphafold2, for heterodimers consisting of wHv149/wHv-Im7 (top) and wHv149/wHv-Bs (bottom).
- table 2 shows the Ribbon diagram of the top rated structures obtained with the optimized engineered combination of mutations summarized in table 2, generated using Alphafold2, for heterodimers consisting of wHv149/wHv-Im7 (top) and wHv149/wHv-Bs (bottom).
- a structural overlay of the unmodified wHv protein with the resulting heterodimer mutant proteins shows that they are almost completely identical in structure, as shown in Figure 8. This suggests that they will be able to form intact VLPs.
- figure 9 shows a space filling model generated using icn3d (NCBI/structure; PDB: 6edj), showing the asymmetric units of wild type wHv capsid (left) and the designed wHv149/wHv-Im7 (centre) and wHv149/wHv-Barstar (right) heterodimers, respectively, from the front (top) and from the surface (bottom) of the capsid.
- the dashed line indicates the plane of subunit interaction leading to VLP assembly, which remains undisturbed by the integration of Im7 or Barstar.
- a comparison to the unmodified wHv protein structure suggests that the space occupancy of the mutated structures will not interfere with VLP formation.
- FIG 10 shows a view from the side showing “spike” made from two helices from each monomer protruding to the outside (arrow) and bottom part mediating interaction with other dimers in the capsid ( arrow).
- FIG 11 highlights the a total of four conserved electrostatic interactions in all hepatitis virus (including HBc / wHv) wild type capsid proteins. Protein alignment of capsids from human and woodchuck hepatitis virus showing conservation of E8, R56, E64, K96, mediating conserved electrostatic binding.
- Example 4 wHv capsid heterodimer components are soluble and evenly expressed in E.coli.
- Example 5 HBc capsid heterodimer and binding protein form a complex and co-purify on immobilized metal affinity chromatography.
- Method A pET – derived plasmid harbouring DNA encoding the HBc_Im7 and HBc_wt proteins, each under the control of a separate T7-inducible promoter, was transfected into standard BL21/DE3 E.coli. Recombinant proteins were induced by addition of IPTG to 0.3 mM and incubation at 16C for 16h. A separate plasmid was transfected harbouring the binding protein ColE7-IL31 under the control of a tetracycline-inducible promoter.
- Recombinant protein expression was induced by adding 40 ng/ml of anhydrotretracyline at 16C for 16h. Subsequently, cells were lyzed by sonication and insoluble proteins and inclusion bodies separated by centrifugation. The resulting cytosolic fractions of cells expressing HBc_Im7/ HBc_wt and ColE7-IL31 were mixed at room temperature for 30min, followed by purification on Ni-NTA agarose and subsequent SDS-PAGE analysis of the individual fractions (shown in figure 13).
- HBc_Im7 and HBc_wt proteins when driven by individual T7 promoters, can expressed as soluble proteins in E.coli and, furthermore, that they form a complex with the binding protein ColE7-Il31 which co-purifies on Ni- chromatography, confirming formation of stable complexes (white, dark, and light grey arrows, respectively in the eluate lane).
- Example 6 wHv heterodimeric VLPs can be expressed and autoassemble into large nanoparticles in E.coli, and co-purify on immobilized metal affinity chromatography.
- the epitope protein used was murine Interleukin 31 harbouring a single point mutation to inactivate receptor transactivation (SEQ ID NOs: 35 and 36).
- the epitope was N-terminally fused to Colicin E7 (SEQ ID NOs: 31 and 32) and connected via a rigid alpha helical linker (SEQ ID NOs: 33 and 34)), these proteins and VLP scaffold proteins were cloned onto plasmid DU75351 (SEQ ID NO:30) into BL21/DE3 E.coli.
- Transfected cells were inoculated into LB broth overnight at 30 °C, followed by expansion into 200ml medium the next morning at 37°C in shaker flasks.
- the scaffold protein was under the control of a T7 promoter and induced with 0.3 mM IPTG.
- the epitope protein was under the control of a tetA/tetR promoter and induced with 40 ng/ml anhydrotetracycline. Induction was maintained for 3h.
- Biomass was harvested and lysed in 25mM Tris, pH 7.4, 200mM NaCl using a high pressure homogenizer (Emulsiflex). Thereafter, lysates were digested with Benzonase for 1h and Polysorbate 80 was added to 0.005%.
- Lysates were spun down for 10’ at 25000 rpm and supernatants filtered through 0.45 and 0.22 micron filters, respectively, to yield cytosolic fractions. Cytosols were applied to a discontinuous sucrose gradient in the same buffer and spun for 6h at 255,000G with deactivated deceleration. Individual sucrose fractions were analyzed by SDS PAGE. Immobilized metal affinity chromatography (IMAC): Cytosolic fractions of heterodimeric VLPs decorated with epitope protein were prepared as detailed in the Methods for Figure 1. Cytosols were adjusted to 30mM imidazole, followed by IMAC chromatography on a Sartobind IDA 1ml membrane (Sartorius) charged with Nickel-Sulfate.
- IMAC Immobilized metal affinity chromatography
- Example 7 Determination of Size of VLPs.
- Methods Dynamic Light Scatter analysis: The eluate fraction shown in Figure 15 was subjected to a Multiangle Dynamic Light Scatter analysis (MADLS) using a Malvern Zetasizer Ultra in a quarz microcuvette with 80 microlitre volume. Transmission electron microscopy (TEM): the samples shown in Figure 15 were adsorbed to glow discharged carbon-formvar-coated copper grids and negatively stained with a 1% aqueous uranyl acetate.
- MADLS Multiangle Dynamic Light Scatter analysis
- TEM Transmission electron microscopy
- the grids were examined at 80 kV. Results: The light scattering profile shown in Figure 16 confirms a uniform size distribution of the VLP particles at the expected size range (approximately 32 nm diameter). The TEM images in Figure 17 confirm the size of VLPs determined by DLS analysis in Figure 16. The thickened rim structure and fuzzy outer rim appearance is consistent with decoration of an epitope localized to the surface.
- Table 3 shows the top-ranked model structures produced with the optimized combination of mutations summarized in Table 2 are shown in Figure 7.
- the analysis using Alphafold2 yielded very high statistical scores, indicating that the structures have a high predictive probability, as shown in Table 3.
- Sequences Complementary mutant pairing exemplified in the woodchuck wHv variant Sequence Codes: Underlined – mutated compared to wild type Bold font – amino acid negatively charged at physiological pH and forming inter-molecular electrostatic bond Highlighted font – amino acid positively charged at physiological pH and forming inter- molecular electrostatic bond Double underlined – the linker sequence designed to incorporate the functionalizing protein into the wild type HBc/wHv capsid Italic – the functionalized protein sequence SEQ ID NO: 6 – Barstar protein from Bacillus amyloliquefaciens KKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDALTGWVEYPLVLEWRQFEQSKQ LTENGAESVLQVFREAKAEGADITIELS SEQ ID NO: 7 – Im7 protein from E.coli ELKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDNRDDS PEGIV
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Virology (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Molecular Biology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Epidemiology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Mycology (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Communicable Diseases (AREA)
- Oncology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The present invention relates to virus-like particle (VLP) that include one or more viral capsid protein heterodimer(s) each comprising a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence, and; a binding molecule attached exclusively to either the first monomer or the second monomer, and; wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof. The present invention also provides processes of producing the VLPs, and uses of the VLPs in research, diagnosis and as vaccines for use in prevention/treatment of diseases.
Description
VIRUS-LIKE PARTICLES, HETERODIMERIC CAPSID PROTEINS AND METHODS OF PRODUCTION THEREOF Field of the Invention The present invention relates to virus-like particles (VLPs) comprising heterodimeric subunits in which one monomer comprises a single high affinity protein attachment system which allows interchangeable decoration with any functional molecule of choice, wherein the heterodimeric subunits are created by modification of the electrostatic interaction between the monomers forming each heterodimer. The present invention further relates to processes of producing the VLPs, including a rapid single cell process, and uses of the VLPs in research, diagnosis and as vaccines for use in prevention/treatment of diseases. Introduction Virus-like particles (VLPs) are molecules that closely resemble viruses, but contain no viral genetic material. They are formed from viral structural proteins, such as viral capsid proteins that, when individually expressed, self-assemble into a particle. Most Virus-like particles appear as hollow ‘nano-footballs’ where the entire surface of the football is made up by many copies of a single self-assembled protein. For production purposes this means that production of one single protein is sufficient to generate the entire nano-football type VLP structure. This has been exploited in medicine. The most common use of VLPs are as vaccines. Mammals have evolved immune sensing mechanisms to recognise highly repetitive patterns seen on viral capsids as intruders. These patterns are still present in VLPs, which contain repetitive, high density displays of viral surface proteins. Therefore the VLP can generate an immune reaction but the viral genome is typically removed. This is the form of the VLP used in the widespread vaccine against human papillomavirus (HPV) which causes cervical cancer. There are currently a selection of commercially available HPV vaccines of this type such as Cervarix by GlaxoSmithKline along with Gardasil and Gardasil-9, produced by Merck & Co. Further developments of VLPs for use as vaccines involve tethering of other agents to the VLP shell. In this case, the VLP shell serves to present an additional agent as an ‘epitope’ to the immune system and thereby stimulate an immune reaction. In some cases, the viral capsid proteins forming the VLP shell can be modified to directly incorporate the epitope for display through genetic fusion. Current COVID19 vaccines that are under development use this form of VLP, where the spike protein from the coronavirus is directly fused to a viral capsid protein forming a VLP shell from an unrelated virus. However, this approach commonly leads to
impaired VLP assembly and large proteins routinely cause VLP instability. Furthermore, this approach cannot be used if the agent is not protein-based. A further alternative is to assemble the VLP and then use attachment means to secure the agent to the VLP shell. Such VLPs with additional attachment means may be termed ‘compound VLPs’. Compound VLPs may be manufactured by methods such as chemical crosslinking, reactive unnatural amino acids, or the use of binding proteins such as the SpyTag/SpyCatcher system, to covalently attach the desired agent or epitope to the viral capsid proteins forming the VLP. The latter method allows the attachment of other non-protein epitopes to the VLP, but requires a complicated production process and cannot yet be used commercially for any agent. Some desired proteins are simply too large to attach to the VLP using current attachment means, and some complex epitopes include multimers with numerous components that must be separately linked together, which must be achieved by additional chemical crosslinking. The current binding proteins which are used as attachment means, such as in the SpyTag/SpyCatcher system, have further issues in that the binding between the proteins whilst being strong, does not occur instantly but requires time for the reactants to fuse, and can result in VLP aggregation depending on which agent or epitope is attached to the VLP. Further complexities arise in the production of VLPs used in clinical human or veterinary applications, which regulators classify VLPs as “biological” active drug intermediates (ADI’s). “Biologic” drugs are produced in living cells, followed by purification according to a regulator – approved process. Each cell line (regardless whether bacterial, plant, yeast, insect or mammalian) used for the production process is carefully characterized so as to guarantee long-term stability of the ADI and stored under highly specified conditions as a “Master Cell Bank” (MCB). If a VLP requires two (or even more) proteins to be assembled, for example where binding proteins are used to attach an epitope to the VLP shell, then currently one MCB is required for each protein component of the drug, and both require a separate purification process, each requiring separate characterisation procedures. In addition a separate quality- control release is required for each critical drug intermediate and the final ADI, multiplying manufacturing costs. Further to these regulatory complexities, the production of the agent/epitope must be established from scratch for each agent/epitope. The most efficient type of production cell is bacteria (specifically: E.coli). However, many proteins do not assume their native shape when produced in E.coli but must be re-folded into their proper form from a denatured state as part of the purification process, which results in huge drop of overall yield and significantly adds to the complexity of the production process
As a result of these difficulties, the production process of compound VLP-type drugs which attempt to attach agents such as epitopes to the viral capsid proteins is complex and expensive. This has limited the wide-spread exploration of compound VLP applications to fields where inexpensive mass production could make them more competitive. A technology which simplifies the process of making compound VLPs has recently been described which uses the Hepatitis B virus capsid, HBc. In essence, this technology is based on a pair of binding proteins positioned on the surface tip of each monomer forming each homodimer of the VLP, as shown in Figure 2B herein. These binding proteins can in turn act as a docking point to allow other ‘epitope’ proteins to be positioned on the outside of the VLP. However, it is possible that the effectiveness of vaccines made with this technology may be impacted by “crowding” of too many ‘epitope’ proteins being presented on the surface of the VLP depending on the size and shape of the protein being presented. Another potential limitation, as with all HBc VLP vaccines may be that industrial scale manufacture, stability, and storage could be temperature-dependent, posing practical limitations. It is the aim of the present invention to provide a VLP for use as a vaccine that overcomes the problem of “crowding” of proteins presented on the surface by using heterodimeric capsid subunits which are still able to self-assemble but present a single ‘epitope’ protein for each of the dimeric subunits forming the VLP. In addition, the present invention aims to provide a VLP that has improved stability when manufactured at industrial scale. While other technologies, including mRNA, can generate “conventional” vaccines against infectious diseases, vaccines against self-proteins, such as, for example vaccines to treat asthma or dermatitis, cannot be delivered by mRNA platforms. This is because mRNA translated to self-proteins is not loaded onto the surface of presenting cells (follicular dendritic cells) as a whole intact folded protein, and thus is not able to trigger activation of B-cells. As a result, mRNA vaccines do not generate an antibody response against self-proteins. Therefore, there is a general need for improved vaccine development in the areas of chronic inflammatory diseases, animal health, cancer, dementia, and animal health. One or more aspects of the present invention are aimed at solving one or more of the above- mentioned problems. Statements of Invention According to a first aspect of the present invention, there is provided a virus-like particle (VLP) comprising:
- One or more viral capsid protein heterodimer(s) each comprising a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence, and; - a binding molecule attached exclusively to either the first monomer or the second monomer, and; - wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof. In one embodiment, the first amino acid sequence and the second amino acid sequence both comprise at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof. In one embodiment, the at least one mutation of the first amino acid sequence is different to the at least one mutation of the second amino acid sequence. In one embodiment, the binding molecule is operable to be attached to a functional molecule. In some embodiments, the binding molecule is a binding protein. In some embodiments, the binding protein comprises a bacterial toxin inhibitor. In some embodiments, the bacterial toxin inhibitor is selected from Im7, Im8, Im9, Im2, and Barstar. In one embodiment, one of the first or the second amino acid sequences comprises the sequence as set out in SEQ ID NO: 2 or 3. In one embodiment, one of the first or the second amino acid sequences comprises the sequence as set out in SEQ ID NO: 4 or 5. In one embodiment, the first amino acid sequence comprises the sequence as set out in SEQ ID NO:2 and the second amino acid sequence comprises the sequence as set out in SEQ ID NO:4. In one embodiment, the first amino acid sequence comprises the sequence as set out in SEQ ID NO:3 and the second amino acid sequence comprises the sequence as set out in SEQ ID NO:5. In one embodiment, the first amino acid sequence comprises a mutation corresponding to E8K of SEQ ID NO: 1 or 16, and the second amino acid sequence comprises a mutation corresponding to R56D of SEQ ID NO: 1 or 16. In one embodiment, the first amino acid sequence comprises a mutation corresponding to E64K of SEQ ID NO: 1 or 16, and the second amino acid sequence comprises a mutation corresponding to K96D of SEQ ID NO: 1 or 16.
In one embodiment, the first amino acid or the second amino acid sequence further comprises a mutation corresponding to K67E; R82D and/or E97K of SEQ ID NO: 16. In one embodiment, the first amino acid sequence further comprises a mutation corresponding to K67E; R82D and/or E97K of SEQ ID NO: 16. Suitably in such an embodiment, the viral capsid protein heterodimer is a woodchuck hepatitis viral capsid protein heterodimer. In any embodiment herein E97K may be replaced with E79K. In one embodiment, the first amino acid sequence comprises a further mutation corresponding to H88K of SEQ ID NO: 16, and the second amino acid sequence comprises a further mutation corresponding to W71D of SEQ ID NO: 16. Suitably in such an embodiment, the viral capsid protein heterodimer is a woodchuck hepatitis viral capsid protein heterodimer. In one embodiment, the second amino acid sequence further comprises a deletion of a serine residue at a position corresponding to residue 78 of SEQ ID NO:16. Suitably in such an embodiment, the viral capsid protein heterodimer is a woodchuck hepatitis viral capsid protein heterodimer. In one embodiment, the binding molecule is attached to the second amino acid sequence. In some embodiments, the viral capsid protein is from any virus having a dimeric capsid protein. In one embodiment, the viral capsid protein is a hepatitis capsid protein. In one embodiment, the viral hepatitis capsid protein is a Woodchuck hepatitis capsid protein. According to a second aspect of the present invention, there is provided a viral capsid protein heterodimer comprising: - a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence; - a binding molecule attached exclusively to either the first monomer or the second monomer, and; - wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof. In one embodiment of the second aspect, the first amino acid sequence comprises a least one mutation and the second amino acid sequence comprises at least one mutation. In one embodiment the at least one mutation of the first amino acid sequence is different to the at least one mutation of the second amino acid sequence.
According to a third aspect of the present invention, there is provided a viral capsid protein monomer encoded by an amino acid sequence - wherein the amino acid sequence comprises at least one mutation and, - the at least one mutation modifies the electrostatic interaction between the viral capsid protein monomer and other viral capsid protein monomers to promote heterodimeric association thereof, - wherein optionally the viral capsid protein monomer comprises a binding molecule attached thereto. According to a fourth aspect of the present invention, there is provided a first or second viral capsid protein monomer encoded by: - a first amino acid sequence comprising at least one mutation, or - a second amino acid sequence comprising at least one mutation respectively; - wherein the at least one mutation in the first and second amino acid sequences are different, and - wherein the mutations in the first or second amino acid sequences promote association of the first monomer comprising the first amino acid sequence with the second monomer comprising the second amino acid sequence, - wherein optionally one of the first or second monomers exclusively comprises a binding molecule attached thereto. According to a fifth aspect of the present invention, there is provided a nucleic acid encoding the viral capsid protein heterodimer according to the second aspect or a viral capsid protein monomer according to the third or fourth aspect of the invention. According to a sixth aspect of the present invention, there is provided a nucleic acid construct encoding the viral capsid protein heterodimer according to the second aspect, the construct comprising: - a first nucleic acid sequence encoding a first viral capsid protein monomer according to the third or fourth aspect; - a second nucleic acid sequence encoding a second viral capsid protein monomer attached to a binding molecule according to the third or fourth aspect. According to a seventh aspect of the present invention, there is provided a vector comprising one or more of the nucleic acids according to the fifth aspect or the construct according to the sixth aspect of the invention.
According to an eighth aspect of the present invention, there is provided a host cell comprising one or more of the nucleic acids according to the fifth aspect, the construct according to the sixth aspect, or the vector according to the seventh aspect. According to a ninth aspect of the present invention, there is provided a process of producing a virus-like particle (VLP) in a single host cell comprising: a) Providing a host cell comprising i. a first nucleic acid encoding a first viral capsid protein monomer according to the third or fourth aspects, ii. a second nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule according to the third or fourth aspects, iii. a third nucleic acid encoding a functional molecule operable to bind to the binding molecule; b) Culturing the host cell under conditions to express the proteins from the nucleic acids; c) Forming functionalised viral capsid protein heterodimers; d) Forming virus-like particles from the functionalised viral capsid protein heterodimers. In some embodiments of the ninth aspect of the invention, the nucleic acids are comprised on one or more vectors. In one embodiment, the first, second and third nucleic acids are comprised on one vector, suitably a vector according to the seventh aspect. According to a tenth aspect of the present invention, there is provided a process of producing a virus-like particle (VLP), comprising; (a) Providing a first host cell comprising: (i) a first nucleic acid encoding a first viral capsid protein monomer according to the third or fourth aspects, (ii) a second nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule according to the third or fourth aspects, (b) Providing a second host cell comprising: (i) a third nucleic acid encoding a functional molecule operable to bind to the binding molecule; (c) Culturing the first and second host cells under conditions to express the proteins from the first, second, and third nucleic acids respectively; (d) Recovering the proteins;
(e) Mixing the proteins to (i) form functionalised viral capsid protein heterodimers and (ii) form virus-like particles from the functionalised viral capsid protein heterodimers. In some embodiments of the tenth aspect of the invention, the first nucleic acid encoding the first viral capsid protein monomer and the second nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule are comprised on a first vector. In one embodiment, the third nucleic acid is comprised on a second vector. In some embodiments of any of the above aspects, the or each viral capsid protein monomer is from any virus having a dimeric capsid protein. In one embodiment, the or each viral capsid protein monomer is a hepatitis capsid protein monomer. In one embodiment, the or each viral hepatitis capsid protein monomer is a Woodchuck hepatitis capsid protein monomer. According to an eleventh aspect of the present invention, there is provided a cell culture comprising one or more host cells according to the eighth aspect of the invention. According to a twelfth aspect of the present invention, there is provided an immunogenic composition comprising the virus-like particle according to the first aspect. According to a thirteenth aspect of the present invention, there is provided a virus-like particle (VLP) according to the first aspect, an immunogenic composition according to the twelfth aspect of the invention for use as a medicament. According to a fourteenth aspect of the present invention, there is provided a virus-like particle (VLP) according to the first aspect, or an immunogenic composition according to the twelfth aspect for use in the prevention and/or treatment of infectious diseases, cardiovascular diseases, cancer, inflammatory diseases, autoimmune diseases, neurological disease, rheumatological degenerative disease, or addiction. According to a fifteenth aspect of the present invention, there is provided use of a virus-like particle (VLP) according to the first aspect of the invention in research, or in the diagnosis of a disease. According to a sixteenth aspect of the present invention, there is provided a method of diagnosing a disease in a subject comprising: (a) Providing a virus like particle according to the first aspect of the invention, wherein the binding molecule is attached to a functional molecule and, wherein the functional
molecule is an antibody directed towards an antigen derived from a disease causing agent; (b) Mixing the virus like particle with a suitable sample from the subject; (c) Detecting whether the virus like particle precipitates; (d) Diagnosing the presence of a disease if the virus like particle precipitates. The present invention relates to VLPs comprising one or more viral capsid protein heterodimer(s). The Heterodimer(s) each comprise a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence. The first or the second monomer having a binding molecule attached, exclusively to the first or the second monomer. The first and/or second amino acid sequences comprise at least one mutation as compare to the wild-type sequence, such that, the mutation modifies the electrostatic interaction between the first and the second monomers as compared to wild-type sequences. This modification in the electrostatic interaction promotes the formation of a heterodimer comprising the first monomer and the second monomer, and a binding molecule attached exclusively to either the first or the second monomer of the capsid protein heterodimer. Many wild-type viral capsid proteins are made up of dimers formed from identical proteins, that is, the monomeric subunits of the homodimers are identical, such as hepatitis capsid proteins. These homodimers self-assemble to form VLPs. The inventors have found that even if one of the monomers of the dimer has a binding molecule attached, homodimers are still formed. This results in homodimers that have a binding molecule attached to each monomer, or a homodimer having no binding molecules attached. This does not solve the problems in the art discussed above in which crowding of functional molecules on the VLP surface inhibits VLP formation and activity. The inventors have now found that certain mutations in the amino acid sequences encoding the monomers can overcome the usual formation of homodimers and promote association of monomers wherein a binding molecule is attached to only one of the monomers, thus forming a heterodimer. The inventors have achieved this by making targeted mutations in each of the two monomers in order to influence the electrostatic attraction of the monomers, such that they instead preferentially form heterodimers. Each heterodimer having only one binding molecule attached exclusively to either the first monomer or the second monomer which may then be functionalised, allows for larger functional molecules to be attached to the VLP surface without steric hindrance. If the functional molecule is a relatively large molecule, more space per molecule is needed to avoid “crowding” of molecules presented on the surface of the VLP. According to any aspect of the present invention, the viral capsid protein may be a hepatitis B
capsid protein (HBc). Suitably the viral capsid protein may be a human hepatitis B capsid protein. Alternatively, the viral capsid protein may be a Woodchuck hepatitis virus capsid protein (wHv). Suitably, use of wHv capsid protein confers particular advantages over that of human hepatitis capsid protein. For example, wHv can form capsids at cold temperatures (> 60% capsid formation occurs at temperatures as low as 4°C, Kukreja 2014). This can result in an increased yield of VLPs in low temperature and allows the use of bacterial fermentation at lower temperatures which is preferable for production of complex proteins. Features and embodiments of the above aspects are described further under headed sections below. Any feature or embodiment may be combined with any aspect in any workable combination. While the making and using of various embodiments of the present invention are discussed in detail below, it should be appreciated that the present invention provides many applicable inventive concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed herein are merely illustrative of specific ways to make and use the invention and do not delimit the scope of the invention. To facilitate the understanding of this invention, a number of terms are defined below. Terms defined herein have meanings as commonly understood by a person of ordinary skill in the areas relevant to the present invention. Terms such as "a", "an" and "the" are not intended to refer to only a singular entity, but include the general class of which a specific example may be used for illustration. The terminology herein is used to describe specific embodiments of the invention, but their usage does not delimit the invention, except as outlined in the claims. The discussion of the background to the invention herein is included to explain the context of the invention. This is not to be taken as an admission that any of the material referred to was published, known, or part of the common general knowledge in any country as of the priority date of any of the claims. The present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See, for example, Current Protocols in Molecular Biology (Ausubel, 2000, Wiley and son Inc, Library of Congress, USA); Molecular Cloning: A Laboratory Manual, Third Edition, (Sambrook et al, 2001, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory
Press); Oligonucleotide Synthesis (M. J. Gait ed., 1984); U.S. Pat. No.4,683,195; Nucleic Acid Hybridization (Harries and Higgins eds. 1984); Transcription and Translation (Hames and Higgins eds.1984); Culture of Animal Cells (Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells and Enzymes (IRL Press, 1986); Perbal, A Practical Guide to Molecular Cloning (1984); the series, Methods in Enzymology (Abelson and Simon, eds. -in-chief, Academic Press, Inc., New York), specifically, Vols.154 and 155 (Wu et al. eds.) and Vol.185, "Gene Expression Technology" (Goeddel, ed.); Gene Transfer Vectors For Mammalian Cells (Miller and Calos eds., 1987, Cold Spring Harbor Laboratory); Immunochemical Methods in Cell and Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook of Experimental Immunology, Vols. I-IV (Weir and Blackwell, eds., 1986); and Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). The terms "identity" and "identical" and the like refer to the sequence similarity between two polymeric molecules, e.g., between two nucleic acid molecules, such as between two DNA molecules, or between two protein molecules. Sequence alignments and determination of sequence identity can be done, e.g., using the Basic Local Alignment Search Tool (BLAST) originally described by Altschul et al. 1990 (J Mol Biol 215: 403-10), such as the "Blast 2 sequences" algorithm described by Tatusova and Madden 1999 (FEMS Microbiol Lett 174: 247-250). Methods for aligning sequences for comparison are well-known in the art. Various programs and alignment algorithms are described in, for example: Smith and Waterman (1981) Adv. Appl. Math.2:482; Needleman and Wunsch (1970) J. Mol. Biol.48:443; Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A. 85:2444; Higgins and Sharp (1988) Gene 73:237-44; Higgins and Sharp (1989) CABIOS 5:151-3; Corpet et al. (1988) Nucleic Acids Res.16:10881- 90; Huang et al. (1992) Comp. Appl. Biosci. 8:155-65; Pearson et al. (1994) Methods Mol. Biol. 24:307-31; Tatiana et al. (1999) FEMS Microbiol. Lett. 174:247-50. A detailed consideration of sequence alignment methods and homology calculations can be found in, e.g., Altschul et al. (1990) J. Mol. Biol.215:403-10. The National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST™; Altschul et al. (1990)) is available from several sources, including the National Center for Biotechnology Information (Bethesda, MD), and on the internet, for use in connection with several sequence analysis programs. A description of how to determine sequence identity using this program is available on the internet under the "help" section for BLAST™. For comparisons of nucleic acid sequences, the "Blast 2 sequences" function of the BLAST™ (Blastn) program may be employed using the default parameters. Nucleic acid
sequences with even greater similarity to the reference sequences will show increasing percentage identity when assessed by this method. Typically, the percentage sequence identity is calculated over the entire length of the sequence. For example, a global optimal alignment is suitably found by the Needleman-Wunsch algorithm with the following scoring parameters: Match score: +2, Mismatch score: -3; Gap penalties: gap open 5, gap extension 2. The percentage identity of the resulting optimal global alignment is suitably calculated by the ratio of the number of aligned bases to the total length of the alignment, where the alignment length includes both matches and mismatches, multiplied by 100. The term “vector” is well known in the art, and as used herein refers to a nucleic acid molecule, e.g. double-stranded DNA, which may have inserted into it a nucleic acid sequence according to the present invention. A vector is suitably used to transport an inserted nucleic acid molecule into a suitable host cell. A vector typically contains all of the necessary elements that permit transcribing the insert nucleic acid molecule, and, preferably, translating the transcript into a polypeptide. A vector typically contains all of the necessary elements such that, once the vector is in a host cell, the vector can replicate independently of, or coincidental with, the host chromosomal DNA; several copies of the vector and its inserted nucleic acid molecule may be generated. The term “operably linked”, “operably connected” or equivalent expressions as used herein refer to the arrangement of various nucleic acid elements relative to each other such that the elements are functionally connected and are able to interact with each other in the manner intended. The terms “therapy” “therapeutic” “treatment” or “treating” refer to reducing, ameliorating or eliminating one or more signs, symptoms, or effects of a disease or condition. "Treatment," or “therapy” as used herein thus includes any treatment of a disease in a mammal, particularly in a human, and includes: (a) preventing the disease from occurring in a subject predisposed to the disease or at risk of acquiring the disease but has not yet been diagnosed as having it; (b) inhibiting the disease, i.e., arresting its development; and (c) relieving the disease, i.e., causing regression of the disease. The “administration” of an agent to a subject includes any route of introducing or delivering to a subject the agent to perform its intended function. Administration can be carried out by any suitable route, including orally, intranasally, intraocularly, ophthalmically, parenterally (intravascularly, intramuscularly, intraperitoneally, or subcutaneously), or topically. Administration includes self-administration and the administration by another.
The terms “individual,” “subject,” and “patient” are used interchangeably, and refer to any individual subject with a disease or condition in need of therapy, suitably in need of therapy by treatment with the present invention. For the purposes of the present disclosure, the subject may be a human or animal, for example primate, preferably a human, or another mammal, such as a dog, cat, horse, pig, goat, or bovine, and the like. Figures The invention may be described with reference to the following figures in which: Figure 1 shows: a cartoon diagram representing a VLP capsid and subunits. On the top left, the overall structure of the Woodchuck hepatitis virus (wHv) capsid. On the top right: close up (side view and ‘top’ view, respectively) of one so-called “asymmetric unit” composed of two homodimers which in turn are formed each by joining together of one single capsid protein. At the bottom of the figure, a ribbon diagram of one single “homodimer” composed of two identical monomer protein subunits is shown (side view and ‘top’ view, respectively). In this way, the entire capsid is formed from ordered assembly of a single protein where the central building block is a homodimer. For clarity, the single wHv capsid protein is shown without the C- terminal domain which conveys binding to RNA in the wild type virus. Only the N-terminal portion, truncated after aa149, is required for the formation of the capsid. Therefore, “wHv” designation of the capsid protein in the following figure legends, and elsewhere herein, refers to this truncated protein, also designated herein as “wHv149”. Figure 2 shows: a cartoon representation of the present invention. From top to bottom (A) shows a wild type homodimeric capsid subunit (as in Figure 1, but here shown as space-filling model), (B) shows a prior art homodimeric capsid subunit which has been modified to allow two functional molecules (dark grey, on top, facing the surface of the capsid) to be presented by each dimeric subunit on the VLP surface. (C) shows the heterodimeric subunit of the present invention, which allows only one single functional molecule (dark grey, on top, facing the surface of the capsid) to be presented by each heterodimeric subunit on the surface of the VLP. This allows the production of VLP vaccines where only one protein is presented for each of the dimeric capsid subunits. Figure 3 shows: a diagrammatic representation of an approach to creating a mutant wHv capsid where one dimer displays a binding protein such as an integrated Im7 protein toward the VLP surface. Panel A shows an expression vector that allows simultaneous expression of a wild type wHv protein (ORF2) and a wHv protein carrying the binding protein (an integrated Im7 protein, ORF1), driven by an independent second promoter or through an internal ribosome binding site (RBS). This construct can result in either formation of homodimers
(ORF2 + ORF2, as shown, or ORF1 + ORF1, which is not illustrated here) or heterodimers (ORF1 + ORF2). Panel B shows a close up of the central ‘tip’ of the wHv dimer (dashed boxed in Panel A). Highlighted are negatively charged (E64) and positively charged (K96) amino acids, respectively, which create electrostatic attraction, contributing to dimer formation of two subunits. Figure 4 shows: a preliminary approach to create asymmetrical interaction between two wHv monomers in order to favour formation of heterodimers over homodimer formation. Panel A shows a schematic of an exemplary expression vector. Complementary mutations are introduced into the two different monomers: K96E in wHv-Im7 and E64K into wHv, respectively. Below shown is a schematic showing how exemplary complementary mutations generate local electrostatic repulsion whenever two wHv monomers interact, while creating electrostatic attraction whenever a wHv and a modified-wHv monomer interact. In this example, the modified-wHv is modified with an integrated Im7 protein. Panel B shows structural prediction using Alphafold2 database (Jumper, J et al. (2021) and Varadi, M et al. (2021), showing that formation of homodimers composed of wHv/wHv and wHv-Im7/wHv-Im7, respectively, is still favoured despite the mutations introduced (shown is the top-ranked of five models, all of which show homodimer formation). Figure 5 shows: mutational refinement in the wHv monomers. In this example, lysine96 (K96) in wHv is replaced by aspartate (D96), instead of glutamate (E96), in contrast to the model shown in figure 4. Panel A shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96D in wHv- Im7 and E64K into wHv, respectively. The mutation of K96 to D96 reduces side-chain bulkiness, shown on the left of panel B. The resulting structural prediction using Alphafold2 database (right) shows a predicted mixture between heterodimer and homodimer formation. Figure 6 shows: Additional intra-chain ionic stabilization by added mutation of Lysine 67 (K67) to glutamate (E67) in the wHv149 subunit. Panel A shows a schematic of an exemplary expression vector showing that, in contrast to figure 5, an additional mutation is introduced into the wHv monomer (ORF2). In this example, an additional mutation K67E is introduced to the wHv149 subunit. This added mutation results in increased heterodimer formation since now E67 interacts with K64 and no longer with D63 within the same protein chain. This, in turn stabilizes the interaction of K64 with D96 of the heterodimeric partner monomer (ORF1). Nonetheless, still a mix of hetero- and homodimer structures is predicted by Alphafold2 (shown in panel B). This figure shows that exclusive formation of heterodimers is difficult to achieve. Figure 7 shows: a ribbon diagram of the top rated (by Alphafold2 database) heterodimeric capsid subunit structures obtained with the optimized engineered combination of mutations
(wHv149: E8K, E64K, K67E, R82D, E79K, H88K; wHv149-Im7/Bs: R56D, K96D, W71D, summarized in table 2 of the examples), generated using Alphafold2 software, for heterodimers consisting of wHv149/wHv-Im7 (top) and wHv149/wHv-Bs (bottom). Figure 8 shows: an overlay of ribbon-diagram structures of wild type wHv with the combination of mutants listed in table 2, generated using the icn3d web-app (NCBI/structure; PDB: 6edj) and Alphafold2 database, respectively. Left: the wHv149 subunit, right: the wHv- Im7 subunit. The Im7-domain on top of the tip of wHv-Im7 is not shown, as it does not align with wild type wHv. The figure illustrates that the enacted mutations to not impact on the backbone structure of the mutated monomer proteins. Figure 9 shows: Space filling model generated using icn3d (NCBI/structure; PDB: 6edj), showing the wild type wHv (left) and the modified wHv149/wHv-Im7 (centre) and wHv149/wHv-Barstar (right) heterodimers, respectively, from the front (top) and from the surface (bottom) of the capsid. The dashed line (top) indicates the plane of subunit interaction leading to VLP assembly, which remains undisturbed by the integration of binding proteins (Im7 or Barstar). The bottom illustration shows that, compared to the wild type (left) the space occupancy of the heterodimers remains essentially unchanged. Figure 10 shows: (A) view from the side showing “spike” made from two helices from each monomer protruding to the outside (top arrow) and bottom part mediating interaction with other dimers in the capsid (bottom arrow). Panel (B) shows the wild-type amino acid sequences of both the Woodchuck (wild type wHv )(SEQ ID NO: 16) and Human hepatitis B capsid protein (SEQ ID NO: 1) monomers (subunit). The bold residues show differences in the amino acid sequences between Woodchuck and Human hepatitis capsid protein monomers and the highlighted amino acids relate to amino acids capable of forming inter-molecular electrostatic bonds. Figure 11 shows: the electrostatic interactions occurring between to wild type monomers forming a homodimer in the human hepatitis B virus capsid. Left: a ribbon diagram of the homodimer where dark circles and light circles highlight the position of electrostatic interactions between subunits in the ‘tip’ (dark) and in the ‘base’ (light) of the homodimer, respectively. Right: the encircled regions from the left side of the panel are shown at close up (‘tip’ region: top; ‘base’ region’: bottom) in a side-view, and 90 degrees rotated, respectively. The diagram shows that in the wild type HBc dimer E64 of each monomer interacts with K96 of the other monomer (top) and that E8 of each monomer interacts with R56 of the other monomer (bottom). The figure shows that complementary mutations in the human HBc virus capsid can be enacted analogous to the mutations illustrated for the wHv capsid in order to
drive assembly of heterodimers, when one of the monomers has been altered, for example by in-frame fusion of Im7 or Bs (or any other change only implemented in one of the monomers). Figure 12 shows: panel A shows a diagram of an exemplary plasmid to produce the monomeric subunits of the VLP capsid. wHv capsid heterodimer components are soluble and evenly expressed in E.coli. Panel B shows SDS-PAGE gel of showing that both wHv_Im7 and wHv proteins, when driven by individual T7 promoters, can be expressed as soluble proteins in E.coli and are synthesized at approximately even stoichiometric ratios ( black arrows). Figure 13 shows: purification of a human Hepatitis B capsid dimer linked to a binding protein via IMAC. Left lane is the soluble cytosolic fraction of E.coli lysates where expression of the three proteins HBc-Im7 (white arrow), HBc-wt (dark grey arrow), ColE7-IL31 (light grey arrow) had been induced. This was applied to a Ni-NTA column. Lanes 2 and 3 show the flow-through and wash fraction (washed with 40mM imidazole, respectively). Lane 4 shows the fraction eluted with 200 mM imidazole. The relative band density between ColE7-IL31 and HBc-Im7 is approximately even, in keeping with a stoichiometric ratio of 1:1. The band of HBc-wt (light grey) is slightly weaker, owing to reduced molecular weight, but consistent with a 1:1 stoichiometric ratio between HBc-Im7 and HBc-wt. These data illustrate that all three proteins exist in complex and co-purify. Figure 14 shows: SDS PAGE analysis of a discontinuous density gradient run on a heterodimeric VLP decorated with a surface epitope protein (functional molecule). After 6h of 120,000g in the ultracentrifuge, all of the epitope, as well as the two heterodimeric VLP scaffold proteins partition to the 50% and 60% cushion, indicating high – density nanoparticle formation. Furthermore, the staining density of the epitope protein (white arrow) and the Im7- incorporating WHc protein (grey arrow) are approximately even. This indicates that most or all of the WHcIm7 moieties are loaded with epitope protein. Figure 15 shows: SDS PAGE analysis of heterodimeric WHcIm7/WHc VLPs decorated with an epitope protein. The middle lane shows a cytosolic fraction. The right lane shows the same preparation after purification via immobilized metal affinity chromatography (IMAC). The metal-binding tag is located on the epitope protein (white arrow). The data show that both the WHcIm7 (grey arrow) and the WHc protein (black arrow) co-purify, confirming that they are both bound to the epitope. The density of epitope and WHcIm7 is approximately even, indicating that the VLPs are fully decorated with epitope protein. Figure 16 shows: Dynamic Light Scatter (DLS) Analysis of the VLP particles analysed in Figure 15. The light scattering profile confirms uniform size distribution of particles at the expected size range (approximately 32 nm diameter).
Figure 17 shows: Transmission Electron Microscopy (TEM) close-up of WHcIm7/WHc heterodimeric VLPs decorated with a surface epitope. Analysis done with 80 kV at 30,000x magnification. Scale bar 50 nm. The images confirm the size of VLPs determined by DLS analysis in Figure 16. The thickened rim structure and fuzzy outer rim appearance is consistent with decoration of an epitope localized to the surface. Description Viral Capsid Protein The present invention relates to VLPs which comprise one or more viral capsid proteins, the viral capsid proteins self-assemble into the VLP, to which functional molecules can then be attached through a binding molecule which may be a binding protein and/or chemical modification as discussed elsewhere herein. In accordance with any aspect, the viral capsid protein is from any virus having a dimeric capsid protein, suitably from any virus having a dimeric capsid protein formed from identical monomers. In one embodiment, the viral capsid protein is a hepatitis capsid protein, which may be selected from Hepatitis A, B, C or D. In one embodiment, the viral capsid protein is a Hepatitis B viral capsid protein (HBc). Suitably, the viral capsid protein is a mammalian Hepatitis viral capsid protein. Suitably, the viral capsid protein is a human Hepatitis B viral capsid protein. In a preferred embodiment, the viral capsid protein is a Woodchuck (Marmota monax) Hepatitis viral capsid protein (wHv). In one embodiment of any of the aspects, the viral capsid protein is a heterodimer, suitably therefore a heterodimeric viral capsid protein. Suitably, each heterodimeric viral capsid protein is made up of a first monomer and a second monomer. Suitably each heterodimeric viral capsid protein is attached to a binding molecule. Suitably therefore each heterodimeric viral capsid protein displays a binding molecule. Suitably each heterodimeric viral capsid protein is attached to a single binding molecule, suitably therefore one of the monomers of each heterodimeric capsid protein is attached to a binding molecule. Suitably the first or second monomer that forms the heterodimeric viral capsid protein is modified to display a binding molecule. Suitably the first or second monomer of the
heterodimeric viral capsid protein is fused to binding molecule. Suitably the first or second monomer of the heterodimeric viral capsid protein is modified to display a binding molecule by fusing the binding molecule to the monomer of the heterodimeric viral capsid protein. Suitably the first or second monomer of the heterodimeric viral capsid protein is modified to display a binding molecule by inserting the amino acid sequence of the binding molecule into the amino acid sequence of the relevant monomer of the heterodimeric viral capsid protein. Suitably the amino acid sequence of the binding molecule is inserted into the major immunodominant region of the monomer of the heterodimeric viral capsid protein. Suitably the binding molecule is fused to the major immunodominant region of the monomer of the heterodimeric viral capsid protein. In an embodiment of the invention, the binding molecule is inserted between amino acids corresponding to residues 76 and 80 of the major immunodominant region of the monomer of the heterodimeric viral capsid protein. Suitably the binding molecule is inserted between amino acid residues corresponding to 77 and 79 of the major immunodominant region of the monomer of the heterodimeric viral capsid protein. Suitably the binding molecule is inserted between amino acid residues corresponding to 77 and 78 of the major immunodominant region of the monomer of the heterodimeric viral capsid protein. Suitably, the VLP of the invention comprises one or more linkers. Suitably the linkers join the amino acid sequence of the monomer in accordance with the third or fourth aspect with the binding molecule. Suitably, the linkers are located between the protein coding sequences of the monomer and the binding molecule. In one embodiment, the binding molecule is a binding protein and a linker is located at the N and C terminus of the binding protein, suitably to link to the heterodimeric viral capsid protein. Suitably a further linker may also be located between the functional molecule and the binding protein. Suitably the further linker may comprise an alpha helix. Suitably the further linker may comprise or consist of the sequence of LAEAAAKEAAAKEAAKAA (SEQ ID NO: 33). Suitably a further linker may also be located between the binding molecule and a further binding molecule. Suitably each linker is between 5 to 50 amino acids in length. Suitably each linker is 5, 10, 15, 20, 21, 25, 30, 35, 40 amino acids in length. Suitably each linker is 9, 10 or 11 amino acids in length. Suitably each linker comprises the sequence: GGGGSGGGGS (SEQ ID NO:9) or GGGGGSGGGGS (SEQ ID NO: 10), SGGGSSGSG (SEQ ID NO: 11), KAAAEKAAAE (SEQ ID NO: 14), GGKAAAE (SEQ ID NO:15) or LAEAAAKEAAAKEAAKAA (SEQ ID NO: 33). In one embodiment, the linkers used to link the amino acid sequence of
the monomer in accordance with the third or fourth aspect with the binding molecule comprise either KAAAEKAAAE (SEQ ID NO: 14) or GGKAAAE (SEQ ID NO:15). Suitably, the first linker is KAAAEKAAAE (SEQ ID NO: 14) which links the N terminus of the binding protein to the heterodimeric viral capsid protein. Suitably the second linker is GGKAAAE (SEQ ID NO:15) which links the C terminus of the binding protein to the heterodimeric viral capsid protein. Suitably, as explained elsewhere herein, the binding molecule is a binding protein. Suitably, the viral capsid protein is a hepatitis B capsid protein and may comprise the amino acid sequence of SEQ ID NO: 2, suitably forming the first monomer. Suitably, the viral capsid protein is a hepatitis B capsid protein and may comprise the amino acid sequence of SEQ ID NO: 4, suitably forming the second monomer. In one embodiment, therefore, the heterodimeric viral capsid protein is a heterodimeric hepatitis B capsid protein and comprises the amino acid sequence of SEQ ID NO:2, and the amino acid sequence of SEQ ID NO:4. Suitably, the viral capsid protein is a woodchuck hepatitis capsid protein and may comprise a truncated viral capsid protein. Suitably one or both monomers of the woodchuck hepatitis capsid protein are truncated. Suitably both monomers of the woodchuck hepatitis capsid protein are truncated, suitably comprising a truncation of the C terminus. Suitably the C- terminus is removed from the or each monomer of the woodchuck hepatitis capsid protein. Suitably therefore the or each woodchuck hepatitis capsid protein monomer comprises amino acids 1 to 149 of the wild type woodchuck hepatitis capsid protein monomer sequence. Suitably references herein to woodchuck hepatitis capsid protein, wHv capsid protein, or wHv149 capsid protein are to the truncated form. Suitably such a truncated monomer sequence is shown in SEQ ID NO:16, and is additionally shown in SEQ ID NO:3 and 5 with the mutations described herein. Suitably, the viral capsid protein is a woodchuck hepatitis capsid protein and may comprise the amino acid sequence of SEQ ID NO: 3, suitably forming the first monomer. Optionally the sequence may comprise mutations at positions K67, R82, E97, and/or H88, which are described further herein. Suitably E97 may be replaced with E79. Suitably, the viral capsid protein is a woodchuck hepatitis capsid protein and may comprise the amino acid sequence of SEQ ID NO: 5, suitably forming the second monomer. Optionally the sequence may further comprise mutation at position W71 and/or a deletion at S78, which are described further herein.
In one embodiment, therefore, the heterodimeric viral capsid protein is a heterodimeric woodchuck hepatitis capsid protein and comprises the amino acid sequence of SEQ ID NO:3, and the amino acid sequence of SEQ ID NO:5. Optionally wherein the SEQ ID NO:3 further comprises the additional mutations at positions K67, R82, E97, and/or H88, which are described further herein, and SEQ ID NO:5 further comprises the additional mutation at position W71 and/or a deletion at S78, which are described further herein. Suitably E97 may be replaced with E79. Modified Monomers and Heterodimers Suitably the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises a mutation as compared to the wild- type sequence. Suitably, a mutation may be an insertion, deletion or substitution. Suitably the first amino acid sequence encoding the first monomer and the second amino acid sequence encoding the second monomer each comprise at least one mutation as compared to the wild-type sequence. Suitably, the or each mutation in the first amino acid sequence is different to the or each mutation in the second amino acid sequence. Suitably therefore the amino acid sequence of the first monomer is different to the amino acid sequence of the second monomer. Suitably, a mutation in the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer modifies the electrostatic interactions between the monomers as compared to the wild type sequences. Suitably the or each mutation modifies the electrostatic interaction between the monomers to promote association of the monomers to form heterodimers. Suitably the viral capsid protein may comprise further modifications. Suitable modifications may include: insertions, deletions, substituents, truncations, reversals, repeats, or the like in the amino acid sequence encoding the protein. Suitably the viral capsid protein may comprise further modifications in the major immunodominant region. Suitably such modifications aid the insertion of the binding molecule into the viral capsid protein. Suitably the viral capsid protein may comprise amino acid deletions. Suitably the viral capsid protein may comprise amino acid deletions in the major immunodominant region. Suitably the viral capsid protein may comprise amino acid deletions in the major immunodominant region which remove negatively charged amino acids. Suitably, the first amino acid sequence encoding the first monomer or the second amino acid sequence encoding the second monomer comprise at least one mutation.
Suitably the first amino acid sequence encoding the first monomer and the second amino acid sequence encoding the second monomer each comprise at least one mutation. Suitably the at least one mutation of the first amino acid sequence is different to the at least one mutation of the second amino acid sequence. The person skilled in the art would understand that a mutation to an amino acid sequence can include insertions, deletions or substitutions. Suitably, the first and/or second amino acid sequences may include further modifications such as truncations, reversals, repeats, or the like. In a suitable embodiment, the first and/or second amino acid sequences include at least one mutation (for example, a substitution, addition, or deletion) as compared to the wild type sequence. Suitably, the first and/or second amino acid sequences include at least 2 mutations, at least 3 mutations, at least 4 mutations, at least 5 mutations, at least 6 mutations, at least 7 mutations, at least 8 mutations, at least 9 mutations, at least 10 mutations, at least 11 mutations, at least 12 mutations, at least 13 mutations, at least 14 mutations, at least 15 mutations, at least 16 mutations, at least 17 mutations, at least 18 mutations, at least 19 mutations, at least 20 mutations, at least 21 mutations, at least 22 mutations, at least 23 mutations, at least 24 mutations, at least 25 mutations, at least 26 mutations, at least 27 mutations, at least 28 mutations, at least 29 mutations or at least 30 mutations as compared to the wild type sequence. Suitably the or each mutation is present in the alpha helices of the first or second monomers. Suitably the or each mutation may be an amino acid substitution. Suitably the or each mutation may be an amino acid deletion. Suitably the amino acid substitution(s) modify the electrostatic interactions between the first and second monomers to promote association of the first and second monomers. Suitably, a substitution in the first and/or second amino acid sequence may be a substitution of a negatively charged amino acid with a positively charged amino acid. Suitably, a substitution in the first and/or second amino acid sequence may be a substitution of a positively charged amino acid with a negatively charged amino acid. Suitably a mutation in the first amino acid sequence may be complementary to a mutation of the second amino acid sequence. Suitably therefore the mutations in the first and second amino acid sequences may be regarded as pairs, suitably as complementary pairs. Suitably therefore the first and second amino acid sequences may comprise at least one pair of
complementary mutations. Suitably, a pair of complementary mutations may comprise at least one positively charged amino acid and at least one negatively charged amino acid. Suitably the electrostatic interaction may be an electrostatic attraction or repulsion. In the context of the present invention, an electrostatic attraction is an interaction that occurs between molecules that have opposite partial charges. Suitably, these molecules do not form a covalent bond, but interact with each other if they are in close proximity to their opposite partial charge. In the case of an electrostatic attraction, the molecules are pulled together by the attraction between the opposite partial charges. Suitably, an electrostatic repulsion is an interaction that occurs between molecules that have the same partial charge. These molecules do not form a covalent bond but interact with each other if they are in close proximity to the same partial charge. In the case of electrostatic repulsion, the molecules are pushed away from each other by the repulsion of the same partial charge. Suitably, in the context of the present invention, the molecules described above relate to monomers of the present invention. Suitably a mutation of the first amino acid sequence may form an electrostatic interaction with a mutation in the second amino acid sequence. Alternatively, a mutation of the first amino acid sequence may form an electrostatic interaction with a non-mutated residue of the second amino acid sequence. Alternatively a mutation of the second amino acid sequence may form an electrostatic interaction with a non-mutated residue of the first amino acid sequence. Suitably an electrostatic interaction may be formed between a positively charged amino acid and a negatively charged amino acid. Suitably the electrostatic interaction may be an attraction. In a suitable embodiment, the first and second amino acid sequences may form an electrostatic attraction between a positively charged amino acid and a negatively charged amino acid. Suitably the positively charged amino acid is comprised on the first amino acid sequence and the negatively charged amino acid is comprised on the second amino acid sequence. Suitably, an electrostatic interaction may be a repulsion. Suitably, an electrostatic repulsion may be formed between two or more amino acids having a similar charge. Suitably, an electrostatic repulsion may be formed between two or more amino acids having a positive charge. Suitably, an electrostatic repulsion may be formed between two or more amino acids
having a negative charge. In one embodiment, the heterodimer comprises a binding molecule attached to either the first or second monomer. Suitably, the heterodimer comprises a mutation in the first amino acid sequence encoding the first monomer and a different mutation in the second amino acid sequence encoding the second monomer and a binding molecule attached to either the first or second monomer. In one embodiment, the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 8 of SEQ ID NO: 1 or 16, or a position corresponding thereto. Suitably, the mutation may be an amino acid substitution at residue 8 of SEQ ID NO: 1 or 16, or a position corresponding thereto. Suitably, the amino acid substitution may be from a glutamate to a lysine at residue 8 (E8K) of SEQ ID NO:1 or 16, or a position corresponding thereto. In another embodiment the first amino acid sequence encoding the first monomer or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 64 of SEQ ID NO: 1 or 16 or a position corresponding thereto. Suitably, the mutation may be an amino acid substitution at residue 64 of SEQ ID NO: 1 or 16 or a position corresponding thereto. Suitably, the amino acid substitution may be from a glutamate to a lysine at residue 64 (E64K) of SEQ ID NO: 1 or 16 or a position corresponding thereto. In another embodiment, the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid comprise an amino acid substitution from a glutamate to a lysine at residue 8 (E8K) and an amino acid substitution from a glutamate to a lysine at residue 64 (E64K) of SEQ ID NO:1 or 16, or positions corresponding thereto. Suitably, therefore the first or second monomer may be encoded by an amino acid sequence comprising a sequence set out in SEQ ID NO: 2 or SEQ ID NO: 3, or a sequence having at least 60 %, at least 70%, at least 80%, at least 90%, at least 95 %, at least 96%, at least 97%, at least 98%, at least 99% sequence identity thereto and comprising the mutations defined above. In one embodiment, the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 56 of SEQ ID NO: 1 or 16 or a position corresponding thereto. Suitably, the mutation may be an amino acid substitution at residue 56 of SEQ ID NO: 1 or 16 or a position corresponding thereto. Suitably, the amino acid substitution may be from an arginine to an aspartate at residue 56 (R56D) of SEQ ID NO:1 or 16 or a position corresponding thereto.
In another embodiment the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid mutation at residue 96 of SEQ ID NO: 1 or 16 or a position corresponding thereto. Suitably, the mutation may be an amino acid substitution at residue 96 of SEQ ID NO: 1 or 16 or a position corresponding thereto. Suitably, the amino acid substitution may be from a lysine to an aspartate at residue 96(K96D) of SEQ ID NO:1 or 16 or a position corresponding thereto. In another embodiment, the first amino acid sequence encoding the first monomer and/or the second amino acid sequence encoding the second monomer comprises an amino acid comprise an amino acid substitution from an arginine to an aspartate at residue 56 (R56D) and an amino acid substitution from a lysine to an aspartate at residue 96 (K96D) of SEQ ID NO:1 or 16 or positions corresponding thereto. Suitably, the first amino acid sequence encoding the first monomer or the second amino acid sequence encoding the second monomer comprises amino acid substitutions R56D and K96D in SEQ ID NO:1 or 16, or substitutions corresponding thereto, and further comprises a binding molecule attached to the monomer. Suitably, therefore the first or second monomer may be encoded by an amino acid sequence comprising a sequence set out in SEQ ID NO: 4 or SEQ ID NO: 5 or a sequence having at least at least 60 %, at least 70%, at least 80%, at least 90%, at least 95 %, at least 96%, at least 97%, at least 98%, at least 99% sequence identity thereto and comprising the mutations defined above. In one embodiment, a capsid protein heterodimer comprises a first monomer comprising amino acid substitutions E8K and E64K in SEQ ID NO:1 or 16, or substitutions corresponding thereto, and a second monomer comprising amino acid substitutions corresponding to R56D and K96D in SEQ ID NO:1 or 16, or substitutions corresponding thereto, and further comprising a binding molecule attached to the second monomer. In one embodiment, the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 2. In one embodiment, the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 4. In one embodiment, the capsid protein heterodimer comprises a first monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 2 and a second monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 4.
In one embodiment, the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 3. In one embodiment, the capsid protein heterodimer comprises a monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 5. In one embodiment, the capsid protein heterodimer comprises a first monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 3 and a second monomer encoded by an amino acid sequence comprising the amino acid sequence set out in SEQ ID NO: 5. In one embodiment, the capsid protein heterodimer comprises a first or second monomer encoded by an amino acid sequence comprising one or more mutations selected from the list consisting of: E64K, K67E, R56D, H88W, K96D, W71H, W71D, E8K, E8R, K96E, S78E, S78R, D64K, R82D, E67K, H88K, and E97K, and/or optionally a deletion at S78 of SEQ ID NO:1 or 16, or at a position corresponding thereto, or combinations thereof. Suitably E97K may be replaced with E79K. Suitably the one or more mutations may be selected from the list consisting of: E8K; E64K; K67E; R82D;E97K; and H88K of SEQ ID NO:1 or 16, or at a position corresponding thereto. Suitably the one or more mutations may be selected from the list: E8K and E64K of SEQ ID NO:1 or at a position corresponding thereto. Suitably the one or more mutations may be selected from the list: E8K; E64K; K67E; R82D;E97K; and H88K of SEQ ID NO:16, or at a position corresponding thereto. Suitably E97K may be replaced with E79K. Suitably the mutations may be selected from one or more of the following groups: (i) E8K and E64K and K67E; (ii) E8K and R82D and E97K; (iii) E8K and H88K; (iv) E64K and K67E; and R82D and E97K; (v) E64K and K67E; and H88K; (vi) R82D and E97K; and H88K; (vii) E8K; E64K and K67E; and R82D and E97K; (viii) E8K; E64K and K67E; and H88K; (ix) E8K; R82D and E97K; and H88K; (x) E64K and K67E; R82D and E97K; and H88K; or (xi) E8K; E64K and K67E; R82D and E97K; and H88K;
Of SEQ ID NO:1 or 16, or at positions corresponding thereto. Suitably E97K may be replaced with E79K. Suitably such mutations are present in the first monomer. Suitably the one or more mutations may be selected from the list consisting of: R56D; K96D; W71D and a deletion at S78 of SEQ ID NO:1 or 16, or at a position corresponding thereto. Suitably the one or more mutations may be selected from the list: R56D and K96D of SEQ ID NO:1 or at a position corresponding thereto. Suitably the one or more mutations may be selected from the list: R56D; K96D; W71D; and a deletion of S78 of SEQ ID NO:16 or at a position corresponding thereto. Suitably the mutations may be selected from: (i) R56D and K96D; (ii) R56D and W71D; (iii) R56D and a deletion at S78; (iv) K96D and W71D; (v) K96D and a deletion at S78; (vi) W71D and a deletion at S78; (vii) R56D and K96D and W71D; (viii) R56D and K96D and a deletion at S78; (ix) R56D and W71D and a deletion at S78; (x) K96D and W71D and a deletion at S78; (xi) R56D and K96D and W71D and a deletion at S78; Of SEQ ID NO:1 or 16, or at positions corresponding thereto. Suitably such mutations are present in the second monomer. In one embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution at E8K of SEQ ID NO:1 or 16, or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a substitution at R56D of SEQ ID NO:1 or 16 or at a position corresponding thereto, the second monomer having a binding molecule attached. In one embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution at E64K of SEQ ID NO:1 or 16, or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a substitution at K96D of SEQ ID NO:1 or 16, or at a position corresponding thereto, the second monomer having a binding molecule attached.
In one embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution at H88K of SEQ ID NO:16 or at a position corresponding thereto and a second monomer encoded by an amino acid sequence comprising a substitution at W71D of SEQ ID NO:16 or at a position corresponding thereto, the second monomer having a binding molecule attached. In one embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution corresponding to K67E of SEQ ID NO:16 or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a wild type sequence (corresponding to SEQ ID NO:16), the second monomer having a binding molecule attached. In one embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising a substitution corresponding to R82D and E97K of SEQ ID NO:16 or at a position corresponding thereto, and a second monomer encoded by an amino acid sequence comprising a wild type sequence (corresponding to SEQ ID NO:16), the second monomer having a binding molecule attached. Suitably E97K may be replaced with E79K. Suitably, the substitutions corresponding to R82D and E97K result in an Alpha-helix charge inversion. In one embodiment, the heterodimer comprises a first monomer encoded by a wild type amino acid sequence (corresponding to SEQ ID NO:1 or 16) and a second monomer encoded by an amino acid sequence comprising a deletion of a serine residue at a position corresponding to residue 78 of SEQ ID NO:16 or at a position corresponding thereto, the second monomer having a binding molecule attached. In a suitable embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising one or more mutations selected from: E8K; E64K; K67E; R82D; E97K; and H88K of SEQ ID NO:1 or 16, or at positions corresponding thereto; and a second monomer encoded by an amino acid sequence comprising one or more mutations selected from: R56D; K96D; W71D; and a deletion at S78 of SEQ ID NO:1 or 16, or at positions corresponding thereto. Suitably, the second monomer having a binding molecule attached thereto. Suitably E97K may be replaced with E79K. In a suitable embodiment, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising one or more mutations selected from: E8K and E64K of SEQ ID NO:1 or 16, or at positions corresponding thereto; and a second monomer encoded by an amino acid sequence comprising one or more mutations selected from: R56D and K96D of SEQ ID NO:1 or 16, or at positions corresponding thereto. Suitably, the second monomer
having a binding molecule attached thereto. Suitably, in such cases, the heterodimer may be a hepatitis B heterodimer or a woodchuck hepatitis heterodimer. Suitably, when the heterodimeric viral capsid protein is from a woodchuck hepatitis virus, one or more further mutations may be present in the first and/or second monomers. Suitably, the first monomer further comprises one or more further mutations selected from: K67E; R82D; E97K; and H88K of SEQ ID NO:16, or at positions corresponding thereto. Suitably E97K may be replaced with E79K. Preferably the first monomer comprises the further mutation K67E, and optionally the further mutations R82D; E97K; and H88K of SEQ ID NO:16, or at positions corresponding thereto. Suitably E97K may be replaced with E79K. Suitably the second monomer further comprises one or more further mutations selected from: W71D and a deletion of residue S78 of SEQ ID NO:16, or at positions corresponding thereto. In a suitable embodiment, therefore, the heterodimer comprises a first monomer encoded by an amino acid sequence comprising one or more mutations selected from: E8K; E64K; K67E; R82D; E97K; and H88K of SEQ ID NO:16, or at positions corresponding thereto; and a second monomer encoded by an amino acid sequence comprising one or more mutations selected from: R56D; K96D; W71D; and a deletion of S78 of SEQ ID NO:16, or at positions corresponding thereto. Suitably E97K may be replaced with E79K. Suitably, the second monomer having a binding molecule attached thereto. Suitably in such cases, the heterodimer is a woodchuck hepatitis heterodimer. In one embodiment, the heterodimer is a hepatitis B capsid heterodimer and comprises a first monomer encoded by an amino acid sequence according to SEQ ID NO:1 and comprising mutations E8K and E64K; and a second monomer according to SEQ ID NO:1 and comprising mutations R56D and K96D. Suitably, the second monomer having a binding molecule attached thereto. In one embodiment, the heterodimer is a woodchuck hepatitis capsid heterodimer and comprises a first monomer encoded by an amino acid sequence according to SEQ ID NO:16 and comprising mutations E8K; E64K; and K67E, and a second monomer according to SEQ ID NO:16 and comprising mutations R56D; and K96D. Suitably, the second monomer having a binding molecule attached thereto. In another embodiment, the heterodimer is a woodchuck hepatitis capsid heterodimer and comprises a first monomer encoded by an amino acid sequence according to SEQ ID NO:16 and comprising mutations E8K; E64K; K67E; R82D; E97K; and H88K, and a second monomer according to SEQ ID NO:16 and comprising mutations R56D; K96D; W71D and a deletion of S78. Suitably E97K may be replaced with E79K. Suitably, the second monomer having a binding molecule attached thereto.
Suitably, the heterodimer may comprise any combination of mutations described above. Suitably, a mutation to the first and/or second amino acid sequence may modify the electrostatic interactions between the first monomer an the second monomer. Suitably, the electrostatic interactions between the first monomer and the second monomer may be modified to increase repulsions and/or attractions between the monomers. Suitably, a mutation of the first and/or second amino acid sequence may modify electrostatic interactions to increase repulsion between the first and the second amino acid sequences. Suitably, a mutation of the first and/or second amino acid sequence may modify electrostatic interactions to increase attraction between the first and the second amino acid sequences. Suitably, the mutations described on any one of the embodiments of the present invention modify the electrostatic interactions between the first and the second monomers to promote association thereof. Suitably, this association results in the formation of heterodimers. Suitably, the heterodimer comprises a single binding molecule attached exclusively to either the first or the second monomer. Suitably, the binding molecule is a binding protein. Suitably, any of the embodiments described herein may comprise further mutations or modifications to the amino acid sequences encoding the first and/or the second monomers. Binding molecule The present invention is based on the use of VLPs to present a functional molecule on its surface, for example to the immune system. Suitably, the invention relates to VLPs which make use of a binding molecule which can attach a functional molecule, typically an antigen, to the viral capsid proteins forming the VLP. In accordance with the invention, there is provided an a first capsid protein monomer encoded by a first amino acid sequence and a second capsid protein monomer encoded by a second amino acid sequence. Suitably, the first or second monomer comprises a binding molecule attached thereto. Suitably, each capsid protein heterodimer comprises a binding molecule attached exclusively to either the first or the second monomer. Suitably, each capsid protein heterodimer is able to attach a functional molecule via the binding molecule. Suitably therefore the binding molecule is operable to bind to a functional molecule.
Suitably, the binding molecule may be any molecule that can be attached to the first or second monomer for the purposes of attaching to a functional molecule, such as an antigen. In one embodiment the binding molecule is a binding protein. Suitably the binding protein has low homology to proteins of the subjects which may be treated with the VLP. Suitably the binding protein has low homology to human proteins. Suitably the binding protein has low homology with the tertiary structure of any human proteins. Advantageously, low homology with human proteins means that the binding protein itself is less likely to stimulate an off-target immune reaction. Suitably the binding protein does not contain any disulphide bonds. Suitably the binding protein is not glycosylated. Suitably the binding protein is relatively small in size. Suitably the binding protein comprises a relatively short sequence length. Suitably the binding protein comprises a length of between 84 – 134 amino acids. Suitably the binding protein comprises a length of less than 135 amino acids. Advantageously, the lack of disulphide bonds, lack of glycosylation, and small size means that the binding protein is easier to produce in bacterial cells such as E.coli. Suitably the binding protein comprises a bacterial toxin or a bacterial toxin inhibitor or antitoxin. Suitably the binding protein of the VLP is a bacterial toxin inhibitor. Suitably, the binding protein is a bacterial toxin inhibitor called Im7. Suitably, the binding protein is a bacterial toxin inhibitor called Barstar. Suitably the binding protein is part of a pair of binding proteins. Suitably therefore the binding protein attached to the first or second monomer is a first binding protein operable to bind to a second binding protein. Suitably the second binding protein is attached to the functional molecule. Optionally via a linker, as explained hereinabove, which may be an alpha helix linker. Suitably comprising or consisting of a sequence according to ID NO:33. Suitably the first binding protein is a bacterial toxin or a bacterial toxin inhibitor or antitoxin. Suitably the first binding protein of the VLP is a bacterial toxin inhibitor which may be selected from Im7 or Barstar. Alternatively the first binding protein may be SpyCatcher. Alternatively, the first binding protein may be a nanobody.
Suitably the second binding protein is a bacterial toxin which may be selected from ColE7, or Barnase. Alternatively the second binding protein may be SpyTag. Alternatively the second binding protein may be a cognate target protein. In one embodiment, the first binding protein is Barstar and the second binding protein is Barnase. In an alternative embodiment, the first binding protein is Im7 and the second binding protein is ColE7. Preferably the first binding protein is Im7 and the second binding protein is ColE7. Advantageously, this pair of binding proteins allows purification of VLPs by anion chromatography. In an alternative embodiment, the first binding protein is a nanobody and the second binding protein is a cognate target protein. In a suitable embodiment, the first binding protein is attached exclusively to either the first or second monomer of the invention and the second binding protein is attached to a functional molecule. Suitably, the first binding protein binds to the second binding protein. Suitably, a binding molecule may comprise the SpyTag/SpyCatcher system. Suitably the binding protein may be the wild-type protein, or it may be modified. Suitably the binding protein may be modified to improve its function as a binding protein in the context of the VLP of the invention. Suitable modifications may include: insertions, deletions, substituents, truncations, reversals, repeats, or the like in the amino acid sequence encoding the protein. Suitably, any property of the bacterial toxin binding protein detrimental to either the host cell and / or the recipient organism intended for VLP administration is neutralized by targeted modifications. Suitably the or each binding protein may comprise one or more amino acid substitutions. Suitably the amino acid substitutions may increase the binding affinity between the or each binding protein and the functional molecule . Suitably the amino acid substitutions may remove undesirable disulphide bonds from a given binding protein. Suitably the or each binding protein may comprise one or more amino acid substitutions. In an embodiment where the first binding protein is Barstar, suitably the amino acid sequence of Barstar comprises one or more of the following substitutions: C40A, C82A, and I87E. Suitably the amino acid sequence of Barstar may comprise all of the following substitutions: C40A, C82A, and I87E. Suitably the amino acid sequence of Barstar comprises:
KKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDALTGWVEYPLVLEWRQFEQSKQ LTENGAESVLQVFREAKAEGADITIELS (SEQ ID NO: 6) In an embodiment where the first binding protein is Im7, suitably the amino acid sequence of Im7 comprises the following substitution: F41L. Suitably the amino acid sequence of Im7 comprises: ELKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDNRDDS PEGIVKEIKEWRAANGKPGFKQ (SEQ ID NO: 7). Suitably the second binding protein may also comprise one or more amino acid substitutions. Suitably the amino acid substitutions in the amino acid sequence of the second binding protein may increase the negative charge of the second binding protein. In an embodiment where the second binding protein is Barnase, suitably the amino acid sequence of Barnase comprises the following substitution: E73W. Suitably the amino acid sequence of Barnase comprises: AQVINTFDGVADYLQTYHKLPDNYITKSEAQALGWVASKGNLADVAPGKSIGGDIFSNREG KLPGKSGRTWRWADINYTSGFRNSDRILYSSDWLIYKTTDHYQTFTKIR (SEQ ID NO: 12). In an embodiment where the second binding protein is ColE7, suitably the amino acid sequence of ColE7 comprises one or more of the following substitutions: Arg538Ala, Glu542Ala, and His569Ala. Suitably the amino acid sequence of ColE7 may comprise all of the following substitutions: Arg538Ala, Glu542Ala, and His569Ala. Suitably the amino acid sequence of ColE7 comprises: ESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEV SKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKATSFALHHEKPISQNGGVYDMDNISV VTPKRAIDIHRGKS (SEQ ID NO: 13). Suitably the first or second binding proteins may be truncated. Suitably the second binding protein is truncated. In an embodiment where the second binding protein is ColE7, suitably the whole or a part of the ColE7 protein may be used as the second binding protein. Suitably only a part of the ColE7 protein is used as the second binding protein. Suitably the ColE7 protein is truncated, suitably so that it only comprises the catalytic domain of ColE7. Suitably the second binding protein comprises the catalytic domain of ColE7. In an embodiment where the second binding protein is Barnase, suitably the whole or a part of the Barnase protein may be used as the second binding protein. Suitably only a part of the
Barnase protein is used as the second binding protein. Suitably the Barnase protein is truncated, suitably so that it only comprises the catalytic domain of Barnase. Suitably the second binding protein comprises the catalytic domain of Barnase. Suitably the binding molecule may comprise additional modifications. Suitably the binding molecule may comprise chemical modification. Suitably Im7 may comprise chemical modification. Suitably the chemical modification is capable of binding to a functional molecule. Suitably the chemical modification is capable of covalently binding to a functional molecule. In one example, the functional molecule bound to the chemical modification may be a fluorescent molecule. Other suitable functional molecules are described elsewhere herein. Suitably the chemical is attached to the first binding protein by non-covalent binding. Suitably the chemical is attached to the first binding protein by electrostatic and/or hydrophobic bonding. Suitably chemical modifications include alkanes having an amine group. Suitably the alkane may have any chain length. Suitably the alkane is a lower alkane. Suitably the alkane may have a chain length of between 1 and 10 carbons. Suitably the alkane may have a chain length of between 4 and 8 carbons. Suitably the alkane may be branched. Suitably, the length of the carbon chain and the length of branched substitutions on the amine group are chosen such as to allow either irreversible attachment to the protein or reversible attachment, dependent on the desired application. In one embodiment, the chemical is attached irreversibly to the first binding protein. Suitably, in such an embodiment, conferring irreversible binding, the alkane has eight carbon atoms and a terminal nitrogen (octylamine). In another embodiment, the chemical is attached reversibly to the first binding protein. Suitably, in such an embodiment, allowing reversible binding the alkane has 4 carbon atoms in a branched structure (diethylethanolamine). Suitably the first binding protein may be chemically modified at one or more sites, suitably at one or more amino acids. Suitably the first binding protein is chemically modified at one amino acid. In one embodiment, the first binding protein is chemically modified with DEAE. In one embodiment, the first binding protein is chemically modified with octylamine. Suitably, in such embodiments, the first binding protein may be Im7. Suitably, modification with DEAE allows the first binding protein to be purified. Suitably purification by chromatography.
Suitably modification with octylamine allows the first binding protein to directly bind to a functional molecule. In one embodiment, the chemical modification of the binding protein occurs within the host cell. Suitably by post-translational modification. In another embodiment, the chemical modification of the binding protein occurs outside of the host cell. Suitably by means of a chemical reaction. Suitably by means of a non-enzymatically catalyzed non-covalent attachment. Functional Molecule The present invention relates to VLPs which are able to display various functional molecules on their surface by virtue of a binding molecule which may be a binding protein attached exclusively to the first or second monomer of the heterodimeric capsid proteins. Suitably, each heterodimer has a single binding molecule attached. Suitably the binding molecule is a binding protein. Suitably the binding molecule is attached to at least one functional molecule. Suitably the binding molecule may be attached to more than one functional molecule. Suitably the functional molecule(s) may be of the same type or different types. For example, the binding molecule may be attached to any combination of one or more antigens, antigen binding proteins, or fluorescent molecules. Suitably the binding protein is attached to one functional molecule. Suitably the binding protein may be attached to a functional molecule by a chemical modification, or alternatively via a second binding protein wherein in such an embodiment the binding protein attached to the capsid protein is the first binding protein. In one embodiment, the binding protein may comprise a chemical modification. Suitably, a chemical modification may be attached to a functional molecule. Suitably, in such an embodiment, the functional molecule is attached to the binding protein via the chemical modification. Suitably in such embodiments, the functional molecule is a non- protein antigen or epitope thereof, or a fluorescent molecule. Suitably, wherein each viral capsid protein is attached to a binding protein, and each chemical modification is attached to a functional molecule. In another embodiment, the functional molecule may be attached to the second binding protein. Suitably, the second binding protein is Barnase. Suitably, the second binding protein is ColE7.
Suitably, the second binding protein may be attached to the first binding protein, the first binding protein being exclusively attached to either the first or the second monomer of the invention. Suitably the first binding protein is Barstar. Suitably, the first binding protein is Im7. In one embodiment of the present invention, the functional molecule is attached to Barnase. Suitably, Barnase may be bound to Barstar. In another embodiment, the functional molecule is attached to ColE7. Suitably, ColE7 may be bound to Im7. Suitably the binding proteins are directly or indirectly attached to the viral capsid protein and to the functional molecule. Suitably the binding proteins are directly attached to the viral capsid protein and in some cases directly attached to the functional molecule. Suitably, the binding protein may be fused to the first or second monomer of the viral capsid protein heterodimer as described hereinabove. Suitably, the functional molecule may be fused to the binding protein. In some embodiments, the binding protein may be indirectly attached to the functional molecule via a second binding protein attached to the first binding protein. Alternatively , the binding protein may be indirectly attached to the functional molecule. Suitably via chemical modification. Suitable functional molecules may include: protein or non-protein antigens; antigen binding proteins such as antibodies or binding fragments thereof, antibody mimetics, and aptamers; fluorescent molecules. Suitably the functional molecule may be modified, suitably by the introduction of one or more mutations to change its characteristics, properties or biological effect. Sutiably the functional molecule many comprise one or more substitution mutations for example. Suitably the functional molecule is an antigen binding molecule such as an antibody. Suitably the second binding protein is a generic antibody binding protein. Suitably the antibody binding protein is selected from protein G, protein A, protein AG, and streptavidin. Suitably, an antigen binding protein such as an antibody for use as a functional molecule is capable of binding an antigen of interest. Suitably the use of an antigen binding protein such
as an antibody as a functional molecule produces a VLP which is capable of binding to an antigen. Suitably this is useful for detecting an antigen, or for targeting the VLP to an antigen. Suitable antigens may include the whole or part of an antigen. Suitably the antigen may be a subunit or monomer of an antigen. Suitably the functional molecule may be an epitope of an antigen. Suitably the use of an antigen as a functional molecule produces a VLP which is capable of stimulating an immune response to the antigen. Suitably this is useful as a vaccine. Suitably the antigen may be a protein or non-protein antigen. Suitable non-protein antigens may include sugars, lipids or carbohydrates, or small molecule chemicals to which an immune response is desired, or who need to be detected, such as nicotine, cocaine, or other exogenous toxins. Suitably the antigen may be a self or non-self antigen relative to the subject intended to be treated with the VLP. Suitably the antigen may be a human or non-human antigen. Suitably the antigen may be derived from the causative agent in a disease or disorder. Suitably the causative agent may be self or non-self. Suitably a non-self causative agent may be an infectious agent. Suitably therefore the antigen may be derived from an infectious agent such as a virus, bacterium, fungus, protozoan, archaeon. Suitably the antigen may be derived from a virus selected from: Adeno-associated virus, Chikungunya virus, Crimean-Congo hemorrhagic fever virus, Dengue virus, Ebolavirus, Echovirus, Encephalomyocarditis virus, Epstein-Barr virus, Hantaan virus, Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E virus, Human adenovirus, Human astrovirus, Human coronavirus, Human cytomegalovirus, Human enterovirus, Human herpesvirus, Human immunodeficiency virus, Human papillomavirus, Human parainfluenza, Human respiratory syncytial virus, Human rhinovirus, Human torovirus, Influenza A virus, Influenza B virus, Influenza C virus, Japanese encephalitis virus, Polyomavirus, Kunjin virus, Lassa virus, Measles virus, Molluscum contagiosum virus, Mumps virus, Nipah virus, Poliovirus, Rabies virus, Rotavirus A, Rotavirus B, Rotavirus C, Rubella virus, Sagiyama virus, Salivirus A, Sapporo virus, Sindbis virus, Toscana virus, Uukuniemi virus, Varicella-zoster virus, Venezuelan equine encephalitis virus, Vesicular stomatitis virus, Western equine encephalitis virus, West Nile virus, Yellow fever virus, Zika virus. Suitably the antigen may be derived from a bacterium selected from: Actinomyces israelii, Bacillus anthracis, Bacillus cereus, Bartonella henselae, Bartonella quintana, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Borrelia garinii, Borrelia afzelii, Borrelia recurrentis, Brucella abortus, Brucella canis, Brucella melitensis, Brucella suis,
Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Chlamydophila psittaci, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium tetani, Corynebacterium diphtheriae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Francisella tularensis, Haemophilus influenzae , Helicobacter pylori, Klebsiella pneumoniae, Legionella pneumophila, Leptospira interrogans, Leptospira santarosai, Leptospira weilii, Leptospira noguchii, Listeria monocytogenes , Mycobacterium leprae, Mycobacterium tuberculosis, Mycobacterium ulcerans, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pseudomonas aeruginosa, Rickettsia rickettsia, Salmonella enterica subsp. enterica , Salmonella typhi, Shigella sonnei, Shigella dysenteriae, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus saprophyticus, Streptococcus agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus viridans, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae, Yersinia pestis, Yersinia enterocolitica, Yersinia pseudotuberculosis . In one embodiment the antigen is derived from a coronavirus, suitably from SARS-CoV-2. Suitably the antigen is the whole or part of a spike protein derived from SARS-CoV-2, or the whole or part of a nucleocapsid protein derived from SARS-CoV-2. In one embodiment, therefore, the functional molecule is part of a spike protein derived from SARS-CoV-2. Suitably the receptor binding domain. In another embodiment, therefore, the functional molecule is part of a nucleocapsid protein derived from SARS-CoV-2. Suitably the C-terminus. Suitably a self-causative agent may be a non-infectious agent. Suitably therefore the antigen may be derived from a non-infectious agent such as an inflammatory molecule, or a molecule causing degenerative changes in nervous (such as beta-amyloid), cartilage or bone tissue, or a molecule causing worsening of a neoplastic disease. Suitably the antigen may be an inflammatory molecule or a molecule causing degenerative changes or a molecule conducive to a neoplastic disease which is a causative agent in a disease or disorder. Suitably the molecule may operate in humans or in non-human mammals. Suitably the molecule may cause a disease or disorder in a specific species. Suitable inflammatory molecules may include chemokines or cytokines, or proteases. Suitable chemokines or cytokines may include: interleukins, tumour necrosis factors, interferons, and colony stimulating factors. Suitable chemokines or cytokines may include: IL1, IL2, Il3, Il4, IL5, Il6, Il7, IL8, IL9, IL10, IL11, IL12, IL13, IL17, IL33, TNFα, TNFβ, IFNα, IFNβ, IFNγ, G-CSF, GM-CSF, M-CSF, erythropoietin, and TGFβ. Suitable proteases may include ADAMTS4,
ADAMTS5. Suitably the antigen is an interleukin or a protease. Suitably the antigen is IL13, IL17 or IL33 or a fragment thereof. In one embodiment, therefore, the functional molecule is IL13, IL17 or IL33. In one embodiment the IL13 is modified. In one embodiment the IL13 may comprise one or more substitution mutations. Suitably the or each modification may reduce, limit, or change the effect of IL3, suitably it may reduce the transactivation of receptors. Suitable molecules which case degenerative changes in nervous tissue or worsening of neoplastic diseases may include: ADAMTS4/5, angiogenesis factors, or factors allowing escape of tumours such as galectin proteins. References to any antigens herein may equally refer to an epitope of said antigen. An antigen of interest may be any of those listed above. For example, an antigen of interest may be from a disease causing agent such as a virus, bacterium, fungus, protozoan, or archaeon. Alternatively, an antigen of interest may be from a non-infectious agent, for example, a cell surface receptor. Suitably the antibody may be capable of binding to an antigen from a virus, bacterium, fungus, protozoan, archaeon as listed above. Suitable viruses may be selected from, for example: Adeno-associated virus, Chikungunya virus, Crimean-Congo hemorrhagic fever virus, Dengue virus, Ebolavirus, Echovirus, Encephalomyocarditis virus, Epstein-Barr virus, Hantaan virus, Hepatitis A virus, Hepatitis B virus, Hepatitis C virus, Hepatitis D virus, Hepatitis E virus, Human adenovirus, Human astrovirus, Human coronavirus, Human cytomegalovirus, Human enterovirus, Human herpesvirus, Human immunodeficiency virus, Human papillomavirus, Human parainfluenza, Human respiratory syncytial virus, Human rhinovirus, Human torovirus, Influenza A virus, Influenza B virus, Influenza C virus, Japanese encephalitis virus, Polyomavirus, Kunjin virus, Lassa virus, Measles virus, Molluscum contagiosum virus, Mumps virus, Nipah virus, Poliovirus, Rabies virus, Rotavirus A, Rotavirus B, Rotavirus C, Rubella virus, Sagiyama virus, Salivirus A, Sapporo virus, Sindbis virus, Toscana virus, Uukuniemi virus, Varicella-zoster virus, Venezuelan equine encephalitis virus, Vesicular stomatitis virus, Western equine encephalitis virus, West Nile virus, Yellow fever virus, Zika virus. In one embodiment, the functional molecule is an antibody capable of binding to an antigen from a coronavirus. In one embodiment, the antibody is capable of binding to an antigen from SARS-CoV-2. Suitable bacteria may be selected from: Actinomyces israelii, Bacillus anthracis, Bacillus cereus, Bartonella henselae, Bartonella quintana, Bacteroides fragilis, Bordetella pertussis, Borrelia burgdorferi, Borrelia garinii, Borrelia afzelii, Borrelia recurrentis, Brucella abortus,
Brucella canis, Brucella melitensis, Brucella suis, Campylobacter jejuni, Chlamydia pneumoniae, Chlamydia trachomatis, Chlamydophila psittaci, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium tetani, Corynebacterium diphtheriae, Enterococcus faecalis, Enterococcus faecium, Escherichia coli, Francisella tularensis, Haemophilus influenzae , Helicobacter pylori, Klebsiella pneumoniae, Legionella pneumophila, Leptospira interrogans, Leptospira santarosai, Leptospira weilii, Leptospira noguchii, Listeria monocytogenes , Mycobacterium leprae, Mycobacterium tuberculosis, Mycobacterium ulcerans, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Nocardia asteroides, Pseudomonas aeruginosa, Rickettsia rickettsia, Salmonella enterica subsp. enterica , Salmonella typhi, Shigella sonnei, Shigella dysenteriae, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus saprophyticus, Streptococcus agalactiae, Streptococcus pneumoniae, Streptococcus pyogenes, Streptococcus viridans, Treponema pallidum, Ureaplasma urealyticum, Vibrio cholerae, Yersinia pestis, Yersinia enterocolitica, Yersinia pseudotuberculosis. Suitably in such an embodiment, the VLP may be targeted to a particular virus. Suitably targeted to bind to a particular virus. Suitably the VLP may therefore be used for detecting the presence of a virus. Further details on this use are provided elsewhere. Suitably the antigen binding protein such as an antibody may be capable of binding to an antigen from a cell surface receptor. Suitably the cell surface receptor may be an ion-channel linked receptor, a G-protein coupled receptor, or an enzyme-linked receptor. Suitably the cell surface receptor is selected from: 5-HT receptor, nAch-receptor, Zinc-activated ion channel, GABAA receptor, Wnt-family member receptors, co-receptors contained in lipid rafts, T-cell and T-cell co-receptors, B-cell receptors and B-cell costimulatory molecules, Glycine receptor, AMPA receptor, Kainate receptor, NMDA receptor, Glutamate receptor, ATP-gated channel, PIP2 gated channel, Erb receptor, GDNF receptor, NP receptor, trk receptor, toll-like receptor, GABAB receptor, GBPCR class A, B, C, D, E, or F. Suitably in such an embodiment, the VLP may be targeted to a particular cell. Suitably targeted to bind to a particular cell. Suitably the VLP may be used to deliver cargo to a cell. Further details on this use are provided elsewhere. Suitable antibodies may include IgG, IgM, IgE, IgA, IgD antibodies. Suitably, the antibody is an IgG antibody. Suitably IgG subclasses include IgG1, IgG2, IgG3 and IgG4. Suitable further antigen binding proteins may include antibody binding fragments or antibody mimetics which perform the same function as an antibody. Suitably they are also capable of binding an antigen of interest. Suitably the use of an antibody binding fragment or mimetic as a functional molecule also produces a VLP which is capable of binding to an antigen. Suitably
this is useful for detecting an antigen, or for targeting the VLP to an antigen as described above. Suitable antibody binding fragments may include: Fab, monospecific or bispecific F(ab)2, F(ab’)2, monospecific or bispecific diabody, nanobody, ScFv, ScFv-Fc, F(ab)3. Suitable antibody mimetics may include affibodies, affilins, affimers, affitins, alphabodies, anticalins, avimers, DARPins, fynomers, Kunitz domain peptides, monobodies, nanCLAMPs. Suitably the use of a fluorescent molecule as a functional molecule produces a VLP which is visible. Suitably this is useful for labelling, especially when combined with a second functional molecule which can bind to an antigen, for example antibodies or binding fragments thereof, antibody mimetics, or aptamers. Suitable fluorescent molecules may include: GFP, EBFP, EBFP2, Azurite, GFPuv, T-saphhire, Cerulean, CFP, mCFP, mTurquoise2, CyPet, mKeima-red, tagCFP, AmCyan1, mTFP1, midoriishi cyan, turboGFP, tagGFP, emerald, azami green, ZsGreen1, YFP, tagYFP, EYFP, topaz, venus, mCtrine, YPet, turboYFP, ZsYellow1, Kusabira Orange, mOrange, allophycocyanin, mkO, RFP, turboRFP, tdTomato, tagRFP, dsRed, mStrawberry, turboFP602, asRed2, J-red, R-phycoerythrin, B-phycoerythrin, mCherry, HcRed, Katusha, P3, peridin chlorophyll, mKate, turboFP635, mPlum, mRaspberry. Suitably the fluorescent molecule is GFP or any modified form of GFP. In one embodiment of the invention, the or each functional molecule is IL13, IL17, IL33, the receptor binding domain of SARS Cov-2 spike protein, or the C-terminus of the SARS Cov-2 nucleocapsid protein. In another embodiment of the invention, the or each functional molecule is an IgG antibody or binding fragment thereof. In one embodiment, the antibody or binding fragment thereof is an antibody or binding fragment thereof directed towards SARS-CoV-2. Suitably the functional molecule is an epitope. Suitably an epitope selected from IL-13, IL-33, IL-31, IL-17, or SARS-Cov2 spike protein receptor binding domain. In one embodiment, the functional molecule is IL-31, suitably according to SEQ ID NO:17. Suitably the functional molecules may comprise one or more epitopes and/or a fluorescent molecule. Suitably the functional molecules may comprise two epitopes. Suitably the functional molecules may comprise an epitope and a fluorescent molecule. Virus-Like Particle (VLP) The present invention relates to VLPs, their uses and methods of manufacture thereof.
Suitably the VLP comprises one or more viral capsid protein heterodimers which suitably form a VLP. Suitably the one or more viral capsid proteins self-assemble into the VLP. Suitably the VLP comprises one or more binding molecules, each attached to a viral capsid protein heterodimer. Suitably the VLP comprises one or more functional molecules which are suitably each attached to the binding molecule, and/or chemical modifications present on the binding molecule of the capsid protein heterodimer. Suitably, in such a way, the VLP of the invention stably displays the functional molecules on its surface. Suitably the VLP may comprise a plurality of subunits. Suitably each subunit comprises a complete viral capsid protein heterodimer, one binding molecule and one functional molecule. Suitably the subunits self-assemble into a VLP. Suitably therefore the VLP comprises a plurality of viral capsid protein heterodimers, a plurality of binding molecules (each attached to a capsid protein heterodimer and a plurality of functional molecules, each attached to a binding molecule . Suitably each viral capsid heterodimer comprise a first monomer and a second monomer. The first and/or the second monomer comprising at least one mutation in its amino acid sequences as compared to the wildtype amino acid sequence. Suitably, each capsid protein heterodimer comprises a single binding molecule. Suitably, the binding molecule may be attached to either the first or the second monomer. Suitably, the binding molecule is attached to at least one functional molecule. In an embodiment of the first aspect of the invention, the VLP comprises a plurality of capsid protein heterodimers, each heterodimer comprising a first and a second monomer, a binding molecule attached to either the first or the second monomer, and a functional molecule. In such an embodiment, the amino acid sequence of the first and/or second monomer comprises at least one mutation as compared to the wild-type sequence. Suitably, the mutation in the amino acid sequence promotes association of the monomers to form heterodimers. It will be appreciated to those skilled in the art, that unmodified wild-type Hepatitis B viral capsid proteins favour the formation of homodimers. Suitably, the inventors have found that the introduction of specific mutations in the amino acid sequence of the monomeric subunits of the Hepatitis B viral capsid proteins promotes
association of the mutated monomers to form heterodimers. Suitably, the first or second monomer of the heterodimer comprises a binding molecule. Suitable mutations to the amino acid sequences of the monomeric subunits that promote heterodimerisation are discussed elsewhere in the specification. Suitably the VLP comprises a negative surface charge, suitably a homogenous negative surface charge. Suitably the binding molecule may be a binding protein or a pair of binding proteins as discussed hereinabove. Suitably, the VLP may have an average diameter of between 20nm to 100nm, suitably between 25nm and 75nm, suitably between 28nm and 50nm, suitably between 30nm and 40nm, suitably around 32nm. Suitably the size of the VLP may be determined by dynamic light scattering (DLS). For example using a Malvern Zetasizer Ultra. Nucleic Acids and Vectors The present invention relates to nucleic acids encoding component protein parts which form the VLP, and vectors comprising said nucleic acids which may be used in host cells to produce VLPs. Suitably the invention relates to, and makes use of, a first nucleic acid encoding a first viral capsid protein monomer and a second nucleic acid encoding a second viral capsid protein monomer. Suitably, the first or second nucleic acid may encode a capsid protein monomer attached to a binding molecule. Suitably the first or second nucleic acid may encode a fusion protein comprising the viral capsid protein monomer fused to a binding molecule. Suitably the viral capsid protein may be a hepatitis B capsid protein. Suitably the viral capsid protein may be a Woodchuck hepatitis capsid protein. Suitably, the binding molecule may be a binding protein, suitably a first binding protein. Suitably the first or second nucleic acids encoding a fusion protein comprising the viral capsid protein monomer fused to a binding molecule may be known as the ‘capsid fusion protein’. Suitably the first or second nucleic acid sequences may encode a viral capsid protein monomer and may suitably comprise a sequence according to SEQ ID NO: 20 or 23. Suitably the first or second nucleic acid sequences may encode a viral capsid protein monomer and may suitably comprise a sequence having at least 60%, at least 65%, at least
70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 20 or 23. Suitably the first or second nucleic acid sequences may encode a viral capsid protein monomer and may consist of a sequence according to SEQ ID NO: 20 or 23. Suitably any of these monomer sequences may further comprise a sequence encoding a binding molecule, suitably encoding the viral capsid protein monomer fused to a binding molecule, suitably to a first binding protein. Suitably the first and second nucleic acid sequences may encode a heterodimer comprising a binding molecule, suitably a first binding protein, and may suitably comprise a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 18, 19, 21, or 22. Suitably therefore a nucleic acid encoding a heterodimer of the invention may comprise a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 18, 19, 21, or 22. Suitably the invention relates to, and makes use of, a third nucleic acid encoding a functional molecule. Suitably, the third nucleic acid encoding a functional molecule may be optionally attached to a second binding protein as explained hereinabove. Suitably the third nucleic acid may encode a fusion protein comprising the functional molecule optionally fused to a second binding protein. Suitably this may be known as the ‘functional fusion protein’. In one embodiment, the third nucleic acid encodes only a functional molecule. In one embodiment, the third nucleic acid encodes a functional molecule attached to second binding protein. In one embodiment, the third nucleic acid encodes a functional molecule fused to a second binding protein. In one embodiment the third nucleic acid encodes a functional fusion protein. One example of a third nucleic acid sequence encoding a functional molecule is a sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO:17. Suitably the third nucleic acid may comprise a sequence encoding a functional molecule attached to a second binding protein, suitably having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% at least 96%, at least 97%, at least 98%, at least 99% identity with SEQ ID NO: 24-25.
Suitably the third nucleic acid encoding a functional molecule attached to a second binding protein may consist of a sequence according to SEQ ID NO: 24-25. In embodiments where the third nucleic acid encodes two epitopes, they may be any two epitopes fused to a second binding protein. In one embodiment, the third nucleic acid may encode for example a SARS-Cov2 spike protein receptor binding domain and a C-terminal fragment of the nucleocapsid protein. In embodiments where the third nucleic acid encodes an epitope and a fluorescent molecule fused to a second binding protein, they may be any epitope and any fluorescent molecule. In one embodiment, the third nucleic acid may encode for example a SARS-Cov2 spike protein receptor binding domain and eGFP. In some embodiments, the invention may make use of the first and second nucleic acids. In some embodiments, the invention may make use of the first, second and third nucleic acids. Suitably the first, second, and third nucleic acids described herein may be provided as one contiguous nucleic acid sequence, or may be provided as a plurality of separate nucleic acid sequences. References to the first, second, and third, nucleic acids include embodiments where plurality of nucleic acid sequences may be used to encode the same proteins as the first, second, third, nucleic acids. Suitably, the first and second nucleic acids may comprise a sequence according to SEQ ID NO: 18, 19, 21 or 22. Suitably the third nucleic acid may comprise a sequence according to SEQ ID NO: 24 or 25. Suitably the first and second nucleic acids may comprise a contiguous sequence according to SEQ ID NO:18 or 21, and the third nucleic acid may comprise a sequence according to SEQ ID NO: 24. Suitably in such an embodiment, the first binding protein is lm7 and the second binding protein is Col-E7. Suitably the first and second nucleic acids may comprise a contiguous sequence according to SEQ ID NO:19 or 22, and the third nucleic acid may comprise a sequence according to SEQ ID NO: 25. Suitably in such an embodiment, the first binding protein is Barstar and the second binding protein is Barnase. Suitably the nucleic acids may comprise one or more expression elements to aid in expression of the proteins encoded thereon. Suitable expression elements include promoters, operators, enhancers, activators, repressors, 5’UTRs, 3’UTRs, introns, IRES, etc.
Suitably each of the nucleic acids comprises one or more expression elements which ensure equal expression of the proteins encoded thereon. Suitably each of the nucleic acids comprises a promoter which ensures equal expression of the proteins encoded therein. Suitably the promoter may comprise one or more modifications which adapt the level of expression therefrom. Suitably the promoter may comprise one or more mutations. Suitably the or each nucleic acid described herein is operably linked to a promoter. Suitable promoters may be selected from: CMV-IE, EF1a, SV40, PGK1, CAG, human beta actin, T7, TetR/TetA, T7lac, SP6, LP1, TTR, CK8, Synapsin, Glial fibrillary acidic protein (GFAP), CaMKII, TBG, and albumin promoter. Suitably each nucleic acid may be linked to the same promoter or a different promoter. Suitably each nucleic acid may be linked to the same promoter. Suitably therefore each nucleic acid may be expressed at the same time. Suitably each nucleic acid may be linked to a T7 promoter, optionally with one or more modifications to ensure equal expression levels of the proteins encoded by the nucleic acids. Suitably each nucleic acid may be linked to a different promoter. Suitably therefore each nucleic acid may be expressed at different times. Suitably the or each nucleic acid may be independently expressed. Suitably expression of each nucleic acid may be induced at different times. Suitably therefore the or each promoter may be an inducible promoter. Suitably which may be induced by contacting the promoter with a suitable inducer, at a concentration effective to induce expression therefrom. In one embodiment, the first or second nucleic acid sequence may be linked to a first promoter and the first or second nucleic acid may be linked to a second promoter. Suitably the first or second promoter may be a T7 promoter. Suitably, the T7 promoter may comprise the sequence agcataat (SEQ ID NO:8). Suitably the first or second promoter may be a TetR/TetA promoter. Suitably therefore the first or second nucleic acid expresses the viral capsid protein monomers described herein at the same or equal levels. Suitably therefore the heterodimeric capsid protein is expressed at a 1:1 level compared to the functional fusion protein, or the functional molecule. Suitably the nucleic acids may be comprised on one or more vectors. Suitably the first, second and/or third nucleic acids may be comprised on one vector. Alternatively, first, second, and/or third nucleic acids may be comprised on multiple vectors. In one embodiment, the first nucleic acid may be comprised on one vector and the second nucleic acid may be comprised on another vector.
Suitably the nucleic acids may be comprised on one or more vectors as constructs. Suitably as expression constructs. Suitably an expression construct in accordance with the invention may comprise a first nucleic acid sequence encoding a first viral capsid protein monomer; and a second nucleic acid sequence encoding a second viral capsid protein monomer attached to a binding molecule. Suitably the expression construct comprises a first promoter operably linked to the first and second nucleic acids. Suitably the expression construct comprises an IRES or ribosome binding site located between the first nucleic acid and the second nucleic acid. Suitably such that the first and second nucleic acids are expressed as separate proteins, suitably as separate monomers. Alternatively, the expression construct comprises a sequence encoding an internal cleavage peptide, suitably a self-cleaving peptide located between the first nucleic acid and the second nucleic acid. Suitably such that once the first and second nucleic acids are expressed as a protein, the protein cleaves into separate monomers. In one embodiment, the first nucleic acid is comprised on a first vector, suitably for example the first vector may comprise a first nucleic acid sequence encoding a woodchuck hepatitis monomer, in such an embodiment, the vector may has a sequence according to SEQ ID NO: 29. In other examples, the first vector may comprise a first nucleic acid encoding a hepatitis B monomer, suitably comprising a sequence according to SEQ ID NO: 20. In one embodiment, the second nucleic acid is comprised on a second vector, suitably for example the second vector may comprise a second nucleic acid sequence encoding a woodchuck hepatitis monomer attached to a binding molecule, suitably a first binding protein, in such an embodiment, the vector has a sequence according to SEQ ID NO:28. .In other examples, the second vector may comprise a second nucleic acid encoding a hepatitis B monomer attached to a binding molecule, suitably a first binding protein. Alternatively, in one embodiment, the first and second nucleic acids may be comprised on the same vector. In one embodiment, the vector has a sequence according to SEQ ID NO: 26. Additionally, the third nucleic acid may be comprises on a second vector. In one embodiment the second vector may comprise a sequence according to SEQ ID NO: 24 or 25. In another embodiment, the first, second and the third nucleic acids may be comprised on the same vector, suitably therefore one vector may encode the VLP of the invention. In one embodiment, the vector may comprise a sequence encoding both woodchuck hepatitis monomers, i.e. a woodchuck hepatitis heterodimer, a first and second binding protein and a functional molecule, suitably the vector has a sequence according to SEQ ID NO: 27. In other
examples, the vector may comprise a sequence encoding a both hepatitis B monomers, i.e. a hepatitis B heterodimer, a first and second binding protein and a functional molecule, suitably comprising sequences according to SEQ ID NOs: 18 or 19, and 24 or 25. In one embodiment, the first nucleic acid and the third nucleic acid are comprised on the same vector.. For example, the vector may comprise a nucleic acid sequence according to SEQ ID NO: 20 or 23, and a nucleic acid sequence according to SEQ ID NO: 24 or 25. In one embodiment, the second nucleic acid and the third nucleic acid are comprised on the same vector. In one embodiment, a single vector comprises SEQ ID NO:18 and SEQ ID NO: 24. In one embodiment, a first vector comprises SEQ ID NO:18 and a second vector comprises SEQ ID NO: 24. In one embodiment a single vector copmrpises SEQ ID NO:19 and SEQ ID NO:25. In one embodiment, a first vector comprises SEQ ID NO:19 and a second vector comprises SEQ ID NO: 25. In one embodiment, a single vector comprises SEQ ID NO:21 and SEQ ID NO: 24. In one embodiment, a first vector comprises SEQ ID NO:21 and a second vector comprises SEQ ID NO: 24. In one embodiment, a single vector comprises SEQ ID NO:22 and SEQ ID NO: 25. In one embodiment, a first vector comprises SEQ ID NO:22 and a second vector comprises SEQ ID NO: 25. Suitably the one or more vectors may be comprised in one or more host cells. Suitably the one or more vectors may be comprised in a single host cell. Suitably, for example in the ninth aspect. Alternatively the one or more vectors may be comprised in a two host cells in any combination. Suitably for example in the tenth aspect. Suitably a host cell may comprise any of the above vectors in any combination. Suitably in the process of the ninth aspect, the first, second and/or third nucleic acids may be comprised on one vector or on a first and second vector, or on a first, second and third vector respectively as described above. In one embodiment of the process of the ninth aspect, the first and second nucleic acids are comprised on one vector. Suitably the or each vector is present in the single host cell. In one embodiment of the process of the ninth aspect, the first and second nucleic acids are comprised on a single vector of SEQ ID NO: 26. In one embodiment of the process of the ninth aspect, the first and third nucleic acids are comprised on a single vector. Suitably the single host cell comprises a single vector of SEQ ID NO:26 or
27. In one embodiment of the process of the ninth aspect, the first, second and/or third nucleic acids are comprised on two different vectors. Suitably the first nucleic acid may be comprised on a first vector selected from SEQ ID NO:29. Suitably the second nucleic acid may be comprised on a second vector selected from SEQ ID NO:28. Suitably any workable combination of first and second vectors may be used in the single host cell. For example, the first vector may comprise SEQ ID NO:28 and may be combined with the second vector of SEQ ID NO:29. Suitably in the process of the tenth aspect, the first and second nucleic acids may be comprised on a first vector, or may be comprised on a first and second vector respectively. Suitably the third nucleic acid may be comprised on a third vector. Suitably the first vector and optionally the second vector is present in the first host cell and the third vector is present in a second host cell. In one embodiment of the process of the tenth aspect, suitably the first vector is of SEQ ID NO:29, and the second vector is of SEQ ID NO: 28. Suitably any workable combination of first and second vectors may be used in the host cells. For example, the first host cell may comprise a first vector of SEQ ID NO:28 and may be combined with a second vector of SEQ ID NO: 29. For example, the first host cell may comprise a first vector of SEQ ID NO: 29 and may be combined with a second vector of any of SEQ ID NO: 28. Suitably, the one or more vectors may further comprise the third nucleic acid. Any suitable vector may be used for the chosen host cell/s. Suitable host cells are discussed below. Suitably the vector is selected from: a plasmid, a cosmid, a phage, a virus, an artificial chromosome. Suitably the or each vector is a plasmid. Suitable plasmid vectors for a host E.coli cell may include, for example: pALTER-Ex1, pALTER-Ex2, pBAD/His, pBAD/Myc-His, pBAD/gIII, pCal-n, pCal-n-EK, Cal-c, pCal-Kc, pcDNA 2.1, pDUAL, pET-3a-c, pET-9a-d, pET-11a-d, pET-12a-c, pET-14b, pET-15b, pET- 16b, pET-17b, pET-19b, pET-20b(+), pET-21a-d(+), pET-22b(+),pET-23a-d(+), pET-24a-d(+), pET-25b(+), pET-26b(+), pET-27b(+), pET-28a-c(+), pET-29a-c(+), pET-30a-c(+), pET- 31b(+), pET-32a-c(+), pET-33b(+), pET-34b(+) , pET-35b(+), pET-36b(+), pET-37b(+), pET- 38b(+), pET-39b(+), pET-40b(+), pET-41a-c(+), pET-42a-c(+), pET-43a-c(+), pETBlue-1, pETBlue-2, pETBlue-3, pGEMEX-1, pGEMEX-2, pGEX-1lT, pGEX-2T, pGEX-2TK, pGEX-3X, pGEX-4T, pGEX-5X, pGEX-6P, pHAT10/11/12, pHAT20, pHAT-GFPuv, pKK223-3, pLEX, pMAL-c2X, pMAL-c2E, pMAL-c2G, pMAL-p2X, pMAL-p2E, pMAL-p2G, pProEX HT, pPROLar.A, pPROTet.E, pQE-9, pQE-16, pQE-30/31/32, pQE-40, pQE-60, pQE-70, pQE- 80/81/82L, pQE-100, pRSET, pSE280, pSE380, pSE420, pThioHis, pTrc99A, pTrcHis, pTrcHis2, pTriEx-1, pTriEx-2, pTrxFus. In one embodiment, the vector used is pET-Duet.
Suitable plasmid vectors for a host mammalian cell may include: the pSV and the pCMV series of vectors. In one embodiment, the vector used is pcDNA5D. In one embodiment, host mammalian cells are HEK293 cells or CHO cells or derivatives thereof. Suitably if more than one vector is used, it is the same type of vector. Suitably the vector may comprise a variety of other functional nucleic acid sequences, such as one or more selectable markers, one or more origins of replication, multiple cloning sites and the like. Process of Producing a VLP The present invention further relates to processes for the production of VLPs. Two different processes are described herein, one is a single cell process, the other is a process which takes place in two cells and requires mixing of component parts to form the VLP. In accordance with the ninth aspect of the invention, there is provided a single cell process of producing a VLP. In accordance with the tenth aspect of the invention, there is provided a multiple cell process of producing a VLP. Suitably a dual cell process. Suitably the processes may further comprise transfecting the one or more vectors comprising the nucleic acids into the or each host cell. Suitably prior to culturing the or each host cell. Suitably transfection may take place by any suitable method such as electroporation, microinjection, particle delivery, chemical mediated endocytosis, calcium phosphate co- precipitation, or liposome mediated delivery. Suitably culturing the host cells under conditions to express the proteins comprises culturing the host cells under optimum growth conditions. Suitably the optimum growth conditions will vary depending on the host cell being used. Suitably the host cell may be selected from any bacterium, yeast, insect cell or human cell. Suitably the host cell is a bacterial host cell. Suitably the host cell is selected from E.coli, B.subtilis, Caulobacter crescentus, Rodhobacter sphaeroides, Pseudoalteromonas haloplanktis, Shewanella sp. strain Ac10, Pseudomonas fluorescens, Pseudomonas putida, Pseudomonas aeruginosa, Halomonas elongate, Chromohalobacter salexigens, Streptomyces lividans, Streptomyces griseus, Nocardia lactamdurans, Mycobacterium smegmatis, Corynebacterium glutamicum, Corynebacterium ammoniagenes, Brevibacterium lactofermentum, Bacillus brevis, Bacillus megaterium, Bacillus licheniformis, Bacillus
amyloliquefacien, Lactococcus lactis, Lactobacillus plantarum, Lactobacillus casei, Lactobacillus reuteri, Lactobacillus gasseri. In one embodiment, the host cell is E.coli. Suitably the E.coli strain is selected from BL21, lemo21, NiCo21, NEB Express, SHuffle, T7 Express, BLR, HMS174, Tuner, Origami2, Rosetta2, m15. In one embodiment, the E.coli strain is BL21(DE3) where the additional genes regulating disulfide formation, dsbC and erv1P, are integrated genomically. Suitably, the genomic integration is within the recAX locus. In an alternative embodiment, the host cell is a human cell, such as a HEK293T cell. Suitably optimum growth conditions comprise culturing at a temperature of 15-25°C. Suitably optimum growth conditions comprise culturing in a medium compatible with bioprocess applications for medicines intended for use in humans, such as chemically defined medium. Suitably optimum growth conditions comprise culturing in an aerated culture medium. Suitably the host cells are cultured to a high density. Suitably to a density OD600 of 4-20. Suitably culturing the host cells under conditions to express the proteins may also comprise inducing the host cells to express the proteins. Suitably inducing the host cells may comprise addition of an inducer into the culture medium, or the creation of certain inducive conditions within the culture medium such as acid/alkali pH, heat shock, hypoxia or the like. Suitably the inducer or inducive condition stimulates transcription of the nucleic acids. Suitably an inducer or inducive condition does so by stimulating an inducible expression control sequence within the nucleic acids. Suitably the inducible expression control sequence may be an inducible promoter. Suitable inducers include isopropyl-β-d-thiogalactoside (IPTG) for lactose driven promoters or tetracycline for tetracycline – regulated promoters. Suitably the host cells are induced to express the proteins once the culture has reached the optimal density described above. Suitably the host cells are induced to express the proteins during logarithmic growth. Suitably the concentration of proteins may be varied by adjusting the concentration of an inducer or altering the inducive conditions to which the host cells are exposed. Suitably the culturing step takes between 4 – 24 hours. Suitably the host cells are induced to express the proteins after 2-6h of culturing or when an OD of 6-8 has been achieved.
In a further aspect of the invention, there is provided a cell culture comprising one or more host cells of the ninth or tenth aspects and a culture medium. Suitably a plurality of said cells. Alternatively, the process may not be conducted within one or more cells, and may be conducted in a cell-free system. Suitably in the process of the ninth or tenth aspect, step (a) and/or (b) and/or (c) is conducted within a host cell, to ensure proper production of the VLP. However, suitably steps relating to mixing and forming functionalised viral capsid protein heterodimers; may occur outside of a host cell, in a cell free system. Suitably the processes may further comprise a step of recovering the VLPs. Suitably recovering the VLPs from the host cells. Suitably after the VLPs have been formed. Suitably recovering the VLPs may comprise disrupting the host cells. Alternatively, the host cells may secrete the VLPs into the culture solution. Suitably disrupting the host cells may be carried out by any suitable method such as homogenisation, sonication, or freeze-thaw. Recovery of the VLPs may take place by any suitable method such as filtration, pull-down, centrifugation, or chromatography. Suitably, in an embodiment where the binding molecule comprises a chemical modification, suitably the recovery and purification of VLPs takes place by chromatography. Suitably involving a sequence of steps including mixed mode (hydrophobic interaction and size exclusion) chromatography, anion exchange chromatography, and ultrafiltration. Suitably by anion exchange chromatography. Suitably when anion exchange chromatography is used to recover the VLPs, the VLP may comprise chemical modification, suitably in such an embodiment the first binding protein of the VLP is modified with DEAE. Suitably the DEAE molecules can bind to the chromatography column. Suitably, the recovery and purification of VLPs takes place by affinity chromatography. For example, immobilized metal affinity chromatography (IMAC). Suitably, when affinity chromatography is used to recover the VLPs, the VLP may comprise an affinity tag which is capable of binding to affinity agents used for the affinity chromatography. For example, when immobilized metal affinity chromatography (IMAC) is used the VLP may comprise a metal binding affinity tag, suitably in such an embodiment the first binding protein of the VLP, second binding protein of the VLP and/or a functional protein bound the first and/or second binding protein may comprise an affinity tag. Suitably, the functional protein comprises the affinity tag. Suitably the functional protein is fused to the affinity tag. In such examples, binding of the functional protein to the first and/or second binding proteins can be confirmed by affinity chromatography as the first and/or second binding proteins will be recovered by virtue of binding to the functional protein. “Affinity tag", or “affinity ligand” refers to a short amino acid sequence or peptide enabling a specific interaction with a protein or a ligand, for example in the case of IMAC a metal ion. Examples
of affinity tags include biotin, desthiobiotin, histidine, polyhistidine, myc, hemagglutinin (HA), FLAG, fluorescence tag, tandem affinity purification (TAP) tags, FLAG, glutathione S transferase (GST) or derivatives thereof. A suitably affinity tag may comprise or consist of a sequence according to SEQ ID NO: 37. Suitably, in the process of the tenth aspect, step (d) comprises recovering the proteins. Suitably recovering the proteins from the host cells. Suitably recovering the proteins may be performed by similar techniques. Suitably recovering the proteins may comprise disrupting the host cells as above. Alternatively, the host cells may secrete the proteins into the culture solution. Suitably the VLPs form by self-assembly, suitably automatic self-assembly. Suitably once the component proteins are mixed, either within a single host cell as per the ninth aspect or outside of a cell as per the tenth aspect, they will assemble to form VLPs. In respect of the single cell process of the ninth aspect, suitably the step of culturing the host cell further comprises culturing under conditions such that the proteins expressed from the first and second nucleic acids, or from any further nucleic acids, bind to each other. In some embodiments, after the culturing step the binding molecule may be chemically modified. Suitably therefore the method may comprise a step of recovering the proteins, and subsequently chemically modifying the binding molecule. Suitably these steps take place after step (b) but prior to step (c). In some embodiments, the host cell may be cultured under conditions so as to express proteins from the first, second, and third nucleic acids. In one embodiment, the third nucleic acid encodes only a functional molecule. Suitably, in such an embodiment, the binding molecule is chemically modified. In one embodiment, step (c) of the ninth aspect comprises a binding molecule binding to a functional molecule, suitably via a chemical modification. In respect of the two cell process of the tenth aspect, suitably during the culturing step the binding molecule may be chemically modified. Suitably therefore the conditions for culturing the second host cell are such that the binding protein is chemically modified. Suitably such chemical modification of the binding protein may take place post-translationally. Alternatively, the method may comprise a step of chemically modifying the binding protein. Suitably this step takes place after step (d) but prior to step (e).
In some embodiments, the host cells may be cultured under conditions so as to express proteins from the first, second, and third nucleic acids. In one embodiment, step (e) comprises each binding protein binding to each functional molecule. Suitably via a chemical modification. In one embodiment, step (e) further comprises mixing under conditions such that the proteins bind to each other. Suitably step (e) comprises mixing host cell supernatants or host cell lysates. Suitably mixing the first host cell supernatant or lysate with the second host cell supernatant or lysate. Suitably the mixing is such that the ratio of first host cell supernatant or lysate to further host cell(s) supernatant or lysate is about 1:1. Suitably the mixing step takes place at room temperature, suitably around 18-22°C. Suitably mixing takes place for between 15 minutes to 2 hours, suitably between 20 minutes and 1 hour, suitably between 25 minutes and 45 minutes, suitably for about 30 minutes. Suitably a functional molecule may be mixed with the VLPs once formed. Immunogenic Composition The present invention further relates to an immunogenic composition comprising the VLP of the invention. Suitably the immunogenic composition may be a vaccine. Suitably the immunogenic composition may further comprise one or more adjuvants. Suitable adjuvants include: mineral salts, emulsions, microorganism derived adjuvants, carbohydrates, cytokines, particulates or tensoactive compounds. Suitable mineral salts include: adjumer, alhydrogel, aluminium hydroxide, aluminum phosphate, aluminium potassium sulphate, amorphous aluminium hydroxyphosphate sulfate (AAHSA), aluminium salts in general, calcium phosphate, Rehydragel HPA, or Rehydragel LV. Suitable emulsions include: Freund’s complete, Freund’s incomplete, montanide ISA720, montanide ISA 51, montanide incomplete, Ribi, TiterMax, AF03, AS03, MF59, specol, SPT, or squalene. Suitable microorganism derived include: cholera toxin or mutants thereof, cholera toxin subunit B, CpG DNA, LTR 192G, MPL, Bordella pertussis components, E.coli heat labile toxin, CTA1-DD gene fusion protein, Etx B subunit, lipopolysaccharides, flagellin, Corynebacterium derived P40, LTK72, MPL-SE, or Ty particles.
Suitably the immunogenic composition may further comprise one or more pharmaceutically acceptable excipients. Pharmaceutically acceptable excipients may include stabilizers, fillers, preservatives, diluents, nutrients, antioxidants, antimicrobial agents, buffers, solvents, inactivating agents, purifiers, emulsifiers, surfactants and the like. Suitable excipients may be selected from, for example: monosodium glutamate, sucrose, D- mannose, D-fructose, dextrose, human serum albumin, potassium phosphate, plasdone C, anhydrous lactose, microcrystalline cellulose, polacrilin potassium, magnesium stearate, cellulose acetate phthalate, alcohol, acetone, castor oil, sodium chloride, benzethonium chloride, formaldehyde, ascorbic acid, hydrolyzed casein, sodium bicarbonate, sodium carbonate, glutaraldehyde, 2-phenoxyethanol, polysorbate 80 (Tween 80), neomycin, polymyxin B sulfate, bovine serum albumin, neomycin sulfate, polymyxin B, yeast protein, streptomycin sulfate, ammonium thiocyanate, rice protein, lactose, formalin, amino acid supplement, phosphate-buffered saline solution, disodium phosphate dihydrate, sodium dihydrogen phosphate dihydrate, yeast DNA, deoxycholate, phosphorothioate linked oligodeoxynucleotide, dibasic dodecahydrate, monobasic dehydrate, L-histidine, sodium borate, sodium taurodeoxycholate, ovalbumin, sorbitan trioleate, sodium citrate dehydrate, citric acid monohydrate, kanamycin, barium, hydrocortisone, egg proteins, cetyltrimethylammonium bromide (CTAB), octoxynol-10 (TRITON X-100), α-tocopheryl hydrogen succinate, gentamicin sulfate, monobasic sodium phosphate, dibasic sodium phosphate, cetyltrimethlyammonium bromide, and β-propiolactone, Thimerosal, α-tocopheryl hydrogen succinate, hydrolyzed porcine gelatin, arginine, dibasic potassium phosphate, monobasic potassium phosphate, protamine sulfate , sodium metabisulphite, Vero cell protein, CRM197 protein, vitamins, bovine calf serum, urea, succinate buffer, isotonic saline solution, phenol, M-199 medium, chicken protein, polygeline, chlortetracycline, dextran, Dulbecco’s Modified Eagle Medium, magnesium sulfate, ferric (III) nitrate, L-cystine, L-tyrosine, sorbitol, xanthan, water, EDTA, dioleoyl phosphatidylcholine (DOPC), 3-O-desacl4’monophosphoryl lipid A (MPL), QS-21, and cholesterol. In one embodiment, the excipients may be arginine, glutamine and trehalose. Suitably the immunogenic composition is formulated as a fluid, suitably as a liquid. Suitably the excipients and additives are selected such that the formulation is a liquid. Suitably an injectable liquid. Immunogenicity The term “Immunogenic" means that a VLP or an immunogenic composition comprising the VLP of the invention is capable of eliciting an immune response in a subject. Suitably a potent and preferably a protective immune response in a subject. Thus, the VLP or an immunogenic
composition comprising the VLP of the invention may be capable of generating an antibody response in a subject and/or a non-antibody based immune response in a subject. Suitably this may be referred to as its immunogenic activity. Suitably the immunogenic activity of the VLP or an immunogenic composition comprising the VLP of the invention may be determined by the amount of antibodies present in a subject after administration of the VLP or an immunogenic composition comprising the VLP of the invention i.e. antibody production. Suitably the amount of antibodies which bind to the antigen of the VLP. Suitably the amount of antibodies present in a subject after administration of the VLP or an immunogenic composition comprising the VLP of the invention, i.e. antibody production, is sustained and consistent over a period of time. Suitably the immunogenic activity of the VLP or an immunogenic composition comprising the VLP of the invention may be determined by the amount of antibodies present in a subject after administration of the VLP or an immunogenic composition comprising the VLP of the invention over a given period of time, i.e. antibody production over a given period of time. Suitable periods of time are outlined below. By amount of antibodies it is meant the titre or concentration thereof. Suitably the concentration of antibodies in sera. Suitably a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 5 days, at least 10 days, at least 15 days, at least 20 days, at least 25 days, at least 30 days, at least 35 days, at least 40 days, at least 45 days, at least 50 days, at least 55 days, at least 60 days, at least 65 days, at least 70 days, at least 75 days, at least 80 days, at least 85 days, at least 90 days, at least 95 days, or at least 100 days or more in a subject. Suitably a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 110 days, at least 120 days, at least 130 days, at least 140 days, at least 150 days, at least 160 days, at least 170 days, at least 180 days, at least 190 days, at least 200 days, at least 210 days, at least 220 days, at least 230 days, at least 240 days, at least 250 days, at least 260 days, at least 270 days, at least 280 days, at least 290 days, at least 300 days or more in subject. Suitably a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 1 week, at least 2 weeks, at least 3 weeks, at least 4 weeks, at least 5 weeks, at least 6 weeks, at least 7 weeks, at least 8 weeks, at least 9 weeks, at least 10 weeks, at least 12 weeks, at least 14 weeks, at least 16 weeks, at least 18 weeks, at least 20 weeks days or more in a subject. Suitably a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least at least 30 weeks, at least 40 weeks, at least 50 weeks, at least 60 weeks, at least 70 weeks, at least 80 weeks, at least 90 weeks, at least 100 weeks or more in a subject.
Suitably a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least for at least 1 year, at least 2 years at least 3 years, at least 4 years, at least 5 years, at least 6 years, at least 7 years, at least 8 years, at least 9 years or at least 10 years or more in a subject. Suitably, a VLP or an immunogenic composition comprising the VLP of the invention may sustain immunogenic activity for at least 10 years, for at least 15 years, for at least 20 years, for at least 25 years, for at least 30 years, for at least 35 years, for at least 40 years, for at least 45 years, for at least 50 years or more in a subject.. Suitably wherein immunogenic activity may refer to immunogenic antibody production. Suitably antibody production at a concentration which is immunogenic. Suitably antibody production at a concentration in sera which is immunogenic. Suitably at a concentration of between 1-20 ^g/ml, 1-18 ^g/ml, 1-16 ^g/ml, 1-14 ^g/ml, 1-12 ^g/ml, 2-18 ^g/ml , 2-16 ^g/ml , 2- 14 ^g/ml, 2-12 ^g/ml, or 2 – 10 ^g/ml in sera for example. The skilled reader, on considering the information set out in the Examples, will recognise that the VLPs or the immunogenic compositions of the invention exhibit immunogenic activity that makes them well suited to therapeutic use in the manner described in this specification. Medical Uses The present invention further relates to use of the VLP or the immunogenic composition comprising the VLP for use in therapy, or in the prevention and/or treatment of a disease. In a further aspect, the present invention further provides a method of treating a subject having a disease, comprising administering an effective amount of a VLP according to the first aspect or an immunogenic composition according to the twelfth aspect, to the subject. In further aspect, the present invention further provides a method of manufacturing a medicament for the treatment of a disease, the medicament comprising an effective amount of a VLP according to the first aspect or an immunogenic composition according to the twelfth aspect. Suitably the disease may be selected from: an infectious disease, cancer, an autoimmune disease, a cardiovascular disease, a metabolic disease, an inflammatory disease, a neurological disease, or rheumatological degenerative disease, or an addiction. Suitable infectious diseases include: viral, bacterial, fungal, or protozoan infections. Suitable viral infections include: COVID-19, SARS, MERS, influenza, common cold, respiratory syncytial virus infection, adenovirus infection, parainfluenza virus infection, norovirus infection, rotavirus infection, astrovirus infection, measles, mumps, rubella,
chickenpox, shingles, roseola, smallpox, fifth disease, chikungunya virus infection, HPV infection, Hepatitis A, B, C, D or E, warts, herpes, molluscum contagiosum, ebola, lassa fever, dengue fever, yellow fever, Marburg hemorrhagic fever, Crimean-Congo hemorrhagic fever, polio, viral meningitis, viral encephalitis, rabies, zika virus infection, west nile virus infection, HIV/AIDS, Hantavirus infection, HPS. Suitable bacterial infections include: urinary tract infections, cystitis, impetigo, bacterial food poisoning, campylobacteriosis, C.difficile infection, bacterial cellulitis, MRSA, CRPA, VRSA, sepsis, erysipelas, necrotising fasciitis, bacterial folliculitis, gonorrhoea, chlamydia, syphilis, mycoplasma genitalium, bacterila vaginosis, pelvic inflammatory disease, tuberculosis, whooping cough, Haemophilus influenzae disease, pneumonia, bacterial meningitis, lyme disease, cholera, botulism, tetanus, anthrax, Cryptosporidiosis, Diphtheria, E. coli infection, Legionnaires Disease, Leptospirosis, Listeriosis, salmonella infections, Shigellosis gastroenteritis, Staphylococcal infections, Streptococcal infections, TSS, typhoid fever, Yersenia infection. Suitable cancers include: breast cancer, liver cancer, lung cancer, pancreatic cancer, brain cancer, prostate cancer, bowel cancer, rectal cancer, bone cancer, leukemia, bladder cancer, cervical cancer, endometrial cancer, eye cancer, retinoblastoma, ewing sarcoma, gallbladder cancer, head and neck cancer, kaposi’s sarcoma, kidney cancer, laryngeal cancer, mesothelioma, myeloma, lymphoma, ovarian cancer, oesophageal cancer, mouth cancer, nasopharyngeal cancer, nose and sinus cancer, skin cancer, sarcoma, stomach cancer, testicular cancer, thyroid cancer, uterine cancer, vaginal cancer, penile cancer, vulval cancer. Suitable autoimmune diseases include: asthma, psoriasis, MS, rheumatoid arthritis, reactive arthritis, lupus, inflammatory bowel syndrome/disease, type 1 diabetes, Guillain-Barre syndrome, demyelinating polyneuropathy, Graves’ disease, Hashimo’s thyroiditis, Myasthenia gravis, vasculitis, pernicious anemia, ulcerative colitis, antiphospholipid syndrome, Kawasaki disease, alopecia, vitiligo, scleroderma, Sjogren’s syndrome, crohn’s disease, coeliac disease, Addison’s disease, narcolepsy. Suitable cardiovascular diseases include: angina, heart attack, heart failure, coronary heart disease, stroke, transient ischemic attack, peripheral arterial disease, aortic disease, atherosclerosis, hypertension, cerebrovascular disease, renal artery stenosis, aneurysm, cardiomyopathy, pulmonary heart disease, arrythmia, dysrhythmia, endocarditis, cardiomegaly, myocarditis, valvular heart disease, congenital heart disease, rheumatic heart disease. Suitable metabolic diseases include: hypercholesterolemia, hypertriglyceridemia, diabetes, hyperlipidemia, hyperbilirubinemia, hypercalcemia.
Suitable inflammatory diseases may include any of the above infections or autoimmune diseases. Suitable inflammatory diseases may include include: arthritis, asthma, tuberculosis, periodontis, chronic ulcers, sinusitis, hepatitis, glomerulonephritis, inflammatory bowel syndrome/disease, preperfusion injury, transplant rejection, sickle cell disease, allergies, cardiovascular disease, psoriasis, cytokine-mediated pruritus, COPD, diabetes, bronchitis, Crohn’s disease, atherosclerosis, dermatitis, arteritis, lupus. Suitable neurological diseases include: Alzheimer’s, ataxia, ALS, Bells palsy, brain tumours, aneurysms, epilepsy, Guillain-Barre syndrome, hydrocephalus, Meningitis, MS, muscular dystrophy, neurocutaneous syndromes, Parkinson’s, migraines, encephalitis, myasthenia gravis, dementia, seizures, spinal muscular atrophy, motor neuron disease, scoliosis, neuropathy, chronic fatigue syndrome, cerebal palsy. Suitable rheumatological degenerative diseases include: rheumatoid arthritis, psoriasis arthritis, spondylarthropathy, osteoarthritis, lupus, systemic sclerosis. Suitable addictions include: alcohol, nicotine, caffeine, amphetamines, opioids, sedatives, hypnotics, anxiolytics, cocaine, cannabinoids, hallucinogenics, phenycylcidine. In one embodiment, the VLP or the immunogenic composition are for use in the prevention or treatment of COVID-19. Suitably in such an embodiment, the functional molecule may be a SARS-CoV-2 antigen, suitably a SARS-CoV-2 spike protein. Alternatively in such an embodiment, the functional molecule may be an inflammatory cytokine, suitably IL-33. In one embodiment, the VLP or the immunogenic composition are for use in the prevention or treatment of psoriasis or arthritis. Suitably in such an embodiment, the functional molecule may be an inflammatory cytokine, suitably IL17. In one embodiment the VLP or the immunogenic composition are for use in the prevention or treatment of asthma or atopic dermatitis. Suitably in such an embodiment, the functional molecule may be an inflammatory cytokine, suitably IL13 or IL33. Suitably, an effective amount for administration to the subject is an effective amount to prevent or treat the disease. Suitable effective amounts can be readily determined by the skilled medical practitioner. Suitably a dose comprises an effective amount. A suitable dose of the VLP may comprise: 10- 100 micrograms, suitably 10-80 micrograms, suitably 20-60 micrograms, suitably 20-40 micrograms. Suitably the VLP or immunogenic composition may be administered by any route. Suitably the VLP or immunogenic composition may be administered enterally or parenterally. Suitably the
VLP or immunogenic composition may be administered orally, rectally, vaginally, sublingually, by injection, transdermally, or by inhalation. In one embodiment, the VLP or immunogenic composition may be administered by injection, suitably by subcutaneous injection. In one embodiment, the VLP or immunogenic composition may be administered by inhalation, suitably by nasal inhalation. Subject The present invention relates to the prevention and/or treatment of a disease in a subject by using the VLP or immunogenic composition thereof. Suitably the subject may be human or animal. Suitably therefore the prevention and/or treatment of disease may be in the veterinary field. Suitably the subject may be adult or child. Suitably the subject may be male or female. In one embodiment, the subject is an adult human. Suitably the subject may have been diagnosed with a disease. Alternatively, the subject may be suspected of having a disease. Suitably the subject may display one or more symptoms of a disease. Alternatively, the subject may be at risk of contracting a disease. Suitably the subject may have one or more risk factors associated with a disease. Suitable risk factors may include: weight, smoking, alcohol or substance addiction, age, sex, race, inheritance for example. Suitable risk factors may further include a genetic predisposition to a disease, for example by expression of particular gene, or by the presence of a particular mutation in a gene. In one embodiment, subjects that have been diagnosed with a disease or who have one or more symptoms of a disease are provided with the VLP or immunogenic composition for treatment of the disease. In one embodiment, subjects that are at risk of developing a disease are provided with the VLP or immunogenic composition for prevention of the disease. Other Uses The present invention further relates to use of the VLP in research and in the diagnosis of diseases. Suitably the VLP of the first aspect may be used in research. Suitably the VLP may be used as a detection tool. Suitably the VLP may be used as a label. Suitably in such embodiments,
the binding molecule of the VLP is attached to a functional molecule which is a fluorescent molecule. Suitably the binding molecule may be attached to a functional molecule which is an antigen binding molecule such as an antibody. Suitably, the functional molecule may further comprise a fluorescent molecule. Suitably the antigen binding molecule may specifically bind a cell surface receptor. Suitable cell surface receptors are discussed elsewhere herein, however suitably the cell surface receptor is specific to a cell type. Suitably therefore the VLP is capable of binding to, and labelling, specific cell types. Suitably the VLP may be used as a carrier. Suitably in such embodiments, the VLP may comprise a cargo. Suitably the cargo may be contained within the VLP, suitably within the VLP shell. Suitably the cargo may be a therapeutic molecule. Suitably therefore the VLP may not in itself be a therapeutic, but may be a carrier of a therapeutic molecule. Suitable therapeutic molecules may include oligonucleotides, small molecules, peptides, for example. In one embodiment, the therapeutic molecule may comprise an antisense oligonucleotide which may act to repress expression of a particular nucleic acid. In another embodiment, the therapeutic molecule may comprise a cytotoxic chemical which may act to trigger cell death. Suitably, in such embodiments, the VLP is targeted to a particular site, for example to a particular cell or cell type where the therapeutic molecule is required. Suitably this is achieved by the binding molecule of the VLP being attached to a functional molecule which is an antigen binding molecule such as an antibody. Suitably the antigen binding molecule may specifically bind to a cell surface receptor. Suitably to a cell surface receptor specific to the target cell. Suitably binding to the cell surface receptor may stimulate uptake of the VLP into the cell. Suitably therefore, the VLP is capable of binding to specific cell types and delivering cargo thereto. In a further aspect of the invention, there is provided a carrier VLP comprising the features of the first aspect, and in addition a cargo, wherein the cargo is contained within the VLP shell. Suitably the cargo is a therapeutic molecule. Suitably the VLP of the first aspect may also be used in diagnosis. Suitably the binding molecule of the VLP is attached to a functional molecule which is an antigen binding molecule, such as an antibody. Suitably the antibody specifically binds an antigen derived from a disease causing agent as discussed hereinabove. Suitably from an infectious agent such as a virus, bacterium, fungus, protozoan, or archaeon. Suitably, therefore, the VLP is capable of binding to a disease causing agent and allowing detection thereof.
Suitably therefore the VLP of the invention may be used in a method of diagnosing a disease in accordance with the sixteenth aspect of the present invention. Suitably there is provided a method of diagnosing a disease in a subject comprising: (a) Providing a virus like particle according to the first aspect of the invention, wherein the binding molecule is attached to a functional molecule and wherein the functional molecule is an antibody directed towards an antigen derived from a disease causing agent; (b) Mixing the virus like particle with a suitable sample from the subject; (c) Detecting whether the virus like particle precipitates; (d) Diagnosing the presence of a disease if the VLP precipitates. Suitably, in an embodiment where the functional molecule is an antigen binding molecule, the VLP further comprises a second binding protein. The second binding protein is described elsewhere herein. Suitably the antigen binding protein is attached to the second binding protein. Suitably the second binding protein binds to the first binding protein which is attached to a monomer of the heterodimeric capsid protein as described hereinabove. Suitably detection is via precipitation of the VLP bound to the disease causing agent. Suitably detecting precipitation may comprise visual confirmation, or testing with a spectrometer. Suitably if no precipitation occurs, the disease is not present. Suitably, the VLP may also comprise a fluorescent molecule. Suitably such a fluorescent molecule may be attached to a chemical modification of the binding molecule. In such embodiments, suitably the detection step may comprise detecting the presence of fluorescence in the sample. Suitably the detection step may comprise detecting the presence of fluorescent precipitation in the sample. Suitably diagnosing the presence of a disease if fluorescent precipitation occurs. Advantageously, the use of fluorescence allows more sensitive detection of the precipitation in a sample. A suitable sample from a subject may be a blood sample, saliva sample, serum sample, sputum sample, sperm sample, mucus sample, CSF sample. Suitably the sample is a fluid sample. Suitably the method of diagnosis may further comprise a step of incubating the sample with the VLP. Suitably for a period of time sufficient to allow the VLP to bind to any antigens in the sample and precipitate. Suitably for at least 1 minute, suitably up to 30 minutes, suitably up to 25 minutes, suitably up to 20 minutes, suitably up to 15minutes.
Suitable diseases which may be detected by the method may be any of those listed herein above. Suitably the method of diagnosis may further comprise a step of treatment of the subject if a disease is diagnosed. Suitably treatment of the subject may comprise administering an effective amount of any known treatment for the relevant disease to the subject. Certain embodiments of the invention will now be described with reference to the following examples: Examples Materials and Methods The invention disclosed here enables the production of VLP vaccines in the hepatitis VLP shell where only one protein is presented for each of the VLP-dimers, as shown in Figure 2 (image on the right). This was achieved by introducing specific mutations into the amino acid sequence of the VLP shell protein. Since the monomers shown in Figure 2 are in fact the same protein bound together through self-interaction, this interaction can be weakened if amino acids are introduced which have a positive or negative charge, thereby creating electrostatic repulsion. The inventors have reasoned that this can be exploited to produce heterodimeric viral capsid proteins by introducing the gene for the VLP shell protein into a vector allowing protein synthesis for example in bacteria such as E.coli. This is summarized in Figure 3. Testing of mutational effects: The inventors exploited the recent publication of Alphafold2, more specifically, Alphafold2 Advanced (available at https://colab.research.google.com/github/sokrypton/ColabFold/blob/main/beta/AlphaFold2_a dvanced.ipynb) This algorithm allows the accurate modelling of structures of protein pairs where the quality of predicted folding can be the pTM score rather than the pLDTT score which returns modelled protein structures ranked by the protein-protein interaction. The stoichiometry of interacting subunits is specified as x:y:z where ‘x’ is the predefined number of molecules of protein 1, ‘y’ the number of protein 2 molecules, and so forth. In the case of Hepatitis capsid modelling, the so-called asymmetric unit, which gives rise to the building blocks automatically assembling into the entire capsid, contains two dimers of one single protein, which would be modelled by simply specifying a number of ‘4’ for the capsid protein. This returns a structure of two dimers. Once mutations are introduced, two different proteins are derived from the original wild type capsid protein: one protein which contains a binding protein (such as Im7 or Barstar), and a
second protein which does not. When the stoichiometry for these two proteins is defined as “2:2” it means that Alphafold2 is going to return a protein complex which contains 2 molecules of each protein. As expected, Alphafold2 returned two dimers which were in turn positioned exactly as in the published Hepatitis virus asymmetrical unit structure (PDB: 6EDJ). Furthermore, the structures in each case as such were classified as ‘high predictive accuracy’ (Alphafold2 colouring index), with the predicted exception of unfolded domains (glycine serine linkers, and histidine-containing tags). This result confirms the operational accuracy of this modelling approach. Upon introducing a series of mutations, Alphafold2 was then interrogated and returned models consisting of either (i) dimers that were either composed of 2x protein 1 and 2x protein 2 (homodimers only), (ii) dimers that exclusively consisted of protein 1 and protein 2 (heterodimers only), or (iii) a mixture of both. This in silico approach allowed the iterative testing and identification of mutational combinations triggering the exclusive assembly of heterodimer structures, as detailed in the examples below. In each case, both pDLL and pTM scores for the structures were high, confirming a high predictive capacity, as shown below in the Table. Alphafold2 structure prediction of dimer formation generated by the optimized sequences shown in table 2. Model rank Dimer pLDDT score pTM score arrangement wHv – Im7 1 hetero 81.5 0.60 2 hetero 75.8 0.59 3 hetero 78.6 0.52 4 hetero 72.2 0.51 5 hetero 68.4 0.47 wHv Bs 1 hetero 78.7 0.55 2 hetero 81.1 0.51 3 hetero 74.4 0.45 4 hetero 72.1 0.44 5 homo 75.4 0.42 Example 1: Dimerisation of Woodchuck Hepatitis virus capsid proteins
In this example, instead of using the VLP shell from Hepatitis B virus (HBc), the groundhog hepatitis (Woodchuck hepatitis virus, (wHv)) was used. Structurally, both virus capsids are extremely similar. However, compared to HBc, wHv has distinct advantages: i) It forms capsids at cold temperatures (> 60% capsid formation occurs at temperatures as low as 4°C, Kukreja2014). This will increase the yield of VLPs in low temperature bacterial fermentation which is preferable for production of complex proteins. (ii) The temperature dependence on capsid formation is much weaker: Since the virus is adapted to groundhogs which hibernate and whose body temperature wildly fluctuates between 6.5°C and 37°C, VLPs from wHv are much more stable during temperature fluctuation. This could be a significant advantage both for manufacture and storage. (iii) Use of wHv abrogates any cross-reactivity to human Hepatitis B. The precludes even the theoretical possibility that having had a past infection with Hepatitis in some way interferes with the response to vaccination with the VLP vaccine. When both the wild type wHv protein and the wHv protein carrying an integrated Im7 protein are produced by bacteria, they can form dimers in two ways: either the two unmodified (called “wild type”) proteins together (called a “homodimer”) or one unmodified and one Im7-carrying protein (called “heterodimer”). The interaction itself is affected by the interaction of electrically charged amino acids within the protein. This is shown in Figure 3B (right). The negatively charged amino acid called E64 from one of the monomers interacts with a positively charged amino acid called K96 from the other monomer. Figure 3 shows the approach to creating a mutant wHv capsid where one dimer displays an integrated Im7 protein (marked as ‘Im7-insert’) toward the VLP surface. A. an expression vector is established which allows simultaneous expression of a wild type wHv protein and a wHv protein carrying the integrated Im7 protein, driven by a single promoter, through an internal ribosome binding site (RBS). This set-up can result in either formation of homodimers or heterodimers. B. The central ‘tip’ of a wHv homodimer, of amino acid making up the protein, as well as a close-up view of electrostatic forces between charged amino acids from both monomers (right). Example 2: modified wHv protein This electrostatic interaction between the monomers can be modified. An example of such a modification is; if a positively charged amino acid, for example K96, is replaced by a negatively charged amino acid, then the protein which is usually forms a dimer with itself (homodimer),
experiences electrostatic repulsion (E64 now meets position 96, now mutated from K to E). However, if the K96->E replacement on the protein is accompanied with a complementary replacement of E64->K placed on the corresponding monomer of wHv-Im7, then whenever one wHv monomer and one wHv-Im7 monomer form a dimer (“heterodimer”) this interaction is strengthened by an electrostatic attraction. This is illustrated in Figure 4a. The Alphafold2 software (Jumper, J et al. Nature (2021) and Varadi, M et al. Nucleic Acids Research (2021)) allows accurate prediction of protein structures that will be formed from any given amino acid sequence. When the above mutated protein sequences were entered into Alphafold2, it turned out that- despite the added electrostatic forces, surprisingly the formation of homodimers is still highly favoured, as shown in Figure 4B. Figure 4 shows interaction of wHv capsid protein monomers with exemplary mutations in their amino acid sequences. Panel A shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96E in wHv-Im7 and E64K into wHv, respectively. Panel B shows structural prediction using Alphafold2 database, showing that formation of homodimers composed of wHv/wHv and wHv- Im7/wHv-Im7, respectively, is still favoured despite the mutations introduced (shown is the top-ranked of five models, all of which show homodimer formation). This suggests that the mutations introduced into the amino acid sequences of the monomers to promote formation of heterodimers are not trivial. In order to overcome this problem, the inventors derived and introduced additional mutations as further refinements. These are illustrated below in Figure 5 and Figure 6. In each case, despite the increased electrostatic bias toward formation of wHv- wHv-Im7 heterodimers, still the desired effect could not be achieved. In the example in shown in figure 5, lysine96 in wHv is replaced by aspartate (D96), instead of glutamate (E96) in contrast to model shown in figures 3 and 4. Panel A (of figure 5) shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96D in wHv-Im7 and E64K into wHv, respectively. The mutation of K96 to D96 reduces side-chain bulkiness, shown on the left of panel B. The resulting structural prediction using Alphafold2 database (right) shows a predicted mixture between heterodimer and homodimer formation. In the example shown in Figure 6, further additional intra-chain ionic stabilization was achieved by added mutation of Lysine 67 to glutamate in the wHv149 subunit. Panel A shows a schematic of an exemplary expression vector showing that complementary mutations are introduced into the two different monomers: K96D in wHv-Im7 and E64K into wHv, respectively, in this example, an addition mutation K67E is introduced to the wHv149 subunit.
Surprisingly, this set of mutations resulted in increased heterodimer formation, as predicted by Alphafold2 (shown in panel B). Overall, these experiments confirm that exclusive formation of heterodimers is difficult to achieve, cannot be easily predicted based on single electric charge changing mutations, and hence is non-trivial. Example 3: mutations in wHv protein promote heterodimer formation The inventors found that certain combinations of mutations in amino acid sequence of the monomers combined with a specifically required linkers to join the wHv and the binding molecule, in this example Im7, as well as Barstar proteins, achieved the desired result of heterodimeric capsid proteins. This final, optimized, combination of mutations is summarized in Table 2. The analysis using Alphafold2 yielded very high statistical scores, indicating that the structures have a high predictive probability of forming the desired heterodimers, as shown in Table 3. In figure 7, shows the Ribbon diagram of the top rated structures obtained with the optimized engineered combination of mutations summarized in table 2, generated using Alphafold2, for heterodimers consisting of wHv149/wHv-Im7 (top) and wHv149/wHv-Bs (bottom). Despite the mutations, a structural overlay of the unmodified wHv protein with the resulting heterodimer mutant proteins, shows that they are almost completely identical in structure, as shown in Figure 8. This suggests that they will be able to form intact VLPs. In addition, figure 9 shows a space filling model generated using icn3d (NCBI/structure; PDB: 6edj), showing the asymmetric units of wild type wHv capsid (left) and the designed wHv149/wHv-Im7 (centre) and wHv149/wHv-Barstar (right) heterodimers, respectively, from the front (top) and from the surface (bottom) of the capsid. The dashed line indicates the plane of subunit interaction leading to VLP assembly, which remains undisturbed by the integration of Im7 or Barstar. A comparison to the unmodified wHv protein structure suggests that the space occupancy of the mutated structures will not interfere with VLP formation. The examples shown in figure 10 shows a view from the side showing “spike” made from two helices from each monomer protruding to the outside (arrow) and bottom part mediating interaction with other dimers in the capsid ( arrow). Also shown is a top view of the HBc capsid protein: “homodimer” composed of two identical HBc monomer proteins. Figure 11 highlights the a total of four conserved electrostatic interactions in all hepatitis virus (including HBc / wHv) wild type capsid proteins. Protein alignment of capsids from human and woodchuck hepatitis virus showing conservation of E8, R56, E64, K96, mediating conserved
electrostatic binding. Interestingly, all four sites become electrostatic repulsed while, when pairing with the complementary mutations inserted into the functionalized monomer, regain electrostatic attraction. For instance, E64 of each monomer interacts with K96 of the other monomer and E8 of each monomer interacts with R56 of the other monomer. Example 4: wHv capsid heterodimer components are soluble and evenly expressed in E.coli. Method: A pET – derived plasmid harbouring DNA encoding the wHv_Im7 and wHv proteins, each under the control of a separate T7-inducible promoter, was transfected into standard BL21/DE3 E.coli. Recombinant proteins were induced by addition of IPTG to 0.3 mM and incubation at 16C for 16h. Subsequently, cells were lyzed by sonication and insoluble proteins and inclusion bodies separated by centrifugation. The resulting cytosolic fractions were subjected to denaturing SDS-PAGE (shown in figure 12). The data shown in the SDS-PAGE gel shows that both wHv_Im7 and wHv proteins, when driven by individual T7 promoters, can be expressed as soluble proteins in E.coli and are synthesized at approximately even stoichiometric ratios (black and grey arrows). Example 5: HBc capsid heterodimer and binding protein form a complex and co-purify on immobilized metal affinity chromatography. Method: A pET – derived plasmid harbouring DNA encoding the HBc_Im7 and HBc_wt proteins, each under the control of a separate T7-inducible promoter, was transfected into standard BL21/DE3 E.coli. Recombinant proteins were induced by addition of IPTG to 0.3 mM and incubation at 16C for 16h. A separate plasmid was transfected harbouring the binding protein ColE7-IL31 under the control of a tetracycline-inducible promoter. Recombinant protein expression was induced by adding 40 ng/ml of anhydrotretracyline at 16C for 16h. Subsequently, cells were lyzed by sonication and insoluble proteins and inclusion bodies separated by centrifugation. The resulting cytosolic fractions of cells expressing HBc_Im7/ HBc_wt and ColE7-IL31 were mixed at room temperature for 30min, followed by purification on Ni-NTA agarose and subsequent SDS-PAGE analysis of the individual fractions (shown in figure 13). The data shown in the SDS-PAGE gel HBc_Im7 and HBc_wt proteins, when driven by individual T7 promoters, can expressed as soluble proteins in E.coli and, furthermore, that they form a complex with the binding protein ColE7-Il31 which co-purifies on Ni- chromatography, confirming formation of stable complexes (white, dark, and light grey arrows, respectively in the eluate lane).
Example 6: wHv heterodimeric VLPs can be expressed and autoassemble into large nanoparticles in E.coli, and co-purify on immobilized metal affinity chromatography. Methods: Cloning, expression, density gradient: the epitope protein used was murine Interleukin 31 harbouring a single point mutation to inactivate receptor transactivation (SEQ ID NOs: 35 and 36). The epitope was N-terminally fused to Colicin E7 (SEQ ID NOs: 31 and 32) and connected via a rigid alpha helical linker (SEQ ID NOs: 33 and 34)), these proteins and VLP scaffold proteins were cloned onto plasmid DU75351 (SEQ ID NO:30) into BL21/DE3 E.coli. Transfected cells were inoculated into LB broth overnight at 30 °C, followed by expansion into 200ml medium the next morning at 37°C in shaker flasks. Upon reaching of OD595 of 0.8, temperature was reduced to 16°C. The scaffold protein was under the control of a T7 promoter and induced with 0.3 mM IPTG. The epitope protein was under the control of a tetA/tetR promoter and induced with 40 ng/ml anhydrotetracycline. Induction was maintained for 3h. Biomass was harvested and lysed in 25mM Tris, pH 7.4, 200mM NaCl using a high pressure homogenizer (Emulsiflex). Thereafter, lysates were digested with Benzonase for 1h and Polysorbate 80 was added to 0.005%. Lysates were spun down for 10’ at 25000 rpm and supernatants filtered through 0.45 and 0.22 micron filters, respectively, to yield cytosolic fractions. Cytosols were applied to a discontinuous sucrose gradient in the same buffer and spun for 6h at 255,000G with deactivated deceleration. Individual sucrose fractions were analyzed by SDS PAGE. Immobilized metal affinity chromatography (IMAC): Cytosolic fractions of heterodimeric VLPs decorated with epitope protein were prepared as detailed in the Methods for Figure 1. Cytosols were adjusted to 30mM imidazole, followed by IMAC chromatography on a Sartobind IDA 1ml membrane (Sartorius) charged with Nickel-Sulfate. After adsorption, membrane was washed with the same buffer containing 70mM imidazole. Bound protein was eluted with 250mM imidazole and fractions analyzed by SDS PAGE. Results: The data shown in Figure 14 confirms that heterodimeric VLPs can be expressed and autoassemble into large nanoparticles in E.coli. The data also confirms that these heterodimeric VLPs can be decorated fully with epitope protein on the surface. The data in Figure 15 shows that both the WHcIm7 (grey arrow) and the WHc protein (black arrow) co- purify on IMAC, confirming that the VLP scaffold proteins are indeed both bound to the epitope. The staining density of epitope protein (white arrow) and WHcIm7 moiety (grey arrow) is approximately even, indicating that the VLPs are fully decorated with epitope protein. Example 7: Determination of Size of VLPs.
Methods: Dynamic Light Scatter analysis: The eluate fraction shown in Figure 15 was subjected to a Multiangle Dynamic Light Scatter analysis (MADLS) using a Malvern Zetasizer Ultra in a quarz microcuvette with 80 microlitre volume. Transmission electron microscopy (TEM): the samples shown in Figure 15 were adsorbed to glow discharged carbon-formvar-coated copper grids and negatively stained with a 1% aqueous uranyl acetate. The grids were examined at 80 kV. Results: The light scattering profile shown in Figure 16 confirms a uniform size distribution of the VLP particles at the expected size range (approximately 32 nm diameter). The TEM images in Figure 17 confirm the size of VLPs determined by DLS analysis in Figure 16. The thickened rim structure and fuzzy outer rim appearance is consistent with decoration of an epitope localized to the surface.
Table 1. Genetic mutations introduced into wHv subunits failing to achieve tight heterodimer formation.
1 Alphafold2 structure prediction using pTM score ranking and 5 models for each analysis. ‘Hetero’ - the top three ranked models showed heterodimer formation; ‘mixed’ - the top ranked model showed heterodimer. * failure to form tight dimer of dimers.
Table 2
Table 3
Table 3 shows the top-ranked model structures produced with the optimized combination of mutations summarized in Table 2 are shown in Figure 7. The analysis using Alphafold2 yielded very high statistical scores, indicating that the structures have a high predictive probability, as shown in Table 3. These data confirm that achieving heterodimer formation in wHv VLPs is non-trivial, but the inventors have achieved resolution of the modifications which will achieve heterodimer formation reliably during large-scale VLP manufacture.
Sequences
Complementary mutant pairing exemplified in the woodchuck wHv variant:
Sequence Codes: Underlined – mutated compared to wild type Bold font – amino acid negatively charged at physiological pH and forming inter-molecular electrostatic bond Highlighted font – amino acid positively charged at physiological pH and forming inter- molecular electrostatic bond Double underlined – the linker sequence designed to incorporate the functionalizing protein into the wild type HBc/wHv capsid Italic – the functionalized protein sequence SEQ ID NO: 6 – Barstar protein from Bacillus amyloliquefaciens KKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDALTGWVEYPLVLEWRQFEQSKQ LTENGAESVLQVFREAKAEGADITIELS SEQ ID NO: 7 – Im7 protein from E.coli ELKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDNRDDS PEGIVKEIKEWRAANGKPGFKQ SEQ ID NO: 8 – T7 promotor parent sequence from Bacteriophage T7 agcataat SEQ ID NO: 9 – linker sequence (synthetic) GGGGSGGGGS SEQ ID NO: 10 – linker sequence (synthetic) GGGGGSGGGGS SEQ ID NO: 11 – linker sequence (synthetic) SGGGSSGSG SEQ ID NO: 12 – Barnase protein from Bacillus amyloliquefaciens
AQVINTFDGVADYLQTYHKLPDNYITKSEAQALGWVASKGNLADVAPGKSIGGDIFSNREG KLPGKSGRTWRWADINYTSGFRNSDRILYSSDWLIYKTTDHYQTFTKIR SEQ ID NO: 13 – ColE7 protein from E.coli ESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEV SKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKATSFALHHEKPISQNGGVYDMDNISV VTPKRAIDIHRGKS SEQ ID NO:14 – linker (synthetic) KAAAEKAAAE SEQ ID NO:15 – linker (synthetic) GGKAAAE SEQ ID NO:16: wHv ‘149’ monomer (truncated from wild type woodchuck hepatitis virus capsid protein) MDIDPYKEFGSSYQLLNFLPLDFFPDLNALVDTATALYEEELTGREHCSPHHTAIRQALVCW DELTKLIAWMSSNITSEQVRTIIVNHVNDTWGLKVRQSLWFHLSCLTFGQHTVQEFLVSFGV WIRTPAPYRPPNAPILSTLPEHTVI* SEQ ID NO:17 – Human IL31 functional molecule nucleic acid sequence (optimized for E.coli expression and without the signal peptide): Atgcgtttactacgaccaagtgatgatgtacagaaaatagtcgaggaattacagtccctctcgaagatgcttttgaaagatgtgga ggaagagaagggcgtgctcgtgtcccagaattacacgctgccgtgtctcagccctgacgcccagccgccaaacaacatccac agcccagccatccgggcatatctcaagacaatcagacagctagacaacaaatctgttattgatgagatcatagagcacctcgac aaactcatatttcaagatgcaccagaaacaaacatttctgtgccaacagacacccatgaatgtaaacgcttcatcctgactatttct caacagttttcagagtgcatggacctcgcactaaaatcattgacctctggagcccaacaggccaccacttaa SEQ ID NO:18 – Human Hepatitis B virus heterodimer monomer with integrated Im7: Atggacattgacccgtataaagaatttggagcttctgtggagttactctcttttttgccttctgacttctttccttctattcgagatctcctcg acaccgcctcagctctgtatcgggaggccttagagtctccggaacattgttcacctcaccatacagcactcgaccaagctattctgt gttggggtgagttgatgaatttggccacctgggtgggaagtaatttgcaaaaagctgcagcagagaaagctgcagctgaaaata gtattagtgattacacagaggctgagtttgttcaacttcttaaggaaattgaaaaagagaatgttgctgcaactgatgatgtgttagat gtgttactcgaacactttgtaaaaattactgagcatccagatggaacggatctgatttattatcctagtgataatagagacgatagcc ccgaagggattgtcaaggaaattaaagaatggcgagctgctaacggtaagccaggaggtggagcatccagggaattagtagt cagctatgttaatgttaatatgggcctagatatcagacaactactgtggtttcacatttcctgtcttacttttggaagagaaactgttcttg aatatttggtgtcttttggagtgtggattcgcactcctcctgcttacagaccaccaaatgcccctatcttatcaacacttccggaaacta ctgttgtttaa
SEQ ID NO:19 – Human Hepatitis B virus heterodimer monomer with integrated Barstar binding protein: atggatattgatccgtataaagaatttggcgcgagcgtggaactgctgagctttctgccgagcgatttttttccgagcattcgcgatct gctggataccgcgagcgcgctgtatcgcgaagcgctggaaagcccggaacattgcagcccgcatcataccgcgctggatcag gcgattctgtgctggggcgaactgatgaacctggcgacctgggtgggcagcaacctgcagaaagcggcggcggaaaaagcg gcggcggaaaaagcggtgattaacggcgaacagattcgcagcattagcgatctgcatcagaccctgaaaaaagaactggcg ctgccggaatattatggcgaaaacctggatgcgctgtgggatgcgctgaccggctgggtggaatatccgctggtgctggaatggc gccagtttgaacagagcaaacagctgaccgaaaacggcgcggaaagcgtgctgcaggtgtttcgcgaagcgaaagcggaa ggcgcggatattaccattgaactgagcggcggcggcgcgagccgcgaactggtggtgagctatgtgaacgtgaacatgggcct ggatattcgccagctgctg tggtttcatattagctgcctgacctttggccgcgaaaccgtgctggaatatctggtgagctttggcgtgtggattcgcaccccgccgg cgtatcgcccgccgaacgcgccgattctgagcaccctgccggaaaccaccgtggtgtaa SEQ ID NO:20 – Complementary Human Hepatitis B virus heterodimer partner monomer without binding protein: Atggacattgacccgtataaaaaatttggagcttctgtggagttactctcttttttgccttctgacttctttccttctattcgagatctcctcg acaccgcctcagctctgtatcgggaggccttagagtctccggaacattgttcacctcaccatacagcactcaggcaagctattctgt gttggggtaaattgatgaatttggccacctgggtgggaagtaatttggaagacccagcatccagggaattagtagtcagctatgtt aatgttaatatgggcctaaaaatcagacaactactgtggtttcacatttcctgtcttacttttggaagagaaactgttcttgaatatttggt gtcttttggagtgtggattcgcactcctcctgcttacagaccaccaaatgcccctatcttatcaacacttccggaaactactgttgttta a SEQ ID NO:21 – Woodchuck hepatitis virus heterodimer monomer with integrated Im7 binding protein: ATGGACATCGATCCATATAAGGAGTTTGGCTCCTCATACCAATTACTTAATTTCCTTCCC TTGGACTTTTTCCCCGACCTTAACGCCCTGGTTGACACGGCTACGGCGCTTTACGAAGA GGAATTAACGGGGCGTGAACATTGTTCACCTCATCACACGGCCATTGATCAAGCGTTG GTATGCTGGGATGAACTTACGAAACTGATTGCCGATATGAGTTCAAATATTACGAGCAA AGCAGCAGCCGAGAAGGCAGCAGCCGAAGAACTGAAAAATAGCATTTCAGACTACACC GAAGCAGAATTTGTGCAGTTACTGAAAGAGATCGAGAAGGAGAACGTAGCCGCAACCG ATGACGTGCTTGATGTCCTGCTTGAACATTTAGTAAAGATTACGGAACATCCAGACGGT ACGGATTTAATCTATTATCCTAGTGACAATCGCGACGACAGTCCAGAAGGCATCGTAAA GGAGATTAAAGAATGGCGTGCTGCAAACGGAAAGCCTGGGTTTAAGCAGGGTGGAAAA GCTGCGGCAGAACAGGTGCGTACCATTATCGTAAATCACGTCAATGATACCTGGGGTC TTGATGTTCGTCAGTCCCTGTGGTTTCACCTTTCATGCTTGACGTTTGGTCAGCACACA GTACAGGAGTTCCTTGTTTCTTTCGGGGTATGGATTCGTACACCAGCTCCTTATCGCCC TCCTAACGCACCTATTTTATCCACGTTACCTGAACATACCGTTATTTAG SEQ ID NO:22 – Woodchuck hepatitis virus heterodimer monomer with integrated Barstar binding protein: ATGGACATCGACCCATACAAAGAATTTGGAAGTAGTTACCAGCTTCTGAATTTCTTACCT CTTGACTTCTTTCCAGACTTGAACGCGCTGGTAGACACAGCGACAGCTTTATATGAAGA GGAGCTGACAGGCCGCGAGCACTGCTCACCTCATCATACGGCCATCGATCAAGCACTG GTATGCTGGGATGAGTTGACTAAGTTGATTGCTGACATGTCATCTAACATCACCTCCAA
AGCAGCGGCCGAGAAGGCAGCAGCAGAAAAAAAGGCTGTAATTAACGGTGAGCAGAT CCGCAGTATTAGCGACTTACACCAAACATTGAAAAAGGAACTTGCCTTACCCGAGTACT ATGGCGAAAACCTGGATGCTCTGTGGGACGCTTTAACAGGATGGGTCGAGTACCCGTT GGTGTTAGAATGGCGCCAGTTCGAGCAGAGTAAGCAATTGACTGAGAATGGTGCCGAA TCCGTATTACAAGTATTCCGCGAAGCCAAGGCCGAGGGGGCAGATATCACTATTGAAC TGTCTAAAGCCGCGGCCGAACAGGTACGCACTATCATTGTGAACCATGTCAATGACAC CTGGGGGTTAGATGTCCGTCAATCCCTTTGGTTTCATTTATCGTGCTTAACTTTCGGTCA GCACACTGTTCAAGAGTTTCTGGTCTCGTTTGGAGTATGGATTCGCACGCCTGCGCCAT ATCGCCCGCCTAATGCGCCTATTCTGTCTACCTTGCCCGAACATACGGTTATTTAG SEQ ID NO:23 – Complementary Woodchuck Hepatitis virus partner monomer without binding protein: ATGGATATTGATCCGTATAAAAAATTTGGCAGCAGCTATCAGCTGCTGAACTTTCTGCC GCTGGATTTTTTTCCGGATCTGAACGCGCTGGTGGATACCGCGACCGCGCTGTATGAA GAAGAACTGACCGGCCGCGAACATTGCAGCCCGCATCATACCGCGATTCGCCAAGCG CTGGTGTGCTGGGATAAACTGACCGAACTGATTGCGTGGATGAGCAGCAACATTACGA GCAAACAAGTGGATACCATTATTGTGAACAAAGTGAACGATACCTGGGGCCTGAAAGTG CGTCAGAGCCTGTGGTTTCATCTGAGCTGCCTGACCTTTGGTCAGCATACCGTGCAAG AATTTCTGGTGAGCTTTGGCGTGTGGATTCGCACCCCGGCGCCGTATCGCCCGCCGAA CGCGCCGATTCTGAGCACCCTGCCGGAACATACCGTGATTTAA SEQ ID NO:24 – Functional molecule and second binding protein sequence: ColicinE7- fused-to-human-IL31 ATGGAGAGTAAACGGAATAAGCCAGGGAAGGCAACAGGTAAAGGAAAACCTGTCAATA ATAAGTGGTTAAATAATGCAGGTAAAGACTTAGGTTCTCCTGTTCCAGATCGTATAGCTA ATAAACTACGTGATAAGGAGTTTAAAAGTTTCGATGATTTTCGTAAGAAATTCTGGGAAG AAGTGTCAAAAGATCCTGAGTTAAGTAAACAATTTAGTCGAAACAATAATGATCGAATGA AGGTTGGAAAAGCGCCCAAGACTAGAACCCAGGATGTTTCAGGGAAGGCAACTTCATT CGCACTTCATCATGAGAAGCCGATCAGCCAAAATGGTGGTGTCTATGATATGGATAACA TCAGCGTGGTAACACCTAAACGTGCTATTGATATTCACCGAGGTAAAAGCGGAGGTGG CTCATCAGGATCtGGTGAAAACCTGTATTTTCAGGGatccGGAGGTGGCTCAggacgtttactac gaccaagtgatgatgtacagaaaatagtcgaggaattacagtccctctcgaagatgcttttgaaagatgtggaggaagagaag ggcgtgctcgtgtcccagaattacacgctgccgtgtctcagccctgacgcccagccgccaaacaacatccacagcccagccat ccgggcatatctcaagacaatcagacagctagacaacaaatctgttattgatgagatcatagagcacctcgacaaactcatattt caagatgcaccagaaacaaacatttctgtgccaacagacacccatgaatgtaaacgcttcatcctgactatttctcaacagttttca gagtgcatggacctcgcactaaaatcattgacctctggagcccaacaggccaccacttaa SEQ ID NO:25 – Functional molecule and second binding protein sequence: Barnase-fused- to-murine-IL31 ATGgcacaggttatcaacacgtttgacggggttgcggattatcttcagacatatcataagctacctgataattacattacaaaatc agaagcacaagccctcggctgggtggcatcaaaagggaaccttgcagacgtcgctccggggaaaagcatcggcggagaca tcttctcaaacagggaaggcaaactcccgggcaaaagcggacgaacatggcgtTGGgcggatattaactatacatcaggctt cagaaattcagaccggattctttactcaagcgactggctgatttacaaaacaacggaccattatcagacctttacaaaaatcaga GGAGGTTTAGCTGAGGCAGCTGCTAAGGAAGCTGCCGCAAAGGCTGCAAAAGAAGATC TGCGCACCACCATTGATCTGCTGAAACAGGAAAGCCAGGATCTGTATAACAACTATAGC
ATTAAACAGGCGAGCGGCATGAGCGCGGATGAAAGCATTCAGCTGCCGTGCTTTAGCC TGGATCGCGAAGCGCTGACCAACATTAGCGTGATTATTGCGCATCTGGAAAAAGTGAAA GTGCTGAGCGAAAACACCGTGGATACCAGCTGGGTGATTCGCTGGCTGACCAACATTA GCTGCTTTAACCCGCTGAACCTGAACATTAGCGTGCCGGGCAACACCGATGAAAGCTA TGATTGCGCGGTGTTTGTGCTGACCGTGCTGAAACAGTTTAGCAACTGCATGGCGGAA CTGCAGGCGAAGGACCACGACCACGACCACGACCACGACCACGAGtaataa SEQ ID NO:26 – Nucleic acid sequence for vector that expresses both human Hepatitis B virus heterodimer monomer units (internal number DU73928; derived from pET-duet vector) GGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTGTTTAACTTTAAGAAG GAGATATACCAtggacattgacccgtataaagaatttggagcttctgtggagttactctcttttttgccttctgacttctttccttcta ttcgagatctcctcgacaccgcctcagctctgtatcgggaggccttagagtctccggaacattgttcacctcaccatacagcactcg accaagctattctgtgttggggtgagttgatgaatttggccacctgggtgggaagtaatttgcaaaaagctgcagcagagaaagct gcagctgaaaatagtattagtgattacacagaggctgagtttgttcaacttcttaaggaaattgaaaaagagaatgttgctgcaact gatgatgtgttagatgtgttactcgaacactttgtaaaaattactgagcatccagatggaacggatctgatttattatcctagtgataat agagacgatagccccgaagggattgtcaaggaaattaaagaatggcgagctgctaacggtaagccaggaggtggagcatcc agggaattagtagtcagctatgttaatgttaatatgggcctagatatcagacaactactgtggtttcacatttcctgtcttacttttggaa gagaaactgttcttgaatatttggtgtcttttggagtgtggattcgcactcctcctgcttacagaccaccaaatgcccctatcttatcaac acttccggaaactactgttgtttaaTGAGCGGCCGCTTAACCTAGGCTGCTGCCACCGCTGAGCAAT AACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGTTGTACACG GCCGCATAATAATACGACTCACTATAGGGGAATTGTGAGCGGATAACAATTCCCCTCTA GAAATAATTTTGTTTAACTTTAAGAAGGAGATATACCatggacattgacccgtataaaaaatttggagctt ctgtggagttactctcttttttgccttctgacttctttccttctattcgagatctcctcgacaccgcctcagctctgtatcgggaggccttaga gtctccggaacattgttcacctcaccatacagcactcaggcaagctattctgtgttggggtaaattgatgaatttggccacctgggtg ggaagtaatttggaagacccagcatccagggaattagtagtcagctatgttaatgttaatatgggcctaaaaatcagacaactact gtggtttcacatttcctgtcttacttttggaagagaaactgttcttgaatatttggtgtcttttggagtgtggattcgcactcctcctgcttaca gaccaccaaatgcccctatcttatcaacacttccggaaactactgttgtttaataaTAAGCGGCCGCTTAACCTAGG CTGCTGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTT GAGGGGTTTTTTGCTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTT TGTTGTACACGGCCGCATAATCGgttgacactctatcattgatagagttattttaccactccctatcagtgatagaga aAagtgaaatgaatagttcgacaaaaatctagataacgagggcaaCATaaGGTACCgTCGAGTCTGGTAAAG AAACCGCTGCTGCGAAATTTGAACGCCAGCACATGGACTCGTCTACTAGCGCAGCTTA ATTAACCTAGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTA AACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATCCGGATTGGCGAATGG GACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGT GACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTC TCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTT CCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCAC GTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTT CTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGGTCTATTC TTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTA ACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTCTGGCGGCACGAT GGCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTA AATCAATCTAAAGTATATATGAGTAAACTTGGTCTTtaGGACCCACTTTCACATTTAAGTT GTTTTTCTAATCCGCAgATGATCAATTCAAGGCCGAATAAGAAGGCTGGCTCTGCACCT TGGTGATCAAATAATTCGATAGCTTGTCGTAATAATGGCGGCATACTATCAGTAGTAGG TGTTTCCCTTTCTTCTTTAGCGACTTGATGCTCTTGATCTTCCAATACGCAACCTAAAGT AAAATGCCCCACAGCGCTGAGTGCATATAATGCATTCTCTAGTGAAAAACCTTGTTGGC ATAAAAAGGCTAATTGATTTTCGAGAGTTTCATACTGTTTTTCTGTAGGCCGTGTACCTA
AATGTACTTTTGCTCCATCGCGATGACTTAGTAAAGCACATCTAAAACTTTTAGCGTTAT TACGTAAAAAATCTTGCCAGCTTTCCCCTTCTAAAGGGCAAAAGTGAGTATGGTGCCTA TCTAACATCTCAATGGCTAAGGCGTCGAGCAAAGCCCGCTTATTTTTTACATGCCAATA CAATGTAGGCTGCTCTACACCTAGCTTCTGGGCGAGTTTACGGGTTGTTAAACCTTCGA TTCCGACCTCATTAAGCAGCTCTAATGCGCTGTTAATCACTTTACTTTTATCTAATCTAGA CATcattaattccTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCG TTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTAC CATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTT ATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTA TCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGT TAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGT TTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCC ATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTT GGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGC CATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAG TGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCAC ATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCA AGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATC TTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATG CCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTT CAATCATGATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATG TATTTAGAAAAATAAACAAATAGGTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCC ACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTG CGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCC GGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATAC CAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCA CCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATA AGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTC GGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGA ACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAG GCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCT TCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTG AGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAA CGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTG CGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCT CGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCG CCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATATATGGTG CACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATACACTCCGCTATC GCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACACCCGCTGACGCGCC CTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGG AGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGCTGCGGT AAAGCTCATCAGCGTGGTCGTGAAGCGATTCACAGATGTCTGCCTGTTCATCCGCGTC CAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGTCTGGCTTCTGATAAAGCGGGCCATGT TAAGGGCGGTTTTTTCCTGTTTGGTCACTGATGCCTCCGTGTAAGGGGGATTTCTGTTC ATGGGGGTAATGATACCGATGAAACGAGAGAGGATGCTCACGATACGGGTTACTGATG ATGAACATGCCCGGTTACTGGAACGTTGTGAGGGTAAACAACTGGCGGTATGGATGCG GCGGGACCAGAGAAAAATCACTCAGGGTCAATGCCAGCGCTTCGTTAATACAGATGTA GGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGCAGATCCGGAACATAATGGTGC AGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAAACACGGAAACCGAAGACCATTCA TGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCTTCACGTTCGCTCGCGT ATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCCCGCCAGCCTAGCCGGGTCCTCA
ACGACAGGAGCACGATCATGCTAGTCATGCCCCGCGCCCACCGGAAGGAGCTGACTG GGTTGAAGGCTCTCAAGGGCATCGGTCGAGATCCCGGTGCCTAATGAGTGAGCTAACT TACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAG CTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCCAG GGTGGTTTTTCTTTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACCGCCTGG CCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTGGTTTGCCCCAGCAGGCGAAAATCCT GTTTGATGGTGGTTAACGGCGGGATATAACATGAGCTGTCTTCGGTATCGTCGTATCCC ACTACCGAGATGTCCGCACCAACGCGCAGCCCGGACTCGGTAATGGCGCGCATTGCG CCCAGCGCCATCTGATCGTTGGCAACCAGCATCGCAGTGGGAACGATGCCCTCATTCA GCATTTGCATGGTTTGTTGAAAACCGGACATGGCACTCCAGTCGCCTTCCCGTTCCGCT ATCGGCTGAATTTGATTGCGAGTGAGATATTTATGCCAGCCAGCCAGACGCAGACGCG CCGAGACAGAACTTAATGGGCCCGCTAACAGCGCGATTTGCTGGTGACCCAATGCGAC CAGATGCTCCACGCCCAGTCGCGTACCGTCTTCATGGGAGAAAATAATACTGTTGATG GGTGTCTGGTCAGAGACATCAAGAAATAACGCCGGAACATTAGTGCAGGCAGCTTCCA CAGCAATGGCATCCTGGTCATCCAGCGGATAGTTAATGATCAGCCCACTGACGCGTTG CGCGAGAAGATTGTGCACCGCCGCTTTACAGGCTTCGACGCCGCTTCGTTCTACCATC GACACCACCACGCTGGCACCCAGTTGATCGGCGCGAGATTTAATCGCCGCGACAATTT GCGACGGCGCGTGCAGGGCCAGACTGGAGGTGGCAACGCCAATCAGCAACGACTGTT TGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATGTAATTCAGCTCCGCCATCGCCGC TTCCACTTTTTCCCGCGTTTTCGCAGAAACGTGGCTGGCCTGGTTCACCACGCGGGAA ACGGTCTGATAAGAGACACCGGCATACTCTGCGACATCGTATAACGTTACTGGTTTCAC ATTCACCACCCTGAATTGACTCTCTTCCGGGCGCTATCATGCCATACCGCGAAAGGTTT TGCGCCATTCGATGGTGTCCGGGATCTCGACGCTCTCCCTTATGCGACTCCTGCATTA GGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCGCCGCCGCAAGGAATGGTG CATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCACGGGGCCTGCCACCATACCCA CGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCCCATCGGTGA TGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGGCCACGA TGCGTCCGGCGTAGAGGATCGAGATCGATCTCGATCCCGCGAAATTAATACGACTCAC TATA SEQ ID NO:27 – Sequence of vector that expresses both woodchuck hepatitis virus heterodimer monomer subunits (as well as second binding protein-functional molecule Barnase-IL31, derived from pET duet plasmid) GGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTGTTTAACTTTAAGAAG GAGATATACCATGGACATCGACCCATACAAAGAATTTGGAAGTAGTTACCAGCTTCTGA ATTTCTTACCTCTTGACTTCTTTCCAGACTTGAACGCGCTGGTAGACACAGCGACAGCT TTATATGAAGAGGAGCTGACAGGCCGCGAGCACTGCTCACCTCATCATACGGCCATCG ATCAAGCACTGGTATGCTGGGATGAGTTGACTAAGTTGATTGCTGACATGTCATCTAAC ATCACCTCCAAAGCAGCGGCCGAGAAGGCAGCAGCAGAAAAAAAGGCTGTAATTAACG GTGAGCAGATCCGCAGTATTAGCGACTTACACCAAACATTGAAAAAGGAACTTGCCTTA CCCGAGTACTATGGCGAAAACCTGGATGCTCTGTGGGACGCTTTAACAGGATGGGTCG AGTACCCGTTGGTGTTAGAATGGCGCCAGTTCGAGCAGAGTAAGCAATTGACTGAGAA TGGTGCCGAATCCGTATTACAAGTATTCCGCGAAGCCAAGGCCGAGGGGGCAGATATC ACTATTGAACTGTCTAAAGCCGCGGCCGAACAGGTACGCACTATCATTGTGAACCATGT CAATGACACCTGGGGGTTAGATGTCCGTCAATCCCTTTGGTTTCATTTATCGTGCTTAA CTTTCGGTCAGCACACTGTTCAAGAGTTTCTGGTCTCGTTTGGAGTATGGATTCGCACG CCTGCGCCATATCGCCCGCCTAATGCGCCTATTCTGTCTACCTTGCCCGAACATACGGT TATTTAGTGAGCGGCCGCTTAACCTAGGCTGCTGCCACCGCTGAGCAATAACTAGCATA ACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGTTGTACACGGCCGCATAAT
AATACGACTCACTATAGGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTT GTTTAACTTTAAGAAGGAGATATACCATGGATATTGATCCGTATAAAAAATTTGGCAGCA GCTATCAGCTGCTGAACTTTCTGCCGCTGGATTTTTTTCCGGATCTGAACGCGCTGGTG GATACCGCGACCGCGCTGTATGAAGAAGAACTGACCGGCCGCGAACATTGCAGCCCG CATCATACCGCGATTCGCCAAGCGCTGGTGTGCTGGGATAAACTGACCGAACTGATTG CGTGGATGAGCAGCAACATTACGAGCAAACAAGTGGATACCATTATTGTGAACAAAGTG AACGATACCTGGGGCCTGAAAGTGCGTCAGAGCCTGTGGTTTCATCTGAGCTGCCTGA CCTTTGGTCAGCATACCGTGCAAGAATTTCTGGTGAGCTTTGGCGTGTGGATTCGCACC CCGGCGCCGTATCGCCCGCCGAACGCGCCGATTCTGAGCACCCTGCCGGAACATACC GTGATTTAATAAGCGGCCGCTTAACCTAGGCTGCTGCCACCGCTGAGCAATAACTAGC ATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGTTGTACACGGCCGCAT AATCGgttgacactctatcattgatagagttattttaccactccctatcagtgatagagaaAagtgaaatgaatagttcgacaaa aatctagataacgagggcaaCATATGgcacaggttatcaacacgtttgacggggttgcggattatcttcagacatatcataag ctacctgataattacattacaaaatcagaagcacaagccctcggctgggtggcatcaaaagggaaccttgcagacgtcgctccg gggaaaagcatcggcggagacatcttctcaaacagggaaggcaaactcccgggcaaaagcggacgaacatggcgtTGG gcggatattaactatacatcaggcttcagaaattcagaccggattctttactcaagcgactggctgatttacaaaacaacggacca ttatcagacctttacaaaaatcagaGGAGGTTTAGCTGAGGCAGCTGCTAAGGAAGCTGCCGCAAA GGCTGCAAAAGAAGATCTGCGCACCACCATTGATCTGCTGAAACAGGAAAGCCAGGAT CTGTATAACAACTATAGCATTAAACAGGCGAGCGGCATGAGCGCGGATGAAAGCATTC AGCTGCCGTGCTTTAGCCTGGATCGCGAAGCGCTGACCAACATTAGCGTGATTATTGC GCATCTGGAAAAAGTGAAAGTGCTGAGCGAAAACACCGTGGATACCAGCTGGGTGATT CGCTGGCTGACCAACATTAGCTGCTTTAACCCGCTGAACCTGAACATTAGCGTGCCGG GCAACACCGATGAAAGCTATGATTGCGCGGTGTTTGTGCTGACCGTGCTGAAACAGTTT AGCAACTGCATGGCGGAACTGCAGGCGAAGGACCACGACCACGACCACGACCACGAC CACGAGtaataaGGTACCCTCGAGTCTGGTAAAGAAACCGCTGCTGCGAAATTTGAACGC CAGCACATGGACTCGTCTACTAGCGCAGCTTAATTAACCTAGGCTGCTGCCACCGCTG AGCAATAACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTG AAAGGAGGAACTATATCCGGATTGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAG CGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGC GCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTC AAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGAC CCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGG TTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTG GAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTT CGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAA TATTAACGTTTACAATTTCTGGCGGCACGATGGCATGAGATTATCAAAAAGGATCTTCAC CTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACT TGGTCTTtaGGACCCACTTTCACATTTAAGTTGTTTTTCTAATCCGCAgATGATCAATTCA AGGCCGAATAAGAAGGCTGGCTCTGCACCTTGGTGATCAAATAATTCGATAGCTTGTCG TAATAATGGCGGCATACTATCAGTAGTAGGTGTTTCCCTTTCTTCTTTAGCGACTTGATG CTCTTGATCTTCCAATACGCAACCTAAAGTAAAATGCCCCACAGCGCTGAGTGCATATA ATGCATTCTCTAGTGAAAAACCTTGTTGGCATAAAAAGGCTAATTGATTTTCGAGAGTTT CATACTGTTTTTCTGTAGGCCGTGTACCTAAATGTACTTTTGCTCCATCGCGATGACTTA GTAAAGCACATCTAAAACTTTTAGCGTTATTACGTAAAAAATCTTGCCAGCTTTCCCCTT CTAAAGGGCAAAAGTGAGTATGGTGCCTATCTAACATCTCAATGGCTAAGGCGTCGAG CAAAGCCCGCTTATTTTTTACATGCCAATACAATGTAGGCTGCTCTACACCTAGCTTCTG GGCGAGTTTACGGGTTGTTAAACCTTCGATTCCGACCTCATTAAGCAGCTCTAATGCGC TGTTAATCACTTTACTTTTATCTAATCTAGACATcattaattccTACCAATGCTTAATCAGTGAG GCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGT GTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCG CGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGG
CCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGC CGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGC TACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCC AACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTC GGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGC AGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGA GTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCG GCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGG AAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGA TGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTG GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGA AATGTTGAATACTCATACTCTTCCTTTTTCAATCATGATTGAAGCATTTATCAGGGTTATT GTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGTCATGACC AAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAA AGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACC ACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGG TAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTA GGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTT ACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGA TAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCC AGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAA GCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTC GGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGT CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGG GGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTG CTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTAT TACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGA GTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTG TGCGGTATTTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCA TAGTTAAGCCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCG ACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGC TTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCA TCACCGAAACGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCGAT TCACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAA TGTCTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTG ATGCCTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGA GAGGATGCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGT GAGGGTAAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGT CAATGCCAGCGCTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATC CTGCGATGCAGATCCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACT TTACGAAACACGGAAACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGC AGCAGCAGTCGCTTCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGG CAACCCCGCCAGCCTAGCCGGGTCCTCAACGACAGGAGCACGATCATGCTAGTCATGC CCCGCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGAG ATCCCGGTGCCTAATGAGTGAGCTAACTTACATTAATTGCGTTGCGCTCACTGCCCGCT TTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGA GAGGCGGTTTGCGTATTGGGCGCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCA ACAGCTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCT GGTTTGCCCCAGCAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACAT GAGCTGTCTTCGGTATCGTCGTATCCCACTACCGAGATGTCCGCACCAACGCGCAGCC CGGACTCGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTTGGCAACCAGCAT
CGCAGTGGGAACGATGCCCTCATTCAGCATTTGCATGGTTTGTTGAAAACCGGACATG GCACTCCAGTCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTGCGAGTGAGATATTT ATGCCAGCCAGCCAGACGCAGACGCGCCGAGACAGAACTTAATGGGCCCGCTAACAG CGCGATTTGCTGGTGACCCAATGCGACCAGATGCTCCACGCCCAGTCGCGTACCGTCT TCATGGGAGAAAATAATACTGTTGATGGGTGTCTGGTCAGAGACATCAAGAAATAACGC CGGAACATTAGTGCAGGCAGCTTCCACAGCAATGGCATCCTGGTCATCCAGCGGATAG TTAATGATCAGCCCACTGACGCGTTGCGCGAGAAGATTGTGCACCGCCGCTTTACAGG CTTCGACGCCGCTTCGTTCTACCATCGACACCACCACGCTGGCACCCAGTTGATCGGC GCGAGATTTAATCGCCGCGACAATTTGCGACGGCGCGTGCAGGGCCAGACTGGAGGT GGCAACGCCAATCAGCAACGACTGTTTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGA ATGTAATTCAGCTCCGCCATCGCCGCTTCCACTTTTTCCCGCGTTTTCGCAGAAACGTG GCTGGCCTGGTTCACCACGCGGGAAACGGTCTGATAAGAGACACCGGCATACTCTGC GACATCGTATAACGTTACTGGTTTCACATTCACCACCCTGAATTGACTCTCTTCCGGGC GCTATCATGCCATACCGCGAAAGGTTTTGCGCCATTCGATGGTGTCCGGGATCTCGAC GCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTT GAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCC GGCCACGGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTG GCGAGCCCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACC TGTGGCGCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGAGGATCGAGATCGATCT CGATCCCGCGAAATTAATACGACTCACTATA SEQ ID NO:28 – Sequence of vector that expresses one woodchuck hepatitis virus monomer subunit (containing Barstar binding protein) GGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTGTTTAACTTTAAGAAG GAGATATACCATGGACATCGACCCATACAAAGAATTTGGAAGTAGTTACCAGCTTCTGA ATTTCTTACCTCTTGACTTCTTTCCAGACTTGAACGCGCTGGTAGACACAGCGACAGCT TTATATGAAGAGGAGCTGACAGGCCGCGAGCACTGCTCACCTCATCATACGGCCATCG ATCAAGCACTGGTATGCTGGGATGAGTTGACTAAGTTGATTGCTGACATGTCATCTAAC ATCACCTCCAAAGCAGCGGCCGAGAAGGCAGCAGCAGAAAAAAAGGCTGTAATTAACG GTGAGCAGATCCGCAGTATTAGCGACTTACACCAAACATTGAAAAAGGAACTTGCCTTA CCCGAGTACTATGGCGAAAACCTGGATGCTCTGTGGGACGCTTTAACAGGATGGGTCG AGTACCCGTTGGTGTTAGAATGGCGCCAGTTCGAGCAGAGTAAGCAATTGACTGAGAA TGGTGCCGAATCCGTATTACAAGTATTCCGCGAAGCCAAGGCCGAGGGGGCAGATATC ACTATTGAACTGTCTAAAGCCGCGGCCGAACAGGTACGCACTATCATTGTGAACCATGT CAATGACACCTGGGGGTTAGATGTCCGTCAATCCCTTTGGTTTCATTTATCGTGCTTAA CTTTCGGTCAGCACACTGTTCAAGAGTTTCTGGTCTCGTTTGGAGTATGGATTCGCACG CCTGCGCCATATCGCCCGCCTAATGCGCCTATTCTGTCTACCTTGCCCGAACATACGGT TATTTAGTGAGCGGCCGCTTAACCTAGGCTGCTGCCACCGCTGAGCAATAACTAGCATA ACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGTTGTACACGGCCGCATAAT AATACGACTCACTATAGGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTT GTTTAACTTTAAGAAGGAGATATACCATGGATATTGATCCGTATAAAAAATTTGGCAGCA GCTATCAGCTGCTGAACTTTCTGCCGCTGGATTTTTTTCCGGATCTGAACGCGCTGGTG GATACCGCGACCGCGCTGTATGAAGAAGAACTGACCGGCCGCGAACATTGCAGCCCG CATCATACCGCGATTCGCCAAGCGCTGGTGTGCTGGGATAAACTGACCGAACTGATTG CGTGGATGAGCAGCAACATTACGAGCAAACAAGTGGATACCATTATTGTGAACAAAGTG AACGATACCTGGGGCCTGAAAGTGCGTCAGAGCCTGTGGTTTCATCTGAGCTGCCTGA CCTTTGGTCAGCATACCGTGCAAGAATTTCTGGTGAGCTTTGGCGTGTGGATTCGCACC CCGGCGCCGTATCGCCCGCCGAACGCGCCGATTCTGAGCACCCTGCCGGAACATACC GTGATTTAAGGTACCCTCGAGTCTGGTAAAGAAACCGCTGCTGCGAAATTTGAACGCCA
GCACATGGACTCGTCTACTAGCGCAGCTTAATTAACCTAGGCTGCTGCCACCGCTGAG CAATAACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAA AGGAGGAACTATATCCGGATTGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGC GCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCG CCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCA AGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACC CCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGT TTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGG AACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTC GGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAAT ATTAACGTTTACAATTTCTGGCGGCACGATGGCATGAGATTATCAAAAAGGATCTTCAC CTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACT TGGTCTTtaGGACCCACTTTCACATTTAAGTTGTTTTTCTAATCCGCAgATGATCAATTCA AGGCCGAATAAGAAGGCTGGCTCTGCACCTTGGTGATCAAATAATTCGATAGCTTGTCG TAATAATGGCGGCATACTATCAGTAGTAGGTGTTTCCCTTTCTTCTTTAGCGACTTGATG CTCTTGATCTTCCAATACGCAACCTAAAGTAAAATGCCCCACAGCGCTGAGTGCATATA ATGCATTCTCTAGTGAAAAACCTTGTTGGCATAAAAAGGCTAATTGATTTTCGAGAGTTT CATACTGTTTTTCTGTAGGCCGTGTACCTAAATGTACTTTTGCTCCATCGCGATGACTTA GTAAAGCACATCTAAAACTTTTAGCGTTATTACGTAAAAAATCTTGCCAGCTTTCCCCTT CTAAAGGGCAAAAGTGAGTATGGTGCCTATCTAACATCTCAATGGCTAAGGCGTCGAG CAAAGCCCGCTTATTTTTTACATGCCAATACAATGTAGGCTGCTCTACACCTAGCTTCTG GGCGAGTTTACGGGTTGTTAAACCTTCGATTCCGACCTCATTAAGCAGCTCTAATGCGC TGTTAATCACTTTACTTTTATCTAATCTAGACATcattaattccTACCAATGCTTAATCAGTGAG GCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGT GTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCG CGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGG CCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGC CGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGC TACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCC AACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTC GGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGC AGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGA GTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCG GCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGG AAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGA TGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTG GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGA AATGTTGAATACTCATACTCTTCCTTTTTCAATCATGATTGAAGCATTTATCAGGGTTATT GTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGTCATGACC AAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAA AGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACC ACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGG TAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTA GGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTT ACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGA TAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCC AGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAA GCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTC GGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGT CCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGG GGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTG
CTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTAT TACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGA GTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTG TGCGGTATTTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCA TAGTTAAGCCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCG ACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGC TTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCA TCACCGAAACGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCGAT TCACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAA TGTCTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTG ATGCCTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGA GAGGATGCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGT GAGGGTAAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGT CAATGCCAGCGCTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATC CTGCGATGCAGATCCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACT TTACGAAACACGGAAACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGC AGCAGCAGTCGCTTCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGG CAACCCCGCCAGCCTAGCCGGGTCCTCAACGACAGGAGCACGATCATGCTAGTCATGC CCCGCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGAG ATCCCGGTGCCTAATGAGTGAGCTAACTTACATTAATTGCGTTGCGCTCACTGCCCGCT TTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGA GAGGCGGTTTGCGTATTGGGCGCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCA ACAGCTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCT GGTTTGCCCCAGCAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACAT GAGCTGTCTTCGGTATCGTCGTATCCCACTACCGAGATGTCCGCACCAACGCGCAGCC CGGACTCGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTTGGCAACCAGCAT CGCAGTGGGAACGATGCCCTCATTCAGCATTTGCATGGTTTGTTGAAAACCGGACATG GCACTCCAGTCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTGCGAGTGAGATATTT ATGCCAGCCAGCCAGACGCAGACGCGCCGAGACAGAACTTAATGGGCCCGCTAACAG CGCGATTTGCTGGTGACCCAATGCGACCAGATGCTCCACGCCCAGTCGCGTACCGTCT TCATGGGAGAAAATAATACTGTTGATGGGTGTCTGGTCAGAGACATCAAGAAATAACGC CGGAACATTAGTGCAGGCAGCTTCCACAGCAATGGCATCCTGGTCATCCAGCGGATAG TTAATGATCAGCCCACTGACGCGTTGCGCGAGAAGATTGTGCACCGCCGCTTTACAGG CTTCGACGCCGCTTCGTTCTACCATCGACACCACCACGCTGGCACCCAGTTGATCGGC GCGAGATTTAATCGCCGCGACAATTTGCGACGGCGCGTGCAGGGCCAGACTGGAGGT GGCAACGCCAATCAGCAACGACTGTTTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGA ATGTAATTCAGCTCCGCCATCGCCGCTTCCACTTTTTCCCGCGTTTTCGCAGAAACGTG GCTGGCCTGGTTCACCACGCGGGAAACGGTCTGATAAGAGACACCGGCATACTCTGC GACATCGTATAACGTTACTGGTTTCACATTCACCACCCTGAATTGACTCTCTTCCGGGC GCTATCATGCCATACCGCGAAAGGTTTTGCGCCATTCGATGGTGTCCGGGATCTCGAC GCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTT GAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCC GGCCACGGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTG GCGAGCCCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACC TGTGGCGCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGAGGATCGAGATCGATCT CGATCCCGCGAAATTAATACGACTCACTATA SEQ ID NO:29 – Sequence of vector that expresses one woodchuck hepatitis virus monomer subunit (without a binding protein)
GGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTGTTTAACTTTAAGAAG GAGATATACCATGgcacaggttatcaacacgtttgacggggttgcggattatcttcagacatatcataagctacctgataat tacattacaaaatcagaagcacaagccctcggctgggtggcatcaaaagggaaccttgcagacgtcgctccggggaaaagc atcggcggagacatcttctcaaacagggaaggcaaactcccgggcaaaagcggacgaacatggcgtTGGgcggatattaa ctatacatcaggcttcagaaattcagaccggattctttactcaagcgactggctgatttacaaaacaacggaccattatcagaccttt acaaaaatcagaGGAGGTTTAGCTGAGGCAGCTGCTAAGGAAGCTGCCGCAAAGGCTGCA AAAGAAGATCTGCGCACCACCATTGATCTGCTGAAACAGGAAAGCCAGGATCTGTATAA CAACTATAGCATTAAACAGGCGAGCGGCATGAGCGCGGATGAAAGCATTCAGCTGCCG TGCTTTAGCCTGGATCGCGAAGCGCTGACCAACATTAGCGTGATTATTGCGCATCTGGA AAAAGTGAAAGTGCTGAGCGAAAACACCGTGGATACCAGCTGGGTGATTCGCTGGCTG ACCAACATTAGCTGCTTTAACCCGCTGAACCTGAACATTAGCGTGCCGGGCAACACCG ATGAAAGCTATGATTGCGCGGTGTTTGTGCTGACCGTGCTGAAACAGTTTAGCAACTGC ATGGCGGAACTGCAGGCGAAGGACCACGACCACGACCACGACCACGACCACGAGtaata aGGTACCCTCGAGTCTGGTAAAGAAACCGCTGCTGCGAAATTTGAACGCCAGCACATG GACTCGTCTACTAGCGCAGCTTAATTAACCTAGGCTGCTGCCACCGCTGAGCAATAACT AGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGA ACTATATCCGGATTGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGG GTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTC CTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTA AATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAA ACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGC CCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAAC ACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTA TTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACG TTTACAATTTCTGGCGGCACGATGGCATGAGATTATCAAAAAGGATCTTCACCTAGATC CTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTTt aGGACCCACTTTCACATTTAAGTTGTTTTTCTAATCCGCAgATGATCAATTCAAGGCCGA ATAAGAAGGCTGGCTCTGCACCTTGGTGATCAAATAATTCGATAGCTTGTCGTAATAAT GGCGGCATACTATCAGTAGTAGGTGTTTCCCTTTCTTCTTTAGCGACTTGATGCTCTTGA TCTTCCAATACGCAACCTAAAGTAAAATGCCCCACAGCGCTGAGTGCATATAATGCATT CTCTAGTGAAAAACCTTGTTGGCATAAAAAGGCTAATTGATTTTCGAGAGTTTCATACTG TTTTTCTGTAGGCCGTGTACCTAAATGTACTTTTGCTCCATCGCGATGACTTAGTAAAGC ACATCTAAAACTTTTAGCGTTATTACGTAAAAAATCTTGCCAGCTTTCCCCTTCTAAAGG GCAAAAGTGAGTATGGTGCCTATCTAACATCTCAATGGCTAAGGCGTCGAGCAAAGCC CGCTTATTTTTTACATGCCAATACAATGTAGGCTGCTCTACACCTAGCTTCTGGGCGAG TTTACGGGTTGTTAAACCTTCGATTCCGACCTCATTAAGCAGCTCTAATGCGCTGTTAAT CACTTTACTTTTATCTAATCTAGACATcattaattccTACCAATGCTTAATCAGTGAGGCACCT ATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATA ACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACC CACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGC GCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAA GCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGG CATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGAT CAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCT CCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACT GCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTC AACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCA ATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACG TTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAAC CCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGA GCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTT
GAATACTCATACTCTTCCTTTTTCAATCATGATTGAAGCATTTATCAGGGTTATTGTCTCA TGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGTCATGACCAAAATC CCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATC TTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCT ACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTG GCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCAC CACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGT GGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTA CCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTG GAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCA CGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAG GAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCG GGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAG CCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTT TTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCT TTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGA GCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTAT TTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAG CCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGC CAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACA AGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAA CGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCGATTCACAGATG TCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGTCTGGCT TCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTGATGCCTCCG TGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGATGCT CACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGTGAGGGTAAA CAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCCAG CGCTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGC AGATCCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAAAC ACGGAAACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAG TCGCTTCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCCCG CCAGCCTAGCCGGGTCCTCAACGACAGGAGCACGATCATGCTAGTCATGCCCCGCGC CCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGAGATCCCGG TGCCTAATGAGTGAGCTAACTTACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGT CGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCG GTTTGCGTATTGGGCGCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCAACAGCT GATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTGGTTTG CCCCAGCAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACATGAGCTG TCTTCGGTATCGTCGTATCCCACTACCGAGATGTCCGCACCAACGCGCAGCCCGGACT CGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTTGGCAACCAGCATCGCAGT GGGAACGATGCCCTCATTCAGCATTTGCATGGTTTGTTGAAAACCGGACATGGCACTCC AGTCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTGCGAGTGAGATATTTATGCCAG CCAGCCAGACGCAGACGCGCCGAGACAGAACTTAATGGGCCCGCTAACAGCGCGATT TGCTGGTGACCCAATGCGACCAGATGCTCCACGCCCAGTCGCGTACCGTCTTCATGGG AGAAAATAATACTGTTGATGGGTGTCTGGTCAGAGACATCAAGAAATAACGCCGGAACA TTAGTGCAGGCAGCTTCCACAGCAATGGCATCCTGGTCATCCAGCGGATAGTTAATGAT CAGCCCACTGACGCGTTGCGCGAGAAGATTGTGCACCGCCGCTTTACAGGCTTCGACG CCGCTTCGTTCTACCATCGACACCACCACGCTGGCACCCAGTTGATCGGCGCGAGATT TAATCGCCGCGACAATTTGCGACGGCGCGTGCAGGGCCAGACTGGAGGTGGCAACGC CAATCAGCAACGACTGTTTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATGTAATT CAGCTCCGCCATCGCCGCTTCCACTTTTTCCCGCGTTTTCGCAGAAACGTGGCTGGCC
TGGTTCACCACGCGGGAAACGGTCTGATAAGAGACACCGGCATACTCTGCGACATCGT ATAACGTTACTGGTTTCACATTCACCACCCTGAATTGACTCTCTTCCGGGCGCTATCAT GCCATACCGCGAAAGGTTTTGCGCCATTCGATGGTGTCCGGGATCTCGACGCTCTCCC TTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCG CCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCACGG GGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCC GATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGC CGGTGATGCCGGCCACGATGCGTCCGGCGTAGAGGATCGAGATCGATCTCGATCCCG CGAAATTAATACGACTCACTATA SEQ ID NO:30 – Sequence of vector that expresses one woodchuck hepatitis virus capsid monomer subunit (without a binding protein), one woodchuck hepatitis virus capsid monomer subunit including a binding protein (lm7), and an epitope protein ( modified murine IL31) fused to both Colicin E7 (via an alpha helical linker) and an affinity ligand (internal ID DU73351).GGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTGTTTAACTT TAAGAAGGAGATATACCATGGACATCGATCCATATAAGGAGTTTGGCTCCTCATACCAA TTACTTAATTTCCTTCCCTTGGACTTTTTCCCCGACCTTAACGCCCTGGTTGACACGGCT ACGGCGCTTTACGAAGAGGAATTAACGGGGCGTGAACATTGTTCACCTCATCACACGG CCATTGATCAAGCGTTGGTATGCTGGGATGAACTTACGAAACTGATTGCCGATATGAGT TCAAATATTACGAGCAAAGCAGCAGCCGAGAAGGCAGCAGCCGAAGAACTGAAAAATA GCATTTCAGACTACACCGAAGCAGAATTTGTGCAGTTACTGAAAGAGATCGAGAAGGAG AACGTAGCCGCAACCGATGACGTGCTTGATGTCCTGCTTGAACATTTAGTAAAGATTAC GGAACATCCAGACGGTACGGATTTAATCTATTATCCTAGTGACAATCGCGACGACAGTC CAGAAGGCATCGTAAAGGAGATTAAAGAATGGCGTGCTGCAAACGGAAAGCCTGGGTT TAAGCAGGGTGGAAAAGCTGCGGCAGAACAGGTGCGTACCATTATCGTAAATCACGTC AATGATACCTGGGGTCTTGATGTTCGTCAGTCCCTGTGGTTTCACCTTTCATGCTTGAC GTTTGGTCAGCACACAGTACAGGAGTTCCTTGTTTCTTTCGGGGTATGGATTCGTACAC CAGCTCCTTATCGCCCTCCTAACGCACCTATTTTATCCACGTTACCTGAACATACCGTTA TTTAGTGAGCGGCCGCTTAACCTAGGCTGCTGCCACCGCTGAGCAATAACTAGCATAA CCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGTTGTACACGGCCGCATAATA ATACGACTCACTATAGGGGAATTGTGAGCGGATAACAATTCCCCTCTAGAAATAATTTTG TTTAACTTTAAGAAGGAGATATACCATGGATATTGATCCGTATAAAAAATTTGGCAGCAG CTATCAGCTGCTGAACTTTCTGCCGCTGGATTTTTTTCCGGATCTGAACGCGCTGGTGG ATACCGCGACCGCGCTGTATGAAGAAGAACTGACCGGCCGCGAACATTGCAGCCCGC ATCATACCGCGATTCGCCAAGCGCTGGTGTGCTGGGATAAACTGACCGAACTGATTGC GTGGATGAGCAGCAACATTACGAGCAAACAAGTGGATACCATTATTGTGAACAAAGTGA ACGATACCTGGGGCCTGAAAGTGCGTCAGAGCCTGTGGTTTCATCTGAGCTGCCTGAC CTTTGGTCAGCATACCGTGCAAGAATTTCTGGTGAGCTTTGGCGTGTGGATTCGCACCC CGGCGCCGTATCGCCCGCCGAACGCGCCGATTCTGAGCACCCTGCCGGAACATACCG TGATTTAATAAGCGGCCGCTTAACCTAGGCTGCTGCCACCGCTGAGCAATAACTAGCAT AACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGTTGTACACGGCCGCATA ATCGgttgacactctatcattgatagagttattttaccactccctatcagtgatagagaaAagtgaaatgaatagttcgacaaaa atctagataacgagggcaaCATATGGAGAGTAAACGGAATAAGCCAGGGAAGGCAACAGGTAA AGGAAAACCTGTCAATAATAAGTGGTTAAATAATGCAGGTAAAGACTTAGGTTCTCCTGT TCCAGATCGTATAGCTAATAAACTACGTGATAAGGAGTTTAAAAGTTTCGATGATTTTCG TAAGAAATTCTGGGAAGAAGTGTCAAAAGATCCTGAGTTAAGTAAACAATTTAGTCGAAA CAATAATGATCGAATGAAGGTTGGAAAAGCGCCCAAGACTAGAACCCAGGATGTTTCAG GGAAGGCAACTTCATTCGCACTTCATCATGAGAAGCCGATCAGCCAAAATGGTGGTGT CTATGATATGGATAACATCAGCGTGGTAACACCTAAACGTGCTATTGATATTCACCGACT GGCGGAAGCGGCGGCGAAAGAAGCGGCGGCGAAAGAAGCGGCGAAAGCGGCGAAAG AAGATCTGCGCACCACCATTGATCTGCTGAAACAGGAAAGCCAGGATCTGTATAACAAC
TATAGCATTAAACAGGCGAGCGGCATGAGCGCGGATGAAAGCATTCAGCTGCCGTGCT TTAGCCTGGATCGCGAAGCGCTGACCAACATTAGCGTGATTATTGCGCATCTGGAAAAA GTGAAAGTGCTGAGCGAAAACACCGTGGATACCAGCTGGGTGATTCGCTGGCTGACCA ACATTAGCTGCTTTAACCCGCTGAACCTGAACATTAGCGTGCCGGGCAACACCGATGA AAGCTATGATTGCGCGGTGTTTGTGCTGACCGTGCTGAAACAGTTTAGCAACTGCATGG CGGAACTGCAGGCGGGCAGCGGCGGCAGCCATGATCATGATCATGATCATGATCATGA AtaataaGGTACCgTCGAGTCTGGTAAAGAAACCGCTGCTGCGAAATTTGAACGCCAGCA CATGGACTCGTCTACTAGCGCAGCTTAATTAACCTAGGCTGCTGCCACCGCTGAGCAAT AACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGA GGAACTATATCCGGATTGGCGAATGGGACGCGCCCTGTAGCGGCGCATTAAGCGCGG CGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCG CTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCT CTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAA AAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTC GCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACA ACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCC TATTGGTTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAA CGTTTACAATTTCTGGCGGCACGATGGCATGAGATTATCAAAAAGGATCTTCACCTAGA TCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTC TTtaGGACCCACTTTCACATTTAAGTTGTTTTTCTAATCCGCAgATGATCAATTCAAGGCC GAATAAGAAGGCTGGCTCTGCACCTTGGTGATCAAATAATTCGATAGCTTGTCGTAATA ATGGCGGCATACTATCAGTAGTAGGTGTTTCCCTTTCTTCTTTAGCGACTTGATGCTCTT GATCTTCCAATACGCAACCTAAAGTAAAATGCCCCACAGCGCTGAGTGCATATAATGCA TTCTCTAGTGAAAAACCTTGTTGGCATAAAAAGGCTAATTGATTTTCGAGAGTTTCATAC TGTTTTTCTGTAGGCCGTGTACCTAAATGTACTTTTGCTCCATCGCGATGACTTAGTAAA GCACATCTAAAACTTTTAGCGTTATTACGTAAAAAATCTTGCCAGCTTTCCCCTTCTAAA GGGCAAAAGTGAGTATGGTGCCTATCTAACATCTCAATGGCTAAGGCGTCGAGCAAAG CCCGCTTATTTTTTACATGCCAATACAATGTAGGCTGCTCTACACCTAGCTTCTGGGCG AGTTTACGGGTTGTTAAACCTTCGATTCCGACCTCATTAAGCAGCTCTAATGCGCTGTTA ATCACTTTACTTTTATCTAATCTAGACATcattaattccTACCAATGCTTAATCAGTGAGGCAC CTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAG ATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAG ACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGA GCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGG AAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACA GGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAAC GATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGT CCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGC ACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTA CTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCG TCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAA ACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGT AACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGG TGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAT GTTGAATACTCATACTCTTCCTTTTTCAATCATGATTGAAGCATTTATCAGGGTTATTGTC TCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGTCATGACCAAA ATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGG ATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACC GCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAA CTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGC CACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACC
AGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAG TTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGC TTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCG CCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAA CAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGT CGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGG AGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGC CTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCG CCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAG TGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGG TATTTCACACCGCATATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTA AGCCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCC GCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAG ACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCG AAACGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCGATTCACAG ATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGTCTG GCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTGATGCCT CCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGAT GCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGTGAGGGT AAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCC AGCGCTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGAT GCAGATCCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAA ACACGGAAACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGC AGTCGCTTCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCC CGCCAGCCTAGCCGGGTCCTCAACGACAGGAGCACGATCATGCTAGTCATGCCCCGC GCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGAGATCCC GGTGCCTAATGAGTGAGCTAACTTACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCA GTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGG CGGTTTGCGTATTGGGCGCCAGGGTGGTTTTTCTTTTCACCAGTGAGACGGGCAACAG CTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTGGTT TGCCCCAGCAGGCGAAAATCCTGTTTGATGGTGGTTAACGGCGGGATATAACATGAGC TGTCTTCGGTATCGTCGTATCCCACTACCGAGATGTCCGCACCAACGCGCAGCCCGGA CTCGGTAATGGCGCGCATTGCGCCCAGCGCCATCTGATCGTTGGCAACCAGCATCGCA GTGGGAACGATGCCCTCATTCAGCATTTGCATGGTTTGTTGAAAACCGGACATGGCACT CCAGTCGCCTTCCCGTTCCGCTATCGGCTGAATTTGATTGCGAGTGAGATATTTATGCC AGCCAGCCAGACGCAGACGCGCCGAGACAGAACTTAATGGGCCCGCTAACAGCGCGA TTTGCTGGTGACCCAATGCGACCAGATGCTCCACGCCCAGTCGCGTACCGTCTTCATG GGAGAAAATAATACTGTTGATGGGTGTCTGGTCAGAGACATCAAGAAATAACGCCGGAA CATTAGTGCAGGCAGCTTCCACAGCAATGGCATCCTGGTCATCCAGCGGATAGTTAAT GATCAGCCCACTGACGCGTTGCGCGAGAAGATTGTGCACCGCCGCTTTACAGGCTTCG ACGCCGCTTCGTTCTACCATCGACACCACCACGCTGGCACCCAGTTGATCGGCGCGAG ATTTAATCGCCGCGACAATTTGCGACGGCGCGTGCAGGGCCAGACTGGAGGTGGCAA CGCCAATCAGCAACGACTGTTTGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATGTA ATTCAGCTCCGCCATCGCCGCTTCCACTTTTTCCCGCGTTTTCGCAGAAACGTGGCTGG CCTGGTTCACCACGCGGGAAACGGTCTGATAAGAGACACCGGCATACTCTGCGACATC GTATAACGTTACTGGTTTCACATTCACCACCCTGAATTGACTCTCTTCCGGGCGCTATC ATGCCATACCGCGAAAGGTTTTGCGCCATTCGATGGTGTCCGGGATCTCGACGCTCTC CCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCAC CGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCAC GGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGC CCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGC
GCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGAGGATCGAGATCGATCTCGATCC CGCGAAATTAATACGACTCACTATA SEQ ID NO:31 – Colicin E7 protein encoded by SEQ ID NO: 30 nts 1687-2082: ESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRIANKLRDKEFKSFDDFRKKFWEEV SKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKATSFALHHEKPISQNGGVYDMDNISV VTPKRAIDIHR SEQ ID NO:32 – Colicin E7 nucleic acid sequence used in SEQ ID NO: 30 nts 1687-2082 - DNA ATGGAGAGTAAACGGAATAAGCCAGGGAAGGCAACAGGTAAAGGAAAACCTGTCAATA ATAAGTGGTTAAATAATGCAGGTAAAGACTTAGGTTCTCCTGTTCCAGATCGTATAGCTA ATAAACTACGTGATAAGGAGTTTAAAAGTTTCGATGATTTTCGTAAGAAATTCTGGGAAG AAGTGTCAAAAGATCCTGAGTTAAGTAAACAATTTAGTCGAAACAATAATGATCGAATGA AGGTTGGAAAAGCGCCCAAGACTAGAACCCAGGATGTTTCAGGGAAGGCAACTTCATT CGCACTTCATCATGAGAAGCCGATCAGCCAAAATGGTGGTGTCTATGATATGGATAACA TCAGCGTGGTAACACCTAAACGTGCTATTGATATTCACCGA SEQ ID NO: 33 - Helix-Linker protein used in SEQ ID NO: 30 nts 2083-2136 between Colicin E7 and murine IL31 LAEAAAKEAAAKEAAKAA SEQ ID NO: 34 - nucleic acid sequence of Helix-Linker used in SEQ ID NO: 30 nts 2083-2136 between Colicin E7 and murine IL31 - DNA CTGGCGGAAGCGGCGGCGAAAGAAGCGGCGGCGAAAGAAGCGGCGAAAGCGGCG SEQ ID NO: 35 ––modified murine IL31 protein epitope - mIL31K/A (with specified receptor inactivating mutation K116->A) – encoded in SEQ ID NO: 30 by nt 2137-2505 KEDLRTTIDLLKQESQDLYNNYSIKQASGMSADESIQLPCFSLDREALTNISVIIAHLEKVKVL SENTVDTSWVIRWLTNISCFNPLNLNISVPGNTDESYDCAVFVLTVLKQFSNCMAELQA SEQ ID NO: 36 –nucleic acid sequence of modified murine IL31 epitope - mIL31K/A (with specified receptor inactivating mutation K116->A) –used in SEQ ID NO: 30 nts 2137-2505 – DNA AAAGAAGATCTGCGCACCACCATTGATCTGCTGAAACAGGAAAGCCAGGATCTGTATAA CAACTATAGCATTAAACAGGCGAGCGGCATGAGCGCGGATGAAAGCATTCAGCTGCCG TGCTTTAGCCTGGATCGCGAAGCGCTGACCAACATTAGCGTGATTATTGCGCATCTGGA AAAAGTGAAAGTGCTGAGCGAAAACACCGTGGATACCAGCTGGGTGATTCGCTGGCTG ACCAACATTAGCTGCTTTAACCCGCTGAACCTGAACATTAGCGTGCCGGGCAACACCG
ATGAAAGCTATGATTGCGCGGTGTTTGTGCTGACCGTGCTGAAACAGTTTAGCAACTGC ATGGCGGAACTGCAGGCG SEQ ID NO: 37 - affinity tag protein used for purification – encoded in SEQ ID NO: 30 by nt 2506-2550 GSGGSHDHDHDHDHE SEQ ID NO: 38 – nucleic acid sequence of affinity tag used for purification – encoded in SEQ ID NO: 30 by nt 2506-2550 – DNA GGCAGCGGCGGCAGCCATGATCATGATCATGATCATGATCATGAA References: Jumper, J et al. Highly accurate protein structure prediction with AlphaFold. Nature (2021). Varadi, M et al. AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Research (2021). Kukreja et al. J Virol.2014 Dec;88(24):14105-15. doi: 10.1128/JVI.01840-14. Structurally similar woodchuck and human hepadnavirus core proteins have distinctly different temperature dependences of assembly Konczal et al, PLoS One 2019 14(4): e0215892, https://doi.org/10.1371/journal.pone.0215892 Re-introducing non-optimal synonymous codons into codon-optimized constructs enhances soluble recovery of recombinant proteins from Escherichia coli.
Claims
CLAIMS: 1. A virus-like particle (VLP) comprising: - One or more viral capsid protein heterodimer(s) each comprising a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence, and; - a binding molecule attached exclusively to either the first monomer or the second monomer, and; - wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof.
2. A VLP according to claim 1, wherein the first amino acid sequence and the second amino acid sequence comprise at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof.
3. A VLP according to claims 1 or 2 wherein the binding molecule is capable of binding to a functional molecule.
4. A VLP according to any preceding claim wherein the binding molecule is a binding protein.
5. A VLP according to any preceding claim, wherein the binding protein comprises a bacterial toxin inhibitor.
6. A VLP according to claim 5 wherein the wherein the bacterial toxin inhibitor is selected from Im7, Im8, Im9, Im2, and Barstar.
7. A VLP according to any preceding claim, wherein the first amino acid sequence comprises a mutation at position E8 of SEQ ID NO: 1 or 16, or a position corresponding thereto, and the second amino acid sequence comprises a mutation at position R56 of SEQ ID NO: 1 or 16, or a position corresponding thereto, preferably wherein the mutation at position E8 or a position corresponding thereto is E8K, and preferably wherein the mutation at position R56 or a position corresponding thereto is R56D.
8. A VLP according to any preceding claim, wherein the first amino acid sequence comprises a mutation at position E64 of SEQ ID NO: 1 or 16, or a position corresponding thereto, and the second amino acid sequence comprises a mutation at position K96 of SEQ ID NO: 1 or 16, or a position corresponding thereto, preferably wherein the mutation at position E64 or a position corresponding thereto is E64K, and preferably wherein the mutation at position K96 or a position corresponding thereto is K96D, optionally wherein the first amino acid sequence comprises a mutation at position K67 of SEQ ID NO:16, or a position corresponding thereto, preferably wherein the mutation at K67 is K67E.
9. A VLP according to any preceding claim, wherein the first amino acid sequence comprises mutations at positions R82 and E79 of SEQ ID NO: 16, or a position corresponding thereto, preferably wherein the mutation at position R82 or a position corresponding thereto is R82D, and preferably wherein the mutation at position E79 or a position corresponding thereto is E79K.
10. A VLP according to any preceding claim, wherein the first amino acid sequence comprises a mutation at position H88 of SEQ ID NO: 16 or a position corresponding thereto, and the second amino acid sequence comprises a mutation at position W71, and optionally a deletion at position S78, of SEQ ID NO: 16 or a position corresponding thereto, preferably wherein the mutation at position H88 or a position corresponding thereto is H88K, and preferably wherein the mutation at position W71 or a position corresponding thereto is W71D.
11. A VLP according to any preceding claim wherein the first amino acid sequence comprises the sequence set out in SEQ ID NO: 2 or 3, or a sequence having at least 70% identity thereto, and wherein the second amino acid sequence comprises the sequence set out in SEQ ID NO:4 or 5 or a sequence having at least 70% identity thereto.
12. A VLP according to any preceding claim, wherein the binding molecule is attached to the second amino acid sequence.
13. A VLP according to any preceding claim wherein the viral capsid protein heterodimer is a hepatitis capsid protein heterodimer, preferably a Woodchuck hepatitis capsid protein heterodimer.
14. A viral capsid protein heterodimer comprising: - a first monomer encoded by a first amino acid sequence and a second monomer encoded by a second amino acid sequence; - a binding molecule attached exclusively to either the first monomer or the second monomer, and; - wherein the first amino acid sequence and/or the second amino acid sequence comprises at least one mutation which modifies the electrostatic interaction between the first and second monomers to promote association thereof.
15. The viral capsid protein heterodimer according to claim 14, wherein the binding molecule is capable of binding to a functional molecule.
16. The viral capsid protein heterodimer according to claims 14 or 15, wherein the binding molecule is a binding protein.
17. The viral capsid protein heterodimer according to any of claims 14 to 16, wherein the binding protein comprises a bacterial toxin inhibitor.
18. The viral capsid protein heterodimer according to any of claims 14 to 17, wherein the bacterial toxin inhibitor is selected from Im7, Im8, Im9, Im2, and Barstar.
19. The viral capsid protein heterodimer according to any of claims 14 to 18, wherein the first amino acid sequence comprises a mutation at position E8 of SEQ ID NO: 1 or 16 or a position corresponding thereto, and the second amino acid sequence comprises a mutation at position R56 of SEQ ID NO: 1 or 16 or a position corresponding thereto, preferably wherein the mutation at position E8 or a position corresponding thereto is E8K, and preferably wherein the mutation at position R56 or a position corresponding thereto is R56D.
20. The viral capsid protein heterodimer according to any of claims 14 to 19, wherein the first amino acid sequence comprises a mutation at position E64 of SEQ ID NO: 1 or 16 or a position corresponding thereto, and the second amino acid sequence comprises a mutation at position K96 of SEQ ID NO: 1 or 16, or a position corresponding thereto, preferably wherein the mutation at position E64 or a position corresponding thereto is E64K, and preferably wherein the mutation at position K96 or a position corresponding thereto is K96D, optionally wherein the first amino acid sequence comprises a mutation at position K67 of SEQ ID NO:16, or a position corresponding thereto, preferably wherein the mutation at K67 is K67E.
21. The viral capsid protein heterodimer according to any of claims 14 to 20, wherein the first amino acid sequence comprises a mutation at position R82 and/or E79 of SEQ ID NO: 16 or a position corresponding thereto, preferably wherein the mutation at position R82 or a position corresponding thereto is R82D, and preferably wherein the mutation at position E79 or a position corresponding thereto is E79K.
22. The viral capsid protein heterodimer according to any of claims 14 to 21, wherein the first amino acid sequence comprises a mutation at position H88 of SEQ ID NO: 16 or a position corresponding thereto, and the second amino acid sequence comprises a mutation at position W71, and optionally a deletion at position S78, of SEQ ID NO: 16 or a position corresponding thereto, preferably wherein the mutation at position H88 or a position corresponding thereto is H88K, and preferably wherein the mutation at position W71 or a position corresponding thereto is W71D.
23. The viral capsid protein heterodimer according to any of claims 14 to 22, wherein the first amino acid sequence comprises the sequence set out in SEQ ID NO: 2 or 3 or a sequence
having at least 70% identity thereto, and wherein the second amino acid sequence comprises the sequence set out in SEQ ID NO:4 or 5 or a sequence having at least 70% identity thereto.
24. A viral capsid protein heterodimer according to any of claims 14 to 23, wherein the viral capsid protein heterodimer is a hepatitis capsid protein heterodimer, preferably a Woodchuck hepatitis capsid protein heterodimer.
25. A viral capsid protein monomer encoded by an amino acid sequence - wherein the amino acid sequence comprises at least one mutation and, - the at least one mutation modifies the electrostatic interaction between the viral capsid protein monomer and other viral capsid protein monomers to promote heterodimeric association thereof, - wherein optionally the viral capsid protein monomer comprises a binding molecule attached thereto.
26. A viral capsid protein monomer according to claim 25, wherein the viral capsid protein monomer and other viral capsid protein monomer is a hepatitis viral capsid protein monomer, preferably a Woodchuck hepatitis capsid protein monomer.
27. A nucleic acid encoding the viral capsid protein heterodimer according to claim 14 or the viral capsid protein monomer according to claim 25.
28. A nucleic acid construct encoding the viral capsid protein heterodimer according to claim 14, the construct comprising; - a first nucleic acid sequence encoding a first viral capsid protein monomer according to claim 25; - a second nucleic acid sequence encoding a second viral capsid protein monomer according to claim 25; and -optionally a third nucleic acid sequence encoding a binding molecule attached exclusively to the first or second viral capsid protein monomer.
29. A process of producing a virus-like particle (VLP) in a single host cell comprising: a) Providing a host cell comprising i. a nucleic acid encoding a first viral capsid protein monomer according to claim 25, ii. a nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule according to claim 25,
iii. a nucleic acid encoding a functional molecule operable to bind to the binding molecule; b) Culturing the host cell under conditions to express the proteins from the nucleic acids; c) Forming functionalised viral capsid protein heterodimers; d) Forming virus-like particles from the functionalised viral capsid protein heterodimers .
30. A process of producing a virus-like particle (VLP), comprising; (a) Providing a first host cell comprising: i. a first nucleic acid encoding a first viral capsid protein monomer according to claim 25; and ii. a second nucleic acid encoding a second viral capsid protein monomer attached to a binding molecule according to claim 25, (b) Providing a second host cell comprising: i. a third nucleic acid encoding a functional molecule operable to bind to the binding molecule; (c) Culturing the host cells under conditions to express the proteins from the first, second, and third nucleic acids respectively; (d) Recovering the proteins; (e) Mixing the proteins to (i) form functionalised viral capsid protein heterodimers and (ii) to form virus-like particles from the functionalised viral capsid protein heterodimers.
31. A process according to claims 29 or 30, wherein the first and second viral capsid protein monomers are hepatitis capsid protein monomers, preferably Woodchuck hepatitis capsid protein monomers.
32. An immunogenic composition comprising the virus-like particle of any of claims 1 to 13.
33. A virus-like particle (VLP) of any of claims 1 to 13, or an immunogenic composition of claim 32 for use as a medicament.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2210507.6 | 2022-07-18 | ||
GBGB2210507.6A GB202210507D0 (en) | 2022-07-18 | 2022-07-18 | Virus-like particles, heterodimeric capsid proteins and methods of production thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024018188A1 true WO2024018188A1 (en) | 2024-01-25 |
Family
ID=84540295
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2023/051873 WO2024018188A1 (en) | 2022-07-18 | 2023-07-17 | Virus-like particles, heterodimeric capsid proteins and methods of production thereof |
Country Status (2)
Country | Link |
---|---|
GB (1) | GB202210507D0 (en) |
WO (1) | WO2024018188A1 (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4683195A (en) | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
WO2013036973A2 (en) * | 2011-09-09 | 2013-03-14 | Biomed Realty, L.P. | Methods and compositions for controlling assembly of viral proteins |
WO2019222711A1 (en) * | 2018-05-17 | 2019-11-21 | Lumen Bioscience, Inc. | Arthrospira platensis oral vaccine delivery platform |
WO2021163538A1 (en) * | 2020-02-13 | 2021-08-19 | The Trustees Of Indiana University | Hepadnavirus capsid protein heterodimers and virus-like particles |
-
2022
- 2022-07-18 GB GBGB2210507.6A patent/GB202210507D0/en not_active Ceased
-
2023
- 2023-07-17 WO PCT/GB2023/051873 patent/WO2024018188A1/en unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4683195A (en) | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US4683195B1 (en) | 1986-01-30 | 1990-11-27 | Cetus Corp | |
WO2013036973A2 (en) * | 2011-09-09 | 2013-03-14 | Biomed Realty, L.P. | Methods and compositions for controlling assembly of viral proteins |
WO2019222711A1 (en) * | 2018-05-17 | 2019-11-21 | Lumen Bioscience, Inc. | Arthrospira platensis oral vaccine delivery platform |
WO2021163538A1 (en) * | 2020-02-13 | 2021-08-19 | The Trustees Of Indiana University | Hepadnavirus capsid protein heterodimers and virus-like particles |
Non-Patent Citations (25)
Title |
---|
"Current Protocols in Molecular Biology", 2000, WILEY AND SON INC |
"Handbook of Experimental Immunology", vol. I-IV, 1986, COLD SPRING HARBOR LABORATORY PRESS |
"Immunochemical Methods in Cell and Molecular Biology", 1987, COLD SPRING HARBOR LABORATORY |
ALTSCHUL ET AL., BLAST, 1990 |
ALTSCHUL ET AL., J MOL BIOL, vol. 154,156,185, 1990, pages 403 - 10 |
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 10 |
CORPET ET AL., NUCLEIC ACIDS RES., vol. 16, 1988, pages 10881 - 90 |
HEIDI CECILIE LARSEN SPåNG ET AL: "Heterodimeric Barnase-Barstar Vaccine Molecules: Influence of One versus Two Targeting Units Specific for Antigen Presenting Cells", PLOS ONE, vol. 7, no. 9, 18 September 2012 (2012-09-18), pages e45393, XP055533610, DOI: 10.1371/journal.pone.0045393 * |
HIGGINSSHARP, CABIOS, vol. 5, 1989, pages 151 - 3 |
HIGGINSSHARP, GENE, vol. 73, 1988, pages 237 - 44 |
HUANG ET AL., COMP. APPL. BIOSCI., vol. 8, 1992, pages 155 - 65 |
JUMPER, J ET AL.: "Highly accurate protein structure prediction with AlphaFold", NATURE, 2021 |
KONCZAL ET AL., PLOS ONE, vol. 14, no. 4, 2019, pages e0215892, Retrieved from the Internet <URL:https://doi.org/10.1371/journal.pone.0215892> |
KUKREJA ET AL., J VIROL., vol. 88, no. 24, December 2014 (2014-12-01), pages 14105 - 15 |
MOLEDA ALEKSANDRA: "The Development and Characterisation of a Novel and Modular Virus-like Particle Platform", 1 February 2022 (2022-02-01), XP093084171, Retrieved from the Internet <URL:https://discovery.dundee.ac.uk/en/studentTheses/the-development-and-characterisation-of-a-novel-and-modular-virus> [retrieved on 20230921] * |
NEEDLEMANWUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
PEARSON ET AL., METHODS MOL. BIOL., vol. 24, 1994, pages 307 - 31 |
PEARSONLIPMAN, PROC. NATL. ACAD. SCI. U.S.A., vol. 85, 1988, pages 2444 |
PERBAL: "A Practical Guide to Molecular Cloning", 1984 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR |
SMITHWATERMAN, ADV. APPL. MATH., vol. 2, no. 482, 1981 |
TATIANA ET AL., FEMS MICROBIOL., vol. 174, 1999, pages 247 - 50 |
TATUSOVAMADDEN, FEMS MICROBIOL LETT, vol. 174, 1999, pages 247 - 250 |
VARADI, M ET AL.: "AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models", NUCLEIC ACIDS RESEARCH, 2021 |
WALKER ANDREAS ET AL: "SplitCore: An exceptionally versatile viral nanoparticle for native whole protein display regardless of 3D structure", SCIENTIFIC REPORTS, vol. 1, no. 1, 14 June 2011 (2011-06-14), XP093083978, Retrieved from the Internet <URL:https://www.nature.com/articles/srep00005.pdf> DOI: 10.1038/srep00005 * |
Also Published As
Publication number | Publication date |
---|---|
GB202210507D0 (en) | 2022-08-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113292640B (en) | Novel recombinant coronavirus RBD trimer protein vaccine capable of generating broad-spectrum cross-neutralization activity, and preparation method and application thereof | |
Brune et al. | New routes and opportunities for modular construction of particulate vaccines: stick, click, and glue | |
Liu et al. | A novel method for synthetic vaccine construction based on protein assembly | |
CN104583239B (en) | More specific monoclonal antibodies | |
CN107739736A (en) | Influenza nucleic acids molecule and vaccine prepared therefrom | |
US20230321260A1 (en) | Peptide tags and binding partners | |
JP2020502104A (en) | Production of seleno-biologics in genomically recoded organisms | |
CN110551187B (en) | Chemically synthesized H7N9 avian influenza virus NA protein extracellular region antigen segment, preparation method and application | |
KR20220082035A (en) | Carrier proteins with site-directed mutations and their use in the manufacture of vaccines | |
Wo et al. | Engineering probiotic-derived outer membrane vesicles as functional vaccine carriers to enhance immunity against SARS-CoV-2 | |
JP6172582B2 (en) | Vaccine to prevent swine edema disease | |
CN113347988A (en) | Modified inclusion bodies and uses thereof | |
US20160000901A1 (en) | Compositions and Methods for the Production of Virus-Like Particles | |
US20240093159A1 (en) | Virus-like particles and methods of production thereof | |
WO2023025287A1 (en) | Sars-cov-2 immunogenic substance and preparation method therefor and application thereof | |
JPWO2018074558A1 (en) | Complex polypeptide monomer, aggregate of monomer of complex polypeptide having cell permeation function, and norovirus component vaccine for subcutaneous, intradermal, transdermal or intramuscular administration comprising the aggregate as an active ingredient | |
WO2024018188A1 (en) | Virus-like particles, heterodimeric capsid proteins and methods of production thereof | |
US20230068497A1 (en) | Carrier matrix comprising dodecin protein | |
WO2024067182A1 (en) | Charge regulation-type antigen protein capable of enhancing synergistic immune efficacy with adjuvant | |
CN114502572A (en) | Self-assembled protein nanostructures displaying paramyxovirus and/or pneumovirus F proteins and uses thereof | |
CA2522221A1 (en) | Immunogen, composition for immunological use, and method of producing a ntibody using the same | |
Teow et al. | Production and purification of polymerization-competent HIV-1 capsid protein p24 (CA) in NiCo21 (DE3) Escherichia coli | |
WO2022047176A2 (en) | Single-chain coronavirus viral membrane protein complexes | |
AU2005292852B2 (en) | Method of immunizing animal, composition for immunization, method of producing antibody, method of producing hybridoma and method of producing monoclonal antibody | |
Karyagina et al. | Development of a platform for producing recombinant protein components of epitope vaccines for the prevention of COVID-19 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23748580 Country of ref document: EP Kind code of ref document: A1 |