US20230159595A1 - Polypeptides for use in self-assembling protein nanostructures - Google Patents
Polypeptides for use in self-assembling protein nanostructures Download PDFInfo
- Publication number
- US20230159595A1 US20230159595A1 US17/934,343 US202217934343A US2023159595A1 US 20230159595 A1 US20230159595 A1 US 20230159595A1 US 202217934343 A US202217934343 A US 202217934343A US 2023159595 A1 US2023159595 A1 US 2023159595A1
- Authority
- US
- United States
- Prior art keywords
- seq
- subunit
- nanostructure
- polypeptide sequence
- protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 161
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 161
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 161
- 239000002086 nanomaterial Substances 0.000 title claims abstract description 139
- 108090000623 proteins and genes Proteins 0.000 title claims description 100
- 102000004169 proteins and genes Human genes 0.000 title claims description 85
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 25
- 238000000034 method Methods 0.000 abstract description 19
- 235000018102 proteins Nutrition 0.000 description 84
- 108020004414 DNA Proteins 0.000 description 40
- 238000013461 design Methods 0.000 description 39
- 150000007523 nucleic acids Chemical class 0.000 description 29
- 102000053602 DNA Human genes 0.000 description 25
- 108020004682 Single-Stranded DNA Proteins 0.000 description 24
- 239000000203 mixture Substances 0.000 description 24
- 238000000429 assembly Methods 0.000 description 21
- 230000000712 assembly Effects 0.000 description 21
- 102000039446 nucleic acids Human genes 0.000 description 20
- 108020004707 nucleic acids Proteins 0.000 description 20
- 235000001014 amino acid Nutrition 0.000 description 19
- 210000004027 cell Anatomy 0.000 description 18
- 239000013604 expression vector Substances 0.000 description 18
- 239000013638 trimer Substances 0.000 description 18
- 230000014509 gene expression Effects 0.000 description 17
- 229940024606 amino acid Drugs 0.000 description 16
- 150000001413 amino acids Chemical group 0.000 description 16
- 238000010828 elution Methods 0.000 description 15
- 238000000338 in vitro Methods 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 14
- 239000000539 dimer Substances 0.000 description 14
- 239000000463 material Substances 0.000 description 14
- 239000000872 buffer Substances 0.000 description 13
- 239000013078 crystal Substances 0.000 description 13
- 239000002105 nanoparticle Substances 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- 239000000499 gel Substances 0.000 description 11
- 230000035772 mutation Effects 0.000 description 11
- 125000000539 amino acid group Chemical group 0.000 description 10
- 238000010186 staining Methods 0.000 description 10
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 108091029795 Intergenic region Proteins 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000002523 gelfiltration Methods 0.000 description 8
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 8
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 7
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- 229960005486 vaccine Drugs 0.000 description 7
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 6
- 102000002067 Protein Subunits Human genes 0.000 description 6
- 108010001267 Protein Subunits Proteins 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- 230000002209 hydrophobic effect Effects 0.000 description 6
- 235000004252 protein component Nutrition 0.000 description 6
- 238000003259 recombinant expression Methods 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 5
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 5
- 230000028993 immune response Effects 0.000 description 5
- 238000009877 rendering Methods 0.000 description 5
- UMCMPZBLKLEWAF-BCTGSCMUSA-N 3-[(3-cholamidopropyl)dimethylammonio]propane-1-sulfonate Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCC[N+](C)(C)CCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 UMCMPZBLKLEWAF-BCTGSCMUSA-N 0.000 description 4
- 239000004475 Arginine Substances 0.000 description 4
- 108010053770 Deoxyribonucleases Proteins 0.000 description 4
- 102000016911 Deoxyribonucleases Human genes 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 230000004186 co-expression Effects 0.000 description 4
- 230000021615 conjugation Effects 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 238000012377 drug delivery Methods 0.000 description 4
- 238000005538 encapsulation Methods 0.000 description 4
- 244000052769 pathogen Species 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 238000004448 titration Methods 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 108010089921 CTCGAG-specific type II deoxyribonucleases Proteins 0.000 description 3
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 3
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 3
- 150000008575 L-amino acids Chemical class 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 229940009098 aspartate Drugs 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 239000003599 detergent Substances 0.000 description 3
- 238000001493 electron microscopy Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 3
- 210000001163 endosome Anatomy 0.000 description 3
- 230000007717 exclusion Effects 0.000 description 3
- 229930195712 glutamate Natural products 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 239000011259 mixed solution Substances 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 238000002887 multiple sequence alignment Methods 0.000 description 3
- 229910052759 nickel Inorganic materials 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 239000011148 porous material Substances 0.000 description 3
- 238000001542 size-exclusion chromatography Methods 0.000 description 3
- 238000000235 small-angle X-ray scattering Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 108010036162 GATC-specific type II deoxyribonucleases Proteins 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- 230000033289 adaptive immune response Effects 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000005094 computer simulation Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 230000009881 electrostatic interaction Effects 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 239000012216 imaging agent Substances 0.000 description 2
- 230000002519 immonomodulatory effect Effects 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 210000003712 lysosome Anatomy 0.000 description 2
- 230000001868 lysosomic effect Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 229920000620 organic polymer Polymers 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 238000000059 patterning Methods 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 239000013014 purified material Substances 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000001338 self-assembly Methods 0.000 description 2
- 239000004055 small Interfering RNA Substances 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 238000012800 visualization Methods 0.000 description 2
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 101710167800 Capsid assembly scaffolding protein Proteins 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 229940033330 HIV vaccine Drugs 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108091007412 Piwi-interacting RNA Proteins 0.000 description 1
- 101710130420 Probable capsid assembly scaffolding protein Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710204410 Scaffold protein Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108091027967 Small hairpin RNA Proteins 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 238000005411 Van der Waals force Methods 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000001553 co-assembly Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000012938 design process Methods 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000635 electron micrograph Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 108010021310 endodeoxyribonuclease NcoI Proteins 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000001597 immobilized metal affinity chromatography Methods 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 239000008204 material by function Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000005060 membrane bound organelle Anatomy 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 238000005232 molecular self-assembly Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000001426 native polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000002439 negative-stain electron microscopy Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 210000004896 polypeptide structure Anatomy 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 230000004845 protein aggregation Effects 0.000 description 1
- 235000019624 protein content Nutrition 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 239000012521 purified sample Substances 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000007086 side reaction Methods 0.000 description 1
- 239000002924 silencing RNA Substances 0.000 description 1
- 238000001464 small-angle X-ray scattering data Methods 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
Definitions
- the invention provides isolated polypeptides comprising an amino acid sequence that is at least 75% identical over its length, and identical at least at one identified interface position, to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS:1-34.
- the invention provides nanostructures, comprising:
- each first assembly comprising a plurality of identical first polypeptides, wherein the first polypeptides comprise the polypeptide of claim 1 ;
- each second assembly comprising a plurality of identical second polypeptides, wherein the second polypeptides comprise the polypeptide of claim 1 , and wherein the second polypeptide differs from the first polypeptide;
- the present invention provides isolated nucleic acids encoding the polypeptides of the invention.
- the invention provides nucleic acid expression vectors comprising isolated nucleic acids of the invention.
- the present invention provides recombinant host cells, comprising a nucleic acid expression vector according to the invention.
- the present invention provides a kit, comprising one or more isolated nanostructures of the invention; one or more of the isolated proteins of the present invention or the assemblies of the present invention; one or more recombinant nucleic acids of the present invention; one or more recombinant expression vectors of the present invention; and/or one or more recombinant host cells of the present invention.
- FIG. 1 Overview of the design method utilized to produce the exemplary nanostructures and sequences, illustrated with the I53 icosahedral architecture.
- A A schematic illustration of icosahedral symmetry outlined with dashed lines, with the five-fold symmetry axes shown going through each vertex and three-fold symmetry axes going through each face of the icosahedron.
- B 12 pentamers (dark grey) and 20 trimers (light grey) are aligned along the 5-fold and 3-fold symmetry axes, respectively.
- Each oligomer possesses two rigid body degrees of freedom, one translational (r) and one rotational (w) that are systematically sampled to identify configurations with large interfaces and high densities of contacting residues suitable for protein-protein interface design.
- C Example of such a docked configuration with a large interface and high density of contacting residues suitable for protein-protein interface design.
- D Close-up of the docked interface between the pentameric and trimeric subunits, as outlined in panel C. Side chains atoms beyond the beta carbon are ignored at this stage of design.
- E New amino acid sequences are designed at the interface to stabilize the modeled configuration.
- FIG. 2 Design models of exemplary nanostructures. Computational models of the 11 exemplary nanomaterials, (A) I53-34, (B) I53-40, (C) I53-47, (D) I53-50, (E) I53-51, (F) I52-03, (G) I52-32, (H) I52-33, (I)I32-06, (J) I32-19, and (K) I32-28, are shown to scale (relative to the 30 nm scale bar), viewed down one of the 5-fold icosahedral symmetry axes with ribbon-style rendering of the protein backbone.
- Each I53 material comprises 12 identical pentamers (dark grey) and 20 identical trimers (light grey)
- each I52 material comprises 12 identical pentamers (dark grey) and 30 identical dimers (light grey)
- each I32 material comprises 20 identical trimers (dark grey) and 30 identical dimers (light grey), with the designed interface formed between these oligomeric building blocks. All renderings were generated using PyMOL® Schrödinger, LLC.
- FIG. 3 Sodium dodecyl sulfate polyacrylamide electrophoresis (SDS-PAGE) and mass spectrometry analysis. Pairs of proteins encoding each material were co-expressed (as described in the Methods of Production) in E. coli , lysed, and purified via nickel-affinity chromatography followed by gel filtration with a Superose® 6 10/300 GL column (GE Life Sciences). (A) The resulting samples were subjected to SDS-PAGE followed by staining with GelCode® Blue Stain Reagent (Pierce Biotechnology, Inc.).
- SDS-PAGE Sodium dodecyl sulfate polyacrylamide electrophoresis
- the left lane in each panel contains protein molecular weight standards; the approximate molecular weights in kilodaltons are indicated directly to the left of each band.
- the right lanes in each panel contain the purified samples. For all of the materials except I52-03, clear bands, of similar staining intensity and near the expected molecular weights of each protein subunit, are present for each of the two proteins comprising the purified materials.
- FIG. 4 Negative stain averages. Averages have been obtained of the (A) I53-40, (B) I53-50, (C) I52-03, and (D) I32-06 nanostructures and found to match well with the design models. Raw negative stain micrographs from which the averages were generated are shown on the left side of each panel. Averages (left), along with renderings from the design models (right), are shown on the right side of each panel. Views are shown corresponding approximately to the 5-fold, 3-fold, and 2-fold symmetry axes.
- FIG. 5 X-ray crystallography.
- X-ray crystal structures (bottom) ranging from 3.5 to 5.0 ⁇ resolution have been obtained for three of the designed materials, (A) I53-40, (B) I52-32, and (C) I32-28, and found to match closely with the design models (top).
- Each structure is shown using a ribbon-style rendering. Views of the I53 and I52 designs and crystal structures (panels A and B) are shown looking down one of the 5-fold symmetry axes, while the I32 design model and crystal structure (panel C) are shown looking down one of the 3-fold symmetry axes.
- Each crystal structure contains only a portion of the full icosahedron in the asymmetric unit.
- Crystal lattice symmetry was applied to generate the full icosahedra shown in the bottom panel.
- the I53-40 design model and crystal structure (panel A) comprise 12 pentamers (dark grey) and 20 trimers (light grey)
- the I52-32 design model and crystal structure (panel B) comprise 12 pentamers (dark grey) and 30 dimers (light grey)
- the I32-28 design model and crystal structure (panel C) comprise 20 trimers (dark grey) and 30 dimers (light grey). All renderings were generated using PyMOL® Schrödinger, LLC.
- FIG. 6 In vitro assembly of I53-50A.1PosT1+I53-50B.4PosT1 in the presence of 400 nucleotide (nt) ssDNA leads to encapsulation and protection of the ssDNA.
- Mixtures of 26 ng/ ⁇ L ssDNA and various proteins were analyzed by agarose gel electrophoretic mobility shift assay (EMSA) after incubation for 16 hours to determine the ability of mixtures of I53-50A.1PosT1+I53-50B.4PosT1 to encapsulate the ssDNA (left; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein).
- ESA agarose gel electrophoretic mobility shift assay
- Components titration are mixtures of I53-50A.1PosT1+I53-50B.4PosT1 at 2, 4, 6, 8, 10 and 12 ⁇ M) with the DNA shift the DNA such that it migrates similarly to SEC-purified I53-50A.1PosT1+I53-50B.4PosT1 nanoparticles (upper band), while mixtures of DNA with only one protein component or the other do not.
- the mixtures were then incubated with 25 ⁇ g/mL DNase I for 1 hour at room temperature in order to evaluate the ability of the in vitro-assembled nanoparticles to protect the ssDNA cargo from degradation (right; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein).
- the DNA that co-migrates with the protein in mixtures of both components is largely protected from DNase challenge, while free ssDNA and the mixture of ssDNA+I53-50B.4PosT1 are not.
- the mixture of ssDNA+I53-50A.1PosT1 is weakly protected, but migrates as a diffuse smear on the gel.
- FIG. 7 In vitro assembly of I53-50A.1PosT1+I53-50B.4PosT1 in the presence of 1600 nucleotide (nt) ssDNA leads to encapsulation and protection of the ssDNA.
- Mixtures of 35.2 ng/ ⁇ L ssDNA and various proteins were analyzed by agarose electrophoretic mobility shift assay (EMSA) after incubation for 16 hours to determine the ability of mixtures of I53-50A.1PosT1+I53-50B.4PosT1 to encapsulate the ssDNA (left; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein).
- ESA agarose electrophoretic mobility shift assay
- Components titration are mixtures of I53-50A.1PosT1+153-50B.4PosT1 at 2, 4, 6, 8, 10 and 12 ⁇ M) with the DNA shift the DNA such that it migrates similarly to SEC-purified I53-50A.1PosT1+I53-50B.4PosT1 nanoparticles (upper band), while mixtures of DNA with only one protein component or the other do not.
- the mixtures were then incubated with 25 ⁇ g/mL DNase I for 1 hour at room temperature in order to evaluate the ability of the in vitro-assembled nanoparticles to protect the ssDNA cargo from degradation (right; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein).
- the DNA that co-migrates with the protein in mixtures of both components is largely protected from DNase challenge, while free ssDNA and the mixture of ssDNA+I53-50B.4PosT1 are not.
- the mixture of ssDNA+I53-50A.1PosT1 is weakly protected, but migrates as a diffuse smear on the gel.
- amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V).
- “about” means +/ ⁇ 5% of the recited parameter.
- the invention provides isolated polypeptide comprising an amino acid sequence that is at least 75% identical over its length, and identical at least at one identified interface position, to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS: 1-34.
- the isolated polypeptides of the invention can be used, for example, to prepare the nanostructures of the invention.
- the polypeptides of the invention were designed for their ability to self-assemble in pairs to form nanostructures, such as icosahedral nanostructures.
- the design involved design of suitable interface residues for each member of the polypeptide pair that can be assembled to form the nanostructure.
- the nanostructures of the invention include symmetrically repeated, non-natural, non-covalent polypeptide-polypeptide interfaces that orient a first assembly and a second assembly into a nanostructure, such as one with an icosahedral symmetry.
- Starting proteins were those derived from pentameric, trimeric, and dimeric crystal structures from the Protein Data Bank (PDB), along with a small number of crystal structures of de novo designed proteins not yet deposited in the PDB.
- PDB Protein Data Bank
- each of the polypeptides of the present invention includes one or more modifications at “interface residues” compared to the starting proteins, permitting the polypeptides of the invention to, for example, form icosahedral nanostructures as described herein.
- Table 1 provides the amino acid sequence of exemplary polypeptides of the invention; the right hand column in Table 1 identifies the residue numbers in each exemplary polypeptide that were identified as present at the interface of resulting assembled nanostructures (i.e.: “identified interface residues”). As can be seen, the number of interface residues for the exemplary polypeptides of SEQ ID NO:1-34 range from 4-13.
- the isolated polypeptides of the invention comprise an amino acid sequence that is at least 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical over its length, and identical at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or 13 identified interface positions (depending on the number of interface residues for a given polypeptide), to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS: 1-34.
- the isolated polypeptides of the invention comprise an amino acid sequence that is at least 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical over its length, and identical at least at 20%, 25%, 33%, 40%, 50%, 60%, 70%, 75%, 80%, 90%, or 100% of the identified interface positions, to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS: 1-34.
- the polypeptides of the invention comprise or consist of a polypeptide having the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS:1-40.
- I53-34A SEQ ID RTEKGLEALKALRKSGLLLGAGTVRSPKEAEAALEAGAAFLVSPGL 28, 32, 36, 37, NO: 1 LEEVAALAQARGVPYLPGVLTPIEVERALALGLSALKFFPAEPFQG 186, 188, 191, 192, 195 VRVLRAYAEVFPEVRFLPTGGIKEEHLPHYAALPNLLAVGGSWLLQ GDLAAVMKKVKAAKALLSPQAPG I53-34B MTKKVGIVDTTFARVDMAEAAIRTLKALSPNIKIIRKTVPGIKDLPV I53-34B: SEQ ID ACKKLLEEEGCDIVMALGMPGKAEKDKVCAHEASLGLMLAQLMT 19, 20, 23, 24, 27, 109, 113, 116, NO: 2 NKHIIEVFVHEDEA
- I53-40 A genus (SEQ ID NO: 35) MTKKVGIVDTTFARVDMASAAILTLKMESPNIKIIRKTVPGIK DLPVACKKLLEEEGCDIVMALGMPGK(A/K)EKDKVCAHEASL GLMLAQLMTNKHIIEVFVHEDEAKDDAELKILAARRAIEHALN VYYLLFKPEYLTRMAGKGLRQGFEDAGPARE I53-40 B genus (SEQ ID NO: 36) M(S/D)(T/D)INNQLK(A/R)LKVIPVIAIDNAEDIIPLGKV LAENGLPAAEITFRSSAAVKAIMLLRSAQPEMLIGAGTILNGV QALAAKEAGA(T/D)FVVSPGFNPNTVRACQIIGIDIVPGVNN PSTVE(A/Q)ALEMGLTTLKFFPAEASGGISMVKSLVGPYGDI RLMPTGGITP(S/D)NIDNYLAIPQVLACGGTWMV
- the polypeptides are expected to tolerate some variation in the designed sequences without disrupting subsequent assembly into nanostructures: particularly when such variation comprises conservative amino acid substitutions.
- conservative amino acid substitution means that: hydrophobic amino acids (Ala, Cys, Gly, Pro, Met, See, Sme, Val, Ile, Leu) can only be substituted with other hydrophobic amino acids; hydrophobic amino acids with bulky side chains (Phe, Tyr, Trp) can only be substituted with other hydrophobic amino acids with bulky side chains; amino acids with positively charged side chains (Arg, His, Lys) can only be substituted with other amino acids with positively charged side chains; amino acids with negatively charged side chains (Asp, Glu) can only be substituted with other amino acids with negatively charged side chains; and amino acids with polar uncharged side chains (Ser, Thr, Asn, Gln) can only be substituted with other amino acids with polar uncharged side chains.
- the ability to widely modify surface amino acid residues without disruption of the polypeptide structure permits many types of modifications to endow the resulting self-assembled nanostructures with a variety of functions.
- the polypeptides of the invention can be modified to facilitate covalent linkage to a “cargo” of interest.
- the polypeptides can be modified, such as by introduction of various cysteine residues at defined positions to facilitate linkage to one or more antigens of interest, such that a nanostructure of the polypeptides would provide a scaffold to provide a large number of antigens for delivery as a vaccine to generate an improved immune response.
- some or all native cysteine residues that are present in the polypeptides but not intended to be used for conjugation may be mutated to other amino acids to facilitate conjugation at defined positions.
- the polypeptides of the invention may be modified by linkage (covalent or non-covalent) with a moiety to help facilitate “endosomal escape.”
- endosome a membrane-bound organelle that is the entry point of the delivery vehicle into the cell. Endosomes mature into lysosomes, which degrade their contents.
- the polypeptides can be modified, for example, by introducing cysteine residues that will allow chemical conjugation of such a lipid or organic polymer to the monomer or resulting assemly surface.
- the polypeptides can be modified, for example, by introducing cysteine residues that will allow chemical conjugation of fluorophores or other imaging agents that allow visualization of the nanostructures of the invention in vitro or in vivo.
- polypeptides can be mutated in order to improve the stability or solubility of the protein subunits or the assembled nanostructures.
- a multiple sequence alignment of other proteins from that family can be used to guide the selection of amino acid mutations at non-conserved positions that can increase protein stability and/or solubility, a process referred to as consensus protein design (9).
- Surface amino acid residues on the polypeptides can be mutated to positively charged (Arg, Lys) or negatively charged (Asp, Glu) amino acids in order to endow the protein surface with an overall positive or overall negative charge.
- surface amino acid residues on the polypeptides can be mutated to endow the interior surface of the self-assembling nanostructure with a high net charge. Such a nanostructure can then be used to package or encapsulate a cargo molecule with the opposite net charge due to the electrostatic interaction between the nanostructure interior surface and the cargo molecule.
- surface amino acid residues on the polypeptides can be mutated primarily to Arginine or Lysine residues in order to endow the interior surface of the self-assembling nanostructure with a net positive charge.
- Solutions containing the polypeptides can then be mixed in the presence of a nucleic acid cargo molecule such as a dsDNA, ssDNA, dsRNA, ssRNA, cDNA, miRNA, siRNA, shRNA, piRNA, or other nucleic acid in order to encapsulate the nucleic acid inside the self-assembling nanostructure.
- a nucleic acid cargo molecule such as a dsDNA, ssDNA, dsRNA, ssRNA, cDNA, miRNA, siRNA, shRNA, piRNA, or other nucleic acid in order to encapsulate the nucleic acid inside the self-assembling nanostructure.
- a nanostructure could be used, for example, to protect, deliver, or concentrate nucleic acids.
- Table 2 lists surface amino acid residue numbers for each exemplary polypeptide of the invention denoted by SEQ ID NOS: 1-34.
- 1 or more at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, or more
- these surface residues may be modified in the polypeptides of the invention.
- polypeptides of the present invention can also tolerate non-conservative substitutions.
- the isolated polypeptides may be produced recombinantly or synthetically, using standard techniques in the art.
- the isolated polypeptides of the invention can be modified in a number of ways, including but not limited to the ways described above, either before or after assembly of the nanostructures of the invention.
- the term “polypeptide” is used in its broadest sense to refer to a sequence of subunit amino acids.
- the polypeptides of the invention may comprise L-amino acids, D-amino acids (which are resistant to L-amino acid-specific proteases in vivo), or a combination of D- and L-amino acids.
- the invention provides nanostructures, comprising:
- each first assembly comprising a plurality of identical first polypeptides, wherein the first polypeptides comprise the polypeptide of any embodiment or combination of embodiments of the first aspect of the invention
- each second assembly comprising a plurality of identical second polypeptides, wherein the second polypeptides comprise the polypeptide of any embodiment or combination of embodiments of the first aspect of the invention, wherein the second polypeptide differs from the first polypeptide;
- first polypeptides self-assemble to form a first assembly
- second polypeptides self-assemble to form a second assembly
- a plurality of these first and second assemblies then self-assemble non-covalently via the designed interfaces to produce the nanostructures of the invention.
- the designed interfaces on the polypeptides of the invention resembling natural protein-protein interfaces with well-packed cores composed primarily of hydrophobic amino acid side chains surrounded by a periphery composed primarily of hydrophilic and charged side chains, rigidly orient the assemblies within the nanostructures formed by self-assembly.
- the interaction between the first assembly and the second assembly is a non-covalent protein-protein interaction.
- Any suitable non-covalent interaction(s) can drive self-interaction of the assemblies to form the nanostructure, including but not limited to one or more of electrostatic interactions, 7 ⁇ -effects, van der Waals forces, hydrogen bonding, and hydrophobic effects.
- pentamers, trimers, and dimers of the first or second assemblies assemble relative to each other such that their 5-fold, 3-fold, and 2-fold symmetry axes are aligned along icosahedral 5-fold, 3-fold, and 2-fold symmetry axes, respectively.
- the nanostructures are between about 20 nanometers (nm) to about 40 nm in diameter, with interior lumens between about 15 nm to about 32 nm across and pore sizes in the protein shells between about 1 nm to about 14 nm in their longest dimensions ( FIG. 2 ).
- the nanostructures of the invention can be used for any suitable purpose, including but not limited to delivery vehicles, as the nanostructures can encapsulate molecules of interest and/or the first and/or second proteins can be modified to bind to molecules of interest (diagnostics, therapeutics, detectable molecules for imaging and other applications, etc.).
- the nanostructures of the invention are well suited for several applications, including vaccine design, targeted delivery of therapeutics, and bioenergy.
- the first polypeptides and the second polypeptides comprise polypeptides with the amino acid sequence selected from the following pairs, or modified versions thereof (i.e.: permissible modifications as disclosed for the polypeptides of the invention: isolated polypeptides comprising an amino acid sequence that is at least 75% identical over its length, and identical at least at one identified interface position, to the amino acid sequence indicated by the SEQ ID NO.):
- the nanostructure has icosahedral symmetry.
- the nanostructure may comprise 60 copies of the first polypeptide and 60 copies of the second polypeptide.
- the number of identical first polypeptides in each first assembly is different than the number of identical second polypeptides in each second assembly.
- the nanostructure comprises twelve first assemblies and twenty second assemblies; in this embodiment, each first assembly may, for example, comprise five copies of the identical first polypeptide, and each second assembly may, for example, comprise three copies of the identical second polypeptide.
- the nanostructure comprises twelve first assemblies and thirty second assemblies; in this embodiment, each first assembly may, for example, comprise five copies of the identical first polypeptide, and each second assembly may, for example, comprise two copies of the identical second polypeptide.
- the nanostructure comprises twenty first assemblies and thirty second assemblies; in this embodiment, each first assembly may, for example, comprise three copies of the identical first polypeptide, and each second assembly may, for example, comprise two copies of the identical second polypeptide. All of these embodiments are capable of forming synthetic nanomaterials with regular icosahedral symmetry.
- oligomeric states of the first and second polypeptides are as follows:
- I53-34A trimer+I53-34B: pentamer
- I53-40A pentamer+I53-40B: trimer
- I53-47A trimer+I53-47B: pentamer
- I53-50A trimer+I53-50B: pentamer
- I53-51A trimer+I53-51B: pentamer
- I32-06A dimer+I32-06B: trimer
- I32-19A trimer+I32-19B: dimer
- I32-28A trimer+I32-28B: dimer
- I52-03A pentamer+I52-03B: dimer
- I52-32A dimer+I52-32B: pentamer
- I52-33A pentamer+I52-33B: dimer.
- the nanostructures form spontaneously when appropriate polypeptide pairs are co-expressed in E. coli cells, yielding milligram quantities of purified material per liter of cell culture using standard methods of immobilized metal-affinity chromatography and gel filtration.
- a poly-histidine purification tag is appended to just one of the two distinct polypeptide subunits (i.e.: the first and second polypeptides) comprising each nanostructure, the other subunit is found to co-purify with the tagged subunit.
- the nanostructure further comprises a cargo within the nanostructure.
- a “cargo” is any compound or material that can be incorporated on and/or within the nanostructure.
- polypeptide pairs suitable for nanostructure self-assembly can be expressed/purified independently; they can then be mixed in vitro in the presence of a cargo of interest to produce the nanostructure comprising a cargo. This feature, combined with the protein nanostructures' large lumens and relatively small pore sizes, makes them well suited for the encapsulation of a broad range of cargo including, but not limited to, small molecules, nucleic acids, polymers, and other proteins.
- the protein nanostructures of the present invention could be used for many applications in medicine and biotechnology, including targeted drug delivery and vaccine design.
- targeting moieties could be fused or conjugated to the protein nanostructure exterior to mediate binding and entry into specific cell populations and drug molecules could be encapsulated in the cage interior for release upon entry to the target cell or sub-cellular compartment.
- antigenic epitopes from pathogens could be fused or conjugated to the cage exterior to stimulate development of adaptive immune responses to the displayed epitopes, with adjuvants and other immunomodulatory compounds attached to the exterior and/or encapsulated in the cage interior to help tailor the type of immune response generated for each pathogen.
- the polypeptide components may be modified as noted above.
- the polypeptides can be modified, such as by introduction of various cysteine residues at defined positions to facilitate linkage to one or more antigens of interest as cargo, and the nanostructure could act as a scaffold to provide a large number of antigens for delivery as a vaccine to generate an improved immune response.
- Other modifications of the polypeptides as discussed above may also be useful for incorporating cargo into the nanostructure.
- the nanostructures may comprise one or more peptides configured to bind or fuse with desired immunogens.
- the nanostructure comprises one or more copies of variants designed to form a nanostructure of the trimeric proteins 1WOZ or 1WA3 (PDB ID codes), which have been demonstrated to be suitable for fusion with the trimeric HIV immunogen, BG505 SOSIP (4-6).
- PDB ID codes polymeric HIV immunogen
- Such nanostructures could be used as scaffolds for the design of an HIV vaccine capable of inducing protective immune responses against the virus.
- the nanostructures of the present invention could be useful as scaffolds for the attachment of enzymes on the interior and/or exterior of the cages. Such enzymes confer on the nanostructure the ability to catalyze biochemical pathways or other reactions.
- the cargo may comprise a detectable cargo.
- the nanostructures of the present invention could also be useful as single-cell or single-molecule imaging agents. The materials are large enough to be identified in cells by electron microscopy, and when tagged with fluorophores they are readily detectable by light microscopy. This feature makes them well-suited to the task of correlating images of the same cells taken by light microscopy and electron microscopy.
- the present invention provides isolated nucleic acids encoding a protein of the present invention.
- the isolated nucleic acid sequence may comprise RNA or DNA.
- isolated nucleic acids are those that have been removed from their normal surrounding nucleic acid sequences in the genome or in cDNA sequences.
- Such isolated nucleic acid sequences may comprise additional sequences useful for promoting expression and/or purification of the encoded protein, including but not limited to polyA sequences, modified Kozak sequences, and sequences encoding epitope tags, export signals, and secretory signals, nuclear localization signals, and plasma membrane localization signals. It will be apparent to those of skill in the art, based on the teachings herein, what nucleic acid sequences will encode the proteins of the invention.
- the present invention provides recombinant expression vectors comprising the isolated nucleic acid of any embodiment or combination of embodiments of the invention operatively linked to a suitable control sequence.
- “Recombinant expression vector” includes vectors that operatively link a nucleic acid coding region or gene to any control sequences capable of effecting expression of the gene product.
- “Control sequences” operably linked to the nucleic acid sequences of the invention are nucleic acid sequences capable of effecting the expression of the nucleic acid molecules. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof.
- intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered “operably linked” to the coding sequence.
- Other such control sequences include, but are not limited to, polyadenylation signals, termination signals, and ribosome binding sites.
- Such expression vectors can be of any type known in the art, including but not limited to plasmid and viral-based expression vectors.
- control sequence used to drive expression of the disclosed nucleic acid sequences in a mammalian system may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive).
- inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive.
- the construction of expression vectors for use in transfecting prokaryotic cells is also well known in the art, and thus can be accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989; Gene Transfer and Expression Protocols, pp.
- the expression vector must be replicable in the host organisms either as an episome or by integration into host chromosomal DNA.
- the expression vector comprises a plasmid.
- the invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors.
- the present invention provides host cells that have been transfected with the recombinant expression vectors disclosed herein, wherein the host cells can be either prokaryotic or eukaryotic.
- the cells can be transiently or stably transfected.
- transfection of expression vectors into prokaryotic and eukaryotic cells can be accomplished via any technique known in the art, including but not limited to standard bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection.
- a method of producing a polypeptide according to the invention is an additional part of the invention.
- the method comprises the steps of (a) culturing a host according to this aspect of the invention under conditions conducive to the expression of the polypeptide, and (b) optionally, recovering the expressed polypeptide.
- kits comprising:
- the present invention provides methods of using the nanostructures of the present invention.
- both polypeptides comprising an assembly are capable of independent expression and purification, this enables control over assembly through mixing of purified components in vitro.
- This feature combined with the nanostructures' large lumens and relatively small pore sizes, makes them well suited for the encapsulation of a broad range of other materials including small molecules, nucleic acids, polymers, and other proteins, as discussed above.
- the nanostructures of the present invention could be used for many applications in medicine and biotechnology, including targeted drug delivery and vaccine design.
- targeting moieties could be fused or conjugated to the nanostructure exterior to mediate binding and entry into specific cell populations and drug molecules could be encapsulated in the cage interior for release upon entry to the target cell or sub-cellular compartment.
- antigenic epitopes from pathogens could be fused or conjugated to the nanostructure exterior to stimulate development of adaptive immune responses to the displayed epitopes, with adjuvants and other immunomodulatory compounds attached to the exterior and/or encapsulated in the cage interior to help tailor the type of immune response generated for each pathogen.
- Other uses will be clear to those of skill in the art based on the disclosure relating to polypeptide modifications, nanostructure design, and cargo incorporation.
- the icosahedral materials disclosed herein (amino acid sequences provided in Table 1), which comprise possible embodiments of the present invention, were produced as follows.
- the initial sequences and structures for the design process were derived from pentameric, trimeric, and dimeric crystal structures from the Protein Data Bank (PDB), along with a small number of crystal structures of de novo designed proteins not yet deposited in the PDB.
- PDB Protein Data Bank
- the PDB Accession numbers for the wild type scaffold proteins related to the exemplary polypeptides of the invention are as follows:
- Genes encoding the 71 pairs of I53 sequences were synthesized and cloned into a variant of the pET29b expression vector (Novagen, Inc.) between the NdeI and XhoI endonuclease restriction sites.
- Genes encoding the 44 pairs of I52 sequences and 68 pairs of I32 sequences were synthesized and cloned into a variant of the pET28b expression vector (Novagen, Inc.) between the NcoI and XhoI endonuclease restriction sites.
- the two protein coding regions in each DNA construct are connected by an intergenic region.
- the intergenic region in the I53 designs was derived from the pETDuet-1 vector (Novagen, Inc.) and includes a stop codon, T7 promoter/lac operator, and ribosome binding site.
- the intergenic region in the I52 and I32 designs only includes a stop codon and ribosome binding site.
- the sequences of the 153, 152 and I32 intergenic regions are as follows:
- the constructs for the I53 protein pairs thus possess the following set of elements from 5′ to 3′: NdeI restriction site, upstream gene, intergenic region, downstream gene, XhoI restriction site.
- the constructs for the I52 and I32 protein pairs possess the following set of elements from 5′ to 3′: NcoI restriction site, upstream gene, intergenic region, downstream gene, XhoI restriction site.
- the upstream genes encode components denoted with the suffix “A”;
- the downstream genes encode the “B” components (Table 1). This allows for co-expression of the designed protein pairs in which both the upstream and downstream genes have their own ribosome binding site, and in the case of the I53 designs, both genes also have their own T7 promoter/lac operator.
- each co-expression construct includes a 6x-histidine tag (HHHHHH) appended to the N- or C-terminus of one of the two protein coding regions.
- HHHHHH 6x-histidine tag
- the designed proteins were first screened for soluble expression and co-purification at small scale from 2 to 4 mL cultures by nickel affinity chromatography using His MultiTrap® FF nickel-coated filter plates (GE Healthcare). Purification products were analyzed by SDS-PAGE to identify those containing species near the expected molecular weight of both protein subunits (indicating co-purification). Those found to contain both subunits were subsequently subjected to native (non-denaturing) PAGE to identify slow migrating species further indicating assembly to higher order materials.
- the purified proteins were analyzed by size exclusion chromatography using a Superose® 6 10/300 column to assess their assembly states.
- major peaks were observed in the chromatograms near elution volumes of 8.5 to 12 mL, which correspond well with the expected elution volumes for the designed 120-subunit icosahedral nanostructures.
- the relative elution volumes correspond with the physical dimensions of the computational design models of the nanostructures, that is, proteins designed to assemble into relatively larger nanostructures yielded peaks at earlier elution volumes while those designed to assemble into relatively smaller nanostructures yielded peaks at later elution volumes.
- smaller secondary peaks were observed at slightly earlier elution volumes than the predominant peak, suggesting transient or low-affinity dimerization of the nanostructures.
- SAXS small angle X-ray scattering
- the computational models of designed I53 materials were redesigned by allowing optimization of the identities of relatively exposed residues (defined as having a solvent accessible surface area of greater than 20 square ⁇ ngstroms), excepting polar residues (Aspartate, Glutamate, Histidine, Lysine, Asparagine, Glutamine, and Arginine) and residues near the designed protein-protein interfaces between the pentameric and trimeric components. Mutations that resulted in losses of significant atomic packing interactions or side chain-backbone hydrogen bonds were discarded.
- PSSM position-specific scoring matrix
- the Rosetta macromolecular modeling suite was used to mutate manually selected amino acid positions to charged amino acids in order to generate variant nanoparticles featuring highly positively or negatively charged interior surfaces.
- mutations were limited to either Aspartate or Glutamate.
- mutations were limited to either Arginine or Lysine.
- Relevant score metrics for each mutation were independently assessed, and favorable mutations were sorted into two tiers based on their scores.
- each co-expression construct included a 6x-histidine tag (HHHHHH) appended to the N- or C-terminus of one of the two protein coding regions.
- each individual expression construct included a 6x-histidine tag appended to the N- or C-terminus of the protein coding region.
- the I53-40.1A and I53-40.1B protein variants were constructed by consensus protein design, in which multiple sequence alignments from protein families related to each protein subunit were used to guide the selection of amino acid residues at surface-exposed positions.
- the variant proteins were found to be more stable and soluble when purified independently than the original proteins, a property that enabled the formation of the designed nanostructure by simply mixing solutions containing the purified components in physiological buffers in a 1:1 molar ratio.
- the addition of 0.75% CHAPS, a zwitterionic detergent, to the buffer was found to further increase the stability and solubility of I53-40.1A and was therefore included during the purification of the protein prior to in vitro assembly.
- Size exclusion chromatograms from a run analyzing the mixed solution containing both components on a Superose 6 column revealed a single major peak at the elution volume expected for the 120-subunit designed icosahedral nanostructure.
- Analysis of the peak fractions by SDS-PAGE revealed bands at the expected molecular weight for the first and second polypeptides of the nanostructure in an apparent 1:1 stoichiometric ratio. The data demonstrate that when mixed, the two components co-assemble to the 120-subunit designed icosahedral nanostructure.
- the I53-47A.1, I53-47B.1, I53-50A.1, and I53-50B.1 protein variants were constructed by consensus protein design, in which multiple sequence alignments from protein families related to each protein subunit were used to guide the selection of amino acid residues at surface-exposed positions.
- the variant proteins were found to be more stable and soluble when purified independently than the original proteins, a property that enabled the formation of the designed nanostructure by simply mixing solutions containing the purified components in physiological buffers in a 1:1 molar ratio, a process referred to as in vitro assembly.
- additional negatively charged amino acid residues i.e., Aspartate and Glutamate
- ssDNA single-stranded DNA
- ssDNA single-stranded DNA
- samples were either analyzed by electrophoresis on a 1% agarose gel or DNase I was added to a final concentration of 25 pg/mL and incubated for one hour at room temperature before electrophoresis.
- Gels were stained with SybrGold® (ThermoFisher Scientific) and imaged to visualize nucleic acid, and were subsequently stained with GelCode® Blue (Pierce) and imaged again to visualize protein.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Inorganic Chemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
Abstract
Description
- This application claims priority to U.S. Provisional Patent Application Ser. No. 62/074,167 filed Nov. 3, 2014, incorporated by reference herein in its entirety.
- This invention was made with U.S. government support under CHE-1332907, awarded by the National Science Foundation, and DGE-0718124, awarded by the National Science Foundation. The U.S. Government has certain rights in the invention.
- Molecular self- and co-assembly of proteins into highly ordered, symmetric supramolecular complexes is an elegant and powerful means of patterning matter at the atomic scale. Recent years have seen advances in the development of self-assembling biomaterials, particularly those composed of nucleic acids. DNA has been used to create, for example, nanoscale shapes and patterns, molecular containers, and three-dimensional macroscopic crystals.
- Methods for designing self-assembling proteins have progressed more slowly, yet the functional and physical properties of proteins make them attractive as building blocks for the development of advanced functional materials.
- In a first aspect, the invention provides isolated polypeptides comprising an amino acid sequence that is at least 75% identical over its length, and identical at least at one identified interface position, to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS:1-34.
- In a second aspect, the invention provides nanostructures, comprising:
- (a) a plurality of first assemblies, each first assembly comprising a plurality of identical first polypeptides, wherein the first polypeptides comprise the polypeptide of
claim 1; and - (b) a plurality of second assemblies, each second assembly comprising a plurality of identical second polypeptides, wherein the second polypeptides comprise the polypeptide of
claim 1, and wherein the second polypeptide differs from the first polypeptide; - wherein the plurality of first assemblies non-covalently interact with the plurality of second assemblies to form a nanostructure.
- In another aspect, the present invention provides isolated nucleic acids encoding the polypeptides of the invention. In a further aspect, the invention provides nucleic acid expression vectors comprising isolated nucleic acids of the invention. In another aspect, the present invention provides recombinant host cells, comprising a nucleic acid expression vector according to the invention.
- In a further aspect, the present invention provides a kit, comprising one or more isolated nanostructures of the invention; one or more of the isolated proteins of the present invention or the assemblies of the present invention; one or more recombinant nucleic acids of the present invention; one or more recombinant expression vectors of the present invention; and/or one or more recombinant host cells of the present invention.
- The foregoing aspects and many of the attendant advantages of this invention will become more readily appreciated as the same become better understood by reference to the following detailed description, when taken in conjunction with the accompanying drawings.
-
FIG. 1 . Overview of the design method utilized to produce the exemplary nanostructures and sequences, illustrated with the I53 icosahedral architecture. (A) A schematic illustration of icosahedral symmetry outlined with dashed lines, with the five-fold symmetry axes shown going through each vertex and three-fold symmetry axes going through each face of the icosahedron. (B) 12 pentamers (dark grey) and 20 trimers (light grey) are aligned along the 5-fold and 3-fold symmetry axes, respectively. Each oligomer possesses two rigid body degrees of freedom, one translational (r) and one rotational (w) that are systematically sampled to identify configurations with large interfaces and high densities of contacting residues suitable for protein-protein interface design. (C) Example of such a docked configuration with a large interface and high density of contacting residues suitable for protein-protein interface design. (D) Close-up of the docked interface between the pentameric and trimeric subunits, as outlined in panel C. Side chains atoms beyond the beta carbon are ignored at this stage of design. (E) New amino acid sequences are designed at the interface to stabilize the modeled configuration. -
FIG. 2 . Design models of exemplary nanostructures. Computational models of the 11 exemplary nanomaterials, (A) I53-34, (B) I53-40, (C) I53-47, (D) I53-50, (E) I53-51, (F) I52-03, (G) I52-32, (H) I52-33, (I)I32-06, (J) I32-19, and (K) I32-28, are shown to scale (relative to the 30 nm scale bar), viewed down one of the 5-fold icosahedral symmetry axes with ribbon-style rendering of the protein backbone. Each I53 material comprises 12 identical pentamers (dark grey) and 20 identical trimers (light grey), each I52 material comprises 12 identical pentamers (dark grey) and 30 identical dimers (light grey), and each I32 material comprises 20 identical trimers (dark grey) and 30 identical dimers (light grey), with the designed interface formed between these oligomeric building blocks. All renderings were generated using PyMOL® Schrödinger, LLC. -
FIG. 3 . Sodium dodecyl sulfate polyacrylamide electrophoresis (SDS-PAGE) and mass spectrometry analysis. Pairs of proteins encoding each material were co-expressed (as described in the Methods of Production) in E. coli, lysed, and purified via nickel-affinity chromatography followed by gel filtration with a Superose® 6 10/300 GL column (GE Life Sciences). (A) The resulting samples were subjected to SDS-PAGE followed by staining with GelCode® Blue Stain Reagent (Pierce Biotechnology, Inc.). The left lane in each panel contains protein molecular weight standards; the approximate molecular weights in kilodaltons are indicated directly to the left of each band. The right lanes in each panel contain the purified samples. For all of the materials except I52-03, clear bands, of similar staining intensity and near the expected molecular weights of each protein subunit, are present for each of the two proteins comprising the purified materials. (B) While only one band (near the expected molecular weight of 27 kDa for the dimer subunit) is clearly distinguishable for I52-03 via SDS-PAGE, mass spectrometry analysis shows that the other protein subunit is also present in the sample; the mass spectrometry peak at 21,029 Da matches closely with the expected molecular weight of 21,026 Da for the pentamer subunit with loss of the initiator methionine, a common post-translational modification. -
FIG. 4 . Negative stain averages. Averages have been obtained of the (A) I53-40, (B) I53-50, (C) I52-03, and (D) I32-06 nanostructures and found to match well with the design models. Raw negative stain micrographs from which the averages were generated are shown on the left side of each panel. Averages (left), along with renderings from the design models (right), are shown on the right side of each panel. Views are shown corresponding approximately to the 5-fold, 3-fold, and 2-fold symmetry axes. -
FIG. 5 . X-ray crystallography. X-ray crystal structures (bottom) ranging from 3.5 to 5.0 Å resolution have been obtained for three of the designed materials, (A) I53-40, (B) I52-32, and (C) I32-28, and found to match closely with the design models (top). Each structure is shown using a ribbon-style rendering. Views of the I53 and I52 designs and crystal structures (panels A and B) are shown looking down one of the 5-fold symmetry axes, while the I32 design model and crystal structure (panel C) are shown looking down one of the 3-fold symmetry axes. Each crystal structure contains only a portion of the full icosahedron in the asymmetric unit. Crystal lattice symmetry was applied to generate the full icosahedra shown in the bottom panel. The I53-40 design model and crystal structure (panel A) comprise 12 pentamers (dark grey) and 20 trimers (light grey), while the I52-32 design model and crystal structure (panel B) comprise 12 pentamers (dark grey) and 30 dimers (light grey), and the I32-28 design model and crystal structure (panel C) comprise 20 trimers (dark grey) and 30 dimers (light grey). All renderings were generated using PyMOL® Schrödinger, LLC. -
FIG. 6 . In vitro assembly of I53-50A.1PosT1+I53-50B.4PosT1 in the presence of 400 nucleotide (nt) ssDNA leads to encapsulation and protection of the ssDNA. Mixtures of 26 ng/μL ssDNA and various proteins were analyzed by agarose gel electrophoretic mobility shift assay (EMSA) after incubation for 16 hours to determine the ability of mixtures of I53-50A.1PosT1+I53-50B.4PosT1 to encapsulate the ssDNA (left; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein). Mixtures of both components (lanes labeled “Components titration” are mixtures of I53-50A.1PosT1+I53-50B.4PosT1 at 2, 4, 6, 8, 10 and 12 μM) with the DNA shift the DNA such that it migrates similarly to SEC-purified I53-50A.1PosT1+I53-50B.4PosT1 nanoparticles (upper band), while mixtures of DNA with only one protein component or the other do not. The mixtures were then incubated with 25 μg/mL DNase I for 1 hour at room temperature in order to evaluate the ability of the in vitro-assembled nanoparticles to protect the ssDNA cargo from degradation (right; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein). The DNA that co-migrates with the protein in mixtures of both components (I53-50A.1PosT1+I53-50B.4PosT1; lanes labeled “Components titration” are mixtures at 2, 4, 6, 8, 10 and 12 μM) is largely protected from DNase challenge, while free ssDNA and the mixture of ssDNA+I53-50B.4PosT1 are not. The mixture of ssDNA+I53-50A.1PosT1 is weakly protected, but migrates as a diffuse smear on the gel. Overall, the data show that the ssDNA is encapsulated in nanoparticles formed by I53-50A.1PosT1+I53-50B.4PosT1, which forms a barrier that prevents degradation of the ssDNA by DNase. -
FIG. 7 . In vitro assembly of I53-50A.1PosT1+I53-50B.4PosT1 in the presence of 1600 nucleotide (nt) ssDNA leads to encapsulation and protection of the ssDNA. Mixtures of 35.2 ng/μL ssDNA and various proteins were analyzed by agarose electrophoretic mobility shift assay (EMSA) after incubation for 16 hours to determine the ability of mixtures of I53-50A.1PosT1+I53-50B.4PosT1 to encapsulate the ssDNA (left; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein). Mixtures of both components (lanes labeled “Components titration” are mixtures of I53-50A.1PosT1+153-50B.4PosT1 at 2, 4, 6, 8, 10 and 12 μM) with the DNA shift the DNA such that it migrates similarly to SEC-purified I53-50A.1PosT1+I53-50B.4PosT1 nanoparticles (upper band), while mixtures of DNA with only one protein component or the other do not. The mixtures were then incubated with 25 μg/mL DNase I for 1 hour at room temperature in order to evaluate the ability of the in vitro-assembled nanoparticles to protect the ssDNA cargo from degradation (right; the upper image of the gel is after staining for DNA, while the lower image of the gel is after staining for protein). The DNA that co-migrates with the protein in mixtures of both components (I53-50A.1PosT1+I53-50B.4PosT1; lanes labeled “Components titration” are mixtures at 2, 4, 6, 8, 10 and 12 μM) is largely protected from DNase challenge, while free ssDNA and the mixture of ssDNA+I53-50B.4PosT1 are not. The mixture of ssDNA+I53-50A.1PosT1 is weakly protected, but migrates as a diffuse smear on the gel. Overall, the data show that the ssDNA is encapsulated in nanoparticles formed by I53-50A.1PosT1+I53-50B.4PosT1, which forms a barrier that prevents degradation of the ssDNA by DNase. - All references cited are herein incorporated by reference in their entirety. Within this application, unless otherwise stated, the techniques utilized may be found in any of several well-known references such as: Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. Goeddel, 1991. Academic Press, San Diego, Calif.), “Guide to Protein Purification” in Methods in Enzymology (M. P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press, San Diego, Calif.), Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed. (R. I. Freshney. 1987. Liss, Inc. New York, N.Y.), Gene Transfer and Expression Protocols, pp. 109-128, ed. E. J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, Tex.).
- As used herein, the singular forms “a”, “an” and “the” include plural referents unless the context clearly dictates otherwise. “And” as used herein is interchangeably used with “or” unless expressly stated otherwise.
- As used herein, the amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gln; Q), glycine (Gly; G), histidine (His; H), isoleucine (Ile; I), leucine (Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V). As used herein, “about” means +/−5% of the recited parameter.
- All embodiments of any aspect of the invention can be used in combination, unless the context clearly dictates otherwise.
- Unless the context clearly requires otherwise, throughout the description and the claims, the words ‘comprise’, ‘comprising’, and the like are to be construed in an inclusive sense as opposed to an exclusive or exhaustive sense; that is to say, in the sense of “including, but not limited to”. Words using the singular or plural number also include the plural and singular number, respectively. Additionally, the words “herein,” “above,” and “below” and words of similar import, when used in this application, shall refer to this application as a whole and not to any particular portions of the application.
- The description of embodiments of the disclosure is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. While the specific embodiments of, and examples for, the disclosure are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the disclosure, as those skilled in the relevant art will recognize.
- In a first aspect, the invention provides isolated polypeptide comprising an amino acid sequence that is at least 75% identical over its length, and identical at least at one identified interface position, to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS: 1-34. The isolated polypeptides of the invention can be used, for example, to prepare the nanostructures of the invention. As described in the examples that follow, the polypeptides of the invention were designed for their ability to self-assemble in pairs to form nanostructures, such as icosahedral nanostructures. The design involved design of suitable interface residues for each member of the polypeptide pair that can be assembled to form the nanostructure. The nanostructures of the invention include symmetrically repeated, non-natural, non-covalent polypeptide-polypeptide interfaces that orient a first assembly and a second assembly into a nanostructure, such as one with an icosahedral symmetry. Starting proteins were those derived from pentameric, trimeric, and dimeric crystal structures from the Protein Data Bank (PDB), along with a small number of crystal structures of de novo designed proteins not yet deposited in the PDB. Thus, each of the polypeptides of the present invention includes one or more modifications at “interface residues” compared to the starting proteins, permitting the polypeptides of the invention to, for example, form icosahedral nanostructures as described herein. Table 1 provides the amino acid sequence of exemplary polypeptides of the invention; the right hand column in Table 1 identifies the residue numbers in each exemplary polypeptide that were identified as present at the interface of resulting assembled nanostructures (i.e.: “identified interface residues”). As can be seen, the number of interface residues for the exemplary polypeptides of SEQ ID NO:1-34 range from 4-13. In various embodiments, the isolated polypeptides of the invention comprise an amino acid sequence that is at least 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical over its length, and identical at least at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or 13 identified interface positions (depending on the number of interface residues for a given polypeptide), to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS: 1-34. In other embodiments, the isolated polypeptides of the invention comprise an amino acid sequence that is at least 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical over its length, and identical at least at 20%, 25%, 33%, 40%, 50%, 60%, 70%, 75%, 80%, 90%, or 100% of the identified interface positions, to the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS: 1-34. In further embodiments, the polypeptides of the invention comprise or consist of a polypeptide having the amino acid sequence of a polypeptide selected from the group consisting of SEQ ID NOS:1-40.
-
TABLE 1 Name Amino Acid Sequence Identified interface residues I53-34A MEGMDPLAVLAESRLLPLLTVRGGEDLAGLATVLELMGVGALEITL I53-34A: SEQ ID RTEKGLEALKALRKSGLLLGAGTVRSPKEAEAALEAGAAFLVSPGL 28, 32, 36, 37, NO: 1 LEEVAALAQARGVPYLPGVLTPIEVERALALGLSALKFFPAEPFQG 186, 188, 191, 192, 195 VRVLRAYAEVFPEVRFLPTGGIKEEHLPHYAALPNLLAVGGSWLLQ GDLAAVMKKVKAAKALLSPQAPG I53-34B MTKKVGIVDTTFARVDMAEAAIRTLKALSPNIKIIRKTVPGIKDLPV I53-34B: SEQ ID ACKKLLEEEGCDIVMALGMPGKAEKDKVCAHEASLGLMLAQLMT 19, 20, 23, 24, 27, 109, 113, 116, NO: 2 NKHIIEVFVHEDEAKDDDELDILALVRAIEHAANVYYLLFKPEYLTR 117, 120, 124, 148 MAGKGLRQGREDAGPARE I53-40A MTKKVGIVDTTFARVDMASAAILTLKMESPNIKIIRKTVPGIKDLPV I53-40A: SEQ ID ACKKLLEEEGCDIVMALGMPGKAEKDKVCAHEASLGLMLAQLMT 20, 23, 24, 27, 28, 109, 112, 113, NO: 3 NKHIIEVFVHEDEAKDDAELKILAARRAIEHALNVYYLLFKPEYLIR 116, 120, 124 MAGKGLRQGFEDAGPARE I53-40B MSTINNQLKALKVIPVIAIDNAEDIIPLGKVLAENGLPAAEITFRSSAA I53-40B: 47, 51, 54, 58, 74, 102 SEQ ID VKAIMLLRSAQPEMLIGAGTILNGVQALAAKEAGATFVVSPGFNPN NO: 4 TVRACQIIGIDIVPGVNNPSTVEAALEMGLTTLKFFPAEASGGISMV KSLVGPYGDIRLMPTGGITPSNIDNYLAIPQVLACGGTWMVDKKLV TNGEWDEIARLTREIVEQVNP I53-47A MPIFTLNTNIKATDVPSDFLSLTSRLVGLILSKPGSYVAVHINTDQQL I53-47A: 22, 25, 29, 72, 79, 86, 87 SEQ ID SFGGSTNPAAFGTLMSIGGIEPSKNRDHSAVLFDHLNAMLGIPKNR NO: 5 MYIHFVNLNGDDVGWNGTTF I53-47B MNQHSHKDYETVRIAVVRARWHADIVDACVEAFEIAMAAIGGDRF I53-47B: SEQ ID AVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVNGGIYRHEFV 28, 31, 35, 36, 39, 131, 132, 135, NO: 6 ASAVIDGMMNVQLSTGVPVLSAVLTPHRYRDSAEHHRFFAAHFAV 139, 146 KGVEAARACIEILAAREKIAA I53-50A MKMEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEITFTVP I53-50A: 25, 29, 33, 54, 57 SEQ ID DADTVIKALSVLKEKGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLD NO: 7 EEISQFCKEKGVFYMPGVMTPTELVKAMKLGHTILKLFPGEVVGPQ FVKAMKGPFPNVKFVPTGGVNLDNVCEWFKAGVLAVGVGSALVK GTPDEVREKAKAFVEKIRGCTE I53-50B MNQHSHKDYETVRIAVVRARWHAEIVDACVSAFEAAMADIGGDR I53-50B: SEQ ID FAVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVNGGIYRHEF 24, 28, 36, 124, 125, 127, 128, 129, NO: 8 VASAVIDGMMNVQLSTGVPVLSAVLTPHRYRDSDAHTLLFLALFA 131, 132, 133, 135, 139 VKGMEAARACVEILAAREKIAA I53-51A MFTKSGDDGNTNVINKRVGKDSPLVNFLGDLDELNSFIGFAISKIPW I53-51 A: SEQ ID EDMKKDLERVQVELFEIGEDLSTQSSKKKIDESYVLWLLAATAIYRI 80, 83, 86, 87, 88, 90, 91, 94, 166, NO: 9 ESGPVKLFVIPGGSEEASVLHVTRSVARRVERNAVKYTKELPEINR 172, 176 MIIVYLNRLSSLLFAMALVANKRRNQSEKIYEIGKSW I53-51B MNQHSHKDYETVRIAVVRARWHADIVDQCVRAFEEAMADAGGDR I53-51B: SEQ ID FAVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVNGGIYRHEF 31, 35, 36, 40, 122, 124, 128, 131, NO: 10 VASAVIDGMMNVQLSTGVPVLSAVLTPHRYRSSREHHEFFREHFM 135, 139, 143, 146, 147 VKGVEAAAACITILAAREKIAA I52-03A MGHTKGPTPQQHDGSALRIGIVHARWNKTIIMPLLIGTIAKLLECGV I52-03A: 28, 32, 36, 39, 44, 49 SEQ ID KASNIVVQSVPGSWELPIAVQRLYSASQLQTPSSGPSLSAGDLLGSS NO: 11 TTDLTALPTTTASSTGPFDALIAIGVLIKGETMHFEYIADSVSHGLMR VQLDTGVPVIFGVLTVLTDDQAKARAGVIEGSHNHGEDWGLAAVE MGVRRRDWAAGKTE I52-03B MYEVDHADVYDLFYLGRGKDYAAEASDIADLVRSRTPEASSLLDV I52-03B: 94, 115, 116, 206, 213 SEQ ID ACGTGTHLEHFTKEFGDTAGLELSEDMLTHARKRLPDATLHQGDM NO: 12 RDFQLGRKFSAVVSMFSSVGYLKTVAELGAAVASFAEHLEPGGVV VVEPWWFPETFADGWVSADVVRRDGRTVARVSHSVREGNATRME VHFTVADPGKGVRHFSDVHLITLFHQREYEAAFMAAGLRVEYLEG GPSGRGLFVGVPA I52-32A MGMKEKFVLIITHGDFGKGLLSGAEVIIGKQENVHTVGLNLGDNIE I52-32A: SEQ ID KVAKEVMRIIIAKLAEDKEIIIVVDLFGGSPFNIALEMMKTFDVKVIT 47, 49, 53, 54, 57, 58, 61, 83, 87, NO: 13 GINMPMLVELLTSINVYDTTELLENISKIGKDGIKVIEKSSLKM 88 I52-32B MKYDGSKLRIGILHARWNLEIIAALVAGAIKRLQEFGVKAENIIIETV I52-32B: 19, 20, 23, 30, 40 SEQ ID PGSFELPYGSKLFVEKQKRLGKPLDAIIPIGVLIKGSTMHFEYICDSTT NO: 14 HQLMKLNFELGIPVIFGVLTCLTDEQAEARAGLIEGKMHNHGEDW GAAAVEMATKFN I52-33A MAVKGLGEVDQKYDGSKLRIGILHARWNRKIILALVAGAVLRLLEF I52-33A: 33, 41, 44, 50 SEQ ID GVKAENIIIETVPGSFELPYGSKLFVEKQKRLGKPLDAIIPIGVLIKGS NO: 15 TMHFEYICDSTTHQLMKLNFELGIPVIFGVLTCLTDEQAEARAGLIE GKMHNHGEDWGAAAVEMATKFN I52-33B MGANWYLDNESSRLSFTSTKNADIAEVHRFLVLHGKVDPKGLAEV I52-33B: SEQ ID EVETESISTGIPLRDMLLRVLVFQVSKFPVAQINAQLDMRPINNLAP 61, 63, 66, 67, 72, 147, 148, 154, NO: 16 GAQLELRLPLTVSLRGKSHSYNAELLATRLDERRFQVVTLEPLVIHA 155 QDFDMVRAFNALRLVAGLSAVSLSVPVGAVLIFTAR I32-06A MTDYIRDGSAIKALSFAIILAEADLRHIPQDLQRLAVRVIHACGMVD I32-06A: 9, 12, 13, 14, 20, 30, 33, SEQ ID VANDLAFSEGAGKAGRNALLAGAPILCDARMVAEGITRSRLPADN 34 NO: 17 RVIYTLSDPSVPELAKKIGNTRSAAALDLWLPHIEGSIVAIGNAPTAL FRLFELLDAGAPKPALIIGMPVGFVGAAESKDELAANSRGVPYVIVR GRRGGSAMTAAAVNALASERE I32-06B MITVFGLKSKLAPRREKLAEVIYSSLHLGLDIPKGKHAIRFLCLEKED I32-06B: SEQ ID FYYPFDRSDDYTVIEINLMAGRSEETKMLLIFLLFIALERKLGIRAHD 24, 71, 73, 76, 77, 80, 81, 84, NO: 18 VEITIKEQPAHCWGFRGRTGDSARDLDYDIYV 85, 88, 114, 118 I32-19A MGSDLQKLQRFSTCDISDGLLNVYNIPTGGYFPNLTAISPPQNSSIVG I32-19A: SEQ ID TAYTVLFAPIDDPRPAVNYIDSVPPNSILVLALEPHLQSQFHPFIKITQ 208, 213, 218, 222, 225, 226, 229, 23 NO: 19 AMYGGLMSTRAQYLKSNGTVVFGRIRDVDEHRTLNHPVFAYGVGS 3 CAPKAVVKAVGTNVQLKILTSDGVTQTICPGDYIAGDNNGIVRIPVQ ETDISKLVTYIEKSIEVDRLVSEAIKNGLPAKAAQTARRMVLKDYI I32-19B MSGMRVYLGADHAGYELKQAIIAFLKMTGHEPIDCGALRYDADDD I32-19B: SEQ ID YPAFCIAAATRTVADPGSLGIVLGGSGNGEQIAANKVPGARCALAW 20, 23, 24, 27, 117, 118, 122, 125 NO: 20 SVQTAALAREHNNAQLIGIGGRMHTLEEALRIVKAFVTTPWSKAQR HQRRIDILAEYERTHEAPPVPGAPA I32-28A MGDDARIAAIGDVDELNSQIGVLLAEPLPDDVRAALSAIQHDLFDL I32-28A: SEQ ID GGELCIPGHAAITEDHLLRLALWLVHYNGQLPPLEEFILPGGARGAA 60, 61, 64, 67, 68, 71, 110, 120, NO: 21 LAHVCRTVCRRAERSIKALGASEPLNIAPAAYVNLLSDLLFVLARVL 123, 124, 128 NRAAGGADVLWDRTRAH I32-28B MILSAEQSFTLRHPHGQAAALAFVREPAAALAGVQRLRGLDSDGE I32-28B: SEQ ID QVWGELLVRVPLLGEVDLPFRSEIVRTPQGAELRPLTLTGERAWVA 35, 36, 54, 122, 129, 137, 140, 141, NO: 22 VSGQATAAEGGEMAFAFQFQAHLATPEAEGEGGAAFEVMVQAAA 144, 148 GVTLLLVAMALPQGLAAGLPPA I53-40A.1 MTKKVGIVDTTFARVDMASAAILTLKMESPNIKIIRKTVPGIKDLPV I53-40A: SEQ ID ACKKLLEEEGCDIVMALGMPGKKEKDKVCAHEASLGLMLAQLMT 20, 23, 24, 27, 28, 109, 112, 113, NO: 23 NKHIIEVFVHEDEAKDDAELKILAARRAIEHALNVYYLLFKPEYLIR 116, 120, 124 MAGKGLRQGFEDAGPARE I53-40B.1 MDDINNQLKRLKVIPVIAIDNAEDIIPLGKVLAENGLPAAEITFRSSA I53-40B: 47, 51, 54, 58, 74, 102 SEQ ID AVKAIMLLRSAQPEMLIGAGTILNGVQALAAKEAGADFVVSPGFNP NO: 24 NTVRACQIIGIDIVPGVNNPSTVEQALEMGLTTLKFFPAEASGGISM VKSLVGPYGDIRLMPTGGITPDNIDNYLAIPQVLACGGTWMVDKKL VRNGEWDEIARLTREIVEQVNP I53-47A.1 MPIFTLNTNIKADDVPSDFLSLTSRLVGLILSKPGSYVAVHINTDQQL I53-47A: 22, 25, 29, 72, 79, 86, 87 SEQ ID SFGGSTNPAAFGTLMSIGGIEPDKNRDHSAVLFDHLNAMLGIPKNR NO: 25 MYIHFVNLNGDDVGWNGTTF I53- MPIFTLNTNIKADDVPSDFLSLTSRLVGLILSEPGSYVAVHINIDQQL I53-47A: 22, 25, 29, 72, 79, 86, 87 47A.1NegT SFGGSTNPAAFGTLMSIGGIEPDKNEDHSAVLFDHLNAMLGIPKNR 2 MYIHFVDLDGDDVGWNGTTF SEQ ID NO: 26 I53-47B.1 MNQHSHKDHETVRIAVVRARWHADIVDACVEAFEIAMAAIGGDRF I53-47B: SEQ ID AVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVNGGIYRHEFV 28, 31, 35, 36, 39, 131, 132, NO: 27 ASAVIDGMMNVQLDTGVPVLSAVLTPHRYRDSDEHHRFFAAHFAV 135, 139, 146 KGVEAARACIEILNAREKIAA I53- MNQHSHKDHETVRIAVVRARWHADIVDACVEAFEIAMAAIGGDRF I53-47B: 47B.1NegT AVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVDGGIYDHEFV 28, 31, 35, 36, 39, 131, 132, 2 ASAVIDGMMNVQLDTGVPVLSAVLTPHEYEDSDEDHEFFAAHFAV 135, 139, 146 SEQ ID KGVEAARACIEILNAREKIAA NO: 28 I53-50A.1 MKMEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEITFTVP I53-50A: 25, 29, 33, 54, 57 SEQ ID DADTVIKALSVLKEKGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLD NO: 29 EEISQFCKEKGVFYMPGVMTPTELVKAMKLGHDILKLFPGEVVGPQ FVKAMKGPFPNVKFVPTGGVNLDNVCEWFKAGVLAVGVGDALVK GDPDEVREKAKKFVEKIRGCTE I53- MKMEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEITFTVP I53-50A: 25, 29, 33, 54, 57 50A.1NegT DADTVIKALSVLKEKGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLD 2 EEISQFCKEKGVFYMPGVMTPTELVKAMKLGHDILKLFPGEVVGPE SEQ ID FVEAMKGPFPNVKFVPTGGVDLDDVCEWFDAGVLAVGVGDALVE NO: 30 GDPDEVREDAKEFVEEIRGCTE I53- MKMEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEITFTVP I53-50A: 25, 29, 33, 54, 57 50A.lPosT DADTVIKALSVLKEKGAIIGAGTVTSVEQCRKAVESGAEFIVSPHLD 1 EEISQFCKEKGVFYMPGVMTPTELVKAMKLGHDILKLFPGEVVGPQ SEQ ID FVKAMKGPFPNVKFVPTGGVNLDNVCKWFKAGVLAVGVGKALV NO: 31 KGKPDEVREKAKKFVKKIRGCTE I53-50B.1 MNQHSHKDHETVRIAVVRARWHAEIVDACVSAFEAAMRDIGGDRF I53-50B: SEQ ID AVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVNGGIYRHEFV 24, 28, 36, 124, 125, 127, 128, NO: 32 ASAVIDGMMNVQLDTGVPVLSAVLTPHRYRDSDAHTLLFLALFAV 129, 131, 132, 133, 135, 139 KGMEAARACVEILAAREKIAA I53- MNQHSHKDHETVRIAVVRARWHAEIVDACVSAFEAAMRDIGGDRF I53-50B: 50B.1NegT AVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVDGGIYDHEFV 24, 28, 36, 124, 125, 127, 128, 2 ASAVIDGMMNVQLDTGVPVLSAVLTPHEYEDSDADTLLFLALFAV 129, 131, 132, 133, 135, 139 SEQ ID KGMEAARACVEILAAREKIAA NO: 33 I53- MNQHSHKDHETVRIAVVRARWHAEIVDACVSAFEAAMRDIGGDRF I53-50B: 50B.4PosT AVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVVNGGIYRHEFV 24, 28, 36, 124, 125, 127, 128, 1 ASAVINGMMNVQLNTGVPVLSAVLTPHNYDKSKAHTLLFLALFAV 129, 131, 132, 133, 135, 139 SEQ ID KGMEAARACVEILAAREKIAA NO: 34 -
I53-40 A genus (SEQ ID NO: 35) MTKKVGIVDTTFARVDMASAAILTLKMESPNIKIIRKTVPGIK DLPVACKKLLEEEGCDIVMALGMPGK(A/K)EKDKVCAHEASL GLMLAQLMTNKHIIEVFVHEDEAKDDAELKILAARRAIEHALN VYYLLFKPEYLTRMAGKGLRQGFEDAGPARE I53-40 B genus (SEQ ID NO: 36) M(S/D)(T/D)INNQLK(A/R)LKVIPVIAIDNAEDIIPLGKV LAENGLPAAEITFRSSAAVKAIMLLRSAQPEMLIGAGTILNGV QALAAKEAGA(T/D)FVVSPGFNPNTVRACQIIGIDIVPGVNN PSTVE(A/Q)ALEMGLTTLKFFPAEASGGISMVKSLVGPYGDI RLMPTGGITP(S/D)NIDNYLAIPQVLACGGTWMVDKKLV (T/R)NGEWDEIARLTREIVEQVNP I53-47A genus (SEQ ID NO: 37) MPIFTLNTNIKA(T/D)DVPSDFLSLTSRLVGLILS(K/E)PG SYVAVHINTDQQLSFGGSTNPAAFGTLMSIGGIEP(S/D)KN (R/E)DHSAVLFDHLNAMLGIPKNRMYIHFV(N/D)L(N/D)G DDVGWNGTTF I53-47B genus (SEQ ID NO: 38) MNQHSHKD(Y/H)ETVRIAVVRARWHADIVDACVEAFEIAMAA IGGDRFAVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFVV (N/D)GGIY(R/D)HEFVASAVIDGMMNVQL(S/D)TGVPVL SAVLTPH(R/E)Y(R/E)DS(A/D)E(H/D)H(R/E)FFAAH FAVKGVEAARACIEIL(A/N)AREKIAA I53-50A genus (SEQ ID NO: 39) MKMEELFKKHKIVAVLRANSVEEAIEKAVAVFAGGVHLIEIT FTVPDADTVIKALSVLKEKGAIIGAGTVTSVEQCRKAVESGA EFIVSPHLDEEISQFCKEKGVFYMPGVMTPTELVKAMKLGH (T/D)ILKLFPGEVVGP(Q/E)FV(K/E)AMKGPFPNVKFVP TGGV(N/D)LD(N/D)VC(E/K)WF(K/D)AGVLAVGVG (S/K/D)ALV(K/E)G(T/D/K)PDEVRE(K/D)AK(A/E/K) FV(E/K)(K/E)IRGCTE I53-50B genus (SEQ ID NO: 40) MNQHSHKD(Y/H)ETVRIAVVRARWHAEIVDACVSAFEAAM (A/R)DIGGDRFAVDVFDVPGAYEIPLHARTLAETGRYGAV LGTAFVV(N/D)GGIY(R/D)HEFVASAVI(D/N)GMMNVQL (S/D/N)TGVPVLSAVLTPH(R/E/N)Y(R/D/E)(D/K)S (D/K)A(H/D)TLLFLALFAVKGMEAARACVEILAAREKIAA - As is the case with proteins in general, the polypeptides are expected to tolerate some variation in the designed sequences without disrupting subsequent assembly into nanostructures: particularly when such variation comprises conservative amino acid substitutions. As used here, “conservative amino acid substitution” means that: hydrophobic amino acids (Ala, Cys, Gly, Pro, Met, See, Sme, Val, Ile, Leu) can only be substituted with other hydrophobic amino acids; hydrophobic amino acids with bulky side chains (Phe, Tyr, Trp) can only be substituted with other hydrophobic amino acids with bulky side chains; amino acids with positively charged side chains (Arg, His, Lys) can only be substituted with other amino acids with positively charged side chains; amino acids with negatively charged side chains (Asp, Glu) can only be substituted with other amino acids with negatively charged side chains; and amino acids with polar uncharged side chains (Ser, Thr, Asn, Gln) can only be substituted with other amino acids with polar uncharged side chains.
- As will be apparent to those of skill in the art, the ability to widely modify surface amino acid residues without disruption of the polypeptide structure permits many types of modifications to endow the resulting self-assembled nanostructures with a variety of functions. In one non-limiting embodiment, the polypeptides of the invention can be modified to facilitate covalent linkage to a “cargo” of interest. In one non-limiting example, the polypeptides can be modified, such as by introduction of various cysteine residues at defined positions to facilitate linkage to one or more antigens of interest, such that a nanostructure of the polypeptides would provide a scaffold to provide a large number of antigens for delivery as a vaccine to generate an improved immune response. In some embodiments, some or all native cysteine residues that are present in the polypeptides but not intended to be used for conjugation may be mutated to other amino acids to facilitate conjugation at defined positions. In another non-limiting embodiment, the polypeptides of the invention may be modified by linkage (covalent or non-covalent) with a moiety to help facilitate “endosomal escape.” For applications that involve delivering molecules of interest to a target cell, such as targeted delivery, a critical step can be escape from the endosome—a membrane-bound organelle that is the entry point of the delivery vehicle into the cell. Endosomes mature into lysosomes, which degrade their contents. Thus, if the delivery vehicle does not somehow “escape” from the endosome before it becomes a lysosome, it will be degraded and will not perform its function. There are a variety of lipids or organic polymers that disrupt the endosome and allow escape into the cytosol. Thus, in this embodiment, the polypeptides can be modified, for example, by introducing cysteine residues that will allow chemical conjugation of such a lipid or organic polymer to the monomer or resulting assemly surface. In another non-limiting example, the polypeptides can be modified, for example, by introducing cysteine residues that will allow chemical conjugation of fluorophores or other imaging agents that allow visualization of the nanostructures of the invention in vitro or in vivo.
- Surface amino acid residues on the polypeptides can be mutated in order to improve the stability or solubility of the protein subunits or the assembled nanostructures. As will be known to one of skill in the art, if the polypeptide has significant sequence homology to an existing protein family, a multiple sequence alignment of other proteins from that family can be used to guide the selection of amino acid mutations at non-conserved positions that can increase protein stability and/or solubility, a process referred to as consensus protein design (9).
- Surface amino acid residues on the polypeptides can be mutated to positively charged (Arg, Lys) or negatively charged (Asp, Glu) amino acids in order to endow the protein surface with an overall positive or overall negative charge. In one non-limiting embodiment, surface amino acid residues on the polypeptides can be mutated to endow the interior surface of the self-assembling nanostructure with a high net charge. Such a nanostructure can then be used to package or encapsulate a cargo molecule with the opposite net charge due to the electrostatic interaction between the nanostructure interior surface and the cargo molecule. In one non-limiting embodiment, surface amino acid residues on the polypeptides can be mutated primarily to Arginine or Lysine residues in order to endow the interior surface of the self-assembling nanostructure with a net positive charge. Solutions containing the polypeptides can then be mixed in the presence of a nucleic acid cargo molecule such as a dsDNA, ssDNA, dsRNA, ssRNA, cDNA, miRNA, siRNA, shRNA, piRNA, or other nucleic acid in order to encapsulate the nucleic acid inside the self-assembling nanostructure. Such a nanostructure could be used, for example, to protect, deliver, or concentrate nucleic acids.
- Table 2 lists surface amino acid residue numbers for each exemplary polypeptide of the invention denoted by SEQ ID NOS: 1-34. Thus, in various embodiments, 1 or more (at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, or more) of these surface residues may be modified in the polypeptides of the invention.
-
TABLE 2 Amino Acid Surface residues Name Sequence no tnearinterface I53-34A MEGMDPLAVLAESRLLPLLT I53-34A: SEQ ID VRGGEDLAGLATVLELMGVG 6, 8, 9, 12, 14, 22, 25, 48, 49, 50, NO: 1 ALEITLRTEKGLEALKALRK 52, 53, 56, 73, 74, 81, 94, 95, 101, SGLLLGAGTVRSPKEAEAAL 102, 103, 104, 119, 122, 137, 140, EAGAAFLVSPGLLEEVAALA 143, 147, 150, 151, 153, 161, 162, QARGVPYLPGVLTPIEVERA 163, 164, 166, 167, 170, 172, 184, LALGLSALKFFPAEPFQGVR 193, 198, 199, 200, 202 VLRAYAEVFPEVRFLPTGGI KEEHLPHYAALPNLLAVGGS WLLQGDLAAVMKKVKAAKAL LSPQAPG I53-34B MTKKVGIVDTTFARVDMAEA I53-34B: SEQ ID AIRTLKALSPNIKIIRKTVP 3, 12, 31, 33, 35, 36, 51, 54, 55, NO: 2 GIKDLPVACKKLLEEEGCDI 56, 59, 69, 70, 71, 74, 93, 103, VMALGMPGKAEKDKVCAHEA 106, 107, 108, 131, 132, 133, 134, SLGLMLAQLMTNKHIIEVFV 138, 142, 153 HEDEAKDDDELDILALVRAI EHAANVYYLLFKPEYLTRMA GKGLRQGREDAGPARE I53-40A MTKKVGIVDTTFARVDMASA I53-40A: SEQ ID AILTLKMESPNIKIIRKTVP 3, 4, 31, 33, 35, 36, 37, 51, 54, NO: 3 GIKDLPVACKKLLEEEGCDI 55, 56, 57, 59, 69, 70, 71, 74, VMALGMPGKAEKDKVCAHEA 93, 103, 106, 118, 127, 128, 131, SLGLMLAQLMTNKHIIEVFV 132, 133, 134, 135, 138, 139, 142, HEDEAKDDAELKILAARRAI 150, 153 EHALNVYYLLFKPEYLIRMA GKGLRQGFEDAGPARE I53-40B MSTINNQLKALKVIPVIAID I53-40B: SEQ ID NAEDIIPLGKVLAENGLPAA 2, 3, 7, 9, 10, 12, 20, 21, 23, 26, NO: 4 EITFRSSAAVKAIMLLRSAQ 27, 30, 34, 38, 45, 60, 62, 75, 85, PEMLIGAGTILNGVQALAAK 94, 95, 122, 124, 126, 134, 139, 143, EAGATFVVSPGFNPNTVRAC 151, 153, 161, 163, 166, 167, 170, QIIGIDIVPGVNNPSTVEAA 172, 180, 184, 185, 186, 189, 190, LEMGLTTLKFFPAEASGGIS 192, 193, 194, 195, 198, 201, 202, MVKSLVGPYGDIRLMPTGGI 205, 208, 209 TPSNIDNYLAIPQVLACGGT WMVDKKLVTNGEWDEIARLT REIVEQVNP I53-47A MPIFTLNTNIKATDVPSDFL I53-47A: SEQ ID SLTSRLVGLILSKPGSYVAV 11, 13, 14, 17, 34, 36, 37, 45, 47, NO: 5 HINTDQQLSFGGSTNPAAFG 54, 55, 56, 65, 69, 70, 71, 74, 91, TLMSIGGIEPSKNRDHSAVL 92, 93, 101, 103, 105, 109, 110, 112, FDHLNAMLGIPKNRMYIHFV 114 NLNGDDVGWNGTTF I53-47B MNQHSHKDYETVRIAVVRAR I53-47B: SEQ ID WHADIVDACVEAFEIAMAAI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, NO: 6 GGDRFAVDVFDVPGAYEIPL 24, 43, 44, 51, 63, 67, 70, 85, 87, HARTLAETGRYGAVLGTAFV 101, 105, 122, 123, 124, 125, 126, VNGGIYRHEFVASAVIDGMM 147, 152, 153, 154 NVQLSTGVPVLSAVLTPHRY RDSAEHHRFFAAHFAVKGVE AARACIEILAAREKIAA I53-50A MKMEELFKKHKIVAVLRANS I53-50A: SEQ ID VEEAIEKAVAVFAGGVHLIE 4, 5, 6, 8, 9, 11, 17, 19, 23, 37, 46, NO: 7 ITFTVPDADTVIKALSVLKE 47, 59, 74, 77, 78, 81, 94, 95, 98, KGAIIGAGTVTSVEQCRKAV 101, 102, 103, 106, 119, 122, 126, ESGAEFIVSPHLDEEISQFC 139, 142, 145, 149, 150, 152, 160, KEKGVFYMPGVMTPTELVKA 161, 162, 163, 166, 169, 179, 183, MKLGHTILKLFPGEVVGPQF 185, 188, 191, 192, 194, 198, 199 VKAMKGPFPNVKFVPTGGVN LDNVCEWFKAGVLAVGVGSA LVKGTPDEVREKAKAFVEKI RGCTE I53-50B MNQHSHKDYETVRIAVVRAR I53-50B: SEQ ID WHAEIVDACVSAFEAAMADI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, NO: 8 GGDRFAVDVFDVPGAYEIPL 34, 38, 39, 40, 43, 44, 48, 51, 63, HARTLAETGRYGAVLGTAFV 67, 70, 87, 101, 105, 118, 143, 147, VNGGIYRHEFVASAVIDGMM 152, 153, 154 NVQLSTGVPVLSAVLTPHRY RDSDAHTLLFLALFAVKGME AARACVEILAAREKIAA I53-51A MFTKSGDDGNTNVINKRVGK I53-51A: SEQ ID DSPLVNFLGDLDELNSFIGF 19, 20, 24, 28, 46, 47, 51, 70, 71, 73, NO: 9 AISKIPWEDMKKDLERVQVE 74, 75, 76, 102, 122, 130, 133, 134, 135, LFEIGEDLSTQSSKKKIDES 136, 137, 140, 162, 163, 164, 165, YVLWLLAATAIYRIESGPVK 169, 175, 177 LFVIPGGSEEASVLHVTRSV ARRVERNAVKYTKELPEINR MIIVYLNRLSSLLFAMALVA NKRRNQSEKIYEIGKSW I53-51B MNQHSHKDYETVRIAVVRAR I53-51B: SEQ ID WHADIVDQCVRAFEEAMADA 6, 7, 8, 9, 10, 11, 13, 18, 21, 27, 34, NO: 10 GGDRFAVDVFDVPGAYEIPL 38, 43, 48, 63, 67, 70, 85, 87, 101, HARTLAETGRYGAVLGTAFV 118, 125, 126, 129, 152, 153, 154 VNGGIYRHEFVASAVIDGMM NVQLSTGVPVLSAVLTPHRY RSSREHHEFFREHFMVKGVE AAAACITILAAREKIAA I52-03A MGHTKGPTPQQHDGSALRIG I52-03A: SEQ ID IVHARWNKTIIMPLLIGTIA 6, 9, 10, 11, 13, 15, 16, 26, 48, 69, NO: 11 KLLECGVKASNIVVQSVPGS 75, 76, 78, 79, 111, 125, 127, 142, WELPIAVQRLYSASQLQTPS 146, 159, 160, 161, 162, 171, 175, SGPSLSAGDLLGSSTTDLTA 193, 194, 196, 197, 199, 200 LPTTTASSTGPFDALIAIGV LIKGETMHFEYIADSVSHGL MRVQLDTGVPVIFGVLTVLI DDQAKARAGVIEGSHNHGED WGLAAVEMGVRRRDWAAGKT E I52-03B MYEVDHADVYDLFYLGRGKD I52-03B: SEQ ID YAAEASDIADLVRSRTPEAS 2, 3, 5, 6, 8, 15, 17, 20, 22, 23, 26, NO: 12 SLLDVACGTGTHLEHFTKEF 27, 30, 33, 34, 35, 37, 38, 40, 54, 55, GDTAGLELSEDMLTHARKRL 57, 58, 59, 61, 62, 68, 70, 71, 74, 77, PDATLHQGDMRDFQLGRKFS 78, 79, 81, 82, 84, 86, 87, 91, 96, 97, AVVSMFSSVGYLKTVAELGA 98, 111, 127, 130, 131, 132, 141, 144, AVASFAEHLEPGGVVVVEPW 145, 148, 150, 154, 157, 158, 159, 160, WFPETFADGWVSADVVRRDG 161, 171, 172, 173, 174, 177, 187, 189, RTVARVSHSVREGNATRMEV 192, 198, 199, 222, 223, 224, 236 HFTVADPGKGVRHFSDVHLI TLFHQREYEAAFMAAGLRVE YLEGGPSGRGLFVGVPA I52-32A MGMKEKFVLIITHGDFGKGL I52-32A: SEQ ID LSGAEVIIGKQENVHTVGLN 3, 5, 15, 18, 30, 32, 35, 40, 41, 42, NO: 13 LGDNIEKVAKEVMRIIIAKL 44, 45, 65, 73, 79, 91, 103, 106, 109, AEDKEIIIVVDLFGGSPFNI 110, 111, 112, 114, 115, 118, 122, 123, ALEMMKTFDVKVITGINMPM 125, 126, 129, 131 LVELLTSINVYDTTELLENI SKIGKDGIKVIEKSSLKM I52-32B MKYDGSKLRIGILHARWNLE I52-32B: SEQ ID IIAALVAGAIKRLQEFGVKA 4, 6, 7, 9, 17, 32, 35, 42, 59, 63, 64, NO: 14 ENIIIETVPGSFELPYGSKL 66, 67, 68, 69, 70, 71, 73, 83, 85, 90, FVEKQKRLGKPLDAIIPIGV 106, 119, 120, 121, 122, 125, 131, 133, LIKGSTMHFEYICDSTTHQL 134, 135, 136, 154 MKLNFELGIPVIFGVLTCLT DEQAEARAGLIEGKMHNHGE DWGAAAVEMATKFN I52-33A MAVKGLGEVDQKYDGSKLRI I52-33A: SEQ ID GILHARWNRKIILALVAGAV 12, 14, 16, 17, 19, 26, 27, 46, 69, 73, NO: 15 LRLLEFGVKAENIIIETVPG 74, 76, 77, 78, 80, 81, 83, 93, 95, 100, SFELPYGSKLFVEKQKRLGK 116, 129, 130, 131, 132, 145, 164 PLDAIIPIGVLIKGSTMHFE YICDSTTHQLMKLNFELGIP VIFGVLTCLTDEQAEARAGL IEGKMHNHGEDWGAAAVEMA TKFN I52-33B MGANWYLDNESSRLSFTSTK I52-33B: SEQ ID NADIAEVHRFLVLHGKVDPK 4, 6, 10, 20, 21, 23, 24, 31, 32, 34, 36, NO: 16 GLAEVEVETESISTGIPLRD 39, 40, 42, 44, 46, 48, 56, 73, 77, 79, MLLRVLVFQVSKFPVAQINA 81, 83, 85, 88, 89, 91, 92, 96, 97, 99, QLDMRPINNLAPGAQLELRL 101, 103, 109, 110, 111, 112, 114, 124, PLTVSLRGKSHSYNAELLAT 125, 138, 140, 143, 158, 175 RLDERRFQVVTLEPLVIHAQ DFDMVRAFNALRLVAGLSAV SLSVPVGAVLIFTAR I32-06A MTDYIRDGSAIKALSFAIIL I32-06A: SEQ ID AEADLRHIPQDLQRLAVRVI 24, 26, 27, 41, 47, 50, 51, 56, 60, 63, NO: 17 HACGMVDVANDLAFSEGAGK 64, 67, 68, 77, 84, 85, 86, 91, 93, 98, AGRNALLAGAPILCDARMVA 99, 100, 101, 102, 105, 108, 109, 114, EGITRSRLPADNRVIYTLSD 123, 124, 125, 127, 135, 142, 145, 148, PSVPELAKKIGNTRSAAALD 149, 152, 153, 169, 172, 173, 176, 177, LWLPHIEGSIVAIGNAPTAL 180, 187, 189 FRLFELLDAGAPKPALIIGM PVGFVGAAESKDELAANSRG VPYVIVRGRRGGSAMTAAAV NALASERE I32-06B MITVFGLKSKLAPRREKLAE I32-06B: SEQ ID VIYSSLHLGLDIPKGKHAIR 8, 9, 10, 13, 14, 15, 16, 17, 20, 34, 36, NO: 18 FLCLEKEDFYYPFDRSDDYT 45, 46, 47, 50, 51, 53, 54, 57, 67, 70, VIEINLMAGRSEETKMLLIF 91, 93, 95, 105, 112 LLFIALERKLGIRAHDVEIT IKEQPAHCWGFRGRTGDSAR DLDYDIYV I32-19A MGSDLQKLQRFSTCDISDGL I32-19A: SEQ ID LNVYNIPTGGYFPNLTAISP 3, 4, 6, 7, 9, 10, 25, 27, 36, 40, 42, NO: 19 PQNSSIVGTAYTVLFAPIDD 43, 44, 49, 58, 59, 61, 62, 63, 70, 72, PRPAVNYIDSVPPNSILVLA 73, 74, 82, 84, 88, 89, 109, 110, 112, LEPHLQSQFHPFIKITQAMY 126, 127, 129, 130, 132, 146, 155, 156, GGLMSTRAQYLKSNGTVVFG 157, 159, 166, 169, 172, 189, 190, 192, RIRDVDEHRTLNHPVFAYGV 194, 195, 198, 201, 204, 215, 232 GSCAPKAVVKAVGTNVQLKI LTSDGVTQTICPGDYIAGDN NGIVRIPVQETDISKLVTYI EKSIEVDRLVSEAIKNGLPA KAAQTARRMVLKDYI I32-19B MSGMRVYLGADHAGYELKQA I32-19B: SEQ ID IIAFLKMTGHEPIDCGALRY 4, 5, 31, 33, 38, 41, 42, 43, 55, 56, 59, NO: 20 DADDDYPAFCIAAATRTVAD 61, 62, 83, 93, 94, 101, 104, 113, 119, PGSLGIVLGGSGNGEQIAAN 129, 131, 134, 136, 137, 139, 140, 143, KVPGARCALAWSVQTAALAR 144, 146, 147, 150, 152, 153, 156, 158, EHNNAQLIGIGGRMHTLEEA 159 LRIVKAFVTTPWSKAQRHQR RIDILAEYERTHEAPPVPGA PA I32-28A MGDDARIAAIGDVDELNSQI I32-28A: SEQ ID GVLLAEPLPDDVRAALSAIQ 4, 6, 7, 10, 14, 27, 30, 31, 33, 34, 41, NO: 21 HDLFDLGGELCIPGHAAITE 44, 45, 51, 52, 53, 54, 55, 56, 59, 76, DHLLRLALWLVHYNGQLPPL 78, 79, 80, 81, 82, 83, 90, 103, 111, EEFILPGGARGAALAHVCRT 115, 116, 131, 134, 142, 145, 147, 150 VCRRAERSIKALGASEPLNI APAAYVNLLSDLLFVLARVL NRAAGGADVLWDRTRAH I32-28B MILSAEQSFTLRHPHGQAAA I32-28B: SEQ ID LAFVREPAAALAGVQRLRGL 3, 4, 6, 8, 12, 15, 17, 18, 22, 26, 28, NO: 22 DSDGEQVWGELLVRVPLLGE 32, 38, 39, 41, 43, 45, 46, 48, 50, 60, VDLPFRSEIVRTPQGAELRP 66, 68, 71, 73, 74, 79, 81, 82, 83, 84, LTLTGERAWVAVSGQATAAE 86, 87, 95, 100, 103, 105, 109, 111, 113, GGEMAFAFQFQAHLATPEAE 151, 152, 155, 156, 157 GEGGAAFEVMVQAAAGVTLL LVAMALPQGLAAGLPPA I53-40A.1 MTKKVGIVDTTFARVDMASA I53-40A: SEQ ID AILTLKMESPNIKIIRKTVP 3, 4, 31, 33, 35, 36, 37, 51, 54, 55, 56, NO: 23 GIKDLPVACKKLLEEEGCDI 57, 59, 69, 70, 71, 74, 93, 103, 106, 118, VMALGMPGKKEKDKVCAHEA 127, 128, 131, 132, 133, 134, 135, 138, SLGLMLAQLMTNKHIIEVFV 139, 142, 150, 153 HEDEAKDDAELKILAARRAI EHALNVYYLLFKPEYLIRMA GKGLRQGFEDAGPARE I53-40B.1 MDDINNQLKRLKVIPVIAID I53-40B: SEQ ID NAEDIIPLGKVLAENGLPAA 2, 3, 7, 9, 10, 12, 20, 21, 23, 26, 27, NO: 24 EITFRSSAAVKAIMLLRSAQ 30, 34, 38, 45, 60, 62, 75, 85, 94, 95, PEMLIGAGTILNGVQALAAK 122, 124, 126, 134, 139, 143, 151, 153, EAGADFVVSPGFNPNTVRAC 161, 163, 166, 167, 170, 172, 180, 184, QIIGIDIVPGVNNPSTVEQA 185, 186, 189, 190, 192, 193, 194, 195, LEMGLTTLKFFPAEASGGIS 198, 201, 202, 205, 208, 209 MVKSLVGPYGDIRLMPTGGI TPDNIDNYLAIPQVLACGGT WMVDKKLVRNGEWDEIARLT REIVEQVNP I53-47A.1 MPIFTLNTNIKADDVPSDFL I53-47A: SEQ ID SLTSRLVGLILSKPGSYVAV 11, 13, 14, 17, 34, 36, 37, 45, 47, 54, NO: 25 HINTDQQLSFGGSTNPAAFG 55, 56, 65, 69, 70, 71, 74, 91, 92, 93, TLMSIGGIEPDKNRDHSAVL 101, 103, 105, 109, 110, 112, 114 FDHLNAMLGIPKNRMYIHFV NLNGDDVGWNGTTF I53- MPIFTLNTNIKADDVPSDFL I53-47A: 47A.1NegT SLTSRLVGLILSEPGSYVAV 11, 13, 14, 17, 34, 36, 37, 45, 47, 54, 2 HINIDQQLSFGGSTNPAAFG 55, 56, 65, 69, 70, 71, 74, 91, 92, 93, SEQ ID TLMSIGGIEPDKNEDHSAVL 101, 103, 105, 109, 110, 112, 114 NO: 26 FDHLNAMLGIPKNRMYIHFV DLDGDDVGWNGTTF I53-47B.1 MNQHSHKDHETVRIAVVRAR I53-47B: SEQ ID WHADIVDACVEAFEIAMAAI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, 24, NO: 27 GGDRFAVDVFDVPGAYEIPL 43, 44, 51, 63, 67, 70, 85, 87, 101, HARTLAETGRYGAVLGTAFV 105, 122, 123, 124, 125, 126, 147, 152, VNGGIYRHEFVASAVIDGMM 153, 154 NVQLDTGVPVLSAVLTPHRY RDSDEHHRFFAAHFAVKGVE AARACIEILNAREKIAA I53- MNQHSHKDHETVRIAVVRAR I53-47B: 47B.1NegT WHADIVDACVEAFEIAMAAI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, 24, 2 GGDRFAVDVFDVPGAYEIPL 43, 44, 51, 63, 67, 70, 85, 87, 101, SEQ ID HARTLAETGRYGAVLGTAFV 105, 122, 123, 124, 125, 126, 147, 152, NO: 28 VDGGIYDHEFVASAVIDGMM 153, 154 NVQLDTGVPVLSAVLTPHEY EDSDEDHEFFAAHFAVKGVE AARACIEILNAREKIAA I53-50A.1 MKMEELFKKHKIVAVLRANS I53-50A: SEQ ID VEEAIEKAVAVFAGGVHLIE 4, 5, 6, 8, 9, 11, 17, 19, 23, 37, 46, NO: 29 ITFTVPDADTVIKALSVLKE 47, 59, 74, 77, 78, 81, 94, 95, 98, 101, KGAIIGAGTVTSVEQCRKAV 102, 1013, 106, 119, 122, 126, 139, 142, ESGAEFIVSPHLDEEISQFC 145, 49, 150, 152, 160, 161, 162, 163, KEKGVFYMPGVMTPTELVKA 166, 169, 179, 183, 185, 188, 191, 192, MKLGHDILKLFPGEVVGPQF 194, 198, 199 VKAMKGPFPNVKFVPTGGVN LDNVCEWFKAGVLAVGVGDA LVKGDPDEVREKAKKFVEKI RGCTE I53- MKMEELFKKHKIVAVLRANS I53-50A: 50A.1NegT VEEAIEKAVAVFAGGVHLIE 4, 5, 6, 8, 9, 11, 17, 19, 23, 37, 46, 2 ITFTVPDADTVIKALSVLKE 47, 59, 74, 77, 78, 81, 94, 95, 98, 101, SEQ ID KGAIIGAGTVTSVEQCRKAV 102, 103, 106, 119, 122, 126, 139, 142, NO: 30 ESGAEFIVSPHLDEEISQFC 145, 149, 150, 152, 160, 161, 162, 163, KEKGVFYMPGVMTPTELVKA 166, 169, 179, 183, 185, 188, 191, 192, MKLGHDILKLFPGEVVGPEF 194, 198, 199 VEAMKGPFPNVKFVPTGGVD LDDVCEWFDAGVLAVGVGDA LVEGDPDEVREDAKEFVEEI RGCTE I53- MKMEELFKKHKIVAVLRANS I53-50A: 50A.lPosT VEEAIEKAVAVFAGGVHLIE 4, 5, 6, 8, 9, 11, 17, 19, 23, 37, 46, 1 ITFTVPDADTVIKALSVLKE 47, 59, 74, 77, 78, 81, 94, 95, 98, 101, SEQ ID KGAIIGAGTVTSVEQCRKAV 102, 103, 106, 119, 122, 126, 139, 142, NO: 31 ESGAEFIVSPHLDEEISQFC 145, 419, 150, 152, 160, 161, 162, 163, KEKGVFYMPGVMTPTELVKA 166, 169, 179, 183, 185, 188, 191, 192, MKLGHDILKLFPGEVVGPQF 194, 198, 199 VKAMKGPFPNVKFVPTGGVN LDNVCKWFKAGVLAVGVGKA LVKGKPDEVREKAKKFVKKI RGCTE I53-50B.1 MNQHSHKDHETVRIAVVRAR I53-50B: SEQ ID WHAEIVDACVSAFEAAMRDI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, 34, NO: 32 GGDRFAVDVFDVPGAYEIPL 38, 93, 40, 43, 44, 48, 51, 63, 67, 70, HARTLAETGRYGAVLGTAFV 87, 101, 105, 118, 143, 147, 152, 153, VNGGIYRHEFVASAVIDGMM 154 NVQLDTGVPVLSAVLTPHRY RDSDAHTLLFLALFAVKGME AARACVEILAAREKIAA I53- MNQHSHKDHETVRIAVVRAR I53-50B: 50B.1NegT WHAEIVDACVSAFEAAMRDI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, 34, 2 GGDRFAVDVFDVPGAYEIPL 38, 39, 40, 43, 44, 48, 51, 63, 67, 70, SEQ ID HARTLAETGRYGAVLGTAFV 87, 101, 105, 118, 143, 147, 152, 153, NO: 33 VDGGIYDHEFVASAVIDGMM 154 NVQLDTGVPVLSAVLTPHEY EDSDADTLLFLALFAVKGME AARACVEILAAREKIAA I53- MNQHSHKDHETVRIAVVRAR I53-50B: 50B.4PosT WHAEIVDACVSAFEAAMRDI 6, 7, 8, 9, 10, 11, 13, 18, 20, 21, 34, 1 GGDRFAVDVFDVPGAYEIPL 38, 39, 40, 43, 44, 48, 51, 63, 67, 70, SEQ ID HARTLAETGRYGAVLGTAFV 87, 101, 105, 118, 143, 147, 152, 153, NO: 34 VNGGIYRHEFVASAVINGMM 154 NVQLNTGVPVLSAVLTPHNY DKSKAHTLLFLALFAVKGME AARACVEILAAREKIAA - In certain instances, the polypeptides of the present invention can also tolerate non-conservative substitutions. The isolated polypeptides may be produced recombinantly or synthetically, using standard techniques in the art. The isolated polypeptides of the invention can be modified in a number of ways, including but not limited to the ways described above, either before or after assembly of the nanostructures of the invention. As used throughout the present application, the term “polypeptide” is used in its broadest sense to refer to a sequence of subunit amino acids. The polypeptides of the invention may comprise L-amino acids, D-amino acids (which are resistant to L-amino acid-specific proteases in vivo), or a combination of D- and L-amino acids.
- In another aspect, the invention provides nanostructures, comprising:
- (a) a plurality of first assemblies, each first assembly comprising a plurality of identical first polypeptides, wherein the first polypeptides comprise the polypeptide of any embodiment or combination of embodiments of the first aspect of the invention; and
- (b) a plurality of second assemblies, each second assembly comprising a plurality of identical second polypeptides, wherein the second polypeptides comprise the polypeptide of any embodiment or combination of embodiments of the first aspect of the invention, wherein the second polypeptide differs from the first polypeptide;
- wherein the plurality of first assemblies non-covalently interact with the plurality of second assemblies to form a nanostructure.
- As described in the examples that follow, a plurality (2, 3, 4, 5, 6, or more) of first polypeptides self-assemble to form a first assembly, and a plurality (2, 3, 4, 5, 6, or more) of second polypeptides self-assemble to form a second assembly. A plurality of these first and second assemblies then self-assemble non-covalently via the designed interfaces to produce the nanostructures of the invention. The designed interfaces on the polypeptides of the invention, resembling natural protein-protein interfaces with well-packed cores composed primarily of hydrophobic amino acid side chains surrounded by a periphery composed primarily of hydrophilic and charged side chains, rigidly orient the assemblies within the nanostructures formed by self-assembly. As will be understood by those of skill in the art, the interaction between the first assembly and the second assembly is a non-covalent protein-protein interaction. Any suitable non-covalent interaction(s) can drive self-interaction of the assemblies to form the nanostructure, including but not limited to one or more of electrostatic interactions, 7π-effects, van der Waals forces, hydrogen bonding, and hydrophobic effects. In various embodiments, pentamers, trimers, and dimers of the first or second assemblies assemble relative to each other such that their 5-fold, 3-fold, and 2-fold symmetry axes are aligned along icosahedral 5-fold, 3-fold, and 2-fold symmetry axes, respectively.
- In various other embodiments, the nanostructures are between about 20 nanometers (nm) to about 40 nm in diameter, with interior lumens between about 15 nm to about 32 nm across and pore sizes in the protein shells between about 1 nm to about 14 nm in their longest dimensions (
FIG. 2 ). The nanostructures of the invention can be used for any suitable purpose, including but not limited to delivery vehicles, as the nanostructures can encapsulate molecules of interest and/or the first and/or second proteins can be modified to bind to molecules of interest (diagnostics, therapeutics, detectable molecules for imaging and other applications, etc.). The nanostructures of the invention are well suited for several applications, including vaccine design, targeted delivery of therapeutics, and bioenergy. - In various embodiments of the nanostructure of the invention, the first polypeptides and the second polypeptides comprise polypeptides with the amino acid sequence selected from the following pairs, or modified versions thereof (i.e.: permissible modifications as disclosed for the polypeptides of the invention: isolated polypeptides comprising an amino acid sequence that is at least 75% identical over its length, and identical at least at one identified interface position, to the amino acid sequence indicated by the SEQ ID NO.):
- (i) SEQ ID NO:1 and SEQ ID NO:2 (I53-34A and I53-34B);
- (ii) SEQ ID NO:3 and SEQ ID NO:4 (I53-40A and I53-40B);
- (iii) SEQ ID NO:3 and SEQ ID NO:24 (I53-40A and I53-40B.1);
- (iv) SEQ ID NO:23 and SEQ ID NO:4 (I53-40A.1 and I53-40B);
- (v) SEQ ID NO:35 and SEQ ID NO:36 (I53-40A genus and I53-40B genus);
- (vi) SEQ ID NO:5 and SEQ ID NO:6 (I53-47A and I53-B);
- (vii) SEQ ID NO:5 and SEQ ID NO:27 (I53-47A and I53-47B.1);
- (viii) SEQ ID NO:5 and SEQ ID NO:28 (I53-47A and I53-47B.1NegT2);
- (ix) SEQ ID NO:25 and SEQ ID NO:6 (I53-47A.1 and I53-47B);
- (x) SEQ ID NO:25 and SEQ ID NO:27 (I53-47A.1 and I53-47B.1);
- (xi) SEQ ID NO:25 and SEQ ID NO:28 (I53-47A.1 and I53-47B.1NegT2);
- (xii) SEQ ID NO:26 and SEQ ID NO:6 (I53-47A.1NegT2 and I53-47B);
- (xiii) SEQ ID NO:26 and SEQ ID NO:27 (I53-47A.1NegT2 and I53-47B.1);
- (xiv) SEQ ID NO:26 and SEQ ID NO:28 (I53-47A.1NegT2 and I53-47B.1NegT2);
- (xv) SEQ ID NO:37 and SEQ ID NO:38 (I53-47A genus and I53-47B genus);
- (xvi) SEQ ID NO:7 and SEQ ID NO:8 (I53-50A and I53-50B);
- (xvii) SEQ ID NO:7 and SEQ ID NO:32 (I53-50A and I53-50B.1);
- (xix) SEQ ID NO:7 and SEQ ID NO:33 (I53-50A and I53-50B.1NegT2);
- (xx) SEQ ID NO:7 and SEQ ID NO:34 (I53-50A and I53-50B.4PosT1);
- (xxi) SEQ ID NO:29 and SEQ ID NO:8 (I53-50A.1 and I53-50B);
- (xxii) SEQ ID NO:29 and SEQ ID NO:32 (I53-50A.1 and I53-50B.1);
- (xxiii) SEQ ID NO:29 and SEQ ID NO:33 (I53-50A.1 and I53-50B.1NegT2);
- (xxiv) SEQ ID NO:29 and SEQ ID NO:34 (I53-50A.1 and I53-50B.4PosT1);
- (xxv) SEQ ID NO:30 and SEQ ID NO:8 (I53-50A.1NegT2 and I53-50B);
- (xxvi) SEQ ID NO:30 and SEQ ID NO:32 (I53-50A.1NegT2 and I53-50B.1);
- (xxvii) SEQ ID NO:30 and SEQ ID NO:33 (I53-50A.1NegT2 and I53-50B.1NegT2);
- (xxviii)SEQ ID NO:30 and SEQ ID NO:34 (I53-50A.1NegT2 and I53-50B.4PosT1);
- (xxix) SEQ ID NO:31 and SEQ ID NO:8 (I53-50A.1PosT1 and I53-50B);
- (xxx) SEQ ID NO:31 and SEQ ID NO:32 (I53-50A.1PosT1 and I53-50B.1);
- (xxxi) SEQ ID NO:31 and SEQ ID NO:33 (I53-50A.1PosT1 and I53-50B.1NegT2);
- (xxxii) SEQ ID NO:31 and SEQ ID NO:34 (I53-50A.1PosT1 and I53-50B.4PosT1);
- (xxxiii) SEQ ID NO:39 and SEQ ID NO:40 (I53-50A genus and I53-50B genus);
- (xxxiv) SEQ ID NO:9 and SEQ ID NO:10 (I53-51A and I53-51B);
- (xxxv) SEQ ID NO:11 and SEQ ID NO:12 (I52-03A and I52-03B);
- (xxxvi) SEQ ID NO:13 and SEQ ID NO:14 (I52-32A and I52-32B);
- (xxxv) SEQ ID NO:15 and SEQ ID NO:16 (I52-33A and I52-33B)
- (xxxvi) SEQ ID NO:17 and SEQ ID NO:18 (I32-06A and I32-06B);
- (xxxvii) SEQ ID NO:19 and SEQ ID NO:20 (I32-19A and I32-19B);
- (xxxviii) SEQ ID NO:21 and SEQ ID NO:22 (I32-28A and I32-28B); and
- (xxxix) SEQ ID NO:23 and SEQ ID NO:24 (I53-40A.1 and I53-40B.1).
- In one embodiment, the nanostructure has icosahedral symmetry. In this embodiment, the nanostructure may comprise 60 copies of the first polypeptide and 60 copies of the second polypeptide. In one such embodiment, the number of identical first polypeptides in each first assembly is different than the number of identical second polypeptides in each second assembly. For example, in one embodiment, the nanostructure comprises twelve first assemblies and twenty second assemblies; in this embodiment, each first assembly may, for example, comprise five copies of the identical first polypeptide, and each second assembly may, for example, comprise three copies of the identical second polypeptide. In another embodiment, the nanostructure comprises twelve first assemblies and thirty second assemblies; in this embodiment, each first assembly may, for example, comprise five copies of the identical first polypeptide, and each second assembly may, for example, comprise two copies of the identical second polypeptide. In a further embodiment, the nanostructure comprises twenty first assemblies and thirty second assemblies; in this embodiment, each first assembly may, for example, comprise three copies of the identical first polypeptide, and each second assembly may, for example, comprise two copies of the identical second polypeptide. All of these embodiments are capable of forming synthetic nanomaterials with regular icosahedral symmetry. In various further embodiments, oligomeric states of the first and second polypeptides are as follows:
- I53-34A: trimer+I53-34B: pentamer;
- I53-40A: pentamer+I53-40B: trimer;
- I53-47A: trimer+I53-47B: pentamer;
- I53-50A: trimer+I53-50B: pentamer;
- I53-51A: trimer+I53-51B: pentamer;
- I32-06A: dimer+I32-06B: trimer;
- I32-19A: trimer+I32-19B: dimer;
- I32-28A: trimer+I32-28B: dimer;
- I52-03A: pentamer+I52-03B: dimer;
- I52-32A: dimer+I52-32B: pentamer; and
- I52-33A: pentamer+I52-33B: dimer.
- As disclosed in the examples that follow, the nanostructures form spontaneously when appropriate polypeptide pairs are co-expressed in E. coli cells, yielding milligram quantities of purified material per liter of cell culture using standard methods of immobilized metal-affinity chromatography and gel filtration. When a poly-histidine purification tag is appended to just one of the two distinct polypeptide subunits (i.e.: the first and second polypeptides) comprising each nanostructure, the other subunit is found to co-purify with the tagged subunit.
- In one embodiment, the nanostructure further comprises a cargo within the nanostructure. As used herein, a “cargo” is any compound or material that can be incorporated on and/or within the nanostructure. For example, polypeptide pairs suitable for nanostructure self-assembly can be expressed/purified independently; they can then be mixed in vitro in the presence of a cargo of interest to produce the nanostructure comprising a cargo. This feature, combined with the protein nanostructures' large lumens and relatively small pore sizes, makes them well suited for the encapsulation of a broad range of cargo including, but not limited to, small molecules, nucleic acids, polymers, and other proteins. In turn, the protein nanostructures of the present invention could be used for many applications in medicine and biotechnology, including targeted drug delivery and vaccine design. For targeted drug delivery, targeting moieties could be fused or conjugated to the protein nanostructure exterior to mediate binding and entry into specific cell populations and drug molecules could be encapsulated in the cage interior for release upon entry to the target cell or sub-cellular compartment. For vaccine design, antigenic epitopes from pathogens could be fused or conjugated to the cage exterior to stimulate development of adaptive immune responses to the displayed epitopes, with adjuvants and other immunomodulatory compounds attached to the exterior and/or encapsulated in the cage interior to help tailor the type of immune response generated for each pathogen. The polypeptide components may be modified as noted above. In one non-limiting example, the polypeptides can be modified, such as by introduction of various cysteine residues at defined positions to facilitate linkage to one or more antigens of interest as cargo, and the nanostructure could act as a scaffold to provide a large number of antigens for delivery as a vaccine to generate an improved immune response. Other modifications of the polypeptides as discussed above may also be useful for incorporating cargo into the nanostructure.
- In certain embodiments, the nanostructures may comprise one or more peptides configured to bind or fuse with desired immunogens. In certain further embodiments, the nanostructure comprises one or more copies of variants designed to form a nanostructure of the trimeric proteins 1WOZ or 1WA3 (PDB ID codes), which have been demonstrated to be suitable for fusion with the trimeric HIV immunogen, BG505 SOSIP (4-6). Such nanostructures could be used as scaffolds for the design of an HIV vaccine capable of inducing protective immune responses against the virus. In another embodiment, the nanostructures of the present invention could be useful as scaffolds for the attachment of enzymes on the interior and/or exterior of the cages. Such enzymes confer on the nanostructure the ability to catalyze biochemical pathways or other reactions. Such patterning has been shown to be important in natural systems in order to increase local substrate concentrations, sequester toxic intermediates, and/or reduce the rates of undesirable side reactions (7, 8). In another embodiment, the cargo may comprise a detectable cargo. For example, the nanostructures of the present invention could also be useful as single-cell or single-molecule imaging agents. The materials are large enough to be identified in cells by electron microscopy, and when tagged with fluorophores they are readily detectable by light microscopy. This feature makes them well-suited to the task of correlating images of the same cells taken by light microscopy and electron microscopy.
- In another aspect, the present invention provides isolated nucleic acids encoding a protein of the present invention. The isolated nucleic acid sequence may comprise RNA or DNA. As used herein, “isolated nucleic acids” are those that have been removed from their normal surrounding nucleic acid sequences in the genome or in cDNA sequences. Such isolated nucleic acid sequences may comprise additional sequences useful for promoting expression and/or purification of the encoded protein, including but not limited to polyA sequences, modified Kozak sequences, and sequences encoding epitope tags, export signals, and secretory signals, nuclear localization signals, and plasma membrane localization signals. It will be apparent to those of skill in the art, based on the teachings herein, what nucleic acid sequences will encode the proteins of the invention.
- In a further aspect, the present invention provides recombinant expression vectors comprising the isolated nucleic acid of any embodiment or combination of embodiments of the invention operatively linked to a suitable control sequence. “Recombinant expression vector” includes vectors that operatively link a nucleic acid coding region or gene to any control sequences capable of effecting expression of the gene product. “Control sequences” operably linked to the nucleic acid sequences of the invention are nucleic acid sequences capable of effecting the expression of the nucleic acid molecules. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered “operably linked” to the coding sequence. Other such control sequences include, but are not limited to, polyadenylation signals, termination signals, and ribosome binding sites. Such expression vectors can be of any type known in the art, including but not limited to plasmid and viral-based expression vectors. The control sequence used to drive expression of the disclosed nucleic acid sequences in a mammalian system may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive). The construction of expression vectors for use in transfecting prokaryotic cells is also well known in the art, and thus can be accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989; Gene Transfer and Expression Protocols, pp. 109-128, ed. E. J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, Tex.). The expression vector must be replicable in the host organisms either as an episome or by integration into host chromosomal DNA. In a preferred embodiment, the expression vector comprises a plasmid. However, the invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors.
- In another aspect, the present invention provides host cells that have been transfected with the recombinant expression vectors disclosed herein, wherein the host cells can be either prokaryotic or eukaryotic. The cells can be transiently or stably transfected. Such transfection of expression vectors into prokaryotic and eukaryotic cells can be accomplished via any technique known in the art, including but not limited to standard bacterial transformations, calcium phosphate co-precipitation, electroporation, or liposome mediated-, DEAE dextran mediated-, polycationic mediated-, or viral mediated transfection. (See, for example, Molecular Cloning: A Laboratory Manual (Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press; Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed. (R. I. Freshney. 1987. Liss, Inc. New York, N.Y.). A method of producing a polypeptide according to the invention is an additional part of the invention. The method comprises the steps of (a) culturing a host according to this aspect of the invention under conditions conducive to the expression of the polypeptide, and (b) optionally, recovering the expressed polypeptide.
- In a further aspect, the present invention provides kits comprising:
- (a) one or more of the isolated polypeptides, polypeptide assemblies, or nanostructures of the invention;
- (b) one or more recombinant nucleic acids of the invention;
- (c) one or more recombinant expression vectors comprising recombinant nucleic acids of the invention; and/or
- (d) one or more recombinant host cell, comprising recombinant expression vectors of the invention.
- In yet a further aspect, the present invention provides methods of using the nanostructures of the present invention. In cases where both polypeptides comprising an assembly are capable of independent expression and purification, this enables control over assembly through mixing of purified components in vitro. This feature, combined with the nanostructures' large lumens and relatively small pore sizes, makes them well suited for the encapsulation of a broad range of other materials including small molecules, nucleic acids, polymers, and other proteins, as discussed above. In turn, the nanostructures of the present invention could be used for many applications in medicine and biotechnology, including targeted drug delivery and vaccine design. For targeted drug delivery, targeting moieties could be fused or conjugated to the nanostructure exterior to mediate binding and entry into specific cell populations and drug molecules could be encapsulated in the cage interior for release upon entry to the target cell or sub-cellular compartment. For vaccine design, antigenic epitopes from pathogens could be fused or conjugated to the nanostructure exterior to stimulate development of adaptive immune responses to the displayed epitopes, with adjuvants and other immunomodulatory compounds attached to the exterior and/or encapsulated in the cage interior to help tailor the type of immune response generated for each pathogen. Other uses will be clear to those of skill in the art based on the disclosure relating to polypeptide modifications, nanostructure design, and cargo incorporation.
- Methods of production: The icosahedral materials disclosed herein (amino acid sequences provided in Table 1), which comprise possible embodiments of the present invention, were produced as follows. The initial sequences and structures for the design process were derived from pentameric, trimeric, and dimeric crystal structures from the Protein Data Bank (PDB), along with a small number of crystal structures of de novo designed proteins not yet deposited in the PDB.
- The PDB Accession numbers for the wild type scaffold proteins related to the exemplary polypeptides of the invention are as follows:
- SEQ ID NO:1 (I53-34A): 2yw3;
- SEQ ID NO:2 (I53-34B): 2b98;
- SEQ ID NO:3 (I53-40A): 2b98;
- SEQ ID NO:4 (I53-40B): 4e38;
- SEQ ID NO:5 (I53-47A): 1hfo;
- SEQ ID NO:6 (I53-47B): 2obx;
- SEQ ID NO:7 (I53-50A): 1 wa3;
- SEQ ID NO:8 (I53-50B): 2obx;
- SEQ ID NO:9 (I53-51A): 1woz;
- SEQ ID NO:10 (I53-51B): 2obx;
- SEQ ID NO:11 (I52-03A): 1c41;
- SEQ ID NO:12 (I52-03B): 3bxo;
- SEQ ID NO:13 (I52-32A): 31fh;
- SEQ ID NO:14 (I52-32B): 2jfb;
- SEQ ID NO:15 (I52-33A): 2jfb;
- SEQ ID NO:16 (I52-33B): 3q34;
- SEQ ID NO:17 (I32-06A): 3e7d;
- SEQ ID NO:18 (I32-06B): 1mww;
- SEQ ID NO:19 (I32-19A): 2c5q;
- SEQ ID NO:20 (I32-19B): 2vvp;
- SEQ ID NO:21 (I32-28A): 2zhz; and
- SEQ ID NO:22 (I32-28B): 3nqn.
- 15,552 pairs of pentamers and trimers, 50,400 pairs of pentamers and dimers, and 344,825 pairs of trimers and dimers were arranged in icosahedral symmetry with the 5-fold symmetry axes of the pentamers, 3-fold symmetry axes of the trimers, and 2-fold symmetry axes of the dimers aligned along the 5-fold, 3-fold, and 2-fold icosahedral symmetry axes, respectively. While maintaining perfect icosahedral symmetry, rotations and translations along these axes were sampled to identify configurations predicted to be suitable for protein-protein interface design. In total, 68,983 153, 35,468 152, and 177,252 132 configurations were designed, yielding 71 pairs of 153 protein sequences, 44 pairs of I52 protein sequences, and 68 pairs of I32 protein sequences predicted to fold and assemble into the modeled icosahedral complexes.
- Genes encoding the 71 pairs of I53 sequences were synthesized and cloned into a variant of the pET29b expression vector (Novagen, Inc.) between the NdeI and XhoI endonuclease restriction sites. Genes encoding the 44 pairs of I52 sequences and 68 pairs of I32 sequences were synthesized and cloned into a variant of the pET28b expression vector (Novagen, Inc.) between the NcoI and XhoI endonuclease restriction sites.
- The two protein coding regions in each DNA construct are connected by an intergenic region. The intergenic region in the I53 designs was derived from the pETDuet-1 vector (Novagen, Inc.) and includes a stop codon, T7 promoter/lac operator, and ribosome binding site.
- The intergenic region in the I52 and I32 designs only includes a stop codon and ribosome binding site. The sequences of the 153, 152 and I32 intergenic regions are as follows:
- I53 intergenic region DNA sequence:
-
(SEQ ID NO: 41) 5′- TAATGCTTAAGTCGAACAGAAAGTAATC GTATTGTACACGGCCGCATAATCGAAAT TAATACGACTCACTATAGGGGAATTGTG AGCGGATAACAATTCCCCATCTTAGTAT ATTAGTTAAGTATAAGAAGGAGATATAC TT-3′ - I52 intergenic region DNA sequence:
-
(SEQ ID NO: 42) 5′-TAAAGAAGGAGATATCAT-3′ - I32 intergenic region DNA sequence:
-
(SEQ ID NO: 43) 5′-TGAGAAGGAGATATCAT-3′ - The constructs for the I53 protein pairs thus possess the following set of elements from 5′ to 3′: NdeI restriction site, upstream gene, intergenic region, downstream gene, XhoI restriction site. The constructs for the I52 and I32 protein pairs possess the following set of elements from 5′ to 3′: NcoI restriction site, upstream gene, intergenic region, downstream gene, XhoI restriction site. In each case, the upstream genes encode components denoted with the suffix “A”; the downstream genes encode the “B” components (Table 1). This allows for co-expression of the designed protein pairs in which both the upstream and downstream genes have their own ribosome binding site, and in the case of the I53 designs, both genes also have their own T7 promoter/lac operator.
- For purification purposes, each co-expression construct includes a 6x-histidine tag (HHHHHH) appended to the N- or C-terminus of one of the two protein coding regions.
- Expression plasmids were transformed into BL21(DE3) E. coli cells. Cells were grown in LB medium supplemented with 50 mg L−1 of kanamycin (Sigma) at 37° C. until an OD600 of 0.8 was reached. Protein expression was induced by addition of 0.5 mM isopropyl-thio-β-D-galactopyranoside (Sigma) and allowed to proceed for either 5 h at 22° C. or 3 h at 37° C. before cells were harvested by centrifugation.
- The designed proteins were first screened for soluble expression and co-purification at small scale from 2 to 4 mL cultures by nickel affinity chromatography using His MultiTrap® FF nickel-coated filter plates (GE Healthcare). Purification products were analyzed by SDS-PAGE to identify those containing species near the expected molecular weight of both protein subunits (indicating co-purification). Those found to contain both subunits were subsequently subjected to native (non-denaturing) PAGE to identify slow migrating species further indicating assembly to higher order materials. Those designs appearing to co-purify and yielding slowly migrating species by native PAGE were subsequently expressed at larger scale (1 to 12 liters of culture) and purified by nickel affinity chromatography via gravity columns with nickel-NTA resin (Qiagen) or HisTrap® HP columns (GE Healthcare). Fractions containing the designed proteins were pooled, concentrated using centrifugal filter devices (Sartorius Stedim Biotech), and further purified on a Superose® 6 10/300 gel filtration column (GE Healthcare).
- The purified proteins were analyzed by size exclusion chromatography using a Superose® 6 10/300 column to assess their assembly states. For each of the exemplary proteins described here, major peaks were observed in the chromatograms near elution volumes of 8.5 to 12 mL, which correspond well with the expected elution volumes for the designed 120-subunit icosahedral nanostructures. Within this set of exemplary proteins, the relative elution volumes correspond with the physical dimensions of the computational design models of the nanostructures, that is, proteins designed to assemble into relatively larger nanostructures yielded peaks at earlier elution volumes while those designed to assemble into relatively smaller nanostructures yielded peaks at later elution volumes. In some cases, smaller secondary peaks were observed at slightly earlier elution volumes than the predominant peak, suggesting transient or low-affinity dimerization of the nanostructures.
- Gel filtration fractions containing pure protein in the desired assembly state were analyzed by negative stain electron microscopy as described previously (2). Electron micrographs showing fields of particles of the expected size and shape have been obtained for 10 of the nanostructures. In one case (I32-19), the nanostructure appears to be unstable in the conditions encountered during grid preparation, precluding visualization by electron microscopy.
- To further validate the structures of our materials, small angle X-ray scattering (SAXS) data was obtained for several of the designed nanostructures. Scattering measurements were performed at the SIBYLS® 12.3.1 beamline at the Advanced Light Source, LBNL, on 20 microliter samples loaded into a helium-purged sample chamber (10). Data were collected on gel filtration fractions and samples concentrated ˜2x-10x from individual fractions, with the gel filtration buffer and concentrator eluates used for buffer subtraction. Sequential exposures ranging from 0.5 to 5 seconds were taken at 12 keV to maximize signal to noise, with visual checks for radiation-induced damage to the protein. The FOXS® algorithm (11, 12) was then used to calculate scattering profiles from our design models and fit them to the experimental data. The major features of the I53-34, I53-40, I53-47, I53-50, I52-03, I52-32, I52-33, I32-06, I32-19, and I32-28 design models were all found to match well with the experimental data, supporting the conclusion that the nanostructures assemble to the intended assembly state and three-dimensional configuration in solution. Graphs of the log of the scattering intensity, I(q), as a function of scattering angle, q, show multiple large dips in the scattering intensity in the low q region between 0.015 A−1 and 0.15 A−1, each of which is closely recapitulated in the theoretical profiles calculated from the design models. Although the I53-51 design model was not found to match well with the SAXS data, this appears likely to be due to low stability of the designed material, which caused it to be primarily unassembled at the concentrations used for the SAXS measurements; this result is consistent with our findings from gel filtration of I53-51, in which significant peaks were observed corresponding to the unassembled pentamers and trimers in addition to the presumed 120-subunit assembly peak.
- Using the Rosetta macromolecular modeling suite, the computational models of designed I53 materials were redesigned by allowing optimization of the identities of relatively exposed residues (defined as having a solvent accessible surface area of greater than 20 square Ångstroms), excepting polar residues (Aspartate, Glutamate, Histidine, Lysine, Asparagine, Glutamine, and Arginine) and residues near the designed protein-protein interfaces between the pentameric and trimeric components. Mutations that resulted in losses of significant atomic packing interactions or side chain-backbone hydrogen bonds were discarded. A position-specific scoring matrix (PSSM) based on homologous protein sequences was used to augment the Rosetta scorefunction to favor residues that appear frequently at a given position in homologous proteins, a design approach referred to as consensus protein design (9). Multiple design trajectories were performed with varying weights on the contribution of the PSSM, and mutations to polar residues that appeared favorable across all design trajectories were selected for inclusion in the variant protein. These variants were designated by the addition of “0.1” to the end of their names (e.g., I53-50A. 1).
- The Rosetta macromolecular modeling suite was used to mutate manually selected amino acid positions to charged amino acids in order to generate variant nanoparticles featuring highly positively or negatively charged interior surfaces. To generate negatively charged nanoparticles (denoted by the letters “Neg” in their names), mutations were limited to either Aspartate or Glutamate. To generate positively charged nanoparticles (denoted by the letters “Pos” in their names), mutations were limited to either Arginine or Lysine. Relevant score metrics for each mutation were independently assessed, and favorable mutations were sorted into two tiers based on their scores. Two new nanoparticle variants sequences were then designed for each individual protein for each type of charge, one including only the
Tier 1 mutations (named “T1”) and the other including both theTier 1 andTier 2 mutations (named “T2”). In most cases, the charged mutations were incorporated into the consensus redesign variants described above. - Genes encoding the I53 “0.1” and charged variant proteins were synthesized and cloned into the pET29b expression vector (Novagen, Inc.) between the NdeI and XhoI endonuclease restriction sites. Constructs were produced in two formats. In the first, the two proteins were encoded in a bicistronic arrangement on a single expression plasmid as described above for co-expression in E. coli. In the second, each protein component (i.e., the pentameric component and the trimeric component) were cloned individually into pET29b for expression in the absence of the other component.
- For purification purposes, each co-expression construct included a 6x-histidine tag (HHHHHH) appended to the N- or C-terminus of one of the two protein coding regions. Similarly, each individual expression construct included a 6x-histidine tag appended to the N- or C-terminus of the protein coding region.
- The “0.1” and charged variant proteins were expressed and purified as described above with two differences. First, expression at 18° C. was evaluated in addition to expression at 37° C. at small scale for all variants, and, in some cases, expression at 18° C. was used to produce the proteins at multi-liter scale. Second, for some variants, the detergent 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonate (CHAPS) was included in all purification buffers at a concentration of 0.75% weight/volume to prevent protein aggregation.
- After purification of individually expressed protein components, pairs of components designed to co-assemble into a nanoparticle (e.g., I53-40.1A and I53-40.1B) were mixed in equimolar amounts in buffer and allowed to incubate at room temperature for 1-24 hours, a procedure we refer to as “in vitro assembly.” For assemblies including charged components, the buffer included 500 mM NaCl; in all other cases the buffer included 150 mM NaCl. The mixtures were fractionated and analyzed on a Superose® 6 10/300 gel filtration column (GE Healthcare), and fractions were analyzed by SDS-PAGE to determine the protein contents of each elution peak.
- In one exemplary embodiment, the I53-40.1A and I53-40.1B protein variants, based off of I53-40A and I53-40B, respectively, were constructed by consensus protein design, in which multiple sequence alignments from protein families related to each protein subunit were used to guide the selection of amino acid residues at surface-exposed positions. The variant proteins were found to be more stable and soluble when purified independently than the original proteins, a property that enabled the formation of the designed nanostructure by simply mixing solutions containing the purified components in physiological buffers in a 1:1 molar ratio. The addition of 0.75% CHAPS, a zwitterionic detergent, to the buffer was found to further increase the stability and solubility of I53-40.1A and was therefore included during the purification of the protein prior to in vitro assembly. Size exclusion chromatograms from a run analyzing the mixed solution containing both components on a Superose 6 column revealed a single major peak at the elution volume expected for the 120-subunit designed icosahedral nanostructure. Analysis of the peak fractions by SDS-PAGE revealed bands at the expected molecular weight for the first and second polypeptides of the nanostructure in an apparent 1:1 stoichiometric ratio. The data demonstrate that when mixed, the two components co-assemble to the 120-subunit designed icosahedral nanostructure.
- In another exemplary embodiment, the I53-47A.1, I53-47B.1, I53-50A.1, and I53-50B.1 protein variants, based off of I53-47A, I53-47B, I53-50A, and I53-50B, respectively, were constructed by consensus protein design, in which multiple sequence alignments from protein families related to each protein subunit were used to guide the selection of amino acid residues at surface-exposed positions. The variant proteins were found to be more stable and soluble when purified independently than the original proteins, a property that enabled the formation of the designed nanostructure by simply mixing solutions containing the purified components in physiological buffers in a 1:1 molar ratio, a process referred to as in vitro assembly. The addition of 0.75% CHAPS, a zwitterionic detergent, to the buffer was found to further increase the stability and solubility of I53-47B.1 and I53-50B.1 and was therefore included during the purification of the proteins prior to in vitro assembly. Size exclusion chromatograms from a run analyzing the mixed solution containing both I53-47A.1 and I53-47B.1 on a Superose 6 column revealed a major peak at the elution volume expected for the 120-subunit designed icosahedral nanostructure as well as a smaller secondary peak at a later elution volume. Analysis of the peak fractions corresponding to the 120-subunit nanostructure by SDS-PAGE revealed bands at the expected molecular weight for the first and second polypeptides of the nanostructure in an apparent 1:1 stoichiometric ratio. Analysis of the secondary peak at the later elution volume revealed that this peak comprises only the trimeric subunit, suggesting that the in vitro assembly mixture actually contained an excess of this polypeptide. Similarly, size exclusion chromatograms from a run analyzing the mixed solution containing both I53-50A.1 and I53-50B.1 on a Superose 6 column revealed a peak at the elution volume expected for the 120-subunit designed icosahedral nanostructure as well as two secondary peaks at later elution volumes. Analysis of the peak fractions corresponding to the 120-subunit nanostructure by SDS-PAGE revealed bands at the expected molecular weight for the first and second polypeptides of the nanostructure in an apparent 1:1 stoichiometric ratio. Analysis of the secondary peaks at the later elution volumes revealed that the first of the two comprises only the pentameric subunit, while the second of the two comprises only the trimeric subunit, suggesting that for this pair of proteins, in vitro assembly is somewhat inefficient. Together, the data demonstrate that when mixed, the two components of each nanostructure (i.e., I53-47A.1 and I53-47B.1 or I53-50A.1 and I53-50B.1) co-assemble to the 120-subunit designed icosahedral nanostructures.
- In another exemplary embodiment, the protein variants I53-47A.1NegT2, I53-47B.1NegT2, I53-50A.1NegT2, and I53-50B.1NegT2, based off of I53-47A.1, I53-47B.1, I53-50A.1, and I53-50B.1, respectively, bear mutations that introduce additional negatively charged amino acid residues (i.e., Aspartate and Glutamate) on their surfaces such that the nanostructures formed through the assembly of these proteins have highly charged interior surfaces. After the two independently purified proteins I53-47A.1NegT2 and I53-47B.1NegT2 were mixed together in an in vitro assembly reaction in a buffer with a concentration of 150 mM NaCl, no assembly was observed when the mixture was analyzed on a Superose 6 size exclusion chromatography column; only unassembled I53-47A.1NegT2 and I53-47B.1NegT2 proteins eluted from the column. In contrast, if the in vitro assembly reaction was performed in the presence of 0.5 M NaCl, robust assembly to the designed nanostructure was observed, with some remaining unassembled proteins eluting later as smaller secondary elution peaks. Similarly, after the two independently purified proteins I53-50A.1NegT2 and I53-50B.1NegT2 were mixed together in an in vitro assembly reaction in a buffer with a concentration of 150 mM NaCl, no assembly was observed when the mixture was analyzed on a Superose® 6 size exclusion chromatography column; only unassembled I53-50A.1NegT2 and I53-50B.1NegT2 proteins eluted from the column. In contrast, if the in vitro assembly reaction was performed in the presence of 0.5 M NaCl, assembly to the designed nanostructure was observed, with some remaining unassembled proteins eluting later. Together, the data demonstrate that when mixed, the two components of each highly charged 120-subunit designed icosahedral nanostructure assemble to the target structure only in the presence of high ionic strength.
- In order to package nucleic acids, pairs of individually purified protein components designed to co-assemble into a nanoparticle were combined with single-stranded DNA (ssDNA) in buffer and allowed to incubate overnight. ssDNA was present at a final concentration of 26 ng/μL (200 pM) for 400 nucleotide (nt) strands, and 35.2 ng/μL (66.7 pM) for 1600 nt strands. Individual protein components were added at final equimolar concentrations ranging from 2-12 and the final NaCl concentration was 150 mM. After overnight incubation, samples were either analyzed by electrophoresis on a 1% agarose gel or DNase I was added to a final concentration of 25 pg/mL and incubated for one hour at room temperature before electrophoresis. Gels were stained with SybrGold® (ThermoFisher Scientific) and imaged to visualize nucleic acid, and were subsequently stained with GelCode® Blue (Pierce) and imaged again to visualize protein.
- The above definitions and explanations are meant and intended to be controlling in any future construction unless clearly and unambiguously modified in the following examples or when application of the meaning renders any construction meaningless or essentially meaningless. In cases where the construction of the term would render it meaningless or essentially meaningless, the definition should be taken from Webster's Dictionary, 3rd Edition or a dictionary known to those of skill in the art, such as the Oxford Dictionary of Biochemistry and Molecular Biology (Ed. Anthony Smith, Oxford University Press, Oxford, 2004).
- The above description provides specific details for a thorough understanding of, and enabling description for, embodiments of the disclosure. However, one skilled in the art will understand that the disclosure may be practiced without these details. In other instances, well-known structures and functions have not been shown or described in detail to avoid unnecessarily obscuring the description of the embodiments of the disclosure. The description of embodiments of the disclosure is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. While specific embodiments of, and examples for, the disclosure are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the disclosure, as those skilled in the relevant art will recognize.
- Aspects of the disclosure can be modified, if necessary, to employ the systems, functions and concepts of the above references and application to provide yet further embodiments of the disclosure. These and other changes can be made to the disclosure in light of the detailed description.
- Specific elements of any of the foregoing embodiments can be combined or substituted for elements in other embodiments. Furthermore, while advantages associated with certain embodiments of the disclosure have been described in the context of these embodiments, other embodiments may also exhibit such advantages, and not all embodiments need necessarily exhibit such advantages to fall within the scope of the disclosure.
- The above detailed description describes various features and functions of the disclosed systems, devices, and methods with reference to the accompanying figures. In the figures, similar symbols typically identify similar components, unless context dictates otherwise. The illustrative embodiments described in the detailed description, figures, and claims are not meant to be limiting. Other embodiments can be utilized, and other changes can be made, without departing from the spirit or scope of the subject matter presented herein. It will be readily understood that the aspects of the present disclosure, as generally described herein, and illustrated in the figures, can be arranged, substituted, combined, separated, and designed in a wide variety of different configurations, all of which are explicitly contemplated herein.
- Numerous modifications and variations of the present disclosure are possible in light of the above teachings. Unless otherwise indicated, all numbers expressing quantities of ingredients, properties such as molecular weight, reaction conditions, and so forth used in the specification and claims are to be understood as being modified in all instances by the term “about.” Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the invention are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. Any numerical value, however, inherently contains certain errors necessarily resulting from the standard deviation found in their respective testing measurements. Groupings of alternative elements or embodiments of the invention disclosed herein are not to be construed as limitations. Each group member may be referred to and claimed individually or in any combination with other members of the group or other elements found herein. It is anticipated that one or more members of a group may be included in, or deleted from, a group for reasons of convenience and/or patentability. When any such inclusion or deletion occurs, the specification is deemed to contain the group as modified thus fulfilling the written description of all Markush groups used in the appended claims.
- Furthermore, numerous references have been made to patents and printed publications throughout this specification. Each of the above-cited references and printed publications are individually incorporated herein by reference in their entirety.
- It is to be understood that the embodiments of the invention disclosed herein are illustrative of the principles of the present invention. Other modifications that may be employed are within the scope of the invention. Thus, by way of example, but not of limitation, alternative configurations of the present invention may be utilized in accordance with the teachings herein.
- Accordingly, the present invention is not limited to that precisely as shown and described. The particulars shown herein are by way of example and for purposes of illustrative discussion of the preferred embodiments of the present invention only and are presented in the cause of providing what is believed to be the most useful and readily understood description of the principles and conceptual aspects of various embodiments of the invention. In this regard, no attempt is made to show structural details of the invention in more detail than is necessary for the fundamental understanding of the invention, the description taken with the drawings and/or examples making apparent to those skilled in the art how the several forms of the invention may be embodied in practice.
-
- 1. N. P. King et al., Computational design of self-assembling protein nanomaterials with atomic level accuracy. Science 336, 1171 (Jun. 1, 2012).
- 2. N. P. King et al., Accurate design of co-assembling multi-component protein nanomaterials. Nature 510, 103 (Jun. 5, 2014).
- 3. S. Raman et al., Design of Peptide Nanoparticles Using Simple Protein Oligomerization Domains. The
Open Nanomedicine Journal 2, 15 (2009). - 4. J. P. Julien et al., Crystal structure of a soluble cleaved HIV-1 envelope trimer. Science 342, 1477 (Dec. 20, 2013).
- 5. D. Lyumkis et al., Cryo-EM structure of a fully glycosylated soluble cleaved HIV-1 envelope trimer. Science 342, 1484 (Dec. 20, 2013).
- 6. M. Pancera et al., Structure and immune recognition of trimeric pre-fusion HIV-1 Env. Nature, (Oct. 8, 2014).
- 7. T. O. Yeates, C. S. Crowley, S. Tanaka, Bacterial microcompartment organelles: protein shell structure and evolution. Annu Rev Biophys 39, 185 (2010).
- 8. P. Kumar, M. Singh, S. Karthikeyan, Crystal structure analysis of icosahedral lumazine synthase from Salmonella typhimurium, an antibacterial drug target. Acta Crystallogr D Biol Crystallogr 67, 131 (February, 2011).
- 9. C. Jäckel, J. D. Bloom, P. Kast, F. H. Arnold, D. Hilvert. Consensus Protein Design without Phylogenetic Bias J. Mol. Biol., 399 (2010), pp. 541-546.
- 10. Hura, G. L. et al. Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS). Nat. Methods 6, 606-612 (2009).
- 11. Schneidman-Duhovny, D., Hammel, M., Tainer, J. A. & Sali, A. Accurate SAXS Profile Computation and its Assessment by Contrast Variation Experiments. Biophys. 1 105, 962-974 (2013).
- 12. Schneidman-Duhovny, D., Hammel, M. & Sali, A. FoXS: a web server for rapid computation and fitting of SAXS profiles. Nucleic Acids Res. 38, W540-W544 (2010).
Claims (41)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/934,343 US20230159595A1 (en) | 2014-11-03 | 2022-09-22 | Polypeptides for use in self-assembling protein nanostructures |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462074167P | 2014-11-03 | 2014-11-03 | |
US14/930,792 US9630994B2 (en) | 2014-11-03 | 2015-11-03 | Polypeptides for use in self-assembling protein nanostructures |
US15/490,351 US10351603B2 (en) | 2014-11-03 | 2017-04-18 | Polypeptides for use in self-assembling protein nanostructures |
US16/427,493 US11485759B2 (en) | 2014-11-03 | 2019-05-31 | Polypeptides for use in self-assembling protein nanostructures |
US17/934,343 US20230159595A1 (en) | 2014-11-03 | 2022-09-22 | Polypeptides for use in self-assembling protein nanostructures |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/427,493 Continuation US11485759B2 (en) | 2014-11-03 | 2019-05-31 | Polypeptides for use in self-assembling protein nanostructures |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230159595A1 true US20230159595A1 (en) | 2023-05-25 |
Family
ID=55851914
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/930,792 Active US9630994B2 (en) | 2014-11-03 | 2015-11-03 | Polypeptides for use in self-assembling protein nanostructures |
US15/490,351 Active 2036-07-21 US10351603B2 (en) | 2014-11-03 | 2017-04-18 | Polypeptides for use in self-assembling protein nanostructures |
US16/427,493 Active 2036-03-11 US11485759B2 (en) | 2014-11-03 | 2019-05-31 | Polypeptides for use in self-assembling protein nanostructures |
US17/934,343 Pending US20230159595A1 (en) | 2014-11-03 | 2022-09-22 | Polypeptides for use in self-assembling protein nanostructures |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/930,792 Active US9630994B2 (en) | 2014-11-03 | 2015-11-03 | Polypeptides for use in self-assembling protein nanostructures |
US15/490,351 Active 2036-07-21 US10351603B2 (en) | 2014-11-03 | 2017-04-18 | Polypeptides for use in self-assembling protein nanostructures |
US16/427,493 Active 2036-03-11 US11485759B2 (en) | 2014-11-03 | 2019-05-31 | Polypeptides for use in self-assembling protein nanostructures |
Country Status (1)
Country | Link |
---|---|
US (4) | US9630994B2 (en) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102159626B1 (en) | 2014-09-26 | 2020-09-25 | 베쓰 이스라엘 디코니스 메디칼 센터 인크 | Methods and Compositions for Inducing Protective Immunity Against Human Immunodeficiency Virus Infection |
US9630994B2 (en) | 2014-11-03 | 2017-04-25 | University Of Washington | Polypeptides for use in self-assembling protein nanostructures |
WO2016138525A1 (en) * | 2015-02-27 | 2016-09-01 | University Of Washington | Polypeptide assemblies and methods for the production thereof |
DK3390430T3 (en) | 2015-12-15 | 2019-11-18 | Janssen Vaccines & Prevention Bv | HUMAN IMMUNDEFECT VIRUS ANTIGENES, VECTORS, COMPOSITIONS AND METHODS OF USING IT |
US10188136B2 (en) | 2016-02-16 | 2019-01-29 | Indian Institute Of Science Education And Research | Hydrophobin mimics: process for preparation thereof |
AU2017286375B2 (en) | 2016-06-16 | 2019-04-18 | Janssen Vaccines & Prevention B.V. | HIV vaccine formulation |
AU2017318689A1 (en) | 2016-09-02 | 2019-04-11 | Beth Israel Deaconess Medical Center, Inc. | Methods for inducing an immune response against human immunodeficiency virus infection in subjects undergoing antiretroviral treatment |
JP6957039B2 (en) * | 2016-09-06 | 2021-11-02 | 学校法人慶應義塾 | Fusion proteins, structures, collectors, methods of collection, DNA, and vectors |
IL265186B (en) | 2016-09-15 | 2022-09-01 | Janssen Vaccines Prevention B V | Trimer stabilizing hiv envelope protein mutations |
WO2018170362A2 (en) * | 2017-03-16 | 2018-09-20 | The Penn State Research Foundation | Versatile display scaffolds for proteins |
RU2019130167A (en) | 2017-04-04 | 2021-05-05 | Юниверсити Оф Вашингтон | SELF-ASSEMBLY PROTEIN NANOSTRUCTURES EXPOSING F-PROTEINS OF PARAMIXOVIRUS AND / OR PNEUMOVIRUS AND THEIR APPLICATION |
CN110958887B (en) | 2017-06-15 | 2023-10-31 | 扬森疫苗与预防公司 | Poxvirus vectors encoding HIV antigens and methods of use thereof |
CN110891601A (en) | 2017-07-19 | 2020-03-17 | 扬森疫苗与预防公司 | Trimer-stable HIV envelope protein mutations |
US20210380641A1 (en) * | 2017-11-09 | 2021-12-09 | University Of Washington | Self-assembling protein structures and components thereof |
CN112088014A (en) * | 2018-02-28 | 2020-12-15 | 华盛顿大学 | Self-assembling nanostructured vaccines |
JP2021532803A (en) * | 2018-08-07 | 2021-12-02 | イーティーエイチ・チューリッヒ | Polypeptides that self-assemble into nanoparticles |
JP2021533772A (en) | 2018-08-13 | 2021-12-09 | ヤンセン ファッシンズ アンド プリベンション ベーフェーJanssen Vaccines & Prevention B.V. | Stabilized trimer of filovirus glycoprotein |
CN113423718A (en) | 2019-02-08 | 2021-09-21 | 美国政府(由卫生和人类服务部的部长所代表) | Nanoparticle-based influenza virus vaccines and uses thereof |
JP2023513592A (en) * | 2020-02-14 | 2023-03-31 | ユニバーシティ オブ ワシントン | Polypeptides and their uses (cross-reference to related applications) |
US20240000919A1 (en) | 2020-11-12 | 2024-01-04 | Xiamen University | Modified envelope protein of human immunodeficiency virus and use thereof |
WO2022103967A1 (en) | 2020-11-13 | 2022-05-19 | Icosavax, Inc. | Protein-based nanoparticle vaccine for metapneumovirus |
JP2024509769A (en) | 2021-02-23 | 2024-03-05 | ヤンセン ファッシンズ アンド プリベンション ベーフェー | Trimer-stabilized HIV envelope protein mutations |
KR102527221B1 (en) * | 2021-10-15 | 2023-05-02 | 에스케이바이오사이언스 주식회사 | A method for purification of target protein |
KR102524839B1 (en) | 2021-11-04 | 2023-04-25 | 에스케이바이오사이언스(주) | Vaccine composition for preventing or treating infection of SARS-CoV-2 |
KR20230100689A (en) | 2021-12-27 | 2023-07-05 | 에스케이바이오사이언스(주) | SARS-CoV-2 vaccine booster composition |
WO2023156505A1 (en) | 2022-02-17 | 2023-08-24 | Janssen Vaccines & Prevention B.V. | Trimer stabilizing hiv envelope protein mutations r304v, n302m and t320l |
WO2023225562A1 (en) | 2022-05-17 | 2023-11-23 | Icosavax, Inc. | Multivalent vaccine for paramyxoviruses and uses thereof |
WO2024014943A1 (en) | 2022-07-12 | 2024-01-18 | 에스케이바이오사이언스 주식회사 | Sars-cov-2 vaccine booster composition |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9630994B2 (en) * | 2014-11-03 | 2017-04-25 | University Of Washington | Polypeptides for use in self-assembling protein nanostructures |
US11192926B2 (en) * | 2017-04-04 | 2021-12-07 | University Of Washington | Self-assembling protein nanostructures displaying paramyxovirus and/or pneumovirus F proteins and their use |
US11771755B2 (en) * | 2018-02-28 | 2023-10-03 | University Of Washington | Self-asssembling nanostructure vaccines |
Family Cites Families (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE4238904A1 (en) * | 1992-11-19 | 1994-05-26 | Basf Ag | Riboflavin synthesis in yeast |
WO1999045041A1 (en) * | 1998-03-04 | 1999-09-10 | Exxon Chemical Patents Inc. | High temperature olefin polymerization process |
US6303767B1 (en) * | 1998-11-05 | 2001-10-16 | Kosan Biosciences, Inc. | Nucleic acids encoding narbonolide polyketide synthase enzymes from streptomyces narbonensis |
JP2000152791A (en) * | 1998-11-11 | 2000-06-06 | E I Du Pont De Nemours & Co | Lumazine synthase and riboflavin synthase |
US20040005672A1 (en) * | 2002-02-22 | 2004-01-08 | Santi Daniel V. | Heterologous production of polyketides |
AU2003291619A1 (en) * | 2002-07-31 | 2004-04-30 | Kosan Biosciences, Inc. | Production of glycosylated macrolides in e. coli |
JP2008500364A (en) | 2004-05-25 | 2008-01-10 | キメラコア, インコーポレイテッド | Self-assembling nanoparticle drug delivery system |
WO2006034292A2 (en) | 2004-09-21 | 2006-03-30 | Medimmune, Inc. | Antibodies against and methods for producing vaccines for respiratory syncytial virus |
EP4206231A1 (en) | 2007-12-24 | 2023-07-05 | ID Biomedical Corporation of Quebec | Recombinant rsv antigens |
US9102526B2 (en) * | 2008-08-12 | 2015-08-11 | Imiplex Llc | Node polypeptides for nanostructure assembly |
WO2010019725A2 (en) | 2008-08-12 | 2010-02-18 | Salemme Francis R | Node polypeptides for nanostructure assembly |
GB0817507D0 (en) | 2008-09-24 | 2008-10-29 | Ucl Business Plc | Protein cages |
MX2011006205A (en) | 2008-12-09 | 2011-09-01 | Novavax Inc | Modified rsv f proteins and methods of their use. |
US20110200560A1 (en) | 2009-05-26 | 2011-08-18 | Massachusetts Institute Of Technology | Non-ionic self-assembling peptides and uses thereof |
WO2011017835A1 (en) | 2009-08-11 | 2011-02-17 | Nanjing University | Preparation method of protein or peptide nanoparticles for in vivo drug delivery by unfolding and refolding |
CA2771289A1 (en) | 2009-08-14 | 2011-02-17 | Allergan, Inc. | Methods of treating cancer using galanin retargeted endpeptidases |
US9487593B2 (en) | 2010-07-07 | 2016-11-08 | Artificial Cell Technologies, Inc | Respiratory syncytial virus antigenic compositions and methods |
PL2707385T3 (en) | 2011-05-13 | 2018-03-30 | Glaxosmithkline Biologicals Sa | Pre-fusion rsv f antigens |
US9441019B2 (en) | 2011-09-23 | 2016-09-13 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Influenza hemagglutinin protein-based vaccines |
SG10201602434UA (en) | 2011-09-30 | 2016-05-30 | Novavax Inc | Recombinant nanoparticle rsv f vaccine for respiratory syncytial virus |
EP2766030A4 (en) | 2011-10-12 | 2015-09-16 | Univ Washington | Engineered outer domain (eod) of hv gp120 and mutants thereof |
AU2013245950B2 (en) | 2012-04-10 | 2016-04-21 | The Trustees Of The University Of Pennsylvania | Human respiratory syncytial virus consensus antigens, nucleic acid constructs and vaccines made therefrom, and methods of using same |
US8969521B2 (en) | 2012-04-11 | 2015-03-03 | University Of Washington Through Its Center For Commercialization | General method for designing self-assembling protein nanomaterials |
WO2014005643A1 (en) | 2012-07-05 | 2014-01-09 | Okairos Ag | Novel prime-boosting regimens involving immunogenic polypeptides encoded by polynucleotides |
JP6523955B2 (en) | 2012-08-01 | 2019-06-05 | バヴァリアン・ノルディック・アクティーゼルスカブ | Recombinant modified vaccinia virus Ankara (MVA) RS virus (RSV) vaccine |
WO2014124301A1 (en) | 2013-02-07 | 2014-08-14 | University Of Washington Through Its Center For Commercialization | Self-assembling protein nanostructures |
US9738689B2 (en) | 2013-03-13 | 2017-08-22 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Prefusion RSV F proteins and their use |
WO2014160463A1 (en) | 2013-03-13 | 2014-10-02 | The United States Of America, As Represented By The Secretary, Department Of Health & Human Services | Prefusion rsv f proteins and their use |
JP6462599B2 (en) | 2013-03-15 | 2019-01-30 | シャアメン ユニバーシティ | Epitope of RSV fusion protein and antibody recognizing the epitope |
CA2925201A1 (en) | 2013-09-24 | 2015-04-02 | Massachusetts Institute Of Technology | Self-assembled nanoparticle vaccines |
EP2974739A1 (en) | 2014-07-15 | 2016-01-20 | Novartis AG | RSVF trimerization domains |
CN107427571A (en) | 2014-12-31 | 2017-12-01 | 美利坚合众国,由健康及人类服务部部长代表 | Novel multivalent vaccine based on nano particle |
WO2016138525A1 (en) | 2015-02-27 | 2016-09-01 | University Of Washington | Polypeptide assemblies and methods for the production thereof |
US10744193B2 (en) | 2015-03-30 | 2020-08-18 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Immunogenic RSV polypeptides |
KR102638978B1 (en) | 2015-07-07 | 2024-02-22 | 얀센 백신스 앤드 프리벤션 비.브이. | Vaccine against RSV |
US10457708B2 (en) | 2015-07-07 | 2019-10-29 | Janssen Vaccines & Prevention B.V. | Stabilized soluble pre-fusion RSV F polypeptides |
WO2017040387A2 (en) | 2015-08-31 | 2017-03-09 | Technovax, Inc. | Human respiratory syncytial virus (hrsv) virus-like particles (vlps) based vaccine |
CN114796474A (en) | 2015-09-03 | 2022-07-29 | 诺瓦瓦克斯股份有限公司 | Vaccine compositions with improved stability and immunogenicity |
EP3368547A4 (en) | 2015-10-29 | 2019-06-05 | Emory University | Chimeric rsv, immunogenic compositions, and methods of use |
RU2723039C2 (en) | 2015-12-23 | 2020-06-08 | Пфайзер Инк. | Protein f rsv mutants |
AU2017242020B2 (en) | 2016-03-29 | 2021-08-05 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Substitutions-modified prefusion RSV F proteins and their use |
KR102506895B1 (en) | 2016-04-05 | 2023-03-08 | 얀센 백신스 앤드 프리벤션 비.브이. | Stabilized soluble pre-fusion RSV F protein |
WO2017207477A1 (en) | 2016-05-30 | 2017-12-07 | Janssen Vaccines & Prevention B.V. | Stabilized pre-fusion rsv f proteins |
KR102421049B1 (en) | 2016-05-30 | 2022-07-15 | 얀센 백신스 앤드 프리벤션 비.브이. | Stabilized pre-fusion RSV F protein |
US10961283B2 (en) | 2016-06-27 | 2021-03-30 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Self-assembling insect ferritin nanoparticles for display of co-assembled trimeric antigens |
MX2019006349A (en) | 2016-12-16 | 2019-08-22 | Inst Res Biomedicine | Novel recombinant prefusion rsv f proteins and uses thereof. |
-
2015
- 2015-11-03 US US14/930,792 patent/US9630994B2/en active Active
-
2017
- 2017-04-18 US US15/490,351 patent/US10351603B2/en active Active
-
2019
- 2019-05-31 US US16/427,493 patent/US11485759B2/en active Active
-
2022
- 2022-09-22 US US17/934,343 patent/US20230159595A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9630994B2 (en) * | 2014-11-03 | 2017-04-25 | University Of Washington | Polypeptides for use in self-assembling protein nanostructures |
US11485759B2 (en) * | 2014-11-03 | 2022-11-01 | University Of Washington | Polypeptides for use in self-assembling protein nanostructures |
US11192926B2 (en) * | 2017-04-04 | 2021-12-07 | University Of Washington | Self-assembling protein nanostructures displaying paramyxovirus and/or pneumovirus F proteins and their use |
US11732011B2 (en) * | 2017-04-04 | 2023-08-22 | University Of Washington | Self-assembling protein nanostructures displaying paramyxovirus and/or pneumovirus F proteins and their use |
US11771755B2 (en) * | 2018-02-28 | 2023-10-03 | University Of Washington | Self-asssembling nanostructure vaccines |
Also Published As
Publication number | Publication date |
---|---|
US20160122392A1 (en) | 2016-05-05 |
US10351603B2 (en) | 2019-07-16 |
US20190284242A1 (en) | 2019-09-19 |
US9630994B2 (en) | 2017-04-25 |
US20170218029A1 (en) | 2017-08-03 |
US11485759B2 (en) | 2022-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230159595A1 (en) | Polypeptides for use in self-assembling protein nanostructures | |
Stuible et al. | Rapid, high-yield production of full-length SARS-CoV-2 spike ectodomain by transient gene expression in CHO cells | |
US20230313167A1 (en) | Polypeptide Assemblies and Methods for the Production Thereof | |
US11866699B2 (en) | Genome editing reagents and their use | |
Kim et al. | Chaperna-mediated assembly of ferritin-based Middle East respiratory syndrome-coronavirus nanoparticles | |
US20190341124A1 (en) | Self-Assembling Protein Nanostructures | |
BR112021009231A2 (en) | modified extracellular vesicles and their uses | |
EP3757218A1 (en) | Fusion protein | |
CA3019594A1 (en) | Polypeptides capable of forming homo-oligomers with modular hydrogen bond network-mediated specificity and their design | |
CN105821078A (en) | MGMT-Based method for obtaining high yield of recombinant protein expression | |
US8969521B2 (en) | General method for designing self-assembling protein nanomaterials | |
US8859237B2 (en) | Diguanylate cyclase method of producing the same and its use in the manufacture of cyclic-di-GMP and analogues thereof | |
CN105246503A (en) | Novel binding proteins for pcsk9 | |
Rubinstein et al. | ATP synthase from Saccharomyces cerevisiae: location of subunit h in the peripheral stalk region | |
CN114560915A (en) | Modified high-titer SARS-CoV-2 pseudovirus | |
Bignon et al. | Halorubrum pleomorphic virus-6 membrane fusion is triggered by an S-layer component of its haloarchaeal host | |
CN115850507B (en) | Orthogonal and bi-component protein self-assembly coupling system and construction method thereof | |
Dowling et al. | Hierarchical design of pseudosymmetric protein nanoparticles | |
US10913777B2 (en) | In vitro assembly of bacterial microcompartments | |
WO2019094669A2 (en) | Self-assembling protein structures and components thereof | |
US20220162265A1 (en) | Self-assembling 2d arrays with de novo protein building blocks | |
US20220226485A1 (en) | De novo design of phosphorylation inducible protein switches (phospho-switches) | |
Goldbaum et al. | Crystallization and Preliminary X-Ray Diffraction Analysis of the Lumazine Synthase fromBrucella abortus | |
US20130023643A1 (en) | Nuclear localization signal peptides derived from vp2 protein of chicken anemia virus and uses of said peptides | |
TWI776048B (en) | Recombinant protein for preventing swine fever virus infection and composition and cell comprising the same. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: UNIVERSITY OF WASHINGTON, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:HOWARD HUGHES MEDICAL INSTITUTE;REEL/FRAME:061192/0923 Effective date: 20220418 Owner name: HOWARD HUGHES MEDICAL INSTITUTE, MARYLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BAKER, DAVID;REEL/FRAME:061192/0841 Effective date: 20150917 Owner name: UNIVERSITY OF WASHINGTON, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BALE, JACOB B.;KING, NEIL P.;SHEFFLER, WILLIAM H.;AND OTHERS;SIGNING DATES FROM 20151118 TO 20160203;REEL/FRAME:061192/0756 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |