WO2024059686A2 - Squelettes d'arpine et échafaudages d'imagerie par microscopie électronique rigidifiée - Google Patents
Squelettes d'arpine et échafaudages d'imagerie par microscopie électronique rigidifiée Download PDFInfo
- Publication number
- WO2024059686A2 WO2024059686A2 PCT/US2023/074156 US2023074156W WO2024059686A2 WO 2024059686 A2 WO2024059686 A2 WO 2024059686A2 US 2023074156 W US2023074156 W US 2023074156W WO 2024059686 A2 WO2024059686 A2 WO 2024059686A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- protein
- subunit
- amino acid
- darpin
- Prior art date
Links
- 238000001493 electron microscopy Methods 0.000 title claims description 8
- 238000003384 imaging method Methods 0.000 title description 21
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 194
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 194
- 108700022150 Designed Ankyrin Repeat Proteins Proteins 0.000 claims abstract description 97
- 238000000034 method Methods 0.000 claims abstract description 19
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 46
- 239000003446 ligand Substances 0.000 claims description 43
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 39
- 108020001507 fusion proteins Proteins 0.000 claims description 31
- 102000037865 fusion proteins Human genes 0.000 claims description 31
- 125000000539 amino acid group Chemical group 0.000 claims description 26
- 239000002245 particle Substances 0.000 claims description 19
- 238000000635 electron micrograph Methods 0.000 claims description 10
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 9
- 230000027455 binding Effects 0.000 claims description 7
- 229940079593 drug Drugs 0.000 claims description 7
- 239000003814 drug Substances 0.000 claims description 7
- 239000007787 solid Substances 0.000 claims description 4
- 238000006467 substitution reaction Methods 0.000 claims description 4
- 238000007792 addition Methods 0.000 claims description 3
- 238000012217 deletion Methods 0.000 claims description 3
- 230000037430 deletion Effects 0.000 claims description 3
- 239000000126 substance Substances 0.000 claims description 3
- 239000011324 bead Substances 0.000 claims description 2
- 239000000758 substrate Substances 0.000 claims description 2
- 101150105104 Kras gene Proteins 0.000 description 18
- 150000001413 amino acids Chemical class 0.000 description 12
- 239000000203 mixture Substances 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 11
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 102000004196 processed proteins & peptides Human genes 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 239000000523 sample Substances 0.000 description 4
- 239000007983 Tris buffer Substances 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 102200006657 rs104894228 Human genes 0.000 description 3
- 102200006655 rs104894230 Human genes 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- NXQKSXLFSAEQCZ-SFHVURJKSA-N sotorasib Chemical compound FC1=CC2=C(N(C(N=C2N2[C@H](CN(CC2)C(C=C)=O)C)=O)C=2C(=NC=CC=2C)C(C)C)N=C1C1=C(C=CC=C1O)F NXQKSXLFSAEQCZ-SFHVURJKSA-N 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- RTAQQCXQSZGOHL-UHFFFAOYSA-N Titanium Chemical compound [Ti] RTAQQCXQSZGOHL-UHFFFAOYSA-N 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000002439 negative-stain electron microscopy Methods 0.000 description 2
- 230000009871 nonspecific binding Effects 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 102200006614 rs104894229 Human genes 0.000 description 2
- 238000001542 size-exclusion chromatography Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 101100180402 Caenorhabditis elegans jun-1 gene Proteins 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 102000011727 Caspases Human genes 0.000 description 1
- 108010076667 Caspases Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 101100378121 Drosophila melanogaster nAChRalpha1 gene Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- OTMSDBZUPAUEDD-UHFFFAOYSA-N Ethane Chemical compound CC OTMSDBZUPAUEDD-UHFFFAOYSA-N 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 101000582320 Homo sapiens Neurogenic differentiation factor 6 Proteins 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 102100030589 Neurogenic differentiation factor 6 Human genes 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108010034546 Serratia marcescens nuclease Proteins 0.000 description 1
- 108010003723 Single-Domain Antibodies Proteins 0.000 description 1
- COQLPRJCUIATTQ-UHFFFAOYSA-N Uranyl acetate Chemical compound O.O.O=[U]=O.CC(O)=O.CC(O)=O COQLPRJCUIATTQ-UHFFFAOYSA-N 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- SYELZBGXAIXKHU-UHFFFAOYSA-N dodecyldimethylamine N-oxide Chemical compound CCCCCCCCCCCC[N+](C)(C)[O-] SYELZBGXAIXKHU-UHFFFAOYSA-N 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000001000 micrograph Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 230000000116 mitigating effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 102000027450 oncoproteins Human genes 0.000 description 1
- 108091008819 oncoproteins Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 229940073531 sotorasib Drugs 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 108091005946 superfolder green fluorescent proteins Proteins 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000003325 tomography Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000004017 vitrification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N23/00—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00
- G01N23/22—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by measuring secondary emission from the material
- G01N23/225—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by measuring secondary emission from the material using electron or ion
- G01N23/2251—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by measuring secondary emission from the material using electron or ion using incident electron beams, e.g. scanning electron microscopy [SEM]
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N23/00—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00
- G01N23/22—Investigating or analysing materials by the use of wave or particle radiation, e.g. X-rays or neutrons, not covered by groups G01N3/00 – G01N17/00, G01N21/00 or G01N22/00 by measuring secondary emission from the material
- G01N23/2202—Preparing specimens therefor
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/35—Fusion polypeptide containing a fusion for enhanced stability/folding during expression, e.g. fusions with chaperones or thioredoxin
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/73—Fusion polypeptide containing domain for protein-protein interaction containing coiled-coiled motif (leucine zippers)
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N1/00—Sampling; Preparing specimens for investigation
- G01N1/28—Preparing specimens for investigation including physical details of (bio-)chemical methods covered elsewhere, e.g. G01N33/50, C12Q
- G01N1/42—Low-temperature sample treatment, e.g. cryofixation
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2223/00—Investigating materials by wave or particle radiation
- G01N2223/60—Specific applications or type of materials
- G01N2223/612—Specific applications or type of materials biological material
Definitions
- Cryo-electron microscopy is a rapidly expanding method for determining the atomic structures of large molecular assemblies. It is, however, not suitable for determining the structures of small-to-medium sized molecules. A size of about 40-50 kDa. For comparison, the average eukaryotic protein chain is about 35 kDa in mass, while bacterial proteins are generally smaller.
- Nanobodies, antibody fragments, and DARPins have been employed as scaffolds with limited resolution, e.g., 3.8 ⁇ for a 27 kDa cargo protein; 2.49 ⁇ for a cargo protein of about 250 kDa; 3.03 ⁇ for a 57 kDa cargo protein; about 2.7 ⁇ for a cargo protein of about 52 kDa.
- the finest resolution achieved thus far in the prior art is about 3.5 ⁇ for a 22 kDa cargo protein.
- the invention is directed to a protein comprising (1) X1-X2-X3-X4-V-X5-X6-L-L-A-X7-G-A-D-V-N (SEQ ID NO: 1) wherein X1 is Y, Q, I or D, preferably Y, Q, or I, more preferably, Y or Q, most preferably Y; X2 is D or K, preferably D; X3 is D or Y, preferably D; X4 is Q, E, or A, preferably Q or E, more preferably Q; X5 is A or D, preferably A; X6 is A or W, preferably A; X7 is K or L, preferably K; and wherein when X5 is D then X6 is A, and when X6 is W
- the invention is directed to a protein comprising or consisting of K-K-L-L-E-A-X-X-X-X-X-X1-X2-X3-X4-V-X5-X6-L-L-A-X7-G-A-D-V-N- X-X-X-X-X-X-P-L-X-L-A-X-X-X-X-H-X8-E-I-V-X9-V-L-L-X10-R-G-X11-D- X12-X13-X-X-X-X-X-X-P-L-H-L-A-X-X-X-X-X-H-L-E-I-X14-E-X15-L-L-X16- X17-G-A-D-V-N (SEQ ID NO: 4) wherein X1 is Y, Q, I or D, preferably Y, Q,
- the protein comprises or consists of a sequence selected from the group consisting of SEQ ID NO: 51 to SEQ ID NO: 71. In some embodiments, the protein comprises or consists of a sequence selected from the group consisting of SEQ ID NO: 5 to SEQ ID NO: 17. [0017] In some embodiments, the invention is directed to a fusion protein comprising or consisting of the protein as described above, e.g., having SEQ ID NO: 4, linked to a subunit protein of a self-assembling protein cage.
- the fusion protein comprises a protein having SEQ ID NO: 4 fused to a subunit protein of a self- assembling protein cage, said subunit protein comprises or consists of a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 18.
- the protein is fused to the subunit protein via a peptide linker.
- the peptide linker is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 amino acid residues long.
- the peptide linker is not more than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or 5 amino acid residues long.
- the peptide linker (a) is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 amino acid residues long, and (b) not more than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or 5 amino acid residues long.
- the peptide linker joins the N-terminus of the protein to the C-terminus of the subunit protein.
- the peptide linker forms an alpha-helical structure.
- the peptide linker has an amino acid sequence comprising KEELD (SEQ ID NO: 37), PAPAP (SEQ ID NO: 38), KEELD (SEQ ID NO: 39), KDELD (SEQ ID NO: 40), RDERN (SEQ ID NO: 41), an (E(A)3K)n motif, an (E4(R/K)4)n motif, or an (AP)n motif.
- the subunit protein comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a subunit sequence selected from the group consisting of SEQ ID NO: 18 to SEQ ID NO: 36.
- the subunit protein comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 18.
- the fusion protein comprises or consists of a sequence selected from the group consisting of SEQ NO: 42 to SEQ ID NO: 50.
- the invention is directed to a protein particle comprising multiple copies of a fusion protein as described herein, e.g., a fusion protein comprising or consisting of the protein as described above, e.g., having SEQ ID NO: 4, linked to a subunit protein of a self-assembling protein cage.
- the fusion protein comprises a protein having SEQ ID NO: 4 fused to a subunit protein of a self- assembling protein cage, said subunit protein comprises or consists of a sequence having at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 18.
- the protein is fused to the subunit protein via a peptide linker.
- the peptide linker (a) is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 amino acid residues long.
- the peptide linker is not more than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or 5 amino acid residues long. In some embodiments, the peptide linker joins the N-terminus of the protein to the C-terminus of the subunit protein. In some embodiments, the peptide linker forms an alpha-helical structure.
- the peptide linker has an amino acid sequence comprising KEELD (SEQ ID NO: 37), PAPAP (SEQ ID NO: 38), KEELD (SEQ ID NO: 39), KDELD (SEQ ID NO: 40), RDERN (SEQ ID NO: 41), an (E(A) 3 K)n motif, an (E 4 (R/K) 4 )n motif, or an (AP)n motif.
- the subunit protein comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a subunit sequence selected from the group consisting of SEQ ID NO: 18 to SEQ ID NO: 36.
- the subunit protein comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 18.
- the fusion protein comprises or consists of a sequence selected from the group consisting of SEQ NO: 42 to SEQ ID NO: 50.
- the protein particle further comprises one or more additional subunit proteins of the self-assembling protein cage, wherein the one or more additional subunit proteins lack a DARPin backbone having SEQ ID NO: 4.
- the one or more additional subunit proteins comprise or consist of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a subunit sequence selected from the group consisting of SEQ ID NO: 18 to SEQ ID NO: 36.
- the one or more additional subunit proteins comprise or consist of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 19.
- the one or more additional subunit proteins have an amino acid sequence that differs from subunit protein that is fused to a DARPin backbone having SEQ ID NO: 4.
- the present invention is directed to a method of assaying the structure of a target protein of interest which comprises binding the target protein to the protein particle as described herein to result in a complex, and obtaining an image of the complex.
- the target protein is bound to a moiety such as a solid substrate (e.g., a bead), another protein, or a chemical (e.g., a drug).
- the image is obtained by cryo-electron microscopy.
- the methods comprise assaying the structure of the target protein of interest when bound to a given ligand (e.g., another protein or chemical such as a drug of interest), which comprise forming a complex comprising the given ligand bound to the target protein and the fusion protein bound to the given ligand and/or the target protein; and obtaining a cryo-electron micrograph of the complex.
- the methods further comprise comparing the cryo-electron micrograph to a reference cryo- electron micrograph, said reference cryo-electron micrograph is of (a) a complex comprising the fusion protein bound to the target protein, said complex excludes the presence of the given ligand or another ligand bound thereto; (b) a complex comprising (i) the given ligand bound to a mutant of the target protein and (ii) the fusion protein bound to the given ligand and/or the mutant, said mutant having at least one amino acid substitution, deletion, or addition as compared to the target protein; or (c) a complex comprising (i) a second ligand bound to the target protein and (ii) the fusion protein bound to the given ligand and/or the target protein, wherein said second ligand is different the given ligand.
- the fusion protein is provided in the form of a REMIS cage as described herein.
- Figure 1 Schematically shows the structure of a REMIS cage.
- FIG. 2 Schematically shows the structure of a prior art cage protein having DARPins attached thereto, which DARPins lack a DARPin backbone comprising SEQ ID NO: 4.
- Figure 3 Cryo-EM micrograph of imaging scaffold bound to KRas (left), 2D classes from the selected particles (right).
- Figure 4 Cryo-EM imaging of KRas bound to the drug molecule AMG510 while bound to the REMIS cage RCG-33. The molecular model is shown on the left. A surface rendering of the 3-D cryo-EM density map that formed the basis for the molecular model is shown on the right.
- DARPin backbones Disclosed herein are DARPin backbones. DARPins having DARPin backbones as described herein are capable of forming multimers when in close proximity. Also disclosed herein are self-assembling protein cages which comprise the DARPin backbones. Adjacent DARPin backbones on the protein cages form multimers and thereby “rigidify” the protein cage structure.
- the rigidified protein cages when used as a scaffold for Cyro-EM imaging, the rigidified protein cages enable 3 ⁇ resolution imaging of proteins smaller than 20 kDa.
- the rigidified protein cages are referred to herein as Rigid Electron Microscopy Imaging Scaffold cages (“REMIS cages”) and the fusion proteins that comprise a DARPin backbone fused to a subunit of a self- assembling protein cage are referred to herein as “REMIS subunits”.
- REMIS cages Rigid Electron Microscopy Imaging Scaffold cages
- REMIS subunits fusion proteins that comprise a DARPin backbone fused to a subunit of a self- assembling protein cage.
- the exemplified REMIS cages present 12 copies of the target cargo, thereby tending to provide EM images with very large numbers of individual views of the cargo protein.
- the high symmetry of the REMIS cage means that an individual particle (REMIS cage plus its bound target cargo) presents the target cargo in 12 different orientations, thereby strongly mitigating preferred orientation effects that challenge many cryo-EM studies.
- DARPIN BACKBONES have an amino acid sequence that comprises (1) X1-X2-X3-X4-V-X5-X6-L-L-A-X7-G-A-D-V-N (SEQ ID NO: 1) wherein X1 is Y, Q, I or D, preferably Y, Q, or I, more preferably, Y or Q, most preferably Y; X2 is D or K, preferably D; X3 is D or Y, preferably D; X4 is Q, E, or A, preferably Q or E, more preferably Q; X5 is A or D, preferably A; X6 is A or W, preferably A; X7 is K or L, preferably K; and wherein when X5 is D then X6 is A, and when X6 is W then X7 is L; (2) H-X8-E-I-V-X9-V-L-L-X10-R-
- amino acid residues X5, X6, 9L of SEQ ID NO: 1, X7, 11R of SEQ ID NO: 2, X15, X16, and X17 are key in the formation of multimers
- amino acid residues 1Y and 4Q of SEQ ID NO: 1, and 15I of SEQ ID NO: 2 improve multimer formation and/or multimer stability or rigidity
- amino acid residues X8 and X14 are based on variability at corresponding locations in known DARPins, and are not expected to disrupt multimer formation, stability, or rigidity.
- the DARPin backbones have an amino acid sequence that comprises or consists of K-K-L-L-E-A-X-X-X-X-X1-X2-X3-X4-V-X5-X6-L-L-A-X7-G-A-D-V-N-X-X-X-X-X- X-X-X-P-L-X-L-A-X-X-X-X-X-H-X8-E-I-V-X9-V-L-L-X10-R-G-X11-D-X12-X13-X-X- X-X-X-X-X-P-L-H-L-A-X-X-X-X-X-X-H-L-E-I-X14-E-X15-L-L-X16-X17-G-A-D-V-N (SEQ ID NO: 4) wherein X1 is
- amino acid residues X5, X6, 19L, X7, 54R, X15, X16, and X17 of SEQ ID NO: 4 are key in the formation of multimers
- amino acid residues 11Y, 14Q, and 58I of SEQ ID NO: 4 improve multimer formation and/or multimer stability or rigidity
- amino acid residues X8 and X14 of SEQ ID NO: 4 are based on variability at corresponding locations in known DARPins, and are not expected to disrupt multimer formation, stability, or rigidity.
- Exemplary DARPins having a DARPin backbone include DARPin against GFP: KKLLEAARAGYDDQVAALLAKGADVNAADDVGVTPLHLAAQRGHLEIVEVLLKRGADINAADLWG QTPLHLAATAGHLEIVELLLRWGADVNARDNIGHTPLHLAAWAGHLEIVEVLLKYGADVNAQDKF GKTPFDLAIDNGNEDIAEVLQKAA (SEQ ID NO: 5) DARPin against GFP: KKLLEAARAGDKYAVDALLAKGADVNAADDVGVTPLHLAAQRGHLEIVEVLLKRGWDINAADLWG QTPLHLAATAGHLEIVELLLWYGADVNARDNIGHTPLHLAAWAGHLEIVEVLLKYGADVNAQDKF GKTPFDLAIDNGNEDIAEVLQKAA (SEQ ID NO: 6) DARPin against GFP: KKLLEAARAGYDDQVA
- Proteins comprising a DARPin backbone will typically comprise between 4 and 6 tandem copies of the 33 amino acid motif (“repeat unit”) that identifies the protein as belonging to the DARPin protein family. In each repeat unit, the amino acid residues of the loop regions are variable and may be selected based on the intended cargo protein to be bound thereby. See, e.g., Plückthun (2015). Proteins comprising a DARPin backbone may comprise “capping repeats”, i.e., the first and last repeat units that are typical of DARPins.
- a protein having a DARPin backbone may be recombinantly linked to a subunit of a self-assembling protein cage (“protein cage”) to result in a REMIS subunit.
- Subunits of protein cages include those described in US8969521, US9066870, US9630994, US10248758, US10501733, US20200397886, US20210163540, and WO2020/220044 and the following: Example, Subunit B: MFTRRGDQGETDLANRARVGKDSPVVEVQGTIDELNSFIGYALVLSRWDDIRNDLFRIQNDLFVL GEDVSTGGKGRTVTMDMIIYLIKRSVEMKAEIGKIELFVVPGGSVESASLHMARAVSRRLERRIK AASELTEINANVLLYANMLSNILFMHALISNKR (SEQ ID NO: 18) Cage Protein T33-51, Subunit A: MRITTKVGDKGSTRLFGGEEVWKDDPI
- the peptide linker will typically be a sequence that is predicted to favor an alpha-helical conformation in accordance with rules and patterns in the art. See, e.g., Padilla et al. (2001). [0043] In some embodiments, the peptide linker is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 and not more than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or 5 amino acid residues long. In some embodiments, the peptide linker comprises an amino acid sequence or amino acid motif that forms an alpha-helical structure.
- the amino acid sequence of the peptide linker comprises KEELD (SEQ ID NO: 37), PAPAP (SEQ ID NO: 38), KEELD (SEQ ID NO: 39), KDELD (SEQ ID NO: 40), RDERN (SEQ ID NO: 41), (E(A) 3 K)n motif, (E4(R/K)4)n motif, or (AP)n motif.
- the amino acid sequence of the subunit has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a sequence selected from SEQ ID NO: 18 to SEQ ID NO: 36.
- REMIS subunits comprise (A) a protein having a DARPin backbone that (1) comprises SEQ ID NO: 1, SEQ ID NO: 2, and SEQ ID NO: 3, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, or (2) comprises or consists of SEQ ID NO: 4, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, and each X is independently any amino acid; (B) a peptide linker that is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 and not more than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or 5 amino acid residues long, said peptide linker preferably forms an alpha-helical structure and/or has an amino acid sequence comprising KEELD (SEQ ID NO: 37), PAP
- the peptide linker joins the N- terminus of the DARPin backbone to the C-terminus of the subunit.
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- REMIS subunits comprise (A) a protein having a DARPin backbone that (1) comprises SEQ ID NO: 1, SEQ ID NO: 2, and SEQ ID NO: 3, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, or (2) comprises or consists of SEQ ID NO: 4, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, and each X is independently any amino acid; (B) a peptide linker that comprises or consists of KEELD (SEQ ID NO: 32), and (C) a subunit having an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 18, wherein the peptide linker joins the N-terminus of the DARPin back
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- REMIS subunits comprise (A) a protein having a DARPin backbone that comprises or consists of a sequence selected from the group consisting of SEQ ID NO: 4, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, and SEQ ID NO: 71; (B) a protein having a DARPin backbone that comprises
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- the REMIS subunit comprises or consists of one of the following: RCG-10, Subunit B, which binds GFP: MFTRRGDQGETDLANRARVGKDSPVVEVQGTIDELNSFIGYALVLSRWDDIRNDLFRIQNDLFVL GEDVSTGGKGRTVTMDMIIYLIKRSVEMKAEIGKIELFVVPGGSVESASLHMARAVSRRLERRIK AASELTEINANVLLYANMLSNILFMHALISNKRKEELDKKLLEAARAGYDDQVAALLAKGADVNA ADDVGVTPLHLAAQRGHLEIVEVLLKRGADINAADLWGQTPLHLAATAGHLEIVELLLRWGADVN ARDNIGHTPLHLAAWAGHLEIVEVLLKYGADVNAQDKFGKTPFDLAIDNGNEDIAEVL
- REMIS cages also comprise multiple copies of an additional subunit, which additional subunit does not contain a protein having a DARPin backbone linked thereto.
- the additional subunit comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to a sequence selected from SEQ ID NO: 18 to SEQ ID NO: 36.
- the additional subunit comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 19.
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- the REMIS cages comprise or consist of (I) a REMIS subunit that comprises or consists of (A) a protein having a DARPin backbone that (1) comprises SEQ ID NO: 1, SEQ ID NO: 2, and SEQ ID NO: 3, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, or (2) comprises or consists of SEQ ID NO: 4, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, and each X is independently any amino acid; (B) a peptide linker that is at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 and not more than 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or 5 amino acid residues long, said peptide linker preferably forms an alpha-helical structure and/or has an
- the sequence of the additional subunit is different from that of the subunit of the REMIS subunit.
- the subunit of the REMIS subunit and the additional subunit are counterparts, i.e., subunit A and subunit B, of a known self-assembling protein cage.
- the additional subunit comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 19.
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- the REMIS cages comprise or consist of (I) a REMIS subunit that comprises or consists of (A) a protein having a DARPin backbone that (1) comprises SEQ ID NO: 1, SEQ ID NO: 2, and SEQ ID NO: 3, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, or (2) comprises or consists of SEQ ID NO: 4, wherein X1 to X17 are the same as that indicated for the DARPin Backbone Formula set forth above, and each X is independently any amino acid; (B) a peptide linker that comprises or consists of KEELD (SEQ ID NO: 32), and (C) a subunit having an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 18, wherein the
- the sequence of the additional subunit is different from that of the subunit of the REMIS subunit.
- the subunit of the REMIS subunit and the additional subunit are counterparts, i.e., subunit A and subunit B, of a known self-assembling protein cage.
- the additional subunit comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 19.
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- the REMIS cages comprise or consist of (I) a REMIS subunit that comprises or consists of (A) a protein having a DARPin backbone that comprises or consists of a sequence selected from the group consisting of SEQ ID NO: 4, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 53, SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59, SEQ ID NO: 60, SEQ ID NO: 61, SEQ ID NO: 62, SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID NO: 68, SEQ ID NO: 69, SEQ ID NO: 70, and SEQ ID NO: 71; (B) a peptide linker that comprises or consists of KEELD (SEQ ID NO: 32
- the sequence of the additional subunit is different from that of the subunit of the REMIS subunit.
- the subunit of the REMIS subunit and the additional subunit are counterparts, i.e., subunit A and subunit B, of a known self- assembling protein cage.
- the additional subunit comprises or consists of an amino acid sequence that has at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to SEQ ID NO: 19.
- the protein having the DARPin backbone comprises or consists of a sequence selected from SEQ ID NO: 5 to SEQ ID NO: 17.
- the subunits (the REMIS subunit and, if present, the additional subunit) are typically present in 12, 24, or 60 copies in the assembled REMIS cage, according to the symmetries of Platonic solids. In its assembled form, a REMIS cage would typically be in the range of about 400 kDa up to about 3 MDa.
- the DARPin backbones of REMIS subunits protrude from REMIS cages like “arms”.
- a REMIS cage is schematically shown in Figure 1 as solid shapes and ribbon diagrams.
- Figure 2 schematically shows the “prior art”, i.e., protein cages having DARpins that lack a DARPin backbone as described herein. As shown in Figure 2, the DARpins do not bind each other.
- DARPin backbones, REMIS subunits, and REMIS cages described herein may be made using methods known in the art including chemical synthesis, biosynthesis or in vitro synthesis using recombinant DNA methods, and solid phase synthesis.
- DARPin backbones, REMIS subunits, and REMIS cages may be purified using protein purification techniques known in the art such as reverse phase high- performance liquid chromatography (HPLC), ion-exchange or immunoaffinity chromatography, filtration or size exclusion, or electrophoresis. See, e.g., Olsnes and Pihl (1973) Biochem 12(16):3121-3126; and Scopes (1982) Protein Purification, Springer-Verlag, NY, which are herein incorporated by reference. Polynucleotides that encode the DARPin backbones, REMIS subunits, and REMIS cages described herein are also contemplated herein.
- the DARPin backbones, REMIS subunits, and REMIS cages are isolated.
- the DARPin backbones and REMIS subunits are substantially purified.
- a “substantially purified” compound refers to a compound that is removed from its natural environment and/or is at least about 60% free, preferably about 75% free, and more preferably about 90% free, and most preferably about 95-100% free from other macromolecular components or compounds with which the compound is associated with in nature or from its synthesis.
- a protein comprising a DARPin backbone specifically binds the given target it was designed to bind.
- a protein comprising a DARPin backbone “specifically binds” its given target if it reacts or associates more frequently, more rapidly, with greater duration, and/or with greater binding affinity with the given target than it does with a given alternative, and/or indiscriminate binding that gives rise to non-specific binding and/or background binding.
- non-specific binding and background binding refer to an interaction that is not dependent on the presence of a specific structure (e.g., a given epitope).
- the proteins were expressed in E. coli BL21(DE3) cells (New England Biolabs) in Terrific Broth at 18 °C overnight upon 1 mM IPTG induction at OD 1.0.
- pellets were resuspended in resuspension buffer (50 mM Tris, 300 mM NaCl, 20 mM imidazole, pH 8.0) supplemented with benzonase nuclease, 1 mM PMSF, protease inhibitor cocktail EDTA-free (Thermo Scientific) and 0.1% LDAO and lysed using an EmulsiFlex C3 homogenizer (Avestin).
- resuspension buffer 50 mM Tris, 300 mM NaCl, 20 mM imidazole, pH 8.0
- benzonase nuclease 1 mM PMSF
- protease inhibitor cocktail EDTA-free Thermo Scientific
- 0.1% LDAO EmulsiFlex
- the cell lysate was cleared by centrifugation at 20,000 xg for 20 minutes at 4 °C, the resulting supernatant was recovered and centrifuged at 10,000 xg for 10 minutes at 4 °C and then loaded onto a HisTrap column (GE Healthcare) pre-equilibrated with the same resuspension buffer.
- the imaging scaffold was eluted with a linear gradient to 300 mM imidazole. Upon elution, 5 mM EDTA and 5 mM BME were added immediately. The eluted protein was concentrated using Amicon Ultra-15100 kDa MWCO for the imaging scaffold and 3 kDa MWCO for the GFP.
- the concentrated protein was further purified by size exclusion chromatography using a Superose 6 Increase column, eluted with 20 mM Tris pH 8.0, 100 mM NaCl, 5 mM BME, 5 mM EDTA. Fractions were analyzed by SDS-PAGE and negative stain EM for the presence of the imaging scaffold. [0063] KRas G12V and KRas G13C proteins were prepared using methods in the art. [0064] Negative Stain EM [0065] The concentration of a 3.5 ⁇ l sample of fresh Supersoe 6 Increase eluent was adjusted to about 100 ⁇ g/ml, applied to glow-discharged for one minute and blotted away.
- Cryo-EM Data Processing and Model Building [0069] Motion correction, CTF correction, particle picking and 2D classification were performed in cryoSPARC v.3.2 using methods in the art. [0070] An initial set of particles was automatically picked using a blob-picker. The extracted particles were 2D classified and then an ab initio reconstruction was generated. The model was used to obtain a 3D reconstruction enforcing T symmetry. The 3D structure was used to generate 2D projections of the particles and then used to repick the particles from the images using a template picker. The picked particles were extracted from the micrographs and then 3D reconstructed enforcing T symmetry. These were further refined using focused 3D classification and local refinements.
- the models were subjected to multiple rounds of refinement with Coot (Emsley et al., 2010) and PHENIX (Afonine et al., 2018) [0072] CYRO-EM IMAGING Five different REMIS cages were made based on the original T33-51 protein cage.
- the original T33-51 protein cage comprises subunit A (SEQ ID NO: 19) and subunit B (SEQ ID NO: 20) in a stoichiometry of A 12 B 12 .
- the REMIS cages contained a REMIS subunit comprising SEQ ID NO: 18 (instead of SEQ ID NO: 20) and a DARPin backbone (i.e., SEQ ID NO: 4) linked thereto via a peptide linker (i.e., SEQ ID NO: 37) in the same stoichiometry.
- the REMIS cages were RCG-10, RCG-5, RCG- 8, RCG-13, and RCG-14, and their REMIS subunits were SEQ ID NO: 42 to SEQ ID NO: 46, respectively.
- Each REMIS cage assembled according to their elution volumes in size exclusion chromatography and bound their intended target (“target cargo”).
- RCG- 10 was used as the reference scaffold upon which DARPin sequence changes could be made to present other proteins of interest.
- the oncogenic protein KRas (19.4 kDa) and MBP were selected as the target cargo and the loop regions of SEQ ID NO: 4 were substituted accordingly.
- target cargo was added to REMIS cages and then purified by SEC prior to cryo-EM analysis.
- Three KRas mutants were studied: G12V, G12C, and G13C.
- the KRas mutants contained GDP as a bound ligand and were bound by RCG-33 (SEQ ID NO: 50).
- the DARPin represented by SEQ ID NO: 49 binds KRas including the KRas mutants.
- cyro-EM imaging using the REMIS cages can be used to characterize the structures of proteins as small as about 19.4 kDa and also identify and/or distinguish proteins having single amino acid differences from one another.
- cryo-EM imaging revealed the conformation of the drug molecule while bound to KRas. See Figure 4.
- the resolution of REMIS cage plus MBP cargo was similar. See Figure 5. Therefore, cryo-EM imaging using the REMIS cages can be used to elucidate the conformation of protein- ligand interactions, i.e., the structure of the target protein when bound to a given ligand of interest.
- protein-ligand e.g., drug interactions between target protein and a given ligand of interest can be assayed and/or characterized by comparing the cryo-EM image of the complex comprising (i) the given ligand bound to the target protein and (ii) the REMIS cage bound to the target protein and/or the given ligand.
- the reference cryo-electron micrograph is of (a) a complex comprising the fusion protein bound to the target protein, said complex excludes the presence of the given ligand or another ligand bound thereto; (b) a complex comprising (i) the given ligand bound to a mutant of the target protein and (ii) the fusion protein bound to the given ligand and/or the mutant, said mutant having at least one amino acid substitution, deletion, or addition as compared to the target protein; or (c) a complex comprising (i) a second ligand bound to the target protein and (ii) the fusion protein bound to the given ligand and/or the target protein, wherein said second ligand is different the given ligand.
- Differences between the cryo-EM image and one or more reference cryo-electron micrographs can then be used to characterize the protein-ligand interaction between a target protein and given ligand, identify the target protein or a ligand that binds the target protein, distinguish the target protein from other proteins, or distinguish a ligand bound to the target protein from other ligands that also bind the target protein.
- Structural analysis indicates that the key amino acid residues for multimer formation between DARPins comprising a DARPin backbone are at positions 16, 17, 19, 21, 54, 83, 86, and 87 of SEQ ID NO: 4 and amino acid residues a positions 11Y, 14Q, and 58I of SEQ ID NO: 4 improve the formation and/or structural integrity of multimers.
- REMIS cages having SEQ ID NO: 4, wherein X5 is A, X6 is A, X7 is K, X15 is L, X16 is R, and X17 is W were found to be more rigid than those where: ⁇ X5 is D, X6 is A, X7 is K, X15 is L, X16 is W, and X17 is Y; ⁇ X5 is A, X6 is W, X7 is L, X15 is K, X16 is R, and X17 is C; ⁇ X5 is A, X6 is A, X7 is K, X15 is L, X16 is R, and X17 is C; and ⁇ X5 is D, X6 is W, X7 is L, X15 is L, X16 is R, and X17 is C.
- DARPin backbones, REMIS subunits, and REMIS cages comprise or consist of: SEQ ID NO: 51: KKLLEAXXXXX1X2X3X4VAALLAKGADVNXXXXXXXXPLXLAXXXXHX8EIVX9VLLX10RGX1 1DX12X13XXXXXXXXPLHLAXXXXHLEIX14ELLLRWGADVN; SEQ ID NO: 52: KKLLEAXXXXX1X2X3X4VDALLAKGADVNXXXXXXXXXPLXLAXXXXHX8EIVX9VLLX10RGX1 1DX12X13XXXXXXXXXPLHLAXXXXXHLEIX14ELLLWYGADVN; SEQ ID NO: 53: KKLLEAXXXXX1X2X3X4VAWLLALGADVNXXXXX
- REMIS cages having SEQ ID NO: 4, wherein X1 is Y, X4 is Q, X5 is A, X6 is A, X7 is K, X12 is I, X15 is L, X16 is R, and X17 is W were found to be more rigid than those where: ⁇ X1 is D, X4 is A, X5 is D, X6 is A, X7 is K, X12 is I, X15 is L, X16 is W, and X17 is Y; ⁇ X1 is Y, X4 is K, X5 is A, X6 is W, X7 is L, X12 is I, X15 is K, X16 is R, and X17 is C; ⁇ X1 is I, X4 is A, X5 is A, X6 is A, X7 is K, X12 is I, X15 is L, X16 is R, and X17 is C;
- DARPin backbones, REMIS subunits, and REMIS cages comprise or consist of: SEQ ID NO: 56: KKLLEAXXXXYX2X3QVAALLAKGADVNXXXXXXXXXPLXLAXXXXHX8EIVX9VLLX10RGX11D IX13XXXXXXXXPLHLAXXXXHLEIX14ELLLRWGADVN; SEQ ID NO: 57: KKLLEAXXXXDX2X3AVDALLAKGADVNXXXXXXXXXPLXLAXXXXHX8EIVX9VLLX10RGX11D IX13XXXXXXXXXPLHLAXXXXXHLEIX14ELLLWYGADVN; SEQ ID NO: 58: KKLLEAXXXYX2X3KVAWLLALGADVNXXXXXXXXPLXLAX
- DARPin backbones, REMIS subunits, and REMIS cages comprise or consist of: SEQ ID NO: 63: KKLLEAXXXXYDDQVAALLAKGADVNXXXXXXXXPLXLAXXXXHLEIVEVLLKRGADINXXXXXX XXPLHLAXXXXHLEIVELLLRWGADVN; SEQ ID NO: 64: KKLLEAXXXXDKYAVDALLAKGADVNXXXXXXXXXXPLXLAXXXXHLEIVEVLLKRGWDINXXXXXXX XPLHLAXXXXHLEIVELLLWYGADVN; SEQ ID NO: 65: KKLLEAXXXYDDKVAWLLALGADVNXXXXXXXXPLXLAXXXXHLEIVEVLLKRGADINXXXXXXX XPLHLAXXXXXHLEIVEVLLKRGAD
- REMIS cages having SEQ ID NO: 63 were found to be the most rigid.
- preferred DARPin backbones, REMIS subunits, and REMIS cages comprise or consist of SEQ ID NO: 63.
- REFERENCES The following references are herein incorporated by reference in their entirety with the exception that, should the scope and meaning of a term conflict with a definition explicitly set forth herein, the definition explicitly set forth herein controls: Afonine et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr D Struct Biol.2018 Jun 1;74(Pt 6):531-544.
- a and/or B means “A, B, or both A and B” and “A, B, C, and/or D” means “A, B, C, D, or a combination thereof” and said “A, B, C, D, or a combination thereof” means any subset of A, B, C, and D, for example, a single member subset (e.g., A or B or C or D), a two-member subset (e.g., A and B; A and C; etc.), or a three-member subset (e.g., A, B, and C; or A, B, and D; etc.), or all four members (e.g., A, B, C, and D).
- a single member subset e.g., A or B or C or D
- a two-member subset e.g., A and B; A and C; etc.
- a three-member subset e.g., A, B, and C; or A, B, and D; etc.
- the phrase “one or more of”, e.g., “one or more of A, B, and/or C” means “one or more of A”, “one or more of B”, “one or more of C”, “one or more of A and one or more of B”, “one or more of B and one or more of C”, “one or more of A and one or more of C” and “one or more of A, one or more of B, and one or more of C”.
- the phrase “comprises or consists of A” is used as a tool to avoid excess page and translation fees and means that in some embodiments the given thing at issue: comprises A or consists of A.
- the sentence “In some embodiments, the composition comprises or consists of A” is to be interpreted as if written as the following two separate sentences: “In some embodiments, the composition comprises A. In some embodiments, the composition consists of A.”
- a sentence reciting a string of alternates is to be interpreted as if a string of sentences were provided such that each given alternate was provided in a sentence by itself.
- the sentence “In some embodiments, the composition comprises A, B, or C” is to be interpreted as if written as the following three separate sentences: “In some embodiments, the composition comprises A. In some embodiments, the composition comprises B.
- the composition comprises C.”
- the sentence “In some embodiments, the composition comprises at least A, B, or C” is to be interpreted as if written as the following three separate sentences: “In some embodiments, the composition comprises at least A. In some embodiments, the composition comprises at least B. In some embodiments, the composition comprises at least C.”
- sample is used in its broadest sense and includes specimens and cultures obtained from any source, as well as biological samples and environmental samples.
- an “isolated” compound refers to a compound that is isolated from its native environment.
- an isolated polynucleotide is a one which does not have the bases normally flanking the 5’ end and/or the 3’ end of the polynucleotide as it is found in nature.
- an isolated polypeptide is a one which does not have its native amino acids, which correspond to the full-length polypeptide, flanking the N-terminus, C-terminus, or both.
- protein protein
- polypeptide and “peptide” are used interchangeably to refer to two or more amino acids linked together. Groups or strings of amino acid abbreviations are used to represent peptides.
- nucleic acid sequences are indicated with the 5’ end on the left and the sequences are written from 5’ to 3’.
- sequence identity refers to the percentage of nucleotides or amino acid residues that are the same between sequences, when compared and optimally aligned for maximum correspondence over a given comparison window, as measured by visual inspection or by a sequence comparison algorithm in the art, such as the BLAST algorithm, which is described in Altschul et al., (1990) J Mol Biol 215:403-410.
- Software for performing BLAST (e.g., BLASTP and BLASTN) analyses is publicly available through the National Center for Biotechnology Information (ncbi.nlm.nih.gov).
- the comparison window can exist over a given portion, e.g., a functional domain, or an arbitrarily selection a given number of contiguous nucleotides or amino acid residues of one or both sequences.
- the comparison window can exist over the full length of the sequences being compared.
- a given comparison window e.g., over 80% of the given sequence
- the recited sequence identity is over 100% of the given sequence.
- the percentages are determined using BLASTP 2.8.0+, scoring matrix BLOSUM62, and the default parameters available at blast.ncbi.nlm.nih.gov/Blast.cgi.
- Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv Appl Math 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J Mol Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, PNAS USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by visual inspection.
Landscapes
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Peptides Or Proteins (AREA)
Abstract
L'invention concerne des squelettes de DARPin qui forment des cages de protéines multimères et à auto-assemblage qui comprennent des protéines, qui comprennent des squelettes de DARPin, fusionnés à des protéines de sous-unités des cages de protéines à auto-assemblage et des procédés de fabrication et d'utilisation de ceux-ci.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263406976P | 2022-09-15 | 2022-09-15 | |
US63/406,976 | 2022-09-15 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2024059686A2 true WO2024059686A2 (fr) | 2024-03-21 |
WO2024059686A3 WO2024059686A3 (fr) | 2024-05-23 |
Family
ID=90275915
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/074156 WO2024059686A2 (fr) | 2022-09-15 | 2023-09-14 | Squelettes d'arpine et échafaudages d'imagerie par microscopie électronique rigidifiée |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024059686A2 (fr) |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NZ592591A (en) * | 2008-11-03 | 2012-04-27 | Molecular Partners Ag | Binding proteins comprising ankyrin repeast domains that inhibit the vegf-a receptor interaction |
KR20140039203A (ko) * | 2011-04-29 | 2014-04-01 | 얀센 바이오테크 인코포레이티드 | Il4/il13 결합 반복 단백질 및 용도 |
-
2023
- 2023-09-14 WO PCT/US2023/074156 patent/WO2024059686A2/fr unknown
Also Published As
Publication number | Publication date |
---|---|
WO2024059686A3 (fr) | 2024-05-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Fontana et al. | Structure of cytoplasmic ring of nuclear pore complex by integrative cryo-EM and AlphaFold | |
Yoder et al. | The His-Gly motif of acid-sensing ion channels resides in a reentrant ‘loop’implicated in gating and ion selectivity | |
Stuttfeld et al. | Architecture of the human mTORC2 core complex | |
Hinshaw et al. | The structure of the Ctf19c/CCAN from budding yeast | |
Yao et al. | Fusion of DARPin to aldolase enables visualization of small protein by cryo-EM | |
Choi et al. | NSF-mediated disassembly of on-and off-pathway SNARE complexes and inhibition by complexin | |
Dong et al. | Structure of human M-type phospholipase A2 receptor revealed by cryo-electron microscopy | |
Chen et al. | Structure of an endogenous mycobacterial MCE lipid transporter | |
Tao et al. | Cryo-EM structure of the KvAP channel reveals a non-domain-swapped voltage sensor topology | |
Hallett et al. | Cryo-EM structure of the Smc5/6 holo-complex | |
Sverzhinsky et al. | Membrane protein complex ExbB4-ExbD1-TonB1 from Escherichia coli demonstrates conformational plasticity | |
Lukoyanova et al. | 3D reconstruction of mammalian septin filaments | |
Fregoso et al. | Molecular mechanism of Arp2/3 complex inhibition by Arpin | |
Nörpel et al. | Structure of the human C9orf72-SMCR8 complex reveals a multivalent protein interaction architecture | |
WO2024059686A2 (fr) | Squelettes d'arpine et échafaudages d'imagerie par microscopie électronique rigidifiée | |
Meissner et al. | Quaternary structure of the European spiny lobster (Palinurus elephas) 1× 6-mer hemocyanin from cryoEM and amino acid sequence data | |
Pinotsis et al. | Evidence for a dimeric assembly of two titin/telethonin complexes induced by the telethonin C-terminus | |
Javadi et al. | Assembly mechanisms of the bacterial cytoskeletal protein FilP | |
Cai et al. | Structural basis of CST-Polα/Primase recruitment and regulation by POT1 at telomeres | |
James et al. | Structure of a proton-powered molecular motor that drives protein transport and gliding motility | |
Ford et al. | CFTR three-dimensional structure | |
Yu et al. | Yeast R2TP interacts with extended termini of client protein Nop58p | |
Castells-Graells et al. | Rigidified scaffolds for 3 angstrom resolution cryo-EM of small therapeutic protein targets | |
US20230083273A1 (en) | Hepadnavirus capsid protein heterodimers and virus-like particles | |
Dutka et al. | Structure and function of the Dot/Icm T4SS |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23866460 Country of ref document: EP Kind code of ref document: A2 |